RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6893
         (988 letters)



>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score =  331 bits (849), Expect = 1e-98
 Identities = 154/365 (42%), Positives = 207/365 (56%), Gaps = 40/365 (10%)

Query: 541 DIVDDFWPENEK----EKQLQARITKLSKILSGEVSIDLHLQFLIRSNKTDMLILKNTKE 596
           D+ D    E  +    E   Q     +  ILSGE +      FL+  N TD   L ++K 
Sbjct: 249 DLEDSASQEILEILVTELVAQGYDQAVMSILSGEFTKKYLGAFLLEKNNTDFKFLNSSKS 308

Query: 597 SI--RVSICHTATIIANAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHE 654
           S+  + S  H A  +AN+FM+ GT++D F R+NL+WL +A+NWAK TATASLGVIH G+ 
Sbjct: 309 SLARKFSRFHYAVSLANSFMNLGTSNDSFYRNNLDWLGKASNWAKFTATASLGVIHLGNS 368

Query: 655 TEALAHMQSYLPRETGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATNEM---V 711
                 ++ YLP E   +S   EGG LYALGLI A  G    +YLL    D  +E+   +
Sbjct: 369 NPGYEILKPYLPSEVA-SSRQKEGGALYALGLIKAGFGREDTEYLLEYFLDTEDELTPEL 427

Query: 712 RHGGCLGLGLAAMGSARDDIYEQLKFNLYQDDAVTGEAAGIAMGLVLLGSKSESAIQDMV 771
            +G CLG+GL  MGSA  +IYE+LK  L  D A+ GEAA   MGL++LG+ S  AI+DM 
Sbjct: 428 AYGVCLGIGLINMGSANREIYEKLKELLKNDRALLGEAAVYGMGLLMLGTWSVEAIEDMR 487

Query: 772 AYAKETQHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGS 831
            YA ETQHE+I RGL +G + I+YG+ E ADD + +L  +K                   
Sbjct: 488 TYAGETQHERIKRGLGIGFALILYGRQEMADDYINELLYDK------------------- 528

Query: 832 GSNVATKRLLHDAILRRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALG 891
                      D+ILR +G++++A+AY G+G+      LLH AVSD NDDVRR AV ALG
Sbjct: 529 -----------DSILRYNGVFSLALAYVGTGNLGVVSTLLHYAVSDGNDDVRRAAVIALG 577

Query: 892 FILFR 896
           F+   
Sbjct: 578 FVCCD 582



 Score =  312 bits (801), Expect = 9e-92
 Identities = 164/498 (32%), Positives = 225/498 (45%), Gaps = 133/498 (26%)

Query: 43  LAMSSIQSRSFRAQILRTLVRLYHSLSVPDYVNMAQCWIYLDDPRSVANLLGKLVAGSAD 102
           LA++ ++   FR +ILR L  +      PDY  + +  +YL+D      L+ KLV    +
Sbjct: 181 LAITLVEEEGFRKEILRMLAEIGPGKPKPDYFYVIKAVVYLNDAEKAKALIEKLV--KEN 238

Query: 103 DDLMAAQIGFDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAP 162
           D L+ AQ+ FDL +SA+Q  L                         E   T         
Sbjct: 239 DLLLYAQVAFDLEDSASQEIL-------------------------EILVTELVAQGYDQ 273

Query: 163 TPAPGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSKILSGEVSIDLHLQFLIRSN 222
                                                 +  ILSGE +      FL+  N
Sbjct: 274 -------------------------------------AVMSILSGEFTKKYLGAFLLEKN 296

Query: 223 KTDMLILKNTKESI--RVSICHTATIIANAFMHSGTTSDQFLRDNLEWLARATNWAKLTA 280
            TD   L ++K S+  + S  H A  +AN+FM+ GT++D F R+NL+WL +A+NWAK TA
Sbjct: 297 NTDFKFLNSSKSSLARKFSRFHYAVSLANSFMNLGTSNDSFYRNNLDWLGKASNWAKFTA 356

Query: 281 TASLGVIHRGHETEALAHMQSYLPRETGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQ 340
           TASLGVIH G+       ++ YLP E   +S   EGG LYALGLI A  G    +YLL  
Sbjct: 357 TASLGVIHLGNSNPGYEILKPYLPSEVA-SSRQKEGGALYALGLIKAGFGREDTEYLLEY 415

Query: 341 VKDATNE-----------------------------------DDAVTGEAAGIAMGLVLL 365
             D  +E                                   D A+ GEAA   MGL++L
Sbjct: 416 FLDTEDELTPELAYGVCLGIGLINMGSANREIYEKLKELLKNDRALLGEAAVYGMGLLML 475

Query: 366 GSKSESAIQDMVAYAKETQHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRS 425
           G+ S  AI+DM  YA ETQHE+I RGL +G + I+YG+ E ADD + +L  +KD+ILR  
Sbjct: 476 GTWSVEAIEDMRTYAGETQHERIKRGLGIGFALILYGRQEMADDYINELLYDKDSILR-- 533

Query: 426 GMYTIAMAYCVSYLYFDLFSDLTYRIVKNRSGMYTIAMAYCGSGSNVATKRLLHVAVSDV 485
                                         +G++++A+AY G+G+      LLH AVSD 
Sbjct: 534 -----------------------------YNGVFSLALAYVGTGNLGVVSTLLHYAVSDG 564

Query: 486 NDDVRREAVTALGFILFR 503
           NDDVRR AV ALGF+   
Sbjct: 565 NDDVRRAAVIALGFVCCD 582



 Score = 34.5 bits (79), Expect = 0.33
 Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 5/64 (7%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K   R K+K  EK    K+ K E+  +  +     +K++++     K   A  I   KKK
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREE-----KGIDAPAILNVKKK 846

Query: 958 KKKK 961
           K  K
Sbjct: 847 KPYK 850



 Score = 33.8 bits (77), Expect = 0.63
 Identities = 13/59 (22%), Positives = 23/59 (38%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K    A++  K+KG    + K +      +     I ++++K          KKKK  K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850



 Score = 30.3 bits (68), Expect = 7.1
 Identities = 12/64 (18%), Positives = 28/64 (43%), Gaps = 5/64 (7%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-----KKKKKKEAEKM 979
           +    A++  K+K   +K  K  +  +  + ++   K++++K          KKK+  K+
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851

Query: 980 KKKT 983
              T
Sbjct: 852 DNMT 855



 Score = 29.9 bits (67), Expect = 9.1
 Identities = 13/59 (22%), Positives = 25/59 (42%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K A + ++K K+K   +K  K +      + ++   K +++           KKKK  K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 77.1 bits (189), Expect = 3e-14
 Identities = 36/98 (36%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGKK 954
            KK+    KK       + +    EAE   +K  A + KK+  KK A+  KKK  EK    
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394

Query: 955  KKKKKKKKKKKK----KKKKKKKKEAEKMKKKTEEAEK 988
            + KKK ++ KKK    KK    KK+A++ KKK EE +K
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432



 Score = 74.4 bits (182), Expect = 3e-13
 Identities = 34/95 (35%), Positives = 56/95 (58%), Gaps = 5/95 (5%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK---KKDAEKMKKKGAEKIGK 953
            +K+   + KK AE+ +K  + KK+AE+ +KK  A+  KKK    K A +  K  AE    
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKK--ADAAKKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 954  KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + +  ++K +  +KKK++ KK+A+  KKK EE +K
Sbjct: 1358 EAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392



 Score = 72.5 bits (177), Expect = 8e-13
 Identities = 40/102 (39%), Positives = 63/102 (61%), Gaps = 11/102 (10%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKK---KKDAEKMKKKGAE 949
            KK+    KKK A++++K    KK+A++ +KK    K A++ KKK    K A++ KKK  E
Sbjct: 1397 KKKAEEDKKK-ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455

Query: 950  KIGKKKKKKKKKKKKKK---KKKKKKKKEAEKMKKKTEEAEK 988
                ++ KKK ++ KK    KKK ++ K+A++ KKK EEA+K
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497



 Score = 72.5 bits (177), Expect = 1e-12
 Identities = 42/103 (40%), Positives = 65/103 (63%), Gaps = 12/103 (11%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK---KKDAEKMKKKGA 948
            +K+    KKK A+  +KK ++KK+A++ +KK     K A+++KK    KK A++ KKK  
Sbjct: 1370 EKKKEEAKKK-ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428

Query: 949  EKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            EK      KKK ++ KK  + KKK ++ K+AE+ KKK EEA+K
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 71.7 bits (175), Expect = 2e-12
 Identities = 39/97 (40%), Positives = 59/97 (60%), Gaps = 7/97 (7%)

Query: 896  RKKRTRRKK----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
            +K    +KK    KK AE+ +K  + KK+AE+ +K   A+K  ++ K AE+ KKK  E  
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE-- 1468

Query: 952  GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              KK  + KKK ++ KK  + KK+AE+ KKK +EA+K
Sbjct: 1469 -AKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504



 Score = 71.3 bits (174), Expect = 2e-12
 Identities = 40/122 (32%), Positives = 69/122 (56%), Gaps = 13/122 (10%)

Query: 879  NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKK--EAEKMRKKKGAEKMKKK 936
            N+++R+     +     R+   + ++ + A++++K ++KKK  EA+K  +KK A++ KKK
Sbjct: 1251 NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310

Query: 937  ---KKDAEKMKKKGAEKIGKK----KKKKKKKKKK---KKKKKKKKKKEAEKMKKKTEEA 986
                K A++ KKK AE+  KK    KKK ++ KK     K + +    EAE  ++K E A
Sbjct: 1311 AEEAKKADEAKKK-AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369

Query: 987  EK 988
            EK
Sbjct: 1370 EK 1371



 Score = 70.6 bits (172), Expect = 3e-12
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 19/111 (17%)

Query: 897  KKRTRRKKKKGAEKIRKKKK--------KKKEAEKMRKK----KGAEKMKKK----KKDA 940
             K         AE   +K +         KK+A+  +KK    K A++ KKK    KK A
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407

Query: 941  EKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++KK  A K      KKK ++KKK  + KKK ++ K+A++ KKK EEA+K
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458



 Score = 70.2 bits (171), Expect = 5e-12
 Identities = 37/95 (38%), Positives = 54/95 (56%), Gaps = 6/95 (6%)

Query: 897  KKRTRRKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            KK+    KKK A+  +KK    KK  EA K   +  A++ +  ++ AE  +KK  E   K
Sbjct: 1321 KKKAEEAKKK-ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA--K 1377

Query: 954  KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK    KKK ++KKK  + KK+AE+ KKK +E +K
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412



 Score = 69.8 bits (170), Expect = 6e-12
 Identities = 37/106 (34%), Positives = 55/106 (51%), Gaps = 16/106 (15%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK---KKGAEKMKKK----------KKDAEK 942
            +     + + + A    +  ++K EA + +K   KK A+  KKK          KK AE+
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402

Query: 943  MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             KKK  E    KK    KKK  + KKK ++KK+A++ KKK EEA+K
Sbjct: 1403 DKKKADE---LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK 1445



 Score = 69.8 bits (170), Expect = 6e-12
 Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 11/103 (10%)

Query: 896  RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK----KKDAEKMKK 945
            +K   ++K    K AE+ +K  + KK+AE+ +K    KK AE+ KKK    KK AE+  K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-AK 1342

Query: 946  KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K AE    + +    + +  ++K +  +K+ E+ KKK + A+K
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385



 Score = 69.4 bits (169), Expect = 8e-12
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 6/95 (6%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKIGK 953
            KK+    KKK  E  +  +  K EAE       AE  ++K + AEK K   KK A+   K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEA--AADEAEAAEEKAEAAEKKKEEAKKKADAA-K 1384

Query: 954  KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK ++KKK  + KKK ++ KK+A+++KK     +K
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419



 Score = 69.0 bits (168), Expect = 1e-11
 Identities = 32/92 (34%), Positives = 50/92 (54%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
             K+    KKK  E  +K    KK+AE+ +K   A K + +    E    +   +  +KKK
Sbjct: 1314 AKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ KKK    KKK ++KK+A++ KKK EE +K
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405



 Score = 68.2 bits (166), Expect = 2e-11
 Identities = 40/101 (39%), Positives = 64/101 (63%), Gaps = 10/101 (9%)

Query: 898  KRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK---KKDAEKMKKKGAEK 950
            K+   +KKK  E  +K ++ KK+A++++K    KK A++ KKK   KK A++ KKK  E 
Sbjct: 1384 KKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEA 1443

Query: 951  IGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                  KKK ++ KK ++ KKK ++ K+A++ KKK EEA+K
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK 1484



 Score = 67.9 bits (165), Expect = 3e-11
 Identities = 35/92 (38%), Positives = 58/92 (63%), Gaps = 3/92 (3%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            +K+   + KK AE+ +K  + KK+AE+ +K + A+K  ++ K A++ KKK  E    KK 
Sbjct: 1429 EKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE---AKKA 1485

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             + KKK ++ KKK  + K+A + KKK +EA+K
Sbjct: 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKK 1517



 Score = 64.0 bits (155), Expect = 4e-10
 Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 11/101 (10%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKK---KKDAEKMKKKGAE 949
            KK+    KK  A++ +KK ++ K+AE+ +KK    K A++ KKK    K A++ KKK  E
Sbjct: 1437 KKKAEEAKK--ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494

Query: 950  KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK--KKTEEAEK 988
               K  + KK  + KKK  + KK +EA+K    KK EEA+K
Sbjct: 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535



 Score = 63.2 bits (153), Expect = 7e-10
 Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 9/100 (9%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-----KKGAEKMKKK---KKDAEKMKKKGA 948
            KK     K K AE+ +K ++ KK+AE+ +K     KK AE +KK+    K AE++KKK A
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712

Query: 949  EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E+  KK ++ KK +++ K K ++ KKEAE+ KKK EEA+K
Sbjct: 1713 EEK-KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751



 Score = 60.9 bits (147), Expect = 3e-09
 Identities = 37/96 (38%), Positives = 62/96 (64%), Gaps = 7/96 (7%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKK-----KGAE 949
            KK+  + KKK AE+ +K ++ KK  E  K++  + A+K ++ KK AE+ KK     K A 
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691

Query: 950  KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            +  KK+ ++ KK ++ KKK+ ++KK+AE++KK  EE
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727



 Score = 60.9 bits (147), Expect = 3e-09
 Identities = 34/94 (36%), Positives = 59/94 (62%), Gaps = 5/94 (5%)

Query: 898  KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---KKKDAEKMKKKGAEKIGKK 954
             +   + KK AE+ +K ++ +K+A +  KK+  E  K    KKK+AE+ KK  AE++ K 
Sbjct: 1667 AKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKK--AEELKKA 1724

Query: 955  KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++ K K ++ KK+ ++ KK+AE+ KK  EE +K
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758



 Score = 60.2 bits (145), Expect = 5e-09
 Identities = 40/105 (38%), Positives = 64/105 (60%), Gaps = 14/105 (13%)

Query: 898  KRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKK----KKDAEKMKKK--- 946
            K    +KK  AE+ +K ++ K +AE+++K    KK  E++KKK    KK AE++KK    
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658

Query: 947  ---GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                A +  KK ++ KKK ++ KK ++ +KK AE +KK+ EEA+K
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703



 Score = 58.2 bits (140), Expect = 2e-08
 Identities = 33/93 (35%), Positives = 58/93 (62%), Gaps = 4/93 (4%)

Query: 898  KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGKKK 955
            K+   +KKK  +  +K+ ++KK+AE+++K +   K+K     K AE+ KKK  E   KK 
Sbjct: 1626 KKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE--AKKA 1683

Query: 956  KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ +KK  +  KK+ ++ K+AE++KKK  E +K
Sbjct: 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK 1716



 Score = 58.2 bits (140), Expect = 2e-08
 Identities = 34/101 (33%), Positives = 62/101 (61%), Gaps = 10/101 (9%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRK---KKGAEKMKK--KKKDAEKMKK----KG 947
            KK+    KKK A++ +K  + KK+A++ +K    K A++ KK  + K A++ KK    K 
Sbjct: 1489 KKKAEEAKKK-ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547

Query: 948  AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            A+++ K ++ KK ++KKK ++ KK +++     +K EEA+K
Sbjct: 1548 ADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588



 Score = 57.5 bits (138), Expect = 3e-08
 Identities = 36/105 (34%), Positives = 61/105 (58%), Gaps = 15/105 (14%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK---KKDAEKMKKKGA 948
            KK+    KK  A++ +KK ++ K+A++ +KK     K A++ KK    KK A++ KK   
Sbjct: 1463 KKKAEEAKK--ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520

Query: 949  EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK-----KTEEAEK 988
             K   + KK ++ KK  + KK ++KK+A+++KK     K EE +K
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565



 Score = 57.5 bits (138), Expect = 4e-08
 Identities = 33/92 (35%), Positives = 62/92 (67%), Gaps = 1/92 (1%)

Query: 898  KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKKK 956
            K+    +KK AE ++K+ ++ K+AE+++KK+  EK K ++ K AE+  K  AE+  K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + KKK ++ KK +++KKK A   K++ ++AE+
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772



 Score = 53.2 bits (127), Expect = 8e-07
 Identities = 33/99 (33%), Positives = 57/99 (57%), Gaps = 9/99 (9%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKK 954
            ++R   + +K  E       +++ A K  + + A+++KK  +KK A++ KK  AE+  K 
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKK--AEEKKKA 1304

Query: 955  KKKKKKKKKKK-----KKKKKKKKKEAEKMKKKTEEAEK 988
             + KKK ++ K     KKK ++ KK+A+  KKK EEA+K
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343



 Score = 52.8 bits (126), Expect = 9e-07
 Identities = 35/95 (36%), Positives = 56/95 (58%), Gaps = 6/95 (6%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK---KKDAEKMKKKGAEKIGK 953
            + R     K   E+ + K ++ K+AE+ + K  AE++KK    KK  E++KKK AE+  K
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIK--AEELKKAEEEKKKVEQLKKKEAEEKKK 1648

Query: 954  KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             ++ KK +++ K K  ++ KK  E  KKK EEA+K
Sbjct: 1649 AEELKKAEEENKIKAAEEAKKAEED-KKKAEEAKK 1682



 Score = 52.8 bits (126), Expect = 1e-06
 Identities = 34/112 (30%), Positives = 60/112 (53%), Gaps = 19/112 (16%)

Query: 896  RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMR-----------------KKKGAEKMKKK 936
            RK    +K +  K AE+ +K  ++ K+AE+ R                 +++ A K ++ 
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277

Query: 937  KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K  E  K +  +K  + KK ++KKK  + KKK ++ K+A++ KKK EEA+K
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1329



 Score = 52.1 bits (124), Expect = 2e-06
 Identities = 32/92 (34%), Positives = 55/92 (59%), Gaps = 6/92 (6%)

Query: 899  RTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKK 954
            +   + KK A++ +K  + KK  EA+K  + K A++ KK  +KK A+++KK  AE++ K 
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK--AEELKKA 1560

Query: 955  KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            ++KKK ++ KK ++ K       +  KK EEA
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEA 1592



 Score = 51.3 bits (122), Expect = 3e-06
 Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 10/99 (10%)

Query: 897  KKRTRRKKKKGAEKIRKK----KKKKKEAEKMRK-KKGAEKMKK--KKKDAEKMKKKGAE 949
            KK+    KK  A++ +KK    KKK  EA+K  + KK A++ KK  + K A++ KK    
Sbjct: 1476 KKKAEEAKK--ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEA 1533

Query: 950  KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K   + KK ++KKK  + KK ++ K+AE+ KKK EEA+K
Sbjct: 1534 KKADEAKKAEEKKKADELKKAEELKKAEE-KKKAEEAKK 1571



 Score = 50.9 bits (121), Expect = 4e-06
 Identities = 35/97 (36%), Positives = 56/97 (57%), Gaps = 8/97 (8%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK----KKKKDAEKMKKKGAEKIG 952
            +K+   + KK  E      +K +EA+K  + +  E MK    +KK  AE+ KK    KI 
Sbjct: 1562 EKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621

Query: 953  ----KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
                KK +++KKK ++ KKK+ ++KK+AE++KK  EE
Sbjct: 1622 AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658



 Score = 50.5 bits (120), Expect = 4e-06
 Identities = 28/95 (29%), Positives = 58/95 (61%), Gaps = 2/95 (2%)

Query: 896  RKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            +K   ++K    K AE+++K ++KKK  E  + ++      +K ++A+K ++   E++ K
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599

Query: 954  KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +++KK K ++ KK ++ K +AE++KK  EE +K
Sbjct: 1600 LYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634



 Score = 46.7 bits (110), Expect = 8e-05
 Identities = 32/93 (34%), Positives = 54/93 (58%), Gaps = 6/93 (6%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK----KKDAEKMKKKGAEKIG 952
             K+    KKK AE+ +K ++ KK  E+  K K AE+ KK+    KK AE+ KK   EK  
Sbjct: 1701 AKKAEELKKKEAEEKKKAEELKKA-EEENKIK-AEEAKKEAEEDKKKAEEAKKDEEEKKK 1758

Query: 953  KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
                KK+++KK ++ +K+K+    E++ ++ E+
Sbjct: 1759 IAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEK 1791



 Score = 46.3 bits (109), Expect = 8e-05
 Identities = 34/112 (30%), Positives = 58/112 (51%), Gaps = 19/112 (16%)

Query: 896  RKKRTRRKK---------KKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKK--------K 936
            +K    +KK          K A++ +K  + KK  EA+K  +KK A+++KK        +
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562

Query: 937  KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK AE+ KK   +K    +K ++ KK ++ + ++  K   E+ K K EEA+K
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614



 Score = 46.3 bits (109), Expect = 1e-04
 Identities = 32/90 (35%), Positives = 51/90 (56%), Gaps = 6/90 (6%)

Query: 905  KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--KKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            KK  EK + ++ KK E +K    + AE+ KK  + +  E MK    EK  K ++ KK ++
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617

Query: 963  KKKK----KKKKKKKKEAEKMKKKTEEAEK 988
             K K    KK +++KK+ E++KKK  E +K
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647



 Score = 45.9 bits (108), Expect = 1e-04
 Identities = 37/99 (37%), Positives = 62/99 (62%), Gaps = 9/99 (9%)

Query: 896  RKKRTRRKKK----KGAEKIRKKK-KKKKEAEKMRKKKGAEKMK--KKKKDAEKMKKKGA 948
            +K     KK+    K AE+++KK+ ++KK+AE+++K +   K+K  + KK+AE+ KKK  
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747

Query: 949  EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            E   KK +++KKK    KK+++KK +E  K K+   E E
Sbjct: 1748 E--AKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEE 1784



 Score = 45.1 bits (106), Expect = 2e-04
 Identities = 30/104 (28%), Positives = 55/104 (52%), Gaps = 11/104 (10%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKK-------KEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
            + +  +    + AE+ +K ++ +        E EK  K + A+K ++ K  AE++KK   
Sbjct: 1571 KAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEE 1630

Query: 949  EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK----KKTEEAEK 988
            EK   ++ KKK+ ++KKK ++ KK +E  K+K     K  E +K
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674



 Score = 42.1 bits (98), Expect = 0.002
 Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 1/90 (1%)

Query: 899  RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
            +   K   G ++  K   K  + +     +  E  ++    AE+ KK    K  + +K +
Sbjct: 1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAE 1118

Query: 959  KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + KKK +  +K ++ ++AE   +K EEA K
Sbjct: 1119 EAKKKAEDARKAEEARKAEDA-RKAEEARK 1147



 Score = 39.0 bits (90), Expect = 0.018
 Identities = 31/108 (28%), Positives = 54/108 (50%), Gaps = 3/108 (2%)

Query: 877  DVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
            D  +D R +  T   F    + +     K  AE+ RK ++ KK+AE  RK + A K +  
Sbjct: 1082 DAKEDNRADEATEEAFGKAEEAKKTETGK--AEEARKAEEAKKKAEDARKAEEARKAEDA 1139

Query: 937  KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            +K AE+ +K    K  +  +K +  +K ++ +K +  K+AE  +K  E
Sbjct: 1140 RK-AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEE 1186



 Score = 38.6 bits (89), Expect = 0.022
 Identities = 25/84 (29%), Positives = 51/84 (60%), Gaps = 1/84 (1%)

Query: 901  RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            + +  + AE++RK ++ +K AE  RK + A K ++++K  E  K + A+K    KK ++ 
Sbjct: 1177 KAEAARKAEEVRKAEELRK-AEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEA 1235

Query: 961  KKKKKKKKKKKKKKEAEKMKKKTE 984
            KK  ++ KK ++++  E+++K  E
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEE 1259



 Score = 38.2 bits (88), Expect = 0.025
 Identities = 27/93 (29%), Positives = 52/93 (55%), Gaps = 1/93 (1%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            R +  R+ +     +  +K +  K+AE  RK +   K ++ +K AE  +K  A +  +++
Sbjct: 1153 RVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRK-AEDARKAEAARKAEEE 1211

Query: 956  KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K ++ +K +  KK +  K+AE+ KK  EEA+K
Sbjct: 1212 RKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKK 1244



 Score = 37.8 bits (87), Expect = 0.038
 Identities = 24/87 (27%), Positives = 48/87 (55%), Gaps = 5/87 (5%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            KK     K K  E  ++ ++ KK+AE+ +K +     KKK    +K ++K AE+I  +K+
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE---KKKIAHLKKEEEKKAEEI--RKE 1776

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKT 983
            K+   +++  ++ +K++ E +K  K  
Sbjct: 1777 KEAVIEEELDEEDEKRRMEVDKKIKDI 1803



 Score = 37.8 bits (87), Expect = 0.038
 Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 44/145 (30%)

Query: 884  REAVTALGFILFRKKRTRRKKKKGAEKIRK--------KKKKKKEAEKMRKKKGAEKMKK 935
            ++A  A      RK    RK    AE  RK        +++K +EA K    K AE +KK
Sbjct: 1176 KKAEAARKAEEVRKAEELRK----AEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231

Query: 936  ---KKKDAEKMKK------------------------KGAEKIGKKKK-----KKKKKKK 963
                KKDAE+ KK                        + A K  + +K     K ++KKK
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291

Query: 964  KKKKKKKKKKKEAEKMKKKTEEAEK 988
              + KK ++KK+A++ KKK EEA+K
Sbjct: 1292 ADEAKKAEEKKKADEAKKKAEEAKK 1316



 Score = 37.0 bits (85), Expect = 0.068
 Identities = 27/84 (32%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 901  RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            + +  K AE  RK ++ +K AE++RK + A K +  +K  E+ K + A K    KK +  
Sbjct: 1171 KAEDAKKAEAARKAEEVRK-AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAV 1229

Query: 961  KKKKKKKKKKKKKKEAEKMKKKTE 984
            KK ++ KK  ++ K+AE+ +   E
Sbjct: 1230 KKAEEAKKDAEEAKKAEEERNNEE 1253



 Score = 36.3 bits (83), Expect = 0.097
 Identities = 22/80 (27%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             K +    KK+  E  +K ++ KK+ E+  KKK A   K+++K AE+++K+    I ++ 
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEE--KKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

Query: 956  KKKKKKKKKKKKKKKKKKKE 975
             ++ +K++ +  KK K   +
Sbjct: 1786 DEEDEKRRMEVDKKIKDIFD 1805



 Score = 35.1 bits (80), Expect = 0.24
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 3/104 (2%)

Query: 884  REAVTALGFILFRKKRTRRKKK--KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
            R+A  A    + RK    RK +  + AE  +K +  +K AE++RK +   K +  +K   
Sbjct: 1146 RKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARK-AEEVRKAEELRKAEDARKAEA 1204

Query: 942  KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
              K +   K  + +K +  KK +  KK ++ KK+AE+ KK  EE
Sbjct: 1205 ARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248



 Score = 30.9 bits (69), Expect = 4.1
 Identities = 27/105 (25%), Positives = 53/105 (50%), Gaps = 12/105 (11%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKK--------KEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
            RK    +KK + A K  + +K +        ++AE  ++ + A K +  +K  E  K + 
Sbjct: 1115 RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAED 1174

Query: 948  AEKI--GKKKKKKKKKKKKKKKKKKKKKKEAEKMK--KKTEEAEK 988
            A+K    +K ++ +K ++ +K +  +K + A K +  +K EEA K
Sbjct: 1175 AKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARK 1219


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 66.9 bits (163), Expect = 4e-11
 Identities = 53/245 (21%), Positives = 106/245 (43%), Gaps = 22/245 (8%)

Query: 611 NAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRETG 670
           N  ++ G  +D  +  + EW+ +      ++A AS+GVI   +  + L  +  YL  +  
Sbjct: 368 NDPINLGYENDSLIPLDDEWIYKCKVPGLISAFASIGVIESWNSDKGLETLDKYLYAD-- 425

Query: 671 PTSGYYEGGGLYALGLIH-ANHG------AVINDYLLGQVKDATNEMVRHGGCLGLGLAA 723
               Y + G L  +GL             A++++YL      +++        LGLG A 
Sbjct: 426 --ESYRKAGALLGIGLSGLRVFEERPPALALLSNYL-----QSSSSKHVIAAILGLGAAF 478

Query: 724 MGSARDDIYEQLKFNLYQDDAVTGEA--AGIAMGLVLLGSKSE----SAIQDMVAYAKET 777
            G+  +++ E L+  ++  D+       A + +G V +G+ +       +Q  V   K  
Sbjct: 479 SGTQAEEVLELLQPIMFSTDSPIEVVFFASLTLGSVFVGTCNGDLTSLILQTFVERGKIE 538

Query: 778 QHEKILRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNVAT 837
              +  R LA+G++ + YG+ +  DD+   +   + A+ +   +      Y G+G  +  
Sbjct: 539 SETQWFRFLALGLASLFYGRKDQVDDVEETIMAIEGALSKHEEILVKGCQYVGTGDVLVI 598

Query: 838 KRLLH 842
           + LLH
Sbjct: 599 QSLLH 603



 Score = 39.1 bits (91), Expect = 0.014
 Identities = 52/287 (18%), Positives = 99/287 (34%), Gaps = 75/287 (26%)

Query: 249 NAFMHSGTTSDQFLRDNLEWLARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRETG 308
           N  ++ G  +D  +  + EW+ +      ++A AS+GVI   +  + L  +  YL  +  
Sbjct: 368 NDPINLGYENDSLIPLDDEWIYKCKVPGLISAFASIGVIESWNSDKGLETLDKYLYAD-- 425

Query: 309 PTSGYYEGGGLYALGLIH-ANHG------AVINDYL--------------LGQVKDATNE 347
               Y + G L  +GL             A++++YL              LG     T  
Sbjct: 426 --ESYRKAGALLGIGLSGLRVFEERPPALALLSNYLQSSSSKHVIAAILGLGAAFSGTQA 483

Query: 348 DDAV---------------TGEAAGIAMGLVLLGSKSE----SAIQDMVAYAKETQHEKI 388
           ++ +                   A + +G V +G+ +       +Q  V   K     + 
Sbjct: 484 EEVLELLQPIMFSTDSPIEVVFFASLTLGSVFVGTCNGDLTSLILQTFVERGKIESETQW 543

Query: 389 LRGLAVGISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCVSYLYFDLFSDLT 448
            R LA+G++ + YG+ +  DD+   +   + A+ +      I +  C             
Sbjct: 544 FRFLALGLASLFYGRKDQVDDVEETIMAIEGALSKHE---EILVKGC------------- 587

Query: 449 YRIVKNRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVT 495
                           Y G+G  +  + LLHV      D ++ E   
Sbjct: 588 ---------------QYVGTGDVLVIQSLLHVKDEFTGDTLKNEEAL 619


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 63.7 bits (155), Expect = 1e-10
 Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 6/99 (6%)

Query: 896 RKKRTRRKK-----KKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           R+K   ++       K AE+  K+ + K+K+AE+ + K+ AE   K + +AEK  K+ A+
Sbjct: 90  RQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK 149

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  +++ K K   + KKK  + KKK   + K K E   K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAK 188



 Score = 59.5 bits (144), Expect = 3e-09
 Identities = 36/102 (35%), Positives = 46/102 (45%), Gaps = 10/102 (9%)

Query: 897 KKRTRRKKKKGAE---KIRKKKKKKKEAEKMRKKKGAEKMK----KKKKDAEKMKKKGAE 949
            K+   K+K+  E   K   + K K EAE  +K K   K +     K K A + KKK AE
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170

Query: 950 KIGKKKKKKKKKKKKKKKKK---KKKKKEAEKMKKKTEEAEK 988
              K + + K K + K K K    K K EA K K   E A K
Sbjct: 171 AKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212



 Score = 57.2 bits (138), Expect = 2e-08
 Identities = 27/89 (30%), Positives = 51/89 (57%), Gaps = 6/89 (6%)

Query: 903 KKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           +K++ AE+ R+K+   +   E    + ++ A++ ++K+K AE+ K K A +    K K +
Sbjct: 81  EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE---AKAKAE 137

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + +KK K++ KK+ E E   K   EA+K
Sbjct: 138 AEAEKKAKEEAKKQAEEEAKAKAAAEAKK 166



 Score = 56.4 bits (136), Expect = 3e-08
 Identities = 37/104 (35%), Positives = 47/104 (45%), Gaps = 11/104 (10%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKK---KKEAEKMRKKKGAEKMKKK----KKDAEKMKKKGA 948
            K +   + K  AE   +KK K   KK+AE+  K K A + KKK    KK AE   K  A
Sbjct: 124 AKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183

Query: 949 EKIGKKKKKKKKKK----KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E   K K ++ K K    K K   +   K EAE       EAE+
Sbjct: 184 EAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227



 Score = 55.6 bits (134), Expect = 5e-08
 Identities = 31/100 (31%), Positives = 61/100 (61%), Gaps = 11/100 (11%)

Query: 898 KRTRRKKKKGAEKIRKKKKK-KKEAEKMRKKKGAE-----KMKKKKKDAEKMKKKGAEKI 951
            R +++KK  A+K ++++KK +++AE+  K++ AE     +++++    +  K+  AE+ 
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQ--AEQA 110

Query: 952 GKKKKKKKK---KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K+ ++K+K   + K K+  + K K EAE  KK  EEA+K
Sbjct: 111 AKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150



 Score = 51.8 bits (124), Expect = 1e-06
 Identities = 25/96 (26%), Positives = 52/96 (54%), Gaps = 5/96 (5%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEK---IG 952
            +  ++++K  E+  ++ +K++ AE+ R+K+  ++    K  K AE+  K+  EK     
Sbjct: 63  AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + K K+  + K K + + +KK + E  K+  EEA+ 
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158



 Score = 42.1 bits (99), Expect = 9e-04
 Identities = 20/82 (24%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-KKK 966
           A +I+++KK   + E+ R+KK  ++ ++ +K     + +  E   +   +K  K+  +  
Sbjct: 52  ANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
           K+ ++K+K+AE+ K K     K
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAK 133



 Score = 41.7 bits (98), Expect = 0.002
 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 6/82 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            K +   + KK A + +KK + + +A+   K K   +  K K +A K       K   + 
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAK------AKAAAEA 209

Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
             K + +       + ++K  E
Sbjct: 210 AAKAEAEAAAAAAAEAERKADE 231



 Score = 36.0 bits (83), Expect = 0.081
 Identities = 16/75 (21%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK--KKK 973
                 +  R ++  +   KK+++ +K  ++ AE+  K++  ++ ++K+ +++    K  
Sbjct: 45  PGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA 104

Query: 974 KEAEKMKKKTEEAEK 988
           K+AE+  K+ EE +K
Sbjct: 105 KQAEQAAKQAEEKQK 119


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 57.7 bits (140), Expect = 9e-10
 Identities = 37/91 (40%), Positives = 57/91 (62%), Gaps = 8/91 (8%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           +  RR++ +  E + +K KK+ E E+ ++K+     +KK+KD EK  KK A    K++KK
Sbjct: 42  RALRRREYERLELMDEKWKKETEDEEFQQKR----EEKKRKDEEKTAKKRA----KRQKK 93

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+KKKKKKK KK  KK+E E  K   E +++
Sbjct: 94  KQKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124



 Score = 47.7 bits (114), Expect = 2e-06
 Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 16/87 (18%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           +K+ +  E  +K+++KK++ E+   KK A++ KKK                +KKKKKKK 
Sbjct: 60  KKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKK----------------QKKKKKKKA 103

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK  KK++K+  K +E+   + EE E+
Sbjct: 104 KKGNKKEEKEGSKSSEESSDEEEEGEE 130



 Score = 44.7 bits (106), Expect = 3e-05
 Identities = 23/78 (29%), Positives = 47/78 (60%), Gaps = 3/78 (3%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           F++KR  +K+K   E+   KK+ K++ +K +KKK  +K KK  K  EK   K +E+   +
Sbjct: 68  FQQKREEKKRK--DEEKTAKKRAKRQKKK-QKKKKKKKAKKGNKKEEKEGSKSSEESSDE 124

Query: 955 KKKKKKKKKKKKKKKKKK 972
           +++ ++ K+++  +  +K
Sbjct: 125 EEEGEEDKQEEPVEIMEK 142


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 61.4 bits (149), Expect = 1e-09
 Identities = 26/93 (27%), Positives = 55/93 (59%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +++   +++ K  +K +K+K K++  ++  K++  EK   K+K+ EK KK    +  +++
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK+++ + K + KK  KKK   K K+  EE ++
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196



 Score = 58.7 bits (142), Expect = 1e-08
 Identities = 27/87 (31%), Positives = 49/87 (56%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           ++ +KG  K    K K  +  K    K  EK K++ K+ +K KK+  ++  K +K K++ 
Sbjct: 78  KRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+K+  K+K+K+KE +  + +  E EK
Sbjct: 138 KEKRPPKEKEKEKEKKVEEPRDREEEK 164



 Score = 58.4 bits (141), Expect = 1e-08
 Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 4/93 (4%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +K+ +++K K   K RK K++ KE    ++K K  EK  ++ +D E+ KK+   +    K
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVR---AK 172

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + KK  KKK   KKK+  E EK ++   EA K
Sbjct: 173 SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVK 205



 Score = 56.0 bits (135), Expect = 6e-08
 Identities = 22/94 (23%), Positives = 58/94 (61%), Gaps = 2/94 (2%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
               +T+  K+   E  ++++K+K++ ++ +KKK  ++  K++    K K++  EK   K
Sbjct: 87  GPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK--KEKPKEEPKDRKPKEEAKEKRPPK 144

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K+K+K+KK ++ + ++++K+ E+++ K+   + 
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKP 178



 Score = 52.6 bits (126), Expect = 8e-07
 Identities = 27/91 (29%), Positives = 50/91 (54%), Gaps = 7/91 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            + + R+ K++  EK   K+K K+KE     KK    + ++++K  E+++ K   K    
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKE-----KKVEEPRDREEEKKRERVRAKSRPK-KPP 179

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           KKK   KKK+  +++K+++   E +K K EE
Sbjct: 180 KKKPPNKKKEPPEEEKQRQAAREAVKGKPEE 210



 Score = 49.9 bits (119), Expect = 6e-06
 Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 1/76 (1%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           K  + K    EA K R +KG  K    K    K  K  + K  +K+K++ K++KKKKK+K
Sbjct: 65  KCAESKLSSDEAVK-RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEK 123

Query: 970 KKKKKEAEKMKKKTEE 985
            K++ +  K K++ +E
Sbjct: 124 PKEEPKDRKPKEEAKE 139



 Score = 48.0 bits (114), Expect = 2e-05
 Identities = 23/94 (24%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           KR  ++K+K  EK + ++ + +E EK R   + K   K   KKK   K K+   E+  ++
Sbjct: 140 KRPPKEKEKEKEK-KVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQ 198

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             ++  K K ++    +++++ E   K  E    
Sbjct: 199 AAREAVKGKPEEPDVNEEREKEEDDGKDRETTTS 232



 Score = 41.0 bits (96), Expect = 0.003
 Identities = 17/67 (25%), Positives = 38/67 (56%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
            K+   +  ++++K        K K A++   +  K+++K+K++ K++KKKKKE  K + 
Sbjct: 69  SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128

Query: 982 KTEEAEK 988
           K  + ++
Sbjct: 129 KDRKPKE 135



 Score = 41.0 bits (96), Expect = 0.003
 Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 4/94 (4%)

Query: 896 RKKRTRRKKKKG--AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            K++ ++ ++     E+ ++++ + K   K   KK      KKK+  E+ K++ A +   
Sbjct: 147 EKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP--NKKKEPPEEEKQRQAAREAV 204

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K K ++    ++++K++   K+ E      EE E
Sbjct: 205 KGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238



 Score = 39.5 bits (92), Expect = 0.008
 Identities = 20/99 (20%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKM---KKKGAE 949
            +K+  R + K   K   KKK   KKKE  +  K++ A +   K K  E     +++  E
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             GK ++      ++ + ++  +         K  +   
Sbjct: 222 DDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSP 260



 Score = 37.9 bits (88), Expect = 0.028
 Identities = 18/91 (19%), Positives = 44/91 (48%), Gaps = 1/91 (1%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-KKKDAEKMKKKGAEKIGKK 954
            ++   R+++K  E++R K + KK  +K    K  E  ++ K++ A +   KG  +    
Sbjct: 154 VEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDV 213

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            ++++K++   K ++       E   +++ E
Sbjct: 214 NEEREKEEDDGKDRETTTSPMEEDESRQSSE 244



 Score = 37.2 bits (86), Expect = 0.050
 Identities = 17/61 (27%), Positives = 35/61 (57%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K AE      +  ++++K G++    K K  K+ K +  K+++K+K++ ++ KKK +E  
Sbjct: 65  KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124

Query: 988 K 988
           K
Sbjct: 125 K 125



 Score = 31.8 bits (72), Expect = 2.3
 Identities = 17/91 (18%), Positives = 33/91 (36%), Gaps = 5/91 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-----AEKMKKKKKDAEKMKKKGAEK 950
             K+    KKK   +  K+++  +EA K + ++       EK +   KD E       E 
Sbjct: 178 PPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEED 237

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
             ++  +  ++     KK       A    +
Sbjct: 238 ESRQSSEISRRSSSSLKKPDPSPSMASPETR 268



 Score = 30.6 bits (69), Expect = 4.7
 Identities = 12/89 (13%), Positives = 40/89 (44%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            KK+   KKK+  E+ ++++  ++  +   ++    + ++K++D  K ++     + + +
Sbjct: 179 PKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDE 238

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            ++  +  ++     KK   +  M     
Sbjct: 239 SRQSSEISRRSSSSLKKPDPSPSMASPET 267


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 59.5 bits (144), Expect = 3e-09
 Identities = 28/91 (30%), Positives = 38/91 (41%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K+  +     +   +   K K  A+K+ KKK A+     KK A+   K   +     KK 
Sbjct: 200 KKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKA 259

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK  K  KK  KK  K A K  K   +A K
Sbjct: 260 LKKAAKAVKKAAKKAAKAAAKAAKGAAKATK 290



 Score = 55.7 bits (134), Expect = 5e-08
 Identities = 28/93 (30%), Positives = 38/93 (40%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            K   ++   K     +K  KKK     +  KK A+   K  K A+K  KK  +K  K  
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK  KK  K   K  K   +A K K K ++   
Sbjct: 268 KKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAG 300



 Score = 55.7 bits (134), Expect = 5e-08
 Identities = 30/90 (33%), Positives = 36/90 (40%), Gaps = 1/90 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K     KK   +K  K     K+A K   K  A+K KK  K A K   K  +K  KK  
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKA-AKKAKKTAKKALKKAAKAVKKAAKKAA 275

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K   K  K   K  K K +A+K   K   A
Sbjct: 276 KAAAKAAKGAAKATKGKAKAKKKAGKKAAA 305



 Score = 52.2 bits (125), Expect = 6e-07
 Identities = 29/87 (33%), Positives = 34/87 (39%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K+   K    A+K  K   K  +  K   KK  +K  K  K A K   K A K  K   K
Sbjct: 228 KKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAK 287

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTE 984
             K K K KKK  KK     K K   +
Sbjct: 288 ATKGKAKAKKKAGKKAAAGSKAKATAK 314



 Score = 50.7 bits (121), Expect = 2e-06
 Identities = 29/87 (33%), Positives = 37/87 (42%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +   + +K  K A K  KK KK  +    +  K  +K  KK   A     KGA K  K K
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            K KKK  KK     K K  A+  K+ 
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRG 319



 Score = 49.9 bits (119), Expect = 4e-06
 Identities = 30/93 (32%), Positives = 39/93 (41%), Gaps = 2/93 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKK--KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             +   KKK     +  KK  K   +A K  KK   + +KK  K  +K  KK A+   K 
Sbjct: 222 AAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKA 281

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            K   K  K K K KKK  K+A    K    A+
Sbjct: 282 AKGAAKATKGKAKAKKKAGKKAAAGSKAKATAK 314



 Score = 48.7 bits (116), Expect = 7e-06
 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K  ++ KK   + ++K  K  K+A K +  K A K  K    A K K K  +K GKK  
Sbjct: 246 AKAAKKAKKTAKKALKKAAKAVKKAAK-KAAKAAAKAAKGAAKATKGKAKAKKKAGKKAA 304

Query: 957 KKKKKKKK-KKKKKKKKKKEAEKMKKK 982
              K K   K  K+  K K+A+K+ KK
Sbjct: 305 AGSKAKATAKAPKRGAKGKKAKKVTKK 331



 Score = 46.4 bits (110), Expect = 4e-05
 Identities = 30/89 (33%), Positives = 36/89 (40%), Gaps = 6/89 (6%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
              A+K  K    K  A+K   K  +   K  KK A K     A+K  K   K  KK KK
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAK-TAVSAKKAAKTAAKAAKKAKK 254

Query: 965 K-----KKKKKKKKKEAEKMKKKTEEAEK 988
                 KK  K  KK A+K  K   +A K
Sbjct: 255 TAKKALKKAAKAVKKAAKKAAKAAAKAAK 283



 Score = 46.4 bits (110), Expect = 5e-05
 Identities = 31/88 (35%), Positives = 37/88 (42%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           +T    KK A+   K  KK K+  K   KK A+ +KK  K A K   K A+   K  K K
Sbjct: 233 KTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGK 292

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            K KKK  KK     K     K     A
Sbjct: 293 AKAKKKAGKKAAAGSKAKATAKAPKRGA 320



 Score = 43.3 bits (102), Expect = 5e-04
 Identities = 25/79 (31%), Positives = 32/79 (40%), Gaps = 1/79 (1%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           +     KK       K    +   K K  A+K+ KK A K     KK  K   K  KK K
Sbjct: 194 VGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAK 253

Query: 971 KK-KKEAEKMKKKTEEAEK 988
           K  KK  +K  K  ++A K
Sbjct: 254 KTAKKALKKAAKAVKKAAK 272



 Score = 38.7 bits (90), Expect = 0.011
 Identities = 22/70 (31%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K  ++  KK A+   K  K   +A K  K K  +K  KK     K K          K 
Sbjct: 264 AKAVKKAAKKAAKAAAKAAKGAAKATK-GKAKAKKKAGKKAAAGSKAKATAKAPKRGAKG 322

Query: 957 KKKKKKKKKK 966
           KK KK  KK+
Sbjct: 323 KKAKKVTKKR 332


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 56.2 bits (136), Expect = 6e-09
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           ++KKK+ AE+I    K KKE E+  K+K   K KK KK  +K K K  +K  K  K +KK
Sbjct: 61  KKKKKELAEEIE---KVKKEYEE--KQKWKWKKKKSKKKKDKDKDKKDDK--KDDKSEKK 113

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            +K+ + K +   K   +      E +
Sbjct: 114 DEKEAEDKLEDLTKSYSETLSTLSELK 140



 Score = 53.2 bits (128), Expect = 6e-08
 Identities = 26/57 (45%), Positives = 36/57 (63%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K+  ++ EK+KK+  EK   K KKKK KKKK K K KK  K+ +K +KK E+  +
Sbjct: 63  KKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119



 Score = 53.2 bits (128), Expect = 6e-08
 Identities = 27/70 (38%), Positives = 42/70 (60%), Gaps = 1/70 (1%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK-KKKKKEAEK 978
              K +KK+ AE+++K KK+ E+ +K   +K   KKKK K K KK  KK  K +KK+ ++
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKE 117

Query: 979 MKKKTEEAEK 988
            + K E+  K
Sbjct: 118 AEDKLEDLTK 127



 Score = 52.8 bits (127), Expect = 9e-08
 Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            E  +KKK+  +E EK++K+   ++  K KK   K KKK  +K   KK  KK  K +KK 
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSK-KKKDKDK--DKKDDKKDDKSEKKD 114

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
           +K+ + K  +  K  +E    
Sbjct: 115 EKEAEDKLEDLTKSYSETLST 135



 Score = 46.6 bits (111), Expect = 1e-05
 Identities = 23/59 (38%), Positives = 37/59 (62%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             + KKKKK+  +  +K  ++  +K+K K KKKK KKKK K K K+ +K   K+E+ ++
Sbjct: 57  YTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115



 Score = 45.5 bits (108), Expect = 3e-05
 Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 7/95 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           ++K   +KKK   +K + K KK  + +   +KK  ++ + K +D  K   +    + + K
Sbjct: 81  KQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140

Query: 956 KKKKKKKK-------KKKKKKKKKKKEAEKMKKKT 983
            +K    K        +K++ +  KKEAE+++   
Sbjct: 141 PRKYALHKDIYQSRLDRKRRAEVAKKEAEELQNPG 175



 Score = 40.1 bits (94), Expect = 0.002
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 6/89 (6%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           +K+K   K +K KKKK + +   KK   +  K +KKD ++ + K  E + K   +     
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKD--KKDDKKDDKSEKKDEKEAEDK-LEDLTKSYSETLSTL 136

Query: 963 KKKKKKKKKKKKEAEKM---KKKTEEAEK 988
            + K +K    K+  +    +K+  E  K
Sbjct: 137 SELKPRKYALHKDIYQSRLDRKRRAEVAK 165



 Score = 35.1 bits (81), Expect = 0.079
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 14/55 (25%)

Query: 948 AEKIGKKKKKK--------------KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           AE    KKKKK              +K+K K KKKK KKKK+ +K KK  ++ +K
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDK 109


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 55.1 bits (133), Expect = 2e-08
 Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 7/71 (9%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            + + +EK     K +KEAE   ++K  +K KK+ K  +K KK       KK+K  + K 
Sbjct: 130 SESETSEK-ETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKD------KKEKMVEPKG 182

Query: 963 KKKKKKKKKKK 973
            KKKKKKKKKK
Sbjct: 183 SKKKKKKKKKK 193



 Score = 52.0 bits (125), Expect = 2e-07
 Identities = 22/66 (33%), Positives = 40/66 (60%), Gaps = 2/66 (3%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
            + + +EK    K   + + + ++ EK +KK  +++ K+KK+KK KK+K  + K  KKK+
Sbjct: 130 SESETSEKETTAK--VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKK 187

Query: 976 AEKMKK 981
            +K KK
Sbjct: 188 KKKKKK 193



 Score = 49.7 bits (119), Expect = 1e-06
 Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           + + +EK    K + E   ++  EK  KKKKK+ KK+KK+KK KK+K  E +  KKK ++
Sbjct: 131 ESETSEKETTAKVEKEAEVEE-EEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189

Query: 986 AEK 988
            +K
Sbjct: 190 KKK 192



 Score = 48.9 bits (117), Expect = 2e-06
 Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 10/68 (14%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           +K T  K +K AE   ++KK+KK+ ++++K+K  +K KK+K    K            KK
Sbjct: 136 EKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPK----------GSKK 185

Query: 957 KKKKKKKK 964
           KKKKKKKK
Sbjct: 186 KKKKKKKK 193



 Score = 47.4 bits (113), Expect = 7e-06
 Identities = 23/60 (38%), Positives = 39/60 (65%)

Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           G+E    +K+   K++K+   +  +KK+KKKKK+ KK+KK+KK KKE     K +++ +K
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188



 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 24/79 (30%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
              +  ++ +   E+ ++KKKKK+  ++ ++KK                         KK
Sbjct: 139 TTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKK------------------------DKK 174

Query: 956 KKKKKKKKKKKKKKKKKKK 974
           +K  + K  KKKKKKKKKK
Sbjct: 175 EKMVEPKGSKKKKKKKKKK 193


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 55.0 bits (133), Expect = 2e-08
 Identities = 38/100 (38%), Positives = 60/100 (60%), Gaps = 11/100 (11%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA---EKMK--KKKKDAEKMKKKGAEK 950
            KK+ ++KK    + ++K + +KK+ E++ + K A   EK K  K    AE +K K  EK
Sbjct: 83  LKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEK 142

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKE------AEKMKKKTE 984
           + KK  K+K+K+KKK KK+ K++KE      AE+ KK+ E
Sbjct: 143 LLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREE 182



 Score = 51.9 bits (125), Expect = 3e-07
 Identities = 35/96 (36%), Positives = 55/96 (57%), Gaps = 9/96 (9%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA---EKMK--KKKKDAEKMKKKGA 948
           L +KK+ ++   K   K  + +KKK E   + + K A   EK K  K    AE +K K  
Sbjct: 83  LKKKKKKKKTDYKQLLKKLEARKKKLEE--LDEDKAAEIEEKEKWTKALAKAEGVKVKDD 140

Query: 949 EKIGKK--KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           EK+ KK  K+K+K+KKK KK+ K++K+K  +K  ++
Sbjct: 141 EKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAER 176



 Score = 51.2 bits (123), Expect = 5e-07
 Identities = 29/78 (37%), Positives = 44/78 (56%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
             +    K K  EK+ KK  K+KE +K + KK  ++ K+K +  +  ++K  E+  KK+K
Sbjct: 129 LAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRK 188

Query: 957 KKKKKKKKKKKKKKKKKK 974
             KK KKKKK KKK +  
Sbjct: 189 DDKKNKKKKKAKKKGRIL 206



 Score = 50.4 bits (121), Expect = 9e-07
 Identities = 37/111 (33%), Positives = 48/111 (43%), Gaps = 18/111 (16%)

Query: 896 RKKRTRRKKKKGAE-KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK-------G 947
           R+KR +RK +K  + K  KKK+  +++E    K    K KKK    E  +          
Sbjct: 12  RRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEF 71

Query: 948 AEKIGKKKKKKKKKKKKKKKK----------KKKKKKEAEKMKKKTEEAEK 988
           A+    KK  K KKKKKKKK            +KKK E     K  E  EK
Sbjct: 72  ADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEK 122



 Score = 44.6 bits (106), Expect = 7e-05
 Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 27/119 (22%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEA-EKMRKKKGAEKMKKKKKDAEK------------ 942
           R++R  RKK+K  E  +K+  +K EA E   ++  ++K     ++AE             
Sbjct: 15  REQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADG 74

Query: 943 -MKKKGAEKIGKKKKKKK---------KKKKKK----KKKKKKKKKEAEKMKKKTEEAE 987
              KK  +   KKKKKK          + +KKK     + K  + +E EK  K   +AE
Sbjct: 75  EQAKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAE 133



 Score = 42.3 bits (100), Expect = 5e-04
 Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 14/94 (14%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           + KK  +  +KKKKKK +      K+  +K++ +KK  E++ +  A +I +K+K  K   
Sbjct: 76  QAKKDLKLKKKKKKKKTDY-----KQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALA 130

Query: 963 K---KKKK------KKKKKKKEAEKMKKKTEEAE 987
           K    K K      KK  K+KE +K K K E  E
Sbjct: 131 KAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKE 164



 Score = 34.6 bits (80), Expect = 0.15
 Identities = 12/40 (30%), Positives = 28/40 (70%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E + +++++K++++K +KK+K+K+ K+ E  +K   E  K
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVK 44



 Score = 34.2 bits (79), Expect = 0.20
 Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 7/57 (12%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-----KKKGAEKMKKKKKDAEKMKKKG 947
           ++K+ ++ KK+  E+  K +KKK E +K R     K+K  +K KKKKK   K K + 
Sbjct: 151 KEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKK--AKKKGRI 205



 Score = 33.8 bits (78), Expect = 0.25
 Identities = 11/40 (27%), Positives = 28/40 (70%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              + ++++K++++K +KK+K+K+ KK+ +  K + EE +
Sbjct: 5   EALLEQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVK 44



 Score = 32.3 bits (74), Expect = 0.96
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKK-----KKKKKKEAEKMKKKTEEAEK 988
            ++ +K+   K  KK+K+K+ KKK+  +K      K ++ +++K     E AE 
Sbjct: 9   EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEG 62



 Score = 29.6 bits (67), Expect = 6.3
 Identities = 12/57 (21%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + +++K++  K +KK  +K  + KKK+  +K + ++ K ++ K  +K          
Sbjct: 9   EQRRRKREQRKARKK--QKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGN 63


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 56.9 bits (138), Expect = 3e-08
 Identities = 38/132 (28%), Positives = 56/132 (42%), Gaps = 29/132 (21%)

Query: 872 HVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGA-------------EKIR---KKK 915
           HV    +N D+ R A T     L R  RT R  +KG               KI    ++ 
Sbjct: 316 HV----INFDMPRSADT----YLHRIGRTGRAGRKGTAISLVEAHDHLLLGKIERYIEEP 367

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
            K +  +++R K  A   KK  K ++K+  K AEK  K+K+K K KK+ +  K   K++ 
Sbjct: 368 LKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRR- 426

Query: 976 AEKMKKKTEEAE 987
               K      E
Sbjct: 427 ----KPSGTSEE 434



 Score = 44.2 bits (105), Expect = 3e-04
 Identities = 15/44 (34%), Positives = 25/44 (56%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  +EK   K  KK   K+ +KK+K+K+K + +K  + T+   K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGK 424



 Score = 33.0 bits (76), Expect = 0.84
 Identities = 15/34 (44%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K   +KK  K  KK   K AEK K+K +E  K
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEK-KEKEKEKPK 411


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 56.1 bits (136), Expect = 6e-08
 Identities = 38/135 (28%), Positives = 76/135 (56%), Gaps = 20/135 (14%)

Query: 864 NVATK--RLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKK--------KKGAEKIRK 913
           ++A K    LH +   V    RRE +  L  I+F+       +        ++  E +  
Sbjct: 350 SIAKKIAEKLHTSKRKV----RREVLPFLS-IIFKHNPELAARLAAFLELTEEEIEFLTG 404

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            KK  K+ +K+      EK +KK+++ +K KKK A    KK+++++++K+KK+++K++++
Sbjct: 405 SKKATKKIKKI-----VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459

Query: 974 KEAEKMKKKTEEAEK 988
           +EAE+ K++ EE +K
Sbjct: 460 EEAEEEKEEEEEKKK 474



 Score = 55.3 bits (134), Expect = 9e-08
 Identities = 26/71 (36%), Positives = 48/71 (67%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
             KK KK  EK  KK+  EK +KKKK     KK+  E+  K+KK+++K++++++ +++K+
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467

Query: 973 KKEAEKMKKKT 983
           ++E +K K+ T
Sbjct: 468 EEEEKKKKQAT 478



 Score = 55.3 bits (134), Expect = 1e-07
 Identities = 26/71 (36%), Positives = 50/71 (70%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            +K  K  +KI +K +KK+E EK  KKK A   KKK+++ E+ K+K  E+  +++++ ++
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 961 KKKKKKKKKKK 971
           +K+++++KKKK
Sbjct: 465 EKEEEEEKKKK 475



 Score = 54.5 bits (132), Expect = 2e-07
 Identities = 29/131 (22%), Positives = 61/131 (46%), Gaps = 35/131 (26%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKK--------------------KKKKKEAEKM-------- 924
           +L + K  R  +   A+KI +K                    K   + A ++        
Sbjct: 337 LLSKTKEKRETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTE 396

Query: 925 -------RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
                    KK  +K+KK  + AEK +++  ++  KK    KKK+++++++K+KK++E E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456

Query: 978 KMKKKTEEAEK 988
           + +++ EE ++
Sbjct: 457 EEEEEAEEEKE 467



 Score = 54.2 bits (131), Expect = 2e-07
 Identities = 21/73 (28%), Positives = 46/73 (63%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           T  KK     K   +K +KK  E+ ++KK      KKK++ E+ +K+  E+  ++++++ 
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462

Query: 960 KKKKKKKKKKKKK 972
           +++K+++++KKKK
Sbjct: 463 EEEKEEEEEKKKK 475



 Score = 51.8 bits (125), Expect = 1e-06
 Identities = 19/72 (26%), Positives = 47/72 (65%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
              +  KK      + +KK+++E ++ +KK  A K K+++++ EK KK+  ++  +++ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 958 KKKKKKKKKKKK 969
           ++K+++++KKKK
Sbjct: 464 EEKEEEEEKKKK 475



 Score = 51.8 bits (125), Expect = 1e-06
 Identities = 19/77 (24%), Positives = 49/77 (63%)

Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           + F+   KK T++ KK   +  +K++++KKE +K       ++ +++++  +K ++K  E
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458

Query: 950 KIGKKKKKKKKKKKKKK 966
           +   +++K+++++KKKK
Sbjct: 459 EEEAEEEKEEEEEKKKK 475



 Score = 51.5 bits (124), Expect = 2e-06
 Identities = 20/76 (26%), Positives = 51/76 (67%), Gaps = 1/76 (1%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
            L   K+  +K KK  EK  KK++++K+ +K +   G +K ++++++ ++ K++  E+  
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKK-EEEEEEEKEKKEEEKEEEE 459

Query: 953 KKKKKKKKKKKKKKKK 968
           ++ +++K+++++KKKK
Sbjct: 460 EEAEEEKEEEEEKKKK 475


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 52.5 bits (126), Expect = 7e-08
 Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           +K++KE  K  ++K A K KK+KK  EK +KK AEK+ KKK  K  K   KK  KK   K
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKK--EKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153

Query: 975 EAEK 978
           + E 
Sbjct: 154 KEEG 157



 Score = 50.1 bits (120), Expect = 5e-07
 Identities = 26/64 (40%), Positives = 38/64 (59%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           +EK +K+  K+ E ++  K K  +K KK+KK AEK+ KK + K  K   KK  KK   KK
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKK 154

Query: 968 KKKK 971
           ++ K
Sbjct: 155 EEGK 158



 Score = 44.8 bits (106), Expect = 4e-05
 Identities = 19/50 (38%), Positives = 32/50 (64%), Gaps = 4/50 (8%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++EK +K+ +    K+ ++K+  K KK+KK+KK+KK AEK+ KK      
Sbjct: 94  ESEKEQKEVS----KETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTT 139



 Score = 43.2 bits (102), Expect = 1e-04
 Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 2/68 (2%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
           E  +  +K+ +++ ++K+    K KK+  EK  KK  +K  KKK  K  K   KK  +K 
Sbjct: 93  EESEKEQKEVSKETEEKEA--IKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150

Query: 980 KKKTEEAE 987
             K EE +
Sbjct: 151 TTKKEEGK 158



 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E+ +K++K+  K  ++  E I  KK+KK+KK+KK  +K  KKK          +  +K
Sbjct: 93  EESEKEQKEVSKETEE-KEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149



 Score = 37.0 bits (86), Expect = 0.015
 Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 2/57 (3%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKK--KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K++K+ +++ ++K A K  K+KK+KK+KK  +K  KKK  K  +    K   +   K
Sbjct: 97  KEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153



 Score = 37.0 bits (86), Expect = 0.017
 Identities = 19/62 (30%), Positives = 32/62 (51%), Gaps = 3/62 (4%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---KKKKKKKKKKKKKEAEKMKKKTEEA 986
             + ++K+   E  +K+  +   +KK+KK+KK   K  KKK  K  K   +K  KKT   
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153

Query: 987 EK 988
           ++
Sbjct: 154 KE 155



 Score = 35.9 bits (83), Expect = 0.039
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
            K T  K+   A+K +K+KK+KK AEK+ KKK  +  K   K A K      E+
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEE 156


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 51.9 bits (125), Expect = 7e-08
 Identities = 24/74 (32%), Positives = 45/74 (60%), Gaps = 8/74 (10%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KK   K+ K+  ++KE +K RK+ G     KK++ A    +   E+  ++ +K+K+ +K 
Sbjct: 65  KKHNAKVEKELLREKEKKKKRKRPG-----KKRRIA---LRLRRERTKERAEKEKRTRKN 116

Query: 965 KKKKKKKKKKEAEK 978
           ++KK K+++KE EK
Sbjct: 117 REKKFKRRQKEKEK 130



 Score = 43.8 bits (104), Expect = 4e-05
 Identities = 15/57 (26%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           KK   K++K+    ++ KKK   + GKK++   + ++++ K++ +K+K   K ++K 
Sbjct: 65  KKHNAKVEKELLREKEKKKKRK-RPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120



 Score = 43.0 bits (102), Expect = 7e-05
 Identities = 21/71 (29%), Positives = 42/71 (59%), Gaps = 5/71 (7%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           KK   K+ K+   EK KKKK+     K+ G ++    + ++++ K++ +K+K+ +K   +
Sbjct: 65  KKHNAKVEKELLREKEKKKKR-----KRPGKKRRIALRLRRERTKERAEKEKRTRKNREK 119

Query: 978 KMKKKTEEAEK 988
           K K++ +E EK
Sbjct: 120 KFKRRQKEKEK 130


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 54.8 bits (132), Expect = 1e-07
 Identities = 32/98 (32%), Positives = 37/98 (37%), Gaps = 7/98 (7%)

Query: 897 KKRTRRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKK-----KKDAEKMKKKGAE 949
           KK     KKK   +  KK     KK+AE     K A + KKK     KK A    KK A 
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              K    K   + K   +K    K AEK       AE
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258



 Score = 52.1 bits (125), Expect = 8e-07
 Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 1/94 (1%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +++   +++KK AE+  K+   K++  +    K A   K K +   K     A+K   + 
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167

Query: 956 KKKKKKKKKKKKKKKKKKK-EAEKMKKKTEEAEK 988
           KKK + +  KK   + KKK EAE   K   EA+K
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKK 201



 Score = 50.2 bits (120), Expect = 3e-06
 Identities = 31/86 (36%), Positives = 39/86 (45%), Gaps = 7/86 (8%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKKKKKK 962
           A+K   + KKK EAE    KK A + KKK +     K     KK AE   KKK   + KK
Sbjct: 160 AKKAAAEAKKKAEAEA--AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKK 217

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K   + K    K A + K   E+A  
Sbjct: 218 KAAAEAKAAAAKAAAEAKAAAEKAAA 243



 Score = 49.4 bits (118), Expect = 6e-06
 Identities = 36/111 (32%), Positives = 47/111 (42%), Gaps = 19/111 (17%)

Query: 897 KKRTRRKKKKGAEKIRKKK----KKKKEAEKMR----KKKGAEKMKKK------KKDAEK 942
             +    K+K AE+   K     K K EAE  R     KK A + KKK      KK A +
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182

Query: 943 MKKKGAEKIGKK-----KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KKK   +   K     KKK + + KKK   + KKK  AE      + A +
Sbjct: 183 AKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233



 Score = 49.0 bits (117), Expect = 8e-06
 Identities = 27/91 (29%), Positives = 42/91 (46%), Gaps = 4/91 (4%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMR----KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           R K+   E++  +++KK+  E  +    K+K AE+   K   A K K +   K      K
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAK 161

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K   + KKK + +  KK A + KKK E    
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAA 192



 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 25/95 (26%), Positives = 38/95 (40%), Gaps = 3/95 (3%)

Query: 897 KKRTRRKKKKGAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            KR     KK A + +KK +    KK A + +KK  AE   K   +A+K  +  A+K   
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAA 212

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + KKK   + K    K   +     +K       
Sbjct: 213 AEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAA 247



 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 32/95 (33%), Positives = 41/95 (43%), Gaps = 9/95 (9%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKIGKKKKK 957
           K K  AE  R     KK A + +KK  AE  KK     KKK   +   K A +  KK + 
Sbjct: 146 KAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEA 205

Query: 958 KKKKKKKKKKKKK----KKKKEAEKMKKKTEEAEK 988
           + KKK   + KKK     K   A+   +    AEK
Sbjct: 206 EAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEK 240



 Score = 44.4 bits (105), Expect = 2e-04
 Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 7/93 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAE----KMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           ++K+ AE+ R K+ +K+      +KK AE    +   K+K AE+   K A     K + +
Sbjct: 93  QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152

Query: 959 KKK---KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K+     KK   + KKK EAE  KK   EA+K
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK 185



 Score = 44.0 bits (104), Expect = 3e-04
 Identities = 32/99 (32%), Positives = 54/99 (54%), Gaps = 12/99 (12%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKK--KKKDAEKMKKKGAEKIGKKKKKK 958
           +K  K AE+ RKKK++++  E  +K+    E++K+  K++ A + +KK AE+  K+   K
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 959 KKKKKKKKKKKK---KKKKEAEKM------KKKTEEAEK 988
           +K+ ++   K     K K EAE        KK   EA+K
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169



 Score = 30.5 bits (69), Expect = 4.5
 Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 4/61 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           +KK     KKK A + +KK     K          K  AEK    K   +    K A ++
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259

Query: 952 G 952
            
Sbjct: 260 D 260


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 55.0 bits (133), Expect = 1e-07
 Identities = 17/91 (18%), Positives = 35/91 (38%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            +++ K     E+I++  + KK+  +   +         K   +  +    +K  K   K
Sbjct: 25  AKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAK 84

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
               K   KKK K +   ++K +KK    + 
Sbjct: 85  AAAAKAPAKKKLKDELDSSKKAEKKNALDKD 115



 Score = 53.1 bits (128), Expect = 5e-07
 Identities = 20/95 (21%), Positives = 36/95 (37%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L  ++  ++K KK A K + K    KE  K   +   +  ++  +    +     +    
Sbjct: 10  LAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDA 69

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +    KKK K   K    K  A+K  K   ++ K
Sbjct: 70  TESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104



 Score = 50.4 bits (121), Expect = 3e-06
 Identities = 20/109 (18%), Positives = 38/109 (34%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           ++ +++          +   T+ + K+  E  +K  ++  +          +     + D
Sbjct: 14  EEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESD 73

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K K K A K    K   KKK K +    KK +K+    K       K
Sbjct: 74  IPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVK 122



 Score = 47.3 bits (113), Expect = 3e-05
 Identities = 18/96 (18%), Positives = 34/96 (35%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
            L  KK+T  +  +    +    K   +A +    K   K   K   A+   KK  +   
Sbjct: 41  ALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              KK +KK    K       K+ + + +  ++ + 
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDD 136



 Score = 47.3 bits (113), Expect = 3e-05
 Identities = 15/92 (16%), Positives = 28/92 (30%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K+  +   +   +   +  +       M K           K   K   K A      KK
Sbjct: 35  KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K +    KK +KK   + +      ++ + 
Sbjct: 95  KLKDELDSSKKAEKKNALDKDDDLNYVKDIDV 126



 Score = 45.0 bits (107), Expect = 2e-04
 Identities = 20/93 (21%), Positives = 34/93 (36%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            +     + KK  +K+  K K K    K   K+  E  KK  +  +++    +  +    
Sbjct: 8   AELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTD 67

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +    KKK K   K   A+   KK  + E 
Sbjct: 68  DATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100



 Score = 40.7 bits (96), Expect = 0.003
 Identities = 21/96 (21%), Positives = 32/96 (33%), Gaps = 20/96 (20%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK--------------- 957
           K +   +E  K + KK A K K K    ++  K+  E   K  ++               
Sbjct: 7   KAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDT 66

Query: 958 -----KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                    KKK K   K    +A   KK  +E + 
Sbjct: 67  DDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDS 102



 Score = 38.4 bits (90), Expect = 0.017
 Identities = 14/59 (23%), Positives = 24/59 (40%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
               K +    E+ KKK  +   K K K    K++ K+  + KKK  E++ +       
Sbjct: 3   TASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61



 Score = 38.4 bits (90), Expect = 0.018
 Identities = 17/123 (13%), Positives = 34/123 (27%), Gaps = 3/123 (2%)

Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR 925
            TK  +  A+        +     +      K      +    +K  K   K   A+   
Sbjct: 33  ITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPA 92

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           KKK  +++   KK     KK   +K       K      +         + +      ++
Sbjct: 93  KKKLKDELDSSKKAE---KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDD 149

Query: 986 AEK 988
            + 
Sbjct: 150 DDD 152



 Score = 38.4 bits (90), Expect = 0.020
 Identities = 15/123 (12%), Positives = 29/123 (23%)

Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR 925
             K    +    +      +           KK+T+   K  A K   KKK K E +  +
Sbjct: 45  KKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK 104

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K +    + K              +                        E +      ++
Sbjct: 105 KAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDD 164

Query: 986 AEK 988
            + 
Sbjct: 165 VDD 167



 Score = 35.7 bits (83), Expect = 0.11
 Identities = 18/52 (34%), Positives = 24/52 (46%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              + K +    E+  KK KK   K K K    K++ KEA + KKKT E   
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQID 53



 Score = 35.7 bits (83), Expect = 0.13
 Identities = 17/71 (23%), Positives = 27/71 (38%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
              + K       E  KK KK A K K KG     + K+  + KKK  ++  +     + 
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSG 61

Query: 978 KMKKKTEEAEK 988
            +K   +  E 
Sbjct: 62  MVKDTDDATES 72



 Score = 31.1 bits (71), Expect = 3.3
 Identities = 14/56 (25%), Positives = 24/56 (42%)

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           M      AE   ++ A+K  KK   K K K    K++ K+  E++K   +  +   
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVL 56



 Score = 30.3 bits (69), Expect = 5.2
 Identities = 11/69 (15%), Positives = 26/69 (37%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
                 K + A + + KKK  +   K  ++    K++ K+  + KKK  ++  +      
Sbjct: 1   MTTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLS 60

Query: 980 KKKTEEAEK 988
               +  + 
Sbjct: 61  GMVKDTDDA 69



 Score = 30.0 bits (68), Expect = 7.3
 Identities = 14/99 (14%), Positives = 36/99 (36%), Gaps = 8/99 (8%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK--------MKKKGA 948
            K   +KK K      KK +KK   +K       + +    +  +         +     
Sbjct: 88  AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI 147

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           +     +   +         + ++KKEA++++K +++ +
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDD 186


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 54.4 bits (131), Expect = 3e-07
 Identities = 31/110 (28%), Positives = 47/110 (42%), Gaps = 7/110 (6%)

Query: 881 DVRREAVT----ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
           +VR EAV      + F L   +R  R   K A   R+K KK    +K  K++   K    
Sbjct: 707 EVRVEAVNMDERKIDFSLISSERAPRNVGKTA---REKAKKGDAGKKGGKRRQVGKKVNF 763

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           + D+    +K A+    KK  +K KK   K +K     +A++  KK    
Sbjct: 764 EPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 37.0 bits (86), Expect = 0.050
 Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 14/73 (19%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---------KKKKKKKKKKKKKE 975
           R  +   K  +     EK KK  A K G K+++  KK         + +KK K K  KK+
Sbjct: 729 RAPRNVGKTAR-----EKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKD 783

Query: 976 AEKMKKKTEEAEK 988
           A K KK + + +K
Sbjct: 784 ARKAKKPSAKTQK 796


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 53.8 bits (129), Expect = 3e-07
 Identities = 32/98 (32%), Positives = 55/98 (56%), Gaps = 5/98 (5%)

Query: 896 RKKRTRR-----KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           RKK+  +     K K+ AE+ R K+ +K+  +   ++K AE+ +K+ +  +K +++ A K
Sbjct: 81  RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARK 140

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              ++KKK +  K K   +  K K A + KKK EEA K
Sbjct: 141 AAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAK 178



 Score = 52.3 bits (125), Expect = 8e-07
 Identities = 26/93 (27%), Positives = 42/93 (45%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           + K     KKK  E  +  ++ K +AE    KK AE   K   +  K + +   K  KK 
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKA 221

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +   ++K   +KKK   K +A+K     + AE+
Sbjct: 222 EAAAEEKAAAEKKKAAAKAKADKAAAAAKAAER 254



 Score = 51.5 bits (123), Expect = 1e-06
 Identities = 29/102 (28%), Positives = 44/102 (43%), Gaps = 9/102 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR------KKKGAEKMKKKKKDAE---KMKKK 946
           + +  + K    A K++   + KK+AE+        K K      KKK +AE     +K 
Sbjct: 148 KAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKA 207

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            AE   K K +KK +   ++K   +KKK A K K     A  
Sbjct: 208 KAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAA 249



 Score = 48.8 bits (116), Expect = 9e-06
 Identities = 29/105 (27%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 897 KKRTRRKKKKGAEKIRK---KKKKKKEAEKMRKKKGAEKMK----------KKKKDAEKM 943
           +K+ + ++K+  E+ RK   ++KKK EA K +    A K+K          +  K AE+ 
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEA 183

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K      KKK + + K   +K K + + K   + K +    EK
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228



 Score = 48.4 bits (115), Expect = 1e-05
 Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 5/92 (5%)

Query: 898 KRTRRKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           K+   +  K AE+ + K      KK+AE   K   AEK K + +   K +KK AE   ++
Sbjct: 170 KKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAA-AEKAKAEAEAKAKAEKK-AEAAAEE 227

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K   +KKK   K K  K    A+  ++K   A
Sbjct: 228 KAAAEKKKAAAKAKADKAAAAAKAAERKAAAA 259



 Score = 46.5 bits (110), Expect = 4e-05
 Identities = 26/93 (27%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +KK+    K K A +   K K   EA+K  ++      + K K      KK AE   K  
Sbjct: 145 QKKKAEAAKAKAAAE-AAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAA 203

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K K + + K K +KK + A + K   E+ + 
Sbjct: 204 AEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKA 236



 Score = 44.9 bits (106), Expect = 1e-04
 Identities = 24/87 (27%), Positives = 44/87 (50%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            ++K+  E  ++ + ++K+ E+  +K  AE+ KK +    K   + A+     + KKK +
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAE 174

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +  K  ++ K K EA   KKK E   K
Sbjct: 175 EAAKAAEEAKAKAEAAAAKKKAEAEAK 201



 Score = 44.6 bits (105), Expect = 2e-04
 Identities = 25/104 (24%), Positives = 52/104 (50%), Gaps = 11/104 (10%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEK 950
           ++K+    +K+   + ++++++ ++A   +KKK      K   +A K+K     KK AE+
Sbjct: 116 QQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175

Query: 951 IGKKKKKKKKK------KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K  ++ K K      KKK + + K   ++A+   +   +AEK
Sbjct: 176 AAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEK 219



 Score = 43.0 bits (101), Expect = 5e-04
 Identities = 27/98 (27%), Positives = 54/98 (55%), Gaps = 5/98 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           +K   +RKKK+       K K+  E E++++ +K   K ++++K AE+ +K+   +  ++
Sbjct: 75  KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQ 134

Query: 955 KKKKKK----KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++ +K    +KKK +  K K   EA K+K   E  +K
Sbjct: 135 EEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172



 Score = 41.1 bits (96), Expect = 0.002
 Identities = 24/93 (25%), Positives = 58/93 (62%), Gaps = 3/93 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R +  +   KKG ++  +KKK+++ AE+++ K+ AE+ + K+ + E++K +  +K   ++
Sbjct: 66  RIQSQQSSAKKGEQQ--RKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQK-QAEE 122

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K+ + ++K+++++ +K  AE+ KK      K
Sbjct: 123 AEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAK 155



 Score = 38.8 bits (90), Expect = 0.012
 Identities = 18/64 (28%), Positives = 28/64 (43%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            KK+   + K  AEK + + + K +AEK  +    EK   +KK A    K        K 
Sbjct: 192 AKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKA 251

Query: 956 KKKK 959
            ++K
Sbjct: 252 AERK 255



 Score = 38.4 bits (89), Expect = 0.016
 Identities = 18/80 (22%), Positives = 50/80 (62%), Gaps = 2/80 (2%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKKKKKKKKKKKKKK 968
           + ++  + +  +   KK   ++ KK+++ AE++K K A  ++  K+ +K++ K ++++K+
Sbjct: 60  VVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQ 119

Query: 969 KKKKKKEAEKMKKKTEEAEK 988
            ++ +K+A+  +K+ EE  +
Sbjct: 120 AEEAEKQAQLEQKQQEEQAR 139


>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain.  This presumed domain is
           found at the C-terminus of a family of FtsJ-like
           methyltransferases. Members of this family are involved
           in 60S ribosomal biogenesis.
          Length = 212

 Score = 51.9 bits (125), Expect = 3e-07
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 7/84 (8%)

Query: 905 KKGAE-KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
           KK AE K RKK +  K  EK +KK  AE + + +  +E+ K K  +K+ KK KKKKKK+K
Sbjct: 111 KKVAEAKARKKMRALKRLEKAKKK--AEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRK 168

Query: 964 KK----KKKKKKKKKEAEKMKKKT 983
           K     KK  K K    + +K K 
Sbjct: 169 KTYVVAKKGNKGKAGRPKGVKGKY 192



 Score = 43.4 bits (103), Expect = 2e-04
 Identities = 30/94 (31%), Positives = 40/94 (42%), Gaps = 14/94 (14%)

Query: 902 RKKKKGAEKIRKKKKK---------KKEAEKMR-----KKKGAEKMKKKKKDAEKMKKKG 947
           RKK +  +++ K KKK           E EK +      KK  +K KK+KK     KK  
Sbjct: 119 RKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGN 178

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
             K G+ K  K K K    + KK  +    K KK
Sbjct: 179 KGKAGRPKGVKGKYKVVDPRMKKDLRALKRKAKK 212



 Score = 32.3 bits (74), Expect = 0.83
 Identities = 23/68 (33%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 900 TRRKKKKGAEKIRKK--KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           + R+K K  +K+ KK  KKKKK  +     K   K K  +    K K K  +   KK  +
Sbjct: 145 SEREKAKQIKKLYKKAKKKKKKRKKTYVVAKKGNKGKAGRPKGVKGKYKVVDPRMKKDLR 204

Query: 958 KKKKKKKK 965
             K+K KK
Sbjct: 205 ALKRKAKK 212


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 52.4 bits (126), Expect = 4e-07
 Identities = 28/85 (32%), Positives = 51/85 (60%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           R+K + A K R  K KK+ AE            ++    E+ KK+  E   KK K++++K
Sbjct: 139 RQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQK 198

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEA 986
           ++++++K++KK++E E+ K+K EEA
Sbjct: 199 REEERRKQRKKQQEEEERKQKAEEA 223



 Score = 47.8 bits (114), Expect = 1e-05
 Identities = 30/136 (22%), Positives = 64/136 (47%), Gaps = 29/136 (21%)

Query: 882 VRREAVTALGFILFRKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEK----MKK 935
           + R+A  A    L  K+  R+KK  K   EK +++++K++E  ++R++   EK     ++
Sbjct: 81  LERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQ 140

Query: 936 KKKDAEKMKK--------KGAEKIGKKKKKKKKK---------------KKKKKKKKKKK 972
           K + A K +         + A        K ++                KK K++++K++
Sbjct: 141 KAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKRE 200

Query: 973 KKEAEKMKKKTEEAEK 988
           ++  ++ KK+ EE E+
Sbjct: 201 EERRKQRKKQQEEEER 216



 Score = 38.9 bits (91), Expect = 0.008
 Identities = 16/73 (21%), Positives = 40/73 (54%), Gaps = 7/73 (9%)

Query: 897 KKRTRRKKKKGAEKIRKK-----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
               + ++    E+ +K+      KK K+ ++ R+++  ++ KK+++  E+ +K+ AE+ 
Sbjct: 166 SGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQE--EEERKQKAEEA 223

Query: 952 GKKKKKKKKKKKK 964
            +K  K  K++ K
Sbjct: 224 WQKWMKNVKQRPK 236



 Score = 38.6 bits (90), Expect = 0.010
 Identities = 17/74 (22%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK-KKKK 974
           +  KE +  R+ + A +     K A++ KK       K+K++++K++++ + +++  K+K
Sbjct: 74  RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133

Query: 975 EAEKMKKKTEEAEK 988
             E  ++K ++A K
Sbjct: 134 YEEWCRQKAQQAAK 147



 Score = 37.0 bits (86), Expect = 0.034
 Identities = 21/91 (23%), Positives = 51/91 (56%), Gaps = 11/91 (12%)

Query: 896 RKKRTRRKKKKGAEK-----IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
            K+RT + KK+ AE          K ++  +++  KK+  E   KK K  ++ +++    
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREE---- 201

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
             +++K++KK+++++++K+K ++   + MK 
Sbjct: 202 --ERRKQRKKQQEEEERKQKAEEAWQKWMKN 230



 Score = 35.1 bits (81), Expect = 0.14
 Identities = 19/80 (23%), Positives = 45/80 (56%), Gaps = 4/80 (5%)

Query: 913 KKKKKKKEA-EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK---K 968
           K +++ +EA E     K A++ KK +K  E+ +K+  EK  ++ + +++  K+K +   +
Sbjct: 80  KLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCR 139

Query: 969 KKKKKKEAEKMKKKTEEAEK 988
           +K ++   ++  K  +EA +
Sbjct: 140 QKAQQAAKQRTPKHKKEAAE 159


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 52.0 bits (125), Expect = 8e-07
 Identities = 27/83 (32%), Positives = 52/83 (62%), Gaps = 7/83 (8%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKI 951
              K  R K+K  A+  R K+K++KE E+  K++  +++KKK     ++K   K+ A+K 
Sbjct: 268 PINKPVRPKRKTKAQ--RNKEKRRKELEREAKEE--KQLKKKLAQLARLKEIAKEVAQKE 323

Query: 952 GKKKKKKKKKKKKKKKKKKKKKK 974
             + +KK+++K++ +KKK K++K
Sbjct: 324 KARARKKEQRKERGEKKKLKRRK 346



 Score = 45.1 bits (107), Expect = 1e-04
 Identities = 21/103 (20%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---------------MKK 945
             KK++  E++ +KK +K   E  R  + +E + ++  D  +                + 
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP 268

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                  K+K K ++ K+K++K+ +++ KE +++KKK  +  +
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311



 Score = 40.5 bits (95), Expect = 0.004
 Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 3/83 (3%)

Query: 907 GAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
                +  + K+K +A++ ++K+   K  +++   EK  KK   ++ + K+  K+  +K+
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKR--RKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKE 323

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
           K + +KK++  E+ +KK  +  K
Sbjct: 324 KARARKKEQRKERGEKKKLKRRK 346



 Score = 33.5 bits (77), Expect = 0.59
 Identities = 21/78 (26%), Positives = 36/78 (46%), Gaps = 12/78 (15%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK---------KKKKKK 970
           E E + K     + K+K K     +K+  E   + K++K+ KKK           K+  +
Sbjct: 265 EYEPINKPV---RPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQ 321

Query: 971 KKKKEAEKMKKKTEEAEK 988
           K+K  A K +++ E  EK
Sbjct: 322 KEKARARKKEQRKERGEK 339



 Score = 30.8 bits (70), Expect = 3.6
 Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 17/120 (14%)

Query: 872 HVAVSDVN---DDVRREAVTALGFILFRKKR-------TRRKKKKGAEKIRKKKKKK--- 918
           ++ ++DV    ++ R E +T  G +L  K         T   +    +K RKK KK    
Sbjct: 11  NIDITDVEQFLEEQRDERIT--GGVLAEKPDEDLFVVDTTGDEATIKKKRRKKLKKPLKS 68

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
            E  + R K  A  +KK+     K  KK   K   +K+  + KK  + KK  K K     
Sbjct: 69  DEILENRSKVPAPSVKKRHVK--KKSKKKKSKQVSRKELLRLKKLARGKKGGKAKLLNAS 126


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 50.8 bits (122), Expect = 1e-06
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKK--KDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           K   K     E + + K     K KK++  +D E  + +  +K   K++ KKKK K+ K 
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268

Query: 968 KKKKKKKEAEKMKK 981
            K  KK  A+ MKK
Sbjct: 269 VKALKKVVAKGMKK 282



 Score = 48.1 bits (115), Expect = 1e-05
 Identities = 21/74 (28%), Positives = 37/74 (50%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           +++ K     +     +  K ++K K++ ++  +  +  AEK  K K++ KKKK K+ K 
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268

Query: 969 KKKKKKEAEKMKKK 982
            K  KK   K  KK
Sbjct: 269 VKALKKVVAKGMKK 282



 Score = 44.6 bits (106), Expect = 1e-04
 Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 3/76 (3%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           +   K     E     K  K+  +K R+ +   +  + +    + K+K  E+I KKK K+
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRA---EKKRKSKEEIKKKKPKE 265

Query: 959 KKKKKKKKKKKKKKKK 974
            K  K  KK   K  K
Sbjct: 266 SKGVKALKKVVAKGMK 281



 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 16/99 (16%)

Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
            S +  D+ +E + +L    F+      K+     K +K++ ++             + +
Sbjct: 200 CSYLPPDLYKELLKSLLIPEFKPLDKYLKE----SKKKKRETEEDVEAAE------SRAE 249

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           KK+K  E++KKK      K K+ K  K  KK   K  KK
Sbjct: 250 KKRKSKEEIKKK------KPKESKGVKALKKVVAKGMKK 282



 Score = 35.0 bits (81), Expect = 0.16
 Identities = 17/57 (29%), Positives = 34/57 (59%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + K  K+++K K++  E +   + + +KK+K K++ KKKK KE++ +K   +   K
Sbjct: 222 PLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAK 278


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 51.3 bits (123), Expect = 2e-06
 Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 3/92 (3%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
              +  + K      +K KK+ K  E  +K +  +K KK+KK+    K +G  K+G K  
Sbjct: 40  STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEP---KSEGETKLGFKTP 96

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK KK KKK  K K  +       K  E AEK
Sbjct: 97  KKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128



 Score = 50.5 bits (121), Expect = 3e-06
 Identities = 25/84 (29%), Positives = 36/84 (42%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            K       +++ K      +K K  +K  + KK +EK KKK  EK   K + + K   K
Sbjct: 35  SKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFK 94

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
             KK KK KK+  K K   +    
Sbjct: 95  TPKKSKKTKKKPPKPKPNEDVDNA 118



 Score = 43.6 bits (103), Expect = 4e-04
 Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 6/90 (6%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI-GKKKKKK 958
            + K +  +          KE           + + K       K K  +K    KKK +
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTF-----SEEENKVATTSTKKDKKEDKNNESKKKSE 71

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KKKKKKK+KK+ K + E +   K  ++++K
Sbjct: 72  KKKKKKKEKKEPKSEGETKLGFKTPKKSKK 101



 Score = 39.7 bits (93), Expect = 0.006
 Identities = 21/81 (25%), Positives = 31/81 (38%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
                +K K +  +          K        E+ K         KK+ K  + KKK +
Sbjct: 12  FSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSE 71

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
           KKKKKKKE ++ K + E    
Sbjct: 72  KKKKKKKEKKEPKSEGETKLG 92



 Score = 36.2 bits (84), Expect = 0.090
 Identities = 19/72 (26%), Positives = 29/72 (40%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K    KKK   +K +KK+KK+ ++E   K       K KK   +  K K  E +     
Sbjct: 61  DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFN 120

Query: 957 KKKKKKKKKKKK 968
           K  +  +K    
Sbjct: 121 KIAELAEKSNVY 132



 Score = 34.3 bits (79), Expect = 0.28
 Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 3/69 (4%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +KK  ++KKKK   K +K+ K + E +   K     K  KKK    K  +       K  
Sbjct: 67  KKKSEKKKKKK---KEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIA 123

Query: 956 KKKKKKKKK 964
           +  +K    
Sbjct: 124 ELAEKSNVY 132


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 49.7 bits (119), Expect = 2e-06
 Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 17/109 (15%)

Query: 894 LFRKKRTRRKKKK---GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           L R++R ++KK+      ++  KKKKKK        K  A + KKK +            
Sbjct: 78  LQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSP 137

Query: 951 IGK--------------KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             K              ++ K+++ ++KK + K +K+KE +K K+ T+E
Sbjct: 138 RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQE 186



 Score = 49.3 bits (118), Expect = 3e-06
 Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 25/115 (21%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK------------- 942
           R K+ +R K K  ++  KKKKKK        K  A + KKK +                 
Sbjct: 83  RLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSS 142

Query: 943 ----MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE------AEKMK--KKTEE 985
               ++ K A     K+++ ++KK + K +K+K+KK+       E++   K+TE 
Sbjct: 143 RSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETER 197



 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 24/102 (23%), Positives = 39/102 (38%), Gaps = 20/102 (19%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
            +K  ++  + KKKK+   K  K+      KKKKKD    K   A     KKK ++    
Sbjct: 74  GEKELQREERLKKKKRVKTKAYKEP---TKKKKKKDPTAAKSPKAAAPRPKKKSERISWA 130

Query: 964 -----------------KKKKKKKKKKKEAEKMKKKTEEAEK 988
                            + K+   ++ KE E  +KK +   +
Sbjct: 131 PTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKAR 172



 Score = 42.0 bits (99), Expect = 6e-04
 Identities = 17/66 (25%), Positives = 28/66 (42%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
           E E  ++ +  E++KKKK+   K  K+  +K  KK     K  K    + KKK +     
Sbjct: 71  EEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWA 130

Query: 980 KKKTEE 985
               + 
Sbjct: 131 PTLLDS 136



 Score = 38.1 bits (89), Expect = 0.014
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKK-KKKKKKKKKKEAEKMKKKTEEAEK 988
           D E+  +K  ++  + KKKK+ K K  K+  KKKKKK+    K     A +
Sbjct: 69  DDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPR 119


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 51.1 bits (123), Expect = 2e-06
 Identities = 21/54 (38%), Positives = 39/54 (72%), Gaps = 1/54 (1%)

Query: 930 AEKMKKK-KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            +++K++  K  E++K+K  +   KK+++KK +K+KKKKK+KKK K+ +K  +K
Sbjct: 361 GDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 46.1 bits (110), Expect = 6e-05
 Identities = 24/65 (36%), Positives = 40/65 (61%), Gaps = 11/65 (16%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            ++++++  K+ E     K+K  +  KKK++  EK  +K      +KKKKK+KKK KK+K
Sbjct: 361 GDELKEELNKRIE---EIKEKYPKPPKKKRE--EKKPQK------RKKKKKRKKKGKKRK 409

Query: 968 KKKKK 972
           KK +K
Sbjct: 410 KKGRK 414



 Score = 43.8 bits (104), Expect = 4e-04
 Identities = 18/48 (37%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E++ K+  E I +K  K  KKK+++KK +K+KKK+  K K K  + + 
Sbjct: 366 EELNKRIEE-IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKG 412



 Score = 40.3 bits (95), Expect = 0.004
 Identities = 23/78 (29%), Positives = 34/78 (43%), Gaps = 32/78 (41%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            KR    K+K  +  +KK+++KK                                 +K+K
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKP--------------------------------QKRK 396

Query: 957 KKKKKKKKKKKKKKKKKK 974
           KKKK+KKK KK+KKK +K
Sbjct: 397 KKKKRKKKGKKRKKKGRK 414



 Score = 33.0 bits (76), Expect = 0.75
 Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
           +KKR  +K +K  +K +K+KKK K+ +K  +K
Sbjct: 384 KKKREEKKPQKRKKK-KKRKKKGKKRKKKGRK 414



 Score = 32.2 bits (74), Expect = 1.2
 Identities = 19/43 (44%), Positives = 28/43 (65%), Gaps = 4/43 (9%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
           ++K  +  KKK  EK  +K+KKKK+    RKKKG ++ KK +K
Sbjct: 376 KEKYPKPPKKKREEKKPQKRKKKKK----RKKKGKKRKKKGRK 414


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 48.0 bits (114), Expect = 2e-06
 Identities = 31/93 (33%), Positives = 54/93 (58%), Gaps = 1/93 (1%)

Query: 896 RKKRTRRKKKKGAEK-IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            KK TR  K K  ++ + KK + KKE  K+ +K+G    +K+  + +    K   K  KK
Sbjct: 12  GKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKK 71

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           KK  +KK+  K++K+++++KE  K +K+ E+ E
Sbjct: 72  KKLDEKKEIAKQRKREQREKELAKRQKELEKIE 104



 Score = 40.6 bits (95), Expect = 7e-04
 Identities = 28/97 (28%), Positives = 60/97 (61%), Gaps = 7/97 (7%)

Query: 894 LFRKKRTRRK-----KKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           L +K R +++     +K+G     K+  +K  K +++ RK +  +K+ +KK+ A++ K++
Sbjct: 28  LTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKRE 87

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
             EK   K++K+ +K +  KKK+K++++  +K+ KKT
Sbjct: 88  QREKELAKRQKELEKIELSKKKQKERERRRKKLTKKT 124



 Score = 40.6 bits (95), Expect = 9e-04
 Identities = 24/75 (32%), Positives = 48/75 (64%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K +  K+ K +++ RK +KKKK  EK    K  ++ +++K+ A++ K+    ++ KKK+K
Sbjct: 52  KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQK 111

Query: 958 KKKKKKKKKKKKKKK 972
           ++++++KK  KK K 
Sbjct: 112 ERERRRKKLTKKTKS 126



 Score = 37.9 bits (88), Expect = 0.006
 Identities = 22/92 (23%), Positives = 51/92 (55%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            +  ++  KK   + + K+ ++   +K R KK   K+ +K+  A   K+   +++   K+
Sbjct: 5   DQNQKKNGKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKE 64

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K +KKKK  +KK+  K+ ++ +++ E A++
Sbjct: 65  DRKFEKKKKLDEKKEIAKQRKREQREKELAKR 96



 Score = 36.8 bits (85), Expect = 0.017
 Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 11/89 (12%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K     +K +  +K+ +KK+  K+    RK     + +++K+ A++ K+   EKI   KK
Sbjct: 60  KSSKEDRKFEKKKKLDEKKEIAKQ----RK-----REQREKELAKRQKEL--EKIELSKK 108

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K+K++++++KK  KK K     M  +  +
Sbjct: 109 KQKERERRRKKLTKKTKSGQPLMGPRIND 137



 Score = 29.9 bits (67), Expect = 3.8
 Identities = 15/50 (30%), Positives = 32/50 (64%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
           +K+  +++K++  EK   K++K+ E  ++ KKK  E+ +++KK  +K K 
Sbjct: 77  KKEIAKQRKREQREKELAKRQKELEKIELSKKKQKERERRRKKLTKKTKS 126


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 48.6 bits (116), Expect = 3e-06
 Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           FR KR  R      E  + K  K K   ++  K      + K  +  K+  K       K
Sbjct: 7   FRAKRVVRAL----ESEKYKANKDKGNPEIYNK---INSQDKAIEKFKLLIKAQMAERVK 59

Query: 955 KKKKKKKKKKKKKKKKKK 972
           K   ++KK++KKK KKKK
Sbjct: 60  KLHSQEKKEEKKKPKKKK 77



 Score = 39.0 bits (91), Expect = 0.005
 Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 7/69 (10%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L  +K    K K   E   K   + K  EK        K+  K + AE++KK  +++  +
Sbjct: 16  LESEKYKANKDKGNPEIYNKINSQDKAIEK-------FKLLIKAQMAERVKKLHSQEKKE 68

Query: 954 KKKKKKKKK 962
           +KKK KKKK
Sbjct: 69  EKKKPKKKK 77



 Score = 32.5 bits (74), Expect = 0.86
 Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK-TEEAEK 988
           A+++ +  +  +    K         K   + K  +K K   K + AE++KK  ++E ++
Sbjct: 9   AKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKE 68



 Score = 32.1 bits (73), Expect = 0.95
 Identities = 13/55 (23%), Positives = 24/55 (43%), Gaps = 3/55 (5%)

Query: 936 KKKD---AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K++D   A+++ +    +  K  K K   +   K   + K  E  K+  K + AE
Sbjct: 2   KRQDFFRAKRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAE 56



 Score = 31.3 bits (71), Expect = 1.7
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 1/64 (1%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           R K+    ++ +K  A K K    E   K   + K  +K K   K +  +  +K+  + +
Sbjct: 8   RAKRVVRALESEKYKANKDKGN-PEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEK 66

Query: 985 EAEK 988
           + EK
Sbjct: 67  KEEK 70



 Score = 31.3 bits (71), Expect = 1.8
 Identities = 13/65 (20%), Positives = 28/65 (43%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           +R  +  +    K K   ++  K   +    +K K   K +  ++ KK   + +K +KK 
Sbjct: 13  VRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKK 72

Query: 984 EEAEK 988
            + +K
Sbjct: 73  PKKKK 77



 Score = 29.4 bits (66), Expect = 7.0
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEA 976
           K  K   KK KK KKKKKKK +
Sbjct: 196 KGDKSSAKKDKKSKKKKKKKRS 217



 Score = 29.4 bits (66), Expect = 8.2
 Identities = 13/23 (56%), Positives = 14/23 (60%)

Query: 952 GKKKKKKKKKKKKKKKKKKKKKK 974
            K  K   KK KK KKKKKKK+ 
Sbjct: 195 YKGDKSSAKKDKKSKKKKKKKRS 217



 Score = 29.0 bits (65), Expect = 9.3
 Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 3/68 (4%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K+  +  E  + K   +K   +  +    + K    I K K   K +  ++ KK   ++K
Sbjct: 10  KRVVRALESEKYKANKDKGNPEIYNKINSQDK---AIEKFKLLIKAQMAERVKKLHSQEK 66

Query: 975 EAEKMKKK 982
           + EK K K
Sbjct: 67  KEEKKKPK 74


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 50.4 bits (120), Expect = 4e-06
 Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 5/90 (5%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
            +  A+K R + ++K++  K   K       K+ K   + +K+  EK   + KK  ++  
Sbjct: 239 AQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL 298

Query: 964 KKKKKK----KKKKKEAEKM-KKKTEEAEK 988
           K K  K    K++ K +EK  + K  EA+K
Sbjct: 299 KAKDHKAFDLKQESKASEKEAEDKELEAQK 328



 Score = 42.3 bits (99), Expect = 0.001
 Identities = 19/82 (23%), Positives = 36/82 (43%), Gaps = 1/82 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM-KKKGAEKIGKKK 955
            K+    +K+  EK + + KK  E     K   A  +K++ K +EK  + K  E   K++
Sbjct: 272 DKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331

Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
              +  +K K + + +     E
Sbjct: 332 PVAEDLQKTKPQVEAQPTSLNE 353



 Score = 41.9 bits (98), Expect = 0.001
 Identities = 23/101 (22%), Positives = 52/101 (51%), Gaps = 9/101 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
           +++   R+K++ A+ + K          K+  E +K   +K   ++KK  ++A K K   
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHK 304

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKE-AEKMKKKTEEAE 987
           A  + ++ K  +K+ + K+ + +KK++  AE ++K   + E
Sbjct: 305 AFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVE 345



 Score = 39.6 bits (92), Expect = 0.008
 Identities = 24/115 (20%), Positives = 54/115 (46%), Gaps = 21/115 (18%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKM------------RKKKGAEKMKK----KKK 938
           FR+  T  K+++  E  ++ ++ K+E +K               +  A+K +     K++
Sbjct: 196 FRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255

Query: 939 DAEKMKKKGAEKIGKKKKK----KKKKKKKKKKKKKKKKKEAEKMK-KKTEEAEK 988
           +A+ + K       K+ K+    +K++ +K + + KK  +EA K K  K  + ++
Sbjct: 256 EAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQ 310



 Score = 36.1 bits (83), Expect = 0.087
 Identities = 18/85 (21%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            K+    + R+ +K + E +K    + A K K  K    K + K +EK  + K+ + +KK
Sbjct: 272 DKQVAENQKREIEKAQIEIKK--NDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKK 329

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
           ++   +  +K K   + +  +   +
Sbjct: 330 REPVAEDLQKTKPQVEAQPTSLNED 354



 Score = 35.4 bits (81), Expect = 0.17
 Identities = 15/78 (19%), Positives = 40/78 (51%), Gaps = 9/78 (11%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA- 976
           KK  E +R+    EK    ++D   +K++ +++  K+ ++ K++  KK+    K +++A 
Sbjct: 180 KKVVEALREDN--EKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD 237

Query: 977 ------EKMKKKTEEAEK 988
                 +K + +  + ++
Sbjct: 238 FAQDNADKQRDEVRQKQQ 255


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 47.7 bits (114), Expect = 4e-06
 Identities = 24/79 (30%), Positives = 41/79 (51%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +  ++R K  K  ++ R+  K+ K+ EK   K+G +    KK + +K+  K      KK 
Sbjct: 89  QSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKS 148

Query: 956 KKKKKKKKKKKKKKKKKKK 974
           K+  K  +KK+KK   K+K
Sbjct: 149 KQLDKALEKKRKKNAGKEK 167



 Score = 47.7 bits (114), Expect = 4e-06
 Identities = 26/77 (33%), Positives = 40/77 (51%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K R +  K K  E+   K+ KK+E E +++ K    +KK +     +KKK  E    K+ 
Sbjct: 92  KSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQL 151

Query: 957 KKKKKKKKKKKKKKKKK 973
            K  +KK+KK   K+KK
Sbjct: 152 DKALEKKRKKNAGKEKK 168



 Score = 46.9 bits (112), Expect = 9e-06
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K+  +  K +   K  K K +++E  K  KK+  E +K+ KK    +KK   +K+  KKK
Sbjct: 85  KRTLQSMKSR--LKTLKNKDREREILKEHKKQEKELIKEGKK-PYYLKKSEIKKLVLKKK 141

Query: 957 KKKKKKKKKKKK--KKKKKKEAEKMKK 981
             + KK K+  K  +KK+KK A K KK
Sbjct: 142 FDELKKSKQLDKALEKKRKKNAGKEKK 168



 Score = 46.5 bits (111), Expect = 1e-05
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 6/89 (6%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRK-----KKGAEKMKKKKKDAEKMKK-KGAEKIGK 953
            R K+ +  EK  KK K  +E E++++     K   + +K K ++ E +K+ K  EK   
Sbjct: 60  YREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELI 119

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           K+ KK    KK + KK   KK+ +++KK 
Sbjct: 120 KEGKKPYYLKKSEIKKLVLKKKFDELKKS 148



 Score = 45.3 bits (108), Expect = 3e-05
 Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           +K  E++ K  KK K++E+  + K   +  K +    K K +  E + + KK++K+  K+
Sbjct: 62  EKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKE 121

Query: 965 KKKKKKKKKKEAEKM--KKKTEEAEK 988
            KK    KK E +K+  KKK +E +K
Sbjct: 122 GKKPYYLKKSEIKKLVLKKKFDELKK 147



 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 11/109 (10%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKKGAEKMKKKK 937
           DD R + +  L      K   + K  +  E++++     K +      K +  E +K+ K
Sbjct: 58  DDYREKEIEEL-----EKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHK 112

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKK----KKKKKKKKEAEKMKKK 982
           K  +++ K+G +    KK + KK   KKK    KK K+  K  EK +KK
Sbjct: 113 KQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKK 161



 Score = 41.1 bits (97), Expect = 7e-04
 Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 11/72 (15%)

Query: 896 RKKRTRRK-KKKGAEKIRKKKK----KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           R++   ++ KK+  E I++ KK    KK E +K+  KK  +++KK K+  + ++KK    
Sbjct: 103 REREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKK---- 158

Query: 951 IGKKKKKKKKKK 962
             +KK   K+KK
Sbjct: 159 --RKKNAGKEKK 168



 Score = 29.9 bits (68), Expect = 3.6
 Identities = 22/100 (22%), Positives = 44/100 (44%), Gaps = 14/100 (14%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGA 948
           KK   RK+     + +K K +    + +  K   +K +K        ++K+ E+++K   
Sbjct: 14  KKPVSRKRDIIEVEKKKPKVRDPRFDSLSGKLNLDKFRKNYKFLDDYREKEIEELEKA-- 71

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                KK K  ++K++ K+  +  K   + +K K  E E 
Sbjct: 72  ----LKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREI 107


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 49.6 bits (119), Expect = 5e-06
 Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
            L  +    +   RK  K  E+  +K  K  E E+  + +  ++ KKK++   K+ K   
Sbjct: 247 KLANLKLSPE-VLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSP 305

Query: 949 EKIGKKKKKKKKKKKKK 965
           E+  K ++K++KK+ +K
Sbjct: 306 EEQRKLEEKERKKQARK 322



 Score = 46.9 bits (112), Expect = 3e-05
 Identities = 19/80 (23%), Positives = 41/80 (51%), Gaps = 16/80 (20%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK- 968
           K+  +  +K   +K R++   E+   K  + E+      E+  +KK++KKK++++ K   
Sbjct: 252 KLSPEVLRK--VDKTREE--EEEKILKAAEEER-----QEEAQEKKEEKKKEEREAKLAK 302

Query: 969 ------KKKKKKEAEKMKKK 982
                 +K ++KE +K  +K
Sbjct: 303 LSPEEQRKLEEKERKKQARK 322



 Score = 41.9 bits (99), Expect = 0.001
 Identities = 13/55 (23%), Positives = 36/55 (65%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  +  +K +++  EKI K  +++++++ ++KK++KKK++   K+ K + E ++
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309



 Score = 40.7 bits (96), Expect = 0.003
 Identities = 17/63 (26%), Positives = 40/63 (63%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             +   K+ K +++ E+   K AE+  +++ ++KK++KKK++++ K  K + + ++K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313

Query: 986 AEK 988
            E+
Sbjct: 314 KER 316


>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region.  Remorins are
           plant-specific plasma membrane-associated proteins. In
           tobacco remorin co-purifies with lipid rafts. Most
           remorins have a variable, proline-rich C-half and a more
           conserved N-half that is predicted to form coiled coils.
           Consistent with this, circular dichroism studies have
           demonstrated that much of the protein is alpha-helical.
           Remorins exist in plasma membrane preparations as
           oligomeric structures and form filaments in vitro. The
           proteins can bind polyanions including the extracellular
           matrix component oligogalacturonic acid (OGA). In vitro,
           remorin in plasma membrane preparations is
           phosphorylated (principally on threonine residues) in
           the presence of OGA and thus co-purifies with a protein
           kinases(s). The biological functions of remorins are
           unknown but roles as components of the
           membrane/cytoskeleton are possible.
          Length = 112

 Score = 46.2 bits (110), Expect = 5e-06
 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 6/84 (7%)

Query: 909 EKIRKKKKKKKEA----EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            KI  K ++++      E  +K K   ++KK ++  EK K + AEK+  K K     KK 
Sbjct: 19  AKINNKYQREEAKIQAWENKKKAKAEAELKKIERKLEKKKAEAAEKL--KNKLAAAHKKA 76

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
           ++K+   + K  E+  K  E+A K
Sbjct: 77  EEKRATAEAKRGEEEAKAEEKAAK 100



 Score = 36.5 bits (85), Expect = 0.010
 Identities = 21/73 (28%), Positives = 34/73 (46%), Gaps = 4/73 (5%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           +K K   + ++   K  A + KKK K   ++KK       K +KKK +  +K K K    
Sbjct: 17  EKAKINNKYQREEAKIQAWENKKKAKAEAELKKIER----KLEKKKAEAAEKLKNKLAAA 72

Query: 973 KKEAEKMKKKTEE 985
            K+AE+ +   E 
Sbjct: 73  HKKAEEKRATAEA 85



 Score = 33.5 bits (77), Expect = 0.13
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 4/50 (8%)

Query: 902 RKKKKGAEKIRKKKKK-KKEAEKMRK---KKGAEKMKKKKKDAEKMKKKG 947
           +KK + AEK++ K     K+AE+ R     K  E+  K ++ A K++  G
Sbjct: 56  KKKAEAAEKLKNKLAAAHKKAEEKRATAEAKRGEEEAKAEEKAAKIRATG 105


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 48.5 bits (116), Expect = 7e-06
 Identities = 16/77 (20%), Positives = 30/77 (38%)

Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
             E    K + ++E  K +KKK  +K KK        +   +  +  +      +  + K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225

Query: 967 KKKKKKKKEAEKMKKKT 983
           K KKK+    +  K + 
Sbjct: 226 KLKKKRSIAPDNEKSEV 242



 Score = 45.8 bits (109), Expect = 5e-05
 Identities = 18/71 (25%), Positives = 30/71 (42%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           ++E E ++ +   E+ KKKKK  +K  KK        +         +      +  EA+
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225

Query: 978 KMKKKTEEAEK 988
           K+KKK   A  
Sbjct: 226 KLKKKRSIAPD 236



 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 18/70 (25%), Positives = 30/70 (42%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            K + E E+ +KKK  +K K KK +A     +         +      +  + KK KKK+
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKR 231

Query: 975 EAEKMKKKTE 984
                 +K+E
Sbjct: 232 SIAPDNEKSE 241



 Score = 43.5 bits (103), Expect = 3e-04
 Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 2/72 (2%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            + R ++++  +K +KKKKK K+         AE         E     G     KK KK
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNNA--TGSSAEATVSSAVPTELSSGAGQVGEAKKLKK 229

Query: 958 KKKKKKKKKKKK 969
           K+      +K +
Sbjct: 230 KRSIAPDNEKSE 241



 Score = 42.7 bits (101), Expect = 5e-04
 Identities = 20/69 (28%), Positives = 31/69 (44%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K +   E+ +KKKKKKK+  K     G+                GA ++G+ KK KKK+ 
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232

Query: 963 KKKKKKKKK 971
                +K +
Sbjct: 233 IAPDNEKSE 241


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 47.0 bits (112), Expect = 8e-06
 Identities = 23/90 (25%), Positives = 57/90 (63%), Gaps = 13/90 (14%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKM----RKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           KK   +K+ K  EK  + +++++EAE+     RKK   ++  ++K++ E  +++      
Sbjct: 1   KKIGAKKRAKLEEK--QARRQQREAEEEEREERKKLEEKREGERKEEEELEEER------ 52

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            +KKK+++++K+++++ +K+++E EK+K  
Sbjct: 53  -EKKKEEEERKEREEQARKEQEEYEKLKSS 81



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 19/75 (25%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            KK+ K  EK  +++  E  +++++  E+ K +   +  +K++++ +++++KKK+++++K
Sbjct: 5   AKKRAKLEEKQARRQQREAEEEERE--ERKKLEEKREGERKEEEELEEEREKKKEEEERK 62

Query: 974 KEAEKMKKKTEEAEK 988
           +  E+ +K+ EE EK
Sbjct: 63  EREEQARKEQEEYEK 77



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 19/75 (25%), Positives = 54/75 (72%), Gaps = 5/75 (6%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           KK   K+  K+ +K+     ++++++AE+ +++  +K+ ++K++ ++K++++ +++++KK
Sbjct: 1   KKIGAKKRAKLEEKQA----RRQQREAEEEEREERKKL-EEKREGERKEEEELEEEREKK 55

Query: 974 KEAEKMKKKTEEAEK 988
           KE E+ K++ E+A K
Sbjct: 56  KEEEERKEREEQARK 70



 Score = 32.7 bits (75), Expect = 0.54
 Identities = 16/64 (25%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
           +REA         + +  R  ++K  E++ ++++KKKE E+ +++   E+ +K++++ EK
Sbjct: 20  QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKER--EEQARKEQEEYEK 77

Query: 943 MKKK 946
           +K  
Sbjct: 78  LKSS 81


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 47.6 bits (113), Expect = 2e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           K   K +  ++KK  A+K    K   K KK  K       K A K+ K K  K   KK  
Sbjct: 5   KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64

Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
            K +K +  + E + KKT + E
Sbjct: 65  VKFEKTESVKKESVAKKTVKKE 86



 Score = 46.5 bits (110), Expect = 3e-05
 Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 1/93 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK T++K  +  +   KK    KE  K +K       K  KK A+  K K  +   KK  
Sbjct: 5   KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64

Query: 957 KKKKKKKKKKKKKKKKKKEA-EKMKKKTEEAEK 988
            K +K +  KK+   KK    E +  +  EA  
Sbjct: 65  VKFEKTESVKKESVAKKTVKKEAVSAEVFEASN 97



 Score = 44.9 bits (106), Expect = 1e-04
 Identities = 24/88 (27%), Positives = 35/88 (39%), Gaps = 1/88 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           +++    KK    K   K KK  +    +  K A K+KK K   +   KK   K  K + 
Sbjct: 14  EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKS-VKTTTKKVTVKFEKTES 72

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            KK+   KK  KK+    E  +   K  
Sbjct: 73  VKKESVAKKTVKKEAVSAEVFEASNKLF 100



 Score = 41.1 bits (96), Expect = 0.002
 Identities = 28/101 (27%), Positives = 42/101 (41%), Gaps = 10/101 (9%)

Query: 897 KKRTRRKKKKGAEKIRKKK-------KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           KK   +K     E  + KK       K  K+A K++K K + K   KK   +  K +  +
Sbjct: 16  KKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTK-SVKTTTKKVTVKFEKTESVK 74

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKK--KEAEKMKKKTEEAEK 988
           K    KK  KK+    +  +   K  K   K+ KK   +EK
Sbjct: 75  KESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115



 Score = 40.3 bits (94), Expect = 0.003
 Identities = 21/82 (25%), Positives = 32/82 (39%), Gaps = 3/82 (3%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
               KK  KK+  + +K    +    K+    K   K       KK  K KK K  K   
Sbjct: 1   MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60

Query: 970 KK---KKKEAEKMKKKTEEAEK 988
           KK   K ++ E +KK++   + 
Sbjct: 61  KKVTVKFEKTESVKKESVAKKT 82


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 48.8 bits (116), Expect = 2e-05
 Identities = 25/100 (25%), Positives = 49/100 (49%), Gaps = 1/100 (1%)

Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           L     R    +   +   E+I   K++ +KE E + +     K ++K+K  ++ + K  
Sbjct: 233 LKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K  ++ K +  K +++K   ++K KE+EK  KK E+  K
Sbjct: 293 AKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELK 332



 Score = 46.9 bits (111), Expect = 6e-05
 Identities = 27/93 (29%), Positives = 52/93 (55%), Gaps = 2/93 (2%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             + ++++ +K  E + +  K+ KE EK +K +  E+  K     E+  K    K+ ++K
Sbjct: 253 EIESSKQELEKEEEILAQVLKENKEEEKEKKLQ--EEELKLLAKEEEELKSELLKLERRK 310

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              ++K K+ +K+ KK +KE +K K++ EE EK
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343



 Score = 45.3 bits (107), Expect = 2e-04
 Identities = 28/96 (29%), Positives = 59/96 (61%), Gaps = 2/96 (2%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           IL +  +  ++++K  +K+++++ K    E+   K    K++++K D E+ K K +EK  
Sbjct: 267 ILAQVLKENKEEEK-EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEE-KLKESEKEL 324

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK +K+ KK+K++ ++ +K+ KE E  ++  EE E+
Sbjct: 325 KKLEKELKKEKEEIEELEKELKELEIKREAEEEEEE 360



 Score = 43.8 bits (103), Expect = 5e-04
 Identities = 21/96 (21%), Positives = 53/96 (55%), Gaps = 5/96 (5%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKI 951
            K+   K+++   ++ K+ K++++ +K+++++     K+      +    + +K   E+ 
Sbjct: 257 SKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK 316

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            K+ +K+ KK +K+ KK+K++ +E EK  K+ E   
Sbjct: 317 LKESEKELKKLEKELKKEKEEIEELEKELKELEIKR 352



 Score = 41.1 bits (96), Expect = 0.003
 Identities = 24/104 (23%), Positives = 49/104 (47%), Gaps = 8/104 (7%)

Query: 893 ILFRKKRTRRKKKKG--------AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
               KK    ++K          AE+  K +K K E EK  K K  E+  +  ++  K +
Sbjct: 755 KSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEE 814

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  E+     ++++K K+++ ++   + KE +K++K  EE  +
Sbjct: 815 AELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELE 858



 Score = 41.1 bits (96), Expect = 0.003
 Identities = 17/88 (19%), Positives = 46/88 (52%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
             K +K   +  K++K K + E++R  +   K + +  + E++  +  EKI +++ ++  
Sbjct: 781 EEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELA 840

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + K+++K +K  +E  +  ++    E+
Sbjct: 841 LELKEEQKLEKLAEEELERLEEEITKEE 868



 Score = 41.1 bits (96), Expect = 0.003
 Identities = 18/91 (19%), Positives = 53/91 (58%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            + +  + K  E+++  ++K KE E+  +K   +K +++++ +E   K+      ++K +
Sbjct: 726 DKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTE 785

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K +++K++K K +++E   ++++ +E  +
Sbjct: 786 KLKVEEEKEEKLKAQEEELRALEEELKEEAE 816



 Score = 40.3 bits (94), Expect = 0.006
 Identities = 20/88 (22%), Positives = 46/88 (52%), Gaps = 4/88 (4%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
             +K K  E+  +K + KKE E+  K + + K K+  ++ EK +K   E+    +K++K 
Sbjct: 742 LEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE----EKEEKL 797

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K ++++ +  +++ + E    + E+   
Sbjct: 798 KAQEEELRALEEELKEEAELLEEEQLLI 825



 Score = 40.0 bits (93), Expect = 0.007
 Identities = 20/116 (17%), Positives = 49/116 (42%), Gaps = 8/116 (6%)

Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEA--------EKMRKKKGAEK 932
                           ++     KK   EK      K +EA        + + +K   ++
Sbjct: 691 AKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKE 750

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++K   +K +++  +     K+K+  ++++K +K K ++++ EK+K + EE   
Sbjct: 751 EEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRA 806



 Score = 39.2 bits (91), Expect = 0.012
 Identities = 19/112 (16%), Positives = 45/112 (40%), Gaps = 2/112 (1%)

Query: 878 VNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKK 936
             +++    +      L   K   + KK   E  + K+K + E E +          ++ 
Sbjct: 182 ETENLAELIIDLEELKLQELKLKEQAKKA-LEYYQLKEKLELEEENLLYLDYLKLNEERI 240

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
               E ++ +  E    K++ +K+++   +  K+ K++E EK  ++ E    
Sbjct: 241 DLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLL 292



 Score = 38.8 bits (90), Expect = 0.016
 Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 1/101 (0%)

Query: 889 ALGFILFRKKRTRRKKK-KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
           A G    RKK+ R KK  +  E + +     +E +    K   +  K  +    K K + 
Sbjct: 163 AAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLEL 222

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E+        K  +++    ++  + E E+++   +E EK
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEK 263



 Score = 38.4 bits (89), Expect = 0.025
 Identities = 22/94 (23%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKK 954
           R+K    +K K +EK   KK +K+  ++  + +  EK  K+ +   E  +++  +    +
Sbjct: 308 RRKVDDEEKLKESEK-ELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQ 366

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K ++ +++   KKK + ++ +   K K EE E 
Sbjct: 367 EKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 400



 Score = 36.1 bits (83), Expect = 0.13
 Identities = 11/68 (16%), Positives = 30/68 (44%), Gaps = 3/68 (4%)

Query: 909  EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
            E+ R K+    + E       A    ++K++     +    K  + +++KK+  ++  ++
Sbjct: 959  EEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDE---LKKERLEEEKKELLREIIEE 1015

Query: 969  KKKKKKEA 976
              ++ KE 
Sbjct: 1016 TCQRFKEF 1023



 Score = 34.2 bits (78), Expect = 0.39
 Identities = 17/90 (18%), Positives = 50/90 (55%), Gaps = 3/90 (3%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---KKKDAEKMKKKGAEKIGKKKKKK 958
           +++ +  EK  K+ + K+EAE+  +++  +  +K    +++    KK  +E++    K K
Sbjct: 335 KEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLK 394

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++ + K +++K+ K   E  +++ +  ++
Sbjct: 395 EEELELKNEEEKEAKLLLELSEQEEDLLKE 424


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 45.4 bits (108), Expect = 2e-05
 Identities = 16/68 (23%), Positives = 42/68 (61%), Gaps = 1/68 (1%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKKKKKKK 972
           K +KK        KK  E+ ++++ + E++ ++   +++ +K+  K K++K+++ ++K+K
Sbjct: 85  KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144

Query: 973 KKEAEKMK 980
           +   E+MK
Sbjct: 145 EILKEQMK 152



 Score = 41.9 bits (99), Expect = 3e-04
 Identities = 17/70 (24%), Positives = 42/70 (60%), Gaps = 1/70 (1%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           ++ +KK +K     +K+K  E+ ++ + +    +++  E + K+  K K++K+++ ++K+
Sbjct: 84  LKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQ 143

Query: 971 KKK-KEAEKM 979
           K+  KE  KM
Sbjct: 144 KEILKEQMKM 153



 Score = 37.3 bits (87), Expect = 0.013
 Identities = 18/77 (23%), Positives = 48/77 (62%), Gaps = 9/77 (11%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K R + +K  G +K  K++++++E E        E++ ++++  E ++K+ A+   K++K
Sbjct: 85  KWRKKVRKLLGLDKKEKEEEEEEEVEV-------EELDEEEQIDELLEKELAKL--KREK 135

Query: 957 KKKKKKKKKKKKKKKKK 973
           +++ ++K+K+  K++ K
Sbjct: 136 RRENERKQKEILKEQMK 152



 Score = 31.5 bits (72), Expect = 0.96
 Identities = 11/61 (18%), Positives = 36/61 (59%), Gaps = 6/61 (9%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           +K+RK  G +K +K++++ E+       ++ +  ++++  +  +K+  K K+++  + ++
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEE------VEVEELDEEEQIDELLEKELAKLKREKRRENER 141

Query: 982 K 982
           K
Sbjct: 142 K 142



 Score = 30.3 bits (69), Expect = 2.7
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 20/62 (32%)

Query: 942 KMKKKGAEKIGKKKKKKKKKK--------------------KKKKKKKKKKKKEAEKMKK 981
           K +KK  + +G  KK+K++++                    K+  K K++K++E E+ +K
Sbjct: 85  KWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQK 144

Query: 982 KT 983
           + 
Sbjct: 145 EI 146


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 43.9 bits (104), Expect = 2e-05
 Identities = 21/46 (45%), Positives = 30/46 (65%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K+K KG +   KKKKKKKKKK K K++   +K+E EK   +++  E
Sbjct: 9   KLKLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKE 54



 Score = 42.0 bits (99), Expect = 7e-05
 Identities = 20/35 (57%), Positives = 23/35 (65%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K KK   KKKKKKKKKK K  E++  + EE EK
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEK 45



 Score = 41.6 bits (98), Expect = 9e-05
 Identities = 19/35 (54%), Positives = 21/35 (60%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K KK   KKKKKKKKKK K KE    +K+ EE   
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSS 47



 Score = 40.9 bits (96), Expect = 2e-04
 Identities = 19/36 (52%), Positives = 23/36 (63%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K KK   KKKKKKKKKK K + E + +K EE + 
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKS 46



 Score = 40.1 bits (94), Expect = 3e-04
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K     + KKKKKKKKK K K++   +K++E +   +   +  +
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56



 Score = 38.9 bits (91), Expect = 0.001
 Identities = 21/80 (26%), Positives = 42/80 (52%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K K     ++KKKKKKK+  K +++   EK +++K  AE   K+G E    +K ++++  
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDG 70

Query: 963 KKKKKKKKKKKKEAEKMKKK 982
               + ++  ++   K  K+
Sbjct: 71  MNLTEAERAFEEAQRKRLKE 90



 Score = 34.7 bits (80), Expect = 0.030
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K+K K  +   KKKKKKKK K K++   +K+++E    +   +E E
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGE 56



 Score = 33.5 bits (77), Expect = 0.071
 Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 14/59 (23%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K     +KKKKK              KKKKK K K++   +K++++K  AE   K+ EE
Sbjct: 13  KGKKIDVKKKKK--------------KKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEE 57



 Score = 32.8 bits (75), Expect = 0.13
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           +  +  K K K +   ++KKK   K KKKK  +++      E+  K   +   K+ ++ +
Sbjct: 3   SNVVGGKLKLKGKKIDVKKKK---KKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDE 59

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
             +K ++E + M     EAE+
Sbjct: 60  DNEKIEQEEDGMNLT--EAER 78



 Score = 29.7 bits (67), Expect = 1.6
 Identities = 18/73 (24%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
            ++KKKK  +K + K++   E E+  +K  AE   K+ ++ E  +K   E+ G    + +
Sbjct: 19  VKKKKKKKKKKNKSKEEVVTEKEE-EEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEAE 77

Query: 960 KKKKKKKKKKKKK 972
           +  ++ ++K+ K+
Sbjct: 78  RAFEEAQRKRLKE 90


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 44.8 bits (106), Expect = 2e-05
 Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 31/120 (25%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEK-------------MRKKKGAEKMKKKKKDAEKMK 944
           K  R K ++    I++KK   KE  +             M+  K    +K  ++  EK K
Sbjct: 1   KSLRSKWRRKMRAIKRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKK 60

Query: 945 KKGAEKI---------GKKKKKKKKKKKKK---------KKKKKKKKKEAEKMKKKTEEA 986
           K+G E            K   + KK  + +         +++ KK KK+ EK KKK + A
Sbjct: 61  KEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKKGA 120


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 47.9 bits (114), Expect = 2e-05
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 1/92 (1%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
              +R   K  + I   + K K A+K +   K     K + K + K          K   
Sbjct: 158 ANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNI 217

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                KKK K+KK+KK+     +K+++EE   
Sbjct: 218 MSSFFKKKTKEKKEKKEASESTVKEESEEESG 249



 Score = 45.6 bits (108), Expect = 9e-05
 Identities = 16/77 (20%), Positives = 37/77 (48%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           +K K+KK++ E        E  ++  K    ++ + AE  G  + + + + K   ++   
Sbjct: 224 KKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDS 283

Query: 972 KKKEAEKMKKKTEEAEK 988
           +++  EK K+K +  +K
Sbjct: 284 EEETEEKEKEKRKRLKK 300



 Score = 45.6 bits (108), Expect = 1e-04
 Identities = 22/87 (25%), Positives = 29/87 (33%), Gaps = 1/87 (1%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
             K  A   R   K  K       K K A+K +   K+    K +G   +     K+   
Sbjct: 153 ALKPTANGKRPSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPP 212

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK        KKK  EK +KK      
Sbjct: 213 KKSNIMSSFFKKKTKEKKEKKEASEST 239



 Score = 41.7 bits (98), Expect = 0.002
 Identities = 24/97 (24%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKKKKDAEKMKKKGAEK 950
           F KK+T+ KK+K        K++ +E    R      + AE     + + E   K   E+
Sbjct: 221 FFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGER 280

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
                +++ ++K+K+K+K+ KK  E E   ++ E   
Sbjct: 281 --SDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVP 315



 Score = 40.2 bits (94), Expect = 0.004
 Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 3/92 (3%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            K T  +K +G   ++    K+   +K   M      +  +KK+K          E   +
Sbjct: 188 SKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEE 247

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             K+    + +  +     + E E   K + E
Sbjct: 248 SGKRDVILEDESAEPTGLDEDEDEDEPKPSGE 279



 Score = 38.3 bits (89), Expect = 0.020
 Identities = 19/92 (20%), Positives = 32/92 (34%), Gaps = 15/92 (16%)

Query: 902 RKKKKGAEKIRK-----KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K  K  +   K     K + K   +    K+   K K     +            KK K
Sbjct: 178 VKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPK-KSNIMSSFF---------KKKTK 227

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +KK+KK+  +   K++ +E    +    E E 
Sbjct: 228 EKKEKKEASESTVKEESEEESGKRDVILEDES 259



 Score = 36.0 bits (83), Expect = 0.090
 Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 8/80 (10%)

Query: 897 KKRTRRKKKKGAEKIRKKKK-KKKEAEK-------MRKKKGAEKMKKKKKDAEKMKKKGA 948
           +++          + R +++  KK+  K        +K    E   + + +    K K  
Sbjct: 337 EEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPK 396

Query: 949 EKIGKKKKKKKKKKKKKKKK 968
                     KK K  KKKK
Sbjct: 397 VSTPAVPAAAKKPKAPKKKK 416



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 15/86 (17%)

Query: 913 KKKKKKKEAEK---MRKKKGAEKMKKKK--KDAEK--MKKKGA--------EKIGKKKKK 957
           KK+++K+E        +++G  ++ KKK  KD E   + KK          E      K 
Sbjct: 334 KKEEEKEEVTVSPDGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKP 393

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKT 983
           K K          KK K  +K K+ +
Sbjct: 394 KPKVSTPAVPAAAKKPKAPKKKKQSS 419



 Score = 32.1 bits (73), Expect = 1.4
 Identities = 19/130 (14%), Positives = 40/130 (30%), Gaps = 41/130 (31%)

Query: 900 TRRKKKKGAEKIRK------KKKKK-------KEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           T  K+K+  ++++K      + ++         E E+  + +     KK+++  E     
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSP 346

Query: 947 GAEKIGKKKKKKKKK----------------------------KKKKKKKKKKKKKEAEK 978
              +   +++  KKK                              K K K       A  
Sbjct: 347 DGGRRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTPAVPAAA 406

Query: 979 MKKKTEEAEK 988
            K K  + +K
Sbjct: 407 KKPKAPKKKK 416


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 21/83 (25%), Positives = 52/83 (62%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
            + +K +   + +R ++K+K+  E+  KK+  ++ KKKK+ A K+K++       +K ++
Sbjct: 266 PSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEANDPAQKLQE 325

Query: 959 KKKKKKKKKKKKKKKKEAEKMKK 981
           ++K++  K +K++K++E E  ++
Sbjct: 326 ERKEQLAKLRKEEKEREKEYEQE 348



 Score = 43.2 bits (102), Expect = 5e-04
 Identities = 22/103 (21%), Positives = 47/103 (45%), Gaps = 16/103 (15%)

Query: 896 RKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKG------------AEKMKKKKKD 939
           R+K+ R KK     K+     K +KK++EAE  +K K              E M++ ++ 
Sbjct: 14  REKKKREKKSIRQSKLEEELNKLEKKEEEAECKKKFKANPVPASVKLPLYEEIMEQNEER 73

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            E++++K    +   +K     +++++KK    +K      ++
Sbjct: 74  REEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSER 116



 Score = 39.0 bits (91), Expect = 0.009
 Identities = 19/93 (20%), Positives = 41/93 (44%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            K  +R    +      ++  K    +     K  E + K  +   K K+   ++  K+ 
Sbjct: 237 HKSSSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKEL 296

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++KKKKK+   K K++ +  +  +K  EE ++
Sbjct: 297 EQRKKKKKEMAPKVKQRFEANDPAQKLQEERKE 329



 Score = 37.8 bits (88), Expect = 0.021
 Identities = 15/79 (18%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           F +   + K+ K  ++ ++ +++KK+ ++M     A K+K++ +  +  +K   ++  K+
Sbjct: 277 FLRTERKEKEAKEQQEKKELEQRKKKKKEM-----APKVKQRFEANDPAQK--LQEERKE 329

Query: 955 KKKKKKKKKKKKKKKKKKK 973
           +  K +K++K+++K+ +++
Sbjct: 330 QLAKLRKEEKEREKEYEQE 348



 Score = 37.8 bits (88), Expect = 0.021
 Identities = 19/65 (29%), Positives = 38/65 (58%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           ++ K ++++   K+R +  A+++ +  +   +M K  A++   KKKK+ +KK K KKK  
Sbjct: 135 LKDKMQEEELKRKIRVQMRAQELLQSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTF 194

Query: 971 KKKKE 975
           K K+ 
Sbjct: 195 KPKRA 199



 Score = 36.3 bits (84), Expect = 0.064
 Identities = 19/90 (21%), Positives = 44/90 (48%), Gaps = 11/90 (12%)

Query: 903 KKKKGAEKIRKKKKKKKEA-----EKMRKKKGA------EKMKKKKKDAEKMKKKGAEKI 951
           +  K   +      KK E+        RK+K A      ++++++KK  ++M  K  ++ 
Sbjct: 255 ENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRF 314

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
                 +K ++++K++  K +K+E E+ K+
Sbjct: 315 EANDPAQKLQEERKEQLAKLRKEEKEREKE 344



 Score = 34.8 bits (80), Expect = 0.20
 Identities = 14/88 (15%), Positives = 37/88 (42%), Gaps = 4/88 (4%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKK-EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            ++R    ++K    +   +K       + +KK    +  +      + KK  A+ + K 
Sbjct: 70  NEERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKS 129

Query: 955 KKK---KKKKKKKKKKKKKKKKKEAEKM 979
                 K K ++++ K+K + +  A+++
Sbjct: 130 IYIPLLKDKMQEEELKRKIRVQMRAQEL 157



 Score = 33.6 bits (77), Expect = 0.52
 Identities = 15/82 (18%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK--KKK 966
           E+ R++ ++K +A  +  +K     +++++    + +K      +++ KK K K   K  
Sbjct: 71  EERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSI 130

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
                K K  E+  K+    + 
Sbjct: 131 YIPLLKDKMQEEELKRKIRVQM 152



 Score = 32.1 bits (73), Expect = 1.4
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K++KKK++KK  ++ K ++E  K++KK EEAE 
Sbjct: 13  KREKKKREKKSIRQSKLEEELNKLEKKEEEAEC 45



 Score = 30.9 bits (70), Expect = 3.3
 Identities = 12/83 (14%), Positives = 34/83 (40%), Gaps = 6/83 (7%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
            + R+++ ++K    +   +      ++++  + +  +       +++ KK K K   K 
Sbjct: 70  NEERREEVREKSKAILLSSQKPFSFYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKS 129

Query: 969 ------KKKKKKEAEKMKKKTEE 985
                 K K ++E  K K + + 
Sbjct: 130 IYIPLLKDKMQEEELKRKIRVQM 152



 Score = 29.7 bits (67), Expect = 7.0
 Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 15/81 (18%)

Query: 894 LFRKKRTRRKKKKGA---------------EKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
           L RK R + + ++                 E++ KKKK+ ++  K +KK    K  K   
Sbjct: 144 LKRKIRVQMRAQELLQSSRLPPRMAKHEAQERLTKKKKRGQKKSKYKKKTFKPKRAKSIP 203

Query: 939 DAEKMKKKGAEKIGKKKKKKK 959
           D EK+ +K  +++ +KKK K+
Sbjct: 204 DFEKLHEKFQKQLAEKKKSKR 224


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 44.6 bits (106), Expect = 2e-05
 Identities = 18/83 (21%), Positives = 43/83 (51%), Gaps = 7/83 (8%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           + ++      K+  K    EK + +K   K ++KK +AEK+ K  AE+    K + + +K
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEK---KQEEKKSEAEKLAKMSAEE----KAEYELEK 53

Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
            +K+ ++ + +    ++K + ++
Sbjct: 54  LEKELEELEAELARRELKAEAKK 76



 Score = 41.9 bits (99), Expect = 2e-04
 Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-KKKKKKKKKKKKKKKKKKEAEKMK 980
           E+ +     E  K   K+  K +KK  EK  + +K  K   ++K + + +K +KE E+++
Sbjct: 3   EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62

Query: 981 KKTEEAE 987
            +    E
Sbjct: 63  AELARRE 69



 Score = 40.7 bits (96), Expect = 5e-04
 Identities = 18/78 (23%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K  T ++  K   K + K +KK+E +K   +K   KM  ++K   +++K   +++ + + 
Sbjct: 6   KTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK-LAKMSAEEKAEYELEKL-EKELEELEA 63

Query: 957 KKKKKKKKKKKKKKKKKK 974
           +  +++ K + KK   +K
Sbjct: 64  ELARRELKAEAKKMLSEK 81



 Score = 36.9 bits (86), Expect = 0.010
 Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 5/69 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K+K   EK  K+++KK EAEK+ K    E   K + + EK++K+  E   +  +++ K +
Sbjct: 19  KEKAKWEK--KQEEKKSEAEKLAKMSAEE---KAEYELEKLEKELEELEAELARRELKAE 73

Query: 963 KKKKKKKKK 971
            KK   +K 
Sbjct: 74  AKKMLSEKG 82



 Score = 34.5 bits (80), Expect = 0.072
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 2/67 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K   + +KK  EK  + +K  K + + + +   E  K +K+  E   +    ++  + K
Sbjct: 18  AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY--ELEKLEKELEELEAELARRELKAEAK 75

Query: 957 KKKKKKK 963
           K   +K 
Sbjct: 76  KMLSEKG 82



 Score = 31.5 bits (72), Expect = 0.78
 Identities = 12/36 (33%), Positives = 20/36 (55%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++K    K+  K   K+K K  +K ++K  EAEK
Sbjct: 2   PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEK 37



 Score = 29.5 bits (67), Expect = 3.6
 Identities = 10/60 (16%), Positives = 30/60 (50%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           + ++ + +KK  AEK+ K   ++K   ++ K +   +  + +    ++K +  + + +K 
Sbjct: 23  KWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLSEKG 82


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 48.1 bits (115), Expect = 2e-05
 Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 32/120 (26%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKK-----------------------GAEKMK 934
           KR  ++ K+  EK RK +K+ +E EK+ KK+                         E+++
Sbjct: 462 KRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELE 521

Query: 935 KKKKDAEKMKKK---------GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           KK ++ EK+K+K           +K  +K ++ KKK  + +KK  + ++E  ++ K+ EE
Sbjct: 522 KKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581



 Score = 45.4 bits (108), Expect = 1e-04
 Identities = 27/109 (24%), Positives = 65/109 (59%), Gaps = 3/109 (2%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           +++ +E    L  +L        +  +  E++ K +K+ KE E+++++   E+++K+ + 
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEE--IEELEKELES 249

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E  K+K  EKI ++ +++ ++ KK+ ++ ++K KE +++K+K EE  K
Sbjct: 250 LEGSKRKLEEKI-RELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297



 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 27/96 (28%), Positives = 54/96 (56%), Gaps = 4/96 (4%)

Query: 895 FRKKRTRRKKKKG--AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           +  +    +K+     E+I   +++ KE E+  K++  E++KKK K+ EK  ++  E+  
Sbjct: 305 YLDELREIEKRLSRLEEEINGIEERIKELEE--KEERLEELKKKLKELEKRLEELEERHE 362

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             ++ K KK++ ++ KK+      EK++K+ EE EK
Sbjct: 363 LYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398



 Score = 44.3 bits (105), Expect = 3e-04
 Identities = 19/101 (18%), Positives = 57/101 (56%), Gaps = 9/101 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA--------EKMKKKKKDAEKMKKKG 947
           + +    + ++  ++I + ++K KE +++++K           E+   + ++ EK   + 
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRL 319

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E+I   +++ K+ ++K+++ ++ KKK  ++++K+ EE E+
Sbjct: 320 EEEINGIEERIKELEEKEERLEELKKKL-KELEKRLEELEE 359



 Score = 43.5 bits (103), Expect = 6e-04
 Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 20/128 (15%)

Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKK---KKGAEKIRKKKKKKKEAEKMRKKKGA- 930
           + ++ +   R     L  +  + K   ++    ++  E   + K KK+E E+++K+    
Sbjct: 330 IKELEEKEER-----LEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGL 384

Query: 931 --EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK------ 982
             EK++K+ ++ EK K++  E+I  K   +  + KK+ K+ KK  +E +K K K      
Sbjct: 385 TPEKLEKELEELEKAKEEIEEEI-SKITARIGELKKEIKELKKAIEELKKAKGKCPVCGR 443

Query: 983 --TEEAEK 988
             TEE  K
Sbjct: 444 ELTEEHRK 451



 Score = 43.5 bits (103), Expect = 6e-04
 Identities = 24/99 (24%), Positives = 60/99 (60%), Gaps = 6/99 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKK-KGAEKMKKKKKDAEKMKKKGA----E 949
            + +   +K++  E+++KK K+ +K  E++ ++ +  E+ K KK++ E++KK+      E
Sbjct: 328 ERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPE 387

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+ K+ ++ +K K++ +++  K      ++KK+ +E +K
Sbjct: 388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426



 Score = 43.1 bits (102), Expect = 7e-04
 Identities = 18/92 (19%), Positives = 52/92 (56%), Gaps = 4/92 (4%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMR--KKKGAEKMKKKKKDAEKMKKKGAE--KIGKKKK 956
            ++ +      RK ++K +E E+     KK  E++++K K+ +++K+K  E  K+ +  +
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +   + ++ +K+  + ++E   ++++ +E E+
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELEE 335



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 23/97 (23%), Positives = 55/97 (56%), Gaps = 7/97 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R  R   +     E+I++ ++K++  E+++KK   ++++K+ ++ E+  +   E+   KK
Sbjct: 315 RLSRLEEEINGIEERIKELEEKEERLEELKKKL--KELEKRLEELEERHEL-YEEAKAKK 371

Query: 956 KKKKKKKKKKK----KKKKKKKKEAEKMKKKTEEAEK 988
           ++ ++ KK+      +K +K+ +E EK K++ EE   
Sbjct: 372 EELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408



 Score = 40.0 bits (94), Expect = 0.007
 Identities = 24/105 (22%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 894 LFRKKRTRRKKKKGAEKIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
           L R+++  +K ++  +K  ++        ++ +KE E++ KK   E+ ++ +   E+  +
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELR---EEYLE 670

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKK--KKKKKEAEKMKKKTEEAEK 988
              E  G + + ++ +K++++ KK  +K K+E E+ +K  +E EK
Sbjct: 671 LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEK 715



 Score = 39.7 bits (93), Expect = 0.009
 Identities = 20/105 (19%), Positives = 56/105 (53%), Gaps = 12/105 (11%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG---AEKMKKKKKDAEKMKKKGAEKIG 952
           + +  ++K  +  +K+ + +++  E  K  ++ G    E+++++ K+ E    +  E   
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKD 609

Query: 953 KKKKKKKKKKKKKKKKKK---------KKKKEAEKMKKKTEEAEK 988
            +K+ ++++K+ KK +++         + +K  E+++K+ EE EK
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEK 654



 Score = 39.3 bits (92), Expect = 0.011
 Identities = 21/90 (23%), Positives = 52/90 (57%), Gaps = 5/90 (5%)

Query: 902 RKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            K+ K  E+++++ ++ +KE E +   K   K+++K ++ E+  ++  ++I + ++K K+
Sbjct: 227 EKEVKELEELKEEIEELEKELESLEGSKR--KLEEKIRELEERIEELKKEIEELEEKVKE 284

Query: 961 KKKKKKKKKK--KKKKEAEKMKKKTEEAEK 988
            K+ K+K ++  K  +  E+   +  E EK
Sbjct: 285 LKELKEKAEEYIKLSEFYEEYLDELREIEK 314



 Score = 38.5 bits (90), Expect = 0.019
 Identities = 18/83 (21%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           K AEK  ++++K+ +  +    K  E++ + +K  E+++K+  E    +KK  +++ ++ 
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE---LEKKYSEEEYEEL 664

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
           +++  +  +E   ++ + EE EK
Sbjct: 665 REEYLELSRELAGLRAELEELEK 687



 Score = 38.1 bits (89), Expect = 0.027
 Identities = 24/117 (20%), Positives = 53/117 (45%), Gaps = 24/117 (20%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK----KGA--- 948
           +K+ T    +K  +++ + +K K+E E+   K  A   + KK+  E  K     K A   
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437

Query: 949 -----------------EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                            E+   + K+ +K+ K+ ++K++K +KE  +++K  ++  +
Sbjct: 438 CPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESE 494



 Score = 38.1 bits (89), Expect = 0.028
 Identities = 20/93 (21%), Positives = 45/93 (48%), Gaps = 3/93 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMR-KKKGAEKM-KKKKKDAEKMKKKGAE-KIG 952
             K     K++  E  ++ +  +    K+  K +  E+  ++ KK+ E++++K  E K  
Sbjct: 229 EVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL 288

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K+K ++  K  +  ++   + +E EK   + EE
Sbjct: 289 KEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321



 Score = 37.0 bits (86), Expect = 0.057
 Identities = 30/125 (24%), Positives = 63/125 (50%), Gaps = 19/125 (15%)

Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIR--KKKKKKKEAEKM-RKKKGAEKMKKK- 936
            +R+E       +   K+    K K+ AE+++  ++K KK   E++ +K +  EK+K+K 
Sbjct: 477 KLRKELRELEKVLK--KESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534

Query: 937 ----------KKDAEK---MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
                     KK+ EK   +KKK AE   K  + +++  +  K+ ++   +  E+++++ 
Sbjct: 535 IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERL 594

Query: 984 EEAEK 988
           +E E 
Sbjct: 595 KELEP 599



 Score = 36.2 bits (84), Expect = 0.099
 Identities = 15/89 (16%), Positives = 48/89 (53%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            +  +++ +  EK  K+ +  +E  K ++K+  E +++  + + ++ +   E    +K+ 
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K+ ++ K++ ++ +K+ E+ +  K+  E 
Sbjct: 231 KELEELKEEIEELEKELESLEGSKRKLEE 259



 Score = 35.8 bits (83), Expect = 0.14
 Identities = 25/118 (21%), Positives = 56/118 (47%), Gaps = 10/118 (8%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRK---KKKKKKEAEKMRKKKGAEKMKKK 936
             +  E   A   +   +KR    +K+  E  +K   ++ ++   E +   +    ++ +
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAE 681

Query: 937 KKDAEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA---EKMKK-KTEEAE 987
            ++ EK +   KK  EK+ ++ ++++K KK+ +K +K  ++     EK+KK K    E
Sbjct: 682 LEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKE 739



 Score = 34.7 bits (80), Expect = 0.27
 Identities = 19/99 (19%), Positives = 49/99 (49%), Gaps = 8/99 (8%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-----KGAEKIG 952
               ++ K+  E++ K  K+ +  E++ K+K  E  +  ++  E   +     +  EK+ 
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227

Query: 953 KKKKKKKKKKKK---KKKKKKKKKKEAEKMKKKTEEAEK 988
           K+ K+ ++ K++    +K+ +  +    K+++K  E E+
Sbjct: 228 KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEE 266



 Score = 33.9 bits (78), Expect = 0.46
 Identities = 21/90 (23%), Positives = 47/90 (52%), Gaps = 6/90 (6%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +  R +  + + ++   + + +E EK R+  KK  EK+K++    E+ +K   +++ K +
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEEL---EEREKA-KKELEKLE 717

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K  ++ ++ ++K KK K    E+   K  E
Sbjct: 718 KALERVEELREKVKKYKALLKERALSKVGE 747



 Score = 32.3 bits (74), Expect = 1.7
 Identities = 22/86 (25%), Positives = 47/86 (54%), Gaps = 7/86 (8%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K   E I++ K++ +  EK  K+   E +++  K+ EK   +  E + +  +   +  + 
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRT--ENIEELIKEKEK---ELEEVLREINEISSELPEL 219

Query: 965 KKK--KKKKKKKEAEKMKKKTEEAEK 988
           +++  K +K+ KE E++K++ EE EK
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEK 245



 Score = 30.4 bits (69), Expect = 6.3
 Identities = 28/131 (21%), Positives = 54/131 (41%), Gaps = 38/131 (29%)

Query: 896 RKKRTRRKKKKGAEKIR----KKKKKKKEAEKMRKKKG------------------AEKM 933
            K+    +  K   +I     + K+ KK  E+++K KG                   E  
Sbjct: 399 AKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYT 458

Query: 934 KKKKKDAEKMKKKGA--EKIGKKKKKKKKKKKKKKK--------------KKKKKKKEAE 977
            + K+  +++K+      K+ K+ ++ +K  KK+ +              ++K KK   E
Sbjct: 459 AELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLE 518

Query: 978 KMKKKTEEAEK 988
           +++KK EE EK
Sbjct: 519 ELEKKAEEYEK 529



 Score = 30.4 bits (69), Expect = 6.8
 Identities = 18/102 (17%), Positives = 54/102 (52%), Gaps = 9/102 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK----KGAEKMKKKKKDAEKMKKKGAEKI 951
             +  +  ++ G E + + +++ KE E    +    K AEK  ++++   K  ++  +K 
Sbjct: 572 LAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKA 631

Query: 952 GKK---KKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEEAEK 988
            ++    +K+ ++ +K+ ++ +KK  ++E E+++++  E  +
Sbjct: 632 FEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSR 673


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 47.0 bits (112), Expect = 3e-05
 Identities = 23/70 (32%), Positives = 36/70 (51%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K  EK++  K   K  KK     +M      K+  K ++  +K   + K+KKK+KK+ EK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340

Query: 979 MKKKTEEAEK 988
            KK+ E  E+
Sbjct: 341 KKKQIERLEE 350



 Score = 46.2 bits (110), Expect = 6e-05
 Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 5/85 (5%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           EK+++K K  K   K  KK             +   K   +      + K+KKK+KKK++
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339

Query: 969 KKKKK-----KEAEKMKKKTEEAEK 988
           KKKK+     +  EK++ +  + E+
Sbjct: 340 KKKKQIERLEERIEKLEVQATDKEE 364



 Score = 44.6 bits (106), Expect = 2e-04
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 8/86 (9%)

Query: 906 KGAEKIRKKKKKKKEAEKM-----RKKKGAEKMKKK-KKDAEKMKKKGAEKIGKKKKKKK 959
           K  EKI+  K + K  +KM            K+K K ++D EK+  +  EK  KK+KKK+
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEK--KKEKKKE 338

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
           +KKKK+ ++ +++ ++ E      EE
Sbjct: 339 EKKKKQIERLEERIEKLEVQATDKEE 364



 Score = 35.4 bits (82), Expect = 0.12
 Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 4/64 (6%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
              K+  +   ++  +K +AE   KKK  +K +KKKK  E+++    E+I K + +   K
Sbjct: 307 SDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE----ERIEKLEVQATDK 362

Query: 962 KKKK 965
           ++ K
Sbjct: 363 EENK 366



 Score = 35.4 bits (82), Expect = 0.13
 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKK 955
            KR ++           K+K K + E+  +K  AE  +KKK K  E+ KKK  E++ ++ 
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERI 352

Query: 956 KKKKKKKKKKKKKK 969
           +K + +   K++ K
Sbjct: 353 EKLEVQATDKEENK 366



 Score = 35.0 bits (81), Expect = 0.20
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK----KKGAEKMKKKKKDAEKMK 944
            + K  R  +K  AE   KKK+KKKE +K ++    ++  EK++ +  D E+ K
Sbjct: 313 LKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366


>gnl|CDD|220121 pfam09135, Alb1, Alb1.  Alb1 is a nuclear shuttling factor involved
           in ribosome biogenesis.
          Length = 112

 Score = 43.6 bits (103), Expect = 3e-05
 Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 4/89 (4%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKK 957
            R K+   ++       K E     KK+  +K K+  +   K +KKG   AE +     K
Sbjct: 25  PRAKRTDRKRPGILAASKNEGVSK-KKQRKKKPKRLTRKQRKRQKKGLERAEIVTDTLSK 83

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K+ K   + K  KK+K   +    K E+ 
Sbjct: 84  KRAKSIGRAKYVKKRKATWDDTNAKIEDE 112



 Score = 32.1 bits (73), Expect = 0.35
 Identities = 16/67 (23%), Positives = 32/67 (47%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           R+      + +K          +  +K    +     E + KKK++KKK K+  +K++K+
Sbjct: 7   RRAPSPSIDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKR 66

Query: 972 KKKEAEK 978
           +KK  E+
Sbjct: 67  QKKGLER 73



 Score = 29.0 bits (65), Expect = 4.0
 Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 4/58 (6%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKK----KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K       + K+   ++ G     K     KKK++KKK K+  +K+ ++ KK  E AE
Sbjct: 18  KSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKRQKKGLERAE 75


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 43.8 bits (104), Expect = 4e-05
 Identities = 22/84 (26%), Positives = 47/84 (55%), Gaps = 8/84 (9%)

Query: 913 KKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK--- 966
           K+++++KE ++   +   +    + KK +  +  +K+       +K+KKKKKKK+ K   
Sbjct: 32  KEEEEEKEEKEKEAEPDEDGWVTVTKKGRRPKTARKESVAAKAAEKEKKKKKKKELKNFY 91

Query: 967 --KKKKKKKKEAEKMKKKTEEAEK 988
             + ++ K+ E  +++KK EE +K
Sbjct: 92  RFQIRESKRNELAELRKKFEEDKK 115


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 46.9 bits (112), Expect = 4e-05
 Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 23/126 (18%)

Query: 880 DDVRREAVTALGFILFRK---------------KRTRRKKKKGAEKIRKKKKKKKEAEKM 924
           D +R E +  LG  +  K               +R + +K+   E+ R +K KK+E    
Sbjct: 521 DKLRDEWLPNLGIRIEDKPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEE--- 577

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-----KKKKKKKKKKKKKKKKKKEAEKM 979
           +KKK  EK++K K    +  K+  +K     +          ++  KK++KK  KE +K 
Sbjct: 578 KKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQ 637

Query: 980 KKKTEE 985
            K  EE
Sbjct: 638 AKLHEE 643



 Score = 40.4 bits (95), Expect = 0.005
 Identities = 16/43 (37%), Positives = 30/43 (69%)

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  +K   +++K++K+  K++K+ +K KK+ EK KK+ E+ EK
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588



 Score = 32.3 bits (74), Expect = 1.3
 Identities = 12/44 (27%), Positives = 27/44 (61%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K     + K   K++ +++K++K+  K++K   K+KK+ E+ +K
Sbjct: 538 KPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKK 581


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 47.0 bits (112), Expect = 5e-05
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 897 KKRTRRKKKKGAEKI--RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           +   + KK KGA+    R+  + K+        K A +  + KK  E+++++  E+   K
Sbjct: 367 RYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLK 426

Query: 955 KKKKKKKKKKKKKK 968
            KKKK+KKK+  +K
Sbjct: 427 SKKKKRKKKEWFEK 440



 Score = 38.5 bits (90), Expect = 0.019
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 19/104 (18%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            K  R     G E+I+ +  K K   +   R  K  +K+K  K + ++   +  E I   
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396

Query: 955 KKKKKKKKKKK-----------------KKKKKKKKKEAEKMKK 981
           +  K   +K +                  K KKKK+K+ E  +K
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440



 Score = 30.4 bits (69), Expect = 4.8
 Identities = 12/44 (27%), Positives = 25/44 (56%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           +K   +KI +   + +KK +K+ KK + K ++ E   ++ E+A 
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAA 316



 Score = 30.4 bits (69), Expect = 5.3
 Identities = 24/131 (18%), Positives = 50/131 (38%), Gaps = 20/131 (15%)

Query: 878 VNDDVRREAVTALGFILFRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKG------A 930
           +++   R+ +  L   L +K     KK +   EK   + ++ ++A +  ++KG       
Sbjct: 271 LDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANL 330

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKK-------------KKKKKKKKKKKKKKKEAE 977
           + +++  K        G E+I  +  K K             KK K  K    ++  E +
Sbjct: 331 QLIEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELK 390

Query: 978 KMKKKTEEAEK 988
           +     E A+ 
Sbjct: 391 EAIAYYESAKT 401


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 46.6 bits (111), Expect = 6e-05
 Identities = 25/90 (27%), Positives = 35/90 (38%), Gaps = 8/90 (8%)

Query: 897 KKRTRRKKKKGAEKIRKK-KKKKKEAEKMR-------KKKGAEKMKKKKKDAEKMKKKGA 948
           +     K +K A K+ K+ +K +KE E+          K  +   K  KK   K K    
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSK 519

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
                 K  K   K KKKKKK+K     + 
Sbjct: 520 LDKAANKISKAAVKVKKKKKKEKSIDLDDD 549



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 14/106 (13%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEA-------------EKMRKKKGAEKMKKKKKDAEK 942
             K  +   K      + +K++++E                   KK   K K   K  + 
Sbjct: 464 EAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKA 523

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K  ++   K KKKKKK+K          ++++ K+    EE E 
Sbjct: 524 ANKI-SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDED 568



 Score = 37.3 bits (87), Expect = 0.048
 Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 6/93 (6%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRK------KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +K KK  +   K+KK+  E E++        +K A K+ K+ + A+K +++         
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPW 496

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K      K  KK+  KKK + K+ K   +  K
Sbjct: 497 LKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529



 Score = 35.4 bits (82), Expect = 0.18
 Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 20/107 (18%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE------- 949
           K RT  + +     I ++     + +K+      E+++ KK   E+ K + AE       
Sbjct: 158 KPRTELESE--IASILEESGLASDEKKLTP---FEELELKKLSPEEAKARRAELRKMREL 212

Query: 950 ---KIGKKKKKKKKKKKK-----KKKKKKKKKKEAEKMKKKTEEAEK 988
              +  K K+ KK K KK     KK+K K++ KE E++ K   EA  
Sbjct: 213 LFREEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAAL 259



 Score = 34.3 bits (79), Expect = 0.36
 Identities = 22/107 (20%), Positives = 37/107 (34%), Gaps = 19/107 (17%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---------KMKKKKKDAEKMKKKG 947
           K   RRK      +   + KK K+  K   K+  E         + + K +       K 
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477

Query: 948 AEKIGKKKKK----------KKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           +EK  K++++          K      K  KK+  KK++     K  
Sbjct: 478 SEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAA 524



 Score = 34.3 bits (79), Expect = 0.41
 Identities = 24/108 (22%), Positives = 51/108 (47%), Gaps = 19/108 (17%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-------------KKGAEKMKKKKKDA 940
           L   K  +R     AE  RKK++   E E++R+             ++ ++K   ++K  
Sbjct: 373 LLSMKFMQR-----AEA-RKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFG 426

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  +K AE    KK+ K + K+KK+  ++++ ++ E+ K +    + 
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKL 474



 Score = 33.9 bits (78), Expect = 0.46
 Identities = 18/92 (19%), Positives = 37/92 (40%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K   + KK+   E+  + +++ K  +   K     +  +K+++ E++ ++          
Sbjct: 444 KNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSV 503

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K  KK+  KKK   K  +A     K     K
Sbjct: 504 GKSAKKQDSKKKSSSKLDKAANKISKAAVKVK 535



 Score = 32.3 bits (74), Expect = 1.5
 Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 17/106 (16%)

Query: 893 ILFRK-KRTRRKKK---KGAEKIRKKKKKK---KEAEKMRKKKGAEKMKKKKKDAEKMKK 945
           +LFR+  + +R KK   K   ++ KK+K K   KE E++ K           ++ EK+++
Sbjct: 212 LLFREEAKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEA----ALEELEKLER 267

Query: 946 KGA-EKIGKKKKKKKKKKKKKKKKKKKKK-----KEAEKMKKKTEE 985
           + A E++  K K   K  K   K  + K      K  E+  ++ EE
Sbjct: 268 RRAEERMTLKHKNTSKWAKSMLKTGRAKYDEEARKALEEQLRQGEE 313



 Score = 32.0 bits (73), Expect = 2.1
 Identities = 28/114 (24%), Positives = 47/114 (41%), Gaps = 27/114 (23%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE------------------- 941
             +KK G  K  +  ++      M+  + AE  KK++ DAE                   
Sbjct: 353 MLRKKLGKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEEN 412

Query: 942 --------KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
                     +K G E   K+ + KK KK+ K + K+KK+ + E+  +  EEA+
Sbjct: 413 EEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466



 Score = 31.2 bits (71), Expect = 3.0
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           KK+KKK+K+K++   K +  KKE  K KK 
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKL 656



 Score = 30.8 bits (70), Expect = 4.5
 Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 16/66 (24%)

Query: 930 AEKMKKKKKDAEKMKKK---------------GAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           AE  K+KK+  E+   K               G +K  KK+K+K++   K +  KK+K+K
Sbjct: 593 AEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKEKRK 652

Query: 975 EAEKMK 980
           +  K+K
Sbjct: 653 DK-KLK 657


>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
          Length = 1052

 Score = 46.5 bits (110), Expect = 7e-05
 Identities = 30/80 (37%), Positives = 39/80 (48%), Gaps = 4/80 (5%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM--KKKGAEKIGK 953
            RK   R  KK  A +   KKK  K+A    KK  A+K   KK  A+K   KK  A+K+ +
Sbjct: 975  RKTVRRSVKKAAATRAAMKKKVAKKAPA--KKAAAKKAAAKKAAAKKKVAKKAPAKKVAR 1032

Query: 954  KKKKKKKKKKKKKKKKKKKK 973
            K   KK  KK  +K   +K 
Sbjct: 1033 KPAAKKAAKKPARKAAGRKA 1052



 Score = 43.8 bits (103), Expect = 4e-04
 Identities = 33/111 (29%), Positives = 46/111 (41%), Gaps = 4/111 (3%)

Query: 874  AVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM 933
            A     + V   A  A+  +    ++       G EK+ +  +K     +  KK  A + 
Sbjct: 933  AQPAQENTVVESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRKT--VRRSVKKAAATRA 990

Query: 934  KKKKKDAEK--MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
              KKK A+K   KK  A+K   KK   KKK  KK   KK  +K A K   K
Sbjct: 991  AMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKAPAKKVARKPAAKKAAK 1041



 Score = 40.4 bits (94), Expect = 0.005
 Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 10/87 (11%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             +K  RR  KK A        +    +K+ KK  A+K   KK  A+K   K      KK 
Sbjct: 974  VRKTVRRSVKKAA------ATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKV--AKKA 1025

Query: 956  KKKKKKKKKKKKKKKKK--KKEAEKMK 980
              KK  +K   KK  KK  +K A +  
Sbjct: 1026 PAKKVARKPAAKKAAKKPARKAAGRKA 1052



 Score = 30.3 bits (68), Expect = 6.9
 Identities = 22/63 (34%), Positives = 27/63 (42%), Gaps = 3/63 (4%)

Query: 929  GAEKMKK-KKKDAEKMKKKGAEK--IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            G EK+ +  +K   +  KK A      KKK  KK   KK   KK   KK A K K   + 
Sbjct: 966  GIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKKA 1025

Query: 986  AEK 988
              K
Sbjct: 1026 PAK 1028


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 45.4 bits (108), Expect = 7e-05
 Identities = 44/156 (28%), Positives = 59/156 (37%), Gaps = 49/156 (31%)

Query: 875 VSDVNDDVRRE---------AVTALGFILFRKKR--TRRKKKKGAE---------KIRKK 914
           + DV DD +RE         AV      L +K     +R     AE         K+RKK
Sbjct: 65  IEDVEDDFKRELAFYKQALAAVLE-ARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKK 123

Query: 915 ----KKKKKEAEKMRKKKGAEKMKK-----KKKDAEKMKKKGAEKIGKKKKKKK------ 959
                  KK +E+ RK++  +K  K     K ++  K KK   EKI K KKK+K      
Sbjct: 124 LLAEAAAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEG 183

Query: 960 -------------KKKKKKKKKKKKKKKEAEKMKKK 982
                          KKK +K    + K   K K K
Sbjct: 184 DLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAK 219



 Score = 44.6 bits (106), Expect = 1e-04
 Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 6/86 (6%)

Query: 895 FRKKRTRRK--KKKGAEKIRKKKKKKKEAE----KMRKKKGAEKMKKKKKDAEKMKKKGA 948
            RK+R  +K  K+   EK++++ K+KK+      K++KK+     +    D         
Sbjct: 137 ARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDFLVDDAAKG 196

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKK 974
               K +K    + K   K+K K  K
Sbjct: 197 GSKKKGRKGGAARGKPNAKRKAKDAK 222



 Score = 42.3 bits (100), Expect = 7e-04
 Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 19/93 (20%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK--KKG------------- 947
             KK +E+ RK+++ KK  ++++ +K  E+ K+KK   EK+K  KK              
Sbjct: 129 AAKKASEEARKQRELKKFGKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDF 188

Query: 948 ----AEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
               A K G KKK +K    + K   K+K K+A
Sbjct: 189 LVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA 221



 Score = 33.9 bits (78), Expect = 0.32
 Identities = 24/101 (23%), Positives = 38/101 (37%), Gaps = 26/101 (25%)

Query: 898 KRTRRKKKKGAEKIRK---------KKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----K 944
           K+ ++K+K G ++            K   KK+  K    +G    K+K KDA K     K
Sbjct: 170 KKLKKKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA-KFGFGGK 228

Query: 945 KKGAEK------------IGKKKKKKKKKKKKKKKKKKKKK 973
           K+G++               KK K      K   K+  K +
Sbjct: 229 KRGSKSNTAESSNDLSGFSVKKMKGGGGAGKGGNKRPGKSR 269


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 44.0 bits (104), Expect = 8e-05
 Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 4/63 (6%)

Query: 926 KKKGAEKMKKKK----KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           KKK   K KK +       E        K  +KK KKKK +  K++KKKKK+K+ +K + 
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169

Query: 982 KTE 984
             E
Sbjct: 170 SPE 172



 Score = 43.7 bits (103), Expect = 1e-04
 Identities = 22/64 (34%), Positives = 36/64 (56%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           +  +  KKK   K +K +  + + ++     +  K   +K  KKK +  K++KKKKK+KK
Sbjct: 104 MHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKK 163

Query: 971 KKKK 974
           KKKK
Sbjct: 164 KKKK 167



 Score = 41.4 bits (97), Expect = 6e-04
 Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 7/75 (9%)

Query: 895 FRKKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           +R    +  KKK   K +K + +     E     +G +  +KK K      KK  E   +
Sbjct: 101 YRLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHK------KKKHEDDKE 154

Query: 954 KKKKKKKKKKKKKKK 968
           +KKKKK+KKKKKK+ 
Sbjct: 155 RKKKKKEKKKKKKRH 169



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           KK K K +  + +     E     +      KK   +K    K++KKKKK+KKKKKK+ 
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169



 Score = 38.3 bits (89), Expect = 0.007
 Identities = 18/37 (48%), Positives = 23/37 (62%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           +K KKKK E +K RKKK  EK KKKK+ + +    G 
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHPGVGF 178



 Score = 34.0 bits (78), Expect = 0.21
 Identities = 19/67 (28%), Positives = 31/67 (46%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L   +  ++K K   +K R +    +E     +     + K KKK  E  K++  +K  K
Sbjct: 103 LMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162

Query: 954 KKKKKKK 960
           KKKKK+ 
Sbjct: 163 KKKKKRH 169



 Score = 32.9 bits (75), Expect = 0.46
 Identities = 19/65 (29%), Positives = 32/65 (49%), Gaps = 1/65 (1%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           M  +   +K K K K   + +    E+     +  K  +KK KKKK +  KE +K KK+ 
Sbjct: 104 MHIQPPKKKHKHKHK-KHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEK 162

Query: 984 EEAEK 988
           ++ +K
Sbjct: 163 KKKKK 167



 Score = 32.5 bits (74), Expect = 0.67
 Identities = 13/31 (41%), Positives = 21/31 (67%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE 931
           ++ KKK  E  +++KKKKKE +K +K+   E
Sbjct: 142 KKHKKKKHEDDKERKKKKKEKKKKKKRHSPE 172


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 45.9 bits (110), Expect = 8e-05
 Identities = 33/122 (27%), Positives = 62/122 (50%), Gaps = 20/122 (16%)

Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKK---GAEKMKKKKKDA 940
            V A+     RKK     K K AE+  K+  ++ KKEAE ++K+      E++ K + + 
Sbjct: 15  VVGAVIGYFVRKKIAE-AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEF 73

Query: 941 EKMKKKGAEKIGKKKKKKKKKK---KKKKKKKKKKKKEAEK-----------MKKKTEEA 986
           EK  ++   ++ K +K+  +K+    +K +  +K+++E EK           ++KK EE 
Sbjct: 74  EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEEL 133

Query: 987 EK 988
           E+
Sbjct: 134 EE 135



 Score = 37.1 bits (87), Expect = 0.042
 Identities = 22/114 (19%), Positives = 62/114 (54%), Gaps = 10/114 (8%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEK---IRKKKKKKKEAEKMRKKKGAEKMKKK 936
           ++ ++EA       L   K    K +   EK    R+ + +K E +++ +K+  E + +K
Sbjct: 45  EEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLE-KRLLQKE--ENLDRK 101

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK----KTEEA 986
            +  EK +++  +K  + ++K+++ +KK+++ ++  +++ +++++      EEA
Sbjct: 102 LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEA 155


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 46.1 bits (110), Expect = 8e-05
 Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 9/82 (10%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           ++++ E   +   +  +K++KK++ AEK  +K   +    KKK +   KK K    + KK
Sbjct: 397 EEEEGENGNLSPAE-RKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455

Query: 975 E-----AEKMKKKT---EEAEK 988
                  EK+ +     EEA K
Sbjct: 456 VDPDPLGEKLARTEDPLEEAMK 477



 Score = 44.2 bits (105), Expect = 3e-04
 Identities = 24/87 (27%), Positives = 38/87 (43%), Gaps = 4/87 (4%)

Query: 902 RKKKKGAEKIRKK---KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
            K   GA +I  K   K    E E+   + G     ++KK  +K +K   +K  K++ +K
Sbjct: 374 FKAALGAIEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKA-EKKAEKEEAEK 432

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEE 985
              KKK +   KK K    + KK   +
Sbjct: 433 AAAKKKAEAAAKKAKGPDGETKKVDPD 459



 Score = 37.2 bits (87), Expect = 0.040
 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK-KKEAEKMKKKTEEAEK 988
           K      + ++     +   ++KK +KK++K +KK +K + E    KKK E A K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444



 Score = 36.8 bits (86), Expect = 0.058
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           RKK   RKK++ AEK + +K++ ++A    KKK     KK K    + KK   + +G+K
Sbjct: 411 RKKL--RKKQRKAEK-KAEKEEAEKAAA--KKKAEAAAKKAKGPDGETKKVDPDPLGEK 464



 Score = 35.7 bits (83), Expect = 0.13
 Identities = 13/55 (23%), Positives = 28/55 (50%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             + ++ E      +    KK +KK++K +KK +K++ +K  A+K  +   +  K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAK 447



 Score = 33.8 bits (78), Expect = 0.57
 Identities = 15/46 (32%), Positives = 21/46 (45%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           L +K+R   KK +  E  +   KKK EA   + K    + KK   D
Sbjct: 414 LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPD 459


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 44.7 bits (106), Expect = 9e-05
 Identities = 25/90 (27%), Positives = 46/90 (51%), Gaps = 2/90 (2%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
            L  ++  R +  +  +     +K+KK++ K ++K    K  K++K+ EK  KK  +++ 
Sbjct: 148 ELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKL--YKELKERKEREKKLKKVEQRLE 205

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            +++  KK K KKKK  K K  +     KK
Sbjct: 206 LQRELMKKGKGKKKKIVKDKDGKVVYKWKK 235



 Score = 42.0 bits (99), Expect = 7e-04
 Identities = 23/96 (23%), Positives = 41/96 (42%), Gaps = 15/96 (15%)

Query: 879 NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
              + + ++          K+ R+  K+  E+  ++KK KK  +++  ++  E MKK K 
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQR--ELMKKGKG 216

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
                         KKK  K K  K   K KK++K+
Sbjct: 217 K-------------KKKIVKDKDGKVVYKWKKERKR 239



 Score = 33.9 bits (78), Expect = 0.32
 Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 1/79 (1%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            +   + R  + +K      ++KK + K K+K     K +K+  +K+ KK +++ + +++
Sbjct: 151 DRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKL-KKVEQRLELQRE 209

Query: 965 KKKKKKKKKKEAEKMKKKT 983
             KK K KKK+  K K   
Sbjct: 210 LMKKGKGKKKKIVKDKDGK 228



 Score = 32.3 bits (74), Expect = 0.86
 Identities = 20/89 (22%), Positives = 41/89 (46%), Gaps = 13/89 (14%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKK-----------KKK 959
            +++ +    ++ K    ++ +KKK   +K K      E+  ++KK           ++ 
Sbjct: 151 DRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQREL 210

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK K KKKK  K K+ + + K  +E ++
Sbjct: 211 MKKGKGKKKKIVKDKDGKVVYKWKKERKR 239



 Score = 32.0 bits (73), Expect = 1.1
 Identities = 25/90 (27%), Positives = 39/90 (43%), Gaps = 12/90 (13%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEK--------MKKKKKDAEKMKKKGAEKIGKKK 955
           KKK  +   K   KK++  K  K+K AE+        M   K D      K  EK    K
Sbjct: 21  KKKDYKLRAKDYHKKEKTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSK 80

Query: 956 KKKKKKKKKKKK----KKKKKKKEAEKMKK 981
           ++ K  K +       K++ + K+ EK+K+
Sbjct: 81  EQVKLLKTQDLNYVRTKRQAEAKKIEKLKE 110


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 46.0 bits (110), Expect = 1e-04
 Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 9/96 (9%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            +    R  +  A + + KK  KK+     KKK  ++   +     K  K    + G+++
Sbjct: 99  AEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRR 158

Query: 956 KKKKKKKKKKKKKKKKK---------KKEAEKMKKK 982
           ++KKKKK+K  +K  ++          + AEKM  K
Sbjct: 159 RRKKKKKQKPTEKIPREVVIPETITVAELAEKMAVK 194



 Score = 36.0 bits (84), Expect = 0.12
 Identities = 15/73 (20%), Positives = 40/73 (54%), Gaps = 7/73 (9%)

Query: 923 KMRKKKGAEKMKKKKKDA-EKMKKKGAEK------IGKKKKKKKKKKKKKKKKKKKKKKE 975
           K+R  + A+++    K+  EK+K+ G E       + +++ +K++ K++ +++ K + +E
Sbjct: 3   KVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEE 62

Query: 976 AEKMKKKTEEAEK 988
           A   + + E   +
Sbjct: 63  AAAAEAEEEAKAE 75



 Score = 35.2 bits (82), Expect = 0.22
 Identities = 19/75 (25%), Positives = 31/75 (41%), Gaps = 17/75 (22%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD---------------- 939
           + K+ + K+K      R K  K +   + R+++  +K K+K  +                
Sbjct: 127 KPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETITVAE 186

Query: 940 -AEKMKKKGAEKIGK 953
            AEKM  K AE I K
Sbjct: 187 LAEKMAVKAAEVIKK 201


>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease.  The potyviridae
           family positive stand RNA viruses with genome encoding a
           polyprotein. members include zucchini yellow mosaic
           virus, and turnip mosaic viruses which cause
           considerable losses of crops worldwide. This family
           consists of a C terminus region from various plant
           potyvirus P1 proteins (found at the N terminus of the
           polyprotein). The C terminus of P1 is a serine-type
           protease responsible for autocatalytic cleavage between
           P1 and the helper component protease pfam00851. The
           entire P1 protein may be involved in virus-host
           interactions.
          Length = 245

 Score = 44.6 bits (106), Expect = 1e-04
 Identities = 26/98 (26%), Positives = 44/98 (44%), Gaps = 12/98 (12%)

Query: 888 TALGFILFRKKR-TRRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAEKMK 944
             +   L RK+    +++KKG   +RK       K+ EK+++++  E+   +   A  + 
Sbjct: 5   AKVAERLLRKEMSKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVS 64

Query: 945 K---------KGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           K            E +  +    K+  KK KKKKKKKK
Sbjct: 65  KITPIGTDKVSKTESVSFRTPYYKRTTKKMKKKKKKKK 102



 Score = 41.9 bits (99), Expect = 8e-04
 Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 6/91 (6%)

Query: 891 GFILFRKKRT----RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
             I   KK      +    + A+K  K K++++E  +  +   A  + K          K
Sbjct: 17  SKIKQEKKGRIILRKLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSK 76

Query: 947 GA--EKIGKKKKKKKKKKKKKKKKKKKKKKE 975
                      K+  KK KKKKKKKK    +
Sbjct: 77  TESVSFRTPYYKRTTKKMKKKKKKKKVVMSD 107


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 43.5 bits (102), Expect = 1e-04
 Identities = 19/92 (20%), Positives = 63/92 (68%), Gaps = 2/92 (2%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM--KKKKKDAEKMKKKGAEKIGK 953
           ++++ RR++++     R++ K++   E++R+++ A +   ++ ++  EK K+K  E+  +
Sbjct: 41  QEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQ 100

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           +++++++ +K+K++ + + ++EAE+M+ + E+
Sbjct: 101 EQEEQERIQKQKEEAEARAREEAERMRLEREK 132



 Score = 43.2 bits (101), Expect = 1e-04
 Identities = 23/93 (24%), Positives = 64/93 (68%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R++  RR++++ A +  +K K+K E E+ ++++  E+++K+K++AE   ++ AE++  ++
Sbjct: 71  REEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLER 130

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K  ++ ++++ ++KK+ +E  K  +K+E + +
Sbjct: 131 EKHFQQIEQERLERKKRLEEIMKRTRKSEVSPQ 163



 Score = 40.1 bits (93), Expect = 0.002
 Identities = 20/95 (21%), Positives = 65/95 (68%), Gaps = 5/95 (5%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           +L  K+R  R+++   E+  +++++++E +++ +++   +  +++   E+  ++  E+  
Sbjct: 27  LLAEKRRQAREQR---EQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEE-- 81

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           + ++K++K K+K ++++K++++E E+++K+ EEAE
Sbjct: 82  RAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAE 116



 Score = 29.3 bits (65), Expect = 6.1
 Identities = 20/74 (27%), Positives = 44/74 (59%), Gaps = 7/74 (9%)

Query: 902 RKKKKGAEKIRKKK-----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           +++++  E+I+K+K     + ++EAE+MR ++  EK  ++ +     +KK  E+I K+ +
Sbjct: 99  KQEQEEQERIQKQKEEAEARAREEAERMRLER--EKHFQQIEQERLERKKRLEEIMKRTR 156

Query: 957 KKKKKKKKKKKKKK 970
           K +   + KK+  K
Sbjct: 157 KSEVSPQVKKEDPK 170


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 44.6 bits (106), Expect = 1e-04
 Identities = 17/49 (34%), Positives = 25/49 (51%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           D E+ ++K  EK     K K KK  K K ++K+K K  ++ K   E  E
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 42.0 bits (99), Expect = 7e-04
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            EK ++K K A K K K        K K ++K+K K++K++K  +E E+   + E AEK
Sbjct: 43  EEKEEEKAKVAAKAKAKK-----ALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEK 96



 Score = 38.5 bits (90), Expect = 0.010
 Identities = 12/56 (21%), Positives = 31/56 (55%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           ++ K+++K     K K  + +  K ++K+K K++K++K  ++ +E     +  E+ 
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97



 Score = 36.2 bits (84), Expect = 0.050
 Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 3/37 (8%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           EK+K   AEK  K  KKKK K K K         + +
Sbjct: 193 EKLK---AEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226



 Score = 35.4 bits (82), Expect = 0.10
 Identities = 10/36 (27%), Positives = 20/36 (55%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++K+++K K   K K KK  K   + K+K +  ++
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKE 77



 Score = 34.2 bits (79), Expect = 0.23
 Identities = 11/36 (30%), Positives = 15/36 (41%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K +K  K  KKKK K K K          ++ + 
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDG 229



 Score = 33.5 bits (77), Expect = 0.45
 Identities = 14/66 (21%), Positives = 21/66 (31%), Gaps = 17/66 (25%)

Query: 920 EAEKMRKKKGA------EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            +E ++K          EK+K +K              G KKKK K K K          
Sbjct: 176 SSEDIKKVSSTLNVLINEKLKAEKAAK-----------GGKKKKGKAKAKLNVGGANDDD 224

Query: 974 KEAEKM 979
            + +  
Sbjct: 225 DDYDGY 230



 Score = 32.3 bits (74), Expect = 0.97
 Identities = 9/34 (26%), Positives = 21/34 (61%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + ++K+++K K   K K KK  +   ++ E+A++
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKR 74



 Score = 31.2 bits (71), Expect = 2.2
 Identities = 14/57 (24%), Positives = 32/57 (56%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           K+++ A+   K K KK    K+ +K+ A++ K++K   E  +    +++ +K + +K
Sbjct: 45  KEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101



 Score = 30.0 bits (68), Expect = 5.5
 Identities = 11/58 (18%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK---MKKKKKDAEKMKKKGAEK 950
            +K     K K  + ++ K ++K++A++ +++KG  +      + + AEK++ +  ++
Sbjct: 47  EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRKLQE 104



 Score = 30.0 bits (68), Expect = 5.8
 Identities = 12/60 (20%), Positives = 27/60 (45%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            ++K  EK +   K K +     K +  EK K++K++    + +      +  +K + +K
Sbjct: 42  DEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 42.8 bits (101), Expect = 1e-04
 Identities = 20/34 (58%), Positives = 24/34 (70%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           K   K+K K+KK+KKKKKKKKKKK   +  KKK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 40.8 bits (96), Expect = 6e-04
 Identities = 22/40 (55%), Positives = 25/40 (62%), Gaps = 6/40 (15%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           K+  K+  K KK+      KKKKKKKKKKK  KK  KKKK
Sbjct: 102 KQVLKEKAKQKKQ------KKKKKKKKKKKTSKKAAKKKK 135



 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 20/33 (60%), Positives = 26/33 (78%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           + + +K K+KK+KKKKKKKKKKK  K+A K KK
Sbjct: 103 QVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 20/38 (52%), Positives = 28/38 (73%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           +K  +++ K+K K+KK+KKKKKKKKKKK  +    KKK
Sbjct: 98  QKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 39.3 bits (92), Expect = 0.002
 Identities = 19/33 (57%), Positives = 23/33 (69%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           K+   EK  +KK+KKKKKKKKKKK  KK  K+ 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 39.3 bits (92), Expect = 0.002
 Identities = 20/33 (60%), Positives = 21/33 (63%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K   K   K  K+KKKKKKKKKKK  KK  KKK
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 38.5 bits (90), Expect = 0.003
 Identities = 19/37 (51%), Positives = 27/37 (72%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K+ K+  ++  K+KK+KKKKKKKKKKK  KK  + +K
Sbjct: 99  KIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 34.7 bits (80), Expect = 0.060
 Identities = 25/62 (40%), Positives = 28/62 (45%), Gaps = 25/62 (40%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           KI K+  K+K            K KK+KK              KKKKKKKKK  KK  KK
Sbjct: 99  KIVKQVLKEK-----------AKQKKQKK--------------KKKKKKKKKTSKKAAKK 133

Query: 970 KK 971
           KK
Sbjct: 134 KK 135



 Score = 34.3 bits (79), Expect = 0.084
 Identities = 15/31 (48%), Positives = 24/31 (77%)

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+  K+K K+KK+KKKK+ +K KK +++A K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAK 132



 Score = 33.5 bits (77), Expect = 0.19
 Identities = 15/32 (46%), Positives = 23/32 (71%)

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+  K+K K+KK+KKKKK+ +K K   + A+K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKK 133



 Score = 32.4 bits (74), Expect = 0.43
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+  K+K K+KK+KKKKKK+ +K   K    +K
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKK 134



 Score = 31.2 bits (71), Expect = 0.98
 Identities = 18/47 (38%), Positives = 31/47 (65%), Gaps = 4/47 (8%)

Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
           F++  +K  ++  K+ A    K+KK+KK+ +K +KKK ++K  KKKK
Sbjct: 93  FLVEEQKIVKQVLKEKA----KQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           K   K + K+KK+K+ +K +KKK   K   KKK 
Sbjct: 102 KQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 43.8 bits (104), Expect = 1e-04
 Identities = 15/52 (28%), Positives = 30/52 (57%), Gaps = 2/52 (3%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            K++ ++  AE+  KK      K+K  ++++K K K  +KK+K  A+K + +
Sbjct: 108 AKVQAQR--AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQ 157



 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 18/74 (24%), Positives = 42/74 (56%), Gaps = 8/74 (10%)

Query: 915 KKKKKEA-EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +K+ +EA  K++ ++ AE+  KK++ A + +K        ++++K K K  +KK+K + +
Sbjct: 100 RKQLEEAKAKVQAQR-AEQQAKKREAAGEKEKA------PRRERKPKPKAPRKKRKPRAQ 152

Query: 974 KEAEKMKKKTEEAE 987
           K   +    ++ +E
Sbjct: 153 KPEPQHTPVSDISE 166



 Score = 42.2 bits (100), Expect = 5e-04
 Identities = 12/53 (22%), Positives = 26/53 (49%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K   +  ++ A+K     +K+K  ++++K K K  +K+ +   +K E    
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHT 159



 Score = 39.9 bits (94), Expect = 0.003
 Identities = 16/72 (22%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           K K + + AE+  KK+ A   K+K    E+  K       K  +KK+K + +K + +   
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKP------KAPRKKRKPRAQKPEPQHTP 160

Query: 973 KKEAEKMKKKTE 984
             +  ++     
Sbjct: 161 VSDISELTVGQA 172



 Score = 34.9 bits (81), Expect = 0.13
 Identities = 10/64 (15%), Positives = 29/64 (45%), Gaps = 1/64 (1%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKK 954
           ++   + KK++ A +  K  +++++ +    +K  +   +K +         +E  +G+ 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172

Query: 955 KKKK 958
            K K
Sbjct: 173 VKVK 176



 Score = 34.1 bits (79), Expect = 0.22
 Identities = 14/64 (21%), Positives = 28/64 (43%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           ++ ++ A+K     +K+K   + RK K     KK+K  A+K + +        +    + 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172

Query: 962 KKKK 965
            K K
Sbjct: 173 VKVK 176



 Score = 33.4 bits (77), Expect = 0.38
 Identities = 13/68 (19%), Positives = 28/68 (41%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           + + + AE+  KK++   E EK  +++   K K  +K  +   +K   +        +  
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELT 168

Query: 962 KKKKKKKK 969
             +  K K
Sbjct: 169 VGQAVKVK 176



 Score = 33.4 bits (77), Expect = 0.41
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 6/55 (10%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKK------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           A K  ++   K+  ++ +++ KK      K+K  ++++K K     KK+   A+K
Sbjct: 99  ARKQLEEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 45.5 bits (108), Expect = 2e-04
 Identities = 25/88 (28%), Positives = 41/88 (46%), Gaps = 12/88 (13%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            KK  ++KK      I   +KK  E E  R ++  E   + + +++K KK   E +   ++
Sbjct: 1500 KKINQKKKGF----IPSNEKKSIEIEN-RNQEEKEPAGQGELESDKEKKGNLESVLSNQE 1554

Query: 957  K-------KKKKKKKKKKKKKKKKKEAE 977
            K       +   KK+K KK+ K   EAE
Sbjct: 1555 KNIEEDYAESDIKKRKNKKQYKSNTEAE 1582



 Score = 44.3 bits (105), Expect = 3e-04
 Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 4/88 (4%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-AEKMKKKKKDAEKMKKKGAEKIGKK 954
            +KK      +K + +I  + +++KE     + +   EK    +      +K   E   + 
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 955  KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
              KK+K    KK+ K   + E +   K+
Sbjct: 1565 DIKKRKN---KKQYKSNTEAELDFFLKR 1589



 Score = 36.6 bits (85), Expect = 0.080
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
             MKK  ++     K    KI    ++K KKKKKK+KKK+++ K  EK + +  EA
Sbjct: 719 GLMKKIFRNW--NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEA 772



 Score = 36.2 bits (84), Expect = 0.12
 Identities = 15/74 (20%), Positives = 34/74 (45%)

Query: 891  GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
            GFI   +K++   + +  E+     + + E++K +K      +  ++K+ E+   +   K
Sbjct: 1508 GFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK 1567

Query: 951  IGKKKKKKKKKKKK 964
              K KK+ K   + 
Sbjct: 1568 KRKNKKQYKSNTEA 1581



 Score = 33.9 bits (78), Expect = 0.53
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 941 EKMKKKGAEKIGKKKKKK------KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             MKK      GK  + K      +K KKKKKK+KKK+++   + K + E AE
Sbjct: 719 GLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 32.8 bits (75), Expect = 1.2
 Identities = 35/115 (30%), Positives = 46/115 (40%), Gaps = 34/115 (29%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKK------------EAE----KMRKKKG-----AEKM 933
           F  K   R K + + K R KKKKKK            E E      RK+        +++
Sbjct: 863 FYLKPWHRSKLRSSHKDRMKKKKKKKNDFCFLTVWGMETELPFGSPRKRPSFFEPIFKEL 922

Query: 934 KKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKKKKK--------KKKKKKEAEK 978
           KKK +   K KKK     KI K++ K   K  K+ KK         K+  KE  K
Sbjct: 923 KKKIR---KFKKKYFLVLKILKERTKLFLKVSKETKKWIIKSFLFLKRIIKELSK 974


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 45.1 bits (107), Expect = 2e-04
 Identities = 23/88 (26%), Positives = 38/88 (43%), Gaps = 7/88 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-----KIGKKKKK 957
           KK      +        EAEK R++   +K  K    +               + +  K 
Sbjct: 218 KKSDNLFTLDSGGSTDDEAEKKRQEV--KKKLKINNVSLDDDSTETPASDYYDVSEMVKF 275

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           KK KKKKKKKKK++K  + ++++ + E 
Sbjct: 276 KKPKKKKKKKKKRRKDLDEDELEPEAEG 303



 Score = 42.8 bits (101), Expect = 0.001
 Identities = 23/90 (25%), Positives = 44/90 (48%), Gaps = 7/90 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            +   +RK+++  EKI K ++K++   K+   K   +      D +         + K K
Sbjct: 47  EEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAW-------LKKSK 99

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K++KKK+ ++KK     +KE E+  + T E
Sbjct: 100 KRQKKKEAERKKALLLDEKEKERAAEYTSE 129



 Score = 42.1 bits (99), Expect = 0.001
 Identities = 22/84 (26%), Positives = 45/84 (53%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           ++K++   + K+  E++R+K  K +E  +   K G  K   +  D +   K   +K  K+
Sbjct: 42  WKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKR 101

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEK 978
           +KKK+ ++KK     +K+K+ A +
Sbjct: 102 QKKKEAERKKALLLDEKEKERAAE 125



 Score = 39.4 bits (92), Expect = 0.010
 Identities = 21/98 (21%), Positives = 39/98 (39%), Gaps = 19/98 (19%)

Query: 910 KIRKK--------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           K+R K           K  + K  +   A +  KK+++  + K+K  E   K  K ++K+
Sbjct: 10  KLRAKLGLKPLPVPGAKPGSTKESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKR 69

Query: 962 KKKKKKKKKKKKKEAE-----------KMKKKTEEAEK 988
           ++  K    K   E +           K KK+ ++ E 
Sbjct: 70  ERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKEA 107



 Score = 39.4 bits (92), Expect = 0.011
 Identities = 23/110 (20%), Positives = 48/110 (43%), Gaps = 23/110 (20%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K  + KKKK     +KKKK++K+ ++   +  AE +      + K  ++   ++    K
Sbjct: 273 VKFKKPKKKK-----KKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPK 327

Query: 957 KKKKK------------------KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+K++                  K+++  +KK+KK   E + ++  E   
Sbjct: 328 KRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERS 377



 Score = 39.0 bits (91), Expect = 0.015
 Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 9/84 (10%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK-IGKKKKK--------KKKKKKK 964
           KK++++   K ++++  EK+ K ++  E+  K G  K +G+            KK KK++
Sbjct: 43  KKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQ 102

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
           KKK+ ++KK      K+K   AE 
Sbjct: 103 KKKEAERKKALLLDEKEKERAAEY 126



 Score = 34.0 bits (78), Expect = 0.40
 Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 12/84 (14%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK--- 964
             ++ K KK         KKK  +K K++K   E   +  AE +G      +K  ++   
Sbjct: 269 VSEMVKFKK--------PKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENA 320

Query: 965 -KKKKKKKKKKEAEKMKKKTEEAE 987
             +   KK+K+E E      ++ +
Sbjct: 321 RLEDSPKKRKEEQEDDDFVEDDDD 344



 Score = 32.4 bits (74), Expect = 1.3
 Identities = 18/102 (17%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 896 RKKRTRRKKKK---------GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           +KK+ ++K++K          AE +       ++  +    +  +  KK+K++ E     
Sbjct: 280 KKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDDFV 339

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             +   +    K+++  +KK+KK + +  A ++ ++  E E+
Sbjct: 340 EDDDDLQASLAKQRRLAQKKRKKLRPEDIARQIAEERSEDEE 381



 Score = 29.7 bits (67), Expect = 8.1
 Identities = 18/88 (20%), Positives = 32/88 (36%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
             K+K       KKKK   + +   K      + K  ++ E  KKK              
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + +KK+++ KKK +   +    +  E 
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTET 261


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 42.2 bits (100), Expect = 2e-04
 Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 8/90 (8%)

Query: 892 FILFR----KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG----AEKMKKKKKDAEKM 943
           FIL      K     K     +K    K+ +++A K  KK G    A++  K + +  K 
Sbjct: 41  FILKHLSTLKFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQ 100

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +KK   K  K+++K++K++ K++KKK K +
Sbjct: 101 EKKKRSKEKKEEEKERKRQLKQQKKKAKHR 130



 Score = 41.5 bits (98), Expect = 3e-04
 Identities = 24/80 (30%), Positives = 47/80 (58%), Gaps = 3/80 (3%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGAEKIGKKKKKKKKKK 962
           K         KK+KK   K  +++ A+++KK      A++  K   E+  K++KKK+ K+
Sbjct: 50  KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERN-KQEKKKRSKE 108

Query: 963 KKKKKKKKKKKKEAEKMKKK 982
           KK+++K++K++ + +K K K
Sbjct: 109 KKEEEKERKRQLKQQKKKAK 128



 Score = 35.7 bits (83), Expect = 0.029
 Identities = 19/75 (25%), Positives = 42/75 (56%), Gaps = 9/75 (12%)

Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK---------KKKKKKKKKKKKKKKKKK 973
           K  +       K+KK + ++++++ A+++ K           K + ++ K++KKK+ K+K
Sbjct: 50  KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEK 109

Query: 974 KEAEKMKKKTEEAEK 988
           KE EK +K+  + +K
Sbjct: 110 KEEEKERKRQLKQQK 124



 Score = 34.1 bits (79), Expect = 0.11
 Identities = 22/81 (27%), Positives = 48/81 (59%), Gaps = 7/81 (8%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           A K+  KK+KK   ++++++  A K  KK   + K     A++  K + ++ K++KKK+ 
Sbjct: 54  AVKVSIKKQKKINPKRLQRQ--AAKEVKKPGISTK-----AQQALKLEHERNKQEKKKRS 106

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
           K+KK++++  K + K ++ + 
Sbjct: 107 KEKKEEEKERKRQLKQQKKKA 127



 Score = 31.5 bits (72), Expect = 0.74
 Identities = 12/34 (35%), Positives = 23/34 (67%), Gaps = 4/34 (11%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG 929
           +KKR++ KK++  E+ R+ K++KK+     K +G
Sbjct: 102 KKKRSKEKKEEEKERKRQLKQQKKK----AKHRG 131


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 41.8 bits (99), Expect = 2e-04
 Identities = 22/98 (22%), Positives = 55/98 (56%), Gaps = 9/98 (9%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---------MKKKG 947
           K+R   + +   +  R++ ++++E  K R+++  +K ++ ++   K          K++ 
Sbjct: 5   KRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRR 64

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           AEK  +++KK +K+K+++ K+ K + +E +   +K EE
Sbjct: 65  AEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEE 102



 Score = 33.0 bits (76), Expect = 0.19
 Identities = 19/82 (23%), Positives = 47/82 (57%), Gaps = 9/82 (10%)

Query: 916 KKKKEAEKM------RKKKGAEKMKKKKKDAEKMKKKGAE---KIGKKKKKKKKKKKKKK 966
           +K++E E++      ++++   + +  K+  E+++KK  E    + K  K  K+ + K++
Sbjct: 4   EKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRR 63

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
           + +KK ++E +  K+K EE ++
Sbjct: 64  RAEKKAEEEKKLRKEKEEEIKE 85


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 43.6 bits (103), Expect = 2e-04
 Identities = 21/110 (19%), Positives = 47/110 (42%), Gaps = 35/110 (31%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGAEKIGK------- 953
           E+ R+++K+K+   + R+ + A+ M K        + +  ++ +K  A K  K       
Sbjct: 106 EENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEA 165

Query: 954 --------------------KKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
                               +K+K++KKK K+ K+++K++K    +    
Sbjct: 166 REHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAALVAAA 215



 Score = 32.5 bits (74), Expect = 0.82
 Identities = 14/63 (22%), Positives = 35/63 (55%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            +  +E  + +K+K A +  ++ + A+ M K        + +K+K+++K +  K++K++ 
Sbjct: 102 NQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERL 161

Query: 975 EAE 977
            AE
Sbjct: 162 VAE 164



 Score = 30.5 bits (69), Expect = 3.3
 Identities = 16/82 (19%), Positives = 38/82 (46%), Gaps = 13/82 (15%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKG------AEKMKKKKKDAEKMKKKGAEKIGKK 954
            +K+K+  +    K++K++   + R+  G        + ++  +  EK +KK       K
Sbjct: 142 AQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKK-------K 194

Query: 955 KKKKKKKKKKKKKKKKKKKKEA 976
            K+ K+++K++K+        A
Sbjct: 195 VKEAKRREKEEKRMAALVAAAA 216


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 43.4 bits (103), Expect = 2e-04
 Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           + KK   E++KKK+ D ++ +    +K     +++KKKKKKK  KKKK KK A    K 
Sbjct: 149 LLKK--VERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            K+ E+++KK+   K  +  +D +K      EK  KKKK  KKKK KK      K 
Sbjct: 150 LKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMKA 205



 Score = 38.1 bits (89), Expect = 0.010
 Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 7/63 (11%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
            KK  E+++KK+   KEAE       A   KK     E+ KKK  +   KKK KK     
Sbjct: 150 LKK-VERLKKKELDIKEAEA------ARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVG 202

Query: 964 KKK 966
            K 
Sbjct: 203 MKA 205



 Score = 30.7 bits (70), Expect = 3.1
 Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           +R ++K+    E    + KKK    +  KKK   K   KKK  +K+   G + I
Sbjct: 154 ERLKKKELDIKEAEAARDKKKSNNAEEEKKKKK-KKSAKKKKLKKVAAVGMKAI 206


>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain.  This
           presumed domain is found at the N-terminus of Ribosomal
           L30 proteins and has been termed RL30NT or NUC018.
          Length = 71

 Score = 39.9 bits (94), Expect = 3e-04
 Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 7/54 (12%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK+K  EK++ K       + KK+  KK  +KKK+K   K AEK  K+   AE+
Sbjct: 4   KKRKRNEKLRAK-------RAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50



 Score = 33.4 bits (77), Expect = 0.057
 Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           KK+K+ E  + ++ K     K  +K   K+  K AEK  K+ +  ++ + + K++ KKK 
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKKG 63

Query: 974 ---KEAEK 978
                AE 
Sbjct: 64  NFYVPAEP 71



 Score = 30.7 bits (70), Expect = 0.53
 Identities = 11/35 (31%), Positives = 19/35 (54%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             KK+K+ +K + K+ KK+  +    KKK +   K
Sbjct: 2   LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFK 36



 Score = 30.3 bits (69), Expect = 0.73
 Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK---KGAEKMKKKKKDAEK 942
           KKR R +K +   K  KK+  KK A K ++K   K AEK  K+ + AE+
Sbjct: 4   KKRKRNEKLR--AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAER 50



 Score = 29.1 bits (66), Expect = 1.8
 Identities = 17/55 (30%), Positives = 34/55 (61%), Gaps = 4/55 (7%)

Query: 934 KKKKKDAE----KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           KK+K++ +    + KK+ A+K  +KKK+K   K+ +K  K+ +  E ++++ K E
Sbjct: 4   KKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKRE 58



 Score = 29.1 bits (66), Expect = 2.1
 Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 3/39 (7%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE---EAEK 988
             KK+K+ +K + K+ KK+  K+A + KK+      AEK
Sbjct: 2   LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEK 40


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 27/91 (29%), Positives = 41/91 (45%), Gaps = 17/91 (18%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK--------------IGKKKKKKKK 960
           K + K   K  + K A K KKK+ ++        E                G+KK K+K 
Sbjct: 13  KNEPKWKSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKL 72

Query: 961 KKKKKKKKKK---KKKKEAEKMKKKTEEAEK 988
           ++ K K KK+   K+K++ E+  K   EAEK
Sbjct: 73  RRDKLKAKKEEAEKEKEKEERFMKALAEAEK 103



 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +KK  EK+R+ K   K  K++ + EK K++   K   + +K++ + +KKK + K  K+E 
Sbjct: 65  RKKAKEKLRRDKLKAK--KEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEK 122

Query: 977 EKMKKKTE 984
           + M   T 
Sbjct: 123 KIMFADTS 130



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 12/56 (21%), Positives = 30/56 (53%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             K K ++   +  K++  ++  K+++  K   + +K++ + +KK+AE    K E+
Sbjct: 67  KAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEEK 122



 Score = 37.7 bits (88), Expect = 0.007
 Identities = 23/63 (36%), Positives = 40/63 (63%), Gaps = 4/63 (6%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           RKK K+K   +K++ KK  E+ +K+K+  E+  K  AE   +K++ + +KKK + K  K+
Sbjct: 65  RKKAKEKLRRDKLKAKK--EEAEKEKEKEERFMKALAE--AEKERAELEKKKAEAKLMKE 120

Query: 972 KKK 974
           +KK
Sbjct: 121 EKK 123



 Score = 36.6 bits (85), Expect = 0.018
 Identities = 25/65 (38%), Positives = 38/65 (58%), Gaps = 6/65 (9%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           RKK K   +  K K KK+EAEK ++K+  E+  K   +AEK +     ++ KKK + K  
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKE--ERFMKALAEAEKERA----ELEKKKAEAKLM 118

Query: 962 KKKKK 966
           K++KK
Sbjct: 119 KEEKK 123



 Score = 36.2 bits (84), Expect = 0.025
 Identities = 16/46 (34%), Positives = 29/46 (63%), Gaps = 1/46 (2%)

Query: 944 KKKGAEKIGKKKKKKKKKKK-KKKKKKKKKKKEAEKMKKKTEEAEK 988
           +KK  EK+ + K K KK++  K+K+K+++  K   + +K+  E EK
Sbjct: 65  RKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110



 Score = 33.9 bits (78), Expect = 0.14
 Identities = 16/51 (31%), Positives = 31/51 (60%), Gaps = 6/51 (11%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK-KKKKKEAEKMKKKTEE 985
           +KK  EK+++        K KK++ +K+K+K+++  K   EAEK + + E+
Sbjct: 65  RKKAKEKLRRDK-----LKAKKEEAEKEKEKEERFMKALAEAEKERAELEK 110



 Score = 33.5 bits (77), Expect = 0.21
 Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKK 945
           +K  R K K   E+  K+K+K++   K   +  K   +++KKK +A+ MK+
Sbjct: 70  EKLRRDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKE 120


>gnl|CDD|216739 pfam01851, PC_rep, Proteasome/cyclosome repeat. 
          Length = 35

 Score = 38.6 bits (91), Expect = 4e-04
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 355 AAGIAMGLVLLGSKSESAIQDMVAYAKETQHEKI 388
            A +A+GL+  GS +E A++ ++ Y  +T +E  
Sbjct: 1   GAILALGLIHAGSGNEEALELLLPYLSDTSNEVR 34



 Score = 38.6 bits (91), Expect = 4e-04
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 749 AAGIAMGLVLLGSKSESAIQDMVAYAKETQHEKI 782
            A +A+GL+  GS +E A++ ++ Y  +T +E  
Sbjct: 1   GAILALGLIHAGSGNEEALELLLPYLSDTSNEVR 34



 Score = 36.6 bits (86), Expect = 0.002
 Identities = 10/35 (28%), Positives = 12/35 (34%)

Query: 714 GGCLGLGLAAMGSARDDIYEQLKFNLYQDDAVTGE 748
           G  L LGL   GS  ++  E L   L         
Sbjct: 1   GAILALGLIHAGSGNEEALELLLPYLSDTSNEVRA 35



 Score = 31.6 bits (73), Expect = 0.082
 Identities = 14/35 (40%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 679 GGLYALGLIHANHG-AVINDYLLGQVKDATNEMVR 712
           G + ALGLIHA  G     + LL  + D +NE+  
Sbjct: 1   GAILALGLIHAGSGNEEALELLLPYLSDTSNEVRA 35



 Score = 28.9 bits (66), Expect = 0.95
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 317 GGLYALGLIHANHG-AVINDYLLGQVKDATNE 347
           G + ALGLIHA  G     + LL  + D +NE
Sbjct: 1   GAILALGLIHAGSGNEEALELLLPYLSDTSNE 32



 Score = 27.0 bits (61), Expect = 3.6
 Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 1/26 (3%)

Query: 280 ATASLGVIHRGHETE-ALAHMQSYLP 304
           A  +LG+IH G   E AL  +  YL 
Sbjct: 2   AILALGLIHAGSGNEEALELLLPYLS 27



 Score = 27.0 bits (61), Expect = 3.6
 Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 1/26 (3%)

Query: 642 ATASLGVIHRGHETE-ALAHMQSYLP 666
           A  +LG+IH G   E AL  +  YL 
Sbjct: 2   AILALGLIHAGSGNEEALELLLPYLS 27


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 43.7 bits (103), Expect = 4e-04
 Identities = 18/93 (19%), Positives = 34/93 (36%), Gaps = 18/93 (19%)

Query: 897 KKRTRRKKK---------------KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
              T R+ +               + AE I+K +  +K   K RKK      K++  + E
Sbjct: 304 FLWTLRRNRLRMIITPWRAPELHAENAE-IKKTRTAEKNEAKARKK--EIAQKRRAAERE 360

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
             ++   E+     + + ++   K KKK     
Sbjct: 361 INREARQERAAAMARARARRAAVKAKKKGLIDA 393



 Score = 37.9 bits (88), Expect = 0.024
 Identities = 16/100 (16%), Positives = 44/100 (44%), Gaps = 18/100 (18%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKK----------------EAEKMRKKKGAEKMKKK-KK 938
            ++   R+K+K  +K      ++                 E  +++K + AEK + K +K
Sbjct: 289 EQRAQYREKQK-EKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARK 347

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
                K++ AE+   ++ ++++     + + ++   +A+K
Sbjct: 348 KEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387



 Score = 35.2 bits (81), Expect = 0.19
 Identities = 19/92 (20%), Positives = 38/92 (41%), Gaps = 13/92 (14%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             K+TR  +K  A     K +KK+ A+   K++ AE+   ++   E+     A  + + +
Sbjct: 331 EIKKTRTAEKNEA-----KARKKEIAQ---KRRAAEREINREARQER-----AAAMARAR 377

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            ++   K KKK        E    + +  +  
Sbjct: 378 ARRAAVKAKKKGLIDASPNEDTPSENEESKGS 409



 Score = 31.7 bits (72), Expect = 2.1
 Identities = 17/79 (21%), Positives = 41/79 (51%), Gaps = 2/79 (2%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
             ++K+K+K+A     ++   +M      A ++  + AE I K +  +K + K +KK+  
Sbjct: 293 QYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAE-IKKTRTAEKNEAKARKKEIA 351

Query: 971 KKKKEAE-KMKKKTEEAEK 988
           +K++ AE ++ ++  +   
Sbjct: 352 QKRRAAEREINREARQERA 370


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 41.8 bits (99), Expect = 4e-04
 Identities = 33/108 (30%), Positives = 46/108 (42%), Gaps = 20/108 (18%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK-----------DAEKMKKKGAE 949
           ++KK          K KKK  E++  +    K KKK K           + +   K  A+
Sbjct: 1   KKKKSSPKRSKGMAKSKKKTREELDAEARERKRKKKHKGLKSGSRHNEGNTQSKGKGQAQ 60

Query: 950 K----IGKKKK-----KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K    IG KK      ++K K KKKK K KK K   E+  +K E  E+
Sbjct: 61  KKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPKLSPEQELEKLENDER 108


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 43.8 bits (104), Expect = 4e-04
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 9/78 (11%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK------ 962
            +I + KK+ K+ E+  ++  AE  +KK+++ EK K K  +K  KK  KK          
Sbjct: 95  AQIEELKKELKKLEEKIEQLEAEI-EKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALK 153

Query: 963 --KKKKKKKKKKKKEAEK 978
               KK  K+K  KE + 
Sbjct: 154 GLNYKKNFKEKLLKELKS 171



 Score = 35.8 bits (83), Expect = 0.15
 Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 12/72 (16%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           ++  E E   ++      KK+ K  E       EKI + + + +KK+++ +K K K   +
Sbjct: 88  EENIEIEAQIEEL-----KKELKKLE-------EKIEQLEAEIEKKEEELEKAKNKFLDK 135

Query: 976 AEKMKKKTEEAE 987
           A K   K  ++ 
Sbjct: 136 AWKKLAKKYDSN 147



 Score = 34.6 bits (80), Expect = 0.27
 Identities = 22/93 (23%), Positives = 36/93 (38%), Gaps = 10/93 (10%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            KK   + ++  AE I KK+++ ++A+     K  +K+ KK         KG        
Sbjct: 104 LKKLEEKIEQLEAE-IEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL------- 155

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
               KK  K+K  K+ K          + E  K
Sbjct: 156 --NYKKNFKEKLLKELKSVILNASSLLSLEELK 186


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 43.9 bits (103), Expect = 4e-04
 Identities = 21/89 (23%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 897 KKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           KK+  R  K+  +K   +  +  +E  ++   + A   K+ KK   + ++K A      +
Sbjct: 70  KKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVE 129

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           ++K +KK +K++K KK  ++ E    ++E
Sbjct: 130 EEKTEKKVRKRRKVKKMDEDVEDQGSESE 158



 Score = 40.0 bits (93), Expect = 0.006
 Identities = 27/126 (21%), Positives = 52/126 (41%), Gaps = 28/126 (22%)

Query: 890 LGFILFRKKRTRRKKKKGAEKIRKK--------------------KKKKKEAEKMRKKKG 929
           LGF  FR +R R + K       +K                    KK  +  ++  KK  
Sbjct: 26  LGFNKFRGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTV 85

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK--------KKKKKKKKKKEAEKMKK 981
            E  +  ++ +E +  + A    + KK  ++ ++K        +++K +KK ++  K+KK
Sbjct: 86  VEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKK 145

Query: 982 KTEEAE 987
             E+ E
Sbjct: 146 MDEDVE 151



 Score = 30.4 bits (68), Expect = 6.2
 Identities = 13/54 (24%), Positives = 29/54 (53%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
             +RTRRK    +  + ++K +KK  ++ + KK  E ++ +  ++E    + +E
Sbjct: 113 TPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESE 166


>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
           This family consists of several eukaryotic ribosome
           biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
           protein that is essential for the maturation of 25 S
           rRNA and the 60 S ribosomal subunit assembly in
           Saccharomyces cerevisiae.
          Length = 164

 Score = 41.9 bits (99), Expect = 4e-04
 Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 11/79 (13%)

Query: 904 KKKGAEKIRKKKKKKK---EAEKMRKKKGAEKMKKKKKDAE-----KMKKKGAEKIGKKK 955
           KKKG   I+K+KK K    EA      +   K   K  + +     K      E    KK
Sbjct: 88  KKKG---IKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKK 144

Query: 956 KKKKKKKKKKKKKKKKKKK 974
           +++KK++  K +K++ K K
Sbjct: 145 RREKKERVAKNEKRELKNK 163



 Score = 36.1 bits (84), Expect = 0.033
 Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 18/86 (20%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK----------- 962
           K K   + EK  KKKG +K KK K   ++   +   + G K+  K  +K           
Sbjct: 76  KPKPPTKWEKFAKKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDD 135

Query: 963 -------KKKKKKKKKKKKEAEKMKK 981
                  KK+++KK++  K  ++  K
Sbjct: 136 PKEDPFAKKRREKKERVAKNEKRELK 161


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 43.4 bits (103), Expect = 4e-04
 Identities = 17/93 (18%), Positives = 54/93 (58%), Gaps = 14/93 (15%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            K+  ++K ++  ++  +++ ++KE EK R+++   K+               E   +K 
Sbjct: 118 EKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI--------------LEYQREKA 163

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++++++ +++++K++K++E  +++ + EEAE 
Sbjct: 164 EREEEREAERRERKEEKEREVARLRAQQEEAED 196



 Score = 41.4 bits (98), Expect = 0.001
 Identities = 18/98 (18%), Positives = 60/98 (61%), Gaps = 5/98 (5%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           +++++  R++++ AE+  +K++K++E  ++R ++   + ++++ D  +      E   K+
Sbjct: 158 YQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKE 217

Query: 955 KKKKKKKKKKKKKKKKKKKKE-----AEKMKKKTEEAE 987
           ++K+K++ +K++++K++ ++       EK ++  EE  
Sbjct: 218 RQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERA 255



 Score = 39.5 bits (93), Expect = 0.006
 Identities = 14/95 (14%), Positives = 49/95 (51%), Gaps = 11/95 (11%)

Query: 896 RKKRTRRKKKKGAEKIR----KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
            K+   ++++   E  R    +K+ + +E E+   +K   K  + +++ E+        I
Sbjct: 245 EKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQ-------I 297

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            +K++++  +++++ ++ ++ ++E  + + + EE 
Sbjct: 298 EEKEERRAAEREEELEEGERLREEEAERQARIEEE 332



 Score = 39.1 bits (92), Expect = 0.010
 Identities = 18/97 (18%), Positives = 58/97 (59%), Gaps = 9/97 (9%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK---KKKKDAEKMKKKGAEKIGKKKKK 957
           R++++K  E+  K++++K+E ++ R+++  EK +   +++ + E  +++  EK  + ++ 
Sbjct: 215 RKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEEL 274

Query: 958 KKKKKKKKKKKKKKKKKEAEKM------KKKTEEAEK 988
           +++  +K++ K+ + ++E E+       ++  E  E+
Sbjct: 275 EQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEE 311



 Score = 38.7 bits (91), Expect = 0.011
 Identities = 17/82 (20%), Positives = 51/82 (62%), Gaps = 3/82 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           ++K+ R +  +  E+  ++K+++KE E+  + K  E  ++K   AE+ +++ AE+  +K+
Sbjct: 122 KQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREK---AEREEEREAERRERKE 178

Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
           +K+++  + + ++++ + +  E
Sbjct: 179 EKEREVARLRAQQEEAEDEREE 200



 Score = 38.4 bits (90), Expect = 0.016
 Identities = 19/100 (19%), Positives = 66/100 (66%), Gaps = 8/100 (8%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           ++R +R++++  E+++++++  +  E+++   + +  EK +K+KK  E++ +   E+I +
Sbjct: 81  EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140

Query: 954 KKKKKKKKKKKKKK-----KKKKKKKEAEKMKKKTEEAEK 988
           K+++K+++++++ K     ++K +++E  + +++  + EK
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180



 Score = 38.4 bits (90), Expect = 0.016
 Identities = 17/118 (14%), Positives = 56/118 (47%), Gaps = 25/118 (21%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK--------------KKKKDAE 941
            K   + ++++  E + +++ K    E+ R++K  E+ +              K++++  
Sbjct: 32  IKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEY 91

Query: 942 KMKKKGAEKIG-----------KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + + +  E++             + ++K++K+KK +++  +  +E  + K++ +E E+
Sbjct: 92  EERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERER 149



 Score = 38.4 bits (90), Expect = 0.017
 Identities = 20/93 (21%), Positives = 53/93 (56%), Gaps = 3/93 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           RK+  R  +    E+I +++K+++E  + R ++  +  +  ++  E+ + +  EK  K+K
Sbjct: 65  RKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQK 124

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K +++  +  +++ ++K++E E+ +   EE  K
Sbjct: 125 KLREEIDEFNEERIERKEEEKERER---EEELK 154



 Score = 38.0 bits (89), Expect = 0.019
 Identities = 15/82 (18%), Positives = 48/82 (58%), Gaps = 10/82 (12%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK---------KKKKKKK 965
           KK+ K  EK  +++  E M++++  A   +++  E+  K+++++         ++++K++
Sbjct: 29  KKRIKAEEKEEERRIDEMMEEERLKALAEEEE-RERKRKEERREGRAVLQEQIEEREKRR 87

Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
           +++ +++ +E E+M +  E  +
Sbjct: 88  QEEYEERLQEREQMDEIIERIQ 109



 Score = 33.3 bits (77), Expect = 0.65
 Identities = 16/118 (13%), Positives = 59/118 (50%), Gaps = 29/118 (24%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKE-----------------AEKMRKKKGAEKMKKKKK 938
            ++  + +K++   ++R ++++ ++                   K R+K+  E  K++++
Sbjct: 172 ERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQ 231

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKK-----------KKKKKKKKKKKKEAEKMKKKTEE 985
             +++++   E+I +K+++ ++++           +K+ + ++ +++ AEK + K  E
Sbjct: 232 -KQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLE 288



 Score = 33.0 bits (76), Expect = 0.75
 Identities = 20/104 (19%), Positives = 57/104 (54%), Gaps = 20/104 (19%)

Query: 902 RKKKKGAEKIRKKKKKKKEA---------------EKMRKKKGAEKMKKKKKDAEKMKKK 946
            ++ K   +  ++++K+KE                EK R+++  E+++++++  E +++ 
Sbjct: 49  EERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERI 108

Query: 947 GAEKIGKKKKKKKKKKKKKK-----KKKKKKKKEAEKMKKKTEE 985
             E   + ++K++K+KK ++      +++ ++KE EK +++ EE
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEE 152



 Score = 31.4 bits (72), Expect = 2.2
 Identities = 18/83 (21%), Positives = 52/83 (62%), Gaps = 4/83 (4%)

Query: 909 EKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK-KKKK 965
           E++R+  +K + A+  K R  +  EK K+ K + ++ +++  E + +++ K   + ++++
Sbjct: 4   EELRELNEKLRAAKVNKERDAQIEEK-KRIKAEEKEEERRIDEMMEEERLKALAEEEERE 62

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
           +K+K+++++    ++++ EE EK
Sbjct: 63  RKRKEERREGRAVLQEQIEEREK 85


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 43.1 bits (101), Expect = 4e-04
 Identities = 33/110 (30%), Positives = 45/110 (40%), Gaps = 6/110 (5%)

Query: 117 SATQAFLASVLQELKTCAPI--PDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSS 174
           + TQA  A+     K  AP   P +T V        A TP +   EAP   P   +  S+
Sbjct: 23  TTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82

Query: 175 TTPGPETKTLES----LNEKEKQLQARITKLSKILSGEVSIDLHLQFLIR 220
               P++ T +     +N K K L+A  TK S     E+ I L     IR
Sbjct: 83  KVETPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIR 132


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 42.5 bits (100), Expect = 5e-04
 Identities = 18/89 (20%), Positives = 32/89 (35%), Gaps = 2/89 (2%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
            + + +   E+ +   + +   E    K   ++  K +K  +K K K   K   K K K 
Sbjct: 56  PQPEPEPPEEQPKPPTEPETPPEPTPPKP--KEKPKPEKKPKKPKPKPKPKPKPKPKVKP 113

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + K KK   K   K  A   +     +  
Sbjct: 114 QPKPKKPPSKTAAKAPAAPNQPARPPSAA 142



 Score = 40.9 bits (96), Expect = 0.002
 Identities = 21/84 (25%), Positives = 32/84 (38%), Gaps = 6/84 (7%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            E+ + + +  +E  K   +    +   +    +  +K   EK  KK K K K K K K 
Sbjct: 53  TEEPQPEPEPPEEQPKPPTE---PETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP 109

Query: 968 K---KKKKKKEAEKMKKKTEEAEK 988
           K   + K KK   K   K   A  
Sbjct: 110 KVKPQPKPKKPPSKTAAKAPAAPN 133



 Score = 37.5 bits (87), Expect = 0.024
 Identities = 17/74 (22%), Positives = 27/74 (36%), Gaps = 2/74 (2%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK--KKKKK 974
              E  +   +   E+ K   +     +    +   K K +KK KK K K K   K K K
Sbjct: 51  APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPK 110

Query: 975 EAEKMKKKTEEAEK 988
              + K K   ++ 
Sbjct: 111 VKPQPKPKKPPSKT 124



 Score = 34.8 bits (80), Expect = 0.17
 Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 1/72 (1%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
            K       + +   +  +++       +   E    K K+K K +KK KK K K K + 
Sbjct: 46  AKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPK- 104

Query: 977 EKMKKKTEEAEK 988
            K K K +   K
Sbjct: 105 PKPKPKVKPQPK 116



 Score = 32.5 bits (74), Expect = 0.89
 Identities = 15/81 (18%), Positives = 23/81 (28%), Gaps = 8/81 (9%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
              +      K+K K ++  K  K K   K K K K   + K K        K   K   
Sbjct: 75  TPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPK--------KPPSKTAA 126

Query: 963 KKKKKKKKKKKKEAEKMKKKT 983
           K      +  +  +       
Sbjct: 127 KAPAAPNQPARPPSAASASGA 147


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 43.7 bits (104), Expect = 5e-04
 Identities = 28/95 (29%), Positives = 50/95 (52%), Gaps = 19/95 (20%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK-------- 954
              K AEK++++ ++KKE  +  + K  E+ +K+ + A K  KK A++I K+        
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600

Query: 955 -----------KKKKKKKKKKKKKKKKKKKKEAEK 978
                       +K+  K  +KK+KKKKK+KE ++
Sbjct: 601 YASVKAHELIEARKRLNKANEKKEKKKKKQKEKQE 635



 Score = 41.4 bits (98), Expect = 0.003
 Identities = 28/96 (29%), Positives = 52/96 (54%), Gaps = 12/96 (12%)

Query: 897 KKRTRRKKKKGAEKIRK----KKKKKKEAEKMRK------KKGAEKMKKKKKDAEKMKKK 946
            K   + K++  EK  K    + K  +EAEK  +      KK A+++ K+ +  +K    
Sbjct: 543 LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYA 602

Query: 947 G--AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
              A ++ + +K+  K  +KK+KKKKK+K++ E++K
Sbjct: 603 SVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638



 Score = 39.4 bits (93), Expect = 0.010
 Identities = 19/73 (26%), Positives = 44/73 (60%), Gaps = 4/73 (5%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK----KKE 975
           E  +   ++ AE+ +   K+AEK+K++  EK  K ++++ K  ++ +K+ ++     KKE
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585

Query: 976 AEKMKKKTEEAEK 988
           A+++ K+  + +K
Sbjct: 586 ADEIIKELRQLQK 598



 Score = 37.1 bits (87), Expect = 0.051
 Identities = 33/89 (37%), Positives = 42/89 (47%), Gaps = 19/89 (21%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRK-----------KKGAEKMKKKKKDAE------KM 943
           R++  K  EK  KKKKK+KE ++  K           +KG        K+A       KM
Sbjct: 613 RKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKM 672

Query: 944 KKKGA--EKIGKKKKKKKKKKKKKKKKKK 970
           K   +  EKI K KKKKKKK K  K K +
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701



 Score = 34.8 bits (81), Expect = 0.25
 Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 26/111 (23%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE-------KMKKKG- 947
           ++ R  +K    + K  +  + +K   K  +KK  +K K+K+K  E       K    G 
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650

Query: 948 -AEKIGKKKKKK-----------------KKKKKKKKKKKKKKKKEAEKMK 980
             E +     K+                 +K +K KKKKKKK K    K +
Sbjct: 651 KGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPR 701


>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein.  This is a novel repeat
           in Pneumocystis carinii Major surface glycoprotein (MSG)
           some members of the alignment have up to nine repeats of
           this family, the repeats containing several conserved
           cysteines. The MSG of P. carinii is an important protein
           in host-pathogen interactions. Surface glycoprotein A
           from Pneumocystis carinii is a main target for the host
           immune system, this protein is implicated in the
           attachment of Pneumocystis carinii to the host alveolar
           epithelial cells, alveolar macrophages, host surfactant
           and possibly accounts in part for the hypoxia seen in
           Pneumocystis carinii pneumonia (PCP).
          Length = 81

 Score = 39.3 bits (92), Expect = 5e-04
 Identities = 19/77 (24%), Positives = 37/77 (48%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           +  K ++K  E     +  E+     K+  +  K+   K+ K  K ++   K+ K + KK
Sbjct: 1   KDFKLEEKCKELEEYCEFLEEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKK 60

Query: 972 KKKEAEKMKKKTEEAEK 988
           KKK  + +K+K  + +K
Sbjct: 61  KKKCKKALKEKCTKLKK 77



 Score = 35.1 bits (81), Expect = 0.016
 Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
              K++ EK+++K       KK    E++  K  +   KKKKK KK  K+K  K KK+  
Sbjct: 24  PLLKEKCEKLKEKC---YKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLKKESN 80

Query: 975 E 975
           E
Sbjct: 81  E 81


>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p), ribosomal RNA-processing protein 7 homolog A
           (Rrp7A), and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
           YCL031C gene from Saccharomyces cerevisiae. It is an
           essential yeast protein involved in pre-rRNA processing
           and ribosome assembly, and is speculated to be required
           for correct assembly of rpS27 into the pre-ribosomal
           particle. Rrp7A, also termed gastric cancer antigen
           Zg14, is the Rrp7p homolog mainly found in Metazoans.
           The cellular function of Rrp7A remains unclear
           currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 118

 Score = 40.3 bits (95), Expect = 6e-04
 Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 24/88 (27%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAE-------KMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           K+++E ++  K+   E       D +         +K    +    + K K+K+KKKKKK
Sbjct: 32  KREEEEKEEAKEARNEP------DEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKK 85

Query: 969 KK-----------KKKKEAEKMKKKTEE 985
           K+           KKK+E  +++KK EE
Sbjct: 86  KELEDFYRFQIREKKKEELAELRKKFEE 113



 Score = 29.9 bits (68), Expect = 2.4
 Identities = 16/50 (32%), Positives = 27/50 (54%), Gaps = 9/50 (18%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKM---------RKKKGAEKMKKK 936
           RK +T R++   A+   K+KKKKK+ E           +KK+   +++KK
Sbjct: 61  RKGKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQIREKKKEELAELRKK 110


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 43.2 bits (102), Expect = 6e-04
 Identities = 18/74 (24%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           G ++    K+  K+ +   KK G  K    + DA +  +  A++   K+++ +KK+K++ 
Sbjct: 364 GDKRQELLKEYNKKLQDYTKKLGEVK---DETDASEEAEAKAKEEKLKQEENEKKQKEQA 420

Query: 967 KKKKKKKKEAEKMK 980
            + K+K+++ E+ K
Sbjct: 421 DEDKEKRQKDERKK 434



 Score = 42.1 bits (99), Expect = 0.001
 Identities = 16/70 (22%), Positives = 40/70 (57%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            K  E +++  KK ++  K   +   E    ++ +A+  ++K  ++  +KK+K++  + K
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424

Query: 965 KKKKKKKKKK 974
           +K++K ++KK
Sbjct: 425 EKRQKDERKK 434



 Score = 37.9 bits (88), Expect = 0.026
 Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 4/63 (6%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK +   K   + K + +A +  +     K K++K   E+ +KK  E+  + K+K++K +
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAE----AKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431

Query: 963 KKK 965
           +KK
Sbjct: 432 RKK 434



 Score = 36.3 bits (84), Expect = 0.081
 Identities = 18/76 (23%), Positives = 40/76 (52%), Gaps = 8/76 (10%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R++  +   KK  +  +K  + K E +    ++   K K++K   E+ +KK      +K+
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDA--SEEAEAKAKEEKLKQEENEKK------QKE 418

Query: 956 KKKKKKKKKKKKKKKK 971
           +  + K+K++K ++KK
Sbjct: 419 QADEDKEKRQKDERKK 434



 Score = 35.2 bits (81), Expect = 0.16
 Identities = 20/84 (23%), Positives = 43/84 (51%), Gaps = 8/84 (9%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKK--------KKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K  ++ K    +   K  E +K+         KK  E   +  A +  + K K++K K++
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
           + +KK+K++ + +K K++ +E +K
Sbjct: 411 ENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 42.7 bits (101), Expect = 7e-04
 Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 6/81 (7%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
            F ++R      K  E+++K K KK +   +           K   K   K KKK A   
Sbjct: 317 NFNEERVE----KYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGAS 372

Query: 952 GKKKKKKKKKKKKKKKKKKKK 972
              K +  ++ K   KKK KK
Sbjct: 373 AVPKSETSQEAKSSGKKKVKK 393



 Score = 36.9 bits (86), Expect = 0.053
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 1/72 (1%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           K+K   + R +K  E++KK K    + +         KK  KK   K K KKKKKK   +
Sbjct: 314 KEKNFNEERVEKYIERLKKAKTKKTQTRLDSFFTA-TKKPIKKSNSKAKLKKKKKKAGAS 372

Query: 977 EKMKKKTEEAEK 988
              K +T +  K
Sbjct: 373 AVPKSETSQEAK 384


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 43.1 bits (101), Expect = 7e-04
 Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 8/115 (6%)

Query: 865 VATKRLLHVAVSDVNDD-----VRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKK 919
               R  HVA  + +DD      ++E       I + +     +  K   K + KK+K  
Sbjct: 480 FEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETS---EADKDVNKSKNKKRKVD 536

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           E E+ +K K      K+KK  +KMK   A+K  + +  KKKKK+  K+KK   KK
Sbjct: 537 EEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591



 Score = 37.7 bits (87), Expect = 0.032
 Identities = 27/76 (35%), Positives = 36/76 (47%), Gaps = 3/76 (3%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           K +E     K   K   K RK    E+ KK K      K+K   K+ KK K    KK+++
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQK---KLYKKMKYSNAKKEEQ 570

Query: 966 KKKKKKKKKEAEKMKK 981
            +  KKKKK+  K KK
Sbjct: 571 AENLKKKKKQIAKQKK 586



 Score = 36.6 bits (84), Expect = 0.067
 Identities = 22/70 (31%), Positives = 37/70 (52%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K +E     K   K K KK+  ++ +++   K+     K+KK  KK K    KK+++AE 
Sbjct: 514 KYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAEN 573

Query: 979 MKKKTEEAEK 988
           +KKK ++  K
Sbjct: 574 LKKKKKQIAK 583



 Score = 30.0 bits (67), Expect = 7.7
 Identities = 15/81 (18%), Positives = 35/81 (43%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
            T+R  ++      +  + +  A +       E   +K+ + E    K +E     K   
Sbjct: 467 ETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVN 526

Query: 959 KKKKKKKKKKKKKKKKEAEKM 979
           K K KK+K  +++++K+ + +
Sbjct: 527 KSKNKKRKVDEEEEEKKLKMI 547


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 42.8 bits (101), Expect = 8e-04
 Identities = 20/63 (31%), Positives = 37/63 (58%), Gaps = 1/63 (1%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK-KKKKKKKKEAE 977
           KE EK    +  E+ K++KK+ E+      E++ ++++K++KKKK KK K+   + +   
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88

Query: 978 KMK 980
           K K
Sbjct: 89  KTK 91



 Score = 33.6 bits (77), Expect = 0.57
 Identities = 17/60 (28%), Positives = 34/60 (56%), Gaps = 4/60 (6%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            EK  +K+   +E E+ ++    EK ++++K  +K ++   E+  ++KKKK KK K+   
Sbjct: 27  VEKEVEKEVPDEEEEEEKE----EKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTT 82



 Score = 30.1 bits (68), Expect = 6.0
 Identities = 14/49 (28%), Positives = 33/49 (67%), Gaps = 1/49 (2%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           + E+ +K+  ++  +K   K+++  ++++K++KKKK  +K+K+ T E E
Sbjct: 38  EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKT-KKVKETTTEWE 85


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 42.5 bits (100), Expect = 8e-04
 Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 10/95 (10%)

Query: 897 KKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           KK     K  KK  EK +K+++ ++   + R+ K      K  + AE    +  +K  + 
Sbjct: 321 KKYGSGAKHVKKEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRP 380

Query: 955 KKKKKKKKKKKKKKK--------KKKKKEAEKMKK 981
           K+ +KKK KK    K        KKK KE +   K
Sbjct: 381 KRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKANAK 415



 Score = 41.7 bits (98), Expect = 0.002
 Identities = 39/151 (25%), Positives = 54/151 (35%), Gaps = 30/151 (19%)

Query: 868 KRLLHVAVSDVNDDVRR-----EAVTALGFILFRKKRTRRKKKKGAEK-----------I 911
           +R+ +      ND + R     EA+  L      KK    K  K                
Sbjct: 26  RRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITSKPTKAWAESPEF 85

Query: 912 RKKKKKKK--------EAEKMRKKKGAEKMK------KKKKDAEKMKKKGAEKIGKKKKK 957
           R+    K         EAE + K K    +       K  K+  +   +G +KI   + K
Sbjct: 86  RQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLESIMEGLDKILGIETK 145

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK K KKK KK KKK+  E   K  EE  +
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESE 176



 Score = 37.9 bits (88), Expect = 0.022
 Identities = 28/114 (24%), Positives = 48/114 (42%), Gaps = 28/114 (24%)

Query: 896 RKKRT-RRKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAE----------- 941
           RK R  +R ++   EK      K  KKE EK       E+ +++ + +E           
Sbjct: 306 RKNRRGQRARQAIWEKKYGSGAKHVKKEREK-------EQKEREGRQSEWEARQAKREGG 358

Query: 942 -KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM------KKKTEEAEK 988
                + AE  G + ++K  + K+ +KKK KK    + +      KKK +E + 
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKA 412



 Score = 34.4 bits (79), Expect = 0.28
 Identities = 23/82 (28%), Positives = 32/82 (39%), Gaps = 15/82 (18%)

Query: 910 KIRKKKKKKKE----AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           ++   K  K       E + K  G E   KK K              K KK KKK+ K+ 
Sbjct: 118 RLLNSKPVKNVLESIMEGLDKILGIETKAKKGKAK-----------KKTKKSKKKEAKES 166

Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
             K  +++ E+E   K  E AE
Sbjct: 167 SDKDDEEESESEDESKSEESAE 188


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 38.8 bits (91), Expect = 8e-04
 Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 16/84 (19%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA-----------EKIGKKKKKKKK 960
            + K +KK AE  RK +     K+ KK+     KK             E+I ++ ++KK+
Sbjct: 2   LRYKIEKKVAEHHRKLR-----KEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKR 56

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
           K++++K+++K+ +K      +K  
Sbjct: 57  KQEEEKERRKEARKAERAEARKRG 80



 Score = 36.1 bits (84), Expect = 0.007
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 23/78 (29%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK----------KKK-----KKKKKKK 970
           +KK AE  +K +K+A+K            K KKKK          K++     ++KK+K+
Sbjct: 7   EKKVAEHHRKLRKEAKKN--------PTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQ 58

Query: 971 KKKKEAEKMKKKTEEAEK 988
           +++KE  K  +K E AE 
Sbjct: 59  EEEKERRKEARKAERAEA 76



 Score = 34.9 bits (81), Expect = 0.021
 Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 14/62 (22%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK---------KKK-----EAEKMKKKTEEA 986
            K++KK AE   K +K+ KK    K KKKK          K++     E +K K++ E+ 
Sbjct: 4   YKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKE 63

Query: 987 EK 988
            +
Sbjct: 64  RR 65



 Score = 30.3 bits (69), Expect = 0.92
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 22/88 (25%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD-----AEKMKKKGAEK 950
           +K                 +K +KEA+K    K      KKKKD     +   K++  E+
Sbjct: 8   KKVAEH------------HRKLRKEAKKNPTWK-----SKKKKDPGIPNSFPFKEEILEE 50

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           I +KK+K++++K+++K+ +K ++ EA K
Sbjct: 51  IEEKKRKQEEEKERRKEARKAERAEARK 78



 Score = 29.9 bits (68), Expect = 1.0
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKM 979
           + + K +KK  +  +K +KEA+K 
Sbjct: 1   RLRYKIEKKVAEHHRKLRKEAKKN 24



 Score = 29.1 bits (66), Expect = 2.0
 Identities = 9/33 (27%), Positives = 17/33 (51%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            + K +KK  +  +K +K+ KK      KK ++
Sbjct: 2   LRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKD 34



 Score = 28.8 bits (65), Expect = 3.3
 Identities = 16/69 (23%), Positives = 34/69 (49%), Gaps = 16/69 (23%)

Query: 901 RRKKKKGAEK---IRKKKKK----------KKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
            RK +K A+K    + KKKK          K   E++ ++   +K K++++   + + + 
Sbjct: 14  HRKLRKEAKKNPTWKSKKKKDPGIPNSFPFK---EEILEEIEEKKRKQEEEKERRKEARK 70

Query: 948 AEKIGKKKK 956
           AE+   +K+
Sbjct: 71  AERAEARKR 79


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 43.1 bits (102), Expect = 8e-04
 Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 12/89 (13%)

Query: 909  EKIRKK-KKKKKEAEKMRK-----------KKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            EK+  + +KK+KE EK++             K  E ++++++  EK   K      K K 
Sbjct: 1105 EKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKG 1164

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            K  K +K K KKK+KKKK++   K K   
Sbjct: 1165 KASKLRKPKLKKKEKKKKKSSADKSKKAS 1193



 Score = 40.4 bits (95), Expect = 0.005
 Identities = 27/84 (32%), Positives = 42/84 (50%)

Query: 905  KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            K+ A++ R K K K +A K+RK K  +K KKKKK +    KK +     K+    +K+K 
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKL 1209

Query: 965  KKKKKKKKKKEAEKMKKKTEEAEK 988
              K   KK   +   ++  EE + 
Sbjct: 1210 DDKPDNKKSNSSGSDQEDDEEQKT 1233



 Score = 40.0 bits (94), Expect = 0.007
 Identities = 24/100 (24%), Positives = 39/100 (39%), Gaps = 11/100 (11%)

Query: 900  TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
             +R K K   K  K +K K + ++ +KKK +    KK       K+  +++  K   K  
Sbjct: 1155 EQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214

Query: 960  KKK-----------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
             KK           +++K K KK   +  K KK       
Sbjct: 1215 NKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSS 1254



 Score = 39.3 bits (92), Expect = 0.014
 Identities = 23/88 (26%), Positives = 40/88 (45%)

Query: 901  RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
               ++K   K ++ K K K      +K   +K +KKKK +   K K A  +G  K+    
Sbjct: 1145 EEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSD 1204

Query: 961  KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K+K   K   KK  +    ++ +E +K
Sbjct: 1205 EKRKLDDKPDNKKSNSSGSDQEDDEEQK 1232



 Score = 38.1 bits (89), Expect = 0.030
 Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 2/93 (2%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
             K +    +  ++ + K+ K K   +++   + +     K+ D E           KKK 
Sbjct: 1251 SKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKV 1310

Query: 957  KKKKKKKKKKKKKKKKKKE--AEKMKKKTEEAE 987
            KK+ +      KKKKK ++  A K K KT   +
Sbjct: 1311 KKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQ 1343



 Score = 37.7 bits (88), Expect = 0.038
 Identities = 17/91 (18%), Positives = 31/91 (34%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            +    +       K    +           KK  +K  +    A K KKK  +K  +KKK
Sbjct: 1277 RVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKK 1336

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             K + K+    +  +  +   K K  +   +
Sbjct: 1337 SKTRVKQASASQSSRLLRRPRKKKSDSSSED 1367



 Score = 37.3 bits (87), Expect = 0.050
 Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 1/93 (1%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            ++    ++ K   +    K +K K  +K +KKK +   K KK  +     K  +   K+K
Sbjct: 1150 KEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKA-SVVGNSKRVDSDEKRK 1208

Query: 956  KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
               K   KK       ++ + E+  K  + + K
Sbjct: 1209 LDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVK 1241



 Score = 36.6 bits (85), Expect = 0.079
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 13/86 (15%)

Query: 901  RRKKKKGAEKIRKK-------------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
              KK+K  EK++               ++  +E E++ +K+ A++ + K K   K  K  
Sbjct: 1111 LEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLR 1170

Query: 948  AEKIGKKKKKKKKKKKKKKKKKKKKK 973
              K+ KK+KKKKK    K KK     
Sbjct: 1171 KPKLKKKEKKKKKSSADKSKKASVVG 1196



 Score = 36.6 bits (85), Expect = 0.080
 Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 4/97 (4%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK----GAEKI 951
             K R  + KKK  +K +    K K+A  +   K  +  +K+K D +   KK    G+++ 
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQE 1226

Query: 952  GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +++K K KK   K+ K KK   ++  +   E +  
Sbjct: 1227 DDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD 1263



 Score = 36.6 bits (85), Expect = 0.081
 Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 3/93 (3%)

Query: 898  KRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             +    +  G  K     KKK KK  E        +K K +KK A K K K   K     
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK-KSEKKTARKKKSKTRVKQASAS 1347

Query: 956  KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  +  ++ +KKK     ++ +  +    E E 
Sbjct: 1348 QSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDED 1380



 Score = 36.6 bits (85), Expect = 0.093
 Identities = 26/116 (22%), Positives = 48/116 (41%), Gaps = 8/116 (6%)

Query: 880  DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEK------ 932
            ++        +      K +T+ K  K  +   KKK KKKK++   + KK +        
Sbjct: 1142 EEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV 1201

Query: 933  -MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
               +K+K  +K   K +   G  ++  +++K K KK   K+ K  +    K+ E  
Sbjct: 1202 DSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDN 1257



 Score = 32.3 bits (74), Expect = 1.7
 Identities = 19/94 (20%), Positives = 29/94 (30%), Gaps = 3/94 (3%)

Query: 895  FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK---DAEKMKKKGAEKI 951
             +K   +R K K     +  +   + +     K+G  K   K+                 
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294

Query: 952  GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
                  K     KKK KK+ +   A   KKK  E
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSE 1328



 Score = 32.3 bits (74), Expect = 1.8
 Identities = 20/88 (22%), Positives = 31/88 (35%), Gaps = 8/88 (9%)

Query: 895  FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
              K+   +   K    ++       +        G+    K     +K  KK  E     
Sbjct: 1265 LSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGS----KPSSPTKKKVKKRLEGSLAA 1320

Query: 955  KKKKKKKKK----KKKKKKKKKKKEAEK 978
             KKKKK +K    KKK K + K+  A +
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQ 1348



 Score = 31.6 bits (72), Expect = 2.5
 Identities = 20/90 (22%), Positives = 31/90 (34%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K+   +K           +    +  K  K K A K     + +     K  +       
Sbjct: 1241 KRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGS 1300

Query: 957  KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            K     KKK KK+ +    A K KKK+E+ 
Sbjct: 1301 KPSSPTKKKVKKRLEGSLAALKKKKKSEKK 1330



 Score = 31.2 bits (71), Expect = 4.1
 Identities = 19/90 (21%), Positives = 30/90 (33%), Gaps = 3/90 (3%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             +  +    K+G  K   K+    +       K  +             KK  +K  +  
Sbjct: 1258 DEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317

Query: 956  KKKKKKKKKKKKK---KKKKKKEAEKMKKK 982
                KKKKK +KK   KKK K   ++    
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASAS 1347


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 42.7 bits (100), Expect = 0.001
 Identities = 29/110 (26%), Positives = 48/110 (43%), Gaps = 19/110 (17%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKM-----------------RKKKGAEKMKKKKK 938
           +K + + KK+K  E+ + KKK+ +  + +                  +   A  +     
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KKKKKKEAEKMKKKTEEA 986
           D+E  + K AE    KK  K KKKK++K+K  KKKKKK        ++  
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGG 309



 Score = 40.4 bits (94), Expect = 0.005
 Identities = 37/139 (26%), Positives = 55/139 (39%), Gaps = 51/139 (36%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKG------------------AEKM--------- 933
           RR +++  +  R+KKK++KE    R+                      E+M         
Sbjct: 88  RRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDE 147

Query: 934 ----------------KKKKKDAEKM---KKKGAEKIGKKKKK-----KKKKKKKKKKKK 969
                            K   D+EK+   K + AE     +K      +KK KK KKK+K
Sbjct: 148 DDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEK 207

Query: 970 KKKKKEAEKMKKKTEEAEK 988
           K+K+KE +K KKK  E  K
Sbjct: 208 KEKEKERDKDKKKEVEGFK 226



 Score = 31.6 bits (71), Expect = 2.1
 Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 45/124 (36%)

Query: 903 KKKKGAEKIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK------- 953
           +K + AE  +  +K    A  +K +K K  EK K+K+K+ +K KKK  E           
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEK-KEKEKERDKDKKKEVEGFKSLLLALDD 234

Query: 954 -----------------------------------KKKKKKKKKKKKKKKKKKKKKEAEK 978
                                              + ++ KK  K KKKK++K+K+E +K
Sbjct: 235 SPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKK 294

Query: 979 MKKK 982
            KK 
Sbjct: 295 KKKH 298



 Score = 31.2 bits (70), Expect = 3.6
 Identities = 9/37 (24%), Positives = 26/37 (70%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           E+  +++ +K K++KKK++K+K+ ++    +  +++E
Sbjct: 87  ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123



 Score = 31.2 bits (70), Expect = 3.6
 Identities = 23/112 (20%), Positives = 53/112 (47%), Gaps = 23/112 (20%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAE------ 941
           +F K+  R  K +  E   K+  +++EA K  +       K +   +K+ +D        
Sbjct: 1   IFAKEEQRHSKPRHTEADEKELARRREARKQEQANNPFYIKSSPSPQKRYQDPPGVEHIP 60

Query: 942 --------KMKKKG---AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
                    +K  G   +++  K +++++ +++ +K K++KKK+E EK  ++
Sbjct: 61  VVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRR 112



 Score = 30.8 bits (69), Expect = 4.5
 Identities = 8/34 (23%), Positives = 23/34 (67%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           +++  +++ K K++KKK++K+K+ +++      E
Sbjct: 87  ERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120



 Score = 30.0 bits (67), Expect = 8.2
 Identities = 9/33 (27%), Positives = 22/33 (66%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           K + E+  ++  EK  ++KKK++K+K+ +++  
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHH 115



 Score = 29.6 bits (66), Expect = 8.6
 Identities = 8/38 (21%), Positives = 26/38 (68%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K+ ++++ +++ +K K++KKK++K++  + +  +   E
Sbjct: 83  KLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTE 120


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 42.9 bits (102), Expect = 0.001
 Identities = 11/46 (23%), Positives = 17/46 (36%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
              +  K +       K    KK   K  +K K++   +EK  K T
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKAT 859



 Score = 42.1 bits (100), Expect = 0.002
 Identities = 15/47 (31%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           AEK +K +  A+   K  A   GKK   K  +K K++    +K  +A
Sbjct: 815 AEKPEKLRYLADAPAKDPA---GKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 40.5 bits (96), Expect = 0.004
 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           AEK +K    +       K    KK   K  +K K+     +K  +A
Sbjct: 815 AEKPEKL---RYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 35.9 bits (84), Expect = 0.11
 Identities = 7/41 (17%), Positives = 14/41 (34%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            EK+        K    KK   K  +K  ++     ++ + 
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEKDGKA 858



 Score = 35.9 bits (84), Expect = 0.12
 Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 5/49 (10%)

Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           AEK  K +       K    +K   K +     +K K++    +K  K 
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFS-----RKTKQQYVASEKDGKA 858



 Score = 34.0 bits (79), Expect = 0.45
 Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  +K +       K    KK   K + K K++   +EK
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK 854



 Score = 33.2 bits (77), Expect = 0.73
 Identities = 11/51 (21%), Positives = 22/51 (43%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K ++      + A ++  +K +K +       K    KK A K  +KT++ 
Sbjct: 798 KGRETRAPTVEEALELLAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848



 Score = 32.1 bits (74), Expect = 1.7
 Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 7/68 (10%)

Query: 895 FRKKRTRR--KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
             K R  R    ++  E + +K +K +       K         KK A K  +K  ++  
Sbjct: 796 LPKGRETRAPTVEEALELLAEKPEKLRYLADAPAKD-----PAGKKAAVKFSRKTKQQYV 850

Query: 953 KKKKKKKK 960
             +K  K 
Sbjct: 851 ASEKDGKA 858


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 24/103 (23%), Positives = 49/103 (47%), Gaps = 7/103 (6%)

Query: 885 EAVTALGFILFRKK-RTRRKKKKGAEKIRKKKKKK-KEAEKMRKKKGAEKMKKKKKDAEK 942
           +A   +  +   +  +   + KK +  +  + KK+ KE +KM+K+    K  K + D + 
Sbjct: 22  QAAPKIAIVNLGRIFQESPQAKKVSADLESEFKKRQKELQKMQKE-LKAKEAKLQDDGKM 80

Query: 943 MKKKGAEKIGKKKKKKKK----KKKKKKKKKKKKKKEAEKMKK 981
                  K   + KK+K      KK+++ +K   ++EAE+ +K
Sbjct: 81  EALSDRAKAEAEIKKEKLVNAFNKKQQEYEKDLNRREAEEEQK 123



 Score = 33.9 bits (78), Expect = 0.20
 Identities = 14/65 (21%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 925 RKKKGAEKMK-KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           + KK +  ++ + KK  ++++K   E   K+ K +   K +    + K + E +K K   
Sbjct: 41  QAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVN 100

Query: 984 EEAEK 988
              +K
Sbjct: 101 AFNKK 105



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 20/93 (21%), Positives = 49/93 (52%), Gaps = 8/93 (8%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            K+ +++ +K  ++++ K+ K ++  KM          K + + +K K   A      KK
Sbjct: 53  FKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRA---KAEAEIKKEKLVNAF----NKK 105

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEE-AEK 988
           +++ +K   +++ ++++K  EK+++  E  AEK
Sbjct: 106 QQEYEKDLNRREAEEEQKLLEKIQRAIESVAEK 138


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 42.4 bits (100), Expect = 0.001
 Identities = 14/61 (22%), Positives = 31/61 (50%), Gaps = 2/61 (3%)

Query: 917 KKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            + +   +R++  K  EK+K+K        K   +K  +  + ++KKK+KK K +++  +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394

Query: 975 E 975
            
Sbjct: 395 N 395



 Score = 38.1 bits (89), Expect = 0.018
 Identities = 14/57 (24%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 930 AEKMKKK-KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
              ++++ +K  EK+K+K  +   K K ++ KK++  + ++KKK+K+A+  ++  + 
Sbjct: 339 GISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 36.6 bits (85), Expect = 0.053
 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 11/75 (14%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            F  +      +   E++ K+ +K KE           K   K K     K++      K
Sbjct: 332 AFSGEPDGISLR---EELEKRIEKLKEKP--------PKPPTKAKPERDKKERPGRYRRK 380

Query: 954 KKKKKKKKKKKKKKK 968
           KK+KK K +++  + 
Sbjct: 381 KKEKKAKSERRGLQN 395



 Score = 35.8 bits (83), Expect = 0.10
 Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 936 KKKDAEKMKK---KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + D   +++   K  EK+ +K  K   K K ++ KK++  +   K K+K  ++E+
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSER 390


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 42.3 bits (99), Expect = 0.001
 Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 20/112 (17%)

Query: 896 RKKRTRRKKKKGAEKIRKKK-------------------KKKKEAEKMRKKKGA-EKMKK 935
            K R +R K K  +  R K+                   K+ K+  K  ++ GA  K   
Sbjct: 201 AKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALA 260

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             K   + K K   +          KKK K+  KK  K E + +K   ++A+
Sbjct: 261 ALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAKDAD 312



 Score = 36.2 bits (83), Expect = 0.081
 Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 14/102 (13%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---------KMKKKKKDAEKMKKKGAEKI 951
           R +K+    K R+K+ K K  +  R K+  +         K  K+++   K  +K   + 
Sbjct: 193 RDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREA 252

Query: 952 GKKKKKKKKKKKKKKKKKK-----KKKKEAEKMKKKTEEAEK 988
           G + K     K K + K K     +    A  +KKK +E  K
Sbjct: 253 GARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMK 294


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 42.4 bits (98), Expect = 0.001
 Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 1/84 (1%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           ++K  A+K  +K+ ++ EAE  ++    E+ K+ K + E+++K+  +K    KK + +KK
Sbjct: 89  EQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKK 148

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEA 986
           K+K  K +K +K AEK KK +  A
Sbjct: 149 KEKAVKAEKAEK-AEKTKKASTPA 171



 Score = 41.2 bits (95), Expect = 0.002
 Identities = 39/129 (30%), Positives = 66/129 (51%), Gaps = 10/129 (7%)

Query: 870 LLHVAVSDVNDDVRREAVTALGFILFRK---KRTRRKKKKGAEKIRK------KKKKKKE 920
           LL   + +  +D R+EA  +L      +   K  + +KKK  ++I K      K   KK 
Sbjct: 38  LLTAVLPNSINDQRKEAFASLELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKA 97

Query: 921 AEKMRKKKGAEKMKKKKKDAE-KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
           AEK  ++  AE  K+  ++ E K  K   E+I K+++KK+   KK + +KKK+K    + 
Sbjct: 98  AEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEK 157

Query: 980 KKKTEEAEK 988
            +K E+ +K
Sbjct: 158 AEKAEKTKK 166



 Score = 35.0 bits (79), Expect = 0.19
 Identities = 24/81 (29%), Positives = 40/81 (49%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K+   K+ + AE   KK+  ++E  K  K +     K+++K    +KK  AEK  +K  K
Sbjct: 95  KKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVK 154

Query: 958 KKKKKKKKKKKKKKKKKEAEK 978
            +K +K +K KK       E+
Sbjct: 155 AEKAEKAEKTKKASTPAPVEE 175



 Score = 32.3 bits (72), Expect = 1.5
 Identities = 29/103 (28%), Positives = 44/103 (42%), Gaps = 25/103 (24%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK-------- 961
           K  KK KKK E+E        E++ ++ K   +   + A    KK KK K+K        
Sbjct: 214 KKNKKNKKKSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPA 273

Query: 962 -----------------KKKKKKKKKKKKKEAEKMKKKTEEAE 987
                            + +KK KK KKK E+EK+ ++  +AE
Sbjct: 274 ASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAE 316



 Score = 32.0 bits (71), Expect = 1.7
 Identities = 30/108 (27%), Positives = 52/108 (48%), Gaps = 18/108 (16%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKM--------KKKKKD-------- 939
           +KK  + KKK  +EK+  ++  + EA K +K    + M        K++ KD        
Sbjct: 293 QKKNKKDKKKSESEKV-VEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETPAA 351

Query: 940 -AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
             E++ +   E + +K       + KKK KK KKK E+EK+ ++  E+
Sbjct: 352 PVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVES 399


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 41.7 bits (98), Expect = 0.001
 Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 11/87 (12%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK---------KKKDAEKMKKKGAEKI 951
           R+ K+ G    RK++KK +E+    +   +              K K+    + K A K+
Sbjct: 130 RKPKEPG--LKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKL 187

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEK 978
                   +K  K K   KK  K+  K
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKKMRK 214



 Score = 40.5 bits (95), Expect = 0.003
 Identities = 23/111 (20%), Positives = 37/111 (33%), Gaps = 23/111 (20%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---------- 944
            +K+    K  +  E+ +K KK      + + K+   K K++KK  E             
Sbjct: 101 SKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTP 160

Query: 945 -------------KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
                        K+G  +  KK  K         +K  K K   +K  KK
Sbjct: 161 NSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKK 211



 Score = 30.5 bits (69), Expect = 4.4
 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           KE++K  K   A +  ++ K ++K      E K  +   K+K++KK ++     +   + 
Sbjct: 99  KESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSS 158

Query: 978 KMKKKTEEAE 987
                     
Sbjct: 159 TPNSSRPSTP 168


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 42.0 bits (99), Expect = 0.001
 Identities = 20/85 (23%), Positives = 37/85 (43%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           + +   +K + K+K  K+  K  KK   +      +   + ++   ++  KKKKKKKKKK
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
           KKK   +         +   +   +
Sbjct: 67  KKKNLGEAYDLAYDLPVVWSSAAFQ 91



 Score = 41.6 bits (98), Expect = 0.002
 Identities = 23/77 (29%), Positives = 38/77 (49%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
            + K++K++ ++   KK  +K KK   D +    +   +  + + K+  KKKKKKKKKKK
Sbjct: 8   NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67

Query: 972 KKKEAEKMKKKTEEAEK 988
           KK   E      +    
Sbjct: 68  KKNLGEAYDLAYDLPVV 84



 Score = 32.4 bits (74), Expect = 1.4
 Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 19/76 (25%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-------------------KKKKKKKKK 972
            M +  ++  K +K+  ++ G KKK KK KK                     K+  KKKK
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 973 KKEAEKMKKKTEEAEK 988
           KK+ +K KK   EA  
Sbjct: 61  KKKKKKKKKNLGEAYD 76


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 40.5 bits (95), Expect = 0.001
 Identities = 23/85 (27%), Positives = 33/85 (38%), Gaps = 4/85 (4%)

Query: 905 KKGA--EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            KGA  E   + K +  + EK  K    +    K K A K     AEK  K  + + +  
Sbjct: 90  AKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEK--KVNEARAEAV 147

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAE 987
            +KK  +       E    + EEAE
Sbjct: 148 AEKKAAEAAAVAAEEAAAAEEEEAE 172



 Score = 36.3 bits (84), Expect = 0.035
 Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 7/69 (10%)

Query: 927 KKGA--EKMKKKKKDAEKMKKKG-----AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
            KGA  E   + K +A K +K+G      +K+ K KK  KK   + +KK  + + EA   
Sbjct: 90  AKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAE 149

Query: 980 KKKTEEAEK 988
           KK  E A  
Sbjct: 150 KKAAEAAAV 158



 Score = 32.1 bits (73), Expect = 0.87
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
           KK K ++  +  KK   EAEK   +  AE + +KK       
Sbjct: 117 KKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAV 158



 Score = 30.1 bits (68), Expect = 4.3
 Identities = 19/63 (30%), Positives = 27/63 (42%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           + KK     + K      + + K      +K+ K   KK K  K KK  KK A + +KK 
Sbjct: 80  LLKKHLLGGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKV 139

Query: 984 EEA 986
            EA
Sbjct: 140 NEA 142


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 38.6 bits (90), Expect = 0.002
 Identities = 22/63 (34%), Positives = 25/63 (39%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           K K K +  EK  K    E   K K    K KK GA        +   +K K KK KK  
Sbjct: 4   KNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDC 63

Query: 973 KKE 975
           KK 
Sbjct: 64  KKG 66



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 15/55 (27%), Positives = 24/55 (43%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK K   + K+KG +   K+   K K K+ K KK      +    +  TE+ + 
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKG 56



 Score = 37.8 bits (88), Expect = 0.003
 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 1/67 (1%)

Query: 909 EKIRKKKKKKKEAE-KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            K  K K + KE   K  KK+   K K K+  A+K      +  G +   +K K KK KK
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 968 KKKKKKK 974
             KK  K
Sbjct: 62  DCKKGCK 68



 Score = 34.3 bits (79), Expect = 0.043
 Identities = 15/74 (20%), Positives = 27/74 (36%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            KK K   + ++K      K+     +  + K  +      +    +   +K K KK KK
Sbjct: 2   NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61

Query: 975 EAEKMKKKTEEAEK 988
           + +K  K      K
Sbjct: 62  DCKKGCKWEGNTCK 75



 Score = 33.6 bits (77), Expect = 0.082
 Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 903 KKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           KK K   + ++K     KK++  K + K+G    K KK  A   +  G E   +K K KK
Sbjct: 3   KKNKTKTECKEKGCKWDKKEDDGKCKPKEG----KAKKNGAPVTQTAGTETTTEKCKGKK 58

Query: 960 KKKKKKKKKKKKKKK 974
            KK  KK  K +   
Sbjct: 59  DKKDCKKGCKWEGNT 73


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 41.4 bits (98), Expect = 0.002
 Identities = 16/62 (25%), Positives = 34/62 (54%)

Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           A++ + +KG  ++    +  ++MK++    +  KK+ K   K +KKK+K++ K    K K
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89

Query: 981 KK 982
            +
Sbjct: 90  PR 91



 Score = 37.5 bits (88), Expect = 0.026
 Identities = 15/61 (24%), Positives = 29/61 (47%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            ++KK +  ++      E+ K+ K++ +       E     K +KKK+K++ K  K K K
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89

Query: 974 K 974
            
Sbjct: 90  P 90



 Score = 33.7 bits (78), Expect = 0.42
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK-KKDAEKMKKKGAEKIGKKKKK 957
             R+K +KG  ++    ++ KE   M+++  A  + KK  K   K +KK  EK   K  K
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKE---MKEELKAALLDKKELKAWHKAQKK-KEKQEAKAAK 85

Query: 958 KKKKKK 963
            K K +
Sbjct: 86  AKSKPR 91


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 40.3 bits (95), Expect = 0.002
 Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 5/89 (5%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
              K K   EK+ KK+ ++ +      +   EK+KK +++ E    +  E + + K+ +K
Sbjct: 63  ALNKLKTRLEKL-KKELEELKQRIAELQAQIEKLKKGREETE----ERTELLEELKQLEK 117

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + KK K + +K +K + E+++K  EE + 
Sbjct: 118 ELKKLKAELEKYEKNDPERIEKLKEETKV 146



 Score = 31.4 bits (72), Expect = 1.3
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 5/72 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEK----MRKKKGAEK-MKKKKKDAEKMKKKGAEK 950
             +  +++  +   +I K KK ++E E+    + + K  EK +KK K + EK +K   E+
Sbjct: 77  ELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPER 136

Query: 951 IGKKKKKKKKKK 962
           I K K++ K  K
Sbjct: 137 IEKLKEETKVAK 148


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 41.3 bits (97), Expect = 0.002
 Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 905 KKGAEKIRKKKKKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           +  AE I KK++  K  E+++   +KK  E ++ +++  E+   +    +  K+ +++++
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAE 987
            K    KK K++ E    + + E  E
Sbjct: 195 SKWAILKKLKRRAELGSQQVQGEALE 220



 Score = 36.7 bits (85), Expect = 0.056
 Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 11/101 (10%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK--KDAEKMKKKGAEKIG 952
            R +    KKK+    + + +++    +K +K    E++K+ +     E  K +  E + 
Sbjct: 54  KRAQYRSAKKKE----LSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLN 109

Query: 953 KKKKKK-----KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K K+     K+  +  K+ +K+ +  AE ++KK E  + 
Sbjct: 110 LEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150



 Score = 35.9 bits (83), Expect = 0.10
 Identities = 17/88 (19%), Positives = 41/88 (46%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
             K +       +K K+ +  EK   +   E  K+ +  AE ++KK      +++ K + 
Sbjct: 99  IAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFEN 158

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +KK ++  + +++K  E++ +   + E 
Sbjct: 159 EKKLEESLELEREKFEEQLHEANLDLEF 186



 Score = 35.9 bits (83), Expect = 0.10
 Identities = 15/95 (15%), Positives = 44/95 (46%), Gaps = 10/95 (10%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM-------KKKGAEKIGKKK 955
           K K+     ++  +  KE +K  +   AE ++KK+++ +         +KK  E +  ++
Sbjct: 112 KDKELELLEKELDELSKELQKQLQNT-AEIIEKKRENNKNEERLKFENEKKLEESLELER 170

Query: 956 KK--KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K  ++  +     + K+ +++ E      ++ ++
Sbjct: 171 EKFEEQLHEANLDLEFKENEEQRESKWAILKKLKR 205



 Score = 35.2 bits (81), Expect = 0.17
 Identities = 13/86 (15%), Positives = 41/86 (47%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           + +   +    K +  +     + K+     K+  + +++++K+        +KK++  K
Sbjct: 90  QFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNK 149

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++ K + +KK  E ++ + E+ E+
Sbjct: 150 NEERLKFENEKKLEESLELEREKFEE 175



 Score = 32.1 bits (73), Expect = 1.5
 Identities = 13/76 (17%), Positives = 36/76 (47%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           +     +E EK   +K A+    KKK+  +++++   +  ++K    ++ K+ +   + +
Sbjct: 39  RILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDE 98

Query: 973 KKEAEKMKKKTEEAEK 988
             + E ++    E +K
Sbjct: 99  IAKLEALELLNLEKDK 114



 Score = 31.3 bits (71), Expect = 2.7
 Identities = 16/95 (16%), Positives = 44/95 (46%), Gaps = 1/95 (1%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGK 953
            R   T  + +K A + R + +  K+ E  + ++     KK++K+   +  K+    +  
Sbjct: 38  SRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQD 97

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +  K +  +    +K K+ +   +++ + ++E +K
Sbjct: 98  EIAKLEALELLNLEKDKELELLEKELDELSKELQK 132



 Score = 30.9 bits (70), Expect = 3.2
 Identities = 16/95 (16%), Positives = 42/95 (44%), Gaps = 8/95 (8%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK---- 956
              +K+   +  KKK+  +  E++  +K  +K    ++  +  +    ++I K +     
Sbjct: 50  EANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQF-ELALQDEIAKLEALELL 108

Query: 957 ---KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              K K+ +  +K+  +  K+  ++++   E  EK
Sbjct: 109 NLEKDKELELLEKELDELSKELQKQLQNTAEIIEK 143



 Score = 30.6 bits (69), Expect = 5.2
 Identities = 14/95 (14%), Positives = 43/95 (45%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L + +     +KK  + +  ++ K+ E     +    E ++    + +K  +   +++ +
Sbjct: 66  LSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDE 125

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K+ +K+ +   +  +KK++  +  ++   E EK
Sbjct: 126 LSKELQKQLQNTAEIIEKKRENNKNEERLKFENEK 160


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 39.4 bits (93), Expect = 0.002
 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            +KKK +  ++      E+ K  K+  E       E   K  +K +KK +K K K +KKK
Sbjct: 32  AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKEL--KAWEKAEKKAEKAKAKAEKKK 89

Query: 974 KEAEKMKKK 982
            + E+ K +
Sbjct: 90  AKKEEPKPR 98



 Score = 36.4 bits (85), Expect = 0.026
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK+ K  +K +KK +K K +AEK K K EE + 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 33.3 bits (77), Expect = 0.23
 Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 20/93 (21%)

Query: 887 VTALGFILFRKKRTRRKKKKGAEKIR--------KKKKKKKEAEKMRKKKGAEKMKKKKK 938
           V A+  ++       +KKK    ++         K  K+  EA  + KK+     K +KK
Sbjct: 18  VVAIIAVVGLIVALAQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKK 77

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
                         K K K +KKK KK++ K +
Sbjct: 78  AE------------KAKAKAEKKKAKKEEPKPR 98


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 41.7 bits (98), Expect = 0.002
 Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 1/94 (1%)

Query: 896 RKKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           +K +   K     +  + K K+KKK  +K+      +     K ++++     A  + +K
Sbjct: 58  KKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRK 117

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K K +K KKK    K   K+ +K+    +E  K
Sbjct: 118 PKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151



 Score = 39.4 bits (92), Expect = 0.011
 Identities = 20/84 (23%), Positives = 30/84 (35%), Gaps = 8/84 (9%)

Query: 910 KIRKKKKKKKEAE--------KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           K+++KKK KK+             K    +          +  K   EK+ KK    K  
Sbjct: 76  KLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKST 135

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEE 985
            KKKKK    K +  +    K + 
Sbjct: 136 NKKKKKVLSSKDELIKYDNNKPKS 159



 Score = 34.8 bits (80), Expect = 0.26
 Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%)

Query: 897 KKRTRRKKKKGAEKIRKK------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK-GAE 949
           KK   ++KKK  +K+            K  +++         M+K K   EK+KKK    
Sbjct: 73  KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K   KKKKK    K +  K    K +
Sbjct: 133 KSTNKKKKKVLSSKDELIKYDNNKPK 158



 Score = 31.3 bits (71), Expect = 3.4
 Identities = 16/56 (28%), Positives = 22/56 (39%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           F   +   K+  G   I   +K K + EK++KK    K   KKK      K    K
Sbjct: 96  FFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIK 151



 Score = 30.2 bits (68), Expect = 6.5
 Identities = 24/93 (25%), Positives = 37/93 (39%), Gaps = 7/93 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKM---RKKKGAEKMKKKKK----DAEKMKKKGAEKIGKKK 955
             KK  EK+ KK KK  + + +   +K K  +K K KKK    D             +  
Sbjct: 48  INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                   +K K K +K K+   + K T + +K
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKK 140


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 41.5 bits (98), Expect = 0.002
 Identities = 14/62 (22%), Positives = 22/62 (35%), Gaps = 5/62 (8%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
            K+KK+K+   ++   KG        K AEK+K  G E +             K      
Sbjct: 766 EKQKKEKESKSELEALKGVGA-----KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSA 820

Query: 972 KK 973
            +
Sbjct: 821 DR 822



 Score = 41.5 bits (98), Expect = 0.003
 Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 5/80 (6%)

Query: 907 GAEKIRKKKKKKKEAE--KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           G   +R K     E       K    EK KK+K+   +++     K    K  +K K   
Sbjct: 740 GLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEA---LKGVGAKTAEKLKDAG 796

Query: 965 KKKKKKKKKKEAEKMKKKTE 984
            +        + + +  K +
Sbjct: 797 VETVTDLTAADPDAVAAKVD 816



 Score = 38.1 bits (89), Expect = 0.024
 Identities = 14/75 (18%), Positives = 23/75 (30%), Gaps = 6/75 (8%)

Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK-KKKKDAEKMKKKG 947
            LG     K     K+KK  E   + +  K         K AEK+K    +    +    
Sbjct: 753 ELGCCNNPKCNYTEKQKKEKESKSELEALKGV-----GAKTAEKLKDAGVETVTDLTAAD 807

Query: 948 AEKIGKKKKKKKKKK 962
            + +  K       +
Sbjct: 808 PDAVAAKVDGVSADR 822



 Score = 30.4 bits (69), Expect = 5.8
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            +K+KK+K+ K + +  K    K AEK+K    E
Sbjct: 765 TEKQKKEKESKSELEALKGVGAKTAEKLKDAGVE 798


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 40.8 bits (96), Expect = 0.002
 Identities = 30/88 (34%), Positives = 38/88 (43%), Gaps = 3/88 (3%)

Query: 904 KKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEK--MKKKGAEKIGKKKKKKKK 960
           KK+  E I K  K+KK E  K R     +K   K    EK          I KK K++ K
Sbjct: 89  KKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESK 148

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK  K  +K K KK  +K   KT   + 
Sbjct: 149 KKNSKNIEKLKIKKYFKKCNFKTNLKKD 176



 Score = 36.2 bits (84), Expect = 0.066
 Identities = 27/90 (30%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--------AEKMKK 945
           L + K+ + K+K     +   K K+ E +  +KK+  +  +K  +          EK K 
Sbjct: 192 LIKNKKLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKY 251

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K   KI  K KK  KKKKK  KK  +  K 
Sbjct: 252 KDLNKIIGKFKKSFKKKKKNSKKNYENIKI 281



 Score = 35.0 bits (81), Expect = 0.14
 Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 12/98 (12%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE----KMKKKKKDAEKMKKKGAEKIG 952
           K+ +++K  K  EK++ KK  KK   K   KK  +    K+ K +++     KK   K  
Sbjct: 144 KEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQK 203

Query: 953 K--------KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
                    K K+ +KK  KKK+ +K  +K  EK K K
Sbjct: 204 TLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNK 241



 Score = 33.9 bits (78), Expect = 0.35
 Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 14/105 (13%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           F++K+  R KK+     +KK  K    EK   +  K      K K++  +K  K   +  
Sbjct: 100 FKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLK 159

Query: 952 GKKKKKK-------KKKKKKKKKKKKKKKK----EAEKMKKKTEE 985
            KK  KK       KK  K K  K  K ++    + +K+K K + 
Sbjct: 160 IKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204



 Score = 32.7 bits (75), Expect = 0.74
 Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 8/67 (11%)

Query: 903 KKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           KKK+  +  +K  +K K +   + +K        K KD  K+  K  +   KKKK  KK 
Sbjct: 223 KKKQIEKFFQKIYEKYKNKPHFIIEKN-------KYKDLNKIIGKFKKSFKKKKKNSKKN 275

Query: 962 KKKKKKK 968
            +  K  
Sbjct: 276 YENIKIN 282



 Score = 32.7 bits (75), Expect = 0.92
 Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
             KKK  K  EK++ KK  +K   K    + +K K  + I  ++    K KK K K+K  
Sbjct: 146 ESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKTL 205

Query: 972 KK--KEAEKMKKKTEEAEK 988
           K      +  + + +  +K
Sbjct: 206 KNILYNLKDKELEKKGYKK 224



 Score = 31.2 bits (71), Expect = 2.6
 Identities = 27/81 (33%), Positives = 36/81 (44%), Gaps = 8/81 (9%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK-----KKKKKKK 964
           K++ K+K  K      K K  EK   KKK  EK  +K      K K K     +K K K 
Sbjct: 197 KLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQK---IYEKYKNKPHFIIEKNKYKD 253

Query: 965 KKKKKKKKKKEAEKMKKKTEE 985
             K   K KK  +K KK +++
Sbjct: 254 LNKIIGKFKKSFKKKKKNSKK 274


>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 169

 Score = 39.4 bits (92), Expect = 0.003
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 16/106 (15%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAE-----KMRKKKGAEKMKKKKKDAEKMKKKGAE- 949
           R K+TRR + K  +K R+K +++ +AE     + +K+KG     +     E    K    
Sbjct: 3   RSKKTRRPRSKADKKARRKTREELDAEARDRKRQKKRKGLASGSRHSGGNENSGNKQQNQ 62

Query: 950 ----KIGKKKK------KKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
               +IG KK       +K  K+ K K +K K   +AE    + +E
Sbjct: 63  KKDPRIGSKKPIPLGVTEKVTKQHKPKSEKPKLSPQAELELLENDE 108


>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7).  RRP7
           is an essential protein in yeast that is involved in
           pre-rRNA processing and ribosome assembly. It is
           speculated to be required for correct assembly of rpS27
           into the pre-ribosomal particle.
          Length = 131

 Score = 38.7 bits (91), Expect = 0.003
 Identities = 20/79 (25%), Positives = 41/79 (51%), Gaps = 7/79 (8%)

Query: 891 GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDAEKMKK 945
           G+    +   +RK      K  ++++K KE +K +KK+       +  +KKK++  +++K
Sbjct: 51  GWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELENFYRFQIREKKKEELAELRK 110

Query: 946 KGAEKIGKKKKKKKKKKKK 964
           K  E   KK+ ++ K  +K
Sbjct: 111 KFEED--KKRIEQLKAARK 127



 Score = 36.0 bits (84), Expect = 0.021
 Identities = 23/89 (25%), Positives = 39/89 (43%), Gaps = 26/89 (29%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAE------KMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           K++E  K   K  +E       D +      +  +K      + K  ++++K K+KKKKK
Sbjct: 32  KREEEAKEEAKARSEP------DEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKK 85

Query: 971 K--------------KKKEAEKMKKKTEE 985
           K              KK+E  +++KK EE
Sbjct: 86  KKELENFYRFQIREKKKEELAELRKKFEE 114


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 38.8 bits (91), Expect = 0.003
 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 4/75 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE----KI 951
           RK+   ++  KG  +  +++K+ +E EK + K  A     +  D E  +++  +    + 
Sbjct: 30  RKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWDDPMAQF 89

Query: 952 GKKKKKKKKKKKKKK 966
            +KKK+K  KK K  
Sbjct: 90  LRKKKEKTDKKGKPL 104



 Score = 38.4 bits (90), Expect = 0.004
 Identities = 22/96 (22%), Positives = 47/96 (48%), Gaps = 19/96 (19%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG---AEKMKKKKKDAEKMKKKG------ 947
           +     +K++  ++  K++K++KE ++    KG    E+ +K+ ++ EK K K       
Sbjct: 10  RIIDIEEKRE-EKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYAD 68

Query: 948 AEKIGKKKKKKKK---------KKKKKKKKKKKKKK 974
            E   ++ K++++         +KKK+K  KK K  
Sbjct: 69  DEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPL 104



 Score = 35.7 bits (83), Expect = 0.037
 Identities = 16/77 (20%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           K  +     EK  +K+  ++ K++K++ EK   KG  +  +++K+ ++ +K K K   + 
Sbjct: 7   KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARY 66

Query: 973 K--KEAEKMKKKTEEAE 987
              ++ ++  K+ E  +
Sbjct: 67  ADDEDYDEELKEQERWD 83



 Score = 34.6 bits (80), Expect = 0.077
 Identities = 14/53 (26%), Positives = 35/53 (66%), Gaps = 2/53 (3%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +  D E+ +++  ++  K++K++K++K+K+  K   +K+E EK  ++ E+A+ 
Sbjct: 10  RIIDIEEKREE--KEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 41.4 bits (97), Expect = 0.003
 Identities = 21/64 (32%), Positives = 33/64 (51%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +KK   E E  RKKK  EK K+K+    K  +K A+   + ++        KK +KK +K
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRK 68

Query: 974 KEAE 977
           ++ E
Sbjct: 69  RDVE 72



 Score = 39.1 bits (91), Expect = 0.015
 Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK    E++ +KKKK+++A    K+K  +K+K  +K+A+   +      G    KK +KK
Sbjct: 10  KKILTEEELERKKKKEEKA----KEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 963 KKKK 966
            +K+
Sbjct: 66  SRKR 69



 Score = 38.3 bits (89), Expect = 0.024
 Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KKKKKKEAE 977
           E+E  +K    E++++KKK  EK K+K  +K+   +K+ K K + ++        KK  +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 978 KMKKKTEEAE 987
           K +K+  E E
Sbjct: 65  KSRKRDVEDE 74



 Score = 34.1 bits (78), Expect = 0.44
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 2/65 (3%)

Query: 913 KKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           KK   ++E E+ +KK  K  EK  KK K A+K  K   +           KK +KK +K+
Sbjct: 10  KKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKR 69

Query: 971 KKKKE 975
             + E
Sbjct: 70  DVEDE 74



 Score = 33.7 bits (77), Expect = 0.54
 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           +   RKKKK  +   K+ KK K A+K  K   A+   ++  D   + KK  +K  K+
Sbjct: 16  EELERKKKKEEKAKEKELKKLKAAQKEAK---AKLQAQQASDGTNVPKKSEKKSRKR 69



 Score = 33.3 bits (76), Expect = 0.86
 Identities = 15/43 (34%), Positives = 28/43 (65%), Gaps = 1/43 (2%)

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKK-KKKEAEKMKKKTEEAE 987
              E   +KK   +++ ++KKKK++K K+KE +K+K   +EA+
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44



 Score = 32.6 bits (74), Expect = 1.5
 Identities = 16/50 (32%), Positives = 28/50 (56%)

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
             + + +AEK      E   KKKK++K K+K+ KK K  +K+   K++ +
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQ 50



 Score = 31.4 bits (71), Expect = 2.8
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           L RKK+   K K+   K  K K  +KEA+ K++ ++ ++     KK  +K +K+  E
Sbjct: 18  LERKKKKEEKAKEKELK--KLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVE 72



 Score = 30.6 bits (69), Expect = 4.9
 Identities = 12/35 (34%), Positives = 20/35 (57%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++KKKK++K K+K+ KK K  + E   K   +   
Sbjct: 19  ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS 53



 Score = 30.3 bits (68), Expect = 7.2
 Identities = 14/36 (38%), Positives = 26/36 (72%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++ ++KKKK++K K+K+ KK K A+K  K   +A++
Sbjct: 16  EELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQ 51


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 40.4 bits (95), Expect = 0.003
 Identities = 21/87 (24%), Positives = 57/87 (65%), Gaps = 3/87 (3%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           T  ++++  +KI++ +K+ ++A+K  ++   EK+K+ K + +++KKK A +I +K ++  
Sbjct: 129 TPEEERELVQKIKELRKELEDAKKALEEN--EKLKELKAEIDELKKK-AREIHEKIQELA 185

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            + ++  ++  K  +EA++++K+ +E 
Sbjct: 186 NEAQEYHEEMIKLFEEADELRKEADEL 212



 Score = 37.0 bits (86), Expect = 0.043
 Identities = 21/112 (18%), Positives = 56/112 (50%), Gaps = 20/112 (17%)

Query: 895 FRKKRT--RRKKKKGAEKIRKKKKK----KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
            +KK      K ++ A + ++  ++     +EA+++RK+  A+++ ++  +  K   +  
Sbjct: 170 LKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKE--ADELHEEFVELSKKIDELH 227

Query: 949 EKIGKKKKK------------KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E+    + +             K+K  K+++K+++ K+ AE++ +K +  EK
Sbjct: 228 EEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFKRGEK 279



 Score = 36.2 bits (84), Expect = 0.063
 Identities = 25/119 (21%), Positives = 52/119 (43%), Gaps = 26/119 (21%)

Query: 896 RKKRTRRKKK-----KGAEKIRKKKKKKKEAEKMRKK---------KGAEKMKKKKKDAE 941
           RK+    KK      K  E   +  + KK+A ++ +K         +  E+M K  ++A+
Sbjct: 144 RKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEAD 203

Query: 942 KMKKKG------AEKIGKKKKKKKKKKKKKKK------KKKKKKKEAEKMKKKTEEAEK 988
           +++K+         ++ KK  +  ++ +  +       KK K  +  EK  K+ E+ E+
Sbjct: 204 ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREE 262



 Score = 34.3 bits (79), Expect = 0.26
 Identities = 24/118 (20%), Positives = 55/118 (46%), Gaps = 22/118 (18%)

Query: 893 ILFRKKRTRRKKKKGAEKI-RKKKKKKKEAEKMRKK---------------KGAEKMKKK 936
            L ++ R  ++K+       R  K  ++E E++ KK               +  ++++K+
Sbjct: 87  ELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKE 146

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK------KEAEKMKKKTEEAEK 988
            +DA+K  ++  +    K +  + KKK ++  +K ++      +  E+M K  EEA++
Sbjct: 147 LEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADE 204



 Score = 32.7 bits (75), Expect = 0.78
 Identities = 19/84 (22%), Positives = 45/84 (53%), Gaps = 7/84 (8%)

Query: 903 KKKKGAEKIRKKKKK-KKEAEKMRKKKGA--EKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           K  + A+++RK+  +  +E  ++ KK     E+ +  + +  +++KK      K+K  K+
Sbjct: 197 KLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256

Query: 960 KKK----KKKKKKKKKKKKEAEKM 979
           ++K    K++ ++  +K K  EK+
Sbjct: 257 REKREELKERAEEIYEKFKRGEKL 280



 Score = 32.3 bits (74), Expect = 1.1
 Identities = 19/100 (19%), Positives = 51/100 (51%), Gaps = 7/100 (7%)

Query: 895 FRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            ++ + +R +     +++RK+ ++ KE        G    K  +++ E+++KK    +  
Sbjct: 71  VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSI-KSLEREIERLEKKQQTSVLT 129

Query: 954 KKKKKK-----KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++++     K+ +K+ +  KK  +E EK+K+   E ++
Sbjct: 130 PEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDE 169



 Score = 32.0 bits (73), Expect = 1.5
 Identities = 21/116 (18%), Positives = 52/116 (44%), Gaps = 31/116 (26%)

Query: 901 RRKKKKGAEKIRKKKKKKKE-----------AEKMRKK------KGAEKMKKKKK-DAEK 942
           R ++ +  E++++ K+K+ E             ++++K       G      +++ +  +
Sbjct: 61  REERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLE 120

Query: 943 MK-----------KKGAEKIGKKKKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEE 985
            K           ++  +KI + +K+ +  KK  ++ +K K  K E +++KKK  E
Sbjct: 121 KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176



 Score = 31.6 bits (72), Expect = 1.7
 Identities = 17/97 (17%), Positives = 47/97 (48%), Gaps = 7/97 (7%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE----KMKKKGAEKI 951
           ++   R++  + AEK  +   K +E   +R+K    + ++ + + E    K K+      
Sbjct: 28  KRDELRKEASELAEKRDELNAKVRE---LREKAQELREERDEINEEVQELKEKRDEINAK 84

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ +K+ ++ K+K+ +     +  + ++++ E  EK
Sbjct: 85  LQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEK 121



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 18/94 (19%), Positives = 44/94 (46%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             K      K+K  ++  ++ K+K++  +    + AEK  +      ++++K  E   ++
Sbjct: 5   LDKLDELELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREER 64

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  ++ ++ K+K+ +   K  E  K+  E  EK
Sbjct: 65  DEINEEVQELKEKRDEINAKLQELRKEYRELKEK 98



 Score = 30.8 bits (70), Expect = 3.7
 Identities = 22/113 (19%), Positives = 50/113 (44%), Gaps = 25/113 (22%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAE------KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             K ++  EK ++ ++++ E        K ++ +   K+++ +K+  ++K+K  E     
Sbjct: 47  NAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGG 106

Query: 955 KKKKKKKKKKKK-------------------KKKKKKKKEAEKMKKKTEEAEK 988
           +  K  +++ ++                   +K K+ +KE E  KK  EE EK
Sbjct: 107 RSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEK 159


>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1.  This family consists
           of several bacterial histone H1-like Hc1 proteins. In
           Chlamydia, Hc1 is expressed in the late stages of the
           life cycle, concomitant with the reorganisation of
           chlamydial reticulate bodies into elementary bodies.
           This suggests that Hc1 protein plays a role in the
           condensation of chromatin during intracellular
           differentiation.
          Length = 123

 Score = 38.2 bits (88), Expect = 0.003
 Identities = 30/84 (35%), Positives = 40/84 (47%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           RT   K +   K+ +K+  K E   + KKK A+K  K  K  +K  K  A+K      KK
Sbjct: 35  RTDSIKLEKIAKLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKK 94

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKK 982
            K  K K   KK   K+ +KM KK
Sbjct: 95  AKAVKAKPATKKTAAKKVKKMSKK 118



 Score = 34.7 bits (79), Expect = 0.059
 Identities = 29/75 (38%), Positives = 33/75 (44%), Gaps = 2/75 (2%)

Query: 902 RKKKKGAEKIRKKKKK--KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           RK+   AEK    KKK  KK  +  +  K A K   KK  A   KK  A K     KK  
Sbjct: 49  RKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTA 108

Query: 960 KKKKKKKKKKKKKKK 974
            KK KK  KK+   K
Sbjct: 109 AKKVKKMSKKRSTAK 123



 Score = 32.4 bits (73), Expect = 0.34
 Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKKKKKKKKKK 964
           AEK  K   ++   + ++ +K A+  +K+   AEK   MKKK A+K  K  K  KK  K 
Sbjct: 23  AEKGNKAAAQRVRTDSIKLEKIAKLYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKA 82

Query: 965 KKKK---KKKKKKEAEKMKKKTEEA 986
             KK      KK +A K K  T++ 
Sbjct: 83  PAKKAAAAPAKKAKAVKAKPATKKT 107



 Score = 30.1 bits (67), Expect = 2.5
 Identities = 26/76 (34%), Positives = 34/76 (44%), Gaps = 2/76 (2%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK-KKKKKKKKKKKKKKKKKKK 972
           K +K  +A   R +  + K++K  K   K   K AEK G  KKK  KK  K  K  KK  
Sbjct: 22  KAEKGNKAAAQRVRTDSIKLEKIAKLYRKESIK-AEKSGLMKKKPAKKAPKAAKAPKKAA 80

Query: 973 KKEAEKMKKKTEEAEK 988
           K  A+K      +  K
Sbjct: 81  KAPAKKAAAAPAKKAK 96



 Score = 28.9 bits (64), Expect = 5.0
 Identities = 26/77 (33%), Positives = 34/77 (44%), Gaps = 6/77 (7%)

Query: 894 LFRKKRTRRKK-----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE-KMKKKG 947
           L+RK+  + +K     KK A+K  K  K  K+A K   KK A    KK K  + K   K 
Sbjct: 47  LYRKESIKAEKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKK 106

Query: 948 AEKIGKKKKKKKKKKKK 964
                 KK  KK+   K
Sbjct: 107 TAAKKVKKMSKKRSTAK 123


>gnl|CDD|218585 pfam05424, Duffy_binding, Duffy binding domain.  This domain is
           found in Plasmodium Duffy binding proteins. Plasmodium
           vivax and Plasmodium knowlesi merozoites invade human
           erythrocytes that express Duffy blood group surface
           determinants. The Duffy receptor family is localised in
           micronemes, an organelle found in all organisms of the
           phylum Apicomplexa.
          Length = 276

 Score = 40.0 bits (94), Expect = 0.003
 Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 5/60 (8%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKK-----KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++KK  EK+KKK  EK   K   KK  K     KK KK    KKKE +K KKK ++ +K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258



 Score = 38.9 bits (91), Expect = 0.010
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 915 KKKKKEAEKMRKK-KGAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKK 970
           +++KK+ EK++KK K  +  K   K   K K   KK  + I   KKK+  K+KKK KK K
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWI-DNKKKEWDKQKKKYKKYK 257

Query: 971 KKKKEAEKMKKKTEEAEK 988
           K+  +      K + A++
Sbjct: 258 KRNNKKNYSDIKDKYAKE 275



 Score = 36.5 bits (85), Expect = 0.053
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 9/77 (11%)

Query: 902 RKKKKGAEKIRKK-KKKKKEAEKMRKKKG--------AEKMKKKKKDAEKMKKKGAEKIG 952
           R++KK  EK++KK K+KK      +K            + +  KKK+ +K KKK  +   
Sbjct: 199 RERKKKLEKLKKKCKEKKCNKYCSKKCTKCKNACKKYKKWIDNKKKEWDKQKKKYKKYKK 258

Query: 953 KKKKKKKKKKKKKKKKK 969
           +  KK     K K  K+
Sbjct: 259 RNNKKNYSDIKDKYAKE 275


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 39.6 bits (93), Expect = 0.004
 Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 12/92 (13%)

Query: 878 VNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK 937
           +  DVR++    L F L + +R   +K+   E+ R  +  K  +E+ R K+ A++M  +K
Sbjct: 108 ITSDVRKQ----LRF-LEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEM--QK 160

Query: 938 KDAEKMKKKGAE-----KIGKKKKKKKKKKKK 964
           ++ E+M+ + A       IG +KKKK++   +
Sbjct: 161 EEDEEMRHRAANATALAAIGGRKKKKRRITMR 192



 Score = 35.4 bits (82), Expect = 0.097
 Identities = 11/49 (22%), Positives = 32/49 (65%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           + ++++ EK  ++  E++ +  K + ++ + K+K K+ +K+E E+M+ +
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHR 169


>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
           Validated.
          Length = 860

 Score = 40.6 bits (95), Expect = 0.004
 Identities = 22/51 (43%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           K  G  K   +KK   K KKK A   GK  K  KKK K K KK  KK+   
Sbjct: 738 KYAGTPKTPYEKK--TKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786



 Score = 37.1 bits (86), Expect = 0.057
 Identities = 15/35 (42%), Positives = 15/35 (42%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           EK  K KKK    K K  K  KKK K   K   K 
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782



 Score = 35.6 bits (82), Expect = 0.16
 Identities = 16/50 (32%), Positives = 21/50 (42%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           DA   K  G  K   +KK K KKK    K K  K  + +   K  +  +K
Sbjct: 733 DAVITKYAGTPKTPYEKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782



 Score = 34.8 bits (80), Expect = 0.29
 Identities = 14/44 (31%), Positives = 16/44 (36%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K K  +K    K K  K  KKK K K KK  +       T    
Sbjct: 750 KTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPA 793



 Score = 32.9 bits (75), Expect = 1.1
 Identities = 11/35 (31%), Positives = 15/35 (42%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           K KKK  + K +  K  +K  K K      K+ G 
Sbjct: 752 KAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGP 786



 Score = 32.1 bits (73), Expect = 1.7
 Identities = 21/90 (23%), Positives = 35/90 (38%), Gaps = 11/90 (12%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA------EKMKKKGAEKIGKKKK 956
           KK K  +K    K K  +  K + K  ++K  KK+               GAE +G+ + 
Sbjct: 749 KKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRAGPLYTPSPALAAMIGAEPVGRGEA 808

Query: 957 KKK-----KKKKKKKKKKKKKKKEAEKMKK 981
            KK     K+   +  + KK      K++ 
Sbjct: 809 TKKVWDYIKEHGLQSPENKKLIIPDSKLQG 838


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 40.1 bits (93), Expect = 0.004
 Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 9/74 (12%)

Query: 924 MRKKKGAEKMKKKKKDAEK-MKKKGAEK-IGKKKKKKKKKKKKKKK-------KKKKKKK 974
           M  KK A+K ++  K A + M+K+  +K  G K K +  K +   K       K++ +KK
Sbjct: 1   MPPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKK 60

Query: 975 EAEKMKKKTEEAEK 988
           E  +M++K  E EK
Sbjct: 61  EKMRMEEKRREPEK 74



 Score = 39.7 bits (92), Expect = 0.006
 Identities = 17/77 (22%), Positives = 38/77 (49%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           +K+++  K+A +  +K+  +K    K      K +   K  +    K++ +KK+K + ++
Sbjct: 8   KKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEE 67

Query: 972 KKKEAEKMKKKTEEAEK 988
           K++E EK   +     K
Sbjct: 68  KRREPEKQVIRAGVDPK 84



 Score = 33.5 bits (76), Expect = 0.44
 Identities = 25/103 (24%), Positives = 43/103 (41%), Gaps = 17/103 (16%)

Query: 897 KKRTRRKKKKGAEK---IRKKKKKKKEAEKMRK---KKGAEKMKKKKKDAEKMKKKGAEK 950
           KK TR  +K+  +K   ++ K +  K    +++       E+++KK+K   + K++  EK
Sbjct: 15  KKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKMRMEEKRREPEK 74

Query: 951 ----IGKKKK-------KKKKKKKKKKKKKKKKKKEAEKMKKK 982
                G   K         K   K    K    K+EA K +K 
Sbjct: 75  QVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKP 117


>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7.  TMA7 plays
           a role in protein translation. Deletions of the TMA7
           gene results in altered protein synthesis rates.
          Length = 63

 Score = 36.0 bits (83), Expect = 0.005
 Identities = 22/57 (38%), Positives = 30/57 (52%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           KKK  +  +K+KK+  E +   K+K  E+ K  K+ A K K KG    G  KK  KK
Sbjct: 7   KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63



 Score = 28.7 bits (64), Expect = 2.3
 Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%)

Query: 914 KKKKKKEAEKMRKKKGAEKM--KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           KKK  K  +K +K+   E +  K+K+K+  K  K+ A K   K        KK  KK
Sbjct: 7   KKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGKK 63



 Score = 26.7 bits (59), Expect = 9.7
 Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 8/44 (18%)

Query: 947 GAEKIGKKKKKKKKK--------KKKKKKKKKKKKKEAEKMKKK 982
           G +K  K  KK+KK+        K+K+K++ K  K+ A K K K
Sbjct: 6   GKKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGK 49


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 37.4 bits (87), Expect = 0.005
 Identities = 23/75 (30%), Positives = 48/75 (64%), Gaps = 2/75 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKK-KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +KR  ++ ++ A K R+K+ K +KEAE+ R+ +  ++ +  K++ E+ +K  A K+  KK
Sbjct: 31  EKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKM-AAKMHAKK 89

Query: 956 KKKKKKKKKKKKKKK 970
            ++ K+++K+ K  K
Sbjct: 90  VERLKRREKRNKLLK 104



 Score = 37.0 bits (86), Expect = 0.006
 Identities = 24/86 (27%), Positives = 50/86 (58%), Gaps = 7/86 (8%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK  R K K  + + R +K+ +++A K R     EK  K +K+AE+ ++   + I +++ 
Sbjct: 18  KKAFRPKSKLTSWEKRMEKRLEQQAIKAR-----EKELKDEKEAERQRRI--QAIKERRA 70

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
            K++K++ +K   K   K+ E++K++
Sbjct: 71  AKEEKERYEKMAAKMHAKKVERLKRR 96



 Score = 32.0 bits (73), Expect = 0.41
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 8/63 (12%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L  +K   R+++  A K R+  K     EK R +K A KM  KK   E++K++  EK  K
Sbjct: 50  LKDEKEAERQRRIQAIKERRAAK----EEKERYEKMAAKMHAKK--VERLKRR--EKRNK 101

Query: 954 KKK 956
             K
Sbjct: 102 LLK 104



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 16/82 (19%), Positives = 44/82 (53%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K  KG     K  K +K+A + + K  + + + +K+  ++  K   +++  +K+ +++++
Sbjct: 2   KASKGMRVNGKNWKTEKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRR 61

Query: 963 KKKKKKKKKKKKEAEKMKKKTE 984
            +  K+++  K+E E+ +K   
Sbjct: 62  IQAIKERRAAKEEKERYEKMAA 83


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 40.4 bits (94), Expect = 0.005
 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 2/72 (2%)

Query: 903 KKKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            KK  A +  K +  +  K  E+ +KK+     KK    A+K +K G EK+    KK   
Sbjct: 541 YKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSS 600

Query: 961 KKKKKKKKKKKK 972
               K ++ K  
Sbjct: 601 AFFAKLQETKAV 612



 Score = 37.7 bits (87), Expect = 0.032
 Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 9/82 (10%)

Query: 909 EKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           E+I KK K  +K + E  R  K  E+ KKK+K     K          K KK++K  K+K
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAAKKA-------VTKAKKERKIGKEK 590

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
                KK  +    K  E    
Sbjct: 591 VDGVAKKSSSAFFAKLQETKAV 612



 Score = 34.2 bits (78), Expect = 0.37
 Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 3/62 (4%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
             R  K +   + RKKK+K++ A+K   K   E+   K+K     KK  +    K ++ K
Sbjct: 554 LDRTDKNR---ERRKKKRKRRAAKKAVTKAKKERKIGKEKVDGVAKKSSSAFFAKLQETK 610

Query: 959 KK 960
             
Sbjct: 611 AV 612


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 37.2 bits (86), Expect = 0.005
 Identities = 17/49 (34%), Positives = 23/49 (46%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
              K    K KK    K G  K KKK  KKKKK++ ++     E++  K
Sbjct: 1   MAPKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDK 49



 Score = 29.5 bits (66), Expect = 2.5
 Identities = 14/57 (24%), Positives = 29/57 (50%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
             K    K +K +K K    K +KK   +K K++ + A  + ++  +KI K+  +++
Sbjct: 2   APKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKEVPRER 58



 Score = 29.1 bits (65), Expect = 3.6
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
              +  K+KK +K K    K KKK   +K K++   A
Sbjct: 3   PKSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRA 39


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 37.1 bits (86), Expect = 0.006
 Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 2/56 (3%)

Query: 935 KKKKDAEKMKKKGAEKI--GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K++K  EK  K+  + I    KK+KKKKK+  K  + K  KK  +K KKK E  E 
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56



 Score = 35.9 bits (83), Expect = 0.013
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 3/70 (4%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K++K+ EK  K+   +     KK  +K K+       K  KK +KK KKK +  +  +  
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPE-- 58

Query: 976 AEKMKKKTEE 985
            E  +++TE+
Sbjct: 59  -ESKRRRTED 67



 Score = 33.6 bits (77), Expect = 0.086
 Identities = 17/61 (27%), Positives = 27/61 (44%), Gaps = 4/61 (6%)

Query: 932 KMKKKKKDAEKMK----KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K +KK +  +K          +K  KKKK+  K  + K  KK +KK + +    +  E  
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 988 K 988
           K
Sbjct: 61  K 61



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 5/76 (6%)

Query: 914 KKKKKKEAEKMRK----KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           K++KK E    R         +K KKKKK   K  +  A K   +KK KKK +  +  ++
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKK-GQKKDKKKDEFPEFPEE 59

Query: 970 KKKKKEAEKMKKKTEE 985
            K+++  + +   TEE
Sbjct: 60  SKRRRTEDGLPIYTEE 75



 Score = 28.2 bits (63), Expect = 6.4
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           +KKKKK+ A+  R K   +  KK KK  E  +     K
Sbjct: 24  RKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 39.9 bits (93), Expect = 0.006
 Identities = 16/58 (27%), Positives = 32/58 (55%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
             +K+  +  A+   ++ +D+E+ +++  E+ G   KK KK KK K KK    K +++
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSD 357



 Score = 37.6 bits (87), Expect = 0.032
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 5/95 (5%)

Query: 888 TALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
           TA GF L   K  +       ++  + + +K         +G +   K  +  ++     
Sbjct: 174 TANGFQLMMMKAAKNGPAAFGDEDEETEGEKGG-----GGRGKDLKIKDLEGDDEDDGDE 228

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           ++K G+   ++K KKKKKK  K KKK + +K  K+
Sbjct: 229 SDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKR 263



 Score = 37.6 bits (87), Expect = 0.032
 Identities = 18/110 (16%), Positives = 48/110 (43%), Gaps = 17/110 (15%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG-----------------AEKMKKKKK 938
           +KK+  + KKK  +  + K+    +A++     G                  +  +++ K
Sbjct: 244 KKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDK 303

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + ++  K   +  +  ++ +++K +++    KK K+ +K+K K    +K
Sbjct: 304 LSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDK 353



 Score = 35.3 bits (81), Expect = 0.16
 Identities = 17/90 (18%), Positives = 32/90 (35%), Gaps = 21/90 (23%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGKKKKK- 957
           R+ K   E   K + ++ E  +  +++  E    + KK K  +K+K K            
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359

Query: 958 -----------------KKKKKKKKKKKKK 970
                              KK+K+ KK++ 
Sbjct: 360 DDSDDSDIDGEDSVSLVTAKKQKEPKKEEP 389



 Score = 34.6 bits (79), Expect = 0.27
 Identities = 16/71 (22%), Positives = 28/71 (39%), Gaps = 4/71 (5%)

Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           G +   ++ K   E       K   +  +  +++E+ K +    + KK KK KK K KK 
Sbjct: 294 GNDPEEREDKLSPEIPA----KPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKN 349

Query: 967 KKKKKKKKEAE 977
              K      +
Sbjct: 350 GLDKDDSDSGD 360


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 37.2 bits (87), Expect = 0.006
 Identities = 15/82 (18%), Positives = 41/82 (50%), Gaps = 9/82 (10%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK-KKKKKKK 966
            EK+RK+     + +++ +    E+ ++ + + E+++ +      + +  K+  K KKKK
Sbjct: 15  KEKLRKRGVDVLDVDELLEL--DEERRELQVELEELQAE------RNELSKEIGKAKKKK 66

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
           +  +    E +++K + +  E 
Sbjct: 67  EDAEALIAEVKELKDELKALEA 88



 Score = 33.7 bits (78), Expect = 0.090
 Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 10/89 (11%)

Query: 894 LFRKK-RTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
             ++K R R       +++     ++++ + E E+++ ++      +  K+  K KKK  
Sbjct: 13  AVKEKLRKRGVDVLDVDELLELDEERRELQVELEELQAER-----NELSKEIGKAKKKKE 67

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           +      + K+ K + K  + + ++ EAE
Sbjct: 68  DAEALIAEVKELKDELKALEAELRELEAE 96



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 15/63 (23%), Positives = 33/63 (52%), Gaps = 9/63 (14%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           D+ RRE       +    +  + ++ + +++I K KKKK++AE +  +   +++K + K 
Sbjct: 35  DEERRE-------LQVELEELQAERNELSKEIGKAKKKKEDAEALIAE--VKELKDELKA 85

Query: 940 AEK 942
            E 
Sbjct: 86  LEA 88


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 38.7 bits (90), Expect = 0.007
 Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 5/85 (5%)

Query: 897 KKRTRRKKKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           ++R +R+K++  +K+  K       +   K+R K         K   E+ K         
Sbjct: 128 RERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPS--PDKALVEETKIISKAWSEL 185

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEK 978
            + KKKK   K KK K++      +
Sbjct: 186 DESKKKKYIDKYKKLKEEYDSFYPE 210



 Score = 33.3 bits (76), Expect = 0.41
 Identities = 22/91 (24%), Positives = 33/91 (36%), Gaps = 11/91 (12%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK------ 962
           EK    K+   + E+ +++K     K   K       +   KI  K +     K      
Sbjct: 116 EKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEET 175

Query: 963 KKKKKK-----KKKKKKEAEKMKKKTEEAEK 988
           K   K      + KKKK  +K KK  EE + 
Sbjct: 176 KIISKAWSELDESKKKKYIDKYKKLKEEYDS 206


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 40.2 bits (94), Expect = 0.007
 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 3/79 (3%)

Query: 896  RKKRTR---RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
            R+K  R   R +   A+K+ ++  KK   +K  KK    +  ++   +  M+ +   ++ 
Sbjct: 1179 REKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVV 1238

Query: 953  KKKKKKKKKKKKKKKKKKK 971
            K K +   KKK     K+K
Sbjct: 1239 KPKGRAGAKKKAPAAAKEK 1257



 Score = 39.1 bits (91), Expect = 0.016
 Identities = 24/99 (24%), Positives = 41/99 (41%), Gaps = 13/99 (13%)

Query: 901  RRKKKKGAEKIRKKKKK----------KKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGA 948
            R K     E ++K   K          +KE +K+ K+  K  E  +K ++ A + +   A
Sbjct: 1135 RDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAA 1194

Query: 949  EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            +K+  ++  KK   KK  KK  + +   E       E E
Sbjct: 1195 KKV-SRQAPKKPAPKKTTKKASESETTEETYGSSAMETE 1232



 Score = 35.6 bits (82), Expect = 0.17
 Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 10/89 (11%)

Query: 909  EKIRKKKKKKKEAEKMRKK-------KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            E    + K   E E ++K        K  + ++K+    +K   K  E   K ++   + 
Sbjct: 1130 ELCADRDKLNIEVEDLKKTTPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARG 1189

Query: 962  KKK--KKKKKKKKKKEA-EKMKKKTEEAE 987
            +    KK  ++  KK A +K  KK  E+E
Sbjct: 1190 ESGAAKKVSRQAPKKPAPKKTTKKASESE 1218



 Score = 35.2 bits (81), Expect = 0.24
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 14/83 (16%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            KK +R+  KK A   +K  KK  E+E   +  G+  M + +  AE +K KG         
Sbjct: 1195 KKVSRQAPKKPAP--KKTTKKASESETTEETYGSSAM-ETENVAEVVKPKG--------- 1242

Query: 957  KKKKKKKKKKKKKKKKKKEAEKM 979
              +   KKK     K+K+E +++
Sbjct: 1243 --RAGAKKKAPAAAKEKEEEDEI 1263



 Score = 34.8 bits (80), Expect = 0.31
 Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 5/89 (5%)

Query: 902  RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            R KKKG  K     KK  +     KK+G   ++  +K   +M  K AE IG   +KK +K
Sbjct: 1329 RLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEM-LKPAEAIGISPEKKVRK 1387

Query: 962  KKKKKKKKKK----KKKEAEKMKKKTEEA 986
             +     KK      +    K  + +E  
Sbjct: 1388 MRASPFNKKSGSVLGRAATNKETESSENV 1416



 Score = 32.5 bits (74), Expect = 1.5
 Identities = 16/92 (17%), Positives = 37/92 (40%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            + +  R K ++ A +      KK   +  +K    +  KK  +     +  G+  +  + 
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETEN 1233

Query: 956  KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              +  K K +   KKK    A++ +++ E  +
Sbjct: 1234 VAEVVKPKGRAGAKKKAPAAAKEKEEEDEILD 1265


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 39.6 bits (93), Expect = 0.008
 Identities = 17/90 (18%), Positives = 24/90 (26%), Gaps = 3/90 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            +     K     E +       K +    K +             +  K  A    K  
Sbjct: 522 EELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFL 581

Query: 956 KKKKKKKKKK---KKKKKKKKKEAEKMKKK 982
           KKKK            K KK K   + KK+
Sbjct: 582 KKKKGILDAGAFESTLKDKKNKVLPEAKKR 611


>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
            NSA2 appears to be a protein required for the
           maturation of 27S pre-rRNA in yeast; it has been
           characterized in mammalian cells as a nucleolar protein
           that might play a role in the regulation of the cell
           cycle and in cell proliferation.
          Length = 257

 Score = 38.8 bits (91), Expect = 0.008
 Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 18/89 (20%)

Query: 894 LFRKKRTRR------KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
           L RK+  RR      K+KK A   R+  K+ K+A+K+R  K   K+  KK+  EK++   
Sbjct: 6   LHRKRHGRRLDHEERKRKKEA---REPHKRSKKAQKLRGLKA--KLYNKKRYKEKIQM-- 58

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
                KK  K  +++  K+K   K  + A
Sbjct: 59  -----KKTIKMHEERNVKQKVDDKVPEGA 82



 Score = 36.8 bits (86), Expect = 0.039
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM---------KKKGAEKIGKKKKKKK 959
           E  RK+  ++ + E+ ++KK A +  K+ K A+K+         KK+  EKI  + KK  
Sbjct: 5   ELHRKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKI--QMKKTI 62

Query: 960 KKKKKKKKKKKKKKKEAE 977
           K  +++  K+K   K  E
Sbjct: 63  KMHEERNVKQKVDDKVPE 80


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 39.5 bits (93), Expect = 0.008
 Identities = 14/84 (16%), Positives = 46/84 (54%), Gaps = 5/84 (5%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM---KKKGAEKIGKKKKKKKKKKKK 964
            EK++K+     + +K+ +    E+ +K  ++ E++   + + +++IG+  K+ +   ++
Sbjct: 15  REKLKKRGGDALDVDKLLELD--EERRKLLRELEELQAERNELSKEIGRALKRGEDDAEE 72

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
              + K+ K++ ++++   +E E 
Sbjct: 73  LIAEVKELKEKLKELEAALDELEA 96



 Score = 38.3 bits (90), Expect = 0.017
 Identities = 10/89 (11%), Positives = 38/89 (42%), Gaps = 5/89 (5%)

Query: 894 LFRKK-RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
             R+K + R       +K+ +  +++++      ++  E   ++ + ++++ +       
Sbjct: 13  AVREKLKKRGGDALDVDKLLELDEERRK----LLRELEELQAERNELSKEIGRALKRGED 68

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
             ++   + K+ K+K K+ +    E   +
Sbjct: 69  DAEELIAEVKELKEKLKELEAALDELEAE 97



 Score = 32.6 bits (75), Expect = 1.1
 Identities = 12/86 (13%), Positives = 36/86 (41%), Gaps = 9/86 (10%)

Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
           R      +  +L   +  R+  ++  E   ++ +  KE            +K+ + DAE+
Sbjct: 21  RGGDALDVDKLLELDEERRKLLRELEELQAERNELSKE--------IGRALKRGEDDAEE 72

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKK 968
           +  +  +++ +K K+ +    + + +
Sbjct: 73  LIAEV-KELKEKLKELEAALDELEAE 97


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 38.0 bits (89), Expect = 0.009
 Identities = 19/83 (22%), Positives = 47/83 (56%), Gaps = 4/83 (4%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KK A+K +K+K+K++   K  +K   +   + K+ AE+ K + AE+  ++  ++++ + +
Sbjct: 14  KKKAKKAKKEKRKQR---KQARKGADDGDDELKQAAEEAKAEKAER-DRELNRQRQAEAE 69

Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
           +K    + K+  E+ +   ++ E
Sbjct: 70  QKAIVAQIKQLIEQNRIDRKDGE 92



 Score = 35.3 bits (82), Expect = 0.078
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KKK KK KK+K+K++K+ +K   +   +  + AE+
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEE 48



 Score = 34.5 bits (80), Expect = 0.14
 Identities = 18/45 (40%), Positives = 28/45 (62%), Gaps = 11/45 (24%)

Query: 953 KKKKKKKKKKKKKKKKKKKK---------KKEAEKMKKKTEEAEK 988
           KKK KK KK+K+K++K+ +K         K+ AE  + K E+AE+
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQAAE--EAKAEKAER 56



 Score = 33.7 bits (78), Expect = 0.25
 Identities = 17/63 (26%), Positives = 34/63 (53%), Gaps = 14/63 (22%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKK--------KKKKKKKKKKKKKKEAEKMKKKTEE 985
           KKK K A+K K+K      ++K+ +K         K+  ++ K +K +++ E  +++  E
Sbjct: 14  KKKAKKAKKEKRK------QRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAE 67

Query: 986 AEK 988
           AE+
Sbjct: 68  AEQ 70


>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
           non-specific RNA binding region part 2.  This is a
           region found N terminal to the catalytic domain of
           glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
           but not in Escherichia coli. This region is thought to
           bind RNA in a non-specific manner, enhancing
           interactions between the tRNA and enzyme, but is not
           essential for enzyme function.
          Length = 83

 Score = 35.8 bits (83), Expect = 0.009
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           K  A+ + KKKKKKKKK +      K KK  AE + +   
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 35.4 bits (82), Expect = 0.014
 Identities = 17/49 (34%), Positives = 21/49 (42%), Gaps = 6/49 (12%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K +A+ +KKK      KKKKKKK +      K KK   E         E
Sbjct: 17  KTEADLVKKK------KKKKKKKAEDTAATAKAKKATAEDVSEGAMFME 59



 Score = 34.3 bits (79), Expect = 0.040
 Identities = 13/35 (37%), Positives = 17/35 (48%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             KKKKKKKKKK +      K +    +  +E A 
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKATAEDVSEGAM 56



 Score = 34.3 bits (79), Expect = 0.040
 Identities = 13/32 (40%), Positives = 17/32 (53%)

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             KKKKKKKKKK +     A+  K   E+  +
Sbjct: 22  LVKKKKKKKKKKAEDTAATAKAKKATAEDVSE 53



 Score = 33.5 bits (77), Expect = 0.057
 Identities = 17/38 (44%), Positives = 20/38 (52%), Gaps = 1/38 (2%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +G K +    KKKKKKKKKK +   A   K K   AE 
Sbjct: 14  LGPKTEADLVKKKKKKKKKKAEDTAATA-KAKKATAED 50



 Score = 33.5 bits (77), Expect = 0.066
 Identities = 16/36 (44%), Positives = 18/36 (50%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K +    KKKKKKKKKK +      K KK T E   
Sbjct: 17  KTEADLVKKKKKKKKKKAEDTAATAKAKKATAEDVS 52



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 4/48 (8%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K    + KK      KKKKKK +      K KK   ++  +     E 
Sbjct: 17  KTEADLVKKKK----KKKKKKAEDTAATAKAKKATAEDVSEGAMFMEG 60


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 38.9 bits (91), Expect = 0.010
 Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 4/84 (4%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK-- 960
           K KK   K  K        E M KK  A    +      K KKK  +K+  KK K KK  
Sbjct: 219 KYKKQFSKYIKAGVGPDSLEDMYKK--AHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLT 276

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTE 984
            K++K + K KK +  E+++KK +
Sbjct: 277 GKQRKARVKAKKAQRRERLQKKIK 300



 Score = 38.1 bits (89), Expect = 0.016
 Identities = 36/101 (35%), Positives = 43/101 (42%), Gaps = 15/101 (14%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK------------KDAEKMKKKGAEK 950
           K    AE  R +   K  AE M++ K  +  K KK               E M KK    
Sbjct: 188 KDSYDAEVHRDRIFGKHVAEYMKQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAA 247

Query: 951 IGKKKKKKKKKKKKKKK---KKKKKKKEAEKMKKKTEEAEK 988
           I     K KKKKKKKKK   KK K KK   K +K   +A+K
Sbjct: 248 IRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKK 288



 Score = 30.0 bits (68), Expect = 5.6
 Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 9/61 (14%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R   ++ KKK       KKKKKK   +K + KK   K +K +  A+K +++  E++ KK 
Sbjct: 249 RANPSKVKKK-------KKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRR--ERLQKKI 299

Query: 956 K 956
           K
Sbjct: 300 K 300


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 39.3 bits (92), Expect = 0.011
 Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K    K  + AE+I K  K  ++  K RKK    K     +D E+ KK+  E I KK  K
Sbjct: 156 KGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNK-----QDLEEYKKR-LEAIKKKDIK 209

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
              + +K K++  K KKE +  K+K + A
Sbjct: 210 NPLELQKIKEEFDKLKKEGKADKQKIKSA 238


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.4 bits (92), Expect = 0.012
 Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 3/87 (3%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK-KKKKKK 961
            +K+  +K    +K  KE EK+  KK  E+  ++ K+ E+ KK   EK  ++  K  KK+
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKL--KKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +   ++ K+KK    K  K  E+  K
Sbjct: 581 VESIIRELKEKKIHKAKEIKSIEDLVK 607



 Score = 38.6 bits (90), Expect = 0.016
 Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 8/96 (8%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK--------MKK 945
           L   ++   +K +  EK+ K+++K K+  +   ++  E+ + KK + EK        +KK
Sbjct: 520 LSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKK 579

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           +    I + K+KK  K K+ K  +   K +  K K 
Sbjct: 580 EVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKI 615



 Score = 38.6 bits (90), Expect = 0.019
 Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 4/91 (4%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K     K  K  EK++K+ +++ E  K R++    +++K+ ++A K  KK  E I ++ K
Sbjct: 530 KNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELK 589

Query: 957 KKKKKKKKKKKK----KKKKKKEAEKMKKKT 983
           +KK  K K+ K      K K+ + +  +K T
Sbjct: 590 EKKIHKAKEIKSIEDLVKLKETKQKIPQKPT 620



 Score = 36.7 bits (85), Expect = 0.071
 Identities = 25/97 (25%), Positives = 48/97 (49%), Gaps = 15/97 (15%)

Query: 901 RRKKKKGAEK------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           + K   G  K      I K    +KE E+       E ++K  K+ EK+KK+  +++ + 
Sbjct: 501 QAKTFYGEFKEEINVLIEKLSALEKELEQK-----NEHLEKLLKEQEKLKKELEQEMEEL 555

Query: 955 KKKKKKKKKKKKK----KKKKKKKEAEKMKKKTEEAE 987
           K++++ KK + +K      K  KKE E + ++ +E +
Sbjct: 556 KERERNKKLELEKEAQEALKALKKEVESIIRELKEKK 592



 Score = 32.5 bits (74), Expect = 1.3
 Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 9/85 (10%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKK 963
           ++  E   +++ KK E E    K+  E +K  KK+ E + ++  E KI K K+ K  +  
Sbjct: 550 QEMEELKERERNKKLELE----KEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDL 605

Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
            K K+ K+K  +    K    +A+K
Sbjct: 606 VKLKETKQKIPQ----KPTNFQADK 626



 Score = 32.5 bits (74), Expect = 1.5
 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 10/87 (11%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK---------K 961
           I ++ K      K       EK+   +K+ E+ K +  EK+ K+++K KK         K
Sbjct: 498 IIEQAKTFYGEFKEEINVLIEKLSALEKELEQ-KNEHLEKLLKEQEKLKKELEQEMEELK 556

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++++ KK + +K+  E +K   +E E 
Sbjct: 557 ERERNKKLELEKEAQEALKALKKEVES 583



 Score = 30.6 bits (69), Expect = 5.9
 Identities = 20/86 (23%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 910 KIRKKKKKKK------EAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           K R++ KK +      EA K  KK+    ++  K+KK  +  + K  E + K K+ K+K 
Sbjct: 556 KERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKI 615

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAE 987
            +K    +  K  +  +++   ++ +
Sbjct: 616 PQKPTNFQADKIGDKVRIRYFGQKGK 641



 Score = 29.8 bits (67), Expect = 9.5
 Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 10/96 (10%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK----KKKDAEKMKKKGAEKIGKKKKK 957
           +  KK  E I ++ K+KK   K ++ K  E + K    K+K  +K     A+KIG K + 
Sbjct: 575 KALKKEVESIIRELKEKKI-HKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRI 633

Query: 958 KKKKKKKKKKKKKKKKKEAE-----KMKKKTEEAEK 988
           +   +K K  +     K        +MK    E EK
Sbjct: 634 RYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEK 669


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 36.1 bits (84), Expect = 0.012
 Identities = 19/103 (18%), Positives = 55/103 (53%), Gaps = 13/103 (12%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAE-------------KMRKKKGAEKMKKKKKD 939
           ++ + ++ +++ +  A++I++ + + KE E             K+ K  G   +K+ K++
Sbjct: 3   LINKFQQLQQQLQLLAQQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEE 62

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            ++  ++  E + K+ K  +K+ +K +K+ ++ K+E  K+  +
Sbjct: 63  VKEELEERKETLEKEIKTLEKQLEKLEKELEELKEELYKLFGQ 105


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 37.8 bits (88), Expect = 0.013
 Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 4/87 (4%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK- 957
           R  +K KKG   I     K++      K+    ++   ++  ++ K +   K  +     
Sbjct: 126 RKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNAY 185

Query: 958 ---KKKKKKKKKKKKKKKKKEAEKMKK 981
              ++ K   K   KK+KKK+A++ KK
Sbjct: 186 RTLRQAKLNAKFVGKKEKKKQAKEKKK 212



 Score = 33.2 bits (76), Expect = 0.51
 Identities = 22/93 (23%), Positives = 37/93 (39%), Gaps = 11/93 (11%)

Query: 891 GFILFRKKRTRRKK----------KKGAEKIRKKKKKKKEAEKMRKKKGAEK-MKKKKKD 939
             +LF +K  + KK          K+    +  K+ K  E    ++    EK     K++
Sbjct: 120 KLVLFPRKGKKPKKGFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEE 179

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
                 +   +     K   KK+KKK+ K+KKK
Sbjct: 180 RAFNAYRTLRQAKLNAKFVGKKEKKKQAKEKKK 212


>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020).  This
           family of fungal proteins is conserved towards the
           C-terminus of HMG domains. The function is not known.
          Length = 49

 Score = 34.5 bits (79), Expect = 0.014
 Identities = 16/48 (33%), Positives = 22/48 (45%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             + KKK  E   ++ K    + + KKK KKK  KE  K K    + E
Sbjct: 1   NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48



 Score = 31.0 bits (70), Expect = 0.19
 Identities = 13/46 (28%), Positives = 23/46 (50%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
             ++KKK  E   ++  +   + + KKK KKK  K+  K+K    +
Sbjct: 1   NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYID 46



 Score = 29.5 bits (66), Expect = 0.85
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
             + KKK ++A   + K  +   + KKK KKK  K+  K+K    + E
Sbjct: 1   NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDSKEKDAYIDDE 48



 Score = 26.8 bits (59), Expect = 7.5
 Identities = 13/35 (37%), Positives = 20/35 (57%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           R++KKK +EA   R K    + + KKK  +K  K+
Sbjct: 2   RERKKKWREANSERNKDNDLRSRVKKKAKKKFGKE 36


>gnl|CDD|218481 pfam05177, RCSD, RCSD region.  Proteins contain this region include
           C.elegans UNC-89. This region is found repeated in
           UNC-89 and shows conservation in prolines, lysines and
           glutamic acids. Proteins with RCSD are involved in
           muscle M-line assembly, but the function of this region
           RCSD is not clear.
          Length = 101

 Score = 36.0 bits (82), Expect = 0.014
 Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 5/85 (5%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           + ++ +  E+ +  +KK+K  EK+       K K   K  +  +K   E++    KK+K 
Sbjct: 22  KTEEDRATEEAKSPEKKEKSPEKVD-----GKPKSPTKKEKSPEKSATEEVKSPTKKEKS 76

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEE 985
            +K ++K     KKE  + K+K+ E
Sbjct: 77  PEKVEEKPASPTKKERVQKKEKSPE 101


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 39.0 bits (90), Expect = 0.014
 Identities = 21/86 (24%), Positives = 44/86 (51%), Gaps = 2/86 (2%)

Query: 897  KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
             KR+     +  E     + ++K   K R  K   K +++K+  E +++   E+IGKK+K
Sbjct: 992  DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051

Query: 957  KKKKKKKKKKKK--KKKKKKEAEKMK 980
            +++++ +K      K+  KK  +K +
Sbjct: 1052 EREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 34.3 bits (78), Expect = 0.41
 Identities = 14/80 (17%), Positives = 38/80 (47%), Gaps = 10/80 (12%)

Query: 913  KKKKKKKEAEKMRKKKGAEK----------MKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
             K+     + ++      E           +K++ KD E+ ++  + +  K+++  KK+K
Sbjct: 992  DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEK 1051

Query: 963  KKKKKKKKKKKKEAEKMKKK 982
            +++++ +K      ++M KK
Sbjct: 1052 EREQRIRKTIHDNYKEMAKK 1071



 Score = 33.2 bits (75), Expect = 0.82
 Identities = 16/68 (23%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 896  RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
             +++   K +   E+I+ +++K++  E +++ K  E  KK+K+  ++++K   +   +  
Sbjct: 1011 CREKHEIKDRIVKERIKDQEEKERM-ESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMA 1069

Query: 956  KKKKKKKK 963
            KK+ KKK+
Sbjct: 1070 KKRLKKKR 1077



 Score = 30.5 bits (68), Expect = 6.6
 Identities = 12/79 (15%), Positives = 41/79 (51%)

Query: 910  KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
             + ++ + K   +K      + +++       + K +  ++I K++ K +++K++ +  +
Sbjct: 980  SLPREIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQ 1039

Query: 970  KKKKKEAEKMKKKTEEAEK 988
            + K++E  K +K+ E+  +
Sbjct: 1040 RAKEEEIGKKEKEREQRIR 1058


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this family
           (Kri1p) is found to be required for 40S ribosome
           biogenesis in the nucleolus.
          Length = 99

 Score = 35.7 bits (83), Expect = 0.015
 Identities = 9/34 (26%), Positives = 24/34 (70%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K+++K +++++ K+ K  K+++  EK++K  + A
Sbjct: 4   KEEEKAQREEELKRLKNLKREEIEEKLEKIKKVA 37



 Score = 32.2 bits (74), Expect = 0.30
 Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 5/53 (9%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K++K+ EK +++  E     K+ K  K+++ ++K +K KK A        E +
Sbjct: 1   KERKEEEKAQRE-EEL----KRLKNLKREEIEEKLEKIKKVAGLRGADLSEED 48



 Score = 31.1 bits (71), Expect = 0.67
 Identities = 10/32 (31%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKD--AEKMKK 945
           K++KE EK ++++  +++K  K++   EK++K
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32



 Score = 30.7 bits (70), Expect = 1.0
 Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 10/45 (22%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA 940
           RK+  + +++   E  R K  K++E E        EK++K KK A
Sbjct: 3   RKEEEKAQRE--EELKRLKNLKREEIE--------EKLEKIKKVA 37



 Score = 28.4 bits (64), Expect = 5.0
 Identities = 8/32 (25%), Positives = 23/32 (71%)

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K++K+++K +++++ K+ +  K ++  E+ EK
Sbjct: 1   KERKEEEKAQREEELKRLKNLKREEIEEKLEK 32


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 36.4 bits (84), Expect = 0.016
 Identities = 15/22 (68%), Positives = 16/22 (72%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKE 975
           K KKKKKK KK KK KK  KK+
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 36.0 bits (83), Expect = 0.017
 Identities = 15/23 (65%), Positives = 15/23 (65%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKK 971
           EK  KKKKK KK KK KK  KK 
Sbjct: 98  EKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 36.0 bits (83), Expect = 0.022
 Identities = 15/22 (68%), Positives = 15/22 (68%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKK 974
           K KKKKKK KK KK KK  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 33.7 bits (77), Expect = 0.12
 Identities = 14/21 (66%), Positives = 16/21 (76%)

Query: 958 KKKKKKKKKKKKKKKKKEAEK 978
           K KKKKKK KK KK KK ++K
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119



 Score = 32.9 bits (75), Expect = 0.21
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
           K KKKKKK KK KK KK  + +
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 32.2 bits (73), Expect = 0.39
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 957 KKKKKKKKKKKKKKKKKKEAEK 978
           K KKKKKK KK KK KK   + 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 30.2 bits (68), Expect = 1.8
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 960 KKKKKKKKKKKKKKKEAEKMKK 981
           K KKKKKK KK KK +    K 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 3.5
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 959 KKKKKKKKKKKKKKKKEAEKMK 980
           K KKKKKK KK KK K++ K  
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 3.9
 Identities = 13/22 (59%), Positives = 14/22 (63%)

Query: 961 KKKKKKKKKKKKKKEAEKMKKK 982
           K KKKKKK KK KK  +  KK 
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.1 bits (65), Expect = 4.9
 Identities = 11/21 (52%), Positives = 16/21 (76%)

Query: 965 KKKKKKKKKKEAEKMKKKTEE 985
           K KKKKKK K+ +K KK +++
Sbjct: 99  KPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
          Length = 801

 Score = 38.6 bits (90), Expect = 0.016
 Identities = 18/63 (28%), Positives = 29/63 (46%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           K+ K +E E  R+K    +  +  +      KK A+++ KK      KKK+ KK     K
Sbjct: 570 KELKDEEVEAYREKFAGSQADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGK 629

Query: 974 KEA 976
            +A
Sbjct: 630 SKA 632



 Score = 32.1 bits (73), Expect = 2.1
 Identities = 14/57 (24%), Positives = 25/57 (43%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           + KK A++++KK       +K  KK G     K   + E    +   ++G   K +K
Sbjct: 599 EAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREIDVSRLDIRVGLIVKAEK 655



 Score = 29.7 bits (67), Expect = 9.4
 Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 5/60 (8%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           R+K  G++  R  +     AE    KK A+++KKK       KK+G +  G  K K   +
Sbjct: 581 REKFAGSQADRAAR-----AEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 38.1 bits (89), Expect = 0.017
 Identities = 37/153 (24%), Positives = 59/153 (38%), Gaps = 22/153 (14%)

Query: 833 SNVATKRLLHDAILRRSGMYTIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALGF 892
           S +AT R+L  A   +     I       G + A    L     D  +    E V A   
Sbjct: 175 SELATTRILTPADFAK-----IQELRLEKGVDKALGGKLKRRDKDAPERHSDELVDADDI 229

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEK---MRKKKGAE------KMKKKKKDAEKM 943
               KK+    K+   E+I   K+ +++ EK    + KK  E      K K +KK+    
Sbjct: 230 EGPAKKK----KQTKEERIATAKEGREDREKFGSRKGKKDKEGKSTTNKEKARKKNFMMT 285

Query: 944 KKKGAEKIGKKKKK---KKKKKKKKKKKKKKKK 973
             K   + GK+K+    K+K  +    ++KK  
Sbjct: 286 LHKKRVR-GKQKRSLRDKQKVLRAHILRQKKGG 317



 Score = 37.7 bits (88), Expect = 0.023
 Identities = 25/109 (22%), Positives = 44/109 (40%), Gaps = 22/109 (20%)

Query: 896 RKKRTRRKKKKGAEK---------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
            K+R +   ++ +++           KKKK+ KE      K+G E  +K      K  K+
Sbjct: 208 LKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREKFGSRKGKKDKE 267

Query: 947 GAEKIGKKKKKKK-------------KKKKKKKKKKKKKKKEAEKMKKK 982
           G     K+K +KK             K+K+  + K+K  +    + KK 
Sbjct: 268 GKSTTNKEKARKKNFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316



 Score = 30.0 bits (68), Expect = 6.2
 Identities = 11/89 (12%), Positives = 40/89 (44%), Gaps = 11/89 (12%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG-----------KKKKK 957
           E+ + +  KK + +   +    ++ +  +++  + +K+ A ++            K ++ 
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPADFAKIQEL 194

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           + +K   K    K K+++ +  ++ ++E 
Sbjct: 195 RLEKGVDKALGGKLKRRDKDAPERHSDEL 223


>gnl|CDD|131972 TIGR02926, AhaH, ATP synthase archaeal, H subunit.  he A1/A0 ATP
           synthase is homologous to the V-type (V1/V0, vacuolar)
           ATPase, but functions in the ATP synthetic direction as
           does the F1/F0 ATPase of bacteria. The hydrophilic A1
           "stalk" complex (AhaABCDEFG) is the site of ATP
           generation and is coupled to the membrane-embedded
           proton translocating A0 complex. It is unclear precisely
           where AhaH fits into these complexes.
          Length = 85

 Score = 35.2 bits (81), Expect = 0.018
 Identities = 22/84 (26%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            + KK  E   +  ++ +E  K R  +  E+ ++  ++AE+   K  E+I K+ +++ + 
Sbjct: 2   EEIKKAEEDAEELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIE- 60

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEE 985
            K+ +K +++ +KE E MK K +E
Sbjct: 61  -KEAEKIREEGEKEIEAMKSKAKE 83



 Score = 32.5 bits (74), Expect = 0.17
 Identities = 21/68 (30%), Positives = 40/68 (58%), Gaps = 4/68 (5%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            +K++ AE   + ++  +EAE+   K G E +K+ +++ EK     AEKI ++ +K+ + 
Sbjct: 21  ERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKE----AEKIREEGEKEIEA 76

Query: 962 KKKKKKKK 969
            K K K+K
Sbjct: 77  MKSKAKEK 84



 Score = 27.9 bits (62), Expect = 6.8
 Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKK--KEAEKMRKKKGAEKMKKKK 937
           +++  EA       +   +   R+  + AE+   K  ++  KEAE+  +K+  +  ++ +
Sbjct: 12  EELIEEAEEERKQRIAEAREEARELLEEAEEEASKLGEEIIKEAEEEIEKEAEKIREEGE 71

Query: 938 KDAEKMKKKGAEKI 951
           K+ E MK K  EKI
Sbjct: 72  KEIEAMKSKAKEKI 85


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 37.0 bits (86), Expect = 0.019
 Identities = 20/79 (25%), Positives = 43/79 (54%), Gaps = 7/79 (8%)

Query: 917 KKKEAEKMRKKKGAEK-MKKKK---KDAEKMKKKGAEKIGKKKKKKKKKKKKK---KKKK 969
           K  + +K+ ++KG EK + +K+   ++ +    +   KI + +++    K+KK    K+ 
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161

Query: 970 KKKKKEAEKMKKKTEEAEK 988
            + K EA K+K++  +AE 
Sbjct: 162 ARLKSEASKIKQELVDAEL 180


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.5 bits (90), Expect = 0.020
 Identities = 20/100 (20%), Positives = 58/100 (58%), Gaps = 9/100 (9%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEK----- 950
            + + +K++   K+ + ++  +  E + +  +K  EK++++ + AE+ ++  AE      
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227

Query: 951 --IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             +  K K+ +K+ ++ +++  + ++E E+++++ EEAEK
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEK 267



 Score = 36.6 bits (85), Expect = 0.085
 Identities = 16/96 (16%), Positives = 55/96 (57%), Gaps = 4/96 (4%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           +L   ++   K ++ AEK  + ++ K E  ++       K+K+ +K+ E+++    E++ 
Sbjct: 194 LLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELE----EELS 249

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + +++ ++ +++ ++ +K+ ++   ++++  EE E+
Sbjct: 250 RLEEELEELQEELEEAEKEIEELKSELEELREELEE 285



 Score = 34.3 bits (79), Expect = 0.35
 Identities = 15/94 (15%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             + R   ++ +      K++ ++ E E    +   E++++ + + E+++++  E   K 
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLR---ERLEELENELEELEERLEELKEKI 332

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +  K++ ++++   ++ ++  AE  + K E  EK
Sbjct: 333 EALKEELEERETLLEELEQLLAELEEAKEELEEK 366



 Score = 31.6 bits (72), Expect = 3.0
 Identities = 12/60 (20%), Positives = 37/60 (61%), Gaps = 1/60 (1%)

Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           G  K K++K++AE+  ++  E + ++ +   ++ +K+ +K +++ ++AE+ ++   E  +
Sbjct: 166 GVSKYKERKEEAERKLERTEENL-ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 36.5 bits (85), Expect = 0.024
 Identities = 17/79 (21%), Positives = 49/79 (62%), Gaps = 4/79 (5%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKK 959
           ++ K A+ + + ++ K+EA+ +   +  +++++ ++ A ++    KK AE+I ++ K + 
Sbjct: 38  RQAKIADDLAEAERLKEEAQAL-LAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEA 96

Query: 960 KKKKKKKKKKKKKKKEAEK 978
           +++ ++ K+  + + EAEK
Sbjct: 97  EEELERIKEAAEAEIEAEK 115



 Score = 33.4 bits (77), Expect = 0.26
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAE-KMKKKKKDAEKMKKK--------GAEKIGKKK 955
           KK AE+I ++ K + E E  R K+ AE +++ +K+ A +  +          AEK+  KK
Sbjct: 82  KKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKK 141

Query: 956 KKKKKKKKKKKKKKKKKKKE 975
             +  +K        +  + 
Sbjct: 142 VDEAAQKDLIDAFIAELGEN 161



 Score = 33.4 bits (77), Expect = 0.27
 Identities = 16/98 (16%), Positives = 45/98 (45%), Gaps = 5/98 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKK-----KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           R K   +      E+  ++ +++     ++A+K  ++   E   + +++ E++K+    +
Sbjct: 51  RLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAE 110

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           I  +K++  ++ + +  +      E    KK  E A+K
Sbjct: 111 IEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQK 148



 Score = 31.5 bits (72), Expect = 0.95
 Identities = 16/79 (20%), Positives = 44/79 (55%), Gaps = 8/79 (10%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           E+  K      EAE+++++  A+ +  + +   E+ +++ +E I    ++ KK+ ++  +
Sbjct: 37  ERQAKIADDLAEAERLKEE--AQALLAEYEQELEEAREQASEII----EQAKKEAEQIAE 90

Query: 968 KKKKK-KKEAEKMKKKTEE 985
           + K + ++E E++K+  E 
Sbjct: 91  EIKAEAEEELERIKEAAEA 109


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 36.2 bits (84), Expect = 0.024
 Identities = 25/90 (27%), Positives = 54/90 (60%), Gaps = 13/90 (14%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           + ++ KKKG E++   K+ +KE        G  K K++++      KK  E+   K+K++
Sbjct: 1   KKKKAKKKGVEEVVFDKEARKEY-----LTGFHKRKQQRR------KKAQEEA--KEKER 47

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++ +++K+ ++++K+E EK  K+ +EA K
Sbjct: 48  EERIEERKRIREERKQELEKQLKERKEALK 77



 Score = 34.2 bits (79), Expect = 0.11
 Identities = 21/69 (30%), Positives = 42/69 (60%), Gaps = 12/69 (17%)

Query: 879 NDDVRREAVTALGFILFRK-KRTRRKKKKGAEKIRKKKKKKKEAE--KMR--KKKGAEKM 933
           + + R+E +T      F K K+ RRKK +  E+ ++K+++++  E  ++R  +K+  EK 
Sbjct: 16  DKEARKEYLTG-----FHKRKQQRRKKAQ--EEAKEKEREERIEERKRIREERKQELEKQ 68

Query: 934 KKKKKDAEK 942
            K++K+A K
Sbjct: 69  LKERKEALK 77


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 36.0 bits (83), Expect = 0.025
 Identities = 14/58 (24%), Positives = 17/58 (29%), Gaps = 3/58 (5%)

Query: 127 LQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTL 184
           L       P+   T    + P    PTP    T A  PAP       +  P P     
Sbjct: 82  LTPPAPPEPVTPPT---AQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQ 136


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 36.5 bits (85), Expect = 0.025
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 2/51 (3%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K   E ++ K    I   K+K K+   ++KK      +  EK++KK  E E
Sbjct: 86  KSSEELVQMKK--AISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQE 134



 Score = 31.5 bits (72), Expect = 1.3
 Identities = 11/47 (23%), Positives = 22/47 (46%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
            E ++ KK  +   +K   +   +KK      +  +K +KK  ++EA
Sbjct: 89  EELVQMKKAISALKQKIKRDSAERKKAASSGSRAIEKLRKKLAEQEA 135



 Score = 30.4 bits (69), Expect = 3.2
 Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 3/44 (6%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
            KK   A K K K       ++KK      +  +K +KK  E E
Sbjct: 94  MKKAISALKQKIKRDSA---ERKKAASSGSRAIEKLRKKLAEQE 134



 Score = 29.6 bits (67), Expect = 5.0
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           +E  +M+K   A K K K+  AE+  KK A    +  +K +KK
Sbjct: 89  EELVQMKKAISALKQKIKRDSAER--KKAASSGSRAIEKLRKK 129


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
           conserved protein active in cell division in a number of
           Proteobacteria. The N-terminal 30 residue region tends
           to by Lys/Arg-rich, and is followed by a
           membrane-spanning region. This is followed by an acidic
           low-complexity region of variable length and a
           well-conserved C-terminal domain of two tandem regions
           matched by pfam05036 (Sporulation related repeat), found
           in several cell division and sporulation proteins. The
           role of FtsN as a suppressor for other cell division
           mutations is poorly understood; it may involve cell wall
           hydrolysis [Cellular processes, Cell division].
          Length = 298

 Score = 37.4 bits (86), Expect = 0.026
 Identities = 23/97 (23%), Positives = 39/97 (40%), Gaps = 4/97 (4%)

Query: 896 RKKRTRRKKKKGA--EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
              R   K    A  E+    + +K+ AEK  +K    + +K   + EK+  K  E   K
Sbjct: 127 ADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQK 186

Query: 954 KK--KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K   K+  + +   K  +   K  +  K K +  EK
Sbjct: 187 QKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEK 223



 Score = 35.1 bits (80), Expect = 0.15
 Identities = 22/80 (27%), Positives = 35/80 (43%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K     EKI  K K+ K+ +K   K+ AE     K      K   A+K   K K+K ++ 
Sbjct: 168 KTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKADKADKTKPKPKEKAERA 227

Query: 963 KKKKKKKKKKKKEAEKMKKK 982
              +      K++AE ++ K
Sbjct: 228 AALQCGAYANKEQAESVRAK 247



 Score = 34.3 bits (78), Expect = 0.31
 Identities = 16/86 (18%), Positives = 40/86 (46%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           ++++  E+++   +  ++       +    ++ +K+ AEK  +K      +K   + +K 
Sbjct: 117 EQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKI 176

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K K+ K+K+K   K   +T+   K
Sbjct: 177 ASKVKEAKQKQKALPKQTAETQSNSK 202



 Score = 31.2 bits (70), Expect = 2.9
 Identities = 19/76 (25%), Positives = 30/76 (39%), Gaps = 2/76 (2%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
           KK    +  + +K   E EK+  K   ++ K+K+K   K   +        +   K  K 
Sbjct: 156 KKPQKARTAEAQKTPVETEKIASK--VKEAKQKQKALPKQTAETQSNSKPIETAPKADKA 213

Query: 964 KKKKKKKKKKKEAEKM 979
            K K K K+K E    
Sbjct: 214 DKTKPKPKEKAERAAA 229


>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region.  This
           domain is the C-terminal end of ribosomal L18/L5
           proteins.
          Length = 93

 Score = 34.8 bits (81), Expect = 0.027
 Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 3/50 (6%)

Query: 939 DAEKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           D E+M KK    I     +KKK+KK+ K + K+   KK   E+ K +  +
Sbjct: 43  DIEEMYKKAHAAIRADPSRKKKEKKEVKAESKRYNAKKLTLEQRKARVAQ 92



 Score = 27.9 bits (63), Expect = 6.9
 Identities = 9/33 (27%), Positives = 16/33 (48%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
            +K  KK+ K + K+   KK   +++K     K
Sbjct: 61  RKKKEKKEVKAESKRYNAKKLTLEQRKARVAQK 93


>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain.  This family
           contains a C-terminal presumed domain in Sas10 which
           hash been identified as a regulator of chromatin
           silencing.
          Length = 76

 Score = 34.5 bits (80), Expect = 0.028
 Identities = 15/64 (23%), Positives = 29/64 (45%), Gaps = 7/64 (10%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           TR++KK    +  + KK+KK  + +++ K     ++  +  E+    G E  G K    +
Sbjct: 20  TRKRKKD--RRNPRVKKRKKYEKAVKRLKS---QRRVVR--EEDGGYGGETTGIKTNVVR 72

Query: 960 KKKK 963
             K 
Sbjct: 73  SVKL 76



 Score = 28.4 bits (64), Expect = 3.6
 Identities = 12/43 (27%), Positives = 25/43 (58%)

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            K KG  +  KK ++  + KK+KK +K  K+ ++++   + E+
Sbjct: 14  EKNKGLTRKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREED 56



 Score = 28.0 bits (63), Expect = 4.5
 Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 11/63 (17%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
            K+K + ++ K KG  + +KK             +  + KK+KK +K  K+ K +++   
Sbjct: 5   GKRKISYQIEKNKGLTRKRKKD-----------RRNPRVKKRKKYEKAVKRLKSQRRVVR 53

Query: 976 AEK 978
            E 
Sbjct: 54  EED 56



 Score = 27.2 bits (61), Expect = 8.2
 Identities = 16/61 (26%), Positives = 30/61 (49%), Gaps = 3/61 (4%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K+K + +++K K    K KK    +   + KK+KK +K  K+ K +++   E+      E
Sbjct: 6   KRKISYQIEKNKGLTRKRKKD--RRN-PRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGE 62

Query: 986 A 986
            
Sbjct: 63  T 63


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 38.0 bits (89), Expect = 0.028
 Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 9/98 (9%)

Query: 899 RTRRKKKKGAE--KIRKKKK-----KKKEAEKMRKKKGAEKMKKKKKDA--EKMKKKGAE 949
           R   ++KK AE  K R + +     ++K A + R KK AE    K KDA    + +  A+
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAK 498

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K    +    K   +         +EA K + +  +AE
Sbjct: 499 KAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAE 536



 Score = 37.6 bits (88), Expect = 0.036
 Identities = 12/83 (14%), Positives = 31/83 (37%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            R+ R  R+K     + +K  + +   +K        ++K KK  A +     A      
Sbjct: 457 ARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDN 516

Query: 955 KKKKKKKKKKKKKKKKKKKKEAE 977
                 ++ +K + + ++ ++  
Sbjct: 517 SAVIAAREARKAQARARQAEKQA 539



 Score = 31.5 bits (72), Expect = 2.4
 Identities = 16/82 (19%), Positives = 33/82 (40%), Gaps = 5/82 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           R+K  R  + K A + R  K K   A  + + K      KK    + +  K   +     
Sbjct: 464 REKAAREARHKKAAEARAAKDKDAVAAALARVKA-----KKAAATQPIVIKAGARPDNSA 518

Query: 956 KKKKKKKKKKKKKKKKKKKEAE 977
               ++ +K + + ++ +K+A 
Sbjct: 519 VIAAREARKAQARARQAEKQAA 540



 Score = 30.7 bits (70), Expect = 5.0
 Identities = 23/105 (21%), Positives = 38/105 (36%), Gaps = 27/105 (25%)

Query: 910 KIRKKKKKKKEAE--------------------KMRKKKGAEKMKKKKKDA-------EK 942
           +IR  +++KK+AE                    + R KK AE    K KDA        K
Sbjct: 437 EIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVK 496

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            KK  A +    K   +         ++ +K +A   + + + A 
Sbjct: 497 AKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAA 541


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 37.9 bits (89), Expect = 0.029
 Identities = 25/101 (24%), Positives = 43/101 (42%), Gaps = 7/101 (6%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             KK+   K  K  EK      K        K++  +   +K+++ EK+ K+  E + K 
Sbjct: 138 HLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKL 197

Query: 955 KKKKKKKKKKKKKKKKK-------KKKEAEKMKKKTEEAEK 988
           + +KKK+ K    K+          K+E   MK+  EE  +
Sbjct: 198 EAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMKEINEEERR 238



 Score = 34.8 bits (81), Expect = 0.26
 Identities = 15/78 (19%), Positives = 35/78 (44%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            + ++KK   K   +  +   G  K+  +  D+++  +K   +  ++ +K  K+  +  K
Sbjct: 136 RDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMK 195

Query: 968 KKKKKKKEAEKMKKKTEE 985
           K + +KK+  K     E 
Sbjct: 196 KLEAEKKKQSKNFDPKEG 213



 Score = 30.2 bits (69), Expect = 6.4
 Identities = 22/83 (26%), Positives = 36/83 (43%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           K   +I +   KKK   K+ K+         K D E    K   +  + +K+++ +K  K
Sbjct: 129 KVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAK 188

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
           +  +  KK EAEK K+      K
Sbjct: 189 EALEAMKKLEAEKKKQSKNFDPK 211


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 36.4 bits (84), Expect = 0.029
 Identities = 22/96 (22%), Positives = 41/96 (42%), Gaps = 4/96 (4%)

Query: 894 LFRKKRTRRKKKKGAEK----IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           L R ++T R   KG  K    +R+  +K KE      +   EK++K+        +   +
Sbjct: 71  LGRTEQTVRNHLKGETKAGQLVRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKK 130

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             G+     K+  ++K K +  K +  E++  K E 
Sbjct: 131 IKGEHFPMDKELLEEKLKGEVIKGEPLEEVIDKLEY 166



 Score = 36.4 bits (84), Expect = 0.030
 Identities = 17/90 (18%), Positives = 38/90 (42%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           + T  K K+G     + + +K E E ++ +   E +KK K +   M K+  E+  K +  
Sbjct: 93  RETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVI 152

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K +  ++   K +   +    +  + +   
Sbjct: 153 KGEPLEEVIDKLEYPVESPADLLHEIKIHL 182


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 35.6 bits (83), Expect = 0.032
 Identities = 23/85 (27%), Positives = 51/85 (60%), Gaps = 7/85 (8%)

Query: 905 KKGAEKIRKKKKK-KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
           K   +++ K+ KK + E EK+ K+     ++K K+  +K     +E   ++KK+K+ +KK
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKE-----LQKLKEKLQKDAATLSEAA-REKKEKELQKK 70

Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ ++K++K + +  K++ EE +K
Sbjct: 71  VQEFQRKQQKLQQDLQKRQQEELQK 95


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 35.3 bits (82), Expect = 0.033
 Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 15/85 (17%)

Query: 916 KKKKEAEKMRKKKGAE-----------KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            +++E ++ RK+   E             +K     E+  K   E+  +K+KKKKKKK+ 
Sbjct: 32  AREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKEL 91

Query: 965 KK----KKKKKKKKEAEKMKKKTEE 985
           +     + ++KKK+E   + KK EE
Sbjct: 92  EDFYRFQLREKKKEEQADLLKKFEE 116



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 14/82 (17%)

Query: 891 GFILF-RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK--------MKKKKKDAE 941
           GF+   R  R      + A K  +++KK+KE +K +KK   E          +KKK++  
Sbjct: 52  GFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKK---ELEDFYRFQLREKKKEEQA 108

Query: 942 KMKKKGAEKIGKKKKKKKKKKK 963
            + KK  E   K++ +K K+K+
Sbjct: 109 DLLKKFEED--KERVEKMKEKR 128



 Score = 29.2 bits (66), Expect = 4.8
 Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 15/98 (15%)

Query: 896 RKKRTRRKKKKGAEK--------IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
            ++R R +K+   E         +R  +K     E+  K    EK +K+KK   K KKK 
Sbjct: 34  EEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKK---KKKKKE 90

Query: 948 AEKIGKKKKKKKKKKK----KKKKKKKKKKKEAEKMKK 981
            E   + + ++KKK++     KK ++ K++ E  K K+
Sbjct: 91  LEDFYRFQLREKKKEEQADLLKKFEEDKERVEKMKEKR 128


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 36.6 bits (85), Expect = 0.035
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 4/69 (5%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE-KIGKKKKKKKKKKKKKK 966
            E +RKK+K+KKE  K++KK+    + +K    E + K   E  +G+ +   +  +  KK
Sbjct: 158 LELLRKKEKEKKEQLKIQKKQ---SLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKK 214

Query: 967 KKKKKKKKE 975
           KK K K +E
Sbjct: 215 KKAKSKYEE 223



 Score = 31.6 bits (72), Expect = 1.4
 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-KKGAEKIGKKKKKKK 959
           R+K+K+  E+++ +KK+      + K  G E + K  K+    + +   E     KKKK 
Sbjct: 162 RKKEKEKKEQLKIQKKQS----LLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKA 217

Query: 960 KKK 962
           K K
Sbjct: 218 KSK 220


>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI).  YihI activates the
           GTPase activity of Der, a 50S ribosomal subunit
           stability factor. The stimulation is specific to Der as
           YihI does not stimulate the GTPase activity of Era or
           ObgE. The interaction of YihI with Der requires only the
           C-terminal 78 amino acids of YihI. A yihI deletion
           mutant is viable and shows a shorter lag period, but the
           same post-lag growth rate as a wild-type strain. yihI is
           expressed during the lag period. Overexpression of yihI
           inhibits cell growth and biogenesis of the 50S ribosomal
           subunit. YihI is an unusual, highly hydrophilic protein
           with an uneven distribution of charged residues,
           resulting in an N-terminal region with high pI and a
           C-terminal region with low pI.
          Length = 169

 Score = 36.2 bits (84), Expect = 0.036
 Identities = 30/96 (31%), Positives = 43/96 (44%), Gaps = 25/96 (26%)

Query: 902 RKKKKGAEKIRKKKKKK---------KEAEKMRKKKG-------AEKMKKKKKDAEKMKK 945
           RK  K   K+  K KKK         +E ++ +K+KG        E+ + +K+     KK
Sbjct: 1   RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGLKSGSRHNEESESQKQKGAAQKK 60

Query: 946 KGAEKIGKKKK-------KKKKKKKKKKKKKKKKKK 974
               +IG KK        K K +K+  KKKK KK K
Sbjct: 61  D--PRIGSKKPVPLIVEFKVKPEKQAPKKKKVKKPK 94


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 35.6 bits (82), Expect = 0.037
 Identities = 23/114 (20%), Positives = 52/114 (45%), Gaps = 10/114 (8%)

Query: 884 REAVTALGFILFR--------KKRTRRKKKKGAEKIRKKKKKKK-EAEKMRKKKGAEKMK 934
           R  V   G++L R        K + +  +K+ AE+   ++KK K     +R    A    
Sbjct: 19  RIIVIVGGYLLIRPYFIKLGAKAQEKEHEKERAEREEAREKKAKISPNALRGGATAGH-G 77

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++  D E+ ++  A      +  KK +K+++K  +K  + E +  + + ++ ++
Sbjct: 78  EEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDE 131


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 34.5 bits (80), Expect = 0.038
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
            + ++++++   K K       GKKKKK+K KKK+ +  
Sbjct: 26  KDYLEEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64



 Score = 30.7 bits (70), Expect = 0.83
 Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           ++K+ K+K  +     KKKKK++A+K + +   A 
Sbjct: 33  EEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPAS 67



 Score = 29.5 bits (67), Expect = 2.1
 Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 3/46 (6%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           E+ ++K  ++   K  +     KKKKK+K KKK++         EA
Sbjct: 30  EEQEEKELKQ---KADEGNNSGKKKKKRKAKKKRDEAGPASTAAEA 72



 Score = 29.1 bits (66), Expect = 2.6
 Identities = 14/53 (26%), Positives = 29/53 (54%), Gaps = 6/53 (11%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKM 943
           +K  ++K  +G    +KKKK+K + ++          +  +KM +KKK ++K+
Sbjct: 34  EKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFSKKI 86


>gnl|CDD|150240 pfam09495, DUF2462, Protein of unknown function (DUF2462).  This
           protein is highly conserved, but its function is
           unknown. It can be isolated from HeLa cell nucleoli and
           is found to be homologous with Leydig cell tumour
           protein whose function is unknown.
          Length = 82

 Score = 34.3 bits (79), Expect = 0.038
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKK---KKKKKKKKEAEKMKKK 982
           K+K KG   +   KKK+K  KK      K KK +  +A+K+KK 
Sbjct: 7   KVKAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKV 50



 Score = 30.5 bits (69), Expect = 0.87
 Identities = 18/53 (33%), Positives = 23/53 (43%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K KG   +K  KK  +  KK G   I  KK +  + +K KK       K  EK
Sbjct: 9   KAKGPAAVKAGKKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEK 61



 Score = 28.5 bits (64), Expect = 4.1
 Identities = 18/63 (28%), Positives = 32/63 (50%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK+ +  KK G   I+ KK +  +A+K++K   +   K  +K       +G  ++ K  +
Sbjct: 20  KKKQKGPKKAGPRVIKPKKAQAVQAQKLKKVLTSGLNKSTEKLIASRASEGHLELLKGVR 79

Query: 957 KKK 959
           KKK
Sbjct: 80  KKK 82


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 35.9 bits (83), Expect = 0.040
 Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK-------KKKKKKKK 966
           KK+ +   E ++KK   E++   K  AE   K   + +   KK            ++   
Sbjct: 67  KKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSN 126

Query: 967 KKKKKKKKEAEKMKKKTEEAEK 988
           + K  ++  +++  K+ + A K
Sbjct: 127 ETKLTQESSSDESPKEVKLATK 148



 Score = 34.7 bits (80), Expect = 0.090
 Identities = 15/87 (17%), Positives = 32/87 (36%), Gaps = 8/87 (9%)

Query: 907 GAEKIRKKKK-----KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
             E ++KK K       K   +   K   + ++  KK  E+           ++   + K
Sbjct: 73  AVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDT---IPNHLNEEVSNETK 129

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             ++    +  K+     K KT++ +K
Sbjct: 130 LTQESSSDESPKEVKLATKNKTKKHDK 156


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 36.6 bits (85), Expect = 0.042
 Identities = 16/83 (19%), Positives = 42/83 (50%), Gaps = 4/83 (4%)

Query: 908 AEKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
             +I++ +K  KK +AE     K  E ++ + +D E    +   +I  ++ +++ K+ ++
Sbjct: 23  EPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEI--QEIRERIKRAEE 80

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
           K    K ++E   +  + + A++
Sbjct: 81  KLSAVKDERELRALNIEIQIAKE 103



 Score = 33.9 bits (78), Expect = 0.28
 Identities = 20/116 (17%), Positives = 49/116 (42%), Gaps = 11/116 (9%)

Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKK-KKEAEKMRKK--KGAEKMKKKKKDA 940
           R+A+      L    +     +   E +  +  + + E +++R++  +  EK+   K + 
Sbjct: 30  RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDER 89

Query: 941 EKMKKKGAEKIGKKKKKKK--------KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E        +I K++            ++ +K +K+ +  K+  E+++K   EAE 
Sbjct: 90  ELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA 145



 Score = 31.2 bits (71), Expect = 2.1
 Identities = 14/87 (16%), Positives = 39/87 (44%), Gaps = 6/87 (6%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           R++ K AE+     K ++E   +  +    K +    + E        ++ ++ +K +K+
Sbjct: 72  RERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDE------LAELMEEIEKLEKE 125

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  K++ ++ +K   + + + EE   
Sbjct: 126 IEDLKERLERLEKNLAEAEARLEEEVA 152



 Score = 29.6 bits (67), Expect = 7.6
 Identities = 14/96 (14%), Positives = 41/96 (42%), Gaps = 3/96 (3%)

Query: 893 ILFRKKRTRRKKKKGAEKIR-KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
              R +   ++ +K  +K + + +   K  E +  +   E ++ +    E   ++  E+I
Sbjct: 18  EKDRLEPRIKEIRKALKKAKAELEALNKALEALEIEL--EDLENQVSQLESEIQEIRERI 75

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            + ++K    K +++ +    + +  K +  + E E
Sbjct: 76  KRAEEKLSAVKDERELRALNIEIQIAKERINSLEDE 111


>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
          Length = 381

 Score = 37.0 bits (86), Expect = 0.043
 Identities = 25/82 (30%), Positives = 37/82 (45%), Gaps = 2/82 (2%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           R  KKK A++  +KK   K      +   A K  K +K A   ++K   K  +K K ++ 
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAK--KLRKLAVLAQEKATAKGAQKVKNRRA 359

Query: 961 KKKKKKKKKKKKKKEAEKMKKK 982
           +KK KK +  K  K   K  K 
Sbjct: 360 RKKAKKARLAKVAKALYKSIKA 381



 Score = 34.7 bits (80), Expect = 0.24
 Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 4/79 (5%)

Query: 914 KKKKKKEAEKMRKKKGAEK----MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           +  KKK A++  +KK   K      +    A+K++K       K   K  +K K ++ +K
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARK 361

Query: 970 KKKKKEAEKMKKKTEEAEK 988
           K KK    K+ K   ++ K
Sbjct: 362 KAKKARLAKVAKALYKSIK 380



 Score = 34.7 bits (80), Expect = 0.25
 Identities = 21/85 (24%), Positives = 34/85 (40%), Gaps = 10/85 (11%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K   RR +KK   K R+ + +   A K  +K      +K            A+   K K 
Sbjct: 307 KPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKAT----------AKGAQKVKN 356

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKK 981
           ++ +KK KK +  K  K   + +K 
Sbjct: 357 RRARKKAKKARLAKVAKALYKSIKA 381


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 37.2 bits (87), Expect = 0.044
 Identities = 19/100 (19%), Positives = 53/100 (53%), Gaps = 3/100 (3%)

Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKK---GAEKMKKKKKDAEKMKK 945
            +G++L ++   ++         R  ++ KKEAE ++K+      E++ K + + E+  K
Sbjct: 13  IIGYLLRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELK 72

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           +   ++ + +++  ++++   +K +   K+ E ++KK +E
Sbjct: 73  ERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKE 112



 Score = 33.7 bits (78), Expect = 0.49
 Identities = 28/113 (24%), Positives = 56/113 (49%), Gaps = 7/113 (6%)

Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKK--KKKKEAEKMRKKKGAE--KMKKKKK 938
           +R A   LG      KR   + KK AE ++K+   + K+E  K+R +   E  + + + +
Sbjct: 20  KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKK---KKKKKKKKKKKEAEKMKKKTEEAEK 988
             E+   +  E + +K +   KK++   KK+K+   K+K  ++ +++ EE   
Sbjct: 80  RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIA 132



 Score = 32.6 bits (75), Expect = 1.3
 Identities = 23/103 (22%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 889 ALGFILFR--KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
            +G I+    +KR   KK   AE     K+  +EA+K  +    E + + K++  K++ +
Sbjct: 9   IVGLIIGYLLRKRIAEKKLGSAE--ELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAE 66

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKK-KKEAEKMKKKTEEAEK 988
             E+  K+++ + ++ +++  ++++   ++ E + KK E  EK
Sbjct: 67  -LERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEK 108



 Score = 29.9 bits (68), Expect = 7.7
 Identities = 22/111 (19%), Positives = 57/111 (51%), Gaps = 4/111 (3%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           ++ ++EA T     L   K    K +   E+  K+++ + +  + R  +  E + +K + 
Sbjct: 39  EEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMES 98

Query: 940 AEKMKK---KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT-EEA 986
            +K ++   K  +++  K+K   +K+++ ++   ++++E E++   T EEA
Sbjct: 99  LDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEA 149


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 37.1 bits (85), Expect = 0.045
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 9/68 (13%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK  G +KI  +K+K+ +    + + G  ++K KK+ AEK         G ++++  +  
Sbjct: 564 KKTGGMKKIMDEKRKRLKNNIEQTQDGKPELKIKKRKAEK---------GDQRQELDRIV 614

Query: 963 KKKKKKKK 970
           K  K+  K
Sbjct: 615 KSIKRSGK 622



 Score = 35.1 bits (80), Expect = 0.20
 Identities = 27/135 (20%), Positives = 49/135 (36%), Gaps = 28/135 (20%)

Query: 876 SDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK 935
           SD  ++   E           +  +  K +   EK+ +K ++++E  K R+      +KK
Sbjct: 402 SDDGENGEMEDTFTSHLPASNESESDDKLETTIEKLDRKLRERQENRKERQ------LKK 455

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK----------------------K 973
            K D++   K   E I KK KK K   ++    K++                       K
Sbjct: 456 TKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELIKADDEDDEQLDHFDMKSILK 515

Query: 974 KEAEKMKKKTEEAEK 988
            E  K  +K ++   
Sbjct: 516 AEKFKKNRKLKKKAS 530



 Score = 31.3 bits (70), Expect = 3.4
 Identities = 23/99 (23%), Positives = 39/99 (39%), Gaps = 20/99 (20%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK------ 950
           K   + ++++   K R+ KK K +++   K K     KK KK    +++  A K      
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELI 495

Query: 951 --------------IGKKKKKKKKKKKKKKKKKKKKKKE 975
                         +    K +K KK +K KKK    +E
Sbjct: 496 KADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEE 534


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 33.8 bits (78), Expect = 0.048
 Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 8/50 (16%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK------KKKEAEKMKKK 982
             E  ++  A K+ +  ++ ++KKKKKKK KKK      +KK  E+M+++
Sbjct: 20  GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK--EQMRRR 67



 Score = 33.0 bits (76), Expect = 0.097
 Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 5/69 (7%)

Query: 859 CGSGSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKK 918
            G          L     + ++ +    V     I  R +R ++KKKK  +K ++   +K
Sbjct: 4   GGPSVPEQLPNGLGEEGEEDDEQIDASKVR---RISQRNRRKKKKKKKLKKKSKEWILRK 60

Query: 919 KEAEKMRKK 927
           K  E+MR++
Sbjct: 61  K--EQMRRR 67



 Score = 31.5 bits (72), Expect = 0.37
 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE-KMKKK 982
                E  +        K ++  ++ ++KKKKKKK KKK  E  ++KK
Sbjct: 14  NGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKK 61



 Score = 30.3 bits (69), Expect = 0.90
 Identities = 16/47 (34%), Positives = 30/47 (63%), Gaps = 7/47 (14%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG 947
             K ++ +++ R+KKKKKK   K++KK     ++KK    E+M+++G
Sbjct: 29  ASKVRRISQRNRRKKKKKK---KLKKKSKEWILRKK----EQMRRRG 68



 Score = 28.8 bits (65), Expect = 2.8
 Identities = 17/45 (37%), Positives = 30/45 (66%), Gaps = 2/45 (4%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           +A K+R+     + KKKKK  +K+KKK  E I +KK++ +++ K+
Sbjct: 28  DASKVRRISQRNRRKKKKK--KKLKKKSKEWILRKKEQMRRRGKE 70


>gnl|CDD|223625 COG0551, TopA, Zn-finger domain associated with topoisomerase type
           I [DNA replication, recombination, and repair].
          Length = 140

 Score = 35.2 bits (81), Expect = 0.048
 Identities = 20/74 (27%), Positives = 30/74 (40%), Gaps = 1/74 (1%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
            KG   ++K +  K         K     +K K   +K  K G+ K+ +KK+KKK    +
Sbjct: 67  GKGLLVLKKGRFGKNFLGCSNYPKC-RFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLE 125

Query: 965 KKKKKKKKKKEAEK 978
              K    K E  K
Sbjct: 126 CSNKPCCLKVENLK 139



 Score = 33.7 bits (77), Expect = 0.14
 Identities = 24/113 (21%), Positives = 38/113 (33%), Gaps = 22/113 (19%)

Query: 896 RKKRTRRKKKKGA---EKIRKKKKKKKEAEKMRKK-----KGAEKMKKKKKD-------- 939
            KK  +     G     K    + +K  AEK   K     KG   +KK +          
Sbjct: 28  VKKFGKYGIFLGCSNYPKCDYYEPEKAIAEKTGVKCPKCGKGLLVLKKGRFGKNFLGCSN 87

Query: 940 ------AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
                  EK K K  +      +K  +KK+KKK    +   +   +K +  + 
Sbjct: 88  YPKCRFTEKPKPKEKKCPKCGSRKLVEKKEKKKGVFLECSNKPCCLKVENLKN 140


>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin.  The structure of
           protocadherins is similar to that of classic cadherins
           (pfam00028), but particularly on the cytoplasmic domains
           they also have some unique features. They are expressed
           in a variety of organisms and are found in high
           concentrations in the brain where they seem to be
           localised mainly at cell-cell contact sites. Their
           expression seems to be developmentally regulated.
          Length = 223

 Score = 36.4 bits (84), Expect = 0.050
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           ++KK  +  KK+  +     ++ K+KKKKK KKKK  K
Sbjct: 67  ERKKGYQAGKKETEDWFSPNQENKQKKKKKDKKKKSPK 104



 Score = 33.7 bits (77), Expect = 0.30
 Identities = 16/38 (42%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KKG +  KK+ +D          K  KKKK KKKK  K
Sbjct: 69  KKGYQAGKKETEDW--FSPNQENKQKKKKKDKKKKSPK 104



 Score = 30.2 bits (68), Expect = 4.6
 Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 5/34 (14%)

Query: 945 KKGAEKIGKKKKKK----KKKKKKKKKKKKKKKK 974
           KKG +  GKK+ +      ++ K+KKKKK KKKK
Sbjct: 69  KKGYQA-GKKETEDWFSPNQENKQKKKKKDKKKK 101


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 37.4 bits (86), Expect = 0.050
 Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 20/108 (18%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK----- 955
           RR+     E+ RK   +K++ + +      EK   +    +K + +GA    K       
Sbjct: 573 RREVFPVVEQRRKLAPRKRKEDFVTPSTSLEKSMDRILHGQKKRAEGAVVFEKPLEATEN 632

Query: 956 ---------------KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                          KKK  ++ ++ K+ KKK  E E  ++    +EK
Sbjct: 633 FNPWLDRKMRRIKRIKKKAYRRIRRDKRLKKKMPEEENTQENHLGSEK 680



 Score = 32.8 bits (74), Expect = 1.0
 Identities = 16/97 (16%), Positives = 37/97 (38%), Gaps = 6/97 (6%)

Query: 897 KKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKK----KKKDAEKMKKKGAEK 950
           +K   R    +K   +     +K  EA +        KM++    KKK   ++++    K
Sbjct: 603 EKSMDRILHGQKKRAEGAVVFEKPLEATENFNPWLDRKMRRIKRIKKKAYRRIRRDKRLK 662

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
               +++  ++     +KK+        +K+   E +
Sbjct: 663 KKMPEEENTQENHLGSEKKRHGGVPDILLKEIEVEDD 699



 Score = 31.6 bits (71), Expect = 2.8
 Identities = 20/123 (16%), Positives = 58/123 (47%), Gaps = 4/123 (3%)

Query: 866 ATKRLLHVAVSDVNDDVRREAVTALGF---ILFRKKRTRRKKKKGAEKIRKKKKKKKEAE 922
           A K+   +A++ ++ +   E    L F   ++FR++R  ++  K   K  +K +K ++ +
Sbjct: 354 ALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKEK 413

Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           +M     +E ++ +K +  +  ++  ++  K      +K  ++    +  ++   +  +K
Sbjct: 414 EMALIPKSEDLENEKSEEARALERMTQR-HKNTSSWTRKMLERASHGEGTREAVNEQIRK 472

Query: 983 TEE 985
            +E
Sbjct: 473 GDE 475



 Score = 30.4 bits (68), Expect = 5.3
 Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 2/76 (2%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK--GAEKIGKKKKKKKKKKKKKKKKKKKK 972
              KK+ E    K   E++ ++ +    M++     E+  K+  K K K  +K +K +K+
Sbjct: 353 SALKKQEELALNKLSVEEVAERTRQLRFMRELMFREERKAKRVAKIKSKTYRKIRKNRKE 412

Query: 973 KKEAEKMKKKTEEAEK 988
           K+ A   K +  E EK
Sbjct: 413 KEMALIPKSEDLENEK 428


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 34.9 bits (81), Expect = 0.050
 Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K K+A+ +      ++   KK   E+ +K    +K+ K K+++  KKK++K K K KKK
Sbjct: 81  KAKDADVI--NAAFKEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKKERKDKVKNKKK 137



 Score = 32.6 bits (75), Expect = 0.31
 Identities = 34/131 (25%), Positives = 61/131 (46%), Gaps = 37/131 (28%)

Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKK-----------------KKEAEK------MR 925
           AL  +L   K+ +++   G + ++K  K+                 K+EA+K      ++
Sbjct: 7   ALKAVLKDIKKQKQQPPHGKQSLKKLLKQGRGLSNIEITDENLKKFKREAKKYGVDYAVK 66

Query: 926 KKKGAEKMKK----KKKDAEKM----KKKGAEKIGKKKKKKK------KKKKKKKKKKKK 971
           K K     K     K KDA+ +    K+  A+K+ K++ +K       KK K+++  KKK
Sbjct: 67  KDKNNGPPKYLVFFKAKDADVINAAFKEFTAKKLKKEQDEKPSVRKKLKKFKEEQAAKKK 126

Query: 972 KKKEAEKMKKK 982
           ++K+  K KKK
Sbjct: 127 ERKDKVKNKKK 137


>gnl|CDD|179522 PRK02998, prsA, peptidylprolyl isomerase; Reviewed.
          Length = 283

 Score = 36.5 bits (84), Expect = 0.051
 Identities = 36/144 (25%), Positives = 70/144 (48%), Gaps = 9/144 (6%)

Query: 853 TIAMAYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIR 912
            +A++ CGS  NV T ++ ++   +++ ++R++   +  + +   K    K K   E+ +
Sbjct: 16  VLALSACGSSDNVVTSKVGNITEKELSKELRQKYGESTLYQMVLSKALLDKYKVSDEEAK 75

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGA-EKIGKKKKKKKKKKKKKKKKK 969
           K+ ++ K+      K   E++  K +D   EKMK + A EK  K    +K  K   K + 
Sbjct: 76  KQVEEAKDKMGDNFKSTLEQVGLKNEDELKEKMKPEIAFEKAIKATVTEKDVKDNYKPEM 135

Query: 970 K------KKKKEAEKMKKKTEEAE 987
           K      K +K A+++K+K    E
Sbjct: 136 KVSHILVKDEKTAKEVKEKVNNGE 159


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 37.2 bits (87), Expect = 0.051
 Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 18/87 (20%)

Query: 918 KKEAEKMRKKKGAEKMK---KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
             E E+M K   AE+     KK+K+  + K +  E +   +K  K++  K  +  KKK +
Sbjct: 503 DDEIERMVKD--AEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVE 560

Query: 975 EA-------------EKMKKKTEEAEK 988
           EA             E+++ KTEE +K
Sbjct: 561 EAIEWLKEELEGEDKEEIEAKTEELQK 587


>gnl|CDD|234465 TIGR04092, LTA_DltD, D-alanyl-lipoteichoic acid biosynthesis
           protein DltD.  Members of this protein family are DltD,
           part of the DltABCD system widely distributed in the
           Firmicutes for D-alanylation of lipoteichoic acids. The
           most common form of LTA, as in Staphylococcus aureus,
           has a backbone of polyglycerolphosphate.
          Length = 383

 Score = 36.8 bits (86), Expect = 0.052
 Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 25/109 (22%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKK------KKKKEAEKMRKKKGAEKMKKKKK------DAE 941
           L +K     KK    +K+  +       +K+ +     K K  +K+KK  K      D E
Sbjct: 180 LLKKLYANGKKLSKFDKVYLEPYYRNLLEKEDDLFSRFKLKYKKKIKKAAKKLPETYDWE 239

Query: 942 KMKKKGAEKIGKKKK------------KKKKKKKKKKKKKKKKKKEAEK 978
           ++ KK AEK GKK+              K+ KKK KK K  +K  +  K
Sbjct: 240 ELDKK-AEKEGKKRTTNNPFGIDNKYYNKRIKKKLKKLKGSQKNFDYLK 287



 Score = 36.0 bits (84), Expect = 0.083
 Identities = 27/104 (25%), Positives = 44/104 (42%), Gaps = 9/104 (8%)

Query: 894 LFRKKRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           L   K +   K+  A+++ +    K  K  E + KK  A   K  K D   ++      +
Sbjct: 148 LKNPKISTETKRYAAKRLLELSVVKSDKILENLLKKLYANGKKLSKFDKVYLEPYYRNLL 207

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEA------EKMKKK-TEEAEK 988
            K+     + K K KKK KK  K+       E++ KK  +E +K
Sbjct: 208 EKEDDLFSRFKLKYKKKIKKAAKKLPETYDWEELDKKAEKEGKK 251



 Score = 35.7 bits (83), Expect = 0.12
 Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 3/79 (3%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           +   +K+    KK  + +K+  +     + +K+ D   +  +   K  KK KK  KK  +
Sbjct: 178 ENLLKKLYANGKKLSKFDKVYLEPYYRNLLEKEDD---LFSRFKLKYKKKIKKAAKKLPE 234

Query: 965 KKKKKKKKKKEAEKMKKKT 983
               ++  KK  ++ KK+T
Sbjct: 235 TYDWEELDKKAEKEGKKRT 253


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 37.0 bits (85), Expect = 0.056
 Identities = 15/58 (25%), Positives = 43/58 (74%), Gaps = 2/58 (3%)

Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            + K+ KK+++A +  K+ AE+  K +++++++K+K+K++++++++EAE+  K +  +
Sbjct: 573 ASSKLAKKREEAVEKAKREAEQ--KAREEREREKEKEKEREREREREAERAAKASSSS 628



 Score = 33.1 bits (75), Expect = 0.85
 Identities = 16/61 (26%), Positives = 36/61 (59%), Gaps = 1/61 (1%)

Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK-KGAEKMKKKKKDAEKMK 944
           A T L F      +  +K+++  EK +++ ++K   E+ R+K K  E+ ++++++AE+  
Sbjct: 563 ARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAA 622

Query: 945 K 945
           K
Sbjct: 623 K 623


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 35.9 bits (83), Expect = 0.058
 Identities = 19/66 (28%), Positives = 23/66 (34%), Gaps = 1/66 (1%)

Query: 920 EAEKMRKKKGAEKMKKKKKD-AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
            A+     K   K + K    A K   K  E I  KK KK     K    K     +   
Sbjct: 143 AAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTT 202

Query: 979 MKKKTE 984
            KKKT+
Sbjct: 203 KKKKTK 208



 Score = 33.6 bits (77), Expect = 0.29
 Identities = 18/63 (28%), Positives = 20/63 (31%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
                K   K R K      K   K  E +  K A+K     K    K     KK  KKK
Sbjct: 146 AAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKK 205

Query: 974 KEA 976
           K  
Sbjct: 206 KTK 208



 Score = 29.4 bits (66), Expect = 7.3
 Identities = 16/67 (23%), Positives = 18/67 (26%), Gaps = 3/67 (4%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
                 K   K R K  +          K  E +  KK        K          KK 
Sbjct: 145 QAAAAPKAPAKPRAKSPRPAAKAA---PKPTETITAKKAKKTAAAPKPTADKTATPAKKT 201

Query: 960 KKKKKKK 966
            KKKK K
Sbjct: 202 TKKKKTK 208


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 36.2 bits (84), Expect = 0.059
 Identities = 18/47 (38%), Positives = 25/47 (53%)

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K +K   E   +  K+ K KKK  KKK+K   K AE   K+  +AE+
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAER 51



 Score = 35.4 bits (82), Expect = 0.093
 Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 1/74 (1%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           E + KK+K ++E      K+   K KK  K   K+  K AE   K+ +K +++  + K+ 
Sbjct: 1   ELLLKKRKARQELAVQVAKQAKAK-KKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRL 59

Query: 969 KKKKKKEAEKMKKK 982
            KK  K     + K
Sbjct: 60  AKKPGKFYVPAEHK 73



 Score = 34.3 bits (79), Expect = 0.23
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK-------------KKKKKKKKEAEK 978
            +KK+K   E   +   +   KKK  KKK+K   K             ++  + K+ A+K
Sbjct: 3   LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKK 62

Query: 979 MKKKTEEAEK 988
             K    AE 
Sbjct: 63  PGKFYVPAEH 72



 Score = 33.9 bits (78), Expect = 0.34
 Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 6/67 (8%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK+K  +++  +  K+ +A+K   KK   K K   K AE   K   E    +++  + K+
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKK---KRKIYFKRAESFVK---EYRKAEREIIRLKR 58

Query: 963 KKKKKKK 969
             KK  K
Sbjct: 59  LAKKPGK 65



 Score = 32.7 bits (75), Expect = 0.73
 Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 1/73 (1%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           +L +K++ R++      K  K KKK  + ++    K AE   K+ + AE+ +    +++ 
Sbjct: 2   LLLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAER-EIIRLKRLA 60

Query: 953 KKKKKKKKKKKKK 965
           KK  K     + K
Sbjct: 61  KKPGKFYVPAEHK 73



 Score = 31.9 bits (73), Expect = 1.1
 Identities = 20/81 (24%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 884 REAVTALGFILFRKKRTRRKKK---KGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKK 936
             AV        +KK  ++K+K   K AE      RK +++    +++ KK G   +  +
Sbjct: 12  ELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGKFYVPAE 71

Query: 937 KKDAEKMKKKGAEKIGKKKKK 957
            K    ++ KG   I  K +K
Sbjct: 72  HKLVFVIRIKGINGIPPKPRK 92


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 36.4 bits (83), Expect = 0.070
 Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 916 KKKKEAEKMR---KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           + KKEAE  R    K G E  ++++K  ++ +K   E I     + K +++K+K +++K+
Sbjct: 132 QAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQ 191

Query: 973 KKEAEKMKKKTEE 985
           K E EK K     
Sbjct: 192 KTEQEKQKTSNIA 204



 Score = 35.3 bits (80), Expect = 0.18
 Identities = 23/87 (26%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           R  K G E  ++++K ++E +K  K+       + K + EK K +      ++K+K +++
Sbjct: 143 RANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTE------QEKQKTEQE 196

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+K      K   E E+ K+KTE  ++
Sbjct: 197 KQKTSNIANKNAIELEQEKQKTENEKQ 223



 Score = 33.3 bits (75), Expect = 0.57
 Identities = 22/83 (26%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 897 KKRTRRKKKKGAEKIR-KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +K  + K+K   E I     + K E EK + ++  +K +++K+    +  K A ++ ++K
Sbjct: 156 QKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEK 215

Query: 956 KKKKKKKKKKKKKKKKKKKEAEK 978
           +K + +K+   K++K   KEAE+
Sbjct: 216 QKTENEKQDLIKEQKDFIKEAEQ 238



 Score = 31.0 bits (69), Expect = 3.5
 Identities = 16/77 (20%), Positives = 41/77 (53%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           E+  +++K+K E E +       K +++K+  E+ K+K  ++  K      K   + +++
Sbjct: 155 EQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQE 214

Query: 969 KKKKKKEAEKMKKKTEE 985
           K+K + E + + K+ ++
Sbjct: 215 KQKTENEKQDLIKEQKD 231


>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752).  This
           domain family is found in eukaryotes, and is typically
           between 140 and 163 amino acids in length.
          Length = 148

 Score = 34.7 bits (80), Expect = 0.071
 Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK---KKKKKKKKKK 971
           K K+ + E +  + GA         A K  K+ A K+ +  +KK+ K   ++ +KK+KKK
Sbjct: 42  KAKRLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKK 101

Query: 972 KKKEAEK 978
           KK+E   
Sbjct: 102 KKEEEND 108



 Score = 28.5 bits (64), Expect = 10.0
 Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 10/77 (12%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK---DAEK-MKKKGAEKI 951
           R K   RK K+  EK R K   ++  +K +KKK  E+     +   D EK +       +
Sbjct: 70  RDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKEEENDDPSRRPFDREKDL------AV 123

Query: 952 GKKKKKKKKKKKKKKKK 968
           G K  K ++++   K  
Sbjct: 124 GGKISKAQRRELINKAA 140


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 36.5 bits (84), Expect = 0.072
 Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 4/95 (4%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAE----KMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           + K     K K A   ++K++  +E         K K A   K K     K K++G E++
Sbjct: 94  KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            +++++  K+K K K     K K A   K+K  EA
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEA 188



 Score = 35.0 bits (80), Expect = 0.20
 Identities = 25/100 (25%), Positives = 42/100 (42%), Gaps = 16/100 (16%)

Query: 904 KKKGAEKIRKK-----KKKKKEAEKMR-------KKKGAEKMKKKKKDAEKMKKKGAEKI 951
           K++ A   + K     K+K++  E++        K K A   K K     K K++G E++
Sbjct: 60  KRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEV 119

Query: 952 GKKKKKKKKKKK----KKKKKKKKKKKEAEKMKKKTEEAE 987
            +++K   K K     K K     K+K     +   EE E
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEE 159



 Score = 33.0 bits (75), Expect = 0.84
 Identities = 25/104 (24%), Positives = 41/104 (39%)

Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
           E VT       + K     K K A   ++K++  +E  +  ++   EK K K   A K K
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                K    +  +  ++  +++K K K K A   K K     K
Sbjct: 177 AAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAK 220



 Score = 33.0 bits (75), Expect = 0.88
 Identities = 22/104 (21%), Positives = 36/104 (34%), Gaps = 13/104 (12%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            + +R+  +   +  K   K K A   + K  A   +K++   E  +++      K K K
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168

Query: 958 KKKKKKKKKKKKKKKK-------------KEAEKMKKKTEEAEK 988
                K K     K+K             +E  K K K   A K
Sbjct: 169 AAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAK 212



 Score = 31.9 bits (72), Expect = 1.8
 Identities = 26/96 (27%), Positives = 46/96 (47%), Gaps = 5/96 (5%)

Query: 896 RKKRTRRKKKKGAEKIR--KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           +     ++K++G E++   +K   K +A    K K A   K+K++  E++ ++  E   +
Sbjct: 104 KAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163

Query: 954 KKKKKKKK--KKKKKKKKKKKKKEA-EKMKKKTEEA 986
           K K K     K K     K+K  EA E  ++ TEE 
Sbjct: 164 KAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEE 199



 Score = 31.5 bits (71), Expect = 2.4
 Identities = 17/85 (20%), Positives = 36/85 (42%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +     ++K++G E++ +++++  + +   K   A K K      +K  + G       +
Sbjct: 138 KAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTE 197

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMK 980
           ++K K K K     K K     K K
Sbjct: 198 EEKAKAKAKAAAAAKAKAAALAKQK 222



 Score = 30.3 bits (68), Expect = 4.9
 Identities = 26/104 (25%), Positives = 43/104 (41%)

Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
            A  A    L ++KR   ++    E+   K+K K +A    K K A   K+K  +A +  
Sbjct: 133 AAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGT 192

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++  E+   K K K     K K     K+K ++      +E  K
Sbjct: 193 EEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAK 236


>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2.  This family
           contains the bacterial histone H1-like nucleoprotein HC2
           (approximately 200 residues long), which seems to be
           found mostly in Chlamydia. HC2 functions in DNA
           condensation, although it has been suggested that it
           also has other roles.
          Length = 187

 Score = 35.5 bits (81), Expect = 0.075
 Identities = 30/90 (33%), Positives = 35/90 (38%), Gaps = 2/90 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK   RK         KK   KK A+K   KK   K    KK   K  K  A+K+  KK 
Sbjct: 41  KKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAK--KATAKKVAAKKV 98

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
             KK   KK   KK   K+A   K    + 
Sbjct: 99  VAKKTVAKKAAAKKPAAKKAVAKKAVARKP 128



 Score = 35.1 bits (80), Expect = 0.093
 Identities = 34/98 (34%), Positives = 45/98 (45%), Gaps = 5/98 (5%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----KKKGAEKI 951
           +KKR+ +K       +RK   KK  A+K   +K A K    +K   K     KK  A+K 
Sbjct: 5   QKKRSSKKTAAKKAAVRKPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAAKKA 64

Query: 952 GKKK-KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK   KK   KK   KK   KK  A+K+  K   A+K
Sbjct: 65  AKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKK 102



 Score = 32.8 bits (74), Expect = 0.55
 Identities = 34/87 (39%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 907 GAEKIRKKKK---KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK--KKKKK 961
           GA+K R  KK   KK    K   KK A K    +K A   KK  A K   KK    KK  
Sbjct: 3   GAQKKRSSKKTAAKKAAVRKPAAKKAAAKKTVVRKVA--AKKPAARKTVAKKTVAAKKPA 60

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK  KK   KK  A+K   K   A+K
Sbjct: 61  AKKAAKKAVAKKVVAKKPVAKKAVAKK 87



 Score = 32.4 bits (73), Expect = 0.66
 Identities = 35/92 (38%), Positives = 43/92 (46%), Gaps = 7/92 (7%)

Query: 904 KKKGAEKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKM--KKKGAEKIGKKKKKK 958
           +K  A+K   +K   KK   A+K   KK A+K   KK  A+K   KK  A+K   KK   
Sbjct: 36  RKVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAA 95

Query: 959 KK--KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK   KK   KK   KK  A+K   K   A K
Sbjct: 96  KKVVAKKTVAKKAAAKKPAAKKAVAKKAVARK 127



 Score = 32.1 bits (72), Expect = 0.83
 Identities = 28/92 (30%), Positives = 33/92 (35%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           +K   +K       +RK   KK  A K   KK     K   K A K          K   
Sbjct: 21  RKPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVA 80

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK   KK   KK   KK  A+K   K   A+K
Sbjct: 81  KKAVAKKATAKKVAAKKVVAKKTVAKKAAAKK 112



 Score = 31.7 bits (71), Expect = 1.2
 Identities = 32/83 (38%), Positives = 40/83 (48%), Gaps = 4/83 (4%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMR--KKKGAEKMKKKKKDAEKM--KKKGAEKIGK 953
           K+T   KK  A+K  KK   KK   K    KK  A+K   KK  A+K+  KK  A+K   
Sbjct: 51  KKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKKTVAKKAAA 110

Query: 954 KKKKKKKKKKKKKKKKKKKKKEA 976
           KK   KK   KK   +K   K+A
Sbjct: 111 KKPAAKKAVAKKAVARKPAAKKA 133


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 35.8 bits (83), Expect = 0.080
 Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 3/94 (3%)

Query: 898 KRTRRKKKKGAEKIRKKKKK---KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           K  +      +E+I+ KKK     +E EK++ +    K    K+      KK AE   K+
Sbjct: 113 KAAKLLTDWRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKE 172

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K + K+KK ++ + K + E      + E    
Sbjct: 173 DEKLEAKRKKGEESRLKAENEYYTCCVRLERLRL 206



 Score = 33.1 bits (76), Expect = 0.59
 Identities = 20/85 (23%), Positives = 43/85 (50%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
           + KKK     R+ +K + + E  ++    +KM  K K + +++ K  EK+  K+KK ++ 
Sbjct: 127 KVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKRKKGEES 186

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEA 986
           + K + +        E+++ + E A
Sbjct: 187 RLKAENEYYTCCVRLERLRLEWESA 211


>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain.  This region has been
           identified as the binding site of the SRP72 protein to
           SRP RNA.
          Length = 57

 Score = 32.4 bits (74), Expect = 0.082
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           A+K KK   +K G ++KKKK  KKK+K K
Sbjct: 2   AKKPKKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           K  +KI KKK   ++KKKK  KKK+K K
Sbjct: 3   KKPKKIAKKKPGTEQKKKKSAKKKRKPK 30



 Score = 28.6 bits (64), Expect = 1.9
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            A+K  K  KKK   ++KKKK  KKK+K
Sbjct: 1   EAKKPKKIAKKKPGTEQKKKKSAKKKRK 28



 Score = 27.8 bits (62), Expect = 4.5
 Identities = 13/28 (46%), Positives = 16/28 (57%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KK  +  KKK   +  KKK  KKK+K K
Sbjct: 3   KKPKKIAKKKPGTEQKKKKSAKKKRKPK 30


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 34.3 bits (79), Expect = 0.090
 Identities = 27/107 (25%), Positives = 48/107 (44%), Gaps = 17/107 (15%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD---AEKMKKKGAE 949
           IL R K+  + KKK   +    K +KK   ++R +K     K + K     +  K++   
Sbjct: 23  ILSRSKKLLKAKKKLKSE----KLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKERRLR 78

Query: 950 KIGKK----------KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K+ +K            +KK +K  K+  KK + K  E +K+ ++E 
Sbjct: 79  KVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKEVSKEK 125



 Score = 31.9 bits (73), Expect = 0.49
 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 4/53 (7%)

Query: 932 KMKKKKKDAEKMKK----KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           K+   K  A K K     +  + +  KKK K +K +KK K++ + +K     K
Sbjct: 8   KILASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60



 Score = 31.2 bits (71), Expect = 0.98
 Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 2/53 (3%)

Query: 913 KKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
            K    K  + +  R KK  +  KK K +  + K K   +  K++  +K + K
Sbjct: 12  SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVK 64



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 15/53 (28%), Positives = 24/53 (45%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           A+   K         K+    + + KK  K KKK K +K +KK K   + +K+
Sbjct: 3   ADAFAKILASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKR 55


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 34.6 bits (79), Expect = 0.090
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           M++    E + K++K++ K + +        K  KKKK+K  +KK K KKKE EK   K 
Sbjct: 1   MKETSYEEALAKQRKESGKTQSQ--------KSDKKKKEKVSEKKGKSKKKE-EKPNGKI 51

Query: 984 EEAE 987
            E E
Sbjct: 52  PEHE 55



 Score = 31.9 bits (72), Expect = 0.89
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           MK+   E+   K++K+  K + +K  KKKK+K +EK  K  ++ EK
Sbjct: 1   MKETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46



 Score = 31.1 bits (70), Expect = 1.2
 Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 10/49 (20%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K++KE+ K + +K      KKKK+      K +EK GK KKK++K   K
Sbjct: 12  KQRKESGKTQSQKS----DKKKKE------KVSEKKGKSKKKEEKPNGK 50



 Score = 28.8 bits (64), Expect = 9.6
 Identities = 15/37 (40%), Positives = 24/37 (64%), Gaps = 4/37 (10%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           K++K + K + +K  KK+ EK+ +KKG    K KKK+
Sbjct: 12  KQRKESGKTQSQKSDKKKKEKVSEKKG----KSKKKE 44


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 34.3 bits (79), Expect = 0.091
 Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 3/92 (3%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA-EKMKKKGAEKIGKKK 955
            K  + +KK+     RK+ +K K    + +   AE+ K++++ A E+  +K  EK G + 
Sbjct: 27  IKAAKEEKKQAKLSERKENRKPKYIGSLLEA--AERRKREREIAEERKLQKEREKEGDEF 84

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             K+K      KK+ ++ ++ E+ +K+ EE E
Sbjct: 85  ADKEKFVTSAYKKQLEENRKLEEEEKEREELE 116



 Score = 33.2 bits (76), Expect = 0.19
 Identities = 15/88 (17%), Positives = 42/88 (47%), Gaps = 14/88 (15%)

Query: 912 RKKKKKKKEAEK--------------MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           +K +K+ ++A +              ++  K  +K  K  +  E  K K    + +  ++
Sbjct: 1   KKARKEAQKALEEDPSIYDYDEVYDSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER 60

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           +K++++  +++K +K++E E  +   +E
Sbjct: 61  RKREREIAEERKLQKEREKEGDEFADKE 88



 Score = 29.7 bits (67), Expect = 3.5
 Identities = 12/56 (21%), Positives = 27/56 (48%), Gaps = 9/56 (16%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKK-------KKKKKKKKKKKEA--EKMKKKTEEAE 987
           + +K    EK   K  ++K+ +K        +  +++K+++E   E+  +K  E E
Sbjct: 25  DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKE 80



 Score = 29.3 bits (66), Expect = 4.3
 Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 10/64 (15%)

Query: 934 KKKKKDAEKMKKKGA---------EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           KK +K+A+K  ++           + I K  K++KK+ K  ++K+ +K K    + +  E
Sbjct: 1   KKARKEAQKALEEDPSIYDYDEVYDSI-KAAKEEKKQAKLSERKENRKPKYIGSLLEAAE 59

Query: 985 EAEK 988
             ++
Sbjct: 60  RRKR 63


>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein.  This family consists of several
           eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
           expression of functional F1-ATPase requires two proteins
           encoded by the ATP11 and ATP12 genes. Atp11p is a
           molecular chaperone of the mitochondrial matrix that
           participates in the biogenesis pathway to form F1, the
           catalytic unit of the ATP synthase.
          Length = 250

 Score = 35.4 bits (82), Expect = 0.096
 Identities = 27/91 (29%), Positives = 37/91 (40%), Gaps = 8/91 (8%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK--KKKKKKK 963
           K   K+ +K K+       R KK  +   +KK+ + K    G  K   K K  K  K   
Sbjct: 3   KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62

Query: 964 KKKKKKKKKKK-----EAEKMKKKT-EEAEK 988
           KKK   K  K      + EK+K  + EE E 
Sbjct: 63  KKKPFDKPFKPLSSILDVEKIKDLSAEEIEF 93


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 35.1 bits (81), Expect = 0.10
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 7/73 (9%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            KK  KE +K RKK     + K + D +  ++K   K  +KKKK   K  + +++ ++ K
Sbjct: 89  LKKDLKEIKKRRKK-----LDKARLDYDAAREK--LKKLRKKKKDSSKLAEAEEELQEAK 141

Query: 974 KEAEKMKKKTEEA 986
           ++ E+++++  E 
Sbjct: 142 EKYEELREELIED 154



 Score = 30.5 bits (69), Expect = 2.7
 Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK  K+ +K +KK  +K        ++K KK +KKKK   K AE  ++  E  EK
Sbjct: 90  KKDLKEIKKRRKK-LDKARLDYDAAREKLKKLRKKKKDSSKLAEAEEELQEAKEK 143


>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
          Length = 862

 Score = 36.2 bits (84), Expect = 0.10
 Identities = 20/89 (22%), Positives = 28/89 (31%), Gaps = 20/89 (22%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDAEKMKKKGAE 949
           F+  RT R  K               A  +++  G      E  + K   A+K   K   
Sbjct: 788 FKSSRTGRNFK---------------AYLVKQPDGKIGFEFEPREPKAAAAKKTAAKATA 832

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
               K +K   KK   KK   KK    + 
Sbjct: 833 AAATKAEKAAAKKAPAKKTAAKKTAARKT 861


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 36.1 bits (83), Expect = 0.10
 Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 3/73 (4%)

Query: 117 SATQAFLASVLQELKTCA-PIPDATHVVVK-VPVEGAPTPATPDTEAP-TPAPGTESGPS 173
           S T   LA  ++ L T A   P    +    VPV  +   +TP   AP  P     S PS
Sbjct: 587 SDTAEALAGAIETLLTQASAQPAGLSLPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPS 646

Query: 174 STTPGPETKTLES 186
             T  P+ K + S
Sbjct: 647 LETSLPQQKPVLS 659


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 35.7 bits (83), Expect = 0.10
 Identities = 18/89 (20%), Positives = 36/89 (40%), Gaps = 9/89 (10%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRK----KKG--AEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
            KKG   I       K  E+++     KKG   + +KK  K +   ++K   ++G    +
Sbjct: 10  AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERK---EVGALINE 66

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            KK+ +    +   + + A   ++   E 
Sbjct: 67  LKKEVEDAITELTPELEAAGLWERLAFEK 95



 Score = 33.0 bits (76), Expect = 0.74
 Identities = 14/65 (21%), Positives = 26/65 (40%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           + +   KKG      KK  K    + RK+ GA   + KK+  + + +   E       ++
Sbjct: 31  KVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITELTPELEAAGLWER 90

Query: 959 KKKKK 963
              +K
Sbjct: 91  LAFEK 95


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 34.0 bits (78), Expect = 0.11
 Identities = 22/98 (22%), Positives = 49/98 (50%), Gaps = 19/98 (19%)

Query: 897 KKRTRR----------KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           KKR RR          K  +   ++  K+K++ E E    +K  E+ K+++   E+++K+
Sbjct: 6   KKRNRRMFGLLLGTLQKFSQEESRLTSKEKRRAEIE----QKLEEQEKQER---EELRKE 58

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEA--EKMKKK 982
             E   ++++K+ + +K ++K + +K +E   E     
Sbjct: 59  KRELFEERRRKQLELRKLEQKMEDEKLQETWHEHNLAL 96



 Score = 30.9 bits (70), Expect = 1.4
 Identities = 18/94 (19%), Positives = 45/94 (47%), Gaps = 2/94 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK--KGAEKIGKK 954
            + T ++K++   + + ++++K+E E++RK+K     ++++K  E  K   K  ++  ++
Sbjct: 28  SRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDEKLQE 87

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +         + K K     +  + T E EK
Sbjct: 88  TWHEHNLALANFIRTKTKPHLYYRPWRLTPETEK 121


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 34.1 bits (79), Expect = 0.11
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 6/71 (8%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
             +  + + E+       EK+ KK K+ E +KKK  +  GK KK   KK +K      K 
Sbjct: 55  HARIDRLDEERYDI---EEKVAKKDKEIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKA 111

Query: 973 ---KKEAEKMK 980
               K    M 
Sbjct: 112 LLGSKHKVSMD 122


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 35.7 bits (83), Expect = 0.11
 Identities = 15/58 (25%), Positives = 32/58 (55%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
              +  EK++K +KD  K+++K  +    KKK K  + +++    +K+  EA+++  K
Sbjct: 243 YLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 36.2 bits (84), Expect = 0.11
 Identities = 20/97 (20%), Positives = 39/97 (40%), Gaps = 13/97 (13%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--------- 947
             + R KK   AEK+ +     K+ E+  K+   E+   +++ A                
Sbjct: 864 INQYRAKKLDTAEKLEELYILAKK-EEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENG 922

Query: 948 ---AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
               EK+ K+K  ++K K   K K++ + K     ++
Sbjct: 923 TELVEKLAKEKSLREKNKDDWKAKEEVEAKLKAFFRR 959


>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962).  This
           eukaryotic family of proteins has no known function.
          Length = 155

 Score = 34.2 bits (79), Expect = 0.12
 Identities = 15/73 (20%), Positives = 30/73 (41%), Gaps = 23/73 (31%)

Query: 910 KIRKKKKKKKEA--EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           K+ K   KKK+    K RK                     A+++ +   +  K +KKK +
Sbjct: 1   KVTKNIAKKKKVLHPKSRK---------------------AKQLNRATLRDDKLEKKKAE 39

Query: 968 KKKKKKKEAEKMK 980
           + +K+  E +++ 
Sbjct: 40  RAEKRSLELDRLF 52


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 34.8 bits (80), Expect = 0.13
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            KKKK   +KKK  KK KKKKKE E++ +   E
Sbjct: 51  SKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83



 Score = 34.0 bits (78), Expect = 0.23
 Identities = 23/68 (33%), Positives = 38/68 (55%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           +I  KK  KK+    RKKK  +K KKKKK+ E++ +  AE++   ++ ++  KK   +  
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELP 103

Query: 970 KKKKKEAE 977
           K +   AE
Sbjct: 104 KVQNTAAE 111



 Score = 34.0 bits (78), Expect = 0.24
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K  KKKK   +KKK  KK KKK+ EK +     AE
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAE 83



 Score = 33.6 bits (77), Expect = 0.26
 Identities = 16/39 (41%), Positives = 22/39 (56%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +I  KK  KKKK   +KKK  KK K+ +K K++  E   
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAA 82



 Score = 33.6 bits (77), Expect = 0.31
 Identities = 18/39 (46%), Positives = 22/39 (56%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  KKKK   +KKK  KK KKKKK++ E  +   EE   
Sbjct: 49  KTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSD 87



 Score = 32.1 bits (73), Expect = 1.0
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           D E   KK ++K  K   +KKK  KK KKKKK+K++  E   ++  ++E+
Sbjct: 42  DVEIPSKKTSKK-KKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEE 90


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 35.7 bits (82), Expect = 0.13
 Identities = 20/85 (23%), Positives = 46/85 (54%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           ++K K+  + + K+ K + ++    +K+ AE+ K+KK++ E   +  A+    +++  + 
Sbjct: 471 KQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAES 530

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEE 985
            K+ K+  + + KK    +K K EE
Sbjct: 531 LKQAKQDLEMEIKKLEHDLKLKEEE 555



 Score = 34.9 bits (80), Expect = 0.26
 Identities = 22/95 (23%), Positives = 47/95 (49%), Gaps = 3/95 (3%)

Query: 896 RKKRTRRKKKKGAEKI---RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           ++ ++    +  AEK     KK+KK++E    R    A   +++  ++ K  K+  E   
Sbjct: 483 KRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEI 542

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           KK +   K K+++ +  +K+ +E  K ++  +E E
Sbjct: 543 KKLEHDLKLKEEECRMLEKEAQELRKYQESEKETE 577


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 35.7 bits (83), Expect = 0.14
 Identities = 16/44 (36%), Positives = 28/44 (63%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           E MK K+     ++KK   KI    ++K KK++KK+K K+++K+
Sbjct: 635 ELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 34.6 bits (80), Expect = 0.30
 Identities = 15/78 (19%), Positives = 32/78 (41%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
             KK K+  E E+  ++   E     +    K +++G+ +       ++K+   K +   
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281

Query: 971 KKKKEAEKMKKKTEEAEK 988
           K +   EK  ++    EK
Sbjct: 282 KLEILKEKKDEELFWFEK 299



 Score = 31.5 bits (72), Expect = 2.8
 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 2/78 (2%)

Query: 902 RKKKKGAEKIRKKKK-KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            KK K   +  ++++    E E   + KG ++ ++   + E       EK    K +   
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTE-EDPSLFSEEKEDPDKTEDLD 281

Query: 961 KKKKKKKKKKKKKKEAEK 978
           K +  K+KK ++    EK
Sbjct: 282 KLEILKEKKDEELFWFEK 299



 Score = 31.1 bits (71), Expect = 3.5
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 7/65 (10%)

Query: 931 EKMKKKK----KDAEKMKKKGAEK--IGKKKK-KKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           ++++K        +E MK K   +  I KKK+ K     ++K KK++KK+K   + K++ 
Sbjct: 621 DRIEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680

Query: 984 EEAEK 988
           E AE 
Sbjct: 681 EIAET 685


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 35.6 bits (82), Expect = 0.14
 Identities = 18/83 (21%), Positives = 34/83 (40%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           + KK    I+ +KK    ++ +  KK     +  K           +    KKK  +  K
Sbjct: 369 ENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNK 428

Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
           ++  KK++  +KE  K  +K  +
Sbjct: 429 EQDIKKEELLEKEFVKKSEKIPK 451


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 34.6 bits (80), Expect = 0.14
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 15/101 (14%)

Query: 903 KKKKGAEKIRKK-----KKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +++K  E+ +KK     KK+++  +++ +  K   E +KK    A     K  + + K+K
Sbjct: 30  EQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEK 89

Query: 956 KKKKKKKKKKKKKK--------KKKKKEAEKMKKKTEEAEK 988
           +  +K +++ K  K        KK KK+A+++ K  +E  K
Sbjct: 90  ESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYK 130



 Score = 32.3 bits (74), Expect = 0.94
 Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 7/84 (8%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KK  EKI + KK KK+A+++      + MK++ K  +K+ K   + +  +K+  +  K K
Sbjct: 103 KKYIEKI-EDKKLKKKAKQL-----VKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDK 156

Query: 965 KKKKKKKKKKEAEKMKKKTEEAEK 988
               K+  +K  + + +  E+ +K
Sbjct: 157 DLTLKELDEK-IKAVNQSYEKIQK 179



 Score = 31.5 bits (72), Expect = 1.6
 Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 5/93 (5%)

Query: 901 RRKKKKGAEK--IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGA--EKIGKKK 955
           +R+K    EK  I K +++ K A+K  +K   +K+KKK K   K MK++    +K+ K  
Sbjct: 80  KREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKAY 139

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK    +K+  +  K K    +++ +K +   +
Sbjct: 140 KKALNLEKELYEYLKDKDLTLKELDEKIKAVNQ 172



 Score = 30.4 bits (69), Expect = 3.3
 Identities = 24/83 (28%), Positives = 40/83 (48%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
             +K    A     K++K  + EK   +K  E+ K  KK  EK++ K  +K  K+  K  
Sbjct: 66  AIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVM 125

Query: 960 KKKKKKKKKKKKKKKEAEKMKKK 982
           K++ K   K  K  K+A  ++K+
Sbjct: 126 KERYKSYDKLYKAYKKALNLEKE 148



 Score = 30.0 bits (68), Expect = 4.4
 Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 13/93 (13%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI---GKKKKKKKK 960
            KK  E I ++ +K  E EK       E+ +KK ++ EK +++  ++I   GK   +  K
Sbjct: 14  GKKPEESIYEQLEKAAEQEKP-----FEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIK 68

Query: 961 KKKKK-----KKKKKKKKKEAEKMKKKTEEAEK 988
           K   +      K++K  KKE E ++K  EE + 
Sbjct: 69  KLSDQALANVDKREKLLKKEKESIEKSEEEFKS 101



 Score = 29.2 bits (66), Expect = 7.1
 Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA--EKIGKKKKKKKK 960
           K KK A+++ K  K++ ++     K   + +  +K+  E +K K    +++ +K K   +
Sbjct: 113 KLKKKAKQLVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQ 172

Query: 961 KKKKKKKKKKK 971
             +K +K KKK
Sbjct: 173 SYEKIQKAKKK 183


>gnl|CDD|216134 pfam00816, Histone_HNS, H-NS histone family. 
          Length = 92

 Score = 32.6 bits (75), Expect = 0.14
 Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 8/64 (12%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKKK 966
             I K +K+ + A K  +     +++      E   + G   E++     KKK KKK+  
Sbjct: 5   ALIEKLEKEIEAARKAERAAALAEIR------ELAAEYGLTLEELLGGAAKKKAKKKRAP 58

Query: 967 KKKK 970
              K
Sbjct: 59  VPAK 62


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 34.0 bits (78), Expect = 0.15
 Identities = 13/55 (23%), Positives = 29/55 (52%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
                + E+ +++  ++  ++   + K + K+KKK++  K + EK K KTE  + 
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146



 Score = 34.0 bits (78), Expect = 0.17
 Identities = 13/50 (26%), Positives = 25/50 (50%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             E+   +  ++   + K + K+KKK++  K K +KE  K + K  +  K
Sbjct: 101 TEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 32.4 bits (74), Expect = 0.58
 Identities = 16/71 (22%), Positives = 35/71 (49%), Gaps = 4/71 (5%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           KI ++     + E+  +++  ++ +++     K + K      KK++  K K +K+K K 
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEK----KKREVPKPKTEKEKPKT 141

Query: 970 KKKKKEAEKMK 980
           + KK +  K K
Sbjct: 142 EPKKPKPSKPK 152


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 35.0 bits (80), Expect = 0.15
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 5/87 (5%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKK 956
             +++  G       ++  +EA K RK+K       +  + +K    KK G     K+K+
Sbjct: 35  EEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQ 94

Query: 957 KK--KKKKKKKKKKKKKKKKEAEKMKK 981
           +K   K K K+K  +     EAE + K
Sbjct: 95  EKITDKFKVKRKVDRFNGVSEAELLTK 121


>gnl|CDD|212010 cd11620, HR1_PKC-like_2_fungi, Second Protein kinase C-related
           kinase homology region 1 (HR1) Rho-binding domain of
           fungal Protein Kinase C-like proteins.  This subfamily
           is composed of fungal PKC-like proteins including Pkc1p
           from Saccharomyces cerevisiae, and Pck1p and Pck2p from
           Schizosaccharomyces pombe. The yeast PKC-like proteins
           play a critical role in regulating cell wall
           biosynthesis and maintaining cell wall integrity. They
           contain two HR1 domains, C2 and C1 domains, and a kinase
           domain. This model characterizes the second HR1 domain.
           HR1 domains are anti-parallel coiled-coil (ACC) domains
           that bind small GTPases from the Rho family. The HR1
           domains of Pck1p and Pck2p interact with GTP-bound Rho1p
           and Rho2p.
          Length = 72

 Score = 31.9 bits (73), Expect = 0.16
 Identities = 17/54 (31%), Positives = 29/54 (53%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
           K++ +K+ K+  EKM +   AE  K+   DAE  + +  +KI   KK  K+ + 
Sbjct: 17  KLQVEKQYKEGIEKMARLYQAEGDKRSIADAENKRVESEQKIQLLKKALKRYQD 70


>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
           only].
          Length = 128

 Score = 33.6 bits (77), Expect = 0.16
 Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG--AEKIGKKKKKKKKKKKKK 965
            EK  +  ++++E E     +  E+ +K     E + K G   E++   +  K ++K K+
Sbjct: 24  LEKEEQVVQERQEEEAAAIAEIEERQEKYGTIRELLIKDGITPEELLNTEVTKPRRKAKR 83

Query: 966 KKKKKKKKK 974
             +  K K 
Sbjct: 84  AARPAKYKY 92


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 35.4 bits (82), Expect = 0.16
 Identities = 20/103 (19%), Positives = 34/103 (33%), Gaps = 24/103 (23%)

Query: 903 KKKKGAEKIRKKKKKKKEAEK--MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK- 959
              K  +KI+K     KE E   ++ ++  EK+  K +  EK+  K  + I +   + K 
Sbjct: 474 AVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDSKSEKLEKISAK-IDNIKELFDESKM 532

Query: 960 --------------------KKKKKKKKKKKKKKKEAEKMKKK 982
                                 K   K K+    K   K   +
Sbjct: 533 SILQEILQPALHHIELNIARIYKLNIKDKEDFLNKLYIKAHLE 575



 Score = 35.0 bits (81), Expect = 0.23
 Identities = 18/86 (20%), Positives = 35/86 (40%), Gaps = 6/86 (6%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           KG  ++  K+  +K  +K + K   + +       EK   K  +KI K     K+ +   
Sbjct: 437 KGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDA 496

Query: 966 KKKKK------KKKKEAEKMKKKTEE 985
            K ++       K ++ EK+  K + 
Sbjct: 497 LKLQEILEKVDSKSEKLEKISAKIDN 522



 Score = 35.0 bits (81), Expect = 0.24
 Identities = 26/114 (22%), Positives = 43/114 (37%), Gaps = 21/114 (18%)

Query: 888 TALGFILFRKKRTRRKKKKGAEKIRKKKKKK-----------KEAEKMRKKKGAEKM--K 934
           T   + LFR+K          E I   K K+           K   ++  K+  EK+  K
Sbjct: 403 TDSVWNLFREKFE--------EDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDK 454

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K K   K+         K   K  +K KK     K+ + +A K+++  E+ + 
Sbjct: 455 DKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEILEKVDS 508



 Score = 31.2 bits (71), Expect = 3.0
 Identities = 15/68 (22%), Positives = 33/68 (48%), Gaps = 4/68 (5%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK- 971
           ++K   K  +K++K    + +K+ + DA K  ++  EK+  K +K +K   K    K+  
Sbjct: 471 QEKAVLKLLQKIKKNN--DFIKEFETDALK-LQEILEKVDSKSEKLEKISAKIDNIKELF 527

Query: 972 KKKEAEKM 979
            + +   +
Sbjct: 528 DESKMSIL 535



 Score = 30.8 bits (70), Expect = 4.5
 Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 15/85 (17%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K+  EK+  K K K   + +        +       + + K        +K KK     K
Sbjct: 445 KEVCEKLLDKDKNKPFIKLI--------LLSNNSQEKAVLKL------LQKIKKNNDFIK 490

Query: 965 KKKKKKKKKKEA-EKMKKKTEEAEK 988
           + +    K +E  EK+  K+E+ EK
Sbjct: 491 EFETDALKLQEILEKVDSKSEKLEK 515


>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family.  This protein
           is plasmid encoded and found to be essential for plasmid
           replication.
          Length = 279

 Score = 35.0 bits (81), Expect = 0.16
 Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)

Query: 886 AVTALGFILF----RKKRTRRKKKKGAEKIRKKKKKKK-----EAEKMRKKKGAEK--MK 934
            +T L F+L      K  + RK++   E  + KKK        EA +  K+   E+    
Sbjct: 152 TLTPLFFMLLGISEEKLNSARKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSY 211

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           + ++ A   K++ A K+ K  +K  +KK      K+  K E  KM  K
Sbjct: 212 RTERKAFGKKRRRARKLAKLDEKDIRKKILNALVKEYSKGELTKMGLK 259


>gnl|CDD|222331 pfam13707, RloB, RloB-like protein.  This family includes the RloB
           protein that is found within a bacterial restriction
           modification operon. This family includes the AbiLii
           protein that is found as part of a plasmid encoded phage
           abortive infection mechanism. Deletion within abiLii
           abolished the phage resistance. The family includes some
           proteins annotated as CRISPR Csm2 proteins.
          Length = 176

 Score = 34.3 bits (79), Expect = 0.16
 Identities = 12/58 (20%), Positives = 24/58 (41%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           E       +   A+K+  KKG    KK K   +++ ++    I   K+ +K  ++   
Sbjct: 104 EDTTAALSRDAYAKKLLLKKGLPDYKKGKNLFKELLERLDTAIQNAKRLRKDNEEVNP 161



 Score = 29.2 bits (66), Expect = 7.3
 Identities = 23/131 (17%), Positives = 38/131 (29%), Gaps = 36/131 (27%)

Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKK--KEAEKMRKKKGAE------------KMKKKK 937
           F   +        K           +   +EA+K+RKK   E              KKK 
Sbjct: 15  FEHLKDLLRLSTLKIKVVPAGGGSPESLVEEAKKIRKKNANEYDEVWCVFDRDDFKKKKL 74

Query: 938 KDAEKMKKKGAEKIG----------------------KKKKKKKKKKKKKKKKKKKKKKE 975
            +A K  K+    +                       +    KK   KK     KK K  
Sbjct: 75  DEAIKKAKENGINLILSNPCFELWLLLHFEDTTAALSRDAYAKKLLLKKGLPDYKKGKNL 134

Query: 976 AEKMKKKTEEA 986
            +++ ++ + A
Sbjct: 135 FKELLERLDTA 145


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 35.7 bits (82), Expect = 0.16
 Identities = 18/89 (20%), Positives = 41/89 (46%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           + +R+ R           K K+K + +    KK  +  K+ K+++EK K + +    +  
Sbjct: 203 KMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRIL 262

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           +   K+ KK +K      + +  +K+K++
Sbjct: 263 RSYTKENKKDEKNVVSGIRSSVLLKRKSQ 291



 Score = 35.4 bits (81), Expect = 0.21
 Identities = 17/72 (23%), Positives = 37/72 (51%), Gaps = 6/72 (8%)

Query: 915 KKKKKEAEKMRKKKGA------EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           + K+   ++++K+         +KM++  ++   +K    +K  +K K+K K  KK +  
Sbjct: 180 RNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDG 239

Query: 969 KKKKKKEAEKMK 980
            K+ K+ +EK K
Sbjct: 240 DKQTKRNSEKSK 251



 Score = 31.9 bits (72), Expect = 1.9
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 5/69 (7%)

Query: 925 RKKKGAEKMKKKKKDA-----EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
            K+   +++KK+         +KM++        K     K K+K K+K K  KK+ +  
Sbjct: 181 NKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGD 240

Query: 980 KKKTEEAEK 988
           K+    +EK
Sbjct: 241 KQTKRNSEK 249



 Score = 30.0 bits (67), Expect = 7.7
 Identities = 18/86 (20%), Positives = 35/86 (40%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK+T        +K+R+  +     +     K  EK K+K K+ +K +    +     +K
Sbjct: 190 KKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEK 249

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
            K +      +  +   KE +K +K 
Sbjct: 250 SKVQNSHFDVRILRSYTKENKKDEKN 275


>gnl|CDD|233421 TIGR01453, grpIintron_endo, group I intron endonuclease.  This
           model represents one subfamily of endonucleases
           containing the endo/excinuclease amino terminal domain,
           pfam01541 at its amino end. A distinct subfamily
           includes excinuclease abc subunit c (uvrC). Members of
           pfam01541 are often termed GIY-YIG endonucleases after
           conserved motifs near the amino end. This subfamily in
           This model is found in open reading frames of group I
           introns in both phage and mitochondria. The closely
           related endonucleases of phage T4: segA, segB, segC,
           segD and segE, score below the trusted cutoff for the
           family.
          Length = 214

 Score = 34.7 bits (80), Expect = 0.17
 Identities = 20/91 (21%), Positives = 31/91 (34%), Gaps = 15/91 (16%)

Query: 896 RKKRTRRKKKK----GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
             K +   K K     + K       K  +E+ + K     + K K              
Sbjct: 96  GYKHSEETKAKMSKLFSGKKNNPWYGKTHSEETKAK-----ISKNKLGE------NNPFF 144

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           GKK  ++ KKK  +K+K  KK    +K    
Sbjct: 145 GKKHSEETKKKISEKEKGAKKVNVYDKNNGL 175


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 33.1 bits (76), Expect = 0.17
 Identities = 14/54 (25%), Positives = 16/54 (29%), Gaps = 8/54 (14%)

Query: 134 APIPDATHVVVKVPVEG--------APTPATPDTEAPTPAPGTESGPSSTTPGP 179
            P+ D    VV  P              PA P  +   P PG    PS   P  
Sbjct: 41  PPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPA 94


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 33.9 bits (78), Expect = 0.17
 Identities = 19/97 (19%), Positives = 42/97 (43%), Gaps = 6/97 (6%)

Query: 898 KRTRRK--KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM----KKKGAEKI 951
            R  R    K    ++       K  +K+ K    E++  +     +M    + +  +++
Sbjct: 40  ARVERMEEIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKEL 99

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +     +   ++ KK + K KKE ++++KK +E EK
Sbjct: 100 EEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEK 136



 Score = 32.8 bits (75), Expect = 0.36
 Identities = 17/78 (21%), Positives = 41/78 (52%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K+  + ++  ++++   E     +   +++ + K   +     +E + ++ KK + K KK
Sbjct: 63  KRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKLKK 122

Query: 965 KKKKKKKKKKEAEKMKKK 982
           ++K+ +KK  E EK +KK
Sbjct: 123 EQKELRKKLDELEKEEKK 140



 Score = 31.6 bits (72), Expect = 0.84
 Identities = 16/58 (27%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
            +   E +   KK+  E     +   E++KK   +K+ K++K+ +KK  + +K++KKK
Sbjct: 85  NEMLAELQDQLKKELEEANGDSEGLLEELKKH-RDKLKKEQKELRKKLDELEKEEKKK 141


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 34.5 bits (80), Expect = 0.17
 Identities = 15/61 (24%), Positives = 32/61 (52%), Gaps = 7/61 (11%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           + ++K+ GA     +K D   + +         + +K+KKK   K++KK  K+E +K+++
Sbjct: 70  KILKKEGGAIIKDLEKCDFTPIYEY-------FEAEKEKKKAMSKEEKKAIKEEKDKLEE 122

Query: 982 K 982
            
Sbjct: 123 P 123



 Score = 29.5 bits (67), Expect = 7.3
 Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 10/59 (16%)

Query: 895 FRKKRTRRKKKKGAE-------KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           FRK   +       +        I +  + +KE +K   K   EK K  K++ +K+++ 
Sbjct: 68  FRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSK--EEK-KAIKEEKDKLEEP 123


>gnl|CDD|237034 PRK12278, PRK12278, 50S ribosomal protein L21/unknown domain fusion
           protein; Provisional.
          Length = 221

 Score = 34.4 bits (79), Expect = 0.18
 Identities = 10/48 (20%), Positives = 13/48 (27%)

Query: 929 GAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           GA+    K        +  AE    K KK+   K             A
Sbjct: 105 GADASGVKAATGAGKVEVAAEAAPAKAKKEAAPKAAPAPAAAAAPPAA 152


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 35.2 bits (81), Expect = 0.19
 Identities = 15/45 (33%), Positives = 19/45 (42%)

Query: 138 DATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETK 182
           +A  V V  P    PT A P    PTPAP T    ++    P  +
Sbjct: 357 EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKE 401


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 34.0 bits (78), Expect = 0.19
 Identities = 11/78 (14%), Positives = 27/78 (34%), Gaps = 6/78 (7%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           L R++    K++  AE+  +            +        +K+ + +   +K      K
Sbjct: 65  LLRERE--LKEEAEAEEEGEVDASPDAGAVAGESSA----DRKEAEQQGAAQKRKSCRDK 118

Query: 954 KKKKKKKKKKKKKKKKKK 971
           ++K  K  +   +    K
Sbjct: 119 ERKSAKDPRGGTQDVVDK 136



 Score = 32.8 bits (75), Expect = 0.50
 Identities = 10/73 (13%), Positives = 32/73 (43%), Gaps = 4/73 (5%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
             + ++++ K+EAE     +   ++           +  A++   +++   +K+K  + K
Sbjct: 63  HALLRERELKEEAEA----EEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDK 118

Query: 969 KKKKKKEAEKMKK 981
           ++K  K+     +
Sbjct: 119 ERKSAKDPRGGTQ 131


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 34.9 bits (79), Expect = 0.20
 Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 3/79 (3%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K  ++K+++ AE++  K+ K    E+ R K   ++++K++KD E+ +++ A      K+K
Sbjct: 146 KNEKKKERQLAEQLAAKRLKD---EQHRHKARKQELRKREKDRERARREDAAAAAAAKQK 202

Query: 958 KKKKKKKKKKKKKKKKKEA 976
              KK      KK  K  A
Sbjct: 203 AAAKKAAAPSGKKSAKAAA 221



 Score = 34.2 bits (77), Expect = 0.37
 Identities = 24/95 (25%), Positives = 53/95 (55%), Gaps = 1/95 (1%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           IL R  R  R++KK    I ++   K +    R K+   +   K K+ +K +++ AE++ 
Sbjct: 101 ILRRLLRKYREEKKIDRHIYRELYVKAKGNVFRNKRNLMEHIHKVKNEKKKERQLAEQLA 160

Query: 953 KKK-KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
            K+ K ++ + K +K++ +K++K+ E+ +++   A
Sbjct: 161 AKRLKDEQHRHKARKQELRKREKDRERARREDAAA 195


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 34.9 bits (81), Expect = 0.21
 Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 4/70 (5%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGA---EKMKKKKKDAEKMKKKGAEK 950
           + +++  R +++ A   +++K+  KE E +R+ K  A   +K K + K  EK++ + AE+
Sbjct: 238 YLEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEE 297

Query: 951 IGKKKKKKKK 960
              ++ K   
Sbjct: 298 RPVEEGKPLA 307



 Score = 30.7 bits (70), Expect = 4.5
 Identities = 14/73 (19%), Positives = 28/73 (38%), Gaps = 13/73 (17%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           ++K E  +       ++ K+  K+ E +             ++ K    K KK K + K 
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWI-------------RRGKAAASKAKKAKSRIKR 286

Query: 976 AEKMKKKTEEAEK 988
            EK++ +  E   
Sbjct: 287 LEKLEARLAEERP 299


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
           This is a region of 120 amino acids that is conserved in
           a family of proteins found from plants to fungi. The
           function is not known.
          Length = 114

 Score = 33.0 bits (76), Expect = 0.21
 Identities = 30/97 (30%), Positives = 53/97 (54%), Gaps = 10/97 (10%)

Query: 897 KKRTRR-----KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-KGAEK 950
           KK+ R      KKK     +R +K++  EA K   +  AE+ KK+KK A+K    +  E+
Sbjct: 3   KKQIRDIERLLKKKDLPADVRVEKERALEALKKELED-AERRKKRKKMAKKYHMVRFFER 61

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              KK  ++ K+ +KK ++   +KE + +K++  +AE
Sbjct: 62  ---KKATRRLKQLEKKLEETSDEKEIKDLKEQLSKAE 95



 Score = 31.8 bits (73), Expect = 0.49
 Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 14/101 (13%)

Query: 867 TKRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK 926
            +RLL     D+  DVR E           K+R     KK  E   ++KK+KK A+K   
Sbjct: 9   IERLL--KKKDLPADVRVE-----------KERALEALKKELEDAERRKKRKKMAKKYHM 55

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            +  E+ KK  +  ++++KK  E   +K+ K  K++  K +
Sbjct: 56  VRFFER-KKATRRLKQLEKKLEETSDEKEIKDLKEQLSKAE 95



 Score = 31.4 bits (72), Expect = 0.57
 Identities = 23/81 (28%), Positives = 47/81 (58%), Gaps = 7/81 (8%)

Query: 915 KKKKKEAEKMRKKKG--AEKMKKKKKDAEKMKKKGAEKIGKKKKKK--KKKKKKK---KK 967
           KK+ ++ E++ KKK   A+   +K++  E +KK+  +   +KK+KK  KK    +   +K
Sbjct: 3   KKQIRDIERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVRFFERK 62

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
           K  ++ K+ EK  ++T + ++
Sbjct: 63  KATRRLKQLEKKLEETSDEKE 83


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 34.9 bits (81), Expect = 0.22
 Identities = 17/87 (19%), Positives = 46/87 (52%), Gaps = 13/87 (14%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           E   + K+ ++E +++      E+++K KK  +K+ KK   ++  +K+++ +K  + K+K
Sbjct: 331 ELKEELKELEEELKELE-----EELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385

Query: 969 --------KKKKKKEAEKMKKKTEEAE 987
                   +++ K+  E+++    E +
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGK 412



 Score = 30.3 bits (69), Expect = 5.7
 Identities = 15/94 (15%), Positives = 47/94 (50%)

Query: 890 LGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           +G      K   ++ ++  +++ ++ +K K+  K   KK   ++  +K++  +   +  E
Sbjct: 325 VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKE 384

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           K+ ++ ++ +++ K+ K++ +    E +    KT
Sbjct: 385 KLSEELEELEEELKELKEELESLYSEGKISVNKT 418


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 35.1 bits (81), Expect = 0.22
 Identities = 18/90 (20%), Positives = 55/90 (61%), Gaps = 4/90 (4%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK- 959
             K ++   +IR++ ++ +  E++ +K   E++++ +++ E+++++  E + K  + ++ 
Sbjct: 660 EEKVEELEAEIRRELQRIENEEQLEEKL--EELEQLEEELEQLREELEELLKKLGEIEQL 717

Query: 960 -KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ + +K + ++ KKE EK++K  E  E+
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEE 747



 Score = 34.0 bits (78), Expect = 0.55
 Identities = 18/84 (21%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           +  ++K ++ EAE  R+ +  E  ++ ++  E++++   E    +++ ++  KK  + ++
Sbjct: 657 EELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQ 716

Query: 970 -----KKKKKEAEKMKKKTEEAEK 988
                + +K E E++KK+ E+ EK
Sbjct: 717 LIEELESRKAELEELKKELEKLEK 740



 Score = 32.8 bits (75), Expect = 1.2
 Identities = 18/92 (19%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 895 FRKKRTRRKK-KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            R  RTR+++ ++  E++++ KKK KE E+   +        +  +AE   ++  E++  
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES 637

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           + +K   + + ++  +   ++  EK+++   E
Sbjct: 638 ELEKLNLQAELEELLQAALEELEEKVEELEAE 669



 Score = 31.3 bits (71), Expect = 3.7
 Identities = 11/81 (13%), Positives = 42/81 (51%), Gaps = 1/81 (1%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
               ++  E   K + ++ + E  + +   +++K+  ++   ++ +  E++ + +++ K+
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRK-EELEELRERLKE 597

Query: 961 KKKKKKKKKKKKKKEAEKMKK 981
            KKK K+ +++  +  E ++ 
Sbjct: 598 LKKKLKELEERLSQLEELLQS 618



 Score = 30.1 bits (68), Expect = 7.4
 Identities = 18/118 (15%), Positives = 57/118 (48%), Gaps = 4/118 (3%)

Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA--EK 932
           +    +++ + A+  L   +   +   R++ +  E   + ++K +E E++ ++     E+
Sbjct: 644 LQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREE 703

Query: 933 MKKKKKDAEKMKK--KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++  K   ++++  +  E    + ++ KK+ +K +K  +  ++  EK+ K    A+ 
Sbjct: 704 LEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADI 761


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 33.7 bits (76), Expect = 0.23
 Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 2/55 (3%)

Query: 124 ASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPG 178
           AS L  + T +P      V+ +   +  P      +  PTPA G    P  +   
Sbjct: 116 ASQLSLVTTVSPGSGLAPVLTQQ--QVPPQQPQQPSMVPTPALGGVQPPQPSQQQ 168


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 34.8 bits (80), Expect = 0.24
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +KKKKKKK K+KKKKK +EA K     E  E+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIHALENIEE 715



 Score = 31.4 bits (71), Expect = 3.3
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
           +KKKKKKK K+KKKKK ++  K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706



 Score = 30.2 bits (68), Expect = 7.8
 Identities = 13/22 (59%), Positives = 15/22 (68%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEK 923
           RKKKK  +   KKKKK +EA K
Sbjct: 684 RKKKKKKKSKEKKKKKNREASK 705


>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
          Length = 405

 Score = 34.8 bits (80), Expect = 0.24
 Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 10/87 (11%)

Query: 911 IRKKKKKKKEAEKMR---KKKGA-EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           +   KK  K A   +   K   A  K+    K A +M         K KK+K  KK+K+ 
Sbjct: 307 VNPIKKDVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLAEAARVKAKKEKLAKKRKQV 366

Query: 967 KKKKKKKKEA------EKMKKKTEEAE 987
            K++    +A      + M   ++  E
Sbjct: 367 SKEEAAAIKAAGKAWYKTMISDSDYTE 393


>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
           topoisomerase IA [General function prediction only].
          Length = 298

 Score = 34.3 bits (79), Expect = 0.24
 Identities = 19/47 (40%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           AE+  K G  K  K  KK    K KK   KK   K+A K KK  ++A
Sbjct: 253 AERRAKGGPGK--KPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKA 297



 Score = 32.8 bits (75), Expect = 0.73
 Identities = 18/47 (38%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           AE+  K     +K  KK      KK   KK   KK  K KK  KK A
Sbjct: 253 AERRAKGGP-GKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKKAA 298



 Score = 32.8 bits (75), Expect = 0.87
 Identities = 21/45 (46%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 921 AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           AE+ R K G  K   KK  A K KK  A+K   KK  K KK  KK
Sbjct: 253 AER-RAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 32.0 bits (73), Expect = 1.4
 Identities = 16/42 (38%), Positives = 17/42 (40%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
           R+ KG    K  KK      KK   K    KK  K KK  KK
Sbjct: 255 RRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAKK 296



 Score = 29.3 bits (66), Expect = 8.9
 Identities = 27/105 (25%), Positives = 37/105 (35%), Gaps = 19/105 (18%)

Query: 903 KKKKGAEKIRKKK--KKKKEAEKMRKKKGAEK--MKKKKKDAEKMKKKGAEKI------- 951
           K+ +G      K+     +  + +  K G     +K  K +A   K K    I       
Sbjct: 191 KRSRGRSAAALKELGTHPEGGKPITVKDGRFGPYVKDGKTNATLPKGKDPASITLEEALE 250

Query: 952 -----GKKKKKKKKKKKKKKKKKKKK---KKEAEKMKKKTEEAEK 988
                  K    KK  KK    K KK   KK A K   KT++A K
Sbjct: 251 LLAERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKKAAKTKKAAK 295


>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen.  The
           hepatitis delta virus (HDV) encodes a single protein,
           the hepatitis delta antigen (HDAg). The central region
           of this protein has been shown to bind RNA. Several
           interactions are also mediated by a coiled-coil region
           at the N terminus of the protein.
          Length = 194

 Score = 33.7 bits (77), Expect = 0.27
 Identities = 27/115 (23%), Positives = 57/115 (49%), Gaps = 27/115 (23%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEK-----------MRKKK---GAEKMKKKKKDAEKM--- 943
           R+K+++  + +RK  KK K+ E+           +RK K   GA   K+ + D  ++   
Sbjct: 24  RKKREELEKDLRKANKKIKKLEEENPWLGNVVGIIRKDKDGEGAPPAKRPRTDQMEVDSG 83

Query: 944 --KKKGAEKIGKKKKKKKKKKKKKKKKKKK--------KKKEAEKMKKKTEEAEK 988
             K+  A     ++++  +++K  + KKK+         ++E E++++ TEE E+
Sbjct: 84  PGKRPHAGGFTDQERRDHRRRKALENKKKQLSSGGKHLSREEEEELRRLTEEDEE 138


>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11.  The WW domain is a
           small protein module with a triple-stranded beta-sheet
           fold. This is a family of WW domain binding proteins.
          Length = 78

 Score = 31.5 bits (72), Expect = 0.28
 Identities = 21/81 (25%), Positives = 47/81 (58%), Gaps = 13/81 (16%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           R+++KK     ++ KK K E +  R+       K  KK+ ++++++  +K+ K +++  K
Sbjct: 11  RKEQKK-----KELKKNKAERQARREA------KLAKKNPDRLQRE-LDKL-KAQEESGK 57

Query: 961 KKKKKKKKKKKKKKEAEKMKK 981
            +  +K+K KK +K+ + +KK
Sbjct: 58  LRPVEKQKLKKLEKDLDAVKK 78



 Score = 30.7 bits (70), Expect = 0.66
 Identities = 11/34 (32%), Positives = 24/34 (70%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K++KKK+ KK K +++ ++EA+  KK  +  ++
Sbjct: 11  RKEQKKKELKKNKAERQARREAKLAKKNPDRLQR 44



 Score = 28.4 bits (64), Expect = 4.4
 Identities = 14/66 (21%), Positives = 30/66 (45%), Gaps = 9/66 (13%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK+  +K K   +  R         E    KK  ++++++    +  ++ G  +  +K+K
Sbjct: 15  KKKELKKNKAERQARR---------EAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQK 65

Query: 957 KKKKKK 962
            KK +K
Sbjct: 66  LKKLEK 71



 Score = 27.2 bits (61), Expect = 9.2
 Identities = 8/36 (22%), Positives = 24/36 (66%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK+ KK K +++ +++ K  KK  ++++++ ++ + 
Sbjct: 16  KKELKKNKAERQARREAKLAKKNPDRLQRELDKLKA 51


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This
           family includes gbp a protein from Soybean that binds to
           GAGA element dinucleotide repeat DNA. It seems likely
           that the this domain mediates DNA binding. This putative
           domain contains several conserved cysteines and a
           histidine suggesting this may be a zinc-binding DNA
           interaction domain.
          Length = 301

 Score = 34.1 bits (78), Expect = 0.28
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKK--KKKKKKKKEAEKMKKKTE 984
           +A+++KK    +  K  K  K KK KKK     +  K      + K +
Sbjct: 143 EAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPD 190



 Score = 31.0 bits (70), Expect = 2.6
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           A+++ K KK +  K  K  K KK KKK +   +        
Sbjct: 144 AKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGID 184



 Score = 29.5 bits (66), Expect = 8.8
 Identities = 13/54 (24%), Positives = 19/54 (35%), Gaps = 6/54 (11%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K  KK +  +  K        K KK KKK     +  K        K   K+++
Sbjct: 148 KKPKKGQSPKVPKAP------KPKKPKKKGSVSNRSVKMPGIDPRSKPDWKSQD 195


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 34.8 bits (81), Expect = 0.29
 Identities = 17/72 (23%), Positives = 43/72 (59%), Gaps = 4/72 (5%)

Query: 911 IRKKKKKKKEAEKMR----KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
            RK+KK K+E + ++    KK      ++K ++ EK ++   +   K K K+++++++K+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 967 KKKKKKKKEAEK 978
           +K K+ +++  +
Sbjct: 550 EKDKETEEDEPE 561


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 34.8 bits (80), Expect = 0.29
 Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 6/75 (8%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           ++K   + KE +   +K+ ++K  K+KKDA    K       KK K  +         +K
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKS------KKAKTDRDVSTSTAASQK 236

Query: 971 KKKKEAEKMKKKTEE 985
           K      K++ +++E
Sbjct: 237 KSSDLESKLEAQSKE 251



 Score = 31.7 bits (72), Expect = 2.2
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 5/68 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKK----GAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           KK     K  K +++K+E++K  K+K    G +K KK K D   +    A    K    +
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTD-RDVSTSTAASQKKSSDLE 242

Query: 959 KKKKKKKK 966
            K + + K
Sbjct: 243 SKLEAQSK 250


>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
           protein; Provisional.
          Length = 311

 Score = 34.3 bits (78), Expect = 0.29
 Identities = 28/142 (19%), Positives = 59/142 (41%), Gaps = 24/142 (16%)

Query: 868 KRLLHVAVSDVNDDVRREAVTALGFILFR--------KKRTRRKKKKGAEKIRKK----- 914
            +LL+ A+S+V  +   + +      +++         KRT  + K  A+++ K+     
Sbjct: 48  LKLLNSAISNVQHN--SKDMDPSKLYIYKIVANQGPTMKRTLPRAKGSADQLFKRTTHLE 105

Query: 915 ---------KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
                    ++K+  A K +K K    ++ K K   K   K ++   K  +K +    + 
Sbjct: 106 IVLSDDVNEREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDPEL 165

Query: 966 KKKKKKKKKEAEKMKKKTEEAE 987
            K++++  K  EK   + EE  
Sbjct: 166 LKREQQVLKVVEKTASQKEEET 187


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 34.5 bits (80), Expect = 0.30
 Identities = 19/48 (39%), Positives = 32/48 (66%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +++K+  A+ +G   ++K+KKK+KKK KKKK K    K+ K  +EA +
Sbjct: 383 QELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATR 430


>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
          Length = 362

 Score = 34.4 bits (79), Expect = 0.30
 Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 3/67 (4%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
            E E +  ++  E+ K +K    K    G  ++ +K   K++K+ +K + +  K+  A +
Sbjct: 235 AEVEAIANQRMIERCKNRK---AKRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMR 291

Query: 979 MKKKTEE 985
           M+     
Sbjct: 292 MEANQIR 298


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 33.8 bits (78), Expect = 0.31
 Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 3/47 (6%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K AE+   K K+AEK ++K  +   KK ++ +  KK +K+ +K++ K
Sbjct: 148 KSAER---KLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 33.1 bits (76), Expect = 0.53
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK 950
           K AE+  K+ +K++E ++   +K  E+ +  KK  EK  +K   K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKK-IEKEVEKRQAK 191



 Score = 32.7 bits (75), Expect = 0.69
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 1/49 (2%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K AE+ K K AEK  +K+++  +KK ++ +  KK +KE EK + K  EA
Sbjct: 148 KSAER-KLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEA 195



 Score = 29.6 bits (67), Expect = 7.5
 Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 11/62 (17%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K AE  RK K AEK         + +K+      K ++ +  KK +K+ +K++ K    K
Sbjct: 148 KSAE--RKLKEAEK---------QEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSEAK 196

Query: 979 MK 980
           +K
Sbjct: 197 LK 198


>gnl|CDD|212008 cd00089, HR1, Protein kinase C-related kinase homology region 1
           (HR1) domain that binds Rho family small GTPases.  The
           HR1 domain, also called the ACC (anti-parallel
           coiled-coil) finger domain or Rho-binding domain binds
           small GTPases from the Rho family. It is found in Rho
           effector proteins including PKC-related kinases such as
           vertebrate PRK1 (or PKN) and yeast PKC1 protein kinases
           C, as well as in rhophilins and Rho-associated kinase
           (ROCK). Rho family members function as molecular
           switches, cycling between inactive and active forms,
           controlling a variety of cellular processes. HR1 domains
           may occur in repeat arrangements (PKN contains three HR1
           domains), separated by a short linker region.
          Length = 68

 Score = 31.1 bits (71), Expect = 0.31
 Identities = 14/51 (27%), Positives = 27/51 (52%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           K+ K+ K ++ AE + K     K+KK   + +   K+  EKI   K++ ++
Sbjct: 13  KLEKELKIREGAENLLKLYSNPKVKKDLAEVQLNLKESKEKIDLLKRQLER 63


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 33.6 bits (77), Expect = 0.32
 Identities = 16/42 (38%), Positives = 22/42 (52%), Gaps = 2/42 (4%)

Query: 949 EKIGKKKKK--KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K  KKK+    +K    KK KKK K+K  E+ KK   + +K
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89



 Score = 33.2 bits (76), Expect = 0.43
 Identities = 18/48 (37%), Positives = 24/48 (50%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K  KK      +K    KK KKK K+K  +++K+ E    KTEE E 
Sbjct: 48  PKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEN 95



 Score = 29.7 bits (67), Expect = 5.5
 Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
            K+ ++K+        +K    K+++K  K+K  E+ KK + D +K
Sbjct: 48  PKQPKKKRPT----TPRKPATTKKSKKKDKEKLTEEEKKPESDDDK 89


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 33.9 bits (78), Expect = 0.32
 Identities = 16/108 (14%), Positives = 33/108 (30%), Gaps = 16/108 (14%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK---------------KDAE 941
           KK    K+    +K+      + E E       ++K +K                 K A 
Sbjct: 157 KKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAAL 216

Query: 942 KMKKKGAEKI-GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + +K  AE    +  K++   K   +++   +            E+  
Sbjct: 217 EEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAAATAAQPSAVTESAS 264



 Score = 29.7 bits (67), Expect = 6.9
 Identities = 17/83 (20%), Positives = 32/83 (38%), Gaps = 3/83 (3%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           KK  E+ ++ KK  +E +   + K  E +   + + E        +  K +K        
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDK-LETLVALQNELETQLNSLNSQ--KAEKNALIAALA 203

Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
            K+     +K A + +K   EA 
Sbjct: 204 AKEASALGEKAALEEQKALAEAA 226


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 34.4 bits (79), Expect = 0.32
 Identities = 23/102 (22%), Positives = 48/102 (47%), Gaps = 11/102 (10%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK---KDAEKMKKKGAEKIGKK 954
           K   +++ K   + ++++K++ + +K+  +    + K      K+AE  +++G   I  +
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424

Query: 955 KKKKKKKKKKKKKKKKK--------KKKEAEKMKKKTEEAEK 988
           K ++     + K  KK          KKE EK K++ E  EK
Sbjct: 425 KSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEK 466



 Score = 31.3 bits (71), Expect = 2.5
 Identities = 17/78 (21%), Positives = 37/78 (47%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           I K  + K+E  K+  K    +      + + +KK       ++ KK+K+K+K++ +  +
Sbjct: 406 ILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVE 465

Query: 971 KKKKEAEKMKKKTEEAEK 988
           K  +  +  K  T + +K
Sbjct: 466 KVLELFKGSKIITYKKKK 483


>gnl|CDD|197773 smart00527, HMG17, domain in high mobilty group proteins HMG14 and
           HMG 17. 
          Length = 88

 Score = 31.8 bits (72), Expect = 0.33
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           K+K  A +   K + K+ + ++  K    K + K K A   KK         K KK  K 
Sbjct: 2   KRKVAAGEGDDKDEPKRRSARLSAKPAPPKPEPKPKKAAAKKKSEN---VAAKGKKGAKG 58

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+ ++  K++   AE  + K+EEA  
Sbjct: 59  KQTEEAGKEENNPAENGETKSEEAPA 84


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 33.3 bits (76), Expect = 0.33
 Identities = 19/89 (21%), Positives = 45/89 (50%), Gaps = 6/89 (6%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK----KDAEKMKKKGAEK 950
           F  K+    +K+  ++   +  K KE E+  + + AEK K+      +D  + ++K    
Sbjct: 31  FIIKK--ADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIA 88

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
             ++K+++ +  K+++    K+KK  ++M
Sbjct: 89  QEEEKEEEAEDVKQQEVFSFKRKKPFKEM 117


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 34.4 bits (79), Expect = 0.35
 Identities = 24/82 (29%), Positives = 32/82 (39%), Gaps = 1/82 (1%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
               A    K  KKKK      K   AE +   ++DA K K K  +K G+ K  K     
Sbjct: 409 SSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPK-KGKKNQKKGRDKSSKVPSDS 467

Query: 964 KKKKKKKKKKKEAEKMKKKTEE 985
           K   KK+  K + +      EE
Sbjct: 468 KAGGKKESVKSQEDNNNIPPEE 489



 Score = 32.9 bits (75), Expect = 1.1
 Identities = 18/73 (24%), Positives = 26/73 (35%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           I   +K     E       ++K  KKKK      K G  +     ++   KK KK +KK 
Sbjct: 397 IGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKG 456

Query: 971 KKKKEAEKMKKKT 983
           + K        K 
Sbjct: 457 RDKSSKVPSDSKA 469


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 34.6 bits (79), Expect = 0.38
 Identities = 30/87 (34%), Positives = 43/87 (49%), Gaps = 7/87 (8%)

Query: 909  EKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAE-----KIGKKKKKKKKK 961
            E I K K  K EA+K  K  +K  E  ++ KKD  ++  K +      K  K KK  +  
Sbjct: 1490 EHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEII 1549

Query: 962  KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             K+ K   KK   EAEK ++K +E +K
Sbjct: 1550 IKEIKDAHKKFILEAEKSEQKIKEIKK 1576


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 32.0 bits (73), Expect = 0.39
 Identities = 14/92 (15%), Positives = 41/92 (44%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           KK      ++  E   + ++  KEA +  ++   E  ++ +K A+++ ++  E+  ++ +
Sbjct: 13  KKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAE 72

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +   + +K+      K  E + ++    E   
Sbjct: 73  EILAEAEKEASAILSKAAEGKVVEAALSEFLA 104



 Score = 28.2 bits (63), Expect = 8.7
 Identities = 12/78 (15%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
             ++   E   + ++  +EAE+  +K   E +++ +++AE+  ++   +  +K+      
Sbjct: 29  EAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAEEILAE-AEKEASAILS 87

Query: 962 KKKKKKKKKKKKKEAEKM 979
           K  + K  +    E   +
Sbjct: 88  KAAEGKVVEAALSEFLAI 105


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 33.6 bits (77), Expect = 0.40
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 13/81 (16%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDA-----EKMKKKGAEKIGKKKKKKKKKKKKK---- 965
           +  ++EAEK+ + +  +K ++KK +      +K KK+  +K  K KK  ++K ++K    
Sbjct: 81  QSLREEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAE 140

Query: 966 ----KKKKKKKKKEAEKMKKK 982
               K     + KEAEK+KKK
Sbjct: 141 QAYEKSSSGAQPKEAEKLKKK 161



 Score = 30.5 bits (69), Expect = 3.2
 Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 12/98 (12%)

Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEK--MRKKKGAEKMKKKKKDA 940
           R EA     F    K++  RKK +  + +++ +K KKE  K  M+ KK  E+  ++K  A
Sbjct: 84  REEAEKLEEF--REKQKEERKKTE--DIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKA 139

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           E+  +K           + K+ +K KKK  + K  AE+
Sbjct: 140 EQAYEK------SSSGAQPKEAEKLKKKAAQCKTSAEE 171



 Score = 30.5 bits (69), Expect = 4.0
 Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 12/94 (12%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           FR+K+   +KK      R +K KK+  +K  K K + + K ++KD        AE+    
Sbjct: 93  FREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKD-------KAEQA--- 142

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +K     + K+ +K KK+A + K   EEA+ 
Sbjct: 143 --YEKSSSGAQPKEAEKLKKKAAQCKTSAEEADS 174


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
           family consists of several repeats of around 30 residues
           in length which are found specifically in
           mature-parasite-infected erythrocyte surface antigen
           proteins from Plasmodium falciparum. This family often
           found in conjunction with pfam00226.
          Length = 215

 Score = 33.3 bits (75), Expect = 0.40
 Identities = 20/46 (43%), Positives = 31/46 (67%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +G ++  K+K D +K K  G+EK  K+ K+K +K+ K+K KKK KK
Sbjct: 5   EGDKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKK 50


>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
          Length = 306

 Score = 33.9 bits (77), Expect = 0.40
 Identities = 24/109 (22%), Positives = 36/109 (33%), Gaps = 1/109 (0%)

Query: 105 LMAAQIGFDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTP 164
           ++  Q      +      LA   Q+ +         HVV + P + AP        A   
Sbjct: 32  VIQPQAPVQPGQPGAPQQLAIPTQQPQPVPTSAMTPHVVQQAPAQPAPAAPPAAGAALPE 91

Query: 165 APGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSKILSGEVSIDL 213
           A      P+ T  G    TL    E + QL   ++ L    SG   +D 
Sbjct: 92  ALEVPPPPAFTPNGEIVGTLAGNLEGDPQLAPSVSYLEAF-SGLDKLDT 139


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 34.3 bits (79), Expect = 0.41
 Identities = 29/127 (22%), Positives = 48/127 (37%), Gaps = 22/127 (17%)

Query: 873 VAVSDVNDDVRR-EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE 931
           V  S +   VRR EAVT    I +  ++  + K+            K    ++ KK   E
Sbjct: 679 VKESGIAAGVRRIEAVTGNAAIEYLNEQEDKLKELADIL-------KVTPNELPKK--IE 729

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK------------KKKKKKKKKKEAEKM 979
           +  ++ K  +K  +    KI  +K  + K K +              K  K  K  A+++
Sbjct: 730 RFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVLVEVVDAKDMKSLKTMADRL 789

Query: 980 KKKTEEA 986
           K K   A
Sbjct: 790 KSKLGSA 796



 Score = 31.2 bits (71), Expect = 3.4
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 867 TKRLLHVAVSDVNDDVRR--EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKM 924
              +L V  +++   + R  E   AL     +K+    K K   +KI + K K +    +
Sbjct: 713 LADILKVTPNELPKKIERFFEEWKAL-----QKELESLKLKIADQKIDELKDKAETINGV 767

Query: 925 R---KKKGAEKMKKKKKDAEKMKKKGAEKI 951
           +   +   A+ MK  K  A+++K K    I
Sbjct: 768 KVLVEVVDAKDMKSLKTMADRLKSKLGSAI 797


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 33.9 bits (78), Expect = 0.41
 Identities = 20/70 (28%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 907 GAEKIRKKK----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
            + K++KK+    + + E E  + +K  EK+K ++ + EK  +  AEKI K     +  +
Sbjct: 274 LSYKLQKKEEQTLRTRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLE 333

Query: 963 KKKKKKKKKK 972
              K KKK++
Sbjct: 334 VDFKSKKKRE 343


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 33.2 bits (76), Expect = 0.44
 Identities = 18/101 (17%), Positives = 42/101 (41%), Gaps = 9/101 (8%)

Query: 887 VTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           V    F            ++ A+K   ++  + E  K  +K+ A          +K  K 
Sbjct: 30  VAYQLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAA-------NSEDKEDKG 82

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            AEK  + ++ +++ +++ ++   + +KE E+  +   E E
Sbjct: 83  DAEK--EDEESEEENEEEDEESSDENEKETEEKTESNVEKE 121


>gnl|CDD|233160 TIGR00869, sec62, protein translocation protein, Sec62 family.
           Members of the NSCC2 family have been sequenced from
           various yeast, fungal and animals species including
           Saccharomyces cerevisiae, Drosophila melanogaster and
           Homo sapiens. These proteins are the Sec62 proteins,
           believed to be associated with the Sec61 and Sec63
           constituents of the general protein secretary systems of
           yeast microsomes. They are also the non-selective cation
           (NS) channels of the mammalian cytoplasmic membrane. The
           yeast Sec62 protein has been shown to be essential for
           cell growth. The mammalian NS channel proteins has been
           implicated in platelet derived growth factor(PGDF)
           dependent single channel current in fibroblasts. These
           channels are essentially closed in serum deprived
           tissue-culture cells and are specifically opened by
           exposure to PDGF. These channels are reported to exhibit
           equal selectivity for Na+, K+ and Cs+ with low
           permeability to Ca2+, and no permeability to anions
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 232

 Score = 33.3 bits (76), Expect = 0.44
 Identities = 14/25 (56%), Positives = 16/25 (64%)

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEA 976
            K K   KKK K KK KKK+ K+EA
Sbjct: 207 EKDKYSYKKKLKSKKLKKKQAKREA 231


>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609).  This
           region is found in a number of hypothetical proteins
           thought to be expressed by the eukaryote Encephalitozoon
           cuniculi, an obligate intracellular microsporidial
           parasite. It is approximately 200 residues long.
          Length = 230

 Score = 33.2 bits (76), Expect = 0.45
 Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 7/63 (11%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK--KKDAEKMKKKGAEKIGK 953
            KK+  +KK KG     K  K+      +R +K  EK+K++  K   EK K +  E+I +
Sbjct: 22  AKKKGGKKKSKGGRHCYKIHKRV-----LRWRKSPEKIKEELDKGSEEKWKGRSIEEIKE 76

Query: 954 KKK 956
           +K 
Sbjct: 77  QKV 79


>gnl|CDD|237100 PRK12429, PRK12429, 3-hydroxybutyrate dehydrogenase; Provisional.
          Length = 258

 Score = 33.3 bits (77), Expect = 0.46
 Identities = 26/79 (32%), Positives = 31/79 (39%), Gaps = 12/79 (15%)

Query: 307 TGPTSGYYEGGGLYALGLIHANHGA--VINDYLLGQVKDATNEDDAVTGEAAGIAMGLVL 364
           TG  SG     GL  + L  A  GA  VI D        A        G+A G+AM +  
Sbjct: 10  TGAASGI----GL-EIALALAKEGAKVVIADLNDEAAAAAAEALQKAGGKAIGVAMDV-- 62

Query: 365 LGSKSESAIQDMVAYAKET 383
                E AI   + YA ET
Sbjct: 63  ---TDEEAINAGIDYAVET 78


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 34.3 bits (78), Expect = 0.47
 Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 1/75 (1%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            +K ++E E++   K     KK +     +K+K     G  K  + K +  K    K+K+
Sbjct: 918 LEKDQQEKEELISSKETSN-KKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKE 976

Query: 974 KEAEKMKKKTEEAEK 988
            E   +  + EE EK
Sbjct: 977 TELNTVNAQLEECEK 991



 Score = 31.9 bits (72), Expect = 1.9
 Identities = 20/106 (18%), Positives = 48/106 (45%), Gaps = 7/106 (6%)

Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAE 941
           R+    LG  +  KK    KK+   E+++   K+ ++ E    +  +  ++++K +++  
Sbjct: 432 RDEKKGLGRTIELKKEILEKKQ---EELKFVIKELQQLEGSSDRILELDQELRKAERELS 488

Query: 942 KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           K +K         KK+ K  + +K    +K +K  ++M++      
Sbjct: 489 KAEKNSL--TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTT 532


>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17. 
          Length = 93

 Score = 31.3 bits (71), Expect = 0.50
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 5/88 (5%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           R+ + K A K+ K++ K++ A    K    +   K KK A K KK        K KK  K
Sbjct: 3   RKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAV-----KGKKGAK 57

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            KK+ K+++ K++  AE  + KTEEA  
Sbjct: 58  GKKETKQEEAKEENPAENGETKTEEAPA 85



 Score = 29.4 bits (66), Expect = 2.0
 Identities = 23/69 (33%), Positives = 32/69 (46%)

Query: 916 KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K+K EA+   K    E  ++  + + K      E   KK   KKKK   K KK  K KKE
Sbjct: 2   KRKGEAKDAAKVTKQEPKRRSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGAKGKKE 61

Query: 976 AEKMKKKTE 984
            ++ + K E
Sbjct: 62  TKQEEAKEE 70


>gnl|CDD|226059 COG3528, COG3528, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 330

 Score = 33.7 bits (77), Expect = 0.51
 Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 4/41 (9%)

Query: 261 FLRDNLEWLARATNWAKLTATASLGVIHR---GHETEALAH 298
           +L   L W +    W +      LGV      G E + LAH
Sbjct: 115 YLHTGLNWNSLHPTWMRYFLL-DLGVTGPDALGQEVQNLAH 154



 Score = 33.7 bits (77), Expect = 0.51
 Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 4/41 (9%)

Query: 623 FLRDNLEWLARATNWAKLTATASLGVIHR---GHETEALAH 660
           +L   L W +    W +      LGV      G E + LAH
Sbjct: 115 YLHTGLNWNSLHPTWMRYFLL-DLGVTGPDALGQEVQNLAH 154


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 33.8 bits (77), Expect = 0.51
 Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 6/74 (8%)

Query: 113 DLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGP 172
            L  SA +   A V        P P           E  PTP+T  +   T  P   +  
Sbjct: 189 ALPLSAPRLGPADVFV---PATPRPTPR---TTASPETTPTPSTTTSPPSTTIPAPSTTI 242

Query: 173 SSTTPGPETKTLES 186
           ++   G   +   +
Sbjct: 243 AAPQAGTTPEAEGT 256



 Score = 31.8 bits (72), Expect = 2.0
 Identities = 24/95 (25%), Positives = 33/95 (34%), Gaps = 14/95 (14%)

Query: 99  GSADDDL--MAAQIG-FDLYESATQAFLASVLQELKTCAPIPDATHVVVKVPVEGAPTPA 155
           GS D  L   A ++G  D++  AT           +T       T          + T A
Sbjct: 184 GSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIA 243

Query: 156 TP----------DTEAPTPAPGTESGPSSTTPGPE 180
            P              PTP  G E+ P++ TP PE
Sbjct: 244 APQAGTTPEAEGTPAPPTPGGG-EAPPANATPAPE 277


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 33.9 bits (77), Expect = 0.51
 Identities = 21/92 (22%), Positives = 48/92 (52%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           K +    ++   E +  K K  +        +GA+    K+ +  K K++ A    ++ K
Sbjct: 161 KPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELK 220

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KK+++++K  +++++++K+ E  +K  EE EK
Sbjct: 221 KKREERRKVLEEEEQRRKQEEADRKSREEEEK 252


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 33.6 bits (77), Expect = 0.52
 Identities = 19/87 (21%), Positives = 39/87 (44%), Gaps = 8/87 (9%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAEKIGK 953
           +KRT R+      K+R  + K  +   + +    E   K+ +K          G ++  +
Sbjct: 289 QKRTTRR-----VKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDE 343

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMK 980
            +  ++ K++ +KK+K KKK    K+ 
Sbjct: 344 DEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 33.5 bits (78), Expect = 0.53
 Identities = 12/99 (12%), Positives = 39/99 (39%), Gaps = 22/99 (22%)

Query: 894 LFRKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
             ++   +R      +++     ++++ + E E+++ ++                   ++
Sbjct: 13  AVKEALAKRGFPLDVDELLELDEERRELQTELEELQAER----------------NALSK 56

Query: 950 KIGKKKKKKKKKKKKKK--KKKKKKKKEAEKMKKKTEEA 986
           +IG+ K+K +  +      K+ K++ K  E    + E  
Sbjct: 57  EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAE 95



 Score = 32.3 bits (75), Expect = 1.4
 Identities = 15/72 (20%), Positives = 35/72 (48%), Gaps = 9/72 (12%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD 939
           D+ RRE  T L      + +  R     +++I + K+K ++AE +  +   +++K++ K 
Sbjct: 34  DEERRELQTELE-----ELQAERNAL--SKEIGQAKRKGEDAEALIAE--VKELKEEIKA 84

Query: 940 AEKMKKKGAEKI 951
            E    +   ++
Sbjct: 85  LEAELDELEAEL 96


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 32.6 bits (75), Expect = 0.53
 Identities = 8/49 (16%), Positives = 24/49 (48%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E        ++  K  + KK+K + + + K+ + +A++ + +++ A  
Sbjct: 61  LELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALH 109


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 33.5 bits (77), Expect = 0.57
 Identities = 18/96 (18%), Positives = 36/96 (37%), Gaps = 10/96 (10%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK----MKKKGAE 949
           L         +K     +    KK      +  K    + +K  K+ ++    +K K   
Sbjct: 258 LKTNFEVLTGEKITKHYLSIGGKK------LPLKFRKNEFEKYDKEEDEKPLFLKWKLPI 311

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            I ++   + + K  K  K++  +K  +  KKK E+
Sbjct: 312 SIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEK 347


>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1.  RNA
           polymerases catalyze the DNA dependent polymerisation of
           RNA. Prokaryotes contain a single RNA polymerase
           compared to three in eukaryotes (not including
           mitochondrial. and chloroplast polymerases). This
           domain, domain 1, represents the clamp domain, which a
           mobile domain involved in positioning the DNA,
           maintenance of the transcription bubble and positioning
           of the nascent RNA strand.
          Length = 330

 Score = 33.4 bits (77), Expect = 0.59
 Identities = 24/93 (25%), Positives = 43/93 (46%), Gaps = 3/93 (3%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           + K ++++ KK     +KK    K  E      G +  +   K   K+ K GAE I    
Sbjct: 121 KGKNSKKRLKKINNLCKKKSICSKCGEDNG---GLKAFEGCGKYQPKISKDGAEAIKALL 177

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  + ++ K+K +K   +K  +  KK ++E E+
Sbjct: 178 KNIEIEELKEKLRKLNPEKVLQIFKKISKEDEE 210


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 32.6 bits (75), Expect = 0.60
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           +K+KKK   K++KK  K++K+K  E   
Sbjct: 97  EKEKKKAMSKEEKKAIKEEKEKLEEPYG 124



 Score = 32.2 bits (74), Expect = 0.95
 Identities = 14/54 (25%), Positives = 25/54 (46%), Gaps = 10/54 (18%)

Query: 941 EKMKKKGAEKIGKKKK----------KKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           + + K+    I K  K          +++K+KKK   K++KK  + EK K +  
Sbjct: 69  KILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 29.5 bits (67), Expect = 6.5
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 926 KKKGAEKMKK-KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
            K+    +KK  K D   + +   E+  KKK   K++KK  K++K+K ++ 
Sbjct: 72  TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
           helical domain is found in a set of bacterial plasmid
           replication proteins. The domain is found to the
           C-terminus of the primase/polymerase domain. Mutants of
           this domain are defective in template binding,
           dinucleotide formation and conformation change prior to
           DNA extension.
          Length = 135

 Score = 31.7 bits (72), Expect = 0.64
 Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           KK+++   E   K K  E+M K  +   K  +   E++ KK KK  ++ K+KKK      
Sbjct: 3   KKEEEDSEEDFEKLK--EEMAKYDRFKGKTVEAIREEVCKKIKKSLEELKEKKKAILNTA 60

Query: 974 KEA 976
           K  
Sbjct: 61  KGV 63


>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
          Length = 103

 Score = 31.3 bits (71), Expect = 0.66
 Identities = 30/83 (36%), Positives = 55/83 (66%), Gaps = 8/83 (9%)

Query: 911 IRKKKKKKKEAE-KMRKKKGAEK--MKKKKKDAEKMKK---KGAEKIGKK--KKKKKKKK 962
           I++  K +KEAE ++ K K   K  ++K K++A+K+++   K AE+  +K  +KKKK+ +
Sbjct: 5   IKEIVKAEKEAEERIEKAKEEAKKIIRKAKEEAKKIEEEIIKKAEEEAQKLIEKKKKEGE 64

Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
           ++ KK  ++ +KE E++K K EE
Sbjct: 65  EEAKKILEEGEKEIEELKVKAEE 87


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.5 bits (77), Expect = 0.66
 Identities = 19/108 (17%), Positives = 53/108 (49%), Gaps = 15/108 (13%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAE------KMKKKKKDAEKMKKKGAE 949
           R +     +++   +I   +K+ K+ E+  ++   E      ++++ K++ EK++ +   
Sbjct: 402 RPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELER 461

Query: 950 KIGKKKKKKKKKKK---------KKKKKKKKKKKEAEKMKKKTEEAEK 988
              + + K +K ++         + +K+ ++KKK  E++++K  E  K
Sbjct: 462 FRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRK 509


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 32.2 bits (74), Expect = 0.67
 Identities = 15/57 (26%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK----KKKKEAEKMKKKTE 984
            ++K  ++ +K ++   +KI K + + KK  KK  K       K+KK  EK   + +
Sbjct: 5   SLRKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRYEKQLDQLD 61


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 33.3 bits (76), Expect = 0.71
 Identities = 17/56 (30%), Positives = 19/56 (33%), Gaps = 2/56 (3%)

Query: 135 PIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEK 190
           P P A HV    P         P       APG  +   +  PGP   T   LN K
Sbjct: 231 PPPGAGHVHRGGPGPPER-DDAPVVPIRPSAPGPLAAQPAPAPGPGEPT-ARLNPK 284


>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit.  This family
           represents the eukaryotic vacuolar (H+)-ATPase
           (V-ATPase) G subunit. V-ATPases generate an acidic
           environment in several intracellular compartments.
           Correspondingly, they are found as membrane-attached
           proteins in several organelles. They are also found in
           the plasma membranes of some specialised cells.
           V-ATPases consist of peripheral (V1) and membrane
           integral (V0) heteromultimeric complexes. The G subunit
           is part of the V1 subunit, but is also thought to be
           strongly attached to the V0 complex. It may be involved
           in the coupling of ATP degradation to H+ translocation.
          Length = 105

 Score = 31.0 bits (71), Expect = 0.72
 Identities = 23/71 (32%), Positives = 41/71 (57%), Gaps = 4/71 (5%)

Query: 896 RKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           RK+R +R K  K+ AEK  ++ + ++EAE   K+  AE    + +  +K++K+  EKI +
Sbjct: 24  RKRRAKRLKQAKEEAEKEIEEYRAQREAE--FKEFEAEHSGSRGELEKKIEKETEEKIDE 81

Query: 954 KKKKKKKKKKK 964
            K+   K K+ 
Sbjct: 82  LKRSFNKNKEA 92


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 32.0 bits (74), Expect = 0.72
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKK 936
           KG  K  + ++ + EA+   +   AE + +K
Sbjct: 41  KGNLKQLEARRAELEAKAAEELAEAEALAEK 71



 Score = 30.8 bits (71), Expect = 1.8
 Identities = 7/33 (21%), Positives = 17/33 (51%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            K+ + ++ + + K  ++  EAE + +K E   
Sbjct: 44  LKQLEARRAELEAKAAEELAEAEALAEKLEGLT 76


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 32.5 bits (74), Expect = 0.74
 Identities = 13/27 (48%), Positives = 14/27 (51%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAE 977
             KKK K KK K K KK + KKK    
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 32.5 bits (74), Expect = 0.75
 Identities = 16/34 (47%), Positives = 19/34 (55%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           +K     + KKKK K KK K K KK + KKK  E
Sbjct: 82  RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115



 Score = 31.7 bits (72), Expect = 1.1
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           ++K A    +KKKK K KK K K KK + KK+A
Sbjct: 81  RRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKA 113



 Score = 31.3 bits (71), Expect = 1.8
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTE 984
           KKKK K KK K K KK+ +K K +  
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 31.0 bits (70), Expect = 2.2
 Identities = 13/27 (48%), Positives = 16/27 (59%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           EK  K K KK K K KK + KKK ++ 
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKAREG 116



 Score = 31.0 bits (70), Expect = 2.3
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
           KKKK K KK K K K+ E  KK  E 
Sbjct: 91  KKKKGKSKKNKLKGKKDEDKKKAREG 116


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 32.2 bits (74), Expect = 0.76
 Identities = 24/93 (25%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
            KK     +K  AE  ++ K+  +  +K  ++   E+++KK KD EK      +K   K 
Sbjct: 39  MKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEV--EELRKKLKDYEK------DKQSLKN 90

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K + K+ +K+ K  K + E  + + +  E E+
Sbjct: 91  LKARLKELEKELKNLKWESEVLEQRFEKVERER 123



 Score = 30.3 bits (69), Expect = 3.6
 Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K  K+E  +M+K    E  +K   +  +  K+  E + KK +++ ++ +KK K  +K K+
Sbjct: 30  KSLKEEIAEMKKN--EEHNEKLMAEIAQENKRLVEPL-KKAEEEVEELRKKLKDYEKDKQ 86

Query: 975 EAEKMKKKTEEAEK 988
             + +K + +E EK
Sbjct: 87  SLKNLKARLKELEK 100


>gnl|CDD|236272 PRK08475, PRK08475, F0F1 ATP synthase subunit B; Validated.
          Length = 167

 Score = 31.9 bits (73), Expect = 0.76
 Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           EK+++ K+KK++A K       E+ K+K +   +  KK A  + +K +K+ K   +   K
Sbjct: 67  EKLKESKEKKEDALKK-----LEEAKEKAELIVETAKKEAYILTQKIEKQTKDDIENLIK 121

Query: 969 KKKKKKEAEKMKKKTE 984
             ++  E E  K + E
Sbjct: 122 SFEELMEFEVRKMERE 137



 Score = 29.6 bits (67), Expect = 5.0
 Identities = 16/50 (32%), Positives = 29/50 (58%), Gaps = 3/50 (6%)

Query: 940 AEKMK---KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           A+ +K   K    KI K+ ++ ++K K+ K+KK+   K+ E+ K+K E  
Sbjct: 44  AKPLKNFYKSRINKISKRLEEIQEKLKESKEKKEDALKKLEEAKEKAELI 93


>gnl|CDD|234385 TIGR03897, lanti_2_LanM, type 2 lantibiotic biosynthesis protein
           LanM.  Members of this family are known generally as
           LanM, a multifunctional enzyme of lantibiotic
           biosynthesis. This catalysis by LanM distinguishes the
           type 2 lantibiotics, such as mersacidin, cinnamycin, and
           lichenicidin, from LanBC-produced type 1 lantibiotics
           such as nisin and subtilin. The N-terminal domain
           contains regions associated with Ser and Thr
           dehydration. The C-terminal region contains a pfam05147
           domain, which catalyzes the formation of the lanthionine
           bridge [Cellular processes, Toxin production and
           resistance].
          Length = 931

 Score = 33.4 bits (77), Expect = 0.76
 Identities = 31/165 (18%), Positives = 50/165 (30%), Gaps = 43/165 (26%)

Query: 269 LARATNWAKLTATASLGVIHRGHETEALAHMQSYLPRET-----GPTSGYY-EGGGLYAL 322
           LA  T   +    A           +AL  ++ YL  +           +   GG +YAL
Sbjct: 608 LAALTGKKRYRDLAR----------KALQPLRKYLETQLRLAKSMGLGAFSGLGGIIYAL 657

Query: 323 GLI--------HANHGAVINDYLLGQVKDATNEDDAVTGEAAGIAMGLVLLGSKSESAI- 373
             +          N    I + L   +       D + G AAG  + L+ L   +     
Sbjct: 658 AHLGQLLNDSELLNDAKRILNRLEELIIKDEEFLDLIGG-AAGAILVLLNLYEVTGDPEV 716

Query: 374 -----------------QDMVAYAKETQHEKILRGLAVGISFIMY 401
                            ++     K  Q EK L G + G + I +
Sbjct: 717 LELAIACGEHLLKQAVKEEGGIAWKTPQDEKPLTGFSHGAAGIAW 761


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 32.9 bits (75), Expect = 0.77
 Identities = 15/50 (30%), Positives = 20/50 (40%), Gaps = 4/50 (8%)

Query: 134 APIPDATHVVVKVP----VEGAPTPATPDTEAPTPAPGTESGPSSTTPGP 179
           A +  A+ V + +P     + AP P  P T A  PAP     P      P
Sbjct: 21  AAVAGASAVAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPP 70


>gnl|CDD|216568 pfam01555, N6_N4_Mtase, DNA methylase.  Members of this family are
           DNA methylases. The family contains both N-4
           cytosine-specific DNA methylases and N-6
           Adenine-specific DNA methylases.
          Length = 219

 Score = 32.4 bits (74), Expect = 0.78
 Identities = 11/68 (16%), Positives = 22/68 (32%), Gaps = 9/68 (13%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK-KKKKKKKKKKKKKK 970
            K KK                +K    + +K+KK+G  +   K         + +  +K+
Sbjct: 105 SKTKKYYTFNYD--------AIKVPYDEKDKLKKRGDSEPNGKPIGDVWDFPRVQPSEKE 156

Query: 971 KKKKEAEK 978
                 +K
Sbjct: 157 SGGHPTQK 164


>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family.  This is a family of
           sequences found in both bacteria and bacteriophages.
           This region is approximately 130 residues long and in
           some cases is found as part of the PVL (Panton-Valentine
           leukocidin) group of genes, which encode a member of the
           leukocidin group of bacterial toxins that kill
           leukocytes by creation of pores in the cell membrane.
           PVL appears to be a virulence factor associated with a
           number of human diseases.
          Length = 118

 Score = 31.3 bits (71), Expect = 0.78
 Identities = 22/84 (26%), Positives = 44/84 (52%), Gaps = 11/84 (13%)

Query: 910 KIRKKKKK--KKEAEKMRKKK-GAEKMKKKKKDAEKMKK--------KGAEKIGKKKKKK 958
           KIR K  K  + E E M+K       ++K+ K   ++ +        +  E    KK + 
Sbjct: 3   KIRIKPYKFTEFENELMKKYGITPGIVRKRVKGGWELSEALDAPYGMRLKEYREIKKSEN 62

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKK 982
            ++++K+++ ++K++KEAE  +KK
Sbjct: 63  IEQERKERELERKRRKEAELRRKK 86


>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755).  This
           family is predominated by ORFs from Circoviridae. The
           function of this family remains to be determined.
          Length = 122

 Score = 31.2 bits (71), Expect = 0.78
 Identities = 14/61 (22%), Positives = 30/61 (49%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +K+  KE +K  +     +  ++ +   K ++K ++   KK+KK +    +  KK   KK
Sbjct: 41  QKQLLKECQKTSQLIHLSQQLQQPQRKRKEQQKSSQTHKKKRKKSRHVSSRSAKKISAKK 100

Query: 974 K 974
           +
Sbjct: 101 R 101


>gnl|CDD|235263 PRK04243, PRK04243, 50S ribosomal protein L15e; Validated.
          Length = 196

 Score = 32.2 bits (74), Expect = 0.79
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEK 923
           +K R  RKK KG EK+R   +  +   K
Sbjct: 169 KKGRGLRKKGKGTEKVRPSIRANERRGK 196


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 32.8 bits (75), Expect = 0.80
 Identities = 16/89 (17%), Positives = 46/89 (51%), Gaps = 9/89 (10%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGA-----EKMKKKKKDA----EKMKKKGAEKIGKKKKKK 958
            E+ R + + +   ++ +K+        E++ K ++++    E M++   E+I + +++ 
Sbjct: 104 QEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLRQEAMRRATEEEILEMRRET 163

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            +++ + +++  + K EAE   +  EE E
Sbjct: 164 IEEEAELERENIRAKIEAEARGRAKEERE 192


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score = 33.2 bits (76), Expect = 0.81
 Identities = 16/84 (19%), Positives = 28/84 (33%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
           I F  +    KK K  EK  ++K        + + + A+ +K  +   E+      E   
Sbjct: 374 ISFVTEEEEVKKLKRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKK 433

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA 976
            K  KK   +    +    K    
Sbjct: 434 LKSSKKALLRGLGVRFTLSKLLAN 457



 Score = 30.5 bits (69), Expect = 5.7
 Identities = 6/74 (8%), Positives = 16/74 (21%), Gaps = 12/74 (16%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKM----------RKKKGAEKMKKKKK--DAEK 942
            +        +    K+     K+                R    A+ ++   K      
Sbjct: 438 KKALLRGLGVRFTLSKLLANLGKEIPGAGDAVTIDPELERRSPNSADDIEYILKGLSYRA 497

Query: 943 MKKKGAEKIGKKKK 956
            ++    +    K 
Sbjct: 498 EERTAKNEAANIKL 511


>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein. 
          Length = 105

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 15/35 (42%), Positives = 17/35 (48%)

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           M  KK+  +  K   A   GK KKKK  K K K K
Sbjct: 1   MPPKKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 15/32 (46%), Positives = 16/32 (50%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           KK  +K  K  A   G K KKKK  K K K K
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35



 Score = 28.1 bits (63), Expect = 9.0
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 953 KKKKKKKKKKKKKKK--KKKKKKKEAEKMKKK 982
           KK++KK  K        K KKKK    K+K K
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35



 Score = 27.7 bits (62), Expect = 9.7
 Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           K  +K   K A      K KKKK  K K K K
Sbjct: 4   KKQQKKAAKAAAASAGGKAKKKKWSKGKVKDK 35


>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
           structure and biogenesis].
          Length = 67

 Score = 30.0 bits (68), Expect = 0.82
 Identities = 14/46 (30%), Positives = 30/46 (65%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           K++K+    E  +++  EK  +K+K+KK   +K+K K+ +K+++ E
Sbjct: 22  KVEKEGILREMKEREFYEKPSEKRKRKKAAARKRKFKRLRKEQQRE 67



 Score = 27.7 bits (62), Expect = 5.5
 Identities = 12/33 (36%), Positives = 19/33 (57%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEK 932
              K+++  EK  +K+K+KK A + RK K   K
Sbjct: 30  REMKEREFYEKPSEKRKRKKAAARKRKFKRLRK 62



 Score = 27.3 bits (61), Expect = 7.1
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEK 923
            + K   +RK+KK A + RK K+ +KE ++
Sbjct: 37  FYEKPSEKRKRKKAAARKRKFKRLRKEQQR 66


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 31.6 bits (72), Expect = 0.83
 Identities = 22/86 (25%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           ++KK A+ +   ++ KKEA    +KK    +K+ K +A+++ +  A K G +  ++ K +
Sbjct: 27  RQKKIADGLASAERAKKEAALA-QKKAQVILKEAKDEAQEIIEN-ANKRGSEILEEAKAE 84

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +++++K K +  AE ++ + E+A +
Sbjct: 85  AREEREKIKAQARAE-IEAEKEQARE 109



 Score = 28.9 bits (65), Expect = 7.1
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 5/82 (6%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           + R+KK     A   R KK A   +KK +   K  K  A++I   +   K+  +  ++ K
Sbjct: 25  ETRQKKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEI--IENANKRGSEILEEAK 82

Query: 970 KKKKKEAEKMKKKTE---EAEK 988
            + ++E EK+K +     EAEK
Sbjct: 83  AEAREEREKIKAQARAEIEAEK 104


>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1.  This protein
           family shows weak but suggestive similarity to
           translation initiation factor SUI1 and its prokaryotic
           homologs.
          Length = 173

 Score = 32.2 bits (73), Expect = 0.83
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
             EAE +   +  E+  K +KD   ++K    + G KKKKK  +K   K++ + K+K
Sbjct: 48  TAEAEAVTIGEAQEE--KGEKDLLGIQKAQEAREGGKKKKKLPQKVTIKREPRTKRK 102


>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
           This family consists of several plant proteins of
           unknown function. Several sequences in this family are
           described as being "myosin heavy chain-like".
          Length = 484

 Score = 33.0 bits (75), Expect = 0.86
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 12/108 (11%)

Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM 943
           +EAV+  G I   K    +         R ++   KE E M  ++ A K+KK+ + AEK 
Sbjct: 8   KEAVSLFGGIADWKAHLPQ---------RSERVLVKETELMLAQEEANKLKKELEVAEKE 58

Query: 944 KKKGAEKIGKKKKKK---KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K +  +++   K+     K K +K +K++++ K+++E  K + EE E+
Sbjct: 59  KLQVLKELESTKRTVEDLKLKLEKAEKEEQQAKQDSELAKLRAEELEQ 106


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 33.3 bits (76), Expect = 0.88
 Identities = 18/70 (25%), Positives = 25/70 (35%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K   K+  +   EK+    K    + KK   +     K     KK    K  +K K+ EK
Sbjct: 645 KNNYKVNFQNFLEKIFGGPKHIFAISKKLINEAKIYWKAINNTKKSVVIKPLEKPKKYEK 704

Query: 979 MKKKTEEAEK 988
                E  EK
Sbjct: 705 TSVSKENLEK 714



 Score = 31.3 bits (71), Expect = 3.0
 Identities = 16/55 (29%), Positives = 22/55 (40%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           KK   + +   K     +K    K  EK KK +K +   +    E  G K  KKK
Sbjct: 671 KKLINEAKIYWKAINNTKKSVVIKPLEKPKKYEKTSVSKENLEKEIFGDKFTKKK 725



 Score = 30.6 bits (69), Expect = 4.6
 Identities = 17/86 (19%), Positives = 27/86 (31%), Gaps = 3/86 (3%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAEKIGKKKKKKK 959
             +   E   + K KK E   +      EK  + + D E     K        K  K K 
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
                K +         +++ +KT E
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTE 440



 Score = 30.2 bits (68), Expect = 7.3
 Identities = 19/57 (33%), Positives = 23/57 (40%), Gaps = 2/57 (3%)

Query: 915 KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           KK   EA+   K     K     K  EK KK   EK    K+  +K+    K  KKK
Sbjct: 671 KKLINEAKIYWKAINNTKKSVVIKPLEKPKK--YEKTSVSKENLEKEIFGDKFTKKK 725


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 32.5 bits (74), Expect = 0.89
 Identities = 21/119 (17%), Positives = 42/119 (35%), Gaps = 19/119 (15%)

Query: 880 DDVRREAVTALGFILFRKKRTRRKKKKGAEK----------------IRKKKKKKKEAEK 923
           D  + +   A G IL  K  T   +     +                +   K   +   +
Sbjct: 73  DGGKCKTYFARGSILCSKSITSISQNTEGRERLLTVSHENGKLQVTYVTHSKVAHEAEIR 132

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK---KKKKEAEKM 979
                   +  +K+  A   KK   E   +  K+K+K++ +  +K K   +KK+E ++ 
Sbjct: 133 NLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRR 191



 Score = 32.1 bits (73), Expect = 1.3
 Identities = 14/77 (18%), Positives = 37/77 (48%), Gaps = 1/77 (1%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
            +K+ G +K +++ +  ++ ++ R+KK  E+ ++  +D       G+         K + 
Sbjct: 159 HRKRSGKQKEKRRVEDSQKHKEDRRKK-QEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEP 217

Query: 962 KKKKKKKKKKKKKEAEK 978
            K+K++K    ++  E 
Sbjct: 218 PKEKRQKHHDPERRLEP 234



 Score = 31.8 bits (72), Expect = 1.7
 Identities = 16/75 (21%), Positives = 37/75 (49%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           K   + E   +   K AEK +K+ +   + K     +    K+K+K++ +  +K K+ ++
Sbjct: 124 KVAHEAEIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRR 183

Query: 974 KEAEKMKKKTEEAEK 988
           K+ E+ ++  E+   
Sbjct: 184 KKQEEKRRNDEDKRP 198



 Score = 29.5 bits (66), Expect = 6.8
 Identities = 13/51 (25%), Positives = 28/51 (54%)

Query: 925 RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           RKK   E  K+  K  EK + + ++K  + ++KK+++K++  + K+     
Sbjct: 152 RKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGG 202



 Score = 29.5 bits (66), Expect = 8.0
 Identities = 19/81 (23%), Positives = 39/81 (48%), Gaps = 4/81 (4%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            EK  +   +KK  ++ RK+ G +K K++ +D++K K    E   KK+++K++  + K+ 
Sbjct: 143 FEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHK----EDRRKKQEEKRRNDEDKRP 198

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
                    +      +E  K
Sbjct: 199 GGGGGSSGGQSGLSTKDEPPK 219


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 30.0 bits (68), Expect = 0.90
 Identities = 20/59 (33%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +K+ +K +   ++ ++G EK+ KK+ K     KK + K K+ +KE  ++KKK E+ EK
Sbjct: 4   EKKLAEKDEQIAQLMEEG-EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEK 61


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 32.7 bits (75), Expect = 0.91
 Identities = 20/94 (21%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKK---GAEKMKKKKKDAEKMKKKG---AEKIGKK 954
           R+  +   + +++  K +  +  +  +K     ++ KK   + E+++ K    +++IGK 
Sbjct: 4   RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K +KK K ++ KK+ K+ K+E  ++    +  E 
Sbjct: 64  KGQKKDKIEEIKKELKELKEELTELSAALKALEA 97


>gnl|CDD|220400 pfam09785, Prp31_C, Prp31 C terminal domain.  This is the C
           terminal domain of the pre-mRNA processing factor Prp31.
           Prp31 is required for U4/U6.U5 tri-snRNP formation. In
           humans this protein has been linked to autosomal
           dominant retinitis pigmentosa.
          Length = 124

 Score = 31.2 bits (71), Expect = 0.91
 Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 4/87 (4%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK-IGKKKKKKKKK 961
           KKK+G  + RK K++    E +RK     +M+  K++ E          +G        +
Sbjct: 16  KKKRGGRRFRKMKERFAMTE-LRKA--QNRMEFGKEEEEVGYDFDEGVGLGMLGSGGGGR 72

Query: 962 KKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +  +K  K K K ++KMKK+ + +  
Sbjct: 73  IRLSQKDAKTKAKLSKKMKKRLQSSGA 99


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 32.9 bits (75), Expect = 0.94
 Identities = 15/55 (27%), Positives = 19/55 (34%), Gaps = 2/55 (3%)

Query: 138 DATHVVVKVPVEGAPTPATPDTEAPTPAPGTESG-PSSTTPGPETKTLESLNEKE 191
           D  +V V V     P   T +    T  P T     ++ T  P T T E   E  
Sbjct: 701 DRVNVEV-VEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETT 754


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 33.0 bits (76), Expect = 0.94
 Identities = 25/128 (19%), Positives = 55/128 (42%), Gaps = 23/128 (17%)

Query: 873 VAVSDVNDDVRR-EAVTALGFILFRKKRTRRKKKKGA---------EKIR----KKKKKK 918
           ++   +   VRR EAVT    + + +++    K+  +          K+     + K+ +
Sbjct: 686 ISEEGIAAGVRRIEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELE 745

Query: 919 KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           KE E+++KK  A ++     +A+  +  G + +          K+      K+ ++ A+ 
Sbjct: 746 KELERLKKKLAAAELADLLSNAKAEEIGGVKVLA---------KEVDGADMKELREIADD 796

Query: 979 MKKKTEEA 986
           +KKK   A
Sbjct: 797 LKKKLGSA 804


>gnl|CDD|225949 COG3415, COG3415, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 138

 Score = 31.6 bits (72), Expect = 0.94
 Identities = 11/81 (13%), Positives = 34/81 (41%), Gaps = 1/81 (1%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
             K  +  +R++       E +  ++ A++         +  ++  E  G     K +K 
Sbjct: 1   MAKPFSNDLRERVVDAVVGEGLSCREAAKRFGVSISTVYRWVRRYRET-GLDLPPKPRKG 59

Query: 963 KKKKKKKKKKKKEAEKMKKKT 983
           + +K  +++ +   E++++K 
Sbjct: 60  RPRKLSEEQLEILLERLREKD 80


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 33.0 bits (75), Expect = 1.0
 Identities = 14/46 (30%), Positives = 16/46 (34%), Gaps = 10/46 (21%)

Query: 132  TCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTP 177
              AP P A              PATP   A    P T +GP +  P
Sbjct: 2736 PAAPAPPAV----------PAGPATPGGPARPARPPTTAGPPAPAP 2771



 Score = 33.0 bits (75), Expect = 1.0
 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPG 178
           AP+P +       PV  A  P  P T  P+  PG++ GP+     
Sbjct: 409 APVPASVPTPAPTPVP-ASAPPPPATPLPSAEPGSDDGPAPPPER 452



 Score = 31.1 bits (70), Expect = 3.9
 Identities = 10/35 (28%), Positives = 14/35 (40%)

Query: 147  PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPET 181
                 P+P  PDT AP P P + S  ++       
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643



 Score = 30.7 bits (69), Expect = 4.9
 Identities = 15/51 (29%), Positives = 15/51 (29%), Gaps = 1/51 (1%)

Query: 135  PIPDATHVVVKVPVEGAPTPATPDTE-APTPAPGTESGPSSTTPGPETKTL 184
            P P         P    P P  P    APT  P     PS   P P    L
Sbjct: 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL 2967



 Score = 30.3 bits (68), Expect = 8.0
 Identities = 11/52 (21%), Positives = 13/52 (25%), Gaps = 1/52 (1%)

Query: 134  APIPDATHVVVKVPVEGAPTPATPDT-EAPTPAPGTESGPSSTTPGPETKTL 184
            AP                  P T     AP P     +GP      P   +L
Sbjct: 2740 APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.8 bits (76), Expect = 1.0
 Identities = 15/59 (25%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           E          E+E    +K  EK K   K  +K++K   +K+  +  + KK  K ++K
Sbjct: 202 EDENDDSLAADESE--LPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258


>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain.  PfEMP1 (Plasmodium falciparum
           erythrocyte membrane protein) has been identified as the
           rosetting ligand of the malaria parasite P. falciparum.
           Rosetting is the adhesion of infected erythrocytes with
           uninfected erythrocytes in the vasculature of the
           infected organ, and is associated with severe malaria.
           PfEMP1 interacts with Complement Receptor One on
           uninfected erythrocytes to form rosettes.
          Length = 150

 Score = 31.5 bits (72), Expect = 1.1
 Identities = 24/104 (23%), Positives = 37/104 (35%), Gaps = 25/104 (24%)

Query: 903 KKKKGAEKIR----KKKKKKKEA----------EKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           KKKK  EKI+    K+ K               E +  +   +K     K+ +K+K    
Sbjct: 49  KKKKEWEKIKEHFNKQYKIDDGRDEYNVLESFLEDLLFQIDIKKAYGDAKELKKIK---- 104

Query: 949 EKIGKKKKKKKKKKKKKKKKK-------KKKKKEAEKMKKKTEE 985
           E +            K  + K          +K+A+K KKK  E
Sbjct: 105 ELLKCNGCNNSANSAKNGENKDAIDCLLDHLQKKAKKCKKKHSE 148



 Score = 28.8 bits (65), Expect = 7.3
 Identities = 21/102 (20%), Positives = 31/102 (30%), Gaps = 21/102 (20%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAE--KMKKKKKDAEKMKKKG------------- 947
              KG       KKK +  EK  +KK  E  K+K+      K+                 
Sbjct: 25  NNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDEYNVLESFLEDL 84

Query: 948 ------AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
                  +  G  K+ KK K+  K          A+  + K 
Sbjct: 85  LFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSAKNGENKD 126



 Score = 28.8 bits (65), Expect = 8.8
 Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 13/53 (24%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK---------KKKKEAEKMKK 981
           +D+ K +KK    +       K K  K   KKK         KKKKE EK+K+
Sbjct: 11  EDSIKWRKK----LKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKE 59


>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 155

 Score = 31.6 bits (72), Expect = 1.1
 Identities = 9/55 (16%), Positives = 15/55 (27%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K  K +        A+     +    KKKK   +    +             AE+
Sbjct: 99  KPSKLELFNAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEE 153


>gnl|CDD|201294 pfam00542, Ribosomal_L12, Ribosomal protein L7/L12 C-terminal
           domain. 
          Length = 68

 Score = 29.8 bits (68), Expect = 1.1
 Identities = 15/36 (41%), Positives = 22/36 (61%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           +G K+ K   +   K  K+   K+EAE++KKK EEA
Sbjct: 26  LGLKEAKDLVEGAPKTIKEGVSKEEAEEIKKKLEEA 61


>gnl|CDD|179302 PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional.
          Length = 198

 Score = 32.0 bits (73), Expect = 1.1
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 4/57 (7%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKK----KKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           K KK   E+ ++   E I + K++ +    K +++ K+ K K +KEA   K+   EA
Sbjct: 11  KIKKDGLEEAERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHALEA 67


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 12/33 (36%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
           F++++ ++KKK   E    +KK+KKE  + RK 
Sbjct: 37  FKREKKKKKKKSEGE-FFAEKKEKKEVSEQRKA 68



 Score = 30.3 bits (69), Expect = 1.4
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           K++ +K KKK   +   +KK+KK+  +++K  +K 
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72



 Score = 29.1 bits (66), Expect = 3.2
 Identities = 13/24 (54%), Positives = 17/24 (70%)

Query: 964 KKKKKKKKKKKEAEKMKKKTEEAE 987
           K++KKKKKKK E E   +K E+ E
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKE 61



 Score = 29.1 bits (66), Expect = 3.9
 Identities = 15/35 (42%), Positives = 24/35 (68%), Gaps = 3/35 (8%)

Query: 955 KKKKKKKKKKKKK---KKKKKKKEAEKMKKKTEEA 986
           K++KKKKKKK +     +KK+KKE  + +K  ++A
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72



 Score = 28.7 bits (65), Expect = 4.8
 Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 3/35 (8%)

Query: 952 GKKKKKKKKKKKK---KKKKKKKKKKEAEKMKKKT 983
            ++KKKKKKK +     +KK+KK+  E  K  +K 
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVSEQRKADQKA 72


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 32.8 bits (75), Expect = 1.1
 Identities = 18/80 (22%), Positives = 29/80 (36%), Gaps = 3/80 (3%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK---GAEKIGKKKKKKKKKK 962
           K  E    K++K   +          K ++K  D     +    G  +  K+K +K K  
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271

Query: 963 KKKKKKKKKKKKEAEKMKKK 982
           K+K KK        + M K 
Sbjct: 272 KEKAKKLGISIILFDDMTKN 291



 Score = 31.6 bits (72), Expect = 2.2
 Identities = 18/95 (18%), Positives = 31/95 (32%), Gaps = 14/95 (14%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
            L + K     K++            K      ++KG +   + +  +    +   EK+ 
Sbjct: 209 TLIKSKEININKEEKNNGSNVNNNGNKN--NKEEQKGNDLSNELEDISLGPLEYDKEKLE 266

Query: 953 KKKKKKKKKKK------------KKKKKKKKKKKE 975
           K K  K+K KK            K K    K + E
Sbjct: 267 KIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNE 301


>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
          Length = 553

 Score = 32.7 bits (75), Expect = 1.1
 Identities = 10/45 (22%), Positives = 16/45 (35%)

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
            K   K   K K+   EK  K++ K        K + +K     +
Sbjct: 9   SKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAADD 53



 Score = 30.7 bits (70), Expect = 5.0
 Identities = 11/48 (22%), Positives = 23/48 (47%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            E + K   +K  K K+ +++K  K++ K       A+   +K+  A+
Sbjct: 5   GEPLSKNALKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAAAD 52



 Score = 30.0 bits (68), Expect = 6.7
 Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 2/38 (5%)

Query: 896 RKKRTRRKK--KKGAEKIRKKKKKKKEAEKMRKKKGAE 931
            KKR + K+  ++ A K   K      A K R +K A 
Sbjct: 13  LKKRLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRKSAA 50


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 23/67 (34%), Positives = 39/67 (58%), Gaps = 9/67 (13%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK---KKKKKKKKKKKKKKKKKKKKK 974
           K+E EKM KK     +KK +K+    K+K  EK  K   +  ++++K+ K+ +K KKK  
Sbjct: 243 KQEMEKMSKK-----IKKLEKENLAWKRK-WEKSNKALLEMAEERQKRDKEIEKLKKKID 296

Query: 975 EAEKMKK 981
           + EK+ +
Sbjct: 297 KLEKLCR 303



 Score = 30.0 bits (68), Expect = 5.4
 Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 8/64 (12%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK--KKKKKKKKKKKEAEKMKKKTE 984
           K+  EKM KK K  EK      E +  K+K +K  K   +  ++++K+ KE EK+KKK +
Sbjct: 243 KQEMEKMSKKIKKLEK------ENLAWKRKWEKSNKALLEMAEERQKRDKEIEKLKKKID 296

Query: 985 EAEK 988
           + EK
Sbjct: 297 KLEK 300


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 31.9 bits (73), Expect = 1.1
 Identities = 9/37 (24%), Positives = 25/37 (67%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           A  + ++K ++KK+K + + ++++ K   EK+K +++
Sbjct: 31  ALALKREKAQEKKRKAEAQAERRELKARKEKLKTRSD 67



 Score = 30.0 bits (68), Expect = 3.8
 Identities = 11/49 (22%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           A  +K++K      ++K  +   + ++++ K +K+K K +    KEA+ 
Sbjct: 31  ALALKREKA-----QEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQA 74


>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2.  This entry is the
           first approximately 250 residues of cortactin-binding
           protein 2. In addition to being a positional candidate
           for autism this protein is expressed at highest levels
           in the brain in humans. The human protein has six
           associated ankyrin repeat domains pfam00023 towards the
           C-terminus which act as protein-protein interaction
           domains.
          Length = 193

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 16/73 (21%), Positives = 39/73 (53%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           AEK ++K   + E EK +  +  +K        E+ +++  + + ++K ++ KK+++ +K
Sbjct: 108 AEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQEKSQQAKKEQEHRK 167

Query: 968 KKKKKKKEAEKMK 980
                ++E  K+K
Sbjct: 168 LLATLEEELGKLK 180


>gnl|CDD|240396 PTZ00388, PTZ00388, 40S ribosomal protein S8-like; Provisional.
          Length = 223

 Score = 32.0 bits (73), Expect = 1.2
 Identities = 24/84 (28%), Positives = 36/84 (42%), Gaps = 6/84 (7%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEK------IGKKKK 956
            KK+ AEK   KK  K   EK  K+K  EK+      +  + ++   +      + K+K+
Sbjct: 14  NKKRYAEKAEMKKTIKAHEEKDVKEKVPEKVPDGAVPSYLLDREQVNRAKVLSNMLKQKR 73

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMK 980
           K+K  K      K K   E E  K
Sbjct: 74  KEKAGKWDVPIPKVKAMNEDEMFK 97


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 13/54 (24%), Positives = 24/54 (44%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
            K+ K            +  + +K  A K   K  ++++ +K +K+ KK KK E
Sbjct: 72  IKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 9/51 (17%), Positives = 21/51 (41%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            K  +         E   ++K + +K+  K  ++++ +K +    K K  E
Sbjct: 75  GKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 8/63 (12%), Positives = 23/63 (36%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +                   +   E+  +   E +   ++++ +K +K+ KK KK +   
Sbjct: 69  EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEPVQ 128

Query: 977 EKM 979
            ++
Sbjct: 129 GQL 131



 Score = 29.7 bits (67), Expect = 3.5
 Identities = 13/54 (24%), Positives = 25/54 (46%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            K  K    ++      E  +++K +A K   K  ++   +K +K+ KK KK +
Sbjct: 72  IKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 28.9 bits (65), Expect = 5.9
 Identities = 8/52 (15%), Positives = 24/52 (46%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + D +  K            +  +++K + +K+  K  + E+++K  + ++K
Sbjct: 69  EDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKK 120



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 9/34 (26%), Positives = 18/34 (52%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKK 938
           ++   + RK+  K  + E++RK +   K  KK +
Sbjct: 92  EEEKAEARKEALKAYQQEELRKIQKRSKKSKKAE 125



 Score = 28.5 bits (64), Expect = 8.3
 Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 4/58 (6%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           I+  K            +  E+ K + +  E +K    E+    +K +K+ KK KK +
Sbjct: 72  IKVGKPINCRVTVNHTVELTEEEKAEAR-KEALKAYQQEE---LRKIQKRSKKSKKAE 125


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 32.5 bits (74), Expect = 1.2
 Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 2/79 (2%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           E+  K K  KK  E    + G  +  KK     +     +E+   K +K  +  +    K
Sbjct: 42  EQSEKAKPPKKPKEA--SRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPK 99

Query: 969 KKKKKKEAEKMKKKTEEAE 987
            K    E+E+ ++  E  +
Sbjct: 100 SKAPSTESEEEEEPEETPD 118



 Score = 32.1 bits (73), Expect = 1.5
 Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 8/90 (8%)

Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKM--------KKKKKDAEKMKKKGAEKIGKKKKKK 958
             EK   KK KK +A K   +   + +         +K K  +K K+       +  KK 
Sbjct: 9   PDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKS 68

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K  +      ++K  +  K  + T  +  
Sbjct: 69  SKPTESSAASSEEKPAKPRKSAESTRSSHP 98


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 32.9 bits (75), Expect = 1.2
 Identities = 26/103 (25%), Positives = 46/103 (44%), Gaps = 8/103 (7%)

Query: 885  EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
            +  T L    F     + K  K  E+I+  ++K +   K        K+ + K    ++K
Sbjct: 1053 KMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLK--------KIDENKNKLIEIK 1104

Query: 945  KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             K  E +    K+K K+ +   KKKK  +K  ++M+K  +E E
Sbjct: 1105 NKSHEHVVNADKEKNKQTEHYNKKKKSLEKIYKQMEKTLKELE 1147



 Score = 31.3 bits (71), Expect = 3.3
 Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 8/93 (8%)

Query: 901  RRKKKKGAEKIRKKKKKKKEAEKMRKKKGA----EKMKKKKKDAEKMKKKGAEK----IG 952
            +  K+KG E   K  +     EKM+ K  +      +KK K    K + K  E+    + 
Sbjct: 1031 KLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALL 1090

Query: 953  KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            KK  + K K  + K K  +    A+K K K  E
Sbjct: 1091 KKIDENKNKLIEIKNKSHEHVVNADKEKNKQTE 1123


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 32.5 bits (74), Expect = 1.2
 Identities = 14/75 (18%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 913 KKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           K+ +KK E +      K   +   K+  +    K     K  +       +KK    K K
Sbjct: 32  KQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNK 91

Query: 971 KKKKEAEKMKKKTEE 985
           ++ ++ +   KK ++
Sbjct: 92  EEIEKPKDEPKKPDK 106



 Score = 29.8 bits (67), Expect = 7.5
 Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 6/87 (6%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
               +++ +KK    I+  +   K  +K   K+ +E    K  +  K      E      
Sbjct: 27  TSSNSKQPEKKPE--IKPNENTPKIPKKPDNKEPSENNNNKSNNENK----DEENPSSTN 80

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKK 982
            +KK    K K++ +K K E +K  KK
Sbjct: 81  PEKKPDPSKNKEEIEKPKDEPKKPDKK 107


>gnl|CDD|217754 pfam03836, RasGAP_C, RasGAP C-terminus. 
          Length = 139

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 13/42 (30%), Positives = 20/42 (47%)

Query: 941 EKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
             ++KK      K   KK +KK  KK K    K  A+++ +K
Sbjct: 86  STLQKKKKFSKQKFSVKKLQKKHGKKPKFGSYKYSAKQLYEK 127


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 32.8 bits (76), Expect = 1.2
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 16/83 (19%)

Query: 918 KKEAEKMRK--KKGAEKMKKKKKD-------------AEKMKKKGAEKIGKKKKKKKKKK 962
            +E E+M K  +  AE+ KK+K+               EK  K+  +K+   +K+K +  
Sbjct: 502 DEEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561

Query: 963 KKKKKKKKKKKKEAEKMKKKTEE 985
            K+ K+  K + + E +K KTEE
Sbjct: 562 IKELKEALKGE-DKEAIKAKTEE 583


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 12/49 (24%), Positives = 32/49 (65%), Gaps = 2/49 (4%)

Query: 921 AEKMRKKKGAE-KMKKKKKDAEKMKKKG-AEKIGKKKKKKKKKKKKKKK 967
           AE+  K +  E ++ +++++ ++ + KG A+KI KK++K  + + + ++
Sbjct: 66  AERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114



 Score = 28.4 bits (64), Expect = 6.5
 Identities = 10/46 (21%), Positives = 27/46 (58%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
            K+++K+++  + +++  E   K    K +KK++K  + + + +EA
Sbjct: 70  AKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQEA 115



 Score = 28.4 bits (64), Expect = 8.2
 Identities = 9/25 (36%), Positives = 16/25 (64%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKK 927
           + K  A+KI KK++K  EA+   ++
Sbjct: 90  QAKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|227453 COG5124, COG5124, Protein predicted to be involved in meiotic
           recombination [Cell division and chromosome partitioning
           / General function prediction only].
          Length = 209

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)

Query: 897 KKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           K +T +K    +E ++KK ++ K++    +++   EK  ++KK  E  K    E + +K+
Sbjct: 73  KSQTLQKLYDSSELLKKKIQEVKQDIATYKEEIDKEKATRRKKFTEGQKNYNREALLEKR 132

Query: 956 KKKKKKKKKKKKKKKKKKKEA-----EKMKKKTEEAEK 988
           KK++ + KKK    +K +         + KKK     K
Sbjct: 133 KKEQDEIKKKLNSLQKIEPIRWDAAKIQEKKKKVHLNK 170


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 32.6 bits (74), Expect = 1.2
 Identities = 22/104 (21%), Positives = 32/104 (30%), Gaps = 23/104 (22%)

Query: 106 MAAQIGFDLYESATQAFLAS--VLQELKTCAP--IPDATHVVVKV--------------- 146
           + A+ G D Y  A Q  + S  +L + KT     I +   VVV V               
Sbjct: 30  IEAEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDFVVVMVSKPKTGTGKVAPPAA 89

Query: 147 ----PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLES 186
                    P+P        + AP +     S +    T T   
Sbjct: 90  TPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 31.8 bits (72), Expect = 1.2
 Identities = 16/76 (21%), Positives = 31/76 (40%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           RK K+K +E  K + +   EK+   K++ +   K+      K    +    +++  +  K
Sbjct: 95  RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154

Query: 972 KKKEAEKMKKKTEEAE 987
             +E E      E A 
Sbjct: 155 VHEETEDSYHVEETAS 170



 Score = 29.1 bits (65), Expect = 9.1
 Identities = 15/96 (15%), Positives = 31/96 (32%), Gaps = 8/96 (8%)

Query: 901 RRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
            +         R++     K  +E E     +     + K+   EK  ++  E   +  +
Sbjct: 135 DKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVE 194

Query: 957 KKKKKKKKKKK----KKKKKKKEAEKMKKKTEEAEK 988
           K ++ K +          ++    E  K   EE  K
Sbjct: 195 KAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAK 230


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 32.3 bits (73), Expect = 1.2
 Identities = 22/104 (21%), Positives = 47/104 (45%), Gaps = 1/104 (0%)

Query: 884 REAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKM 943
           ++ +T L   L + +R    K+ G   +RK K  ++E  + RK   A K +K  +  E M
Sbjct: 108 KQRLTKLTQYLLKARRLALFKEHGEYMVRKPKMNRREKLRERKALNAAKFEKNIEK-ELM 166

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            +  +   G      ++    K   +++K+++ ++   + EE  
Sbjct: 167 LRLKSGIYGDTPLNVREHLWNKAATEREKRQDEKERYVEEEEES 210


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 32.6 bits (74), Expect = 1.2
 Identities = 13/57 (22%), Positives = 29/57 (50%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           KKG      +K      KK+ ++   +K   K K+K K  ++  K + + +++++K+
Sbjct: 716 KKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKS 772



 Score = 30.3 bits (68), Expect = 6.0
 Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 6/86 (6%)

Query: 906 KGAEKIRKKKKK-----KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI-GKKKKKKK 959
           K     +  +K      KK+  K   +K + K K+K K+ E+  K  ++K   ++K    
Sbjct: 716 KKGVPPKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSC 775

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEE 985
                         KE+ K  +  EE
Sbjct: 776 SPSSSSSHHHSSSNKESRKSSRNKEE 801


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.4 bits (72), Expect = 1.2
 Identities = 15/34 (44%), Positives = 25/34 (73%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           +KKKKKKKKK+ +  ++KKKK    + +K+ ++A
Sbjct: 83  RKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 31.0 bits (71), Expect = 1.6
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           K+K  +K KKK  E   +KKKK    +++K+ +K 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116



 Score = 31.0 bits (71), Expect = 1.6
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+KKKKKKKKK+ +   EK KK     E+
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEER 110



 Score = 30.6 bits (70), Expect = 1.9
 Identities = 14/34 (41%), Positives = 25/34 (73%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K+KKKKKKKKK+ +  ++KK+   + ++ +E +K
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115



 Score = 29.8 bits (68), Expect = 4.2
 Identities = 16/32 (50%), Positives = 23/32 (71%), Gaps = 2/32 (6%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA--EKMKKK 982
           K+KKKKKKKKK+ +  ++KKKK    E+ K+ 
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKED 113



 Score = 29.4 bits (67), Expect = 5.7
 Identities = 14/31 (45%), Positives = 23/31 (74%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           K+KKKKKKKKK+ +  ++KKK+    +++ E
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKE 112


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.4 bits (74), Expect = 1.2
 Identities = 20/110 (18%), Positives = 37/110 (33%), Gaps = 8/110 (7%)

Query: 883 RREAVTALGFI--LFRKKRTRRKKKKGA-EKIRKKKKKKKEAEKMRK-----KKGAEKMK 934
           RR  V     +  LFR+++   ++++   E + K      E E+ R+         ++  
Sbjct: 163 RRADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEED 222

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
                + K       K     KK   K+ +     K KK      K +  
Sbjct: 223 NDNTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDA 272


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.4 bits (72), Expect = 1.3
 Identities = 17/83 (20%), Positives = 45/83 (54%), Gaps = 6/83 (7%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           +K  + ++ K +E  ++ KK ++E E++ K+       + K +A  ++K+  E+   ++K
Sbjct: 113 RKALQAEQGK-SELEQEIKKLEEEKEELEKRV---AELEAKLEA--IEKREEEERQIEEK 166

Query: 957 KKKKKKKKKKKKKKKKKKEAEKM 979
           +   +    KK+ ++ K + E++
Sbjct: 167 RHADEIAFLKKQNQQLKSQLEQI 189



 Score = 30.6 bits (70), Expect = 2.3
 Identities = 12/46 (26%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           M+K  A +  + K + +++ KK +++K++ +K   +++ K E  EK
Sbjct: 112 MRK--ALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK 155


>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
          Length = 866

 Score = 32.5 bits (74), Expect = 1.3
 Identities = 20/86 (23%), Positives = 28/86 (32%), Gaps = 14/86 (16%)

Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE---------TKTL 184
              P         P   AP PA     A  PAP   + P++  P P+            L
Sbjct: 8   KEEPAKEEATPPAPAASAPAPA-----AAAPAPVAAAAPAAAGPRPDDEPFKASDALHAL 62

Query: 185 ESLNEKEKQLQARITKLSKILSGEVS 210
            +L  K++  Q       + L G  S
Sbjct: 63  VALKLKKRIDQIEALDSIEDLVGGKS 88


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 32.4 bits (74), Expect = 1.3
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 6/73 (8%)

Query: 919 KEAEKMRKK---KGAEKMKKKK-KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K   K+ KK   K A    +KK K A +  K+  ++  K+    KK+    K+K+   K 
Sbjct: 325 KNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKR--KELALAKKQSSTTKRKENLPKW 382

Query: 975 EAEKMKKKTEEAE 987
              + KK T + E
Sbjct: 383 SNPEYKKTTSQEE 395



 Score = 30.1 bits (68), Expect = 6.0
 Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 6/99 (6%)

Query: 890 LGFILFRKKRTRRKK--KKGAEKI-RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           L ++L +      KK   K A    RKK K   EA K  +K+    + KK+    K K+ 
Sbjct: 319 LDYVLLKNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKEN 378

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKK---KKKEAEKMKKK 982
             +    + KK   ++++ K +  K    K+     +K+
Sbjct: 379 LPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 32.4 bits (73), Expect = 1.3
 Identities = 17/82 (20%), Positives = 32/82 (39%), Gaps = 7/82 (8%)

Query: 885 EAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK 944
           E     G  + +++ TR  K +      ++  K+K ++         K  K+    + + 
Sbjct: 483 EIRLKNGVAMSKEELTREDKNR-----LRRALKRKRSKA--NLPNVNKRSKRNDVVDTLS 535

Query: 945 KKGAEKIGKKKKKKKKKKKKKK 966
           K     I +K +KK    K KK
Sbjct: 536 KANITVINQKGEKKDVSGKTKK 557


>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
           (reggie) like proteins. This subgroup contains proteins
           similar to stomatin, prohibitin, flotillin, HlfK/C and
           podicin.  Many of these band 7 domain-containing
           proteins are lipid raft-associated.  Individual proteins
           of this band 7 domain family may cluster to form
           membrane microdomains which may in turn recruit
           multiprotein complexes.  Microdomains formed from
           flotillin proteins may in addition be dynamic units with
           their own regulatory functions.  Flotillins have been
           implicated in signal transduction, vesicle trafficking,
           cytoskeleton rearrangement and are known to interact
           with a variety of proteins.  Stomatin interacts with and
           regulates members of the degenerin/epithelia Na+ channel
           family in mechanosensory cells of Caenorhabditis elegans
           and vertebrate neurons and participates in trafficking
           of Glut1 glucose transporters. Prohibitin may act as a
           chaperone for the stabilization of mitochondrial
           proteins.  Prokaryotic HflK/C plays a role in the
           decision between lysogenic and lytic cycle growth during
           lambda phage infection. Flotillins have been implicated
           in the progression of prion disease, in the pathogenesis
           of neurodegenerative diseases such as Parkinson's and
           Alzheimer's disease and, in cancer invasion and
           metastasis. Mutations in the podicin gene give rise to
           autosomal recessive steroid resistant nephritic
           syndrome.
          Length = 280

 Score = 32.1 bits (73), Expect = 1.3
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 4/74 (5%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKK 957
           R  K K  E IR+  +   EAEK +     +K K  +K+AE  +KK    AEK+ +  K 
Sbjct: 151 RVTKPKIPEAIRRNYELM-EAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKI 209

Query: 958 KKKKKKKKKKKKKK 971
              +K  +K+ +K+
Sbjct: 210 LFGQKVMEKETEKR 223


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 31.9 bits (73), Expect = 1.4
 Identities = 18/96 (18%), Positives = 50/96 (52%), Gaps = 16/96 (16%)

Query: 908 AEKIRKKKKKKKEAEKM-------RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK--- 957
            EK  + ++ K EA +        ++K+  + M+ +++  ++  K+  EK+  +++K   
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLA 261

Query: 958 ------KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
                 + K +++++  K+  K EAE ++K+ ++ +
Sbjct: 262 EQERMLEHKLQEQEELLKEGFKTEAESLQKEIQDLK 297



 Score = 29.9 bits (68), Expect = 6.2
 Identities = 17/80 (21%), Positives = 42/80 (52%), Gaps = 9/80 (11%)

Query: 914 KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK----KKKKKKKKKKKK 969
            K+K  EAE+ +    AE  + +++   + +K+  + +  +++      K+  +K + ++
Sbjct: 201 AKEKAIEAERAK----AEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAER 256

Query: 970 KKKKKEAEKM-KKKTEEAEK 988
           +K   E E+M + K +E E+
Sbjct: 257 EKLLAEQERMLEHKLQEQEE 276


>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
           4 and similar proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. This subfamily is composed of
           Cdc42-Interacting Protein 4 (CIP4), Formin Binding
           Protein 17 (FBP17), FormiN Binding Protein 1-Like
           (FNBP1L), and similar proteins. CIP4 and FNBP1L are
           Cdc42 effectors that bind Wiskott-Aldrich syndrome
           protein (WASP) and function in endocytosis. CIP4 and
           FBP17 bind to the Fas ligand and may be implicated in
           the inflammatory response. CIP4 may also play a role in
           phagocytosis. Members of this subfamily typically
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. In addition, some members such as FNBP1L contain
           a central Cdc42-binding HR1 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 251

 Score = 31.8 bits (73), Expect = 1.4
 Identities = 21/71 (29%), Positives = 40/71 (56%), Gaps = 4/71 (5%)

Query: 921 AEKMRK---KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           AE +     K+    + + +++ +K   +G+ K+ +K +   K+ +K KK  +K  KEAE
Sbjct: 85  AENLNSNVCKELKTLISELRQERKKHLSEGS-KLQQKLESSIKQLEKSKKAYEKAFKEAE 143

Query: 978 KMKKKTEEAEK 988
           K K+K E+A+ 
Sbjct: 144 KAKQKYEKADA 154


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.5 bits (74), Expect = 1.4
 Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 905 KKGAEKIRK-KKKKKKEAEKMRKKKGAE---KMKKKKKDAEKMKKKGAE---KIGKKKKK 957
           K+   ++++ KK+KK+E +K+ +++ A     M  K K   K   +  E      K  + 
Sbjct: 59  KREKARLKELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQS 118

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              KK K + +   K  E E+ ++  +E E
Sbjct: 119 ASAKKAKGRGRHASKLTEEEEDEEYLKEEE 148



 Score = 30.9 bits (70), Expect = 3.8
 Identities = 22/98 (22%), Positives = 38/98 (38%), Gaps = 21/98 (21%)

Query: 906 KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK- 964
           + AE   K +  K+E  ++++ K     K+KK++ +K+ ++    I      K K + K 
Sbjct: 47  EEAESPAKAEISKREKARLKELK-----KQKKQEIQKILEQQNAAIDADMNNKGKGRLKY 101

Query: 965 ---------------KKKKKKKKKKEAEKMKKKTEEAE 987
                          +    KK K       K TEE E
Sbjct: 102 LLQQTEIFAHFAKGDQSASAKKAKGRGRHASKLTEEEE 139


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 20/106 (18%), Positives = 45/106 (42%), Gaps = 1/106 (0%)

Query: 881 DVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDA 940
           +   +    L  +      TR + K   EK+ K K++  E ++   +   E  +  ++ A
Sbjct: 364 EELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELA 423

Query: 941 E-KMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           +      G E    + +++K+ K  + KK++ K ++      K E+
Sbjct: 424 DLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQ 469



 Score = 32.0 bits (73), Expect = 2.2
 Identities = 20/91 (21%), Positives = 49/91 (53%), Gaps = 2/91 (2%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKKKKK 957
            ++ K  G E+  + K+K  E E ++   + +   K+++ +DAE+   K   +I K   +
Sbjct: 278 NKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAE 337

Query: 958 KKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            ++ +++ ++++K++ K  E+  +  EE E 
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELED 368



 Score = 30.8 bits (70), Expect = 4.3
 Identities = 20/96 (20%), Positives = 53/96 (55%), Gaps = 9/96 (9%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKM--RKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           RKK+K  E++ + ++  +  + +   K++  E+++++++ AE+ +    EK   +  +  
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELL 229

Query: 960 KKKKKKKKKKKK-------KKKEAEKMKKKTEEAEK 988
           K+K+  +++K+         ++E EK+ ++  E EK
Sbjct: 230 KEKEALERQKEAIERQLASLEEELEKLTEEISELEK 265



 Score = 30.4 bits (69), Expect = 6.0
 Identities = 19/97 (19%), Positives = 52/97 (53%), Gaps = 4/97 (4%)

Query: 893 ILFRKKRTRRKK----KKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           ++  +KR + ++    ++ AE+ +   K+K+E E     K  E ++++K+  E+      
Sbjct: 191 LIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLE 250

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           E++ K  ++  + +K+ ++ ++  ++  +K+K   EE
Sbjct: 251 EELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE 287



 Score = 29.7 bits (67), Expect = 9.4
 Identities = 15/88 (17%), Positives = 42/88 (47%), Gaps = 3/88 (3%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           ++  +   EK   +K+ ++  E+    K   K  +K+ +    KK   E++ ++ ++ + 
Sbjct: 819 QKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK---EELEEELEELEA 875

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             +  + +    KKE ++++ +  E E+
Sbjct: 876 ALRDLESRLGDLKKERDELEAQLRELER 903


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 32.6 bits (75), Expect = 1.4
 Identities = 21/85 (24%), Positives = 43/85 (50%), Gaps = 4/85 (4%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIG----KKKKKKKKKKK 963
           A+KI    +K  +  + R++      ++     + +KKK  + +     + + K K +KK
Sbjct: 554 ADKITDSMQKAIDETERRREIQMAYNEEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKK 613

Query: 964 KKKKKKKKKKKEAEKMKKKTEEAEK 988
             K  KK+ +K  +K++K+ +EA K
Sbjct: 614 ASKMSKKELEKLIKKLEKEMKEAAK 638


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKK-KEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           +R ++    K  G   +++  + +  E E+  KKK  EK+++K+K  E++++   E+  +
Sbjct: 287 YRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKE 346

Query: 954 KKKKKKKKKKKKKKKKKKKKKEA 976
              K ++ +KK +  +K+ +K  
Sbjct: 347 LNSKLEEIQKKLEDLEKRLEKLK 369



 Score = 31.1 bits (71), Expect = 2.6
 Identities = 20/83 (24%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 909 EKIRKKK---KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           E  R +K    K      +++   A    +  ++  ++KKK  EKI +K+K+ ++ ++  
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEA----RLNEEERELKKKFTEKIREKEKRLEELEQNL 340

Query: 966 KKKKKKKKKEAEKMKKKTEEAEK 988
            +++K+   + E+++KK E+ EK
Sbjct: 341 IEERKELNSKLEEIQKKLEDLEK 363


>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
          Length = 145

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           K   KK KKK KK  K  KK  +  K+K 
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKV 29



 Score = 30.8 bits (70), Expect = 1.7
 Identities = 14/30 (46%), Positives = 17/30 (56%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           K  A+K  KK KK  K  KK  K KK+K +
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVR 30



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 18/57 (31%), Positives = 28/57 (49%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           K  A+K KKK K A K  KKG +   +K +   +  + K  +  KK K   ++K  +
Sbjct: 1   KAPAKKAKKKAKKAAKAAKKGVKVKKRKVRTSVRFFRPKTLRLPKKPKYPRRVKPAS 57


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 32.3 bits (74), Expect = 1.4
 Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 11/94 (11%)

Query: 896 RKKRTRRKKKKGAEKIRKK----KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           R+ RT +K  +    I  K    KK  KE  K   ++       K K     +     +I
Sbjct: 259 REVRTGKKIVRYYLNIGGKKIYLKKPPKEFAKYDVEE-------KTKKFGLWRWSLPIEI 311

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
            K+   + K+K +K  K++  +K  EK +K+ ++
Sbjct: 312 VKETYYEVKEKVEKVTKEEAIEKAKEKAEKELKK 345


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 23/76 (30%), Positives = 45/76 (59%), Gaps = 9/76 (11%)

Query: 922 EKMRKKKGAEKMKKKKKDAEK-------MKK--KGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           E++RK +  E++K+K ++A+K       +K+  K  EK   + KK++ ++ K +  KKKK
Sbjct: 12  EELRKAQSLEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKK 71

Query: 973 KKEAEKMKKKTEEAEK 988
           K + EK+  + +  +K
Sbjct: 72  KFKKEKVDVRVKVVKK 87


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 31.6 bits (72), Expect = 1.5
 Identities = 13/43 (30%), Positives = 20/43 (46%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           E+ RK++ A   KKK K  +  K+   + + K  K      KK
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198



 Score = 30.0 bits (68), Expect = 4.2
 Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 937 KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-KKKKKKEAEKMKK 981
            K  EK K++ A    KK K +K  K+ +KK   K  K     +KK
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198



 Score = 29.2 bits (66), Expect = 8.6
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEA-EKMKKKTEEAEK 988
            K+ +K+K++     KKK K +K  KEA +K   K  +A  
Sbjct: 153 AKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIV 193


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 32.0 bits (73), Expect = 1.5
 Identities = 12/79 (15%), Positives = 35/79 (44%), Gaps = 1/79 (1%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            E  ++ + K +   K  + +    ++++    EK   +  E+  +    + +K  ++KK
Sbjct: 340 GESFQEWRSKLQALAK-VESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKK 398

Query: 968 KKKKKKKEAEKMKKKTEEA 986
           K +++  E  + +  T  +
Sbjct: 399 KLREQADEEYQQRHATRAS 417


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 31.0 bits (70), Expect = 1.5
 Identities = 24/99 (24%), Positives = 51/99 (51%), Gaps = 18/99 (18%)

Query: 903 KKKKGAEKIRKKK-----KKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKK 957
           K+ +G E   +++     +  +EAE+ RK     K +K +++ E M++   +K G KKK+
Sbjct: 23  KEDEGDESDAEEEDEEIQEALREAEEERK----AKHRKMEEEREVMRQGIRDKYGIKKKE 78

Query: 958 KKKKKKKKK---------KKKKKKKKEAEKMKKKTEEAE 987
           + +++ +           +KKK  ++ A +  ++ EE E
Sbjct: 79  EDEEEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEE 117


>gnl|CDD|217787 pfam03910, Adeno_PV, Adenovirus minor core protein PV. 
          Length = 336

 Score = 32.1 bits (73), Expect = 1.5
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           K+K  E+M +  A +I   +  K + K +K K+ KK+KK  EK
Sbjct: 3   KRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEK 45



 Score = 30.2 bits (68), Expect = 5.0
 Identities = 15/45 (33%), Positives = 25/45 (55%)

Query: 933 MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           M K+K   E ++    E  G +  K + K +K K+ KK+KK+E +
Sbjct: 1   MSKRKIKEEMLQILAPEIYGPRPVKDEAKPRKIKRVKKRKKREEK 45


>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal.  L19e
           is found in the large ribosomal subunit of eukaryotes
           and archaea. L19e is distinct from the ribosomal subunit
           L19, which is found in prokaryotes. It consists of two
           small globular domains connected by an extended segment.
           It is located toward the surface of the large subunit,
           with one exposed end involved in forming the
           intersubunit bridge with the small subunit.  The other
           exposed end is involved in forming the translocon
           binding site, along with L22, L23, L24, L29, and L31e
           subunits.
          Length = 145

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 13/111 (11%)

Query: 874 AVSDVNDDVRREAVTAL---GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA 930
            + +V + + R+ + AL   G I       + K KKG  + R K++ +K  +  R+  G+
Sbjct: 25  RLEEVAEAITRDDIRALIKEGVI-------KAKPKKGISRGRLKERHEKRKKGRRRGPGS 77

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
              +K KK A   KK+   K  +  ++  K+ + K K  KK  ++  +  K
Sbjct: 78  ---RKGKKGARTPKKERWIKTIRALRRYLKELRDKGKIDKKTYRKLYRKAK 125


>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region.  This family
           contains sequences that are similar to the N-terminal
           region of Red protein. This and related proteins contain
           a RED repeat which consists of a number of RE and RD
           sequence elements. The region in question has several
           conserved NLS sequences and a putative trimeric
           coiled-coil region, suggesting that these proteins are
           expressed in the nucleus. The function of Red protein is
           unknown, but efficient sequestration to nuclear bodies
           suggests that its expression may be tightly regulated of
           that the protein self-aggregates extremely efficiently.
          Length = 238

 Score = 31.7 bits (72), Expect = 1.6
 Identities = 10/33 (30%), Positives = 18/33 (54%)

Query: 956 KKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           KKKKK    +K+++  +K+   K + +  E  K
Sbjct: 1   KKKKKYAYLRKQEENAEKEINPKYRDRARERRK 33


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 31.0 bits (70), Expect = 1.6
 Identities = 12/83 (14%), Positives = 45/83 (54%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           ++  RR++ +  E+ R+++ + +   + R+ +   + + + +   + + +  E+    ++
Sbjct: 23  RRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERDKDARE 82

Query: 957 KKKKKKKKKKKKKKKKKKEAEKM 979
            KK++++K  K++  + K  E++
Sbjct: 83  PKKRERQKLIKEEDLEGKSDEEV 105


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 32.2 bits (74), Expect = 1.6
 Identities = 11/40 (27%), Positives = 18/40 (45%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
            EK+ K  +K     K        KKKE E+  ++ +E +
Sbjct: 140 VEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGD 179


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 31.3 bits (71), Expect = 1.6
 Identities = 22/94 (23%), Positives = 39/94 (41%), Gaps = 5/94 (5%)

Query: 899 RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKG----AEKMKKKKKDAEKMKKKG-AEKIGK 953
           R ++K +    K+ +K+ K+K+ +K     G    A+   KKK  A+  K+ G A     
Sbjct: 2   RKKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNT 61

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             +     + K   + KK +   E+  K     E
Sbjct: 62  AARTDLLIQDKPIPELKKGEARIERDDKGNPLDE 95


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           K K K K K K  KK +++ K+E + ++ +     +
Sbjct: 98  KPKPKPKPKPKPVKKVEEQPKREVKPVEPRPASPFE 133



 Score = 29.7 bits (67), Expect = 6.6
 Identities = 11/48 (22%), Positives = 19/48 (39%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           E  K+      K + K   K   K K  KK +++ K++ K  +     
Sbjct: 83  EPPKEAPVVIPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEPRPAS 130



 Score = 29.7 bits (67), Expect = 7.6
 Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            K + K K K K K K  KK  E+ K++ +  E 
Sbjct: 93  PKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126



 Score = 29.7 bits (67), Expect = 8.0
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           I K + K K K K K K  KK +++ ++  K  E 
Sbjct: 92  IPKPEPKPKPKPKPKPKPVKKVEEQPKREVKPVEP 126


>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
          Length = 290

 Score = 31.6 bits (72), Expect = 1.6
 Identities = 11/43 (25%), Positives = 17/43 (39%)

Query: 923 KMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKK 965
           K + K     +K+     EK  +KG       K+     +KKK
Sbjct: 246 KAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288



 Score = 31.6 bits (72), Expect = 1.9
 Identities = 10/42 (23%), Positives = 14/42 (33%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
              K K    K      EK  +K  +     K+     +KKK
Sbjct: 247 AKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288



 Score = 31.2 bits (71), Expect = 2.1
 Identities = 10/45 (22%), Positives = 16/45 (35%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           K   K K K    ++   K  +   K  +     K+     +KKK
Sbjct: 244 KPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKK 288



 Score = 30.9 bits (70), Expect = 3.4
 Identities = 10/44 (22%), Positives = 16/44 (36%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
            K K K K      K+   K+ K  +K  +     K+     +K
Sbjct: 243 AKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRK 286


>gnl|CDD|234773 PRK00461, rpmC, 50S ribosomal protein L29; Reviewed.
          Length = 87

 Score = 29.7 bits (67), Expect = 1.7
 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 21/86 (24%)

Query: 914 KKKKKKEAEKMRKKKGAE-----------------KMKKKKKDAEKMKKKGAEKIGKKKK 956
           +KK  +E EK+  +  AE                 K+K+ +KD  ++       + +++ 
Sbjct: 6   RKKSVEELEKLVIELKAELFTLRFKNATGSLDQTHKIKEIRKDIARILTI----LNEREL 61

Query: 957 KKKKKKKKKKKKKKKKKKEAEKMKKK 982
           ++K+  K+ KK  KK  K A K  K+
Sbjct: 62  EEKENNKEPKKNTKKAVKLASKAGKQ 87


>gnl|CDD|191748 pfam07464, ApoLp-III, Apolipophorin-III precursor (apoLp-III).
           This family consists of several insect
           apolipoprotein-III sequences. Exchangeable
           apolipoproteins constitute a functionally important
           family of proteins that play critical roles in lipid
           transport and lipoprotein metabolism. Apolipophorin III
           (apoLp-III) is a prototypical exchangeable
           apolipoprotein found in many insect species that
           functions in transport of diacylglycerol (DAG) from the
           fat body lipid storage depot to flight muscles in the
           adult life stage.
          Length = 155

 Score = 31.0 bits (70), Expect = 1.7
 Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 10/83 (12%)

Query: 914 KKKKKKEAEKMRKK-----KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           ++  ++ AE++RK      + A++++ K + A +   + ++K+ K+     +   +K   
Sbjct: 69  RQNIEETAEELRKAHPEVEQQAQELRDKLQAAVQSTVQESQKLAKEVASNVEGTNEKLAP 128

Query: 969 KKKKK-----KEAEKMKKKTEEA 986
           K K+      K AE+++KK  EA
Sbjct: 129 KIKQAYDDFVKNAEEVQKKLHEA 151


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.9 bits (73), Expect = 1.8
 Identities = 19/49 (38%), Positives = 28/49 (57%)

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            E++K+  AEK   + +KK  +KK KK  KK K +EA   +    EAE+
Sbjct: 75  EEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123



 Score = 31.1 bits (71), Expect = 2.9
 Identities = 16/63 (25%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 919 KEAEKMRKKKGAEKMK--KKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +E E+  +++  E++K  + +K+  + +KKG EK  KK  KK K ++   ++  + + E 
Sbjct: 64  EEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEE 123

Query: 977 EKM 979
            + 
Sbjct: 124 PEP 126


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 11/74 (14%), Positives = 41/74 (55%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           +K+  +    K A+K  +K+  +   + +K  ++++K+  +   +     ++ +K K+++
Sbjct: 24  QKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQE 83

Query: 969 KKKKKKEAEKMKKK 982
            ++K++E ++ ++ 
Sbjct: 84  LQQKQQELQQKQQA 97


>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
          Length = 658

 Score = 32.1 bits (74), Expect = 1.8
 Identities = 13/40 (32%), Positives = 16/40 (40%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
            M K K    KM      + GK+K   KK   +    KKK
Sbjct: 619 PMLKVKGKNGKMLVCQDRECGKRKNVSKKTNARCPNCKKK 658


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 7/52 (13%)

Query: 943 MKKKGAEKIGKKKK-------KKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           M K+    I    K          K +K++KK   K++K+A K +K+  E E
Sbjct: 71  MTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122



 Score = 30.0 bits (68), Expect = 5.4
 Identities = 10/28 (35%), Positives = 18/28 (64%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           +K++KK   K++KK  K +K+K  E+  
Sbjct: 97  QKEEKKAMSKEEKKAIKAEKEKLEEEYG 124



 Score = 29.6 bits (67), Expect = 7.3
 Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           K +K++KK   K++KK  K +KE  + +  
Sbjct: 95  KAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 31.6 bits (72), Expect = 1.9
 Identities = 10/33 (30%), Positives = 23/33 (69%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
             + ++ KKK++ + KKKK+ +E + ++KK ++
Sbjct: 84  SLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQ 116



 Score = 30.9 bits (70), Expect = 2.9
 Identities = 15/44 (34%), Positives = 24/44 (54%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           KD    +K   EK       + ++ KKK++ + KKKKE E++K 
Sbjct: 66  KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKA 109


>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3).  Pre-mRNA
           processing factor 3 (PRP3) is a U4/U6-associated
           splicing factor. The human PRP3 has been implicated in
           autosomal retinitis pigmentosa.
          Length = 222

 Score = 31.2 bits (71), Expect = 1.9
 Identities = 15/68 (22%), Positives = 36/68 (52%), Gaps = 3/68 (4%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKK-KDAEKMKKKGAEKIGKK 954
             KR +R+++    +  +K K  ++AE++R++   E+ + +  ++A+    K A  + +K
Sbjct: 11  EAKRKKRRRRGL--QFIEKGKFSQQAEELRREAKLEEEEARISEEAQNAGLKSATDLAEK 68

Query: 955 KKKKKKKK 962
             KK+   
Sbjct: 69  IPKKELPD 76


>gnl|CDD|234054 TIGR02904, spore_ysxE, spore coat protein YsxE.  Members of this
           family are homologs of the Bacillus subtilis spore coat
           protein CotS. Members of this family, designated YsxE,
           are found only in the family Bacillaceae, from among the
           endospore-forming members of the Firmicutes branch of
           the Bacteria. As a rule, the ysxE gene is found
           immediately downstream of spoVID, a gene necessary for
           spore coat assembly. The protein has been shown to be
           part of the spore coat [Cellular processes, Sporulation
           and germination].
          Length = 309

 Score = 31.7 bits (72), Expect = 2.0
 Identities = 25/105 (23%), Positives = 43/105 (40%), Gaps = 18/105 (17%)

Query: 529 IELKEFSLKKLNDIVDDFWPENEKEKQLQARITKLSKILSGEVSIDLHLQFLIRSNKTDM 588
           ++  EF+  KL D     W E  KEK+   R      ++ G++S+  H  +    ++T  
Sbjct: 171 MQAAEFAQNKLED-----WHEATKEKE-SWRTV----LVHGKLSLS-HFLY----DETRG 215

Query: 589 LILKNTKESIRVSICHTATIIANAFMHSGTTSDQFLRDNLEWLAR 633
               N +++   S     T +A  F  S  T  Q + +  EW   
Sbjct: 216 GYFINFEKA---SFASPLTDLAAFFRRSFGTYPQPIDEMFEWFYT 257


>gnl|CDD|145004 pfam01632, Ribosomal_L35p, Ribosomal protein L35. 
          Length = 61

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 15/50 (30%), Positives = 28/50 (56%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
           KMK  K  A++ K+  + KI ++K  K+    KK  K+K++ ++   + K
Sbjct: 2   KMKTVKAAAKRFKRTASGKIKRRKAGKRHLLTKKSTKRKRRLRKKVLVSK 51


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 30.4 bits (69), Expect = 2.0
 Identities = 13/80 (16%), Positives = 29/80 (36%), Gaps = 2/80 (2%)

Query: 910 KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
            I       +E  ++ K+    + +KK    +  +K   E++  +   +   +    K  
Sbjct: 35  DIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKAT 94

Query: 970 KKK--KKEAEKMKKKTEEAE 987
           KK+    E E+    +E   
Sbjct: 95  KKQLFSSEYEQTSSSSESTS 114


>gnl|CDD|220357 pfam09716, ETRAMP, Malarial early transcribed membrane protein
           (ETRAMP).  These sequences represent a family of
           proteins from the malaria parasite Plasmodium
           falciparum, several of which have been shown to be
           expressed specifically in the ring stage as well as the
           rodent parasite Plasmodium yoelii. A homologue from
           Plasmodium chabaudi was localized to the parasitophorous
           vacuole membrane. Members have an initial hydrophobic,
           Phe/Tyr-rich, stretch long enough to span the membrane,
           a highly charged region rich in Lys, a second putative
           transmembrane region and a second highly charged, low
           complexity sequence region. Some members have up to 100
           residues of additional C-terminal sequence. These genes
           have been shown to be found in the sub-telomeric regions
           of both Plasmodium falciparum and P. yoelii chromosomes.
          Length = 82

 Score = 29.1 bits (66), Expect = 2.0
 Identities = 13/29 (44%), Positives = 14/29 (48%)

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
                  KK KK KK     +KKKK KKK
Sbjct: 25  NNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 13/31 (41%), Positives = 15/31 (48%)

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           +      K  K KK KK     +KKKK KKK
Sbjct: 23  LNNNVVAKKKKLKKLKKIDDDLEKKKKNKKK 53



 Score = 28.0 bits (63), Expect = 6.0
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKK 960
            KK   K  KK  + + KKKK KKK
Sbjct: 29  AKKKKLKKLKKIDDDLEKKKKNKKK 53


>gnl|CDD|238028 cd00073, H15, linker histone 1 and histone 5 domains; the basic
           subunit of chromatin is the nucleosome, consisting of an
           octamer of core histones, two full turns of DNA, a
           linker histone (H1 or H5) and a variable length of
           linker DNA; H1/H5 are chromatin-associated proteins that
           bind to the exterior of nucleosomes and dramatically
           stabilize the highly condensed states of chromatin
           fibers; stabilization of higher order folding occurs
           through electrostatic neutralization of the linker DNA
           segments, through a highly positively charged carboxy-
           terminal domain known as the AKP helix (Ala, Lys, Pro);
           thought to be involved in specific protein-protein and
           protein-DNA interactions and play a role in suppressing
           core histone tail domain acetylation in the chromatin
           fiber.
          Length = 88

 Score = 29.5 bits (67), Expect = 2.0
 Identities = 32/93 (34%), Positives = 43/93 (46%), Gaps = 30/93 (32%)

Query: 885 EAVTALGFILFRKKRTRRKKKKGA--EKIRK--KKKKKKEAEKMRK------KKGAEKMK 934
           EA+ AL            K++KG+  + I+K  + K K + E   K      KKG  K K
Sbjct: 13  EAIKAL------------KERKGSSLQAIKKYIEAKYKVDDENFNKLLKLALKKGVAKGK 60

Query: 935 KKKKDAEKMKKKGAE---KIGKKKKKKKKKKKK 964
             +     +K  GA    K+ KKKKKKKK  KK
Sbjct: 61  LVQ-----VKGTGASGSFKLSKKKKKKKKPAKK 88


>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
           [Translation, ribosomal structure and biogenesis].
          Length = 98

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKK 974
           I   K  +KK+K  +KK++ KK K
Sbjct: 75  IPPDKLIRKKRKLPRKKRRPKKPK 98



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K   +D +     G + +     K  +KK+K  +KK++ KK 
Sbjct: 57  KPDDRDVKLESLVG-DTLYIPPDKLIRKKRKLPRKKRRPKKP 97


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 31.6 bits (72), Expect = 2.0
 Identities = 15/61 (24%), Positives = 45/61 (73%), Gaps = 4/61 (6%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE-AEK 978
           E  ++R+++   +  +++++AE  + +  E+  ++++++K+++KK+ K++K+++KE AEK
Sbjct: 153 ELAELRQQQ---RQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209

Query: 979 M 979
           +
Sbjct: 210 I 210



 Score = 29.6 bits (67), Expect = 7.1
 Identities = 14/74 (18%), Positives = 44/74 (59%), Gaps = 7/74 (9%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           E++ + ++++++ E+ R    AE  + ++ +  + +++  EK   +++KK+ K++K+++K
Sbjct: 152 EELAELRQQQRQFEQRRN---AELAETQRLEEAE-RRRREEK---ERRKKQDKERKQREK 204

Query: 969 KKKKKKEAEKMKKK 982
           +  +K  A    + 
Sbjct: 205 ETAEKIAARAFAQG 218


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 31.7 bits (72), Expect = 2.1
 Identities = 11/44 (25%), Positives = 26/44 (59%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
            ++  K +K+ +K   K ++ + + K++K++K+KK KK+     
Sbjct: 120 LDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPVL 163


>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
          Length = 102

 Score = 29.6 bits (66), Expect = 2.1
 Identities = 19/57 (33%), Positives = 29/57 (50%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
            K     KK+ K A + K + A+K  K KK++K  ++K +  KK  KK + K   K 
Sbjct: 40  AKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAKP 96



 Score = 28.1 bits (62), Expect = 8.7
 Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK+ KK   +K +  K   K KK++K A + K + A+K  KK   K   K
Sbjct: 47  KKQHKKAPEQKAQAAKKHHKNKKEQK-APEQKAQAAKKHAKKHSHKTAAK 95


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 31.8 bits (72), Expect = 2.1
 Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 6/94 (6%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD--AEKMKKKGAEK 950
           + +R +R          +  K  +K+ +  +M  K   EK+  KK++   + +     E+
Sbjct: 1   MAYRNRREANINNNDRMQ-EKDDEKQDQKNRMELK---EKVLDKKEEVVTDNVDSPVKEQ 56

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
             ++  K   + KK  K++ K+  E  K K++ +
Sbjct: 57  SSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQ 90


>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin. 
          Length = 137

 Score = 30.6 bits (69), Expect = 2.1
 Identities = 14/54 (25%), Positives = 22/54 (40%), Gaps = 6/54 (11%)

Query: 925 RKKKG-AEKMKKK-----KKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
            +KKG  EK+K+K         +          G   +   ++KK K  K K+K
Sbjct: 77  ERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 32.0 bits (72), Expect = 2.1
 Identities = 12/34 (35%), Positives = 14/34 (41%)

Query: 147 PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE 180
           P   AP  A P   AP  A    + P + TP P 
Sbjct: 746 PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779


>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
           protein; Provisional.
          Length = 230

 Score = 31.1 bits (70), Expect = 2.1
 Identities = 12/75 (16%), Positives = 33/75 (44%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK 963
           KK  ++       K K+       +     + ++K + + +K  + ++ ++K   + + +
Sbjct: 114 KKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNEKVLSPELSEEKSDSELETQ 173

Query: 964 KKKKKKKKKKKEAEK 978
            +K + K+KK   E 
Sbjct: 174 PQKTQLKEKKPSIEH 188


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 22/109 (20%), Positives = 49/109 (44%), Gaps = 12/109 (11%)

Query: 875 VSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
           +   N+DV+ E          +++    ++++  E+I + +  + E E +  ++  E+ +
Sbjct: 36  IIKENEDVKDE----------KQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKK--KKKKKKKKKKKKEAEKMKK 981
           +   D + ++KK    I    +    +    K  KK +K KK AE + K
Sbjct: 86  EDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIVK 134


>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
          Length = 94

 Score = 29.4 bits (66), Expect = 2.2
 Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKK 958
           KK+G   +RK K+KKK+    ++   A     K+K AE +       K    +GK   + 
Sbjct: 3   KKQGKVLVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEA 62

Query: 959 KKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
             K  +++K   +KK + +K++ + E+AE
Sbjct: 63  WNKLSEEEKAPYEKKAQEDKVRYEKEKAE 91


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 30.5 bits (70), Expect = 2.2
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 10/79 (12%)

Query: 909 EKIRKKKKK--KKEAEKMR------KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
           E+ RK+  K  KK AE+ +      ++   +K+KK +KD  K   +   K  +K+ +K  
Sbjct: 89  EERRKELVKQAKKLAEEAKVAIRNIRRDANDKLKKLEKD--KEISEDEVKRAEKEIQKLT 146

Query: 961 KKKKKKKKKKKKKKEAEKM 979
            K  KK  +  KKKE E M
Sbjct: 147 DKYIKKIDELLKKKEKEIM 165



 Score = 30.5 bits (70), Expect = 2.5
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 16/74 (21%)

Query: 901 RRKK-----KKGAEK-------IRKKKKKKKEAEKMRKKKGAEK--MKKKKKDAEKMKKK 946
           RRK+     KK AE+       IR+    K   +K+ K K   +  +K+ +K+ +K+  K
Sbjct: 91  RRKELVKQAKKLAEEAKVAIRNIRRDANDK--LKKLEKDKEISEDEVKRAEKEIQKLTDK 148

Query: 947 GAEKIGKKKKKKKK 960
             +KI +  KKK+K
Sbjct: 149 YIKKIDELLKKKEK 162


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 30.3 bits (69), Expect = 2.3
 Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 890 LGFILFRKKRTRRKKKKGAEKIRKK-KKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           LG IL  +K       K AE+  K+      EAE+        ++ + + +A ++    A
Sbjct: 24  LGKILDERKEKIANNIKEAEERLKQAAALLAEAEQ--------QLAQARAEASEIINN-A 74

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           +K  +K K++   + +K  ++  +   AE  ++K
Sbjct: 75  KKEAQKLKEEILAEAQKDAERLLESARAEIEQEK 108


>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694).  Family of
           hypothetical bacterial proteins.
          Length = 278

 Score = 31.0 bits (71), Expect = 2.4
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           ++K+    +     +   + ++  K+KKKKKKKKKKK++  E M +K
Sbjct: 63  QEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWMCEK 109



 Score = 30.7 bits (70), Expect = 4.0
 Identities = 15/43 (34%), Positives = 26/43 (60%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +K+K + E  +        ++  K+KKKKKKKKKKK++ ++  
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREWM 106



 Score = 29.5 bits (67), Expect = 7.6
 Identities = 16/42 (38%), Positives = 25/42 (59%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           EK K   +  E  ++    +   K+KKKKKKKKKKK++ ++ 
Sbjct: 64  EKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEELREW 105


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 31.8 bits (72), Expect = 2.4
 Identities = 12/44 (27%), Positives = 32/44 (72%)

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++  EK+ ++++++K++ K K ++++K K+EA K ++  E A++
Sbjct: 6   RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQR 49


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 31.8 bits (72), Expect = 2.4
 Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 915 KKKKKEAEKMRKKKGA--EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           KK   + EK+  KK +  +K +KK K+   ++ K   +  ++K       +   KK +++
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170

Query: 973 KKE-AEKMKKK 982
            KE  E  K+K
Sbjct: 171 FKEYLENFKRK 181



 Score = 31.1 bits (70), Expect = 3.3
 Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 2/84 (2%)

Query: 905 KKGAEKIRKKKKKKKE--AEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           KK  + + K + KK+    +  +K K    ++ K +  EK +K       +   KK +++
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEA 986
            K+  +  K+KK   K+ K+ E A
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENA 194


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 30.4 bits (69), Expect = 2.4
 Identities = 22/98 (22%), Positives = 52/98 (53%), Gaps = 18/98 (18%)

Query: 909 EKIRKKKKKKKEAEKMRK--KKGAEKMKKK----KKDAEKMKKKGAE----------KIG 952
           E+ +  K+ +KE E++ +  K+  EK++++     +   K  +   +          KI 
Sbjct: 16  ERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIE 75

Query: 953 KKKKKKKKKKKKKKKKKKKK--KKEAEKMKKKTEEAEK 988
           KK+ K++ +K  + ++++KK  K E E+ ++K ++ E+
Sbjct: 76  KKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQER 113



 Score = 29.2 bits (66), Expect = 5.2
 Identities = 18/83 (21%), Positives = 47/83 (56%), Gaps = 9/83 (10%)

Query: 914 KKKKKKEAEKMRKKKGAEKM---KKKKKDAEK-MKK-----KGAEKIGKKKKKKKKKKKK 964
           ++++K+  EK+ +++  E     K+ + + +  +K      K  +K  K++ +K  + ++
Sbjct: 32  ERQQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKLPRFQE 91

Query: 965 KKKKKKKKKKEAEKMKKKTEEAE 987
           ++KK+ K +KE ++ K + +E E
Sbjct: 92  QEKKRMKAEKEEQEQKHQKQERE 114



 Score = 28.8 bits (65), Expect = 6.3
 Identities = 21/95 (22%), Positives = 49/95 (51%), Gaps = 10/95 (10%)

Query: 883 RREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK 942
           R E  T L   +F++  + + +KK  ++  +K  + +E EK R K   E+ ++K +  E+
Sbjct: 58  RAEQKTRL--KMFKE--SLKIEKKELKQEVEKLPRFQEQEKKRMKAEKEEQEQKHQKQER 113

Query: 943 MKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
                 E + K+++  ++  ++ + +K+ +  E E
Sbjct: 114 ------EFLAKQEENLEEALQQLQNEKRHELAETE 142



 Score = 28.8 bits (65), Expect = 8.2
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 12/88 (13%)

Query: 903 KKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           ++++  E  R  K+ + E +  R K   E +K +KK+ ++      EK+    + ++++K
Sbjct: 43  EQRQTQELRRLPKRIRAEQKT-RLKMFKESLKIEKKELKQEV----EKL---PRFQEQEK 94

Query: 963 KKKKKKKK----KKKKEAEKMKKKTEEA 986
           K+ K +K+    K +K+  +   K EE 
Sbjct: 95  KRMKAEKEEQEQKHQKQEREFLAKQEEN 122


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 31.8 bits (73), Expect = 2.4
 Identities = 13/69 (18%), Positives = 20/69 (28%), Gaps = 1/69 (1%)

Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTT-PGPETKTLESLNEKEK 192
           A    A           AP PA P   AP   P   +  ++   P        +      
Sbjct: 52  AAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAA 111

Query: 193 QLQARITKL 201
            ++  +T L
Sbjct: 112 AVEDEVTPL 120


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.9 bits (70), Expect = 2.5
 Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 6/63 (9%)

Query: 922 EKMRKKKGAEK--MKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK----KKKKKKKKKKE 975
             M   K A K     K K    +  +  E I +K    +KK K+       ++ +K +E
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177

Query: 976 AEK 978
            E+
Sbjct: 178 EEE 180


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 28.8 bits (65), Expect = 2.5
 Identities = 15/66 (22%), Positives = 37/66 (56%), Gaps = 7/66 (10%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           ++KK +    + M       K+K KKK+   +K +      K++++++ +K++++K+++K
Sbjct: 1   KQKKDEPVTLKDMLNADLLAKLKAKKKE---LKAEEE----KREEEEEARKREERKEREK 53

Query: 972 KKKEAE 977
            K   E
Sbjct: 54  NKSFEE 59


>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
          Length = 144

 Score = 30.1 bits (68), Expect = 2.5
 Identities = 17/55 (30%), Positives = 32/55 (58%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            E  KK  ++ +K KK+G +KI +  K+  K +K+ +K+ +  ++   +MK KT 
Sbjct: 90  IEDFKKSVEELDKTKKEGNKKIEELNKEITKLRKELEKRAQAIEQRQAQMKPKTN 144


>gnl|CDD|203471 pfam06518, DUF1104, Protein of unknown function (DUF1104).  This
           family consists of several hypothetical proteins of
           unknown function which appear to be found largely in
           Helicobacter pylori.
          Length = 93

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 14/62 (22%), Positives = 30/62 (48%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKT 983
           + K+ G  K K+ K      K    + + K   + ++K +K+ +++  KK +A  +K+  
Sbjct: 30  IDKRLGKMKEKEAKNFKAGFKAAMRKNLSKMSVEDRRKFRKEVREEMNKKVDAMSVKEAR 89

Query: 984 EE 985
           E 
Sbjct: 90  EL 91


>gnl|CDD|191968 pfam08213, DUF1713, Mitochondrial domain of unknown function
           (DUF1713).  This domain is found at the C terminal end
           of mitochondrial proteins of unknown function.
          Length = 34

 Score = 27.7 bits (62), Expect = 2.6
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKK 981
            +K+++K KK K KK +K+ +    K+ K
Sbjct: 6   LRKRRRKMKKHKYKKLRKRTRALRRKLDK 34


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
           dihydrolipoamide acetyltransferase, long form.  This
           model describes a subset of pyruvate dehydrogenase
           complex dihydrolipoamide acetyltransferase specifically
           close by both phylogenetic and per cent identity (UPGMA)
           trees. Members of this set include two or three copies
           of the lipoyl-binding domain. E. coli AceF is a member
           of this model, while mitochondrial and some other
           bacterial forms belong to a separate model [Energy
           metabolism, Pyruvate dehydrogenase].
          Length = 546

 Score = 31.4 bits (71), Expect = 2.6
 Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 8/80 (10%)

Query: 142 VVVKVPVEGAPTPATPDT----EAPTPAPGTESGPSSTTPGPETKTLESLNEKEKQLQAR 197
           +++ + V G+ TPAT       +    +P       +  P        +  +   Q  A+
Sbjct: 185 LILTLSVAGS-TPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQNPAK 243

Query: 198 I---TKLSKILSGEVSIDLH 214
           +       + L+ E  +DL 
Sbjct: 244 VDHAAPAVRRLAREFGVDLS 263


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 26/91 (28%), Positives = 54/91 (59%), Gaps = 2/91 (2%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGAEKIGKKKKKK 958
            +RK++   E+I+K  +  +E +  + KK  E  +K+KK+ +K + +K  E+I + K K+
Sbjct: 28  KKRKEQHLKEQIQKLTELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKE 87

Query: 959 KKKKKKKKKKKKKKK-KEAEKMKKKTEEAEK 988
           K  ++++K +  +   +E  +  K+ EEA+K
Sbjct: 88  KHAQEREKTEINRSHIQEVVQSIKRLEEAQK 118


>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
           Autographa californica nuclear polyhedrosis virus
           (AcMNPV) pp31 is a nuclear phosphoprotein that
           accumulates in the virogenic stroma, which is the viral
           replication centre in the infected-cell nucleus, binds
           to DNA, and serves as a late expression factor.
          Length = 267

 Score = 31.2 bits (71), Expect = 2.6
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
              +K+K  KK  KK    KK+K+K +   
Sbjct: 222 SPSRKRKAPKKVAKKVAAAKKRKQKRSPPP 251


>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
          Length = 466

 Score = 31.4 bits (71), Expect = 2.7
 Identities = 10/29 (34%), Positives = 12/29 (41%)

Query: 151 APTPATPDTEAPTPAPGTESGPSSTTPGP 179
            PTP         PA   +SGP  + P P
Sbjct: 51  TPTPTRGKGTPTGPASPPKSGPPKSPPAP 79


>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding.  The F-actin binding
           domain forms a compact bundle of four antiparallel
           alpha-helices, which are arranged in a left-handed
           topology. Binding of F-actin to the F-actin binding
           domain may result in cytoplasmic retention and
           subcellular distribution of the protein, as well as
           possible inhibition of protein function.
          Length = 179

 Score = 30.4 bits (68), Expect = 2.7
 Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 135 PIPDATHVVVKVPVEGAPTPATPDTEAPTPAP-GTESGPSSTTPGPETKTLESLNEKEKQ 193
           P P +T   V  P    P P+T    +P+P+     + PSS    P   T  SL  K +Q
Sbjct: 12  PKPQSTAKPVGTPPSPVPLPST----SPSPSKMANGTQPSSAAFIPLISTRVSL-RKTRQ 66

Query: 194 LQARITKLSKILSGEVSID 212
              RI   S  ++  V +D
Sbjct: 67  PPERIA--SGKITKGVVLD 83


>gnl|CDD|213527 TIGR00390, hslU, ATP-dependent protease HslVU, ATPase subunit.
           This model represents the ATPase subunit of HslVU, while
           the proteasome-related peptidase subunit is HslV.
           Residues 54-61 of the model contain a P-loop ATP-binding
           motif. Cys-287 of E. coli (position 308 in the seed
           alignment), studied in MEDLINE:98389714, is Ser in other
           members of the seed alignment [Protein fate, Protein
           folding and stabilization].
          Length = 441

 Score = 31.3 bits (71), Expect = 2.8
 Identities = 25/96 (26%), Positives = 37/96 (38%), Gaps = 12/96 (12%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGA----------EKMKKKKKDAEKMKKKGAEKIGK 953
            K     + + ++++   E  RKK             E   K     E M   G E++  
Sbjct: 136 AKNQWGNMEEIQEEESVREAFRKKLREGELDDKEIEIEVSAKMPSGIEIMAPPGMEEMTM 195

Query: 954 KKKKKKKKKKKKKKKKKKKK-KEAEKMKKKTEEAEK 988
             +   +     KKKK+K K KEA+K     EEA K
Sbjct: 196 NLQSLFQNLGPDKKKKRKLKIKEAKKALIA-EEAAK 230


>gnl|CDD|222242 pfam13584, BatD, Oxygen tolerance.  This family of proteins carries
           up to three membrane spanning regions and is involved in
           tolerance to oxygen in in Bacteroides spp.
          Length = 488

 Score = 31.5 bits (72), Expect = 2.8
 Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 9/57 (15%)

Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDA 940
            +  +  ++  +KR  R              + K+A K+ KK  K A+K     KDA
Sbjct: 439 LLLFVLLLILYRKRRARNAD-------VAGNRTKKANKLAKKRLKEAKKALAGNKDA 488


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 31.6 bits (72), Expect = 2.8
 Identities = 13/59 (22%), Positives = 23/59 (38%), Gaps = 1/59 (1%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +K  +   +K+KK   EK     + KK+  K K  K   +K   + +K   +    
Sbjct: 87  TYLLKTTEPLNKKIKK-WFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPG 144


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 30.3 bits (69), Expect = 2.8
 Identities = 13/65 (20%), Positives = 30/65 (46%), Gaps = 12/65 (18%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +K E  + R+K+  ++ ++              K G K   KK +K++++  + +K+   
Sbjct: 40  RKMEKYQKREKEIQKRARE------------LRKNGDKLSPKKFEKRQEELMEDQKEMMM 87

Query: 977 EKMKK 981
           + MK 
Sbjct: 88  DMMKP 92



 Score = 30.3 bits (69), Expect = 3.2
 Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 936 KKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK---KKKEAEKMKKKT 983
           +K +  + ++K  +K  ++ +K   K   KK +K++    + + E M    
Sbjct: 40  RKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMM 90


>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
          Length = 1121

 Score = 31.4 bits (72), Expect = 2.8
 Identities = 16/45 (35%), Positives = 26/45 (57%)

Query: 930 AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           A K  K  K  +K+K  G E + +    KK+K+++++KKK K K 
Sbjct: 250 ALKAPKILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKD 294


>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter.  This domain is related to
           pfam00005.
          Length = 85

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 18/70 (25%), Positives = 36/70 (51%), Gaps = 9/70 (12%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRK-KKGAEKMKK---KKKDAEKMKKKGAEK 950
           + +++  R +++     +++K+ KK  E + + +  A K K+   + K  EKM     E+
Sbjct: 18  YLEQKEERLEQQEKAYEKQQKEIKKLEEFIDRFRAKASKAKQAQSRIKALEKM-----ER 72

Query: 951 IGKKKKKKKK 960
           I K  +KK K
Sbjct: 73  IEKPVEKKPK 82



 Score = 27.5 bits (62), Expect = 8.6
 Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 12/72 (16%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           ++KE    +++K  EK +K+ K  E+        I      + + K  K K+ + + K  
Sbjct: 20  EQKEERLEQQEKAYEKQQKEIKKLEEF-------I-----DRFRAKASKAKQAQSRIKAL 67

Query: 977 EKMKKKTEEAEK 988
           EKM++  +  EK
Sbjct: 68  EKMERIEKPVEK 79


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 31.1 bits (70), Expect = 2.9
 Identities = 12/78 (15%), Positives = 27/78 (34%), Gaps = 8/78 (10%)

Query: 919 KEAEKMRKKKGAEKMKKKKKDAE--------KMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
            EA    ++ G + +    ++ +           K   EK+   +K     + K    + 
Sbjct: 181 GEAPITTRRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQA 240

Query: 971 KKKKEAEKMKKKTEEAEK 988
             K+E +  K +  +A  
Sbjct: 241 LTKEELDAKKAELSKALA 258


>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
          Length = 132

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKKEAEK--MKKK 982
           I KKK  +K +K+ K ++KK + KE  K   KKK
Sbjct: 99  IKKKKAGRKARKELKNRRKKVRGKEKTKVSGKKK 132



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 9/57 (15%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKK 974
           KK   K R  +    + KKKK   K +K+        K ++KK + K+K K   KKK
Sbjct: 85  KKFEPKYRLIR--LGLIKKKKAGRKARKE-------LKNRRKKVRGKEKTKVSGKKK 132



 Score = 29.2 bits (66), Expect = 4.8
 Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           KKKK  +K +K+ K ++KK +  EK K   +
Sbjct: 100 KKKKAGRKARKELKNRRKKVRGKEKTKVSGK 130


>gnl|CDD|215038 PLN00040, PLN00040, Protein MAK16 homolog; Provisional.
          Length = 233

 Score = 30.9 bits (70), Expect = 3.0
 Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 22/101 (21%)

Query: 899 RTRRKKKKGAEKI--RKKKKKKKEAEKMRK-KKGAEKMKKKKKDAEKMKKKG-------- 947
           R R+   K  EKI    +K  K+E  +  K +K A+  K  +K+  +  K G        
Sbjct: 121 RMRKLALKTREKIVTTPRKLLKRERRRESKAQKAAQLEKSIEKELLERLKSGTYGDIYNF 180

Query: 948 -----------AEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
                       E    +++  K  K    KKK +   E E
Sbjct: 181 PSKSYNKVLEMEEVEEAEEELPKSDKNPNSKKKSRVHVEIE 221


>gnl|CDD|227818 COG5531, COG5531, SWIB-domain-containing proteins implicated in
           chromatin remodeling [Chromatin structure and dynamics].
          Length = 237

 Score = 30.9 bits (70), Expect = 3.1
 Identities = 24/107 (22%), Positives = 35/107 (32%), Gaps = 5/107 (4%)

Query: 879 NDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK-KKK 937
            D    + V  L F  +  +   R K        K     +E   +RK+K     K   K
Sbjct: 20  RDTNNEKDVGKLLFSEWTVRLEGRFKDNNDLIRDKFDSLAEEPRVLRKEKYNITRKTTGK 79

Query: 938 KDAEKMKKKGAEKIGK----KKKKKKKKKKKKKKKKKKKKKEAEKMK 980
            D  K +        +      + K+  KKKK     K     EK+K
Sbjct: 80  NDLPKEEDSSLPSSKETENGDTEGKETDKKKKSSTISKNSPSGEKVK 126


>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit.  All proteins in this
           family for wich functions are known are DNA helicases
           that function in the nucleotide excision repair and are
           endonucleases that make the 3' incision next to DNA
           damage. They are part of a pathway requiring UvrA, UvrB,
           UvrC, and UvrD homologs. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 655

 Score = 31.1 bits (71), Expect = 3.1
 Identities = 19/81 (23%), Positives = 39/81 (48%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           A+KI    +K  E  + R+K      ++     + ++K   + +  + K+K+   KKKKK
Sbjct: 550 ADKITDSMQKAIEETERRRKIQMAYNEEHGITPQTIRKPIRDILDIELKEKEDAAKKKKK 609

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
            +       +++KK  ++ EK
Sbjct: 610 GEDLSDLSKKELKKLIKQLEK 630


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 31.6 bits (72), Expect = 3.1
 Identities = 19/101 (18%), Positives = 32/101 (31%), Gaps = 11/101 (10%)

Query: 899  RTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKD-AEKMKKKGAEKIGKKKKK 957
                 K K   +  +   KK         K   ++K  KK+ AE  K+ GA +   K+ K
Sbjct: 960  HEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLK 1019

Query: 958  KKK----------KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            +            K    +  +    K  +K+K        
Sbjct: 1020 ELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLENN 1060


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 31.5 bits (72), Expect = 3.2
 Identities = 11/39 (28%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 951 IGKKKKKKKKKKKKKKKKK---KKKKKEAEKMKKKTEEA 986
             +KKK KK  +K+ +  +    KK+KE + + +K    
Sbjct: 83  FREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121



 Score = 31.1 bits (71), Expect = 3.8
 Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           ++KKK KK  +K+ +  +    +K K+    +EK
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEK 117


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 30.3 bits (69), Expect = 3.2
 Identities = 18/46 (39%), Positives = 23/46 (50%), Gaps = 3/46 (6%)

Query: 944 KKKGAEKIGKKKKKKKK---KKKKKKKKKKKKKKEAEKMKKKTEEA 986
            K     +GKKKK KK+   KK KKK +    K E  K   K ++A
Sbjct: 114 AKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKA 159



 Score = 29.2 bits (66), Expect = 7.5
 Identities = 18/50 (36%), Positives = 26/50 (52%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           DA   K+  A+  G    KKKK KK+++ KK KKK E+   K +  +   
Sbjct: 105 DAAPFKQWYAKHYGIDLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNH 154


>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function.  This domain
           family is found in eukaryotes, and is typically between
           96 and 116 amino acids in length.
          Length = 106

 Score = 29.2 bits (66), Expect = 3.2
 Identities = 18/48 (37%), Positives = 21/48 (43%), Gaps = 3/48 (6%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
           KKKG +K+       EK      + I K KK     KKK KK   KK 
Sbjct: 1   KKKGLKKLDCY---CEKKIFDKIDYIDKLKKDPNIDKKKFKKIIFKKY 45


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 30.0 bits (68), Expect = 3.3
 Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 909 EKIRKKKKKKKEAEKMRKK--KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
            K+ K  +++   +K  K   K  E +KKK KD E+++K    KI K K++ K  K++ +
Sbjct: 20  SKLDKANEERDSLKKQLKNRDKQIEDLKKKVKDNEELQK----KIEKLKQQNKTAKEEYE 75

Query: 967 KKKKKKKKE-AEKMKKKTEEA 986
            K    K   A ++  K  +A
Sbjct: 76  AKLADTKLNNAIELALKKAKA 96


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 30.4 bits (69), Expect = 3.3
 Identities = 19/78 (24%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
           AE+IR + +++ E EK    + AEK   K ++  + K+K AE + K+      K + + K
Sbjct: 10  AEEIRAEAEEEFEIEKAEAVEEAEK---KIEEIYEKKEKQAE-MEKQIIISNAKNEARLK 65

Query: 968 KKKKKKKEAEKMKKKTEE 985
               +++  + + ++ +E
Sbjct: 66  VLNAREELLDSVFEEAKE 83


>gnl|CDD|221161 pfam11661, DUF2986, Protein of unknown function (DUF2986).  This
           family of proteins has no known function.
          Length = 44

 Score = 27.6 bits (62), Expect = 3.3
 Identities = 14/36 (38%), Positives = 17/36 (47%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKM 979
           KKK  +K+ KK KK   K     K K   K E  K+
Sbjct: 4   KKKINQKLKKKAKKANAKLHPSNKPKYISKAERAKL 39


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 31.2 bits (71), Expect = 3.4
 Identities = 10/54 (18%), Positives = 20/54 (37%), Gaps = 4/54 (7%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK 962
           E+      K+   E+ +K     + K K+  +   K+   E+  +   K   K 
Sbjct: 525 EEQGTNFGKRNSKERYKK----NEDKIKEFASALGKEILKEEPTENSSKNILKL 574



 Score = 30.9 bits (70), Expect = 4.0
 Identities = 9/34 (26%), Positives = 17/34 (50%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           E+ G    K+  K++ KK + K K+  +   K+ 
Sbjct: 525 EEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558



 Score = 30.1 bits (68), Expect = 8.5
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
                K+  K++ KK + K KE      K  
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKEFASALGKEI 558


>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1 (PSTPIP1), also known
           as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
           in hematopoietic cells. It is a binding partner of the
           cell surface receptor CD2 and PTP-PEST, a tyrosine
           phosphatase which functions in cell motility and Rac1
           regulation. It also plays a role in the activation of
           the Wiskott-Aldrich syndrome protein (WASP), which
           couples actin rearrangement and T cell activation.
           Mutations in the gene encoding PSTPIP1 cause the
           autoinflammatory disorder known as PAPA (pyogenic
           sterile arthritis, pyoderma gangrenosum, and acne)
           syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
           PEST motifs, and a C-terminal SH3 domain. F-BAR domains
           form banana-shaped dimers with a positively-charged
           concave surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 242

 Score = 30.7 bits (69), Expect = 3.4
 Identities = 23/93 (24%), Positives = 45/93 (48%), Gaps = 6/93 (6%)

Query: 895 FR-KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
           FR +++ +RKK +   +  +K K     + M  KK  E+  ++  +AE    +  E+   
Sbjct: 93  FRERQKEQRKKYEAVMERVQKSKVSLYKKTMESKKTYEQRCREADEAE----QTFERSSS 148

Query: 954 KKKKKKKKKKKKKKKK-KKKKKEAEKMKKKTEE 985
               K+ +K + K K+ +    EAE++ K+  E
Sbjct: 149 TGNPKQSEKSQNKAKQCRDAATEAERVYKQNIE 181


>gnl|CDD|233390 TIGR01385, TFSII, transcription elongation factor S-II.  This model
           represents eukaryotic transcription elongation factor
           S-II. This protein allows stalled RNA transcription
           complexes to perform a cleavage of the nascent RNA and
           restart at the newly generated 3-prime end.
          Length = 299

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKMKKKGAEKIGK 953
            + K+    E++ ++ K   +  K+RK       K A+K+ K  K    + K  ++  G 
Sbjct: 28  HQLKEFPPTEELLQETKVGVKVNKLRKHPNEDISKLAKKIIKSWKKV--VDKNKSDHPGG 85

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
             + K    +     K++ K +++K+++ 
Sbjct: 86  NPEDKTTVGESVNSVKQEAKSQSDKIEQP 114


>gnl|CDD|180550 PRK06382, PRK06382, threonine dehydratase; Provisional.
          Length = 406

 Score = 31.0 bits (70), Expect = 3.4
 Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 790 ISFIMYGQLENADDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNV 835
           +S I+Y +LEN   LVR  C   D   R   +Y IA A   +G N+
Sbjct: 316 MSKIIYKELENLGQLVRIECNIPD---RPGNLYRIANAIASNGGNI 358


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 31.1 bits (71), Expect = 3.4
 Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 16/86 (18%)

Query: 918 KKEAEKMRK--KKGAEKMKKKKKD-------------AEKMKKKGAEKIGKKKKKKKKKK 962
           ++E E+M K  +  AE+ KK+K++             AEK  K+  +K+  ++K+K +K 
Sbjct: 500 EEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIEKA 559

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
             + K+  K + + E++K KTEE ++
Sbjct: 560 VAELKEALKGE-DVEEIKAKTEELQQ 584


>gnl|CDD|233626 TIGR01897, cas_MJ1666, CRISPR-associated protein, MJ1666 family.
           CRISPR is a term for Clustered, Regularly Interspaced
           Short Palidromic Repeats. A number of protein families
           appear only in association with these repeats and are
           designated Cas (CRISPR-Associated) proteins. This model
           describes a Cas protein about 400 residues in length,
           found mostly in the Archaea but also in Aquifex [Mobile
           and extrachromosomal element functions, Other].
          Length = 410

 Score = 30.8 bits (70), Expect = 3.4
 Identities = 18/84 (21%), Positives = 33/84 (39%), Gaps = 3/84 (3%)

Query: 893 ILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK---MKKKGAE 949
             F ++      +    ++   K + K+AE++   K  E  K   +  EK   +   G E
Sbjct: 326 TFFDEEYISVYIEIIKNELDNIKNRSKKAEELALIKATEIYKVSGEGFEKRNFLAHAGFE 385

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKK 973
               + KK+K K   +   K +K 
Sbjct: 386 YNVVEVKKEKDKIYVRYGDKIEKI 409


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 30.3 bits (69), Expect = 3.5
 Identities = 14/58 (24%), Positives = 31/58 (53%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E+ K+  KDA+K+ ++    I   ++    K KK +K+K+  + E +K ++  ++   
Sbjct: 104 ERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTD 161



 Score = 29.5 bits (67), Expect = 6.2
 Identities = 18/72 (25%), Positives = 36/72 (50%), Gaps = 6/72 (8%)

Query: 896 RKKRTRRKKKKGAE------KIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAE 949
           RK+  +  KK   E       IR+    K +  +  K+   +++KK ++D +K+  +  +
Sbjct: 106 RKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIK 165

Query: 950 KIGKKKKKKKKK 961
           KI +  K K+K+
Sbjct: 166 KIDELLKSKEKE 177


>gnl|CDD|153357 cd07673, F-BAR_FCHO2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH domain Only 2
           protein.  F-BAR domains are dimerization modules that
           bind and bend membranes and are found in proteins
           involved in membrane dynamics and actin reorganization.
           The specific function of FCH domain Only 2 (FCHO2) is
           still unknown. It contains an N-terminal F-BAR domain
           and a C-terminal domain of unknown function named SAFF
           which is also present in FCHO1 and endophilin
           interacting protein 1. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 269

 Score = 30.8 bits (69), Expect = 3.5
 Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 19/102 (18%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRK------------KKGAEKMKKKKKDAEKMKKKGA--- 948
           +K G E+++  KK K+E     +            +K  E    K  + E++KK+GA   
Sbjct: 98  QKYGEEQVKSHKKTKEEVAGTLEAVQNIQSITQALQKSKENYNAKCLEQERLKKEGATQR 157

Query: 949 --EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
             EK   K KK  +  K   +K    K + E  +K TE A+K
Sbjct: 158 EIEKAAVKSKKATESYKLYVEKYALAKADFE--QKMTETAQK 197


>gnl|CDD|237803 PRK14724, PRK14724, DNA topoisomerase III; Provisional.
          Length = 987

 Score = 31.1 bits (70), Expect = 3.5
 Identities = 24/88 (27%), Positives = 28/88 (31%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
           F   RTRR  K      ++  K   E E    K    K    K  A      G   +   
Sbjct: 821 FVSMRTRRAFKAFLAWDKEAGKVNFEFEPRESKFPPRKTAAAKAGAASAAFGGTVAVKAA 880

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           K  KK   KK   K    K   +  KKK
Sbjct: 881 KPAKKAAAKKVAAKTAAAKTPRKAAKKK 908


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 30.8 bits (69), Expect = 3.5
 Identities = 16/71 (22%), Positives = 32/71 (45%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
               +    K K  E+M +        +       GK+K   +K+ K KK++ +++ +E 
Sbjct: 123 LIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQIEEE 182

Query: 977 EKMKKKTEEAE 987
           E+M K+  + E
Sbjct: 183 ERMNKEWTKKE 193


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 27.7 bits (62), Expect = 3.5
 Identities = 12/35 (34%), Positives = 23/35 (65%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           +K+ +  KK ++K+K  + +K+EAE  +K+ EE  
Sbjct: 8   EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEA 42


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
           TopJ.  This model describes TopJ (MG_200, CbpA), a DnaJ
           homolog and probable assembly protein of the Mycoplasma
           terminal organelle. The terminal organelle is involved
           in both cytadherence and gliding motility [Cellular
           processes, Chemotaxis and motility].
          Length = 871

 Score = 31.3 bits (70), Expect = 3.6
 Identities = 13/33 (39%), Positives = 16/33 (48%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
              KKKKKKKKK     K K+  K  +  E  +
Sbjct: 105 TPSKKKKKKKKKGWFWAKSKQESKTIETEEIID 137


>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
          Length = 64

 Score = 27.9 bits (63), Expect = 3.6
 Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 945 KKGAEKIGKKKKKKKKKK-KKKKKKKKKKKKEAEKMKKK 982
           K+  EK G  ++ ++++  +K  +K+K+KK  A K ++K
Sbjct: 20  KRKVEKAGILRELRRREFYEKPSEKRKRKKAAARKRRRK 58



 Score = 27.5 bits (62), Expect = 5.0
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query: 956 KKKKKKKKKKKKKKKKKKKE 975
           K+K+KK   +K+++KK  +E
Sbjct: 44  KRKRKKAAARKRRRKKLARE 63


>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
          Length = 211

 Score = 30.2 bits (68), Expect = 3.6
 Identities = 19/65 (29%), Positives = 26/65 (40%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           +   T+R K KG  K RK  K    AE   +K   E++    KD +K   KG  +     
Sbjct: 51  KLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDKDIIKGKRRPRLTL 110

Query: 956 KKKKK 960
           K    
Sbjct: 111 KNTNN 115



 Score = 29.4 bits (66), Expect = 6.4
 Identities = 10/51 (19%), Positives = 18/51 (35%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
            + +K   +       K    K+ K K   K +K  K     +   E+ E+
Sbjct: 35  PNRKKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVER 85


>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
           protein; Provisional.
          Length = 897

 Score = 31.0 bits (70), Expect = 3.7
 Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 4/90 (4%)

Query: 409 DLVRDLCEEKDAILRRSGMYTIAMAYCVSYLYFDLFSDLTYRIVKNRSGMYTIAMAYCGS 468
           D VR L  + D ++R + +  +A   C         + L     + R G    A A  G+
Sbjct: 778 DAVRALTGDPDPLVRAAALAALAELGCPPDDVAAATAALRASAWQVRQGA---ARALAGA 834

Query: 469 GSNVATKRLLHVAVSDVNDDVRREAVTALG 498
            ++VA    L  A++D + DVR+ AV AL 
Sbjct: 835 AADVAVP-ALVEALTDPHLDVRKAAVLALT 863


>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
           Provisional.
          Length = 134

 Score = 29.5 bits (66), Expect = 3.8
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKKKKKKKKKK 965
           EK R   K+++E E+ ++++ AE+ +K     E MK  G    E +G       +  KK+
Sbjct: 31  EKFRVVTKERREEEEQQQRELAERQEKINTWLELMKADGINPEELLGNSSAAAPRAGKKR 90

Query: 966 KKKKKKKK 973
           + +  K +
Sbjct: 91  QPRPAKYR 98


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 28.8 bits (65), Expect = 4.1
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
           E   K K+KK+++++ K++  ++ KKE E  +     + 
Sbjct: 47  EVTDKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSA 85


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 31.0 bits (70), Expect = 4.1
 Identities = 13/75 (17%), Positives = 26/75 (34%), Gaps = 7/75 (9%)

Query: 920 EAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK-------KKKK 972
             E + + K     K+K  + +          GKK   K KK++K KK          ++
Sbjct: 112 LPESLSETKQVTVSKRKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEE 171

Query: 973 KKEAEKMKKKTEEAE 987
           +    ++       +
Sbjct: 172 ENYNTEICTVAPTDQ 186


>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family. 
          Length = 296

 Score = 30.6 bits (68), Expect = 4.1
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 1/46 (2%)

Query: 139 ATHVVVKVPVEGAPT-PATPDTEAPTPAPGTESGPSSTTPGPETKT 183
           A+    K P E  P  PA+P       A  TE      TP P ++T
Sbjct: 87  ASTEPDKEPAEAEPAEPASPAEAEGEAATSTEKAEDGATPSPSSET 132


>gnl|CDD|185404 PTZ00028, PTZ00028, 40S ribosomal protein S6e; Provisional.
          Length = 218

 Score = 30.2 bits (68), Expect = 4.2
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 5/91 (5%)

Query: 898 KRTRRKKKKGAEKIRK--KKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK-KGAEKIGKK 954
           +R RR   K A KIRK     K  +  K   ++  E   K K  A K+++    +++ +K
Sbjct: 128 ERPRRLGPKRASKIRKLFNLSKSDDVRKFVVRRKIEGKNKTK--APKIQRLVTPQRLQRK 185

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           ++ +K  KKK  + +++ K+  + +++   E
Sbjct: 186 RRYRKTLKKKVLQSREEAKEYQKLLEEYKNE 216


>gnl|CDD|205286 pfam13105, DUF3959, Protein of unknown function (DUF3959).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 260 amino acids in length.
          Length = 242

 Score = 30.2 bits (68), Expect = 4.2
 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 912 RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK--KKKKKKKKKKKKKKK 969
           RK+K   KE   + K+K     KK+KK + K++     K+  K  K  +K    K  +K 
Sbjct: 152 RKEKDDNKENINLFKQK-----KKRKKLSFKIRLPRLPKLKMKLFKFGRKPSNSKDPRKY 206

Query: 970 KKKKKEAEKMKKKTE 984
           +   +E   M   TE
Sbjct: 207 EHNYEENVPMYHMTE 221


>gnl|CDD|238146 cd00238, ERp29c, ERp29 and ERp38, C-terminal domain; composed of
           the protein disulfide isomerase (PDI)-like proteins
           ERp29 and ERp38. ERp29 (also called ERp28) is a
           ubiquitous endoplasmic reticulum (ER)-resident protein
           expressed in high levels in secretory cells. It contains
           a redox inactive TRX-like domain at the N-terminus. The
           expression profile of ERp29 suggests a role in secretory
           protein production, distinct from that of PDI. It has
           also been identified as a member of the thyroglobulin
           folding complex and is essential in regulating the
           secretion of thyroglobulin. The Drosophila homolog,
           Wind, is the product of windbeutel, an essential gene in
           the development of dorsal-ventral patterning. Wind is
           required for correct targeting of Pipe, a Golgi-resident
           type II transmembrane protein with homology to
           2-O-sulfotransferase. ERp38 is a P5-like protein, first
           isolated from alfalfa (the cDNA clone was named G1),
           which contains two redox active TRX domains at the
           N-terminus, like human P5. However, unlike human P5,
           ERp38 also contains a C-terminal domain with homology to
           the C-terminal domain of ERp29. It may be a
           glucose-regulated protein. The function of the
           all-helical C-terminal domain of ERp29 and ERp38 remains
           unclear. The C-terminal domain of Wind is thought to
           provide a distinct site required for interaction with
           its substrate, Pipe.
          Length = 93

 Score = 28.8 bits (65), Expect = 4.2
 Identities = 15/66 (22%), Positives = 32/66 (48%), Gaps = 1/66 (1%)

Query: 907 GAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKK 966
             E+    +K K+  EK+++ +     K   K  EK+ +KG + + K+  + ++  +KK 
Sbjct: 16  DEERKELLEKVKEAVEKLKEAEAKYA-KYYVKVMEKILEKGEDYVEKELARLERLLEKKG 74

Query: 967 KKKKKK 972
              +K 
Sbjct: 75  LAPEKA 80


>gnl|CDD|222855 PHA02529, O, capsid-scaffolding protein; Provisional.
          Length = 278

 Score = 30.5 bits (69), Expect = 4.3
 Identities = 14/81 (17%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKK 967
            EK++K   +KKE + M  ++ A+  +  +  AEK+    A ++   +  +  ++ K  +
Sbjct: 175 FEKVKKLFSRKKEDDAMTDEQFADVHEAVEGVAEKIDHTSA-QVATTETAEPPQEGKDTE 233

Query: 968 KKKKKKKEAEKMKKKTEEAEK 988
           +  +  ++   +K+  ++   
Sbjct: 234 ENGETAEQFSALKETLDKLAS 254


>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein.  This family
           consists of the p50 and variable adherence-associated
           antigen (Vaa) adhesins from Mycoplasma hominis. M.
           hominis is a mycoplasma associated with human urogenital
           diseases, pneumonia, and septic arthritis. An adhesin is
           a cell surface molecule that mediates adhesion to other
           cells or to the surrounding surface or substrate. The
           Vaa antigen is a 50-kDa surface lipoprotein that has
           four tandem repetitive DNA sequences encoding a periodic
           peptide structure, and is highly immunogenic in the
           human host. p50 is also a 50-kDa lipoprotein, having
           three repeats A,B and C, that may be a tetramer of
           191-kDa in its native environment.
          Length = 353

 Score = 30.4 bits (68), Expect = 4.3
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 5/76 (6%)

Query: 918 KKEAEKMRK-KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE- 975
           K E  ++    KG E++KK   +  +      + I + +KK K   K  K++ KK K E 
Sbjct: 227 KNELTEINSIIKGVEELKKLSHEISEFSNSVKKTISELEKKFKIDDKTNKEEAKKFKNEL 286

Query: 976 ---AEKMKKKTEEAEK 988
              A+++  K+ E +K
Sbjct: 287 ENFADQLLNKSHEIDK 302


>gnl|CDD|129586 TIGR00495, crvDNA_42K, 42K curved DNA binding protein.  Proteins
           identified by This model have been identified in a
           number of species as a nuclear (but not nucleolar)
           protein with a cell cycle dependence. Various names
           given to members of this family have included cell cycle
           protein p38-2G4, DNA-binding protein GBP16, and
           proliferation-associated protein 1. This protein is
           closely related to methionine aminopeptidase, a
           cobolt-binding protein [Unknown function, General].
          Length = 389

 Score = 30.6 bits (69), Expect = 4.3
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 955 KKKKKKKKKKKKKKKKKKKKEAEKMK 980
           K+KK+KKK KK  K  +   E E M+
Sbjct: 364 KRKKQKKKAKKASKTGEAATEGETME 389


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 30.7 bits (70), Expect = 4.3
 Identities = 17/85 (20%), Positives = 35/85 (41%), Gaps = 6/85 (7%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKG---AEKIGKKKKKK 958
           RK +    K+ +   K +           EK K   K  E++ K      EKI K+ K+ 
Sbjct: 46  RKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKEL 105

Query: 959 KKKKKK---KKKKKKKKKKEAEKMK 980
           +++  +   + K+ +++ +  E   
Sbjct: 106 EEEISELENEIKELEQEIERLEPWG 130



 Score = 30.7 bits (70), Expect = 4.8
 Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 13/69 (18%)

Query: 522 SLLDDPMIELKEFSLKKLNDIVDDFWPENEKE--------KQLQARITKLSKILS----- 568
           + L +   ++   SL++L   V++   + EKE         +L+  I +L + +      
Sbjct: 70  NPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPW 129

Query: 569 GEVSIDLHL 577
           G   +DL L
Sbjct: 130 GNFDLDLSL 138


>gnl|CDD|219401 pfam07404, TEBP_beta, Telomere-binding protein beta subunit (TEBP
           beta).  This family consists of several telomere-binding
           protein beta subunits which appear to be specific to the
           family Oxytrichidae. Telomeres are specialised
           protein-DNA complexes that compose the ends of
           eukaryotic chromosomes. Telomeres protect chromosome
           termini from degradation and recombination and act
           together with telomerase to ensure complete genome
           replication. TEBP beta forms a complex with TEBP alpha
           and this complex is able to recognise and bind ssDNA to
           form a sequence-specific, telomeric nucleoprotein
           complex that caps the very 3' ends of chromosomes.
          Length = 375

 Score = 30.5 bits (68), Expect = 4.4
 Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 1/63 (1%)

Query: 926 KKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           ++KG + + K     +  K KG  K GK K   K  K KK   KK     +  ++K  ++
Sbjct: 222 QEKGKDALNKAADHTDVAKVKGGAK-GKGKAAAKSAKGKKLSAKKGDSASSADVRKSVDK 280

Query: 986 AEK 988
             K
Sbjct: 281 IVK 283


>gnl|CDD|153294 cd07610, FCH_F-BAR, The Extended FES-CIP4 Homology (FCH) or F-BAR
           (FCH and Bin/Amphiphysin/Rvs) domain, a dimerization
           module that binds and bends membranes.  F-BAR domains
           are dimerization modules that bind and bend membranes
           and are found in proteins involved in membrane dynamics
           and actin reorganization. F-BAR domain containing
           proteins, also known as Pombe Cdc15 homology (PCH)
           family proteins, include Fes and Fer tyrosine kinases,
           PACSINs/Syndapins, FCHO, PSTPIP, CIP4-like proteins and
           srGAPs. Many members also contain an SH3 domain and play
           roles in endocytosis. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           These tubules have diameters larger than those observed
           with N-BARs. The F-BAR domains of some members such as
           NOSTRIN and Rgd1 are important for the subcellular
           localization of the protein.
          Length = 191

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 30/118 (25%), Positives = 54/118 (45%), Gaps = 21/118 (17%)

Query: 890 LGFILFRKKRTRRKKK--------KGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAE 941
           LG  L +  R   KK+        K  +K+ KK  KK E+ K         ++++ + A 
Sbjct: 11  LGLDLLKDLREFLKKRAAIEEEYAKNLQKLAKKFSKKPESGKTSLGTSWNSLREETESAA 70

Query: 942 KMKKKGAEKIGKK--------KKKKKKKKKK-----KKKKKKKKKKEAEKMKKKTEEA 986
            + ++ +EK+ +         K+ K++ +KK     +K KKK ++  A+  KK  EE 
Sbjct: 71  TVHEELSEKLSQLIREPLEKVKEDKEQARKKELAEGEKLKKKLQELWAKLAKKADEEY 128


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 31.0 bits (70), Expect = 4.5
 Identities = 12/48 (25%), Positives = 18/48 (37%)

Query: 140 THVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESL 187
           T VV+   V    T  +  +   T +P   S  ++TT  P   T    
Sbjct: 802 TSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849


>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal.  The yeast member
           of this family (Kri1p) is found to be required for 40S
           ribosome biogenesis in the nucleolus. This is the
           C-terminal domain of the family.
          Length = 93

 Score = 28.7 bits (65), Expect = 4.5
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
           K++++K KKK  KK + ++ KK+
Sbjct: 69  KEERRKDKKKYGKKARLREWKKK 91



 Score = 27.9 bits (63), Expect = 6.6
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 960 KKKKKKKKKKKKKKKEAEKMKKKT 983
           K++++K KKK  KK    + KKK 
Sbjct: 69  KEERRKDKKKYGKKARLREWKKKV 92


>gnl|CDD|185443 PTZ00097, PTZ00097, 60S ribosomal protein L19; Provisional.
          Length = 175

 Score = 29.7 bits (67), Expect = 4.6
 Identities = 19/74 (25%), Positives = 36/74 (48%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           +R+  +K + A+K+ +    E   K K +  K K+   E I K K +K K+KK + + + 
Sbjct: 100 LRRLLRKYRAAKKIDRHMYHEFYLKSKGNQFKNKRVLIEAIHKTKNEKVKEKKIQDQLEA 159

Query: 971 KKKKEAEKMKKKTE 984
           ++ K      K+  
Sbjct: 160 RRAKAKALRNKRKA 173


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 29.8 bits (67), Expect = 4.6
 Identities = 13/71 (18%), Positives = 31/71 (43%), Gaps = 2/71 (2%)

Query: 918 KKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
               E+M ++   +  K  K +A+    K   ++ K +   +  +K +++  K   K  E
Sbjct: 121 MSIVEEMLEENAKKGGKIDKMEADSTDLK--ARLRKAQILLEGLQKNQEELFKLLDKYNE 178

Query: 978 KMKKKTEEAEK 988
             ++  +E+ K
Sbjct: 179 LREQVQKESSK 189


>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G.  This
           model describes the vacuolar ATP synthase G subunit in
           eukaryotes and includes members from diverse groups
           e.g., fungi, plants, parasites etc. V-ATPases are
           multi-subunit enzymes composed of two functional
           domains: A transmembrane Vo domain and a peripheral
           catalytic domain V1. The G subunit is one of the
           subunits of the catalytic domain. V-ATPases are
           responsible for the acidification of endosomes and
           lysosomes, which are part of the central vacuolar system
           [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 113

 Score = 29.0 bits (65), Expect = 4.7
 Identities = 20/67 (29%), Positives = 36/67 (53%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKK 955
           RK++T+R K+   E  ++ +K K++ EK  K+  A+ +       EK + +   KI + K
Sbjct: 26  RKRKTKRLKQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAEAETQAKIREIK 85

Query: 956 KKKKKKK 962
           K  +K K
Sbjct: 86  KAVQKNK 92



 Score = 28.3 bits (63), Expect = 8.3
 Identities = 21/78 (26%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 904 KKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEK-MKKKGAEKIGKKKKKKKKKK 962
           +K+ AEK+ + +K+K +  K  K++  ++++K K+  EK  K+  A+ +G     ++K +
Sbjct: 15  EKRAAEKVSEARKRKTKRLKQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEEKAE 74

Query: 963 KKKKKKKKKKKKEAEKMK 980
            + + K ++ KK  +K K
Sbjct: 75  AETQAKIREIKKAVQKNK 92


>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
           (DMAP1).  DNA methylation can contribute to
           transcriptional silencing through several
           transcriptionally repressive complexes, which include
           methyl-CpG binding domain proteins (MBDs) and histone
           deacetylases (HDACs). The chief enzyme that maintains
           mammalian DNA methylation, DNMT1, can also establish a
           repressive transcription complex. The non-catalytic
           amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
           DNMT1 associated protein), and can mediate
           transcriptional repression. DMAP1 has intrinsic
           transcription repressive activity, and binds to the
           transcriptional co-repressor TSG101. DMAP1 is targeted
           to replication foci through interaction with the far N
           terminus of DNMT1 throughout S phase, whereas HDAC2
           joins DNMT1 and DMAP1 only during late S phase,
           providing a platform for how histones may become
           deacetylated in heterochromatin following replication.
          Length = 175

 Score = 29.8 bits (67), Expect = 4.8
 Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 917 KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEA 976
           +K EA K  ++K A+ ++K    A++     +    + +K++KK  KKK  +K    KE 
Sbjct: 10  RKIEARKKEREKKAQDLQKLITAADQQAAGFST--AEARKREKKLPKKKIPQKPGPSKED 67

Query: 977 EK 978
            K
Sbjct: 68  SK 69


>gnl|CDD|225651 COG3109, ProQ, Activator of osmoprotectant transporter ProP [Signal
           transduction mechanisms].
          Length = 208

 Score = 29.8 bits (67), Expect = 4.9
 Identities = 12/51 (23%), Positives = 25/51 (49%)

Query: 896 RKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKK 946
           RK+    K +  A++  ++ KK++EA    +K  AE+     +   K  ++
Sbjct: 100 RKQLAEAKARVQAQRAEQQAKKREEAPAAGEKPTAERPATAARPKPKAPRE 150


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 30.3 bits (69), Expect = 5.0
 Identities = 19/73 (26%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 894 LFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGK 953
            ++K +  ++  +  E+    ++ K+E E +   +   +  +  +D E+++++  E+   
Sbjct: 377 YYKKYKKLKRAVEAVEEQ--IEETKEEIEYLESVEAQLENAESLEDLEEIREELIEQGYL 434

Query: 954 KKKKKKKKKKKKK 966
           K+KK+KKKKKKKK
Sbjct: 435 KEKKRKKKKKKKK 447


>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain.  FH proteins control
           rearrangements of the actin cytoskeleton, especially in
           the context of cytokinesis and cell polarisation.
           Members of this family have been found to interact with
           Rho-GTPases, profilin and other actin-assoziated
           proteins. These interactions are mediated by the
           proline-rich FH1 domain, usually located in front of FH2
           (but not listed in SMART). Despite this cytosolic
           function, vertebrate formins have been assigned
           functions within the nucleus. A set of Formin-Binding
           Proteins (FBPs) has been shown to bind FH1 with their WW
           domain.
          Length = 392

 Score = 30.4 bits (69), Expect = 5.0
 Identities = 11/77 (14%), Positives = 31/77 (40%), Gaps = 6/77 (7%)

Query: 892 FILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           F  F +      K        ++  KK+E E+ R+KK  ++  + ++ + + K++     
Sbjct: 320 FKDFNEFLKEFSKA------AEENIKKEEEEEERRKKLVKETTEYEQSSSRQKERNPSMD 373

Query: 952 GKKKKKKKKKKKKKKKK 968
            + ++          ++
Sbjct: 374 FEVERDFLGVLDSLLEE 390


>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
           carrier protein subunit; Validated.
          Length = 153

 Score = 29.4 bits (66), Expect = 5.0
 Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 3/65 (4%)

Query: 134 APIPDATHVVVKVP---VEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEK 190
           + + +        P   V  APTP  P   AP PA   E+  ++  PG   + L    ++
Sbjct: 46  SAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENVVTAPMPGKILRILVREGQQ 105

Query: 191 EKQLQ 195
            K  Q
Sbjct: 106 VKVGQ 110


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 30.4 bits (69), Expect = 5.1
 Identities = 13/63 (20%), Positives = 34/63 (53%), Gaps = 6/63 (9%)

Query: 915 KKKKKEAEKMRKK-----KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           ++++ E + +RK+     K   K+K  K+DA ++  +  +++ K+  +K+ + ++ K   
Sbjct: 43  RQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAE-TKELKKEITEKEAEVQEAKAAL 101

Query: 970 KKK 972
             K
Sbjct: 102 DAK 104


>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family.  Members of this family are
           short proteins that are rich in aspartate, glutamate,
           lysine and arginine. Although the function of these
           proteins is unknown, they are found to be ubiquitously
           expressed.
          Length = 38

 Score = 26.8 bits (60), Expect = 5.1
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 947 GAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           G ++   +++  KK  KKKKK  K + K A K  
Sbjct: 4   GNQREQARERNAKKAAKKKKKGAKSQLKAAAKAL 37


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 30.5 bits (68), Expect = 5.1
 Identities = 17/57 (29%), Positives = 21/57 (36%), Gaps = 1/57 (1%)

Query: 123 LASVLQELKTCAPIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGP 179
           + S L +L     +P        VP   AP PA     AP PA  T + P      P
Sbjct: 86  VGSFLAQLPAMPSMPGLLGAAAPVP-APAPAPAAAPPAAPAPAADTPAAPIPDAVQP 141


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
           Provisional.
          Length = 418

 Score = 30.4 bits (69), Expect = 5.1
 Identities = 11/57 (19%), Positives = 23/57 (40%), Gaps = 2/57 (3%)

Query: 147 PVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLESLNEKEKQLQARITKLSK 203
           P   APTP  P    PTP    +    +    P    +   + +E ++   ++++ +
Sbjct: 145 PKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPRETRV--PMSRMRQ 199


>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H).  This family
           consists of several eukaryotic centromere protein H
           (CENP-H) sequences. Macromolecular
           centromere-kinetochore complex plays a critical role in
           sister chromatid separation, but its complete protein
           composition as well as its precise dynamic function
           during mitosis has not yet been clearly determined.
           CENP-H contains a coiled-coil structure and a nuclear
           localisation signal. CENP-H is specifically and
           constitutively localised in kinetochores throughout the
           cell cycle. CENP-H may play a role in kinetochore
           organisation and function throughout the cell cycle.
           This the C-terminus of the region, which is conserved
           from fungi to humans.
          Length = 106

 Score = 28.8 bits (65), Expect = 5.1
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 6/58 (10%)

Query: 909 EKIRKKKK--KKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
             + K++   K+K  E   +    E  KKK+   E M+ K  E++ K +   KK K K
Sbjct: 20  SDLEKERLQLKQKNVELALELL--ELTKKKESWREDMELK--EQLEKLEADLKKSKAK 73


>gnl|CDD|220764 pfam10454, DUF2458, Protein of unknown function (DUF2458).  This a
           is family of uncharacterized proteins.
          Length = 155

 Score = 29.5 bits (66), Expect = 5.1
 Identities = 9/48 (18%), Positives = 21/48 (43%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
            ++  ++   E+   K  E + +K++ +K+ KKK        K  +  
Sbjct: 30  RRLIIEQHRHERQWWKEREALIQKQEARKEDKKKLLPNGTDSKLSSVG 77


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 30.5 bits (68), Expect = 5.2
 Identities = 15/42 (35%), Positives = 22/42 (52%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKK 969
           K  EK  K+K   EKM+ K    +G + K   KK+ + + KK
Sbjct: 379 KSGEKKVKRKARLEKMRAKCRRAVGNRMKGSMKKRVRSRAKK 420


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 25/95 (26%), Positives = 33/95 (34%), Gaps = 13/95 (13%)

Query: 802 DDLVRDLCEEKDAILRRSGMYTIAMAYCGSGSNVATKRLLH-----DAILRRSGMYTIAM 856
           + L+  L  + D  +R       A A    G   A   LL      D  +RR      A 
Sbjct: 2   EALLEALLSDPDPEVRA----AAARALGELGDPEALPALLELLKDPDPEVRR----AAAE 53

Query: 857 AYCGSGSNVATKRLLHVAVSDVNDDVRREAVTALG 891
           A    G   A   LL +   D +  VR  A +AL 
Sbjct: 54  ALGKLGDPEALPALLELLQDDDDAVVRAAAASALA 88


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 30.1 bits (68), Expect = 5.4
 Identities = 20/84 (23%), Positives = 33/84 (39%), Gaps = 6/84 (7%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK------KGAEKMKKKKKDAEKMKKKGAEK 950
             R+  +K K  E+       ++E E   ++      K  E  KK     ++MK   A +
Sbjct: 85  IYRSLEQKSKALEEPDAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGE 144

Query: 951 IGKKKKKKKKKKKKKKKKKKKKKK 974
              K +K       K + +K KKK
Sbjct: 145 SRLKIRKDYVPSGNKPRAEKPKKK 168


>gnl|CDD|227313 COG4980, GvpP, Gas vesicle protein [General function prediction
           only].
          Length = 115

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 26/103 (25%), Positives = 45/103 (43%), Gaps = 10/103 (9%)

Query: 886 AVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKK 945
           A  AL F     K  R+K KK  + + +  + K     M   K        K+ AE +K 
Sbjct: 19  AAAALLFAPKSGKELRKKLKKSGDALFELAEDKGTDILMIADK------LSKESAETLKD 72

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +G    G+ K+  KK K+  + + ++ K E E +++   +  K
Sbjct: 73  QG----GEIKESVKKWKEDIQPEIERLKSEIEDLQEAISDETK 111


>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
           beta and delta phosphatidylinositol 3-kinases (PI3Ks).
           PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
           regulate cell processes such as cell growth,
           differentiation, proliferation, and motility.  PI3Ks
           work on phosphorylation of phosphatidylinositol,
           phosphatidylinositide (4)P (PtdIns (4)P),2 or
           PtdIns(4,5)P2. Specifically they phosphorylate the D3
           hydroxyl group of phosphoinositol lipids on the inositol
           ring. There are 3 classes of PI3Ks based on structure,
           regulation, and specificity. All classes contain a C2
           domain, a PIK domain, and a kinase catalytic domain.
           The members here are class I, beta and delta isoforms of
           PI3Ks and contain both a Ras-binding domain and a
           p85-binding domain.  Class II PI3Ks contain both of
           these as well as a PX domain, and a C-terminal C2 domain
           containing a nuclear localization signal.  C2 domains
           fold into an 8-standed beta-sandwich that can adopt 2
           structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions.  Members have a type-I
           topology.
          Length = 173

 Score = 29.6 bits (67), Expect = 5.5
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 952 GKKKKKKKKKKKKKKKKKKKKK 973
              KK K K+ +K + KKKKKK
Sbjct: 84  EVSKKAKGKRSRKNQTKKKKKK 105



 Score = 29.6 bits (67), Expect = 5.9
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 951 IGKKKKKKKKKKKKKKKKKKK 971
           + KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105



 Score = 28.8 bits (65), Expect = 8.2
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 954 KKKKKKKKKKKKKKKKKKKKK 974
             KK K K+ +K + KKKKKK
Sbjct: 85  VSKKAKGKRSRKNQTKKKKKK 105


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 29.6 bits (67), Expect = 5.5
 Identities = 12/80 (15%), Positives = 40/80 (50%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK 968
           +KI ++ +++ E      ++ AEK+K++ K   +   +   +  +K+ ++++++      
Sbjct: 9   KKILREAEEEAEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRIISSAL 68

Query: 969 KKKKKKEAEKMKKKTEEAEK 988
            + ++K  E  ++  E   +
Sbjct: 69  LEARRKLLEAKEEILESVFE 88


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.1 bits (68), Expect = 5.6
 Identities = 13/81 (16%), Positives = 47/81 (58%), Gaps = 2/81 (2%)

Query: 909 EKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMK--KKGAEKIGKKKKKKKKKKKKKK 966
           +++ ++ ++ +E  +  +K+  E  +   +  EK+K  ++ A+ + KK  + +++ ++ +
Sbjct: 8   QELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLE 67

Query: 967 KKKKKKKKEAEKMKKKTEEAE 987
           ++    ++E E+++ + +EA 
Sbjct: 68  EEAAASEEERERLEAEVDEAT 88


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
           includes the E. faecalis PcfJ protein, which is
           functionally uncharacterized. It is found in bacteria
           and viruses, and is typically between 159 and 170 amino
           acids in length. There is a conserved HCV sequence
           motif.
          Length = 169

 Score = 29.6 bits (67), Expect = 5.6
 Identities = 10/32 (31%), Positives = 20/32 (62%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            + KK+K+K +K +K+  + +K   ++K K E
Sbjct: 38  YQAKKEKEKAEKMRKRYPELEKIFRELKSKFE 69


>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
           peptidyl-prolyl isomerase.  This family is only found at
           the amino terminus of pfam00254. This domain is of
           unknown function.
          Length = 124

 Score = 28.7 bits (65), Expect = 5.7
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKK 958
           +  +KK  A++  K +K AE+ K  G   + +  KK+
Sbjct: 75  QAFQKKLQAKQQAKAEKKAEENKAAGEAFLAENAKKE 111


>gnl|CDD|212012 cd11622, HR1_PKN_1, First Protein kinase C-related kinase homology
           region 1 (HR1) Rho-binding domain of Protein Kinase N.
           PKN, also called Protein-kinase C-related kinase (PRK),
           is a serine/threonine protein kinase that can be
           activated by the small GTPase Rho, and by fatty acids
           such as arachidonic and linoleic acids. It is involved
           in many biological processes including cytoskeletal
           regulation, cell adhesion, vesicle transport, glucose
           transport, regulation of meiotic maturation and
           embryonic cell cycles, signaling to the nucleus, and
           tumorigenesis. In some vertebrates, there are three PKN
           isoforms from different genes (designated PKN1, PKN2,
           and PKN3), which show different enzymatic properties,
           tissue distribution, and varied functions. PKN proteins
           contain three HR1 domains, a C2 domain, and a kinase
           domain. This model characterizes the first HR1 domain of
           PKN. HR1 domains are anti-parallel coiled-coil (ACC)
           domains that bind small GTPases from the Rho family.
          Length = 66

 Score = 27.6 bits (62), Expect = 5.7
 Identities = 16/41 (39%), Positives = 21/41 (51%), Gaps = 3/41 (7%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
           IRK+ K K+ AE +RK   A   KK     E + KK   K+
Sbjct: 15  IRKELKIKEGAENLRK---ATTDKKSLAHVESILKKSNRKL 52


>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
           function prediction only].
          Length = 386

 Score = 30.2 bits (68), Expect = 5.8
 Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 5/39 (12%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAE 977
           DA   K +       +K+K  +K    K KK  KK +AE
Sbjct: 348 DAYPKKAEA-----ARKRKGDRKGVSHKAKKGGKKNQAE 381


>gnl|CDD|216061 pfam00687, Ribosomal_L1, Ribosomal protein L1p/L10e family.  This
           family includes prokaryotic L1 and eukaryotic L10.
          Length = 200

 Score = 29.5 bits (67), Expect = 5.9
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           K  +  A++ K  GA+ +G +  K+K K KK +K  K 
Sbjct: 54  KDPEAKAKEAKDAGADVVGGEDLKEKYKIKKGRKLAKD 91


>gnl|CDD|153370 cd07686, F-BAR_Fer, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Fer (Fes related)
           tyrosine kinase.  F-BAR domains are dimerization modules
           that bind and bend membranes and are found in proteins
           involved in membrane dynamics and actin reorganization.
           Fer (Fes related) is a cytoplasmic (or nonreceptor)
           tyrosine kinase expressed in a wide variety of tissues,
           and is found to reside in both the cytoplasm and the
           nucleus. It plays important roles in neuronal
           polarization and neurite development, cytoskeletal
           reorganization, cell migration, growth factor signaling,
           and the regulation of cell-cell interactions mediated by
           adherens junctions and focal adhesions. Fer kinase also
           regulates cell cycle progression in malignant cells. It
           contains an N-terminal F-BAR domain, an SH2 domain, and
           a C-terminal catalytic kinase domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 234

 Score = 29.6 bits (66), Expect = 5.9
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 5/76 (6%)

Query: 911 IRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
           I+ K++ KK    + ++  AE  K  K + EK+K    +       K+    K+K K   
Sbjct: 100 IKDKQQVKKSYIGVHQQIEAEMYKVTKTELEKLKCSYRQLT-----KEVNSAKEKYKDAV 154

Query: 971 KKKKEAEKMKKKTEEA 986
            K KE EK +++ ++A
Sbjct: 155 AKGKETEKARERYDKA 170


>gnl|CDD|225074 COG2163, RPL14A, Ribosomal protein L14E/L6E/L27E [Translation,
           ribosomal structure and biogenesis].
          Length = 125

 Score = 28.9 bits (65), Expect = 5.9
 Identities = 12/80 (15%), Positives = 33/80 (41%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
             ++R   K  +  +K+    +   + +  +  + A  + K  K A+ ++ K   +    
Sbjct: 45  VPRRRINIKHLEPTDKVIDLVRGASDEKVKKANEAAGVLAKLDKSAKNLETKKVREPLTD 104

Query: 955 KKKKKKKKKKKKKKKKKKKK 974
            ++ K  K  K+++ +  K 
Sbjct: 105 AERFKVMKLVKEERLQVGKA 124


>gnl|CDD|191868 pfam07839, CaM_binding, Plant calmodulin-binding domain.  The
           sequences featured in this family are found repeated in
           a number of plant calmodulin-binding proteins, and are
           thought to constitute the calmodulin-binding domains.
           Binding of the proteins to calmodulin depends on the
           presence of calcium ions. These proteins are thought to
           be involved in various processes, such as plant defence
           responses and stolonisation or tuberization.
          Length = 115

 Score = 28.5 bits (64), Expect = 6.0
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           +K  ++ KK+  K +K+K K     K+   +K+  +  EK
Sbjct: 4   KKEERRSKKEDPKPEKEKPKSWSNLKKVILLKRFVKSLEK 43


>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
          Length = 650

 Score = 30.2 bits (69), Expect = 6.0
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 9/62 (14%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           KK KKK EA     K   +K++ K ++         ++I  + KKK  K  K KKK++K+
Sbjct: 198 KKGKKKFEASFYGYKG--KKIELKNEED-------VKEILAELKKKDFKVSKVKKKERKR 248

Query: 973 KK 974
             
Sbjct: 249 NP 250


>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
           Provisional.
          Length = 517

 Score = 30.4 bits (69), Expect = 6.0
 Identities = 15/76 (19%), Positives = 22/76 (28%), Gaps = 19/76 (25%)

Query: 137 PDATHVVVKVPVEG-------------------APTPATPDTEAPTPAPGTESGPSSTTP 177
            D    +V VP  G                   A +P       P  A  +ES  +    
Sbjct: 84  EDNPEYIVTVPKRGYKLVAPVIWYSEEEGEEILASSPPPIPEAVPATAEPSESANAQFPA 143

Query: 178 GPETKTLESLNEKEKQ 193
            P T+ +     K K+
Sbjct: 144 PPLTRAVCQSPAKSKR 159


>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain. 
          Length = 225

 Score = 29.7 bits (67), Expect = 6.0
 Identities = 13/64 (20%), Positives = 32/64 (50%), Gaps = 10/64 (15%)

Query: 935 KKKKDAEKMKKKGAEKIGKKKKKKKKKK----------KKKKKKKKKKKKEAEKMKKKTE 984
           ++++D    ++  A +  K++  +K +K          +KK+K K + +KEAE+   + +
Sbjct: 118 RERRDLRIEERDEASEKKKEELIEKAQKEIDDFYENYNEKKEKTKAQNRKEAEQFLAERD 177

Query: 985 EAEK 988
           +   
Sbjct: 178 DFSS 181


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 29.4 bits (67), Expect = 6.1
 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 23/122 (18%)

Query: 889 ALGFI---LFRKKRTRRKKKKGAEKIRKK--KKKKKEAEKMRKKK---GAEKMKKKKKDA 940
            +GF      RK      K   AE+  K+  ++ KKEAE ++K+      E++ K + +A
Sbjct: 11  VVGFAIGYFVRKSIAE-AKISSAEEEAKQIIEEAKKEAEALKKEALLEAKEEIHKLRAEA 69

Query: 941 EKMKK------KGAEK--------IGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           E+  K      +  EK        + +K +  +KK++  ++K+K+     +++++K EE 
Sbjct: 70  ERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEEL 129

Query: 987 EK 988
           E+
Sbjct: 130 EE 131


>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
           Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the heterodimeric topo I from
           Leishmania donvanni. Topo I enzymes are divided into:
           topo type IA (bacterial) and type IB (eukaryotic). Topo
           I relaxes superhelical tension in duplex DNA by creating
           a single-strand nick, the broken strand can then rotate
           around the unbroken strand to remove DNA supercoils and,
           the nick is religated, liberating topo I. These enzymes
           regulate the topological changes that accompany DNA
           replication, transcription and other nuclear processes. 
           Human topo I is the target of a diverse set of
           anticancer drugs including camptothecins (CPTs). CPTs
           bind to the topo I-DNA complex and inhibit re-ligation
           of the single-strand nick, resulting in the accumulation
           of topo I-DNA adducts. In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topo I play putative roles in
           organizing the kinetoplast DNA network unique to these
           parasites.  This family may represent more than one
           structural domain.
          Length = 212

 Score = 29.8 bits (67), Expect = 6.2
 Identities = 13/28 (46%), Positives = 20/28 (71%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEAEKMK 980
           +K+KKK + K++KK  K++K KEAE   
Sbjct: 94  EKEKKKSRTKEEKKALKEEKDKEAEPYM 121


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 29.2 bits (66), Expect = 6.2
 Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 20/99 (20%)

Query: 889 ALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGA 948
           A+G I+ ++ R R  +K+   +I          E +RK +  E   K+    +       
Sbjct: 71  AVGVIVNKRVRGRIVRKRIHVRI----------EHVRKSRCREDFLKRVAANDA------ 114

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAE 987
                K + KKK +    K++  + + A  +K K  E  
Sbjct: 115 ----IKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEVV 149


>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
           subunit.
          Length = 1044

 Score = 30.3 bits (68), Expect = 6.2
 Identities = 16/55 (29%), Positives = 25/55 (45%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           + K K ++K KK     +G  KK+KKK         KK  ++  +     EE E+
Sbjct: 608 RGKSKKSKKGKKGRRSLLGGLKKRKKKGGGGGSMMGKKYGRKKSQAVFDLEEIEE 662


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 29.6 bits (67), Expect = 6.3
 Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 10/88 (11%)

Query: 908 AEKIRKK-KKKKKEAEKMRKKK----GAEKMKKKKKDAEKMK---KKGAEKIGKKKKKKK 959
           A+++ K+ +KK KE   ++ K       E ++    D +++    K+ +++  K K +  
Sbjct: 85  AKQLEKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPV 144

Query: 960 --KKKKKKKKKKKKKKKEAEKMKKKTEE 985
               +  K K+++++ +EAEK+K    +
Sbjct: 145 VIDTQTSKPKEEEEESEEAEKIKNFLAD 172


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.1 bits (68), Expect = 6.3
 Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 5/86 (5%)

Query: 908 AEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKK----- 962
           AE+I   K   K+   +R +  AE+ +     +E+  ++       +++KK   +     
Sbjct: 160 AERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSEL 219

Query: 963 KKKKKKKKKKKKEAEKMKKKTEEAEK 988
              +KK ++ +    ++K +   AE 
Sbjct: 220 SADQKKLEELRANESRLKNEIASAEA 245



 Score = 30.1 bits (68), Expect = 6.4
 Identities = 10/93 (10%), Positives = 30/93 (32%), Gaps = 4/93 (4%)

Query: 896 RKKRTRRKKKKGAEKI----RKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKI 951
               + ++ ++         RKK   +  +E    +K  E+++  +   +          
Sbjct: 188 TTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAA 247

Query: 952 GKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
            K ++     +    + +  + K   +  K T 
Sbjct: 248 AKAREAAAAAEAAAARARAAEAKRTGETYKPTA 280


>gnl|CDD|153289 cd07605, I-BAR_IMD, Inverse (I)-BAR, also known as the IRSp53/MIM
           homology Domain (IMD), a dimerization module that binds
           and bends membranes.  Inverse (I)-BAR (or IMD) is a
           member of the Bin/Amphiphysin/Rvs (BAR) domain family.
           It is a dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions in the
           opposite direction compared to classical BAR and F-BAR
           domains, which produce membrane invaginations. IMD
           domains are found in Insulin Receptor tyrosine kinase
           Substrate p53 (IRSp53), Missing in Metastasis (MIM), and
           Brain-specific Angiogenesis Inhibitor 1-Associated
           Protein 2-like (BAIAP2L) proteins. These are
           multi-domain proteins that act as scaffolding proteins
           and transducers of a variety of signaling pathways that
           link membrane dynamics and the underlying actin
           cytoskeleton. Most members contain an N-terminal IMD, an
           SH3 domain, and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus, exccept for MIM which does not
           carry an SH3 domain. Some members contain additional
           domains and motifs. The IMD domain binds and bundles
           actin filaments, binds membranes and produces membrane
           protrusions, and interacts with the small GTPase Rac.
          Length = 223

 Score = 29.6 bits (67), Expect = 6.7
 Identities = 17/62 (27%), Positives = 30/62 (48%), Gaps = 2/62 (3%)

Query: 927 KKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEA 986
           +   + + K +KD +K  K+  E +   K + + KK +KK +K    K  EK+ +  EE 
Sbjct: 104 ELDQKVINKFEKDYKKEYKQKREDL--DKARSELKKLQKKSQKSGTGKYQEKLDQALEEL 161

Query: 987 EK 988
             
Sbjct: 162 ND 163


>gnl|CDD|223747 COG0675, COG0675, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 364

 Score = 30.0 bits (67), Expect = 6.7
 Identities = 11/65 (16%), Positives = 25/65 (38%)

Query: 895 FRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
            +K+ +R+K +    K  K K  +       ++K   K+ KK  +   +       +   
Sbjct: 212 AQKRLSRKKSRSKRRKKAKLKLARLRERIRNRRKDFHKLAKKLVELADLVGVETLVVEDL 271

Query: 955 KKKKK 959
            K++ 
Sbjct: 272 VKRRS 276


>gnl|CDD|214787 smart00721, BAR, BAR domain. 
          Length = 239

 Score = 29.7 bits (67), Expect = 6.7
 Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 8/46 (17%)

Query: 951 IGKKKKKKKKKK--------KKKKKKKKKKKKEAEKMKKKTEEAEK 988
           I K +KK ++K         K KK KK K+KK+ EK+ K  EE  K
Sbjct: 136 IKKARKKLERKLLDYDSARHKLKKAKKSKEKKKDEKLAKAEEELRK 181


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This is
           a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 26.8 bits (60), Expect = 6.8
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 950 KIGKKKKKKKKKKKKKKKKKKKKKKE 975
           K  +  K+KKKK K+  ++  K   +
Sbjct: 30  KFPQFDKQKKKKAKRNAEQDAKNSAD 55


>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein.  This protein family was
           first noted as a paralogous set in Porphyromonas
           gingivalis, but it is more widely distributed among the
           Bacteroidetes. The protein family is now renamed GLPGLI
           after its best-conserved motif.
          Length = 227

 Score = 29.6 bits (67), Expect = 6.8
 Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
           A KI +          K KK KK  +++ +K+K+K
Sbjct: 192 AIKI-ELNPTPIIIIPKPKKGKKISREKFKKIKEK 225


>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4.  All proteins in this
           family for which functions are known are involved in
           targeting nucleotide excision repair to specific regions
           of the genome.This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 713

 Score = 30.2 bits (68), Expect = 6.9
 Identities = 14/84 (16%), Positives = 31/84 (36%), Gaps = 4/84 (4%)

Query: 897 KKRTRRKKKKGAEKIRK--KKKKKKEAEK--MRKKKGAEKMKKKKKDAEKMKKKGAEKIG 952
             ++ +      EK+R       +K  E      +   + + +K K   +  +KG  K  
Sbjct: 172 PLKSAKLSNLIPEKVRLLLHPSVRKSEELPSRSLRGLRKPLVEKLKKCMETWQKGLRKTT 231

Query: 953 KKKKKKKKKKKKKKKKKKKKKKEA 976
           K   K     +  + K ++ +K +
Sbjct: 232 KGLLKLLNGGRYSRSKWEEIEKSS 255


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 30.0 bits (67), Expect = 6.9
 Identities = 15/38 (39%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMK 934
           +KR RRK+K   +  RK +K++ E  +    KGAE+M+
Sbjct: 388 QKRQRRKEKFWRQHCRKARKEQAE-RRPIPGKGAERMR 424


>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
          Length = 977

 Score = 30.3 bits (68), Expect = 7.0
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query: 954 KKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K + +  K KK KKKK  KKKEA K +   E+
Sbjct: 115 KNRVESWKDKKNKKKKSAKKKEAHKAQIPPEQ 146


>gnl|CDD|204272 pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4).  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) is a widespread family of prokaryotic direct
           repeats with spacers of unique sequence between
           consecutive repeats. This protein family, typified by
           YPO2462 of Yersinia pestis, is a CRISPR-associated (Cas)
           family strictly associated with the Ypest subtype of
           CRISPR/Cas locus. It is designated Csy4, for CRISPR/Cas
           Subtype Ypest protein 4.
          Length = 182

 Score = 29.1 bits (66), Expect = 7.3
 Identities = 14/58 (24%), Positives = 22/58 (37%), Gaps = 6/58 (10%)

Query: 872 HVAVSDVN---DDVRREAVTALGFILFRKKRTRRKKKKG---AEKIRKKKKKKKEAEK 923
           +  +SDV    DDV    V+ +       +  RR  K+     E+ R +  K  E   
Sbjct: 77  YFHISDVLPVPDDVEYVTVSRVQAKSSPARLRRRLMKRAGLTEEEARARIPKSLEKRL 134


>gnl|CDD|100102 cd00387, Ribosomal_L7_L12, Ribosomal protein L7/L12. Ribosomal
           protein L7/L12 refers to the large ribosomal subunit
           proteins L7 and L12, which are identical except that L7
           is acetylated at the N terminus. It is a component of
           the L7/L12 stalk, which is located at the surface of the
           ribosome. The stalk base consists of a portion of the
           23S rRNA and ribosomal proteins L11 and L10. An extended
           C-terminal helix of L10 provides the binding site for
           L7/L12. L7/L12 consists of two domains joined by a
           flexible hinge, with the helical N-terminal domain (NTD)
           forming pairs of homodimers that bind to the extended
           helix of L10. It is the only multimeric ribosomal
           component, with either four or six copies per ribosome
           that occur as two or three dimers bound to the L10
           helix. L7/L12 is the only ribosomal protein that does
           not interact directly with rRNA, but instead has
           indirect interactions through L10. The globular
           C-terminal domains of L7/L12 are highly mobile. They are
           exposed to the cytoplasm and contain binding sites for
           other molecules. Initiation factors, elongation factors,
           and release factors are known to interact with the
           L7/L12 stalk during their GTP-dependent cycles. The
           binding site for the factors EF-Tu and EF-G comprises
           L7/L12, L10, L11, the L11-binding region of 23S rRNA,
           and the sarcin-ricin loop of 23S rRNA. Removal of L7/L12
           has minimal effect on factor binding and it has been
           proposed that L7/L12 induces the catalytically active
           conformation of EF-Tu and EF-G, thereby stimulating the
           GTPase activity of both factors. In eukaryotes, the
           proteins that perform the equivalent function to L7/L12
           are called P1 and P2, which do not share sequence
           similarity with L7/L12. However, a bacterial L7/L12
           homolog is found in some eukaryotes, in mitochondria and
           chloroplasts. In archaea, the protein equivalent to
           L7/L12 is called aL12 or L12p, but it is closer in
           sequence to P1 and P2 than to L7/L12.
          Length = 127

 Score = 28.7 bits (65), Expect = 7.4
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 968 KKKKKKKEAEKMKKKTEEA 986
           K+   K+EAE++KKK EEA
Sbjct: 103 KEGVSKEEAEEIKKKLEEA 121


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 29.9 bits (68), Expect = 7.5
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 7/57 (12%)

Query: 938 KDAEKMKKKGAEKIGKKKKKKKKKKKKKKKK--KK-----KKKKEAEKMKKKTEEAE 987
           KD E ++K+  E   K+KKKKKKK++K +    KK     ++ K         +E E
Sbjct: 154 KDLEFVEKRLDELTKKRKKKKKKKEEKVELDVLKKVLEWLEEGKPVRDGDWTDKEIE 210


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 30.1 bits (68), Expect = 7.5
 Identities = 13/58 (22%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 931 EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
           E+  + K  +++ K++  +     KK K   K+K+K    K+++  + ++ +  +AEK
Sbjct: 302 EEQGETKILSKEEKQELLD--LLFKKNKLTYKQKRKLLGLKEEEIFKGLRYEGLKAEK 357


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 29.1 bits (66), Expect = 7.5
 Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 901 RRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKK 960
            R KK+  EK  +  K +KE  K++ ++  +K++K+ K   K++++G             
Sbjct: 94  ERLKKELKEKEERLAKLRKELYKVKLER--DKLRKQNK---KLRQQGGL---LHVPALLY 145

Query: 961 KKKKKKKKKKKKKKEAEKMKKKTEEAEK 988
              K  +K ++K+K  EK+++K +  E 
Sbjct: 146 DYDKTVEKVEEKRKSVEKLRRKHKILEM 173


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 29.7 bits (67), Expect = 7.6
 Identities = 9/59 (15%), Positives = 28/59 (47%)

Query: 924 MRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKK 982
             +       +   ++         ++  K++ K K++K++  + +K+ K+  E++K+K
Sbjct: 331 ATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELMEELKEK 389


>gnl|CDD|227187 COG4850, COG4850, Uncharacterized conserved protein [Function
           unknown].
          Length = 373

 Score = 29.8 bits (67), Expect = 7.6
 Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 482 VSDVNDDVRREAVTALGFILFRTLFYVAPACTRSPWPGV 520
           +SD++D V+   VT       R+L     A TR   PGV
Sbjct: 165 ISDIDDTVKVTGVTEGPRKAGRSLL--LHALTRQVIPGV 201


>gnl|CDD|204658 pfam11477, PM0188, Sialyltransferase PMO188.  PMO188 is a
           sialyltransferase from P.multocida. It transfers sialic
           acid from cytidine 5'-monophosphonuraminic acid to an
           acceptor sugar. It has important catalytic residues such
           as Asp141, His311, Glu338, Ser355 and Ser356.
          Length = 381

 Score = 29.8 bits (67), Expect = 7.6
 Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 2/54 (3%)

Query: 294 EALAHMQSYLPRE-TGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATN 346
           E  A  Q  L  E   P S +Y G         H N G  IND +L    D  N
Sbjct: 258 EYYAQQQINLLNEAQNPHSPHYIGNNYDLFFKGHPN-GGAINDIILLHFPDMIN 310



 Score = 29.8 bits (67), Expect = 7.6
 Identities = 18/54 (33%), Positives = 20/54 (37%), Gaps = 2/54 (3%)

Query: 656 EALAHMQSYLPRE-TGPTSGYYEGGGLYALGLIHANHGAVINDYLLGQVKDATN 708
           E  A  Q  L  E   P S +Y G         H N G  IND +L    D  N
Sbjct: 258 EYYAQQQINLLNEAQNPHSPHYIGNNYDLFFKGHPN-GGAINDIILLHFPDMIN 310


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 30.2 bits (68), Expect = 7.6
 Identities = 8/53 (15%), Positives = 11/53 (20%), Gaps = 1/53 (1%)

Query: 134 APIPDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPETKTLES 186
           A                       D + P    G+ S P+S  P         
Sbjct: 443 ADGDAPVPAKANARASADSRCDERDAQ-PPADSGSASAPASDAPPDAAFEPAP 494


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 29.1 bits (66), Expect = 7.6
 Identities = 11/49 (22%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 909 EKIRKKKKKKKEAEK--MRKKKGAEKMKKK-KKDAEKMKKKGAEKIGKK 954
           E++ + +++ +E +   +R     E  +K+ +++ E+ KK   EK  K 
Sbjct: 18  EELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEKFAKD 66


>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
           unknown].
          Length = 272

 Score = 29.7 bits (66), Expect = 7.7
 Identities = 13/88 (14%), Positives = 34/88 (38%), Gaps = 5/88 (5%)

Query: 902 RKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKK 961
              K    +   +    +  EK   ++  +  K        + +   +++        ++
Sbjct: 118 DPAKAENVEKVPESDAIEALEKQLTQQ--KSEKHNSSAINFIDELN-KRLWSDPFVSSQR 174

Query: 962 KKKKKKKKKK--KKKEAEKMKKKTEEAE 987
            +K+ +++KK  KK+EA+ +  K   A 
Sbjct: 175 LRKQFRERKKIEKKQEAKDLSLKNRAAL 202


>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
          Length = 150

 Score = 28.7 bits (65), Expect = 7.8
 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 10/72 (13%)

Query: 874 AVSDVNDDVRREAVTAL---GFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGA 930
           A+ DV + + RE +  L   G I       + K KKG  + R +++ +K  +  R+  G+
Sbjct: 28  ALEDVAEAITREDIRELIKEGVI-------KAKPKKGISRGRARERHEKRKKGRRRGPGS 80

Query: 931 EKMKKKKKDAEK 942
            K KK  +  +K
Sbjct: 81  RKGKKGARTPKK 92


>gnl|CDD|152325 pfam11889, DUF3409, Domain of unknown function (DUF3409).  This
           domain is functionally uncharacterized. This domain is
           found in viruses. This presumed domain is about 60 amino
           acids in length. This domain is found associated with
           pfam00271, pfam05550, pfam05578.
          Length = 56

 Score = 27.1 bits (60), Expect = 7.8
 Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 14/58 (24%)

Query: 928 KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKK-----------KKKKKKKKKKK 974
           +GA+K K+KK D     KKGA KI  K+ +K  K K           K + KKK K +
Sbjct: 2   EGAKKKKQKKPD---RVKKGAMKITPKESEKDSKTKPPDATIVVEGVKYQVKKKGKVR 56


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
           Willebrand factor type A (vWA) domain [General function
           prediction only].
          Length = 437

 Score = 29.7 bits (67), Expect = 7.9
 Identities = 14/97 (14%), Positives = 44/97 (45%), Gaps = 5/97 (5%)

Query: 897 KKRTRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKK 956
           ++  +  +++G+E   + +++   +EK  +   + +M+   ++ E++       +   + 
Sbjct: 113 EELLQDLQREGSEDFLEGEREGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSAGEL 172

Query: 957 KKKKKKKKKK-----KKKKKKKKEAEKMKKKTEEAEK 988
           KK   +   K      K  + +K  E++ ++  E E+
Sbjct: 173 KKGDLELLGKYADFLLKFPELRKILEQLGREDLEIEE 209


>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
           SecD/SecF; Reviewed.
          Length = 755

 Score = 29.8 bits (68), Expect = 8.0
 Identities = 12/23 (52%), Positives = 18/23 (78%)

Query: 953 KKKKKKKKKKKKKKKKKKKKKKE 975
           +K++ KKKKK+KK KK + +K E
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKIE 755


>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
           secretion / Intracellular trafficking and secretion /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 148

 Score = 28.9 bits (65), Expect = 8.3
 Identities = 16/105 (15%), Positives = 42/105 (40%), Gaps = 18/105 (17%)

Query: 898 KRTRRKKKKGAEKIRKKKKKKKEAE-KMRKKKGA-----------------EKMKKKKKD 939
            + R +K+   E+++     + E E  + +K  +                 E    +++ 
Sbjct: 26  SKIRSEKENAEEQLKMLSGYRNEYEQNLNEKLKSGVSAAQWQNYQQFISQLEVAIDQQQS 85

Query: 940 AEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
                +K  E+  +  ++K+ + K  +K K+++K E    + + E
Sbjct: 86  QLSKLRKQVEQKREIWQEKQIELKALEKLKERQKTEFLLEENRRE 130


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 29.5 bits (66), Expect = 8.5
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 939 DAEKMKKKGAEKIGKKKKKKKK--KKKKKKKKKKKKKKEAEKMK 980
           D ++M+ +  E   K+ KK+     KK KK  KKK       +K
Sbjct: 201 DEDEMEVESPELQQKRSKKQSGSLTKKFKKNHKKKGPFRFSSVK 244


>gnl|CDD|152940 pfam12506, DUF3713, Protein of unknown function (DUF3713).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 92 and 1225 amino
           acids in length. There is a single completely conserved
           residue S that may be functionally important.
          Length = 116

 Score = 28.2 bits (63), Expect = 8.5
 Identities = 16/53 (30%), Positives = 23/53 (43%)

Query: 932 KMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTE 984
           K K  +++      K   K+  K     K K KK K + K K+  EK+K  T 
Sbjct: 12  KNKVGQEETNTSSDKSTNKLKTKDADVDKLKDKKTKVETKAKELVEKLKVTTA 64


>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein.  This family
           is found in Trypanosome species, and appears to be one
           of two invariant surface glycoproteins, ISG65 and ISG75.
           that are found in the mammalian stage of the parasitic
           protozoan. the sequence suggests the two families are
           polypeptides with N-terminal signal sequences,
           hydrophilic extracellular domains, single trans-membrane
           alpha-helices and short cytoplasmic domains. they are
           both expressed in the bloodstream form but not in the
           midgut stage. Both polypeptides are distributed over the
           entire surface of the parasite.
          Length = 286

 Score = 29.4 bits (66), Expect = 8.7
 Identities = 21/98 (21%), Positives = 35/98 (35%), Gaps = 5/98 (5%)

Query: 868 KRLLHVAVSDVNDDVRREAVTALGFILFRKKRTRRKKKKGAEKIRKKKKKKKEAEKMRKK 927
           K L     ++  D + ++       I    K    + K   EK+  +   K      +K 
Sbjct: 46  KDLADKVGNERADYLVKKTEGFAANI----KLALDRVKYYLEKLESRSDSKLSDSDKKKI 101

Query: 928 KGA-EKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKK 964
           K   EK K K  +A    KK   +  K  ++ K+   K
Sbjct: 102 KEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEAAAK 139


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 29.9 bits (68), Expect = 8.8
 Identities = 18/79 (22%), Positives = 33/79 (41%), Gaps = 8/79 (10%)

Query: 915 KKKKKEAEKMRKKKG----AEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK 970
                 AE      G    A ++ + +K+ EK++K+  ++I KK   +    K  ++  +
Sbjct: 794 TAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI-DRIEKKLSNEGFVAKAPEEVVE 852

Query: 971 K---KKKEAEKMKKKTEEA 986
           K   K  E +    K EE 
Sbjct: 853 KEKEKLAEYQVKLAKLEER 871


>gnl|CDD|224192 COG1273, COG1273, Ku-homolog [Replication, recombination, and
           repair].
          Length = 278

 Score = 29.6 bits (67), Expect = 8.8
 Identities = 17/89 (19%), Positives = 31/89 (34%), Gaps = 27/89 (30%)

Query: 908 AEKIRKKKKKKKEAEKMR-------------KKKGAEKMKKKKKDA----------EKMK 944
           A+++  KK    + ++               K +G + +   KK+           + +K
Sbjct: 194 AKQLIDKKTGTFDPDEYEDRYQEALMELIEAKLEGRKTVAAPKKEPPEPSNVIDLMDALK 253

Query: 945 KKGAEKIGKKKKKKKKKKKKKKKKKKKKK 973
                 +  +KK  KKK   K     KKK
Sbjct: 254 AS----LEARKKAGKKKPAAKPAAAAKKK 278


>gnl|CDD|234671 PRK00157, rplL, 50S ribosomal protein L7/L12; Reviewed.
          Length = 123

 Score = 28.2 bits (64), Expect = 8.9
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 968 KKKKKKKEAEKMKKKTEEA 986
           K+   K+EAE++KKK EEA
Sbjct: 98  KEGVSKEEAEEIKKKLEEA 116


>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein.  This model
           describes nucleoid occlusion protein, a close homolog to
           ParB chromosome partitioning proteins including Spo0J in
           Bacillus subtilis. Its gene often is located near the
           gene for the Spo0J ortholog. This protein bind a
           specific DNA sequence and blocks cytokinesis from
           happening until chromosome segregation is complete.
          Length = 255

 Score = 29.4 bits (67), Expect = 8.9
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 946 KGAEKIGKKKKKKKKKKKKKKKKKKK 971
           K  E++ KK  +K +K KKKKK++K 
Sbjct: 185 KQTEELIKKLLEKPEKPKKKKKRRKG 210


>gnl|CDD|216599 pfam01603, B56, Protein phosphatase 2A regulatory B subunit (B56
           family).  Protein phosphatase 2A (PP2A) is a major
           intracellular protein phosphatase that regulates
           multiple aspects of cell growth and metabolism. The
           ability of this widely distributed heterotrimeric enzyme
           to act on a diverse array of substrates is largely
           controlled by the nature of its regulatory B subunit.
           There are multiple families of B subunits (See also
           pfam01240), this family is called the B56 family.
          Length = 405

 Score = 29.6 bits (67), Expect = 9.1
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 949 EKIGKKKKKKKKKKKKKKKKKKKKKKEAEK 978
           ++ G K K+K+ K+KKK+KK+KK      +
Sbjct: 373 DECGNKYKEKEAKEKKKEKKRKKTWARIAE 402


>gnl|CDD|153367 cd07683, F-BAR_srGAP1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Protein 1.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs. srGAP1, also called Rho
           GTPase-Activating Protein 13 (ARHGAP13), is a Cdc42- and
           RhoA-specific GAP and is expressed later in the
           development of CNS (central nervous system) tissues. It
           is an important downstream signaling molecule of Robo1.
           srGAP1 contains an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 253

 Score = 29.3 bits (65), Expect = 9.2
 Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 915 KKKKKEAEKMRKK---KGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKK 971
           + K KEAEK  +K   +  + +   + +    ++   +KI K K+K++ K  + K K  K
Sbjct: 154 ESKLKEAEKQEEKQIGRSGDPVFHIRLEDRHQRRSSVKKIEKMKEKRQAKYSENKLKSIK 213

Query: 972 KKKE 975
            + E
Sbjct: 214 ARNE 217


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 28.9 bits (65), Expect = 9.3
 Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 17/83 (20%)

Query: 908 AEKIRKKKKKKKEAEKMRKK-----KGAEKMKKK-----KKDAEKM-------KKKGAEK 950
             KI K  + +K+A +++ K     K A++  KK     K  AE +         + A+ 
Sbjct: 42  DNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEIVKEAHEEADL 101

Query: 951 IGKKKKKKKKKKKKKKKKKKKKK 973
           I ++ K + +++K+K + + K +
Sbjct: 102 IIERAKLEAQREKEKAEYEIKNQ 124


>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
           splicing MRS1.  Members of this family adopt a secondary
           structure consisting of two beta sheets and one alpha
           helix, arranged as a beta-alpha-beta motif. Each beta
           sheet has five strands, arranged in a 32145 order, with
           the second strand being antiparallel to the rest.
           Mitochondrial resolvase Ydc2 is capable of resolving
           Holliday junctions and cleaves DNA after 5'-CT-3' and
           5'-TT-3' sequences. This family also contains the
           mitochondrial RNA-splicing protein MRS1 which is
           involved in the excision of group I introns.
          Length = 254

 Score = 29.3 bits (66), Expect = 9.3
 Identities = 9/28 (32%), Positives = 13/28 (46%)

Query: 948 AEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
                  +KKK + KKK  K  KK + +
Sbjct: 147 ERTEILAEKKKPRSKKKSSKNSKKLRID 174


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 28.4 bits (64), Expect = 9.3
 Identities = 13/42 (30%), Positives = 19/42 (45%)

Query: 934 KKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           +K+     K  +K   K  +K    K K K+KK K K+   E
Sbjct: 5   RKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASE 46


>gnl|CDD|223436 COG0359, RplI, Ribosomal protein L9 [Translation, ribosomal
           structure and biogenesis].
          Length = 148

 Score = 28.7 bits (65), Expect = 9.3
 Identities = 11/42 (26%), Positives = 19/42 (45%)

Query: 944 KKKGAEKIGKKKKKKKKKKKKKKKKKKKKKKEAEKMKKKTEE 985
           K           K  + ++ K +KK  ++  EAE +K+K E 
Sbjct: 33  KGLAVPATKGNLKLLEARRAKLEKKAAEELAEAEALKEKLEG 74


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 28.0 bits (62), Expect = 9.5
 Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 4/79 (5%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKK-- 970
           K++   KE   ++ ++  E  + K K  EK KK   E +        + +K+ +KK    
Sbjct: 18  KEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEESLNMYNAALMEARKEAEKKAVEI 77

Query: 971 --KKKKEAEKMKKKTEEAE 987
             K K+EA  +K K  E E
Sbjct: 78  INKAKQEASLIKLKISEDE 96


>gnl|CDD|219226 pfam06911, Senescence, Senescence-associated protein.  This family
           contains a number of plant senescence-associated
           proteins of approximately 450 residues in length. In
           Hemerocallis, petals have a genetically based program
           that leads to senescence and cell death approximately 24
           hours after the flower opens, and it is believed that
           senescence proteins produced around that time have a
           role in this program. This family extends to the higher
           vertebrates where the full-length protein is often a
           Spartin, associated with mitochondrial membranes and
           transportation along microtubules.
          Length = 181

 Score = 28.8 bits (65), Expect = 9.6
 Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 9/78 (11%)

Query: 905 KKGAEKIRKKKKKKKEAEKMRK--KKGAEKMKKKKKDAEKMKK-------KGAEKIGKKK 955
           + G E +  K    ++  ++    K+   ++KK  K   K+ K       K A  +G   
Sbjct: 25  QSGGEYLISKISPTEKPVEVSPSTKERLRRVKKLSKATAKVSKSVLNGVGKVAGNVGSSL 84

Query: 956 KKKKKKKKKKKKKKKKKK 973
                 KKK K  K +  
Sbjct: 85  ANSVAGKKKGKDGKGEPN 102


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin, subunit
            D2 [Chromatin structure and dynamics / Cell division and
            chromosome partitioning].
          Length = 1128

 Score = 29.9 bits (67), Expect = 9.7
 Identities = 30/185 (16%), Positives = 62/185 (33%), Gaps = 34/185 (18%)

Query: 364  LLGSKSESAIQDMVAYAKETQHEKILRGLAVGISFIMYGQ---LENADDLVRDLC----E 416
             +G  SE    D   + KE +              ++YG+   L N   +V + C     
Sbjct: 861  RIGGTSEDDFVDAFFFIKEKE--------------LLYGEKSVLSNFKPVVEEGCSNSSR 906

Query: 417  EKDAILRRSGMYTIAMAYCVSYLYFDLFSDLTYRI--------VKNRSGMYTIAMAYCGS 468
              D  L+ +   ++    C+S+ +      L            ++  + +       C +
Sbjct: 907  FSDEELQVAAYLSLYKLMCLSFEFCSEHLPLLITSMEKHPIPRIRANAVVGLGDFLVCFN 966

Query: 469  GSNVATKRLLHVAVSDVNDDVRREAVTALGFILFRTLFYVAPACTRSPWPGVISLLDDPM 528
             +       L+  + D + DVRR  +  + F++      V     +     +  LL D  
Sbjct: 967  TTADEHTHYLYRRLGDEDADVRRTCLMTIHFLILAGQLKV-----KGQLGKMALLLTDED 1021

Query: 529  IELKE 533
             E+ +
Sbjct: 1022 AEISD 1026


>gnl|CDD|234874 PRK00955, PRK00955, hypothetical protein; Provisional.
          Length = 620

 Score = 29.5 bits (67), Expect = 9.7
 Identities = 17/62 (27%), Positives = 24/62 (38%)

Query: 913 KKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKKKKKKKKKKKKKKK 972
           +K +      +  KK G E +           KK  +K  KK           KK+KKK+
Sbjct: 559 RKPENYDLVYEALKKAGREDLIGNGPKCLIPPKKPKKKSNKKNNNNNYSSNNSKKRKKKR 618

Query: 973 KK 974
           KK
Sbjct: 619 KK 620


>gnl|CDD|235602 PRK05776, PRK05776, DNA topoisomerase I; Provisional.
          Length = 670

 Score = 29.6 bits (67), Expect = 9.7
 Identities = 16/63 (25%), Positives = 34/63 (53%), Gaps = 1/63 (1%)

Query: 900 TRRKKKKGAEKIRKKKKKKKEAEKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKKKKKKK 959
           TR  ++K  E IR  K  ++E  +  K+   + +++ KK+ +++ ++ A+ +G  K   K
Sbjct: 540 TRDFEEK-LEMIRTGKATREEVIEEAKETLNKLLEEFKKNKDEIGEELAKALGLIKPVGK 598

Query: 960 KKK 962
            K 
Sbjct: 599 CKI 601


>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
           structure and biogenesis].
          Length = 150

 Score = 28.4 bits (64), Expect = 9.8
 Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 14/93 (15%)

Query: 874 AVSDVNDDVRREAVTAL---GFIL---------FRKKRTRRKKKKGAEKIRKKKKKKKEA 921
            + ++   + RE + AL   G I           R ++   +KKKG  +    +K  K A
Sbjct: 28  EIEEIASAITREDIRALIKDGVIKAKPKKGISRGRARKRHAQKKKGRRRGPGSRKGTKGA 87

Query: 922 EKMRKKKGAEKMKKKKKDAEKMKKKGAEKIGKK 954
               K++  ++++  +++  K++  G  KI K 
Sbjct: 88  RMPSKERWIKRIRALRRELRKLRDDG--KIDKH 118


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 29.5 bits (66), Expect = 9.8
 Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 5/72 (6%)

Query: 921 AEKMRKKKGAEKMKKKKKDAEKMK-----KKGAEKIGKKKKKKKKKKKKKKKKKKKKKKE 975
           A  +R   G E++ +  KD  K++     KK  EK+  ++++    +     KKK+   E
Sbjct: 331 AAAVRLLNGNEQIIQLYKDLVKLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASE 390

Query: 976 AEKMKKKTEEAE 987
             K   K +E E
Sbjct: 391 KSKEGGKGKETE 402


>gnl|CDD|177553 PHA03185, PHA03185, UL14 tegument protein; Provisional.
          Length = 214

 Score = 29.2 bits (65), Expect = 10.0
 Identities = 14/44 (31%), Positives = 19/44 (43%), Gaps = 1/44 (2%)

Query: 137 PDATHVVVKVPVEGAPTPATPDTEAPTPAPGTESGPSSTTPGPE 180
           P +  V   +P  GAP P TP    PT  P +ES     +   +
Sbjct: 172 PTSGPVAGGLPGGGAPAPPTPPRPGPTD-PASESDCGRDSRDGD 214


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.132    0.376 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 50,055,462
Number of extensions: 5135822
Number of successful extensions: 32922
Number of sequences better than 10.0: 1
Number of HSP's gapped: 20860
Number of HSP's successfully gapped: 2050
Length of query: 988
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 882
Effective length of database: 6,236,078
Effective search space: 5500220796
Effective search space used: 5500220796
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)