RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5823
         (444 letters)



>gnl|CDD|192483 pfam10203, Pet191_N, Cytochrome c oxidase assembly protein PET191. 
           Pet191_N is the conserved N-terminal of a family of
           conserved proteins found from nematodes to humans. It
           carries six highly conserved cysteine residues. Pet191
           is required for the assembly of active cytochrome c
           oxidase but does not form part of the final assembled
           complex.
          Length = 68

 Score = 88.5 bits (220), Expect = 6e-22
 Identities = 36/67 (53%), Positives = 47/67 (70%), Gaps = 3/67 (4%)

Query: 377 DKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP---QCEALKNTFFECKRSLLD 433
             S+C G R DL  CLLE+DCV+ +++TPR+CL+E L     +C AL+  FFECKR +LD
Sbjct: 1   MGSSCKGLREDLAECLLESDCVKKEKRTPRECLKEPLKELPEECIALRKAFFECKRGMLD 60

Query: 434 NRTRFRG 440
            R RFRG
Sbjct: 61  MRKRFRG 67


>gnl|CDD|214584 smart00249, PHD, PHD zinc finger.  The plant homeodomain (PHD)
           finger is a C4HC3 zinc-finger-like motif found in
           nuclear proteins thought to be involved in epigenetics
           and chromatin-mediated transcriptional regulation. The
           PHD finger binds two zinc ions using the so-called
           'cross-brace' motif and is thus structurally related to
           the RING finger and the FYVE finger. It is not yet known
           if PHD fingers have a common molecular function. Several
           reports suggest that it can function as a
           protein-protein interacton domain and it was recently
           demonstrated that the PHD finger of p300 can cooperate
           with the adjacent BROMO domain in nucleosome binding in
           vitro. Other reports suggesting that the PHD finger is a
           ubiquitin ligase have been refuted as these domains were
           RING fingers misidentified as PHD fingers.
          Length = 47

 Score = 54.1 bits (130), Expect = 6e-10
 Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 3/48 (6%)

Query: 292 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 337
            C  CG  DDG   ++ CDGCD WYH  C+G  L+ E     W+CPKC
Sbjct: 1   YCSVCGKPDDGGE-LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47


>gnl|CDD|201356 pfam00628, PHD, PHD-finger.  PHD folds into an interleaved type of
           Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 51

 Score = 53.6 bits (129), Expect = 1e-09
 Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 292 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPET---SDWFCPKCS 338
            C  CG  DD    ++ CDGCD W+H  C+G   EPE     +W+CP+C 
Sbjct: 1   YCAVCGKVDDDGE-LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECK 49


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 60.0 bits (146), Expect = 1e-09
 Identities = 33/177 (18%), Positives = 63/177 (35%)

Query: 57  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
           T      +++   K K    +  S    T  + KE  + K         V   +     +
Sbjct: 6   TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
                + D   KK K   K    +   KKK K + D  K  EK      D +    K  +
Sbjct: 66  TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
              + + +    +D +   + I  ++ D + ++   D D   + E+ K+ K+ ++ S
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182



 Score = 47.7 bits (114), Expect = 9e-06
 Identities = 25/139 (17%), Positives = 51/139 (36%), Gaps = 5/139 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
              K+    ++  KKK K+   K K K    K + KE  + KK        ++ ++    
Sbjct: 4   ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKK-----TPEQIDQVLIF 58

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
                   D     +  +K+ K + K   +K   KK  K++L   KK +++++ +K+   
Sbjct: 59  LSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDL 118

Query: 242 KLTFKFGTDMEEKTKRESS 260
                     +     +  
Sbjct: 119 NYVKDIDVLNQADDDDDDD 137



 Score = 44.2 bits (105), Expect = 9e-05
 Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 1/136 (0%)

Query: 103 TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 162
           T            E   ++KK  +  K K    K+E ++  + K K  + +I        
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE-QIDQVLIFLS 60

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
             V    +  + +  +KK +   K    K    +K++ E   S+K +K    DK      
Sbjct: 61  GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120

Query: 223 LKKKKKEKERSSHEKE 238
           +K      +    + +
Sbjct: 121 VKDIDVLNQADDDDDD 136



 Score = 44.2 bits (105), Expect = 1e-04
 Identities = 23/159 (14%), Positives = 55/159 (34%)

Query: 60  PSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG 119
             +  D+V        K+   + E   PK+K  +  K              ++ SS+   
Sbjct: 48  TPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           +                    +     D  D D + + + D +   + + E    ++ + 
Sbjct: 108 KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDD 167

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
           ++E++ + KE ++   +     ++D  +  +   KD KL
Sbjct: 168 EDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206



 Score = 29.2 bits (66), Expect = 4.9
 Identities = 20/129 (15%), Positives = 43/129 (33%), Gaps = 1/129 (0%)

Query: 64  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTP-MVPNIVKVTSSEDLGEIK 122
           K         +P +K    E  S K+ E     D +  +      +++     +D  +  
Sbjct: 80  KTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDD 139

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
            D            ++ ++     D  D+D+ K + K+ EK  D +     +++ E   +
Sbjct: 140 DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199

Query: 183 KEFKMKEDK 191
                K   
Sbjct: 200 ARKDAKLTA 208


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 57.1 bits (137), Expect = 1e-08
 Identities = 38/143 (26%), Positives = 62/143 (43%), Gaps = 4/143 (2%)

Query: 120  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            E KK    KK  +  KKK     KK ++ K   +    E +      +  E+  +  ++K
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372

Query: 180  KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
            KEE + K    K+K  EK + +E   + E+  K  D+  K    KKK  E ++ + EK+ 
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432

Query: 240  IPKLTFKFGTDMEEKTKRESSPK 262
              +   K     EE  K + + K
Sbjct: 1433 ADEAKKK----AEEAKKADEAKK 1451



 Score = 55.9 bits (134), Expect = 3e-08
 Identities = 44/172 (25%), Positives = 84/172 (48%), Gaps = 3/172 (1%)

Query: 70   KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
            + + +  E  +++EK    EK+  + K              K   ++   E  K K+ + 
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410

Query: 130  DKKTHKKKEKEKIKKK---KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
             K    KK+ ++ KKK   K K D+ K K +E  K  +  K+ E+ KK E+ KK+ +E K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470

Query: 187  MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
              ++ +K  E+ +K ++  +K +++K K  + KK    KKK ++ + + E +
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522



 Score = 55.1 bits (132), Expect = 5e-08
 Identities = 32/129 (24%), Positives = 63/129 (48%), Gaps = 1/129 (0%)

Query: 111  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
            K   ++   +  K K+ +  K     K + +    + +  ++K +  EK KE+   K   
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382

Query: 171  -KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
             K K EEK+K +E + K +EDK+K  E  +      + ++  K  ++K K ++ KKK +E
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442

Query: 230  KERSSHEKE 238
             +++   K+
Sbjct: 1443 AKKADEAKK 1451



 Score = 55.1 bits (132), Expect = 5e-08
 Identities = 47/175 (26%), Positives = 95/175 (54%), Gaps = 7/175 (4%)

Query: 88   KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK-KKEKEKIKKKK 146
            K+ E +K+K  EL         V+    ++  E KK + +KK ++ +K K  +E  K ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672

Query: 147  DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
            DKK  ++ K  E+D++K     +   K+ E+ KK E+  K + +++K  E+++K E++++
Sbjct: 1673 DKKKAEEAKKAEEDEKK---AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729

Query: 207  ---KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
               +E K + ++ K K E+ KK ++EK++ +H K+   K   +   + E   + E
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEE 1784



 Score = 53.2 bits (127), Expect = 2e-07
 Identities = 48/180 (26%), Positives = 82/180 (45%), Gaps = 6/180 (3%)

Query: 62   KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
            K K + + K   + K+K    +K +   K  ++    E           +    E     
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE---AK 1377

Query: 122  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD---KEKEKGKKEEKE 178
            KK  + KK  +  KK ++ K K ++DKK  D++K     K+K  +   K +EK K +E +
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437

Query: 179  KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            KK E+  K  E K+K  E  + EE   + E+  K  + K K E+ KK  + K+++   K+
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497



 Score = 52.8 bits (126), Expect = 3e-07
 Identities = 38/143 (26%), Positives = 67/143 (46%), Gaps = 4/143 (2%)

Query: 120  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            E KK    KK  +  KK ++ K K ++ KK  D  K K ++ +K  +  K + +    E 
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359

Query: 180  KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
            +  +E     +K+K   K + +    + E+K K  + K K E+ KKK  E ++++  K+ 
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419

Query: 240  IPKLTFKFGTDMEEKTKRESSPK 262
              +   K     EEK K + + K
Sbjct: 1420 ADEAKKK----AEEKKKADEAKK 1438



 Score = 51.3 bits (122), Expect = 9e-07
 Identities = 46/176 (26%), Positives = 93/176 (52%), Gaps = 3/176 (1%)

Query: 64   KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
            K K     K    + M+ ++    K+ E +++++V            +     +  +IK 
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622

Query: 124  DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            ++ +KK ++  KK E+ K K+ ++KK  +++K  E+  E K+   +E  K EE +KK E+
Sbjct: 1623 EE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEE 1679

Query: 184  EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
              K +ED++K  E ++KE ++++K ++ K K+ + KK+  + KK E+E     +E 
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEA 1735



 Score = 47.4 bits (112), Expect = 1e-05
 Identities = 44/183 (24%), Positives = 89/183 (48%), Gaps = 6/183 (3%)

Query: 61   SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
            ++ K + + K K   K+K  + +K + ++K+  + K            + K  +++   +
Sbjct: 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421

Query: 121  IKKDKSIKKDKKTHKKKEKEKIKK----KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
              K K+ +K K    KK+ E+ KK    KK  ++  K +  +K  E+    ++ K K EE
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 177  KEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
             +K +E + K +E K+K  E  +  E  K +++ KK+++  K  + +K ++ KK  E   
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541

Query: 235  HEK 237
             E+
Sbjct: 1542 AEE 1544



 Score = 47.1 bits (111), Expect = 2e-05
 Identities = 52/223 (23%), Positives = 105/223 (47%), Gaps = 7/223 (3%)

Query: 76   KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
            +E     E+    E+E  KV+ ++           ++  +E+  +IK  +  KK ++  K
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675

Query: 136  K-----KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
            K     K +E  +KK  +  K + +  +K +E K  + +EK K EE +K EE+     E+
Sbjct: 1676 KAEEAKKAEED-EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734

Query: 191  KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK-EKERSSHEKEVIPKLTFKFGT 249
             +K  E+ +K+ ++++K+++ K K   LKKE+ KK ++  KE+ +  +E + +   K   
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRM 1794

Query: 250  DMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWI 292
            ++++K K        I          +  + E +    K+V  
Sbjct: 1795 EVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVAD 1837



 Score = 44.7 bits (105), Expect = 1e-04
 Identities = 43/140 (30%), Positives = 66/140 (47%), Gaps = 4/140 (2%)

Query: 125  KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
            K+ +  K    KK +E  K  + KK ++K K  E  K +++ K +EK K EE +K EE +
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576

Query: 185  FKMKEDKE--KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
                   E  K  E+ R EE     E++ K K ++ KK +  K K E+ +    +E   K
Sbjct: 1577 NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK--AEEEKKK 1634

Query: 243  LTFKFGTDMEEKTKRESSPK 262
            +      + EEK K E   K
Sbjct: 1635 VEQLKKKEAEEKKKAEELKK 1654



 Score = 44.4 bits (104), Expect = 1e-04
 Identities = 48/182 (26%), Positives = 79/182 (43%), Gaps = 7/182 (3%)

Query: 62   KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
            K  ++    D+   K +   K+    K+   +K K  E           K   ++   E 
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEE 1442

Query: 122  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK--EKDKEKKVDKEKEKGKKEEKEK 179
             K     K K    KK +E  KK ++ K  D+ K K  E  K  +  K+ E+ KK+  E 
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502

Query: 180  KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK---KKKEKERSSHE 236
            K+  E K K D+ K  E+ +K ++  + E+  K  + K  +EK K    KK E+ + + E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562

Query: 237  KE 238
            K+
Sbjct: 1563 KK 1564



 Score = 44.4 bits (104), Expect = 1e-04
 Identities = 48/180 (26%), Positives = 79/180 (43%), Gaps = 10/180 (5%)

Query: 62   KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
            K K +   K     K+  ++K+K    +K+  + K  +           K   ++   E 
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEE 1455

Query: 122  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD----KEKEKGKKEEK 177
             K     K K    KK  E  KK ++ K  D+ K K ++ +KK D      + K K +E 
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515

Query: 178  EKKEEK----EFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKER 232
            +K EE     E K  E+ +K  E  + EEK    E KK+++  K  +K+K ++ KK +E 
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575



 Score = 44.0 bits (103), Expect = 2e-04
 Identities = 46/159 (28%), Positives = 80/159 (50%), Gaps = 9/159 (5%)

Query: 120  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--KVDKEKEKGKKEEK 177
            E KK    KK  +  KK ++ K K ++ KK  D+ K   + K+K  +  K +E  K +E 
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527

Query: 178  EKKEEK----EFKMKEDKEKPVEKIRKEE-KDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            +K EE     E K  E+K+K  E  + EE K +E++KK+++  K  + + +  +K E+ +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587

Query: 233  SSHEK--EVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
             + E   E + KL  +      E+ K+    KI  + +K
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626



 Score = 42.1 bits (98), Expect = 6e-04
 Identities = 35/167 (20%), Positives = 75/167 (44%), Gaps = 2/167 (1%)

Query: 115  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            +E+    ++ +  ++ +  H  + +  IK ++ +K  +  K +EK K  +  K +EK K 
Sbjct: 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKA 1304

Query: 175  EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
            +E +KK E+  K  E K+K  E  +K +   +K +++K   +  K E   +   ++  ++
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA--EAAADEAEAA 1362

Query: 235  HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 281
             EK    +   +      +  K+++  K      K  +      ADE
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409



 Score = 40.9 bits (95), Expect = 0.001
 Identities = 33/125 (26%), Positives = 70/125 (56%), Gaps = 2/125 (1%)

Query: 115  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            +++  + ++ K   + KK  +KK+ +++KK ++ K  ++ K  E+ K+ + DK     K 
Sbjct: 1524 ADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA 1583

Query: 175  EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
            EE +K EE   +      +  +K++ EE  ++K +++K K ++LKK + +KKK E+ +  
Sbjct: 1584 EEAKKAEEARIEEVMKLYEEEKKMKAEE--AKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641

Query: 235  HEKEV 239
              +E 
Sbjct: 1642 EAEEK 1646



 Score = 32.0 bits (72), Expect = 0.79
 Identities = 41/205 (20%), Positives = 86/205 (41%), Gaps = 10/205 (4%)

Query: 61   SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
               +D  + +       K    +KT   + E ++  +             +   +ED  +
Sbjct: 1082 DAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARK 1141

Query: 121  IKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
             ++ +  +  K+    ++ E  +K ++ +K +D  K +   K ++V K +E  K E+  K
Sbjct: 1142 AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARK 1201

Query: 180  KEEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
             E      K ++E+  E+ RK  + K +E  KK+++     KK+  + KK E+ER++ E 
Sbjct: 1202 AEAAR---KAEEERKAEEARKAEDAKKAEAVKKAEE----AKKDAEEAKKAEEERNNEEI 1254

Query: 238  EVIPKLTFKFGTDMEEKTKRESSPK 262
                +         +   K E + K
Sbjct: 1255 RKFEEARMAHFARRQAAIKAEEARK 1279



 Score = 31.6 bits (71), Expect = 1.0
 Identities = 33/131 (25%), Positives = 71/131 (54%), Gaps = 12/131 (9%)

Query: 120  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            E++K +  +K +   K +E+ K ++ +  +D  K +  +K +E K D E+ K  +EE+  
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251

Query: 180  KEEKEFK--------MKEDKEKPVEKIRKEE-KDSEKEKKS---KDKDKKLKKEKLKKKK 227
            +E ++F+         ++   K  E  + +E K +E++KK+   K  ++K K ++ KKK 
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311

Query: 228  KEKERSSHEKE 238
            +E +++   K+
Sbjct: 1312 EEAKKADEAKK 1322



 Score = 31.3 bits (70), Expect = 1.4
 Identities = 31/166 (18%), Positives = 79/166 (47%), Gaps = 4/166 (2%)

Query: 72   KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 131
            K + +++  + E    + +E  K ++++           ++  +E+  +IK +++ K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740

Query: 132  KTHKKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
            +  KK E+ K     KKK     K++ K  E+ +++K    +E+  +E+++++ E + K+
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKI 1800

Query: 188  KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            K+  +     I   ++ +     SK+ +    KE    K  + E +
Sbjct: 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEA 1846



 Score = 28.6 bits (63), Expect = 8.1
 Identities = 31/134 (23%), Positives = 70/134 (52%), Gaps = 10/134 (7%)

Query: 115  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            +ED  + +  +  ++ +K  + ++ E  +K +  +  ++ +  E+ ++ +  K+ E  KK
Sbjct: 1172 AEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231

Query: 175  EEKEKKEEKEFKMKEDKEKPVEKIRKEE---------KDSEKEKKSKDKDKKLKKEKLKK 225
             E+ KK+ +E K K ++E+  E+IRK E         + +  + +   K  +LKK + KK
Sbjct: 1232 AEEAKKDAEEAK-KAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290

Query: 226  KKKEKERSSHEKEV 239
            K  E +++  +K+ 
Sbjct: 1291 KADEAKKAEEKKKA 1304


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 53.0 bits (128), Expect = 2e-07
 Identities = 27/86 (31%), Positives = 48/86 (55%), Gaps = 9/86 (10%)

Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           T  KK  +KIKK  +K +K         K ++  KEK+K     K+K+EE+E + ++ +E
Sbjct: 403 TGSKKATKKIKKIVEKAEK---------KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKL 218
           +  E+  + E++ E+E++ K K   L
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKKQATL 479



 Score = 52.6 bits (127), Expect = 2e-07
 Identities = 23/81 (28%), Positives = 50/81 (61%)

Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
            +++ + +   +K  +K     ++  KK E+EKKE+K+      K++  E+  KE+K+ E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 207 KEKKSKDKDKKLKKEKLKKKK 227
           KE++ ++ +++ ++E+ KKKK
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475



 Score = 52.6 bits (127), Expect = 2e-07
 Identities = 28/81 (34%), Positives = 52/81 (64%), Gaps = 1/81 (1%)

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
              EI+     KK  K   KK  EK +KK++++ K+K K     K+K+ ++E+EK KKEE
Sbjct: 395 TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453

Query: 177 KEKKEEKEFKMKEDKEKPVEK 197
           ++++EE+E + ++++E+  +K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474



 Score = 51.5 bits (124), Expect = 5e-07
 Identities = 25/83 (30%), Positives = 53/83 (63%), Gaps = 2/83 (2%)

Query: 150 DKDKIKN--KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
            +++I+     K   KK+ K  EK +K+ +E+K+EK+ K    K+K  E+  ++EK  E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 208 EKKSKDKDKKLKKEKLKKKKKEK 230
           +++ +++ ++ K+E+ +KKKK+ 
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 50.7 bits (122), Expect = 9e-07
 Identities = 24/72 (33%), Positives = 45/72 (62%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK     K      +K++E+ KK+K KK     K +E+++E+K  KE+EK ++EE+ ++E
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465

Query: 182 EKEFKMKEDKEK 193
           ++E + K+ K+ 
Sbjct: 466 KEEEEEKKKKQA 477



 Score = 45.7 bits (109), Expect = 3e-05
 Identities = 22/80 (27%), Positives = 51/80 (63%), Gaps = 3/80 (3%)

Query: 160 DKEKKV---DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           ++E +     K+  K  K+  EK E+K  + K++K+K     +K+E++ E+EK+ K+++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 217 KLKKEKLKKKKKEKERSSHE 236
           + ++E+ +++K+E+E    +
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475



 Score = 44.9 bits (107), Expect = 6e-05
 Identities = 19/66 (28%), Positives = 40/66 (60%)

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
            +K  +K  K+ E  EK  E+ +KE+K      K K+++++ +KEK +++K+E+E  + E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464

Query: 237 KEVIPK 242
           ++   +
Sbjct: 465 EKEEEE 470



 Score = 42.6 bits (101), Expect = 3e-04
 Identities = 17/70 (24%), Positives = 51/70 (72%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           +++ + +I +    K++++  +KK+K    KKK+++++++ + KE++KE++ ++ +E+ +
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467

Query: 174 KEEKEKKEEK 183
           +EE++KK++ 
Sbjct: 468 EEEEKKKKQA 477



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 21/64 (32%), Positives = 44/64 (68%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           IKK     + K+  +KKEK+K      KK++++ + KEK +E+K ++E+E  +++E+E++
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471

Query: 181 EEKE 184
           ++K+
Sbjct: 472 KKKK 475



 Score = 38.4 bits (90), Expect = 0.007
 Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
            E+E +     +K  +KI+K  + +EK+++ + K+KK K    KKK++E+E    +KE  
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-E 453

Query: 241 PKLTFKFGTDMEEKTKRESSPK 262
            K   +   + E++ + E   K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKK 475



 Score = 34.9 bits (81), Expect = 0.074
 Identities = 14/79 (17%), Positives = 40/79 (50%)

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
            ++E +     +   K ++KI ++ +   +E+K + K K    +K +++++E++    E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 238 EVIPKLTFKFGTDMEEKTK 256
           +   +   +   + EE+ K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473



 Score = 31.8 bits (73), Expect = 0.75
 Identities = 17/108 (15%), Positives = 40/108 (37%), Gaps = 11/108 (10%)

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKK----SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
           + +   +   ++    + +E+E +    SK   KK+KK   K +KK +E    +K+    
Sbjct: 377 IFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436

Query: 243 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 290
                    +++ + E   +   +  +          +E +    KQ 
Sbjct: 437 -------GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 29.9 bits (68), Expect = 3.3
 Identities = 12/84 (14%), Positives = 37/84 (44%), Gaps = 7/84 (8%)

Query: 84  KTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK 143
                +K    V+  E           +    +      K K  +++++  +KKE+EK +
Sbjct: 405 SKKATKKIKKIVEKAEKKREE------EKKEKKKKAFAGKKKEEEEEEEK-EKKEEEKEE 457

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDK 167
           ++++ +++ + + ++K K+  +  
Sbjct: 458 EEEEAEEEKEEEEEKKKKQATLFD 481


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 52.6 bits (126), Expect = 3e-07
 Identities = 32/126 (25%), Positives = 65/126 (51%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
            VK                 K+ K    KE+EK K++  ++ K K +  +++ + +  KE
Sbjct: 76  AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           + K K+  KEK++EKE K++E +++  EK R+  +   + KK   K    KK++  +++K
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195

Query: 229 EKERSS 234
           +++ + 
Sbjct: 196 QRQAAR 201



 Score = 51.4 bits (123), Expect = 5e-07
 Identities = 37/149 (24%), Positives = 73/149 (48%)

Query: 89  EKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK 148
           E + S  + V+             T      + +  K  +K+K+  K+++K+K +K K++
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127

Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
               K K + K+K    +KEKEK KK E+ +  E+E K +  + K   K   ++K   K+
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187

Query: 209 KKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           K+  +++K+ +  +   K K +E   +E+
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEE 216



 Score = 48.0 bits (114), Expect = 8e-06
 Identities = 51/223 (22%), Positives = 95/223 (42%), Gaps = 16/223 (7%)

Query: 61  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
            K   K         KE  +   K   KEKE  K +  +    P           ++  +
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP-----------KEEPK 129

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKEKGKKEEKEK 179
            +K K   K+K+  K+KEKEK KK ++ +D+++ K +E+ + K +  K  +K    +K++
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKKKKEKERSSHEKE 238
             E+E + +  +E    K  + + + E+EK+  D KD++     +++ +  +      + 
Sbjct: 190 PPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRS 249

Query: 239 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS---PSPPPVAA 278
                       M     RESS +   +P  S   PS  P +A
Sbjct: 250 SSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPASA 292



 Score = 46.0 bits (109), Expect = 2e-05
 Identities = 49/230 (21%), Positives = 96/230 (41%), Gaps = 21/230 (9%)

Query: 52  VHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVK 111
           V   G+  P+            S KE+   KE+   ++K+                   K
Sbjct: 80  VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK-------------------K 120

Query: 112 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKE 170
               ++  + +K K   K+K+  K+KEKEK KK ++ +D+++ K +E+ + K +  K  +
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKKKKE 229
           K    +K++  E+E + +  +E    K  + + + E+EK+  D KD++     +++ +  
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESR 240

Query: 230 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 279
           +      +             M     RESS +   +P  S  PP    A
Sbjct: 241 QSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPA 290



 Score = 43.3 bits (102), Expect = 2e-04
 Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 3/143 (2%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
           +K   K         K+ + +  KE+EK K++ KE+K++K+ K KE+ +    K   +EK
Sbjct: 81  EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140

Query: 204 DSEKEK---KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
              KEK   K K  ++   +E+ KK+++ + +S  +K    K   K     EE+ +R+++
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200

Query: 261 PKIVIKPVKSPSPPPVAAADEYD 283
            + V    + P        +E D
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDD 223



 Score = 41.8 bits (98), Expect = 6e-04
 Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 2/129 (1%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           K   D+   + +    K    K K  KE K +  ++E K KE  ++  +K +++ K+  K
Sbjct: 70  KLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129

Query: 208 EKKSKD--KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
           ++K K+  K+K+  KEK K+K+K+ E     +E   +   +  +  ++  K++   K   
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189

Query: 266 KPVKSPSPP 274
            P +     
Sbjct: 190 PPEEEKQRQ 198



 Score = 32.6 bits (74), Expect = 0.51
 Identities = 29/167 (17%), Positives = 63/167 (37%), Gaps = 12/167 (7%)

Query: 48  PQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVP 107
            +    P   P   K K++   K     KEK   K+   P+++E  K ++          
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV-------- 169

Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
                  S      KK    KK +   ++K+++  ++    K ++   N+E++KE+   K
Sbjct: 170 ----RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGK 225

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           ++E      +E +  +  ++       ++K       +  E +   K
Sbjct: 226 DRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSK 272


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 50.9 bits (123), Expect = 8e-07
 Identities = 40/160 (25%), Positives = 88/160 (55%), Gaps = 19/160 (11%)

Query: 109 IVKVTSSEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-D 166
           + K  +   + E +++ K I ++ K    KE E IKK+   + K++I     + EK++ +
Sbjct: 24  VRKKIAEAKIKEAEEEAKRILEEAK----KEAEAIKKEALLEAKEEIHKLRNEFEKELRE 79

Query: 167 KEKEKGKKEEK-EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           +  E  K E++  +KEE   +  E  EK  E++ K+EK+ E++++  +K ++  +E +++
Sbjct: 80  RRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139

Query: 226 KKKEKER----SSHE-KEVIPKLTFKFGTDMEEKTKRESS 260
           + +E ER    ++ E KE++ +        +EE+ + E++
Sbjct: 140 QLQELERISGLTAEEAKEILLE-------KVEEEARHEAA 172



 Score = 46.3 bits (111), Expect = 2e-05
 Identities = 42/151 (27%), Positives = 84/151 (55%), Gaps = 26/151 (17%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKK--KDKKDK-DKIKNKEK---DKEKKVDKEKEKGKKE 175
           K+ ++IKK+     K+E  K++ +  K+ +++ ++++  EK    KE+ +D++ E  +K 
Sbjct: 49  KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 176 EKE---KKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK--KK 228
           E+E   K++E E K +  E KE+ +E++ +E+   E E+ S      L  E+ K+   +K
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-ELERIS-----GLTAEEAKEILLEK 162

Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
            +E + HE  V+ K       ++EE+ K E+
Sbjct: 163 VEEEARHEAAVLIK-------EIEEEAKEEA 186


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 50.0 bits (120), Expect = 2e-06
 Identities = 42/219 (19%), Positives = 75/219 (34%), Gaps = 17/219 (7%)

Query: 59   PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKD----------VELGVTPM--V 106
              SK K K S   K   K+K   K+K+S  + + + V             +L   P    
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKK 1217

Query: 107  PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--- 163
             N       +D  +  K K     +   KK    K  +  D+   D +  + K K     
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKR 1277

Query: 164  --KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
               V        K    +           K+K  +++       +K+KKS+ K  + KK 
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337

Query: 222  KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
            K + K+    +SS       K      ++ ++ ++ + S
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDS 1376



 Score = 42.3 bits (100), Expect = 5e-04
 Identities = 35/170 (20%), Positives = 58/170 (34%), Gaps = 10/170 (5%)

Query: 116  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
             +  EI K++ +K   K    K ++   KKK+KK     K    DK KK        + +
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKK----KKSSADKSKKASVVGNSKRVD 1202

Query: 176  EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
              EK++  +    +           +E+   K KKS  K  K KK    K  ++ +  S 
Sbjct: 1203 SDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSS 1262

Query: 236  EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 285
            +         K     +  +  + SP     P K P       +      
Sbjct: 1263 DD---LSKEGKPKNAPKRVSAVQYSPP---PPSKRPDGESNGGSKPSSPT 1306



 Score = 38.9 bits (91), Expect = 0.005
 Identities = 33/155 (21%), Positives = 48/155 (30%), Gaps = 5/155 (3%)

Query: 121  IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
              K K+ K  K   K K+KEK KKK       K       K    D EK K   +   KK
Sbjct: 1161 KTKGKASKLRKP--KLKKKEKKKKKSSADKSKKASVVGNSKRVDSD-EKRKLDDKPDNKK 1217

Query: 181  EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
                   +ED E+   K +K      K KK+         ++       KE     K   
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKP--KNAP 1275

Query: 241  PKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPP 275
             +++    +      + +       KP        
Sbjct: 1276 KRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKV 1310



 Score = 36.6 bits (85), Expect = 0.028
 Identities = 27/159 (16%), Positives = 49/159 (30%), Gaps = 5/159 (3%)

Query: 57   TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
             P  S +K   S K+ +S   + + +  +    KE       +      V       S  
Sbjct: 1234 KPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPK--RVSAVQYSPPPPSKR 1291

Query: 117  DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
              GE           K   KK  E       KK K + K   K   KK     ++    +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARK---KKSKTRVKQASASQ 1348

Query: 177  KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
              +   +  K K D     +   + +   +++ +  + D
Sbjct: 1349 SSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387



 Score = 36.6 bits (85), Expect = 0.034
 Identities = 32/119 (26%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 144  KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
            K+K +K   +++ KEK+ EK  +   +    E+ +K EE    ++E +E   ++I KE++
Sbjct: 1101 KEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEE---ALEEQEEVEEKEIAKEQR 1157

Query: 204  DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
               K K      K  K +  KK+KK+K+ S+ + +    +      D +EK K +  P 
Sbjct: 1158 LKSKTKGK--ASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214



 Score = 33.9 bits (78), Expect = 0.23
 Identities = 36/186 (19%), Positives = 78/186 (41%), Gaps = 20/186 (10%)

Query: 108  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
             I      + + E+KK   ++      KK EK   ++++  ++ D+  +++ ++E     
Sbjct: 1027 VITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAV 1086

Query: 168  E-------------KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
                          KEK +K   E  E+KE ++++ K    + +  E+ D  +E   + +
Sbjct: 1087 SYDYLLSMPIWSLTKEKVEKLNAEL-EKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQE 1145

Query: 215  DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
            + + K+   +++ K K +    K   PKL        +EK K++SS     K     +  
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLK------KKEKKKKKSSADKSKKASVVGNSK 1199

Query: 275  PVAAAD 280
             V + +
Sbjct: 1200 RVDSDE 1205


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 47.7 bits (113), Expect = 1e-05
 Identities = 35/132 (26%), Positives = 73/132 (55%), Gaps = 2/132 (1%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           + K          +  +  K+ K   ++ +    ++++ K +  K++ ++ D E+K+ + 
Sbjct: 261 LEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 320

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK--DKDKKLKKEKLKKK 226
           +++ KK EKE K+EKE   + +KE    +I++E ++ E+E+  K  +K ++L++E L KK
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380

Query: 227 KKEKERSSHEKE 238
           K E ER S   +
Sbjct: 381 KLESERLSSAAK 392



 Score = 47.7 bits (113), Expect = 1e-05
 Identities = 33/145 (22%), Positives = 77/145 (53%), Gaps = 3/145 (2%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E+   +  D     +++    +E  + ++++ +  K +++ KE++   +V KE ++ +KE
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELE-KEEEILAQVLKENKEEEKE 281

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           +K ++EE +   KE++E   E ++ E  + D E++ K  +K+ K  +++LKK+K+E E  
Sbjct: 282 KKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEEL 341

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRE 258
             E + +         + E+  K +
Sbjct: 342 EKELKELEIKREAEEEEEEQLEKLQ 366



 Score = 45.3 bits (107), Expect = 5e-05
 Identities = 24/145 (16%), Positives = 74/145 (51%), Gaps = 3/145 (2%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
              +  + + +K  +   +  K+ ++E+ +KK  +++   +  +E++ + ++ K + +  
Sbjct: 252 EEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKV 311

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            +E++ KE ++   K +KE   +K ++E ++ EKE K  +  ++ ++E+ ++ +K +E+ 
Sbjct: 312 DDEEKLKESEKELKKLEKEL--KKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKL 369

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRE 258
              +E +     K  ++      + 
Sbjct: 370 EQLEEEL-LAKKKLESERLSSAAKL 393



 Score = 45.0 bits (106), Expect = 7e-05
 Identities = 36/127 (28%), Positives = 70/127 (55%), Gaps = 3/127 (2%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           + L +  K+K  +++K   KK+E+E+ K +   K+K+  + +EK ++ KV++EKE+  K 
Sbjct: 740 KLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKA 799

Query: 176 EKEKKEEKEFKMK---EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           ++E+    E ++K   E  E+    I +EEK  E+E +    + K +++  K  ++E ER
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELER 859

Query: 233 SSHEKEV 239
              E   
Sbjct: 860 LEEEITK 866



 Score = 43.8 bits (103), Expect = 1e-04
 Identities = 33/166 (19%), Positives = 81/166 (48%), Gaps = 3/166 (1%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE---DLGEIKKDKSIKKDKK 132
           +E++ S ++   KE+E       E         + +        +  E+K +    + +K
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310

Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
              +++ ++ +K+  K +K+  K KE+ +E + + ++ + K+E +E++EE+  K++E  E
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           +  E++  ++K   +   S  K K+ + E   +++KE +      E
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSE 416



 Score = 43.8 bits (103), Expect = 2e-04
 Identities = 29/150 (19%), Positives = 67/150 (44%), Gaps = 1/150 (0%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
             KK++  K  ++T    E   I  ++ K  + K+K + K   +    +++   +EE   
Sbjct: 170 RKKKERLKKLIEETENLAELI-IDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
             +     +E  +   E +R E+++ E  K+  +K++++  + LK+ K+E++    ++E 
Sbjct: 229 YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEE 288

Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
           +  L  +      E  K E       + +K
Sbjct: 289 LKLLAKEEEELKSELLKLERRKVDDEEKLK 318



 Score = 41.9 bits (98), Expect = 6e-04
 Identities = 30/131 (22%), Positives = 74/131 (56%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
            ++  +  +L + +  +  ++ KK  ++ ++E  K K +K++    K +E   +   + +
Sbjct: 681 ELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELK 740

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
             + K +EKE++EEK    KE++E+   ++  +EK+  +E++  +K K  ++++ K K +
Sbjct: 741 LLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800

Query: 229 EKERSSHEKEV 239
           E+E  + E+E+
Sbjct: 801 EEELRALEEEL 811



 Score = 41.5 bits (97), Expect = 9e-04
 Identities = 28/126 (22%), Positives = 57/126 (45%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
               L E  K        K   + E+E +      K  ++  +  ++  +   +E E  K
Sbjct: 199 QELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSK 258

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           +E ++++E     +KE+KE+  EK  +EE+     K+ ++   +L K + +K   E++  
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLK 318

Query: 234 SHEKEV 239
             EKE+
Sbjct: 319 ESEKEL 324



 Score = 41.1 bits (96), Expect = 0.001
 Identities = 29/127 (22%), Positives = 68/127 (53%), Gaps = 8/127 (6%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
             +K+ + +++K    K E+EK +K K ++++ +   +E  +E ++ +E++   ++E++ 
Sbjct: 772 LKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKI 831

Query: 180 KEEK--------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           KEE+        + + K +K    E  R EE+ +++E   +   K+ + E+ K K + + 
Sbjct: 832 KEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELES 891

Query: 232 RSSHEKE 238
           +   EKE
Sbjct: 892 KEEKEKE 898



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 22/134 (16%), Positives = 56/134 (41%), Gaps = 1/134 (0%)

Query: 120  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            +   ++   + ++   ++    +K + + ++    +  EK+KE+   +E+E+  K     
Sbjct: 910  DNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLA 969

Query: 180  KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
            KEE          +  EK  +  KD  K+++ +++ K+L +E +++  +  +    E  V
Sbjct: 970  KEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKE-FLELFV 1028

Query: 240  IPKLTFKFGTDMEE 253
                         E
Sbjct: 1029 SINRGLNKVFFYLE 1042



 Score = 39.6 bits (92), Expect = 0.003
 Identities = 27/143 (18%), Positives = 68/143 (47%), Gaps = 4/143 (2%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKVDKEKEKGKK---EE 176
            ++ + ++++    +  ++  +K+++ ++ K K + + K +KEK+  KE E+  +     
Sbjct: 854 EEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLL 913

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           +EK+ E E ++ E+    ++   + E+   +E   K+K++  K+E+ ++ K+        
Sbjct: 914 EEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEEL 973

Query: 237 KEVIPKLTFKFGTDMEEKTKRES 259
             V      +F    E   K E 
Sbjct: 974 GNVNLMAIAEFEEKEERYNKDEL 996



 Score = 39.2 bits (91), Expect = 0.005
 Identities = 26/140 (18%), Positives = 63/140 (45%), Gaps = 2/140 (1%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E++K+    + K+  +++E+E+++K ++K ++ + +   K K +          KE  E+
Sbjct: 340 ELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKE--EE 397

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
            E K  + KE K       ++E+   E++K+     ++L++    K+ K  E     ++ 
Sbjct: 398 LELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQ 457

Query: 240 IPKLTFKFGTDMEEKTKRES 259
             KL        + +   + 
Sbjct: 458 ALKLLKDKLELKKSEDLLKE 477



 Score = 36.5 bits (84), Expect = 0.033
 Identities = 20/121 (16%), Positives = 58/121 (47%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           +EK KKK+  K   +      +    +++ K +  K +++ K+  E+   ++K +  E+ 
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
                  +  ++  D  ++L +++ ++ +  K+    E+E++ ++  +   + +EK  +E
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286

Query: 259 S 259
            
Sbjct: 287 E 287



 Score = 35.7 bits (82), Expect = 0.049
 Identities = 32/131 (24%), Positives = 70/131 (53%), Gaps = 14/131 (10%)

Query: 116  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
             +  ++K +   K++K+  +KKE E+  +K +  ++ + + +E+  E+ +   K + + E
Sbjct: 880  LEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPE 939

Query: 176  E------KEKKEEKEFKMKEDKEKPVEKIRKEEKDS--------EKEKKSKDKDKKLKKE 221
            E       EK++E++ K +E++      + KEE  +         +EK+ +    +LKKE
Sbjct: 940  ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKE 999

Query: 222  KLKKKKKEKER 232
            +L+++KKE  R
Sbjct: 1000 RLEEEKKELLR 1010



 Score = 29.6 bits (66), Expect = 4.4
 Identities = 21/106 (19%), Positives = 37/106 (34%), Gaps = 4/106 (3%)

Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKED----KEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
             +EK K K+  K+  EE E   +      + K  E   KE+     E     +  +L++
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224

Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
           E L      K        +   L  +       K + E   +I+ +
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270



 Score = 29.2 bits (65), Expect = 5.9
 Identities = 21/141 (14%), Positives = 66/141 (46%)

Query: 70   KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
            +++    E  S +EK   ++KE  +    +  +      I +  + E +  +K +   ++
Sbjct: 881  EEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEE 940

Query: 130  DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
                   +++++   K+++++++K     K++   V+       +E++E+  + E K + 
Sbjct: 941  LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKER 1000

Query: 190  DKEKPVEKIRKEEKDSEKEKK 210
             +E+  E +R+  +++ +  K
Sbjct: 1001 LEEEKKELLREIIEETCQRFK 1021


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 47.2 bits (113), Expect = 1e-05
 Identities = 26/161 (16%), Positives = 64/161 (39%), Gaps = 6/161 (3%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHK----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
            SS++L E  K+  I  + K+H     ++E  K + K++ +++ K + +E    +  ++ 
Sbjct: 15  VSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEA 72

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           K +       ++  +     E   +P E       ++   +    K  K KK    KKKK
Sbjct: 73  KAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKK 132

Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
            K +++   +       +       + +++   +   + + 
Sbjct: 133 PKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173



 Score = 40.2 bits (95), Expect = 0.002
 Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 5/140 (3%)

Query: 62  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
           +   K   K +   + K  ++E  + + +E +K +                  +      
Sbjct: 41  EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAA----- 95

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
            +    +  +       + K KK   KK   K K K+  ++     ++ KG K  + ++ 
Sbjct: 96  ARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRG 155

Query: 182 EKEFKMKEDKEKPVEKIRKE 201
            +  + K+ K+KP EKI +E
Sbjct: 156 RRRRRKKKKKQKPTEKIPRE 175



 Score = 38.7 bits (91), Expect = 0.007
 Identities = 21/131 (16%), Positives = 52/131 (39%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K  + E+     ++ +  + ++  K +       ++  +     +   +  E +  +  E
Sbjct: 49  KREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAE 108

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
              +  K KK  K+ K  + K+K  ++         K  K + + +  ++ + KKKK++ 
Sbjct: 109 AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKP 168

Query: 231 ERSSHEKEVIP 241
                 + VIP
Sbjct: 169 TEKIPREVVIP 179



 Score = 36.0 bits (84), Expect = 0.046
 Identities = 17/109 (15%), Positives = 36/109 (33%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
            K    +  KE     K+  +K K    E        +E+E  K+E K + EE+     E
Sbjct: 2   SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           +      +   + + +      +  +     E   +  +++     E  
Sbjct: 62  EAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAA 110



 Score = 35.6 bits (83), Expect = 0.050
 Identities = 18/121 (14%), Positives = 41/121 (33%), Gaps = 10/121 (8%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKD-----KKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
               + +  K      KE ++K K+     K     ++ +E  KE     E ++  +EE 
Sbjct: 2   SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKE-----EAKREAEEEA 56

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           + + E+    + ++E   E       +   E  +  +      E    +  E      + 
Sbjct: 57  KAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKA 116

Query: 238 E 238
           +
Sbjct: 117 K 117



 Score = 31.0 bits (71), Expect = 1.4
 Identities = 21/132 (15%), Positives = 46/132 (34%), Gaps = 3/132 (2%)

Query: 152 DKIKNKEKDKEKKVDKEK--EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
            K++  E  KE  V  ++  EK K+   E K       +E+  K   K R+ E++++ E 
Sbjct: 2   SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAK-REAEEEAKAEA 60

Query: 210 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
           +     +  ++ K +         + E     +   +   D   +    ++ +   K   
Sbjct: 61  EEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAA 120

Query: 270 SPSPPPVAAADE 281
                P     +
Sbjct: 121 KKKKGPKPKKKK 132



 Score = 30.6 bits (70), Expect = 2.2
 Identities = 19/151 (12%), Positives = 48/151 (31%), Gaps = 4/151 (2%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           K ++ +   +      +  EK KE  ++ +      EE+E ++E+  +  E++ K     
Sbjct: 3   KVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKA---- 58

Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
             EE  + + ++    +        +  +      +  +    +              ++
Sbjct: 59  EAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKK 118

Query: 259 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 289
           ++ K      K   P   AA         K 
Sbjct: 119 AAKKKKGPKPKKKKPKRKAARGGKRGKGGKG 149


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 28/120 (23%), Positives = 49/120 (40%), Gaps = 1/120 (0%)

Query: 74  SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
           S     + K K  P    YS V  +   +           ++    + KK+    + KK 
Sbjct: 10  SFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK 69

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            +KK+K+K +KK+ K +  + K   K  +K    +K+  K +  E  +    K+ E  EK
Sbjct: 70  SEKKKKKKKEKKEPKSEG-ETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 32/124 (25%), Positives = 46/124 (37%), Gaps = 4/124 (3%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
           T    L  I    S              + + K       K K ++K+ E K   EK+K 
Sbjct: 16  TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75

Query: 173 KK-EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           KK E+KE K E E K+     K     +K +K   K K ++D D    K     +K    
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKS---KKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132

Query: 232 RSSH 235
             +H
Sbjct: 133 IGAH 136



 Score = 37.8 bits (88), Expect = 0.010
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 2/94 (2%)

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           ++   +             KE      E+  K    S K+ K +DK+ + KK+  KKKKK
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
           +KE+   + E   KL FK  T  + K  ++  PK
Sbjct: 77  KKEKKEPKSEGETKLGFK--TPKKSKKTKKKPPK 108



 Score = 37.4 bits (87), Expect = 0.011
 Identities = 26/118 (22%), Positives = 50/118 (42%), Gaps = 9/118 (7%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
               + +K K          +    K+      + +NK      K DK+++K  + +K+ 
Sbjct: 11  FFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKS 70

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           +++K       K+K  +K  K E +++   K+  K KK KK+  K K  E   ++  K
Sbjct: 71  EKKK-------KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121



 Score = 37.4 bits (87), Expect = 0.014
 Identities = 28/126 (22%), Positives = 48/126 (38%), Gaps = 10/126 (7%)

Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
           +K K +        +    K+      +E+ K       KK++KE K  E K+K  +K  
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTST-KKDKKEDKNNESKKKSEKK-- 73

Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE-KTKRE 258
                 +K+KK K + K   + KL  K  +K + + +K   PK          +     E
Sbjct: 74  ------KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAE 127

Query: 259 SSPKIV 264
            S   +
Sbjct: 128 KSNVYI 133



 Score = 35.1 bits (81), Expect = 0.065
 Identities = 25/113 (22%), Positives = 47/113 (41%), Gaps = 4/113 (3%)

Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
            +  +      +        KE      E+E K+     K   K +KE+K++E +KKS+ 
Sbjct: 16  TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTK---KDKKEDKNNESKKKSEK 72

Query: 214 KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
           K KK KKEK + K + + +   +     K T K     +     +++   + +
Sbjct: 73  K-KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124



 Score = 31.6 bits (72), Expect = 0.83
 Identities = 18/109 (16%), Positives = 39/109 (35%), Gaps = 8/109 (7%)

Query: 66  KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 125
            V  K+  S   +  +K  T+  +K+  + K+ E        +  K    +   + KK+ 
Sbjct: 32  LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNE--------SKKKSEKKKKKKKEKKEP 83

Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
             + + K   K  K+  K KK        ++ +    K  +  ++    
Sbjct: 84  KSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132



 Score = 30.5 bits (69), Expect = 1.7
 Identities = 18/98 (18%), Positives = 37/98 (37%), Gaps = 3/98 (3%)

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           ++ K +     +       K +     EE   E +  +    K  K++K  + KK+ E+ 
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEE---ENKVATTSTKKDKKEDKNNESKKKSEKK 73

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 271
             +K+   +   +  T +  KT ++S       P   P
Sbjct: 74  KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 46.2 bits (110), Expect = 3e-05
 Identities = 26/122 (21%), Positives = 62/122 (50%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
           GEI   +   ++ +   ++ +E++++ K+K +  K + +E++   +  ++     +E KE
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           + EEK   + E+ E+  E +R+E  + E E      + +  K +++  ++  ER S   E
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421

Query: 239 VI 240
            +
Sbjct: 422 DL 423



 Score = 42.4 bits (100), Expect = 5e-04
 Identities = 24/129 (18%), Positives = 65/129 (50%), Gaps = 7/129 (5%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           + + ++   K    +KE E+++++  + +++  + +E+ +E + + E+ K + EE  ++ 
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283

Query: 182 EK-------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           E+         +  E+ E  +  +R+  ++ E E +  ++  +  KEK++  K+E E   
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343

Query: 235 HEKEVIPKL 243
              E + +L
Sbjct: 344 TLLEELEQL 352



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 22/123 (17%), Positives = 66/123 (53%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
             L E++++    +++    ++  E+++++ +  ++   K KE+ +E +  ++  + + E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
           E E++ E+  +  +  E+ +E + +  +  E+E +  +++ +  +EKL + ++E E    
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856

Query: 236 EKE 238
           E E
Sbjct: 857 ELE 859



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 20/131 (15%), Positives = 67/131 (51%), Gaps = 9/131 (6%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-------EKDKEKKVDKE 168
           E+L  ++++  +++ ++  ++ EKE  + K + ++  +   +        K++ ++++ E
Sbjct: 246 EELSRLEEE--LEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGE 303

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
               ++  +E + E E   +  +E   +    +E+  E+E   ++ ++ L + +  K++ 
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363

Query: 229 EKERSSHEKEV 239
           E++ S+  +E+
Sbjct: 364 EEKLSALLEEL 374



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 28/113 (24%), Positives = 61/113 (53%), Gaps = 8/113 (7%)

Query: 131 KKTHKKKEK--EKIKKKKDKKDK--DKIKNKEKDKEK-KVDKEKEKGKKEEKEKKEEKEF 185
            K  ++KE+   K+++ ++  ++  D ++  EK  EK +   EK +  +E K +  E E 
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            +   K   ++++RKE ++ E+E    +++ +  +E+L++ +KE E    E E
Sbjct: 228 ALLLAK---LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELE 277



 Score = 37.8 bits (88), Expect = 0.014
 Identities = 21/126 (16%), Positives = 73/126 (57%), Gaps = 1/126 (0%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            E+L  +++  +  K++    +++++ ++++ ++ +++  + + +    + + E  + ++
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823

Query: 175 EEKEKK-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           E  E++ EE E +++E +EK  E   + E+  ++ ++ K++ ++L+ EK + + + KE  
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELE 883

Query: 234 SHEKEV 239
             ++E+
Sbjct: 884 EEKEEL 889



 Score = 36.2 bits (84), Expect = 0.036
 Identities = 21/127 (16%), Positives = 66/127 (51%), Gaps = 4/127 (3%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
             + L E+++     + +    K+E   ++++  ++ + +++  E++ E+  ++ +E   
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEEL-EQLQSRLEELEEELEELEEELEEL-- 756

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKER 232
           +E  E+ EE+   ++E   K  E+I + E+  +  ++  ++ +++L++ + +    E+E 
Sbjct: 757 QERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816

Query: 233 SSHEKEV 239
            S E+  
Sbjct: 817 ESLEQRR 823



 Score = 36.2 bits (84), Expect = 0.037
 Identities = 21/118 (17%), Positives = 68/118 (57%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            E+L E ++     + +    ++ +E+++++ ++ +++  + +EK  E + + E+ + + 
Sbjct: 799 EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKEL 858

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           EE +++ E+    KE+ E  ++++ +E+++ E+E +  + +    KE+++K ++  E 
Sbjct: 859 EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916



 Score = 36.2 bits (84), Expect = 0.037
 Identities = 25/124 (20%), Positives = 69/124 (55%), Gaps = 7/124 (5%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKK 174
             L   +++    +D     +K+ EK++++ +K ++  ++K + ++ E  +   K K  +
Sbjct: 179 RKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELR 238

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKE-----KLKKKKK 228
           +E E+ EE+  +++E+ E+  E++ + EK+  E + + ++  ++L++      +LK++ +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298

Query: 229 EKER 232
           E E 
Sbjct: 299 ELEG 302



 Score = 35.8 bits (83), Expect = 0.045
 Identities = 23/124 (18%), Positives = 65/124 (52%), Gaps = 6/124 (4%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
           L E+++     K +    ++E E+++     ++ ++   + +++ +++ +  E+  +EE 
Sbjct: 711 LEELERQLEELKRELAALEEELEQLQS--RLEELEEELEELEEELEELQERLEE-LEEEL 767

Query: 178 EKKEEKEFKMKEDKEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           E  EE   K+KE+ E+     + +++E ++ E+E +  ++     + +L+  ++ +ER  
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827

Query: 235 HEKE 238
            E E
Sbjct: 828 QEIE 831



 Score = 35.8 bits (83), Expect = 0.048
 Identities = 21/127 (16%), Positives = 61/127 (48%), Gaps = 4/127 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVDKEKEKGK 173
           E+L E +      +      ++ KE++++K     ++ +++    +++  +++ E  + +
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396

Query: 174 KEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E +E K E E   +  E   + +E +++E K+ E E +    + +   E+L++ +++ E
Sbjct: 397 NELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLE 456

Query: 232 RSSHEKE 238
                 +
Sbjct: 457 ELRDRLK 463



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 25/130 (19%), Positives = 64/130 (49%), Gaps = 11/130 (8%)

Query: 118 LGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           L E++K  + +++  +  K +  +++K +  + +   +  K K+  K++++ +E+  + E
Sbjct: 195 LEELEKQLEKLERQAE--KAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLE 252

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--------KDKKLKKEKLKKKKK 228
           +E +E +E   + +KE    K   EE   E E+  ++        ++ + +   L+++ +
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLE 312

Query: 229 EKERSSHEKE 238
           E E    E E
Sbjct: 313 ELENELEELE 322



 Score = 33.1 bits (76), Expect = 0.36
 Identities = 20/121 (16%), Positives = 64/121 (52%), Gaps = 1/121 (0%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E+K  K+  +  +   ++ + ++++ + + ++ K +    ++E +  + + +  +EE E+
Sbjct: 689 ELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEE 748

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEKE 238
            EE+  +++E  E+  E++   E+   K K+  ++ + K +  + + ++ E+E    E+ 
Sbjct: 749 LEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERR 808

Query: 239 V 239
           +
Sbjct: 809 L 809



 Score = 32.8 bits (75), Expect = 0.46
 Identities = 27/121 (22%), Positives = 66/121 (54%), Gaps = 1/121 (0%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
            E++  +  ++  +   ++ +E+I++ ++K D+ + + +E +KE +  KE+ +  + EKE
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            + E E K  E++++ +E+  +E +    E K + +  + + E+L+ K +  E    E E
Sbjct: 874 -ELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELE 932

Query: 239 V 239
            
Sbjct: 933 E 933



 Score = 32.4 bits (74), Expect = 0.63
 Identities = 16/120 (13%), Positives = 69/120 (57%), Gaps = 4/120 (3%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
            E+   +   +  ++  ++ +E++++ +++   ++++ + ++ E++++  +E   K ++E
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEE--LEELQERLEELEEELESLEEALAKLKEE 780

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            +E +E +    +E  +E++ +E +++E+   + +++ +  +++ ++ ++E E    E E
Sbjct: 781 IEELEEKRQALQEE--LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIE 838



 Score = 31.6 bits (72), Expect = 0.94
 Identities = 19/64 (29%), Positives = 37/64 (57%), Gaps = 3/64 (4%)

Query: 172 GKKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           G  + KE+KEE E K++  E+  + +E +  EE + + EK  +  +K  + ++LK + +E
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLL-EELEKQLEKLERQAEKAERYQELKAELRE 224

Query: 230 KERS 233
            E +
Sbjct: 225 LELA 228



 Score = 31.6 bits (72), Expect = 1.2
 Identities = 15/118 (12%), Positives = 51/118 (43%), Gaps = 7/118 (5%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKE----KEKIKK---KKDKKDKDKIKNKEKDKEKKVDK 167
             +L E+K++    +++     +     KE++K+   + ++   +  +  E+ +E +   
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           E+ + + +E E++  +  +  +  EK +  +       E E+++    + + +     
Sbjct: 456 EELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESG 513



 Score = 30.5 bits (69), Expect = 2.6
 Identities = 22/94 (23%), Positives = 59/94 (62%), Gaps = 3/94 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E++ E+++     +++    +KE E++K++ ++ + +K   + +D+ K++++EKE+ ++E
Sbjct: 835 EEIEELEEKLDELEEELEELEKELEELKEELEELEAEK--EELEDELKELEEEKEELEEE 892

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
            +E + E   ++KE+ EK  E++ + E   E+ +
Sbjct: 893 LRELESELA-ELKEEIEKLRERLEELEAKLERLE 925



 Score = 29.7 bits (67), Expect = 3.6
 Identities = 24/122 (19%), Positives = 68/122 (55%), Gaps = 11/122 (9%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E L E++++    + +    K+E E+++ +K++ + +  + +E+  ++++++E  + + E
Sbjct: 842 EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEE--KEELEEELRELESE 899

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRK--------EEKDSEKEKKSKDKDKKLKKEKLKKKK 227
             E KEE E K++E  E+   K+ +        EE+  E+ + + + + + + E+L+++ 
Sbjct: 900 LAELKEEIE-KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEI 958

Query: 228 KE 229
           + 
Sbjct: 959 EA 960



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 18/129 (13%), Positives = 62/129 (48%), Gaps = 26/129 (20%)

Query: 129  KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-----DKEKKVDKEKEKGKKEEKEKKEEK 183
            K+ +  K++ +E++++ + +  + K + ++      + E K+++ + +  + E+E +EE 
Sbjct: 880  KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939

Query: 184  EFKMKEDKEKPVEKI---------------------RKEEKDSEKEKKSKDKDKKLKKEK 222
            E  ++ + E+ +E++                      +  ++ + +++  ++ K+   E 
Sbjct: 940  EDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEV 999

Query: 223  LKKKKKEKE 231
            +++  KEK 
Sbjct: 1000 IEELDKEKR 1008


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 45.6 bits (108), Expect = 4e-05
 Identities = 25/97 (25%), Positives = 47/97 (48%), Gaps = 12/97 (12%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
            K  + +K +      K +E +K+   K            K  +V  E +   +E + K 
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNKKLQD------YTKKLGEVKDETDA-SEEAEAKA 402

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
           +E++ K +E+++K     +KE+ D +KEK+ KD+ KK
Sbjct: 403 KEEKLKQEENEKK-----QKEQADEDKEKRQKDERKK 434



 Score = 44.0 bits (104), Expect = 1e-04
 Identities = 20/86 (23%), Positives = 48/86 (55%), Gaps = 2/86 (2%)

Query: 142 IKKKKDKKDKDKIKNKEKDKE-KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
            K  ++ K    +   ++ +  K+ +K+ +   K+  E K+E +   +E + K  E+  K
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETD-ASEEAEAKAKEEKLK 408

Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKK 226
           +E++ +K+K+  D+DK+ +++  +KK
Sbjct: 409 QEENEKKQKEQADEDKEKRQKDERKK 434



 Score = 41.7 bits (98), Expect = 5e-04
 Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           KE   I +  D  D   +   +  +E K + +    K++E  K+  K+ +    K   V 
Sbjct: 330 KEAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV- 388

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
              K+E D+ +E ++K K++KLK+E+ +KK+KE+     EK 
Sbjct: 389 ---KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKR 427



 Score = 37.1 bits (86), Expect = 0.016
 Identities = 21/100 (21%), Positives = 52/100 (52%), Gaps = 4/100 (4%)

Query: 94  KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK 153
            + D +L +  +     +V S+ DL   K+ + +K+  K  +   K+  + K +    ++
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEE 397

Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            + K K+++ K ++ ++K K++  E KE++    ++D+ K
Sbjct: 398 AEAKAKEEKLKQEENEKKQKEQADEDKEKR----QKDERK 433



 Score = 33.6 bits (77), Expect = 0.19
 Identities = 16/59 (27%), Positives = 34/59 (57%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
           + L +  K     KD+    ++ + K K++K K+++++ K KE+  E K  ++K++ KK
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 45.4 bits (108), Expect = 4e-05
 Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           +KS++K ++  K  + +  + KK     + I + ++  + K +++ EK   E KEKK  K
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK--K 333

Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           E K +E K+K +E++ +  +  E +   K+++K
Sbjct: 334 EKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366



 Score = 40.4 bits (95), Expect = 0.001
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 6/83 (7%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           K +EK K  K   ++ K      E   + + K+K   E+  EK+  E K+ +KEKK ++K
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340

Query: 215 DKKL------KKEKLKKKKKEKE 231
            KK       + EKL+ +  +KE
Sbjct: 341 KKKQIERLEERIEKLEVQATDKE 363



 Score = 38.1 bits (89), Expect = 0.008
 Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           K  EK  EK  +K K    + ++ KK    F+M  D ++ ++   K E+D+EK      +
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKS--KFERDNEKLDAEVKE 330

Query: 215 DKKLKKEKLKKKKK 228
            KK KK++ KKKK+
Sbjct: 331 KKKEKKKEEKKKKQ 344



 Score = 35.4 bits (82), Expect = 0.058
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK- 174
           + L    K               K K+ K K ++D +K+  + K+K+K+  KE++K K+ 
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKL-KSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345

Query: 175 ---EEKEKKEEKEFKMKED 190
              EE+ +K E +   KE+
Sbjct: 346 ERLEERIEKLEVQATDKEE 364



 Score = 33.5 bits (77), Expect = 0.20
 Identities = 17/60 (28%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK---EEKEKKEEKE 184
              +K   K E++  K   + K+K K K KE+ K+K++++ +E+ +K   +  +K+E K 
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367



 Score = 33.1 bits (76), Expect = 0.27
 Identities = 20/73 (27%), Positives = 36/73 (49%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            +K      K      +   +  +K K K ++D  K   + KEKK +K+KE+ KK++ E+
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347

Query: 180 KEEKEFKMKEDKE 192
            EE+  K++    
Sbjct: 348 LEERIEKLEVQAT 360



 Score = 33.1 bits (76), Expect = 0.27
 Identities = 13/63 (20%), Positives = 34/63 (53%)

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           +EK +  ++++K  K+  +      +   + + K +  ++KL  E  +KKK++K+    +
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342

Query: 237 KEV 239
           K++
Sbjct: 343 KQI 345



 Score = 30.8 bits (70), Expect = 1.7
 Identities = 19/65 (29%), Positives = 39/65 (60%), Gaps = 7/65 (10%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
            K K   K ++ ++K + E  +KKK+KK       KE+ K+K++++ +E+ +K E +  +
Sbjct: 309 LKRKLKSKFERDNEKLDAEVKEKKKEKK-------KEEKKKKQIERLEERIEKLEVQATD 361

Query: 182 EKEFK 186
           ++E K
Sbjct: 362 KEENK 366


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 45.2 bits (107), Expect = 4e-05
 Identities = 23/109 (21%), Positives = 55/109 (50%), Gaps = 1/109 (0%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
             K  ++ KK  ++ E+ + ++  ++  + +++ +   ++     E+   + EEK+K+ E
Sbjct: 63  AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122

Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E K K+  E   +   + EK +++E K + +++   K   + KKK  E
Sbjct: 123 -EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170



 Score = 44.8 bits (106), Expect = 6e-05
 Identities = 25/104 (24%), Positives = 55/104 (52%), Gaps = 1/104 (0%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK-EFK 186
           +  ++  +K+ +++   +K  K  ++   + ++K+K+ ++ K K   E K K E + E K
Sbjct: 84  RAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKK 143

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            KE+ +K  E+  K +  +E +KK+ +  KK + E   K + + 
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA 187



 Score = 42.5 bits (100), Expect = 3e-04
 Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           K K  ++ K     + K K + + +KK K++ K + +++ K     + K K  E +KK E
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176

Query: 183 KEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            E K K + K K   +  K + ++ K K + +   K + E       E ER
Sbjct: 177 AEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227



 Score = 42.1 bits (99), Expect = 3e-04
 Identities = 27/109 (24%), Positives = 51/109 (46%), Gaps = 1/109 (0%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKKEEKEK 179
            K+ +     +K  K+ E+   + ++ +K  ++ K K   + + K + E EK  KEE +K
Sbjct: 91  QKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           + E+E K K   E   +    ++K   + K   +   K K E+ K K +
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199



 Score = 33.3 bits (76), Expect = 0.21
 Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 2/74 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            K+++ K+ ++  K K   + KKK    KK  +     + + + K   E+ K K E  + 
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKA 203

Query: 180 KEEKEFKMKEDKEK 193
           K   E   K + E 
Sbjct: 204 KAAAEAAAKAEAEA 217



 Score = 31.3 bits (71), Expect = 0.94
 Identities = 16/96 (16%), Positives = 49/96 (51%), Gaps = 3/96 (3%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           ++ K     KE++++KK++++ E+  ++++  ++ ++ ++++         + E+   + 
Sbjct: 56  QQQKKPAAKKEQERQKKLEQQAEE-AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQA 114

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
           E+K K  ++   K K   + K K  +  EK+   + 
Sbjct: 115 EEKQKQAEEA--KAKQAAEAKAKAEAEAEKKAKEEA 148



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 16/94 (17%), Positives = 44/94 (46%), Gaps = 7/94 (7%)

Query: 152 DKIKNKEKDKEKKVDK------EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
           ++I+ ++K   KK  +      ++ +  ++++  ++ ++ ++++         + E+   
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112

Query: 206 EKEKKSK-DKDKKLKKEKLKKKKKEKERSSHEKE 238
           + E+K K  ++ K K+    K K E E     KE
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKE 146


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 45.7 bits (108), Expect = 4e-05
 Identities = 27/99 (27%), Positives = 57/99 (57%), Gaps = 5/99 (5%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEE--KD 204
            + +  I N ++ +EK  +K+ +K + E KEK  ++KE  + ++ + PV++   +E  K 
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIP 241
           +++ KKS  ++ K   E LK K++ ++   +E  ++ IP
Sbjct: 65  ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP 103



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 33/146 (22%), Positives = 67/146 (45%), Gaps = 21/146 (14%)

Query: 62  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
           K  +K   K++   KEK+  K++    +   S VK+                SS++   +
Sbjct: 20  KDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE---------------QSSQE--NL 62

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKVDKEKEKGKKEEK 177
           K    +KK  K   K+  E +K K++ + + + +  +K     + K+   +K +  K E+
Sbjct: 63  KIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQ 122

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEK 203
            KK+ K F++ E K+ P+ +   E++
Sbjct: 123 AKKQTKLFRLFEPKQLPIYRANGEKE 148



 Score = 36.5 bits (84), Expect = 0.035
 Identities = 21/100 (21%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
            + +      ++ + K  + +  K++++ KEK  +KK +   +      KE+  ++  K 
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLK- 63

Query: 188 KEDKEKPVEKIRKEEKDSEKE--KKSKDKDKKLKKEKLKK 225
                  V+K  KEE     E  K  ++  K+++ E L+K
Sbjct: 64  ---IADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQK 100



 Score = 33.4 bits (76), Expect = 0.26
 Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 5/112 (4%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKK-----DKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
            ++K  +K  + ++ + KEK+  KK     D  D    +   ++  K  D+ K+  K+E 
Sbjct: 17  MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           K+  E  + K +  KE   E ++K     E ++    K + +K E+ KK+ K
Sbjct: 77  KQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKKQTK 128



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 6/65 (9%)

Query: 201 EEKDSEKEK---KSKDKDKKLKKEKLKKKKK---EKERSSHEKEVIPKLTFKFGTDMEEK 254
              D  +EK   K   K++   KEK+  KK+        S  KE   +   K   ++++ 
Sbjct: 12  NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKS 71

Query: 255 TKRES 259
           TK ES
Sbjct: 72  TKEES 76



 Score = 29.1 bits (65), Expect = 5.9
 Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 8/108 (7%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKVDKEKEK 171
              +D  + +K++   K+K   KK+E          K++   +N +   E KK  KE+ K
Sbjct: 18  QEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEESK 77

Query: 172 ------GKKEEKEKKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSK 212
                   KEE +K+ + E   K     +P E I K+ +D + E+  K
Sbjct: 78  QLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKK 125


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 45.6 bits (109), Expect = 5e-05
 Identities = 32/111 (28%), Positives = 60/111 (54%), Gaps = 12/111 (10%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK----KEEKEFKMKED 190
           K +E E + K+ +K  ++  + KEK +E++ DK  E+ +KE ++     K+E +  +KE 
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEE-DKLLEEAEKEAQQAIKEAKKEADEIIKEL 593

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKERSSHEKEV 239
           ++      +K    S K  +  +  K+L K  EK +KKKK+++    E +V
Sbjct: 594 RQL-----QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639



 Score = 33.3 bits (77), Expect = 0.27
 Identities = 30/128 (23%), Positives = 51/128 (39%), Gaps = 21/128 (16%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG------- 172
           EI K+    +       K  E I+ +K     ++ K K+K K+K+  +E + G       
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLS 647

Query: 173 ---KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
              K E     ++KE  ++      + K++    D EK +K        KK+K KK K  
Sbjct: 648 LGQKGEVLSIPDDKEAIVQAG----IMKMKVPLSDLEKIQKP-------KKKKKKKPKTV 696

Query: 230 KERSSHEK 237
           K +     
Sbjct: 697 KPKPRTVS 704



 Score = 30.2 bits (69), Expect = 2.5
 Identities = 37/154 (24%), Positives = 65/154 (42%), Gaps = 39/154 (25%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKVDKEKEKGK------------- 173
           ++KK   ++E++K+ ++ +K+ +  IK   KE D+  K  ++ +KG              
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613

Query: 174 ---KEEKEKKEEKEFKMKEDKEKPVE----KIRKEEKDSEKEKKSKDKD----------- 215
               +  EKKE+K+ K KE +E+       K     +  E      DK+           
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMK 673

Query: 216 ------KKLKKEKLKKKKKEKERSSHEKEVIPKL 243
                 +K++K K KKKKK K      + V  +L
Sbjct: 674 VPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 45.2 bits (107), Expect = 5e-05
 Identities = 24/103 (23%), Positives = 41/103 (39%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
            +KK H+K +K+  K  K    KD  K K K K+K   K       +     +    +  
Sbjct: 48  INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
                 + +  K + +  K+K + +K    KK+K+   K E  
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELI 150



 Score = 35.2 bits (81), Expect = 0.073
 Identities = 28/125 (22%), Positives = 47/125 (37%), Gaps = 19/125 (15%)

Query: 135 KKKEKEKIKKKKDKK------DKDKIKNKEKDKEKK--------VDKEKEKGKKEEKEKK 180
            K    KI+ + D K      D +K  +++ DK+ K         D +K K K+++K KK
Sbjct: 26  LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKK 85

Query: 181 -----EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
                ++ +                      K K   +K KK         KK+K+  S 
Sbjct: 86  KLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSS 145

Query: 236 EKEVI 240
           + E+I
Sbjct: 146 KDELI 150



 Score = 34.8 bits (80), Expect = 0.094
 Identities = 26/102 (25%), Positives = 37/102 (36%), Gaps = 1/102 (0%)

Query: 71  DKTSPKEKMSSKEKTSPKEKEYSKVK-DVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
           DK + K   +   K S K K   K K   +L +     N     ++          S+ +
Sbjct: 57  DKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMR 116

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
             K   +K K+KI   K    K K     KD+  K D  K K
Sbjct: 117 KPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPK 158



 Score = 34.0 bits (78), Expect = 0.16
 Identities = 29/159 (18%), Positives = 49/159 (30%), Gaps = 24/159 (15%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-------EKGKKEEKEKKE 181
             + T   K    IK  K    K ++++  KD    +D  K       +K KK  K    
Sbjct: 10  SSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDL 69

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL------KKKKKEKERSSH 235
           +   K K  ++K ++K    + D +    SK+  K+             K K EK +   
Sbjct: 70  KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129

Query: 236 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
                             K K++             + P
Sbjct: 130 TVNKSTN-----------KKKKKVLSSKDELIKYDNNKP 157



 Score = 32.5 bits (74), Expect = 0.48
 Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 12/159 (7%)

Query: 70  KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSI-- 127
              +S K ++ S  K S    + +K    +L          K   ++DL + KK K    
Sbjct: 26  LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNK-----KFNKTDDLKDSKKTKLKQK 80

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           KK KK     +            K           +K   + EK KK+    K       
Sbjct: 81  KKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNK-----ST 135

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            + K+K +    +  K    + KS      L  ++L   
Sbjct: 136 NKKKKKVLSSKDELIKYDNNKPKSISIHSPLTIQELSTL 174


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 44.8 bits (106), Expect = 7e-05
 Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 5/128 (3%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKVDKEK--EKGK 173
           L   K + SIK +KKT    +    KK    ++ +K+ +      K K    +K   +  
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427

Query: 174 KEEKEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           KE+  KKEE  E +  +  EK  +     +     ++K  +KD +L K  L+K  K K  
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFNKFKNE 487

Query: 233 SSHEKEVI 240
            S E   I
Sbjct: 488 ESAENSFI 495



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 35/216 (16%)

Query: 48  PQPLVHPPGTPPPSKMKDKVSPKDKTS------PKEKMSSKEKTSPKEKEYSKVKDVELG 101
           P+           +K  + V   +K S       K K+   +K   +  +   +K  EL 
Sbjct: 379 PEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEELL 438

Query: 102 VTPMVPNIVKVTSS---EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 158
               V    K+  +    D  E+ K K   KD +      K  ++K    K+++  +N  
Sbjct: 439 EKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIEL----SKNMLQKFNKFKNEESAENSF 494

Query: 159 KDKEKKVDK-----------------EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
            D  K+ D                    E   K  +  +  K    K +K K V  I K+
Sbjct: 495 IDIIKQADLVLWVSPNALILGVEDLGLIELINKVNESFEFIKTIFKKFNKNKLVIAISKK 554

Query: 202 EK----DSEKEKKSKDKDKKLKKEKLKKK-KKEKER 232
           +     + + +K   +    +  + LK   KKEK R
Sbjct: 555 QATKITNIKNQKLKTNIINDVDIDDLKMLIKKEKTR 590


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 43.2 bits (102), Expect = 9e-05
 Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
             D++  KK + ++  + ++ K+++K     +DKE K D EKE  + EE+ ++E      
Sbjct: 45  AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE------ 98

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKS 211
            E+     EK   EEK     +K 
Sbjct: 99  DEESSDENEKE-TEEKTESNVEKE 121



 Score = 35.5 bits (82), Expect = 0.030
 Identities = 18/97 (18%), Positives = 46/97 (47%), Gaps = 13/97 (13%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
              D+   D+ + K+ D ++  + E+ K +++E    E+KE K   +KE           
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKED---------- 88

Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
              +E + +++++  +     +K+ E++  S+ ++ I
Sbjct: 89  ---EESEEENEEEDEESSDENEKETEEKTESNVEKEI 122



 Score = 34.7 bits (80), Expect = 0.055
 Identities = 17/85 (20%), Positives = 43/85 (50%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
            S +   + ++ K     +    ++ KE+ K+  + +DK+   + EK+ E+  ++ +E+ 
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEK 197
           ++   E ++E E K + + EK +  
Sbjct: 100 EESSDENEKETEEKTESNVEKEITN 124



 Score = 33.9 bits (78), Expect = 0.091
 Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
                   +++ K     +  + E+  ++EKE    E+KE K + E + +E +E+   + 
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE--NEE 97

Query: 199 RKEEKDSEKEKKSKDKD 215
             EE   E EK++++K 
Sbjct: 98  EDEESSDENEKETEEKT 114



 Score = 32.8 bits (75), Expect = 0.23
 Identities = 25/94 (26%), Positives = 50/94 (53%), Gaps = 9/94 (9%)

Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
              D+     +E  K      + ++  + E+ K+EEKE    EDKE   +K   E++D E
Sbjct: 39  SPSDQAAADEQEAKK-----SDDQETAEIEEVKEEEKEAANSEDKE---DKGDAEKEDEE 90

Query: 207 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
            E++++ ++ +   ++ +K+ +EK  S+ EKE+ 
Sbjct: 91  SEEENE-EEDEESSDENEKETEEKTESNVEKEIT 123



 Score = 32.0 bits (73), Expect = 0.39
 Identities = 20/84 (23%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
               D  E  + + +K+++K     E ++ K   +K+D++    +E+++E+  +   E  
Sbjct: 51  AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES---EEENEEEDEESSDENE 107

Query: 173 KKEEKEKKEEKEFKMKEDKE-KPV 195
           K+ E EK E    K   +   KPV
Sbjct: 108 KETE-EKTESNVEKEITNPSWKPV 130



 Score = 30.9 bits (70), Expect = 0.92
 Identities = 15/72 (20%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 172 GKKEEKEKKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
               ++   +E+E K  +D+E   +E++++EEK++   +  +DK    K+++  +++ E+
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 231 ERSSHEKEVIPK 242
           E      E   +
Sbjct: 98  EDEESSDENEKE 109



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 10/66 (15%), Positives = 32/66 (48%)

Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
               ++   +E+E K+  + +  E +E   E+      + +++K   +K+ +  +E+ ++
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 226 KKKEKE 231
           + +E  
Sbjct: 98  EDEESS 103


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 12/111 (10%)

Query: 138  EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
              E + KK ++K K  I + EK           K  + E   +EEKE   + + E   EK
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEK-----------KSIEIENRNQEEKEPAGQGELESDKEK 1542

Query: 198  IRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
                E     ++K+ ++D  +   +K K KK+ K  +  E +   K   +F
Sbjct: 1543 KGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593



 Score = 35.1 bits (81), Expect = 0.099
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 16/77 (20%)

Query: 122  KKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIK-------NKEKDKEK---KV 165
            KK   I  ++K         ++EKE   + + + DK+K         N+EK+ E+   + 
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 166  DKEKEKGKKEEKEKKEE 182
            D +K K KK+ K   E 
Sbjct: 1565 DIKKRKNKKQYKSNTEA 1581



 Score = 34.3 bits (79), Expect = 0.15
 Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 125  KSIKKDKK----THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
            K I + KK    +++KK  E   + +++K+       E DKEKK + E     +E+  ++
Sbjct: 1500 KKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEE 1559

Query: 181  EEKEFKMKEDKEKPVEKIRKEE 202
            +  E  +K+ K K   K   E 
Sbjct: 1560 DYAESDIKKRKNKKQYKSNTEA 1581



 Score = 31.6 bits (72), Expect = 1.0
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
             K  + K  D ++ K K K+KK  K++E+ K+EEK + E  E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 43.8 bits (104), Expect = 1e-04
 Identities = 31/151 (20%), Positives = 79/151 (52%), Gaps = 14/151 (9%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-- 177
           ++   + + K      KKE E +KK+   + K+++     + E+++ + + + ++ E+  
Sbjct: 26  KLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRL 85

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS--- 234
            ++EE   +  E  +K  E + K+EK+   ++K+ D+ ++  +E + ++++E ER S   
Sbjct: 86  LQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLT 145

Query: 235 --HEKEVIPKLTFKFGTDMEEKTKRESSPKI 263
               KE++ +       ++EE+ + E++  I
Sbjct: 146 QEEAKEILLE-------EVEEEARHEAAKLI 169



 Score = 36.4 bits (85), Expect = 0.024
 Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           + K    E+   +  E+ KKE +  K+E   + KE+  K   ++ +E K+   E +  ++
Sbjct: 24  EKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLER 83

Query: 215 DKKLKKEKLKKK-----KKEKERSSHEKEVIPK 242
               ++E L +K     KKE+     EKE+  K
Sbjct: 84  RLLQREETLDRKMESLDKKEENLEKKEKELSNK 116



 Score = 36.4 bits (85), Expect = 0.025
 Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 7/104 (6%)

Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKKEEKEFKMKEDKEKPVEKIRK 200
           ++K+  +K     +   K   ++  KE E  KKE   E KEE   K++ + E+ +++ R 
Sbjct: 18  LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEV-HKLRAELERELKERRN 76

Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKK-----KKEKERSSHEKEV 239
           E +  E+    +++    K E L KK     KKEKE S+ EK +
Sbjct: 77  ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNL 120



 Score = 29.9 bits (68), Expect = 3.3
 Identities = 29/111 (26%), Positives = 62/111 (55%), Gaps = 16/111 (14%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           ++   E+  + K +   KK++   +KKEKE   K+K+      +  KE++ E+ + +++E
Sbjct: 84  RLLQREETLDRKMESLDKKEENL-EKKEKELSNKEKN------LDEKEEELEELIAEQRE 136

Query: 171 K-----GKKEEKEKK---EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           +     G  +E+ K+   EE E + + +  K +++I +E K+ E +KK+K+
Sbjct: 137 ELERISGLTQEEAKEILLEEVEEEARHEAAKLIKEIEEEAKE-EADKKAKE 186


>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031).  This
           protein is expressed in Plasmodium; its function is
           unknown. It may be the product of gene family pyst-b.
          Length = 228

 Score = 42.3 bits (100), Expect = 2e-04
 Identities = 36/104 (34%), Positives = 49/104 (47%), Gaps = 12/104 (11%)

Query: 130 DKKTHK-----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           DKKT K     +KE E++KK+ D K     +N E   +   DK   K K E     E ++
Sbjct: 107 DKKTKKLINELRKELEEVKKELDNK-----RNGELAIQPIQDKRIIK-KDENSSVSEHED 160

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           FK  E++E  +E    E   S   KK K   +KLKK + K  K 
Sbjct: 161 FKQLENEENFLETEYNEITSSNNYKKLKIN-RKLKKAEKKLIKS 203


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 43.0 bits (101), Expect = 2e-04
 Identities = 23/111 (20%), Positives = 52/111 (46%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           K  ++  + +E EK  + + K+ +++ +    +++KK +  K K   E  + K   E K 
Sbjct: 112 KAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKK 171

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           K ++     +  K + ++   KK  + + K   EK K + + K ++  + E
Sbjct: 172 KAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222



 Score = 38.8 bits (90), Expect = 0.004
 Identities = 30/133 (22%), Positives = 59/133 (44%), Gaps = 5/133 (3%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-----EKGKKEEKEKKEEKEFKMKE 189
           K+ EKE++K ++ +K  ++ + + + ++K+ +++      E+ KK E  K +      K 
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 249
                 +K  +E   + +E K+K +    KK+   + K   E++  E E   K   K   
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223

Query: 250 DMEEKTKRESSPK 262
             EEK   E    
Sbjct: 224 AAEEKAAAEKKKA 236



 Score = 38.0 bits (88), Expect = 0.008
 Identities = 28/116 (24%), Positives = 41/116 (35%), Gaps = 1/116 (0%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           K   +  K     E  K+K   + K K +   K  ++ K   +     KK E E K   E
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKERSSHEKEV 239
               E + K   + + E    EK    K K   K K +K     K  ER +    +
Sbjct: 206 KAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261



 Score = 33.8 bits (77), Expect = 0.16
 Identities = 31/154 (20%), Positives = 68/154 (44%), Gaps = 1/154 (0%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
             I+  + + KK E+++ KK++   ++ K   +  ++E+    EKE+ K +E++K+ E+ 
Sbjct: 65  GRIQSQQSSAKKGEQQRKKKEEQVAEELK-PKQAAEQERLKQLEKERLKAQEQQKQAEEA 123

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
            K  + ++K  E+  ++    +K+K    K K   +    K   E ++ + E     +  
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEA 183

Query: 245 FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAA 278
                    K K E+  K   +  K+ +     A
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKA 217



 Score = 33.8 bits (77), Expect = 0.16
 Identities = 22/94 (23%), Positives = 38/94 (40%), Gaps = 2/94 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK +   K  +  K K +    KKK + +  K   ++   E +   + EK  +   E+K 
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEA-KAAAEKAKAEAEAKAKAEKKAEAAAEEKA 229

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
             E K    K K  +K     K +E++  +   D
Sbjct: 230 AAEKKKAAAKAK-ADKAAAAAKAAERKAAAAALD 262


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 6/108 (5%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEF 185
            K D K   ++EKE+ +  K++K   K+K +E+ K+K+++K EK K    E  K++E ++
Sbjct: 545 WKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKY 604

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKS----KDKDKKLKKEKLKKKKKE 229
              ++   P      EE  S+KE+K      DK  KL +E L K  K 
Sbjct: 605 SAFDETGLPTHDADGEEI-SKKERKKLSKEYDKQAKLHEEYLAKGGKS 651



 Score = 36.9 bits (86), Expect = 0.019
 Identities = 28/105 (26%), Positives = 52/105 (49%), Gaps = 16/105 (15%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           K +   + K  DK++  + K +++  +++    K K ++E+K+K+ EK  K K     P 
Sbjct: 538 KPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP---PA 594

Query: 196 EKIRKEE-------------KDSEKEKKSKDKDKKLKKEKLKKKK 227
           E  +++E              D++ E+ SK + KKL KE  K+ K
Sbjct: 595 EFFKRQEDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQAK 639



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
           +K+ + +E   EK  KE     KE+K   K KK +++K K+ +K ++      E   +  
Sbjct: 545 WKLDDKEELQREKEEKEA---LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQE 601

Query: 245 FKFGT 249
            K+  
Sbjct: 602 DKYSA 606


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 41.6 bits (98), Expect = 2e-04
 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 2/85 (2%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
           K KK    +E EK+KK+ ++K K   K K+K  +KK DK+K+K   ++ +K E+K+ K  
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQK--WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKD 213
           EDK + + K   E   +  E K + 
Sbjct: 119 EDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 41.2 bits (97), Expect = 3e-04
 Identities = 27/71 (38%), Positives = 44/71 (61%)

Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
           K K+K++ +E EK KKE +EK++ K  K K  K+K  +K +K++K  +K +K  +K+ + 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120

Query: 219 KKEKLKKKKKE 229
           K E L K   E
Sbjct: 121 KLEDLTKSYSE 131



 Score = 35.8 bits (83), Expect = 0.020
 Identities = 24/68 (35%), Positives = 36/68 (52%)

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
           K KK+E  ++ EK  K  E+K+K   K +K +K  +K+K  KD  K  K EK  +K+ E 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120

Query: 231 ERSSHEKE 238
           +     K 
Sbjct: 121 KLEDLTKS 128



 Score = 35.4 bits (82), Expect = 0.025
 Identities = 19/61 (31%), Positives = 33/61 (54%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           +K K   K KK+ KKK+K+K KK   K DK + K++++ ++K  D  K   +      + 
Sbjct: 80  EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139

Query: 182 E 182
           +
Sbjct: 140 K 140



 Score = 34.3 bits (79), Expect = 0.057
 Identities = 26/93 (27%), Positives = 50/93 (53%), Gaps = 4/93 (4%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           +  E  KKKK+  ++ +   KE ++++K   +K+K KK++ + K++K+ K  +  EK  E
Sbjct: 56  EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115

Query: 197 KIRKEEKD----SEKEKKSKDKDKKLKKEKLKK 225
           K  +++ +    S  E  S   + K +K  L K
Sbjct: 116 KEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 43.1 bits (101), Expect = 3e-04
 Identities = 28/120 (23%), Positives = 63/120 (52%), Gaps = 7/120 (5%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E  K     K++   K+ + +K ++K D    +  K +++ ++K+ + +      +    
Sbjct: 210 EDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSP 269

Query: 180 KEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           KE+K  ++ E++++ +EK + E +K+ E+  K+KD     K   LK++ K  E+ + +KE
Sbjct: 270 KEDK--QVAENQKREIEKAQIEIKKNDEEALKAKDH----KAFDLKQESKASEKEAEDKE 323



 Score = 42.3 bits (99), Expect = 4e-04
 Identities = 31/127 (24%), Positives = 61/127 (48%)

Query: 112 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
           V    D+ ++K+ +S +  K+  + KE+   K+    K + K    + + +K+ D+ ++K
Sbjct: 194 VNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQK 253

Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            ++ +   K       KEDK+    + R+ EK   + KK+ ++  K K  K    K+E +
Sbjct: 254 QQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESK 313

Query: 232 RSSHEKE 238
            S  E E
Sbjct: 314 ASEKEAE 320



 Score = 35.0 bits (80), Expect = 0.086
 Identities = 23/124 (18%), Positives = 61/124 (49%), Gaps = 4/124 (3%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K   ++D  + ++D+  +K ++     +       K+  DK   +N++++ EK    E +
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE--DKQVAENQKREIEKA-QIEIK 291

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKE 229
           K  +E  + K+ K F +K++ +   ++   +E +++K+++   +D +K K +   +    
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSL 351

Query: 230 KERS 233
            E +
Sbjct: 352 NEDA 355



 Score = 28.4 bits (63), Expect = 8.5
 Identities = 22/96 (22%), Positives = 40/96 (41%), Gaps = 9/96 (9%)

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
            ++ED EK V   R      E+E +   K  +  KE+L KK+ + +++  + +       
Sbjct: 185 ALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD------- 237

Query: 246 KFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 281
            F  D  +K + E   K   +    P P   ++  E
Sbjct: 238 -FAQDNADKQRDEVRQKQQ-EAKNLPKPADTSSPKE 271


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 42.7 bits (101), Expect = 4e-04
 Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 4/86 (4%)

Query: 163 KKVDKEKEKGKKEEKEKKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           KK+ +  E  ++EE+   E E   + K  K++      ++     +EK+  DK + L K 
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 222 KLKKKKKEKERSSHEKEVIPKLTFKF 247
           ++ K+KK++E    EK   P  T  F
Sbjct: 284 EILKEKKDEELFWFEK---PWPTLLF 306



 Score = 31.9 bits (73), Expect = 0.69
 Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
           F  K+ KE    + R+EE D E E  S+ K    K+E+    +++    S EKE   K  
Sbjct: 221 FFTKKLKETSETEEREEETDVEIETTSETK--GTKQEQEGSTEEDPSLFSEEKEDPDKTE 278

Query: 245 FKFGTDMEEKTKRESS 260
                ++ ++ K E  
Sbjct: 279 DLDKLEILKEKKDEEL 294



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 177 KEKKEEKEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            EK     F + E  K K + +   E+K   K     ++  K +++K K K++EKER 
Sbjct: 623 IEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680



 Score = 28.8 bits (65), Expect = 7.8
 Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 1/72 (1%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK K   + ++  ++ + E     + K  K + +   ++      +EKE   K E +  +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE-DLDK 282

Query: 182 EKEFKMKEDKEK 193
            +  K K+D+E 
Sbjct: 283 LEILKEKKDEEL 294


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 41.9 bits (98), Expect = 4e-04
 Identities = 30/110 (27%), Positives = 47/110 (42%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           K  KK   +++K   KK    K+  K K   K    K  K+  K KK +  K   K+  +
Sbjct: 6   KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTV 65

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           K +K + V+K    +K  +KE  S +  +   K      K  K+  + EK
Sbjct: 66  KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115



 Score = 36.9 bits (85), Expect = 0.015
 Identities = 21/98 (21%), Positives = 36/98 (36%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
               KK  K+K  ++K    K    +KE  K KK  K       ++  K ++ +      
Sbjct: 1   MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           K+  V+  + E    E   K   K + +  E  +   K
Sbjct: 61  KKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNK 98



 Score = 34.5 bits (79), Expect = 0.076
 Identities = 26/101 (25%), Positives = 39/101 (38%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K  ++E+     K  +  K+    KK  K    K   K  K K     K   KKV  + E
Sbjct: 9   KKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           K +  +KE   +K  K +    +  E   K  K++ K  K 
Sbjct: 69  KTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKK 109



 Score = 30.3 bits (68), Expect = 1.7
 Identities = 22/113 (19%), Positives = 42/113 (37%), Gaps = 3/113 (2%)

Query: 57  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
                K  ++  P  K +   K ++K K + K       K  +        ++   T   
Sbjct: 6   KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAK--KAAKVKKTKSVKTTTKKV 63

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
            + + +K +S+KK+    K  +KE +  +  +      KN  K  +K    EK
Sbjct: 64  TV-KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115



 Score = 30.3 bits (68), Expect = 1.9
 Identities = 29/97 (29%), Positives = 42/97 (43%), Gaps = 2/97 (2%)

Query: 153 KIKNKEKDKEKKVDKEKEKGKKE-EKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKK 210
           K   K+  +EKK   +K    KE  K KK  K    K   K   V+K +  +  ++K   
Sbjct: 6   KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTV 65

Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
             +K + +KKE + KK  +KE  S E        FK 
Sbjct: 66  KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKN 102


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 42.3 bits (100), Expect = 4e-04
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 10/129 (7%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
           LGE   +    + +    KKE +K+++K ++ +  +I+ KE++ EK  +K  +K  K + 
Sbjct: 86  LGEENIEI---EAQIEELKKELKKLEEKIEQLEA-EIEKKEEELEKAKNKFLDKAWK-KL 140

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
            KK +      E  +    K   +EK   E +    +    L  E+L  K K K   S  
Sbjct: 141 AKKYDSNL--SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEEL--KAKIKTLFSSN 196

Query: 237 KEVIPKLTF 245
           K  +  LT 
Sbjct: 197 KPELALLTL 205



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 31/153 (20%), Positives = 68/153 (44%), Gaps = 18/153 (11%)

Query: 114 SSEDLGEIKKDKSIKKDK-KTHKKKEKEKIKK-------KKDKKDKDKIKNKEKDKEKKV 165
              D+ E+K      ++  + + +K +EK+K        +      + I +      + +
Sbjct: 318 FKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELI 377

Query: 166 DKEKEKGKKEEKEKKEEKEF-------KMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKK 217
            +  EK    +KEK + K+        ++KED +   ++ +  EK  +  EK+ K  + +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437

Query: 218 LKKEKLKKKKKEKERSSHEK--EVIPKLTFKFG 248
           +K  + + K+ EK+ ++ E   + I KL   +G
Sbjct: 438 IKALEKEIKELEKQLTNIEPTADEINKLLKAYG 470



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 27/130 (20%), Positives = 61/130 (46%), Gaps = 10/130 (7%)

Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
           +++ + + +I+  +K+ +K  +++ E+ + E ++K+EE E    +  +K  +K+ K+   
Sbjct: 88  EENIEIEAQIEELKKELKKL-EEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDS 146

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR-ESSPKI 263
           +  E       KK  KEKL K+ K    ++        L+ +   +++ K K   SS K 
Sbjct: 147 NLSEALKGLNYKKNFKEKLLKELKSVILNA-----SSLLSLE---ELKAKIKTLFSSNKP 198

Query: 264 VIKPVKSPSP 273
            +  +     
Sbjct: 199 ELALLTLSVI 208


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 42.0 bits (99), Expect = 5e-04
 Identities = 18/117 (15%), Positives = 51/117 (43%), Gaps = 8/117 (6%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD-----KEKKVDKEKEKGKKEEK 177
            D+ I+ +++  K    ++ +     ++ + I  ++K+     K +  + ++ + + +  
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867

Query: 178 EKKEEKEFKMKEDKEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            + + +  + K D +K  + I       K      + K  +K  KK+K   +K ++E
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924



 Score = 37.7 bits (88), Expect = 0.011
 Identities = 24/112 (21%), Positives = 52/112 (46%), Gaps = 4/112 (3%)

Query: 82  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 141
           + K    ++E   ++  EL +       +K     +L EI+     +++ +   + + E 
Sbjct: 818 RLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQ---EAEEEIQNINENKNEF 874

Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
           ++ K D K  +K+    KD   K+    ++ K++EK  K++K+   K D+E 
Sbjct: 875 VEFKNDPKKLNKLII-AKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEEA 925


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 41.9 bits (98), Expect = 6e-04
 Identities = 39/128 (30%), Positives = 55/128 (42%), Gaps = 5/128 (3%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK-----VDKE 168
             ED  E K  K  KK  K  KK + +K  K+    D D+  + + D E +      D  
Sbjct: 232 GGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSS 291

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
                 EE+E K   E   K + E+  +    EE+ +E+E     K KKLKK K KK   
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351

Query: 229 EKERSSHE 236
           +K+ S   
Sbjct: 352 DKDDSDSG 359



 Score = 33.8 bits (77), Expect = 0.20
 Identities = 18/87 (20%), Positives = 38/87 (43%)

Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
           ++ K   +I  K + ++ +  +E E+ K EE+    +K  K+K+ K K     + +    
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359

Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKER 232
           +    S    +        KK+KE ++
Sbjct: 360 DDSDDSDIDGEDSVSLVTAKKQKEPKK 386



 Score = 30.3 bits (68), Expect = 2.6
 Identities = 35/143 (24%), Positives = 60/143 (41%), Gaps = 5/143 (3%)

Query: 104 PMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
           P +P   ++   ED  E +++K+ ++   + K K+ +K+K KK+  DKD   + +   + 
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDS 365

Query: 164 KVDKEKEKG---KKEEKEKKEEKEFKMKEDKEKPVEKIR--KEEKDSEKEKKSKDKDKKL 218
            +D E        K++KE K+E+               R   E KD  K K + +  K  
Sbjct: 366 DIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSP 425

Query: 219 KKEKLKKKKKEKERSSHEKEVIP 241
                KK K E    S   +  P
Sbjct: 426 ASVPAKKLKTENAPKSSSGKSTP 448



 Score = 28.4 bits (63), Expect = 8.4
 Identities = 27/131 (20%), Positives = 53/131 (40%), Gaps = 11/131 (8%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE----KGKK 174
           G+  ++   +K      K  K K  +  D+ D D+     +D +++  K+K+    K KK
Sbjct: 194 GDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKK 253

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEK-------DSEKEKKSKDKDKKLKKEKLKKKK 227
           +  + K+ K     +  E   +    E +        S      ++++ KL  E   K +
Sbjct: 254 KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPE 313

Query: 228 KEKERSSHEKE 238
            E++  S E E
Sbjct: 314 IEQDEDSEESE 324


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 40.4 bits (95), Expect = 9e-04
 Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 3/115 (2%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           + +   ++E+   KKK+ K    K   K+K K+     +  K      +KK E+      
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPT 132

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
             + P    RK  + S  + K    ++  ++E  +KK + K R   EK+   +LT
Sbjct: 133 LLDSP---RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184



 Score = 31.6 bits (72), Expect = 0.57
 Identities = 31/127 (24%), Positives = 51/127 (40%), Gaps = 18/127 (14%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE--- 176
           E+++++ +KK K+   K  KE  KKKK K        K      K   E+          
Sbjct: 77  ELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDS 136

Query: 177 ----------KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
                      + KE    ++KE + +  +   K  K     +K K K+K+L +E+   +
Sbjct: 137 PRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARK-----RKEKKKEKELTQEERLAE 191

Query: 227 KKEKERS 233
            KE ER 
Sbjct: 192 AKETERI 198


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 40.7 bits (95), Expect = 0.001
 Identities = 43/244 (17%), Positives = 79/244 (32%), Gaps = 38/244 (15%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK----- 183
            D K     +  +      KK  D      ++ E  + KE  +   +    ++EK     
Sbjct: 31  IDAKVCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLLKEIRELLLKAIYIQKEKSDLAD 90

Query: 184 --EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS-------S 234
             E  ++  ++   ++I K   +    +     D   + E+       +  S       S
Sbjct: 91  RAEKLLRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLERNSYSSAARRSSGEHRSAAS 150

Query: 235 HEKEVIPKLTFKFG------TDMEEKTKRESSPKIVIKPV----------KSPSPPPVAA 278
            +     KL  +           E  +KRE S  +    V           +        
Sbjct: 151 SQGSRHTKLKKRKNIHNLKRRSPELSSKREVSFTLESPSVPDTATRVKEGNNGGSTKSRG 210

Query: 279 ADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDA---WYHWVCVGLVAEPETSDWFCP 335
               D  + ++++    C  Q      M+ CD  +    W+H  CVGL   P+   W+CP
Sbjct: 211 VSSEDNSEGEELY----CFCQQVSYGQMVACDNANCKREWFHLECVGLKEPPKGK-WYCP 265

Query: 336 KCSK 339
           +C K
Sbjct: 266 ECKK 269


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 27/159 (16%), Positives = 65/159 (40%)

Query: 73  TSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKK 132
                ++   +     + +    KD     T +            L  I++ + + K K 
Sbjct: 44  YGKTSRLKKSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKG 103

Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
              KK   +I +  + ++      K   +   +     + + EE + K +++ +  E+ E
Sbjct: 104 KKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELE 163

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           K +E++  E+   EK  + K+K K+L++ K + ++  +E
Sbjct: 164 KELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEE 202



 Score = 36.0 bits (83), Expect = 0.018
 Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 15/141 (10%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKE 170
            S+  G +K D  I+KD+   K   +   +  K K     I+      K K KKV K   
Sbjct: 52  KSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSIL 111

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK----------- 219
           +  + ++ ++   E    + +  P+     +E+  E + K K+K ++L+           
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELED 171

Query: 220 -KEKLKKKKKEKERSSHEKEV 239
            K+ L+K  +EKE+    +E+
Sbjct: 172 EKDLLEKLLEEKEKKKELEEL 192


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 12/86 (13%), Positives = 39/86 (45%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
            +++    +  K++ K  E  K  E E+++ +   +KE+ +K   +I + E+  ++ +  
Sbjct: 2   KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 212 KDKDKKLKKEKLKKKKKEKERSSHEK 237
                +    + ++++ + E +    
Sbjct: 62  PAASGEGGGGEEEEEEAKAEAAEFRA 87



 Score = 36.2 bits (84), Expect = 0.027
 Identities = 16/92 (17%), Positives = 39/92 (42%), Gaps = 3/92 (3%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E++K    +K  +  ++K +    K++  K   +I   E+  ++   K    G+    E+
Sbjct: 14  ELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEE 73

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           +EE+      +       +R  +    +E+K+
Sbjct: 74  EEEEAKAEAAEFRA---YLRGGDDALAEERKA 102



 Score = 35.4 bits (82), Expect = 0.058
 Identities = 17/96 (17%), Positives = 35/96 (36%), Gaps = 4/96 (4%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK--KEEKEFKMKEDKEKPVE 196
           +E + +    K+  K+   EK  E + +K +    KEE +K   E    +   D+ +   
Sbjct: 5   REALAELA--KELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKP 62

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
               E    E+E++    +    +  L+        
Sbjct: 63  AASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAE 98



 Score = 28.9 bits (65), Expect = 5.7
 Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 6/58 (10%)

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-EKLKKKKKEKER 232
            KE +E      KE     + K+ ++EK +E E++  + D   ++ +KL  +    E 
Sbjct: 1   LKELREALAELAKE-----LRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 40.2 bits (94), Expect = 0.001
 Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           +     KDK   +   K   + K  +K K  IK +  +   +V K   + KKEEK+K ++
Sbjct: 19  EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAE---RVKKLHSQEKKEEKKKPKK 75

Query: 183 KEFKM 187
           K+  +
Sbjct: 76  KKVPL 80



 Score = 36.7 bits (85), Expect = 0.011
 Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 163 KKVDKEKE-KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           K+V +  E +  K  K+K   + +     ++K +EK +   K    E+  K   ++ K+E
Sbjct: 10  KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69

Query: 222 KLKKKKKEKERSSHEKEV 239
           K K KKK+     +  ++
Sbjct: 70  KKKPKKKKVPLQVNPAQL 87


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 40.9 bits (96), Expect = 0.001
 Identities = 25/113 (22%), Positives = 41/113 (36%), Gaps = 4/113 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDK---DKIKNKEKDKEKKVDKEKEKGKKEEKE 178
           K+    +K  +    K     K K + + K      K    + +KK + E  K    E +
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184

Query: 179 KKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
           KK E E   K   E K   +   ++K + + KK    + K    K   + K  
Sbjct: 185 KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237



 Score = 37.9 bits (88), Expect = 0.008
 Identities = 20/97 (20%), Positives = 44/97 (45%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           K+ EKE++  ++ KK  ++   +   K+K+ ++   K     K K E +  +     +K 
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA 163

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
             + +K+ +    +K + +  KK + E   K   E +
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200



 Score = 37.5 bits (87), Expect = 0.011
 Identities = 27/125 (21%), Positives = 62/125 (49%), Gaps = 5/125 (4%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           + KS K+ ++  KKKE+++ ++ + K+  ++ + K+ +KE+   +E++K  +E  ++   
Sbjct: 70  QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129

Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           K+ + +E   K     + + +   K      KK+  + KK  + +  KK   + +   E 
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEA 189

Query: 238 EVIPK 242
           E   K
Sbjct: 190 EAAAK 194



 Score = 37.1 bits (86), Expect = 0.014
 Identities = 19/97 (19%), Positives = 39/97 (40%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +   +E+ K+ ++   +  +K K+ ++         K K E + K+     K    + K 
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
             +    +K + + KK  + +   K     KKK E E
Sbjct: 170 KAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206



 Score = 36.7 bits (85), Expect = 0.022
 Identities = 28/109 (25%), Positives = 41/109 (37%), Gaps = 4/109 (3%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKVDKEKEKGKKEEKEKKEEKEF 185
            KK  +   K+   K K+ ++   K     K K + E K      K    E +KK E E 
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175

Query: 186 KMK---EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
             K   E K+K   +   +     K+K   +  KK   E  KK   E +
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAK 224



 Score = 35.6 bits (82), Expect = 0.044
 Identities = 21/95 (22%), Positives = 39/95 (41%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           KK     KKK + +  KK   + K K + +   K     K+K + + ++K   E K+   
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
            E K    +   + +  +EK   +K  +K    + 
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255



 Score = 33.6 bits (77), Expect = 0.20
 Identities = 26/120 (21%), Positives = 44/120 (36%), Gaps = 3/120 (2%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
              K K+  + K+     +K   + KK  + +   K   + K+K   +   K   E K K
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK-K 201

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           K E E K K   E   +K    E  +   K + +     +K    K  ++   +    EV
Sbjct: 202 KAEAEAKKKAAAE--AKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 17/108 (15%), Positives = 55/108 (50%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           +   K  E++  K++++  +E ++K+  ++ ++K+ +++ +    ++++  E  K++  K
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
            K+ ++   K     K ++  E +       K   + ++K + E++ K
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 19/92 (20%), Positives = 35/92 (38%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK ++    K   + K+K + +       + K K + + K+K   + K+K   E K    
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           +   + K   EK       E+  + K     D
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 39.9 bits (93), Expect = 0.001
 Identities = 34/167 (20%), Positives = 73/167 (43%), Gaps = 9/167 (5%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
           KEK +S+    P+E E    ++ +L V      + +        E+K+      +K    
Sbjct: 68  KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEE--------EVKEQLQSLLEKIVVS 119

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           K+E++   K+  + D+DK    E   +++   E  K  +E ++    +E   ++ K+   
Sbjct: 120 KQEEDGPGKEP-QLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
           EK  ++E +  KE   K +  K + + + ++  ++E +  E     K
Sbjct: 179 EKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225



 Score = 39.1 bits (91), Expect = 0.002
 Identities = 27/124 (21%), Positives = 59/124 (47%), Gaps = 3/124 (2%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           KEK   +     ++   + E++ +  V K K+K ++E KE+ +    K+   K++     
Sbjct: 68  KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPG 127

Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
           ++ + D +K   ++D D + +  +  K  +E E S H +E   +   ++  DM+EK   +
Sbjct: 128 KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE---QYKQDMKEKASEQ 184

Query: 259 SSPK 262
            +  
Sbjct: 185 ENED 188



 Score = 31.1 bits (70), Expect = 0.92
 Identities = 31/162 (19%), Positives = 68/162 (41%), Gaps = 8/162 (4%)

Query: 104 PMVPNIVKVTSSEDLGEIKKDKSIKKDK---KTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
           P VP       +E+ G++   K+ +K +   K   +   EKI   K ++D    + +  +
Sbjct: 75  PTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDE 134

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK- 219
            +  + ++ +  ++  +  K  +E    ED     E   ++ K   KEK S+ +++  K 
Sbjct: 135 DKFLLAEDSDDRQETLEAGKVHEE---TEDSYHVEETASEQYKQDMKEKASEQENEDSKE 191

Query: 220 -KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
             EK ++ K E +  + E         +    ++E+  +E  
Sbjct: 192 PVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPV 233



 Score = 29.9 bits (67), Expect = 2.6
 Identities = 35/176 (19%), Positives = 77/176 (43%), Gaps = 12/176 (6%)

Query: 63  MKDKVSPKDKTSPKEK--MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
           +K+K + +    P+E    + +E      K   KV++        +   + V+  E+ G 
Sbjct: 67  LKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126

Query: 121 IKK-----DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
            K+     DK +  +    +++  E  K  ++ +D   ++    ++ K+  KEK      
Sbjct: 127 GKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEK----AS 182

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           E+E ++ KE   K ++ K        E+D ++E    +  K +K+E  K+  +E++
Sbjct: 183 EQENEDSKEPVEKAERTKAETD-DVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQ 237


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 34/155 (21%), Positives = 68/155 (43%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           ++E E+  KK   + K   +N EK+ E K  K++ K + +EK++ +E+E    E++ K  
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT 255
           +   K  K SEK +K +++++  ++    K      +S+ +++   K + K      + +
Sbjct: 469 KVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKIS 528

Query: 256 KRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 290
           K     K   K  KS         +E         
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDD 563



 Score = 40.0 bits (94), Expect = 0.003
 Identities = 26/118 (22%), Positives = 57/118 (48%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           K+ +S K  K+   + +++K   ++++ + ++    EK   K + + ++  K+EE+E+ +
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           E+   +K          +++ K     K  K  +K  K     KKKK+KE+S    + 
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDD 549



 Score = 35.4 bits (82), Expect = 0.067
 Identities = 24/128 (18%), Positives = 68/128 (53%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K+   ED  E     S+K  ++   +K++E   + ++ + + + + +  ++E +   +K 
Sbjct: 360 KLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            G+++   +  EKE + K+ K++   + +++++  E+E+   +++ K++K   K  K+ +
Sbjct: 420 VGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSE 479

Query: 231 ERSSHEKE 238
           +    E+E
Sbjct: 480 KAQKEEEE 487



 Score = 34.6 bits (80), Expect = 0.12
 Identities = 23/122 (18%), Positives = 61/122 (50%), Gaps = 1/122 (0%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
           E+   +   K +++ +   K++   +I++ +++ + +++   +E ++  K +  + K   
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           E  EK+ E +   KE+K +  EK   +E++  ++++    +K   K   + +K +KE   
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487

Query: 235 HE 236
            E
Sbjct: 488 EE 489



 Score = 34.3 bits (79), Expect = 0.15
 Identities = 31/133 (23%), Positives = 62/133 (46%), Gaps = 2/133 (1%)

Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
           +        +  L  +K  +  +  KK     E E+++++ + +++   +  E+  +K V
Sbjct: 361 LKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNV 420

Query: 166 DKEKEKGKKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
            + K   +  EKE + +K  K    E KEK      +E +D E+ K  K  +K LK+ + 
Sbjct: 421 GRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEK 480

Query: 224 KKKKKEKERSSHE 236
            +K++E+E    E
Sbjct: 481 AQKEEEEEELDEE 493



 Score = 33.1 bits (76), Expect = 0.36
 Identities = 39/172 (22%), Positives = 73/172 (42%), Gaps = 17/172 (9%)

Query: 77  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 136
           EK+++K     ++ +  + ++      P +     V  S    + KK  S K DK  +K 
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKI 527

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE------D 190
            +     KKK KK+K    + +   E+   K     +++E +++    FK K+       
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFA 587

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKL-----------KKEKLKKKKKEKE 231
            +  V +  KE+K+  +E+  K+ D  L           KK K K+K+K + 
Sbjct: 588 GDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRF 639



 Score = 32.3 bits (74), Expect = 0.60
 Identities = 30/147 (20%), Positives = 61/147 (41%), Gaps = 12/147 (8%)

Query: 99  ELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 158
           E    P   N+ +     + GE + +    K +  ++ KEK++  ++++ +D+++ K ++
Sbjct: 410 EENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEK 469

Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE---------- 208
              +     EK + ++EE+E  EE  +           K +  +K S  +          
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529

Query: 209 --KKSKDKDKKLKKEKLKKKKKEKERS 233
              K K K KK K   L     ++E S
Sbjct: 530 AAVKVKKKKKKEKSIDLDDDLIDEEDS 556



 Score = 32.0 bits (73), Expect = 0.68
 Identities = 25/131 (19%), Positives = 57/131 (43%), Gaps = 6/131 (4%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE----KEKGKKEEK 177
            K+K    +++  + +E+ K++K  +K  K   K +++++E+++D+E    K      + 
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKS 506

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKD--SEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
            KK++ + K     +K   KI K       +K+K+          ++    K + +    
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566

Query: 236 EKEVIPKLTFK 246
           E +      FK
Sbjct: 567 EDDEELPFLFK 577



 Score = 29.3 bits (66), Expect = 5.5
 Identities = 31/168 (18%), Positives = 66/168 (39%), Gaps = 11/168 (6%)

Query: 64  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
           + K  P++     E    K++   + KE  +  + E         + KV +      +K+
Sbjct: 422 RRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVAN----KLLKR 477

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            +  +K+++  +  E+    K      K       K ++ K     +  K   K  K   
Sbjct: 478 SEKAQKEEEEEELDEENPWLKTTSSVGKSA-----KKQDSKKKSSSKLDKAANKISKAAV 532

Query: 184 EFKMKEDKEKP--VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           + K K+ KEK   ++    +E+DS K     ++D+  ++     K+K+
Sbjct: 533 KVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKD 580



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 32/154 (20%), Positives = 71/154 (46%), Gaps = 13/154 (8%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------KNKEKDKEKKVDK 167
           S E+  +  +D    ++    +KK   K+K+ +D ++   +      +  E  K+++ D 
Sbjct: 334 SEEEDEDDDEDDDDGENPWMLRKKL-GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK--K 225
           E E+ ++E + ++E  E    E+ E+P +K     K   +  + + + KKLKKE     K
Sbjct: 393 EIEELRRELEGEEESDE----EENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448

Query: 226 KKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
           +KKE +     ++       K    + +++++  
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 40.4 bits (95), Expect = 0.002
 Identities = 25/108 (23%), Positives = 45/108 (41%), Gaps = 18/108 (16%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE----KDKEKKVDKEKEKGKKEEK 177
             +K     K T  +   +KIKK  +K        KE    K+ +  ++K  EK  K+ +
Sbjct: 82  PTNKITYLLKTT--EPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQ 139

Query: 178 EKKE------------EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           + K             +K  +  ED E+  +K+ K++   +K+ K K 
Sbjct: 140 DLKPGENVILTIKIDGKKLLEYDEDYEEFRDKVLKKKVFKDKKAKEKG 187



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 9/153 (5%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI-KNKEKDKEKKVDK 167
           ++K+    +  EI      + DKKT +K      K +     +D +  NK     K  + 
Sbjct: 39  VLKIDFDTEENEINIKGEEEFDKKTFRKY---LYKGQAGNASQDYLPTNKITYLLKTTEP 95

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS--EKEKKSKDKDKKLKKEKLKK 225
             +K KK  ++     E K +  K K ++ + ++  +   +  +  K  +  +   K+  
Sbjct: 96  LNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPGENVILTIKIDG 155

Query: 226 KKKEKERSSHEK---EVIPKLTFKFGTDMEEKT 255
           KK  +    +E+   +V+ K  FK     E+  
Sbjct: 156 KKLLEYDEDYEEFRDKVLKKKVFKDKKAKEKGI 188


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 18/115 (15%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKVDKEKEKGKKEEKEKKEE 182
           I++  K+ +  +    ++ K + DK K  ++       K KK+   K    ++  E KE 
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392

Query: 183 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
                  K    K + +K +E+IR+E  + E   KSK K +K      KK+  EK
Sbjct: 393 IAYYESAKTALEKAEGKKAIEEIREELIE-EGLLKSKKKKRK------KKEWFEK 440



 Score = 36.2 bits (84), Expect = 0.035
 Identities = 27/114 (23%), Positives = 51/114 (44%), Gaps = 17/114 (14%)

Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEK-GKKEEKEKKEEKEFKMKEDKEKP----------- 194
           +K ++DKIK    + EKK++KE +K   K EK++ E +E +   ++ +            
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332

Query: 195 VEKIRKE-----EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
           +E+  K         +E+ K   DK K   +   +  KK K+    +  +  +L
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQL 386



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 21/126 (16%), Positives = 52/126 (41%), Gaps = 15/126 (11%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG--------- 172
           +K +  K  +   + ++K + + KK +   +K +++ ++ EK  ++ ++KG         
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332

Query: 173 -KKEEKE-----KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            ++  K          +E K++ DK K   +  +      K+ K    +   +  +LK+ 
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392

Query: 227 KKEKER 232
               E 
Sbjct: 393 IAYYES 398



 Score = 28.5 bits (64), Expect = 7.5
 Identities = 12/63 (19%), Positives = 28/63 (44%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
               ++ +     K+        K   +  +  K  E+ +E+ +++   K KK++++KKE
Sbjct: 377 GAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436

Query: 182 EKE 184
             E
Sbjct: 437 WFE 439



 Score = 28.5 bits (64), Expect = 9.3
 Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 2/87 (2%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
            +I+ DKS K   +  ++  K+  K K  K + D+  ++ K+     +  K   +K E +
Sbjct: 351 IKIELDKS-KTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGK 409

Query: 179 KK-EEKEFKMKEDKEKPVEKIRKEEKD 204
           K  EE   ++ E+     +K ++++K+
Sbjct: 410 KAIEEIREELIEEGLLKSKKKKRKKKE 436


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 40.2 bits (94), Expect = 0.002
 Identities = 28/127 (22%), Positives = 70/127 (55%), Gaps = 2/127 (1%)

Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
           +P+ +   +    GE K++ ++  +K +  +KE E+  +  +K  K++ K K++ +++  
Sbjct: 494 IPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEME 553

Query: 166 DKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           + ++ +  K+ + +KE +E    +K++ E  + ++++++    KE KS +   KLK+ K 
Sbjct: 554 ELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQ 613

Query: 224 KKKKKEK 230
           K  +K  
Sbjct: 614 KIPQKPT 620



 Score = 39.0 bits (91), Expect = 0.004
 Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 15/117 (12%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEKP 194
           E+ K    + K++ + +  K    EK+++++ E  +K  KE    K+E E +M+E KE  
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE-- 557

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKE------KLKKKKKEKERSSHEKEVIPKLTF 245
               R+  K  E EK++++  K LKKE      +LK+KK  K +     E + KL  
Sbjct: 558 ----RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKE 610


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 40.4 bits (94), Expect = 0.002
 Identities = 28/139 (20%), Positives = 62/139 (44%), Gaps = 16/139 (11%)

Query: 116  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
            E++ +I +D S   ++K   +  +E + + + K ++    N E D   K D  K    K+
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKD 3949

Query: 176  EKEKKEEKEFK------------MKEDKEKPVEKIRK----EEKDSEKEKKSKDKDKKLK 219
             +EK++E+E              ++E+  +P  +       E+   ++++    KD  L+
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009

Query: 220  KEKLKKKKKEKERSSHEKE 238
               ++   + KE +  EK+
Sbjct: 4010 DMDMEAADENKEEADAEKD 4028


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 38/158 (24%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
           +EK+   ++   K +EY K+ +                  ++L EI+K  S  +++    
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEF------------YEEYLDELREIEKRLSRLEEEINGI 326

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           ++  +++++K+++ ++ K K KE +K  +  +E+ +  +E K KKEE E   K       
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP 386

Query: 196 EKIRKEEKDSEKEKKS-KDKDKKLKKEKLKKKKKEKER 232
           EK+ KE ++ EK K+  +++  K+     + KK+ KE 
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 27/129 (20%), Positives = 68/129 (52%), Gaps = 6/129 (4%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKD-----KKDKDKIKNKEKDKEKKVDKEKEKGKK 174
           E+K  +   + ++   KK +E++ K  +     +K  ++++ + ++ EKK  +E+ +  +
Sbjct: 606 ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELR 665

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           EE  +   +   ++ + E+  ++  + +K  EK K+  ++ +K  K++L+K +K  ER  
Sbjct: 666 EEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREK-AKKELEKLEKALERVE 724

Query: 235 HEKEVIPKL 243
             +E + K 
Sbjct: 725 ELREKVKKY 733



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 37/150 (24%), Positives = 77/150 (51%), Gaps = 3/150 (2%)

Query: 116 EDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGK 173
           E + E+++  + +KK+ +  ++K KE  + K+  ++  K+    E+  ++  + EK   +
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSR 318

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            EE+    E+  K  E+KE+ +E+++K+ K+ EK  +  ++  +L  E+ K KK+E ER 
Sbjct: 319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL-YEEAKAKKEELERL 377

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSPKI 263
                 +     +   +  EK K E   +I
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEI 407



 Score = 38.1 bits (89), Expect = 0.009
 Identities = 22/116 (18%), Positives = 63/116 (54%), Gaps = 4/116 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           ++L E+ ++ +    +    ++E EK++K+  + ++     KE+ +E + + E  +G K 
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEE----LKEEIEELEKELESLEGSKR 255

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           + E+K  +  +  E+ +K +E++ ++ K+ ++ K+  ++  KL +   +   + +E
Sbjct: 256 KLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELRE 311



 Score = 38.1 bits (89), Expect = 0.010
 Identities = 31/125 (24%), Positives = 72/125 (57%), Gaps = 6/125 (4%)

Query: 113 TSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
            + ++LGE+ K   + I++ +K  K+ E  +   K+ +K+ +++  +  +   ++ + +E
Sbjct: 162 NAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELRE 221

Query: 171 KGKKEEKEKKEEKEFKMK-EDKEKPVEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKK 226
           + +K EKE KE +E K + E+ EK +E +   ++  E   +E + + ++ K + E+L++K
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281

Query: 227 KKEKE 231
            KE +
Sbjct: 282 VKELK 286



 Score = 37.7 bits (88), Expect = 0.014
 Identities = 38/137 (27%), Positives = 77/137 (56%), Gaps = 6/137 (4%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           K I+K+ K  ++KE+ K++K+  + +K   K  E  K K++ ++ ++ +++ K+   E+ 
Sbjct: 462 KRIEKELKEIEEKER-KLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL 520

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL----KKKKKEKERSSHEKEVI 240
            K  E+ EK  EK+ K + + +  KK  +K ++LKK+      K  + E+E +   KE +
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE-L 579

Query: 241 PKLTFKFGTDMEEKTKR 257
            +L F+   ++EE+ K 
Sbjct: 580 EELGFESVEELEERLKE 596



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 27/118 (22%), Positives = 65/118 (55%), Gaps = 3/118 (2%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
           KD +   ++E++++KK +++ DK  +++   EK  E+   + +E  KK  +E+ EE   +
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREE 667

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
             E   + +  +R E ++ EK ++   K  +  KE+L++++K K+     ++ + ++ 
Sbjct: 668 YLE-LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVE 724



 Score = 36.6 bits (85), Expect = 0.026
 Identities = 37/138 (26%), Positives = 70/138 (50%), Gaps = 11/138 (7%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E L +   ++  KK ++  K KEK    K + K  K +++  E+ K+K  + EK+  + E
Sbjct: 510 EKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELE 569

Query: 176 EKEKK-----EEKEFKMKEDKEKPVEKIRK------EEKDSEKEKKSKDKDKKLKKEKLK 224
           E+  +     EE  F+  E+ E+ ++++        E KD+EKE + ++K+ K  +E+L 
Sbjct: 570 EELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELD 629

Query: 225 KKKKEKERSSHEKEVIPK 242
           K  +E   +    E + K
Sbjct: 630 KAFEELAETEKRLEELRK 647



 Score = 36.2 bits (84), Expect = 0.035
 Identities = 43/165 (26%), Positives = 82/165 (49%), Gaps = 7/165 (4%)

Query: 70  KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
           K K    E++  +      EK   +++++E     +   I K+T    +GE+KK+    K
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKIT--ARIGELKKEIKELK 425

Query: 130 DKKTHKKKEKEKI---KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
                 KK K K     ++  ++ + ++  +   + K++  EKE  + EEKE+K  KE +
Sbjct: 426 KAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRI--EKELKEIEEKERKLRKELR 483

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
             E   K   ++ K ++ +E+ K+ ++K KK   E+L+KK +E E
Sbjct: 484 ELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYE 528



 Score = 35.4 bits (82), Expect = 0.058
 Identities = 29/122 (23%), Positives = 61/122 (50%), Gaps = 5/122 (4%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
            +  ++IK++ ++  +  IK  E  +E   +KEKE  +   +  +   E     +  + +
Sbjct: 168 GEVIKEIKRRIERL-EKFIKRTENIEELIKEKEKELEEVLREINEISSEL---PELREEL 223

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT 255
           EK+ KE K+ E+ K+  ++ +K  +     K+K +E+    +E I +L  +   ++EEK 
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-EELEEKV 282

Query: 256 KR 257
           K 
Sbjct: 283 KE 284



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 31/135 (22%), Positives = 65/135 (48%), Gaps = 9/135 (6%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIK---------KKKDKKDKDKIKNKEKDKEKKVDKEK 169
            EI++ +   +  +  K+K +EKI+         KK+ ++ ++K+K  ++ KEK  +  K
Sbjct: 238 EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297

Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
                EE   +  +  K     E+ +  I +  K+ E++++  ++ KK  KE  K+ ++ 
Sbjct: 298 LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357

Query: 230 KERSSHEKEVIPKLT 244
           +ER    +E   K  
Sbjct: 358 EERHELYEEAKAKKE 372



 Score = 33.5 bits (77), Expect = 0.25
 Identities = 32/143 (22%), Positives = 77/143 (53%), Gaps = 22/143 (15%)

Query: 116 EDLGEIKKD-KSIKKDKKTHK--KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-- 170
           E L ++K + KS+KK+ +  +  KK+  +++KK D+ +++  +  ++ +E   +  +E  
Sbjct: 532 EKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELE 591

Query: 171 ----------------KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKD 213
                           K  ++E E++E++  K++E+ +K  E++ + EK  E+  K+ ++
Sbjct: 592 ERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE 651

Query: 214 KDKKLKKEKLKKKKKEKERSSHE 236
            +KK  +E+ ++ ++E    S E
Sbjct: 652 LEKKYSEEEYEELREEYLELSRE 674



 Score = 33.1 bits (76), Expect = 0.32
 Identities = 30/152 (19%), Positives = 70/152 (46%), Gaps = 2/152 (1%)

Query: 121 IKKDKSIKKDKKTHKK--KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
           +K+ K +K+  + + K  +  E+   +  + +K   + +E+    +   ++ + K+E  E
Sbjct: 282 VKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           + ++K  ++++  E+  E+    E+   K+++ +   K+L     +K +KE E     KE
Sbjct: 342 ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKE 401

Query: 239 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
            I +   K    + E  K     K  I+ +K 
Sbjct: 402 EIEEEISKITARIGELKKEIKELKKAIEELKK 433


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 40.2 bits (95), Expect = 0.002
 Identities = 24/124 (19%), Positives = 46/124 (37%)

Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
              K  S      +KK K   +  K   K   E  +    KK   K+  + +     + K
Sbjct: 101 EKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILK 160

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
              +    ++E ++ +  K +ED++   E +   +K   ++KK        +      KK
Sbjct: 161 IDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220

Query: 228 KEKE 231
            +KE
Sbjct: 221 IDKE 224



 Score = 36.0 bits (84), Expect = 0.048
 Identities = 29/142 (20%), Positives = 62/142 (43%), Gaps = 10/142 (7%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           KS+ K +K   +  K  IK   ++ ++D +K K   K  K  ++   G  +   + ++ +
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVN-NEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSK 169

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
            ++++          + +K+ E EK +K+  + +KK + +KKK+ K     E  V     
Sbjct: 170 EELEK---------FEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220

Query: 245 FKFGTDMEEKTKRESSPKIVIK 266
                    K   E   ++V++
Sbjct: 221 IDKEEITPMKEINEEERRVVVE 242



 Score = 32.9 bits (76), Expect = 0.45
 Identities = 20/100 (20%), Positives = 48/100 (48%), Gaps = 1/100 (1%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           I+KV +  +   +KK K + K  K ++K     +K   +  D  +   K + ++++ D++
Sbjct: 127 IIKVNNEIERDHLKK-KHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEK 185

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
             K   E  +K E ++ K  ++ +     ++  +K  ++E
Sbjct: 186 LAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 39.9 bits (93), Expect = 0.003
 Identities = 28/99 (28%), Positives = 51/99 (51%)

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
           EK+   +EE E+K++KE K KE + K ++  +KE K   + +++ D     KK + K +K
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRK 68

Query: 228 KEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
           ++ E  + E  + P   F     +  +  ++ SP  V K
Sbjct: 69  RDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVEK 107



 Score = 34.5 bits (79), Expect = 0.14
 Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 1/67 (1%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           +K KK  K KEKE +KK K  + + K K + +      +  K+  KK  K   E++  + 
Sbjct: 20  RKKKKEEKAKEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPED 78

Query: 188 KEDKEKP 194
             D + P
Sbjct: 79  FIDPDTP 85



 Score = 34.1 bits (78), Expect = 0.18
 Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 1/97 (1%)

Query: 138 EKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           E E  KK   +++ + K K +EK KEK++ K K   K+ + + + ++        +K  +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           K RK + + E  +   D D    ++K    +  K+ S
Sbjct: 65  KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYS 101



 Score = 33.7 bits (77), Expect = 0.23
 Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           E   +++   +EE E K K++ EK  EK  K+ K ++KE K+K + ++        KK E
Sbjct: 5   ESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63

Query: 230 KERSSHEKEV 239
           K+    + E 
Sbjct: 64  KKSRKRDVED 73



 Score = 30.6 bits (69), Expect = 2.1
 Identities = 21/107 (19%), Positives = 45/107 (42%), Gaps = 13/107 (12%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
                + L E + ++  KK++K  +K+ K             K+K  +K+ + K+  ++ 
Sbjct: 6   SEAEKKILTEEELERKKKKEEKAKEKELK-------------KLKAAQKEAKAKLQAQQA 52

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
                  +K E+K  K   + E P + I  +    +K++ S    K+
Sbjct: 53  SDGTNVPKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQ 99



 Score = 30.3 bits (68), Expect = 2.4
 Identities = 19/69 (27%), Positives = 40/69 (57%), Gaps = 3/69 (4%)

Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK---FGTDMEEKT 255
           R E +  +K    ++ ++K KKE+  K+K+ K+  + +KE   KL  +    GT++ +K+
Sbjct: 3   RTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62

Query: 256 KRESSPKIV 264
           +++S  + V
Sbjct: 63  EKKSRKRDV 71



 Score = 28.7 bits (64), Expect = 7.8
 Identities = 20/86 (23%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 127  IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
            ++     + + E EK++ K D+  K + K    +K+      KEK     +E+   K  K
Sbjct: 987  LQAQGAVNAEAELEKLRNKMDEIQKQQEK---LEKKMNASGYKEKVPANIQEEDARKLTK 1043

Query: 187  MKEDKEKPVEKIRKEEKDSEKEKKSK 212
            + ++    +E   KE K  E E  + 
Sbjct: 1044 LLQE----LEFFEKESKRLEAETSNS 1065


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 39.2 bits (92), Expect = 0.003
 Identities = 16/52 (30%), Positives = 24/52 (46%)

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           +E   K K   EK   K  K+      EKK K+K+K   K++ +  K   +R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 38.8 bits (91), Expect = 0.004
 Identities = 18/54 (33%), Positives = 29/54 (53%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           K  S KK  K  KK   ++ +KK+ +K+K K+K + +D +    + K  G  EE
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 37.6 bits (88), Expect = 0.011
 Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
           K  K +   +   K+KEKEK K KK  +D   I  + K    
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 37.6 bits (88), Expect = 0.011
 Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 118 LGEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           + E  K + I + +   K   EK+  K  K    K   K +++ ++ KV K     K   
Sbjct: 364 IEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIG 423

Query: 177 KEKKEEKE 184
           K +K    
Sbjct: 424 KRRKPSGT 431



 Score = 36.8 bits (86), Expect = 0.021
 Identities = 16/58 (27%), Positives = 23/58 (39%)

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E + K +    K      K V   R E+K+ EKEK    K  +  K   K++K    
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 36.5 bits (85), Expect = 0.022
 Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 1/48 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
               S K   K  +KKEKEK K K  K+ +D  KN  K ++     E+
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDT-KNIGKRRKPSGTSEE 434



 Score = 34.9 bits (81), Expect = 0.083
 Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            +KKT K  +K   K+ + KK+K+K K K K + +     K  GK+ +     E+
Sbjct: 384 SEKKTGKPSKKVLAKRAE-KKEKEKEKPKVKKRHR---DTKNIGKRRKPSGTSEE 434



 Score = 34.1 bits (79), Expect = 0.13
 Identities = 15/51 (29%), Positives = 25/51 (49%)

Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           K   EK+ GK  +K   +  E K KE ++  V+K  ++ K+  K +K    
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 34.1 bits (79), Expect = 0.14
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 155 KNKE-KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE--KPVEKIRKEEKDSEK 207
           K K   +K+     +K   K+ EK++KE+++ K+K+     K + K RK    SE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 33.4 bits (77), Expect = 0.22
 Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 191 KEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           K + ++++R   K   + +  K SK    K  ++K K+K+K K +  H  
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418



 Score = 32.6 bits (75), Expect = 0.41
 Identities = 11/49 (22%), Positives = 19/49 (38%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           KK     K     + ++KEK K+K   K + +       + K     +E
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 39.2 bits (92), Expect = 0.003
 Identities = 25/91 (27%), Positives = 48/91 (52%), Gaps = 16/91 (17%)

Query: 140 EKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEE-KEKKEEKEFKMKEDKEKPVEK 197
           +K+   K   +   K+    +++E+K+ K  E+ ++EE +EKKEEK+ + +E        
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEERE-------- 297

Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
             K  K S +E++     K  +KE+ K+ +K
Sbjct: 298 -AKLAKLSPEEQR-----KLEEKERKKQARK 322



 Score = 37.2 bits (87), Expect = 0.011
 Identities = 20/80 (25%), Positives = 43/80 (53%), Gaps = 4/80 (5%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
           K    K   ++ +K DK  +++ +   K  E++  +E ++ K+E+K+++ E     K  K
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREA----KLAK 302

Query: 192 EKPVEKIRKEEKDSEKEKKS 211
             P E+ + EEK+ +K+ + 
Sbjct: 303 LSPEEQRKLEEKERKKQARK 322



 Score = 36.8 bits (86), Expect = 0.015
 Identities = 26/83 (31%), Positives = 49/83 (59%), Gaps = 8/83 (9%)

Query: 152 DKIKNKE--KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
           DK+ N +   +  +KVDK +E+  +EEK  K  +E + +E +EK  E+ +KEE    + K
Sbjct: 246 DKLANLKLSPEVLRKVDKTREE--EEEKILKAAEEERQEEAQEK-KEEKKKEE---REAK 299

Query: 210 KSKDKDKKLKKEKLKKKKKEKER 232
            +K   ++ +K + K++KK+  +
Sbjct: 300 LAKLSPEEQRKLEEKERKKQARK 322



 Score = 36.1 bits (84), Expect = 0.028
 Identities = 16/59 (27%), Positives = 38/59 (64%), Gaps = 5/59 (8%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           ++  +K  K  +++ +E+ ++KK++K K+    + + K  K+  E+++ K EEKE+K++
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKE----EREAKLAKLSPEEQR-KLEEKERKKQ 319



 Score = 32.6 bits (75), Expect = 0.36
 Identities = 18/69 (26%), Positives = 42/69 (60%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
             + ++K  KT +++E++ +K  ++++ ++  + KE+ K+++ + +  K   EE+ K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313

Query: 183 KEFKMKEDK 191
           KE K +  K
Sbjct: 314 KERKKQARK 322



 Score = 29.1 bits (66), Expect = 4.9
 Identities = 9/49 (18%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKK-----LKKEKLKKKKKEKERSS 234
             + + K+ K  ++ E++     ++++      KKE+ KK+++E + + 
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 39.3 bits (91), Expect = 0.004
 Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 5/139 (3%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK----EEKEKKEEKEF 185
           +KK+ K K+KEK +K+K+ +DKDK K  E  K   +  +          E  E       
Sbjct: 196 EKKSKKPKKKEKKEKEKE-RDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTV 254

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
                  +P E    E ++++K  K K K ++ +KE+ KKKKK      H  +   +   
Sbjct: 255 SGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPV 314

Query: 246 KFGTDMEEKTKRESSPKIV 264
           + G   EE     SS K++
Sbjct: 315 QNGAVEEEPLPPMSSYKLL 333



 Score = 32.3 bits (73), Expect = 0.57
 Identities = 27/124 (21%), Positives = 51/124 (41%), Gaps = 1/124 (0%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-KGKKEEKEKK 180
           K+ K  +K++   KKKE E  K      D          +  +        G   + E  
Sbjct: 205 KEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPD 264

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
           E K+ + +E K+ P  K +K+ K+ E++KK K              ++  +  + E+E +
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEEPL 324

Query: 241 PKLT 244
           P ++
Sbjct: 325 PPMS 328


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 39.2 bits (91), Expect = 0.004
 Identities = 38/194 (19%), Positives = 86/194 (44%), Gaps = 18/194 (9%)

Query: 79  MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
           +  ++   P E+E       +LG    V  I    ++         + I++  K  +K +
Sbjct: 245 LKLEDNYEPSEQEL------KLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIK 298

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK--EEKEKKEEKEFKMKEDKEKPVE 196
             + K +  K D +K +N     ++K  +   K +K   E E KEE+   ++ + ++  +
Sbjct: 299 TLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHK 358

Query: 197 KIRK--------EEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEK-EVIPKLTFK 246
           ++RK        E  + E+EK +++ DK  ++ +KL K  K ++  +    + + K   +
Sbjct: 359 QLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQ 418

Query: 247 FGTDMEEKTKRESS 260
           + + ++  T+  S 
Sbjct: 419 YDSLIQNITRSRSQ 432


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 26/164 (15%), Positives = 66/164 (40%), Gaps = 5/164 (3%)

Query: 122 KKDKSIKKDKKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           +    + K ++   + E    +I++K ++   +K   +++ +E +  +   K + +  EK
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK 854

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           + E     KE+ E+ +E++    +D E       K++   + +L++ +++ E    + E 
Sbjct: 855 EIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914

Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 283
             K   +    +E   +  S    +  P       P       D
Sbjct: 915 KRKRLSELKAKLEALEEELSE---IEDPKGEDEEIPEEELSLED 955



 Score = 37.7 bits (88), Expect = 0.013
 Identities = 28/134 (20%), Positives = 62/134 (46%), Gaps = 11/134 (8%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           +S  +GEI+K+    + ++   +K KE++++ ++     + +  E  K +  + E    +
Sbjct: 714 ASRKIGEIEKEIEQLEQEE---EKLKERLEELEEDLSSLEQE-IENVKSELKELEARIEE 769

Query: 174 KEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDK-----DKKLKKEKLKKK 226
            EE   K E+     E +     + +I+ E    E+E    +      ++KL +  L+K+
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829

Query: 227 KKEKERSSHEKEVI 240
             EKE    +++ I
Sbjct: 830 YLEKEIQELQEQRI 843



 Score = 36.2 bits (84), Expect = 0.035
 Identities = 20/147 (13%), Positives = 67/147 (45%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
              ++ E +K +    ++    K+E E ++ + ++ DK+  + +++ K+ +   EK K +
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
             E +++ ++  +  +   + +  +       E +    +++K+ K  ++KK++ + E+ 
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL 460

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESS 260
           + +     +  +    + +   K  S 
Sbjct: 461 AADLSKYEQELYDLKEEYDRVEKELSK 487



 Score = 35.4 bits (82), Expect = 0.072
 Identities = 29/125 (23%), Positives = 61/125 (48%), Gaps = 4/125 (3%)

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           +L E+ K+ +  +D+    K  +EK++K K + ++ K +     +E +   E+       
Sbjct: 372 ELEEVDKEFAETRDEL---KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKERSSH 235
               E K  +++E+KE    +I+K+E   E+      K +++L   K +  + EKE S  
Sbjct: 429 IAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488

Query: 236 EKEVI 240
           ++E+ 
Sbjct: 489 QRELA 493



 Score = 33.9 bits (78), Expect = 0.22
 Identities = 19/103 (18%), Positives = 54/103 (52%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           + E   +++   +K+++    +E+  + + + +K   + EE E++ E+E K ++   +  
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            ++++E +D   E +  DK+    +++LK  +++ E+   E  
Sbjct: 360 AELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402



 Score = 32.7 bits (75), Expect = 0.40
 Identities = 23/141 (16%), Positives = 62/141 (43%), Gaps = 15/141 (10%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKK--------------KEKEKIKKKKDKKDKDKIKNKEKD 160
            E L  +K++ S  + +    +              ++  +I+K+ ++ ++++ K KE+ 
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDKKLK 219
           +E + D    + + E  + + ++     E+ E+ + K+ +   D E +   S+  + + +
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAE 799

Query: 220 KEKLKKKKKEKERSSHEKEVI 240
             KL+++    E    E E  
Sbjct: 800 LSKLEEEVSRIEARLREIEQK 820



 Score = 31.6 bits (72), Expect = 1.1
 Identities = 28/113 (24%), Positives = 51/113 (45%), Gaps = 12/113 (10%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF-K 186
           KK+K   + +E E+  ++ D    +K +  E+ + ++   E+ +   +EK + E  E  K
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLK 230

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
            KE  E+  E I ++    E+E            EKL ++  E E+   E E 
Sbjct: 231 EKEALERQKEAIERQLASLEEE-----------LEKLTEEISELEKRLEEIEQ 272



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 10/89 (11%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKK 225
           KEK   + EE E+  E+   + ++K + +E++R+E + +E+ +    + ++ +  E LK+
Sbjct: 172 KEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE 231

Query: 226 KKK--------EKERSSHEKEVIPKLTFK 246
           K+         E++ +S E+E + KLT +
Sbjct: 232 KEALERQKEAIERQLASLEEE-LEKLTEE 259



 Score = 30.4 bits (69), Expect = 2.4
 Identities = 30/177 (16%), Positives = 71/177 (40%), Gaps = 26/177 (14%)

Query: 77  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS--EDLGEIKKDKSIKKDKKTH 134
           E   S  +    + E SK+++        V  I        + L  +  +K   + +   
Sbjct: 785 EARLSHSRIPEIQAELSKLEEE-------VSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837

Query: 135 KKKEKEKIKKKKD---------KKDKDKIKNKEKDKEKKV-DKEKEKGKKEEKEKKEEKE 184
            ++++  +K++              K++++ + ++ E  + D E   G  +++  + E +
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
            +  E K        +E +   ++K+ +  + K K E L+++  E E    E E IP
Sbjct: 898 LRELERK-------IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIP 947



 Score = 30.4 bits (69), Expect = 2.7
 Identities = 14/102 (13%), Positives = 48/102 (47%), Gaps = 4/102 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
            ++ E+K++    +++     +E       I   + K ++ + + ++K  E K  + K +
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458

Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
               +  K E++ + +KE+ ++  +++ K +++  + +    
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500



 Score = 29.3 bits (66), Expect = 5.7
 Identities = 25/109 (22%), Positives = 57/109 (52%), Gaps = 5/109 (4%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
            I +K+ + ++ + + ++ ++ + + KEK + +  E  K++E   + KE  E+ +  + +
Sbjct: 192 IIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEE 251

Query: 201 E-----EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
           E     E+ SE EK+ ++ ++ L++   K K   +E     KE I +L 
Sbjct: 252 ELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 37.8 bits (88), Expect = 0.005
 Identities = 25/80 (31%), Positives = 49/80 (61%), Gaps = 5/80 (6%)

Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
            K+ KD K ++  ++  K+ E   +KE++KGK+E+ E +E ++ ++ ED E+      K 
Sbjct: 2   EKECKDAKHENMEEDCCKENE---NKEEDKGKEEDLEFEEIEKEEIIEDSEE--SNEVKI 56

Query: 202 EKDSEKEKKSKDKDKKLKKE 221
           E+  ++  K K+++KKL+ E
Sbjct: 57  EELKDENNKLKEENKKLENE 76



 Score = 36.3 bits (84), Expect = 0.015
 Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
           EK+ +    +  E+   +E E KEE      ++++   E+I KEE   + E+ ++ K ++
Sbjct: 2   EKECKDAKHENMEEDCCKENENKEED---KGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58

Query: 218 LKKEKLKKKKKEK 230
           LK E  K K++ K
Sbjct: 59  LKDENNKLKEENK 71


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 38.8 bits (91), Expect = 0.006
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 4/83 (4%)

Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKEDKEKPVEKIRKEEKD 204
           DK    + + +E +       E++K +K+++  EKK EKE   K   +K  E   K+ K 
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448

Query: 205 SEKEKKSKDKDKKLKKEKLKKKK 227
            + E K  D D     EKL + +
Sbjct: 449 PDGETKKVDPD--PLGEKLARTE 469



 Score = 37.6 bits (88), Expect = 0.014
 Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +E+E         ++ K++ K++  EKK +KE+ + K   K+K E    K K    + 
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE-KAAAKKKAEAAAKKAKGPDGET 453



 Score = 36.1 bits (84), Expect = 0.034
 Identities = 16/54 (29%), Positives = 26/54 (48%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           + +L   ++ K  KK +K  KK EKE+ +K   KK  +    K K  + +  K 
Sbjct: 403 NGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456



 Score = 35.7 bits (83), Expect = 0.052
 Identities = 16/53 (30%), Positives = 26/53 (49%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KK  KK+ K + K +K++ +K   K K +   KK      + KK + +   EK
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464



 Score = 35.3 bits (82), Expect = 0.063
 Identities = 18/59 (30%), Positives = 28/59 (47%)

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
             E +E++ E       +++K  +K RK EK +EKE+  K   KK  +   KK K    
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451



 Score = 31.8 bits (73), Expect = 0.70
 Identities = 12/51 (23%), Positives = 26/51 (50%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
              +KK ++K +K + K +K++ +     K+ +   +K KG   E +K + 
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 30.7 bits (70), Expect = 1.7
 Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
             E++K++KK+ +K + K + +E +K     K +   KK +    E K
Sbjct: 408 PAERKKLRKKQ-RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454



 Score = 29.9 bits (68), Expect = 3.1
 Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           E++E     +   E+   ++K+ K + KK +KE+ +K   +K+  +  K
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAK 444



 Score = 29.5 bits (67), Expect = 4.1
 Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 3/72 (4%)

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
                  E   E+         + KK +   KK +K + K +K+E E+++ +K+      
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLR---KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444

Query: 245 FKFGTDMEEKTK 256
              G D E K  
Sbjct: 445 KAKGPDGETKKV 456


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 39.0 bits (91), Expect = 0.006
 Identities = 26/130 (20%), Positives = 74/130 (56%), Gaps = 3/130 (2%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            E L ++++     + +     +EK ++ K  +++ K +++ + ++ EK+++K  E+ K+
Sbjct: 328 EERLEKLEEKLEKLESELEELAEEKNELAKLLEERLK-ELEERLEELEKELEKALERLKQ 386

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
            E+  +E KE   +      +E+I++E ++ EKE +  +++ +  +E++KK +++  +  
Sbjct: 387 LEEAIQELKEELAELSAA--LEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE 444

Query: 235 HEKEVIPKLT 244
            ++ +I +L 
Sbjct: 445 SKELMIAELA 454



 Score = 37.4 bits (87), Expect = 0.014
 Identities = 24/147 (16%), Positives = 70/147 (47%), Gaps = 8/147 (5%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
           L E+ K+   K ++   +  E  +  +   +  ++++K  +K +E + ++E+E+ ++E +
Sbjct: 176 LKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE 235

Query: 178 EKKE-----EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
             +E     E+E +  E+ +  + +I   E ++ K ++ + ++ +     L++ +++ ER
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELE---RLLEELEEKIER 292

Query: 233 SSHEKEVIPKLTFKFGTDMEEKTKRES 259
               +  I +L  +         + E 
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEE 319



 Score = 37.4 bits (87), Expect = 0.017
 Identities = 26/163 (15%), Positives = 78/163 (47%), Gaps = 16/163 (9%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
             ++L E+ + +  + +++  ++KE+ +++++ ++ +K+  + +E+  E    +E  K +
Sbjct: 471 HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530

Query: 174 KEEKEKKEEK---EFKMKEDK-------------EKPVEKIRKEEKDSEKEKKSKDKDKK 217
            EEK +K E    E +  ++K             E  ++++++  ++    +  K++ ++
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEE 590

Query: 218 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
           L++   + KKK KE      ++   L     ++ E + +    
Sbjct: 591 LRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633



 Score = 36.3 bits (84), Expect = 0.032
 Identities = 28/166 (16%), Positives = 78/166 (46%), Gaps = 21/166 (12%)

Query: 122 KKDKSIKKDKKTHKKKEKE------KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           ++ + +++  K  KKK KE      ++++     +  + +N+ ++ E++++ E EK   +
Sbjct: 586 EELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQ 645

Query: 176 E--KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK---------KEKLK 224
              +E  +    +++E  E+   +IR+E +  E E++ ++K ++L+         +E+L+
Sbjct: 646 AELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELE 705

Query: 225 KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
           +  K+        E +         ++EE  K     +  ++ ++ 
Sbjct: 706 ELLKKLGEIEQLIEELESRK----AELEELKKELEKLEKALELLEE 747



 Score = 35.9 bits (83), Expect = 0.044
 Identities = 22/103 (21%), Positives = 54/103 (52%), Gaps = 3/103 (2%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
            + H+K+  E  + + ++ +++  + KE+ + ++  +E EK  +E +E+  E     +  
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEAL 527

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           KE   E   K EK     ++ ++  +KL+ ++LK++ ++ E  
Sbjct: 528 KE---ELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567



 Score = 35.9 bits (83), Expect = 0.046
 Identities = 22/127 (17%), Positives = 65/127 (51%), Gaps = 4/127 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK- 174
            +L E++++    +++    + ++  I +     +K  +  +E  +E + +  +    + 
Sbjct: 424 RELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELEL 483

Query: 175 ---EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
              EE+  +E++E +++E+ E+  +++R+ E++  +  + ++  K+  +EKL+K +   E
Sbjct: 484 EELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543

Query: 232 RSSHEKE 238
                KE
Sbjct: 544 ELEELKE 550



 Score = 35.1 bits (81), Expect = 0.072
 Identities = 18/115 (15%), Positives = 57/115 (49%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            E L +++      ++ K   + ++ K + ++ +    ++K   ++      +++E  + 
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEEL 591

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
            E+ K+ +K+ K  E++   +E++ +  + SE E + ++ +++L+ E  K   + 
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQA 646



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 28/151 (18%), Positives = 75/151 (49%), Gaps = 2/151 (1%)

Query: 89  EKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD- 147
           ++   ++K   L +  +    +K+   E     +  + +++  +  ++ E+E  + +++ 
Sbjct: 248 KERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEEL 307

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           +  +  ++  E+  EK    E+   K EEK +K E E +   +++  + K+  EE+  E 
Sbjct: 308 EGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL-LEERLKEL 366

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           E++ ++ +K+L+K   + K+ E+     ++E
Sbjct: 367 EERLEELEKELEKALERLKQLEEAIQELKEE 397



 Score = 33.2 bits (76), Expect = 0.30
 Identities = 22/102 (21%), Positives = 56/102 (54%), Gaps = 3/102 (2%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            +K+  E++K +  + +  +++  +  +E+  + E+   + EEK ++ E+  +  E+ E+
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305

Query: 194 PVEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKKKKEKER 232
            +E +R   ++ E   ++ KS ++  +  +EKL+K + E E 
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEE 347



 Score = 32.0 bits (73), Expect = 0.79
 Identities = 25/107 (23%), Positives = 57/107 (53%), Gaps = 1/107 (0%)

Query: 135 KKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
           K KE+++I  +    +K +K+    K+  K+   + E+ + +  E  E+ E  ++  +E+
Sbjct: 152 KPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEE 211

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
             E  + EE   E+E++  +++ +  +E+L + ++EKER    K  +
Sbjct: 212 LKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARL 258



 Score = 31.7 bits (72), Expect = 0.98
 Identities = 24/105 (22%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE-KEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +  KE IK+ K K ++ + +  E  ++ +   E  E+  KE K+ +E +E + +E+ E+ 
Sbjct: 174 ELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQE 233

Query: 195 VEKIRKEEKDSEKEKKSKDKDKK--LKKEKLKKKKKEKERSSHEK 237
           +E + +   + E+EK+  ++ K   L+ E L+ +  +       +
Sbjct: 234 IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE 278



 Score = 31.7 bits (72), Expect = 1.0
 Identities = 25/124 (20%), Positives = 69/124 (55%), Gaps = 3/124 (2%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           + +++   + EK  ++ + ++  +  ++  E+  E+   + + + ++ E E++ E++ + 
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE 689

Query: 188 KEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEVIPKLT 244
            E  E+ +E++R+E E+  +K  + +   ++L+  K +L++ KKE E+     E++ +L 
Sbjct: 690 LEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELR 749

Query: 245 FKFG 248
            K G
Sbjct: 750 EKLG 753



 Score = 30.5 bits (69), Expect = 2.0
 Identities = 22/130 (16%), Positives = 62/130 (47%), Gaps = 9/130 (6%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK-------- 171
             +  K   ++K    +   E++++ K+K    ++K + +  E ++ + KE         
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLR 582

Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            +KEE E+  E+  ++K+  ++  E++ + EE     E    + + +  +E+L+ + ++ 
Sbjct: 583 TRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKL 642

Query: 231 ERSSHEKEVI 240
              +  +E++
Sbjct: 643 NLQAELEELL 652



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 15/99 (15%), Positives = 49/99 (49%), Gaps = 3/99 (3%)

Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
           +  K+   + K K ++ E ++ +  E  +   +  +EE +   +  K + +++ ++EE+ 
Sbjct: 174 ELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELK---ELKKLEEIQEEQEEEEL 230

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
            ++ +  +++  +L++EK + ++ +      E   +  L
Sbjct: 231 EQEIEALEERLAELEEEKERLEELKARLLEIESLELEAL 269


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 38.7 bits (90), Expect = 0.006
 Identities = 24/105 (22%), Positives = 43/105 (40%), Gaps = 5/105 (4%)

Query: 122 KKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKN--KEKDKEKKVDKEKEKGKKEEK 177
           K  K+ KK  K H K   ++ K   +  K    K +N   E   E   +   +  +K  K
Sbjct: 3   KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
              +      +E+++   E I K   +S+K   ++   +K +KE 
Sbjct: 63  LYSDVDSSSSEEEEDG-SESISKLNVNSKKISLNQVSTQKWRKEL 106


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 37.4 bits (87), Expect = 0.006
 Identities = 22/81 (27%), Positives = 53/81 (65%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
           KK   K + K++ K+  ++++  +E+E+ ++++ E+K E E K +E+ E+  EK ++EE+
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 204 DSEKEKKSKDKDKKLKKEKLK 224
             E+E++++ + ++ +K K  
Sbjct: 61  RKEREEQARKEQEEYEKLKSS 81



 Score = 35.8 bits (83), Expect = 0.021
 Identities = 21/81 (25%), Positives = 53/81 (65%), Gaps = 1/81 (1%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           K   KK+ K + K+ + ++ ++  + + ++++K  +K + + K+EE+ ++E ++ K +E+
Sbjct: 2   KIGAKKRAKLEEKQAR-RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 191 KEKPVEKIRKEEKDSEKEKKS 211
           +++  E+ RKE+++ EK K S
Sbjct: 61  RKEREEQARKEQEEYEKLKSS 81



 Score = 32.4 bits (74), Expect = 0.27
 Identities = 23/70 (32%), Positives = 46/70 (65%), Gaps = 1/70 (1%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
            KK  K +EK  + ++ + EE+E + ++  E+  E  RKEE++ E E++ K K+++ +KE
Sbjct: 5   AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELE-EEREKKKEEEERKE 63

Query: 222 KLKKKKKEKE 231
           + ++ +KE+E
Sbjct: 64  REEQARKEQE 73



 Score = 31.2 bits (71), Expect = 0.68
 Identities = 19/70 (27%), Positives = 48/70 (68%), Gaps = 2/70 (2%)

Query: 168 EKEKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            K++ K EEK+ +++++E + +E +E+   + ++E +  E+E+  ++++KK K+E+ +K+
Sbjct: 5   AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKK-KEEEERKE 63

Query: 227 KKEKERSSHE 236
           ++E+ R   E
Sbjct: 64  REEQARKEQE 73


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 38.2 bits (89), Expect = 0.007
 Identities = 31/138 (22%), Positives = 65/138 (47%), Gaps = 16/138 (11%)

Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKT----------HKKKEKEKIKKKKDKKDKD---KI 154
           +I++V  S  L  I ++    ++K+            +       +   ++K+ +   + 
Sbjct: 298 DIIQV--SRQLLNILREIKFAEEKRLVCKLGSASIATRFSSPNVQENDVEEKNDNSNVQQ 355

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKD 213
           K K+K++ K  ++++E  + E++ K+  +E K K D    V   + + E+D +K K S D
Sbjct: 356 KEKKKEESKAKEEKQEDIEFEKRFKELMEELKEKGDLSIFVALSLSEVEEDGDKVKISFD 415

Query: 214 KDKKLKKEKLKKKKKEKE 231
             K +  E +K+K  E E
Sbjct: 416 SSKAMHYELMKEKLPELE 433


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 38.3 bits (89), Expect = 0.007
 Identities = 27/131 (20%), Positives = 59/131 (45%), Gaps = 1/131 (0%)

Query: 103 TPMVPNIVKVTSSEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 161
             ++   VKV S++   +  K+  + K + KT  K    K    K          K+  +
Sbjct: 169 KSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKE 228

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           +K+  +  E   KEE E++  K   + ED+      + ++E + E +   +  D + + E
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288

Query: 222 KLKKKKKEKER 232
           + +K+K+++ +
Sbjct: 289 EKEKEKRKRLK 299



 Score = 33.3 bits (76), Expect = 0.29
 Identities = 34/177 (19%), Positives = 76/177 (42%), Gaps = 11/177 (6%)

Query: 54  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
                  S  ++     +  S   K  +KEK   KE   S VK+     +     ++   
Sbjct: 199 KTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRD-VILED 257

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
            S +   + +D+   + K + ++ + E+  ++K+K+ + ++K   +D++          +
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDED----------E 307

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            EE E   E   + +E +E     + K+E++ E+   S D  ++  + ++ KKK  K
Sbjct: 308 DEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFK 364


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 38.5 bits (90), Expect = 0.007
 Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 21/112 (18%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKE 208
           K K  N  K+++         G K  KE+++  +   ++++    P+E         +KE
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE--------YDKE 263

Query: 209 KKSKDKDKKLKKEKL-----------KKKKKEKERSSHEKEVIPKLTFKFGT 249
           K  K KD K K +KL           K K    +  + + + I  + +  GT
Sbjct: 264 KLEKIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGT 315



 Score = 35.5 bits (82), Expect = 0.061
 Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 5/71 (7%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKKEEKEKKEE 182
           D  IK  +    K+EK       +  +K+  +  K  D   +++       + +KEK E 
Sbjct: 208 DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLE- 266

Query: 183 KEFKMKEDKEK 193
              K+K+ KEK
Sbjct: 267 ---KIKDLKEK 274



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 19/93 (20%), Positives = 37/93 (39%), Gaps = 13/93 (13%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--------KVDKEKEKGKKEE 176
           KS + +    +K     +    +K +K++ K  +   E         + DKEK +  K+ 
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271

Query: 177 KEKKEEKEFKMKE-----DKEKPVEKIRKEEKD 204
           KEK ++    +         +    KI+ E+ D
Sbjct: 272 KEKAKKLGISIILFDDMTKNKTTNYKIQNEDPD 304



 Score = 33.9 bits (78), Expect = 0.21
 Identities = 23/100 (23%), Positives = 40/100 (40%), Gaps = 10/100 (10%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--KVDKEKEKGKK--- 174
           E K + S   +      KE++K     ++ +   +   E DKEK  K+   KEK KK   
Sbjct: 221 EEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGI 280

Query: 175 -----EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
                ++  K +   +K++ +    +  I      S K K
Sbjct: 281 SIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 38.2 bits (89), Expect = 0.008
 Identities = 27/202 (13%), Positives = 65/202 (32%), Gaps = 31/202 (15%)

Query: 72  KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 131
           K   ++   SK          ++    +      + +I          EI  D  I +  
Sbjct: 764 KARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDY-ILELL 822

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKE 189
           +T   K          +  K+ I+   K+  + + ++K          K  +     + +
Sbjct: 823 QTFNDKNGA------YESLKELIERIIKEWIEDLRQKK----------KLIERLIEAINQ 866

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK------------KEKERSSHEK 237
            + K ++   K E+     KK ++  +  ++E L +++            K  E  +   
Sbjct: 867 YRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELV 926

Query: 238 EVIPKLTFKFGTDMEEKTKRES 259
           E + K       + ++   +E 
Sbjct: 927 EKLAKEKSLREKNKDDWKAKEE 948



 Score = 34.3 bits (79), Expect = 0.15
 Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 11/120 (9%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK- 173
            E +  I K+      +K    +   +   +   K  D  +  E+       +E+ K   
Sbjct: 836 KELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFA 895

Query: 174 KEEKEKKEEKEFKM---------KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
           +EE   +EE  F            E+  + VEK+ KE+   EK K      K+  + KLK
Sbjct: 896 EEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKA-KEEVEAKLK 954



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 25/145 (17%), Positives = 51/145 (35%), Gaps = 9/145 (6%)

Query: 82  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 141
            E    +E    + ++++      +   +     +    ++    +        K   + 
Sbjct: 453 VELELIEESIKEEAEELDE-RIEEITEDILEKIKKKTKNLEFLAMLAVRLIRAAKDIYDH 511

Query: 142 IKKKKDKKDKDKI-----KNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEK 193
            KK++    K  +     K   +  E ++    +   KE  E   K    EF+   DK++
Sbjct: 512 FKKEEVFDLKAMVVASSRKVAVELYEAEIAARLDWHSKESLEGAIKDYNTEFETDFDKKQ 571

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKL 218
              K++ E+KD  K  K KD    L
Sbjct: 572 SHAKLKDEKKDLIKRFKLKDDPLDL 596


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 38.2 bits (89), Expect = 0.008
 Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           EK+ EKE   +E++E+  EK  KEE++   +K+ +  +++ K+EK KK KK KE ++  +
Sbjct: 28  EKEVEKEVPDEEEEEEKEEK--KEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85

Query: 238 EV 239
            +
Sbjct: 86  LL 87



 Score = 34.0 bits (78), Expect = 0.15
 Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 8/71 (11%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           KE +KE   D+E+E+ K+E+KE++E+   K +E  E       +EEK+ +K+K  K K+ 
Sbjct: 29  KEVEKEVP-DEEEEEEKEEKKEEEEKTTDKEEEVDE-------EEEKEEKKKKTKKVKET 80

Query: 217 KLKKEKLKKKK 227
             + E L K K
Sbjct: 81  TTEWELLNKTK 91



 Score = 34.0 bits (78), Expect = 0.17
 Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 8/67 (11%)

Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
           V+KE EK   +E+E++E++E K +E+K    E    EE D E+EK+ K   KK K +K+K
Sbjct: 27  VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE----EEVDEEEEKEEK---KK-KTKKVK 78

Query: 225 KKKKEKE 231
           +   E E
Sbjct: 79  ETTTEWE 85



 Score = 32.4 bits (74), Expect = 0.47
 Identities = 21/52 (40%), Positives = 40/52 (76%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           EKE  K+  D++++++ + K++++EK  DKE+E  ++EEKE+K++K  K+KE
Sbjct: 28  EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 38.2 bits (89), Expect = 0.009
 Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
            ++K KK D  K+ GK+ +  KK   E       EK  +     +KD+ K KK   K +K
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK-AAKKDARKAKKPSAKTQK 796

Query: 218 LK---KEKLKKKKKEKE 231
           +    K K   KKK  E
Sbjct: 797 IAAATKAKRAAKKKVAE 813



 Score = 33.2 bits (76), Expect = 0.33
 Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           K  +EK KK    K   K +   K    + D      KK + +  ++   K K+   K  
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKT- 794

Query: 196 EKIRKEEKDSEKEKKSKDKD 215
           +KI    K +++  K K  +
Sbjct: 795 QKIAAATK-AKRAAKKKVAE 813



 Score = 32.0 bits (73), Expect = 0.73
 Identities = 21/102 (20%), Positives = 43/102 (42%), Gaps = 12/102 (11%)

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--KIKNKEKDKEKKVDKEKEKGKK 174
           D   I  +++ +   KT ++K K+    KK  K +   K  N E D   + +K+ +    
Sbjct: 721 DFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAA 780

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           ++  +K           +KP  K +K    ++ ++ +K K  
Sbjct: 781 KKDARK----------AKKPSAKTQKIAAATKAKRAAKKKVA 812


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 37.5 bits (87), Expect = 0.011
 Identities = 17/100 (17%), Positives = 44/100 (44%), Gaps = 1/100 (1%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            + K+    +K + K  K +I  K +  E+++++E  + +     +   +   +K  K+ 
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKG 389

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            ++    E+  SE E +SK    +++     +  +E  + 
Sbjct: 390 LIDASPNEDTPSENE-ESKGSPPQVEATTTAEPNREPSQE 428



 Score = 32.5 bits (74), Expect = 0.46
 Identities = 16/99 (16%), Positives = 38/99 (38%), Gaps = 5/99 (5%)

Query: 120 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEE 176
           EIKK   + K + K  KK+  +K +  + + +++  + +     +        +  KK  
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390

Query: 177 KEKKE--EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
            +     +   + +E K  P +       +  +E   +D
Sbjct: 391 IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPSQED 429



 Score = 31.0 bits (70), Expect = 1.3
 Identities = 22/142 (15%), Positives = 55/142 (38%), Gaps = 4/142 (2%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           +  +++K+KEK       +           +  ++  E  + KK    +K E + + KE 
Sbjct: 291 RAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEI 350

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
            +K     R+  +++ +E+ +     + ++  +K KKK    +S  ++   +     G+ 
Sbjct: 351 AQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSP 410

Query: 251 MEEKTKRESSPKIVIKPVKSPS 272
            + +    + P        S  
Sbjct: 411 PQVEATTTAEP----NREPSQE 428


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 37.7 bits (88), Expect = 0.011
 Identities = 34/200 (17%), Positives = 88/200 (44%), Gaps = 28/200 (14%)

Query: 54  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
              T   ++     S ++     E++  +       KE +    V+  +  +  +++   
Sbjct: 99  AKATSVAAEATTPKSIQELVEALEELLEEL-----LKETASDPVVQ-ELVSIFNDLIDS- 151

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
             ED  +   +  I   K+   +  K+  ++K +++++ +  +K K ++   K+++E   
Sbjct: 152 IKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLA 211

Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRK---EEKDSEKEKKSKDKDKKLKKE------- 221
             + ++   E+   +++ + E+  E++RK   E+   E E++++  ++KLK E       
Sbjct: 212 RLESKEAALEK---QLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIE 268

Query: 222 ------KLKKKKKEKERSSH 235
                 K  K+K E+ER+  
Sbjct: 269 LQREFNKEIKEKVEEERNGR 288



 Score = 36.9 bits (86), Expect = 0.019
 Identities = 41/195 (21%), Positives = 82/195 (42%), Gaps = 13/195 (6%)

Query: 53  HPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV 112
            P    P   +   V+   K    E    K        E  +V + E   T +       
Sbjct: 54  DPTLRDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSE--EVAEAEAKATSVAAEATTP 111

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKKVDKEKEK 171
            S ++L E  ++   +  K+T      +++     D  D  K  N + D E  +   KE+
Sbjct: 112 KSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEE 171

Query: 172 -----GKKEEKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
                 K  E + +EE+E +  +KE +E+ + K+ +E     + K++  + K+L+ E   
Sbjct: 172 LDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALE-KQLRLEF-- 228

Query: 225 KKKKEKERSSHEKEV 239
           +++KE+ R  +E+++
Sbjct: 229 EREKEELRKKYEEKL 243


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 37.2 bits (87), Expect = 0.017
 Identities = 33/149 (22%), Positives = 59/149 (39%), Gaps = 20/149 (13%)

Query: 89  EKEYSKVKDV---------ELGVTPMVP-NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
           EKE+  +K++         E+ ++P  P  +V V S E    ++ ++   K       +E
Sbjct: 721 EKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIK--GLANLEE 778

Query: 139 KEKIKKKKDK-----KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK-- 191
            E +    ++                     +D   E  + E++ +K EKE    E K  
Sbjct: 779 LEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLS 838

Query: 192 -EKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
            E  V K  +E  + EKEK ++ + K  K
Sbjct: 839 NEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867



 Score = 31.8 bits (73), Expect = 0.85
 Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 9/108 (8%)

Query: 90  KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---KEKEKIKKKK 146
           K  + ++++E+         V+VT+     E+    +   D         KE EK++K+ 
Sbjct: 771 KGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI 830

Query: 147 DKKDKDKIKNK---EKDKEKKVDKEKEK--GKKEEKEKKEEKEFKMKE 189
           D+ +K K+ N+    K  E+ V+KEKEK    + +  K EE+   +K 
Sbjct: 831 DRIEK-KLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEERLAVLKA 877



 Score = 29.1 bits (66), Expect = 5.4
 Identities = 18/96 (18%), Positives = 38/96 (39%), Gaps = 2/96 (2%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           I+     E   E+       +             +  + +K+ +K++ +    EKK+  E
Sbjct: 781 ILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSNE 840

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
               K  E+  ++EKE K+ E + K + K+ +    
Sbjct: 841 GFVAKAPEEVVEKEKE-KLAEYQVK-LAKLEERLAV 874


>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 320

 Score = 36.6 bits (85), Expect = 0.020
 Identities = 20/122 (16%), Positives = 40/122 (32%), Gaps = 10/122 (8%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K+   +++   KK    K           +K   K    +     N EK+ + K+     
Sbjct: 130 KIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDKLS---- 185

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
               E+    + ++  + E       K    E + +  K  K  D  L K ++ K   + 
Sbjct: 186 NIFNEKLSNLKNEKISINEK-----IKSILNEINPDS-KFEKKLDLNLIKLEIVKSNSDN 239

Query: 231 ER 232
           + 
Sbjct: 240 DS 241



 Score = 30.1 bits (68), Expect = 2.3
 Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           +  +KK     +K ++ K  KKKD      I   +   +K + K  +     +   ++  
Sbjct: 121 EYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNI 180

Query: 184 EFKMKEDKEKPVEKIRKEE-KDSEKEKKSKDKDKKLKKEKLKK 225
           + K+     + +  ++ E+   +EK K   ++     K + K 
Sbjct: 181 DDKLSNIFNEKLSNLKNEKISINEKIKSILNEINPDSKFEKKL 223


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 36.2 bits (84), Expect = 0.023
 Identities = 19/80 (23%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 118 LGEIKKDKSIKKDK-KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           L   +++  + K + +  + K+K+K KKKK KK+     + E      V  E   G  + 
Sbjct: 162 LNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQV 221

Query: 177 KEKKEEKEFKMKEDKEKPVE 196
            E K+ K+ +      +  E
Sbjct: 222 GEAKKLKKKRSIAPDNEKSE 241



 Score = 28.1 bits (63), Expect = 8.3
 Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 5/91 (5%)

Query: 149 KDKDKIK---NKEKDKEKKV--DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
            ++D +     +E+ +  K   ++E+ K KK++K+KK +K        E  V      E 
Sbjct: 155 SEEDVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTEL 214

Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
            S   +  + K  K K+      +K +   S
Sbjct: 215 SSGAGQVGEAKKLKKKRSIAPDNEKSEVYKS 245


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 36.3 bits (84), Expect = 0.026
 Identities = 27/126 (21%), Positives = 54/126 (42%), Gaps = 8/126 (6%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE----KKVDKEKEKGKKEE 176
           I + K  K       K+ +  ++ +  K +  ++ N EKDKE    +K   E  K  +++
Sbjct: 74  INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133

Query: 177 KEKKEEKEFKMKEDKEK----PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            +   E   K +E+ +       E  +K E+  E E++  ++        L+ K+ E++R
Sbjct: 134 LQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQR 193

Query: 233 SSHEKE 238
            S    
Sbjct: 194 ESKWAI 199



 Score = 34.8 bits (80), Expect = 0.091
 Identities = 28/149 (18%), Positives = 72/149 (48%), Gaps = 7/149 (4%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE--KEKG 172
            ED   I       + +   K+ +    KKK+  + ++++ N++K+++   +++  + + 
Sbjct: 34  QEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFEL 93

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
             +++  K E    +  +K+K +E + KE  E   E +K+ ++  + ++K++   K +E+
Sbjct: 94  ALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEER 153

Query: 231 ERSSHEKEVIPKLTF---KFGTDMEEKTK 256
            +  +EK++   L     KF   + E   
Sbjct: 154 LKFENEKKLEESLELEREKFEEQLHEANL 182



 Score = 33.3 bits (76), Expect = 0.24
 Identities = 25/129 (19%), Positives = 56/129 (43%), Gaps = 10/129 (7%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKK-----DKDKIKNKEKDKEKKVDKEKEKGKK 174
           E  K   +    +  K +  E +  +KDK+      +    +KE  K+ +   E  + K+
Sbjct: 86  EQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKR 145

Query: 175 EEKEKKE----EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK-KE 229
           E  + +E    E E K++E  E   EK  ++  ++  + + K+ +++ + +    KK K 
Sbjct: 146 ENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKR 205

Query: 230 KERSSHEKE 238
           +     ++ 
Sbjct: 206 RAELGSQQV 214



 Score = 30.6 bits (69), Expect = 1.8
 Identities = 21/92 (22%), Positives = 44/92 (47%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E L +   + S +  K+     E  + K++ +K ++      EK  E+ ++ E+EK +++
Sbjct: 117 ELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQ 176

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
             E   + EFK  E++ +    I K+ K   +
Sbjct: 177 LHEANLDLEFKENEEQRESKWAILKKLKRRAE 208


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 36.8 bits (85), Expect = 0.027
 Identities = 14/66 (21%), Positives = 37/66 (56%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           K+ + +I+ K   ++ ++D   EKG+   ++ K+    K++ ++ + +EKI K +   ++
Sbjct: 362 KEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQ 421

Query: 208 EKKSKD 213
            + + D
Sbjct: 422 AQSAFD 427


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 35.1 bits (81), Expect = 0.027
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
           E+ E+      ++E   E + +SK+K K+   +   +K+K K      K   PKL
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153



 Score = 31.3 bits (71), Expect = 0.48
 Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 16/86 (18%)

Query: 102 VTPMV----------PNIVKV------TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK 145
           V PMV          P + K+      +  E+    ++     + +   + K + K KKK
Sbjct: 67  VGPMVQQYGPLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKK 126

Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEK 171
           ++       K K K + KK    K K
Sbjct: 127 REVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 31.3 bits (71), Expect = 0.51
 Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
              +++ + E+E   + E+E   E + + KE K++ V K  K EK+  K +  K K  K 
Sbjct: 93  SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKP-KTEKEKPKTEPKKPKPSKP 151

Query: 219 K 219
           K
Sbjct: 152 K 152



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 15/58 (25%), Positives = 30/58 (51%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
            ++++E + +   E  +++  E K E + K K +  KP  +  K + + +K K SK K
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 28.2 bits (63), Expect = 5.8
 Identities = 15/77 (19%), Positives = 29/77 (37%), Gaps = 11/77 (14%)

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 256
           KI +E   S+ E++  +++   + E+    + + E    +K  +PK              
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----------PKT 134

Query: 257 RESSPKIVIKPVKSPSP 273
            +  PK   K  K   P
Sbjct: 135 EKEKPKTEPKKPKPSKP 151


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 36.2 bits (84), Expect = 0.028
 Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 5/143 (3%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           KK K+  KKK  + I+K K KK   K   K   K+    K  +  K EE    + K+ K 
Sbjct: 141 KKIKEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKN 200

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK-----EKERSSHEKEVIPK 242
           K+   K +    K+++  +K  K K  +K  +K   K K K     EK +     ++I K
Sbjct: 201 KQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIGK 260

Query: 243 LTFKFGTDMEEKTKRESSPKIVI 265
               F    +   K   + KI I
Sbjct: 261 FKKSFKKKKKNSKKNYENIKINI 283



 Score = 34.7 bits (80), Expect = 0.081
 Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 29/143 (20%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---------------------V 165
           +   KK   +   +  K+KK ++ K ++    K K  K                      
Sbjct: 85  LNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIK 144

Query: 166 DKEKEKGKKEEKEKKEEKEFK--------MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
           ++ K+K  K  ++ K +K FK         K+ K K ++ I+ EE    K KK K+K K 
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204

Query: 218 LKKEKLKKKKKEKERSSHEKEVI 240
           LK      K KE E+  ++K+ I
Sbjct: 205 LKNILYNLKDKELEKKGYKKKQI 227


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 36.5 bits (84), Expect = 0.033
 Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 12/135 (8%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           K   + K+  K KEKG  ++ E K E      E  E+   K R+E+        +  K K
Sbjct: 792 KAAARAKQ--KPKEKGPNDK-EIKIESPSVETE-GERCTIKQREEKGIDAPAILNVKKKK 847

Query: 217 KLKKEKLKKKKKEKERS---SHEKEVIPKLTFKFGTDMEEKTKRESSPKIV-----IKPV 268
             K + + +   ++ R      +   +P   FK G  +    + +    ++     +K V
Sbjct: 848 PYKVDNMTRILPQQSRYISFIKDDRFVPVRKFKGGVVVLRDREPKEPVALIETVRQMKDV 907

Query: 269 KSPSPPPVAAADEYD 283
            +P P P    D  D
Sbjct: 908 NAPLPTPFKVDDNVD 922



 Score = 36.1 bits (83), Expect = 0.046
 Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 5/112 (4%)

Query: 71  DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKD 130
             +    K    E    +E EY +     +       ++ KV ++     IK     K+ 
Sbjct: 746 RGSQAIPKFCFNEDLEEEEFEYPR-----MYEEASGKSVRKVNTAVLSTTIKAAARAKQK 800

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
            K     +KE   +    + + +    ++ +EK +D       K++K  K +
Sbjct: 801 PKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852



 Score = 34.9 bits (80), Expect = 0.082
 Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 5/63 (7%)

Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-EKEFKMKEDK 191
           T K   + K +K K+K   DK    E      V+ E E+   +++E+K  +    +   K
Sbjct: 790 TIKAAARAK-QKPKEKGPNDKEIKIES---PSVETEGERCTIKQREEKGIDAPAILNVKK 845

Query: 192 EKP 194
           +KP
Sbjct: 846 KKP 848


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 36.2 bits (84), Expect = 0.034
 Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 19/202 (9%)

Query: 72  KTSPKEKMSSKEKTSPKEKEYSKVKDVELGV--TPMVPNIVKVTSSEDLGEIKKDKS--- 126
            +  K           +  +   V  V  G   T  V N+ +    ED+    KDK    
Sbjct: 370 NSHAKGASYGDNYEGDESDKIKVVAYVGGGKVETDSVWNLFREKFEEDI-ANYKDKEFYN 428

Query: 127 -------IKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
                  IK   +   K+  EK + K K+K     I      +EK V K  +K KK    
Sbjct: 429 CTEGGARIKGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488

Query: 179 KKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS-HE 236
            KE E +       ++ +EK+  + +  EK     D  K+L  E      +E  + + H 
Sbjct: 489 IKEFETDAL---KLQEILEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEILQPALHH 545

Query: 237 KEVIPKLTFKFGTDMEEKTKRE 258
            E+     +K     +E    +
Sbjct: 546 IELNIARIYKLNIKDKEDFLNK 567



 Score = 28.9 bits (65), Expect = 6.2
 Identities = 29/146 (19%), Positives = 48/146 (32%), Gaps = 16/146 (10%)

Query: 98  VELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKI 154
            EL V     NI+ +   +DL       S  K        E    +KIK         K+
Sbjct: 345 YELAVYLGFKNIIFI--GQDLAYTNDGNSHAKGASYGDNYEGDESDKIKVVAYV-GGGKV 401

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEK----------KEEKEFKMKEDKEKPVEKIRKEEKD 204
           +        +   E++    ++KE           K   E   KE  EK ++K + +   
Sbjct: 402 ETDSVWNLFREKFEEDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDKDKNKPFI 461

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEK 230
                 +  ++K + K   K KK   
Sbjct: 462 KLILLSNNSQEKAVLKLLQKIKKNND 487


>gnl|CDD|222406 pfam13831, PHD_2, PHD-finger.  PHD folds into an interleaved type
           of Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 33

 Score = 31.8 bits (73), Expect = 0.035
 Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 305 PMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 337
           P++ C  C    H  C G V E     W C +C
Sbjct: 2   PLLVCSKCSVAVHASCYG-VPELSDGPWLCRRC 33


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 35.8 bits (83), Expect = 0.036
 Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 3/97 (3%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP--- 194
           E  K + ++  K KD      KD  +K  +E++  KK E+      E K +E K+     
Sbjct: 60  ELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEI 119

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E +R E+    + ++   +   L K++++   +   
Sbjct: 120 AEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNL 156



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 24/119 (20%), Positives = 50/119 (42%), Gaps = 5/119 (4%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
            V+VT    +    +    ++  K  K  E + +K   +KK +++   K+ ++      E
Sbjct: 48  AVEVTKLAAVDSELRKFQTREAAKV-KDAEAQLLKDPAEKKREERKAAKKIERSTPSLIE 106

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKP--VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           ++   +E K+  EE    M+++K     +E+I  E     K++        L  E L+ 
Sbjct: 107 RKT--QEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRA 163


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 36.2 bits (84), Expect = 0.036
 Identities = 24/123 (19%), Positives = 61/123 (49%), Gaps = 7/123 (5%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKD------KEKKVDKEKEKGKKEEKEKKEEKE 184
           K+    KE E+I ++  K+ ++K++    +      +E+   +++ + ++EE +++EE+ 
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86

Query: 185 FKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
            + +E  +   EK+   E +  E+EK    ++ +L++ + +   +    +    E   KL
Sbjct: 87  VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146

Query: 244 TFK 246
             K
Sbjct: 147 LLK 149


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 35.8 bits (83), Expect = 0.037
 Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 2/89 (2%)

Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           E G+ E K++K++   K K  K+K  +  +  + D     +   ++++ + ++  KKKK+
Sbjct: 4   ENGENEVKQQKQQN--KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61

Query: 230 KERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
           K++   +K +       +   +   +   
Sbjct: 62  KKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90



 Score = 35.8 bits (83), Expect = 0.040
 Identities = 18/86 (20%), Positives = 40/86 (46%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           + + K  K++ K+K  KKK+KK K  + + +    + + + +E   K+  +KK++K+ K 
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKD 213
           K+        +  +          +D
Sbjct: 67  KKKNLGEAYDLAYDLPVVWSSAAFQD 92



 Score = 34.7 bits (80), Expect = 0.095
 Identities = 29/161 (18%), Positives = 62/161 (38%), Gaps = 16/161 (9%)

Query: 71  DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE-IKKDKSIKK 129
           +    ++K  +K+K + K+ + SK    +                  L E I +++  + 
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--------------DAFLAELISENQEAEN 52

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
            +   KKK+K+K KKKK+  +   +             +     ++      E+E+K  +
Sbjct: 53  KQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP-EQEWKQTQ 111

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
               PV K  K+ +    E +    +   +    +K++ EK
Sbjct: 112 PPTIPVSKQFKDGEYPVGEIQEYPGENSSRTSSEEKRELEK 152



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 17/88 (19%), Positives = 39/88 (44%), Gaps = 3/88 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKVDKEKEKGKKEEKE 178
           +  ++  K +K   K++  K K KK KKD D       +   + ++ + ++   KK++K+
Sbjct: 4   ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
           KK++K+   +         +       +
Sbjct: 64  KKKKKKNLGEAYDLAYDLPVVWSSAAFQ 91



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 150 DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE---EKEFKMKEDKEKPVEKIRKEEKDSE 206
           + +  + K+++K+K   K+ +K KK+  +      E   + +E + K   K +K++K  +
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 207 KEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
           K+K   +         +       + +SH
Sbjct: 67  KKKNLGEAYDLAYDLPVVWSSAAFQDNSH 95



 Score = 31.6 bits (72), Expect = 0.79
 Identities = 17/94 (18%), Positives = 39/94 (41%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
                 +  +K+ K+ +K+K  +K  KK  K   D      +   +  + E ++  K++K
Sbjct: 1   AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           +KK++K+ K   +       +      +  +  S
Sbjct: 61  KKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 34.3 bits (79), Expect = 0.039
 Identities = 24/68 (35%), Positives = 41/68 (60%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK K  +K  K   K++K+K KKKK KK K   K +EK+  K  ++  ++ ++ E++K+E
Sbjct: 75  KKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134

Query: 182 EKEFKMKE 189
           E    M++
Sbjct: 135 EPVEIMEK 142



 Score = 30.4 bits (69), Expect = 0.76
 Identities = 28/74 (37%), Positives = 45/74 (60%), Gaps = 1/74 (1%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           + +++K  K ++KT KK+ K + KK+K KK K K K   K +EK+  K  E+   EE+E 
Sbjct: 70  QKREEKKRKDEEKTAKKRAKRQKKKQK-KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128

Query: 180 KEEKEFKMKEDKEK 193
           +E+K+ +  E  EK
Sbjct: 129 EEDKQEEPVEIMEK 142



 Score = 30.4 bits (69), Expect = 0.79
 Identities = 25/80 (31%), Positives = 51/80 (63%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
            +D+  ++ ++  K+K++EK  KK+ K+ K K K K+K K KK +K++EK   +  E+  
Sbjct: 63  TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESS 122

Query: 182 EKEFKMKEDKEKPVEKIRKE 201
           ++E + +EDK++   +I ++
Sbjct: 123 DEEEEGEEDKQEEPVEIMEK 142



 Score = 29.6 bits (67), Expect = 1.6
 Identities = 21/64 (32%), Positives = 38/64 (59%)

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           E  ++K +KE + +E ++K  EK RK+E+ + K++  + K K+ KK+K K KK  K+   
Sbjct: 53  ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112

Query: 235 HEKE 238
              +
Sbjct: 113 EGSK 116



 Score = 29.2 bits (66), Expect = 2.3
 Identities = 25/83 (30%), Positives = 48/83 (57%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           +K KK  + +E ++ +++K +KD++K   K   ++KK  K+K+K K ++  KKEEKE   
Sbjct: 57  EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116

Query: 188 KEDKEKPVEKIRKEEKDSEKEKK 210
             ++    E+  +E+K  E  + 
Sbjct: 117 SSEESSDEEEEGEEDKQEEPVEI 139



 Score = 27.7 bits (62), Expect = 7.2
 Identities = 23/83 (27%), Positives = 47/83 (56%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
           EK K++  D+E ++ ++E+K K EEK  K +  ++K  +K +K++K  +  KK + +  K
Sbjct: 57  EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116

Query: 218 LKKEKLKKKKKEKERSSHEKEVI 240
             +E   ++++ +E    E   I
Sbjct: 117 SSEESSDEEEEGEEDKQEEPVEI 139


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
          associated protein 2 has a highly conserved sequence
          throughout evolution including a conserved polyproline
          region and several SH2/SH3 binding sites. It occurs as
          a single copy gene with a four-exon organisation and is
          located on chromosome 12. It encodes a ubiquitously
          expressed protein and binds to DAZ and DAZL1 through
          DAZ repeats.
          Length = 136

 Score = 34.1 bits (78), Expect = 0.042
 Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 10/60 (16%)

Query: 11 VVRQSAPMFPFQFSPFP--SVAGP-------GLIPRLPPMLQHPLIPQPLVHPPGTPPPS 61
          VV   A M P   +P+P  S+  P       G     PPM  +P+   P V+PPG+    
Sbjct: 17 VVPPQAQM-PQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLV 75


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 35.5 bits (82), Expect = 0.047
 Identities = 26/123 (21%), Positives = 37/123 (30%), Gaps = 17/123 (13%)

Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKD------KIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           S+  +K   KK +K    K   + D+D      +++  EK K  K  KE  +       K
Sbjct: 7   SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPK 66

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
                   K  K         EEK     K  K  +         K    +     E E 
Sbjct: 67  --------KSSKPTESSAASSEEK---PAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115

Query: 240 IPK 242
            P 
Sbjct: 116 TPD 118



 Score = 29.4 bits (66), Expect = 3.8
 Identities = 14/77 (18%), Positives = 24/77 (31%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           K K  +K K+          K   K  E      +EK  K  K  +  +    K      
Sbjct: 46  KAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPST 105

Query: 195 VEKIRKEEKDSEKEKKS 211
             +  +E +++     S
Sbjct: 106 ESEEEEEPEETPDPIAS 122



 Score = 29.4 bits (66), Expect = 4.2
 Identities = 16/82 (19%), Positives = 30/82 (36%), Gaps = 3/82 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           E L E+++ +  K  KK  +       +  KK  K  +    + E+   K     +    
Sbjct: 36  EFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95

Query: 174 KEEKEKKEE-KEFKMKEDKEKP 194
              K K    +  + +E +E P
Sbjct: 96  SHPKSKAPSTESEEEEEPEETP 117


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 35.8 bits (83), Expect = 0.047
 Identities = 27/141 (19%), Positives = 58/141 (41%), Gaps = 9/141 (6%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           KS+    K   K     I++  +  + D  KNK  + +KK+   +  G  +   +  E  
Sbjct: 113 KSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKL---QSFGFPQLLIEF-EVN 168

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE-RSSHEKEVIPKL 243
              +E + +  E+   EE   +  +++ + +KKLK E  K +K +        +++    
Sbjct: 169 DISEEQEFEKFEEAINEE-VEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTE 227

Query: 244 TFKFGTDMEEKTKRESSPKIV 264
             K    + +  + E+  K+ 
Sbjct: 228 EIK---PLIKINEEETRVKVE 245


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 33.8 bits (78), Expect = 0.050
 Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKK 180
           ++     K+      KEK K +KK+++K  +  K  +   E+K + E EK +KE E+ + 
Sbjct: 4   EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63

Query: 181 EEKEFKMKEDKEK 193
           E    ++K + +K
Sbjct: 64  ELARRELKAEAKK 76



 Score = 29.9 bits (68), Expect = 1.2
 Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 3/86 (3%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
           E E+ K   DK+    I  ++   EKK +++K + +K  K   EEK    + + EK  ++
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKA---EYELEKLEKE 57

Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKL 223
           + + E +  + +   +  K L ++ L
Sbjct: 58  LEELEAELARRELKAEAKKMLSEKGL 83



 Score = 28.4 bits (64), Expect = 3.4
 Identities = 18/78 (23%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           + + ++   +K+VDK   K K + ++K+EEK+ + ++  +   E+  + E + + EK+ +
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELE-KLEKELE 59

Query: 213 DKDKKLKKEKLKKKKKEK 230
           + + +L + +LK + K+ 
Sbjct: 60  ELEAELARRELKAEAKKM 77


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.8 bits (83), Expect = 0.053
 Identities = 20/129 (15%), Positives = 61/129 (47%), Gaps = 3/129 (2%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
              E L E++ + S  +++    +KE   +  +  + ++ K   +E+    +   E+ + 
Sbjct: 264 ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEA 323

Query: 173 KKEEKEKKEE---KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           + EE E K +   +E    E+K + +++  +  +   +E +++ ++ + + E+L+++ + 
Sbjct: 324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383

Query: 230 KERSSHEKE 238
                 + E
Sbjct: 384 LRSKVAQLE 392



 Score = 34.6 bits (80), Expect = 0.13
 Identities = 25/131 (19%), Positives = 67/131 (51%), Gaps = 7/131 (5%)

Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKI-------KKKKDKKDKDKIKNKEKDKEKKVDK 167
           + ++  +++ K I +++  + +++ E++       + K D+  ++  + +EK +E K + 
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
           E  + + EE E + E+     E+ E+ +E +R +    E +  S + + +  + +L++ +
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413

Query: 228 KEKERSSHEKE 238
             +ER   E E
Sbjct: 414 DRRERLQQEIE 424



 Score = 30.4 bits (69), Expect = 2.6
 Identities = 22/125 (17%), Positives = 66/125 (52%), Gaps = 11/125 (8%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVDK-EKEKGKKE 175
           L E+++   +K  ++  +K E+ K +K +  + +   +  + ++  +++++ ++E  + E
Sbjct: 195 LNELER--QLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAE 252

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERS 233
           E+ ++   E +  E+K   +E++R E   SE E++ ++  K+L     ++ + +++K+  
Sbjct: 253 EELEELTAELQELEEK---LEELRLE--VSELEEEIEELQKELYALANEISRLEQQKQIL 307

Query: 234 SHEKE 238
                
Sbjct: 308 RERLA 312



 Score = 28.5 bits (64), Expect = 8.9
 Identities = 17/109 (15%), Positives = 57/109 (52%), Gaps = 4/109 (3%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           ++ +  + + +K ++ K+ +K + ++ E  +   + +  +EE E+ +E+  + +E+ E+ 
Sbjct: 200 RQLKSLERQAEKAERYKE-LKAELRELELALLVLRLEELREELEELQEELKEAEEELEEL 258

Query: 195 VEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
             ++++ E+  E    E    +++ +  +++L     E  R   +K+++
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQIL 307


>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 320

 Score = 35.1 bits (80), Expect = 0.055
 Identities = 21/110 (19%), Positives = 43/110 (39%), Gaps = 3/110 (2%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
                DK  K+          E+ K+   +K   +    + D    + + + KG  EE+E
Sbjct: 93  EIAVLDKLAKELLL---GLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLRIKGLTEEQE 149

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
            +  ++    + + K V +++      + E K+K+    LKK   + K  
Sbjct: 150 AEALRDKLQNKQQGKKVTEVQARHILVKAEAKAKEALALLKKGVREAKAD 199


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 35.5 bits (82), Expect = 0.056
 Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 6/81 (7%)

Query: 121 IKKDKSIKK--DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
           +KK  S  K    +  K++ K+  K+KKD    DK K  + D++        + K  + E
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLE 242

Query: 179 KKEEKE----FKMKEDKEKPV 195
            K E +    + +K+D +K V
Sbjct: 243 SKLEAQSKELWSLKDDLKKHV 263



 Score = 32.8 bits (75), Expect = 0.43
 Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 4/80 (5%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           KK   + K ++ ++ K+  K+  K KK+     ++K  K K D++        ++K S+ 
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASG--DDKSKKAKTDRDVSTSTAASQKKSSDL 241

Query: 208 EKKSKDKDKKL--KKEKLKK 225
           E K + + K+L   K+ LKK
Sbjct: 242 ESKLEAQSKELWSLKDDLKK 261


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 35.3 bits (82), Expect = 0.060
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 180 KEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           KEE   +++E KEK P    +K E+   +++K K K KK  K++ KK +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 186 KMKEDKEKPVEKIR----KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           ++KE+  K +E+I+    K  K   +EKK + + KK K++K  KK+K+K R 
Sbjct: 363 ELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.9 bits (73), Expect = 0.61
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 164
           EIK+    K  K   KK+E++K +K+K KK + K   K K K +K
Sbjct: 374 EIKE----KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.9 bits (73), Expect = 0.75
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
            K  E+IK+K  K  K K + K+  K KK  K K+KGKK +K+ ++
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.5 bits (72), Expect = 0.88
 Identities = 20/65 (30%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
           V   S   +G+  K++  K+ ++  +K  K   KK+++KK + + K K++ K+ K  K K
Sbjct: 352 VDYFSGRYIGDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGK--KRK 409

Query: 170 EKGKK 174
           +KG+K
Sbjct: 410 KKGRK 414



 Score = 30.3 bits (69), Expect = 2.1
 Identities = 16/50 (32%), Positives = 33/50 (66%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           K++  ++I++ K+K  K   K +E+ K +K  K+K++ KK +K KK+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 28.8 bits (65), Expect = 6.6
 Identities = 16/49 (32%), Positives = 32/49 (65%)

Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           ++  K++++ KEK  K  K+K+EEK+ + ++ K+K  +K +K +K   K
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 35.0 bits (81), Expect = 0.061
 Identities = 22/91 (24%), Positives = 44/91 (48%), Gaps = 7/91 (7%)

Query: 139 KEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
            EK+   K+  +   K  ++ +  E   ++ + K K  EK +++EK  +  E  +  +E+
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNE---EERELKKKFTEKIREKEKRLEELE--QNLIEE 343

Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
            ++     E E + K +D + + EKLK  K 
Sbjct: 344 RKELNSKLE-EIQKKLEDLEKRLEKLKSNKS 373



 Score = 34.6 bits (80), Expect = 0.092
 Identities = 23/93 (24%), Positives = 40/93 (43%), Gaps = 13/93 (13%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
              K+      K+ H+ +  E           ++ + K+K  EK  +KEK   + E+   
Sbjct: 293 SGLKNSGEPSLKEIHEARLNE-----------EERELKKKFTEKIREKEKRLEELEQNLI 341

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           +E KE  +    E+  +K+   EK  EK K +K
Sbjct: 342 EERKE--LNSKLEEIQKKLEDLEKRLEKLKSNK 372



 Score = 31.1 bits (71), Expect = 1.2
 Identities = 17/68 (25%), Positives = 34/68 (50%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            EK  G K   E   ++  + + ++E+   K +  EK  EKEK+ ++ ++ L +E+ +  
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 227 KKEKERSS 234
            K +E   
Sbjct: 349 SKLEEIQK 356



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 18/70 (25%), Positives = 37/70 (52%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           +   +  + ++++  KKK  EKI++K+ + ++ +    E+ KE     E+ + K E+ EK
Sbjct: 304 KEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEK 363

Query: 180 KEEKEFKMKE 189
           + EK    K 
Sbjct: 364 RLEKLKSNKS 373



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 5/85 (5%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK-EDKEKPVEKIR-KEEKDSEKEKKSKDKD 215
           E  + +K+   K  G+   KE  E +  + + E K+K  EKIR KE++  E E+   ++ 
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEER 344

Query: 216 KKLK---KEKLKKKKKEKERSSHEK 237
           K+L    +E  KK +  ++R    K
Sbjct: 345 KELNSKLEEIQKKLEDLEKRLEKLK 369


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 35.5 bits (82), Expect = 0.062
 Identities = 20/107 (18%), Positives = 42/107 (39%), Gaps = 3/107 (2%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE-KE 184
           I  D + H+   +++ +    + D K ++ N   + E  +   +   + EE   K +  E
Sbjct: 835 IIADHQGHQAIIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSE 894

Query: 185 FKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            +     E         EEK +++     D  ++L K   + + K K
Sbjct: 895 LQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941



 Score = 33.6 bits (77), Expect = 0.28
 Identities = 15/104 (14%), Positives = 37/104 (35%), Gaps = 6/104 (5%)

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
            + +  +D  I+ D K H+        +      ++ I+ +E   + +  + +   K E 
Sbjct: 845 IIEQRTQDSGIRNDVK-HQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEA 903

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
             +  +   +    +  P       +   E  K++K+   K K 
Sbjct: 904 LSQNNKYNEEKSAQERMP-----GADSPEELMKRAKEYQDKHKG 942


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 34.9 bits (81), Expect = 0.062
 Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 4/90 (4%)

Query: 135 KKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            KK    I+   D     +++K K   K+  +    +K  K    + E KE     ++ K
Sbjct: 10  AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLE-ERKEVGALINELK 68

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
             +++     +   E ++    ++L  EK+
Sbjct: 69  --KEVEDAITELTPELEAAGLWERLAFEKI 96



 Score = 29.9 bits (68), Expect = 2.8
 Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 2/87 (2%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
            KK     +   +  K  E+   K K  GKK       +K  K+   +E+        E 
Sbjct: 10  AKKGIAAIELASDLLKALEEL--KVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINEL 67

Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEK 230
             E E    +   +L+   L ++   +
Sbjct: 68  KKEVEDAITELTPELEAAGLWERLAFE 94


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 33.7 bits (78), Expect = 0.067
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
           DKK  K  EK + K +K K   +K K K+++ + ++
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 28.3 bits (64), Expect = 5.1
 Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            +KE K+ +K +K K EK K K ++K+    E +
Sbjct: 64  DKKELKAWEKAEK-KAEKAKAKAEKKKAKKEEPK 96



 Score = 28.3 bits (64), Expect = 5.7
 Identities = 11/33 (33%), Positives = 16/33 (48%)

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
           +KE    E+ EKK EK     E K+   E+ + 
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 27.5 bits (62), Expect = 9.5
 Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 11/75 (14%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEE-KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           K K K  E ++    E    E K+ KE  +   + + + K      K EK   K +K+K 
Sbjct: 34  KKKGKKGELEITDLNE----EYKDLKESLEAALLDKKELK---AWEKAEK---KAEKAKA 83

Query: 214 KDKKLKKEKLKKKKK 228
           K +K K +K + K +
Sbjct: 84  KAEKKKAKKEEPKPR 98


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 34.3 bits (79), Expect = 0.070
 Identities = 23/80 (28%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 101 GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
           G         ++ S  +  E +    ++K+ +  ++++KEK KKK+ KK+K     ++KD
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKK----EKKD 172

Query: 161 KEKKVDKEKEKGKKEEKEKK 180
           K++K+ + K   KK++K+KK
Sbjct: 173 KKEKMVEPKGSKKKKKKKKK 192



 Score = 34.3 bits (79), Expect = 0.071
 Identities = 23/63 (36%), Positives = 36/63 (57%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           + +  EK+   + EK  + E+E+K+EK+ K +  KEK  +K +KE+    K  K K K K
Sbjct: 131 ESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190

Query: 217 KLK 219
           K K
Sbjct: 191 KKK 193



 Score = 33.5 bits (77), Expect = 0.12
 Identities = 21/73 (28%), Positives = 42/73 (57%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
             D        + +  ++E   K EKE +++E+++K  +K ++ +K+ +++K  K+K  +
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179

Query: 218 LKKEKLKKKKKEK 230
            K  K KKKKK+K
Sbjct: 180 PKGSKKKKKKKKK 192



 Score = 31.6 bits (72), Expect = 0.45
 Identities = 21/71 (29%), Positives = 39/71 (54%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
            D    +     +  EKE   K + + + ++ + KEK K+K+V KEK++ K ++++  E 
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180

Query: 183 KEFKMKEDKEK 193
           K  K K+ K+K
Sbjct: 181 KGSKKKKKKKK 191



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
            + D    +     +  +K    K +++ + ++++K K+K K KE  KEKK  K+K++  
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEK-KEKKKKKEVKKEKKEKKDKKEKM 177

Query: 174 KEEKEKKEEKEFKMK 188
            E K  K++K+ K K
Sbjct: 178 VEPKGSKKKKKKKKK 192



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 21/59 (35%), Positives = 39/59 (66%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
           +  E   +++K+  +++++K  KKK+KE  K+KK+KKDK +   + K  +KK  K+K+K
Sbjct: 135 SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 21/66 (31%), Positives = 39/66 (59%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
            +   + +  +K+      KE + E++  KEK+K K+ +KEKKE+K+ K K  + K  +K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185

Query: 198 IRKEEK 203
            +K++K
Sbjct: 186 KKKKKK 191


>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
          Length = 643

 Score = 35.2 bits (81), Expect = 0.072
 Identities = 19/111 (17%), Positives = 49/111 (44%), Gaps = 2/111 (1%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--KDKEKKVDKEKEKGKKEEKEKKE 181
           D +I +D KT  K   +    +K ++++    + E    K   +D+ K    +E +E   
Sbjct: 33  DDAIAEDAKTASKSGTKSSAGRKKRRERQDDGDDEPVSPKRIAIDEVKGMTVRELREAAS 92

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           E+       K+  +E++    +   K   +    +  K++   ++++++E+
Sbjct: 93  ERGLATTGRKKDLLERLCAALESDVKVGSANGTGEDEKEKGGDEEREKEEK 143



 Score = 31.0 bits (70), Expect = 1.4
 Identities = 27/153 (17%), Positives = 56/153 (36%), Gaps = 7/153 (4%)

Query: 79  MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
           M++K K        +K      G+   +   +    +ED     K  +        +KK 
Sbjct: 1   MANKLKVDELRAALAKRGLDTTGLKAALVRRLDDAIAEDAKTASKSGT---KSSAGRKKR 57

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEKPV 195
           +E+     D+    K    ++ K   V + +E   +       +K++   ++    E  V
Sbjct: 58  RERQDDGDDEPVSPKRIAIDEVKGMTVRELREAASERGLATTGRKKDLLERLCAALESDV 117

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
            K+       E EK+    +++ K+EK+    K
Sbjct: 118 -KVGSANGTGEDEKEKGGDEEREKEEKIVTATK 149


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 33.9 bits (78), Expect = 0.072
 Identities = 17/81 (20%), Positives = 37/81 (45%)

Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
           +KD E   D  +E+ + +++    E++ K+   K K   +  + +     E  +++   K
Sbjct: 33  KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVK 92

Query: 218 LKKEKLKKKKKEKERSSHEKE 238
             K++L   + E+  SS E  
Sbjct: 93  ATKKQLFSSEYEQTSSSSEST 113



 Score = 32.0 bits (73), Expect = 0.25
 Identities = 16/94 (17%), Positives = 39/94 (41%), Gaps = 1/94 (1%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-DKEKEKGKKEEKE 178
             KKD  I  D    + +  +++   +++K     K+KEK +++++ ++   +   E   
Sbjct: 31  YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
            K  K+     + E+         ++  K+  S 
Sbjct: 91  VKATKKQLFSSEYEQTSSSSESTSEEETKKTSSI 124


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 35.4 bits (82), Expect = 0.074
 Identities = 29/133 (21%), Positives = 53/133 (39%), Gaps = 10/133 (7%)

Query: 108  NIVKVTSSEDLGE-IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
            + +K+ + E   E I+  KS+  D       E  + K +   + K  + N +     +  
Sbjct: 895  SSLKLVNLELESEIIELKKSLSSDLI-----ENLEFKTELIARLKKLLNNIDL----EEG 945

Query: 167  KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
               E  K  E  K  E E K+KE  E+  + ++K      +  K+  + K  KKE  +  
Sbjct: 946  PSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELS 1005

Query: 227  KKEKERSSHEKEV 239
            K+        K++
Sbjct: 1006 KQYGALQESTKQL 1018



 Score = 30.8 bits (70), Expect = 1.6
 Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 19/185 (10%)

Query: 74   SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK-KDKSIKKDKK 132
                     E    K +  +++K +   +       ++     +L ++   +  +K+  +
Sbjct: 912  KKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSE 971

Query: 133  THK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
             ++   KK    +++      + K   KE                ++    +E     K+
Sbjct: 972  EYEDLLKKSTILVREGNKANSELKNFKKEL-----------AELSKQYGALQEST---KQ 1017

Query: 190  DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 249
             KE PVE     +  S+       +   LK  +  K     E +  +         +  +
Sbjct: 1018 LKELPVEVAEL-QSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076

Query: 250  DMEEK 254
             +++K
Sbjct: 1077 LLDDK 1081



 Score = 30.0 bits (68), Expect = 2.9
 Identities = 28/142 (19%), Positives = 50/142 (35%), Gaps = 15/142 (10%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK+    +  +  +  E++  + K D K    +K    + E ++    E  K    +  E
Sbjct: 865 KKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEI---IELKKSLSSDLIE 921

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
             EFK +        K      D E E  S +  K  +  KL + + + + +S E E + 
Sbjct: 922 NLEFKTELIAR---LKKLLNNIDLE-EGPSIEYVKLPELNKLHEVESKLKETSEEYEDLL 977

Query: 242 KLTFKFGTDMEEKTKRESSPKI 263
           K              RE +   
Sbjct: 978 K--------KSTILVREGNKAN 991


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 34.1 bits (79), Expect = 0.079
 Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KD +   KKE   I K  +K D   I      EK+K+K + KE++K  KEEK+K EE 
Sbjct: 66  KDFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 34.8 bits (81), Expect = 0.079
 Identities = 23/106 (21%), Positives = 38/106 (35%), Gaps = 20/106 (18%)

Query: 139 KEKIKKKKDKKDKD----------------KIKNKEKD---KEKKVDKEKEKGKKEEKEK 179
            E  KKKK  K+K+                KI   + +   KEK    E   G ++   +
Sbjct: 261 NEYAKKKKKLKEKNIKGEDVREGLTAFISVKIPEPQFEGQTKEKLGTSEVRFGVEKIVSE 320

Query: 180 KEEKEFKM-KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
                 +    +  K VEK+    K     KK+++  +K K   + 
Sbjct: 321 CLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 35.1 bits (80), Expect = 0.082
 Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 2/95 (2%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDKEKPV 195
           ++E   +K+ + +   IK  E  +  K   + +  K++  E++EEK+ K  M  +K+K +
Sbjct: 497 DEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKL 556

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            K  K     ++E+    K KK +  K KK   +K
Sbjct: 557 YKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591



 Score = 28.9 bits (64), Expect = 6.1
 Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDK 214
            E+D   + +  + +    E++  +++E + +++ E   + I+  E  +  +   KSK+K
Sbjct: 472 SEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNK 531

Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
            +K+ +E+ +KK K    S+ +K++  K+ + 
Sbjct: 532 KRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYS 563


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 34.0 bits (78), Expect = 0.094
 Identities = 15/78 (19%), Positives = 36/78 (46%), Gaps = 2/78 (2%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--KIKNKEKDKEKKVD 166
           IV +     + E+ ++ + K  K    + +   +K +  K       ++  +++  K +D
Sbjct: 115 IVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLD 174

Query: 167 KEKEKGKKEEKEKKEEKE 184
           K  E  ++ +KE  ++KE
Sbjct: 175 KYNELREQVQKESSKKKE 192



 Score = 32.5 bits (74), Expect = 0.27
 Identities = 15/61 (24%), Positives = 29/61 (47%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           +E  K   K DK + D    K + ++ ++  E  +  +EE  K  +K  +++E  +K   
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188

Query: 197 K 197
           K
Sbjct: 189 K 189



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 17/64 (26%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           EE  KK  K  KM+ D      ++RK +   E  +K++++  KL  +K  + +++ ++ S
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKL-LDKYNELREQVQKES 187

Query: 235 HEKE 238
            +K+
Sbjct: 188 SKKK 191


>gnl|CDD|116547 pfam07937, DUF1686, Protein of unknown function (DUF1686).  The
           members of this family are all hypothetical proteins of
           unknown function expressed by the eukaryotic parasite
           Encephalitozoon cuniculi GB-M1. The region in question
           is approximately 250 amino acids long.
          Length = 182

 Score = 33.6 bits (77), Expect = 0.096
 Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 21/73 (28%)

Query: 295 ACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF---CPKCSKVDEGSRKEKKKR 351
           AC       L M G  G       VCVGLV        F       S V           
Sbjct: 108 AC-AAGSVKLFMPGSYG---RSSLVCVGLVV-------FAALLLLVSYVKGA-------G 149

Query: 352 GRKKKTHQDIIMS 364
           G K  +   ++ S
Sbjct: 150 GEKVSSKGIVVGS 162


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 33.6 bits (77), Expect = 0.11
 Identities = 20/52 (38%), Positives = 21/52 (40%), Gaps = 4/52 (7%)

Query: 9   QAVVRQSAPM-FPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 59
           Q VV Q   M  P Q S  P+       P LP   Q P  PQPL  P    P
Sbjct: 65  QPVVPQQPLMPVPGQHSMTPT---QHHQPNLPQPAQQPFQPQPLQPPQPQQP 113



 Score = 28.6 bits (64), Expect = 4.5
 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 5/50 (10%)

Query: 11 VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPP 60
          V +Q  P    Q    P    P L  + P + Q PL+P P      TP  
Sbjct: 42 VSQQHPPTHTLQ----PHHHIPVLPAQQPVVPQQPLMPVP-GQHSMTPTQ 86



 Score = 28.2 bits (63), Expect = 6.5
 Identities = 16/55 (29%), Positives = 20/55 (36%), Gaps = 8/55 (14%)

Query: 11  VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTPPPS 61
           V+    P+ P Q    P +  PG     P     P +PQP   P    P  PP  
Sbjct: 60  VLPAQQPVVPQQ----PLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQP 110



 Score = 27.4 bits (61), Expect = 9.6
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 39  PPMLQHPLIPQPLVHPPGTPPP 60
           PP  Q P+ PQP VHP    PP
Sbjct: 107 PPQPQQPMQPQPPVHPIPPLPP 128


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 34.3 bits (79), Expect = 0.11
 Identities = 33/116 (28%), Positives = 57/116 (49%), Gaps = 3/116 (2%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
            SE L E   D   ++       +  E   +  +K  + K K K +  ++K  KE E+  
Sbjct: 237 MSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA 296

Query: 174 KEEKE--KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
           KEEK+  KK  +  ++KE  ++ V +  K     ++++K + + KKLK+ KL K K
Sbjct: 297 KEEKQLKKKLAQLARLKEIAKE-VAQKEKARARKKEQRKERGEKKKLKRRKLGKHK 351



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 22/94 (23%), Positives = 44/94 (46%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            +++ +   +KK +K   +  +  E  +    + + +  ++ + E   E      E   K
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
           PV   RK +    KEK+ K+ +++ K+EK  KKK
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305



 Score = 31.6 bits (72), Expect = 0.92
 Identities = 21/130 (16%), Positives = 50/130 (38%), Gaps = 17/130 (13%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKK---------DKKDKDKIKNKEKDKEKKVDKEKE 170
           E    K++K    +     K     +            +D  ++  +E +KE K +K+++
Sbjct: 155 EQVAKKTVKTAPISLAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQ 214

Query: 171 KGKKEEKEKKEE--------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
           + ++ E++K E+         E      +E   +   + + +S  E    + +   K  +
Sbjct: 215 ELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVR 274

Query: 223 LKKKKKEKER 232
            K+K K +  
Sbjct: 275 PKRKTKAQRN 284



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 20/105 (19%), Positives = 43/105 (40%), Gaps = 7/105 (6%)

Query: 136 KKEKEKIKK-------KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
           +KE +  KK       ++ K +K   +    D+  +   E+     EE+   E      +
Sbjct: 204 EKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFE 263

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            + E   + +R + K   +  K K + +  ++ K +K+ K+K   
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308



 Score = 29.7 bits (67), Expect = 3.6
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
           E      + + +  +K    ++  + + +KEK + + E +  KEEK+ K K  +   +++
Sbjct: 256 ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA-KEEKQLKKKLAQLARLKE 314

Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           I KE    EK +  K + +K + EK K K++
Sbjct: 315 IAKEVAQKEKARARKKEQRKERGEKKKLKRR 345


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 34.7 bits (80), Expect = 0.12
 Identities = 31/128 (24%), Positives = 45/128 (35%), Gaps = 18/128 (14%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            +  K+  K  KK EK     +   K   K   K   K  K+       +K+E E   E 
Sbjct: 251 QELYKERDKILKKLEKA--LNELLNKLLKKSHLKTNKKSGKLT----PSRKKEFEILPEY 304

Query: 184 EFKMKEDKEKPVEKIRKEE-----KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
                 DK++P  +I+K       K  +       K  KL  E    +K   E +  +  
Sbjct: 305 ----VPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKSLYENTPTKSG 360

Query: 239 VIPKLTFK 246
            I   TFK
Sbjct: 361 FI---TFK 365


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 35/197 (17%), Positives = 88/197 (44%), Gaps = 21/197 (10%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGV---TPMVPNIVKVT----SSEDLGEIKKDKSIK 128
           K K+   E+  P+   +  V+ ++  V    P+   + KV       E  G  ++++   
Sbjct: 358 KPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREI 417

Query: 129 KDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
              +   KK +E +++ +  + + K +++  +++ EK   + +   ++   + ++++E +
Sbjct: 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIR 477

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
            ++       +I + EK+ E++KK  ++ ++   E  K +K E        +V+ KLT  
Sbjct: 478 ARD------RRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT-- 529

Query: 247 FGTDMEEKTKRESSPKI 263
               +E   + E    I
Sbjct: 530 ----LEAIEEAEEEYGI 542


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 34.3 bits (79), Expect = 0.13
 Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 5/74 (6%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK-----EEKEFKMKE 189
                ++  K++ KK++DKIK       K++ KE+      +   K     ++    M  
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLRLYYQQNGKCMYT 587

Query: 190 DKEKPVEKIRKEEK 203
            KE  ++ +     
Sbjct: 588 GKEIDIDDLFDLSY 601



 Score = 30.5 bits (69), Expect = 2.1
 Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 1/85 (1%)

Query: 90  KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK 149
            +  KV +  +        IV   + E+ G     ++ K+  K ++ K KE       + 
Sbjct: 499 LQARKVVNELVKKYGPPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558

Query: 150 DKDKI-KNKEKDKEKKVDKEKEKGK 173
            K++  +N  K+  K     ++ GK
Sbjct: 559 LKEEPTENSSKNILKLRLYYQQNGK 583



 Score = 28.9 bits (65), Expect = 6.0
 Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 9/103 (8%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-----KGKKEEKEKKEEKEFKMKE 189
           KKK+    K +K     +    K    E   + EK      K   + ++   +K+     
Sbjct: 325 KKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDNAEKAFNISLKTYHKLRKALGDKDLLDNP 384

Query: 190 DKEKPVEKIRKE----EKDSEKEKKSKDKDKKLKKEKLKKKKK 228
              K +++I K     +     +K+ +     L +E++KK  +
Sbjct: 385 KNPKDLDEIVKILTLYKDREMIKKRLELYKDVLNEEQVKKLVR 427


>gnl|CDD|235944 PRK07135, dnaE, DNA polymerase III DnaE; Validated.
          Length = 973

 Score = 34.3 bits (79), Expect = 0.13
 Identities = 35/146 (23%), Positives = 56/146 (38%), Gaps = 35/146 (23%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE--------KGKKEEKEKKEEKEFK 186
           K KE+ ++K      +  KI+N    +E+K+   ++        K    ++E    K F 
Sbjct: 123 KNKEQLELKNYYINSNDPKIENAVYVQERKLLFAEDNEYLKILNKIGNNKEENSNFKFFD 182

Query: 187 MKEDKEKPVEKIRKE--------------------EKDSEKEKKSKDKDKKL-------K 219
            ++  E   EKI K                     + D+    +S    KK+       K
Sbjct: 183 FEKWFEDIDEKILKRTNYLVENINIEFPKKEFNLPDFDNNLGLESDLFLKKILKESVINK 242

Query: 220 KEKLKKKKKEKERSSHEKEVIPKLTF 245
           K +LK     KER ++E  VI KL F
Sbjct: 243 KAELKYYPNVKERINYEYSVIKKLKF 268


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 34.2 bits (79), Expect = 0.14
 Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 15/93 (16%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
               +  +  EK++ + + K   +   E+ EK  +   K+KE         R+++ DSE+
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKE---------RQDQPDSEE 506

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
           +         L   KL       E++S E EV 
Sbjct: 507 D------LATLPTLKLGDVPDPIEKTSLETEVS 533


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 33.5 bits (77), Expect = 0.15
 Identities = 25/142 (17%), Positives = 60/142 (42%), Gaps = 14/142 (9%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVD------- 166
           + L + K +           + E E ++ +    + +  +I+ + K  E+K+        
Sbjct: 31  KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERE 90

Query: 167 -----KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
                 E +  K+     ++E    M+E ++   E    +E+    EK   + + +L++E
Sbjct: 91  LRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150

Query: 222 KLKKKKKEKERSSHEKEVIPKL 243
             + +++ +E SS  +E+  KL
Sbjct: 151 VAEIREEGQELSSKREELKEKL 172



 Score = 32.3 bits (74), Expect = 0.35
 Identities = 19/122 (15%), Positives = 51/122 (41%), Gaps = 9/122 (7%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD--KDKIKNKEKDKEKKVDKEKEKGKKEEK 177
           E +  +  K  KK   K E E + K  +  +   + ++N+    E ++ + +E+ K+ E+
Sbjct: 23  EPRIKEIRKALKK--AKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEE 80

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           +    K+ +        ++  ++     E E       + +++ +  +K+ E  +   E+
Sbjct: 81  KLSAVKDERELRALNIEIQIAKERINSLEDEL-----AELMEEIEKLEKEIEDLKERLER 135

Query: 238 EV 239
             
Sbjct: 136 LE 137



 Score = 32.3 bits (74), Expect = 0.40
 Identities = 17/119 (14%), Positives = 51/119 (42%), Gaps = 13/119 (10%)

Query: 127 IKKDKKTHKKK-----------EKEKIKKKKDKKD--KDKIKNKEKDKEKKVDKEKEKGK 173
           ++KD+   + K           E E + K  +  +   + ++N+    E ++ + +E+ K
Sbjct: 17  LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           + E++    K+ +        ++  ++     E E     ++ +  +++++  K+  ER
Sbjct: 77  RAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLER 135



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 25/115 (21%), Positives = 51/115 (44%), Gaps = 2/115 (1%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KS+   +K   +K++ + + K+ +K   K K   E   +     E E    E +  + E 
Sbjct: 7   KSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES 66

Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           E +   ++ K  E+     KD E+E ++ + + ++ KE++   + E      E E
Sbjct: 67  EIQEIRERIKRAEEKLSAVKD-ERELRALNIEIQIAKERINSLEDELAELMEEIE 120


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 34.0 bits (78), Expect = 0.15
 Identities = 33/118 (27%), Positives = 62/118 (52%), Gaps = 7/118 (5%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE--KEKGKKEEKEKKEEKEF 185
           K+  K   K++ EK +++K++ D DKI  +    E K+     K    KEE+ K   K  
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424

Query: 186 KMKED-KEKPVEKIRKE--EKDSEKEKKSKDKDKKLKK--EKLKKKKKEKERSSHEKE 238
           K +ED  +  ++ I+K     + E+ KK K+K+K+  +  EK+ +  K  +  +++K+
Sbjct: 425 KSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKK 482



 Score = 32.5 bits (74), Expect = 0.48
 Identities = 28/120 (23%), Positives = 53/120 (44%), Gaps = 2/120 (1%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-EKKEEKEFKMKEDKEKPVEKIR 199
           K   K++ K++ + K +EK++        +  K E K      K  ++KE++ K   K+ 
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424

Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
           K E+D+   +    K K     + ++ KKEKE+     E + K+   F        K++ 
Sbjct: 425 KSEEDTLDLEIKSIK-KYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKK 483



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 28/101 (27%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKK---E 175
           E+  DK I +  K   K     +K  + K+++ KI  K EK +E  +D E +  KK    
Sbjct: 385 ELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPF 444

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
            + ++ +KE + ++++ + VEK+ +  K S+     K K K
Sbjct: 445 IEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKKGK 485


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 34.1 bits (78), Expect = 0.16
 Identities = 28/152 (18%), Positives = 56/152 (36%), Gaps = 9/152 (5%)

Query: 137  KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
            KE  K ++ ++K  +   + +    +K       +  K+   KK  K+    E  E+   
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKV----SRQAPKKPAPKKTTKKASESETTEETYG 1225

Query: 197  KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE-KEVIPKLTFK----FGTDM 251
                E ++  +  K K +    KK     K+KE+E    + K+ +              M
Sbjct: 1226 SSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKM 1285

Query: 252  EEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 283
            EE  K   + +   +     S   ++ +D+ D
Sbjct: 1286 EETVKAVPARRAAARKKPLASVSVISDSDDDD 1317



 Score = 30.2 bits (68), Expect = 2.8
 Identities = 40/242 (16%), Positives = 85/242 (35%), Gaps = 19/242 (7%)

Query: 51   LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE-LGVTPMVPNI 109
            L    GT    K+++  + +DK +   ++   +KT+PK      +KD++ L       + 
Sbjct: 1116 LSMAIGTLTLEKVQELCADRDKLN--IEVEDLKKTTPKSLW---LKDLDALEKELDKLDK 1170

Query: 110  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---------D 160
                + E   ++++  +  +     K   +   K    K  K   +++           +
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230

Query: 161  KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLK 219
             E   +  K KG+   K+K      + +E+ E    K R    + +    +S   ++ +K
Sbjct: 1231 TENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVK 1290

Query: 220  KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKP---VKSPSPPPV 276
                ++    K+  +    +           +E         K   KP    K  + PP 
Sbjct: 1291 AVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPA 1350

Query: 277  AA 278
            AA
Sbjct: 1351 AA 1352


>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
           subunit A'' fusion protein; Provisional.
          Length = 509

 Score = 34.0 bits (78), Expect = 0.16
 Identities = 24/153 (15%), Positives = 64/153 (41%), Gaps = 8/153 (5%)

Query: 61  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
            ++   ++    T  K++ +     S     YS    +     P++  ++++ S E +  
Sbjct: 27  FEVPKNITDGYITLKKKEFTYHVIISKIAP-YSNSNGIIKKKKPVLKTVLEIESEEKIEA 85

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKVDKEKEKGKK----- 174
           I   +  +   +   +       +    ++K+  + N  +D  K +D  K+ G +     
Sbjct: 86  IDLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSV 145

Query: 175 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSE 206
            E+  K  K+ ++ +D+ E+ + +IR+E + + 
Sbjct: 146 IEEIAKAMKKKELSDDEYEEILRRIREEYERAR 178



 Score = 28.6 bits (64), Expect = 7.1
 Identities = 32/136 (23%), Positives = 49/136 (36%), Gaps = 20/136 (14%)

Query: 125 KSIKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE----- 176
           K+I     T KKKE      I K     + + I  K+K   K V + + + K E      
Sbjct: 31  KNITDGYITLKKKEFTYHVIISKIAPYSNSNGIIKKKKPVLKTVLEIESEEKIEAIDLME 90

Query: 177 ------KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK------EKLK 224
                 +   E   F   E      ++  +E  + +  K   D  K   +      E++ 
Sbjct: 91  FKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSVIEEIA 150

Query: 225 KKKKEKERSSHEKEVI 240
           K  K+KE S  E E I
Sbjct: 151 KAMKKKELSDDEYEEI 166


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 33.9 bits (78), Expect = 0.16
 Identities = 42/191 (21%), Positives = 75/191 (39%), Gaps = 24/191 (12%)

Query: 60  PSKMKDKVSPKDKTSPKEKMSSK---EKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
           P +  +   P++  + K+ + ++    K +   K    VK  ++ V+  +          
Sbjct: 180 PPEELENGEPRNIVAKKDGIITRMYVTKGTAVVKVGDVVKKGDILVSGQIGKEGNEYEVH 239

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
             GE+  +   +   +   K   E +  +K  K    I  K K   K    E EK  KEE
Sbjct: 240 AKGEVLAETWYESTVEVPLKTNFEVLTGEKITKHYLSIGGK-KLPLKFRKNEFEKYDKEE 298

Query: 177 KEKK-----------------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD---KDK 216
            EK                  E ++  +K  KE+ VEK +K  K   +++  KD   KD+
Sbjct: 299 DEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEKEIDKDAKIKDE 358

Query: 217 KLKKEKLKKKK 227
           K+  E+++  K
Sbjct: 359 KVLHERVENGK 369



 Score = 28.9 bits (65), Expect = 6.5
 Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 11/91 (12%)

Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIK------NKEKDKEKKVDKE--KEKGKKEEKE 178
           +K  K   +K +KE+ +K    K K  I        + +DK  K+ KE   EKGKK  K+
Sbjct: 284 LKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKK 343

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
           K E++   + +D +   EK+  E  ++ K +
Sbjct: 344 KLEKE---IDKDAKIKDEKVLHERVENGKVR 371


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 34.1 bits (79), Expect = 0.17
 Identities = 22/106 (20%), Positives = 49/106 (46%), Gaps = 22/106 (20%)

Query: 144 KKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKEEKEFK-------------M 187
           K   K+ K  I        D+ +++ K+ E+   E+K++KE  E K             +
Sbjct: 485 KGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSL 544

Query: 188 KEDKEKP----VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           KE+ +K      +K+ +  +  ++E + +DK++   + K ++ +K 
Sbjct: 545 KEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEI--EAKTEELQKV 588



 Score = 33.7 bits (78), Expect = 0.19
 Identities = 31/122 (25%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 109 IVKVTSSEDLGEIKKDK-SIKKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKE 162
           I+ V++ +D G  K+ K +I         + +  +K       +DKK K++I+ K + +E
Sbjct: 477 ILTVSA-KDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEE 535

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
                  EK  KEE +K  E +   K+  E+ +E +++E +  +KE+  + K ++L+K  
Sbjct: 536 --YVYSLEKSLKEEGDKLPEAD---KKKVEEAIEWLKEELEGEDKEEI-EAKTEELQKVV 589

Query: 223 LK 224
             
Sbjct: 590 QP 591



 Score = 33.0 bits (76), Expect = 0.31
 Identities = 24/104 (23%), Positives = 54/104 (51%), Gaps = 3/104 (2%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE-KEKKEEKEFKM 187
           KDK T  K++K  I         D+I+   KD E+   ++K++ ++ E K + EE  + +
Sbjct: 483 KDKGT-GKEQKITITASSGL-SDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSL 540

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           ++  ++  +K+ + +K   +E     K++   ++K + + K +E
Sbjct: 541 EKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584



 Score = 31.8 bits (73), Expect = 0.85
 Identities = 23/104 (22%), Positives = 55/104 (52%), Gaps = 15/104 (14%)

Query: 112 VTSSEDLGEIKKDKSIKKDKKTH----KKKEKEKIK----------KKKDKKDKDKIKNK 157
           +T+S  L + + ++ +K  ++      K+KE+ + K          +K  K++ DK+   
Sbjct: 495 ITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEA 554

Query: 158 EKDK-EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
           +K K E+ ++  KE+ + E+KE+ E K  ++++  +   E++ +
Sbjct: 555 DKKKVEEAIEWLKEELEGEDKEEIEAKTEELQKVVQPIGERMYQ 598


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 32.7 bits (75), Expect = 0.17
 Identities = 23/88 (26%), Positives = 52/88 (59%), Gaps = 13/88 (14%)

Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKM----KEDKEKPVEKIRKEE-KDSEKEKKSKDK 214
           D E+K ++++ + +++E+++++EKE+      KE++EK +E++ K + K   +    +D 
Sbjct: 13  DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDY 72

Query: 215 DKKLKKE--------KLKKKKKEKERSS 234
           D++LK++        +  +KKKEK    
Sbjct: 73  DEELKEQERWDDPMAQFLRKKKEKTDKK 100



 Score = 31.1 bits (71), Expect = 0.54
 Identities = 22/105 (20%), Positives = 56/105 (53%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           + +DKS +      K++EKE+ K++K++K++ + +  +   +K+  +++ +  ++ K K 
Sbjct: 3   VYRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
             +    ++  E+  E+ R ++  ++  +K K+K  K  K   K 
Sbjct: 63  LARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPLYKG 107



 Score = 31.1 bits (71), Expect = 0.56
 Identities = 22/93 (23%), Positives = 52/93 (55%), Gaps = 3/93 (3%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
           +I   ++K+++ + + +EK+++++ +KE  KG  +++E+++  E   K   +        
Sbjct: 10  RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADD 69

Query: 201 EEKDSEKEKKSKDKD---KKLKKEKLKKKKKEK 230
           E+ D E +++ +  D   + L+K+K K  KK K
Sbjct: 70  EDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 32.6 bits (75), Expect = 0.17
 Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 5/103 (4%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
            ++ ++ +KK KD ++K+++K      K + K +  +  + +  ++ KK+EKE   +  K
Sbjct: 65  IEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124

Query: 192 EKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKER 232
              ++K   ++   +K+     K K+L K  EK +KK   KE+
Sbjct: 125 PYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 33.2 bits (76), Expect = 0.18
 Identities = 24/107 (22%), Positives = 60/107 (56%), Gaps = 5/107 (4%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
           ++    ++K+ K+ I + K +  +++ + ++  +E+K  KE EK    +KE++E+K  K 
Sbjct: 67  EEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKE-EKRLHLQKEEQEQKMAK- 124

Query: 188 KEDKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKEKLKKKKKEKE 231
           ++DK++ ++++      +     + K + K  + + EKL++KK+   
Sbjct: 125 EKDKQEIIDELETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTF 171


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 33.1 bits (75), Expect = 0.18
 Identities = 23/91 (25%), Positives = 55/91 (60%), Gaps = 6/91 (6%)

Query: 125 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           +S+KKD    KK     +E+I+K + ++++ + + + + +E+K+D E+E  +++E+E++E
Sbjct: 108 RSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKID-EREDQEEQEREREE 166

Query: 182 EKEFKMKEDKEKPV--EKIRKEEKDSEKEKK 210
           +   +  +D E  +  +   + E D +K +K
Sbjct: 167 QTIEEQSDDSEHEIIEQDESETESDDDKTEK 197



 Score = 33.1 bits (75), Expect = 0.18
 Identities = 22/83 (26%), Positives = 49/83 (59%), Gaps = 10/83 (12%)

Query: 138 EKEKIKKKKD----KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
           +K+ IKKKK     ++  +K + + ++ E++++ E+ + K +E+E +EE+E      +E+
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQE------RER 164

Query: 194 PVEKIRKEEKDSEKEKKSKDKDK 216
             + I ++  DSE E   +D+ +
Sbjct: 165 EEQTIEEQSDDSEHEIIEQDESE 187



 Score = 31.2 bits (70), Expect = 0.81
 Identities = 19/88 (21%), Positives = 50/88 (56%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
           +D++++ +KD  KK      + ++ EK ++E +E + + + E+  EKI + E   E+E++
Sbjct: 104 RDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERE 163

Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            +++  + + +  + +  E++ S  E +
Sbjct: 164 REEQTIEEQSDDSEHEIIEQDESETESD 191


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 33.5 bits (77), Expect = 0.18
 Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 8/80 (10%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
           K+ +K+    + K +DK  ++ KK+++E +E+ E      +        K+ K  E+ KK
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAE--------KKRKSKEEIKK 260

Query: 211 SKDKDKKLKKEKLKKKKKEK 230
            K K+ K  K   K   K  
Sbjct: 261 KKPKESKGVKALKKVVAKGM 280



 Score = 32.3 bits (74), Expect = 0.47
 Identities = 18/70 (25%), Positives = 35/70 (50%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
             +   E+ K   I + K   K  ++ K KK++ ++D +  +++ + K K  ++ K+K  
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263

Query: 174 KEEKEKKEEK 183
           KE K  K  K
Sbjct: 264 KESKGVKALK 273



 Score = 31.9 bits (73), Expect = 0.57
 Identities = 19/70 (27%), Positives = 35/70 (50%)

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           K+  K     +   + +  ++  +K R+ E+D E  +   +K +K K+E  KKK KE + 
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268

Query: 233 SSHEKEVIPK 242
               K+V+ K
Sbjct: 269 VKALKKVVAK 278



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 16/71 (22%), Positives = 30/71 (42%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
             + K+        + K   K  K+ K K +  E+D E    + ++K K +E+ KK++ +
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264

Query: 185 FKMKEDKEKPV 195
                   K V
Sbjct: 265 ESKGVKALKKV 275



 Score = 28.8 bits (65), Expect = 5.0
 Identities = 21/77 (27%), Positives = 36/77 (46%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
               K+  K+    E + + K   +  K K + +E  +  +   EK++  KEE +KK+ K
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264

Query: 184 EFKMKEDKEKPVEKIRK 200
           E K  +  +K V K  K
Sbjct: 265 ESKGVKALKKVVAKGMK 281


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 22/105 (20%), Positives = 62/105 (59%), Gaps = 1/105 (0%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
            K   + ++++ K+ +++ + +   +E+  ++EK + ++E +E+ +E+E   +  +    
Sbjct: 52  LKALAEEEERERKRKEERREGRAVLQEQIEEREKRR-QEEYEERLQEREQMDEIIERIQE 110

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
           E   + ++  EK+KK +++  +  +E++++K++EKER   E+  I
Sbjct: 111 EDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI 155



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 18/102 (17%), Positives = 66/102 (64%), Gaps = 2/102 (1%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           ++  K++++E  ++ ++++  D+I  + +++++   + +EK +K++K ++E  EF  +  
Sbjct: 81  EEREKRRQEEYEERLQEREQMDEIIERIQEEDEA--EAQEKREKQKKLREEIDEFNEERI 138

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           + K  EK R+ E++ +  +  ++K ++ ++ + +++++++E+
Sbjct: 139 ERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180



 Score = 29.9 bits (68), Expect = 2.5
 Identities = 25/114 (21%), Positives = 77/114 (67%), Gaps = 3/114 (2%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E + +   K++K+   ++E ++  +++ ++ +++ + + +++ K ++ ++EK ++EE+ +
Sbjct: 111 EDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEERE 170

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKER 232
            E +E   KE+KE+ V ++R +++++E E++  D+ +  L +E+ ++K+++KE+
Sbjct: 171 AERRE--RKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 164
           K +K++ K     +KE     K +K K+K + K    + K
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 31.3 bits (72), Expect = 1.0
 Identities = 16/57 (28%), Positives = 31/57 (54%), Gaps = 7/57 (12%)

Query: 111 KVTS-SEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
           +VT+ +E   E+K++ K+   DKK  K   K +  KKK+K+   + K  +   + ++
Sbjct: 41  EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQ--KKKEKQ---EAKAAKAKSKPRL 92



 Score = 30.6 bits (70), Expect = 1.4
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
           K     +KE K+  K +K KKEK + K  + +    
Sbjct: 57  KAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKPR 91



 Score = 28.6 bits (65), Expect = 7.2
 Identities = 11/42 (26%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           KE++K     K + K  +K +   KK +K++ K  K + + +
Sbjct: 53  KEELKAALLDKKELKAWHKAQ---KKKEKQEAKAAKAKSKPR 91



 Score = 28.3 bits (64), Expect = 8.1
 Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 10/66 (15%)

Query: 153 KIKNKEKDKEKKV----DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
             + K +  E +V    ++ KE  ++ +    ++KE K     +K      K+EK   K 
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQK------KKEKQEAKA 83

Query: 209 KKSKDK 214
            K+K K
Sbjct: 84  AKAKSK 89


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 32.2 bits (74), Expect = 0.19
 Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 10/92 (10%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK---------EKEKG 172
           K+ +   K ++   +K +++++K K+K  KD     E  +EKK  +          K++ 
Sbjct: 21  KQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQK 80

Query: 173 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEK 203
            +++ +K++++E +   DK  K ++++ K++ 
Sbjct: 81  LQQDLQKRQQEELQKILDKINKAIKEVAKKKG 112



 Score = 28.7 bits (65), Expect = 3.1
 Identities = 13/69 (18%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKD---KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           K  +++++K+  K+  + ++  EK+    ++K+ K+     +  +EKKE++  K  ++ +
Sbjct: 17  KAAQKQLEKEFKKRQAE-LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQ 75

Query: 193 KPVEKIRKE 201
           +  +K++++
Sbjct: 76  RKQQKLQQD 84


>gnl|CDD|227470 COG5141, COG5141, PHD zinc finger-containing protein [General
           function prediction only].
          Length = 669

 Score = 33.8 bits (77), Expect = 0.20
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 9/64 (14%)

Query: 275 PVAAADEYDTGDSKQVWICPAC-GVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF 333
           P+  +DE+D        IC  C    ++ S  ++ CDGC+   H  C G+   PE   W 
Sbjct: 185 PIEPSDEFDD-------ICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPE-GFWL 236

Query: 334 CPKC 337
           C KC
Sbjct: 237 CRKC 240


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 32.8 bits (75), Expect = 0.20
 Identities = 21/92 (22%), Positives = 43/92 (46%), Gaps = 4/92 (4%)

Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
               K+    + + KK  KE +K +KE K K+ + +    + K + +    K E + +KE
Sbjct: 40  PQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQ---DDGKMEALSDRAKAEAEIKKE 96

Query: 209 KKSKDKDKK-LKKEKLKKKKKEKERSSHEKEV 239
           K     +KK  + EK   +++ +E     +++
Sbjct: 97  KLVNAFNKKQQEYEKDLNRREAEEEQKLLEKI 128


>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1.  RNA
           polymerases catalyze the DNA dependent polymerisation of
           RNA. Prokaryotes contain a single RNA polymerase
           compared to three in eukaryotes (not including
           mitochondrial. and chloroplast polymerases). This
           domain, domain 1, represents the clamp domain, which a
           mobile domain involved in positioning the DNA,
           maintenance of the transcription bubble and positioning
           of the nascent RNA strand.
          Length = 330

 Score = 33.4 bits (77), Expect = 0.21
 Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 8/101 (7%)

Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
              K    K+      K+  K         KK++   +K     K  ++    K  E   
Sbjct: 107 ESVKYFFLKVVIDPKGKNSKKRL-------KKINNLCKKKSICSKCGEDNGGLKAFEGCG 159

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           K   KI K+  ++ K      + ++L KEKL+K   EK   
Sbjct: 160 KYQPKISKDGAEAIKALLKNIEIEEL-KEKLRKLNPEKVLQ 199


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 33.0 bits (76), Expect = 0.22
 Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KD +    KE++ I KK  K D   I     +EK+K+K + KE++K  KEEKEK EE 
Sbjct: 65  KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.5 bits (74), Expect = 0.22
 Identities = 19/65 (29%), Positives = 35/65 (53%)

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
            +  KK+ K K ++   +    +E P +    +  + + +KK  + DK+ KK+K +KKKK
Sbjct: 106 IQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKK 165

Query: 229 EKERS 233
           +K  S
Sbjct: 166 KKRHS 170



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE--KEKKEEKEFKMKEDKE 192
           KKK K K KK + +    +    + +  K  +K+ +K K E+  + KK++KE K K+ + 
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169

Query: 193 KP 194
            P
Sbjct: 170 SP 171


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 32.6 bits (75), Expect = 0.22
 Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           + K + E+E  K EE++++ EK     E K + +EK R+EE+   +EK+  D+   LKK+
Sbjct: 120 QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK-REEEERQIEEKRHADEIAFLKKQ 178

Query: 222 KLKKKKK 228
             + K +
Sbjct: 179 NQQLKSQ 185



 Score = 28.7 bits (65), Expect = 5.0
 Identities = 10/66 (15%), Positives = 38/66 (57%), Gaps = 4/66 (6%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           + E+ K + +++ K  ++ K + + +  +++ + E  +K E+E+++ +E +  ++    +
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE----I 172

Query: 196 EKIRKE 201
             ++K+
Sbjct: 173 AFLKKQ 178


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 33.2 bits (76), Expect = 0.24
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 18/117 (15%)

Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
             K KI+  +K+ +  + K KEK  +E++E +E  EF+ KE++E+    ++KEE   E++
Sbjct: 117 NTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFE-KEEEEQRRLLLQKEE---EEQ 172

Query: 209 KKSKDKDKKLKKEKLK-------------KKKKEKERSSHEKEVIPKL-TFKFGTDM 251
           + +K K+K+   ++L+             KK   K     EK    K  TF  G  M
Sbjct: 173 QMNKRKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPNTFSTGIKM 229


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 33.4 bits (77), Expect = 0.24
 Identities = 17/67 (25%), Positives = 32/67 (47%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
           K+     +++ EK++K   K ++   KN    K+ K+ + +++    EK   E KE   K
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300

Query: 189 EDKEKPV 195
              E P+
Sbjct: 301 YGNEVPL 307


>gnl|CDD|224291 COG1372, COG1372, Intein/homing endonuclease [DNA replication,
           recombination, and repair].
          Length = 420

 Score = 33.4 bits (76), Expect = 0.24
 Identities = 20/117 (17%), Positives = 48/117 (41%)

Query: 88  KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 147
            EK+  + + V +    ++  ++++  +  LG+  K     +  K  KK    +I +K  
Sbjct: 281 SEKKGKRERVVTISGANLLLELLRIVYAYKLGKAAKIAEALEKLKEKKKNINRRILRKIA 340

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
            ++           + K+  E  + +K  KE +  K+    ++    +E+I +   D
Sbjct: 341 IREVLLKLEGRLLSKLKLLLEILRKEKLLKEDENLKKLSKGDEYFDEIEEIEEIGYD 397


>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase.  The valyl-tRNA synthetase
           (ValS) is a class I amino acyl-tRNA ligase and is
           particularly closely related to the isoleucyl tRNA
           synthetase [Protein synthesis, tRNA aminoacylation].
          Length = 861

 Score = 33.5 bits (77), Expect = 0.25
 Identities = 23/120 (19%), Positives = 50/120 (41%), Gaps = 13/120 (10%)

Query: 59  PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL--GVTPMVPNIVKVTSSE 116
           PP+     +      +  E     +  +   K      +VE+      +   +V++    
Sbjct: 746 PPNAPLKVLL---IYTEAETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGF 802

Query: 117 DL-----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
           ++     G I K K + + +K   K++KE I+ +   +++  +K   K   + ++KEKEK
Sbjct: 803 EIIIPVKGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPK---EVIEKEKEK 859


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.8 bits (75), Expect = 0.25
 Identities = 21/115 (18%), Positives = 54/115 (46%), Gaps = 12/115 (10%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           + E E+ ++    K ++K+  ++ DK   +D   E  + +++E + +++   ++ +    
Sbjct: 60  RAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDA 119

Query: 196 EKIRKEEKDSEKEKKSKDKD------------KKLKKEKLKKKKKEKERSSHEKE 238
           +           +  S D D            +K+KKE+ ++K++E+E  + E+E
Sbjct: 120 DDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEE 174


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 33.5 bits (76), Expect = 0.26
 Identities = 40/132 (30%), Positives = 64/132 (48%), Gaps = 12/132 (9%)

Query: 115  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            ++++ +IK D      K  H  K  E+IKKK +    D+IK +  D E   DK       
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENY-IDEIKAQINDLEDVADKAISNDDP 1168

Query: 175  EEKEKKEE-------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKL 223
            EE EKK E       K+  + ++ +K + +I + EKD    ++ K  +    K L K  L
Sbjct: 1169 EEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFL 1228

Query: 224  KKKKKEKERSSH 235
            +K  +EK++S H
Sbjct: 1229 EKIDEEKKKSEH 1240



 Score = 29.6 bits (66), Expect = 4.0
 Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 24/170 (14%)

Query: 122  KKDKSIKKDKKTHKKKE-KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
            K +K IKK K     +E K KI+   D KD D+   K K+ +  +  E+       K   
Sbjct: 1387 KSEKLIKKIKDDINLEECKSKIESTLDDKDIDECIKKIKELKNHILSEESNIDTYFKNAD 1446

Query: 181  EEKE--------FKMKEDKEKPVEKIRKEEKDSEKE------KKSKDKDKKLKKEKLKKK 226
            E  E         +M ++K + + KI+K+   ++ +      K+  DK K  K E  K  
Sbjct: 1447 ENNENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNA 1506

Query: 227  K---KEKERSSHEKEVIPKLTFKFGT----DMEEKTKRESSPKIVIKPVK 269
            K   K KE     K+ + +L  K+      +   KTK++S  +I+IK +K
Sbjct: 1507 KAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDS--EIIIKEIK 1554



 Score = 29.6 bits (66), Expect = 4.7
 Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 2/150 (1%)

Query: 88  KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 147
           K KEY K   ++      + N +K    + L ++ K  + + + K     E E+  +  +
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           K   +   +K  D EKK +  K      E  K  E+E++     +K  E I+  E   E 
Sbjct: 875 KIKAEISDDKLNDYEKKFNDSKS--LINEINKSIEEEYQNINTLKKVDEYIKICENTKES 932

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
            +K  +K   LK+   K     KE +  EK
Sbjct: 933 IEKFHNKQNILKEILNKNIDTIKESNLIEK 962


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.5 bits (76), Expect = 0.27
 Identities = 26/129 (20%), Positives = 48/129 (37%), Gaps = 19/129 (14%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K   S    + + D++    KK   +  K   KK   +  +      E+  E  V+++  
Sbjct: 50  KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISE----PLEEGSELVVNEDAA 105

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
             K+ +K  +  +               RK    S   ++ K + K  K+ K+KK  ++ 
Sbjct: 106 LDKESKKTPRRTR---------------RKAAAASSDVEEEKTEKKVRKRRKVKKMDEDV 150

Query: 231 ERSSHEKEV 239
           E    E EV
Sbjct: 151 EDQGSESEV 159



 Score = 28.5 bits (63), Expect = 7.6
 Identities = 28/150 (18%), Positives = 57/150 (38%), Gaps = 17/150 (11%)

Query: 57  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
                +  D+     K  P   +    K +  E      +  EL           V + +
Sbjct: 55  ALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL-----------VVNED 103

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
              +    +S K  ++T ++K        +++K + K++ + K K K  +  +++G + E
Sbjct: 104 AALD---KESKKTPRRT-RRKAAAASSDVEEEKTEKKVRKRRKVK-KMDEDVEDQGSESE 158

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
               EE EF    + E   E++  E+ D E
Sbjct: 159 VSDVEESEFVTSLENES-EEELDLEKDDGE 187


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 32.7 bits (75), Expect = 0.28
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 131 KKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KK   K+EK  IK   K D          +K+++K + KE++K  K EKEK EE+
Sbjct: 68  KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 33.4 bits (76), Expect = 0.29
 Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 15/105 (14%)

Query: 143 KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 202
             + DKK +   +  +   EK+   +K+K KKEE+E+  E++  +K D E  VE+I    
Sbjct: 2   VDESDKKQQTIDEQSQISPEKQTPNKKDK-KKEEEEQLSEEDAMLKGDLELLVERI---- 56

Query: 203 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
                    +D D  L+   L    KE  +SS           KF
Sbjct: 57  ---------QDPDIDLQNNSL-NMLKEVIKSSTSSMTAVPKPLKF 91


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 33.0 bits (76), Expect = 0.31
 Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 6/81 (7%)

Query: 160 DKEKKVDKEKEKGKK--EEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDK 214
            KE ++ K + K  K  EEK +K ++E+ ++E + K ++K   I K++KD  ++ K K +
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLRE-QLKAIKKELGIEKDDKDELEKLKEKLE 257

Query: 215 DKKLKKEKLKKKKKEKERSSH 235
           + KL +E  K  +KE  + S 
Sbjct: 258 ELKLPEEVKKVIEKELTKLSL 278



 Score = 30.0 bits (68), Expect = 2.8
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
           E L  IKK+  I+KD K   +K KEK+++ K  ++  K+  KE  K   ++
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTKLSLLE 280


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 32.2 bits (73), Expect = 0.31
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 27  PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 59
            S   PG+ P   P +     P P   PPG PP
Sbjct: 147 SSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPP 179


>gnl|CDD|147348 pfam05124, S_layer_C, S-layer like family, C-terminal region. 
          Length = 223

 Score = 32.4 bits (74), Expect = 0.31
 Identities = 26/101 (25%), Positives = 36/101 (35%), Gaps = 8/101 (7%)

Query: 92  YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 151
           Y  VK +ELG   +          +  G+++       ++KT     K       D KD 
Sbjct: 15  YKDVKTLELGEEYIPDWEAYAVLDDGKGKLEYKDDEVLEQKTVGLALKYVGDYLDDIKDG 74

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           DKIK     K    D        E+   K    F+M E KE
Sbjct: 75  DKIKIANYAKLLFDD--------EDSSDKLNVYFEMDEYKE 107


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 32.8 bits (75), Expect = 0.31
 Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 3/64 (4%)

Query: 124 DKSIKKDKK-THKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           DK   +D K +  +KEK+ I+ + +  D   ++I++K  + +K++D+ K + KK +KE  
Sbjct: 31  DKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIA 90

Query: 181 EEKE 184
           E KE
Sbjct: 91  ELKE 94



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 14/101 (13%), Positives = 38/101 (37%), Gaps = 5/101 (4%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           L + K+DK   ++K+   + + E +   + + +   + + +++ +K   +     K    
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209

Query: 176 EKEK---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
             EK   +E+K        E   ++   +    E+      
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAA 250



 Score = 28.5 bits (64), Expect = 6.4
 Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 2/90 (2%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           K+DK  ++ K+   E K   E     + E E +       K +K   +  +  +E  +  
Sbjct: 154 KEDKKSLEEKQAALEDK--LETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALG 211

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           EK + ++ K L +    +  K++  +    
Sbjct: 212 EKAALEEQKALAEAAAAEAAKQEAAAKAAA 241


>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
          Length = 312

 Score = 32.7 bits (75), Expect = 0.32
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKK 228
            GKK   ++   K +  K  K+K       E  +SEK +K K K   L  KK+KL+K  +
Sbjct: 103 FGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNE 162

Query: 229 EK 230
           EK
Sbjct: 163 EK 164


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 33.0 bits (75), Expect = 0.33
 Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 3/88 (3%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           ++  +     K +  DK+K KNKEKDK  K D++ +K  K   EK + +         + 
Sbjct: 205 RRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQN---SHFDVRI 261

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
           +    KE K  EK   S  +   L K K
Sbjct: 262 LRSYTKENKKDEKNVVSGIRSSVLLKRK 289



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 2/66 (3%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKVDKE 168
           + TS+       K+K   K+K  + KK+++  K+ K   +K K++N   D    +   KE
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKE 268

Query: 169 KEKGKK 174
            +K +K
Sbjct: 269 NKKDEK 274


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 33.2 bits (76), Expect = 0.33
 Identities = 38/168 (22%), Positives = 65/168 (38%), Gaps = 9/168 (5%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
            E + + E     EK    ++         +PN   +   +   E  K K IK   +  K
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQK 216

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-----EKEFKMKED 190
            KE+    KK+ K DK KIK+ + D +    + K    + +K  +      E ++ +K  
Sbjct: 217 IKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSG 276

Query: 191 KEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
             K         +  K  +K  K  D+ + L   K KK+ KE+   + 
Sbjct: 277 DLKNFAVDLFGPEIRKYLQKFLKYYDQAEPLLN-KSKKEPKEEAVETL 323



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 35/152 (23%), Positives = 70/152 (46%), Gaps = 7/152 (4%)

Query: 91  EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD 150
           E   ++ +  G        V  T  E         SIK+ K +  KK   ++   +    
Sbjct: 99  EELAIEGLAFGTERSTSGAVPETKDET--PASAPSSIKEQKSSELKKVDSQLPDPRALLK 156

Query: 151 KDKIKNKEKDKE-KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE- 208
            + +K  E  +E +K  K  ++  K+ K+    K+  ++E K++ +E I+K++  +  E 
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQ-DLEEYKKR-LEAIKKKDIKNPLEL 214

Query: 209 KKSKDKDKKLKKE-KLKKKKKEKERSSHEKEV 239
           +K K++  KLKKE K  K+K +  ++  + + 
Sbjct: 215 QKIKEEFDKLKKEGKADKQKIKSAKNDLQNDK 246


>gnl|CDD|204587 pfam11078, Optomotor-blind, Optomotor-blind protein N-terminal
          region.  This family is conserved in Drosophila spp.
          Optomotor-blind is one of the essential toolkit
          proteins for coordinating development in diverse animal
          taxa, and in Drosophila it plays a key role in
          establishing the abdominal pigmentation pattern, in
          development of the central nervous system and leg and
          wing imaginal disc-formation of Drosophila
          melanogaster. This is the N-terminal region of the
          protein and does not include the T-box-containing
          transcription factor that plays a part in DNA-binding.
          Length = 74

 Score = 30.4 bits (68), Expect = 0.33
 Identities = 14/36 (38%), Positives = 16/36 (44%)

Query: 22 QFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGT 57
          Q  P P +    LIP+ PP   HP    PL H   T
Sbjct: 14 QAGPHPGLYPGTLIPKFPPAHPHPHPHHPLGHAYTT 49


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 33.1 bits (76), Expect = 0.35
 Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 5/72 (6%)

Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
           + E+K+ K + K KK     K+    K      K    + +E  D E E K  ++     
Sbjct: 35  EVERKLRKLESKIKKLGIPLKDTG-GKPDVPPSKEFLDLEEEILDLEAEIKEVEE----N 89

Query: 220 KEKLKKKKKEKE 231
            E L+K+  E E
Sbjct: 90  LESLEKEINELE 101



 Score = 32.4 bits (74), Expect = 0.51
 Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 138 EKEKIKKKKDKK-DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           E E+  +K + K  K  I  K+   +  V   KE    EE+    E E K  E+  + +E
Sbjct: 35  EVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE 94

Query: 197 KIR--KEEKDSEKEKKSKDKDK 216
           K     EE  +  +++    D+
Sbjct: 95  KEINELEEWLNVLDEEKSFLDE 116



 Score = 30.0 bits (68), Expect = 3.0
 Identities = 17/73 (23%), Positives = 31/73 (42%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           KK    +K    K D    K     +E+ +D E E  + EE  +  EKE    E+    +
Sbjct: 48  KKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVL 107

Query: 196 EKIRKEEKDSEKE 208
           ++ +    ++ +E
Sbjct: 108 DEEKSFLDENLEE 120


>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
           This model represents a paralogous family of Plasmodium
           yoelii genes preferentially located in the subtelomeric
           regions of the chromosomes. There are no obvious
           homologs to these genes in any other organism.
          Length = 255

 Score = 32.6 bits (74), Expect = 0.35
 Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 2/92 (2%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           DKKT K   K + + ++ KK+ D   N E   +   DK   K K E     E ++FK  E
Sbjct: 108 DKKTKKLINKLQKELEELKKELDNEMNDELTIQPIHDKIIIK-KDENNSVSEHEDFKQLE 166

Query: 190 DKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKK 220
           +++       +E +  S    K K K KKL K
Sbjct: 167 NEKNSSVSEHEEFDIASSDNLKIKRKLKKLVK 198


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 31.9 bits (73), Expect = 0.36
 Identities = 24/125 (19%), Positives = 56/125 (44%), Gaps = 7/125 (5%)

Query: 107 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK---KKKDKKD-KDKIKNKEKDKE 162
           P++     SE    I+ +  I+  KK  +  ++  I     K+ +K  + K+K +   K 
Sbjct: 47  PDLEFYILSESSLIIENE--IESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKV 104

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKE 221
           K V+ E ++  +      +E    +KE+ ++  +        D++  K  +++++  + E
Sbjct: 105 KDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAE 164

Query: 222 KLKKK 226
           K+K  
Sbjct: 165 KIKNF 169



 Score = 31.9 bits (73), Expect = 0.43
 Identities = 23/107 (21%), Positives = 51/107 (47%), Gaps = 12/107 (11%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDK-----EKKVDKE-KEKGKKEEKEKKEEKEFKMKED 190
            E   I + + +  K +I+  ++        K+++K+ ++K K+E   K ++ E ++ ED
Sbjct: 55  SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114

Query: 191 KEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E     I+      KEE   +++ K +      +  K K++++E E
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESE 161



 Score = 28.0 bits (63), Expect = 7.1
 Identities = 22/116 (18%), Positives = 50/116 (43%), Gaps = 20/116 (17%)

Query: 148 KKDKDKIKNKEKDKEKKVDKE-----------------KEKGKKEEKEKKEEKEFKMKED 190
            +    I+N+ + K+K++                    ++K K+E   K ++ E ++ ED
Sbjct: 55  SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114

Query: 191 KEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
            E     ++++    K+  KEK+    +  +   +  K K+E+E S   +++   L
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEKIKNFL 170


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 32.4 bits (74), Expect = 0.38
 Identities = 23/138 (16%), Positives = 52/138 (37%), Gaps = 2/138 (1%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
           ++KE+ ++EE+++   K      + E+   +   E+ + ++E        K       K 
Sbjct: 180 EKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKP 239

Query: 227 KKEKERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 285
               ++S+ ++   P  +  K  +    K +   S  +V K   +P     + +    T 
Sbjct: 240 TSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKK-AAPESTSQSPSSAEPTS 298

Query: 286 DSKQVWICPACGVQDDGS 303
           +S Q     +     D S
Sbjct: 299 ESPQTAGNSSLSSLGDYS 316


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 33.0 bits (76), Expect = 0.38
 Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 13/94 (13%)

Query: 173 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           +K E+ ++E   ++ ++ +  K  E IR+ +  + K KK+K + K+L  EKL+ +  E+ 
Sbjct: 241 QKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRL--EKLEARLAEER 298

Query: 232 RSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
                K                    +   K+V+
Sbjct: 299 PVEEGK----------PLAFRFPPPGKRLGKLVL 322



 Score = 32.6 bits (75), Expect = 0.48
 Identities = 18/106 (16%), Positives = 35/106 (33%), Gaps = 20/106 (18%)

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           E+K ++  +E    E ++K   ++ KE++   + K +  K KK K    + K+ EK  + 
Sbjct: 240 EQKAERLRQEAAAYEKQQK---ELAKEQEWIRRGKAAASKAKKAKS---RIKRLEKLEAR 293

Query: 235 HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
                            E   +          P        V   +
Sbjct: 294 LA--------------EERPVEEGKPLAFRFPPPGKRLGKLVLEFE 325


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 32.1 bits (73), Expect = 0.39
 Identities = 28/107 (26%), Positives = 51/107 (47%), Gaps = 4/107 (3%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           D K  +K EKE     + K D +  K   K KEK+  + ++  K +E  +K+++E +  +
Sbjct: 136 DVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKR--RVEDSQKHKEDRRKKQEEKRRND 193

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           + ++P          S     +KD+  K K++K    ++  E  SHE
Sbjct: 194 EDKRPGGGGGSSGGQS--GLSTKDEPPKEKRQKHHDPERRLEPQSHE 238



 Score = 29.1 bits (65), Expect = 4.3
 Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 1/77 (1%)

Query: 163 KKVDKEKEKGKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           KK D E  K   ++KEK + E   K KED+ K  E+ R+ ++D            +    
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLS 212

Query: 222 KLKKKKKEKERSSHEKE 238
              +  KEK +  H+ E
Sbjct: 213 TKDEPPKEKRQKHHDPE 229


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 32.9 bits (75), Expect = 0.39
 Identities = 16/98 (16%), Positives = 45/98 (45%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
             +   K+ +    K+ D+K   + + +E+ +++  D E+E    +E  +++        
Sbjct: 376 SSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVI 435

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
           DKE+      + +   +K++  ++K    +++K +  K
Sbjct: 436 DKEEVCNLYDRRDTAWQKQRFLREKQTAFERQKTEHTK 473


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 32.7 bits (75), Expect = 0.39
 Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
           K  K+  +K   K  E   +  KEFK   +K+K   +   E  + E +K  ++  +    
Sbjct: 446 KGAKIKIQKAL-KSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQET--- 501

Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFG 248
            +LK+  + +E   + +++I +    FG
Sbjct: 502 -QLKEATELQEFMINNEDLIEEAKELFG 528



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 16/94 (17%), Positives = 40/94 (42%), Gaps = 8/94 (8%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
           +  KIK +K  K         ++  + + + K   +K++ + +   E   +E K+   E 
Sbjct: 446 KGAKIKIQKALK-------SAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEV 498

Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
              + K++ + ++    ++ L +E  K+    K 
Sbjct: 499 QETQLKEATELQEFMINNEDLIEE-AKELFGIKS 531



 Score = 29.6 bits (67), Expect = 4.1
 Identities = 14/83 (16%), Positives = 35/83 (42%), Gaps = 6/83 (7%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE- 181
           K   IK  K    K  +  ++  K+ K  ++ K  + +   ++ +E+ K   EE ++ + 
Sbjct: 446 KGAKIKIQK--ALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQL 503

Query: 182 --EKEFKMKEDKEKP-VEKIRKE 201
               E +      +  +E+ ++ 
Sbjct: 504 KEATELQEFMINNEDLIEEAKEL 526


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 31.5 bits (72), Expect = 0.41
 Identities = 17/59 (28%), Positives = 37/59 (62%)

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           KKV K     KKE++E++EE+    + D+E+ ++++ ++E    K +K ++ ++K K+ 
Sbjct: 88  KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146



 Score = 28.8 bits (65), Expect = 3.0
 Identities = 14/65 (21%), Positives = 38/65 (58%), Gaps = 4/65 (6%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           ++K   + K +K  +++E+ ++++  +++  D++  KE  K K+    +++ + E K+K+
Sbjct: 90  VRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR----EKRRENERKQKE 145

Query: 181 EEKEF 185
             KE 
Sbjct: 146 ILKEQ 150


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 32.7 bits (74), Expect = 0.41
 Identities = 37/185 (20%), Positives = 67/185 (36%), Gaps = 36/185 (19%)

Query: 30  AGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSP-- 87
           +G    P   P  QHP        P   PPPS      +   + S   +       S   
Sbjct: 441 SGLHSGPPQSPFAQHPFT--SGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGG 498

Query: 88  -------------KEKEYSKVKDVELGVTPM-----VPNIVKVTS--SEDLGEIKK-DKS 126
                        KE+   + ++ E    P       P +V   S  S+     K  D+ 
Sbjct: 499 CAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHLDRG 558

Query: 127 IKKDKKTH-----------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
                +T             KK +E ++K K + ++   + +E++KEK+ ++E+E+ ++ 
Sbjct: 559 YNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREA 618

Query: 176 EKEKK 180
           E+  K
Sbjct: 619 ERAAK 623



 Score = 28.9 bits (64), Expect = 6.4
 Identities = 15/55 (27%), Positives = 37/55 (67%), Gaps = 4/55 (7%)

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK----ERSSHE 236
           K+ + +E+ VEK ++E +   +E++ ++K+K+ ++E+ ++++ E+      SSHE
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.6 bits (74), Expect = 0.41
 Identities = 22/77 (28%), Positives = 41/77 (53%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           ++K  KK+E+E   +   +    + +  E  K+ K D E E  K E   K +E+E +M E
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560

Query: 190 DKEKPVEKIRKEEKDSE 206
            + + + K ++ EK++E
Sbjct: 561 KEAQELRKYQESEKETE 577



 Score = 31.8 bits (72), Expect = 0.73
 Identities = 36/128 (28%), Positives = 68/128 (53%), Gaps = 11/128 (8%)

Query: 117 DLGEIKKDKSIKKDK----KTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           DLG++KK+  + + K     + K+K+K+ ++  +K+ K + D   N EK        E++
Sbjct: 449 DLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLA-----EEK 503

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
           K KKEE+E       +    +E+  E +++ ++D E E K  + D KLK+E+ +  +KE 
Sbjct: 504 KRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEA 563

Query: 231 ERSSHEKE 238
           +     +E
Sbjct: 564 QELRKYQE 571


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 32.3 bits (74), Expect = 0.42
 Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 5/105 (4%)

Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG-----KKEEKEKKEEKEFKMKEDKEKP 194
           E+   + D+    K K  E ++ K    E E+      +KEE++  E +E   +E  ++ 
Sbjct: 189 EEAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQL 248

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           +EK+  E +    E++   + K  ++E+L K+  + E  S +KE+
Sbjct: 249 IEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEI 293



 Score = 29.6 bits (67), Expect = 3.4
 Identities = 21/92 (22%), Positives = 51/92 (55%), Gaps = 5/92 (5%)

Query: 122 KKDKSIKKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
            K+K+I+ ++   +  E E ++ ++K K+++  ++ +E+  ++ V +  EK + E ++  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260

Query: 181 EEKEF----KMKEDKEKPVEKIRKEEKDSEKE 208
            E+E     K++E +E   E  + E +  +KE
Sbjct: 261 AEQERMLEHKLQEQEELLKEGFKTEAESLQKE 292


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 31.7 bits (72), Expect = 0.44
 Identities = 23/102 (22%), Positives = 56/102 (54%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           ++ K  ++K    K  D+ + ++++ ++ V  ++E+ K E +E ++EKE    E ++   
Sbjct: 21  EEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAE 80

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           ++ ++E    E++++  +  K+ +    K+KK  KE +  EK
Sbjct: 81  QQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122



 Score = 29.4 bits (66), Expect = 2.9
 Identities = 24/92 (26%), Positives = 51/92 (55%), Gaps = 2/92 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKVDKEKEKGKKEEKEK 179
           +   +++K     K  EKE+ K+  D+  K K + ++ + E  +K  +  E  +++  E+
Sbjct: 22  EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQ 81

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           +E++E   +E+KE+  E ++++E  S K KK 
Sbjct: 82  QEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKP 113


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 31.8 bits (72), Expect = 0.45
 Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)

Query: 17  PMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTP-PPSKMKDKVSPK 70
           PM P      P V   G  P L P  QHPL P    +P    P    PP + +    P+
Sbjct: 82  PMMP-VPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPPVQPQQPADPQ 139


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 32.3 bits (74), Expect = 0.46
 Identities = 17/61 (27%), Positives = 38/61 (62%), Gaps = 6/61 (9%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE------FKMKEDKEKPVEK 197
           K+K  K  +K++ K +++E++   E E+ ++EE E++++++       K+ E KEK +E 
Sbjct: 6   KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65

Query: 198 I 198
           +
Sbjct: 66  L 66


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 31.4 bits (72), Expect = 0.46
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 142 IKKKKDKKDKDKI-KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
           I   K  K  D   K K+K K+KK + E  + KK++    EE++   K      +  I+K
Sbjct: 68  ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKK 127

Query: 201 EE 202
             
Sbjct: 128 VP 129



 Score = 28.3 bits (64), Expect = 4.6
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 2/58 (3%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           KV  S        D   K+ KK  KKK++ +  ++K K  K  +  + K+ +K VD  
Sbjct: 65  KVDISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKK--KYVVSEERKEDQKAVDAA 120


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 31.1 bits (71), Expect = 0.47
 Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 3/88 (3%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDKEKPVEKI 198
           KI  ++ +K  + IK  E+  ++      E  ++  + + E  E     K++ +K  E+I
Sbjct: 26  KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85

Query: 199 RKE-EKDSEKEKKSKDKDKKLKKEKLKK 225
             E +KD+E+  +S   + + +KE+   
Sbjct: 86  LAEAQKDAERLLESARAEIEQEKEQALA 113



 Score = 27.6 bits (62), Expect = 6.6
 Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 3/90 (3%)

Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           KI ++ K+K     KE E+  K+      E E ++ + + +  E I   +K+++K K+  
Sbjct: 26  KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85

Query: 213 DKDKKLKKEKLKKKKK---EKERSSHEKEV 239
             + +   E+L +  +   E+E+     E+
Sbjct: 86  LAEAQKDAERLLESARAEIEQEKEQALAEL 115


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 32.6 bits (74), Expect = 0.49
 Identities = 36/172 (20%), Positives = 48/172 (27%), Gaps = 10/172 (5%)

Query: 51  LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYS-KVKDVELGVTPMVPNI 109
           L  P        M    S K     K    SK K +  EK+    V D            
Sbjct: 404 LKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELR 463

Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD--------- 160
            KV  S+            K KK       + IK KK  K K   K   +D         
Sbjct: 464 KKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPN 523

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
            ++K DK+        K  +   E +      K +E      K     K + 
Sbjct: 524 NKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNA 575


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 32.5 bits (74), Expect = 0.49
 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 5/99 (5%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
            +KI  ++D    D+++   K+K   V +   + ++EE+E++EE+  + +   E+PV   
Sbjct: 371 YDKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGF 430

Query: 199 RKEEKDSEKEKKSKD-----KDKKLKKEKLKKKKKEKER 232
              E                K+ K+  EK+  K+ EKE+
Sbjct: 431 EVPEMPMPAAGGGGGIKIVLKNAKITAEKVIIKRAEKEK 469


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 32.2 bits (73), Expect = 0.50
 Identities = 18/109 (16%), Positives = 43/109 (39%), Gaps = 15/109 (13%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           EI   +++  +    KK+E+  + + ++ + +D   + E     ++ +EKE+ + EE + 
Sbjct: 61  EISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGEL---VELQEEKEQLENEELQY 117

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
             E                 +     E   +S +   +    +L K +K
Sbjct: 118 LREYN------------LFDRNNLQLEDNLQSLELQYEYSLNQLDKLRK 154



 Score = 31.8 bits (72), Expect = 0.62
 Identities = 22/110 (20%), Positives = 49/110 (44%), Gaps = 5/110 (4%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE-KEFKMKE 189
               +  EKE+    K    K + +N E    + +D E ++ KKEE+   +E +E + ++
Sbjct: 35  DSELRDAEKER-DTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKED 93

Query: 190 DK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           D  +  + ++++E++  E E+    ++           + E    S E +
Sbjct: 94  DDLDGELVELQEEKEQLENEELQYLRE--YNLFDRNNLQLEDNLQSLELQ 141


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 31.5 bits (72), Expect = 0.50
 Identities = 10/52 (19%), Positives = 31/52 (59%)

Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
           ++     +  +K + +EK+ +K+  + ++ G K   +K E+++ ++ ED+++
Sbjct: 33  LQWLLIDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKE 84



 Score = 31.5 bits (72), Expect = 0.56
 Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 8/62 (12%)

Query: 165 VDKEKEKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           +D++ EK +K EKE +K  +E   K   +   +K  K +++        +  K++  + +
Sbjct: 38  IDRKMEKYQKREKEIQKRAREL-RKNGDKLSPKKFEKRQEE------LMEDQKEMMMDMM 90

Query: 224 KK 225
           K 
Sbjct: 91  KP 92


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 32.3 bits (73), Expect = 0.51
 Identities = 27/148 (18%), Positives = 60/148 (40%), Gaps = 3/148 (2%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
            +ED  E K ++    ++K   ++   + K  K   D    K  E ++  +  + +E+  
Sbjct: 155 ETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAA 214

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
             +++  E+      ED +   +    E+++ ++E   + +D    KEK KKK    +  
Sbjct: 215 LGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAE 274

Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSP 261
             + E   +   K   + + K + E   
Sbjct: 275 LEDDEPDKEAVKK---EADSKPEEEDEE 299



 Score = 31.5 bits (71), Expect = 1.0
 Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 6/137 (4%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDK-----KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
             K +  +   E K DKS   DK     + ++  E  + +++    D+D  ++  +D  +
Sbjct: 171 EEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSE 230

Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
               +++ G  E++E  EE      ED   P EK +K++   + E +  + DK+  K++ 
Sbjct: 231 DGKDDEDFGSGEDEEDDEEGNI-EYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEA 289

Query: 224 KKKKKEKERSSHEKEVI 240
             K +E++    E+E  
Sbjct: 290 DSKPEEEDEEDDEQEDD 306



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 25/124 (20%), Positives = 58/124 (46%), Gaps = 4/124 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKK--DKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            +D+   ++     +   +  +K K K   D  ++++ E DKE    +   K ++E++E 
Sbjct: 241 GEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEED 300

Query: 180 KEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            E+++ +  +E  E  ++K++ +E   E       K +    EK + K K++     ++ 
Sbjct: 301 DEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPK-ELSSFEKRQAKLKQQIEQLEKEN 359

Query: 239 VIPK 242
           + PK
Sbjct: 360 LAPK 363



 Score = 30.7 bits (69), Expect = 1.8
 Identities = 24/116 (20%), Positives = 60/116 (51%), Gaps = 2/116 (1%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
             ++ G I+ +      +K  KK   +  + + D+ DK+ +K +   K ++ D+E ++ +
Sbjct: 245 EDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQE 304

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
            ++ E++  +    K   ++PV +    + +S KE  S +K +   K+++++ +KE
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLE--GVDLESPKELSSFEKRQAKLKQQIEQLEKE 358


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 30.1 bits (68), Expect = 0.52
 Identities = 20/66 (30%), Positives = 36/66 (54%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           +EK  +K DK+D D I +  K ++KK  +  +  + +  +K ++K+ K  E  E P E  
Sbjct: 2   EEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61

Query: 199 RKEEKD 204
           R+  +D
Sbjct: 62  RRRTED 67


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 32.5 bits (75), Expect = 0.52
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 2/93 (2%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           +E I    D   ++   +   + E+  D E E+ +++E +          E  EK +EK 
Sbjct: 167 RELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA--DESELPEKVLEKF 224

Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           +   K  +K +K+++K  + +  + KK  K +E
Sbjct: 225 KALAKQYKKLRKAQEKKVEGRLAQHKKYAKLRE 257


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 32.4 bits (75), Expect = 0.54
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDK---EKPVEKIRKEEKDSEKEKKSKDKDKKL 218
           E  +D E E  + E++  K EKE +  E K   E  V K  +E  + E+EK ++ ++K  
Sbjct: 804 EGLIDVEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLA 863

Query: 219 K-KEKLKKKKK 228
           K KE+L + K 
Sbjct: 864 KLKERLARLKA 874


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 31.4 bits (71), Expect = 0.54
 Identities = 23/103 (22%), Positives = 57/103 (55%), Gaps = 1/103 (0%)

Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKE 168
           VK+TS +    IK+++ +K +K+   ++E+E+ +++ ++ +  + + +  +D+E++ + E
Sbjct: 26  VKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           ++    ++ EKK   +       +     I K  K +EK KK+
Sbjct: 86  EDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128



 Score = 29.4 bits (66), Expect = 2.1
 Identities = 26/103 (25%), Positives = 50/103 (48%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
            +KE I K+ +    +K ++ E+++E+  ++ +E    E++E+  E E + +ED+E  V+
Sbjct: 31  WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
               E+K+      S   D          KK EK + + E  V
Sbjct: 91  LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIV 133



 Score = 28.3 bits (63), Expect = 5.8
 Identities = 27/100 (27%), Positives = 52/100 (52%), Gaps = 2/100 (2%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
            KK E      K    DK+ I  + +D   K +K+++  ++EE++++E +E +  ED+E+
Sbjct: 16  EKKDENLLEHVKITSWDKEDIIKENEDV--KDEKQEDDEEEEEEDEEEIEEPEDIEDEEE 73

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
            VE   +EE+D E     KD +KK   +     + +  ++
Sbjct: 74  IVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 32.4 bits (74), Expect = 0.56
 Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 11/73 (15%)

Query: 169 KEKGKKEEK-------EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           KE+G  EE        +K  E   KM+E  E+  E    EE D    K   + +    K+
Sbjct: 818 KEEGTYEELSNNGPLFQKLMENAGKMEEYVEENGE----EEDDQTSSKPVANGNANNLKK 873

Query: 222 KLKKKKKEKERSS 234
               KKK KE  S
Sbjct: 874 DSSSKKKSKEGKS 886



 Score = 32.4 bits (74), Expect = 0.62
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 4/62 (6%)

Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
           E   K E+  +E  E +  +   KPV         +  +K S  K K  + + +  K++E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVA----NGNANNLKKDSSSKKKSKEGKSVLIKQEE 893

Query: 230 KE 231
           +E
Sbjct: 894 RE 895


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 32.4 bits (73), Expect = 0.58
 Identities = 37/176 (21%), Positives = 81/176 (46%), Gaps = 6/176 (3%)

Query: 76  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
             K+S ++  S  E +  K+   +     +   +V ++   + GE++   +      +++
Sbjct: 365 ASKLSDEDDDSVMESKMQKLFSEKEIDFGLNSELVDMSDDGENGEMEDTFTSHLPA-SNE 423

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
            +  +K++   +K D+   + +E  KE+++ K K+    + K+KKE    K K+ K   +
Sbjct: 424 SESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHA-I 482

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLK----KEKLKKKKKEKERSSHEKEVIPKLTFKF 247
           E+    +++ E  K   + D++L     K  LK +K +K R   +K    +  F F
Sbjct: 483 ERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEEGFVF 538


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 32.2 bits (73), Expect = 0.58
 Identities = 24/118 (20%), Positives = 59/118 (50%), Gaps = 7/118 (5%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           + + L  +  +++   D     K  +E++K+ +   +  K   ++  +  ++++E    +
Sbjct: 211 AEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALR 270

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKKKE 229
           +E ++ +      MKED E   E++   +   E+ +K ++K  D +L+KEKL+ + K 
Sbjct: 271 EENRKLRS-----MKEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKS 323


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 32.0 bits (73), Expect = 0.58
 Identities = 27/106 (25%), Positives = 38/106 (35%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           +E EK  KK+ K  K     +E  K KK      + K +E   K ++  K +E    P  
Sbjct: 95  RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
                   S        K K+    + KK  K    +S   E  PK
Sbjct: 155 SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPK 200



 Score = 30.5 bits (69), Expect = 1.7
 Identities = 22/116 (18%), Positives = 38/116 (32%), Gaps = 3/116 (2%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-- 177
           E +  K  K  K     +E +K KK  D   + K K     ++++   ++     E    
Sbjct: 98  EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPS 157

Query: 178 -EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
                 +       K K     R ++        S   +K  K +   KK  +K R
Sbjct: 158 STPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 31.6 bits (72), Expect = 0.61
 Identities = 17/74 (22%), Positives = 41/74 (55%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           ++  ++K     ++D++  ++K K   K K  K  K KI+ KEK K +K +K   + +++
Sbjct: 28  DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87

Query: 176 EKEKKEEKEFKMKE 189
             E +  ++ ++++
Sbjct: 88  TPEDELAEKLRLRK 101



 Score = 29.6 bits (67), Expect = 3.1
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKL 223
           D+E+++ K+EEK K   K    K  K K  EK + + +  EK  +  ++D    +  EKL
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97

Query: 224 KKKKKEKE 231
           + +K ++E
Sbjct: 98  RLRKLQEE 105



 Score = 28.1 bits (63), Expect = 9.8
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           +++E   ++  K +  ++ + K   K K  +KEK K++K+EK     E
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.9 bits (70), Expect = 0.61
 Identities = 16/65 (24%), Positives = 35/65 (53%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           E + ++++   + EEKE  + K+ K ++ ++K  EK+ K++     +  +K   KK   +
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153

Query: 222 KLKKK 226
           K + K
Sbjct: 154 KEEGK 158



 Score = 29.7 bits (67), Expect = 2.0
 Identities = 18/65 (27%), Positives = 36/65 (55%)

Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           + EKE+ +  ++ +++E     KE KEK  +K+ ++    +  K +K+  KK  K+   K
Sbjct: 94  ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153

Query: 226 KKKEK 230
           K++ K
Sbjct: 154 KEEGK 158



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 20/67 (29%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E+  + +K+ S + ++K   K +KEK K+KK+KK  +K+  K+  K  K   +K   K  
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 176 EKEKKEE 182
            K+++ +
Sbjct: 152 TKKEEGK 158



 Score = 28.2 bits (63), Expect = 5.2
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +K++KE  K+ ++K+     K K++ KEKKV ++  K K  +  K   K+   K   +K 
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155

Query: 195 VEK 197
             K
Sbjct: 156 EGK 158


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 31.9 bits (73), Expect = 0.61
 Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 6/63 (9%)

Query: 138 EKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           E+E  +      K+ + +K+  + ++K  EKK  K  +KGK EE   +E  E + +E  E
Sbjct: 67  EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEP-E 125

Query: 193 KPV 195
            P+
Sbjct: 126 PPL 128



 Score = 30.7 bits (70), Expect = 1.7
 Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 8/56 (14%)

Query: 193 KPVEKI--------RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
           KP+E+I         +E K+ E EK+  + +KK  ++K KK  K+ +      E +
Sbjct: 61  KPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116


>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
          Length = 650

 Score = 32.1 bits (74), Expect = 0.64
 Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
            E KK K   +      K +K ++   K+++D  +I  + K K+ KV K K+K +K
Sbjct: 195 AEFKKGKKKFEASFYGYKGKKIEL---KNEEDVKEILAELKKKDFKVSKVKKKERK 247


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 30.7 bits (70), Expect = 0.67
 Identities = 19/69 (27%), Positives = 42/69 (60%), Gaps = 1/69 (1%)

Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
            + K   K +KE   +KE+K+ + +  K++ +    + E+  E+ +K K + +K +++K 
Sbjct: 63  DLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK-RTRKNREKKF 121

Query: 224 KKKKKEKER 232
           K+++KEKE+
Sbjct: 122 KRRQKEKEK 130



 Score = 29.2 bits (66), Expect = 1.8
 Identities = 20/58 (34%), Positives = 32/58 (55%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           +    +EKEK KK+K    K +I  + + +  K   EKEK  ++ +EKK ++  K KE
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 31.8 bits (72), Expect = 0.68
 Identities = 25/108 (23%), Positives = 42/108 (38%), Gaps = 3/108 (2%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
            KK  K    +   KK   K K   K   K K  K     +K  K   +  ++ +   K+
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
             +K  + ++K  K   K  K+  K  K   +  K K K K+++  + 
Sbjct: 259 ALKKAAKAVKKAAK---KAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303



 Score = 29.5 bits (66), Expect = 3.6
 Identities = 27/93 (29%), Positives = 33/93 (35%), Gaps = 1/93 (1%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           K  K+  K  K  KK  K K  KK  K  K   K   K   K      +  K + K KK+
Sbjct: 240 KAAKTAAKAAKKAKKTAK-KALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKK 298

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
             +      K K   K  K     +K KK   K
Sbjct: 299 AGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331



 Score = 28.7 bits (64), Expect = 6.9
 Identities = 31/117 (26%), Positives = 42/117 (35%), Gaps = 2/117 (1%)

Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK-GKKEEK 177
            + K        KK  K     K   K   K   K K   K   KK  K  +K  KK  K
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKERS 233
              +  +   K  K K   K +  +K +   K K+  K  K   +  K KK  K+R+
Sbjct: 277 AAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 31.1 bits (71), Expect = 0.70
 Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 2/37 (5%)

Query: 146 KDKKDKDKI--KNKEKDKEKKVDKEKEKGKKEEKEKK 180
           KD K  ++   +  E     K ++  E  KKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
              K +E+   EE ++    K ++  +  KKE  KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 32.2 bits (73), Expect = 0.71
 Identities = 19/71 (26%), Positives = 26/71 (36%), Gaps = 5/71 (7%)

Query: 10   AVVRQSAPMFPFQFSPFPSVAGPGL-----IPRLPPMLQHPLIPQPLVHPPGTPPPSKMK 64
            A  RQ++P  P   +P    AGP        P  PP    P  P P   P   PP    +
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785

Query: 65   DKVSPKDKTSP 75
              V+   ++  
Sbjct: 2786 PAVASLSESRE 2796



 Score = 29.9 bits (67), Expect = 4.1
 Identities = 20/103 (19%), Positives = 27/103 (26%), Gaps = 10/103 (9%)

Query: 11   VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPK 70
            V R + P        F         P  P     P        PP   PP        P 
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP--------PP 2934

Query: 71   DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
                P+  ++     +   +    V    LG   +VP  V V 
Sbjct: 2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA--LVPGRVAVP 2975



 Score = 28.8 bits (64), Expect = 7.8
 Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)

Query: 9    QAVVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVS 68
                + +AP  P    P   +A P +          P  P+    P   PPP        
Sbjct: 2870 SPAAKPAAPARP----PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925

Query: 69   PKDKTSPKE 77
            P     P  
Sbjct: 2926 PPQPQPPPP 2934


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 31.8 bits (72), Expect = 0.73
 Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 10/100 (10%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK------KEEK 183
           D+K      KEK  + K++   DK K+ + ++E++V ++ E+ KK E E+       E +
Sbjct: 114 DRKDENLSSKEKTLESKEQSLTDKSKHID-EREEQV-EKLEEQKKAELERVAALSQAEAR 171

Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           E  + E + K   +I    +++E+E   KD+  K+ K+ L
Sbjct: 172 EIILAETENKLTHEIATRIREAERE--VKDRSDKMAKDLL 209



 Score = 31.0 bits (70), Expect = 1.4
 Identities = 30/107 (28%), Positives = 55/107 (51%), Gaps = 4/107 (3%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK--EK 193
           K E++    KK  K + K   KE   E K +  K + + E++ K E +E K  E +  E+
Sbjct: 50  KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTER 109

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 238
                RK+E  S KEK  + K++ L  K + + +++++ E+   +K+
Sbjct: 110 ATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKK 156


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 32.0 bits (73), Expect = 0.73
 Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 9/129 (6%)

Query: 110 VKVTSSEDLGEIKKDKSIKKDKKT------HKKKEKEKIKKKKDKKDKDKIKNKE-KDKE 162
           V  T +E    +  D  +K  K          K     + +        K++ KE +   
Sbjct: 228 VPATIAEAKELLHADLYLKTLKSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTLR 287

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
            +++KE EK +K  ++ K  +EF  ++D     EKI K+   S +  +   K KK ++E 
Sbjct: 288 TRIEKELEKAEKSLEKLK-GREFNCEKDARIAAEKILKDYS-SVEFLEVDFKSKKKREEA 345

Query: 223 LKKKKKEKE 231
            + + ++ E
Sbjct: 346 KRGRPRKDE 354


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 31.3 bits (71), Expect = 0.73
 Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 2/84 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
            K ++ K++K+     +K+K+ K K    K  ++ ++K  E + +   EK   E      
Sbjct: 101 AKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160

Query: 182 EKEFKMKED--KEKPVEKIRKEEK 203
           E+    +E+  +E P E+   EE 
Sbjct: 161 EEAAAAEEEEAEEAPAEEAPAEES 184



 Score = 29.4 bits (66), Expect = 2.9
 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK--GKKEEKEKKEEKEFKMKEDKEK 193
           K+ K   KK K  K K   K    + EKKV++ + +   +K+  E       +    +E+
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169

Query: 194 PVEKIRKEEKDSEKEK 209
             E+   EE  +E+  
Sbjct: 170 EAEEAPAEEAPAEESA 185


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 31.9 bits (73), Expect = 0.74
 Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 20/111 (18%)

Query: 129 KDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE----- 182
           KDK T K++    I         +++I+   K+ E   + E++K +KEE E +       
Sbjct: 480 KDKGTGKEQ---SITITASSGLSEEEIERMVKEAEA--NAEEDKKRKEEIEARNNADSLA 534

Query: 183 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 225
                  KE   K   E+  EKI K   + ++  K +D ++ K K E+L++
Sbjct: 535 YQAEKTLKEAGDKLPAEE-KEKIEKAVAELKEALKGEDVEEIKAKTEELQQ 584


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 31.8 bits (72), Expect = 0.75
 Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 2/158 (1%)

Query: 43  QHPLIPQPLVHPPGTPPPSK-MKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELG 101
           Q P+  +    P  TP            K   +P +  +         K        EL 
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELL 328

Query: 102 VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 161
                 ++  ++S  D      D    + +K  ++          + +D       + +K
Sbjct: 329 RERFPSDLFAISSLPDSDSEASDSGPTRKRK-RRRVPPLPEYSSDEDEDDSDEDEVDYEK 387

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
           E+K  +E++K     K  +  K   + E  EK V+  R
Sbjct: 388 ERKRRREEDKNFLRLKALELSKYAGVNERMEKIVQVTR 425


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 32.1 bits (73), Expect = 0.77
 Identities = 34/133 (25%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           IK+   +++DK   + +  +++KK  DK  KD I+    + E   D  +E      K KK
Sbjct: 28  IKELGLLEQDK--ARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKK 85

Query: 181 EEKEFKMKEDKEKPVEKIRKE-----EKDSEKEKKSKDKDKKLKKEKLKK--KKKEKERS 233
            +K+   K+D  K  + +RK+     + +  +E       KKL K  L+   K   K+  
Sbjct: 86  FKKD---KKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDSE 142

Query: 234 SHEKEVIPKLTFK 246
             +  +I    FK
Sbjct: 143 DEQNILIDIKEFK 155



 Score = 28.3 bits (63), Expect = 9.9
 Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 11/124 (8%)

Query: 62  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
            +K ++ P+   S KEK   K K   K+++Y  ++ +   +   +         E   + 
Sbjct: 380 YIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFINDLLEKYI--------KEFDSKD 431

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           +  ++I+           E  +   +  +   IK   +  E+ + KE++  K++E+  K 
Sbjct: 432 RLFENIRDYFAEISLMRNEDFQTNDNIAN---IKTAYQTFERILQKERKDLKQDEEVVKL 488

Query: 182 EKEF 185
            K  
Sbjct: 489 IKSL 492


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 31.0 bits (71), Expect = 0.78
 Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           KK K  KK+K+  +K+ ++      D+  +   + K +  E+  D+E  + ++ E E+K 
Sbjct: 15  KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAER--DRELNRQRQAEAEQKA 72

Query: 182 EK 183
             
Sbjct: 73  IV 74



 Score = 27.6 bits (62), Expect = 9.7
 Identities = 14/63 (22%), Positives = 34/63 (53%), Gaps = 4/63 (6%)

Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
           K K KK  KEK K +K+ ++  ++ + ++K+      E+ + E+ + ++E   + + +  
Sbjct: 14  KKKAKKAKKEKRKQRKQARKGADDGDDELKQA----AEEAKAEKAERDRELNRQRQAEAE 69

Query: 219 KKE 221
           +K 
Sbjct: 70  QKA 72


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 0.80
 Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 3/88 (3%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
             +E ++  + + E  +    E E++ E E K + + E  +   RK E++ E E ++K+ 
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEA 704

Query: 215 DKK---LKKEKLKKKKKEKERSSHEKEV 239
           D K     +E   + + E E +  E E+
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEI 732



 Score = 31.5 bits (71), Expect = 1.2
 Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 2/85 (2%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           K            D  +K +    +   + + + K+D+K  D       D     ++E+E
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG--GGGSDGGDSEEEEEE 869

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPV 195
           + ++EE+E++EE+E + +E+ E+P+
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894



 Score = 29.2 bits (65), Expect = 6.2
 Identities = 32/164 (19%), Positives = 71/164 (43%), Gaps = 13/164 (7%)

Query: 77  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK--KDKSIKKDKKTH 134
           E++  + +   + K   + +              +    ED GEI+  +D  +K D+   
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
            K E E   +  +K +       E   E + D  + K +  E+E   E + + K+D++  
Sbjct: 799 GKVEHEGETEAGEKDEH------EGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGV 852

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
                 +  DSE+E++ ++++++ ++E+     +E+E    E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEE-----EEEEEEEEENE 891



 Score = 29.2 bits (65), Expect = 6.5
 Identities = 22/95 (23%), Positives = 53/95 (55%), Gaps = 7/95 (7%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKVDKEKEK--G 172
           E  GE +  +  + + ++  + +  ++K +  +++ + + + + K  EK VD       G
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
             EE+E++EE+E    E++E+  E+  +EE+++E+
Sbjct: 862 DSEEEEEEEEEE----EEEEEEEEEEEEEEEENEE 892


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 30.9 bits (70), Expect = 0.83
 Identities = 38/146 (26%), Positives = 66/146 (45%), Gaps = 21/146 (14%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV---------------- 165
           KK K+   + K ++K+ K K +KKK     + I  K  DK+K +                
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTA 62

Query: 166 -DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
              +     K   E K+ +    ++DK  P++    E  DS++E++  +  K+L++E  +
Sbjct: 63  ARTDLLIQDKPIPELKKGEARIERDDKGNPLD----EPLDSKEEEEKPEVVKQLEEEASE 118

Query: 225 KKKKEKERSSHEKEVIPKLTFKFGTD 250
             KK +  S  E+E I +L  K G D
Sbjct: 119 PAKKPRRLSKREEEWIERLVEKHGDD 144


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 31.8 bits (72), Expect = 0.85
 Identities = 34/151 (22%), Positives = 56/151 (37%), Gaps = 20/151 (13%)

Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
            K+   D EK + KKE   KK EK  K+K +    V K ++EEK+ +             
Sbjct: 110 GKKTNDDLEKIESKKESINKKNEK--KIKNEASLQV-KTQREEKEEKDFIAD-------C 159

Query: 220 KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS----------PKIVIKPVK 269
              L KK +E+ +   E     K   K   + E      S            +IV +  +
Sbjct: 160 WRNLYKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQ 219

Query: 270 SPSPPPVAAADEYDTGDSKQVWICPACGVQD 300
           +P         ++D+ ++  +W     G  D
Sbjct: 220 TPLALLECNLTDFDSIENHSIWEQKIVGSGD 250



 Score = 29.1 bits (65), Expect = 6.5
 Identities = 21/94 (22%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF 185
            I+  K++  KK ++KI  K +   + K + +EK+++  +        K+ +E    KE+
Sbjct: 119 KIESKKESINKKNEKKI--KNEASLQVKTQREEKEEKDFIADCWRNLYKKNEE--RFKEY 174

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
                ++K   KI KE ++++         +K+K
Sbjct: 175 LENFKRKKFKRKILKEFENAKINASEIVGLEKVK 208


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 30.1 bits (68), Expect = 0.93
 Identities = 14/74 (18%), Positives = 44/74 (59%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            ++E     +++++ E E K++E +++  E++RKE+++  +E++ K  + +  ++K++ +
Sbjct: 24  SQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDE 83

Query: 227 KKEKERSSHEKEVI 240
           K ++    H   + 
Sbjct: 84  KLQETWHEHNLALA 97



 Score = 27.4 bits (61), Expect = 7.8
 Identities = 18/84 (21%), Positives = 44/84 (52%)

Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
           K  +++ ++ +KEK + +   K +E+ K+E +E ++EK    +E + K +E  + E+K  
Sbjct: 22  KFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKME 81

Query: 206 EKEKKSKDKDKKLKKEKLKKKKKE 229
           +++ +    +  L      + K +
Sbjct: 82  DEKLQETWHEHNLALANFIRTKTK 105


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 30.7 bits (70), Expect = 0.95
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 3/84 (3%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
           K  K++I   +K  E+  +K   +  +E K   E  + K +E+ E+  +K++  EKD + 
Sbjct: 30  KSLKEEIAEMKK-NEEHNEKLMAEIAQENKRLVEPLK-KAEEEVEELRKKLKDYEKDKQS 87

Query: 208 EKKSKDKDKKLKKEKLKKKKKEKE 231
            K  K + K+L+KE LK  K E E
Sbjct: 88  LKNLKARLKELEKE-LKNLKWESE 110



 Score = 28.3 bits (64), Expect = 6.7
 Identities = 21/108 (19%), Positives = 56/108 (51%), Gaps = 12/108 (11%)

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           KS+K++    KK E+       ++K   +I  + K   + + K +E+ ++  K+ K+ ++
Sbjct: 30  KSLKEEIAEMKKNEE------HNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEK 83

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            K        ++ ++   K+ EKE K+   + ++ +++ +K ++E++ 
Sbjct: 84  DKQS------LKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDE 125


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 30.3 bits (69), Expect = 0.98
 Identities = 21/119 (17%), Positives = 54/119 (45%), Gaps = 11/119 (9%)

Query: 115 SEDLGEIKKDKS-----IKKDKKTHK--KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
           ++DL E ++ K      + + ++  +  +++  +I ++  K+ +   +  + + E+++++
Sbjct: 43  ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102

Query: 168 EKEKGKKE-EKEKKEEKE---FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
            KE  + E E EK+   E    ++ E      EK+  ++ D   +K   D       E 
Sbjct: 103 IKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAELGEN 161



 Score = 28.0 bits (63), Expect = 6.2
 Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           K  ++ + K      E E+ K+E +    E E +++E +E+   +I ++ K  E E+ ++
Sbjct: 33  KALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQ-ASEIIEQAKK-EAEQIAE 90

Query: 213 DKDKKLKKEKLKKKKKEKERSSHEKE 238
           +   + ++E  + K+  +     EKE
Sbjct: 91  EIKAEAEEELERIKEAAEAEIEAEKE 116


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 30.9 bits (70), Expect = 0.99
 Identities = 22/101 (21%), Positives = 56/101 (55%), Gaps = 2/101 (1%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKEDKEK 193
           ++++++ + ++  ++ +  KN  K  +   D   +K K+E+K    KE KE  + E +E 
Sbjct: 109 REQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEAREH 168

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
               +   +   ++  + K+K++K K ++ K+++KE++R +
Sbjct: 169 FGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMA 209


>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
          Length = 212

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 6/90 (6%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM-KEDKE 192
             KK K+          K+++      + K       +    ++  K EK   + KE++ 
Sbjct: 127 KGKKPKKGFAGIPADTSKEEVVALPVKQNKN-----SEVIPFQRTPKREKARVITKEERA 181

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
               +  ++ K + K    K+K K+ K++K
Sbjct: 182 FNAYRTLRQAKLNAKFVGKKEKKKQAKEKK 211


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           E K+ K+++EF  +E KE      +K     + E K K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKK 195



 Score = 28.9 bits (65), Expect = 3.6
 Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 2/39 (5%)

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
           K+ K+K  E+   EE+       +K      K E  KKK
Sbjct: 160 KDAKQK--EEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 31.0 bits (70), Expect = 1.1
 Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 3/103 (2%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           + K D E+ K K EE +K++E+  K  E+ E   E++++  K  E E    ++  K    
Sbjct: 132 DLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPG 191

Query: 222 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 264
           ++   KK  +      E+  +      +D+ ++T   +   I 
Sbjct: 192 EVYDLKKRWDELEPGVELPEEELI---SDLVKETLNLAPKDIE 231


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 30.6 bits (70), Expect = 1.1
 Identities = 25/96 (26%), Positives = 53/96 (55%), Gaps = 11/96 (11%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           KKE E +KK+        ++ KE+  + + + E+E  ++  + +++EK    KE+     
Sbjct: 44  KKEAEALKKEA------LLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLD-- 95

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
              RK+E   +KE+  ++K+K+L   + + ++KE+E
Sbjct: 96  ---RKDESLEKKEESLEEKEKELAARQQQLEEKEEE 128



 Score = 28.7 bits (65), Expect = 5.3
 Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 9/92 (9%)

Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
           KI + E++ ++ +    E+ KKE +  K+E   + KE+  K   +  +E K+   E + +
Sbjct: 28  KISSAEEEAKQII----EEAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQ 83

Query: 213 DKDKKLKKEKLKKK-----KKEKERSSHEKEV 239
           +K    K+E L +K     KKE+     EKE+
Sbjct: 84  EKRLLQKEETLDRKDESLEKKEESLEEKEKEL 115


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 28/101 (27%), Positives = 49/101 (48%), Gaps = 11/101 (10%)

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
           +EK+  +E EK  KEE E++ + E + + ++EK   +  + +  +E EK         ++
Sbjct: 252 EEKR--RELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEK---------RR 300

Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 261
           EKL+   K+  RS+     I    FK G  ++    R S P
Sbjct: 301 EKLQNLLKKASRSADNVWYIEPSEFKAGDTVKLYYNRSSGP 341


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 31.1 bits (70), Expect = 1.1
 Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 24/185 (12%)

Query: 61  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
           SK++++   K+K  PK    + E+   +    +K K   L                    
Sbjct: 35  SKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAAL-------------------- 74

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI--KNKEKDKEKKVDKEKEKGKKEEKE 178
             K K    ++ T ++K K K K     K K     K K +  E+  ++EK   K +   
Sbjct: 75  -AKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAA 133

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEK 237
             + K   + + K +  E++ +EE++++KEK         K K     K+K  E     +
Sbjct: 134 AAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTE 193

Query: 238 EVIPK 242
           EV  +
Sbjct: 194 EVTEE 198


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 31.6 bits (71), Expect = 1.1
 Identities = 53/231 (22%), Positives = 77/231 (33%), Gaps = 14/231 (6%)

Query: 54  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
           PP  P  S +  K     +    E   SKE   P  KE  K  + + G     P   K  
Sbjct: 510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEP--KEGGKPGETKEGEVGKKPGPAKEH 567

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
               +  + K     KD K H K  +E  K K+ +  +   + K          E     
Sbjct: 568 KPSKIPTLSKKPEFPKDPK-HPKDPEEPKKPKRPRSAQRPTRPKSPKL-----PELLDIP 621

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK--E 231
           K  K  +  K  K     ++P    R E     K  K     K     K K+K  +   +
Sbjct: 622 KSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLD 681

Query: 232 RSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP-PPVAAADE 281
            ++  KE     T     +  E   +E+ P+    P  +P P PP    DE
Sbjct: 682 AAAKSKETK---TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE 729



 Score = 29.3 bits (65), Expect = 6.0
 Identities = 41/243 (16%), Positives = 77/243 (31%), Gaps = 6/243 (2%)

Query: 56  GTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS 115
           G PP +    +    +    KE    KE   P E +  +V           P+ +   S 
Sbjct: 518 GLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSK 577

Query: 116 -----EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
                +D    K  +  KK K+    +   + K  K  +  D  K+ ++ +  K  K   
Sbjct: 578 KPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
             ++    ++ E    +K  K     K   + K  EK            KE       ++
Sbjct: 638 PPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDE 697

Query: 231 ERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 289
              S  KE +P+     F T      K     +   +P+  P        + +   + ++
Sbjct: 698 SFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEER 757

Query: 290 VWI 292
            + 
Sbjct: 758 TFF 760


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 30.9 bits (70), Expect = 1.1
 Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 4/69 (5%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK----VDKEKEKGKKEEKEKKEEK 183
           K  KK    K K K+    ++++    K    +K+ K     D ++E  K EE+E++EE+
Sbjct: 125 KAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184

Query: 184 EFKMKEDKE 192
           E +  +D +
Sbjct: 185 EDEDFDDDD 193



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
               K  +K  + K K K        +E   EK    EK+  + E ++  EE E   +E+
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTE-EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179

Query: 191 KEKPVEKIRKEEKDSEKE 208
           +E+  E    ++ D + +
Sbjct: 180 EEEEEEDEDFDDDDDDDD 197


>gnl|CDD|179204 PRK01005, PRK01005, V-type ATP synthase subunit E; Provisional.
          Length = 207

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 27/112 (24%), Positives = 51/112 (45%), Gaps = 7/112 (6%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
             ++K+K+  D   ++ +K  E++    V   KE+ K+   E +EE E  ++  +E   +
Sbjct: 5   SAQDKLKQICDALREETLKPAEEEAGAIVHNAKEQAKRIIAEAQEEAEKIIRSAEETADQ 64

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK-----ERSSHEKEVIPKL 243
           K+++ E  S   +  K   + LK+    K  +E      E    + EV  KL
Sbjct: 65  KLKQGE--SALVQAGKRSLESLKQAVENKIFRESLGEWLEHVLTDPEVSAKL 114


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 16/83 (19%), Positives = 31/83 (37%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
            +       E+   K   +++    +   +  KK  ++       N+E   E K+ +E  
Sbjct: 76  SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135

Query: 171 KGKKEEKEKKEEKEFKMKEDKEK 193
             +  ++ K   K    K DKEK
Sbjct: 136 SDESPKEVKLATKNKTKKHDKEK 158



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 24/102 (23%), Positives = 49/102 (48%), Gaps = 4/102 (3%)

Query: 125 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKK 180
           + IK++   +KK+     E +KKK   ++ D +K   +++ K +    +  KK  E++  
Sbjct: 57  EEIKEEALKYKKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTI 116

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
                +   ++ K  ++   +E   E +  +K+K KK  KEK
Sbjct: 117 PNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 31.3 bits (71), Expect = 1.1
 Identities = 16/49 (32%), Positives = 18/49 (36%), Gaps = 1/49 (2%)

Query: 17  PMFPFQFSPFPSVAGPGLIPRLPPM-LQHPLIPQPLVHPPGTPPPSKMK 64
           P  P Q  P      PGL  ++PP   Q P   Q    P   PPP    
Sbjct: 234 PQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282



 Score = 28.6 bits (64), Expect = 9.3
 Identities = 9/55 (16%), Positives = 12/55 (21%), Gaps = 3/55 (5%)

Query: 9   QAVVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKM 63
            A  +Q  P  P  +   P           P          P   P     P + 
Sbjct: 189 AAFPQQGPPEQPPGYPQPP---QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQP 240


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 31.4 bits (72), Expect = 1.1
 Identities = 19/73 (26%), Positives = 41/73 (56%)

Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
           ++K+KK K+++KN +    KK+ +  E+ K EE+EK E+         +   E+  +E++
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 204 DSEKEKKSKDKDK 216
           + +KE +  + + 
Sbjct: 550 EKDKETEEDEPEG 562



 Score = 29.1 bits (66), Expect = 5.7
 Identities = 23/97 (23%), Positives = 42/97 (43%), Gaps = 21/97 (21%)

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           KEK  KEE          +K  K    +K+++ +++ + E++ KD+   L    +K K K
Sbjct: 492 KEKKMKEE----------LKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF-VKSKGK 540

Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
           E+E    ++E             E +      PKI++
Sbjct: 541 EEEEEEEKEEKDK----------ETEEDEPEGPKIIV 567



 Score = 29.1 bits (66), Expect = 6.2
 Identities = 14/73 (19%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 155 KNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           + KEK  ++++   +    KK ++  +E+K  + ++D++  ++   K +   E+E++ K+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549

Query: 214 KDKKLKKEKLKKK 226
           +  K  +E   + 
Sbjct: 550 EKDKETEEDEPEG 562


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 12/77 (15%), Positives = 33/77 (42%), Gaps = 3/77 (3%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           ++ K  ++       + +    + ++EK K          EE  + EE+   + +   + 
Sbjct: 218 EEAKLAVETGYWPLYRYE--PGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPEL 275

Query: 195 VEKIRKEEKDSEKEKKS 211
           ++K++K+  +  +E K 
Sbjct: 276 IQKLQKDVDERWEELKR 292


>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
           (A-type).  The trypanosome parasite expresses these
           proteins to evade the immune response. This family
           includes a variety of surface proteins such as
           Trypanosoma brucei VSGs such as expression site
           associated gene (ESAG) 6 and 7.
          Length = 370

 Score = 31.3 bits (71), Expect = 1.2
 Identities = 17/73 (23%), Positives = 32/73 (43%), Gaps = 2/73 (2%)

Query: 111 KVTSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
             T+ +   E+K+   + +   K T  +   EKIKK     D +KIK   +  + +   +
Sbjct: 279 DTTTLKASTELKEALKRLLLNKKDTDDEDAAEKIKKLFGDDDAEKIKKLWEKIDNEQIPK 338

Query: 169 KEKGKKEEKEKKE 181
           K  G  ++ +  E
Sbjct: 339 KIAGTTKKTKLGE 351


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 35/167 (20%), Positives = 71/167 (42%), Gaps = 17/167 (10%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
           K+   E L E K   + K +K   K    + K  I        ++ + NK   + +K   
Sbjct: 139 KMNRREKLRERKALNAAKFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQD 198

Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI----------RKEEKDSEKEKKSKDKDKK 217
           EKE+  +EE+E   E E    + +++  +K             E  +SE+E+ S+ +  +
Sbjct: 199 EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDE 258

Query: 218 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 264
            + E  K K ++++    +K   P +  ++    E++ + E  P + 
Sbjct: 259 DEDEDNKGKIRKRKTDDAKKSRKPHIHIEY----EQERENEKIPAVQ 301


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 31.2 bits (71), Expect = 1.2
 Identities = 16/103 (15%), Positives = 38/103 (36%), Gaps = 7/103 (6%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK-EKPVE 196
           EK +  +K+ +++ ++++    + E         G K    + + +   +       P E
Sbjct: 511 EKLEEVRKELEEELEEVEKLLDEVEL--LTGANSGGKTSLLELKAQIVVLAHMGLPVPAE 568

Query: 197 KIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           + + E   +  E K  K K   L     +   K+K+     + 
Sbjct: 569 EAKVE---AVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA 608



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 24/115 (20%), Positives = 50/115 (43%), Gaps = 10/115 (8%)

Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE----KKEEKEFKMK--EDK 191
           E+ K +  ++K+  +++  K ++  K++++E E+ +K   E           K    E K
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELK 552

Query: 192 EKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEK--LKKKKKEKERSSHEKEVIPK 242
            + V            E + ++ D+ K LKK+K  L     E      + +V+P+
Sbjct: 553 AQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPE 607


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 28/143 (19%), Positives = 59/143 (41%), Gaps = 28/143 (19%)

Query: 89  EKEYSKVKDVELGVTPM-VPNIVKVTSSEDLGEIKKDK--------SIKKDKKTHKKKEK 139
           EK   KVK    G++ M V  +++    + L  ++ +K        S         K   
Sbjct: 18  EKLAPKVK----GISSMSVKEVLQSLVDDGL--VRTEKIGTSNYYWSFPSQALNKLKTRL 71

Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
           EK+KK+ ++  +                E +   ++ K+ +EE E + +  +E    +  
Sbjct: 72  EKLKKELEELKQRI-------------AELQAQIEKLKKGREETEERTELLEELKQLEKE 118

Query: 200 KEEKDSEKEKKSKDKDKKLKKEK 222
            ++  +E EK  K+  ++++K K
Sbjct: 119 LKKLKAELEKYEKNDPERIEKLK 141


>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
           Staphylococcus aureus plasmid proteins with no
           characterized function.
          Length = 229

 Score = 30.8 bits (69), Expect = 1.2
 Identities = 28/110 (25%), Positives = 54/110 (49%), Gaps = 5/110 (4%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           I  D+  KK+++    +   K+     +K +  ++N+E  + KK +K   K   E KE +
Sbjct: 6   IMSDEDEKKNEENMNMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQ 65

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKE 229
           + +E ++K  K     ++++  +D  +K KK  +    L +EKLK    E
Sbjct: 66  DIREKELKAIK----SELKEATEDFQDKGKKIHNDFVDLLQEKLKHVDAE 111


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 29.7 bits (67), Expect = 1.2
 Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 10/102 (9%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKE 189
                KE++K  K  ++K+  K K      E    +++E+   EE+  +K+ EKE     
Sbjct: 26  SIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFA 85

Query: 190 DKEKPV--------EKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           DKEK V        E+ RK E++ ++ ++ ++++   K + L
Sbjct: 86  DKEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKGKDL 127


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
            +  D   E  +K+   + E ++F          +    +++  EK K   + +++LK  
Sbjct: 37  AEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLK-- 94

Query: 222 KLKKKKKEKERSSHEKEVI 240
            L+KKK E +    E+E I
Sbjct: 95  ALEKKKAELKDLEEEREGI 113


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 30.4 bits (69), Expect = 1.3
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKK 174
           KK + +KK +   K+ E  + KKK +  +++K K  K+  K+KK+ K    G K
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 19/67 (28%), Positives = 40/67 (59%), Gaps = 10/67 (14%)

Query: 175 EEKEKKEEKEFKMKEDK-----EKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLK 224
           + KE+ ++K +K  ++K      K VE+++K     +E ++ ++KK  +  ++ KK+K K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKK 187

Query: 225 KKKKEKE 231
           K  K+K+
Sbjct: 188 KSAKKKK 194


>gnl|CDD|162081 TIGR00874, talAB, transaldolase.  This family includes the majority
           of known and predicted transaldolase sequences,
           including E. coli TalA and TalB. It excluded two other
           families. The first includes E. coli transaldolase-like
           protein TalC. The second family includes the putative
           transaldolases of Helicobacter pylori and Mycobacterium
           tuberculosis [Energy metabolism, Pentose phosphate
           pathway].
          Length = 317

 Score = 30.9 bits (70), Expect = 1.3
 Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 5/61 (8%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDK---EKPVEKIRKEEKDSE 206
           D++K      E+K+D E  K   ++    +E EF+    ED    EK  E IRK   D E
Sbjct: 248 DELKESTGPVERKLDPESAKKVDKQPIILDESEFRFLHNEDAMATEKLAEGIRKFAADQE 307

Query: 207 K 207
           K
Sbjct: 308 K 308


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 29.9 bits (67), Expect = 1.3
 Identities = 22/101 (21%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR-------KEEKD 204
            K    ++   ++ D+E ++  +E +E+++ K  KM+E++E   + IR       KEE +
Sbjct: 22  GKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDE 81

Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
            E +  +++++ +L ++K   ++   E    ++E   K  F
Sbjct: 82  EEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEKSGF 122


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 9/70 (12%), Positives = 31/70 (44%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           K K + ++ +++ +K +   +++K  +  +  + K    ++  + +K   +     D  +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166

Query: 217 KLKKEKLKKK 226
               + +K K
Sbjct: 167 LTVGQAVKVK 176


>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
           recombination, and repair].
          Length = 1041

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 17/88 (19%), Positives = 27/88 (30%), Gaps = 12/88 (13%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK---------EDKEKPVEKIRKEEKDS 205
           KN     E      KEK  + +    EE+  K            D  K  ++   + K  
Sbjct: 5   KNFRVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAV 64

Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKERS 233
               K  + +      +L  +   KE S
Sbjct: 65  ---DKHSEIESLNVGMRLDVRLSSKELS 89


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 15/50 (30%), Positives = 18/50 (36%)

Query: 39  PPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPK 88
           P  +  P  P      P  PP  K  +K  PK K   K +     K  PK
Sbjct: 87  PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPK 136


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 30/129 (23%), Positives = 56/129 (43%), Gaps = 13/129 (10%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEK----- 203
           K  IK   KD    +++  +  K   K   E K++  K+ +D  K +EKI   EK     
Sbjct: 204 KVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQ 263

Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEK---ERSSHEKEVIPKLTFKFGTDMEEKTKRES- 259
             +  ++ ++  + L + + K  +  +   E +    E+  +L  +   +MEE+    S 
Sbjct: 264 LEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELE-QVKQEMEERGASMSD 322

Query: 260 -SPKIVIKP 267
            SP + IK 
Sbjct: 323 GSPLVKIKQ 331


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 30.9 bits (71), Expect = 1.4
 Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
             +K  K +       K     K  +K   K K++ V  EK  GK 
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK-DGKA 858


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 15/52 (28%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
            ++E+ EK +EK++++       K   ++DKK +  + ++KKKEK+  S  +
Sbjct: 341 SLREELEKRIEKLKEKPPKPPT-KAKPERDKKERPGRYRRKKKEKKAKSERR 391



 Score = 28.9 bits (65), Expect = 5.9
 Identities = 11/60 (18%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
             + +   ++++ +K+ + K+K K      K   E++K ++  + ++++KE K K ++  
Sbjct: 334 SGEPDGISLREELEKRIE-KLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
            + D I  +E+  EK+++K KEK  K   + K E+    KE   +   K ++++  SE+ 
Sbjct: 335 GEPDGISLREE-LEKRIEKLKEKPPKPPTKAKPER--DKKERPGRYRRKKKEKKAKSERR 391

Query: 209 KKSK 212
               
Sbjct: 392 GLQN 395



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKK 228
           +E+ E++  K+KE   KP  K + E    E+  +  + K +K  K + +  + 
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 11/48 (22%), Positives = 24/48 (50%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           K+ +K  +K  K   K K ++  K++     + K++K  K + +G + 
Sbjct: 348 KRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 30.7 bits (69), Expect = 1.5
 Identities = 13/95 (13%), Positives = 32/95 (33%), Gaps = 7/95 (7%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
            +   T ++   + I    ++        K  + E      K   +K +  +K+    + 
Sbjct: 181 GEAPITTRRIGVDGISLSLEET-------KASEVEHLSASLKTATEKVDALEKDLHAAQA 233

Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
           K D  + + K   + K +E  K     +     ++
Sbjct: 234 KLDSGQALTKEELDAKKAELSKALAALEAANAADE 268


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 31.1 bits (71), Expect = 1.5
 Identities = 21/67 (31%), Positives = 36/67 (53%)

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            + E +K EE +   +E  +  +EK   E +  EKEKK  D+  KLK+ ++ + +KE E 
Sbjct: 393 VRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452

Query: 233 SSHEKEV 239
              E ++
Sbjct: 453 EVDEDDI 459


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 30.8 bits (69), Expect = 1.5
 Identities = 23/120 (19%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKG 172
           E L E   D   ++D+K + + +  + + K   +D  ++K      K ++ ++   KE+ 
Sbjct: 180 EPLDEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQE 239

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           K+ +K +K E+E   +      ++   + +  +E E ++      +KK+  +  KK  + 
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKM 299


>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
           [Transcription].
          Length = 507

 Score = 30.8 bits (69), Expect = 1.6
 Identities = 26/166 (15%), Positives = 59/166 (35%), Gaps = 6/166 (3%)

Query: 59  PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 118
           P       ++  D      K           K+ +K+         +  N    TS+   
Sbjct: 135 PKKIRLGSITENDMNLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSRE 194

Query: 119 GEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
            +  ++  I K K   KK ++ E  K   D+++    +  + +   ++ +  EK K  +K
Sbjct: 195 ADENENYVISKVKDIPKKVRDGESAKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKK 254

Query: 178 EKKEE----KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
            K E+    K  +     +   + K   +  +  ++++S+ K    
Sbjct: 255 AKLEKRRHVKFLEGSNTHEMDQLLKHFLDNSNFRQDRRSRKKKASA 300


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 27/81 (33%), Positives = 48/81 (59%), Gaps = 5/81 (6%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKD 215
             + ++  +  EK+K K  +K++K  KE K ++++EK ++K+ +  E   E  KK K K 
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKK 218

Query: 216 KKLKKEK----LKKKKKEKER 232
           KK+ K+K    + K KKE++R
Sbjct: 219 KKIVKDKDGKVVYKWKKERKR 239


>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein.  This family consists of several
           eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
           expression of functional F1-ATPase requires two proteins
           encoded by the ATP11 and ATP12 genes. Atp11p is a
           molecular chaperone of the mitochondrial matrix that
           participates in the biogenesis pathway to form F1, the
           catalytic unit of the ATP synthase.
          Length = 250

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 15/113 (13%)

Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE------EKEFKMKEDKEKPVEKIRKEE 202
           K + K+  K K+   +  +  +K  K++ EKKE            +  K K ++  +  +
Sbjct: 3   KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62

Query: 203 KDSEKEKKSKDKDKKLKKEKLKKK-KKEKE---RSSHEKE-----VIPKLTFK 246
           K    +K  K     L  EK+K    +E E   R+ H K+     VIP  T++
Sbjct: 63  KKKPFDKPFKPLSSILDVEKIKDLSAEEIEFIWRAYHAKKDSLCAVIPAETYE 115


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 40/130 (30%), Positives = 71/130 (54%), Gaps = 3/130 (2%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           + +DL   KK K  K D K   KK + + KK ++  +    + +EK+K  K   + E  K
Sbjct: 77  AKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVK 136

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK---LKKEKLKKKKKEK 230
            ++ EK  +K  K KE ++K  +K  KE K+  ++KK++ + K+   LKK K  KK K+K
Sbjct: 137 VKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKK 196

Query: 231 ERSSHEKEVI 240
           +++  +  ++
Sbjct: 197 KKAKKKGRIL 206


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 7/90 (7%)

Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
                  I    +I   K+   KK+ EK KK+K  K  D++    ++    V  E     
Sbjct: 50  EWITADLINLGAAISIPKR---KKQTEKGKKEKKPKVSDRMDLDNEN----VQLEMLYSI 102

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
             E+E ++ +    K        +++   +
Sbjct: 103 SNEREIRQLRFGGSKVQNPHEGARVKNINE 132


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 16/53 (30%), Positives = 27/53 (50%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           KK H        K KK KK K K+ +K+   +K   K+++   K +K ++ E+
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294



 Score = 28.5 bits (64), Expect = 6.5
 Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 22/105 (20%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKN-------KEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           K  K+++ +K KK+  K  K  +         K+     + +  K K KK++K+K   K+
Sbjct: 210 KQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKK 269

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKK 228
           +K K              K + K++K++ K KK + +E+L+KK K
Sbjct: 270 YKTK--------------KLTGKQRKARVKAKKAQRRERLQKKIK 300


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 25/107 (23%), Positives = 58/107 (54%), Gaps = 3/107 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKE---KKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           K+++++I  K  +  K+  + KEK  E        +  + + E  EKK++      E++ 
Sbjct: 75  KEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEER 134

Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           + V+KI++  K+ E  KK+ ++++KLK+ K +  + +K+     +++
Sbjct: 135 ELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKI 181



 Score = 29.6 bits (67), Expect = 3.1
 Identities = 21/106 (19%), Positives = 59/106 (55%), Gaps = 4/106 (3%)

Query: 130 DKKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKVDKEKEKGK--KEEKEKKEEKEF 185
            K+   K+E E++K+K+D+  K+  ++  K  +   KV + +EK +  +EE+++  E+  
Sbjct: 13  LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQ 72

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
           ++KE +++   K+++  K+  + K+ +++     +     +++ + 
Sbjct: 73  ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIER 118


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 30.6 bits (69), Expect = 1.8
 Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 2/87 (2%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           K+   ++        +KG K++K K    +    E      E   K+ K  + +KK +DK
Sbjct: 402 KSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK--KNQKKGRDK 459

Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIP 241
             K+  +     KKE  +S  +   IP
Sbjct: 460 SSKVPSDSKAGGKKESVKSQEDNNNIP 486



 Score = 29.4 bits (66), Expect = 4.7
 Identities = 28/114 (24%), Positives = 42/114 (36%), Gaps = 10/114 (8%)

Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
            SS  L    +      +         +  KKKK K    K    E   + + D  K KG
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPK-KG 449

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
           KK +K K  +K  K+  D +   +K        E  K  +D +    +E + KK
Sbjct: 450 KKNQK-KGRDKSSKVPSDSKAGGKK--------ESVKSQEDNNNIPPEEWVMKK 494


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 30.6 bits (69), Expect = 1.9
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 13/140 (9%)

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
           DL +IK   +  KDK    ++ K + K  K  D   K             +D E  + + 
Sbjct: 530 DLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRSRS 589

Query: 175 EEKEKK--------EEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLK 224
            E +K+        +E E    +DK    + IR+   E ++   K ++ ++ K+  EKL+
Sbjct: 590 NEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLR 649

Query: 225 KKKKE-KERSSHEKEVIPKL 243
            K    K++ +    +IP L
Sbjct: 650 GKIDNYKKQIAEIDSIIPDL 669



 Score = 30.3 bits (68), Expect = 2.5
 Identities = 35/164 (21%), Positives = 60/164 (36%), Gaps = 14/164 (8%)

Query: 62  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV-----TSSE 116
           +M   +S  D    K+ +    + +  E+ Y K+KDV   +   + NI  +     +S+ 
Sbjct: 139 EMDSLIS-GDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNL 197

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           +L  IKK   I  D+K+H    KE       ++   +  N   D         E    E+
Sbjct: 198 ELENIKK--QIADDEKSHSITLKEI------ERLSIEYNNAMDDYNNLKSALNELSSLED 249

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
            + + E E K  E       +     K+ E+       D   K 
Sbjct: 250 MKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKN 293


>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
          Length = 499

 Score = 30.4 bits (68), Expect = 1.9
 Identities = 14/53 (26%), Positives = 21/53 (39%)

Query: 25  PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKE 77
             P +A    +P  P  +Q   +  P V PP   PP   + +V P    +  E
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464


>gnl|CDD|152940 pfam12506, DUF3713, Protein of unknown function (DUF3713).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 92 and 1225 amino
           acids in length. There is a single completely conserved
           residue S that may be functionally important.
          Length = 116

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 2/53 (3%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
           I K+K  +++  T   K   K+K K    D DK+K+K+   E K  +  EK K
Sbjct: 10  ITKNKVGQEETNTSSDKSTNKLKTKDA--DVDKLKDKKTKVETKAKELVEKLK 60


>gnl|CDD|184294 PRK13743, PRK13743, conjugal transfer protein TrbF; Provisional.
          Length = 141

 Score = 29.4 bits (66), Expect = 1.9
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
            K ++   MKE K     KIR+ ++D + E++  D 
Sbjct: 4   YKYQRPLSMKEKKNDAELKIRQTKRDYDYERRVSDI 39


>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
          Length = 382

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
           D + N  K +    +   +  KK+ K KK+ K+ +   D       IR    D E  KK 
Sbjct: 82  DSVVNLTKKQISSEENTMKMDKKKYKLKKKIKKVREINDT------IRFAHYDYETIKKG 135

Query: 212 KDKDKKLKKEKL 223
             K KK+ ++ +
Sbjct: 136 YIKFKKMIRKDI 147


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 30.4 bits (68), Expect = 2.0
 Identities = 25/123 (20%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           E+  +    +    +  THK K  E    +   +       KE +K K+  +E     +E
Sbjct: 159 EEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE 218

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
            K+K+EE+   ++E++++  ++   + K  E+E+K + K++  ++     +K++K     
Sbjct: 219 LKKKREERRKVLEEEEQRRKQE-EADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDG 277

Query: 236 EKE 238
             E
Sbjct: 278 LSE 280


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 30.5 bits (68), Expect = 2.0
 Identities = 18/70 (25%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 158  EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
            E  ++ ++     K + +++E+KE  E   +  KE+ + K  KE +   ++    D  K+
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRA-KEEEIGKKEKEREQRIRKTI-HDNYKE 1067

Query: 218  LKKEKLKKKK 227
            + K++LKKK+
Sbjct: 1068 MAKKRLKKKR 1077



 Score = 29.7 bits (66), Expect = 4.0
 Identities = 35/173 (20%), Positives = 72/173 (41%), Gaps = 7/173 (4%)

Query: 66  KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 125
           K+   +  SP+E +   +  S K  E   V   E      V  +   + S +  ++ + +
Sbjct: 503 KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESE 562

Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF 185
             +  KK         + + K +  KD   +  + +E+ + ++ EKG  E+ E +E    
Sbjct: 563 FEELKKKW------SSLAQLKSRFQKDATLDSIEGEEE-LIQDDEKGNFEDLEDEENSSD 615

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
              E+          EE   E + +++ ++   KKE+L+   + +ER   EK+
Sbjct: 616 NEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKK 668



 Score = 28.9 bits (64), Expect = 6.8
 Identities = 14/54 (25%), Positives = 29/54 (53%)

Query: 178  EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
            E +E+ E K +  KE+  ++  KE  +S +  K ++  KK K+ + + +K   +
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 33/186 (17%), Positives = 64/186 (34%), Gaps = 23/186 (12%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
              S   +      +S K+DK  ++K              K    NK+ +  K     + 
Sbjct: 150 NKNSDSSIKNDTDTQSSKQDKADNQKAP-------SSNNTKPSTSNKQPNSPKPTQPNQS 202

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
             +    +   +K         K  + +     DS  ++ S+D  KK +K+   + KK+K
Sbjct: 203 NSQPASDDTANQK------SSSKDNQSMSDSALDSILDQYSEDA-KKTQKDYASQSKKDK 255

Query: 231 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS-------PSPPPVAAADEYD 283
             +S+ K   P+L  +     + K  +     +     +S       PS      +  ++
Sbjct: 256 TETSNTKN--PQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFN 313

Query: 284 TGDSKQ 289
             DSK 
Sbjct: 314 VVDSKD 319


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 8/90 (8%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
            K      +KK K   + IK   K KE      K++    ++++   K            
Sbjct: 336 LKIANDWARKKVKTAYEAIKENRKRKELA--LAKKQSSTTKRKENLPKWS------NPEY 387

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           +K   +E+ ++ E    ++ K+L+  K K+
Sbjct: 388 KKTTSQEEQAKLEFIKLEELKRLENGKQKR 417



 Score = 28.5 bits (64), Expect = 8.2
 Identities = 14/61 (22%), Positives = 31/61 (50%), Gaps = 1/61 (1%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
             +K+ ++ K K++   K KE + K  + + K     +E+ K + +  E+ K  +  K+K
Sbjct: 358 RKRKELALAK-KQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQK 416

Query: 180 K 180
           +
Sbjct: 417 R 417


>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family.  This protein
           is plasmid encoded and found to be essential for plasmid
           replication.
          Length = 279

 Score = 30.0 bits (68), Expect = 2.1
 Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 13/102 (12%)

Query: 132 KTHKKKEKEKIKKKKDKKDKDKI-------KNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
            + +K++ E   KK  KK    +       + KE   E+      E+    +K ++  K 
Sbjct: 169 NSARKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKL 228

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            K+ E        IRK+  ++  ++ SK +  K+  + LKKK
Sbjct: 229 AKLDEK------DIRKKILNALVKEYSKGELTKMGLKGLKKK 264


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 30.1 bits (68), Expect = 2.2
 Identities = 12/70 (17%), Positives = 26/70 (37%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
            + E E  +++     + +   +    + K   + EK  K+ K K + K     + K +P
Sbjct: 56  PQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115

Query: 195 VEKIRKEEKD 204
             K    +  
Sbjct: 116 KPKKPPSKTA 125



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 11/63 (17%), Positives = 22/63 (34%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +++ K   + +   +       ++   EKK  K K K K + K K + K     +     
Sbjct: 64  EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK 123

Query: 195 VEK 197
              
Sbjct: 124 TAA 126



 Score = 29.8 bits (67), Expect = 2.6
 Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 3/79 (3%)

Query: 156 NKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
             + + E   ++ K   + E   +    + K K   EK  +K + + K   K K    K 
Sbjct: 55  EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP---KV 111

Query: 216 KKLKKEKLKKKKKEKERSS 234
           K   K K    K   +  +
Sbjct: 112 KPQPKPKKPPSKTAAKAPA 130


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
           Provisional.
          Length = 2722

 Score = 30.6 bits (69), Expect = 2.2
 Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 6/133 (4%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKVDKEKEK-GK 173
           + + ++K   SI K        EK+ +  K+ K+  D    ++ D+  + ++  KEK   
Sbjct: 457 KSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNI 516

Query: 174 KEEKEKKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
                K  E  +     +K + E  +E I+   +  E   K +   + +K +   K K  
Sbjct: 517 VNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYI 576

Query: 230 KERSSHEKEVIPK 242
           +E   H K++I  
Sbjct: 577 EENVDHIKDIISL 589



 Score = 29.8 bits (67), Expect = 3.4
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 16/96 (16%)

Query: 140  EKIKKKKDKKDKDKIK---NKEKDKEKKVDKEKE-----KGKKEE------KEKKEEKEF 185
              IKK K+ K K++IK    K +   KK+D+ K      K K  E      KEK ++ E 
Sbjct: 1065 IDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEH 1124

Query: 186  KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
              K  K+K +EKI K+ + + KE ++ + +     E
Sbjct: 1125 YNK--KKKSLEKIYKQMEKTLKELENMNLEDITLNE 1158



 Score = 29.4 bits (66), Expect = 4.6
 Identities = 41/167 (24%), Positives = 67/167 (40%), Gaps = 18/167 (10%)

Query: 114  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD-----KDKIKNKEKDKEKKVDKE 168
             S  L E+ K      D+   +  EKE+ ++ KD ++     K +  N  K  + K D  
Sbjct: 1963 VSNKLSELNKITC--NDESYDEILEKEEYEELKDLRNSFNQEKAETLNNLKLNKIKEDFN 2020

Query: 169  KEKGKKEEKEKK------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
              K   +E EK        E   K+ E+K+  ++ I    +D EKE +S +       E 
Sbjct: 2021 SYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESIN---PSLDEL 2077

Query: 223  LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
            LKK  K +   S    +I  +  K   D +     E   +I +  +K
Sbjct: 2078 LKKGHKIE--ISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIK 2122


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 15/77 (19%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            KS+++ K++  K+    I ++  +K  ++ +   ++ +++V KE  K  K+  E     
Sbjct: 449 LKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCEN-QEEVKKEVPKKVKKIPEVDTYL 507

Query: 184 EFKMKEDKEKPVEKIRK 200
             +     E+  EK+  
Sbjct: 508 LLEELGINEETYEKLEA 524


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 13/57 (22%), Positives = 15/57 (26%), Gaps = 13/57 (22%)

Query: 20  PFQFSPFPSVAGPGLIP-------------RLPPMLQHPLIPQPLVHPPGTPPPSKM 63
             +  PF  V GP L+                     HP    P    PG   P  M
Sbjct: 85  SSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGM 141


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 25/111 (22%)

Query: 144 KKKDKKD--KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-----MKEDKEKPVE 196
           +KK KK+  K+ I    K K  K +++K K + E+  ++ + +FK     ++  K  P  
Sbjct: 171 RKKSKKEVMKEVI---AKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKP 227

Query: 197 KIRKEEKDSE---------KEKKSKDKDK------KLKKEKLKKKKKEKER 232
            +  EEKD E          +++++  D+        K+E  + KK E ER
Sbjct: 228 PMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAER 278



 Score = 28.4 bits (64), Expect = 9.0
 Identities = 29/126 (23%), Positives = 63/126 (50%), Gaps = 21/126 (16%)

Query: 131 KKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKVDKE-----------KEKGKKEEKE 178
           KK+ K+  KE I K K  K ++ K K +++D  +++D +           K   K     
Sbjct: 172 KKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTP 231

Query: 179 KKEEKEFKMK------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
           ++++ E+  +      + + +P ++ + EE   E  K+  ++ KKL+ E+L++ + E+E 
Sbjct: 232 EEKDDEYDQRVRELTFDRRAQPTDRTKTEE---ELAKEEAERLKKLEAERLRRMRGEEED 288

Query: 233 SSHEKE 238
              E++
Sbjct: 289 DEEEED 294


>gnl|CDD|227923 COG5636, COG5636, Uncharacterized conserved protein, contains
           Zn-ribbon-like motif [Function unknown].
          Length = 284

 Score = 30.1 bits (67), Expect = 2.3
 Identities = 33/170 (19%), Positives = 74/170 (43%), Gaps = 5/170 (2%)

Query: 58  PPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSED 117
            P  K K  +    K    +K SS     PK+ ++S+ + +        P   K+T+   
Sbjct: 70  IPLKKKKTNLQSGSKLPTFKKASSSTSNLPKKADHSR-QPIVKETDSFKPPSFKMTTEPK 128

Query: 118 LGEIKKDKSIKKDK--KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           +  +  D    K++  K  + K +  ++K  D +   + +  ++D E K  +  E    +
Sbjct: 129 VYRVVDDLMEDKEELRKLLRSKAQYMMRKDTDPRTIMEDELLDEDSEDKAIQRSECPPSK 188

Query: 176 EKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
            K+K+  K+    +KE++E  + + R+++     E +  + D  +  +K+
Sbjct: 189 TKKKRACKDCTCGLKEEEENEIVRTRQDKVVKFTEDELTEIDFTIDGKKV 238


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 5/73 (6%)

Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIP 241
           K+F   +  E   E      +  + +KK + + KK K  E+LK  +K+ ++   EK    
Sbjct: 66  KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK---- 121

Query: 242 KLTFKFGTDMEEK 254
           +L+    T + E+
Sbjct: 122 QLSSSLQTKLTEE 134


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 21/73 (28%), Positives = 41/73 (56%), Gaps = 3/73 (4%)

Query: 152 DKIKNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-- 208
             ++  EK   K+  + E+E  K+ EK  KEE+    +   E+ ++++ ++E++  KE  
Sbjct: 368 KDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELS 427

Query: 209 KKSKDKDKKLKKE 221
           K+ K+  +KLK E
Sbjct: 428 KEEKELLEKLKME 440



 Score = 29.4 bits (66), Expect = 3.9
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 4/70 (5%)

Query: 175 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           EE EK   KE +  E++ EK VEK+ KEE+   K    ++  K+L++++       KE S
Sbjct: 371 EEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEE---NFLKELS 427

Query: 234 SHEKEVIPKL 243
             EKE++ KL
Sbjct: 428 KEEKELLEKL 437


>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
           lipoprotein.  This model represents a family of six
           predicted lipoproteins from a region of about 20
           tandemly arranged genes in the Treponema denticola
           genome. Two other neighboring genes share the
           lipoprotein signal peptide region but do not show more
           extensive homology. The function of this locus is
           unknown.
          Length = 412

 Score = 30.3 bits (68), Expect = 2.4
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 3/64 (4%)

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           KE  +   +E + EK  KM+E   K  E + K E ++  +   + K+   K   +  K +
Sbjct: 23  KEDKESGLREMRVEKFSKMEE---KENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGE 79

Query: 229 EKER 232
             + 
Sbjct: 80  SDKF 83


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
            +KK+K +++      K+   +KK +K+K K   +  E    KE++  E +   ++++ K
Sbjct: 3   LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFV-KEYRKAEREIIRLKRLAK 61

Query: 201 EEKDSEKEKKSK 212
           +        + K
Sbjct: 62  KPGKFYVPAEHK 73



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 20/88 (22%), Positives = 35/88 (39%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           K +  +++      K+ +  KK   KK K   K  E   ++    E+E  + +   KK  
Sbjct: 5   KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPG 64

Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
           K +   E K   V +I+       K +K
Sbjct: 65  KFYVPAEHKLVFVIRIKGINGIPPKPRK 92


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 30.5 bits (69), Expect = 2.5
 Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 25/132 (18%)

Query: 118 LGEIKKDKSIKKDKKTH--KKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
           L E+   + I++  +T    K+EK+++     KK     K K K     +E+     + +
Sbjct: 292 LNELNNLRIIEEQGETKILSKEEKQELLDLLFKKNKLTYKQKRKLLGLKEEEIFKGLRYE 351

Query: 172 GKKEEKEKK-------------EEKEFKMKEDKEKPVEKIRK-----EEKDSEKEKKSKD 213
           G K EK                 ++  K  E  EK +++I K     ++++  ++     
Sbjct: 352 GLKAEKNFNISLKTYHDLRKALGKEFLKDLELNEKILDEIVKILTLYKDREMIEKILELY 411

Query: 214 KDKKLKKEKLKK 225
           KD  L +E+LKK
Sbjct: 412 KD-LLNEEQLKK 422


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 30.2 bits (68), Expect = 2.5
 Identities = 27/107 (25%), Positives = 52/107 (48%), Gaps = 6/107 (5%)

Query: 126 SIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KK 180
           S + +KK   K  +   KI KK D K+  +  N + + E K ++       E+K    K 
Sbjct: 31  SKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKN 90

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKK 226
           +E+  K K++ +KP +K + ++ ++    +  + K      +KLKK+
Sbjct: 91  KEEIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137



 Score = 29.0 bits (65), Expect = 5.1
 Identities = 22/103 (21%), Positives = 41/103 (39%), Gaps = 3/103 (2%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI---KNKEKDKEKKVDKEKEKGKKEEKE 178
           +K   IK ++ T K  +K   K+  +  +       K++E       +K+ +  K +E+ 
Sbjct: 35  EKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEI 94

Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           +K + E K  + K +  +            K       KLKKE
Sbjct: 95  EKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137


>gnl|CDD|224162 COG1241, MCM2, Predicted ATPase involved in replication control,
           Cdc46/Mcm family [DNA replication, recombination, and
           repair].
          Length = 682

 Score = 30.4 bits (69), Expect = 2.5
 Identities = 15/52 (28%), Positives = 27/52 (51%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           K K K+ K +K  D IK   +  E  V++  E+ +   +++ EE   K+K+ 
Sbjct: 617 KSKSKRDKIEKVLDIIKELVERSEDPVEEIIEEAEGISEKEVEEALEKLKKK 668


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 30.4 bits (68), Expect = 2.5
 Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 2/106 (1%)

Query: 127 IKKDKKTH--KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           +K+D+  +   ++ KE IKK  +    D     EK  EK+V  E E+  K+  E  EE +
Sbjct: 175 LKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEEDTKKADEDGEEPK 234

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            +  ++ ++  +K  K+ K+  KE + ++K K L     K   KE+
Sbjct: 235 VEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTRDPKDVTKEE 280


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 33/183 (18%), Positives = 70/183 (38%), Gaps = 24/183 (13%)

Query: 77  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 136
            +  S+E T      Y  +KD E     +    +  T+S         +S+K+ K   + 
Sbjct: 26  TEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKLERQA 85

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE---DKEK 193
           +E  +       K   + K  +K  E+K  +E+EK ++E + ++   + K +E    K +
Sbjct: 86  QEAYENWLSA--KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQ 143

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKK-------------------EKLKKKKKEKERSS 234
              K R  +   E  + +        K                   +KLK++++++E   
Sbjct: 144 QAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEER 203

Query: 235 HEK 237
            ++
Sbjct: 204 RKQ 206


>gnl|CDD|130844 TIGR01784, T_den_put_tspse, conserved hypothetical protein
           (putative transposase or invertase).  Several lines of
           evidence suggest that members of this family (loaded as
           a fragment mode model to find part-length matches) are
           associated with transposition, inversion, or
           recombination. Members are found in small numbers of
           genomes, but in large copy numbers in many of those
           species, including over 30 full length and fragmentary
           members in Treponema denticola. The strongest
           similarities are usually within rather than between
           species. PSI-BLAST shows similarity to proteins
           designated as possible transposases, DNA invertases
           (resolvases), and recombinases. In the oral pathogenic
           spirochete Treponema denticola, full-length members are
           often found near transporters or other membrane
           proteins. This family includes members of the putative
           transposase family pfam04754.
          Length = 270

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 20/111 (18%), Positives = 41/111 (36%), Gaps = 4/111 (3%)

Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDS 205
            K K  I  +  D   +     +    +       +    +  E++ K +    KE +  
Sbjct: 150 PKLKAIIDERLLDVLDRWLLLLKGKDNQSVNLIFLQIPFIQKAEEEIKTLLLSSKELELY 209

Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 256
           E+ +K    D    +E+ K++  E+     + E   KL  K G  +E+  +
Sbjct: 210 EQREKYM-TDAISAEEEGKEEGIEEGILEAKLETAKKL-LKNGLTLEQVAE 258


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 30.0 bits (67), Expect = 2.6
 Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 1/87 (1%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           DK  K+  K H   +  K  ++ D+++ D   +K K  +   D+E E   +   E  ++ 
Sbjct: 215 DKPTKEPVKKHSDVKDPKEDEELDEEEHDSAMDKVK-LDLFADEEDEPNAEGVGEASDKN 273

Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKK 210
               ++ + +  E+I + EK+    K+
Sbjct: 274 LSSFEKQQIEMDEQIEELEKELVAPKE 300


>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain.  PfEMP1 (Plasmodium falciparum
           erythrocyte membrane protein) has been identified as the
           rosetting ligand of the malaria parasite P. falciparum.
           Rosetting is the adhesion of infected erythrocytes with
           uninfected erythrocytes in the vasculature of the
           infected organ, and is associated with severe malaria.
           PfEMP1 interacts with Complement Receptor One on
           uninfected erythrocytes to form rosettes.
          Length = 150

 Score = 29.2 bits (66), Expect = 2.6
 Identities = 23/106 (21%), Positives = 44/106 (41%), Gaps = 3/106 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE--EKEFKMKEDKEK 193
            K ++K+K   +       K   K K + V+K  EK KKE ++ KE   K++K+ + +++
Sbjct: 14  IKWRKKLKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDE 73

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK-KKEKERSSHEKE 238
                   E    +    K      + +K+K+  K     +S    
Sbjct: 74  YNVLESFLEDLLFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSA 119


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 29.7 bits (67), Expect = 2.6
 Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE---KEKKSKDKDK 216
           D ++K+DK K++ +  ++E+ +       E+  +  E+I K   ++E   +E K  +  K
Sbjct: 94  DCDRKIDKAKQRLELTQEEQTKIAADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMK 153

Query: 217 KLKK-EKLKKKKKEKERSSHEKEVIP 241
            +K+ E+LK KKKE E S   +   P
Sbjct: 154 LMKEVEELKAKKKELEDSDEVRNAAP 179


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 2/105 (1%)

Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK-KEEKEKKE-EKEFKMKEDKEKPVE 196
           ++K+   + +K +++ K K +  E K   ++E  K K+E +  E E++F + E +     
Sbjct: 3   RDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSAN 62

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
            +R E        K+ D      + +    K++K   S  KE++P
Sbjct: 63  DLRTELSPPSNLLKTSDASHIADESQPNSIKQKKREISPVKELVP 107


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 30.3 bits (68), Expect = 2.7
 Identities = 26/161 (16%), Positives = 65/161 (40%), Gaps = 4/161 (2%)

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKVDKEKEKGKKEEKEKK 180
           ++ I K        E+E+ +K+++++ +       +  K      +      ++ E+E+ 
Sbjct: 385 EEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEES 444

Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-KLKKKKKEKERSSHEKEV 239
            E+E + +E++E+  ++  +EE + E+E++  + D   ++E +   +         E   
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAE 504

Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
                    + M E  +   S      P + P  P    A+
Sbjct: 505 RRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
           +K+K+KK+    L KK KEKE  S  + +
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCI 359


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 26/137 (18%), Positives = 57/137 (41%), Gaps = 5/137 (3%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED----- 190
           ++E E  K +  ++ + KI+   + KEK+ + EK+      K +   K    +E+     
Sbjct: 18  EEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEARLKVLNAREELLDSV 77

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
            E+  E++    +D ++ K       K    KL + K        ++E++     +   +
Sbjct: 78  FEEAKERLANLSEDKDEYKDLLKDLIKQALVKLGEPKVIVRSREEDEELVKSALDEAKEE 137

Query: 251 MEEKTKRESSPKIVIKP 267
            +EKT +++   I    
Sbjct: 138 YKEKTGKDTVETIGDNI 154


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 24/135 (17%), Positives = 59/135 (43%), Gaps = 21/135 (15%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
              + KE++K+ +++  +  ++ + +  +K + K  +K + +   +K E+  K+ E KEK
Sbjct: 328 DFPELKEELKELEEELKE--LEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385

Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH---EKEVIPKLTFKFGTD 250
             E++ + E+               + ++LK++ +           K + P +    G  
Sbjct: 386 LSEELEELEE---------------ELKELKEELESLYSEGKISVNKTIYPGVKIHIGNK 430

Query: 251 MEEKTKRESSPKIVI 265
           +  + KRE  P   +
Sbjct: 431 V-LRIKREIGPCTFV 444


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 29.9 bits (66), Expect = 2.8
 Identities = 34/175 (19%), Positives = 79/175 (45%), Gaps = 6/175 (3%)

Query: 81  SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKE 140
           SK+     +  + K          ++         +   E+ + K   ++ +    K   
Sbjct: 90  SKQAEHENQINWFKSAIGLTAAAALLLAACFADDQDKKIELAQAKKEAENARDRANKSGI 149

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
           ++++++ K +++K K    +KE       +   ++EK+K E+++ K +++K+K      K
Sbjct: 150 ELEQEEQKTEQEKQKT---EKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANK 206

Query: 201 E--EKDSEKEKKSKDKDKKLKKEK-LKKKKKEKERSSHEKEVIPKLTFKFGTDME 252
              E + EK+K   +K   +K++K   K+ ++  + +H +  I KL  K G  +E
Sbjct: 207 NAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQENHNQFFIKKLGIKAGIAIE 261


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 5/88 (5%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-----DKEKEKGKKEE 176
            +        +  K+K  + I   +   +K   +   +D   +      + +KE  K  E
Sbjct: 54  DETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSE 113

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
            +++E K     +  +K ++++R E K+
Sbjct: 114 PKEEEPKAAAESKVVQKELDELRDELKE 141


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 14/51 (27%), Positives = 34/51 (66%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
           + K+++K ++ E+ + K + +    E + ++E+E+K+KE  EK +E+ R++
Sbjct: 28  QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARED 78


>gnl|CDD|191973 pfam08232, Striatin, Striatin family.  Striatin is an intracellular
           protein which has a caveolin-binding motif, a
           coiled-coil structure, a calmodulin-binding site, and a
           WD (pfam00400) repeat domain. It acts as a scaffold
           protein and is involved in signalling pathways.
          Length = 133

 Score = 28.9 bits (65), Expect = 2.8
 Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 10/77 (12%)

Query: 119 GEIKKDKSIKKDKKTH-------KKKEKEKIKKKK---DKKDKDKIKNKEKDKEKKVDKE 168
           GE +  +++K+D K          K+E+ K+KK K   D   K   + +E++ E+  D+E
Sbjct: 39  GERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELEEDDDEE 98

Query: 169 KEKGKKEEKEKKEEKEF 185
               K +E E K+ + +
Sbjct: 99  SNSEKVDESELKKSRLY 115


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 30.2 bits (69), Expect = 2.9
 Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 15/79 (18%)

Query: 158 EKDKEKKVDKEKEKGKKE----EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
           EK   KKV ++ EK ++E    E+ K  +KE    +D +  VE++R+           K 
Sbjct: 211 EKRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELRE-----------KI 259

Query: 214 KDKKLKKEKLKKKKKEKER 232
           +  KL KE  +K +KE ++
Sbjct: 260 EKLKLPKEAKEKAEKELKK 278


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 30.0 bits (67), Expect = 2.9
 Identities = 25/128 (19%), Positives = 47/128 (36%), Gaps = 5/128 (3%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           +++  I    S  +    H     E     K  K   ++   +   ++   K+    +K 
Sbjct: 600 DEIDYIASVDSGNRTNNIHMNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTNRKV 659

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKD-----SEKEKKSKDKDKKLKKEKLKKKKKEK 230
           +K   + K    K DK K ++             S + +  +D    +KK +LKKK K +
Sbjct: 660 KKRTAKRKTVGYKTDKSKKIKSDSLPTDTNVIVISSESEDEEDGFNIIKKSQLKKKIKSE 719

Query: 231 ERSSHEKE 238
            +S    E
Sbjct: 720 LKSESSSE 727


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 29.9 bits (67), Expect = 2.9
 Identities = 6/82 (7%), Positives = 30/82 (36%), Gaps = 4/82 (4%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           ++ + + DK   + + +    + +D+  + + ++  + +  +  + +   +         
Sbjct: 11  RNDTRRSDKGRERSRRRS---RSRDRSRRRRDRDYYRGRRGR-SRSRSPNRYYRPRGDRS 66

Query: 183 KEFKMKEDKEKPVEKIRKEEKD 204
                +       E + + E+D
Sbjct: 67  YRRDDRRSGRNTKEPLTEAERD 88


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 29.7 bits (66), Expect = 2.9
 Identities = 26/139 (18%), Positives = 58/139 (41%), Gaps = 1/139 (0%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKD 215
             K   KK  +  +K  ++ +++ E+K F +K       V+ I K+ +    +++ + K+
Sbjct: 2   PPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKE 61

Query: 216 KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPP 275
           K   +EK ++ +K+  R+  + + +    F   T  +    + +  K   +  + P    
Sbjct: 62  KMRMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLYS 121

Query: 276 VAAADEYDTGDSKQVWICP 294
                E D    K+ WI  
Sbjct: 122 DVRDKEEDVPLGKRPWINT 140


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 30.0 bits (68), Expect = 2.9
 Identities = 13/73 (17%), Positives = 31/73 (42%), Gaps = 1/73 (1%)

Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
           + + E    +     EE +    E+ ++  E+  K E D + +    D D   ++  L++
Sbjct: 109 ELQDEPRYDDAYRDLEEDD-DDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER 167

Query: 226 KKKEKERSSHEKE 238
           +++ +E      E
Sbjct: 168 RRRRREWEEKRAE 180


>gnl|CDD|201540 pfam00992, Troponin, Troponin.  Troponin (Tn) contains three
           subunits, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). this Pfam contains members of
           the TnT subunit. Troponin is a complex of three
           proteins, Ca2+ binding (TnC), inhibitory (TnI), and
           tropomyosin binding (TnT). The troponin complex
           regulates Ca++ induced muscle contraction. This family
           includes troponin T and troponin I. Troponin I binds to
           actin and troponin T binds to tropomyosin.
          Length = 131

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 11/98 (11%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           +   +KKE+EK K   ++    ++     + ++   K   +  + ++E+          D
Sbjct: 17  EFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEERY---------D 67

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
            E+ V K  KE +D   +KK  D   K KK  LKK +K
Sbjct: 68  IEEKVAKKDKEIED--LKKKVNDLRGKFKKPTLKKVRK 103


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 4/96 (4%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
            KK  E+ +K+++ K   K   K+   EK  ++ KEK    EK KK +K+ K   D+   
Sbjct: 130 AKKASEEARKQRELK---KF-GKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDL 185

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            + +  +      +KK +       K   K+K K+ 
Sbjct: 186 FDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA 221


>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
          Length = 615

 Score = 30.0 bits (67), Expect = 3.0
 Identities = 21/104 (20%), Positives = 42/104 (40%), Gaps = 5/104 (4%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
           + T+SE+   + K    KK     K+K K+        K+K+    ++    K++  E+ 
Sbjct: 29  QTTTSENKSAVSK----KKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERI 84

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPV-EKIRKEEKDSEKEKKSKD 213
             +  +   K   E K+K        + I+  +    K +K +D
Sbjct: 85  IDRIFQSSLKNRTEIKVKPKNNPQKKQNIKPVKPIPSKPEKPED 128


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 29.8 bits (68), Expect = 3.1
 Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 20/87 (22%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---------- 216
           KE   GK E +     +   +       +  + +  KD  KE K K  ++          
Sbjct: 343 KEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADA 402

Query: 217 ----------KLKKEKLKKKKKEKERS 233
                     KL+ EKL+K+ KE E+ 
Sbjct: 403 ILDMRLRRLTKLEVEKLEKELKELEKE 429


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.6 bits (66), Expect = 3.1
 Identities = 29/125 (23%), Positives = 58/125 (46%), Gaps = 1/125 (0%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           +   E   +K  ++D +     EK ++ +     D D+  ++    +  VD E+E+  +E
Sbjct: 125 KSDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEE 184

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
              ++EE++ ++  D      +   EE++ E E  S D +  +  E  +  KK+ E    
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAE-GSDDGEDVVDYEGERIDKKQGEEEEM 243

Query: 236 EKEVI 240
           E+EVI
Sbjct: 244 EEEVI 248


>gnl|CDD|173720 cd05631, STKc_GRK4, Catalytic domain of the Protein
           Serine/Threonine Kinase, G protein-coupled Receptor
           Kinase 4.  Serine/Threonine Kinases (STKs), G
           protein-coupled Receptor Kinase (GRK) subfamily, GRK4
           isoform, catalytic (c) domain. STKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The GRK
           subfamily is part of a larger superfamily that includes
           the catalytic domains of other protein STKs, protein
           tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. GRKs phosphorylate and regulate G
           protein-coupled receptors (GPCRs), the largest
           superfamily of cell surface receptors which regulate
           some part of nearly all physiological functions.
           Phosphorylated GPCRs bind to arrestins, which prevents
           further G protein signaling despite the presence of
           activating ligand. There are seven types of GRKs, named
           GRK1 to GRK7. GRK4 has a limited tissue distribution. It
           is mainly found in the testis, but is also present in
           the cerebellum and kidney. It is expressed as multiple
           splice variants with different domain architectures. It
           is post-translationally palmitoylated and localized in
           the membrane. GRK4 polymorphisms are associated with
           hypertension and salt sensitivity, as they cause
           hyperphosphorylation, desensitization, and
           internalization of the dopamine 1 (D1) receptor while
           increasing the expression of the angiotensin II type 1
           receptor. GRK4 plays a crucial role in the D1 receptor
           regulation of sodium excretion and blood pressure.
          Length = 285

 Score = 29.6 bits (66), Expect = 3.2
 Identities = 14/46 (30%), Positives = 30/46 (65%)

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
           K++E+ K+EE + ++KED+E+  EK  ++ K   +   +K+  ++L
Sbjct: 203 KRKERVKREEVDRRVKEDQEEYSEKFSEDAKSICRMLLTKNPKERL 248


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 29.8 bits (67), Expect = 3.2
 Identities = 19/84 (22%), Positives = 36/84 (42%)

Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            + E+E + E +FK  E+       I++++ ++E + + K   K L++    K  K K  
Sbjct: 356 SELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNS 415

Query: 233 SSHEKEVIPKLTFKFGTDMEEKTK 256
           S   K            ++ EKT 
Sbjct: 416 SFINKTENILTNSPLKDELLEKTT 439



 Score = 29.0 bits (65), Expect = 5.9
 Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 1/107 (0%)

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
                E+     K K I+++   +   +++KI+ + D + K   KN E+  E K +K K 
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
                + E          E  EK  E I   E   E E      D  
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTEIINI-ENPQEFEFGQIGNDII 460


>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
          Length = 673

 Score = 29.7 bits (68), Expect = 3.2
 Identities = 7/37 (18%), Positives = 13/37 (35%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
           +K+++   E  K+    K           +E   E I
Sbjct: 532 DKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAETI 568


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 23/67 (34%), Positives = 33/67 (49%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           KKT KKK+    KKK  KK K K K KE+  E   ++  +  + EE +KK + E    ++
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107

Query: 191 KEKPVEK 197
               V  
Sbjct: 108 TAAEVNH 114


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 29.6 bits (66), Expect = 3.3
 Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 3/89 (3%)

Query: 107 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
           PN+  V   E+   +      ++  +   K  K K +  +  +  +  K  +  K  K  
Sbjct: 29  PNMALVEEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKST 88

Query: 167 KEKEKGKK---EEKEKKEEKEFKMKEDKE 192
           K KEK +K   + K K++   F    + E
Sbjct: 89  KSKEKQEKITDKFKVKRKVDRFNGVSEAE 117


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 28.8 bits (65), Expect = 3.3
 Identities = 15/71 (21%), Positives = 45/71 (63%), Gaps = 3/71 (4%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE--KKEEKEFKMKEDK-E 192
           +KE++K++K+     ++  K K+++ ++K  + ++K +  ++E  +K+++  +   DK +
Sbjct: 59  QKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKID 118

Query: 193 KPVEKIRKEEK 203
           K ++++ KE+ 
Sbjct: 119 KAIKEVAKEKG 129


>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
           This is a family of conserved proteins of approximately
           700 residues found from worms to humans.
          Length = 253

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 10/48 (20%), Positives = 24/48 (50%)

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
            K+ K    K+K +  ++K ++   +  K      + K + +K++KK+
Sbjct: 93  TKKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKK 140


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 29.7 bits (67), Expect = 3.5
 Identities = 20/95 (21%), Positives = 42/95 (44%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
            K K+  KKKK KK + K  ++++ + +            +  ++E    +    K K  
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEE 332

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
           ++     +D +  + S  K ++L ++K KK + E 
Sbjct: 333 QEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPED 367


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 29.0 bits (65), Expect = 3.5
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
            K       KKKK K  K+K+K K+ + +KK  + 
Sbjct: 82  RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREG 116


>gnl|CDD|188644 cd00957, Transaldolase_TalAB, Transaldolases including both TalA
           and TalB.  Transaldolases including both TalA and TalB.
           The enzyme catalyses the reversible transfer of a
           dyhydroxyacetone moiety, derived from
           fructose-6-phosphate to erythrose-4-phosphate yielding
           sedoheptulose-7-phosphate and
           glyceraldehyde-3-phosphate. The catalytic mechanism is
           similar to other class I aldolases. The enzyme is found
           in the non-oxidative branch of the pentose phosphate
           pathway and forms a dimer in solution.
          Length = 313

 Score = 29.5 bits (67), Expect = 3.5
 Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 5/61 (8%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDK---EKPVEKIRKEEKDSE 206
           +++KN     E+K+D    K         +E  F+  + ED    EK  E IR   KD+ 
Sbjct: 248 EELKNSTAKVERKLDPAASKALDIHPNFLDESAFRWALNEDAMAVEKLSEGIRGFAKDAV 307

Query: 207 K 207
           K
Sbjct: 308 K 308


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 29.6 bits (66), Expect = 3.6
 Identities = 22/112 (19%), Positives = 60/112 (53%), Gaps = 7/112 (6%)

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKE----EKEFK 186
              +++ +  ++K +  + KI+N +   + KE+++ +EKE+ +  + +       EK  K
Sbjct: 381 EDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEK 440

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
              +KE+ +E+++++    E+ ++ + +  K + E LK++ +  +    E+E
Sbjct: 441 ALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERE 492


>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
           (cas_TM1802).  Clusters of short DNA repeats with
           non-homologous spacers, which are found at regular
           intervals in the genomes of phylogenetically distinct
           prokaryotic species, comprise a family with recognisable
           features. This family is known as CRISPR (short for
           Clustered, Regularly Interspaced Short Palindromic
           Repeats). A number of protein families appear only in
           association with these repeats and are designated Cas
           (CRISPR-Associated) proteins. This minor cas protein is
           found in at least five prokaryotic genomes:
           Methanosarcina mazei, Sulfurihydrogenibium azorense,
           Thermotoga maritima, Carboxydothermus hydrogenoformans,
           and Dictyoglomus thermophilum, the first of which is
           archaeal while the rest are bacterial.
          Length = 584

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 17/98 (17%)

Query: 136 KKEKEKIKKKKDKKDKD-KIKNKE----KDKEKKVDKEKEKGKKEEKEKKEEK------- 183
           +   +K+K    K  KD    NK     KD +K ++K  EK  K+  + K+ +       
Sbjct: 94  ETLNKKLKSIFKKFYKDKGEINKYRYFLKDIKKVLEKNFEKIIKDLIDLKKNEGVLYTIY 153

Query: 184 -----EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
                  K   D E+   K+  + +     KKSK++  
Sbjct: 154 FLKNDGKKYLSDYEEYRNKVLYKIEQRYFSKKSKEEGV 191


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 20/86 (23%), Positives = 46/86 (53%), Gaps = 8/86 (9%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR--KEEKDSEKEK-KS 211
            +KE+  E+   +E E+ +K E+E+ EE+E + + + E+  ++I   + + +  K+K   
Sbjct: 2   SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLR 61

Query: 212 KDKD-----KKLKKEKLKKKKKEKER 232
              +     K+ ++E+ + KK   E+
Sbjct: 62  AQAEFENLRKRTEREREEAKKYAIEK 87


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 29.6 bits (66), Expect = 3.8
 Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 156 NKEKDKE-----KKVDKEKEKG--KKEEKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSE 206
           + E D E       V  E+E+G  K+EE+++  E+  +++E +    E    E  EKD E
Sbjct: 83  DNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDE 142

Query: 207 KEKKSKDKDKKLKKE 221
           KE  ++  + +L  E
Sbjct: 143 KESDAEGDENELAGE 157


>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
           selection and in elongation by RNA polymerase II
           [Transcription].
          Length = 521

 Score = 29.6 bits (66), Expect = 3.9
 Identities = 26/135 (19%), Positives = 63/135 (46%), Gaps = 3/135 (2%)

Query: 130 DKKTHKKKEKEKIK---KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
           ++K+ ++K K   +   K KD+ D+ KI    + + + +  E+E+   +  E++E     
Sbjct: 57  NEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESILFEREEEISKLMERRELAIRM 116

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
            ++ +       R+  +       +++K KKL + K  ++++E+  S    E+     +K
Sbjct: 117 EQQHRSSGCTDTRRSTRYEPLTSAAEEKKKKLLELKKTREREERLYSERHIELQRFKDYK 176

Query: 247 FGTDMEEKTKRESSP 261
              + E+  + E +P
Sbjct: 177 ELEESEQGLQEEYTP 191


>gnl|CDD|225768 COG3227, LasB, Zinc metalloprotease (elastase) [Amino acid
           transport and metabolism].
          Length = 507

 Score = 29.7 bits (67), Expect = 3.9
 Identities = 20/130 (15%), Positives = 37/130 (28%), Gaps = 6/130 (4%)

Query: 57  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK-EYSKVKDVELGVT-----PMVPNIV 110
                 +K  +S K   + KE +   E  +     +         G T      +V  + 
Sbjct: 35  MKSQIFIKGTLSKKSAPNEKEILQFLENVNADNNLKAISTDTDPNGFTHVRYQQVVNGVP 94

Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
              S   +   K       +   +KKK+      K   +    I  K+   E+  +  K 
Sbjct: 95  VKGSEVIVHLDKNGVVKAVNGFVNKKKDLPTNTPKISAEQAIAIARKQFGYERLTESPKA 154

Query: 171 KGKKEEKEKK 180
           +     K   
Sbjct: 155 ELMVRVKNGS 164


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 9/117 (7%)

Query: 128 KKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
           K D +  KK   + +      +K  KK+K+ I+  E+ + K   K  EK + ++ +KK +
Sbjct: 61  KDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEE-EFKSAKKYIEKIEDKKLKKKAK 119

Query: 183 KEFKMKEDKEKPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           +  K+ +++ K  +K+ K  K   + EKE     KDK L  ++L +K K   +S  +
Sbjct: 120 QLVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEK 176



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 24/104 (23%), Positives = 54/104 (51%), Gaps = 11/104 (10%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
           +++K  ++++K  ++ EK++ +      +  K D + IK         VDK      +E+
Sbjct: 30  EQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDK------REK 83

Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
             KKE++  +  E++ K  +K  ++ +D + +KK+K   K +K+
Sbjct: 84  LLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKE 127



 Score = 28.5 bits (64), Expect = 6.4
 Identities = 26/105 (24%), Positives = 62/105 (59%), Gaps = 3/105 (2%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE--KKEEKEF 185
           K ++  +++ EK   ++K  ++++ K++  EK +++  D+  + GK + +   K  ++  
Sbjct: 16  KPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQAL 75

Query: 186 KMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKE 229
              + +EK ++K ++  + SE+E K +K   +K++ +KLKKK K+
Sbjct: 76  ANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQ 120


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 29.5 bits (66), Expect = 4.0
 Identities = 11/122 (9%), Positives = 48/122 (39%), Gaps = 18/122 (14%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF-----KMKEDK 191
           +++E  ++++  + +D+ ++ E+ + +  D+ + + +     ++  +E      + + D 
Sbjct: 1   RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60

Query: 192 EKPV-----------EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
             P            ++ R+  +        + + +   +    + +K+ ++ S      
Sbjct: 61  RSPRSLRYSSVRRSRDRPRRRSRSVRSI--EQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118

Query: 241 PK 242
           P 
Sbjct: 119 PG 120



 Score = 28.3 bits (63), Expect = 9.0
 Identities = 11/104 (10%), Positives = 46/104 (44%), Gaps = 13/104 (12%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           + +E ++ ++K   +D+D+   + + + +   + +++        +  +E   +ED  +P
Sbjct: 1   RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDR-------HRRSRERSYRED-SRP 52

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
            ++ R +            +   +++ + + +++ +   S E+ 
Sbjct: 53  RDRRRYDS-----RSPRSLRYSSVRRSRDRPRRRSRSVRSIEQH 91


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 29.5 bits (67), Expect = 4.1
 Identities = 20/115 (17%), Positives = 45/115 (39%), Gaps = 7/115 (6%)

Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
           I  VT    L  +++ + + K+  +  K   E + K +       +  + K+ EK++++ 
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVER------LLEELKELEKELERL 751

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           K+K     +        K +E     V     +  D ++ ++  D  KK     +
Sbjct: 752 KKK-LAAAELADLLSNAKAEEIGGVKVLAKEVDGADMKELREIADDLKKKLGSAV 805


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.4 bits (66), Expect = 4.1
 Identities = 16/65 (24%), Positives = 31/65 (47%)

Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
           K  ++    + D  +E  K +EK   E   +    D++   E   + +++ EK++K K K
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKK 363

Query: 215 DKKLK 219
            +K K
Sbjct: 364 PRKRK 368


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 28.1 bits (63), Expect = 4.3
 Identities = 19/81 (23%), Positives = 37/81 (45%), Gaps = 6/81 (7%)

Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
           K+   K K     R ++    K+K   +K +K  K +L+ +K++       K V+P+   
Sbjct: 13  KLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED-- 70

Query: 246 KFGTDMEEKTKRESSPKIVIK 266
                 +E+  R+ + K V+K
Sbjct: 71  ----LEKERRLRKVAQKGVVK 87



 Score = 27.7 bits (62), Expect = 6.9
 Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 13/109 (11%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK---------E 181
            K    K K+ I  +  K  K K K K +  EKK  ++    K++  EK           
Sbjct: 12  SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71

Query: 182 EKEFKMKEDKEKPVEK----IRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
           EKE ++++  +K V K    +R  +K +EK  K   K  ++K+E+L K+
Sbjct: 72  EKERRLRKVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKE 120


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 29.7 bits (67), Expect = 4.3
 Identities = 29/119 (24%), Positives = 57/119 (47%), Gaps = 12/119 (10%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK--EFK-- 186
            +T  K   +++KK+ D++      +    KE K   E+ +   E    +  +  E++  
Sbjct: 744 ARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAF 803

Query: 187 MKED--------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
           M+E         +E+P   I+  E +S  E+  ++  + +K  KL++KK E+ER + EK
Sbjct: 804 MQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEK 862


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 12/42 (28%), Positives = 29/42 (69%), Gaps = 3/42 (7%)

Query: 165 VDKEK-EKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKD 204
           +D+EK ++ +K  KE +KE +E +   D +K ++K+++++ +
Sbjct: 69  IDQEKMKELQKMMKEFQKEFREAQESGDMKK-LKKLQEKQME 109


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
           +K +EK+ + +EDKE+ +E+I+KEE+  E  
Sbjct: 50  DKVKEKKEEREEDKEELIERIKKEEETFEDV 80


>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein.  This protein family was
           first noted as a paralogous set in Porphyromonas
           gingivalis, but it is more widely distributed among the
           Bacteroidetes. The protein family is now renamed GLPGLI
           after its best-conserved motif.
          Length = 227

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 7/88 (7%)

Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--- 177
           I K+ S       +K    + + KK   +  D   +  K KE        K  K+ K   
Sbjct: 52  IGKNISKFYSLTFNKN---DSLFKKVKNEGIDANASSFKAKEIGSSYIIYKNYKKGKVTF 108

Query: 178 -EKKEEKEFKMKEDKEKPVEKIRKEEKD 204
            +K   K+F   +   K   K+ KE K 
Sbjct: 109 YDKIFGKKFLYTDTLPKINWKLTKETKT 136


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 29.6 bits (67), Expect = 4.5
 Identities = 8/38 (21%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 129 KDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKV 165
           ++KK +KK  +++ +  +     K K      +K++ V
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 28.5 bits (64), Expect = 4.6
 Identities = 27/81 (33%), Positives = 38/81 (46%)

Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
           K++ K K K  E K   +K+K +          E    ED E   E  R E +   KEK 
Sbjct: 13  KNEPKWKSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKL 72

Query: 211 SKDKDKKLKKEKLKKKKKEKE 231
            +DK K  K+E  K+K+KE+ 
Sbjct: 73  RRDKLKAKKEEAEKEKEKEER 93


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 29.3 bits (67), Expect = 4.8
 Identities = 32/119 (26%), Positives = 48/119 (40%), Gaps = 36/119 (30%)

Query: 129 KDKKTHKKKE----------KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
           KDK T K++            E+I++      KD   N E+DK+          +KE  E
Sbjct: 482 KDKGTGKEQSITITASSGLSDEEIERMV----KDAEANAEEDKK----------RKELVE 527

Query: 179 KKE---------EKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 225
            +          EK  K   DK      EKI    K+ ++  K +DK+  K K E+L +
Sbjct: 528 ARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKEAIKAKTEELTQ 586


>gnl|CDD|225405 COG2849, COG2849, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 230

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 3/116 (2%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
              ++  K+   D  T        I++K +  +K + K K  D++ K+  + +    EE 
Sbjct: 2   GDIVETSKNGVDDGVTITYYLNGAIEEKANYVNKLEGKEKLYDEDGKLLSKLDFSAGEEV 61

Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           + KEE          K      K+   +   +K  +  +       K  K + E  
Sbjct: 62  KSKEE---YYDNGTLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETK 114


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.1 bits (66), Expect = 4.9
 Identities = 33/182 (18%), Positives = 71/182 (39%), Gaps = 32/182 (17%)

Query: 75  PKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTH 134
           P++K+   +  S  E     +   +  V  +V  +V    S+++ E  K    ++ +   
Sbjct: 153 PEDKLEELKLESDVENVE-YISTDKGYVYVVV--VVLKELSDEVEEELKKLGFERLELEE 209

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +    E I++ K++              ++++KE+E   +E KE  ++   ++    E  
Sbjct: 210 EGTPSELIREIKEEL-------------EEIEKERESLLEELKELAKKYLEELLALYEY- 255

Query: 195 VEKIRKEEKDSEKEKKS-----------KDKDKKLKKEKLKKKKK----EKERSSHEKEV 239
           +E   +  +   K  K+           +D+ KKLK+   K        E      E+E 
Sbjct: 256 LEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEE 315

Query: 240 IP 241
           +P
Sbjct: 316 VP 317



 Score = 29.1 bits (66), Expect = 6.2
 Identities = 35/210 (16%), Positives = 83/210 (39%), Gaps = 35/210 (16%)

Query: 74  SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
             KE+  S E+        +K+ +           + K+ S      + K   ++++KK 
Sbjct: 35  DLKEE-LSNERLRKLRSLLTKLSEA----------LDKLRS-----YLPKLNPLREEKKK 78

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-----MK 188
              K  E++ K  ++ + +KI+ + K+ E+++ + + + K+ E+E +  + +      + 
Sbjct: 79  VSVKSLEELIKDVEE-ELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLS 137

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK--------EKERSSHEKEVI 240
                    +       +K ++ K +      E +   K          KE S   +E +
Sbjct: 138 LLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEEL 197

Query: 241 PKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
            KL F+       + + E +P  +I+ +K 
Sbjct: 198 KKLGFE-----RLELEEEGTPSELIREIKE 222



 Score = 28.7 bits (65), Expect = 7.1
 Identities = 23/103 (22%), Positives = 50/103 (48%), Gaps = 22/103 (21%)

Query: 142 IKKKKDKKDKDKIKNKEKDKEK---KVDK-----EKEKGKKEEKEKKEEKEFKMKEDKEK 193
           I+  K++   ++++       K    +DK      K    +EEK+K   K          
Sbjct: 33  IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSL-------- 84

Query: 194 PVEKIRKE-EKDSEK-EKKSKDKDKKLK--KEKLKKKKKEKER 232
             E++ K+ E++ EK EK+ K+ ++++   + ++K+ ++E ER
Sbjct: 85  --EELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125


>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
          Length = 374

 Score = 29.2 bits (65), Expect = 4.9
 Identities = 22/80 (27%), Positives = 40/80 (50%), Gaps = 11/80 (13%)

Query: 134 HKKKEKEKIKK----KKDKK---DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
           H+K+ K KI +    KKDK+   D +K + +++  E K DK+ ++ K++   +  E    
Sbjct: 4   HRKQSKIKIYRITSYKKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAET--- 60

Query: 187 MKEDKEKPVEKIRKEEKDSE 206
            K+D +   E +     DS 
Sbjct: 61  -KQDNQVQSENVVIVPTDSH 79


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 29.4 bits (66), Expect = 4.9
 Identities = 36/149 (24%), Positives = 65/149 (43%), Gaps = 9/149 (6%)

Query: 90  KEYSKVKDVELGVTPMVPNIVKVTSSED-LGEIKKDKSIKKDKKTHKKKE----KEKIKK 144
           K Y K K V   V      I  +   ED L E+  DK  KK    +  K     K+K+KK
Sbjct: 60  KAYQKAKRVSENVKTPRFYIKTLVMLEDFLNELWADKEAKKKMSKNNAKALNTLKQKVKK 119

Query: 145 --KKDKKDKDKIK-NKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE-KIRK 200
             K+ + D  + + + E + E++ + E +     + E ++E      E+     E  + +
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179

Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
            ++D E+++ +    K + +E    KK E
Sbjct: 180 VKEDDEEDEDADLSKKDVLEEPKMFKKPE 208


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 28.6 bits (64), Expect = 5.1
 Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 15/71 (21%)

Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           +  KE+     + K+++                 I  EE DSEKE KS+  D + +   L
Sbjct: 9   ESAKERRNRNDKNKKEEHS---------------IGSEEGDSEKEPKSESADGRKRCRFL 53

Query: 224 KKKKKEKERSS 234
            +K ++  +S 
Sbjct: 54  LRKTRKVSQSL 64


>gnl|CDD|241381 cd13227, PH-like_Vps36_GLUE, Vps36 GLUE (GRAM-like
           ubiquitin-binding in Eap45) Pleckstrin homology-like
           domain.  ESCRT complexes form the main machinery driving
           protein sorting from endosomes to lysosomes. Yeast/human
           ESCRT-I consists of Vps23/Tsg101, Vps28/Vps28, and
           Vps37/Vps37 homolog. Yeast/human ESCRT-II is composed of
           Vps25/EAP20, Vps22/EAP30, and Vps36/EAP45. Yeast
           ESCRT-III consists Vps2, Vps20, Vps24, and Snf7
           subunits. In contrast, there are three human paralogs of
           Snf7 (hSnf7-1/CHMP4A, hSnf7-2/CHMP4B, and
           hSnf7-3/CHMP4C) and two paralogs of Vps2 (CHMP2A and
           CHMP2B). Yeast ESCRT-I links directly to ESCRT-II,
           through a tight interaction of Vps28 (ESCRT-I) with the
           yeast-specific zinc-finger insertion within the GLUE
           domain of Vps36. The Vps36 subunit (ESCRT-II) binds
           ubiquitin using one of its two NZF zinc fingers in its
           N-terminal region. Human Vps36, EAP45, also binds
           ubiquitin despite having no NZF domain. Instead,
           mammalian ESCRT-II interacts with Ub through the Eap45
           GLUE domain itself. The yeast Vps36 GLUE has a complete
           PH domain, wherease Eap45 GLUE only has a PH-like fold
           since it lacks the secondary structure element
           corresponding to the 4 strand. ESCRT-II also interacts
           with ESCRT-III via a Vps25(EAP20)/Vps20(CHMP6)
           interaction. Structure 2CAY is missing this insertion
           that contains 2 NZF zinc fingers. It is a split PH
           domain, with a noncanonical lipid binding pocket that
           binds PI(3)P. The interactions of ESCRT-II GLUE domain
           with membranes, ESCRT-I, and ubiquitin are critical for
           ubiquitinated cargo progression from early to late
           endosomes. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 126

 Score = 28.0 bits (63), Expect = 5.2
 Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 2/43 (4%)

Query: 259 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV--WICPACGVQ 299
            SPKI++      +           TG       W CP C   
Sbjct: 81  RSPKIILFLKDEINSKSGLGQSSNKTGSQNVTSQWTCPICMFT 123


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 23/168 (13%), Positives = 59/168 (35%), Gaps = 7/168 (4%)

Query: 65  DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 124
           DK+   ++ + ++K   K++T     +  +  D E        +       E        
Sbjct: 57  DKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAETAGSL 116

Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           + I+     ++   +E     +  K  +     EK+  ++V++ + +     +     + 
Sbjct: 117 RPIRSSAAAYRDINRELGSFYEYPKTDE-----EKELLREVEELESRL--ATEPSPAPEL 169

Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
            +     EK  E   K     ++      + KK KK  ++  +  +E+
Sbjct: 170 EEQLALMEKSYELAAKYMPRGQERLPVAPESKKGKKPSVQPVRAVEEK 217



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 14/72 (19%), Positives = 32/72 (44%), Gaps = 2/72 (2%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           E+   +E+ KG   +  + +++   +  DK K  E+   E+K  +K++  +D     +K 
Sbjct: 31  EEDAAEEETKGFNADIPQPKDEG--IIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKS 88

Query: 222 KLKKKKKEKERS 233
             + +      S
Sbjct: 89  DKESQDYSSSDS 100


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 29.3 bits (64), Expect = 5.3
 Identities = 32/113 (28%), Positives = 64/113 (56%), Gaps = 5/113 (4%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           ++ + ++K KK+ KKK +++I K  + + K   K   + + ++ + E +K   +E+E K+
Sbjct: 62  EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
            K  + +  KE+      K+E D +K +  K K+K +K EK +K +K K+ S+
Sbjct: 122 WKAEQERIQKEQ-----EKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKAST 169


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 28.4 bits (63), Expect = 5.3
 Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 25  PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS 61
           P P   GPG+ P  PP    P IP P       PPP 
Sbjct: 112 PPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148


>gnl|CDD|216295 pfam01093, Clusterin, Clusterin. 
          Length = 434

 Score = 28.9 bits (65), Expect = 5.3
 Identities = 16/67 (23%), Positives = 33/67 (49%), Gaps = 5/67 (7%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
           EK VD+E +      K+ K      + E  E+  + +    + ++KEK+   K     +E
Sbjct: 11  EKYVDEEVKNALIGVKQMKT-----LMERTEEEHKNLMSTLEKTKKEKEEALKLANEVEE 65

Query: 222 KLKKKKK 228
           KL+++++
Sbjct: 66  KLEEEEE 72


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 28.2 bits (64), Expect = 5.4
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
           + L +++KDK I +D+    +KE +K+  K  KK  + +K KEK+
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 43/186 (23%), Positives = 78/186 (41%), Gaps = 9/186 (4%)

Query: 60  PSKMKDKVSPKDKTSPKEKMSSK---EKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
           P  +  K  P++  + K+ +  +       P  K+   VK  +L ++ ++    K  +  
Sbjct: 177 PKPVLKKAEPRNIVAKKDGVIKRVYVTSGEPVVKKGDVVKKGDLLISGILGKEGKTYTVH 236

Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
             GE+      +K  +   K ++E    KK  +    I  K K   KK  KE  K   EE
Sbjct: 237 AEGEVFARTWYEKTVEVPLKTQREVRTGKKIVRYYLNIGGK-KIYLKKPPKEFAKYDVEE 295

Query: 177 KEKK---EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           K KK         ++  KE   E   K EK +++E  + +K K+  +++LKK+     + 
Sbjct: 296 KTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEE--AIEKAKEKAEKELKKELDPNAKI 353

Query: 234 SHEKEV 239
             +K +
Sbjct: 354 VSDKIL 359


>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 205

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 14/79 (17%), Positives = 29/79 (36%), Gaps = 3/79 (3%)

Query: 122 KKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEEKE 178
           +  +S+ + K    K      ++       K+      +  E+++    + ++ K     
Sbjct: 20  QLGQSLAELKLRLDKDALLAGLEDALALVQKEAALEALQRAEQRLQQAAKAKRAKDAAAN 79

Query: 179 KKEEKEFKMKEDKEKPVEK 197
             E K F  K  KEK V+ 
Sbjct: 80  AAEGKAFLEKNAKEKGVKT 98


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 28.6 bits (64), Expect = 5.6
 Identities = 17/65 (26%), Positives = 28/65 (43%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           EI   K  KK + T  +K     K KK  K+K   + K+ + +    +E E      +E 
Sbjct: 44  EIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103

Query: 180 KEEKE 184
            + +E
Sbjct: 104 GDSQE 108


>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Protein of this family often fused to HTH domain; Some
           proteins could have an additional fusion with
           RecB-family nuclease domain; Core domain appears to have
           a Rossmann-like fold; loosely associated with CRISPR/Cas
           systems; also known as DxTHG family.
          Length = 400

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 17/99 (17%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------------- 154
           + ++  +   + EI   K    +K     K+ ++  KK   +  +KI             
Sbjct: 184 HEIESETIAPVTEISYLKEKFINKYKDNNKDLKRYFKKALNEGLEKIIKFASALYNGLPL 243

Query: 155 -----KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
                K   ++  + ++ + E  +  E  K +E+E K+K
Sbjct: 244 YLLYLKPSIEELREPLEFKLEIIEIIESNKNQEEEIKVK 282


>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
           PA. 
          Length = 669

 Score = 29.1 bits (65), Expect = 5.7
 Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 12/105 (11%)

Query: 54  PPGTPPPSKMK---DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT---PMVP 107
           P       +M+   DK +P++      +MS + K   K   +++++ + L      P  P
Sbjct: 188 PLPFSTYEQMRDYCDKFTPREIEPNVSRMSGRVKLPIKHYRWTQMRPIPLLKEPEGPPSP 247

Query: 108 NIVKVTSSEDL------GEIKKDKSIKKDKKTHKKKEKEKIKKKK 146
              K+  ++ L          ++  I  D     + E E I+  +
Sbjct: 248 YRAKLLMADALYLTICDPGTDQEGIILIDSTFFSQSEPECIRPSE 292


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 29.2 bits (66), Expect = 5.7
 Identities = 16/79 (20%), Positives = 48/79 (60%)

Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
           D E++V++ +E+ ++ E   + E   +  E++ + +E++  E +++ +EK+ + ++ + +
Sbjct: 486 DLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRER 545

Query: 220 KEKLKKKKKEKERSSHEKE 238
             +L+ + +EK  ++ E E
Sbjct: 546 AAELEAEAEEKREAAAEAE 564


>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
           protein; Provisional.
          Length = 897

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 8/57 (14%)

Query: 27  PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS---KMK----DKVSPKDKTSPK 76
             V  PG +P       H LI +PL HP G P P    K++    D V+P  KT+ K
Sbjct: 410 AEVPAPGELPADQLAEAHELIYRPLRHPDGPPQPQVEYKLRRFVNDYVAPP-KTAAK 465


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 14/62 (22%), Positives = 35/62 (56%), Gaps = 10/62 (16%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
           +  E+ ++EE+EK++E+E           E++ +  K   ++ + K K K+++KE+ ++ 
Sbjct: 117 RFLEQLEREEEEKRDEEE----------RERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166

Query: 227 KK 228
           + 
Sbjct: 167 RH 168


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 29.1 bits (66), Expect = 6.0
 Identities = 38/170 (22%), Positives = 63/170 (37%), Gaps = 38/170 (22%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG----- 172
           L ++K        ++  KKKEK+K KKKK K  + KI  K   +  + D E  K      
Sbjct: 385 LKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKI-YKVLKEATRQDLELLKNVWGEI 443

Query: 173 ----KKEEKEK---------------------KEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
               K + K                       + E  F+     ++  + I         
Sbjct: 444 LESLKAQRKSLRALLVNSEPVAASEDTVVLAFEYEIHFEKAMLDKELNDTIENILSKLLG 503

Query: 208 EK-----KSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTD 250
           +        +D+ +K+++E L+K K E+E    E    +IP+    FG D
Sbjct: 504 KSVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKELFGED 553


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 28.5 bits (64), Expect = 6.0
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
            K+K  EK +EK+    ++K ++    KK EKE + ++ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191



 Score = 28.1 bits (63), Expect = 9.3
 Identities = 18/48 (37%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
            K +E EK+EEK+ +  E K +     +K EK+ EK +++K  + KLK
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEK-RQAKYSEAKLK 198


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 28.9 bits (64), Expect = 6.1
 Identities = 29/131 (22%), Positives = 50/131 (38%), Gaps = 2/131 (1%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE--KIRKEEKDSEKEKKSKDK 214
           K  +  KK  KE E  + EE+E++EE+E    E+ E+          E D        D 
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDN 180

Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
            ++ ++E + + + E    +      P   F+  T  +E       P   +   +     
Sbjct: 181 GEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPEYQGEY 240

Query: 275 PVAAADEYDTG 285
               A+EYD G
Sbjct: 241 EQTGANEYDGG 251


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.6 bits (64), Expect = 6.1
 Identities = 28/152 (18%), Positives = 77/152 (50%), Gaps = 15/152 (9%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEK--DKEKKVDKEKEKGK--KEEKEKKEEKEFKMKEDK 191
           ++E+++++++ ++ ++D  + +++  + E+   + +EK K  +EE +  E+K  +++E+ 
Sbjct: 4   EREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEEN 63

Query: 192 EKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
            +  E+    E++ E+ E +  +   ++ K + +++KKE E    ++E+          +
Sbjct: 64  RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR---------E 114

Query: 251 MEEKTKRESSPKIVIKPVKSPSPPPVAAADEY 282
            +E  +R     +        +PP VAA    
Sbjct: 115 AQEAHERARQELLEAAA-APTAPPHVAAPVNG 145


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 28.3 bits (63), Expect = 6.1
 Identities = 18/61 (29%), Positives = 34/61 (55%), Gaps = 9/61 (14%)

Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKKEEKEKKEEKE 184
           S +KD K  ++ +           D   IKN  E++KEKK +  KE+ K+ +K+++ ++E
Sbjct: 68  SFEKDHKFIRRCKLSDA-------DFSLIKNHLEEEKEKKKNLNKEE-KEAKKKERAKRE 119

Query: 185 F 185
           +
Sbjct: 120 Y 120


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 29.0 bits (65), Expect = 6.1
 Identities = 18/123 (14%), Positives = 41/123 (33%), Gaps = 5/123 (4%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-----E 170
           ++ G+I      ++        EK + K K  ++   +I N +     +VD++       
Sbjct: 538 KENGKIYVYNDEERGPYLKYSYEKLEKKLKSLEELPLEIINGDSLLSDEVDEKINQDILR 597

Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
              +   E  E         K  P       E         + +D  L++++   ++ E 
Sbjct: 598 IQDRILSEAIELARLLDLLHKSSPDVLNSIFEYLKFLTDAEEIRDVPLEEKEELDERIES 657

Query: 231 ERS 233
             +
Sbjct: 658 LYN 660


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 29.1 bits (65), Expect = 6.2
 Identities = 31/187 (16%), Positives = 53/187 (28%), Gaps = 17/187 (9%)

Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDK---KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 162
            P+ V   +S  L +  K  +I  +       K       +K  D  D D+     +D E
Sbjct: 43  GPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDE 102

Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
                        + +   E   + ++   +          D   +    D       + 
Sbjct: 103 DDAGNATSGESSTDDDSLLELPDRDEDADTQ-------ANNDQTNDFDQDDSSNSQTDQG 155

Query: 223 LKKKKKEKERSSH--EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
           LK+            EK+   + TFKF     +  +  +         KS S  P    D
Sbjct: 156 LKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD-----GGKSKSSDPGPLND 210

Query: 281 EYDTGDS 287
               GD 
Sbjct: 211 SDGQGDD 217


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 28.9 bits (64), Expect = 6.2
 Identities = 32/145 (22%), Positives = 72/145 (49%), Gaps = 17/145 (11%)

Query: 106 VPNIVKVTSSEDLGEI--------KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-N 156
           V NI  +TS   +  +        K D+   +D  TH          + DK   ++ +  
Sbjct: 382 VTNITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIE 441

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
           KE    K ++ +  + K+ E+ ++EE+E   + ++E+ +E+I +E  + E+ ++ + +  
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERE---RLERER-MERIERERLERERLERERLERD 497

Query: 217 KLKKEKLKKKKKEK----ERSSHEK 237
           +L++++L + ++E+    ER   EK
Sbjct: 498 RLERDRLDRLERERVDRLERDRLEK 522


>gnl|CDD|233469 TIGR01564, S_layer_MJ, S-layer protein, MJ0822 family.  This model
           represents one of several families of proteins
           associated with the formation of prokaryotic S-layers.
           Members of this family are found in archaeal species,
           including Pyrococcus horikoshii (split into two tandem
           reading frames), Methanococcus jannaschii, and related
           species. Some local similarity can be found to other
           S-layer protein families [Cell envelope, Surface
           structures].
          Length = 571

 Score = 28.8 bits (64), Expect = 6.4
 Identities = 32/188 (17%), Positives = 58/188 (30%), Gaps = 18/188 (9%)

Query: 70  KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
           KD     EK  S  K       Y       +G+  ++     V       ++K  K +K 
Sbjct: 310 KDGKVVAEKSDSDTKDGALVHVY---FGGGIGIQ-VLDAWENVGGDYGYVQLKIYKDLKT 365

Query: 130 DKKTHK----------KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
            +   +            +   + K     D+D +++KE         +     K  KE 
Sbjct: 366 LELGEEYMPDWKAEFAIDDNNTVIKWDALVDEDPLEDKEIGIFLTYTGDYSDTVKSGKEV 425

Query: 180 KEEKEFKMKEDKEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKE-KLKKKKKEKERSSH 235
              +   +K D E   +K+      ++  +      DK + L  +  L+ K  E      
Sbjct: 426 GIIEYAVLKLDDEDKEDKLYIDFSMDEYKDATLTKGDKVEDLNGDINLEFKPAEAYIPVK 485

Query: 236 EKEVIPKL 243
               I  L
Sbjct: 486 LTAPIAVL 493


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.8 bits (64), Expect = 6.5
 Identities = 32/190 (16%), Positives = 76/190 (40%), Gaps = 26/190 (13%)

Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
           K+ ++   +E+E +++  +  DKD+I+    +  K      E+  +EE+ ++E+ E ++ 
Sbjct: 349 KEGQSQWAQERETLRQSAEA-DKDRIQKLSAELLKL-----EEWLQEERSQREKLEVELG 402

Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIPKLTFKF 247
                        EKD  + + S+++ +  + +  L+  +KEKE+   EK+ +       
Sbjct: 403 I------------EKDCNRVQLSENRRELSELRSALRVLQKEKEQLQEEKQELLDYIRVL 450

Query: 248 GTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGD-------SKQVWICPACGVQD 300
              ++++   + +                A  D  D+ D       S ++        Q 
Sbjct: 451 ELRLDKEADEKWAEDAATCEDAKSYLHRAAYIDLLDSIDESPLDESSSELSPSALDEEQH 510

Query: 301 DGSLPMIGCD 310
           + S P    +
Sbjct: 511 NESTPKNPKE 520


>gnl|CDD|226475 COG3966, DltD, Protein involved in D-alanine esterification of
           lipoteichoic acid and wall teichoic acid (D-alanine
           transfer protein) [Cell envelope biogenesis, outer
           membrane].
          Length = 415

 Score = 29.0 bits (65), Expect = 6.5
 Identities = 25/114 (21%), Positives = 46/114 (40%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           + K D ++K+  K   KK+      K   K   +I  ++        K       ++  K
Sbjct: 174 QFKSDVTLKELLKKLAKKKPLTTADKSGVKLNLQILERKDLLFSATSKSGPLKHIKKALK 233

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
               +F  KE ++  VE  +K   ++    K+K   KK+K +  K K  +K  +
Sbjct: 234 DLPDQFSYKELEKLAVEIGKKSTTNNPFGIKNKYYIKKIKPKYKKLKDSQKNFT 287


>gnl|CDD|224396 COG1479, COG1479, Uncharacterized conserved protein [Function
           unknown].
          Length = 409

 Score = 28.8 bits (64), Expect = 6.5
 Identities = 28/88 (31%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK--IRKEEK 203
           K+    D +KNK     K   +EK++ K   ++KKE +E KM + K     K  +R    
Sbjct: 212 KELSKIDLLKNKLLMLIK---EEKKQSKDLNQDKKELEEIKMNKGKITERFKDFLRDYLT 268

Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
                   K + KKLKK     K+K KE
Sbjct: 269 YKISSNIKKRELKKLKKIYEDNKEKLKE 296


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 28.8 bits (64), Expect = 6.6
 Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 4/89 (4%)

Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-MKEDKEKPVEKIRKEEK 203
                ++++    E+D E+  +KE  K   E+ E++EE+E K MK+  +   +  +   K
Sbjct: 136 PDLVLEEEEEVEMEED-EEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVK 194

Query: 204 DSEKEKKSKDK--DKKLKKEKLKKKKKEK 230
           D+ K   S      KKL+K + K+K K+ 
Sbjct: 195 DNGKSFWSAASVFSKKLQKWRQKQKLKKP 223


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 27.9 bits (62), Expect = 6.6
 Identities = 32/107 (29%), Positives = 60/107 (56%), Gaps = 2/107 (1%)

Query: 123 KDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           K K I+++     + +KE +K  +K+     + ++ EK  +   +  K + KK+  EKKE
Sbjct: 20  KVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKE 79

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
             + + +E +EK + K +KE +  E  KK K K+++ +++KL KK K
Sbjct: 80  IAKQRKREQREKELAKRQKELEKIELSKK-KQKERERRRKKLTKKTK 125


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 28.5 bits (64), Expect = 6.6
 Identities = 18/68 (26%), Positives = 40/68 (58%), Gaps = 4/68 (5%)

Query: 178 EKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
           E  EE+E   +++ + +  E+ R  E    + ++ ++ +++ ++EK ++KK++KER   E
Sbjct: 147 EVLEEEELAELRQQQRQ-FEQRRNAEL--AETQRLEEAERRRREEKERRKKQDKERKQRE 203

Query: 237 KEVIPKLT 244
           KE   K+ 
Sbjct: 204 KETAEKIA 211


>gnl|CDD|240406 PTZ00411, PTZ00411, transaldolase-like protein; Provisional.
          Length = 333

 Score = 28.6 bits (64), Expect = 6.8
 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 5/51 (9%)

Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFK--MKED---KEKPVEKIRKEEKDSEK 207
           E+K+D EK     E+  +  EKEF+  + ED    EK  E IR   KD EK
Sbjct: 271 ERKLDPEKLTEDTEKLPELTEKEFRWELNEDAMATEKLAEGIRNFAKDLEK 321


>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 147 to 176 amino acids in length. This domain is
           found associated with pfam00226.
          Length = 144

 Score = 27.7 bits (62), Expect = 6.8
 Identities = 13/47 (27%), Positives = 27/47 (57%)

Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
            +  K+K+ ++++EK K E  +KK+E E  +    +    K+ +EE+
Sbjct: 3   NKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEE 49


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 28.8 bits (65), Expect = 7.0
 Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 11/116 (9%)

Query: 138 EKEKIKKK-KDKKDKDKIKNKEKDKEKK-VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           E+EK  ++ + KK  +    K+KD     + + K K     +    +   +         
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAA 522

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKE---------KLKKKKKEKERSSHEKEVIPK 242
            + RK +  + + +K        KK          K KK  ++   +  E+EV PK
Sbjct: 523 REARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPK 578


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 27.7 bits (62), Expect = 7.0
 Identities = 22/76 (28%), Positives = 51/76 (67%), Gaps = 4/76 (5%)

Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KM 187
             +    KE  KI+KK+ K++ +K+   ++ ++K++  EKE  ++E+K +K+E+EF  K 
Sbjct: 62  KTRLKMFKESLKIEKKELKQEVEKLPRFQEQEKKRMKAEKE--EQEQKHQKQEREFLAKQ 119

Query: 188 KEDKEKPVEKIRKEEK 203
           +E+ E+ +++++ E++
Sbjct: 120 EENLEEALQQLQNEKR 135


>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
          Length = 153

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER---SSHEKEVIPKL 243
           +K+ K        K ++  EK  K K   ++++K  +KKK  E  +   S  EKE+   +
Sbjct: 49  LKQQKMYQKILDLKLKEAKEKCLKVKQLLEEIQKFSVKKKVGENNQIFGSVTEKEISQII 108

Query: 244 TFKFGTDMEEK 254
             K   D+E++
Sbjct: 109 KNKTNIDLEKQ 119


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 28.3 bits (63), Expect = 7.3
 Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 173 KKEEKEKKEEKEFKMK----EDKEKPVEKIRKEEKDSEKEKKS--KDKDKKLKKEKLKKK 226
           KK   E+K EK  K+     E+ ++ +E++  + K S+ E      ++ ++ K+  L +K
Sbjct: 1   KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60

Query: 227 KKEKERSSHEKEVIPKLTFK 246
               E    EKE +     K
Sbjct: 61  VDSIEEEISEKEKVMSEKLK 80


>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
           This family consists of a series of primate specific
           nuclear pore complex interacting protein (NPIP)
           sequences. The function of this family is unknown but is
           well conserved from African apes to humans.
          Length = 350

 Score = 28.6 bits (63), Expect = 7.3
 Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 1/99 (1%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
           E K    I K K T K    +KI  K+ K  K+ ++     + +  +KE++  + EE  K
Sbjct: 114 ETKVRAKICKMKVTTKINHHDKINGKR-KTAKEHLRKLSMKECEHAEKERQVSEAEENGK 172

Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
            + KE    ++  +  + +R+  +D  K K      K L
Sbjct: 173 LDMKEIHTYKEMFQRAQALRRRAEDYHKCKIPPSARKPL 211


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 28.4 bits (64), Expect = 7.3
 Identities = 16/47 (34%), Positives = 24/47 (51%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
            KE E+++K+  + D       +K KEKK  K   K  +EE +K  E
Sbjct: 30  DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIE 76


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 27.0 bits (60), Expect = 7.4
 Identities = 16/87 (18%), Positives = 43/87 (49%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
           K+ E    ++ ++ K++ +   K+  ++  +  E+ ++E ++  +E   + +E+ E+  E
Sbjct: 13  KKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAE 72

Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           +I  E +       SK  + K+ +  L
Sbjct: 73  EILAEAEKEASAILSKAAEGKVVEAAL 99


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 28.2 bits (64), Expect = 7.5
 Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 16/63 (25%)

Query: 192 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK---------EKERSSHEKEVIPK 242
           EK +E++       EK+ KS DK+ K + E L+K K+           E +  E +++  
Sbjct: 137 EKRLERL-------EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIKILKS 189

Query: 243 LTF 245
           L  
Sbjct: 190 LQL 192


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 28.5 bits (64), Expect = 7.5
 Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 6/54 (11%)

Query: 136 KKEKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
           K  KE  K K  K+D  + I   ++ K++  +KE      E +E K   + K+K
Sbjct: 58  KLNKEVAKLKIAKEDATELIAETKELKKEITEKE-----AEVQEAKAALDAKLK 106


>gnl|CDD|237796 PRK14712, PRK14712, conjugal transfer nickase/helicase TraI;
            Provisional.
          Length = 1623

 Score = 28.7 bits (63), Expect = 7.6
 Identities = 14/102 (13%), Positives = 57/102 (55%), Gaps = 6/102 (5%)

Query: 135  KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK------GKKEEKEKKEEKEFKMK 188
            +++ +E I+++ +++  + ++   ++K    D + E+      G++ ++    E+E  + 
Sbjct: 1517 QRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVREIAGQERDRAAITEREAALP 1576

Query: 189  EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
            E   +  +++R+  ++  +E   +++ ++++++ ++  +KEK
Sbjct: 1577 ESVLREPQRVREAVREVARENLLQERLQQMERDMVRDLQKEK 1618


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 28.3 bits (63), Expect = 7.6
 Identities = 29/139 (20%), Positives = 69/139 (49%), Gaps = 11/139 (7%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
           ED    ++++  +++++T  + + E   K    + KD+I  +E+++E   +K  E+  K 
Sbjct: 10  EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-----------KLK 224
           + E+++ +  K+ E++ K   +++K     E      D D + ++E           ++K
Sbjct: 70  KAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIK 129

Query: 225 KKKKEKERSSHEKEVIPKL 243
           + ++E+E    EK  I K+
Sbjct: 130 RDREEREEMEREKAEIEKM 148


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 28.2 bits (63), Expect = 7.7
 Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 7/66 (10%)

Query: 170 EKGKKEEKEKKEEKEFKMKED------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
           E  +K+EKEKKE+ + + K+        E+ ++K  KE    + E    + D+  KK+K 
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYV-EYDRAGKKKKA 217

Query: 224 KKKKKE 229
           K K +E
Sbjct: 218 KSKYEE 223


>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897).  This is
           a bacterial family of uncharacterized proteins.
          Length = 55

 Score = 26.1 bits (58), Expect = 7.7
 Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
           K   K K  +   +K K   +N E+D +   D
Sbjct: 24  KYSAKMKFPQFDKQKKKKAKRNAEQDAKNSAD 55


>gnl|CDD|237554 PRK13909, PRK13909, putative recombination protein RecB;
           Provisional.
          Length = 910

 Score = 28.8 bits (65), Expect = 7.9
 Identities = 26/105 (24%), Positives = 43/105 (40%), Gaps = 30/105 (28%)

Query: 130 DKKTHKKKEKEKIKKKKD-------------------KKDKDKIKNKEKDKEKKVD-KEK 169
           DK   +  EKEK  K ++                   KKD+      E      +D K  
Sbjct: 663 DKDYARALEKEKALKYEEEINVLYVAFTRAKNSLIVVKKDESSGSMFEI-----LDLKPL 717

Query: 170 EKGKKEEKEKKEE--KEFKMKEDKEKPV---EKIRKEEKDSEKEK 209
           E+G+ E KE K    KE  +   K KP    E++++ E++ +++ 
Sbjct: 718 ERGEIEIKEPKISPKKESLITSVKLKPHGYQEQVKEIEEEPKEDN 762



 Score = 28.4 bits (64), Expect = 10.0
 Identities = 14/63 (22%), Positives = 32/63 (50%), Gaps = 4/63 (6%)

Query: 127 IKKDKKTHKKKEK---EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
           +KKD+ +    E    + +++ + +  + KI  K++     V K K  G +E+ ++ EE+
Sbjct: 699 VKKDESSGSMFEILDLKPLERGEIEIKEPKISPKKESLITSV-KLKPHGYQEQVKEIEEE 757

Query: 184 EFK 186
             +
Sbjct: 758 PKE 760


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 28.5 bits (64), Expect = 8.0
 Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 1/68 (1%)

Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
             ++K       + + K  ++ K+  K K ++K+K EE + ++KE KE+ + ++    K 
Sbjct: 36  DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEE-LTELSAALKA 94

Query: 205 SEKEKKSK 212
            E E + K
Sbjct: 95  LEAELQDK 102


>gnl|CDD|153319 cd07635, BAR_GRAF2, The Bin/Amphiphysin/Rvs (BAR) domain of GTPase
           Regulator Associated with Focal adhesion 2.  BAR domains
           are dimerization, lipid binding and curvature sensing
           modules found in many different proteins with diverse
           functions. GTPase Regulator Associated with Focal
           adhesion kinase 2 (GRAF2), also called Rho GTPase
           activating protein 10 (ARHGAP10) or PS-GAP, is a GAP
           with activity towards Cdc42 and RhoA which regulates
           caspase-activated p21-activated protein kinase-2
           (PAK-2p34). GRAF2 interacts with PAK-2p34, leading to
           its stabilization and decrease of cell death. It is
           highly expressed in skeletal muscle and also interacts
           with PKNbeta, which is a target of Rho. GRAF2 contains
           an N-terminal BAR domain, followed by a Pleckstrin
           homology (PH) domain, a Rho GAP domain, and a C-terminal
           SH3 domain. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
           The BAR domain of the related protein GRAF directly
           interacts with its Rho GAP domain and inhibits its
           activity. Autoinhibited GRAF is capable of binding
           membranes and tubulating liposomes, showing that the
           membrane-tubulation and GAP-inhibitory functions of the
           BAR domain can occur simultaneously.
          Length = 207

 Score = 28.0 bits (62), Expect = 8.0
 Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 8/94 (8%)

Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE---KPVEKIRKEEKDSEKEKK 210
           I + E D E+ +D   ++     K  +E++E       E   KP+E+ RKE+  + KE+K
Sbjct: 54  IGDAETDDERCIDASLQEFSNFLKNLEEQREIMALNVTETLIKPLERFRKEQLGAVKEEK 113

Query: 211 SK-----DKDKKLKKEKLKKKKKEKERSSHEKEV 239
            K     +K+  L ++ L    K+KE    E +V
Sbjct: 114 KKFDKETEKNYSLLEKHLNLSAKKKEPQLQEADV 147


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 28.5 bits (64), Expect = 8.1
 Identities = 15/104 (14%), Positives = 43/104 (41%), Gaps = 1/104 (0%)

Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
           +++  + I+    + + D++  K ++   K  K ++K +        E+  + ++++E+ 
Sbjct: 321 RRRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQ 380

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
                +EE+  + E++     +    E           S  +KE
Sbjct: 381 RSDEHEEEEGEDSEEEGSQSREDGSSES-SSDVGSDSESKADKE 423


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 28.8 bits (65), Expect = 8.1
 Identities = 18/107 (16%), Positives = 34/107 (31%), Gaps = 9/107 (8%)

Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
           ++ K    D+K  +        K     +     N + +  +K  KEKE   + E  K  
Sbjct: 725 RRGKITVTDEKCPECGLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGV 784

Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
                      K  EK++    ++  +  + D D    K       +
Sbjct: 785 GA---------KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 28.3 bits (63), Expect = 8.4
 Identities = 39/172 (22%), Positives = 72/172 (41%), Gaps = 13/172 (7%)

Query: 64  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
              VS  +++SP  K        PKEK+ S      L       +  +    ED+   + 
Sbjct: 247 SRSVSDSEESSPPSKK-------PKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQV 299

Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
            K   K K    ++ ++ I +KK       +K KE++KE+K  + ++   +  + K+E  
Sbjct: 300 VKKPVKRKNRRGQRARQAIWEKKYGSGAKHVK-KEREKEQKEREGRQSEWEARQAKREGG 358

Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-----KLKKKKKEK 230
           + K     E    + +++    ++ +K K K   + K      + KKK KEK
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEK 410


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 28.7 bits (65), Expect = 8.5
 Identities = 29/122 (23%), Positives = 48/122 (39%), Gaps = 27/122 (22%)

Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--------KIKNKEKDKEKKVD----- 166
           ++     I     + KKKEKE+  ++  + D D          KN ++  +KK D     
Sbjct: 149 KVGCGVKILYYHSSLKKKEKEEFLERLKEGDFDILVTTSQFLSKNFDELPKKKFDFVFVD 208

Query: 167 ---------KEKEK-----GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
                    K  +K     G  EE  +K  +  ++K   E+  EKIR+ E+   + K  K
Sbjct: 209 DVDAVLKSSKNIDKLLYLLGFSEEDIEKAMELIRLKRKYEEIYEKIRELEEKIAELKDKK 268

Query: 213 DK 214
             
Sbjct: 269 GV 270


>gnl|CDD|222579 pfam14179, YppG, YppG-like protein.  The YppG-like protein family
          includes the B. subtilis YppG protein, which is
          functionally uncharacterized. This family of proteins
          is found in bacteria. Proteins in this family are
          typically between 115 and 181 amino acids in length.
          There are two completely conserved residues (F and G)
          that may be functionally important.
          Length = 110

 Score = 27.0 bits (60), Expect = 8.8
 Identities = 15/73 (20%), Positives = 23/73 (31%), Gaps = 6/73 (8%)

Query: 13 RQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDK 72
           Q+    P+Q  P+     P      PP  Q      P   P   P  S    +  P   
Sbjct: 12 PQNQQQQPYQQQPYHQQMPPPPYS--PPQQQQGHFMPPQPQP--YPKQSPQ--QQQPPQF 65

Query: 73 TSPKEKMSSKEKT 85
          +S   +  + + T
Sbjct: 66 SSFLSQFKNSDGT 78


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 350 to 473 amino acids in length. This domain is
           found associated with pfam07923.
          Length = 402

 Score = 28.4 bits (64), Expect = 8.9
 Identities = 21/78 (26%), Positives = 36/78 (46%)

Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
            +K + K   K+  EE+E  MK ++    E +  +  D  +  K    +      + + K
Sbjct: 12  SKKVRMKLATKQLWEEREKFMKYERGWDTESLVGDPLDISESVKELKLEMYTSLAEDQPK 71

Query: 227 KKEKERSSHEKEVIPKLT 244
           K+E ER S + E IP+L 
Sbjct: 72  KEEIERLSTDSEEIPRLD 89


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 27.9 bits (62), Expect = 8.9
 Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 1/103 (0%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
           K+  ++ K+   K+     +  +++KE+  +K+           + E + + K +   P 
Sbjct: 110 KELTDEEKEPYYKEANSDRERYQREKEE-YNKKLPNKAPIGPFIENEPKIRPKVEGPSPD 168

Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
           + + +E K   K     D+ KK K     KK KE+  S + + 
Sbjct: 169 KALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211


>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
           enzyme res; Provisional.
          Length = 986

 Score = 28.5 bits (64), Expect = 8.9
 Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 11/111 (9%)

Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKVDKEKE-KGK 173
           +GEI  D  ++ +++   ++  +KI  +K K D D  + +   + D +K +D+    K  
Sbjct: 577 VGEINSDSKVQLNEEKLTEEMIQKIVTEKQKVDPDFTELRLLEDLDDKKIIDRSNNFKED 636

Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
             +   +   E      ++  V       +D++   K   K +K   E+LK
Sbjct: 637 GFDWLLEAYPELFQTGLRKGKV-------RDNKPASKLTVKLRKENYEELK 680


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 28.1 bits (63), Expect = 9.0
 Identities = 26/101 (25%), Positives = 52/101 (51%), Gaps = 3/101 (2%)

Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK---MKEDKEKPVEKIRKEE 202
           K K+++ K K  E+  E   D ++E+  + E+++  + E +   ++ DKE        EE
Sbjct: 77  KWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEE 136

Query: 203 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
           +  E  KK+K+   +   E+ +++  E+E +  EKE   +L
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 28.1 bits (62), Expect = 9.1
 Identities = 27/117 (23%), Positives = 51/117 (43%), Gaps = 11/117 (9%)

Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEEKEKKEEKEF 185
           +  +       +EK+KK  ++ ++D I N  +    +V   K+K   +K  K KK + E 
Sbjct: 119 ELLELIDVSLTEEKVKKY-EEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLER 177

Query: 186 KMKEDKEKPVEKIRKE--------EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
           +++E++    E  +KE         +D  K  K      K   E+L +   E  R++
Sbjct: 178 QIEEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNN 234


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 28.2 bits (63), Expect = 9.1
 Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 7/90 (7%)

Query: 14  QSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKD-K 72
            ++ +     S   S      I          + P P   PP  PPP  ++   S  D  
Sbjct: 151 TTSDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPP--PPPPSLQQSTSAIDLI 208

Query: 73  TSPKEKMSSKEKT----SPKEKEYSKVKDV 98
              K + S+  KT     PK  E+  + DV
Sbjct: 209 KERKGQRSAAGKTLVLSKPKSPEFPNMLDV 238


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 28.3 bits (63), Expect = 9.2
 Identities = 25/175 (14%), Positives = 64/175 (36%), Gaps = 29/175 (16%)

Query: 74  SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
           S  +++ +  +T  +     +  D+ L   PM    ++    E   +    +        
Sbjct: 611 SNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTL 670

Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK-EDKE 192
             +   +     + +++                  + K    E E ++E E  +  E +E
Sbjct: 671 ELQVPSKSQPTDESEEN-----------------AENKVASIEGEHRKEIEDLLFDESEE 713

Query: 193 KPVEKIRKEEKD-----------SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
             +  + +EEKD           S +E ++ + +   ++  LK +K++++R + E
Sbjct: 714 DNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLAEQNSLKAQKQQQKRIAAE 768


>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
            This family consists of several Phlebovirus
           nonstructural NS-M proteins which represent the
           N-terminal region of the M polyprotein precursor. The
           function of this family is unknown.
          Length = 264

 Score = 27.9 bits (61), Expect = 9.3
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 164 KVDKEKEKGKKEEKEKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
           +VD EK K   EEK+   +++ K M+ED    +     ++K +E +K   D  +  +   
Sbjct: 140 RVDYEKLKKNAEEKDATIQRKTKEMEEDSRNQIAHHEIQQKKNEIQKLRNDLKRGQEHRD 199

Query: 223 LKKKKKE 229
            K + +E
Sbjct: 200 AKLRVEE 206


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 28.2 bits (63), Expect = 9.3
 Identities = 16/79 (20%), Positives = 32/79 (40%)

Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
           EK    +    K ++     +   ++ +E  +   E      + E + +E++EKP EK  
Sbjct: 203 EKYPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKST 262

Query: 200 KEEKDSEKEKKSKDKDKKL 218
           +      KE     + K+L
Sbjct: 263 ESTFQLSKETSIAKESKEL 281


>gnl|CDD|222859 PHA02535, P, terminase ATPase subunit; Provisional.
          Length = 581

 Score = 28.5 bits (64), Expect = 9.5
 Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)

Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
           EKE K   ++K  +   +  E++ +  + S    ++        + K  K+K  K   S 
Sbjct: 66  EKENKTGGDYKEIDLLIRQHERLARVRRYSGTGNEADLNPNVANRNKGPKRKPVKNDIS- 124

Query: 236 EKEVIPKL 243
             E   KL
Sbjct: 125 -DEQTEKL 131


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 27.9 bits (62), Expect = 9.6
 Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
            E E+   + + +    +K+ E D E K     EK ++E +E K +K    + + E   E
Sbjct: 213 GENEENDDELEAEVISYLKDGENDNEVK-----EKIRREYREWKGDKANTNETEIEDESE 267

Query: 197 KIRKEEKDSEKEKKSKDKD 215
              +EE   E+E + K ++
Sbjct: 268 DEYEEEAGEEQENEDKGEE 286


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 27.3 bits (61), Expect = 9.6
 Identities = 13/59 (22%), Positives = 27/59 (45%)

Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
           +E + K  K    +      VE   +E+ ++ KE     + ++L+K + + KK +K   
Sbjct: 68  DEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEP 126


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 28.4 bits (64), Expect = 9.7
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
           V KE+E+ +KE+K KKE KE +    + K +E+
Sbjct: 389 VYKEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 27.2 bits (61), Expect = 9.7
 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 19/100 (19%)

Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
           K T   K   K +KK + K   +   KE  K     K ++  K E +  K+EK       
Sbjct: 50  KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEK------- 102

Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
                       K   KEKK ++K++K + ++ KKK K +
Sbjct: 103 ------------KKRSKEKKEEEKERKRQLKQQKKKAKHR 130


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 28.3 bits (63), Expect = 9.7
 Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 8/81 (9%)

Query: 29   VAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSK------ 82
            V G     + PP  +H   PQ    P   P  SK KDKV PKD   P     S       
Sbjct: 1004 VGGIEAPEKTPPPPEHE--PQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061

Query: 83   EKTSPKEKEYSKVKDVELGVT 103
             K + ++KE    K   L  T
Sbjct: 1062 PKGNEEKKEEQPDKGANLPNT 1082


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 28.4 bits (63), Expect = 9.7
 Identities = 34/151 (22%), Positives = 57/151 (37%), Gaps = 17/151 (11%)

Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKN----KEKDK-EKKVDKE 168
           EDLG+    K I   KK+   +E++  +++ +      ++  N    K KD     V+  
Sbjct: 36  EDLGK----KVIYDLKKSSSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVADVNNG 91

Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
           +E   + E E     E       EK      ++       K+  D +K+       K  +
Sbjct: 92  EEFLSESESE--ASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKIIE 149

Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
           E E +    E +    F   T M EK K+ S
Sbjct: 150 EPEENLGMMEEV----FSMITSMAEKAKKVS 176


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 26.7 bits (59), Expect = 9.8
 Identities = 10/24 (41%), Positives = 21/24 (87%)

Query: 161 KEKKVDKEKEKGKKEEKEKKEEKE 184
            E+K ++++E+ KKEE++++EE+E
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEEE 97


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 28.3 bits (63), Expect = 9.8
 Identities = 17/52 (32%), Positives = 29/52 (55%)

Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
           K KKKK  K+K K KN+E  K+    +  E+G +    +K  +  ++K +K+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 28.2 bits (63), Expect = 9.9
 Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 3/60 (5%)

Query: 24  SPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKE 83
              P  A        P +   P    P V+PP T P   ++     ++K  P  K+SS  
Sbjct: 411 PSAPQSATQPA-GTPPTVSVDPPAAVP-VNPPSTAPQ-AVRPAQFKEEKKIPVSKVSSLG 467


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 28.4 bits (64), Expect = 10.0
 Identities = 22/103 (21%), Positives = 59/103 (57%), Gaps = 3/103 (2%)

Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-EKEFKMKEDKEKP 194
           + E+E +KK+KD+  K+++++ EK+  +  ++  +  ++ + EK   +   ++KE+ E+ 
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477

Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
             ++ + E++ +  K ++ +  KL   +L+K+ +  E    E+
Sbjct: 478 RLELEQAEREGDLAKAAELQYGKL--PELEKRLQAAEAKLGEE 518


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 27.2 bits (61), Expect = 10.0
 Identities = 18/97 (18%), Positives = 42/97 (43%), Gaps = 10/97 (10%)

Query: 101 GVTPMVPNIVK-----VTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKD 152
            ++P +P + +     +    +   + +D  I   K+    K     E  +K + ++D+ 
Sbjct: 32  DISPFLPELRQKHKARIAKDPEFQYLNED--IAWLKEERDDKTVSLNEAERKAEREEDEK 89

Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
           K   +E ++ K + +E  K   +  + KEE +  + E
Sbjct: 90  KRLKRENERRKALGEEPLKSLDDLPKDKEEPDAYLDE 126


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.129    0.374 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,937,580
Number of extensions: 2186480
Number of successful extensions: 21410
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11087
Number of HSP's successfully gapped: 2657
Length of query: 444
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 344
Effective length of database: 6,502,202
Effective search space: 2236757488
Effective search space used: 2236757488
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.4 bits)