RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy128
         (446 letters)



>gnl|CDD|192483 pfam10203, Pet191_N, Cytochrome c oxidase assembly protein PET191. 
           Pet191_N is the conserved N-terminal of a family of
           conserved proteins found from nematodes to humans. It
           carries six highly conserved cysteine residues. Pet191
           is required for the assembly of active cytochrome c
           oxidase but does not form part of the final assembled
           complex.
          Length = 68

 Score = 78.8 bits (195), Expect = 2e-18
 Identities = 32/63 (50%), Positives = 44/63 (69%), Gaps = 3/63 (4%)

Query: 363 DKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP---QCEALKNTFFECKRSLLD 419
             S+C G R DL  CLLE+DCV+ +++TPR+CL+E L     +C AL+  FFECKR +LD
Sbjct: 1   MGSSCKGLREDLAECLLESDCVKKEKRTPRECLKEPLKELPEECIALRKAFFECKRGMLD 60

Query: 420 TRQ 422
            R+
Sbjct: 61  MRK 63


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 63.2 bits (153), Expect = 2e-10
 Identities = 42/122 (34%), Positives = 66/122 (54%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            E KK    KK  +  KKK  E  K    KK  D+ K K ++K+K ++ K K EE +K +E
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE 1448

Query: 163  KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
             +KK E+  K +E K+K  E  + +E   + E+  K  + K K E+ KKK  E ++++  
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA 1508

Query: 223  KE 224
            K+
Sbjct: 1509 KK 1510



 Score = 60.9 bits (147), Expect = 9e-10
 Identities = 35/127 (27%), Positives = 66/127 (51%), Gaps = 5/127 (3%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK----NKEKDKEKKEKEKGKKEE-K 157
            E  K K+    KK  + K+  +  K + +   D+ +      E  ++KKE+ K K +  K
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384

Query: 158  EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +K EEK+K +E + K +EDK+K  E  +      + ++  K  ++K K ++ KKK +E +
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444

Query: 218  RSSHEKE 224
            ++   K+
Sbjct: 1445 KADEAKK 1451



 Score = 59.0 bits (142), Expect = 4e-09
 Identities = 36/122 (29%), Positives = 64/122 (52%), Gaps = 2/122 (1%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
            E KK     K +      E E  ++K    +K K++ K K    +KK +EK K +E +KK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399

Query: 161  EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
             E++KK+  E K     +K  ++ +K+ ++ +K  ++K K ++ KK    KKK E+ + +
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459

Query: 221  HE 222
             E
Sbjct: 1460 EE 1461



 Score = 57.1 bits (137), Expect = 1e-08
 Identities = 53/186 (28%), Positives = 98/186 (52%), Gaps = 8/186 (4%)

Query: 71   KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK-KKEKEKIKKKK 129
            K+ E +K+K  EL         V+    ++  E KK + +KK ++ +K K  +E  K ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672

Query: 130  DKKDKDKIKNKEKDKEK-------KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
            DKK  ++ K  E+D++K       + +E  K EE +KKE +EKK+ +E K  E++ K   
Sbjct: 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732

Query: 183  KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
            +  K+E + +K+K  + K  + +K+K+   KKE+E+ + E     +   +   D E++ +
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792

Query: 243  RESSPK 248
            R    K
Sbjct: 1793 RMEVDK 1798



 Score = 56.7 bits (136), Expect = 2e-08
 Identities = 48/181 (26%), Positives = 87/181 (48%), Gaps = 1/181 (0%)

Query: 44   SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
            +   +    K + + K+K  +K+K    +K+  + K  +        +  K    +    
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAA 1415

Query: 104  IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
             KK     K K   KKK  E  KK ++ K  D+ K K ++ +K E+ K K EE +K +E 
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475

Query: 164  EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            +KK E+  K  E K+K  E+ +K+  +++K  ++K K  + KK +  KK  E +++   K
Sbjct: 1476 KKKAEEAKKADEAKKK-AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK 1534

Query: 224  E 224
            +
Sbjct: 1535 K 1535



 Score = 55.5 bits (133), Expect = 4e-08
 Identities = 44/185 (23%), Positives = 94/185 (50%), Gaps = 4/185 (2%)

Query: 44   SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
            ++ K + + K K   K+K  + +K + ++K+  + K            + K  +++   +
Sbjct: 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421

Query: 104  IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
              K K+ +K K    KK+ E+ KK  + K K +   K ++ +KK +E  K +E +KK E+
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 164  EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK----KKKEKERS 219
             KK ++  K  E+ +K  ++ +K  +  +K  ++K  ++  K ++ KK    KK ++ + 
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541

Query: 220  SHEKE 224
            + EK+
Sbjct: 1542 AEEKK 1546



 Score = 55.5 bits (133), Expect = 4e-08
 Identities = 35/115 (30%), Positives = 64/115 (55%), Gaps = 4/115 (3%)

Query: 114  KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
            KK  +KK+ ++ KK ++KK  D+ K K ++ +K ++ K K EE +KK +  KK+ +E K 
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343

Query: 174  KEDKEKPVEKIRKEEKDSEKEKKSKDKDK----KLKKEKLKKKKKEKERSSHEKE 224
              +  K   +   +E ++ +EK    + K    K K +  KKK +EK+++   K+
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398



 Score = 54.0 bits (129), Expect = 1e-07
 Identities = 45/208 (21%), Positives = 93/208 (44%), Gaps = 8/208 (3%)

Query: 49   KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
            K     K + + K + +E+ + + +++ + +              +   +++L + ++ K
Sbjct: 1231 KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290

Query: 109  SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
               + KK  +KK+ ++ KKK ++  K     K+ ++ KK+ +  KK+ +E K+  E  + 
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350

Query: 169  KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-----LKKEKLKKKKKEKERSSHE- 222
            +     ++ E   EK    EK  E+ KK  D  KK      K ++ KKK +E ++ + E 
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410

Query: 223  --KEVIPKLTFKFGTDMEEKTKRESSPK 248
                   K   +     EEK K + + K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKK 1438



 Score = 52.5 bits (125), Expect = 4e-07
 Identities = 44/181 (24%), Positives = 79/181 (43%), Gaps = 2/181 (1%)

Query: 45   KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 104
            K  ++    D+   K +   K+    K+   +K K  E           K   ++   E 
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEE 1442

Query: 105  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
             K     K K    KK +E  KK ++ K  D+ K K ++ +K ++ K K EE +KK ++ 
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502

Query: 165  KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
            KK  +  K  ++ +K  E  + +E    +E K  D+ KK +++K   + K+ E     +E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562

Query: 225  V 225
             
Sbjct: 1563 K 1563



 Score = 51.7 bits (123), Expect = 7e-07
 Identities = 44/184 (23%), Positives = 89/184 (48%), Gaps = 8/184 (4%)

Query: 49   KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
            K     K    +K    +K    +K   K K  EL     +    +   +E+  + ++DK
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576

Query: 109  SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
            ++   K    KK +E   ++  K  +++ K K ++ +K E+ K K EE +K EE++KK E
Sbjct: 1577 NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636

Query: 169  KEFKMKEDKEKPVEKIRKEEKDSE--------KEKKSKDKDKKLKKEKLKKKKKEKERSS 220
            +  K + +++K  E+++K E++++        K ++ K K ++ KK +  +KK  +    
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696

Query: 221  HEKE 224
              +E
Sbjct: 1697 EAEE 1700



 Score = 49.8 bits (118), Expect = 3e-06
 Identities = 50/191 (26%), Positives = 94/191 (49%), Gaps = 1/191 (0%)

Query: 59   KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
            +EK  ++E    +E +   ++  E         I +V    +  +  K +  KK ++   
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 119  KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-EDK 177
            K E+ K  +++ KK +   K + ++K+K E+ K  +EE + K  +E K+ +E K K E+ 
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680

Query: 178  EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDM 237
            +K  E  +K  +  +KE +   K ++LKK++ ++KKK +E    E+E   K         
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740

Query: 238  EEKTKRESSPK 248
            E+K K E + K
Sbjct: 1741 EDKKKAEEAKK 1751



 Score = 49.4 bits (117), Expect = 4e-06
 Identities = 40/121 (33%), Positives = 65/121 (53%), Gaps = 5/121 (4%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            E KK +  KK  +  KK ++ K K ++ KK  +  K  E+ K+K ++ K   E K+K +E
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADE 1514

Query: 163  KEKKEEK----EFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKE 217
             +K EE     E K  E+ +K  E  + EEK    E KK+++  K  +K+K ++ KK +E
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574

Query: 218  R 218
             
Sbjct: 1575 D 1575



 Score = 48.6 bits (115), Expect = 7e-06
 Identities = 49/154 (31%), Positives = 78/154 (50%), Gaps = 7/154 (4%)

Query: 106  KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
            K K+ +  K    KK+ E+ KKK D+  K     K+ D+ KK +E  K +E +K EE +K
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535

Query: 166  KEE---KEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
             +E    E K K D+ K  E+++K EEK   +E K  ++DK +   K ++ KK +E    
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595

Query: 222  EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
            E   + KL  +      E+ K+    KI  + +K
Sbjct: 1596 E---VMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626



 Score = 45.9 bits (108), Expect = 4e-05
 Identities = 41/131 (31%), Positives = 78/131 (59%), Gaps = 6/131 (4%)

Query: 98   SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
            +++  + ++ K   + KK  + K+ ++ KK ++KK  D++K  E+   KK +EK K EE 
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKKAEEKKKAEEA 1569

Query: 158  EKKEEKE----KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
            +K EE +    +K E+  K +E + + V K+ +EEK  + E+  K ++ K+K E+LKK +
Sbjct: 1570 KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE 1629

Query: 214  KEKERSSHEKE 224
            +EK++    K+
Sbjct: 1630 EEKKKVEQLKK 1640



 Score = 45.5 bits (107), Expect = 6e-05
 Identities = 38/109 (34%), Positives = 66/109 (60%), Gaps = 5/109 (4%)

Query: 105  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K+ +  KK ++  KK+ +EK K ++ KK +++ K K ++ +K+ +E  KK E+ KK+E+E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755

Query: 165  KKEEKEFKMKEDKE-----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            KK+    K +E+K+     K  E + +EE D E EK+  + DKK+K   
Sbjct: 1756 KKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804



 Score = 43.2 bits (101), Expect = 3e-04
 Identities = 39/144 (27%), Positives = 75/144 (52%), Gaps = 3/144 (2%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK--- 159
            + KK +  KK ++  KK  +   K+ ++ K  +++K KE +++KK +E  K EE+ K   
Sbjct: 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732

Query: 160  KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
            +E K++ EE + K +E K+   EK +      E+EKK+++  K+ +    ++  +E E+ 
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792

Query: 220  SHEKEVIPKLTFKFGTDMEEKTKR 243
              E +   K  F    ++ E  K 
Sbjct: 1793 RMEVDKKIKDIFDNFANIIEGGKE 1816



 Score = 42.8 bits (100), Expect = 3e-04
 Identities = 43/182 (23%), Positives = 84/182 (46%), Gaps = 17/182 (9%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------KNKEKDKEKKEKEKGKKEE 156
             +KK +  KKD +  KK E+E+  ++  K ++ ++      +   K +E ++ ++ KK E
Sbjct: 1228 AVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAE 1287

Query: 157  KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDKDKKLKKEKLK---- 210
            ++KK ++ KK E++ K  E K+K  E  + +E  K +E+ KK  D  KK  +E  K    
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA 1347

Query: 211  -----KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 265
                 +   ++  ++ EK    +   +      +  K+++  K      K  +      A
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407

Query: 266  DE 267
            DE
Sbjct: 1408 DE 1409



 Score = 38.6 bits (89), Expect = 0.007
 Identities = 41/159 (25%), Positives = 74/159 (46%), Gaps = 15/159 (9%)

Query: 99   EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKG----- 152
            E+  + ++ +  +  KK    K+ E+ KK  ++ K  ++ +N E+ ++ +E         
Sbjct: 1209 EEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268

Query: 153  ----KKEEKEKKEEKEKKEEK----EFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKK 203
                K EE  K +E +K EEK    E K  E+K+K  E K + EE     E K K ++ K
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328

Query: 204  LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
             K +  KKK +E ++++   +   +         EEK +
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367



 Score = 37.4 bits (86), Expect = 0.018
 Identities = 32/132 (24%), Positives = 75/132 (56%), Gaps = 11/132 (8%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKE 161
            E++K +  +K +   K +E+ K ++ +  +D  K +  +K +E KK+ E+ KK E+E+  
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251

Query: 162  EKEKKEE---------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
            E+ +K E         ++  +K ++ +  ++++K E+  + ++  K ++KK K ++ KKK
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK-KADEAKKK 1310

Query: 213  KKEKERSSHEKE 224
             +E +++   K+
Sbjct: 1311 AEEAKKADEAKK 1322



 Score = 35.9 bits (82), Expect = 0.051
 Identities = 33/148 (22%), Positives = 74/148 (50%), Gaps = 2/148 (1%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            E +K +  +K ++  K ++ ++++  +  +D  K +   K ++ K+ E  +K E+ +K E
Sbjct: 1132 EARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191

Query: 163  KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKEKERSS 220
            + +K E   K +  ++   E+  +E + +E  KK++   K ++ KK+  + KK E+ER++
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251

Query: 221  HEKEVIPKLTFKFGTDMEEKTKRESSPK 248
             E     +         +   K E + K
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARK 1279



 Score = 35.5 bits (81), Expect = 0.063
 Identities = 41/198 (20%), Positives = 91/198 (45%), Gaps = 13/198 (6%)

Query: 55   KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
            K + +++  + E    + +E  K ++++           ++  +E+  +IK +++ K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740

Query: 115  KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
            +  KK E+ K  +++ KK     K +EK  E+  KEK    E+E  EE EK+  +  K  
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKI 1800

Query: 175  ED-------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
            +D             +   V    KE +DS  ++ +  K+ +L++    +K K  + + +
Sbjct: 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNEN 1860

Query: 222  EKEVIPKLTFKFGTDMEE 239
             ++   +  F    D++E
Sbjct: 1861 GEDGNKEADFNKEKDLKE 1878



 Score = 33.2 bits (75), Expect = 0.36
 Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 4/113 (3%)

Query: 120  KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE- 178
            K+ +   K+ ++ D+   +   K +E K+ E GK EE  K EE +KK E   K +E ++ 
Sbjct: 1077 KDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKA 1136

Query: 179  ---KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
               +  E+ RK E     E   K +D +  +E  K +  +K  ++ + E + K
Sbjct: 1137 EDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRK 1189



 Score = 33.2 bits (75), Expect = 0.39
 Identities = 36/132 (27%), Positives = 69/132 (52%), Gaps = 10/132 (7%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            + KK ++ +K ++  K +E  K +  +  +   K + + K +E ++ E  KK E  KK E
Sbjct: 1174 DAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAE 1233

Query: 163  KEKKEEKEFKMKEDKEKPVEKIRKEE---------KDSEKEKKSKDKDKKLKKEKLKKKK 213
            + KK+ +E K K ++E+  E+IRK E         + +  + +   K  +LKK + KKK 
Sbjct: 1234 EAKKDAEEAK-KAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292

Query: 214  KEKERSSHEKEV 225
             E +++  +K+ 
Sbjct: 1293 DEAKKAEEKKKA 1304


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 60.3 bits (147), Expect = 7e-10
 Identities = 24/76 (31%), Positives = 46/76 (60%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
             KK   KIK   +  EKK +E+ K+++K+    K+K+EE+E + ++ +E+  E+  + E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463

Query: 189 KDSEKEKKSKDKDKKL 204
           ++ E+E++ K K   L
Sbjct: 464 EEKEEEEEKKKKQATL 479



 Score = 59.2 bits (144), Expect = 2e-09
 Identities = 21/79 (26%), Positives = 53/79 (67%)

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
            +++ + +   +K  +K +K   K E+K ++E+KEKK++     K+++E+  EK +KEE+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 190 DSEKEKKSKDKDKKLKKEK 208
             E+E++++++ ++ +++K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473



 Score = 58.8 bits (143), Expect = 3e-09
 Identities = 27/81 (33%), Positives = 52/81 (64%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
            +++I+     K+  +K K   E+ EKK E+EKKE+K+      K++  E+  KE+K+ E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 193 KEKKSKDKDKKLKKEKLKKKK 213
           KE++ ++ +++ ++E+ KKKK
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475



 Score = 58.0 bits (141), Expect = 4e-09
 Identities = 33/85 (38%), Positives = 55/85 (64%), Gaps = 10/85 (11%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           T  KK  +KIKK  +K +K      ++++EKKEK+K     K+K+EE+E+++EK    KE
Sbjct: 403 TGSKKATKKIKKIVEKAEK------KREEEKKEKKKKAFAGKKKEEEEEEEKEK----KE 452

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDK 200
           ++++  E+  +EEK+ E+EKK K  
Sbjct: 453 EEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 53.4 bits (129), Expect = 1e-07
 Identities = 27/66 (40%), Positives = 48/66 (72%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K  K IKK  +  +KK +E+ K+KK K    K K +E+++EK++KE+ K+EE+E+ EE++
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466

Query: 165 KKEEKE 170
           ++EE++
Sbjct: 467 EEEEEK 472



 Score = 52.6 bits (127), Expect = 2e-07
 Identities = 29/81 (35%), Positives = 53/81 (65%), Gaps = 1/81 (1%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
              EI+     KK  K   KK  EK +KK++++ K+K K     K+K+E+E+ +KE+KE+
Sbjct: 395 TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453

Query: 160 KEEKEKKEEKEFKMKEDKEKP 180
           ++E+E++E +E K +E+++K 
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474



 Score = 51.5 bits (124), Expect = 5e-07
 Identities = 24/83 (28%), Positives = 54/83 (65%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           + E++  +   K   K  K   +K ++K +++K +K K     K+K++E++E+++ K+EE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 157 KEKKEEKEKKEEKEFKMKEDKEK 179
           KE++EE+ ++E++E + K+ K+ 
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 46.1 bits (110), Expect = 3e-05
 Identities = 22/75 (29%), Positives = 44/75 (58%)

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           +EE E     +K  +K  K+ E  EK  E+ +KE+K      K K+++++ +KEK +++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 214 KEKERSSHEKEVIPK 228
           +E+E  + E++   +
Sbjct: 456 EEEEEEAEEEKEEEE 470



 Score = 44.9 bits (107), Expect = 7e-05
 Identities = 23/80 (28%), Positives = 52/80 (65%)

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           ++E +     KK  K+ K+  EK E+K  + K++K+K     +K+E++ E+EK+ K+++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455

Query: 203 KLKKEKLKKKKKEKERSSHE 222
           + ++E+ +++K+E+E    +
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475



 Score = 38.4 bits (90), Expect = 0.007
 Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
            E+E +     +K  +KI+K  + +EK+++ + K+KK K    KKK++E+E    +KE  
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-E 453

Query: 227 PKLTFKFGTDMEEKTKRESSPK 248
            K   +   + E++ + E   K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKK 475



 Score = 34.9 bits (81), Expect = 0.076
 Identities = 14/79 (17%), Positives = 40/79 (50%)

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            ++E +     +   K ++KI ++ +   +E+K + K K    +K +++++E++    E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454

Query: 224 EVIPKLTFKFGTDMEEKTK 242
           +   +   +   + EE+ K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473



 Score = 31.8 bits (73), Expect = 0.79
 Identities = 17/108 (15%), Positives = 40/108 (37%), Gaps = 11/108 (10%)

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKK----SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
           + +   +   ++    + +E+E +    SK   KK+KK   K +KK +E    +K+    
Sbjct: 377 IFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436

Query: 229 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 276
                    +++ + E   +   +  +          +E +    KQ 
Sbjct: 437 -------GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477



 Score = 31.0 bits (71), Expect = 1.4
 Identities = 20/107 (18%), Positives = 42/107 (39%), Gaps = 35/107 (32%)

Query: 47  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
             K + K      EK   K +   KEK+       +                        
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKK------------------------ 439

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
                      K++E+E+ K+KK+++ +++ +  E++KE++E++K K
Sbjct: 440 -----------KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 60.0 bits (146), Expect = 1e-09
 Identities = 29/187 (15%), Positives = 61/187 (32%)

Query: 40  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
           T      +++   K K    +  S    T  + KE  + K         V   +     +
Sbjct: 6   TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
                + D   KK K   K    +   KKK K + D  K  EK     + +     +   
Sbjct: 66  TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
              +   ++ +    +  +  ++    +E D E +      D+  +K++ K+ +K  +  
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185

Query: 220 SHEKEVI 226
               +  
Sbjct: 186 DFVWDED 192



 Score = 51.1 bits (123), Expect = 7e-07
 Identities = 26/171 (15%), Positives = 55/171 (32%), Gaps = 3/171 (1%)

Query: 55  KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
           K   KE + SK+KT  +  +        +  T                           K
Sbjct: 35  KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K   + +  K  +KK+  DKD   N  KD +   +     ++ +  +  +   + +   +
Sbjct: 95  KLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDE 154

Query: 175 EDKEKPVEKIRKEEKDS---EKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +D E   +    +E +     KE +    D     ++   +   + R   +
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205



 Score = 48.5 bits (116), Expect = 5e-06
 Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 8/142 (5%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
              K+    ++  KKK K+   K K K          K++ K+  E  KK  ++  +   
Sbjct: 4   ASTKAELAAEEEAKKKLKKLAAKSKSK------GFITKEEIKEALESKKKTPEQIDQVLI 57

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
                     +  E  + K  K+ K + K   +K   KK  K++L   KK +++++ +K+
Sbjct: 58  FLSGMVKDTDDATESDIPK--KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKD 115

Query: 225 VIPKLTFKFGTDMEEKTKRESS 246
                        +     +  
Sbjct: 116 DDLNYVKDIDVLNQADDDDDDD 137



 Score = 44.2 bits (105), Expect = 1e-04
 Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 1/132 (0%)

Query: 86  TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
           T            E   ++KK  +  K K    K+E ++  + K K  + +I        
Sbjct: 2   TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE-QIDQVLIFLS 60

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
              K+     E +  ++K K   K    K   +K ++      K +EK+      D    
Sbjct: 61  GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120

Query: 206 KEKLKKKKKEKE 217
            + +    +  +
Sbjct: 121 VKDIDVLNQADD 132



 Score = 39.2 bits (92), Expect = 0.004
 Identities = 26/147 (17%), Positives = 52/147 (35%), Gaps = 5/147 (3%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
             K +   +++ K K K    K K KG   ++E KE  E K++   ++ +        + 
Sbjct: 5   STKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF----LS 60

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
              KD++   +S    KK K        K   +   + E+      +   +  +K    +
Sbjct: 61  GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE-KKNALDKDDDLN 119

Query: 246 SPKIVIKPVKSPSPPPVAAADEYDTGD 272
             K +    ++         D+ D  D
Sbjct: 120 YVKDIDVLNQADDDDDDDDDDDLDDDD 146



 Score = 38.8 bits (91), Expect = 0.005
 Identities = 23/146 (15%), Positives = 46/146 (31%), Gaps = 4/146 (2%)

Query: 47  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
           KD     +   PK+K  +  K +  +    K    EL  +            +D     K
Sbjct: 64  KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK-KAEKKNALDKDDDLNYVK 122

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           D  +          + +      D  D D        + D +  ++++ KKE KE ++  
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEK 189
           +  +    +   +  +   K  K   
Sbjct: 183 DDDDFVWDEDDSEALRQARKDAKLTA 208


>gnl|CDD|214584 smart00249, PHD, PHD zinc finger.  The plant homeodomain (PHD)
           finger is a C4HC3 zinc-finger-like motif found in
           nuclear proteins thought to be involved in epigenetics
           and chromatin-mediated transcriptional regulation. The
           PHD finger binds two zinc ions using the so-called
           'cross-brace' motif and is thus structurally related to
           the RING finger and the FYVE finger. It is not yet known
           if PHD fingers have a common molecular function. Several
           reports suggest that it can function as a
           protein-protein interacton domain and it was recently
           demonstrated that the PHD finger of p300 can cooperate
           with the adjacent BROMO domain in nucleosome binding in
           vitro. Other reports suggesting that the PHD finger is a
           ubiquitin ligase have been refuted as these domains were
           RING fingers misidentified as PHD fingers.
          Length = 47

 Score = 53.4 bits (128), Expect = 1e-09
 Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 3/48 (6%)

Query: 278 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 323
            C  CG  DDG   ++ CDGCD WYH  C+G  L+ E     W+CPKC
Sbjct: 1   YCSVCGKPDDGGE-LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47


>gnl|CDD|201356 pfam00628, PHD, PHD-finger.  PHD folds into an interleaved type of
           Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 51

 Score = 52.9 bits (127), Expect = 2e-09
 Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 278 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPET---SDWFCPKCS 324
            C  CG  DD    ++ CDGCD W+H  C+G   EPE     +W+CP+C 
Sbjct: 1   YCAVCGKVDDDGE-LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECK 49


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 54.5 bits (131), Expect = 5e-08
 Identities = 39/129 (30%), Positives = 68/129 (52%), Gaps = 3/129 (2%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
            VK                 K+ K    KE+EK K++  ++ K K   KEK KE+ +  K
Sbjct: 76  AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK---KEKPKEEPKDRK 132

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
            K+E KEK+  KEK++EKE K++E +++  EK R+  +   + KK   K    KK++  +
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPE 192

Query: 212 KKKEKERSS 220
           ++K+++ + 
Sbjct: 193 EEKQRQAAR 201



 Score = 49.1 bits (117), Expect = 3e-06
 Identities = 48/190 (25%), Positives = 88/190 (46%)

Query: 78  VKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
           VK VE G +       K          K+++  K+  K  KKK+KEK K++   +   + 
Sbjct: 77  VKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE 136

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
             +++  ++KEKEK KK E+ +  E+EKK E+       K+ P +K   ++K+  +E+K 
Sbjct: 137 AKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196

Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 257
           +   ++  K K ++    +ER   E +   + T     + +E  +     +     +K P
Sbjct: 197 RQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKP 256

Query: 258 SPPPVAAADE 267
            P P  A+ E
Sbjct: 257 DPSPSMASPE 266



 Score = 47.2 bits (112), Expect = 1e-05
 Identities = 42/155 (27%), Positives = 77/155 (49%), Gaps = 3/155 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            K   ++  K+ +K        K K  ++ + E GK+EEKEK++ KE+K++K+ K KE+ 
Sbjct: 69  SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128

Query: 178 EKPVEKIRKEEKDSEKEK---KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFG 234
           +    K   +EK   KEK   K K  ++   +E+ KK+++ + +S  +K    K   K  
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKK 188

Query: 235 TDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
              EE+ +R+++ + V    + P        +E D
Sbjct: 189 EPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDD 223



 Score = 46.8 bits (111), Expect = 2e-05
 Identities = 48/231 (20%), Positives = 95/231 (41%), Gaps = 20/231 (8%)

Query: 35  VHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVK 94
           V   G+  P+            S KE+   KE+   ++K+                   K
Sbjct: 80  VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK-------------------K 120

Query: 95  VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
               ++  + +K K   K+K+  K+KEKEK KK ++ +D+++ K +E+ + K   +K  K
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           ++   K+++  +EEK+ +   +  K   +     ++ EKE+    KD++     +++ + 
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEED-DGKDRETTTSPMEEDES 239

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 265
            +      +             M     RESS +   +P  S  PP    A
Sbjct: 240 RQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPA 290



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 34/97 (35%), Positives = 55/97 (56%), Gaps = 2/97 (2%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
           + K    +  K   K   K    K K  KE K  +  KE+++EKE ++KE+K+K  EK +
Sbjct: 68  ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPK-NESGKEEEKEKE-QVKEEKKKKKEKPK 125

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +E KD + ++++K+K    +KEK K+KK E+ R   E
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 52.1 bits (126), Expect = 3e-07
 Identities = 33/130 (25%), Positives = 73/130 (56%), Gaps = 11/130 (8%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKEKEKGKKEEKEKKEEKE 164
           K I + K    ++E ++I ++  K+ +   K      K++  K + + +KE +E++ E +
Sbjct: 26  KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ 85

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK- 223
           K E++  + +E+ ++ +E + K E++ EK++K  ++    K+++L+KK++E E    E+ 
Sbjct: 86  KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQ----KQQELEKKEEELEELIEEQL 141

Query: 224 ---EVIPKLT 230
              E I  LT
Sbjct: 142 QELERISGLT 151



 Score = 43.2 bits (103), Expect = 2e-04
 Identities = 41/151 (27%), Positives = 83/151 (54%), Gaps = 23/151 (15%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKK--KDKKDK-DKIKNKEK---DKEKKEKEKGKKEEKE 158
           K+ ++IKK+     K+E  K++ +  K+ +++ ++++  EK    KE+    K +  EK 
Sbjct: 49  KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 159 KKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK--KK 214
           ++E ++K++E E K +  E KE+ +E++ +E+   E E+ S      L  E+ K+   +K
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-ELERIS-----GLTAEEAKEILLEK 162

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
            +E + HE  V+ K       ++EE+ K E+
Sbjct: 163 VEEEARHEAAVLIK-------EIEEEAKEEA 186



 Score = 36.3 bits (85), Expect = 0.027
 Identities = 29/122 (23%), Positives = 68/122 (55%), Gaps = 19/122 (15%)

Query: 103 EIKKDKSIKKDKKTHKKK----EKEKIKKKKD--KKDKDKIKNKEKDKEKKEKE-KGKKE 155
           E +K+   ++++    +K    ++E + +K +  +K +++++ KEK+ E+K++E + K+E
Sbjct: 72  EFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEE 131

Query: 156 EKEKKEEKEKK-----------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           E E+  E++ +           E KE  +++ +E+   +     K+ E+E K ++ DKK 
Sbjct: 132 ELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAK-EEADKKA 190

Query: 205 KK 206
           K+
Sbjct: 191 KE 192


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 52.0 bits (125), Expect = 4e-07
 Identities = 20/133 (15%), Positives = 65/133 (48%), Gaps = 3/133 (2%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
               +L E++++    +++    K+E E+++ +     +++++  E + E+ E+   + +
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEI-SLLRERLEELENELEELEERLEELK 329

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKK 213
           EK +  ++E +E +    + ++     +  KEE + +     ++ ++  +  +E+L + +
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELE 389

Query: 214 KEKERSSHEKEVI 226
            E     +E E +
Sbjct: 390 AELAEIRNELEEL 402



 Score = 45.9 bits (109), Expect = 3e-05
 Identities = 24/129 (18%), Positives = 68/129 (52%), Gaps = 4/129 (3%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           + + ++   K    +KE E+++++  + +++  + +E+ +E +++ +  K E E+  E+ 
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283

Query: 165 KKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           ++ ++E     +  E+ E  +  +R+  ++ E E +  ++  +  KEK++  K+E E   
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343

Query: 221 HEKEVIPKL 229
              E + +L
Sbjct: 344 TLLEELEQL 352



 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 25/127 (19%), Positives = 68/127 (53%), Gaps = 5/127 (3%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+L E++++    + +    K E E+++++ ++  ++ ++ KE+  E+ E E     E+ 
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE-IEELEGEISLLRERL 311

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           ++ E E +E +E      +E   +    +E+  E+E   ++ ++ L + +  K++ E++ 
Sbjct: 312 EELENELEELEE----RLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 219 SSHEKEV 225
           S+  +E+
Sbjct: 368 SALLEEL 374



 Score = 44.3 bits (105), Expect = 1e-04
 Identities = 29/136 (21%), Positives = 67/136 (49%), Gaps = 6/136 (4%)

Query: 114 KKTHKKKEK--EKIKKKKDKKDK--DKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEE 168
            K  ++KE+   K+++ ++  ++  D ++  EK  EK E++  K E  +E K E  + E 
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227

Query: 169 KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
                K ++  K +E++ +E    E+E +   ++ +  ++++++ K E E    E E + 
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQ 287

Query: 228 KLTFKFGTDMEEKTKR 243
           +   +   ++EE    
Sbjct: 288 EELLELKEEIEELEGE 303



 Score = 41.6 bits (98), Expect = 8e-04
 Identities = 18/128 (14%), Positives = 65/128 (50%), Gaps = 1/128 (0%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
            E+L E+K++   + + +    +E+ +  + + ++ +++++  ++  E  ++E  ++E  
Sbjct: 287 QEELLELKEEIE-ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            ++ E+   E +E K + +++        EE      ++  + + +L + + + ++ ++E
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405

Query: 218 RSSHEKEV 225
             S E+ +
Sbjct: 406 IESLEERL 413



 Score = 40.1 bits (94), Expect = 0.003
 Identities = 30/142 (21%), Positives = 69/142 (48%), Gaps = 20/142 (14%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           GEI   +   ++ +   ++ +E++++ K+K +  K + +E++   +E E+   E +E KE
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361

Query: 162 EKEK---------KEEKE----------FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           E E+         +E  E           ++ E + + +E++++E +  E+  +   +  
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNE-LEELKREIESLEERLERLSERL 420

Query: 203 KLKKEKLKKKKKEKERSSHEKE 224
           +  KE+LK+ + E E    E E
Sbjct: 421 EDLKEELKELEAELEELQTELE 442



 Score = 40.1 bits (94), Expect = 0.003
 Identities = 21/128 (16%), Positives = 55/128 (42%), Gaps = 6/128 (4%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK- 157
           E+   + ++      +    K+E E+      ++ ++  +   ++  + E E  +   + 
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNEL 399

Query: 158 -EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            E K E E  EE+     E   + +E +++E K+ E E +    + +   E+L++ +++ 
Sbjct: 400 EELKREIESLEER----LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455

Query: 217 ERSSHEKE 224
           E      +
Sbjct: 456 EELRDRLK 463



 Score = 38.9 bits (91), Expect = 0.006
 Identities = 21/116 (18%), Positives = 62/116 (53%), Gaps = 2/116 (1%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           K++    +   E+++++ ++ ++   + K +    ++E E+ +   +E +EE E+ EE+ 
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEKEV 225
            +++E  E+  E++   E+   K K+  ++ + K +  + + ++ E+E    E+ +
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 23/130 (17%), Positives = 65/130 (50%), Gaps = 7/130 (5%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKE--KDKEKKEKEKGKK 154
            E+L  ++      + +    +++ EK ++ ++ K +  +++        ++  KE  + 
Sbjct: 185 EENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEEL 244

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           EE+  + E+E +E +E   + +KE  +E+++ E +  E  ++ ++  ++L + K + ++ 
Sbjct: 245 EEELSRLEEELEELQEELEEAEKE--IEELKSELE--ELREELEELQEELLELKEEIEEL 300

Query: 215 EKERSSHEKE 224
           E E S   + 
Sbjct: 301 EGEISLLRER 310



 Score = 38.5 bits (90), Expect = 0.008
 Identities = 25/132 (18%), Positives = 68/132 (51%), Gaps = 4/132 (3%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
             + L E+++     + +    K+E   ++ + +  + + +   +E ++ ++E E+ ++ 
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
            +E +EE E  EE   K+KE+ E+     + +++E ++ E+E +  ++     + +L+  
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESL 819

Query: 213 KKEKERSSHEKE 224
           ++ +ER   E E
Sbjct: 820 EQRRERLEQEIE 831



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 26/126 (20%), Positives = 69/126 (54%), Gaps = 3/126 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
             L E++++    +++    ++  E+++++ +  ++   K KE+ +E +EK +  +EE E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           + EE+ ++ E+     +  E+ +E + +  +  E+E +  +++ +  +EKL + ++E E 
Sbjct: 797 ELEEELEEAERR---LDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853

Query: 219 SSHEKE 224
              E E
Sbjct: 854 LEKELE 859



 Score = 37.4 bits (87), Expect = 0.016
 Identities = 27/144 (18%), Positives = 62/144 (43%), Gaps = 18/144 (12%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK----------DKEKKE 148
             L   +++    +D     +K+ EK++++ +K ++ +    E             ++  
Sbjct: 179 RKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELR 238

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--------K 200
           KE  + EE+  + E+E +E +E   + +KE    K   EE   E E+  ++        +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298

Query: 201 DKKLKKEKLKKKKKEKERSSHEKE 224
           + + +   L+++ +E E    E E
Sbjct: 299 ELEGEISLLRERLEELENELEELE 322



 Score = 37.0 bits (86), Expect = 0.023
 Identities = 17/118 (14%), Positives = 57/118 (48%), Gaps = 4/118 (3%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
             +L E+K++    +++     +     KE++K+ + + ++ + + +E ++E +E E+  
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           +E +++ +E E++  +  +  +  EK +  +       E E+++    + + +     
Sbjct: 456 EELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESG 513



 Score = 35.5 bits (82), Expect = 0.065
 Identities = 23/129 (17%), Positives = 67/129 (51%), Gaps = 2/129 (1%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK-DKKDKDKIKNKEKDKEKKEKEKGKKEE 156
            E+L E++      +++    K E   ++    + + + +   ++ ++ K+E    ++E 
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           ++ +   E+ EE+  +++E+ E+  E++ + E++ E  +++  K K+  +E  +K++  +
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792

Query: 217 E-RSSHEKE 224
           E     E+E
Sbjct: 793 EELEELEEE 801



 Score = 35.1 bits (81), Expect = 0.088
 Identities = 27/118 (22%), Positives = 67/118 (56%), Gaps = 1/118 (0%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           +S+++ ++  +++ +E  ++ ++ ++K     +E ++ +KE E+ K+E +E + EKE + 
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE-EL 875

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
           E E K  E++++ +E+  +E +    E K + +  + + E+L+ K +  E    E E 
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEE 933



 Score = 35.1 bits (81), Expect = 0.097
 Identities = 18/117 (15%), Positives = 66/117 (56%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
            E+++ +      +   +  +++ ++ + + ++ + + +E +++  E E+  +E +++ E
Sbjct: 800 EELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELE 859

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           E +++ E+    KE+ E  ++++ +E+++ E+E +  + +    KE+++K ++  E 
Sbjct: 860 ELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916



 Score = 35.1 bits (81), Expect = 0.099
 Identities = 28/120 (23%), Positives = 71/120 (59%), Gaps = 4/120 (3%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE--KGKKEE 156
           E L E++++    + +    K+E E+++ +K++  +D++K  E++KE+ E+E  + + E 
Sbjct: 842 EKLDELEEELEELEKELEELKEELEELEAEKEEL-EDELKELEEEKEELEEELRELESEL 900

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            E KEE EK  E+  +++   E+   ++ + EE+  E+ + + + + + + E+L+++ + 
Sbjct: 901 AELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEA 960



 Score = 33.9 bits (78), Expect = 0.22
 Identities = 28/146 (19%), Positives = 80/146 (54%), Gaps = 20/146 (13%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-------------DKIKNKEKDK 144
            E+L  +++  +  K++    +++++ ++++ ++ ++              ++++ E+ +
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823

Query: 145 EKKEKEKGKKEEK------EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           E+ E+E  + EE+      +  E +E+ EE E +++E KE+ +E++  E+++ E E K  
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEE-LEELEAEKEELEDELKEL 882

Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKE 224
           +++K+  +E+L++ + E      E E
Sbjct: 883 EEEKEELEEELRELESELAELKEEIE 908



 Score = 32.0 bits (73), Expect = 0.71
 Identities = 20/68 (29%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           G  + KE+KEE E+K E+     E+  + +E +  EE + + EK  +  +K  + ++LK 
Sbjct: 166 GVSKYKERKEEAERKLER----TEENLERLEDLL-EELEKQLEKLERQAEKAERYQELKA 220

Query: 212 KKKEKERS 219
           + +E E +
Sbjct: 221 ELRELELA 228



 Score = 32.0 bits (73), Expect = 0.81
 Identities = 23/97 (23%), Positives = 56/97 (57%), Gaps = 4/97 (4%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           EI++ +   ++ +    + +E++++ + + ++ K + +E + EK+E E   KE +E+KEE
Sbjct: 829 EIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888

Query: 163 KEKKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEK 195
            E++  +      ++KE+ EK  E++ + E   E+ +
Sbjct: 889 LEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 51.1 bits (122), Expect = 8e-07
 Identities = 33/146 (22%), Positives = 78/146 (53%), Gaps = 2/146 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+   +  D     +++    +E  + ++++ +  K +++ KE++   +  ++ K+EEKE
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELE-KEEEILAQVLKENKEEEKE 281

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           KK ++E+ +    + +E K + ++  R++  D EK K+S+ K+ K  +++LKK+K+E E 
Sbjct: 282 KKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESE-KELKKLEKELKKEKEEIEE 340

Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRE 244
              E + +         + E+  K +
Sbjct: 341 LEKELKELEIKREAEEEEEEQLEKLQ 366



 Score = 48.8 bits (116), Expect = 4e-06
 Identities = 32/130 (24%), Positives = 72/130 (55%), Gaps = 2/130 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
               E K +K   +++K  K K +E+  +  +++ K++ +  E+++   E+E+  KEE+ 
Sbjct: 777 LAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEEL 836

Query: 159 KKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           ++   E KEE++ +    E+ E+  E+I KEE   E   K ++ +++  K++L+ K++++
Sbjct: 837 EELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKE 896

Query: 217 ERSSHEKEVI 226
           +    E E  
Sbjct: 897 KEEKKELEEE 906



 Score = 47.3 bits (112), Expect = 1e-05
 Identities = 22/135 (16%), Positives = 67/135 (49%), Gaps = 1/135 (0%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
            +  ++K+   ++++   +    K   ++    ++  + + +E +  K+E EK ++   +
Sbjct: 211 LEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKEKE 217
             +E +++E+++   +E+ +   ++  + + +  K ++ K  D +KLK+ + + KK EKE
Sbjct: 271 VLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKE 330

Query: 218 RSSHEKEVIPKLTFK 232
               ++E+       
Sbjct: 331 LKKEKEEIEELEKEL 345



 Score = 46.9 bits (111), Expect = 2e-05
 Identities = 38/166 (22%), Positives = 81/166 (48%)

Query: 59  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
           +E++ S ++   KE+E       E         + +        E ++ KS     +  K
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
             ++EK+K+ + +  K + + K++ +E +E EK  KE + K+E +E++EE+  K++E  E
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +  E++  ++K   +   S  K K+ + E   +++KE +      E
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSE 416



 Score = 46.5 bits (110), Expect = 3e-05
 Identities = 24/126 (19%), Positives = 60/126 (47%), Gaps = 4/126 (3%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK----IKNKEKDKEKKEKEKGKKEEKE 158
            I   + +K  +   K++ K+ ++  + K+  +     +   +  K  +E+    +E   
Sbjct: 189 LIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLR 248

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            ++E+ +  ++E + +E+    V K  KEE+  +K ++ + K    ++E+LK +  + ER
Sbjct: 249 DEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLER 308

Query: 219 SSHEKE 224
              + E
Sbjct: 309 RKVDDE 314



 Score = 45.3 bits (107), Expect = 5e-05
 Identities = 28/138 (20%), Positives = 72/138 (52%), Gaps = 4/138 (2%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
              K+   E+  E  + K  +++++  +   KEK   ++++K +     +EK+++ K +E
Sbjct: 742 LEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQE 801

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD----SEKEKKSKDKDKKLKK 206
           +  +  +E+ +E+ +  E+E  + E +EK  E+  +E        +K +K  +++ +  +
Sbjct: 802 EELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLE 861

Query: 207 EKLKKKKKEKERSSHEKE 224
           E++ K++  +E    E+E
Sbjct: 862 EEITKEELLQELLLKEEE 879



 Score = 44.2 bits (104), Expect = 1e-04
 Identities = 35/179 (19%), Positives = 83/179 (46%), Gaps = 3/179 (1%)

Query: 71  KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK---K 127
           K +   + ++++     +   + K+   ++     K +  +       K  ++KIK   +
Sbjct: 692 KNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEE 751

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           +++K    K + +E+  E   KEK   EE+EK E+ + +EEKE K+K  +E+      + 
Sbjct: 752 EEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEEL 811

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
           ++++E  ++ +   ++ +K K ++ ++       E+++      +     EE TK E  
Sbjct: 812 KEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELL 870



 Score = 43.4 bits (102), Expect = 2e-04
 Identities = 30/128 (23%), Positives = 58/128 (45%), Gaps = 8/128 (6%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
             KK++  K  ++T    E   I  ++ K  + K+K + K   +  + K +K E E++  
Sbjct: 170 RKKKERLKKLIEETENLAELI-IDLEELKLQELKLKEQAKKALEYYQLK-EKLELEEENL 227

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK------EKKSKDKDKKLKKEKLKKKKKEK 216
                 K  + + D  + + +  +EE +S K      E+      K+ K+E+ +KK +E+
Sbjct: 228 LYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEE 287

Query: 217 ERSSHEKE 224
           E     KE
Sbjct: 288 ELKLLAKE 295



 Score = 41.9 bits (98), Expect = 7e-04
 Identities = 31/150 (20%), Positives = 75/150 (50%), Gaps = 4/150 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKI-KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           E   E K +K  +++ +  +++  ++   ++   K+++  + K KD+ + ++EK K+E+K
Sbjct: 842 ELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK 901

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK---EKLKKKKK 214
           E +EE +K    E K  E +E+  E+     K   + ++   ++   K+   +  +++++
Sbjct: 902 ELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEE 961

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
             +R    KE +  +      + EEK +R 
Sbjct: 962 RNKRLLLAKEELGNVNLMAIAEFEEKEERY 991



 Score = 40.7 bits (95), Expect = 0.002
 Identities = 32/152 (21%), Positives = 72/152 (47%), Gaps = 11/152 (7%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKE-------KEKGKKEE 156
           K +K +KK+K+  ++ EKE  + +  +  ++++ +  EK +EK E        +K  + E
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESE 385

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK---LKKKK 213
           +     K K+EE E K +E+KE  +     E+++   +++ K++ K +++ +     K+ 
Sbjct: 386 RLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQG 445

Query: 214 KEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
           K  E     ++   KL        + +   + 
Sbjct: 446 KLTEEKEELEKQALKLLKDKLELKKSEDLLKE 477



 Score = 40.3 bits (94), Expect = 0.002
 Identities = 31/152 (20%), Positives = 71/152 (46%), Gaps = 14/152 (9%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
            K+K+KE++KK  ++ +       + ++ K ++ K K++ K+  E  + K EK    +E+
Sbjct: 168 EKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLK-EKLELEEEN 226

Query: 177 -------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
                         +   E +R E+++ E  K+  +K++++  + LK+ K+E++    ++
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286

Query: 224 EVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
           E +  L  +      E  K E       + +K
Sbjct: 287 EELKLLAKEEEELKSELLKLERRKVDDEEKLK 318



 Score = 40.0 bits (93), Expect = 0.003
 Identities = 29/164 (17%), Positives = 68/164 (41%), Gaps = 24/164 (14%)

Query: 99   EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--------- 149
             +  ++K +   K++K+  +KKE E+  +K +  ++ + + +E+  E+            
Sbjct: 880  LEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPE 939

Query: 150  --------EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP------VEKIRKEEKDSEKEK 195
                    EK K+E+ +++EE+  K     K +             EK  +  KD  K++
Sbjct: 940  ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKE 999

Query: 196  KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE 239
            + +++ K+L +E +++  +  +    E  V             E
Sbjct: 1000 RLEEEKKELLREIIEETCQRFKE-FLELFVSINRGLNKVFFYLE 1042



 Score = 38.4 bits (89), Expect = 0.009
 Identities = 30/120 (25%), Positives = 57/120 (47%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
            E ++  S  K K+   + + E+ K+ K   +  + +     +EKKE+ K  +E +E  E
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLE 441

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
            K+ K  +E +  E +   + K + E K SE   K     K L++ +L   +++ E +S 
Sbjct: 442 TKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQ 501



 Score = 31.1 bits (70), Expect = 1.4
 Identities = 22/104 (21%), Positives = 41/104 (39%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           +EK KK+E+ KK  +E +   E  +  ++ K  E   KE+     E     +  +L++E 
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226

Query: 209 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
           L      K        +   L  +       K + E   +I+ +
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 16/115 (13%), Positives = 42/115 (36%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           + K+++ K+  ++ E   +   + +E K  ++K  ++        + K+  + ++     
Sbjct: 170 RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLY 229

Query: 202 KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
               K   ++    +E    E+E I     +   + E   +     K   K  K 
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 49.3 bits (118), Expect = 9e-07
 Identities = 21/77 (27%), Positives = 42/77 (54%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
             D++  KK + ++  + ++ K+++K     +DKE K   + + EE E++ E+E +E  +
Sbjct: 45  AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104

Query: 171 FKMKEDKEKPVEKIRKE 187
              KE +EK    + KE
Sbjct: 105 ENEKETEEKTESNVEKE 121



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 1/93 (1%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
            +   +S  D     + ++ K D +   + E+ K ++K+    +DK    + +KE +E E
Sbjct: 33  QLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           +  +EE E+  ++ +KE +E K + + EK +  
Sbjct: 93  EENEEEDEESSDENEKETEE-KTESNVEKEITN 124



 Score = 38.6 bits (90), Expect = 0.003
 Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE-KIR 185
              D+   D+ + K+ D ++  + +  KEE+++    E KE+K    KED+E   E +  
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98

Query: 186 KEEKDSEKEKKSKDKD 201
            EE   E EK++++K 
Sbjct: 99  DEESSDENEKETEEKT 114



 Score = 38.6 bits (90), Expect = 0.003
 Identities = 21/85 (24%), Positives = 40/85 (47%), Gaps = 2/85 (2%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
                   ++++ KK  D++  +  + KE++KE    E  + +   +KE++E +EE E +
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE-E 97

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKS 197
             E+     EK   EEK     +K 
Sbjct: 98  EDEESSDENEKE-TEEKTESNVEKE 121



 Score = 38.6 bits (90), Expect = 0.003
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K D     + +  K++EKE    +  +   D  K  E+ +E+ E+E  +  ++ +KE +E
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112

Query: 165 KKEEKEFKMKEDKE-KPV 181
           K E    K   +   KPV
Sbjct: 113 KTESNVEKEITNPSWKPV 130



 Score = 38.2 bits (89), Expect = 0.005
 Identities = 18/86 (20%), Positives = 44/86 (51%), Gaps = 1/86 (1%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKK 154
            S +   + ++ K     +    ++ KE+ K+    +  +DK   +++D+E +E+ + + 
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKP 180
           EE   + EKE +E+ E  ++++   P
Sbjct: 100 EESSDENEKETEEKTESNVEKEITNP 125



 Score = 37.0 bits (86), Expect = 0.010
 Identities = 21/88 (23%), Positives = 49/88 (55%), Gaps = 3/88 (3%)

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           +        E+E  K +++E  E +E KEE++     + ++  +K   E++D E E++++
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKE--DKGDAEKEDEESEEENE 96

Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKEVI 226
            ++ +   ++ +K+ +EK  S+ EKE+ 
Sbjct: 97  -EEDEESSDENEKETEEKTESNVEKEIT 123



 Score = 35.5 bits (82), Expect = 0.035
 Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
              D+     +E  K   ++    +E KE+++E    E+KE K   +KE        EE 
Sbjct: 39  SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKED-------EES 91

Query: 190 DSEKEKKSKDKDKKLKKE 207
           + E E++ ++   + +KE
Sbjct: 92  EEENEEEDEESSDENEKE 109



 Score = 31.2 bits (71), Expect = 0.84
 Identities = 14/78 (17%), Positives = 37/78 (47%), Gaps = 8/78 (10%)

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
              +   +++E K+  +++  +++E KE        EEK++   +  +DK    K+++  
Sbjct: 40  PSDQAAADEQEAKKSDDQETAEIEEVKE--------EEKEAANSEDKEDKGDAEKEDEES 91

Query: 211 KKKKEKERSSHEKEVIPK 228
           +++ E+E      E   +
Sbjct: 92  EEENEEEDEESSDENEKE 109



 Score = 31.2 bits (71), Expect = 0.86
 Identities = 10/66 (15%), Positives = 32/66 (48%)

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
               ++   +E+E K+  + +  E +E   E+      + +++K   +K+ +  +E+ ++
Sbjct: 38  SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97

Query: 212 KKKEKE 217
           + +E  
Sbjct: 98  EDEESS 103


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 49.8 bits (119), Expect = 1e-06
 Identities = 36/141 (25%), Positives = 73/141 (51%), Gaps = 5/141 (3%)

Query: 108 KSIKKDKKTHKKKEKEKIKK---KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
             I++ KK   KKE+E+ KK   + ++ +K +   + + KE +++   +K  K+ ++  +
Sbjct: 53  NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112

Query: 165 KKEEKEFKMKEDKEKPV-EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           + EEK+ + +E K K   E   K E ++EK+ K + K K+ ++E   K   E ++ + E 
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK-KQAEEEAKAKAAAEAKKKAAEA 171

Query: 224 EVIPKLTFKFGTDMEEKTKRE 244
           +   +   K   + + K K E
Sbjct: 172 KKKAEAEAKAKAEAKAKAKAE 192



 Score = 46.8 bits (111), Expect = 1e-05
 Identities = 31/111 (27%), Positives = 51/111 (45%), Gaps = 4/111 (3%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKEE 168
           K K+  + K K+  + K   + + + K KE+ K++ E+E   K   E K    E +KK E
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176

Query: 169 KEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            E K K + K K   +  K + ++ K K + +   K + E       E ER
Sbjct: 177 AEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227



 Score = 46.0 bits (109), Expect = 2e-05
 Identities = 27/111 (24%), Positives = 51/111 (45%), Gaps = 2/111 (1%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
            K+ +     +K    K+ E+  K+ ++K K   + K K   + + +   + EK+ KEE 
Sbjct: 91  QKELEQRAAAEK--AAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           +K+ E+E K K   E   +    ++K   + K   +   K K E+ K K +
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199



 Score = 44.4 bits (105), Expect = 7e-05
 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 9/122 (7%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-- 160
             KK +   ++ +  +  E+ + K+ + +    +   K+ ++  K+ E+ +K+ +E K  
Sbjct: 69  RQKKLEQQAEEAEKQRAAEQARQKELEQRA-AAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127

Query: 161 ---EEKEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
              E K K E + E K KE+ +K  E+  K +  +E +KK+ +  KK + E   K K E 
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE--AKAKAEA 185

Query: 217 ER 218
           + 
Sbjct: 186 KA 187



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE--KEKKEE 162
            K+++ K+ ++  K K   + KKK  +    K K + + K K E +   K E  K K E 
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEA---KKKAEAEAKAKAEAKAKAKAEEAKAKAEA 200

Query: 163 KEKKEEKEFKMKEDKEK 179
            + K   E   K + E 
Sbjct: 201 AKAKAAAEAAAKAEAEA 217



 Score = 29.4 bits (66), Expect = 3.5
 Identities = 21/67 (31%), Positives = 28/67 (41%), Gaps = 2/67 (2%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEK 169
           KK     KKK + + K K + K K K + + K K +  K K   E   K E E       
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAKAKAE-EAKAKAEAAKAKAAAEAAAKAEAEAAAAAAA 223

Query: 170 EFKMKED 176
           E + K D
Sbjct: 224 EAERKAD 230


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 49.4 bits (118), Expect = 2e-06
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 10/89 (11%)

Query: 120 KEKEKIKKKKDKKDKDKIK-----NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K  E++K   D     + +     NK+     K+  + K E    +E + K +E++ K +
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           E+++K     +KE+ D +KEK+ KD+ KK
Sbjct: 411 ENEKK-----QKEQADEDKEKRQKDERKK 434



 Score = 42.1 bits (99), Expect = 4e-04
 Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 77  KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK 136
            + D +L +  +     +V S+ DL   K+ + +K+  K  +   K K+ + KD+ D   
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTK-KLGEVKDETDAS- 395

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
               E+ + K ++EK K+EE E K++KE+ +E + K ++D+ K
Sbjct: 396 ----EEAEAKAKEEKLKQEENE-KKQKEQADEDKEKRQKDERK 433



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 21/92 (22%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
            K  ++ K    +   ++ +  KE  K  K +   K+  E K+E       D  +  E  
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNK--KLQDYTKKLGEVKDET------DASEEAEAK 401

Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            KEEK  ++E + K K++  + ++ ++K + K
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433



 Score = 40.2 bits (94), Expect = 0.002
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 147 KEKEKGKKEEK---EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           K  E+ K       +K++E  K+  K+ +    K   V    K+E D+ +E ++K K++K
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV----KDETDASEEAEAKAKEEK 406

Query: 204 LKKEKLKKKKKEKERSSHEKE 224
           LK+E+ +KK+KE+     EK 
Sbjct: 407 LKQEENEKKQKEQADEDKEKR 427



 Score = 35.5 bits (82), Expect = 0.050
 Identities = 16/59 (27%), Positives = 33/59 (55%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           + L +  K     KD+    ++ + K K++K K+++++ K KE+  E KEK +  + +K
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434


>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
          Length = 746

 Score = 49.1 bits (118), Expect = 3e-06
 Identities = 27/161 (16%), Positives = 65/161 (40%), Gaps = 8/161 (4%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
            SS++L E  K+  I  + K+H      ++  KE+ K++ +++ K + +     + ++E 
Sbjct: 15  VSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEA 72

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           +         +E  E     E   +  +++         +  + +K +K K     K+K 
Sbjct: 73  KAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKK 132

Query: 210 KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
            K+K  +     +     +         ++K K++ + KI 
Sbjct: 133 PKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 23/125 (18%), Positives = 47/125 (37%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E K +       +  ++ + E       ++  +     E      E E  +  E   +  
Sbjct: 55  EAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRP 114

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           K KK  K+ K  + K+K  ++         K  K + + +  ++ + KKKK++       
Sbjct: 115 KAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPR 174

Query: 223 KEVIP 227
           + VIP
Sbjct: 175 EVVIP 179



 Score = 41.8 bits (99), Expect = 7e-04
 Identities = 16/119 (13%), Positives = 41/119 (34%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
               + +  K      KE ++K K+   + K  +   ++E+  KE+ K+E +E+ + + +
Sbjct: 2   SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +       +E K +       EE                 +     +   +   + +  
Sbjct: 62  EAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAA 120



 Score = 41.8 bits (99), Expect = 7e-04
 Identities = 15/124 (12%), Positives = 47/124 (37%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E  K ++ +      +++ K +       ++  +     +   +  +++  +  +     
Sbjct: 54  EEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARR 113

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
            + K+  + K     +K   K +        +     + ++ ++ + KKKKK+K      
Sbjct: 114 PKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173

Query: 223 KEVI 226
           +EV+
Sbjct: 174 REVV 177


>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 605

 Score = 48.3 bits (115), Expect = 5e-06
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 2/128 (1%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           L   K + SIK +KKT    +    KK    ++ +K+ +      K K    K K  +  
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           K+++ +K+E  E +  +  EK  +     +     ++K  +KD +L K  L+K  K K  
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFNKFKNE 487

Query: 219 SSHEKEVI 226
            S E   I
Sbjct: 488 ESAENSFI 495



 Score = 29.8 bits (67), Expect = 3.6
 Identities = 22/88 (25%), Positives = 36/88 (40%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
            I   KK   +  KE+  KK++  +K+ +K  EK  +  E     +  K+K   K+ +  
Sbjct: 415 KIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELS 474

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           K    K +K K  E       D  K+  
Sbjct: 475 KNMLQKFNKFKNEESAENSFIDIIKQAD 502


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 48.5 bits (116), Expect = 6e-06
 Identities = 42/184 (22%), Positives = 84/184 (45%), Gaps = 2/184 (1%)

Query: 91   NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKEK 149
              +K T+ +D+     DK  +  ++  + +EKE  K ++   K K K     K K KK++
Sbjct: 1119 EKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKE 1178

Query: 150  EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
            +K KK   +K ++       +    ++K K  +K   ++ +S    +  D+++K K +K 
Sbjct: 1179 KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKS 1238

Query: 210  KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
              K+ + ++++  K       F    D+ ++ K +++PK V     SP PP      E +
Sbjct: 1239 SVKRLKSKKNNSSKSSEDNDEF-SSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESN 1297

Query: 270  TGDS 273
             G  
Sbjct: 1298 GGSK 1301



 Score = 47.7 bits (114), Expect = 1e-05
 Identities = 41/219 (18%), Positives = 77/219 (35%), Gaps = 14/219 (6%)

Query: 42   PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKD----------VELGVTPM--V 89
              SK K K S   K   K+K   K+K+S  + + + V             +L   P    
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKK 1217

Query: 90   PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
             N       +D  +  K K     +   KK    K  +  D+   D +  + K K   ++
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKR 1277

Query: 150  EKGKKEEKE--KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
                +       K    +           K+K  +++       +K+KKS+ K  + KK 
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337

Query: 208  KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
            K + K+    +SS       K      ++ ++ ++ + S
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDS 1376



 Score = 40.8 bits (96), Expect = 0.002
 Identities = 27/155 (17%), Positives = 54/155 (34%), Gaps = 2/155 (1%)

Query: 40   TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
             P  S +K   S K+ +S   + + +  +    KE       +      V       S  
Sbjct: 1234 KPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPK--RVSAVQYSPPPPSKR 1291

Query: 100  DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
              GE           K   KK  E       KK K + K   K K K   ++    +  +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351

Query: 160  KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
               + +K++ +   ++D +  V+    E+ + +++
Sbjct: 1352 LLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 27/177 (15%), Positives = 57/177 (32%), Gaps = 3/177 (1%)

Query: 99   EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
             +  EI K++ +K   K    K ++   KKK+K  K K  + +K K+       K+ + +
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEK--KKKKSSADKSKKASVVGNSKRVDSD 1204

Query: 159  KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            +K + + K + +       ++  ++ +K +      K+ K K     K      +     
Sbjct: 1205 EKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFS-SD 1263

Query: 219  SSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 275
               ++        +              P         PS P      +   G    
Sbjct: 1264 DLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAA 1320



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 43/204 (21%), Positives = 76/204 (37%), Gaps = 29/204 (14%)

Query: 92   IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD-------- 143
            ++     +DL +  K     + K   KKK ++   ++++  ++D   + E D        
Sbjct: 1027 VITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAV 1086

Query: 144  --------------KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI----R 185
                          KEK EK   + E+KEK+ EK K    +    ED +K  E +     
Sbjct: 1087 SYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEE 1146

Query: 186  KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
             EEK+  KE++ K K K    +  K K K+KE+   +                   + +S
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV---GNSKRVDS 1203

Query: 246  SPKIVIKPVKSPSPPPVAAADEYD 269
              K  +           + +D+ D
Sbjct: 1204 DEKRKLDDKPDNKKSNSSGSDQED 1227



 Score = 36.9 bits (86), Expect = 0.022
 Identities = 31/162 (19%), Positives = 58/162 (35%), Gaps = 11/162 (6%)

Query: 38   PGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS 97
            P      ++K K +   K+S      S +  S + K  +  K V        P   +   
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294

Query: 98   SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK--EKEKGKKE 155
              + G      + KK KK  +       KKKK  + K   K K K + K+    +  +  
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAAL-KKKKKSEKKTARKKKSKTRVKQASASQSSRLL 1353

Query: 156  EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
             + +K++ +   E +   + D          E++D E ++  
Sbjct: 1354 RRPRKKKSDSSSEDDDDSEVDDS--------EDEDDEDDEDD 1387



 Score = 36.9 bits (86), Expect = 0.022
 Identities = 25/167 (14%), Positives = 52/167 (31%), Gaps = 5/167 (2%)

Query: 39   GTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS 98
            G+      + K  PK  +  + K      +   E       D  L       N  K  S+
Sbjct: 1222 GSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDD-LSKEGKPKNAPKRVSA 1280

Query: 99   EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD----KIKNKEKDKEKKEKEKGKK 154
                     K    +     K      KK K + +      K K K + K  ++K+   +
Sbjct: 1281 VQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340

Query: 155  EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
             ++    +  +   +  K K D     +   + +   +++ +  + D
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387



 Score = 29.2 bits (66), Expect = 5.5
 Identities = 41/196 (20%), Positives = 74/196 (37%), Gaps = 31/196 (15%)

Query: 101  LGEIKKDKSIKKDKK---THKKKEKEKIKKKKDK---KDKDKIKNKEKDKEKKEKEKGKK 154
            LG+++++ +   +K     H    +  I   K K   K+  K+         K+K +   
Sbjct: 1001 LGKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRFKDIIKKKSEKIT 1060

Query: 155  EEKEKKEEKEKKEEKEFKMKED------------------KEKPVEKIRKE--EKDSEKE 194
             E+E+  E++ + + E   +E                   KEK VEK+  E  +K+ E E
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEK-VEKLNAELEKKEKELE 1119

Query: 195  KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPV 254
            K      K +  E L K ++  E     +E       +     + KTK ++S     K  
Sbjct: 1120 KLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRL----KSKTKGKASKLRKPKLK 1175

Query: 255  KSPSPPPVAAADEYDT 270
            K       ++AD+   
Sbjct: 1176 KKEKKKKKSSADKSKK 1191


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 47.9 bits (115), Expect = 8e-06
 Identities = 32/112 (28%), Positives = 60/112 (53%), Gaps = 10/112 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           K +E E + K+ +K  ++  + KEK +E+++K   + E++ ++  KE K+E +  +KE +
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELR 594

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKL-----KKEKLKKKKKEKERSSHEKE 224
           +      +K    S K  +  +  K+L     KKEK KKK+KEK+      +
Sbjct: 595 QL-----QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641



 Score = 46.0 bits (110), Expect = 3e-05
 Identities = 38/132 (28%), Positives = 66/132 (50%), Gaps = 21/132 (15%)

Query: 115 KTHKKKEKEK----IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE-------- 162
           K    ++KEK    I   ++ + + + K +E +   KE EK K+E +EKKE+        
Sbjct: 508 KKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567

Query: 163 KEKKEEK-EFKMKEDKEKPVEKIR------KEEKDSEKEKKSKDKDKKLKK--EKLKKKK 213
            E+ E++ +  +KE K++  E I+      K    S K  +  +  K+L K  EK +KKK
Sbjct: 568 LEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKK 627

Query: 214 KEKERSSHEKEV 225
           K+++    E +V
Sbjct: 628 KKQKEKQEELKV 639



 Score = 37.5 bits (88), Expect = 0.013
 Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 9/151 (5%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           ++KK   ++E++K+ ++ +K+ +  IK  +K+ ++  KE  + ++      K  +  +  
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK---KEKLKKKKKEKERSSHEKEVIPK 228
           K      +  EK +K++K+ ++E K  D+ K L    K ++     +KE     +  I K
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAI--VQAGIMK 671

Query: 229 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
           +      D+E   K +   K   K  K+  P
Sbjct: 672 MKVPLS-DLE---KIQKPKKKKKKKPKTVKP 698



 Score = 35.6 bits (83), Expect = 0.058
 Identities = 38/161 (23%), Positives = 68/161 (42%), Gaps = 43/161 (26%)

Query: 99  EDLGEIKKDKSIKKDKKTHK------------KKEKEK-IKKKKDKKDKDKIKNKEKD-K 144
           E+L E K+    ++DK   +            KKE ++ IK+ +  +       K  +  
Sbjct: 551 EELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELI 610

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFK------------------MKEDKEKPVE---- 182
           E +++     E+KEKK++K+K++++E K                  + +DKE  V+    
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIM 670

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           K++    D EK +K        KK+K KK K  K +     
Sbjct: 671 KMKVPLSDLEKIQKP-------KKKKKKKPKTVKPKPRTVS 704



 Score = 33.6 bits (78), Expect = 0.22
 Identities = 35/155 (22%), Positives = 67/155 (43%), Gaps = 29/155 (18%)

Query: 99  EDLGEIKKDKSIKKDKKTHK-----KKEKEK-IKKKKDKKDKDKIKNKEKD-KEKKEKEK 151
           E L E +     + +K+  +     KKE ++ IK+ +  +       K  +  E +++  
Sbjct: 558 EKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLN 617

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD---------- 201
              E+KEKK++K+K++++E K+ ++      K     +  E      DK+          
Sbjct: 618 KANEKKEKKKKKQKEKQEELKVGDEV-----KYLSLGQKGEVLSIPDDKEAIVQAGIMKM 672

Query: 202 -------KKLKKEKLKKKKKEKERSSHEKEVIPKL 229
                  +K++K K KKKKK K      + V  +L
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 47.3 bits (112), Expect = 1e-05
 Identities = 34/136 (25%), Positives = 71/136 (52%), Gaps = 12/136 (8%)

Query: 95  VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
           V    D+ ++K+ +S +  K+  + KE+   K+    K + K    + + +K+  E  +K
Sbjct: 194 VNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQK 253

Query: 155 EEKEKKEEK-----EKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKKEK 208
           +++ K   K       KE+K  ++ E++++ +EK + E +K+ E+  K+KD     K   
Sbjct: 254 QQEAKNLPKPADTSSPKEDK--QVAENQKREIEKAQIEIKKNDEEALKAKDH----KAFD 307

Query: 209 LKKKKKEKERSSHEKE 224
           LK++ K  E+ + +KE
Sbjct: 308 LKQESKASEKEAEDKE 323



 Score = 36.1 bits (83), Expect = 0.033
 Identities = 24/127 (18%), Positives = 62/127 (48%), Gaps = 7/127 (5%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           K   ++D  + ++D+  +K ++     +       K+  DK   +N++++ EK + E  K
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE--DKQVAENQKREIEKAQIEIKK 292

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKK 212
            +E+  K     K+ K F +K++ +   ++   +E +++K+++   +D +K K +   + 
Sbjct: 293 NDEEALK----AKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQP 348

Query: 213 KKEKERS 219
               E +
Sbjct: 349 TSLNEDA 355



 Score = 28.8 bits (64), Expect = 7.3
 Identities = 22/96 (22%), Positives = 40/96 (41%), Gaps = 9/96 (9%)

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
            ++ED EK V   R      E+E +   K  +  KE+L KK+ + +++  + +       
Sbjct: 185 ALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD------- 237

Query: 232 KFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 267
            F  D  +K + E   K   +    P P   ++  E
Sbjct: 238 -FAQDNADKQRDEVRQKQQ-EAKNLPKPADTSSPKE 271


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 47.4 bits (113), Expect = 1e-05
 Identities = 23/147 (15%), Positives = 62/147 (42%), Gaps = 14/147 (9%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHK--------------KKEKEKIKKKKDKKDKDKI 137
           IV + ++ +  +I     I  D   +               + E+E++K  + ++++  +
Sbjct: 772 IVSLFNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLM 831

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           + +E +   +E++  K+E K +  E ++ EE+   + E+K + VE     +K ++     
Sbjct: 832 RPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAK 891

Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKE 224
               K +      K+ ++  +   +  
Sbjct: 892 DVLIKLVISSDEIKQDEKTTKKKKKDL 918



 Score = 43.1 bits (102), Expect = 3e-04
 Identities = 21/117 (17%), Positives = 52/117 (44%), Gaps = 5/117 (4%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEK 163
            D+ I+ +++  K    ++ +     ++ + I  ++K   +E K +    +E +E+ +  
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867

Query: 164 EKKEEKEFKMKEDKEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            + + +  + K D +K  + I       K      + K  +K  KK+K   +K ++E
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924



 Score = 37.3 bits (87), Expect = 0.015
 Identities = 20/118 (16%), Positives = 50/118 (42%), Gaps = 3/118 (2%)

Query: 65  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
            ++    E+E  K   ++     M P  +++ + E     ++ K    + +  +++ +  
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867

Query: 125 IKKKK---DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
            + K    + K+  K  NK    +    +     ++ K++EK  K++K+   K D+E 
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEEA 925



 Score = 34.6 bits (80), Expect = 0.11
 Identities = 19/101 (18%), Positives = 49/101 (48%), Gaps = 2/101 (1%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI--KKKKDKKDKDKIKNKEKDKEKKEKEK 151
           ++   E L   ++ + I +++K  K++ K ++   ++ +++ ++  +NK +  E K   K
Sbjct: 823 RIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPK 882

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
              +    K+   K      ++K+D++   +K +  EK  E
Sbjct: 883 KLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDE 923



 Score = 32.7 bits (75), Expect = 0.41
 Identities = 24/130 (18%), Positives = 57/130 (43%), Gaps = 14/130 (10%)

Query: 133 DKDKIKNKEKDKEKKE-KEKGKKEEKEK-------KEEKEKKEEKEFKMKEDKEKPVEKI 184
             D I N + +   +E  ++  + E+E+       +EE   + E+   + E+++   ++I
Sbjct: 791 LSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEI 850

Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTDMEEKTK 242
           + E  + ++ ++      + K E ++ K   K+ +     K+V+ KL        +E  +
Sbjct: 851 KLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLV----ISSDEIKQ 906

Query: 243 RESSPKIVIK 252
            E + K   K
Sbjct: 907 DEKTTKKKKK 916


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 46.6 bits (111), Expect = 1e-05
 Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 1/118 (0%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
           T    L  I    S              + + K       K K ++K+ E K+K + KK+
Sbjct: 16  TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           +K++K+E + + E +   K  K+    K +  +     E      +K  +  +     
Sbjct: 76  KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP-NEDVDNAFNKIAELAEKSNVY 132



 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 36/137 (26%), Positives = 55/137 (40%), Gaps = 15/137 (10%)

Query: 57  SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
           S     + K K  P    YS V  +   +           ++      KKDK   K+ ++
Sbjct: 10  SFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTS---TKKDKKEDKNNES 66

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
            KKK ++K KKKK+KK+            K E E     +  KK +K KK+  + K  ED
Sbjct: 67  -KKKSEKKKKKKKEKKE-----------PKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114

Query: 177 KEKPVEKIRKEEKDSEK 193
            +    KI +  + S  
Sbjct: 115 VDNAFNKIAELAEKSNV 131



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 27/108 (25%), Positives = 49/108 (45%), Gaps = 1/108 (0%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           T +K + + I            +      E++ K      +K+KKE+K  + +K+   K+
Sbjct: 15  TTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK-SEKK 73

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            K+K  +K  K E +++   K+  K KK KK+  K K  E   ++  K
Sbjct: 74  KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 25/116 (21%), Positives = 42/116 (36%), Gaps = 1/116 (0%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           K K +             KE      E+E K       K+ KE    + +K+ +  +K+K
Sbjct: 18  KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKK 77

Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE-KTKRESSPKIV 250
           K K + K   + KL  K  +K + + +K   PK          +     E S   +
Sbjct: 78  KEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYI 133



 Score = 38.2 bits (89), Expect = 0.008
 Identities = 27/94 (28%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           ++ + +             KE      E+  K    S K+ K +DK+ + KK+  KKKKK
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
           +KE+   + E   KL FK  T  + K  ++  PK
Sbjct: 77  KKEKKEPKSEGETKLGFK--TPKKSKKTKKKPPK 108



 Score = 33.2 bits (76), Expect = 0.28
 Identities = 23/110 (20%), Positives = 43/110 (39%), Gaps = 4/110 (3%)

Query: 144 KEKKEKEKGKKEEKE-KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           K K +             +E      +E            K +KE+K++E +KKS+ K K
Sbjct: 18  KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTK--KDKKEDKNNESKKKSEKK-K 74

Query: 203 KLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
           K KKEK + K + + +   +     K T K     +     +++   + +
Sbjct: 75  KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124



 Score = 30.5 bits (69), Expect = 1.8
 Identities = 18/98 (18%), Positives = 37/98 (37%), Gaps = 3/98 (3%)

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           ++ K +     +       K +     EE   E +  +    K  K++K  + KK+ E+ 
Sbjct: 17  QKSKLQPISYIYSNVLVLSKEILSTFSEE---ENKVATTSTKKDKKEDKNNESKKKSEKK 73

Query: 220 SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 257
             +K+   +   +  T +  KT ++S       P   P
Sbjct: 74  KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 46.5 bits (111), Expect = 1e-05
 Identities = 25/83 (30%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K    K   ++ +K DK  +++ +   K  E++ +E    E +EKKEEK+K+E +    K
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQE----EAQEKKEEKKKEEREA---K 299

Query: 175 EDKEKPVEKIRKEEKDSEKEKKS 197
             K  P E+ + EEK+ +K+ + 
Sbjct: 300 LAKLSPEEQRKLEEKERKKQARK 322



 Score = 38.8 bits (91), Expect = 0.004
 Identities = 25/83 (30%), Positives = 47/83 (56%), Gaps = 9/83 (10%)

Query: 135 DKIKN---KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           DK+ N     +   K +K + ++EEK  K  +E+++E+  + KE+K+K  E+  K  K S
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKK-EEREAKLAKLS 304

Query: 192 EKEKKSKDKDKKLKKEKLKKKKK 214
            +E++     K  +KE+ K+ +K
Sbjct: 305 PEEQR-----KLEEKERKKQARK 322



 Score = 36.8 bits (86), Expect = 0.014
 Identities = 20/66 (30%), Positives = 44/66 (66%), Gaps = 3/66 (4%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGK--KEEKEKKEE 162
             + ++K  KT +++E++ +K  ++++ ++ + K +EK KE++E +  K   EE+ K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313

Query: 163 KEKKEE 168
           KE+K++
Sbjct: 314 KERKKQ 319



 Score = 30.7 bits (70), Expect = 1.4
 Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           V  T  E+  +I K    ++ ++  +KKE++K ++++ K  K   + +++  E+KE++K 
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE-EQRKLEEKERKKQ 319

Query: 153 KK 154
            +
Sbjct: 320 AR 321



 Score = 28.8 bits (65), Expect = 6.9
 Identities = 14/72 (19%), Positives = 40/72 (55%), Gaps = 3/72 (4%)

Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 236
             + + K+ K  ++ E++     ++++ ++ + KK++K+KE     +  + KL+ +    
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE---REAKLAKLSPEEQRK 310

Query: 237 MEEKTKRESSPK 248
           +EEK +++ + K
Sbjct: 311 LEEKERKKQARK 322


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 46.7 bits (111), Expect = 2e-05
 Identities = 24/106 (22%), Positives = 45/106 (42%), Gaps = 3/106 (2%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
            +KK H+K +K+  K  K    KD  K K K K+K +K+    ++ +        +    
Sbjct: 48  INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYD---NFFDSKNNSK 104

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           +        + +  K + +  K+K + +K    KK+K+   K E  
Sbjct: 105 QFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELI 150



 Score = 36.3 bits (84), Expect = 0.037
 Identities = 25/109 (22%), Positives = 39/109 (35%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
               K  KK K   KKK K+K+    D  +    KN  K           ++ K K E+ 
Sbjct: 66  TDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKL 125

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +KK        + K+K +    +  K    + KS      L  ++L   
Sbjct: 126 KKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLTIQELSTL 174



 Score = 34.8 bits (80), Expect = 0.11
 Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 21/159 (13%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK----EKKEKEKGKKEEKEKKEEKEKKE 167
             + T   K    IK  K    K ++++  KD     +  +K   K ++K KK  K    
Sbjct: 10  SSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDL 69

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL------KKKKKEKERSSH 221
           +   K K  ++K ++K    + D +    SK+  K+             K K EK +   
Sbjct: 70  KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129

Query: 222 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 260
                             K K++             + P
Sbjct: 130 TVNKSTN-----------KKKKKVLSSKDELIKYDNNKP 157


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 46.1 bits (109), Expect = 3e-05
 Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 2/133 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKE 175
           K+ EKE++K ++ +K  ++ + + + ++K+++E+ +K   E++KK E  K +      K 
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
                 +K  +E   + +E K+K +    KK+   + K   E++  E E   K   K   
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223

Query: 236 DMEEKTKRESSPK 248
             EEK   E    
Sbjct: 224 AAEEKAAAEKKKA 236



 Score = 45.7 bits (108), Expect = 3e-05
 Identities = 26/117 (22%), Positives = 49/117 (41%), Gaps = 3/117 (2%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           + KK    K K   +  K K   +  K  E+  +  E+ K K E    K++ E + +   
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAA 204

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---KLKKEKLKKKKKEKERSSHEKEV 225
           +  + + +   K  K+ + + +EK + +K K   K K +K     K  ER +    +
Sbjct: 205 EKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261



 Score = 37.6 bits (87), Expect = 0.010
 Identities = 29/121 (23%), Positives = 57/121 (47%), Gaps = 5/121 (4%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK-KEEKEKKEEK 163
           K+ +  +K  +  +K+++E+   +K   ++ K     K K   E  K K   E +KK E+
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQ--ARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
             K  +E K K   E    K + E +     +K+K + +   K + K +   +E+++ EK
Sbjct: 176 AAKAAEEAKAK--AEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEK 233

Query: 224 E 224
           +
Sbjct: 234 K 234



 Score = 37.6 bits (87), Expect = 0.012
 Identities = 28/117 (23%), Positives = 65/117 (55%), Gaps = 2/117 (1%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK-KEEKEKKEEKEKK 166
             I+  + + KK E+++ KK++   ++ K K   + +  K+ EK + K ++++K+ +E +
Sbjct: 65  GRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE 124

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKERSSHE 222
           ++ + + K+ +E+  +   +++K +E  K K+  +  KLK     KKK E+   + E
Sbjct: 125 KQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAE 181



 Score = 33.4 bits (76), Expect = 0.22
 Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 5/97 (5%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           KK +   K  +  K K +    KKK + +  K   ++   E + K K    EK+ +   E
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEA-KAAAEKAKAEAEAKAKA---EKKAEAAAE 226

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           +K   E K    K K  +K     K +E++  +   D
Sbjct: 227 EKAAAEKKKAAAKAK-ADKAAAAAKAAERKAAAAALD 262



 Score = 33.4 bits (76), Expect = 0.23
 Identities = 21/99 (21%), Positives = 57/99 (57%), Gaps = 4/99 (4%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           +I++++   +K E+++ KKEE+  +E K K+  ++ ++K+ +++ ++   ++++  E EK
Sbjct: 66  RIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEK 125

Query: 196 KSKDKDKKLK----KEKLKKKKKEKERSSHEKEVIPKLT 230
           +++ + K+ +    K   ++KKK +   +       KL 
Sbjct: 126 QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLK 164



 Score = 30.3 bits (68), Expect = 2.1
 Identities = 21/114 (18%), Positives = 44/114 (38%), Gaps = 4/114 (3%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           K  +     +   + +  K     KKK +E  K  ++ K K +    +K  E + K   +
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205

Query: 154 KEEKEKKE----EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           K + E +     EK+ +   E K   +K+K   K + ++  +  +   +     
Sbjct: 206 KAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAA 259


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 46.1 bits (109), Expect = 3e-05
 Identities = 26/99 (26%), Positives = 55/99 (55%), Gaps = 5/99 (5%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEE--KD 190
             +  N   +   +EK+  K+++K + E KEK  ++KE  + ++ + PV++   +E  K 
Sbjct: 5   NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIP 227
           +++ KKS  ++ K   E LK K++ ++   +E  ++ IP
Sbjct: 65  ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP 103



 Score = 44.9 bits (106), Expect = 7e-05
 Identities = 41/147 (27%), Positives = 72/147 (48%), Gaps = 10/147 (6%)

Query: 44  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
               D++  KD     EK   K +   KEK   K ++V   VT  V + VK  SS++   
Sbjct: 11  INNNDRMQEKDD----EKQDQKNRMELKEKVLDKKEEV---VTDNVDSPVKEQSSQE--N 61

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKEE 162
           +K    +KK  K   K+  E +K K++ + + + +  +K     E KE   K+ ++ K E
Sbjct: 62  LKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPE 121

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEK 189
           + KK+ K F++ E K+ P+ +   E++
Sbjct: 122 QAKKQTKLFRLFEPKQLPIYRANGEKE 148



 Score = 41.1 bits (96), Expect = 0.001
 Identities = 22/97 (22%), Positives = 48/97 (49%), Gaps = 13/97 (13%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
                ++      ++ ++++ EK+++K + E KE K+ + KE+ V     +  DS  +++
Sbjct: 2   AYRNRREANINNNDRMQEKDDEKQDQKNRMELKE-KVLDKKEEVVT----DNVDSPVKEQ 56

Query: 197 SKDKDKKLKKEKLKKKKKE--------KERSSHEKEV 225
           S  ++ K+  E  K  K+E        K +  H+KE+
Sbjct: 57  SSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEI 93



 Score = 38.4 bits (89), Expect = 0.008
 Identities = 29/111 (26%), Positives = 61/111 (54%), Gaps = 3/111 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            ++K  +K  + ++ + KEK+  KK++   D + +  K++  +E  K   E K+  +E+ 
Sbjct: 17  MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76

Query: 165 KKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           K+  +  K KE+ +K ++ +I ++   S + K+S    KKL+  K ++ KK
Sbjct: 77  KQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKES--ILKKLEDIKPEQAKK 125



 Score = 36.8 bits (85), Expect = 0.026
 Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           ++ + +  I      ++KD    K+  K + E KEK   +++E   +      KE   +E
Sbjct: 3   YRNRREANINNNDRMQEKDD--EKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE 60

Query: 176 D--KEKPVEKIRKEEKDSEKE--KKSKDKDKKLKKEKLKK 211
           +      V+K  KEE     E  K  ++  K+++ E L+K
Sbjct: 61  NLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQK 100



 Score = 36.1 bits (83), Expect = 0.044
 Identities = 19/90 (21%), Positives = 48/90 (53%), Gaps = 3/90 (3%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +++ +I       ++K+ EK  +K   + K+K+ +K+++      +   KE+  ++  K 
Sbjct: 6   RREANINN-NDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
             E K+   KE+ ++ +E + K +++ +KE
Sbjct: 65  ADEVKKST-KEESKQLLE-VLKTKEEHQKE 92



 Score = 31.1 bits (70), Expect = 1.3
 Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 6/65 (9%)

Query: 187 EEKDSEKEK---KSKDKDKKLKKEKLKKKKK---EKERSSHEKEVIPKLTFKFGTDMEEK 240
              D  +EK   K   K++   KEK+  KK+        S  KE   +   K   ++++ 
Sbjct: 12  NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKS 71

Query: 241 TKRES 245
           TK ES
Sbjct: 72  TKEES 76


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 45.4 bits (108), Expect = 4e-05
 Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 6/94 (6%)

Query: 115 KTHKK---KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK---EEKEKKEEKEKKEE 168
           KTH+K   K +EKIK  K +  + K      +     K K K     + EK + + K+++
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK 332

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           KE K +E K+K +E++ +  +  E +   K+++K
Sbjct: 333 KEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 138 KNKEKDKEK-KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           K+ EK +EK K  +   K  K+     E   + + K+K   E+  EK+  E K+ +KEKK
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKK 336

Query: 197 SKDKDKKL------KKEKLKKKKKEKE 217
            ++K KK       + EKL+ +  +KE
Sbjct: 337 KEEKKKKQIERLEERIEKLEVQATDKE 363



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 19/78 (24%), Positives = 38/78 (48%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           + L    K               K K+K K ++ ++      ++ K++K+KE+ KK++ E
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346

Query: 159 KKEEKEKKEEKEFKMKED 176
           + EE+ +K E +   KE+
Sbjct: 347 RLEERIEKLEVQATDKEE 364



 Score = 36.6 bits (85), Expect = 0.021
 Identities = 24/75 (32%), Positives = 39/75 (52%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           IK  K   K  K      +     K+  K K +  N++ D E KEK+K KK+E++KK++ 
Sbjct: 286 IKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345

Query: 164 EKKEEKEFKMKEDKE 178
           E+ EE+  K++    
Sbjct: 346 ERLEERIEKLEVQAT 360



 Score = 33.5 bits (77), Expect = 0.24
 Identities = 17/60 (28%), Positives = 32/60 (53%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
              +K   K E++  K   + K+K K K KE+ K+K+ +   ++ EK + +  +K+E K 
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367



 Score = 33.1 bits (76), Expect = 0.28
 Identities = 13/63 (20%), Positives = 34/63 (53%)

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +EK +  ++++K  K+  +      +   + + K +  ++KL  E  +KKK++K+    +
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342

Query: 223 KEV 225
           K++
Sbjct: 343 KQI 345



 Score = 31.9 bits (73), Expect = 0.73
 Identities = 28/102 (27%), Positives = 51/102 (50%), Gaps = 6/102 (5%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           EK + K +  + + ++ KK    F+M  D ++ ++   K E+D+EK      + KK    
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKS--KFERDNEKLDAEVKEKKK---- 333

Query: 208 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
           + KK++K+K++    +E I KL  +     E KT    + KI
Sbjct: 334 EKKKEEKKKKQIERLEERIEKLEVQATDKEENKTVALGTSKI 375



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
            L E+  D   K   K  +  EK   + K+ KK+K K + K+K + ++ +E+ +K E + 
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK-EEKKKKQIERLEERIEKLEVQA 359

Query: 160 KEEKEKKE 167
            +++E K 
Sbjct: 360 TDKEENKT 367


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 5/88 (5%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           KK KE  + EE+E  EE + + E   + K  K++      ++     +EK+  DK + L 
Sbjct: 224 KKLKETSETEERE--EETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281

Query: 206 KEKLKKKKKEKERSSHEKEVIPKLTFKF 233
           K ++ K+KK++E    EK   P  T  F
Sbjct: 282 KLEILKEKKDEELFWFEK---PWPTLLF 306



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 16/76 (21%), Positives = 37/76 (48%), Gaps = 4/76 (5%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           T K KE  + ++++++ D +     E    K+E+E   +E+     E+++  +K     E
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKT----E 278

Query: 176 DKEKPVEKIRKEEKDS 191
           D +K      K++++ 
Sbjct: 279 DLDKLEILKEKKDEEL 294



 Score = 31.9 bits (73), Expect = 0.76
 Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
           F  K+ KE    + R+EE D E E  S+ K    K+E+    +++    S EKE   K  
Sbjct: 221 FFTKKLKETSETEEREEETDVEIETTSETK--GTKQEQEGSTEEDPSLFSEEKEDPDKTE 278

Query: 231 FKFGTDMEEKTKRESS 246
                ++ ++ K E  
Sbjct: 279 DLDKLEILKEKKDEEL 294



 Score = 31.5 bits (72), Expect = 0.95
 Identities = 11/72 (15%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           K +K+  +T +++E+  ++ +   + K   K +++   +++     +E+++  + ++  +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETK-GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282

Query: 168 EKEFKMKEDKEK 179
            +  K K+D+E 
Sbjct: 283 LEILKEKKDEEL 294



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEK 163
           KK K   + ++  ++ + E     + K  K + + + E+D     +EK   ++ E  ++ 
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283

Query: 164 EKKEEKEFKMKEDKEKP 180
           E  +EK+ +     EKP
Sbjct: 284 EILKEKKDEELFWFEKP 300



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 163 KEKKEEKEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
            EK     F + E  K K + +   E+K   K     ++  K +++K K K++EKER 
Sbjct: 623 IEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680



 Score = 28.8 bits (65), Expect = 7.3
 Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
           S D+ E+ K K I   +   +KK++ KI    ++K K + K ++  +E+KE
Sbjct: 630 SFDISELMKIKQI--FRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678



 Score = 28.4 bits (64), Expect = 9.0
 Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 2/54 (3%)

Query: 120 KEKEKIKKKKDKKDKDKIKN--KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           K K+  +   +KK + KI +  +EK K++++KEK K+EEKE+ E  E  +    
Sbjct: 638 KIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPL 691


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 45.2 bits (107), Expect = 6e-05
 Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 12/117 (10%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           E+ K    + K++ + +  K    EK+ ++K +  EK  KE+++ K+E E +M+E KE  
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE-- 557

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKE------KLKKKKKEKERSSHEKEVIPKLTF 231
               R+  K  E EK++++  K LKKE      +LK+KK  K +     E + KL  
Sbjct: 558 ----RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKE 610



 Score = 41.3 bits (97), Expect = 0.001
 Identities = 35/126 (27%), Positives = 62/126 (49%), Gaps = 16/126 (12%)

Query: 107 DKSIKKDKKTHKKKEKEK----IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE------- 155
              I +  KT   + KE+    I+K    + + + KN+  +K  KE+EK KKE       
Sbjct: 495 PHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEE 554

Query: 156 EKEKKEEKEKKEEKEF-----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
            KE++  K+ + EKE       +K++ E  + ++++++    KE KS +   KLK+ K K
Sbjct: 555 LKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614

Query: 211 KKKKEK 216
             +K  
Sbjct: 615 IPQKPT 620



 Score = 39.0 bits (91), Expect = 0.005
 Identities = 34/113 (30%), Positives = 56/113 (49%), Gaps = 16/113 (14%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-----KDKKDKDKIKNKEKDKEKKEKEKGK 153
           E L  ++K+   K +      KE+EK+KK+     ++ K++++ K  E +KE +E  K  
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577

Query: 154 KEEKEK--KEEKEKKEEKEF---------KMKEDKEKPVEKIRKEEKDSEKEK 195
           K+E E   +E KEKK  K           K+KE K+K  +K    + D   +K
Sbjct: 578 KKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDK 630


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 45.5 bits (108), Expect = 6e-05
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 10/98 (10%)

Query: 121  EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM-KEDKEK 179
              E + KK ++K K  I + EK        K  + E   +EEKE   + E +  KE K  
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEK--------KSIEIENRNQEEKEPAGQGELESDKEKKGN 1545

Query: 180  PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
                +  +EK+ E++    D  KK K +K  K   E E
Sbjct: 1546 LESVLSNQEKNIEEDYAESDI-KKRKNKKQYKSNTEAE 1582



 Score = 45.1 bits (107), Expect = 8e-05
 Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 4/100 (4%)

Query: 138  KNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
             N+  +K+  +K+KG     EKK    E   +EEKE   + + E   EK    E     +
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQ 1553

Query: 195  KKSKDKD-KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
            +K+ ++D  +   +K K KK+ K  +  E +   K   +F
Sbjct: 1554 EKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 13/77 (16%)

Query: 105  KKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIK-------NKEKDKEKKEKEK 151
            KK   I  ++K         ++EKE   + + + DK+K         N+EK+ E+   E 
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564

Query: 152  GKKEEKEKKEEKEKKEE 168
              K+ K KK+ K   E 
Sbjct: 1565 DIKKRKNKKQYKSNTEA 1581



 Score = 33.5 bits (77), Expect = 0.27
 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 3/46 (6%)

Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
             K  + K  D ++ K K K+KKEK   KKEE+ K+EEK + E  E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEK---KKEEEYKREEKARIEIAE 771



 Score = 32.0 bits (73), Expect = 0.91
 Identities = 14/53 (26%), Positives = 27/53 (50%)

Query: 98   SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
                GE++ DK  K + ++    +++ I++   + D  K KNK++ K   E E
Sbjct: 1530 PAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582



 Score = 30.1 bits (68), Expect = 3.4
 Identities = 15/42 (35%), Positives = 24/42 (57%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K  + K     ++   K+K KEKK++E+ K+EEK + E  E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771



 Score = 29.3 bits (66), Expect = 5.2
 Identities = 15/37 (40%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            KD E K  +   +E+ +KK++KEKK+E+E+K +E  
Sbjct: 730 GKDAEFKISDS-VEEKTKKKKKKEKKKEEEYKREEKA 765


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 44.7 bits (106), Expect = 9e-05
 Identities = 35/117 (29%), Positives = 69/117 (58%), Gaps = 11/117 (9%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEKKEEKE 170
           K++    +KE E ++  K +K ++KI+  E+  E+ +KE  + EEK K+ +E ++K E+ 
Sbjct: 237 KEEIEELEKELESLEGSK-RKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295

Query: 171 FKMKEDKEKPVEKIRKEEKDSEK---------EKKSKDKDKKLKKEKLKKKKKEKER 218
            K+ E  E+ ++++R+ EK   +         E+  + ++K+ + E+LKKK KE E+
Sbjct: 296 IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEK 352



 Score = 42.7 bits (101), Expect = 4e-04
 Identities = 29/137 (21%), Positives = 75/137 (54%), Gaps = 3/137 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           ++L E+ ++ +    +    ++E EK++K+  + ++ K + +E +KE +  E  K++ +E
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE 259

Query: 159 KKEEKEK---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           K  E E+   + +KE +  E+K K +++++++ ++  K  +  ++     +E  K+  + 
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRL 319

Query: 216 KERSSHEKEVIPKLTFK 232
           +E  +  +E I +L  K
Sbjct: 320 EEEINGIEERIKELEEK 336



 Score = 40.0 bits (94), Expect = 0.002
 Identities = 38/142 (26%), Positives = 78/142 (54%), Gaps = 13/142 (9%)

Query: 96  TSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK---- 149
            + ++LGE+ K   + I++ +K  K+ E  +   K+ +K+ +++  +  +   +      
Sbjct: 162 NAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELRE 221

Query: 150 -----EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKK 203
                EK  KE +E KEE E+ E++   ++  K K  EKIR+ EE+  E +K+ ++ ++K
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281

Query: 204 LKK-EKLKKKKKEKERSSHEKE 224
           +K+ ++LK+K +E  + S   E
Sbjct: 282 VKELKELKEKAEEYIKLSEFYE 303



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 27/98 (27%), Positives = 54/98 (55%), Gaps = 1/98 (1%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           +E+IK+ ++K+++ +   K+  + +K  E+ ++  +  +E K KKEE E   K       
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP 386

Query: 182 EKIRKEEKDSEKEKKS-KDKDKKLKKEKLKKKKKEKER 218
           EK+ KE ++ EK K+  +++  K+     + KK+ KE 
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424



 Score = 39.3 bits (92), Expect = 0.004
 Identities = 40/171 (23%), Positives = 86/171 (50%), Gaps = 15/171 (8%)

Query: 65  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
            E+   K +EY K+K+  + +   + ++ K    E L E+KK  +  + K    ++E  +
Sbjct: 517 LEELEKKAEEYEKLKEKLIKLKGEIKSLKK--ELEKLEELKKKLAELEKKLDELEEELAE 574

Query: 125 IKKKKDKK-------DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE----FKM 173
           + K+ ++         ++++K  E    +  + K  ++E E++E++ KK E+E    F+ 
Sbjct: 575 LLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEE 634

Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
             + EK +E++RKE +  E EKK  +++ +  +E+  +  +E      E E
Sbjct: 635 LAETEKRLEELRKELE--ELEKKYSEEEYEELREEYLELSRELAGLRAELE 683



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 33/151 (21%), Positives = 75/151 (49%), Gaps = 2/151 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E + E+++     K +    +++ +++K+ K+K ++    ++  ++   E  + +K    
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKR-LS 317

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           + EE+    E+  K  E+KE+ +E+++K+ K+ EK  +  ++  +L  E+ K KK+E ER
Sbjct: 318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL-YEEAKAKKEELER 376

Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
                  +     +   +  EK K E   +I
Sbjct: 377 LKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407



 Score = 38.1 bits (89), Expect = 0.009
 Identities = 39/137 (28%), Positives = 79/137 (57%), Gaps = 3/137 (2%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           K I+K+ K  ++KE+ K++K+  + +K   K  E  K K+  E+ K+ E++ K+   ++ 
Sbjct: 462 KRIEKELKEIEEKER-KLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL 520

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           EK+ +  E  ++ + K++ E K  +KE +K ++  KKL + + K  + E+E +   KE +
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE-L 579

Query: 227 PKLTFKFGTDMEEKTKR 243
            +L F+   ++EE+ K 
Sbjct: 580 EELGFESVEELEERLKE 596



 Score = 38.1 bits (89), Expect = 0.010
 Identities = 34/157 (21%), Positives = 79/157 (50%), Gaps = 2/157 (1%)

Query: 59  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS--SEDLGEIKKDKSIKKDKKT 116
           +EK+   ++   K +EY K+ +        +  I K  S   E++  I++     ++K+ 
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEE 338

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
             ++ K+K+K+ + + ++ + +++  ++ K +KE+ ++ +K       +K EKE +  E 
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398

Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
            ++ +E+   +      E K + K+ K   E+LKK K
Sbjct: 399 AKEEIEEEISKITARIGELKKEIKELKKAIEELKKAK 435



 Score = 36.2 bits (84), Expect = 0.039
 Identities = 25/115 (21%), Positives = 66/115 (57%), Gaps = 4/115 (3%)

Query: 107 DKSIKKDKKTHKKKE--KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           DK+ ++  +T K+ E  ++++++ + K  +++ +   ++  +  +E      + ++ EK 
Sbjct: 629 DKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           ++E K  K  E  ++ +E+  K +K+ EK +K+ ++ ++L+++  K K   KER+
Sbjct: 689 REEIK--KTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERA 741



 Score = 35.4 bits (82), Expect = 0.065
 Identities = 39/160 (24%), Positives = 76/160 (47%), Gaps = 6/160 (3%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEK-----KEKEKGKKEE 156
           EI++ +   K+ K  K+K +E IK  +  ++  D+++  EK   +        E+  KE 
Sbjct: 274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKEL 333

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           +EK+E  E+ ++K  ++++  E+  E+    E+   K+++ +   K+L     +K +KE 
Sbjct: 334 EEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKEL 393

Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
           E     KE I +   K    + E  K     K  I+ +K 
Sbjct: 394 EELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKK 433



 Score = 34.7 bits (80), Expect = 0.12
 Identities = 42/166 (25%), Positives = 81/166 (48%), Gaps = 26/166 (15%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK----EKDKEKKEKEKGKK 154
           + L E+++   + ++ K  KK+E E++KK+      +K++ +    EK KE+ E+E  K 
Sbjct: 352 KRLEELEERHELYEEAKA-KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKI 410

Query: 155 EEKEKKEEKEKKEEKEFKMK-----------------EDKEKPVEKIRKEEKDSEKEKKS 197
             +  + +KE KE K+   +                 E +++ +E+   E K  EKE K 
Sbjct: 411 TARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKE 470

Query: 198 -KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
            ++K++KL+KE    ++ EK      + +  K   +   ++EEK K
Sbjct: 471 IEEKERKLRKE---LRELEKVLKKESELIKLKELAEQLKELEEKLK 513



 Score = 32.3 bits (74), Expect = 0.67
 Identities = 30/114 (26%), Positives = 58/114 (50%), Gaps = 11/114 (9%)

Query: 140 KEKDKEKKEKEKG-KKEEKEKKEEKEKKEEKEFKMKE---------DKEKPVEKIRKEEK 189
           KE  +  +  EK  K+ E  ++  KEK++E E  ++E         +  + +EK+ KE K
Sbjct: 172 KEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR 243
           + E+ K+  ++ +K  +     K+K +E+    +E I +L  +   ++EEK K 
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-EELEEKVKE 284



 Score = 29.3 bits (66), Expect = 5.0
 Identities = 42/173 (24%), Positives = 81/173 (46%), Gaps = 14/173 (8%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEK--EKIKKKKDKKD--KDKIKNKEKDKEKKEKEKGK 153
            + L  +  +K  K+ ++  K KE+  E+I K   +    K +IK  +K  E+ +K KGK
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437

Query: 154 ----------KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
                     +  KE  EE   + ++  K  ++ E+   K+RKE ++ EK  K + +  K
Sbjct: 438 CPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK 497

Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
           LK+   + K+ E++   +  E + K   ++    E+  K +   K + K ++ 
Sbjct: 498 LKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEK 550


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 44.0 bits (104), Expect = 1e-04
 Identities = 23/113 (20%), Positives = 53/113 (46%), Gaps = 1/113 (0%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +K     ++++  K++++ +  ++K   +++++K  EK++   +++K + EE  K+   +
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           +K+ +E   K        K   E K +    K    + K K E    KK   E
Sbjct: 131 QKQAEEAAAKA-AAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 3/103 (2%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE---KEKKEEKEKKEEKEFKMK 174
           K+ EKE++  ++ KK  ++   +   K+K+ +E   K     K K E + K+     K  
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA 163

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
             + K   +    +K + + KK  + +   K     KKK E E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206



 Score = 40.2 bits (94), Expect = 0.001
 Identities = 29/124 (23%), Positives = 48/124 (38%), Gaps = 11/124 (8%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD------KEKKEKEKGKKEE 156
           E KK       +   K+K+ E+   K     K K + + K       K   E +K  + E
Sbjct: 115 EQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE 174

Query: 157 KEKKEEKEKKEEKE----FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
             KK   E K++ E     K   + +K  E   K++  +E +KK+   + K    K   +
Sbjct: 175 AAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKA-AAEAKAAAAKAAAE 233

Query: 213 KKEK 216
            K  
Sbjct: 234 AKAA 237



 Score = 37.5 bits (87), Expect = 0.013
 Identities = 24/131 (18%), Positives = 46/131 (35%), Gaps = 11/131 (8%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----------DKIKNKEKDKEKKEKEKGK 153
           K+    +K  +    K     K K + + K           +  K  E +  KK   + K
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           K+ + +   K   E K+    E K+K   + +K+     K   +K   +     +     
Sbjct: 185 KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAA 244

Query: 214 KEKERSSHEKE 224
           K  E+++  K 
Sbjct: 245 KAAEKAAAAKA 255



 Score = 36.7 bits (85), Expect = 0.020
 Identities = 21/100 (21%), Positives = 47/100 (47%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           +++  K  +++ KK + +   + + K+  E+E+ K+ EKE+   +E+K++ E   K+   
Sbjct: 70  QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           K  +      K +   K   + + K      KK   E ++
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169



 Score = 36.3 bits (84), Expect = 0.030
 Identities = 19/113 (16%), Positives = 62/113 (54%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           + + ++K  ++ E+++ KKE+++ +E ++K+  ++ ++K+ +++ +    ++++  E  K
Sbjct: 66  RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125

Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
           ++  K K+ ++   K     K ++  E +       K   + ++K + E++ K
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178



 Score = 35.2 bits (81), Expect = 0.069
 Identities = 23/113 (20%), Positives = 44/113 (38%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
              K K+  + K+     +K   + KK  + +   K   + K+K E E   K   E K++
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            E + +K+   +  K+   E      K + + K + +K    K  +     K 
Sbjct: 203 AEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255



 Score = 33.6 bits (77), Expect = 0.18
 Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 3/92 (3%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKE 167
           KK +    KK   + KKK + +   K   + K K + E +K    E +KK   E K    
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
           +   + K   EK       E+  + K     D
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260



 Score = 32.9 bits (75), Expect = 0.36
 Identities = 23/120 (19%), Positives = 57/120 (47%), Gaps = 11/120 (9%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE------EKEFKM 173
           K  E+ +KKK+++  ++++ K+  ++++ K+  K+    ++++K+ +E       K+ + 
Sbjct: 75  KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQA 134

Query: 174 KEDKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
           +E   K     + + +   K      KK+  + KK  + +  KK   + +   E E   K
Sbjct: 135 EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 43.9 bits (104), Expect = 1e-04
 Identities = 35/147 (23%), Positives = 69/147 (46%), Gaps = 11/147 (7%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           S E+  E    K++ + K   +  EKE   KK  K++K++ K K++  E++E E  ++ +
Sbjct: 407 SDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVE-----------KIRKEEKDSEKEKKSKDKDKKLK 205
            EK   K  K  ++ + +E++E+  E               +++DS+K+  SK      K
Sbjct: 467 VEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK 526

Query: 206 KEKLKKKKKEKERSSHEKEVIPKLTFK 232
             K   K K+K++     ++   L  +
Sbjct: 527 ISKAAVKVKKKKKKEKSIDLDDDLIDE 553



 Score = 41.2 bits (97), Expect = 0.001
 Identities = 38/154 (24%), Positives = 78/154 (50%), Gaps = 4/154 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEK 157
            ++ E++++   +++    + +E  K    + K   ++  K  E  K KKE +   KE+K
Sbjct: 392 AEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKK 451

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPV---EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           E  EE+E ++E+E K+++   K +   EK +KEE++ E ++++          K  KK+ 
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
            K++SS + +       K    +++K K+E S  
Sbjct: 512 SKKKSSSKLDKAANKISKAAVKVKKKKKKEKSID 545



 Score = 38.5 bits (90), Expect = 0.006
 Identities = 31/156 (19%), Positives = 67/156 (42%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           ++E+ ++   K    +    E  +++ E +K KKE K + +EK++ +E+E    E++ K 
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV 467

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
            +   K  K SEK +K +++++  ++    K      +S+ +++   K + K      + 
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKI 527

Query: 241 TKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 276
           +K     K   K  KS         +E         
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDD 563



 Score = 36.2 bits (84), Expect = 0.032
 Identities = 38/154 (24%), Positives = 75/154 (48%), Gaps = 7/154 (4%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--KDKEKKEKEKGKK 154
           S E+  +  +D    ++    +KK   K+K+ +D ++   + + +  +  E ++KE+   
Sbjct: 334 SEEEDEDDDEDDDDGENPWMLRKKL-GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           E +E + E E +EE +   +E+ E+P +K     K   +  + + + KKLKKE  K + K
Sbjct: 393 EIEELRRELEGEEESD---EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN-KNEFK 448

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
           EK+ S  E+E+  +   K      +  KR    +
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482



 Score = 35.0 bits (81), Expect = 0.092
 Identities = 23/126 (18%), Positives = 69/126 (54%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+   +   K +++ +   K++   +I++ + + + ++  ++E+++E  +K  G+++   
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           +  EKE + +K  K  +++ K  ++  +EE+  ++E+   +K      ++ +K +KE+E 
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487

Query: 219 SSHEKE 224
              ++E
Sbjct: 488 EELDEE 493



 Score = 33.5 bits (77), Expect = 0.25
 Identities = 30/159 (18%), Positives = 55/159 (34%), Gaps = 4/159 (2%)

Query: 47  KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
           + K  P++     E    K++   + KE  +  + E         + KV +       K 
Sbjct: 422 RRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKA 481

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK- 165
            K  ++++   +    +          K   K K   K  K   K  K   + K++K+K 
Sbjct: 482 QKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKE 541

Query: 166 ---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
                + +   +ED  K      ++E D E     K KD
Sbjct: 542 KSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKD 580



 Score = 32.7 bits (75), Expect = 0.45
 Identities = 30/132 (22%), Positives = 65/132 (49%), Gaps = 5/132 (3%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           K+   ED  E     S+K  ++   +K++E     + ++ + +++ +E+  E++ +E  K
Sbjct: 360 KLKEGEDDEENSGLLSMKFMQRAEARKKEEN--DAEIEELRRELEGEEESDEEENEEPSK 417

Query: 154 KEEKEKK---EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
           K    +K   E  EK+ E +   KE+K +  EK   +E++  ++++    +K   K   +
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477

Query: 211 KKKKEKERSSHE 222
            +K +KE    E
Sbjct: 478 SEKAQKEEEEEE 489



 Score = 31.6 bits (72), Expect = 0.88
 Identities = 22/136 (16%), Positives = 55/136 (40%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
            +++  + KK+   +++ +  ++ + EK+  K  K+ +   K +E+++  +E    K   
Sbjct: 442 ENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTS 501

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
              K  K++  +K+   K DK          +   +K+K+          ++    K + 
Sbjct: 502 SVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDV 561

Query: 217 ERSSHEKEVIPKLTFK 232
           +    E +      FK
Sbjct: 562 DDEEDEDDEELPFLFK 577



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 38/188 (20%), Positives = 69/188 (36%), Gaps = 27/188 (14%)

Query: 45  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 104
                +   +K   +E+    ++ +P  K  S V            +  K+  + +    
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--------- 155
              K  KK KK       + +  ++D    D    +++D E+      +K+         
Sbjct: 530 AAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFAGD 589

Query: 156 ------EKEKKEEKEKKEEKEFKM------------KEDKEKPVEKIRKEEKDSEKEKKS 197
                 EKEKKE  E+++ KE  +             + ++K  ++ R+     E  KK 
Sbjct: 590 DVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKE 649

Query: 198 KDKDKKLK 205
           K KDKKLK
Sbjct: 650 KRKDKKLK 657


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 43.1 bits (101), Expect = 3e-04
 Identities = 29/139 (20%), Positives = 63/139 (45%), Gaps = 13/139 (9%)

Query: 99   EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-----EKGK 153
            E++ +I +D S   ++K   +  +E + + + K ++    N E D   KE      E   
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKD 3949

Query: 154  KEEKEKKEEKE----KKEEKEFKMKEDKEKPVEKIRK----EEKDSEKEKKSKDKDKKLK 205
            ++EKE +EE        +E +  ++E+  +P  +       E+   ++++    KD  L+
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009

Query: 206  KEKLKKKKKEKERSSHEKE 224
               ++   + KE +  EK+
Sbjct: 4010 DMDMEAADENKEEADAEKD 4028



 Score = 33.4 bits (76), Expect = 0.28
 Identities = 21/111 (18%), Positives = 55/111 (49%), Gaps = 4/111 (3%)

Query: 100  DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK--EKKEKEKGKKEEK 157
            D+  + KD  + +++ +  ++E + + ++ +   +D + N   +K  ++  +E   + E+
Sbjct: 3863 DMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPED-LSNSLNEKLWDEPNEEDLLETEQ 3921

Query: 158  EKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
            +  E+     E +   KED  K   +K R+E++D E+       D +++ +
Sbjct: 3922 KSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 35/186 (18%), Positives = 77/186 (41%), Gaps = 10/186 (5%)

Query: 44   SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL-- 101
            SK  D  + +DK   +EK   +E +     +     D++   +   P    +   EDL  
Sbjct: 3937 SKEDDNKALEDKDR-QEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKL 3995

Query: 102  ----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKK 154
                G++ KD  ++        + KE+   +KD+  +D+   +E    D++ ++ +    
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDL 4055

Query: 155  EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
             E ++K  ++  EE   + +E  E  V+   + E+    E ++ D   K+  +      +
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAE 4115

Query: 215  EKERSS 220
              E ++
Sbjct: 4116 ADEENT 4121


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 32/132 (24%), Positives = 58/132 (43%), Gaps = 6/132 (4%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-K 160
           GEIK   ++ ++    + + +E  K+ K  ++K +    E +K+++E EK K +  +K  
Sbjct: 78  GEIKGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAW 137

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           ++  KK +      E  +    K   +EK   E +    +    L  E+L  K K K   
Sbjct: 138 KKLAKKYDSNL--SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEEL--KAKIKTLF 193

Query: 220 SHEKEVIPKLTF 231
           S  K  +  LT 
Sbjct: 194 SSNKPELALLTL 205



 Score = 42.3 bits (100), Expect = 4e-04
 Identities = 26/123 (21%), Positives = 52/123 (42%), Gaps = 4/123 (3%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           +N E + + +E +K  K+ +EK E+ E + EK+ +  E  +        ++   + +   
Sbjct: 89  ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148

Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR-ESSPKIVIKPVKS 256
            +  K L  +K  K+K  KE  S        L+ +   +++ K K   SS K  +  +  
Sbjct: 149 SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLE---ELKAKIKTLFSSNKPELALLTL 205

Query: 257 PSP 259
              
Sbjct: 206 SVI 208



 Score = 32.3 bits (74), Expect = 0.65
 Identities = 28/140 (20%), Positives = 50/140 (35%), Gaps = 5/140 (3%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKEK 149
           NI      E+L +  K    K ++   + ++KE+  +K   K  DK   K  K  +    
Sbjct: 90  NIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLS 149

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           E  K    +K  +++  +E +               +E K   K   S +K +       
Sbjct: 150 EALKGLNYKKNFKEKLLKELK---SVILNASSLLSLEELKAKIKTLFSSNKPELALLTLS 206

Query: 210 KKKKKEKERSSH-EKEVIPK 228
                E E++   EK +I  
Sbjct: 207 VIDFDEIEQAEILEKSIIGS 226



 Score = 31.9 bits (73), Expect = 0.70
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 4/94 (4%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDK 202
           +E  EK    K+EK K ++K        ++KED +   ++ +  EK  +  EK+ K  + 
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVA-ELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA 436

Query: 203 KLKKEKLKKKKKEKERSSHEK--EVIPKLTFKFG 234
           ++K  + + K+ EK+ ++ E   + I KL   +G
Sbjct: 437 EIKALEKEIKELEKQLTNIEPTADEINKLLKAYG 470



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 5/80 (6%)

Query: 123 EKIKK-----KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           E I++        KK+K+K K K       E ++     +++K+  EK      K  +  
Sbjct: 375 ELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQL 434

Query: 178 EKPVEKIRKEEKDSEKEKKS 197
           E  ++ + KE K+ EK+  +
Sbjct: 435 EAEIKALEKEIKELEKQLTN 454



 Score = 29.6 bits (67), Expect = 4.3
 Identities = 10/37 (27%), Positives = 19/37 (51%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           +  KK  ++ I   + +  + + + K  +KE KE EK
Sbjct: 414 QKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEK 450


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 41.2 bits (97), Expect = 4e-04
 Identities = 26/84 (30%), Positives = 55/84 (65%), Gaps = 3/84 (3%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
           KK   K + K++ K+  ++++E E+ ++EE++K EEK + E KE   +E+ E+  EK ++
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKE---EEELEEEREKKKE 57

Query: 187 EEKDSEKEKKSKDKDKKLKKEKLK 210
           EE+  E+E++++ + ++ +K K  
Sbjct: 58  EEERKEREEQARKEQEEYEKLKSS 81



 Score = 38.1 bits (89), Expect = 0.003
 Identities = 24/81 (29%), Positives = 52/81 (64%), Gaps = 8/81 (9%)

Query: 118 KKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
           + K +EK  +++ ++ ++ + + ++K +EK+E E+ ++EE E++ EK+K+EE+  + +E 
Sbjct: 8   RAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67

Query: 177 KEKPVEKIRKEEKDSEKEKKS 197
                   RKE+++ EK K S
Sbjct: 68  A-------RKEQEEYEKLKSS 81



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 16/70 (22%), Positives = 44/70 (62%), Gaps = 6/70 (8%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE----KKEEK 169
           KK  +K+E E+  K++++ ++++ K KE+++ K+ +E+ +KE++E ++ K     ++E  
Sbjct: 32  KKLEEKREGER--KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGT 89

Query: 170 EFKMKEDKEK 179
           +    +++  
Sbjct: 90  DKLSADEESN 99



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 17/75 (22%), Positives = 48/75 (64%), Gaps = 1/75 (1%)

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDKKLKKEKLK 210
            K   K++ + +EK+  ++ +  E++E+   K  +E+++ E KE++  +++++ KKE+ +
Sbjct: 1   KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60

Query: 211 KKKKEKERSSHEKEV 225
           +K++E++    ++E 
Sbjct: 61  RKEREEQARKEQEEY 75


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 42.7 bits (101), Expect = 4e-04
 Identities = 26/116 (22%), Positives = 48/116 (41%), Gaps = 6/116 (5%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           KS+    K   K     I++  +  + D  KNK  + +KK +  G  +   + E  +  E
Sbjct: 113 KSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQSFGFPQLLIEFEVNDISE 172

Query: 168 EKEFKMK------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           E+EF+        E ++   E +  E+K   +  K +        +K +K K  +E
Sbjct: 173 EQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTEE 228


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 39.9 bits (94), Expect = 4e-04
 Identities = 23/92 (25%), Positives = 53/92 (57%), Gaps = 7/92 (7%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE------KE 158
           K+ +   K ++   +K +++++K K+K  KD     E  +EKKEKE  KK +      ++
Sbjct: 21  KQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQK 80

Query: 159 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEK 189
            +++ +K++++E +   DK  K ++++ K++ 
Sbjct: 81  LQQDLQKRQQEELQKILDKINKAIKEVAKKKG 112



 Score = 29.5 bits (67), Expect = 1.8
 Identities = 14/79 (17%), Positives = 48/79 (60%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
           K   K  EK+ +K++ E  K E++ +K +++ +++     +  +EK  ++++K+ ++ ++
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQR 76

Query: 194 EKKSKDKDKKLKKEKLKKK 212
           +++   +D + ++++  +K
Sbjct: 77  KQQKLQQDLQKRQQEELQK 95



 Score = 29.1 bits (66), Expect = 2.7
 Identities = 19/77 (24%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD----KEKKEKEKGK--KEEKEKKEEKEKKEEKEF- 171
           K  +++++K+  K+  + ++  EK+    KEK +K+     +  +EKKE++ +K+ +EF 
Sbjct: 17  KAAQKQLEKEFKKRQAE-LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQ 75

Query: 172 KMKEDKEKPVEKIRKEE 188
           + ++  ++ ++K ++EE
Sbjct: 76  RKQQKLQQDLQKRQQEE 92



 Score = 27.9 bits (63), Expect = 5.6
 Identities = 18/73 (24%), Positives = 43/73 (58%), Gaps = 6/73 (8%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           K  +++ +KE K+++ E EK E++  K+KE  +K           SE  ++ K+K+ + K
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQK------DAATLSEAAREKKEKELQKK 70

Query: 206 KEKLKKKKKEKER 218
            ++ ++K+++ ++
Sbjct: 71  VQEFQRKQQKLQQ 83



 Score = 27.5 bits (62), Expect = 7.5
 Identities = 17/72 (23%), Positives = 40/72 (55%), Gaps = 7/72 (9%)

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK-LKKK 212
           K  +++ E++ KK + E +  E      ++++K ++  +K+  +  +  + KKEK L+KK
Sbjct: 17  KAAQKQLEKEFKKRQAELEKLE------KELQKLKEKLQKDAATLSEAAREKKEKELQKK 70

Query: 213 KKEKERSSHEKE 224
            +E +R   + +
Sbjct: 71  VQEFQRKQQKLQ 82


>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031).  This
           protein is expressed in Plasmodium; its function is
           unknown. It may be the product of gene family pyst-b.
          Length = 228

 Score = 41.2 bits (97), Expect = 5e-04
 Identities = 32/107 (29%), Positives = 48/107 (44%), Gaps = 15/107 (14%)

Query: 113 DKKTHK-----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           DKKT K     +KE E++KK+ D K   ++  +    ++  K         K E     E
Sbjct: 107 DKKTKKLINELRKELEEVKKELDNKRNGELAIQPIQDKRIIK---------KDENSSVSE 157

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
            ++FK  E++E  +E    E   S   KK K   +KLKK + K  K 
Sbjct: 158 HEDFKQLENEENFLETEYNEITSSNNYKKLKIN-RKLKKAEKKLIKS 203



 Score = 28.1 bits (63), Expect = 7.8
 Identities = 21/83 (25%), Positives = 38/83 (45%), Gaps = 9/83 (10%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-------KDKDKIKNKEKDKEKKEKE--K 151
           L E+KK+   K++ +   +  ++K   KKD+       +D  +++N+E   E +  E   
Sbjct: 121 LEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEHEDFKQLENEENFLETEYNEITS 180

Query: 152 GKKEEKEKKEEKEKKEEKEFKMK 174
               +K K   K KK EK+    
Sbjct: 181 SNNYKKLKINRKLKKAEKKLIKS 203


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 41.0 bits (96), Expect = 5e-04
 Identities = 29/124 (23%), Positives = 62/124 (50%), Gaps = 9/124 (7%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE------ 156
             + +K   K +   + +E  K K KK KK   +I   ++ +E  ++     +E      
Sbjct: 79  TFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLL 138

Query: 157 ---KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
              +E+ EE + K +++ +  E+ EK +E++  E+   EK  + K+K K+L++ K + ++
Sbjct: 139 YLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198

Query: 214 KEKE 217
             +E
Sbjct: 199 LLEE 202



 Score = 39.5 bits (92), Expect = 0.001
 Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 12/141 (8%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
            S+  G +K D  I+KD+   K   +   +  K K     I+   +  +KK K+  K   
Sbjct: 52  KSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSIL 111

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK----------- 205
           +  + ++ ++   E    + +  P+     +E+  E + K K+K ++L+           
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELED 171

Query: 206 -KEKLKKKKKEKERSSHEKEV 225
            K+ L+K  +EKE+    +E+
Sbjct: 172 EKDLLEKLLEEKEKKKELEEL 192


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 41.8 bits (99), Expect = 5e-04
 Identities = 38/157 (24%), Positives = 78/157 (49%), Gaps = 33/157 (21%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK---GKKE--------EKEKKEEKE--K 165
           K+  EK     ++  K  I+  +K+ E  +KE     K+E        E+E KE +   +
Sbjct: 20  KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79

Query: 166 KEEKEFKMKEDK-EKPVEKIRKEEKDSEK-EKKSKDKDKKLKK------EKLKKKKKEKE 217
           + E+    +E+  ++ +E + K+E++ EK EK+  +K+K L +      E + ++++E E
Sbjct: 80  RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE 139

Query: 218 RSS-----HEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
           R S       KE++ +       ++EE+ + E++  I
Sbjct: 140 RISGLTQEEAKEILLE-------EVEEEARHEAAKLI 169



 Score = 41.8 bits (99), Expect = 6e-04
 Identities = 40/157 (25%), Positives = 87/157 (55%), Gaps = 24/157 (15%)

Query: 103 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEE 156
           E KK+ +++KK+     K+E  K++ + +++ K++ +N+ +  E++  ++      K E 
Sbjct: 40  EAKKEAETLKKEALLEAKEEVHKLRAELERELKER-RNELQRLERRLLQREETLDRKMES 98

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK---LKKEKLKKK- 212
            +KKEE  +K+EKE     +KEK +++  KEE+  E   + +++ ++   L +E+ K+  
Sbjct: 99  LDKKEENLEKKEKEL---SNKEKNLDE--KEEELEELIAEQREELERISGLTQEEAKEIL 153

Query: 213 -KKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
            ++ +E + HE   + K       ++EE+ K E+  K
Sbjct: 154 LEEVEEEARHEAAKLIK-------EIEEEAKEEADKK 183



 Score = 28.4 bits (64), Expect = 8.2
 Identities = 28/108 (25%), Positives = 62/108 (57%), Gaps = 7/108 (6%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK-- 151
           ++   E+  + K +   KK++   +KKEKE   K+K+  +K++   +   ++++E E+  
Sbjct: 84  RLLQREETLDRKMESLDKKEENL-EKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
           G  +E+ K+   E+ EE   + + +  K +++I +E K+ E +KK+K+
Sbjct: 143 GLTQEEAKEILLEEVEE---EARHEAAKLIKEIEEEAKE-EADKKAKE 186


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 40.9 bits (96), Expect = 5e-04
 Identities = 20/78 (25%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
            +K+    K++  +++  ++ + K+E+K K+E+ E +E ++ ++ ED E+      K E+
Sbjct: 1   MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEE--SNEVKIEE 58

Query: 190 DSEKEKKSKDKDKKLKKE 207
             ++  K K+++KKL+ E
Sbjct: 59  LKDENNKLKEENKKLENE 76


>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 40.0 bits (94), Expect = 7e-04
 Identities = 26/103 (25%), Positives = 55/103 (53%), Gaps = 2/103 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            ++ ++ +KK KD ++K+++K   +  + + K    K+ + +  ++ KK+EKE   +  K
Sbjct: 65  IEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKER 218
              ++K   ++   +K+     K K+L K  EK +KK   KE+
Sbjct: 125 PYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 41.6 bits (98), Expect = 7e-04
 Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 15/115 (13%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEE 168
           I++  K+ +  +    ++ K + DK K  ++   +  KK K+    +    ++  E KE 
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392

Query: 169 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
                  K    K + +K +E+IR+E  + E   KSK K +K      KK+  EK
Sbjct: 393 IAYYESAKTALEKAEGKKAIEEIREELIE-EGLLKSKKKKRK------KKEWFEK 440



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 12/62 (19%), Positives = 32/62 (51%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            + +  S  K+   + +  K  ++K + KK  ++I+ +  ++   + +K K+++KE  E+
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440

Query: 163 KE 164
             
Sbjct: 441 FR 442



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 12/66 (18%), Positives = 28/66 (42%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
               ++ +     K+        K   +  +  K  E+ +E+  +E   K +K+K+++KE
Sbjct: 377 GAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436

Query: 165 KKEEKE 170
             E+  
Sbjct: 437 WFEKFR 442



 Score = 28.9 bits (65), Expect = 6.7
 Identities = 14/65 (21%), Positives = 26/65 (40%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           +  +L     +           K   EK + KK  ++  +   +E   + K+K++ KKE 
Sbjct: 378 AKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEW 437

Query: 157 KEKKE 161
            EK  
Sbjct: 438 FEKFR 442


>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
           region.  This family includes the N-terminal regions of
           the junctin, junctate and aspartyl beta-hydroxylase
           proteins. Junctate is an integral ER/SR membrane calcium
           binding protein, which comes from an alternatively
           spliced form of the same gene that generates aspartyl
           beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
           catalyzes the post-translational hydroxylation of
           aspartic acid or asparagine residues contained within
           epidermal growth factor (EGF) domains of proteins.
          Length = 240

 Score = 40.7 bits (95), Expect = 8e-04
 Identities = 33/170 (19%), Positives = 72/170 (42%), Gaps = 12/170 (7%)

Query: 59  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
           KEK +S+    P+E E    ++ +L V      + +        E+K+      +K    
Sbjct: 68  KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEE--------EVKEQLQSLLEKIVVS 119

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K+E++   K+    +   +  ++ D  ++  E GK  E    E ++    +E   ++ K+
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHE----ETEDSYHVEETASEQYKQ 175

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
              EK  ++E +  KE   K +  K + + + ++  ++E +  E     K
Sbjct: 176 DMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225



 Score = 37.2 bits (86), Expect = 0.010
 Identities = 23/125 (18%), Positives = 54/125 (43%), Gaps = 7/125 (5%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
              +     ++ +   +E+ +    K K K EE+ K++ +   E+        K++    
Sbjct: 71  STSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVS----KQEEDGP 126

Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR 243
            ++ + D +K   ++D D + +  +  K  +E E S H +E   +   ++  DM+EK   
Sbjct: 127 GKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE---QYKQDMKEKASE 183

Query: 244 ESSPK 248
           + +  
Sbjct: 184 QENED 188



 Score = 30.7 bits (69), Expect = 1.2
 Identities = 32/175 (18%), Positives = 77/175 (44%), Gaps = 4/175 (2%)

Query: 25  LQHPLIPQPLVHPPGTPPPSKMKDKVSP-KDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL 83
           L+     +P V P    P ++ + +++  K K   +E++  + ++  ++   SK ++   
Sbjct: 67  LKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126

Query: 84  GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKK--THKKKEKEKIKKKKDKKDKDKIKNKE 141
           G  P +    K   +ED  + ++     K  +        +E   ++  +  K+K   +E
Sbjct: 127 GKEPQLD-EDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQE 185

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
            +  K+  EK ++ + E  +  E+  ++E    ED +   E++ KE  + ++E  
Sbjct: 186 NEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQEVP 240


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 41.5 bits (98), Expect = 8e-04
 Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 4/61 (6%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           +E+E         ++ K++ K++  EKK +    KEE EK   K+K E    K K    +
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAE----KEEAEKAAAKKKAEAAAKKAKGPDGE 452

Query: 180 P 180
            
Sbjct: 453 T 453



 Score = 39.5 bits (93), Expect = 0.003
 Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 10/87 (11%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
             + ++++ +  N    + KK ++K +K EK+ ++E+ +K           +K  E   K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAA--------AKKKAEAAAK 444

Query: 187 EEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           + K  + E K  D D     EKL + +
Sbjct: 445 KAKGPDGETKKVDPD--PLGEKLARTE 469



 Score = 36.4 bits (85), Expect = 0.029
 Identities = 18/62 (29%), Positives = 29/62 (46%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           +    E +E++ E       +++K  +K RK EK +EKE+  K   KK  +   KK K  
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449

Query: 216 KE 217
             
Sbjct: 450 DG 451



 Score = 33.8 bits (78), Expect = 0.17
 Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           + +L   ++ K  KK +K  KK EKE+ +K   KK  +        K K    + KK + 
Sbjct: 403 NGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK----KAKGPDGETKKVDP 458

Query: 158 EKKEEK 163
           +   EK
Sbjct: 459 DPLGEK 464



 Score = 33.0 bits (76), Expect = 0.38
 Identities = 14/51 (27%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 119 KKEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
             E++K++KK +  + K + +  EK   KK+ E   K+ K    E +K + 
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 12/59 (20%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
             ++K  +KK     ++K +KK        EK++ +K   K K E   KK +    E K
Sbjct: 408 PAERKKLRKK-----QRKAEKK-------AEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454



 Score = 29.9 bits (68), Expect = 2.8
 Identities = 14/65 (21%), Positives = 28/65 (43%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
               GE ++ ++        KK  K++ K +K  + ++  K   K K +   +K K  + 
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451

Query: 158 EKKEE 162
           E K+ 
Sbjct: 452 ETKKV 456



 Score = 29.9 bits (68), Expect = 3.0
 Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           E++E     +   E+   ++K+ K + KK +KE+ +K   +K+  +  K
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAK 444



 Score = 29.5 bits (67), Expect = 4.0
 Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 3/72 (4%)

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
                  E   E+         + KK +   KK +K + K +K+E E+++ +K+      
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLR---KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444

Query: 231 FKFGTDMEEKTK 242
              G D E K  
Sbjct: 445 KAKGPDGETKKV 456


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 41.1 bits (96), Expect = 9e-04
 Identities = 41/195 (21%), Positives = 88/195 (45%), Gaps = 17/195 (8%)

Query: 62  MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
           +  ++   P E+E       +LG    V  I    ++         + I++  K  +K +
Sbjct: 245 LKLEDNYEPSEQEL------KLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIK 298

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
             + K +  K D +K +N     ++K +E   K EK  K E E KEE+   ++ + ++  
Sbjct: 299 TLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEK-LKSEIELKEEEIKALQSNIDELH 357

Query: 182 EKIRK--------EEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEK-EVIPKLTF 231
           +++RK        E  + E+EK +++ DK  ++ +KL K  K ++  +    + + K   
Sbjct: 358 KQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLR 417

Query: 232 KFGTDMEEKTKRESS 246
           ++ + ++  T+  S 
Sbjct: 418 QYDSLIQNITRSRSQ 432


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 39.7 bits (93), Expect = 0.001
 Identities = 28/94 (29%), Positives = 52/94 (55%), Gaps = 3/94 (3%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK---KEEKEFKMKEDK 177
            +    KKK K+  ++I+  +K+ E+K+K K KK++ +KK++K+K    ++K+ K ++  
Sbjct: 55  AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           EK  E   ++   S  E  S   + K +K  L K
Sbjct: 115 EKEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148



 Score = 39.7 bits (93), Expect = 0.001
 Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           K KK    +E EK+KK+ ++K K K       K+K +K+K K ++K+  ++ +K E+K+ 
Sbjct: 61  KKKKKELAEEIEKVKKEYEEKQKWK-----WKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
           K  EDK + + K   E   +  E K + 
Sbjct: 116 KEAEDKLEDLTKSYSETLSTLSELKPRK 143



 Score = 38.1 bits (89), Expect = 0.004
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 9/86 (10%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
               K+K +  +E EK KKE     EEK+K + K+ K K+ K+K      K++KD +K+ 
Sbjct: 58  TEAKKKKKELAEEIEKVKKE----YEEKQKWKWKKKKSKKKKDKD-----KDKKDDKKDD 108

Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSH 221
           KS+ KD+K  ++KL+   K    +  
Sbjct: 109 KSEKKDEKEAEDKLEDLTKSYSETLS 134



 Score = 36.6 bits (85), Expect = 0.011
 Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           D E  E +K KKE  E+ E+ +K+ E++ K K  K+K      K++KD +K+KK   KD 
Sbjct: 54  DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKK-----SKKKKDKDKDKKDDKKDD 108

Query: 203 KLKKEKLKKKKKEKERSSHEKEVIP 227
           K +K+  K+ + + E  +       
Sbjct: 109 KSEKKDEKEAEDKLEDLTKSYSETL 133



 Score = 35.8 bits (83), Expect = 0.019
 Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 3/62 (4%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
            K KK   KK+K+K K KKD K  DK    EK  EK+ ++K +   K   E      E +
Sbjct: 84  WKWKKKKSKKKKDKDKDKKDDKKDDK---SEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140

Query: 171 FK 172
            +
Sbjct: 141 PR 142


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 41.1 bits (97), Expect = 0.001
 Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 1/70 (1%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           + E  K + I + +   K    EK   K  KK   K   K++ +++K K K +  + +  
Sbjct: 364 IEEPLKARVIDELRPKTKA-PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422

Query: 161 EEKEKKEEKE 170
            ++ K     
Sbjct: 423 GKRRKPSGTS 432



 Score = 40.3 bits (95), Expect = 0.002
 Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
            +KKT K  +K   K+ + KK+K+K K K K + +  K  GK+ +     E+
Sbjct: 384 SEKKTGKPSKKVLAKRAE-KKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 39.5 bits (93), Expect = 0.003
 Identities = 16/52 (30%), Positives = 24/52 (46%)

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           +E   K K   EK   K  K+      EKK K+K+K   K++ +  K   +R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425



 Score = 38.4 bits (90), Expect = 0.006
 Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
           K  K +   +   K+KEKEK K KK  +D   I  + K     E+
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 37.6 bits (88), Expect = 0.012
 Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 24/78 (30%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           K K   EK+ GK  +K   +  EKKE                        ++++K K K 
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKE------------------------KEKEKPKVKK 414

Query: 202 KKLKKEKLKKKKKEKERS 219
           +    + + K++K    S
Sbjct: 415 RHRDTKNIGKRRKPSGTS 432



 Score = 36.8 bits (86), Expect = 0.019
 Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 1/51 (1%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           +K      KK   K  +KK +K+K+K K K++ ++ K   K +K     +E
Sbjct: 385 EKKTGKPSKKVLAKRAEKK-EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434



 Score = 36.8 bits (86), Expect = 0.021
 Identities = 16/58 (27%), Positives = 23/58 (39%)

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            E + K +    K      K V   R E+K+ EKEK    K  +  K   K++K    
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431



 Score = 33.4 bits (77), Expect = 0.21
 Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 177 KEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           K + ++++R   K   + +  K SK    K  ++K K+K+K K +  H  
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418



 Score = 33.0 bits (76), Expect = 0.27
 Identities = 15/56 (26%), Positives = 28/56 (50%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
           K K   ++K  K   K   K  ++++++KE+ + K +    K + K RK    SE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434


>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62. 
          Length = 217

 Score = 39.8 bits (93), Expect = 0.001
 Identities = 17/72 (23%), Positives = 34/72 (47%)

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
            E ++ K  K+K   + +     ++K +EK +   K    E+  K   ++ K+EK K KK
Sbjct: 16  LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75

Query: 214 KEKERSSHEKEV 225
           K+     +  ++
Sbjct: 76  KKVPLQVNPAQL 87



 Score = 39.4 bits (92), Expect = 0.002
 Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK----EKEKGKKEEKEKKE 161
           +     KDK   +   K   + K  +K K  IK +  ++ KK    EK++ KK+ K+KK 
Sbjct: 19  EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78

Query: 162 EKEKKEEKEFKMKE 175
             +    + F   E
Sbjct: 79  PLQVNPAQLFVDDE 92



 Score = 38.2 bits (89), Expect = 0.004
 Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 9/82 (10%)

Query: 114 KKTHKKKEKEKIKKKKDKK-----DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           K+  +  E EK K  KDK      +K   ++K  +K K   +    E  +K   +EKKEE
Sbjct: 10  KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69

Query: 169 KEFKMKEDKEKPVEKIRKEEKD 190
           K    K+ K+K V       + 
Sbjct: 70  K----KKPKKKKVPLQVNPAQL 87



 Score = 37.5 bits (87), Expect = 0.008
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSE 192
           K  ++  E +K K  K+KG  E   K   ++K  EK   + + +    V+K+  +EK  E
Sbjct: 10  KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69

Query: 193 KEKKSKDK 200
           K+K  K K
Sbjct: 70  KKKPKKKK 77



 Score = 30.5 bits (69), Expect = 1.3
 Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +  E ++    K  K++ + E   K   +DK ++K KL  K +  ER    
Sbjct: 14  RALESEKY---KANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 41.0 bits (96), Expect = 0.001
 Identities = 25/141 (17%), Positives = 49/141 (34%), Gaps = 14/141 (9%)

Query: 113 DKKTHKKKEKEKIKKKKDKKD-----KDKIKNKEKDKEKKEKEKGKKEEK-EKKEEKEKK 166
            +K  +KK      ++   +      +    + E  + KK +   K E K  KKE  +K+
Sbjct: 295 REKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR 354

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
              E ++  ++E   E+     +   +    K K K L  +    +    E    +    
Sbjct: 355 RAAEREI--NREARQERAAAMARARARRAAVKAKKKGL-IDASPNEDTPSENEESKGSP- 410

Query: 227 PKLTFKFGTDMEEKTKRESSP 247
           P++          +  RE S 
Sbjct: 411 PQVE----ATTTAEPNREPSQ 427



 Score = 35.2 bits (81), Expect = 0.069
 Identities = 17/99 (17%), Positives = 40/99 (40%), Gaps = 2/99 (2%)

Query: 103 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEKEKGKKEEKEKK 160
           EIKK   + K + K  KK+  +K +  + + +++  +         + +    K +K+  
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
            +    E+   + +E K  P +       +  +E   +D
Sbjct: 391 IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPSQED 429



 Score = 34.4 bits (79), Expect = 0.12
 Identities = 14/99 (14%), Positives = 34/99 (34%), Gaps = 5/99 (5%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           IKK +   K + K    +KK+   K +   +E ++E +++          +  +   + K
Sbjct: 332 IKKTRTAEKNEAKA---RKKEIAQKRRAAEREINREARQERAAAMA--RARARRAAVKAK 386

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           +  + +           EE      +       +  +E 
Sbjct: 387 KKGLIDASPNEDTPSENEESKGSPPQVEATTTAEPNREP 425



 Score = 32.5 bits (74), Expect = 0.44
 Identities = 23/138 (16%), Positives = 49/138 (35%), Gaps = 23/138 (16%)

Query: 140 KEKDKEKKEKEKGKKEEK-------------------EKKEEKEKKEEKEFKMKEDKEKP 180
           +EK KEKK      +  +                    KK    +K E + + KE  +K 
Sbjct: 295 REKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR 354

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
               R+  +++ +E+ +     + ++  +K KKK    +S  ++   +     G+  + +
Sbjct: 355 RAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSPPQVE 414

Query: 241 TKRESSPKIVIKPVKSPS 258
               + P        S  
Sbjct: 415 ATTTAEP----NREPSQE 428


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 41.4 bits (98), Expect = 0.001
 Identities = 30/120 (25%), Positives = 59/120 (49%), Gaps = 2/120 (1%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           KS+ K +K   +  K  IK   ++ ++D +K K   K  K+ EK      +   E +  +
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVN-NEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSK 169

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
           E+  K +  KE+  EK+ KE  ++ K+ +++ K K+ K    K+   +  +   ++E+ P
Sbjct: 170 EELEKFEAQKEEEDEKLAKEALEAMKKLEAE-KKKQSKNFDPKEGPVQIGKKIDKEEITP 228



 Score = 38.7 bits (91), Expect = 0.007
 Identities = 30/132 (22%), Positives = 59/132 (44%), Gaps = 1/132 (0%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEKEFKMKEDKEKP 180
           K  +KK+K + + +K+  K  ++ +++  K K   K  K+ EK      +   + D  K 
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKE 170

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
             +  + +K+ E EK +K+  + +KK + +KKK+ K     E  V              K
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMK 230

Query: 241 TKRESSPKIVIK 252
              E   ++V++
Sbjct: 231 EINEEERRVVVE 242



 Score = 35.2 bits (82), Expect = 0.072
 Identities = 26/128 (20%), Positives = 58/128 (45%), Gaps = 9/128 (7%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDK----KTHKKKEKEKIKKKKDKKDKDKIKNK--EKDK 144
              K  S      +KK K   +      K + + E++ +KKK   K   + +       K
Sbjct: 101 EKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILK 160

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
              E +  K+E ++ + +KE+++E   K+ ++  + ++K+  E+K   K    K+   ++
Sbjct: 161 IDFEIDDSKEELEKFEAQKEEEDE---KLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217

Query: 205 KKEKLKKK 212
            K+  K++
Sbjct: 218 GKKIDKEE 225



 Score = 30.2 bits (69), Expect = 3.0
 Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 7/101 (6%)

Query: 90  PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
           P ++K       G +K D  I   K+  +K E +K ++ +    +     K+ + EKK++
Sbjct: 145 PKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQ 204

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
            K    ++   +  +K +++E         P+++I +EE+ 
Sbjct: 205 SKNFDPKEGPVQIGKKIDKEEI-------TPMKEINEEERR 238


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 40.3 bits (94), Expect = 0.001
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 4/38 (10%)

Query: 291 PMIGCDGCDA---WYHWVCVGLVAEPETSDWFCPKCSK 325
            M+ CD  +    W+H  CVGL   P+   W+CP+C K
Sbjct: 233 QMVACDNANCKREWFHLECVGLKEPPKGK-WYCPECKK 269


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 40.5 bits (95), Expect = 0.001
 Identities = 26/77 (33%), Positives = 44/77 (57%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
            NK    ++K K +  KE++ K+ E+E KEEK+ K K  +   +++I KE    EK +  
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARAR 328

Query: 198 KDKDKKLKKEKLKKKKK 214
           K + +K + EK K K++
Sbjct: 329 KKEQRKERGEKKKLKRR 345



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 34/124 (27%), Positives = 61/124 (49%), Gaps = 7/124 (5%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKEKEKGKKE 155
            SE L E   D   ++       +  E   +  +K  + K K K +++KEK+ KE  ++ 
Sbjct: 237 MSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA 296

Query: 156 EKEKKEEKEKKEEKEFKMKEDKE----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           ++EK  + +KK  +  ++KE  +    K   + RK+E+  E+ +K K K +KL K K  +
Sbjct: 297 KEEK--QLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPE 354

Query: 212 KKKE 215
              E
Sbjct: 355 PPLE 358



 Score = 33.5 bits (77), Expect = 0.23
 Identities = 25/139 (17%), Positives = 56/139 (40%), Gaps = 17/139 (12%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKK---------DKKDKDKIKNKEKDKEKKEKEKGK 153
           E    K++K    +     K     +            +D  ++  +E +KE K ++K +
Sbjct: 155 EQVAKKTVKTAPISLAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQ 214

Query: 154 KEEKEKKEEKEKKEEKEFKMKE-----DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           + E+ ++++ EK   +  ++ E      +E   +   + + +S  E    + +   K  +
Sbjct: 215 ELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVR 274

Query: 209 LKKKKKE---KERSSHEKE 224
            K+K K    KE+   E E
Sbjct: 275 PKRKTKAQRNKEKRRKELE 293



 Score = 33.1 bits (76), Expect = 0.31
 Identities = 19/102 (18%), Positives = 45/102 (44%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            K EK++ + ++ ++ K +    E  +  +  E   +E  +  EE+   E      + + 
Sbjct: 207 VKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEY 266

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           E   + +R + K   +  K K + +  ++ K +K+ K+K   
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 40.9 bits (96), Expect = 0.001
 Identities = 25/147 (17%), Positives = 71/147 (48%), Gaps = 5/147 (3%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKEKEKGKKEEKE 158
           L E+ K+   K ++   +  E  +  +   +  ++++K  +K +E  ++++E+  ++E E
Sbjct: 176 LKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE 235

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
             EE+  + E+E +  E+ +  + +I   E ++ K ++ + ++ +     L++ +++ ER
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELE---RLLEELEEKIER 292

Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRES 245
               +  I +L  +         + E 
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEE 319



 Score = 40.9 bits (96), Expect = 0.001
 Identities = 19/112 (16%), Positives = 59/112 (52%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           +++  +  E E+  K++ ++  +K++N  ++ E+ +++   ++ KE+  + E + ++  +
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE 573

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           + E+      +  + E+  E+ K+ K K K+L++   + ++  +     E E
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAE 625



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 24/127 (18%), Positives = 69/127 (54%), Gaps = 1/127 (0%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-EKGKKEEK 157
            +L E++++    +++    + ++  I +     +K  +  +E  +E +++  +  + E 
Sbjct: 424 RELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELEL 483

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           E+ EE+  +E++E +++E+ E+  +++R+ E++  +  + ++  K+  +EKL+K +   E
Sbjct: 484 EELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543

Query: 218 RSSHEKE 224
                KE
Sbjct: 544 ELEELKE 550



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 30/140 (21%), Positives = 72/140 (51%), Gaps = 14/140 (10%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--- 155
           E+L  ++      ++     K  +E+++K ++K +K + + +E  +EK E  K  +E   
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLK 364

Query: 156 ---------EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
                    EKE ++  E+ ++ E  ++E KE+  E     E+  E+ ++ + + ++L++
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424

Query: 207 --EKLKKKKKEKERSSHEKE 224
             E+L+++ K+ E   ++ E
Sbjct: 425 ELEELEEEIKKLEEQINQLE 444



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 27/113 (23%), Positives = 64/113 (56%), Gaps = 2/113 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           + +E  + K +  K  ++++K  E+  E+ EKE  K  E+ K+ E+  +E KE   +   
Sbjct: 344 ELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSA 403

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
              +E+I++E ++ EKE +  +++ +  +E++KK +++  +   ++ +I +L 
Sbjct: 404 A--LEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELA 454



 Score = 39.7 bits (93), Expect = 0.003
 Identities = 25/127 (19%), Positives = 66/127 (51%), Gaps = 7/127 (5%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK-----EK 159
           + ++ I++ ++  ++ E+ + + +  +   ++++   +  +  E+   K EEK      +
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
            EE  +++ +  K+ E++ K +E  R EE + E EK  +   K+L++   + K++  E S
Sbjct: 345 LEELAEEKNELAKLLEERLKELE-ERLEELEKELEKALERL-KQLEEAIQELKEELAELS 402

Query: 220 SHEKEVI 226
           +  +E+ 
Sbjct: 403 AALEEIQ 409



 Score = 39.0 bits (91), Expect = 0.005
 Identities = 22/110 (20%), Positives = 59/110 (53%), Gaps = 1/110 (0%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
             I+  +    K  +E++++ +   ++ + K +  ++ ++E E+ ++E +  +   E+ E
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           E   K+K  +E+ +EK+ ++ +  E E +   ++K    + L+++ KE E
Sbjct: 319 ELLEKLKSLEER-LEKLEEKLEKLESELEELAEEKNELAKLLEERLKELE 367



 Score = 38.6 bits (90), Expect = 0.007
 Identities = 24/110 (21%), Positives = 59/110 (53%), Gaps = 4/110 (3%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEK-EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           E+L E K + +   +++  + +E+ E+++K+ +K  +   + +E  +E KE+        
Sbjct: 346 EELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAAL 405

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           E+ +E+ ++ EKE    E+ E+ +E++ +E K  E++    +  + +  E
Sbjct: 406 EEIQEELEELEKE---LEELERELEELEEEIKKLEEQINQLESKELMIAE 452



 Score = 37.4 bits (87), Expect = 0.015
 Identities = 25/119 (21%), Positives = 66/119 (55%), Gaps = 2/119 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKEKEKGKKEE 156
           ++  E K +K     ++  + KEK ++++ K++  +  D+++  ++  E+    + +KEE
Sbjct: 528 KEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEE 587

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            E+  E+ K+ +K+ K  E++   +E++ +  + SE E + ++ +++L+ E  K   + 
Sbjct: 588 LEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQA 646



 Score = 37.0 bits (86), Expect = 0.022
 Identities = 27/162 (16%), Positives = 74/162 (45%), Gaps = 26/162 (16%)

Query: 111 KKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKK------------EKEKGKKEEK 157
           +K+     + E E+++++   +K++ +++ + ++ EK+            E E+  KEE 
Sbjct: 472 EKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEEL 531

Query: 158 EKKEEKEKKEEKEFKMKEDK-------------EKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           E+K EK +   +E +  ++K             E  ++++++  ++    +  K++ ++L
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEEL 591

Query: 205 KKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
           ++   + KKK KE      ++   L     ++ E + +    
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633



 Score = 37.0 bits (86), Expect = 0.023
 Identities = 25/142 (17%), Positives = 71/142 (50%), Gaps = 6/142 (4%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
            + ++   ++  E++++    +++    ++EKE++++ K +  + +    E  K ++E+ 
Sbjct: 217 KLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE------KEKKSKDKDKKL 204
           +  +   E+ EEK ++ E+  +  E+ E+ +E +R   ++ E      K  + + +  + 
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEE 336

Query: 205 KKEKLKKKKKEKERSSHEKEVI 226
           K EKL+ + +E     +E   +
Sbjct: 337 KLEKLESELEELAEEKNELAKL 358



 Score = 36.3 bits (84), Expect = 0.037
 Identities = 17/107 (15%), Positives = 58/107 (54%), Gaps = 6/107 (5%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF--K 172
           +   ++ ++++ +  + +    ++  E++  ++++E   +EE E+ E++ ++ E+E    
Sbjct: 465 QELPEEHEKELLELYELE----LEELEEELSREKEEAELREEIEELEKELRELEEELIEL 520

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           ++ ++    E   K EK     ++ ++  +KL+ ++LK++ ++ E  
Sbjct: 521 LELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567



 Score = 35.5 bits (82), Expect = 0.065
 Identities = 25/132 (18%), Positives = 69/132 (52%), Gaps = 1/132 (0%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E L    + + + +      +++ E+++ +  ++ + +I+N+E+ +EK E+ +  +EE E
Sbjct: 640 EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQ-RIENEEQLEEKLEELEQLEEELE 698

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           +  E+ ++  K+    E   + +E  + E ++ +KE +  +K  +L +E  +K  K   R
Sbjct: 699 QLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLR 758

Query: 219 SSHEKEVIPKLT 230
           +   + ++ ++ 
Sbjct: 759 ADILRNLLAQIE 770



 Score = 35.1 bits (81), Expect = 0.074
 Identities = 21/121 (17%), Positives = 57/121 (47%), Gaps = 12/121 (9%)

Query: 118 KKKEKEKI---------KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           K KE+++I          +K  +  K+ IK  +   E+ E +  +  E  +   +  +EE
Sbjct: 152 KPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEE 211

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
            +   +  K + +++ ++EE+  ++ +  +++  +L++EK + ++ +      E   +  
Sbjct: 212 LK---ELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEA 268

Query: 229 L 229
           L
Sbjct: 269 L 269



 Score = 35.1 bits (81), Expect = 0.093
 Identities = 23/114 (20%), Positives = 60/114 (52%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +  + +++ +K  ++ E+E  + +++ K  ++  N+ + KE    E     EK     +E
Sbjct: 407 EIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQE 466

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
             EE E ++ E  E  +E++ +E    ++E + +++ ++L+KE  + +++  E 
Sbjct: 467 LPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIEL 520



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 26/127 (20%), Positives = 72/127 (56%), Gaps = 6/127 (4%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           + +++   + EK  ++ + ++  +  ++  E+  E+ E E  ++ ++ + EE+ +++ +E
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE 689

Query: 171 FKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEVIP 227
            +  E++   +E++R+E E+  +K  + +   ++L+  K +L++ KKE E+     E++ 
Sbjct: 690 LEQLEEE---LEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLE 746

Query: 228 KLTFKFG 234
           +L  K G
Sbjct: 747 ELREKLG 753



 Score = 34.4 bits (79), Expect = 0.13
 Identities = 25/130 (19%), Positives = 61/130 (46%), Gaps = 10/130 (7%)

Query: 104 IKKDKSIKK---DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE---- 156
           +K     +K   D+    +K ++  +  K+   + K K +E + +  E  +  ++     
Sbjct: 149 LKSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEAL 208

Query: 157 -KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK--LKKEKLKKKK 213
            +E KE K+ +E +E + +E+ E+ +E + +   + E+EK+  ++ K   L+ E L+ + 
Sbjct: 209 EEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEA 268

Query: 214 KEKERSSHEK 223
            +       +
Sbjct: 269 LKIREEELRE 278



 Score = 34.0 bits (78), Expect = 0.17
 Identities = 31/174 (17%), Positives = 75/174 (43%), Gaps = 24/174 (13%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-KE 161
                   ++ ++  ++ ++ K K K+ ++   +++   +  E  E E   +E +E+ + 
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES 637

Query: 162 EKEK------KEEKEFKMKEDKEKPVEK----IRKEEKDSEKEKKSKDKDKKLK------ 205
           E EK       EE      E+ E+ VE+    IR+E +  E E++ ++K ++L+      
Sbjct: 638 ELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEEL 697

Query: 206 ---KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
              +E+L++  K+        E +         ++EE  K     +  ++ ++ 
Sbjct: 698 EQLREELEELLKKLGEIEQLIEELESRK----AELEELKKELEKLEKALELLEE 747



 Score = 32.4 bits (74), Expect = 0.49
 Identities = 26/98 (26%), Positives = 58/98 (59%), Gaps = 4/98 (4%)

Query: 132 KDKDKIKNKEKDKEKKEKE-KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEK 189
            +K+ ++  E + E+ E+E   +KEE E +EE E+ E++  +++E+  + +E     +E+
Sbjct: 471 HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530

Query: 190 DSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEV 225
             EK +K ++  ++L+  KEKL+ ++ ++E    E  +
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRL 568



 Score = 32.4 bits (74), Expect = 0.63
 Identities = 18/125 (14%), Positives = 55/125 (44%), Gaps = 13/125 (10%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
             +  K   ++K    +   E++++ K+K    ++K + +  E + +E  +  E+ +   
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLR 582

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKD----SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
             K+E +E +         E++++ +K      E+  + ++  + L+  + + + +E E 
Sbjct: 583 TRKEELEELR---------ERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633

Query: 219 SSHEK 223
               +
Sbjct: 634 ELESE 638



 Score = 32.0 bits (73), Expect = 0.77
 Identities = 20/80 (25%), Positives = 38/80 (47%)

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
           K K K +KE  ++    EK E+    +KE  ++   KI + E    +  +  +   +  +
Sbjct: 150 KSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALE 209

Query: 207 EKLKKKKKEKERSSHEKEVI 226
           E+LK+ KK +E    ++E  
Sbjct: 210 EELKELKKLEEIQEEQEEEE 229


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 40.6 bits (95), Expect = 0.001
 Identities = 25/126 (19%), Positives = 58/126 (46%), Gaps = 5/126 (3%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           I + K  K       K+ +  ++ +  K +  ++ N EKDKE +  EK   E  ++ +++
Sbjct: 74  INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-----EKLKKKKKEKER 218
            +   +  + K +  K  E+++ E +   +E    +++K  ++       L+ K+ E++R
Sbjct: 134 LQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQR 193

Query: 219 SSHEKE 224
            S    
Sbjct: 194 ESKWAI 199



 Score = 39.4 bits (92), Expect = 0.003
 Identities = 27/135 (20%), Positives = 67/135 (49%), Gaps = 5/135 (3%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           +K+ ++K+ + +  KKK+    ++    +K ++K    +  K+ +   +++  K E    
Sbjct: 48  EKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALEL 107

Query: 173 MKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
           +  +K+K +E + KE  E   E +K+ ++  + ++K++   K +E+ +  +EK++   L 
Sbjct: 108 LNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLE 167

Query: 231 F---KFGTDMEEKTK 242
               KF   + E   
Sbjct: 168 LEREKFEEQLHEANL 182



 Score = 34.8 bits (80), Expect = 0.075
 Identities = 24/109 (22%), Positives = 50/109 (45%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
            K +     + +  K+ E  + +  +  K+  K      +  +K++E  K EE+ K E +
Sbjct: 100 AKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENE 159

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +K EE     +E  E+ + +   + +  E E++ + K   LKK K + +
Sbjct: 160 KKLEESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKRRAE 208



 Score = 29.0 bits (65), Expect = 5.4
 Identities = 27/115 (23%), Positives = 57/115 (49%), Gaps = 1/115 (0%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEE 156
           S++L +  ++ +   +KK    K +E++K + +KK ++ ++  +EK +E+  +     E 
Sbjct: 127 SKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEF 186

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           KE +E++E K     K+K   E   ++++ E  +   E   + K      EK  K
Sbjct: 187 KENEEQRESKWAILKKLKRRAELGSQQVQGEALELPNESFIRSKFPSDINEKADK 241


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 40.7 bits (95), Expect = 0.001
 Identities = 31/145 (21%), Positives = 64/145 (44%), Gaps = 12/145 (8%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           I  +   ++    + DK  +   +   KK+K+K+ K K K D DK   +  D +  E + 
Sbjct: 215 IKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDS 274

Query: 152 GKKEEKEKKE---------EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD- 201
              +++ ++E           + +E ++    E   KP  +  ++ ++SE+EK  ++   
Sbjct: 275 DDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGL 334

Query: 202 --KKLKKEKLKKKKKEKERSSHEKE 224
             K  K +KLK KK   ++   +  
Sbjct: 335 SKKGKKLKKLKGKKNGLDKDDSDSG 359



 Score = 38.8 bits (90), Expect = 0.005
 Identities = 32/136 (23%), Positives = 55/136 (40%), Gaps = 7/136 (5%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-------K 149
             ED  E K  K  KK  K  KK + +K  K+    D D+  + + D E +E        
Sbjct: 232 GGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSS 291

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
             G   E+ + +   +   K    +++  +  E+ + EE+    +K  K K  K KK  L
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351

Query: 210 KKKKKEKERSSHEKEV 225
            K   +    S + ++
Sbjct: 352 DKDDSDSGDDSDDSDI 367



 Score = 36.1 bits (83), Expect = 0.036
 Identities = 36/150 (24%), Positives = 59/150 (39%), Gaps = 25/150 (16%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE- 156
            +DL     +   + D     K  ++  ++K  KK K   KNK+K  + K+ ++G  ++ 
Sbjct: 210 GKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA 269

Query: 157 ------------------------KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
                                       EE+E K   E   K + E+  +    EE+ +E
Sbjct: 270 DEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNE 329

Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +E     K KKLKK K KK   +K+ S   
Sbjct: 330 EEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359



 Score = 35.3 bits (81), Expect = 0.057
 Identities = 24/108 (22%), Positives = 45/108 (41%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           D        +E+  K   +        +++D E+ E+EK ++E    K+ K+ K+ K  K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
              DK+        ++ D + E        K +KE  K++  +   SS
Sbjct: 349 NGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSS 396



 Score = 31.9 bits (72), Expect = 0.87
 Identities = 27/127 (21%), Positives = 52/127 (40%), Gaps = 13/127 (10%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           ++ +       + K  K KD +  D+    E DK  ++ ++ K ++K+KK  K KK+  +
Sbjct: 198 EETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDD 257

Query: 171 FKMK------EDKEKPVEKIRKEEK-------DSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            K        +  E   +    E +        S      ++++ KL  E   K + E++
Sbjct: 258 DKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQD 317

Query: 218 RSSHEKE 224
             S E E
Sbjct: 318 EDSEESE 324



 Score = 29.2 bits (65), Expect = 5.6
 Identities = 19/83 (22%), Positives = 36/83 (43%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           K+      ++ ++ E EK      K+ K   ++ D E   ++  K  +D ++EK  K K 
Sbjct: 187 KNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKK 246

Query: 202 KKLKKEKLKKKKKEKERSSHEKE 224
           K  K +K     K+ +R   +  
Sbjct: 247 KLAKNKKKLDDDKKGKRGGDDDA 269


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 40.9 bits (96), Expect = 0.002
 Identities = 21/115 (18%), Positives = 49/115 (42%), Gaps = 4/115 (3%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           +    +S+K+  +   K+  E +++KK   ++  + I      K    ++  +     KK
Sbjct: 828 KNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKK 887

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           EE+ K+  +E  + E++              +  +   +  +KL KEK  ++K +
Sbjct: 888 EEEFKQFAEEEGLNEEELA--FYDDLALNGGKLPENGTELVEKLAKEKSLREKNK 940



 Score = 35.8 bits (83), Expect = 0.049
 Identities = 25/145 (17%), Positives = 50/145 (34%), Gaps = 6/145 (4%)

Query: 65  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
            E    +E    + ++++      +   +     +    ++    +        K   + 
Sbjct: 453 VELELIEESIKEEAEELDE-RIEEITEDILEKIKKKTKNLEFLAMLAVRLIRAAKDIYDH 511

Query: 125 IKKKKDKKDKDKI-----KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
            KK++    K  +     K   +  E +   +     KE  E   K    EF+   DK++
Sbjct: 512 FKKEEVFDLKAMVVASSRKVAVELYEAEIAARLDWHSKESLEGAIKDYNTEFETDFDKKQ 571

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKL 204
              K++ E+KD  K  K KD    L
Sbjct: 572 SHAKLKDEKKDLIKRFKLKDDPLDL 596



 Score = 34.7 bits (80), Expect = 0.13
 Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 24/200 (12%)

Query: 55  KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
           K   ++   SK          ++    +      + +I          EI  D  I +  
Sbjct: 764 KARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDY-ILELL 822

Query: 115 KTHKKKEK---------EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           +T   K           E+I K+  +  + K K  E+  E   + + KK +     EK +
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLD---TAEKLE 879

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
           +              + K  +E K   +E+   +++     +      K  E  +   E 
Sbjct: 880 ELYI-----------LAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEK 928

Query: 226 IPKLTFKFGTDMEEKTKRES 245
           + K       + ++   +E 
Sbjct: 929 LAKEKSLREKNKDDWKAKEE 948



 Score = 30.8 bits (70), Expect = 1.8
 Identities = 19/113 (16%), Positives = 39/113 (34%), Gaps = 12/113 (10%)

Query: 90  PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
             I  +   + L E   +   +   K     EK +      KK+++  +  E++   +E+
Sbjct: 845 EWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEE 904

Query: 150 EKGKK-------EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
                       +  E   E  +K  KE  ++E       K   + K+  + K
Sbjct: 905 LAFYDDLALNGGKLPENGTELVEKLAKEKSLREKN-----KDDWKAKEEVEAK 952



 Score = 29.3 bits (66), Expect = 5.1
 Identities = 28/158 (17%), Positives = 57/158 (36%), Gaps = 31/158 (19%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK----------------NKE 141
           SED+G ++    + +D     + + E+  K   K     +K                 + 
Sbjct: 697 SEDIGYLEDAGELLRDFAPELEIKVERGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEI 756

Query: 142 KDKEKKEKEKGKKEEKEK--KEEKEKKEEKEFKMKEDKE-------KPVEKIRKEEKDSE 192
           +  E   K + +K+ + K         + +E K KE K          +E I K+E + +
Sbjct: 757 QYLEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINID 816

Query: 193 ------KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
                 +    K+   +  KE +++  KE      +K+
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKK 854


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 3/91 (3%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           E  + +K    K+  +   K+  +    + ++E K+K  EK +++EK  +  E  +  +E
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELE--QNLIE 342

Query: 183 KIR-KEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           + +    K  E +KK +D +K+L+K K  K 
Sbjct: 343 ERKELNSKLEEIQKKLEDLEKRLEKLKSNKS 373



 Score = 39.6 bits (93), Expect = 0.003
 Identities = 20/84 (23%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 122 KEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF------KMK 174
            EK+   K+  +   K  ++ +  E++ + K K  EK +++EK  +E ++       ++ 
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSK 198
              E+  +K+   EK  EK K +K
Sbjct: 349 SKLEEIQKKLEDLEKRLEKLKSNK 372



 Score = 32.7 bits (75), Expect = 0.38
 Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
              K  G+   KE  E +  +EE+E K K  ++   ++ R EE +    ++ K+ + KL 
Sbjct: 293 SGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKL- 351

Query: 206 KEKLKKKKKEKERSSHEK 223
           +E  KK +  ++R    K
Sbjct: 352 EEIQKKLEDLEKRLEKLK 369



 Score = 30.0 bits (68), Expect = 2.4
 Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 3/70 (4%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKK-EKEKGKKEEKEK 159
           +   +  + ++++  KKK  EKI++K+ + ++    +  + K+   K E+ + K E+ EK
Sbjct: 304 KEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEK 363

Query: 160 KEEKEKKEEK 169
           + EK K  + 
Sbjct: 364 RLEKLKSNKS 373


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 40.2 bits (94), Expect = 0.002
 Identities = 29/133 (21%), Positives = 64/133 (48%), Gaps = 2/133 (1%)

Query: 86  TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
             ++   VKV S++   +  K+ + +K +     K      K+   K  + + +  K K 
Sbjct: 169 KSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASL--KRNPPKKSNIMSSFFKKKT 226

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           K++KEK +  E   KEE E++  K   + ED+      + ++E + E +   +  D + +
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286

Query: 206 KEKLKKKKKEKER 218
            E+ +K+K+++ +
Sbjct: 287 TEEKEKEKRKRLK 299



 Score = 38.3 bits (89), Expect = 0.007
 Identities = 40/198 (20%), Positives = 84/198 (42%), Gaps = 12/198 (6%)

Query: 31  PQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK---EYSKVKDVELGVTP 87
           P   +  P     S  K + + K+ T+ K +  +  K +  ++   + S +         
Sbjct: 167 PPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKT 226

Query: 88  MVPNIVKVTSSEDLGEIKKDKSIKKDK---------KTHKKKEKEKIKKKKDKKDKDKIK 138
                 K  S   + E  +++S K+D              + E E   K   ++   + +
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
            +EK+KEK+++ K   E++++ EE E   E   + +E +E     + K+E++ E+   S 
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSP 346

Query: 199 DKDKKLKKEKLKKKKKEK 216
           D  ++  + ++ KKK  K
Sbjct: 347 DGGRRRGRRRVMKKKTFK 364


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 40.7 bits (95), Expect = 0.002
 Identities = 27/103 (26%), Positives = 52/103 (50%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           E   +++   +EE E+K++KE K KE + K ++  +KE K   + +++ D     KK + 
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 210 KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
           K +K++ E  + E  + P   F     +  +  ++ SP  V K
Sbjct: 65  KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVEK 107



 Score = 36.4 bits (84), Expect = 0.037
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           G + E E +++   +EE E K K++ EK  EK  K+ K ++KE K+K + ++        
Sbjct: 1   GSRTESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVP 59

Query: 212 KKKEKERSSHEKEV 225
           KK EK+    + E 
Sbjct: 60  KKSEKKSRKRDVED 73



 Score = 35.6 bits (82), Expect = 0.052
 Identities = 21/97 (21%), Positives = 47/97 (48%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           E   +KK   +++  + K+K+++ KEKE  K +  +K+ + + + ++        +K  +
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           K RK + + E  +   D D    ++K    +  K+ S
Sbjct: 65  KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYS 101



 Score = 35.3 bits (81), Expect = 0.078
 Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE---EKEKKEEK 163
           +K I  +++  +KK+KE+  K+K+ K   K+K  +K+ + K + +   +     +K E+K
Sbjct: 9   EKKILTEEELERKKKKEEKAKEKELK---KLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65

Query: 164 EKKEEKEFKMKEDKEKP 180
            +K + E +  ED   P
Sbjct: 66  SRKRDVEDENPEDFIDP 82



 Score = 31.0 bits (70), Expect = 1.4
 Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)

Query: 110  IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK--E 167
            ++     + + E EK++ K D+  K + K ++K      KEK     +E+   K  K  +
Sbjct: 987  LQAQGAVNAEAELEKLRNKMDEIQKQQEKLEKKMNASGYKEKVPANIQEEDARKLTKLLQ 1046

Query: 168  EKEFKMKEDKEKPVEK 183
            E EF  KE K    E 
Sbjct: 1047 ELEFFEKESKRLEAET 1062



 Score = 31.0 bits (70), Expect = 1.5
 Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
                + L E + ++  KK++K    KEKE +KK K  + + K K + +          K
Sbjct: 6   SEAEKKILTEEELERKKKKEEKA---KEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKK 61

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKP 180
            E+K +K + E +  ++F    D + P
Sbjct: 62  SEKKSRKRDVEDENPEDF---IDPDTP 85



 Score = 30.3 bits (68), Expect = 2.5
 Identities = 19/69 (27%), Positives = 40/69 (57%), Gaps = 3/69 (4%)

Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK---FGTDMEEKT 241
           R E +  +K    ++ ++K KKE+  K+K+ K+  + +KE   KL  +    GT++ +K+
Sbjct: 3   RTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62

Query: 242 KRESSPKIV 250
           +++S  + V
Sbjct: 63  EKKSRKRDV 71


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 40.3 bits (95), Expect = 0.002
 Identities = 30/112 (26%), Positives = 55/112 (49%), Gaps = 19/112 (16%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           KDK T  K++K  I       D       E ++  K+ E+   E+K++KE  E K E E 
Sbjct: 483 KDKGT-GKEQKITITASSGLSDD------EIERMVKDAEEYAAEDKKRKERIEAKNEAEE 535

Query: 172 -------KMKEDKEKP----VEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 211
                   +KE+ +K      +K+ +  +  ++E + +DK++ + K E+L+K
Sbjct: 536 YVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKTEELQK 587


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 40.0 bits (94), Expect = 0.002
 Identities = 40/223 (17%), Positives = 93/223 (41%), Gaps = 30/223 (13%)

Query: 37  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 96
                     +  +S +   +  +  S   + +  +     V+ +E     ++  ++K T
Sbjct: 77  DEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALE----ELLEELLKET 132

Query: 97  SSE-----------DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
           +S+           DL +  K+ ++K D ++     KE++ +   K  +      E+++E
Sbjct: 133 ASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAE---LKAEEEEE 189

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
            +   K K+EE   K E+E     E K    +++      + E + EKE+  K  ++KL+
Sbjct: 190 LERALKEKREELLSKLEEELLARLESKEAALEKQ-----LRLEFEREKEELRKKYEEKLR 244

Query: 206 KEKLKKKKKEKERSSHEKEVIP----KLTFKFGTDMEEKTKRE 244
           +E    +++ +      K  +     +L  +F  +++EK + E
Sbjct: 245 QE---LERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEE 284


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 27/165 (16%), Positives = 64/165 (38%), Gaps = 4/165 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +    + K ++   + E    + ++        K    +KE +E ++ + + KE+ +  E
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYL-EKEIQELQEQRIDLKEQIKSIE 853

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           K+ E     KE+ E+ +E++    +D E       K++   + +L++ +++ E    + E
Sbjct: 854 KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913

Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
              K   +    +E   +  S    +  P       P       D
Sbjct: 914 KKRKRLSELKAKLEALEEELSE---IEDPKGEDEEIPEEELSLED 955



 Score = 38.5 bits (90), Expect = 0.007
 Identities = 28/128 (21%), Positives = 66/128 (51%), Gaps = 7/128 (5%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
           +L E+ K+ +  +D+    +++ EK+K++ ++        +E D+ ++E ++  +E  + 
Sbjct: 372 ELEEVDKEFAETRDELKDYREKLEKLKREINE------LKRELDRLQEELQRLSEELADL 425

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKER 218
                  E K  +++E+KE    +I+K+E   E+      K +++L   K +  + EKE 
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL 485

Query: 219 SSHEKEVI 226
           S  ++E+ 
Sbjct: 486 SKLQRELA 493



 Score = 36.6 bits (85), Expect = 0.027
 Identities = 26/141 (18%), Positives = 67/141 (47%), Gaps = 19/141 (13%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD------------------KI 137
              E L E+++D S  + +  + K E ++++ + ++ ++D                  +I
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRI 793

Query: 138 KNKEKDKEKKEKEKGKKEEK-EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
              + +  K E+E  + E +  + E+K  +   E +  E + + +++ R + K+  K  +
Sbjct: 794 PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE 853

Query: 197 SKDKDKKLKKEKLKKKKKEKE 217
            + ++   KKE+L+++ +E E
Sbjct: 854 KEIENLNGKKEELEEELEELE 874



 Score = 36.6 bits (85), Expect = 0.028
 Identities = 30/137 (21%), Positives = 63/137 (45%), Gaps = 14/137 (10%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           +S  +GEI+K+    + ++   +K KE++++ ++     + + +    E KE E      
Sbjct: 714 ASRKIGEIEKEIEQLEQEE---EKLKERLEELEEDLSSLEQEIENVKSELKELEA----R 766

Query: 157 KEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDK-----DKKLKKEKL 209
            E+ EE   K E+     E +     + +I+ E    E+E    +      ++KL +  L
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826

Query: 210 KKKKKEKERSSHEKEVI 226
           +K+  EKE    +++ I
Sbjct: 827 EKEYLEKEIQELQEQRI 843



 Score = 35.8 bits (83), Expect = 0.051
 Identities = 26/144 (18%), Positives = 64/144 (44%), Gaps = 18/144 (12%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKK--------------KEKEKIKKKKDKKDKDKIKNKEKD 143
            E L  +K++ S  + +    +              ++  +I+K+ ++ ++++ K KE+ 
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDK 202
           +E +E     ++E E  + + K+ E      E+ E+ + K+ +   D E +   S+  + 
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEAR---IEELEEDLHKLEEALNDLEARLSHSRIPEI 796

Query: 203 KLKKEKLKKKKKEKERSSHEKEVI 226
           + +  KL+++    E    E E  
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQK 820



 Score = 35.8 bits (83), Expect = 0.058
 Identities = 24/137 (17%), Positives = 56/137 (40%), Gaps = 14/137 (10%)

Query: 97  SSEDLGEIKKDKS-----IKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKE 148
              ++ E +K +        + K+  +      +E  K+  + +D+ K   ++ +K K+E
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
             + K+E    +EE ++  E+   +       +  I  +  + E+EK+ K  + K ++ K
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAA----IAGIEAKINELEEEKEDKALEIKKQEWK 456

Query: 209 LK--KKKKEKERSSHEK 223
           L+       K       
Sbjct: 457 LEQLAADLSKYEQELYD 473



 Score = 35.0 bits (81), Expect = 0.093
 Identities = 22/96 (22%), Positives = 47/96 (48%), Gaps = 5/96 (5%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEE-----KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           +  I  KE++ E  E+   K E        + EE E++ E+E K ++   +   ++++E 
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +D   E +  DK+    +++LK  +++ E+   E  
Sbjct: 367 EDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 29/115 (25%), Positives = 50/115 (43%), Gaps = 13/115 (11%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEF 171
           D+K  K  E+ +  ++  ++    I  K +  E+  +E+ K E  +   +EK + E  E 
Sbjct: 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228

Query: 172 -KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
            K KE  E+  E I ++    E+E            EKL ++  E E+   E E 
Sbjct: 229 LKEKEALERQKEAIERQLASLEEE-----------LEKLTEEISELEKRLEEIEQ 272



 Score = 33.9 bits (78), Expect = 0.22
 Identities = 30/127 (23%), Positives = 68/127 (53%), Gaps = 1/127 (0%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           I  +   E L   +K   ++ +  + ++   EK ++ +D +++      E DK   E E+
Sbjct: 281 IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE 340

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
            ++E +E+++ ++K  E+  ++KE+ E    ++ + +K+   E + + KD + K EKLK+
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEF-AETRDELKDYREKLEKLKR 399

Query: 212 KKKEKER 218
           +  E +R
Sbjct: 400 EINELKR 406



 Score = 32.7 bits (75), Expect = 0.48
 Identities = 25/114 (21%), Positives = 62/114 (54%), Gaps = 5/114 (4%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           K + +++ ++  +K E+ +   K+ ++ +     KEK+  +++KE  +++    +EE EK
Sbjct: 196 KRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEK 255

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKER 218
             E+      + EK +E+I +  ++  K+ K   ++++L+ KEK+ + + E   
Sbjct: 256 LTEE----ISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIAS 305



 Score = 32.7 bits (75), Expect = 0.51
 Identities = 29/133 (21%), Positives = 69/133 (51%), Gaps = 10/133 (7%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKEKEKGKKEEK 157
           E+L E++++          K+++ E+++++++K ++   +  ++++ E  E  K K+  +
Sbjct: 177 EELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALE 236

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +KE  E++           E+ +EK+   E+ SE EK+ ++ ++ L++   K K   +E
Sbjct: 237 RQKEAIERQ-------LASLEEELEKL--TEEISELEKRLEEIEQLLEELNKKIKDLGEE 287

Query: 218 RSSHEKEVIPKLT 230
                KE I +L 
Sbjct: 288 EQLRVKEKIGELE 300



 Score = 31.6 bits (72), Expect = 1.0
 Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 2/105 (1%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
               SE+L ++    +  + K    ++EKE  K  + KK + K++    D  K E+E   
Sbjct: 415 LQRLSEELADLNAAIAGIEAKINELEEEKED-KALEIKKQEWKLEQLAADLSKYEQELYD 473

Query: 154 KEEKEKKEEKEKKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
            +E+  + EKE  + ++E    E + +  E+  +  +  E+  K+
Sbjct: 474 LKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKA 518



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 34/177 (19%), Positives = 75/177 (42%), Gaps = 23/177 (12%)

Query: 60  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS--EDLGEIKKDKSIKKDKKTH 117
           E   S  +    + E SK+++        V  I        + L  +  +K   + +   
Sbjct: 785 EARLSHSRIPEIQAELSKLEEE-------VSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837

Query: 118 KKKE----KEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKE 170
            +++    KE+IK  + + +    K +E +   +E +     +  E    + K++++E E
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAL--RDLESRLGDLKKERDELE 895

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
            +++E + K      +E +   ++K+ +  + K K E L+++  E E    E E IP
Sbjct: 896 AQLRELERK-----IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIP 947



 Score = 30.0 bits (68), Expect = 3.0
 Identities = 28/93 (30%), Positives = 54/93 (58%), Gaps = 5/93 (5%)

Query: 126 KKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
           KK KD  ++++++ KEK  E + E    ++   EK+ E E  EE+  K++ + +K + +I
Sbjct: 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEI 338

Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            + E++ E+E+K +DK      E+  + K+E E
Sbjct: 339 EELEREIEEERKRRDK----LTEEYAELKEELE 367



 Score = 29.3 bits (66), Expect = 6.0
 Identities = 25/97 (25%), Positives = 49/97 (50%), Gaps = 16/97 (16%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
            KKEK   + EE E+  E+      E      K + +E++R+E + +E+ +    + ++ 
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDE------KRQQLERLRREREKAERYQALLKEKREY 223

Query: 205 K-KEKLKKKKK--------EKERSSHEKEVIPKLTFK 232
           +  E LK+K+         E++ +S E+E + KLT +
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLEEE-LEKLTEE 259


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 39.2 bits (91), Expect = 0.003
 Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 1/111 (0%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFK 172
           KKT KKK  E+ K    K    K   K K   K    K  K+  K KK +  K   K+  
Sbjct: 5   KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           +K +K + V+K    +K  +KE  S +  +   K      K  K+  + EK
Sbjct: 65  VKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115



 Score = 34.2 bits (78), Expect = 0.11
 Identities = 26/101 (25%), Positives = 38/101 (37%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           K  ++E+     K  +  K+    KK  K    K   K  K K     K   KK   K +
Sbjct: 9   KKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           K E  KKE   KK  K+  +  +  +   K+ K      K+
Sbjct: 69  KTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKK 109



 Score = 30.3 bits (68), Expect = 1.9
 Identities = 19/103 (18%), Positives = 38/103 (36%), Gaps = 3/103 (2%)

Query: 40  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
                K  ++  P  K +   K ++K K + K       K  +        ++   T   
Sbjct: 6   KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAK--KAAKVKKTKSVKTTTKKV 63

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK 142
            + + +K +S+KK+    K  +KE +  +  +      KN  K
Sbjct: 64  TV-KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105



 Score = 29.1 bits (65), Expect = 4.4
 Identities = 21/85 (24%), Positives = 35/85 (41%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K  K+ K       KK  +  K K  K    K+  K +  E  +KE   K+  +K+    
Sbjct: 31  KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEK 189
           +  E   K+ ++  K  +K+   EK
Sbjct: 91  EVFEASNKLFKNTSKLPKKLFASEK 115


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 39.5 bits (92), Expect = 0.003
 Identities = 27/98 (27%), Positives = 52/98 (53%), Gaps = 3/98 (3%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           D+K      KEK  + K++   DK K+  E++++ ++ E+ KK E E+     + E +E 
Sbjct: 114 DRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEAREI 173

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
            + E + K   +I    +++E+E   KD+  K+ K+ L
Sbjct: 174 ILAETENKLTHEIATRIREAERE--VKDRSDKMAKDLL 209



 Score = 33.3 bits (76), Expect = 0.26
 Identities = 30/108 (27%), Positives = 59/108 (54%), Gaps = 3/108 (2%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K E++    KK  K + K   KE   E KE+ +  +EE E++ + E++E K+ + +   E
Sbjct: 50  KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRL-TE 108

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 224
           +     RK+E  S KEK  + K++ L  K + + +++++ E+   +K+
Sbjct: 109 RATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKK 156


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 39.1 bits (92), Expect = 0.003
 Identities = 19/100 (19%), Positives = 65/100 (65%), Gaps = 1/100 (1%)

Query: 120 KEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           +E+EK ++++ + + +++ +  E  +  +E+++ + +EK +K++K ++E  EF  +  + 
Sbjct: 81  EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           K  EK R+ E++ +  +  ++K ++ ++ + +++++++E+
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180



 Score = 36.8 bits (86), Expect = 0.020
 Identities = 35/155 (22%), Positives = 82/155 (52%), Gaps = 27/155 (17%)

Query: 99  EDLGEIKKDKSIKKDKKTHK-----KKEKEKIKKKKDKKDK----------DKIKNKEKD 143
           E+  E+++     +  K +K      +EK++IK ++ ++++           K   +E++
Sbjct: 1   ENSEELRELNEKLRAAKVNKERDAQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEE 60

Query: 144 KEKKEKEKGKK---------EEKEK---KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           +E+K KE+ ++         EE+EK   +E +E+ +E+E   +  +    E   + ++  
Sbjct: 61  RERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKR 120

Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           EK+KK +++  +  +E++++K++EKER   E+  I
Sbjct: 121 EKQKKLREEIDEFNEERIERKEEEKEREREEELKI 155



 Score = 34.5 bits (80), Expect = 0.096
 Identities = 22/119 (18%), Positives = 74/119 (62%), Gaps = 8/119 (6%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE--------KEK 165
           ++  K++++E  ++ ++++  D+I  + +++++ E ++ ++++K+ +EE         E+
Sbjct: 81  EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           KEE++ + +E++ K +E  R++ +  E+ +  + + K+ K+ ++ + + ++E +  E+E
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDERE 199



 Score = 32.6 bits (75), Expect = 0.35
 Identities = 31/121 (25%), Positives = 80/121 (66%), Gaps = 9/121 (7%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKD--------KDKIKNKEKDKEKKEKEKGKKEEKE 158
           D+ I++ ++  + + +EK +K+K  ++        + + K +EK++E++E+ K  + ++E
Sbjct: 102 DEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQRE 161

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKE 217
           K E +E++E +  + KE+KE+ V ++R +++++E E++  D+ +  L +E+ ++K+++KE
Sbjct: 162 KAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKE 221

Query: 218 R 218
           +
Sbjct: 222 K 222



 Score = 32.2 bits (74), Expect = 0.53
 Identities = 28/129 (21%), Positives = 75/129 (58%), Gaps = 8/129 (6%)

Query: 108 KSIKKDKKTHKKKEKEKIK---KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
              K+++K  +++E+ KI    ++K ++++++   + + KE+KE+E  +   ++++ E E
Sbjct: 138 IERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDE 197

Query: 165 KKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDK----DKKLKKEKLKKKKKEKERS 219
           ++E  E +    +E+   K R KE++++EK ++ K +     ++  +EK ++ ++E+   
Sbjct: 198 REELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257

Query: 220 SHEKEVIPK 228
             E+E + +
Sbjct: 258 EAERERMLE 266



 Score = 29.9 bits (68), Expect = 2.7
 Identities = 29/136 (21%), Positives = 80/136 (58%), Gaps = 17/136 (12%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK------------DKKDKDKIKNKEKDKEK 146
           E+L E++ D   ++ ++  ++KEKE+ +K++              ++K++   +E+ +E+
Sbjct: 199 EELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEE 258

Query: 147 KEKE----KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
            E+E    K  ++E+ ++E  EK+  K  + + + E+ +E+ ++E + +E+E++ ++ ++
Sbjct: 259 AERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEE-KEERRAAEREEELEEGER 317

Query: 203 KLKKEKLKKKKKEKER 218
             ++E  ++ + E+ER
Sbjct: 318 LREEEAERQARIEEER 333



 Score = 28.7 bits (65), Expect = 7.2
 Identities = 16/94 (17%), Positives = 56/94 (59%), Gaps = 4/94 (4%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K+E+ + ++ +++ +++++  K+ + E+ E+E     EK + +  E + E E +++E +E
Sbjct: 246 KEERLQEERAEEEAERERMLEKQAEDEELEQEN---AEKRRMKRLEHRRELEQQIEEKEE 302

Query: 179 KPVEKIRKEEKDSEKEKKS-KDKDKKLKKEKLKK 211
           +   +  +E ++ E+ ++   ++  ++++E+ + 
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEEERQRL 336


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 26/123 (21%), Positives = 64/123 (52%), Gaps = 4/123 (3%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIK---NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           K+    KE E+I ++  K+ ++K++    + K+   +E+ + ++E + ++EE +++EE+ 
Sbjct: 27  KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86

Query: 171 FKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
            + +E  +   EK+   E +  E+EK    ++ +L++ + +   +    +    E   KL
Sbjct: 87  VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146

Query: 230 TFK 232
             K
Sbjct: 147 LLK 149



 Score = 33.1 bits (76), Expect = 0.32
 Identities = 24/124 (19%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           L E  + +   + ++   ++E+E++ +K+++ D      + +  +  E +  ++E+    
Sbjct: 62  LRERNQQRQEARREREELQREEERLVQKEEQLDA-----RAEKLDNLENQLEEREKALSA 116

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
            E E +E +  K  +++   V  +  E+    ++   K  D +L++EK ++ KK +E + 
Sbjct: 117 RELELEELE--KQLDNELYRVAGLTPEQ---ARKLLLKLLDAELEEEKAQRVKKIEEEAD 171

Query: 221 HEKE 224
            E E
Sbjct: 172 LEAE 175



 Score = 28.5 bits (64), Expect = 7.5
 Identities = 19/93 (20%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-- 171
           K+       EK+   +++ ++ +     ++ E +E EK    E  +      ++ ++   
Sbjct: 89  KEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLL 148

Query: 172 -----KMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
                +++E+K + V+KI +EE D E E+K+++
Sbjct: 149 KLLDAELEEEKAQRVKKI-EEEADLEAERKAQN 180


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 39.3 bits (92), Expect = 0.003
 Identities = 15/87 (17%), Positives = 39/87 (44%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           +K   +   +  KE  K  E EK  E E+++ +   +KE+ +K   +I + E+  ++ + 
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEK 223
                 +    + ++++ + E +    
Sbjct: 61  KPAASGEGGGGEEEEEEAKAEAAEFRA 87



 Score = 36.9 bits (86), Expect = 0.015
 Identities = 16/97 (16%), Positives = 37/97 (38%), Gaps = 3/97 (3%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           +E + +    K+  K+   EK  E  E+EK + +  +++ +K   E    +   D+ +  
Sbjct: 5   REALAELA--KELRKLTEDEKLAE-AEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61

Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
                E    E+E++    +    +  L+        
Sbjct: 62  PAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAE 98



 Score = 36.6 bits (85), Expect = 0.025
 Identities = 17/95 (17%), Positives = 40/95 (42%), Gaps = 6/95 (6%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E++K    +K  +  ++K +    K++  K   +I   E+  ++ E +     E    EE
Sbjct: 14  ELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEE 73

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           +E++ + E             +R  +    +E+K+
Sbjct: 74  EEEEAKAEAAEFR------AYLRGGDDALAEERKA 102



 Score = 34.3 bits (79), Expect = 0.11
 Identities = 17/100 (17%), Positives = 41/100 (41%), Gaps = 4/100 (4%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE--- 168
           K+ +    +  ++++K  + +   + + +EK +    KE+  K + E    +E  +E   
Sbjct: 2   KELREALAELAKELRKLTEDEKLAEAE-EEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           K     E      E+   + + +E     +  D  L +E+
Sbjct: 61  KPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEER 100



 Score = 28.9 bits (65), Expect = 5.6
 Identities = 16/85 (18%), Positives = 30/85 (35%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           ++   E  +E  K  E E   + ++EK      KEE D    +  + ++   + E     
Sbjct: 5   REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64

Query: 213 KKEKERSSHEKEVIPKLTFKFGTDM 237
             E      E+E       +F   +
Sbjct: 65  SGEGGGGEEEEEEAKAEAAEFRAYL 89


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 39.3 bits (91), Expect = 0.004
 Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 26/160 (16%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           + H+  E  K  +K D    +K   K K KEKKEKEK +  +K+KK+E E  +     + 
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKER--DKDKKKEVEGFKSLLLALD 233

Query: 175 ED------------------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
           +                           +P E    E ++++K  K K K ++ +KE+ K
Sbjct: 234 DSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKK 293

Query: 211 KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
           KKKK      H  +   +   + G   EE     SS K++
Sbjct: 294 KKKKHHHHRCHHSDGGAEQPVQNGAVEEEPLPPMSSYKLL 333



 Score = 32.3 bits (73), Expect = 0.54
 Identities = 29/141 (20%), Positives = 59/141 (41%), Gaps = 2/141 (1%)

Query: 43  PSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG 102
               +    P  +   K+    ++K   KE++  K K+VE   + ++       S+  + 
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVA 243

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           E  +         T    E ++ K  + ++ K   K K K++ KEK+EK+K KK    + 
Sbjct: 244 EADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRC 303

Query: 161 EEKEKKEEKEFKMKEDKEKPV 181
              +   E+  +    +E+P+
Sbjct: 304 HHSDGGAEQPVQNGAVEEEPL 324



 Score = 28.5 bits (63), Expect = 8.7
 Identities = 34/129 (26%), Positives = 54/129 (41%), Gaps = 27/129 (20%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE----------KEKKEEKEFK 172
           E+ ++ + + +KDK + K+++KEK+ + +      E  E+           E+  E    
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALP 144

Query: 173 MKEDKEKPVEKIR-------KEEKDSEK----------EKKSKDKDKKLKKEKLKKKKKE 215
             ED + P +  R       K   DSEK            KS +K      EK  KK K+
Sbjct: 145 SDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKK 204

Query: 216 KERSSHEKE 224
           KE+   EKE
Sbjct: 205 KEKKEKEKE 213



 Score = 28.1 bits (62), Expect = 9.9
 Identities = 27/131 (20%), Positives = 47/131 (35%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           KD  D  +  + + DK   + EK   ++    E  +  E+ +    E K K  +K  K+E
Sbjct: 150 KDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKE 209

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
           K+ E++K  K + +  K   L         +S  +     L         +    E    
Sbjct: 210 KEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDA 269

Query: 249 IVIKPVKSPSP 259
              +  KSP  
Sbjct: 270 EAEETKKSPKH 280


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 37.3 bits (87), Expect = 0.004
 Identities = 33/96 (34%), Positives = 56/96 (58%), Gaps = 9/96 (9%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE-KDSE 192
           +DK       +EK+E+++ +KEEKE+KEEKEK+  K    KE++EK +E++ K + K   
Sbjct: 5   RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA 64

Query: 193 KEKKSKDKDKKLKKE--------KLKKKKKEKERSS 220
           +    +D D++LK++        +  +KKKEK    
Sbjct: 65  RYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKK 100



 Score = 36.1 bits (84), Expect = 0.011
 Identities = 25/97 (25%), Positives = 54/97 (55%), Gaps = 5/97 (5%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K+ +  + E+ +++K+++ ++K + +EK+KE  +    K+E +++ EE EK + K     
Sbjct: 7   KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARY 66

Query: 175 EDKEKPVEKIRKEEKDSE-----KEKKSKDKDKKLKK 206
            D E   E+++++E+  +       KK +  DKK K 
Sbjct: 67  ADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKP 103



 Score = 33.4 bits (77), Expect = 0.077
 Identities = 24/102 (23%), Positives = 59/102 (57%), Gaps = 7/102 (6%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           + +DKS +      K++EKE+ K++K++K++ + +  +   +K+E+EK  +E ++ K + 
Sbjct: 3   VYRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62

Query: 164 EKKEE------KEFKMKEDKEKPVEK-IRKEEKDSEKEKKSK 198
             +        +E K +E  + P+ + +RK+++ ++K+ K  
Sbjct: 63  LARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPL 104



 Score = 28.4 bits (64), Expect = 4.0
 Identities = 24/98 (24%), Positives = 54/98 (55%), Gaps = 3/98 (3%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           ++K  +  D ++K + K +EK+++++++EK K+  K   +++E+++  E   K   +   
Sbjct: 5   RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA 64

Query: 182 EKIRKEEKDSEKEKKSKDKD---KKLKKEKLKKKKKEK 216
                E+ D E +++ +  D   + L+K+K K  KK K
Sbjct: 65  RYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
           unknown].
          Length = 523

 Score = 39.1 bits (91), Expect = 0.004
 Identities = 27/105 (25%), Positives = 44/105 (41%), Gaps = 2/105 (1%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K  K+ KK  K H K   ++ K+    K     KN+    E K  E G  +  + KE+  
Sbjct: 3   KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENK-MESGTNDNNKNKEKLS 61

Query: 165 KKEEKEFKMKEDKEKP-VEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           K          ++E+   E I K   +S+K   ++   +K +KE 
Sbjct: 62  KLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQVSTQKWRKEL 106



 Score = 29.5 bits (66), Expect = 4.4
 Identities = 22/97 (22%), Positives = 33/97 (34%), Gaps = 3/97 (3%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           K  K   K  K+ +KN   D+ K+     K      K      E K      D  K  EK
Sbjct: 3   KASKATKKFTKNHLKNT-IDRRKQLARSKKVY--GTKNRNSHTENKMESGTNDNNKNKEK 59

Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           + K   D +     +++D      KL    K+   + 
Sbjct: 60  LSKLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQ 96


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 25/131 (19%), Positives = 68/131 (51%), Gaps = 4/131 (3%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           + ++  +++ K I +++  + +++ E+++ + ++ +    +  E+  E +EK +  KEE 
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 158 EKKEEKEKKEEKEFKM----KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           E  E + ++ E E +      E+ E+ +E +R +    E +  S + + +  + +L++ +
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413

Query: 214 KEKERSSHEKE 224
             +ER   E E
Sbjct: 414 DRRERLQQEIE 424



 Score = 37.0 bits (86), Expect = 0.020
 Identities = 26/140 (18%), Positives = 67/140 (47%), Gaps = 9/140 (6%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK--IKNKEKDKEKKEK---- 149
              E L E++ + S  +++    +KE   +  +  + ++ K  ++ +  + E++ +    
Sbjct: 264 ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEA 323

Query: 150 --EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKK 206
             E+ + +  E  EE  + EEK  ++KE+ E    ++   E +  E E + ++ +++L+ 
Sbjct: 324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383

Query: 207 EKLKKKKKEKERSSHEKEVI 226
            + K  + E + +S   E+ 
Sbjct: 384 LRSKVAQLELQIASLNNEIE 403



 Score = 32.3 bits (74), Expect = 0.57
 Identities = 21/111 (18%), Positives = 57/111 (51%), Gaps = 9/111 (8%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           +++ EK ++ K+ K +  ++  E        +E  +  +E +E+ +E E++ E+     +
Sbjct: 206 ERQAEKAERYKELKAE--LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQ 263

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKE 224
           + E+ +E++R E   SE E++ ++  K+L     ++ + +++K+       
Sbjct: 264 ELEEKLEELRLE--VSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312



 Score = 29.6 bits (67), Expect = 4.5
 Identities = 21/124 (16%), Positives = 60/124 (48%), Gaps = 6/124 (4%)

Query: 118 KKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
            ++  E ++++ ++  +D      E ++ ++  E+ + E +    E+   EE    ++ +
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895

Query: 177 KEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
            E+  E++R+ E   SE  ++ ++  +KL + +L+ +  E         +  +L+ ++  
Sbjct: 896 LEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRID----NLQERLSEEYSL 951

Query: 236 DMEE 239
            +EE
Sbjct: 952 TLEE 955



 Score = 29.3 bits (66), Expect = 5.6
 Identities = 20/140 (14%), Positives = 58/140 (41%), Gaps = 21/140 (15%)

Query: 105 KKDKSIKKDKKTHKK----KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           ++ K +K + +  +        E+++++ ++  ++  + +E   E +E     +E +EK 
Sbjct: 213 ERYKELKAELRELELALLVLRLEELREELEELQEELKEAEE---ELEELTAELQELEEKL 269

Query: 161 EEKE-KKEEKEFKM-------------KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
           EE   +  E E ++                 E+  + +R+   + E++ +  +   +  +
Sbjct: 270 EELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELE 329

Query: 207 EKLKKKKKEKERSSHEKEVI 226
            KL +  +E      + E +
Sbjct: 330 SKLDELAEELAELEEKLEEL 349


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 39.3 bits (92), Expect = 0.004
 Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 13/111 (11%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K  +  I  +EK+        G K  KE     E+K        ED      +  KE+  
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKE-----EQKGNDLSNELEDISLGPLEYDKEK-- 264

Query: 191 SEKEKKSKDKDKKLKK------EKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
            EK K  K+K KKL        +  K K    +  + + + I  + +  GT
Sbjct: 265 LEKIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGT 315



 Score = 36.6 bits (85), Expect = 0.030
 Identities = 23/100 (23%), Positives = 39/100 (39%), Gaps = 7/100 (7%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E K + S   +      KE++K     ++ +   +   E DKEK EK K  KE+ +K   
Sbjct: 221 EEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGI 280

Query: 163 K-------EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
                    K +   +K++ +    +  I      S K K
Sbjct: 281 SIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320



 Score = 33.9 bits (78), Expect = 0.21
 Identities = 17/93 (18%), Positives = 38/93 (40%), Gaps = 10/93 (10%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKE--EKE 164
           KS + +    +K     +    +K +K++ K  +   E ++      + +KEK E  +  
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271

Query: 165 KKEEKEFKMK-------EDKEKPVEKIRKEEKD 190
           K++ K+  +           +    KI+ E+ D
Sbjct: 272 KEKAKKLGISIILFDDMTKNKTTNYKIQNEDPD 304



 Score = 32.8 bits (75), Expect = 0.41
 Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 5/74 (6%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK- 165
           D  IK  +    K+EK       +  +K+  + ++ +    E E    +      E +K 
Sbjct: 208 DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE----DISLGPLEYDKE 263

Query: 166 KEEKEFKMKEDKEK 179
           K EK   +KE  +K
Sbjct: 264 KLEKIKDLKEKAKK 277


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 37.8 bits (88), Expect = 0.004
 Identities = 26/105 (24%), Positives = 58/105 (55%), Gaps = 5/105 (4%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM--KE 175
           ++K  +  K+ KD   ++K +   + +E ++  + +KE KE+K    +KEE+E KM  ++
Sbjct: 67  EEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQEQKMAKEK 126

Query: 176 DKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKEKLKKKKKEKE 217
           DK++ ++++      +     + K + K  + + EKL++KK+   
Sbjct: 127 DKQEIIDELETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTF 171


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 38.9 bits (91), Expect = 0.005
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 20/94 (21%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-------PVEKIRKEE---- 188
           K  DKE+ ++EK +KE  ++++   K +++E K K++ EK       P E  +++E    
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605

Query: 189 ---------KDSEKEKKSKDKDKKLKKEKLKKKK 213
                     D++ E+ SK + KKL KE  K+ K
Sbjct: 606 AFDETGLPTHDADGEEISKKERKKLSKEYDKQAK 639



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
           +K+ + +E   EK  KE     KE+K   K KK +++K K+ +K ++      E   +  
Sbjct: 545 WKLDDKEELQREKEEKEA---LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQE 601

Query: 231 FKFGT 235
            K+  
Sbjct: 602 DKYSA 606



 Score = 30.8 bits (70), Expect = 1.8
 Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 13/121 (10%)

Query: 95  VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
           V   +D  E++++K  K+  K  K+  K K +++K K        KE +K +K K    +
Sbjct: 544 VWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK--------KELEKLEKAKIPPAE 595

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
             K ++++    +E      +   + + K  KE K   KE    DK  KL +E L K  K
Sbjct: 596 FFKRQEDKYSAFDETGLPTHDADGEEISK--KERKKLSKE---YDKQAKLHEEYLAKGGK 650

Query: 215 E 215
            
Sbjct: 651 S 651


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 38.1 bits (89), Expect = 0.005
 Identities = 29/118 (24%), Positives = 50/118 (42%), Gaps = 6/118 (5%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           + +   ++E+   KKK+ K    K   K+K   KK+     K  K      +KK E+   
Sbjct: 73  EGEKELQREERLKKKKRVKTKAYKEPTKKK---KKKDPTAAKSPKAAAPRPKKKSERISW 129

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
                + P    RK  + S  + K    ++  ++E  +KK + K R   EK+   +LT
Sbjct: 130 APTLLDSP---RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 2/93 (2%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           +K     +  KK  KK        K       K   +            +K    +    
Sbjct: 90  VKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKS--SRSSTV 147

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           + KE    ++KE + +  +   K  K  EK+K+
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKE 180


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 37.7 bits (87), Expect = 0.005
 Identities = 22/77 (28%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK-EEKEFKMKEDKEKPVEKIR 185
           KK D K K  +  +++  EK  +E+ + EE+ + E +E+K +E+E + ++++E+  + I 
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170

Query: 186 KEEKDSEKEKKSKDKDK 202
           ++  DSE E   +D+ +
Sbjct: 171 EQSDDSEHEIIEQDESE 187



 Score = 37.0 bits (85), Expect = 0.011
 Identities = 24/93 (25%), Positives = 56/93 (60%), Gaps = 3/93 (3%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           +S+KKD    KKK+   I++++ +K + + +  E+  E + +E+ K +E+E +EE+E++ 
Sbjct: 108 RSVKKDDI--KKKKSLIIRQEQIEKARQEREELEERMEWERREE-KIDEREDQEEQERER 164

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
           E++   ++  +   E I ++E ++E +    +K
Sbjct: 165 EEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197



 Score = 34.3 bits (78), Expect = 0.070
 Identities = 20/81 (24%), Positives = 53/81 (65%), Gaps = 4/81 (4%)

Query: 103 EIKKDKS--IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK--EEKE 158
           +IKK KS  I++++    ++E+E+++++ + + +++  ++ +D+E++E+E+ ++  EE+ 
Sbjct: 114 DIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQS 173

Query: 159 KKEEKEKKEEKEFKMKEDKEK 179
              E E  E+ E + + D +K
Sbjct: 174 DDSEHEIIEQDESETESDDDK 194


>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
           Members of this family are associated with type VII
           secretion of WXG100 family targets in the Firmicutes,
           but not in the Actinobacteria. This highly divergent
           protein family consists largely of a central region of
           highly polar low-complexity sequence containing
           occasional LF motifs in weak repeats about 17 residues
           in length, flanked by hydrophobic N- and C-terminal
           regions [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 150

 Score = 37.0 bits (86), Expect = 0.006
 Identities = 16/96 (16%), Positives = 40/96 (41%), Gaps = 2/96 (2%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
             KKD  I  D    + +  +++   +++K     K+KEK ++++ K +   E   +   
Sbjct: 31  YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
            +  +++ F    + E+         ++  K+  S 
Sbjct: 91  VKATKKQLF--SSEYEQTSSSSESTSEEETKKTSSI 124



 Score = 36.2 bits (84), Expect = 0.009
 Identities = 21/97 (21%), Positives = 41/97 (42%), Gaps = 2/97 (2%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           + D K + K    EK   +   +  ++E +  KE    +E+K  K+   K K   +  + 
Sbjct: 19  ENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQK--KITFQKHKEKPEQEEL 76

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +     E  +++   K  K++L   + E+  SS E  
Sbjct: 77  KNQLFSENATENNTVKATKKQLFSSEYEQTSSSSEST 113


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 38.2 bits (89), Expect = 0.006
 Identities = 38/187 (20%), Positives = 85/187 (45%), Gaps = 22/187 (11%)

Query: 60  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 119
            +  S+E T      Y  +KD E     +    +  T+S         +S+K+ K   + 
Sbjct: 26  TEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKLERQA 85

Query: 120 KE--------KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-------KEEKE 164
           +E        K+  ++KK +K  ++ + +E++KE++E E  ++  KEK       K ++ 
Sbjct: 86  QEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQA 145

Query: 165 KKEEKEFKMKEDKEK-------PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            K+      KE  E          +  R   ++  K++  + + KKLK+++ K++++ ++
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRK 205

Query: 218 RSSHEKE 224
           +   ++E
Sbjct: 206 QRKKQQE 212



 Score = 32.8 bits (75), Expect = 0.32
 Identities = 16/107 (14%), Positives = 60/107 (56%), Gaps = 1/107 (0%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
            +++K+  + + ++++ K+K ++   + K ++  K++  K K +  E          + +
Sbjct: 114 QEREKEREEAELRQRLAKEKYEEWCRQ-KAQQAAKQRTPKHKKEAAESASSSLSGSAKPE 172

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
               +E+ +K +++   ++   +++K+ +++ K+ KK++ ++++K+K
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219



 Score = 32.4 bits (74), Expect = 0.38
 Identities = 21/109 (19%), Positives = 56/109 (51%), Gaps = 3/109 (2%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEE 168
           K ++  +++E E  ++   +K ++  + K +   K+   K KKE  E          K E
Sbjct: 113 KQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPE 172

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           +    +E K++  E   K+ K  +++++ + + ++ K+++ +++K++ E
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAE 221



 Score = 31.6 bits (72), Expect = 0.77
 Identities = 20/82 (24%), Positives = 45/82 (54%), Gaps = 2/82 (2%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK--EEKEK 159
                  S+    K  +   +E+ KK+  + +  K+K +++ +E++ +++ KK  EE+E+
Sbjct: 157 AAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEER 216

Query: 160 KEEKEKKEEKEFKMKEDKEKPV 181
           K++ E+  +K  K  + + KPV
Sbjct: 217 KQKAEEAWQKWMKNVKQRPKPV 238



 Score = 29.7 bits (67), Expect = 3.2
 Identities = 16/77 (20%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK--EKDKEKKEKEKGKKEEKEKK 160
             ++    KK+             K +    +++ K +  E + +K ++++ K+EE+ +K
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRK 205

Query: 161 EEKEKKEEKEFKMKEDK 177
           + K+++EE+E K K ++
Sbjct: 206 QRKKQQEEEERKQKAEE 222



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 22/115 (19%), Positives = 61/115 (53%), Gaps = 13/115 (11%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK---EKGKKEEKEKKE 161
           K+ +  +  ++  K+K +E  ++K  +  K +    +K+  +          K E+   +
Sbjct: 118 KEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQ 177

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           E+ KK  +E+++K        K++++++  ++E++ + + KK ++E+ +K+K E+
Sbjct: 178 EEAKKRLQEWELK--------KLKQQQQ--KREEERRKQRKKQQEEEERKQKAEE 222


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 38.5 bits (90), Expect = 0.006
 Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 2/78 (2%)

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
              +++ K++K++ K++  +KK +K KK+  +        + + + ++ E     +KK K
Sbjct: 4   ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--DAFLAELISENQEAENKQNNKKKKK 61

Query: 199 DKDKKLKKEKLKKKKKEK 216
            K KK KK   +      
Sbjct: 62  KKKKKKKKNLGEAYDLAY 79



 Score = 37.8 bits (88), Expect = 0.011
 Identities = 21/91 (23%), Positives = 42/91 (46%), Gaps = 3/91 (3%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKEKEKGKKEEKEKKE 161
           +  ++  K +K   K++  K K KK KKD D       +   + ++ + K   ++K+KK+
Sbjct: 4   ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
           +K+KK+          + PV       +D+ 
Sbjct: 64  KKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94



 Score = 36.6 bits (85), Expect = 0.022
 Identities = 15/92 (16%), Positives = 40/92 (43%), Gaps = 2/92 (2%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
              E  + E K++K++   K K  K+K  +  +  + D     +   ++++ + ++  KK
Sbjct: 1   AMNENGENEVKQQKQQN--KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKK 58

Query: 213 KKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
           KK+K++   +K +       +   +   +   
Sbjct: 59  KKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90



 Score = 35.8 bits (83), Expect = 0.047
 Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           + + K  K++ K+K  KKK+KK K  +   + D     +   + +E E K+  +KK++K+
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDV---DDDDAFLAELISENQEAENKQNNKKKKKKK 63

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
            K K+        +  +          +D
Sbjct: 64  KKKKKKNLGEAYDLAYDLPVVWSSAAFQD 92



 Score = 32.8 bits (75), Expect = 0.36
 Identities = 19/89 (21%), Positives = 45/89 (50%), Gaps = 1/89 (1%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDK 202
             +  + + K+++++ K++  KK+ K+ K   D  +  + ++  E +++E ++ +K K K
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61

Query: 203 KLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
           K KK+K K   +  + +     V     F
Sbjct: 62  KKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90



 Score = 32.4 bits (74), Expect = 0.57
 Identities = 17/85 (20%), Positives = 39/85 (45%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
           ++ K++++  K+K  K K+K   K+ D +     +   E +E + ++  K++K+ K K+ 
Sbjct: 8   NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67

Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKD 201
           K+   E              +  +D
Sbjct: 68  KKNLGEAYDLAYDLPVVWSSAAFQD 92



 Score = 30.5 bits (69), Expect = 1.9
 Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 19/164 (11%)

Query: 54  DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE-IKKDKSIKK 112
           +    ++K  +K+K + K+ + SK    +                  L E I +++  + 
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--------------DAFLAELISENQEAEN 52

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
            +   KKK+K+K KKKK+  +   +                 ++     +     E+E+K
Sbjct: 53  KQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAF----QDNSHIRKLGNWPEQEWK 108

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
             +    PV K  K+ +    E +    +   +    +K++ EK
Sbjct: 109 QTQPPTIPVSKQFKDGEYPVGEIQEYPGENSSRTSSEEKRELEK 152


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 38.2 bits (89), Expect = 0.008
 Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           EK+ EKE   +E++E+  EK  KEE++   +K+ +  +++ K+EK KK KK KE ++  +
Sbjct: 28  EKEVEKEVPDEEEEEEKEEK--KEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85

Query: 224 EV 225
            +
Sbjct: 86  LL 87



 Score = 36.7 bits (85), Expect = 0.023
 Identities = 25/65 (38%), Positives = 39/65 (60%), Gaps = 8/65 (12%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           K+ EKE  +E+E++E++E K +E+K    E    EE D E+EK+ K K    K +K+K+ 
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKE----EEVDEEEEKEEKKK----KTKKVKET 80

Query: 213 KKEKE 217
             E E
Sbjct: 81  TTEWE 85



 Score = 35.5 bits (82), Expect = 0.049
 Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 4/67 (5%)

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
           KE EK   +E+E++E++EKKEE+E     DKE+ V++  +EEK+ +K+K  K K+   + 
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEE--KTTDKEEEVDE--EEEKEEKKKKTKKVKETTTEW 84

Query: 207 EKLKKKK 213
           E L K K
Sbjct: 85  ELLNKTK 91



 Score = 34.0 bits (78), Expect = 0.16
 Identities = 18/65 (27%), Positives = 44/65 (67%), Gaps = 1/65 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-D 176
           K+ EKE   ++++++ ++K + +EK  +K+E+   ++E++EKK++ +K +E   + +  +
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88

Query: 177 KEKPV 181
           K KP+
Sbjct: 89  KTKPI 93



 Score = 32.8 bits (75), Expect = 0.36
 Identities = 21/63 (33%), Positives = 40/63 (63%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           K   KE   E++E+EK +K+E+E+K   +++E  E + KE+K+K  +K+++   + E   
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88

Query: 196 KSK 198
           K+K
Sbjct: 89  KTK 91



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 20/84 (23%), Positives = 41/84 (48%)

Query: 88  MVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
            V   V+    ++  E +K++  ++++KT  K+E+   +++K++K K   K KE   E +
Sbjct: 26  WVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEF 171
              K K       ++  K+E   F
Sbjct: 86  LLNKTKPIWTRNPKDVTKEEYAAF 109


>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD.  This
           family consists of several putative bacterial stage IV
           sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
           is known to be essential for efficient sporulation
           although its exact function is unknown.
          Length = 383

 Score = 38.1 bits (89), Expect = 0.008
 Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 6/109 (5%)

Query: 111 KKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KE 167
             +  T +K  K  +    KK      K + ++ DKE+ EK    K +      +E   E
Sbjct: 261 NFEVLTGEKITKHYLSIGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYE 320

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD---KDKKLKKEKLKKKK 213
            ++  +K  KE+ VEK +K  K   +++  KD   KD+K+  E+++  K
Sbjct: 321 VQDKVVKLTKEEAVEKGKKLAKKKLEKEIDKDAKIKDEKVLHERVENGK 369



 Score = 29.3 bits (66), Expect = 4.7
 Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 2/88 (2%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKE-KGKKEEKEKKEEKEKKE 167
           +K  K   +K +KE+ +K    K K  I   +E   E ++K  K  KEE  +K +K  K+
Sbjct: 284 LKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKK 343

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           + E ++ +D +   EK+  E  ++ K +
Sbjct: 344 KLEKEIDKDAKIKDEKVLHERVENGKVR 371


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 36.2 bits (84), Expect = 0.010
 Identities = 24/74 (32%), Positives = 45/74 (60%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            +D+  ++ ++  K+K++EK  KK+ K+ K K K K+K K KK  +K +KE  +  EE  
Sbjct: 63  TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESS 122

Query: 165 KKEEKEFKMKEDKE 178
            +EE+  + K+++ 
Sbjct: 123 DEEEEGEEDKQEEP 136



 Score = 34.3 bits (79), Expect = 0.039
 Identities = 22/73 (30%), Positives = 46/73 (63%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           + +++K  K ++KT KK+ K + KK+K KK K   K  +K++++  K   +  ++E++ E
Sbjct: 70  QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGE 129

Query: 163 KEKKEEKEFKMKE 175
           ++K+EE    M++
Sbjct: 130 EDKQEEPVEIMEK 142



 Score = 33.9 bits (78), Expect = 0.067
 Identities = 25/84 (29%), Positives = 50/84 (59%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           D+K  K+ E E+ ++K+++K +   +   K + K++K+K KK++K+K ++  KKEEKE  
Sbjct: 56  DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKK 196
              ++    E+  +E+K  E  + 
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEI 139



 Score = 31.6 bits (72), Expect = 0.32
 Identities = 27/81 (33%), Positives = 48/81 (59%), Gaps = 3/81 (3%)

Query: 147 KEKEKGKKEEKEKKEEKEKKE---EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +E E+ +  +++ K+E E +E   ++E K ++D+EK  +K  K +K  +K+KK K   K 
Sbjct: 47  REYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKG 106

Query: 204 LKKEKLKKKKKEKERSSHEKE 224
            KKE+ +  K  +E S  E+E
Sbjct: 107 NKKEEKEGSKSSEESSDEEEE 127



 Score = 28.5 bits (64), Expect = 3.7
 Identities = 22/87 (25%), Positives = 52/87 (59%)

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
           D+K K + +++E  ++++EK++  +E+  KK  K +K++++ K K+  +K  +K  KE  
Sbjct: 56  DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115

Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            S +E   ++++ +  K++   +  EK
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEIMEK 142



 Score = 27.3 bits (61), Expect = 8.9
 Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 8/97 (8%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E E+ E  ++K +KE + +E ++K  EK RK+E+ + K      K  K +K+K KKKKK+
Sbjct: 48  EYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK------KRAKRQKKKQKKKKKK 101

Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
           K +     +   K   K   +  ++ +     K    
Sbjct: 102 KAKK--GNKKEEKEGSKSSEESSDEEEEGEEDKQEEP 136


>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 320

 Score = 37.4 bits (86), Expect = 0.010
 Identities = 19/115 (16%), Positives = 46/115 (40%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
                DK  K+       ++ +++  +K      + K   +   K+ + KG  EE+E + 
Sbjct: 93  EIAVLDKLAKELLLGLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLRIKGLTEEQEAEA 152

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            ++K + K+   K  + +    + K E  +++      K  +  K    +  K++
Sbjct: 153 LRDKLQNKQQGKKVTEVQARHILVKAEAKAKEALALLKKGVREAKADFAELAKKQ 207


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 37.6 bits (87), Expect = 0.013
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 6/64 (9%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE--KEKKEEK 169
           K     K+K KEK    K+ K    I++   + E +     ++EEK          K++K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIK----IESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847

Query: 170 EFKM 173
            +K+
Sbjct: 848 PYKV 851



 Score = 36.5 bits (84), Expect = 0.032
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 6/65 (9%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           T K   + K +K K+K   DK    E    + E E+   +++E+K            +  
Sbjct: 790 TIKAAARAK-QKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA-----PAILNV 843

Query: 176 DKEKP 180
            K+KP
Sbjct: 844 KKKKP 848



 Score = 34.5 bits (79), Expect = 0.13
 Identities = 29/143 (20%), Positives = 53/143 (37%), Gaps = 19/143 (13%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK--PVEKIRKEEKDS 191
           K   + K+K KEK   +K  K E    E   + E    K +E+K    P     K++K  
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVE--TEGERCTIKQREEKGIDAPAILNVKKKKPY 849

Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV- 250
           + +  ++   ++ +     K  +           +P   FK G  +    + +    ++ 
Sbjct: 850 KVDNMTRILPQQSRYISFIKDDR----------FVPVRKFKGGVVVLRDREPKEPVALIE 899

Query: 251 ----IKPVKSPSPPPVAAADEYD 269
               +K V +P P P    D  D
Sbjct: 900 TVRQMKDVNAPLPTPFKVDDNVD 922



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 27/127 (21%), Positives = 44/127 (34%), Gaps = 7/127 (5%)

Query: 27  HPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT 86
            PLI    V     P             K    E +  +E   P+  E +  K V    T
Sbjct: 727 FPLIH--FVSLSFLPTTVIGIRGSQAIPKFCFNEDLEEEEFEYPRMYEEASGKSVRKVNT 784

Query: 87  PMVPNIVKVTSSEDLGEIKK---DKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEK 142
            ++   +K  +       +K   DK IK +  +  + E E+   K +++K  D       
Sbjct: 785 AVLSTTIKAAARAKQKPKEKGPNDKEIKIESPS-VETEGERCTIKQREEKGIDAPAILNV 843

Query: 143 DKEKKEK 149
            K+K  K
Sbjct: 844 KKKKPYK 850


>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34.  This family represents
           herpes virus protein U79 and cytomegalovirus early
           phosphoprotein P34 (UL112).
          Length = 238

 Score = 36.8 bits (85), Expect = 0.013
 Identities = 18/80 (22%), Positives = 37/80 (46%), Gaps = 1/80 (1%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
            K++ K+KEK +++  +  K+  + K +EK +  ++K  G        +     +++  K
Sbjct: 160 RKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPK 219

Query: 173 MKEDKEKPVEKIRKEEKDSE 192
            K  K    E+ R E +  E
Sbjct: 220 EKRQKHHDPER-RLEPQSHE 238



 Score = 34.5 bits (79), Expect = 0.087
 Identities = 22/113 (19%), Positives = 54/113 (47%), Gaps = 5/113 (4%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
              +LG++K  +  +K+ +   +K+ +   +K+  K K+K + ++  K K+++ K ++E+
Sbjct: 130 EIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEK 189

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           +   E+K           +          K+E   EK +K  D +++L+ +  
Sbjct: 190 RRNDEDKRPGGGGGSSGGQSGLS-----TKDEPPKEKRQKHHDPERRLEPQSH 237



 Score = 34.1 bits (78), Expect = 0.099
 Identities = 28/110 (25%), Positives = 53/110 (48%), Gaps = 7/110 (6%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           D K  +K EKE     + K D +      + +  K+KEK + E+ +K +E  +K+++E +
Sbjct: 136 DVKNAEKFEKECRALSRKKSDDEH-----RKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
             ++ ++P          S     +KD+  K K++K    ++  E  SHE
Sbjct: 191 RNDEDKRPGGGGGSSGGQS--GLSTKDEPPKEKRQKHHDPERRLEPQSHE 238



 Score = 28.3 bits (63), Expect = 7.5
 Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 20/105 (19%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           KE   + +KK   +  K   K+K+K + E  +  KE++ KK+E++++ +++         
Sbjct: 145 KECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDED--------- 195

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
                  +                 K E      KEK +  H+ E
Sbjct: 196 -------KRPGGGGGSSGGQSGLSTKDEP----PKEKRQKHHDPE 229


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 36.9 bits (86), Expect = 0.015
 Identities = 26/105 (24%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKEDKEKP 180
           E+   + D+    K K  E ++ K E  + ++E   +++KEE++  E +E   +E  ++ 
Sbjct: 189 EEAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQL 248

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
           +EK+  E +    E++   + K  ++E+L K+  + E  S +KE+
Sbjct: 249 IEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEI 293



 Score = 35.3 bits (82), Expect = 0.041
 Identities = 21/90 (23%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           K  E E+ K +  + +++ ++ K+K++E+  + + +  ++  K+  EK E +  K+  ++
Sbjct: 204 KAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQ 263

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           E+ +E   +E+++  KE   K + + L+KE
Sbjct: 264 ERMLEHKLQEQEELLKE-GFKTEAESLQKE 292



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 22/94 (23%), Positives = 52/94 (55%), Gaps = 6/94 (6%)

Query: 105 KKDKSIKKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKKEEKEKK 160
            K+K+I+ ++   +  E E ++ ++K K+++  ++ +E+   +  K+  EK + E ++  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
            E+E+  E   K++E +E   E  + E +  +KE
Sbjct: 261 AEQERMLEH--KLQEQEELLKEGFKTEAESLQKE 292


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 37.1 bits (86), Expect = 0.015
 Identities = 18/109 (16%), Positives = 36/109 (33%), Gaps = 9/109 (8%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKI---------KNKEKDKEKKEKEKGKKEEKEKKEEK 163
           D+K   KK K+    K   +D + I           K K  +K ++       +  K+  
Sbjct: 10  DEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSS 69

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +  E      +E   KP +          K K    + ++ ++ +    
Sbjct: 70  KPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPD 118



 Score = 36.3 bits (84), Expect = 0.032
 Identities = 27/120 (22%), Positives = 41/120 (34%), Gaps = 8/120 (6%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           S+  +K   KK +K    K   + D+D ++  + + E+ EK K  K+ KE       +  
Sbjct: 7   SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLD-ELEQSEKAKPPKKPKEASRPGTPRNP 65

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
           K    K  K         EEK     K  K  +         K    +     E E  P 
Sbjct: 66  K----KSSKPTESSAASSEEK---PAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPD 118



 Score = 36.3 bits (84), Expect = 0.032
 Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 2/84 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           E L E+++ +  K  KK  +       +  KK  K  +    + E+   K  K       
Sbjct: 36  EFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKP 180
              K +    E +E +  E+   P
Sbjct: 96  SHPKSKAPSTESEEEEEPEETPDP 119


>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
           fragment found in eukaryotic DNA topoisomerase (topo) IB
           proteins similar to the monomeric yeast and human topo I
           and heterodimeric topo I from Leishmania donvanni. Topo
           I enzymes are divided into:  topo type IA (bacterial)
           and type IB (eukaryotic). Topo I relaxes superhelical
           tension in duplex DNA by creating a single-strand nick,
           the broken strand can then rotate around the unbroken
           strand to remove DNA supercoils and, the nick is
           religated, liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit re-ligation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topo I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I play putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 36.5 bits (85), Expect = 0.016
 Identities = 23/58 (39%), Positives = 34/58 (58%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           KD +    KE++ I KK  K D   I    +++++K+K   K+E+K  KEEKEK EE 
Sbjct: 65  KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122



 Score = 28.0 bits (63), Expect = 8.0
 Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 126 KKKKDKKDKDKIKNKEK-----DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           +K   K++K  IK   K       +  E+EK KK+   K+E+K  KEEKE   + 
Sbjct: 68  RKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122


>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase.  The valyl-tRNA synthetase
           (ValS) is a class I amino acyl-tRNA ligase and is
           particularly closely related to the isoleucyl tRNA
           synthetase [Protein synthesis, tRNA aminoacylation].
          Length = 861

 Score = 37.3 bits (87), Expect = 0.017
 Identities = 23/117 (19%), Positives = 49/117 (41%), Gaps = 10/117 (8%)

Query: 42  PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL--GVTPMVPNIVKVTSSE 99
           PP+     +      +  E     +  +   K      +VE+      +   +V++    
Sbjct: 746 PPNAPLKVLL---IYTEAETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGF 802

Query: 100 DL-----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           ++     G I K K + + +K   K++KE I+ +   +++  +K   K+  +KEKEK
Sbjct: 803 EIIIPVKGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKEKEK 859


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 35.8 bits (83), Expect = 0.017
 Identities = 19/89 (21%), Positives = 42/89 (47%), Gaps = 1/89 (1%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
              +K     E E  K++++ +K +KE K ++     + K + +    K E + +KEK  
Sbjct: 40  PQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLV 99

Query: 198 KDKDKK-LKKEKLKKKKKEKERSSHEKEV 225
              +KK  + EK   +++ +E     +++
Sbjct: 100 NAFNKKQQEYEKDLNRREAEEEQKLLEKI 128



 Score = 30.4 bits (69), Expect = 1.2
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           S DL    K    K+ K+  K +++ K K+ K + D       ++ K + E +K K    
Sbjct: 46  SADLESEFK----KRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNA 101

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKI 184
             K+++E +++   +  E+++K +EKI
Sbjct: 102 FNKKQQEYEKDLNRREAEEEQKLLEKI 128



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 23/94 (24%), Positives = 44/94 (46%), Gaps = 8/94 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            KK    ++ +  K+ K +++  +K+ + KE +  + + K +      K E E K    K
Sbjct: 42  AKKVSADLESEFKKRQK-ELQKMQKELKAKEAKL-QDDGKMEALSDRAKAEAEIK----K 95

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           EK V    K  K  E EK    ++ + +++ L+K
Sbjct: 96  EKLVNAFNK--KQQEYEKDLNRREAEEEQKLLEK 127


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 34.9 bits (81), Expect = 0.018
 Identities = 18/70 (25%), Positives = 36/70 (51%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           ++     K+      KEK K +KK+++K  +  K  +   E+K + + +K EKE +E + 
Sbjct: 4   EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63

Query: 165 KKEEKEFKMK 174
           +   +E K +
Sbjct: 64  ELARRELKAE 73



 Score = 29.1 bits (66), Expect = 1.7
 Identities = 16/76 (21%), Positives = 42/76 (55%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           + ++EK    KE  +   +++ K EK+ + K+ + + + K+  EEK   + +K + + ++
Sbjct: 1   EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60

Query: 204 LKKEKLKKKKKEKERS 219
           L+ E  +++ K + + 
Sbjct: 61  LEAELARRELKAEAKK 76


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 35.8 bits (83), Expect = 0.018
 Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 2/69 (2%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +   E E  +KE   K E++ + EE+E K K+ K++   K  K+EK  +KEK  + K  K
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEV--KKEKKEKKDKKEKMVEPKGSK 184

Query: 204 LKKEKLKKK 212
            KK+K KKK
Sbjct: 185 KKKKKKKKK 193



 Score = 34.3 bits (79), Expect = 0.061
 Identities = 21/77 (27%), Positives = 44/77 (57%)

Query: 84  GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
           G         ++ S  +  E +    ++K+ +  ++++KEK KKK+ KK+K + K+K++ 
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176

Query: 144 KEKKEKEKGKKEEKEKK 160
             + +  K KK++K+KK
Sbjct: 177 MVEPKGSKKKKKKKKKK 193



 Score = 33.5 bits (77), Expect = 0.12
 Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
            + D    +     +  +K    K +++ + ++++K K+K K KE  KEKKEK+  K++ 
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEK-KEKKKKKEVKKEKKEKKDKKEKM 177

Query: 157 KEKKEEKEKKEEKEFK 172
            E K  K+KK++K+ K
Sbjct: 178 VEPKGSKKKKKKKKKK 193



 Score = 33.2 bits (76), Expect = 0.17
 Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 7/68 (10%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           +  EK+   K +++ + EE+EKKE+K+KKE K       KEK  +K +KE+    K  K 
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK-------KEKKEKKDKKEKMVEPKGSKK 185

Query: 198 KDKDKKLK 205
           K K KK K
Sbjct: 186 KKKKKKKK 193



 Score = 32.8 bits (75), Expect = 0.22
 Identities = 19/64 (29%), Positives = 39/64 (60%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
             + +  EK    K +K+ +  + ++K+K+KK++ K +K+EK+ K+EK  + +   K K+
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188

Query: 176 DKEK 179
            K+K
Sbjct: 189 KKKK 192



 Score = 32.4 bits (74), Expect = 0.28
 Identities = 20/67 (29%), Positives = 41/67 (61%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
            +   + +  +K+      KE + E++EK++ KK+++ KKE+KEKK++KE  ++    K 
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185

Query: 181 VEKIRKE 187
            +K +K+
Sbjct: 186 KKKKKKK 192


>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A.  This protein is encoded
           by an open reading frame in plasmid borne DNA repeats of
           Borrelia species. This protein is known as ORF-A. The
           function of this putative protein is unknown.
          Length = 285

 Score = 36.6 bits (85), Expect = 0.019
 Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 26/143 (18%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---------------------E 148
           +   KK   +   +  K+KK ++ K ++    K K  K                     +
Sbjct: 85  LNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIK 144

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKM-----KEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +E  KK  K  ++ K KK  K+        K+ K K ++ I+ EE    K KK K+K K 
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204

Query: 204 LKKEKLKKKKKEKERSSHEKEVI 226
           LK      K KE E+  ++K+ I
Sbjct: 205 LKNILYNLKDKELEKKGYKKKQI 227



 Score = 28.9 bits (65), Expect = 5.9
 Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 1/114 (0%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
            K     K +   +K     K  K++ KK   K   K K K+  +K   K   K+  + K
Sbjct: 121 NKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIK 180

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-LKKEKLKKKKKEK 216
             K  K  +    K K ++  +K  K+     K K+ +KK  KK++++K  ++ 
Sbjct: 181 LLKIIKIEENFLIKNKKLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKI 234



 Score = 28.5 bits (64), Expect = 7.7
 Identities = 28/138 (20%), Positives = 55/138 (39%), Gaps = 9/138 (6%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---KKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           T    L +++K+  +  +   H  K        K    KK+  +I NK   ++K E+ K 
Sbjct: 51  TLQNYLYKLEKEFKVTTNYYKHLGKNNGTEIYYKLNYQKKECYEIINKYFKEKKIERFKK 110

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +     KK+  +    ++ +          K     K  ++E K K+     K +  K  
Sbjct: 111 RVNAYFKKKSNKNGSVEKNECIN------NKNNVINKKIKEESKKKNSKNIEKLKIKKYF 164

Query: 213 KKEKERSSHEKEVIPKLT 230
           KK   +++ +K++  KL 
Sbjct: 165 KKCNFKTNLKKDIKIKLL 182


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 37.0 bits (86), Expect = 0.019
 Identities = 38/198 (19%), Positives = 89/198 (44%), Gaps = 20/198 (10%)

Query: 59  KEKMSSKEKTSPKEKEYSKVKDVELGV---TPMVPNIVKVT----SSEDLGEIKKDKSIK 111
           K K+   E+  P+   +  V+ ++  V    P+   + KV       E  G  ++++   
Sbjct: 358 KPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREI 417

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
              +   KK +E +++ +++  + K + +E  +E  EK + + E   ++   + ++++E 
Sbjct: 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREI-EKLESELERFRREVRDKVRKDREI 476

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
           + ++       +I + EK+ E++KK  ++ ++   E  K +K E        +V+ KLT 
Sbjct: 477 RARD------RRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT- 529

Query: 232 KFGTDMEEKTKRESSPKI 249
                +E   + E    I
Sbjct: 530 -----LEAIEEAEEEYGI 542


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 36.5 bits (85), Expect = 0.020
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
             +   E+ K   I + K   K  ++ K KK++ ++D +  +++ + K K ++E  KK+ 
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263

Query: 157 KEKKEEKEKK 166
           KE K  K  K
Sbjct: 264 KESKGVKALK 273



 Score = 35.4 bits (82), Expect = 0.042
 Identities = 18/72 (25%), Positives = 32/72 (44%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           ++  K     E K   +  ++ ++K+ + +ED E    +  K+ K  E+ KK K K+ K 
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268

Query: 205 KKEKLKKKKKEK 216
            K   K   K  
Sbjct: 269 VKALKKVVAKGM 280



 Score = 35.0 bits (81), Expect = 0.054
 Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 6/83 (7%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
               K+  K     + K  DK  KE +K K+E +E  E  E + EK+ K KE      E 
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE------EI 258

Query: 184 IRKEEKDSEKEKKSKDKDKKLKK 206
            +K+ K+S+  K  K    K  K
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMK 281



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 19/73 (26%), Positives = 36/73 (49%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
           K+  K     + K       + K K +E +++ +  E   +++++ KEE +KK+ KE K 
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268

Query: 174 KEDKEKPVEKIRK 186
            +  +K V K  K
Sbjct: 269 VKALKKVVAKGMK 281



 Score = 32.7 bits (75), Expect = 0.31
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 2/64 (3%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
               DK +K+ KK  KK+E E+  +  + + + K K+KE+ K+KK KE    +  +K   
Sbjct: 220 FKPLDKYLKESKK--KKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVA 277

Query: 163 KEKK 166
           K  K
Sbjct: 278 KGMK 281



 Score = 31.9 bits (73), Expect = 0.54
 Identities = 19/74 (25%), Positives = 36/74 (48%)

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
            +  K+  K     +   + +  ++  +K R+ E+D E  +   +K +K K+E  KKK K
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264

Query: 215 EKERSSHEKEVIPK 228
           E +     K+V+ K
Sbjct: 265 ESKGVKALKKVVAK 278



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 2/79 (2%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
           +++      DL +      +  + K   K  KE  KKK++ ++  +      +K++K KE
Sbjct: 197 DLLCSYLPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE 256

Query: 151 KGKKEEKEKKEEKEKKEEK 169
           + KK  K+ KE K  K  K
Sbjct: 257 EIKK--KKPKESKGVKALK 273



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 14/54 (25%), Positives = 26/54 (48%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
               KK +  ++D +  + + ++K K K++ K K   ++K     KK   KG K
Sbjct: 228 KESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMK 281



 Score = 29.2 bits (66), Expect = 4.0
 Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 9/107 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKD-------KEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           K KE+  ++   +    D I N  +              +  K+  K     + K  +K 
Sbjct: 167 KSKEESLVRSALELDIPDDILNLLRLRYACDLLCSYLPPDLYKELLKSLLIPEFKPLDKY 226

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEK 216
            K  + K++  E+   E  +S  EKK K K++ K KK K  K  K  
Sbjct: 227 LKESKKKKRETEED-VEAAESRAEKKRKSKEEIKKKKPKESKGVKAL 272


>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0).  Part of the CF(0)
           (base unit) of the ATP synthase. The base unit is
           thought to translocate protons through membrane (inner
           membrane in mitochondria, thylakoid membrane in plants,
           cytoplasmic membrane in bacteria). The B subunits are
           thought to interact with the stalk of the CF(1)
           subunits. This domain should not be confused with the ab
           CF(1) proteins (in the head of the ATP synthase) which
           are found in pfam00006.
          Length = 132

 Score = 34.9 bits (81), Expect = 0.020
 Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 2/89 (2%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           KI  ++ +K  + IK  E+  ++      + E++  +   E  E      K++ +K  E+
Sbjct: 26  KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIIN-NAKKEAQKLKEE 84

Query: 184 IRKE-EKDSEKEKKSKDKDKKLKKEKLKK 211
           I  E +KD+E+  +S   + + +KE+   
Sbjct: 85  ILAEAQKDAERLLESARAEIEQEKEQALA 113


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 37.1 bits (86), Expect = 0.021
 Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 5/155 (3%)

Query: 120  KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
            KE +K+ K+  K ++ + K +      +     K   +  K+   KK  K+    E  E+
Sbjct: 1163 KELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222

Query: 180  PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE-KEVIPKLTFK----FG 234
                   E ++  +  K K +    KK     K+KE+E    + K+ +            
Sbjct: 1223 TYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQS 1282

Query: 235  TDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
              MEE  K   + +   +     S   ++ +D+ D
Sbjct: 1283 AKMEETVKAVPARRAAARKKPLASVSVISDSDDDD 1317



 Score = 30.2 bits (68), Expect = 3.1
 Identities = 44/250 (17%), Positives = 93/250 (37%), Gaps = 32/250 (12%)

Query: 34   LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE-LGVTPMVPNI 92
            L    GT    K+++  + +DK +   ++   +KT+PK      +KD++ L       + 
Sbjct: 1116 LSMAIGTLTLEKVQELCADRDKLN--IEVEDLKKTTPKSLW---LKDLDALEKELDKLDK 1170

Query: 93   VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE---- 148
                + E   ++++    + +    KK  ++  KK   KK   K    E  +E       
Sbjct: 1171 EDAKAEEAREKLQRAA-ARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAM 1229

Query: 149  ---------KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD--SEKEKKS 197
                     K KG+   K+K     K++E+E ++ + K++             ++ E+  
Sbjct: 1230 ETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETV 1289

Query: 198  KDKDKKLKKEKLKKKKKEKERS---SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPV 254
            K    +    + K        S     + +   +++      + E+ K++   K      
Sbjct: 1290 KAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVS------LAERLKKKGGRKPAAANK 1343

Query: 255  KSPSPPPVAA 264
            K  + PP AA
Sbjct: 1344 K-AAKPPAAA 1352


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 37.0 bits (86), Expect = 0.021
 Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 6/167 (3%)

Query: 57   SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK-KDKSIKKDKK 115
                     E    K +  +++K +   +       ++     +L ++   +  +K+  +
Sbjct: 912  KKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSE 971

Query: 116  THK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
             ++   KK    +++      + K   KE  +  K+    ++  K+ KE   +  E +  
Sbjct: 972  EYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSA 1031

Query: 173  MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
             K    +  E      K  +K K     +    + + K  K  +E S
Sbjct: 1032 SKIISSESTEL--SILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076



 Score = 33.5 bits (77), Expect = 0.30
 Identities = 31/128 (24%), Positives = 49/128 (38%), Gaps = 9/128 (7%)

Query: 107  DKSIKKDKKTHKKKEKEKIKKKKDKKD----KDKIKNKEKDKEKKEKEKGKKE-----EK 157
             KSI   K  + + E E I+ KK          + K +   + KK       E     E 
Sbjct: 891  VKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEY 950

Query: 158  EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
             K  E  K  E E K+KE  E+  + ++K      +  K+  + K  KKE  +  K+   
Sbjct: 951  VKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGA 1010

Query: 218  RSSHEKEV 225
                 K++
Sbjct: 1011 LQESTKQL 1018



 Score = 31.2 bits (71), Expect = 1.3
 Identities = 26/158 (16%), Positives = 50/158 (31%), Gaps = 13/158 (8%)

Query: 105  KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            KK+    +  +  +  E++  + K D K    +K    + E +  E  K    +  E  E
Sbjct: 865  KKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLE 924

Query: 165  KKEE---------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
             K E             ++E       K+ +  K  E E K K+  ++ +   LKK    
Sbjct: 925  FKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDL-LKKSTIL 983

Query: 216  KERSSHEKEVI---PKLTFKFGTDMEEKTKRESSPKIV 250
                +     +    K   +         +     K +
Sbjct: 984  VREGNKANSELKNFKKELAELSKQYGALQESTKQLKEL 1021


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 35.6 bits (83), Expect = 0.023
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 5/77 (6%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           K  + ++ K E E+  K+ +E+KEE EK+  +     E  EK     R+EE+   +EK+ 
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK-----REEEERQIEEKRH 168

Query: 198 KDKDKKLKKEKLKKKKK 214
            D+   LKK+  + K +
Sbjct: 169 ADEIAFLKKQNQQLKSQ 185



 Score = 29.5 bits (67), Expect = 2.9
 Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 7/57 (12%)

Query: 134 KDKIKNKEKDK---EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           + +IK  E++K   EK+  E   K E  +K E+E+++ +E +  ++    +  ++K+
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE----IAFLKKQ 178


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 36.6 bits (85), Expect = 0.024
 Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 34/169 (20%)

Query: 64  SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---- 119
           + +  S    E   +     GV       ++VT   D   I    +  KD  T K+    
Sbjct: 424 AADNKSLGRFELDGIPPAPRGV-----PQIEVTFDIDANGILNVTA--KDLGTGKEQSIT 476

Query: 120 -KEKEKIKKKK-DKKDKDKIKNKEKDKEKKEKEKGKKE---------------EKEKKEE 162
            K    +  ++ ++  +D   N   DK+ +E  + + E                K  +EE
Sbjct: 477 IKASSGLSDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEE 536

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           KEK EE    ++E  E   E+I+       K ++ ++  +KL ++K ++
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKA------KIEELQEVTQKLAEKKYQQ 579


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 35.1 bits (81), Expect = 0.024
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
           E+ E+      ++E   E + +SK+K K+   +   +K+K K      K   PKL
Sbjct: 99  EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153



 Score = 35.1 bits (81), Expect = 0.026
 Identities = 19/62 (30%), Positives = 32/62 (51%)

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
               D E++E E+   +E E+++  E K E + K K +  KP  +  K + + +K K SK
Sbjct: 91  LSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150

Query: 199 DK 200
            K
Sbjct: 151 PK 152



 Score = 32.8 bits (75), Expect = 0.16
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
            ++E+  +EE   + E+E   E + + KE K++ V K  K EK+  K +  K K  K K
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKP-KTEKEKPKTEPKKPKPSKPK 152



 Score = 32.8 bits (75), Expect = 0.17
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 15/91 (16%)

Query: 85  VTPMV----------PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 134
           V PMV          P + K+     L     ++   +++ T + ++++  + K + K+K
Sbjct: 67  VGPMVQQYGPLVRNLPAMWKIFRE--LSSSDDEEEETEEESTDETEQEDPPETKTESKEK 124

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
              K +E  K K EKEK K E K+ K  K K
Sbjct: 125 ---KKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 30.9 bits (70), Expect = 0.77
 Identities = 14/59 (23%), Positives = 30/59 (50%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
               D+ +  E++   + +++   E K + +EK+K+E  + K +++K K   K  K  K
Sbjct: 92  SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150



 Score = 28.2 bits (63), Expect = 5.2
 Identities = 15/77 (19%), Positives = 29/77 (37%), Gaps = 11/77 (14%)

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
           KI +E   S+ E++  +++   + E+    + + E    +K  +PK              
Sbjct: 86  KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----------PKT 134

Query: 243 RESSPKIVIKPVKSPSP 259
            +  PK   K  K   P
Sbjct: 135 EKEKPKTEPKKPKPSKP 151


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 35.6 bits (83), Expect = 0.026
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           KD +   KKE   I K  +K D   I    + +++K+K   K+E+K  KEEK+K EE 
Sbjct: 66  KDFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 35.2 bits (81), Expect = 0.029
 Identities = 27/81 (33%), Positives = 46/81 (56%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
           KK ++  +  KI + +K+   KE E  K E++E  EE+E+++E+E +  ED E   E + 
Sbjct: 17  KKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVE 76

Query: 186 KEEKDSEKEKKSKDKDKKLKK 206
            EE++ E E+ + D     KK
Sbjct: 77  DEEEEEEDEEDNVDLKDIEKK 97



 Score = 34.8 bits (80), Expect = 0.041
 Identities = 24/89 (26%), Positives = 48/89 (53%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           KE E +K +K + D+++ +  E++ E+ E  + ++E  E +EE+E+ EE    +K+ ++K
Sbjct: 38  KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            +  I    +D   +       KK +K K
Sbjct: 98  NINDIFNSTQDDNAQNLISKNYKKNEKSK 126



 Score = 33.7 bits (77), Expect = 0.076
 Identities = 26/98 (26%), Positives = 52/98 (53%), Gaps = 1/98 (1%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
            +KE I K+ +    +K ++ E+++E+ E+E  + E+ E +EE  + EE+E + +ED   
Sbjct: 31  WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            ++ I K+  +        D  + L  +  KK +K K+
Sbjct: 91  -LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKK 127



 Score = 32.5 bits (74), Expect = 0.24
 Identities = 23/98 (23%), Positives = 51/98 (52%), Gaps = 6/98 (6%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
            DK     +    K EK++  ++++++D+++I+  E  ++++E  + ++EE+E +E+   
Sbjct: 31  WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90

Query: 166 KEEKEFKMKED------KEKPVEKIRKEEKDSEKEKKS 197
            ++ E K   D       +     I K  K +EK KK+
Sbjct: 91  LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128



 Score = 31.4 bits (71), Expect = 0.51
 Identities = 31/104 (29%), Positives = 57/104 (54%), Gaps = 5/104 (4%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKE-----KGKKEEKEKKEEKEKKEEKEFKMKE 175
           EK+  ++KKD+   + +K    DKE   KE       K+E+ E++EE++++E +E +  E
Sbjct: 10  EKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIE 69

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           D+E+ VE   +EE+D E     KD +KK   +     + +  ++
Sbjct: 70  DEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 36.4 bits (84), Expect = 0.030
 Identities = 29/105 (27%), Positives = 46/105 (43%), Gaps = 15/105 (14%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
            D+ DK +    E+ +   EK+   K++K+K+EE++  EE +  +K D E  VE+I    
Sbjct: 2   VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEE-DAMLKGDLELLVERI---- 56

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
                    +D D  L+   L    KE  +SS           KF
Sbjct: 57  ---------QDPDIDLQNNSL-NMLKEVIKSSTSSMTAVPKPLKF 91


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 36.2 bits (84), Expect = 0.031
 Identities = 41/162 (25%), Positives = 75/162 (46%), Gaps = 13/162 (8%)

Query: 55  KTSPKEKMSSKEKTSPKEKEYSKVKDVEL-GVTPMVPNIVKVTSSEDLGEIKKDKSIKKD 113
            TS     +  E  +       + K  EL  V   +P+   +   EDL  ++  + I+K 
Sbjct: 113 STSGAVPETKDETPASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKS 172

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
            K  ++K K   K+KKD  +K  ++      +K+ +   KK+ K   E ++ KEE + K+
Sbjct: 173 LKAMQQKWK---KRKKDLPNKQDLEE----YKKRLEAIKKKDIKNPLELQKIKEEFD-KL 224

Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKLKK 211
           K++ +   +KI+  + D + +KK    D    KK  +  LK+
Sbjct: 225 KKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDLKR 266



 Score = 32.8 bits (75), Expect = 0.39
 Identities = 32/138 (23%), Positives = 63/138 (45%), Gaps = 10/138 (7%)

Query: 89  VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
           VP     T +     IK+ KS +  K   +  +   + K +D K  +  +  EK    K 
Sbjct: 118 VPETKDETPASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKSL--KA 175

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE- 207
            ++  K+ K+    K+  EE + +++  K+K +       K+  + +K K++  KLKKE 
Sbjct: 176 MQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDI-------KNPLELQKIKEEFDKLKKEG 228

Query: 208 KLKKKKKEKERSSHEKEV 225
           K  K+K +  ++  + + 
Sbjct: 229 KADKQKIKSAKNDLQNDK 246


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 36.4 bits (85), Expect = 0.032
 Identities = 35/195 (17%), Positives = 77/195 (39%), Gaps = 52/195 (26%)

Query: 63  SSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEK 122
             K     +EK+   VK +E         ++K    E+L +I+K+    +++ +  + E 
Sbjct: 66  LPKLNPLREEKKKVSVKSLE--------ELIKD-VEEELEKIEKEIKELEEEISELENEI 116

Query: 123 EKIKKKKD----------------------------KKDKDKIKNKEKDKEKKEKEKGKK 154
           ++++++ +                             +DK +    E D E  E     K
Sbjct: 117 KELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDK 176

Query: 155 E--------EKEKKEEKEKK-EEKEFKMKE--DKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
                     KE  +E E++ ++  F+  E  ++  P E IR+     ++E +  +K+++
Sbjct: 177 GYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIRE----IKEELEEIEKERE 232

Query: 204 LKKEKLKKKKKEKER 218
              E+LK+  K+   
Sbjct: 233 SLLEELKELAKKYLE 247



 Score = 31.4 bits (72), Expect = 1.1
 Identities = 23/132 (17%), Positives = 58/132 (43%), Gaps = 10/132 (7%)

Query: 58  PKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTH 117
           P++K+   +  S  E     +   +  V  +V  +V    S+++ E  K    ++ +   
Sbjct: 153 PEDKLEELKLESDVENVE-YISTDKGYVYVVV--VVLKELSDEVEEELKKLGFERLELEE 209

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           +    E I++ K++        +E +KE++   +  KE  +K  E+     +  +++ ++
Sbjct: 210 EGTPSELIREIKEEL-------EEIEKERESLLEELKELAKKYLEELLALYEYLEIELER 262

Query: 178 EKPVEKIRKEEK 189
            + + K  K +K
Sbjct: 263 AEALSKFLKTDK 274



 Score = 29.9 bits (68), Expect = 3.5
 Identities = 33/183 (18%), Positives = 69/183 (37%), Gaps = 33/183 (18%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
              L ++ +     +         +E+ KK   K  ++ IK+ E++ EK EKE  + EE+
Sbjct: 49  RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE 108

Query: 158 ------EKKEEKEKKEE----KEFKMKEDKEKP-------VEKIRKEEKDSEKEKKSKDK 200
                 E KE +++ E       F +              V  + +++ +  K +   + 
Sbjct: 109 ISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVEN 168

Query: 201 DKKLKKEKLK-------KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKP 253
            + +  +K          K+   E      E + KL F+       + + E +P  +I+ 
Sbjct: 169 VEYISTDKGYVYVVVVVLKELSDEVE----EELKKLGFE-----RLELEEEGTPSELIRE 219

Query: 254 VKS 256
           +K 
Sbjct: 220 IKE 222


>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 486

 Score = 36.3 bits (84), Expect = 0.032
 Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 3/120 (2%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK--EKKEEKEFKMKEDKEKPVEKIR 185
           K+  K + K + ++K++EK+E +  K   +  K E        K  ++KE++ K   K+ 
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
           K E+D+   +    K K     + ++ KKEKE+     E + K+   F        K++ 
Sbjct: 425 KSEEDTLDLEIKSIK-KYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKK 483



 Score = 35.6 bits (82), Expect = 0.046
 Identities = 33/127 (25%), Positives = 63/127 (49%), Gaps = 5/127 (3%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK- 159
           + E+ K+ S ++ K+  +KKE+EK +   DK     IKN+ K      K    KEE+ K 
Sbjct: 360 ISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKI 419

Query: 160 --KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKKKE 215
             K EK +++  + ++K  K+       +E K  ++++K + +  +K L+  K  K    
Sbjct: 420 TIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITY 479

Query: 216 KERSSHE 222
           K++   +
Sbjct: 480 KKKKGKD 486


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 35.9 bits (83), Expect = 0.036
 Identities = 24/138 (17%), Positives = 53/138 (38%), Gaps = 2/138 (1%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +K+E+E++EE+++   K      + E+   +   E+ + ++E        K       K 
Sbjct: 180 EKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKP 239

Query: 213 KKEKERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 271
               ++S+ ++   P  +  K  +    K +   S  +V K   +P     + +    T 
Sbjct: 240 TSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKK-AAPESTSQSPSSAEPTS 298

Query: 272 DSKQVWICPACGVQDDGS 289
           +S Q     +     D S
Sbjct: 299 ESPQTAGNSSLSSLGDYS 316


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 35.8 bits (83), Expect = 0.040
 Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 6/100 (6%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK---GKKEEKEKKEEKEKKEEKEFKMKEDK 177
           E  K + ++  K KD      KD  +K++E+    KK E+      E+K ++     E+ 
Sbjct: 60  ELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEI 119

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
               E +R E+    + ++   +   L K++++   +   
Sbjct: 120 ---AEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNL 156



 Score = 29.6 bits (67), Expect = 3.3
 Identities = 23/141 (16%), Positives = 59/141 (41%), Gaps = 3/141 (2%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKE 150
            V+VT    +    +    ++  K  K  E + +K   +KK +++   K+ ++      E
Sbjct: 48  AVEVTKLAAVDSELRKFQTREAAKV-KDAEAQLLKDPAEKKREERKAAKKIERSTPSLIE 106

Query: 151 KGKKEEKEKKEE-KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           +  +E K+  EE  E   +++  ++E +E P E +   +++ +   +    ++ L+    
Sbjct: 107 RKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRAGLE 166

Query: 210 KKKKKEKERSSHEKEVIPKLT 230
            +    K+ S  +     +  
Sbjct: 167 LEILDMKDESYEDLRYFSEKL 187


>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
           region.  Members of this family have very varied
           localisations within the eukaryotic cell. pinin is known
           to localise at the desmosomes and is implicated in
           anchoring intermediate filaments to the desmosomal
           plaque. SDK2/3 is a dynamically localised nuclear
           protein thought to be involved in modulation of
           alternative pre-mRNA splicing. memA is a tumour marker
           preferentially expressed in human melanoma cell lines. A
           common feature of the members of this family is that
           they may all participate in regulating protein-protein
           interactions.
          Length = 131

 Score = 34.0 bits (78), Expect = 0.040
 Identities = 16/77 (20%), Positives = 46/77 (59%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           +K  +EE     +++++ E E K++E +++  E++RKE+++  +E++ K  + +  ++K+
Sbjct: 21  QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80

Query: 210 KKKKKEKERSSHEKEVI 226
           + +K ++    H   + 
Sbjct: 81  EDEKLQETWHEHNLALA 97



 Score = 32.1 bits (73), Expect = 0.21
 Identities = 21/85 (24%), Positives = 45/85 (52%), Gaps = 1/85 (1%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K  +++ +   K+K + E E+ K EE+EK+E +E ++EK    +E + K +E  + E+K 
Sbjct: 22  KFSQEESRLTSKEKRRAEIEQ-KLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKE 215
            +++ +    +  L      + K +
Sbjct: 81  EDEKLQETWHEHNLALANFIRTKTK 105


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 34.8 bits (80), Expect = 0.040
 Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 7/89 (7%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK-------IRKEEKDSEKE 194
           K  ++KE+EK   +E  K +E+E+K E E   KE +    E+        ++E    E++
Sbjct: 34  KKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEK 93

Query: 195 KKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           ++  +  K+ +    K+KK  KE +  EK
Sbjct: 94  EEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122



 Score = 33.3 bits (76), Expect = 0.13
 Identities = 28/93 (30%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +   +++K     K  EKE+ K+  D+  K K + ++ + E+ EKEK +  E E+++  E
Sbjct: 22  EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEK-EAGEPEREDIAE 80

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           ++E++E   +E+KE+  E ++++E  S K KK 
Sbjct: 81  QQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKP 113


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 34.4 bits (80), Expect = 0.045
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           DK ++K  EK ++K EK K K E+K+ K+E+ K 
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 33.7 bits (78), Expect = 0.074
 Identities = 13/33 (39%), Positives = 21/33 (63%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           +KE    E+ EKK EK K + ++ K K+++ KP
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97



 Score = 32.5 bits (75), Expect = 0.18
 Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 2/35 (5%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
           DKK  K  EK   +KK +K      K K K +E K
Sbjct: 64  DKKELKAWEKA--EKKAEKAKAKAEKKKAKKEEPK 96



 Score = 29.4 bits (67), Expect = 1.9
 Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 4/66 (6%)

Query: 125 IKKKKDKKDKDKIKN-KEKDKEKKEKEKGK---KEEKEKKEEKEKKEEKEFKMKEDKEKP 180
            +KKK KK + +I +  E+ K+ KE  +     K+E +  E+ EKK EK     E K+  
Sbjct: 32  AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91

Query: 181 VEKIRK 186
            E+ + 
Sbjct: 92  KEEPKP 97



 Score = 28.6 bits (65), Expect = 3.3
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKI 137
           K +KK  K K K + KK K ++ K ++
Sbjct: 73  KAEKKAEKAKAKAEKKKAKKEEPKPRL 99



 Score = 28.3 bits (64), Expect = 5.3
 Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
            +KE K+ +K +K K EK K K ++K+    E +
Sbjct: 64  DKKELKAWEKAEK-KAEKAKAKAEKKKAKKEEPK 96



 Score = 28.3 bits (64), Expect = 5.4
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK 140
            K +K  +K  KK EK K K +K K  K++ K +
Sbjct: 65  KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98



 Score = 27.5 bits (62), Expect = 9.7
 Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 149 KEKGKKEEKE----KKEEKEKKEE-KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           K+KGKK E E     +E K+ KE  +   + + + K      K EK   K +K+K K +K
Sbjct: 34  KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELK---AWEKAEK---KAEKAKAKAEK 87

Query: 204 LKKEKLKKKKK 214
            K +K + K +
Sbjct: 88  KKAKKEEPKPR 98


>gnl|CDD|235944 PRK07135, dnaE, DNA polymerase III DnaE; Validated.
          Length = 973

 Score = 35.8 bits (83), Expect = 0.046
 Identities = 39/179 (21%), Positives = 67/179 (37%), Gaps = 24/179 (13%)

Query: 75  YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK------KTHKKKEKEKIKKK 128
             K K+ E+ +  +  + + +      G   K+K   + K         K +    ++++
Sbjct: 92  SKKSKNKEIELNDLDSDNIIIIDHPKNGFYAKNKEQLELKNYYINSNDPKIENAVYVQER 151

Query: 129 K--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK--------------EFK 172
           K    +D + +K   K    KE+    K   + ++  E  +EK              EF 
Sbjct: 152 KLLFAEDNEYLKILNKIGNNKEENSNFKF-FDFEKWFEDIDEKILKRTNYLVENINIEFP 210

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
            KE      +     E D   +K  K+     KK +LK     KER ++E  VI KL F
Sbjct: 211 KKEFNLPDFDNNLGLESDLFLKKILKESVIN-KKAELKYYPNVKERINYEYSVIKKLKF 268


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 35.0 bits (81), Expect = 0.049
 Identities = 20/78 (25%), Positives = 46/78 (58%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           ++  ++K     ++D++  ++K K   K K  K  K KI+ KEK K +KE++  ++ E++
Sbjct: 28  DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87

Query: 159 KKEEKEKKEEKEFKMKED 176
             E++  ++ +  K++E+
Sbjct: 88  TPEDELAEKLRLRKLQEE 105



 Score = 34.6 bits (80), Expect = 0.076
 Identities = 24/67 (35%), Positives = 37/67 (55%), Gaps = 2/67 (2%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
           E +++++EK K   + K KK  K K EEKE   +E +EK +     EE   E E   K +
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR--ELEEDTPEDELAEKLR 98

Query: 201 DKKLKKE 207
            +KL++E
Sbjct: 99  LRKLQEE 105



 Score = 33.1 bits (76), Expect = 0.22
 Identities = 19/67 (28%), Positives = 41/67 (61%), Gaps = 2/67 (2%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK--IR 185
           + ++K+++K K   K K KK  +   +E+++ K EKE+K  +E +    +++  EK  +R
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100

Query: 186 KEEKDSE 192
           K +++S+
Sbjct: 101 KLQEESD 107



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 19/70 (27%), Positives = 40/70 (57%), Gaps = 2/70 (2%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           KD  D+++ + KE++K K   +   K+  + K E+++K ++E + K  +E   E+   E+
Sbjct: 34  KDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL--EEDTPED 91

Query: 189 KDSEKEKKSK 198
           + +EK +  K
Sbjct: 92  ELAEKLRLRK 101



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 21/65 (32%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKK 212
           E++EK+EEK K   K    K  K K  EK + + +  EK  +  ++D    +  EKL+ +
Sbjct: 41  EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100

Query: 213 KKEKE 217
           K ++E
Sbjct: 101 KLQEE 105



 Score = 28.1 bits (63), Expect = 8.9
 Identities = 13/48 (27%), Positives = 26/48 (54%)

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +++E   ++  K +  ++ + K   K K  +KEK K++K+EK     E
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85


>gnl|CDD|222406 pfam13831, PHD_2, PHD-finger.  PHD folds into an interleaved type
           of Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 33

 Score = 31.5 bits (72), Expect = 0.054
 Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 291 PMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 323
           P++ C  C    H  C G V E     W C +C
Sbjct: 2   PLLVCSKCSVAVHASCYG-VPELSDGPWLCRRC 33


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 35.5 bits (82), Expect = 0.056
 Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 9/84 (10%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
           KK   + K ++ ++ K+E +KG K +K+   ++K KK + +  +             ++K
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVST------STAASQKK 237

Query: 190 DSEKEKKSKDKDKKL--KKEKLKK 211
            S+ E K + + K+L   K+ LKK
Sbjct: 238 SSDLESKLEAQSKELWSLKDDLKK 261



 Score = 34.4 bits (79), Expect = 0.14
 Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 104 IKKDKSIKK--DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           +KK  S  K    +  K++ K+  K+KKD    DK K  + D++        +++    E
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLE 242

Query: 162 EKEKKEEKE-FKMKEDKEKPV 181
            K + + KE + +K+D +K V
Sbjct: 243 SKLEAQSKELWSLKDDLKKHV 263



 Score = 30.9 bits (70), Expect = 1.6
 Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           KK   + K+E+ E+++++ KK  K  K     +K  +K + +   S     S+ K   L+
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKS-KKAKTDRDVSTSTAASQKKSSDLE 242

Query: 206 KEKLKKKKKE 215
             KL+ + KE
Sbjct: 243 S-KLEAQSKE 251


>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 535

 Score = 35.4 bits (82), Expect = 0.057
 Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 3/83 (3%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           K   IK  K    K  +  ++  K+ K  ++ K  + +   +  E+  K++ E+ +E + 
Sbjct: 446 KGAKIKIQK--ALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQL 503

Query: 166 KEEKEFKMKEDKEKP-VEKIRKE 187
           KE  E +      +  +E+ ++ 
Sbjct: 504 KEATELQEFMINNEDLIEEAKEL 526



 Score = 35.0 bits (81), Expect = 0.074
 Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 4/87 (4%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           +  K K ++  K  E   +  KEFK   +K+K   +   E  + E +K  ++  +     
Sbjct: 446 KGAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQET---- 501

Query: 208 KLKKKKKEKERSSHEKEVIPKLTFKFG 234
           +LK+  + +E   + +++I +    FG
Sbjct: 502 QLKEATELQEFMINNEDLIEEAKELFG 528



 Score = 30.0 bits (68), Expect = 3.2
 Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 11/97 (11%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           +  KIK +K  K         ++  +  KE     EK+K + +   E  E + K+D ++ 
Sbjct: 446 KGAKIKIQKALK-------SAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKD-DEE 497

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           V++ + +E    +E    ++D     E+ K+    K 
Sbjct: 498 VQETQLKEATELQEFMINNEDLI---EEAKELFGIKS 531


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 35.4 bits (81), Expect = 0.058
 Identities = 30/133 (22%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE---------KDKEK 146
            +S    E  +  S +      +    E +  ++D  D ++++ ++         K  E 
Sbjct: 459 RASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSET 518

Query: 147 KEKEKG-KKEEKEKKEEKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
            E +K   K + +K++  E++EEK+ K  M  +K+K + K  K     ++E+    K KK
Sbjct: 519 SEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKK 578

Query: 204 LKKEKLKKKKKEK 216
            +  K KK   +K
Sbjct: 579 KQIAKQKKLDSKK 591


>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
           N-terminal.  SRP is a complex of six distinct
           polypeptides and a 7S RNA that is essential for
           transferring nascent polypeptide chains that are
           destined for export from the cell to the translocation
           apparatus of the endoplasmic reticulum (ER) membrane.
           SRP binds hydrophobic signal sequences as they emerge
           from the ribosome, and arrests translation.
          Length = 272

 Score = 35.1 bits (81), Expect = 0.059
 Identities = 23/116 (19%), Positives = 39/116 (33%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E +  K  K  K     +E +K KK  D   + K K     +++++K +      E    
Sbjct: 98  EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPS 157

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
                 +       K K     R ++        S   +K  K +   KK  +K R
Sbjct: 158 STPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 35.5 bits (82), Expect = 0.060
 Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 5/150 (3%)

Query: 73  KEYSKVKDVELGVTPMVPNIVKVTSSED-LGEIKKDKSIKKDKKTHKKKE----KEKIKK 127
           K Y K K V   V      I  +   ED L E+  DK  KK    +  K     K+K+KK
Sbjct: 60  KAYQKAKRVSENVKTPRFYIKTLVMLEDFLNELWADKEAKKKMSKNNAKALNTLKQKVKK 119

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
              + + D  + +E  + + E+E+  +++ +   + E ++E      E+     E     
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            K+ ++E +  D  KK   E+ K  KK +E
Sbjct: 180 VKEDDEEDEDADLSKKDVLEEPKMFKKPEE 209


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 35.2 bits (81), Expect = 0.061
 Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 2/104 (1%)

Query: 111 KKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
            + K   K  E   KI KK D K+  +  N + + E K++E       EKK +  K +E+
Sbjct: 34  PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEE 93

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKK 212
             K K++ +KP +K + ++ ++    +  + K      +KLKK+
Sbjct: 94  IEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 35.3 bits (82), Expect = 0.062
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 166 KEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
           KEE   +++E KEK P    +K E+   +++K K K KK  K++ KK +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 34.2 bits (79), Expect = 0.12
 Identities = 16/63 (25%), Positives = 37/63 (58%)

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           V   S   +G+  K++  K+ ++  +K  K   KK+++KK + + K K++ K+ K+++K 
Sbjct: 352 VDYFSGRYIGDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411

Query: 153 KKE 155
            ++
Sbjct: 412 GRK 414



 Score = 33.0 bits (76), Expect = 0.33
 Identities = 16/50 (32%), Positives = 34/50 (68%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           K++  ++I++ K+K  K   K +E+ K +K K+K K+++K KK +K+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 33.0 bits (76), Expect = 0.35
 Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 172 KMKEDKEKPVEKIR----KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           ++KE+  K +E+I+    K  K   +EKK + + KK K++K  KK+K+K R 
Sbjct: 363 ELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414



 Score = 31.5 bits (72), Expect = 0.81
 Identities = 15/48 (31%), Positives = 35/48 (72%), Gaps = 1/48 (2%)

Query: 123 EKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           +++K++ +K+ ++ K K  +  K+K+E++K +K +K+KK +K+ K+ K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409



 Score = 28.4 bits (64), Expect = 8.4
 Identities = 17/50 (34%), Positives = 36/50 (72%), Gaps = 1/50 (2%)

Query: 122 KEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           KE++ K+ ++ K+K     K+K +EKK +++ KK++++KK +K KK+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 35.2 bits (82), Expect = 0.068
 Identities = 21/87 (24%), Positives = 39/87 (44%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
               D    ++      E E+   +E E++EE E  +       E  EK +EK +   K 
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQ 230

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +K +K+++K  + +  + KK  K +E
Sbjct: 231 YKKLRKAQEKKVEGRLAQHKKYAKLRE 257



 Score = 28.2 bits (64), Expect = 9.8
 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)

Query: 121 EKEKIKKKKDKKDKDKI----KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           E +      + +  +K+    K   K  +K  K + KK E    + K+  + +E K+KE
Sbjct: 204 ENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLRE-KLKE 261


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 34.2 bits (79), Expect = 0.068
 Identities = 23/107 (21%), Positives = 51/107 (47%), Gaps = 9/107 (8%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
            E   I + + +  K +I+  ++    +E  ++ + + E+K K+E   K ++ E ++ ED
Sbjct: 55  SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114

Query: 177 KEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            E     I+      KEE   +++ K +      +  K K++++E E
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESE 161



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 21/104 (20%), Positives = 48/104 (46%), Gaps = 10/104 (9%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD--KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE- 175
           + +K++I+  +     ++   + +   ++K ++E G K +  + E  E  E   F +KE 
Sbjct: 66  ESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEV 125

Query: 176 -------DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
                   KEK   K+     D++  K  +++++  + EK+K  
Sbjct: 126 NVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEKIKNF 169


>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 193

 Score = 34.2 bits (79), Expect = 0.070
 Identities = 21/83 (25%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           + K ++ + +E E+ +K E+E+ EE+E +EE E + ++ +   +E   +E KD     ++
Sbjct: 5   EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQA 64

Query: 198 K--DKDKKLKKEKLKKKKKEKER 218
           +  +  K+ ++E+ + KK   E+
Sbjct: 65  EFENLRKRTEREREEAKKYAIEK 87



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 18/83 (21%), Positives = 48/83 (57%), Gaps = 9/83 (10%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK----KEEKEKKEEKEFKMK 174
           K++K +    ++ ++ +K + +E ++E+ E+E   +EE+++    + + E+ ++K  + +
Sbjct: 4   KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQ 63

Query: 175 EDKEKPVEKIRKE-EKDSEKEKK 196
            +     E +RK  E++ E+ KK
Sbjct: 64  AE----FENLRKRTEREREEAKK 82



 Score = 28.8 bits (65), Expect = 3.7
 Identities = 14/83 (16%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
             DK  K ++   ++ E+ +  ++++ ++++  +  E ++E++E  + + + +E  ++K 
Sbjct: 1   MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEE-LKDKY 59

Query: 165 KKEEKEFK-MKEDKEKPVEKIRK 186
            + + EF+ +++  E+  E+ +K
Sbjct: 60  LRAQAEFENLRKRTEREREEAKK 82


>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein.  This family consists of several
           eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
           expression of functional F1-ATPase requires two proteins
           encoded by the ATP11 and ATP12 genes. Atp11p is a
           molecular chaperone of the mitochondrial matrix that
           participates in the biogenesis pathway to form F1, the
           catalytic unit of the ATP synthase.
          Length = 250

 Score = 34.6 bits (80), Expect = 0.073
 Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 12/113 (10%)

Query: 132 KDKDKIKNKEKDKEKK---EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           K + K+  K K+   +     +K  K++ EKKE   KK       +  K K ++  +  +
Sbjct: 3   KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKK-KKEKE---RSSHEKE-----VIPKLTFK 232
           K    +K  K     L  EK+K    +E E   R+ H K+     VIP  T++
Sbjct: 63  KKKPFDKPFKPLSSILDVEKIKDLSAEEIEFIWRAYHAKKDSLCAVIPAETYE 115


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 34.9 bits (80), Expect = 0.075
 Identities = 15/94 (15%), Positives = 44/94 (46%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           EI   +++  +    KK+E+  + + ++ + +D   + E  + ++EKE+ + EE +   E
Sbjct: 61  EISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLRE 120

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
               +    +++++ +    +        +K +K
Sbjct: 121 YNLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRK 154



 Score = 30.6 bits (69), Expect = 1.5
 Identities = 17/97 (17%), Positives = 48/97 (49%), Gaps = 3/97 (3%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           +KE++  K+   K +   ++    +    E ++ KKEE+   +E E+ E+++  +  +  
Sbjct: 42  EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELV 101

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKK 214
           +  E+  KE+ ++E+ +  ++ +       +L+   +
Sbjct: 102 ELQEE--KEQLENEELQYLREYNLFDRNNLQLEDNLQ 136



 Score = 29.1 bits (65), Expect = 4.7
 Identities = 19/86 (22%), Positives = 40/86 (46%)

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           + E    +KE++  K+   + + +  +    E    E  E   E+ R  ++  E EK+  
Sbjct: 35  DSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDD 94

Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKE 224
           D D +L + + +K++ E E   + +E
Sbjct: 95  DLDGELVELQEEKEQLENEELQYLRE 120


>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
          Length = 813

 Score = 35.1 bits (81), Expect = 0.076
 Identities = 23/84 (27%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK- 198
             ++  K  +EK KK +  KK  K ++  K+   + D     EK  K +   +  +K+K 
Sbjct: 730 APRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789

Query: 199 --DKDKKLK---KEKLKKKKKEKE 217
              K +K+    K K   KKK  E
Sbjct: 790 PSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 32.0 bits (73), Expect = 0.72
 Identities = 11/44 (25%), Positives = 18/44 (40%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           + EK+   K   K  +   K   K ++     K K+  K+K  E
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813



 Score = 30.9 bits (70), Expect = 1.5
 Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 8/87 (9%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           D+ KI       E+  +  GK   EK KK +  KK  K       + +  +K+  E   +
Sbjct: 716 DERKIDFSLISSERAPRNVGKTAREKAKKGDAGKKGGK-------RRQVGKKVNFEPDSA 768

Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKER 218
            + +K        K  +  KK   K +
Sbjct: 769 FRGEKKAKPKAAKKDARKAKKPSAKTQ 795



 Score = 30.5 bits (69), Expect = 2.4
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 9/102 (8%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
           D   I  +++ +   KT ++K K+    KK  K +   K       K   E       EK
Sbjct: 721 DFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGK-------KVNFEPDSAFRGEK 773

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           K + +  ++   K K+   K  +KI    K +++  K K  +
Sbjct: 774 KAKPKAAKKDARKAKKPSAKT-QKIAAATK-AKRAAKKKVAE 813


>gnl|CDD|153365 cd07681, F-BAR_PACSIN3, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 3 (PACSIN3).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 3 or Syndapin III is expressed ubiquitously and
           regulates glucose uptake in adipocytes through its role
           in GLUT1 trafficking. It also modulates the subcellular
           localization and stimulus-specific function of the
           cation channel TRPV4. PACSIN 3 contains an N-terminal
           F-BAR domain and a C-terminal SH3 domain. F-BAR domains
           form banana-shaped dimers with a positively-charged
           concave surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 258

 Score = 34.5 bits (79), Expect = 0.076
 Identities = 24/96 (25%), Positives = 49/96 (51%), Gaps = 3/96 (3%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEKIRKEE 188
           +D +K++  +K+   K+   G +E KE +E   K ++   K+ K  E  +K     RK+E
Sbjct: 90  EDSEKVRAWQKEAFHKQMIGGFRESKEAEEGFRKAQKPWVKKLKEVESSKKGYHAARKDE 149

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           + ++  +     D  + +E+L+K +   E+ + E E
Sbjct: 150 RTAQTRETHAKADSTVSQEQLRKLQDRVEKCTQEAE 185


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 34.3 bits (79), Expect = 0.081
 Identities = 18/71 (25%), Positives = 37/71 (52%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
             +  +KK  KK    K K+K     +++    EK    ++K K  + E   +E+++ +E
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177

Query: 168 EKEFKMKEDKE 178
           E+E + +ED++
Sbjct: 178 EEEEEEEEDED 188



 Score = 32.4 bits (74), Expect = 0.32
 Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 2/70 (2%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--EKGKKEEKEKKEEKEKKEE 168
           K  KK    K K K+    ++++    K    +K+ KE   E   +E+++ +EE+E++EE
Sbjct: 125 KAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184

Query: 169 KEFKMKEDKE 178
           ++    +D +
Sbjct: 185 EDEDFDDDDD 194



 Score = 29.0 bits (65), Expect = 3.9
 Identities = 20/67 (29%), Positives = 32/67 (47%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
             KK  K     K K K     E+E+   E+    E+K K+ E E   +ED++   E+  
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEE 181

Query: 186 KEEKDSE 192
           +EE+D +
Sbjct: 182 EEEEDED 188



 Score = 28.6 bits (64), Expect = 5.1
 Identities = 26/99 (26%), Positives = 50/99 (50%), Gaps = 4/99 (4%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           DK  KK K     KE     +   K+    +   +K  +K    K K++     EE+E  
Sbjct: 90  DKYQKKRKIGPSIKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDI 149

Query: 167 EEK----EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           +EK    E K+KE + + V++  +++++ E+E++ +D+D
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188


>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
          Length = 158

 Score = 33.5 bits (77), Expect = 0.082
 Identities = 25/104 (24%), Positives = 49/104 (47%), Gaps = 5/104 (4%)

Query: 108 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           + IK++   +KK+     E +KKK   ++ D +K   +++ K  ++  +  +K  +E+  
Sbjct: 57  EEIKEEALKYKKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTI 116

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
                E    E K    ++   +E   E +  +K+K KK  KEK
Sbjct: 117 PNHLNEEVSNETKLT--QESSSDESPKEVKLATKNKTKKHDKEK 158



 Score = 32.0 bits (73), Expect = 0.33
 Identities = 17/82 (20%), Positives = 36/82 (43%), Gaps = 2/82 (2%)

Query: 91  NIVKVTSSEDLGEIKKD--KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
           ++ K    E+L ++K      IK  +   +  +K   +        +++ N+ K  ++  
Sbjct: 76  SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135

Query: 149 KEKGKKEEKEKKEEKEKKEEKE 170
            ++  KE K   + K KK +KE
Sbjct: 136 SDESPKEVKLATKNKTKKHDKE 157


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 33.7 bits (78), Expect = 0.083
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 2/64 (3%)

Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
           I   K  K  D    ++K K+KK+KE    EEK+KK      EE++   K      +  I
Sbjct: 68  ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKK--YVVSEERKEDQKAVDAALLAAI 125

Query: 185 RKEE 188
           +K  
Sbjct: 126 KKVP 129



 Score = 29.8 bits (68), Expect = 1.7
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           I   K  K +    K+K+K+K KKK+ +  ++K K     +E+KE +K
Sbjct: 68  ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115


>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3.  This family of
           proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). SH3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of SH3, AAPs are retained in the ER.
          Length = 196

 Score = 34.2 bits (79), Expect = 0.083
 Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           KD K  ++   +E +     KE+ +  E EKKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEE-ESAEGEKKESKKKK 196



 Score = 33.5 bits (77), Expect = 0.11
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           K+ K  E+   EE E     +++E  E + KE K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195



 Score = 31.9 bits (73), Expect = 0.37
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           K  K  ++   ++ +     KE++  E  KKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 30.0 bits (68), Expect = 1.7
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
              K +E+   EE ++    K ++  +  KKE  KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 2/39 (5%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           KD    +E   E+ E     KEE+    E EKKE K+ K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEES--AEGEKKESKKKK 196


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 35.1 bits (80), Expect = 0.091
 Identities = 25/98 (25%), Positives = 58/98 (59%), Gaps = 7/98 (7%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS-EKEKKSKDKDKKLKKEKLKK 211
           K   +  + EKE    K  +MK  ++K +E++ +EE++  E+E+  + + ++L++E+L++
Sbjct: 432 KDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLER 491

Query: 212 KKKEKERSSHEKEVIPKLTFKFGTDME----EKTKRES 245
           ++ E++R   E++ + +L  +    +E    EK +R S
Sbjct: 492 ERLERDRL--ERDRLDRLERERVDRLERDRLEKARRNS 527



 Score = 29.7 bits (66), Expect = 4.5
 Identities = 31/150 (20%), Positives = 74/150 (49%), Gaps = 23/150 (15%)

Query: 89  VPNIVKVTSSEDLGEI--------KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK 140
           V NI  +TS   +  +        K D+   +D  TH          + DK   ++ + +
Sbjct: 382 VTNITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIE 441

Query: 141 EKDKEKK-------EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
           +++  +K       EK++ ++ E+E++E  E++     +M+  + + +E+ R E +  E+
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERE-----RMERIERERLERERLERERLER 496

Query: 194 EKKSKDKDKKLKKE---KLKKKKKEKERSS 220
           ++  +D+  +L++E   +L++ + EK R +
Sbjct: 497 DRLERDRLDRLERERVDRLERDRLEKARRN 526


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 34.5 bits (80), Expect = 0.100
 Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 13/95 (13%)

Query: 158 EKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           E+K E+ ++E   ++ ++ +  K  E IR+ +  + K KK+K + K+L  EKL+ +  E+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRL--EKLEARLAEE 297

Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 251
                 K                    +   K+V+
Sbjct: 298 RPVEEGK----------PLAFRFPPPGKRLGKLVL 322


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 35.0 bits (81), Expect = 0.10
 Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKE 207
               E +EK E++E++  ++   +   E  +EKI K+ K  ++   +  S++    L   
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDED-LEKIIKDSKKLKERQDQPDSEEDLATLPTL 514

Query: 208 KLKKKKKEKERSSHEKEVI 226
           KL       E++S E EV 
Sbjct: 515 KLGDVPDPIEKTSLETEVS 533


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 34.6 bits (80), Expect = 0.10
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 14/84 (16%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEE------------KEFKMKEDKEKPVEKIRKEE 188
           E ++  KE E   +E+K++KEE E +              KE   K   E+  EKI K  
Sbjct: 502 EIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEE-KEKIEKAV 560

Query: 189 KDSEKEKKSKDKDK-KLKKEKLKK 211
            + ++  K +D ++ K K E+L++
Sbjct: 561 AELKEALKGEDVEEIKAKTEELQQ 584


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 34.3 bits (79), Expect = 0.11
 Identities = 29/128 (22%), Positives = 66/128 (51%), Gaps = 2/128 (1%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
            +E L E+K +    K K     ++ +++  +  +  ++ IK  E+  E +++     EE
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEE 215

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKK 214
             +  +K  +  +EF+  +++ + +EK  K  +  EK  K ++K ++LK+  E++ +K K
Sbjct: 216 FVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFK 275

Query: 215 EKERSSHE 222
             E+ + E
Sbjct: 276 RGEKLTTE 283



 Score = 32.3 bits (74), Expect = 0.43
 Identities = 22/106 (20%), Positives = 59/106 (55%), Gaps = 1/106 (0%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEF 171
            K+   K+E E++K+K+D+  K+  +  EK  E   K +  +E+ +E +EE+++  E+  
Sbjct: 13  LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQ 72

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           ++KE +++   K+++  K+  + K+ +++     +     +++ + 
Sbjct: 73  ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIER 118



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 19/95 (20%), Positives = 46/95 (48%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
              E+  E++K+     ++     K+ +++ ++      +  + ++K K  + KEK  K 
Sbjct: 197 KLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
            ++++E KE+ EE   K K  ++   E++   +K 
Sbjct: 257 REKREELKERAEEIYEKFKRGEKLTTEELLLLQKA 291



 Score = 28.5 bits (64), Expect = 6.2
 Identities = 25/147 (17%), Positives = 69/147 (46%), Gaps = 24/147 (16%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----DKIKNKEKD-------------- 143
           E  +    + +     ++ +EK ++ ++++D+      ++K K  +              
Sbjct: 35  EASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRE 94

Query: 144 -KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK----EDKEKPVEKIRKEEKDSEKEKKSK 198
            KEK+ +        +  E + ++ EK+ +      E++ + V+KI++  K+ E  KK+ 
Sbjct: 95  LKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKAL 154

Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKEV 225
           ++++KLK+ K +  + +K+     +++
Sbjct: 155 EENEKLKELKAEIDELKKKAREIHEKI 181


>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 2 (PACSIN2).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 2 or Syndapin II is expressed ubiquitously and is
           involved in the regulation of tubulin polymerization. It
           associates with Golgi membranes and forms a complex with
           dynamin II which is crucial in promoting vesicle
           formation from the trans-Golgi network. PACSIN 2
           contains an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 33.9 bits (77), Expect = 0.11
 Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           +D +KIKN +K+   K+   G KE KE ++   + +K   K+ K  E  +K      KEE
Sbjct: 90  EDFEKIKNWQKEAFHKQMMGGFKETKEAEDGFRKAQKPWAKKLKEVEAAKKAYHTACKEE 149

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           K +   + +   D  L  E+LKK + + E+   +
Sbjct: 150 KLATSREANSKADPALNPEQLKKLQDKVEKCKQD 183


>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
           repair].
          Length = 480

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 20/120 (16%), Positives = 42/120 (35%)

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           V  T +E    +  D  +K  K   +                       K ++K+E+   
Sbjct: 228 VPATIAEAKELLHADLYLKTLKSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTLR 287

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
            + EKE ++ ++  E+ + +    ++       K  KD    +  +   K  KK +  K+
Sbjct: 288 TRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVDFKSKKKREEAKR 347


>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 16/79 (20%), Positives = 37/79 (46%), Gaps = 1/79 (1%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           IV +     + E+ ++ + K  K    + +   +K +  K     ++  +K++E+  K  
Sbjct: 115 IVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQIL-LEGLQKNQEELFKLL 173

Query: 152 GKKEEKEKKEEKEKKEEKE 170
            K  E  ++ +KE  ++KE
Sbjct: 174 DKYNELREQVQKESSKKKE 192



 Score = 29.4 bits (66), Expect = 2.9
 Identities = 16/62 (25%), Positives = 28/62 (45%), Gaps = 2/62 (3%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG--KKEEKEKKEEKEKKEEKEFKMKEDK 177
           +E  K   K DK + D    K + ++ +   +G  K +E+  K   +  E +E   KE  
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188

Query: 178 EK 179
           +K
Sbjct: 189 KK 190



 Score = 29.0 bits (65), Expect = 4.3
 Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           +E  EE  KK  K  KM+ D      ++RK +   E  +K++++  KL  +K  + +++ 
Sbjct: 125 EEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKL-LDKYNELREQV 183

Query: 217 ERSSHEKE 224
           ++ S +K+
Sbjct: 184 QKESSKKK 191



 Score = 27.8 bits (62), Expect = 9.7
 Identities = 10/68 (14%), Positives = 30/68 (44%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           E++ ++  KK     K +    + K + +  +   E  ++ +++  K      +  + V+
Sbjct: 125 EEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQ 184

Query: 183 KIRKEEKD 190
           K   ++K+
Sbjct: 185 KESSKKKE 192


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 34.3 bits (79), Expect = 0.12
 Identities = 24/102 (23%), Positives = 45/102 (44%), Gaps = 1/102 (0%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
            +   E  K KK K KK K K + K+ D+ E + + +G         +  ++E    +  
Sbjct: 266 YYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDS 325

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
             K K  ++     +D +  + S  K ++L ++K KK + E 
Sbjct: 326 PKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPED 367



 Score = 33.6 bits (77), Expect = 0.23
 Identities = 28/148 (18%), Positives = 58/148 (39%), Gaps = 5/148 (3%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEEK 157
           E+ + +  KK+ +  KKK             +  +   +++ E K+K+            
Sbjct: 172 ELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGS 231

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
              E ++K++E + K+K +     +   +       +     K KK KK+K KKKK+ K+
Sbjct: 232 TDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKD 291

Query: 218 RSSHEKEVIPKLTFKFGTDMEEKTKRES 245
               E E   +      +   +  + E+
Sbjct: 292 LDEDELEPEAEGLGSSDSGSRKDVEEEN 319



 Score = 31.7 bits (72), Expect = 0.84
 Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 16/157 (10%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNK--------EKDKEKKEK--------EKGKKE 155
            +K+  KK  + K KK     D D   NK        E+ + KK+K          G  +
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           ++ +K+ +E K++ +       +   E    +  D  +  K K   KK KK+K ++K  +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293

Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
           ++    E E +         D+EE+  R        K
Sbjct: 294 EDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRK 330



 Score = 29.7 bits (67), Expect = 3.7
 Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP--VEKIRKEEKD 190
              K  + ++ ++    E  KK ++E  E K K+EE   K+ + +EK     K+   +  
Sbjct: 23  PGAKPGSTKESRDAAAYENWKKRQEEA-EAKRKREELREKIAKAREKRERNSKLGGIKTL 81

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
            E +    D    LKK K ++KKKE ER      
Sbjct: 82  GEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLL 115



 Score = 29.0 bits (65), Expect = 6.1
 Identities = 21/98 (21%), Positives = 46/98 (46%), Gaps = 11/98 (11%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           +  KK+++E   K+K ++ ++KI    + +E+  K  G K   E  ++            
Sbjct: 40  ENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDD-----------D 88

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +D +  ++K +K +K  E E+K      + +KE+  + 
Sbjct: 89  DDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEY 126



 Score = 28.6 bits (64), Expect = 9.1
 Identities = 24/118 (20%), Positives = 52/118 (44%), Gaps = 2/118 (1%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           S++L  +    S   + +  +++ K+K+K      D D  +    D     +    K+ K
Sbjct: 220 SDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPK 279

Query: 158 EKKEEKEKKEEK--EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           +KK++K+K+ +   E +++ + E          KD E+E    +   K +KE+ +   
Sbjct: 280 KKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDD 337


>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
           All proteins in this family for whcih functions are
           known are G-T or G-U mismatch glycosylases that function
           in base excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University). Used 2pf model [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 328

 Score = 34.3 bits (78), Expect = 0.12
 Identities = 17/89 (19%), Positives = 35/89 (39%)

Query: 90  PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
           PN+  V   E+   +      ++  +   K  K K +  +  +  +  K  +  K  K  
Sbjct: 29  PNMALVEEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKST 88

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           +  +K+EK   + K K++   F    + E
Sbjct: 89  KSKEKQEKITDKFKVKRKVDRFNGVSEAE 117


>gnl|CDD|116547 pfam07937, DUF1686, Protein of unknown function (DUF1686).  The
           members of this family are all hypothetical proteins of
           unknown function expressed by the eukaryotic parasite
           Encephalitozoon cuniculi GB-M1. The region in question
           is approximately 250 amino acids long.
          Length = 182

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 21/73 (28%)

Query: 281 ACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF---CPKCSKVDEGSRKEKKKR 337
           AC       L M G  G       VCVGLV        F       S V           
Sbjct: 108 AC-AAGSVKLFMPGSYG---RSSLVCVGLVV-------FAALLLLVSYVKGA-------G 149

Query: 338 GRKKKTHQDIIMS 350
           G K  +   ++ S
Sbjct: 150 GEKVSSKGIVVGS 162


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 33.0 bits (76), Expect = 0.13
 Identities = 21/119 (17%), Positives = 48/119 (40%), Gaps = 8/119 (6%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKEKEKGKKE 155
           ++DL E ++ K   +      ++E E+ +++  +  +   K   +  ++ K E E+  + 
Sbjct: 43  ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102

Query: 156 EKEKKEEKEKKEEKEFK------MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            KE  E + + E++         + E      EK+  ++ D   +K   D       E 
Sbjct: 103 IKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAELGEN 161


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 34.3 bits (79), Expect = 0.14
 Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 5/82 (6%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           K  N+ +  ++ E++  K E K KK     K+    K      K    + +E  D E E 
Sbjct: 25  KFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTG-GKPDVPPSKEFLDLEEEILDLEAEI 83

Query: 196 KSKDKDKKLKKEKLKKKKKEKE 217
           K  ++      E L+K+  E E
Sbjct: 84  KEVEE----NLESLEKEINELE 101



 Score = 28.9 bits (65), Expect = 7.4
 Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 6/85 (7%)

Query: 121 EKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           E E+  +K + K  K  I  K       + +    +E    EE+    E E K  E+  +
Sbjct: 35  EVERKLRKLESKIKKLGIPLK---DTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLE 91

Query: 180 PVEKIR--KEEKDSEKEKKSKDKDK 202
            +EK     EE  +  +++    D+
Sbjct: 92  SLEKEINELEEWLNVLDEEKSFLDE 116


>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins, some contain Zn-finger domain; signature
           gene for I-A subtype; also known as TM1802 family.
          Length = 579

 Score = 34.3 bits (79), Expect = 0.15
 Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 8/109 (7%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
             +K T+  K  E    KK KK  +K     + K++  K K  K+  EK  EK  K+ ++
Sbjct: 82  PTNKITYLLKTTE-PLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQD 140

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKS----KDKDKKLKKEKLKKKKKE 215
            K     E  +  I+ + K   +  +     +DK  K K  K KK K++
Sbjct: 141 LK---PGENVILTIKIDGKKLLEYDEDYEEFRDKVLKKKVFKDKKAKEK 186


>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
           unknown].
          Length = 622

 Score = 34.0 bits (77), Expect = 0.16
 Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 3/108 (2%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
           + S+D  E   +K  +K ++  + +++ ++KK KD  D D    K+K +   +K K  K 
Sbjct: 424 SESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDL---KDKKESINKKNKKGKH 480

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
             E+    +++ E      ED E+      K    +EK KK++   KK
Sbjct: 481 AIERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKK 528



 Score = 32.4 bits (73), Expect = 0.53
 Identities = 25/116 (21%), Positives = 49/116 (42%), Gaps = 5/116 (4%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-----EKEKKEEKEKKEEKEFKMKEDK 177
           E     K +   +K+  K +++++  KE+  K+     + + K++KE   +K  K K   
Sbjct: 423 ESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAI 482

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
           E+      + E     ++  +  D    K  LK +K +K R   +K    +  F F
Sbjct: 483 ERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEEGFVF 538


>gnl|CDD|227872 COG5585, COG5585, NAD+--asparagine ADP-ribosyltransferase [Signal
           transduction mechanisms].
          Length = 417

 Score = 34.1 bits (78), Expect = 0.16
 Identities = 30/191 (15%), Positives = 57/191 (29%), Gaps = 5/191 (2%)

Query: 67  KTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK 126
           +      + + VKD +       PNI      +DL    KD     + +  ++ +   + 
Sbjct: 220 RNGFGGVKENFVKDPKSLDQN--PNIHDDLGKKDLKNYTKDPRAD-ENEKLRQTQDRDLG 276

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKE-KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
           KK  K     +KN E     KE   + KK E     +++     +      ++    +  
Sbjct: 277 KKLRKTGMPALKNGEGQIRPKEVRIRKKKLEDAIGVDEKGGVGVDAVPFVSEDLKPGEGS 336

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
                  +   S    K+L  E  K+     +        I          +        
Sbjct: 337 INVHRLYR-SSSSQDIKELYIENFKETNLAGKLKDAYMNEISLEKDAIYEKIGHVYGLLE 395

Query: 246 SPKIVIKPVKS 256
              +  +P KS
Sbjct: 396 IEDVASEPRKS 406



 Score = 29.4 bits (66), Expect = 4.5
 Identities = 22/133 (16%), Positives = 37/133 (27%), Gaps = 2/133 (1%)

Query: 38  PGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS 97
             T  P+    +   + K     K   ++     EK    V  V            + + 
Sbjct: 280 RKTGMPALKNGEGQIRPKEVRIRKKKLEDAIGVDEKGGVGVDAVPFVSED--LKPGEGSI 337

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           +             K+      KE     K KD    +    K+   EK     G  E +
Sbjct: 338 NVHRLYRSSSSQDIKELYIENFKETNLAGKLKDAYMNEISLEKDAIYEKIGHVYGLLEIE 397

Query: 158 EKKEEKEKKEEKE 170
           +   E  K +E +
Sbjct: 398 DVASEPRKSDEVQ 410


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 33.5 bits (77), Expect = 0.16
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           KD K    KE++ I K   K D  ++    K +++++K   K+E+K  K EKEK EE+
Sbjct: 65  KDFKKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122



 Score = 28.8 bits (65), Expect = 4.9
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 114 KKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEK---KEKEKGKKEEKEKKEEKEK 165
           KK   K+EK  IK   K D          +K+++K   KE++K  K EKEK EE+  
Sbjct: 68  KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124


>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2.  It is vital for
           effective cell-replication that replication is not
           stalled at any point by, for instance, damaged bases.
           Rtf2 stabilizes the replication fork stalled at the
           site-specific replication barrier RTS1 by preventing
           replication restart until completion of DNA synthesis by
           a converging replication fork initiated at a flanking
           origin. The RTS1 element terminates replication forks
           that are moving in the cen2-distal direction while
           allowing forks moving in the cen2-proximal direction to
           pass through the region. Rtf2 contains a C2HC2 motif
           related to the C3HC4 RING-finger motif, and would appear
           to fold up, creating a RING finger-like structure but
           forming only one functional Zn2+ ion-binding site.
          Length = 254

 Score = 33.5 bits (77), Expect = 0.17
 Identities = 13/69 (18%), Positives = 24/69 (34%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
           K   +++  +K KKKK KK K         +         +      +  E K+ K+ + 
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232

Query: 174 KEDKEKPVE 182
                +  E
Sbjct: 233 IAPDNEKSE 241



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 21/89 (23%), Positives = 40/89 (44%), Gaps = 2/89 (2%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK--KEEKEFKMKEDKEK 179
           +E+++  K + ++++ K K+K K+KK K+        +         E      +  + K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            ++K R    D+EK +  K      KKEK
Sbjct: 226 KLKKKRSIAPDNEKSEVYKSLFTSHKKEK 254



 Score = 28.9 bits (65), Expect = 4.7
 Identities = 19/89 (21%), Positives = 36/89 (40%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           +D   +   E++ E  +    ++  K+KK++K+KK +K        E  V      E  S
Sbjct: 157 EDVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216

Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
              +  + K  K K+      +K +   S
Sbjct: 217 GAGQVGEAKKLKKKRSIAPDNEKSEVYKS 245


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 33.2 bits (76), Expect = 0.17
 Identities = 24/117 (20%), Positives = 53/117 (45%), Gaps = 2/117 (1%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           L ++ ++   ++ +K  +++ +E    K   K    I +    K K+E++   +  KE+K
Sbjct: 101 LNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQK--ARAAKERK 158

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           E    +  + F    D   P  +   ++K+ E++KK K+  ++ K+EK         
Sbjct: 159 ERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAALVAAA 215


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.9 bits (77), Expect = 0.18
 Identities = 39/179 (21%), Positives = 69/179 (38%), Gaps = 29/179 (16%)

Query: 13  AGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSP-- 70
           +G    P   P  QHP        P   PPPS      +   + S   +       S   
Sbjct: 441 SGLHSGPPQSPFAQHPFT--SGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGG 498

Query: 71  -------------KEKEYSKVKDVELGVTPM-----VPNIVKVTS--SEDLGEIKK-DKS 109
                        KE+   + ++ E    P       P +V   S  S+     K  D+ 
Sbjct: 499 CAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHLDRG 558

Query: 110 IKKDKKTH---KKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
                +T          K+ KK+++  +K K + ++K +E++E+EK K++E+E++ E+E
Sbjct: 559 YNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617



 Score = 29.7 bits (66), Expect = 4.4
 Identities = 14/35 (40%), Positives = 26/35 (74%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           K + +  +K K+E E+  +EE+E+++EKEK+ E+E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERE 613



 Score = 29.3 bits (65), Expect = 6.0
 Identities = 15/55 (27%), Positives = 37/55 (67%), Gaps = 4/55 (7%)

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK----ERSSHE 222
           K+ + +E+ VEK ++E +   +E++ ++K+K+ ++E+ ++++ E+      SSHE
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630



 Score = 28.9 bits (64), Expect = 7.8
 Identities = 14/42 (33%), Positives = 29/42 (69%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           K +E+  EK ++E  +K  +E++ EKEK++E+E + + + E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620


>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
           glycoprotein C-terminal domain.  The trypanosome
           parasite expresses these proteins to evade the immune
           response.
          Length = 98

 Score = 31.6 bits (72), Expect = 0.18
 Identities = 24/75 (32%), Positives = 34/75 (45%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
            K K K + K+K  K DK +   K K KE K K+ G    +    E   ++ K  K K+D
Sbjct: 3   KKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62

Query: 177 KEKPVEKIRKEEKDS 191
            +K  +      KDS
Sbjct: 63  CKKGCKWEGNTCKDS 77



 Score = 27.8 bits (62), Expect = 4.3
 Identities = 17/66 (25%), Positives = 29/66 (43%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K +  +K  K  KK++  K K K+ K  K+     +    +   EK K ++ +K  +K 
Sbjct: 7   TKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66

Query: 165 KKEEKE 170
            K E  
Sbjct: 67  CKWEGN 72


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           K  KK KK  +K+ K+  K   D  D+ K   +E   EK E+++     ++++ E E+K 
Sbjct: 15  KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDREL--NRQRQAEAEQKA 72

Query: 168 EK 169
             
Sbjct: 73  IV 74



 Score = 29.9 bits (68), Expect = 1.7
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 8/65 (12%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-----KEEKEKKE 167
           DKK  KK +KEK   +K +K   K  +   D+ K+  E+ K E+ E+     ++ + + E
Sbjct: 13  DKKKAKKAKKEK---RKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAE 69

Query: 168 EKEFK 172
           +K   
Sbjct: 70  QKAIV 74


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 33.7 bits (78), Expect = 0.19
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
           K +K++ K     +KE     K +K K+K + K    + K
Sbjct: 50  KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 32.9 bits (76), Expect = 0.30
 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 5/53 (9%)

Query: 94  KVTS-SEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
           +VT+ +E   E+K++ K+   DKK  K   K +   KK +K + K    +   
Sbjct: 41  EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQ---KKKEKQEAKAAKAKSKP 90



 Score = 32.1 bits (74), Expect = 0.61
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           KE++K     K + K  +K + K++K++ K  K + + +
Sbjct: 53  KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91



 Score = 31.0 bits (71), Expect = 1.4
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
           K     +KE K+  K +K KKEK + K  + +    
Sbjct: 57  KAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKPR 91



 Score = 30.2 bits (69), Expect = 2.0
 Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 10/60 (16%)

Query: 125 IKKKKDKKDKDKIKN-KEKDKEKKE---------KEKGKKEEKEKKEEKEKKEEKEFKMK 174
            ++KK +K + ++ N  E+ KE KE         KE     + +KK+EK++ +  + K K
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89



 Score = 29.8 bits (68), Expect = 3.0
 Identities = 15/67 (22%), Positives = 30/67 (44%), Gaps = 7/67 (10%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
            ++ K  K +++    +++ KE ++  K     K+E  K   K  K K++K++      K
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKEL-KAWHKAQK-KKEKQE-----AK 82

Query: 187 EEKDSEK 193
             K   K
Sbjct: 83  AAKAKSK 89



 Score = 29.4 bits (67), Expect = 4.1
 Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 15/66 (22%)

Query: 144 KEKKEKEKGK-------KEEKEKKE--EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
            ++K+ +KG+       ++ KE KE  +    ++KE K     +K      K+EK   K 
Sbjct: 30  AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQK------KKEKQEAKA 83

Query: 195 KKSKDK 200
            K+K K
Sbjct: 84  AKAKSK 89


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 22/142 (15%), Positives = 58/142 (40%), Gaps = 11/142 (7%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK-----------DKKDKDKIKNKEKDKEKK 147
           + L + K +           + E E ++ +            ++  + + K      E++
Sbjct: 31  KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERE 90

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
            +    + +  K+     ++E    M+E ++   E    +E+    EK   + + +L++E
Sbjct: 91  LRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150

Query: 208 KLKKKKKEKERSSHEKEVIPKL 229
             + +++ +E SS  +E+  KL
Sbjct: 151 VAEIREEGQELSSKREELKEKL 172



 Score = 31.6 bits (72), Expect = 0.65
 Identities = 21/83 (25%), Positives = 36/83 (43%)

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           D EK   E   KE ++  ++ + + E   K  E  E  +E +  +    E E +   +  
Sbjct: 16  DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERI 75

Query: 203 KLKKEKLKKKKKEKERSSHEKEV 225
           K  +EKL   K E+E  +   E+
Sbjct: 76  KRAEEKLSAVKDERELRALNIEI 98



 Score = 30.8 bits (70), Expect = 1.0
 Identities = 23/121 (19%), Positives = 55/121 (45%), Gaps = 5/121 (4%)

Query: 110 IKKDKKTHKKKEKEK-IKKKKDKKD--KDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEK 165
           ++KD+   + KE  K +KK K + +     ++  E + E  E +  + E + ++  E+ K
Sbjct: 17  LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76

Query: 166 K-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           + EEK   +K+++E     I  +           +  + +++ +  +K+ E  +   E+ 
Sbjct: 77  RAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERL 136

Query: 225 V 225
            
Sbjct: 137 E 137



 Score = 29.6 bits (67), Expect = 2.5
 Identities = 20/93 (21%), Positives = 49/93 (52%), Gaps = 2/93 (2%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           ++   K+  +  +D+     +E EK++K+ +  K+  ++++    + E + +E+  +  +
Sbjct: 97  EIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE 156

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           E +E   K+EE + K+  +     E+IRK +K 
Sbjct: 157 EGQELSSKREELKEKLDPELLSEYERIRKNKKG 189


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 33.6 bits (77), Expect = 0.19
 Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
            T KK E  + K+ KD   K+K K+  + +E +E  + +KEE+E++    +KEE+E +M 
Sbjct: 117 NTKKKIETYQ-KENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMN 175

Query: 175 E--DKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEK 216
           +  +K+  ++++       +E   + K    KL+ +  K K ++ 
Sbjct: 176 KRKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKP 220


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 32.6 bits (75), Expect = 0.20
 Identities = 15/71 (21%), Positives = 47/71 (66%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           +KE++K++K+     ++  K K+++ ++K++E  +K++  ++E ++K++E    + +  +
Sbjct: 59  QKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKID 118

Query: 179 KPVEKIRKEEK 189
           K ++++ KE+ 
Sbjct: 119 KAIKEVAKEKG 129



 Score = 29.5 bits (67), Expect = 2.1
 Identities = 14/77 (18%), Positives = 50/77 (64%), Gaps = 8/77 (10%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK------EEKEKKEEKEKKEEKEF- 171
           K  +++++K+  KK + +++ KEK+ +K+E++  K+      E ++ K+++ +++++E  
Sbjct: 34  KAAQKQLEKEF-KKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQ 92

Query: 172 KMKEDKEKPVEKIRKEE 188
           + ++  ++ +++ ++E 
Sbjct: 93  QKQQAAQQELQQKQQEL 109



 Score = 28.0 bits (63), Expect = 6.4
 Identities = 19/73 (26%), Positives = 46/73 (63%), Gaps = 6/73 (8%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           K  +++ +KE K+ + E +KKE KE + +E K +     ++    SE+ +K+K ++ + K
Sbjct: 34  KAAQKQLEKEFKKLQAELQKKE-KELQKEEQKLQ-----KQAATLSEEARKAKQQELQQK 87

Query: 206 KEKLKKKKKEKER 218
           +++L++K++  ++
Sbjct: 88  QQELQQKQQAAQQ 100


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 32.9 bits (75), Expect = 0.21
 Identities = 21/73 (28%), Positives = 37/73 (50%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           + +       K K K K  + +D   ++   + E  K  ++K K KK E +K+ +K+KKE
Sbjct: 102 RLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKE 161

Query: 168 EKEFKMKEDKEKP 180
           +K+ K +   E P
Sbjct: 162 KKKKKKRHSPEHP 174



 Score = 32.5 bits (74), Expect = 0.27
 Identities = 20/72 (27%), Positives = 38/72 (52%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           E+ +    +  KK+ K K ++   +    +E P +    +  + + +KK  + DK+ KK+
Sbjct: 99  EQYRLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKK 158

Query: 208 KLKKKKKEKERS 219
           K +KKKK+K  S
Sbjct: 159 KKEKKKKKKRHS 170


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.8 bits (77), Expect = 0.21
 Identities = 20/78 (25%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKEKEKG 152
           K            D  +K +    +   + + + K+D+K  D     +  D E++E+E+ 
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEE 871

Query: 153 KKEEKEKKEEKEKKEEKE 170
           ++EE+E++EE+E++EE+E
Sbjct: 872 EEEEEEEEEEEEEEEEEE 889



 Score = 33.0 bits (75), Expect = 0.34
 Identities = 26/134 (19%), Positives = 65/134 (48%), Gaps = 5/134 (3%)

Query: 53  KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 112
           + +T  + +  ++ K    E E    +D E+         V+     + GE  + +   +
Sbjct: 761 RKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSE 820

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEEKEKKEEKEKKE 167
            +    + + E  +++ + +++ + K  EK  +      G     ++EE+E++EE+E++E
Sbjct: 821 TQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEE 880

Query: 168 EKEFKMKEDKEKPV 181
           E+E + +E+ E+P+
Sbjct: 881 EEEEEEEEENEEPL 894



 Score = 32.3 bits (73), Expect = 0.63
 Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 3/101 (2%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           K D  + +    +  ++ ++  E   +  +E   E E++ E E K + + E  +   RK 
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKG 691

Query: 188 EKDSEKEKKSKDKDKK---LKKEKLKKKKKEKERSSHEKEV 225
           E++ E E ++K+ D K     +E   + + E E +  E E+
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI 732



 Score = 30.7 bits (69), Expect = 2.1
 Identities = 24/134 (17%), Positives = 61/134 (45%), Gaps = 5/134 (3%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           + KV +  DL +    ++    ++T ++ E+    + +      +    E ++E + + K
Sbjct: 621 VAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGE----NGEESGGEAEQEGETETK 676

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           G + E E +   E+K E+E + + + ++   K   E ++ E E +++ +  + + E    
Sbjct: 677 G-ENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETG 735

Query: 212 KKKEKERSSHEKEV 225
           ++ E+     E E 
Sbjct: 736 EEGEEVEDEGEGEA 749



 Score = 30.3 bits (68), Expect = 2.4
 Identities = 28/154 (18%), Positives = 71/154 (46%), Gaps = 10/154 (6%)

Query: 60  EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK--KDKSIKKDKKTH 117
           E++  + +   + K   + +              +    ED GEI+  +D  +K D+   
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK-------- 169
            K E E   +  +K + +     + D  + + E G++E   + + + K++EK        
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +    E++E+  E+  +EE++ E+E++ ++++++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|227470 COG5141, COG5141, PHD zinc finger-containing protein [General
           function prediction only].
          Length = 669

 Score = 33.8 bits (77), Expect = 0.21
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 9/64 (14%)

Query: 261 PVAAADEYDTGDSKQVWICPAC-GVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF 319
           P+  +DE+D        IC  C    ++ S  ++ CDGC+   H  C G+   PE   W 
Sbjct: 185 PIEPSDEFDD-------ICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPE-GFWL 236

Query: 320 CPKC 323
           C KC
Sbjct: 237 CRKC 240


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 32.3 bits (74), Expect = 0.22
 Identities = 18/62 (29%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           K  G  ++++++EE+E+ E +E   +E  ++ +EK   + K  EK ++++ K K++ KE+
Sbjct: 92  KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR-EKRRENERKQKEILKEQ 150

Query: 209 LK 210
           +K
Sbjct: 151 MK 152



 Score = 31.9 bits (73), Expect = 0.29
 Identities = 16/62 (25%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           ++K   + K +K  +++E+ ++++  +++  D++  KE  K K+EK + + E K+K+  K
Sbjct: 90  VRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRR-ENERKQKEILK 148

Query: 164 EK 165
           E+
Sbjct: 149 EQ 150


>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 594

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 27/129 (20%), Positives = 47/129 (36%), Gaps = 11/129 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEKEFKMKED 176
            +K  +K K K   K      N ++    K  +K KK     KE E +  + +E      
Sbjct: 448 CEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI----- 502

Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS-HEKEVIPKLTFKFGT 235
               +EK+  + +  EK     D  K+L  E      +E  + + H  E+     +K   
Sbjct: 503 ----LEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEILQPALHHIELNIARIYKLNI 558

Query: 236 DMEEKTKRE 244
             +E    +
Sbjct: 559 KDKEDFLNK 567



 Score = 29.3 bits (66), Expect = 5.0
 Identities = 32/146 (21%), Positives = 51/146 (34%), Gaps = 13/146 (8%)

Query: 81  VELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKI 137
            EL V     NI+ +   +DL       S  K        E    +KIK         K+
Sbjct: 345 YELAVYLGFKNIIFI--GQDLAYTNDGNSHAKGASYGDNYEGDESDKIKVVAYV-GGGKV 401

Query: 138 KNKEK---DKEKKEKEKGKKEEKEKKEEKEK----KEEKEFKMKEDKEKPVEKIRKEEKD 190
           +        +EK E++    ++KE     E     K   E   KE  EK ++K + +   
Sbjct: 402 ETDSVWNLFREKFEEDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDKDKNKPFI 461

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEK 216
                 +  ++K + K   K KK   
Sbjct: 462 KLILLSNNSQEKAVLKLLQKIKKNND 487


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 33.4 bits (77), Expect = 0.23
 Identities = 19/61 (31%), Positives = 41/61 (67%), Gaps = 3/61 (4%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEK 183
           K+K  K  +K++ K +++E++E  + ++EE+E+ EE++K++     + K+ E KEK +E 
Sbjct: 6   KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65

Query: 184 I 184
           +
Sbjct: 66  L 66



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 18/60 (30%), Positives = 35/60 (58%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           DK+K K     +K +EK ++EE+E+  E E++EE+E + ++ ++       K  +  EK+
Sbjct: 3   DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKD 62



 Score = 28.4 bits (64), Expect = 9.0
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
           +K KEK  K   K EEK ++EE+E+  E E + +E+ E        EEK  +     K K
Sbjct: 3   DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDE--------EEKKEKPGFFDKAK 54

Query: 201 DKKLKKEKL 209
             ++K++ +
Sbjct: 55  ITEIKEKDI 63


>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
           (DUF874).  This family consists of several hypothetical
           proteins specific to Helicobacter pylori. The function
           of this family is unknown.
          Length = 417

 Score = 33.3 bits (75), Expect = 0.23
 Identities = 35/138 (25%), Positives = 72/138 (52%), Gaps = 11/138 (7%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIK--------NKEKDKEKKEKEKGKKEEKEKKEEK 163
           +DKK    + K++ +  +D+ +K  I+         +EK K +KE  +    + + ++EK
Sbjct: 124 QDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEK 183

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEK-LKKKKKEKERSS 220
           +K E+++ K +++K+K      K   E + EK+K   +K   +K++K   K+ ++  + +
Sbjct: 184 QKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQEN 243

Query: 221 HEKEVIPKLTFKFGTDME 238
           H +  I KL  K G  +E
Sbjct: 244 HNQFFIKKLGIKAGIAIE 261


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 33.3 bits (76), Expect = 0.23
 Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 1/114 (0%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
            KK  K    +   KK   K K   K   K K  K     KK  K   +  +K ++   K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258

Query: 173 MKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
             +   K V+K  +K  K + K  K   K  K K +  KK  K+    S  K  
Sbjct: 259 ALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKAT 312



 Score = 31.8 bits (72), Expect = 0.61
 Identities = 28/120 (23%), Positives = 45/120 (37%), Gaps = 1/120 (0%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K  S K      K  +K+  K     K   K   K   K KK  +K  K+  +  ++  
Sbjct: 212 AKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAA 271

Query: 165 KKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           KK  K   K  +   K  +   K +K + K+  +  K K   K   +  K +K +   +K
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 33.7 bits (77), Expect = 0.23
 Identities = 35/165 (21%), Positives = 49/165 (29%), Gaps = 1/165 (0%)

Query: 34  LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYS-KVKDVELGVTPMVPNI 92
           L  P        M    S K     K    SK K +  EK+    V D            
Sbjct: 404 LKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELR 463

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
            KV  S+            K KK       + IK KK  K K   K   +D  +      
Sbjct: 464 KKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPN 523

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
            KE+ +K+        K  +   +   P   ++K E  +   K  
Sbjct: 524 NKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFP 568



 Score = 32.6 bits (74), Expect = 0.52
 Identities = 38/208 (18%), Positives = 65/208 (31%), Gaps = 16/208 (7%)

Query: 23  PMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE 82
            + + P  P  +   P    P+ +  K      TS K     K     K  +Y   K  +
Sbjct: 387 NLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVK---VVKVSKYKGNKSEK 443

Query: 83  LGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK 142
                ++  I     S++L          + K    K+ K K  K    K K ++ N   
Sbjct: 444 KRDINVLDTIFASPVSKEL----------RKKVGKSKQTKLKNFKPVPNKSKKQLANNNS 493

Query: 143 DKEKKEKEKGKKEEKEKKEEK--EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD- 199
              K +K    K   +   +   E       K K DK+        +   S  E ++ + 
Sbjct: 494 QNIKSKKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEVRNPNA 553

Query: 200 KDKKLKKEKLKKKKKEKERSSHEKEVIP 227
             KKL+      K     +++    V  
Sbjct: 554 NVKKLEDTTYNAKFPTVSKNNAYTLVDI 581



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 34/155 (21%), Positives = 59/155 (38%), Gaps = 8/155 (5%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
            S E+L E  K           K       KK  +K   +KI N  K   K  K KG K 
Sbjct: 383 QSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVK-VVKVSKYKGNKS 441

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL----KKEKLKK 211
           EK++        +  F     KE   +  + ++   +  K   +K KK       + +K 
Sbjct: 442 EKKRDI---NVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKS 498

Query: 212 KKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
           KK  K +++++  +          + +EK  +++S
Sbjct: 499 KKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTS 533


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 33.4 bits (77), Expect = 0.23
 Identities = 15/106 (14%), Positives = 46/106 (43%), Gaps = 2/106 (1%)

Query: 121 EKEKIKKKKDKKDK-DKIKNK-EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
            K +I+  + +K K ++ K + E  + + E+EK  +E + KK  + +  + +  +     
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALA 493

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +   K     +    +  ++  +  +   +  +K + + R + ++ 
Sbjct: 494 RVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQA 539


>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 472

 Score = 33.6 bits (77), Expect = 0.24
 Identities = 31/115 (26%), Positives = 51/115 (44%), Gaps = 12/115 (10%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           I    S    ++   +++ +    ++ +K K++ K KE+ +E  E EK  KE  E     
Sbjct: 330 IATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELME----- 384

Query: 164 EKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
                 E K K D    V   + + E+D +K K S D  K +  E +K+K  E E
Sbjct: 385 ------ELKEKGDLSIFVALSLSEVEEDGDKVKISFDSSKAMHYELMKEKLPELE 433


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 33.4 bits (76), Expect = 0.25
 Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 9/114 (7%)

Query: 118 KKKEKEKIKKK------KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
            KKE + ++ K        +KDK  +++ EK + K E +     EK+  EEK++K+E+E 
Sbjct: 453 LKKENDMLQTKLNSMVSAKQKDKQSMQSMEK-RLKSEADSRVNAEKQLAEEKKRKKEEEE 511

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
                  +     R+E  +S K+ K +D + ++KK +   K KE+E    EKE 
Sbjct: 512 TAARAAAQAAAS-REECAESLKQAK-QDLEMEIKKLEHDLKLKEEECRMLEKEA 563



 Score = 31.8 bits (72), Expect = 0.78
 Identities = 34/125 (27%), Positives = 66/125 (52%), Gaps = 2/125 (1%)

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
           DLG++KK+  + + K       K+K K+     +K ++K++   +   EK+   +E+K K
Sbjct: 449 DLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEK-RLKSEADSRVNAEKQL-AEEKKRK 506

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           KEE+E       +    +E+  E +++ ++D E E K  + D KLK+E+ +  +KE +  
Sbjct: 507 KEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQEL 566

Query: 220 SHEKE 224
              +E
Sbjct: 567 RKYQE 571


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 33.4 bits (76), Expect = 0.25
 Identities = 21/107 (19%), Positives = 49/107 (45%), Gaps = 2/107 (1%)

Query: 114 KKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
            +  +  E+  I+ +   D K+KDK K+   D E ++ E  K+  K++ + K ++E++E 
Sbjct: 241 GEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEED 300

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
             +ED +   E         + ++   +       ++L   +K + +
Sbjct: 301 DEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAK 347



 Score = 33.0 bits (75), Expect = 0.35
 Identities = 25/120 (20%), Positives = 56/120 (46%), Gaps = 3/120 (2%)

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
             ++ G I+ +      +K  KK   +  + + D+ DK+ +K +   K ++E E+  ++E
Sbjct: 245 EDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQE 304

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK---KSKDKDKKLKKEKLKKKK 213
            ++ EE+  +   +    ++       +   ++ S  EK   K K + ++L+KE L  K 
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKS 364



 Score = 31.5 bits (71), Expect = 1.1
 Identities = 30/152 (19%), Positives = 69/152 (45%), Gaps = 6/152 (3%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--EKGKKE 155
           S+D  E  ++K +++++   +K+  E+  ++K K DK  + +K    ++  +  E  + E
Sbjct: 152 SDDETEDDEEKKMEEEEAGEEKESVEQATREK-KFDKSGVDDKFFKLDEMNEFLEATEAE 210

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E+    +++  E+      ED +   +    E+++ ++E   + +D    KEK KKK   
Sbjct: 211 EEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAG 270

Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
            +    + E   +   K   + + K + E   
Sbjct: 271 DDAELEDDEPDKEAVKK---EADSKPEEEDEE 299


>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2).  DAZ
          associated protein 2 has a highly conserved sequence
          throughout evolution including a conserved polyproline
          region and several SH2/SH3 binding sites. It occurs as
          a single copy gene with a four-exon organisation and is
          located on chromosome 12. It encodes a ubiquitously
          expressed protein and binds to DAZ and DAZL1 through
          DAZ repeats.
          Length = 136

 Score = 32.1 bits (73), Expect = 0.25
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 9/51 (17%)

Query: 3  PFQFSPFP--SVAGP-------GLIPRLPPMLQHPLIPQPLVHPPGTPPPS 44
          P   +P+P  S+  P       G     PPM  +P+   P V+PPG+    
Sbjct: 25 PQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLV 75


>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
          Length = 627

 Score = 33.5 bits (78), Expect = 0.25
 Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 21/113 (18%)

Query: 112 KDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE--- 167
           KDK T K+   + I          ++I     ++  K+ E   +E+K++KE  E +    
Sbjct: 482 KDKGTGKE---QSITITASSGLSDEEI-----ERMVKDAEANAEEDKKRKELVEARNQAD 533

Query: 168 ------EKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 211
                 EK  K   DK      EKI    K+ ++  K +DK+  K K E+L +
Sbjct: 534 SLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKEAIKAKTEELTQ 586


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 33.6 bits (77), Expect = 0.25
 Identities = 21/96 (21%), Positives = 46/96 (47%), Gaps = 9/96 (9%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-------EDKEKPVEKIRKEEK 189
               EK+ ++  K +G+K E     ++ K+ E E +         +D  K +++  +E  
Sbjct: 132 RDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELA 191

Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKE-KERSSHEKE 224
           +  KE +  +K+ K + E+L++      ER + E++
Sbjct: 192 NLRKELRQLEKE-KQRLERLRRLLPLLAERKALEQQ 226


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 32.8 bits (75), Expect = 0.26
 Identities = 10/79 (12%), Positives = 32/79 (40%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
              ++ + + +   ++     + +   +    + KE+ + +++ +    + K KP  K +
Sbjct: 51  APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPK 110

Query: 186 KEEKDSEKEKKSKDKDKKL 204
            + +   K+  SK   K  
Sbjct: 111 VKPQPKPKKPPSKTAAKAP 129



 Score = 31.7 bits (72), Expect = 0.60
 Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 1/81 (1%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEK 208
            +   EE + + E  +++ K     E   +P     KE+   EK+ KK K K K   K K
Sbjct: 49  LEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108

Query: 209 LKKKKKEKERSSHEKEVIPKL 229
            K K + K +    K      
Sbjct: 109 PKVKPQPKPKKPPSKTAAKAP 129



 Score = 30.5 bits (69), Expect = 1.3
 Identities = 13/66 (19%), Positives = 25/66 (37%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           +  E++     + +   +    K K+K K EK+  K + K K + K K + K     +  
Sbjct: 61  EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120

Query: 178 EKPVEK 183
                 
Sbjct: 121 PSKTAA 126



 Score = 29.8 bits (67), Expect = 2.5
 Identities = 19/77 (24%), Positives = 29/77 (37%), Gaps = 5/77 (6%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKE--DKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           + E E  +++ K   E +   E    K KE    EK  +K + + K   K K    K K 
Sbjct: 57  QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP---KVKP 113

Query: 204 LKKEKLKKKKKEKERSS 220
             K K    K   +  +
Sbjct: 114 QPKPKKPPSKTAAKAPA 130



 Score = 28.6 bits (64), Expect = 5.3
 Identities = 15/78 (19%), Positives = 28/78 (35%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E  +++     +     +      K+K K +K   K K K K K + +   K + + K+ 
Sbjct: 61  EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120

Query: 163 KEKKEEKEFKMKEDKEKP 180
             K   K         +P
Sbjct: 121 PSKTAAKAPAAPNQPARP 138



 Score = 28.6 bits (64), Expect = 5.7
 Identities = 19/79 (24%), Positives = 28/79 (35%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            K   E     +    K K K K + K KK K K K + K K + K + + K+   K   
Sbjct: 67  PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126

Query: 178 EKPVEKIRKEEKDSEKEKK 196
           + P    +     S     
Sbjct: 127 KAPAAPNQPARPPSAASAS 145


>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1.  RNA
           polymerases catalyze the DNA dependent polymerisation of
           RNA. Prokaryotes contain a single RNA polymerase
           compared to three in eukaryotes (not including
           mitochondrial. and chloroplast polymerases). This
           domain, domain 1, represents the clamp domain, which a
           mobile domain involved in positioning the DNA,
           maintenance of the transcription bubble and positioning
           of the nascent RNA strand.
          Length = 330

 Score = 33.0 bits (76), Expect = 0.27
 Identities = 27/114 (23%), Positives = 40/114 (35%), Gaps = 7/114 (6%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
             +  ++S+K          K K  KK+      KI N  K K    K        +  E
Sbjct: 101 SSLLLNESVKYFFLKVVIDPKGKNSKKR----LKKINNLCKKKSICSKCGEDNGGLKAFE 156

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
              K + K  K     E     ++  E +  KEK  K   +K+  +  KK  KE
Sbjct: 157 GCGKYQPKISKDG--AEAIKALLKNIEIEELKEKLRKLNPEKV-LQIFKKISKE 207


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 33.3 bits (76), Expect = 0.30
 Identities = 31/114 (27%), Positives = 53/114 (46%), Gaps = 14/114 (12%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-----KEKG----KKEEKEKKEEKEKKEEK 169
           K+ KE++K    +   DKI  +E      E     KEKG    ++  +E++EE+E++EE+
Sbjct: 356 KELKERVKDAIPEDLYDKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEE 415

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-----KDKKLKKEKLKKKKKEKER 218
             + +   E+PV      E                K+ K+  EK+  K+ EKE+
Sbjct: 416 AAEAEAPMEEPVPGFEVPEMPMPAAGGGGGIKIVLKNAKITAEKVIIKRAEKEK 469


>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 725

 Score = 33.3 bits (76), Expect = 0.30
 Identities = 27/144 (18%), Positives = 47/144 (32%), Gaps = 4/144 (2%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNK----EKDKEKKEKEKGKKEEKEKKEEKEKK 166
             + +   + E +  K +++  D   IK K    E D E K   K  +E  E K+ K K 
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
                K +        K    EK +E       ++ +  +        E  +    + +I
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIISTEIAQLDENQNLI 474

Query: 227 PKLTFKFGTDMEEKTKRESSPKIV 250
               F    +       E+  KI 
Sbjct: 475 DTGEFDLENNFSNSFNPENGNKID 498



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 19/86 (22%), Positives = 36/86 (41%)

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
              + E+E + E +FK  E+       I++++ ++E + + K   K L++    K  K K
Sbjct: 354 ALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNK 413

Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTK 242
             S   K            ++ EKT 
Sbjct: 414 NSSFINKTENILTNSPLKDELLEKTT 439


>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 320

 Score = 32.8 bits (75), Expect = 0.30
 Identities = 22/112 (19%), Positives = 37/112 (33%), Gaps = 12/112 (10%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK------ 147
           K+   +++   KK    K           +K   K    +     N EK+ + K      
Sbjct: 130 KIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDKLSNIFN 189

Query: 148 ------EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
                 + EK    EK K    E   + +F+ K D      +I K   D++ 
Sbjct: 190 EKLSNLKNEKISINEKIKSILNEINPDSKFEKKLDLNLIKLEIVKSNSDNDS 241



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 17/106 (16%), Positives = 31/106 (29%), Gaps = 1/106 (0%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE- 161
            ++       DK     + K   KK   K       +   DK   +        ++  E 
Sbjct: 119 NLEYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEK 178

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
             + K    F  K    K  +    E+  S   + + D   + K +
Sbjct: 179 NIDDKLSNIFNEKLSNLKNEKISINEKIKSILNEINPDSKFEKKLD 224


>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
           terminal.  This domain family is found in eukaryotes,
           and is typically between 177 and 207 amino acids in
           length. The family is found in association with
           pfam00778, pfam02377, pfam00610, pfam00595. The segment
           polarity gene dishevelled (dsh) is required for pattern
           formation of the embryonic segments. It is involved in
           the determination of body organisation through the
           Wingless pathway (analogous to the Wnt-1 pathway).
          Length = 202

 Score = 32.2 bits (73), Expect = 0.31
 Identities = 12/33 (36%), Positives = 14/33 (42%)

Query: 10  PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 42
            S   PG+ P   P +     P P   PPG PP
Sbjct: 147 SSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPP 179


>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
           ribosomal structure and biogenesis / Posttranslational
           modification, protein turnover, chaperones].
          Length = 379

 Score = 33.1 bits (75), Expect = 0.31
 Identities = 26/123 (21%), Positives = 59/123 (47%), Gaps = 3/123 (2%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E L E   D   ++D+K + + +  + + K   +D  ++K   +  +K++      +E+E
Sbjct: 180 EPLDEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQE 239

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK---DKDKKLKKEKLKKKKKE 215
           K+ +K +K E+E   +      ++   + +  +E E ++       KK  KE +KK  K 
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKM 299

Query: 216 KER 218
           +++
Sbjct: 300 EKK 302



 Score = 30.4 bits (68), Expect = 1.9
 Identities = 23/109 (21%), Positives = 46/109 (42%), Gaps = 3/109 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE--KEKKEE 162
            K++  +   K       +++ +   K+D      KE++KE K+  K ++E   + K   
Sbjct: 201 AKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALA 260

Query: 163 KEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
             K + + + K + + E        ++K  E  KK+   +KK  K   K
Sbjct: 261 ALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAK 309



 Score = 30.0 bits (67), Expect = 2.8
 Identities = 20/88 (22%), Positives = 40/88 (45%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
             + K+D +    KE+EK  KK  K +++     +     K K + K + + + E     
Sbjct: 223 QIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASA 282

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
              + K KE  +K ++  +K  K++ K+
Sbjct: 283 TAVKKKAKEVMKKALKMEKKAIKNAAKD 310


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
            biogenesis [Translation, ribosomal structure and
            biogenesis].
          Length = 1077

 Score = 33.2 bits (75), Expect = 0.31
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 103  EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
            EI+    + K       +  E +    + ++K +IK    D+  KE+ K ++E++  +  
Sbjct: 984  EIESKLPLDKRSIAVVSRRIE-LPVPPECREKHEIK----DRIVKERIKDQEEKERMESL 1038

Query: 163  KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
            +  KEE+  K ++++E   ++IRK   D+ KE   K   KK
Sbjct: 1039 QRAKEEEIGKKEKERE---QRIRKTIHDNYKEMAKKRLKKK 1076



 Score = 32.8 bits (74), Expect = 0.48
 Identities = 24/97 (24%), Positives = 48/97 (49%), Gaps = 12/97 (12%)

Query: 118  KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-EEKEKKEEKEKKEEKEFKMKED 176
             K+    + ++ +     + + K + K++  KE+ K  EEKE+ E  ++ +E+E   KE 
Sbjct: 992  DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE- 1050

Query: 177  KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
                      +E++    K   D  K++ K++LKKK+
Sbjct: 1051 ----------KEREQRIRKTIHDNYKEMAKKRLKKKR 1077



 Score = 30.9 bits (69), Expect = 1.9
 Identities = 36/184 (19%), Positives = 75/184 (40%), Gaps = 8/184 (4%)

Query: 49  KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
           K+   +  SP+E +   +  S K  E   V   E      V  +   + S +  ++ + +
Sbjct: 503 KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESE 562

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
             +  KK           K + +KD      + +++  ++ EKG  E+ E +E     E 
Sbjct: 563 FEELKKKW----SSLAQLKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEM 618

Query: 169 KEFK----MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +E +      E++E   E   + E++    KK + +     +E+   +KK+ +  + EK 
Sbjct: 619 EESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKR 678

Query: 225 VIPK 228
            I +
Sbjct: 679 KIEE 682



 Score = 29.3 bits (65), Expect = 5.1
 Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 152  GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLK 210
               E +EK E K++  ++  K +E+KE+     R KEE+  +KEK+ + + +K   +  K
Sbjct: 1007 VPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYK 1066

Query: 211  KKKKEKER 218
            +  K++ +
Sbjct: 1067 EMAKKRLK 1074


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 32.2 bits (74), Expect = 0.32
 Identities = 22/100 (22%), Positives = 53/100 (53%), Gaps = 7/100 (7%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K+E  ++KK ++  +K   +  +++K   E  K  +EE E+  +K K  EK+ +      
Sbjct: 33  KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL---- 88

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
              + ++   K+ EKE K+   + ++ +++ +K ++E++ 
Sbjct: 89  ---KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDE 125



 Score = 28.7 bits (65), Expect = 4.6
 Identities = 27/93 (29%), Positives = 48/93 (51%), Gaps = 12/93 (12%)

Query: 105 KKDKSIKKDKKTHKKKEK---EKIKKKKDKKD--KDKIKNKEKDKE--KKEKEKGKKEEK 157
           K ++  +K      ++ K   E +KK +++ +  + K+K+ EKDK+  K  K + K+ EK
Sbjct: 41  KNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEK 100

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           E K  K      E ++ E + + VE+ R E  D
Sbjct: 101 ELKNLK-----WESEVLEQRFEKVERERDELYD 128


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 32.7 bits (75), Expect = 0.32
 Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFKMKEDK 177
            KE E+++K+  + D       +K KEKK  +   K+ E+E K+  E  E+ + ++ E+K
Sbjct: 30  DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEK 89

Query: 178 EKPVEKIRKEEKDSE 192
                    +E +S 
Sbjct: 90  SSWFPSKDPDEINSA 104



 Score = 28.8 bits (65), Expect = 6.3
 Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 6/61 (9%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK------EKDKEKKEKEKGKKEEKEKK 160
           DK I++ +K  K+ +       K KK+K ++K+       E  K  +  EK KK   E+K
Sbjct: 30  DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEK 89

Query: 161 E 161
            
Sbjct: 90  S 90


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 33.3 bits (76), Expect = 0.34
 Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 9/99 (9%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           +EK ++ EK  KEE E++ + E + + ++EK   +  + +  +E EK         ++EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEK---------RREK 302

Query: 209 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
           L+   K+  RS+     I    FK G  ++    R S P
Sbjct: 303 LQNLLKKASRSADNVWYIEPSEFKAGDTVKLYYNRSSGP 341



 Score = 29.5 bits (66), Expect = 5.0
 Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 134 KDKIKNKEK-DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
           ++K +  EK  KE+ E+E+  +E++ ++EEK   E    + K + EK  EK++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQ 304



 Score = 29.1 bits (65), Expect = 7.1
 Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 1/114 (0%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           S K+ ++ +  KE +              K     +E +   +   E   ++ E E + E
Sbjct: 23  SQKRIQQNNGDKE-DSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIE 81

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
            E    +  +       +              D+KLK E+   +K+E E  + E
Sbjct: 82  VEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEE 135


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 33.0 bits (76), Expect = 0.34
 Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 6/100 (6%)

Query: 73  KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---KEKEKIKKKK 129
           K  + ++++E+         V+VT+     E+    +   D         KE EK++K+ 
Sbjct: 771 KGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI 830

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           D+ +K K+ N E    K  +E   ++EKEK  E + K  K
Sbjct: 831 DRIEK-KLSN-EGFVAKAPEEV-VEKEKEKLAEYQVKLAK 867



 Score = 31.1 bits (71), Expect = 1.3
 Identities = 16/70 (22%), Positives = 32/70 (45%), Gaps = 2/70 (2%)

Query: 102 GEIKKDKSIKKDKKTHKK--KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
           G I     + + +K  +K  KE ++I+KK   +       +E  +++KEK    + +  K
Sbjct: 808 GLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867

Query: 160 KEEKEKKEEK 169
            EE+    + 
Sbjct: 868 LEERLAVLKA 877



 Score = 30.3 bits (69), Expect = 2.5
 Identities = 35/150 (23%), Positives = 61/150 (40%), Gaps = 19/150 (12%)

Query: 72  EKEYSKVKDV---------ELGVTPMVP-NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
           EKE+  +K++         E+ ++P  P  +V V S E    ++ ++   K       +E
Sbjct: 721 EKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIK--GLANLEE 778

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG---KKEEKEKKEEKEKKEEKEFKMKEDK- 177
            E I     ++   ++       E      G      E  + E++ +K EKE    E K 
Sbjct: 779 LE-ILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL 837

Query: 178 --EKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
             E  V K  +E  + EKEK ++ + K  K
Sbjct: 838 SNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867


>gnl|CDD|204587 pfam11078, Optomotor-blind, Optomotor-blind protein N-terminal
          region.  This family is conserved in Drosophila spp.
          Optomotor-blind is one of the essential toolkit
          proteins for coordinating development in diverse animal
          taxa, and in Drosophila it plays a key role in
          establishing the abdominal pigmentation pattern, in
          development of the central nervous system and leg and
          wing imaginal disc-formation of Drosophila
          melanogaster. This is the N-terminal region of the
          protein and does not include the T-box-containing
          transcription factor that plays a part in DNA-binding.
          Length = 74

 Score = 30.4 bits (68), Expect = 0.34
 Identities = 14/36 (38%), Positives = 16/36 (44%)

Query: 5  QFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGT 40
          Q  P P +    LIP+ PP   HP    PL H   T
Sbjct: 14 QAGPHPGLYPGTLIPKFPPAHPHPHPHHPLGHAYTT 49


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 32.3 bits (74), Expect = 0.34
 Identities = 35/116 (30%), Positives = 62/116 (53%), Gaps = 12/116 (10%)

Query: 119 KKEKEKIKKKKDK-----KDKDKIKNKEKDKEKKEKEKGKKEEK--EKKEEKEKKEEKEF 171
           K + E IKK  D+       ++K+  KEK+  +K +E+ K  +K  EK E+K+ K++ + 
Sbjct: 61  KDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQ 120

Query: 172 KMK--EDKEKPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
            +K  +++ K  +K+ K  K   + EKE     KDK L  ++L +K K   +S  +
Sbjct: 121 LVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEK 176



 Score = 29.6 bits (67), Expect = 2.4
 Identities = 30/105 (28%), Positives = 54/105 (51%), Gaps = 1/105 (0%)

Query: 112 KDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
             ++   KKEKE I+K +++ K   K   K +DK+ K+K K   +  +++ +   K  K 
Sbjct: 79  DKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKA 138

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           +K   + EK + +  K++  + KE   K K      EK++K KK+
Sbjct: 139 YKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEKIQKAKKK 183


>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII.  Eukaryotic DNA topoisomerase
           II, GyrB, ParE.
          Length = 594

 Score = 32.9 bits (76), Expect = 0.35
 Identities = 25/107 (23%), Positives = 40/107 (37%), Gaps = 19/107 (17%)

Query: 122 KEKIKKKKDKKDKD----------------KIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
            E  KKKK  K+K+                KI   + + + KEK  G  E +   E+   
Sbjct: 261 NEYAKKKKKLKEKNIKGEDVREGLTAFISVKIPEPQFEGQTKEK-LGTSEVRFGVEKIVS 319

Query: 166 KEEKEFKM--KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
           +    F      +  K VEK+    K     KK+++  +K K   + 
Sbjct: 320 ECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366


>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
          Length = 430

 Score = 33.0 bits (75), Expect = 0.35
 Identities = 28/122 (22%), Positives = 49/122 (40%), Gaps = 4/122 (3%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           +K + T +  E+EK K K       K K     K+K+E  +   EE++   + +     +
Sbjct: 77  QKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAK 136

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK----KKKKEKERSSHEKEVI 226
            K     ++  E   +  ++ E+  K K K K     K K     K+K  E     +EV 
Sbjct: 137 AKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVT 196

Query: 227 PK 228
            +
Sbjct: 197 EE 198



 Score = 29.9 bits (67), Expect = 2.6
 Identities = 36/173 (20%), Positives = 69/173 (39%), Gaps = 1/173 (0%)

Query: 44  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
           SK++++   K+K  PK    + E+   +    +K K   L          +VT  E    
Sbjct: 35  SKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALA-KQKREGTEEVTEEEKAKA 93

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
             K  +  K K     K+K +  ++  +++K   K K     K +     K+++E  EE 
Sbjct: 94  KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            ++EE+  K K   +       K    ++++     +  +   E+ K K K K
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAK 206


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 32.9 bits (76), Expect = 0.35
 Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 5/81 (6%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
              +K K +  ++K++K++ +I  K     K E +   +EE  +  E++  E  E   + 
Sbjct: 270 NEHRKLKREAAEEKEEKEEKEI-RKLYLYSKLEGKLPTEEEFLEYIEEKNPELLETAEEL 328

Query: 176 DKEKPVE---KIRKEEKDSEK 193
            +E+ VE   K   +EK+ EK
Sbjct: 329 IEEEEVEHQAK-TADEKNLEK 348


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.37
 Identities = 19/92 (20%), Positives = 54/92 (58%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
           D+++++  E + E  E+E  + EE+ ++ E   + E   +  E++ + +E++  E +++ 
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETI 532

Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +EK+ + ++ + +  +L+ + +EK  ++ E E
Sbjct: 533 EEKRERAEELRERAAELEAEAEEKREAAAEAE 564



 Score = 28.5 bits (64), Expect = 10.0
 Identities = 23/89 (25%), Positives = 52/89 (58%), Gaps = 7/89 (7%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEK---EK 195
           E+D+E+ E+ + + E+ E  EE E+ EE+  + ++    E  +E++ +  +D E+   E+
Sbjct: 471 EEDRERVEELEAELEDLE--EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAER 528

Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +   ++K+ + E+L+++  E E  + EK 
Sbjct: 529 RETIEEKRERAEELRERAAELEAEAEEKR 557


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 32.8 bits (75), Expect = 0.38
 Identities = 18/93 (19%), Positives = 41/93 (44%), Gaps = 1/93 (1%)

Query: 120 KEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K   K+ KK   K  +     K K   +  KE  K++E    +++    +++  + +   
Sbjct: 325 KNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSN 384

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
              +K   +E+ ++ E    ++ K+L+  K K+
Sbjct: 385 PEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417



 Score = 31.2 bits (71), Expect = 1.1
 Identities = 16/65 (24%), Positives = 29/65 (44%), Gaps = 1/65 (1%)

Query: 103 EIKKDKSIKKDKKTHKKK-EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           E  K+   +K+    KK+    K K+   K    + K     +E+ + E  K EE ++ E
Sbjct: 352 EAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLE 411

Query: 162 EKEKK 166
             ++K
Sbjct: 412 NGKQK 416



 Score = 29.7 bits (67), Expect = 3.2
 Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 11/81 (13%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K   E IK+ + +K+    K +    ++KE        + KK   ++++ K         
Sbjct: 348 KTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAK--------- 398

Query: 179 KPVEKIRKEEKDSEKEKKSKD 199
             +E I+ EE    +  K K 
Sbjct: 399 --LEFIKLEELKRLENGKQKR 417



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           KI N    K+ K   +   +E  K++E    +++    K  +  P     + +K + +E+
Sbjct: 337 KIANDWARKKVKTAYE-AIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEE 395

Query: 196 KSKDKDKKLKKEKLKKKKKEK 216
           ++K +  KL++ K  +  K+K
Sbjct: 396 QAKLEFIKLEELKRLENGKQK 416


>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
          Length = 1622

 Score = 33.2 bits (76), Expect = 0.38
 Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 5/73 (6%)

Query: 149 KEKGKKEEKEKKEEKEKK-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           KE+G  EE        +K  E   KM+E  E+  E    EE D    K   + +    K+
Sbjct: 818 KEEGTYEELSNNGPLFQKLMENAGKMEEYVEENGE----EEDDQTSSKPVANGNANNLKK 873

Query: 208 KLKKKKKEKERSS 220
               KKK KE  S
Sbjct: 874 DSSSKKKSKEGKS 886



 Score = 32.0 bits (73), Expect = 0.86
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 4/62 (6%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E   K E+  +E  E +  +   KPV         +  +K S  K K  + + +  K++E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVA----NGNANNLKKDSSSKKKSKEGKSVLIKQEE 893

Query: 216 KE 217
           +E
Sbjct: 894 RE 895


>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           palm domain and Zn-ribbon; MTH326-like has inactivated
           polymerase catalytic domain; alr1562 and slr7011 -
           predicted only on the basis of size, presence of HD
           domain, and location with RAMPs in one operon; signature
           gene for type III; also known as Crm2 family.
          Length = 475

 Score = 32.8 bits (75), Expect = 0.39
 Identities = 26/172 (15%), Positives = 62/172 (36%), Gaps = 9/172 (5%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD--KEKKEKEKGKKEEKEKKEEKEKKEEKE-FKMKE 175
           ++E E++K++ +++  + I NK K+    +  +    K +  ++     ++  E  +  E
Sbjct: 90  EEEAEELKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAE 149

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
           + E   + +R            K     +     +K+  EK +    K+    L  K G 
Sbjct: 150 ELEAARKNVRDFAPVPWIPPAPKIYIYIVSIFN-EKELSEKYKEFLLKKRNGLLLDKLGG 208

Query: 236 DMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVW-----ICPAC 282
              ++ +           ++  +      A +    D K +      +CP C
Sbjct: 209 RARKEYRSVCGLLAAAWKLEKETEQYSLFAKDDLEEDLKLLLEAGERLCPLC 260


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 32.8 bits (75), Expect = 0.39
 Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE---FKMKE--DKEKPVEKIRKE 187
           DK ++   + D+   E EK K E++  +E  E K   E   + MK     EK   K+   
Sbjct: 508 DKGRLSKADIDRMVNEAEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDS 567

Query: 188 EKDSEKEKKSK-----DKDKKLKKEKLKKKKKEKE 217
           +K + ++   +     +K++  +KE+ + K+KE E
Sbjct: 568 DKATIEKAIDEALEWLEKNQLAEKEEFEHKQKEVE 602


>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
           [General function prediction only].
          Length = 303

 Score = 32.3 bits (73), Expect = 0.39
 Identities = 26/112 (23%), Positives = 52/112 (46%), Gaps = 7/112 (6%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-------K 172
             +E +  K   + + +   KE+  E++E+   + E      EKEK ++K+        +
Sbjct: 180 NVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQ 239

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
             E  E   E+  + E D ++++ +K K +K K +  KK +K      +E+E
Sbjct: 240 SMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQE 291


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 32.7 bits (74), Expect = 0.39
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 5/91 (5%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGK--KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
                ++++    E+D+E  EKE GK   E+ E++EE+E K  K+F   E + K  +   
Sbjct: 136 PDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK--KPSV 193

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           K+   S     S    KKL+K + K+K K+ 
Sbjct: 194 KDNGKSFWSAASV-FSKKLQKWRQKQKLKKP 223


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 32.7 bits (74), Expect = 0.39
 Identities = 24/140 (17%), Positives = 59/140 (42%), Gaps = 7/140 (5%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK------IKNKEKDKEKK 147
           K   S    + + D++    KK   +  K   KK   +  +        + N++   +K+
Sbjct: 50  KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKE 109

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
            K+  ++  ++        EE++ + K  K + V+K+ ++ +D   E +  D ++     
Sbjct: 110 SKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVT 169

Query: 208 KLKKKKKEK-ERSSHEKEVI 226
            L+ + +E+ +    + E I
Sbjct: 170 SLENESEEELDLEKDDGEDI 189



 Score = 30.0 bits (67), Expect = 3.1
 Identities = 24/149 (16%), Positives = 56/149 (37%), Gaps = 15/149 (10%)

Query: 40  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
                +  D+     K  P   +    K +  E      +  EL           V + +
Sbjct: 55  ALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL-----------VVNED 103

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
              +    +S K  ++T ++K        +++K + K++ + K K+  E  + +  E E 
Sbjct: 104 AALD---KESKKTPRRT-RRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
            + +E +     + + ++E  +EK   E+
Sbjct: 160 SDVEESEFVTSLENESEEELDLEKDDGED 188


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 31.6 bits (72), Expect = 0.40
 Identities = 21/62 (33%), Positives = 40/62 (64%), Gaps = 3/62 (4%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           +KK KEK+++ K K  K++    EK+KEK+E+      E EK+  + +K++ E K+ +++
Sbjct: 65  RKKAKEKLRRDKLKAKKEEA---EKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEE 121

Query: 178 EK 179
           +K
Sbjct: 122 KK 123



 Score = 30.4 bits (69), Expect = 0.85
 Identities = 26/100 (26%), Positives = 50/100 (50%), Gaps = 2/100 (2%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
           KK  KKK+K          +++     ++   + ++ +G+K+ KEK    + K +KE   
Sbjct: 26  KKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAE 85

Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           KE +++  E+  K   ++EKE+   +K K   K   ++KK
Sbjct: 86  KEKEKE--ERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123


>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 799

 Score = 32.8 bits (75), Expect = 0.41
 Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 7/100 (7%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-KMKED 176
           KK +    +K+K    K++   K    E  + EK      +   +  K   KEF K  E 
Sbjct: 324 KKNKLTYKQKRKLLGLKEEEIFKGLRYEGLKAEKNFNISLKTYHDLRKALGKEFLKDLEL 383

Query: 177 KEKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLKK 211
            EK +++I K     ++++  ++     KD  L +E+LKK
Sbjct: 384 NEKILDEIVKILTLYKDREMIEKILELYKD-LLNEEQLKK 422


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 32.1 bits (73), Expect = 0.41
 Identities = 26/84 (30%), Positives = 50/84 (59%), Gaps = 1/84 (1%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
           E+ +EK+++E+ K E+  K+ +K KKE  + K  + K+   +K R+++K  +  +KS   
Sbjct: 91  EEFREKQKEERKKTEDIMKRSQKNKKELYK-KTMKAKKSYEQKCREKDKAEQAYEKSSSG 149

Query: 201 DKKLKKEKLKKKKKEKERSSHEKE 224
            +  + EKLKKK  + + S+ E +
Sbjct: 150 AQPKEAEKLKKKAAQCKTSAEEAD 173



 Score = 29.8 bits (67), Expect = 2.4
 Identities = 20/74 (27%), Positives = 45/74 (60%), Gaps = 1/74 (1%)

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLK 205
           +E EK ++  +++KEE++K E+   + +++K++  +K  K +K  E++ + KDK ++  +
Sbjct: 85  EEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAEQAYE 144

Query: 206 KEKLKKKKKEKERS 219
           K     + KE E+ 
Sbjct: 145 KSSSGAQPKEAEKL 158


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 32.6 bits (75), Expect = 0.41
 Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           D    E+E   +   +      ++++ +    E+ ++  E+  K E D + +    D D 
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDD---DEESDEEDEESSKSEDDEDDDDDDDDDDI 158

Query: 203 KLKKEKLKKKKKEKERSSHEKE 224
             ++  L+++++ +E      E
Sbjct: 159 ATRERSLERRRRRREWEEKRAE 180


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 31.8 bits (73), Expect = 0.41
 Identities = 34/153 (22%), Positives = 66/153 (43%), Gaps = 36/153 (23%)

Query: 72  EKEYSKVKDVELGVTPM-VPNIVKVTSSEDLGEIKKDK--------SIKKDKKTHKKKEK 122
           EK   KVK    G++ M V  +++    + L  ++ +K        S         K   
Sbjct: 18  EKLAPKVK----GISSMSVKEVLQSLVDDGL--VRTEKIGTSNYYWSFPSQALNKLKTRL 71

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           EK+KK+ +          E  +   E +   ++ K+ +EE E++ E             E
Sbjct: 72  EKLKKELE----------ELKQRIAELQAQIEKLKKGREETEERTELL-----------E 110

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           ++++ EK+ +K K   +K +K   E+++K K+E
Sbjct: 111 ELKQLEKELKKLKAELEKYEKNDPERIEKLKEE 143


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 31.7 bits (72), Expect = 0.42
 Identities = 19/85 (22%), Positives = 40/85 (47%), Gaps = 1/85 (1%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K ++ K++K+     +K+K+ K K    K  ++ ++K  E + +   +K+  E      
Sbjct: 101 AKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEK 189
            +E    + +E +E P E+   EE 
Sbjct: 161 -EEAAAAEEEEAEEAPAEEAPAEES 184



 Score = 31.7 bits (72), Expect = 0.44
 Identities = 20/76 (26%), Positives = 28/76 (36%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           K+ K   KK K  K K   K    + EKK  E   +   EKK  +      E     ++E
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169

Query: 179 KPVEKIRKEEKDSEKE 194
           +  E   +E    E  
Sbjct: 170 EAEEAPAEEAPAEESA 185



 Score = 29.0 bits (65), Expect = 4.2
 Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 1/68 (1%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
             E +   + KKDK +  KK  +K   + +KK  +       +K K  +      E+   
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEK-KAAEAAAVAAEEAAA 165

Query: 161 EEKEKKEE 168
            E+E+ EE
Sbjct: 166 AEEEEAEE 173



 Score = 28.2 bits (63), Expect = 7.5
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           ++KE K   K  K  K KK  K+   E E K+ E + + V + +  E  +   ++
Sbjct: 108 EEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162


>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
          Length = 424

 Score = 32.6 bits (75), Expect = 0.42
 Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 2/88 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            +        +  K+K  + I   +   +K   +   +D   + +    + +KE ++  E
Sbjct: 54  DETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSE 113

Query: 165 KK--EEKEFKMKEDKEKPVEKIRKEEKD 190
            K  E K     +  +K ++++R E K+
Sbjct: 114 PKEEEPKAAAESKVVQKELDELRDELKE 141


>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
          Length = 822

 Score = 32.7 bits (75), Expect = 0.44
 Identities = 12/99 (12%), Positives = 37/99 (37%), Gaps = 1/99 (1%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-EEKEKKEEKEKKEEKEFKMKE 175
            ++ +     +K  +     ++ K    ++       K    EK++++++ + +   +K 
Sbjct: 724 PRRGKITVTDEKCPECGLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKG 783

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
              K  EK++    ++  +  + D D    K       +
Sbjct: 784 VGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 32.8 bits (75), Expect = 0.47
 Identities = 15/96 (15%), Positives = 33/96 (34%), Gaps = 11/96 (11%)

Query: 122 KEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           +  +K + D    + +    +     + +E   K +  + +   K E      K      
Sbjct: 856 RNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNK------ 909

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
                 EEK +++     D  ++L K   + + K K
Sbjct: 910 ----YNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 32.4 bits (74), Expect = 0.48
 Identities = 22/161 (13%), Positives = 59/161 (36%)

Query: 48  DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 107
           DK+   ++ + ++K   K++T     +  +  D E        +       E        
Sbjct: 57  DKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAETAGSL 116

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           + I+     ++   +E     +  K  ++ +   + +E + +   +     + EE+    
Sbjct: 117 RPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALM 176

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           EK +++        ++      +S+K KK   +  +  +EK
Sbjct: 177 EKSYELAAKYMPRGQERLPVAPESKKGKKPSVQPVRAVEEK 217



 Score = 29.7 bits (67), Expect = 2.9
 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 6/71 (8%)

Query: 155 EEKEKKEEKEKK------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           EE++  EE+ K       + K+  +  DK K  E+   E+K  +K++  +D     +K  
Sbjct: 30  EEEDAAEEETKGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSD 89

Query: 209 LKKKKKEKERS 219
            + +      S
Sbjct: 90  KESQDYSSSDS 100


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 32.4 bits (74), Expect = 0.49
 Identities = 15/114 (13%), Positives = 35/114 (30%), Gaps = 13/114 (11%)

Query: 99  EDLGEIKKDK-SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
             L ++   +  I  ++        E+    + ++ K     +E+ K   +       ++
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQ----RAQQAKLAQLLEERKKTLAQLNSELSADQ 223

Query: 158 EKKEEKEKKEEK--------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +K EE    E +        E    + +E          +    E K   +  K
Sbjct: 224 KKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYK 277



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 6/89 (6%)

Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
              D  ++K  +K+    EK+  +++++  K EK+ K  +       +   +E    E  
Sbjct: 33  AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLET------EIASLEAQLIETA 86

Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           D  K+ + +  D   +   L+ +++E+ R
Sbjct: 87  DDLKKLRKQIADLNARLNALEVQEREQRR 115


>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
           [Transcription].
          Length = 507

 Score = 32.4 bits (73), Expect = 0.49
 Identities = 26/166 (15%), Positives = 59/166 (35%), Gaps = 3/166 (1%)

Query: 42  PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 101
           P       ++  D      K           K+ +K+         +  N    TS+   
Sbjct: 135 PKKIRLGSITENDMNLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSRE 194

Query: 102 GEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
            +  ++  I K K   KK ++ E  K   D+++    +  + +   +  E  +K +  KK
Sbjct: 195 ADENENYVISKVKDIPKKVRDGESAKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKK 254

Query: 161 EEKEKKEE-KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
            + EK+   K  +     +   + K   +  +  ++++S+ K    
Sbjct: 255 AKLEKRRHVKFLEGSNTHEMDQLLKHFLDNSNFRQDRRSRKKKASA 300


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 32.3 bits (73), Expect = 0.51
 Identities = 29/164 (17%), Positives = 68/164 (41%), Gaps = 18/164 (10%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKD 199
           E DK++ +K   +  + E+  ++E+ + ++ +++   EK   +++  E   E  E +S  
Sbjct: 367 EADKDRIQKLSAELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRELSELRSAL 426

Query: 200 KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
           +  + +KE+L+++K+E        + I  L  +   + +EK   +++             
Sbjct: 427 RVLQKEKEQLQEEKQELL------DYIRVLELRLDKEADEKWAEDAA----TCEDAKSYL 476

Query: 260 PPVAAADEYDTGD-------SKQVWICPACGVQDDGSLPMIGCD 296
              A  D  D+ D       S ++        Q + S P    +
Sbjct: 477 HRAAYIDLLDSIDESPLDESSSELSPSALDEEQHNESTPKNPKE 520


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 32.7 bits (75), Expect = 0.51
 Identities = 26/73 (35%), Positives = 45/73 (61%), Gaps = 8/73 (10%)

Query: 119 KKEKE--KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE----KEKKEEKEKKEEK--E 170
           KKE E  K++ K  KK ++K++  +++   +E+ K  K+E    K+ K+E EK +EK  E
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEE 258

Query: 171 FKMKEDKEKPVEK 183
            K+ E+ +K +EK
Sbjct: 259 LKLPEEVKKVIEK 271



 Score = 32.3 bits (74), Expect = 0.58
 Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 4/81 (4%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDK 200
           K++ E  K + +  +K EEK +K ++E+ ++E + K ++K   I K++KD  ++ K K +
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLRE-QLKAIKKELGIEKDDKDELEKLKEKLE 257

Query: 201 DKKLKKEKLKKKKKEKERSSH 221
           + KL +E  K  +KE  + S 
Sbjct: 258 ELKLPEEVKKVIEKELTKLSL 278



 Score = 29.6 bits (67), Expect = 3.7
 Identities = 18/46 (39%), Positives = 27/46 (58%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
           E L  IKK+  I+KD K   +K KEK+++ K  ++  K+  KE  K
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 32.0 bits (73), Expect = 0.52
 Identities = 28/127 (22%), Positives = 63/127 (49%), Gaps = 5/127 (3%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+     +++  +K+     +   + I+ + ++      K  E+++E +EK    KE  E
Sbjct: 93  ENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREK---LKELIE 149

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           + E +E+  EK  K K+ +++  +   K E+  E+ ++ ++K K+ K+  LK+  +    
Sbjct: 150 QYELREQHFEKLLKTKDLEQQLNDA--KLEQAQEQAEQEQEKHKREKEILLKEAAQVSTL 207

Query: 219 SSHEKEV 225
              EKE+
Sbjct: 208 KETEKEL 214


>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
           lipoprotein.  This model represents a family of six
           predicted lipoproteins from a region of about 20
           tandemly arranged genes in the Treponema denticola
           genome. Two other neighboring genes share the
           lipoprotein signal peptide region but do not show more
           extensive homology. The function of this locus is
           unknown.
          Length = 412

 Score = 32.2 bits (73), Expect = 0.52
 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           KKE+KE     E + EK  KM+E   K  E + K E ++  +   + K+   K   +  K
Sbjct: 22  KKEDKESGLR-EMRVEKFSKMEE---KENEFLSKPEYNTHVKSPEQIKELSNKISGVSGK 77

Query: 213 KKEKER 218
            +  + 
Sbjct: 78  GESDKF 83



 Score = 29.9 bits (67), Expect = 3.0
 Identities = 17/72 (23%), Positives = 33/72 (45%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
           K++KE G +E + +K  K +++E EF  K +    V+   + ++ S K      K +  K
Sbjct: 23  KEDKESGLREMRVEKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDK 82

Query: 206 KEKLKKKKKEKE 217
                  K + +
Sbjct: 83  FVGTPLVKDDYD 94


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 32.1 bits (73), Expect = 0.54
 Identities = 22/76 (28%), Positives = 47/76 (61%), Gaps = 1/76 (1%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           E++K +S+++ K+ +++ +K+ I   K  K   K+  K + + KK++ +  K E  KK++
Sbjct: 13  ELRKAQSLEELKQKYEEAQKQ-IADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKK 71

Query: 163 KEKKEEKEFKMKEDKE 178
           K KKE+ + ++K  K+
Sbjct: 72  KFKKEKVDVRVKVVKK 87



 Score = 28.7 bits (64), Expect = 6.5
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 4/67 (5%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
           E+ K K EE +K+    K  ++ +K+ E +E  ++K + E+  +E  KK     KK KKE
Sbjct: 21  EELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK----KKFKKE 76

Query: 208 KLKKKKK 214
           K+  + K
Sbjct: 77  KVDVRVK 83


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 32.2 bits (74), Expect = 0.55
 Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 121 EKEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           E+  I  +  KK  +D +  + ++ E K K + K  +  KKE ++  ++ E +MKE    
Sbjct: 580 EEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKE---- 635

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
                  +  + E+  + +D+ K+LK+E L K 
Sbjct: 636 -----AAKNLEFEEAARLRDEIKELKEELLGKS 663


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 32.4 bits (74), Expect = 0.55
 Identities = 18/114 (15%), Positives = 44/114 (38%), Gaps = 10/114 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           K+ E E     K  +   +    E+   K + E    E K++ E + ++ + + K   ++
Sbjct: 641 KRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQ 700

Query: 178 EKPVEKIRKEEKDSEK-EKKSK------DKDKKLKKEKLKKKKKEKERSSHEKE 224
           ++   +  K++    + E+ +K      + D +L          E  R+  +  
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQVVEGELDNQL---AQLSAAIEAARTQAKAR 751



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 31/120 (25%), Positives = 59/120 (49%), Gaps = 11/120 (9%)

Query: 114 KKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
            +T  K   +++KK+ D+     D   N  K+ +++ +E     E+      E +E + F
Sbjct: 744 ARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAF 803

Query: 172 KMKED--------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            M+E         +E+P   I+  E +S  E+  ++  + +K  KL++KK E+ER + EK
Sbjct: 804 -MQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEK 862



 Score = 29.3 bits (66), Expect = 4.8
 Identities = 27/138 (19%), Positives = 59/138 (42%), Gaps = 16/138 (11%)

Query: 102 GEIKKDKS----IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
            ++  D+S    ++  K   +  + E+++      D D++ +   + E+ E        K
Sbjct: 320 AKLAADRSELELLEDQKGAFEDADIEQLQA-----DLDQLPSIRSELEEVEARLDALTGK 374

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKK 211
            +  +  K E  + K+KE  E+ +EK        +EEKD +K    +D      + + + 
Sbjct: 375 HQDVQ-RKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQL 433

Query: 212 KKKEKERSSHEKEVIPKL 229
           +  + E +  E E+  +L
Sbjct: 434 EAGKLEFNEEEYELELRL 451


>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
          Length = 985

 Score = 32.5 bits (74), Expect = 0.57
 Identities = 30/123 (24%), Positives = 55/123 (44%), Gaps = 12/123 (9%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD---------KDKIKNKEKDKEKKEKEKG 152
            EI  +  IK+     +++  + I K++ K            D IK+   D  + E  + 
Sbjct: 563 KEINDESFIKQGFAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNENEFAEK 622

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKK--EKL 209
           + E  + K+E  +  EK   + +D+    ++IR  +E+  EK K  K+  K+ K   EK+
Sbjct: 623 EPELDKLKDELYEAFEKIKDLIKDRLIGADRIRNAKERKKEKIKIRKENFKEFKALWEKI 682

Query: 210 KKK 212
             K
Sbjct: 683 NHK 685


>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
          Length = 472

 Score = 32.3 bits (74), Expect = 0.57
 Identities = 12/27 (44%), Positives = 19/27 (70%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           EK+KEKK K++GK+E +   + K  +E
Sbjct: 395 EKEKEKKAKKEGKEERRIHFQNKSIEE 421


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 32.4 bits (75), Expect = 0.59
 Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 10/65 (15%)

Query: 118 KKKEKEKIKKKKDK--KDKDKIKNK-------EKDKEKK-EKEKGKKEEKEKKEEKEKKE 167
            + E  +++K+  K  K+ ++++ K        K  E+  EKE+ K  E E+K  K K+ 
Sbjct: 809 VEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKER 868

Query: 168 EKEFK 172
               K
Sbjct: 869 LARLK 873


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 32.2 bits (74), Expect = 0.61
 Identities = 26/105 (24%), Positives = 61/105 (58%), Gaps = 4/105 (3%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           + E+E +KK+KD+  K+++++ EK+  + E+E    EE+ K E+   +  ++  +KE+ E
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQ--IKEEIE 475

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           +   ++ + E++ +  K ++ +  KL   +L+K+ +  E    E+
Sbjct: 476 QVRLELEQAEREGDLAKAAELQYGKL--PELEKRLQAAEAKLGEE 518


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 32.1 bits (73), Expect = 0.64
 Identities = 28/117 (23%), Positives = 43/117 (36%), Gaps = 12/117 (10%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
           K   ++I+K+ D               +K     E         K  K+KK     K   
Sbjct: 374 KGIYDQIEKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKG----KSTS 429

Query: 176 DKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
            K    E I  +E+D+ K+     KK +DK  K+  +     KKE  +S  +   IP
Sbjct: 430 TKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIP 486



 Score = 29.4 bits (66), Expect = 4.4
 Identities = 22/105 (20%), Positives = 36/105 (34%), Gaps = 14/105 (13%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-------------KDKDKIKNKEK 142
            SS  L    +      +         +  KKKK K              D++    K K
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK 450

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
             +KK ++K  K   + K    KKE  + +   +   P E + K+
Sbjct: 451 KNQKKGRDKSSKVPSDSK-AGGKKESVKSQEDNNNIPPEEWVMKK 494



 Score = 28.6 bits (64), Expect = 7.3
 Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 4/100 (4%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEK-EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
           +S       K  K  KK K T  K    E I   ++   K   KN++K ++K  K     
Sbjct: 409 SSPAASNSDKGSKK-KKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDS 467

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           +   KKE  + +E+      E  E  ++KI +   D E++
Sbjct: 468 KAGGKKESVKSQEDNNNIPPE--EWVMKKILEWVPDLEED 505


>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
           subunit A'' fusion protein; Provisional.
          Length = 509

 Score = 32.1 bits (73), Expect = 0.65
 Identities = 22/153 (14%), Positives = 61/153 (39%), Gaps = 5/153 (3%)

Query: 44  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
            ++   ++    T  K++ +     S     YS    +     P++  ++++ S E +  
Sbjct: 27  FEVPKNITDGYITLKKKEFTYHVIISKIAP-YSNSNGIIKKKKPVLKTVLEIESEEKIEA 85

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEK--EKK 160
           I   +  +   +   +       +    ++K+  + N  +D  K   +  K   +     
Sbjct: 86  IDLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSV 145

Query: 161 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSE 192
            E+  K  K+ ++ +D+ E+ + +IR+E + + 
Sbjct: 146 IEEIAKAMKKKELSDDEYEEILRRIREEYERAR 178


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 31.9 bits (73), Expect = 0.65
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 3/91 (3%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           KE +K +    D D  K    D E+K+    + EE + K  +  K+    K K  K+  +
Sbjct: 15  KESLKARGLSVDIDLEKLIALDDERKKLLS-EIEELQAKRNELSKQIG--KAKGQKKDKI 71

Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           E+I+KE K+ ++E        K  + +L+ K
Sbjct: 72  EEIKKELKELKEELTELSAALKALEAELQDK 102



 Score = 31.6 bits (72), Expect = 0.77
 Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 11/91 (12%)

Query: 118 KKKEKEKIKKKKDKKDKDKI-----KNKEKDKE-----KKEKEKGKKEEKEKKEEKEKKE 167
             KE  K +      D +K+     + K+   E      K  E  K+  K K ++K+K E
Sbjct: 13  LVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIE 72

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           E + ++KE KE+ + ++    K  E E + K
Sbjct: 73  EIKKELKELKEE-LTELSAALKALEAELQDK 102



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 27/122 (22%), Positives = 54/122 (44%), Gaps = 5/122 (4%)

Query: 103 EIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-EKGKKEEKEKK 160
           +I  +K I  D +  K   E E+++ K+++  K      +   +KK+K E+ KKE KE K
Sbjct: 26  DIDLEKLIALDDERKKLLSEIEELQAKRNELSKQI---GKAKGQKKDKIEEIKKELKELK 82

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           EE  +       ++ + +  +  I     +S    K ++ + ++K+         K +  
Sbjct: 83  EELTELSAALKALEAELQDKLLSIPNIPHESVPVGKDEEDNLEVKRWGTPPVFDFKPKPH 142

Query: 221 HE 222
            E
Sbjct: 143 WE 144


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 30.1 bits (68), Expect = 0.65
 Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
            +K     K K T  +K  EK  +++  K ++K    EK+ E++ + +  KE +  KEEK
Sbjct: 16  TEKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEK 75

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEK 189
           E+ E+   KM     K VE++++ EK
Sbjct: 76  ERYEKMAAKM---HAKKVERLKRREK 98


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 32.3 bits (74), Expect = 0.66
 Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 6/97 (6%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKK----EEKEFKMKEDKEKPVEKIRKEE 188
           K +   KE ++E +E EK   E E         K    E K   +            + +
Sbjct: 512 KLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAK 571

Query: 189 KDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
            ++  E K  K K   L     +   K+K+     + 
Sbjct: 572 VEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA 608



 Score = 30.4 bits (69), Expect = 2.1
 Identities = 30/139 (21%), Positives = 53/139 (38%), Gaps = 15/139 (10%)

Query: 141 EKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEFKMKEDKEKPVEKIR---KEEKDSEKEKK 196
           E+ K +  +EK   EE  EK EE  K+ E+E +  E     VE +       K S  E K
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELK 552

Query: 197 SKDKDKKLKK-----EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 251
           ++             E+ K +  ++ +   +K+ I      F + +++K  +        
Sbjct: 553 AQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAG-AFESTLKDKKNKVLPEAKKR 611

Query: 252 KP-----VKSPSPPPVAAA 265
           K      V+  +  P A  
Sbjct: 612 KLKLGDEVEVITGEPGAVV 630



 Score = 28.8 bits (65), Expect = 6.8
 Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 16/105 (15%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEKEKKEEKEF-KMKEDK 177
           EK +  +K+ +++ ++++      +        GK    E K +             E+ 
Sbjct: 511 EKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEA 570

Query: 178 EKPVEKIRKEEKDSE------------KEKKSKDKDKKLKKEKLK 210
           +       K  K  +            K+KK+K   +  KK KLK
Sbjct: 571 KVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA-KKRKLK 614



 Score = 28.5 bits (64), Expect = 8.5
 Identities = 32/140 (22%), Positives = 55/140 (39%), Gaps = 6/140 (4%)

Query: 64  SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKE 123
           S  KTS  E +   V    +G+ P+     KV + +++  +KK K I  D    +   K+
Sbjct: 542 SGGKTSLLELKAQIVVLAHMGL-PVPAEEAKVEAVDEVKFLKKKKGIL-DAGAFESTLKD 599

Query: 124 KIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           K  K   +  K K+  K  D+ E    E G   +      +   +    K+       +E
Sbjct: 600 KKNKVLPEAKKRKL--KLGDEVEVITGEPGAVVKIIAGILEALVQSGILKVIV-SHLDLE 656

Query: 183 KIRKEEKDSEKEKKSKDKDK 202
            I++E K   K  ++   D 
Sbjct: 657 IIKEEPKVKVKGIEAPGLDS 676


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 32.3 bits (73), Expect = 0.66
 Identities = 29/132 (21%), Positives = 64/132 (48%), Gaps = 8/132 (6%)

Query: 101  LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGKKEEK 157
            L    +    +K++    K+   K  + K    K+K+KN     KD E K ++ GK +  
Sbjct: 914  LETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQD-GKDDYL 972

Query: 158  EKKEEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDKDK-KLKKEKLKKKK 213
            ++KE +      + +  E  ++ + +   + +++ D++K ++   +D   L+K + + K+
Sbjct: 973  KQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKE 1032

Query: 214  KEKERSSHEKEV 225
             E+E   H KE+
Sbjct: 1033 VEEELKQHLKEM 1044



 Score = 31.6 bits (71), Expect = 1.2
 Identities = 31/147 (21%), Positives = 66/147 (44%), Gaps = 1/147 (0%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKE 161
           E +         +  + + K   +   D + K+++K ++ D+   ++K  G+  E +K+ 
Sbjct: 389 ERQIKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEI 448

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
            ++K+EE +F +KE ++      R  E D E  K  ++  K  K    +  KKE +   +
Sbjct: 449 LEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQN 508

Query: 222 EKEVIPKLTFKFGTDMEEKTKRESSPK 248
           EK  + +   K   +ME+     ++  
Sbjct: 509 EKADLDRKLRKLDQEMEQLNHHTTTRT 535



 Score = 30.0 bits (67), Expect = 2.8
 Identities = 25/113 (22%), Positives = 52/113 (46%), Gaps = 6/113 (5%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           IK  +++++ ++T  +K +E   + K  K     + KEK  E +++   K+ + E   E 
Sbjct: 185 IKALETLRQVRQTQGQKVQEHQMELKYLK-----QYKEKACEIRDQITSKEAQLESSREI 239

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            K  E E    +++ K +E    +    + E K+    +K + EK   + + K
Sbjct: 240 VKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKS-RKKQMEKDNSELELK 291



 Score = 30.0 bits (67), Expect = 2.9
 Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 3/108 (2%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
            + E + +K +  + + +  K  + D E K  +  KK+ ++   E E K EK F+  +++
Sbjct: 243 YENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQ 302

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
              +    +     EKE++  D  ++L  EKL K+++   +   E  V
Sbjct: 303 LNDLYHNHQRTV-REKERELVDCQREL--EKLNKERRLLNQEKTELLV 347


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 30.5 bits (69), Expect = 0.66
 Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 9/101 (8%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE-EKEFKMKED 176
            K  KE+ K+ K  + K+  K K      +  E+ K+E +  +E K +KE EKE     D
Sbjct: 27  IKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFAD 86

Query: 177 KEKPV--------EKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           KEK V        E+ RK E++ ++ ++ ++++   K + L
Sbjct: 87  KEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKGKDL 127


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 32.2 bits (73), Expect = 0.66
 Identities = 6/85 (7%), Positives = 30/85 (35%), Gaps = 7/85 (8%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           ++ + + DK   + + +    + +D+  + + ++  + +  + + +       +      
Sbjct: 11  RNDTRRSDKGRERSRRRS---RSRDRSRRRRDRDYYRGRRGRSRSRSPN----RYYRPRG 63

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKD 190
                   +       E + + E+D
Sbjct: 64  DRSYRRDDRRSGRNTKEPLTEAERD 88



 Score = 28.7 bits (64), Expect = 6.5
 Identities = 7/82 (8%), Positives = 32/82 (39%), Gaps = 2/82 (2%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           + +++E+ + + D +  DK + + + +  + +++ ++          +   +  +     
Sbjct: 1   RYRDRERGRLRNDTRRSDKGRERSRRRS-RSRDRSRRRRDRDYYRGRRGRSRS-RSPNRY 58

Query: 178 EKPVEKIRKEEKDSEKEKKSKD 199
            +P         D    + +K+
Sbjct: 59  YRPRGDRSYRRDDRRSGRNTKE 80


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 32.2 bits (73), Expect = 0.67
 Identities = 26/162 (16%), Positives = 69/162 (42%), Gaps = 3/162 (1%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           ++ I K        E+E+ ++K+ ++++    ++  D  K     G+      +E +E++
Sbjct: 385 EEVISKYAMKQDDTEEEE-RRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE-- 224
             +E + +E++E+  E+  +EE+  ++E++ + +     +E+++   +        +E  
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDA 503

Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 266
                     + M E  +   S      P + P  P    A+
Sbjct: 504 ERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545


>gnl|CDD|234000 TIGR02760, TraI_TIGR, conjugative transfer relaxase protein TraI.
            This protein is a component of the relaxosome complex. In
            the process of conjugative plasmid transfer the
            realaxosome binds to the plasmid at the oriT (origin of
            transfer) site. The relaxase protein TraI mediates the
            single-strand nicking and ATP-dependent unwinding
            (relaxation, helicase activity) of the plasmid molecule.
            These two activities reside in separate domains of the
            protein.
          Length = 1960

 Score = 32.2 bits (73), Expect = 0.67
 Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 4/104 (3%)

Query: 72   EKEYSKVKDVELGVTPMVPN-IVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKK 127
            EK  +K K   +    + P+    +   E   ++ +  S  +DK     K     E + K
Sbjct: 1840 EKIINKFKQDNISARIVFPDDWNDIGEEELQKQLMRAISSIEDKDIEIPKAIADFESVLK 1899

Query: 128  KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
              +      I N   ++EK + E+    E + ++  E+   KEF
Sbjct: 1900 MPNSDIISIIDNARTEREKSDLEQIVANEHKSQQALERDLNKEF 1943


>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease.  This family
           includes the AlwI (recognises GGATC), Bsp6I (recognises
           GC^NGC), BstNBI (recognises GASTC), PleI(recognises
           GAGTC) and MlyI (recognises GAGTC) restriction
           endonucleases.
          Length = 429

 Score = 32.0 bits (73), Expect = 0.72
 Identities = 23/129 (17%), Positives = 40/129 (31%), Gaps = 11/129 (8%)

Query: 117 HKKKEKEKIKKKKDKKDKD------KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
            KK+E ++I K   +                 D      +     + E+ +      +++
Sbjct: 178 EKKEEIDEILKNTARYPIYTRLEDYFEYMSNIDGPLLPTDDLANTKIEQLKIIANNIKEQ 237

Query: 171 FKMKEDKE----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           F + E       K V    K+E +  KEKK  D +     E  +K   E          I
Sbjct: 238 FNIAELLSILNSKYVVDELKDELNELKEKKFFDDEYLDLLEVFEKILNEDLYEVTPSADI 297

Query: 227 P-KLTFKFG 234
           P    +   
Sbjct: 298 PAYFEWLTW 306


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 32.0 bits (72), Expect = 0.72
 Identities = 22/139 (15%), Positives = 56/139 (40%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           G+  + + + K  +     EK +++  K ++ K K K    ++  ++  +   EE+E+  
Sbjct: 200 GKRSRAQVVDKLLQLGLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQLP 259

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
           E + +E ++       E       +  +   K++K+      L+   ++     +E    
Sbjct: 260 ESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEKTSWPLLWLQSCLIRAADDREEDECD 319

Query: 222 EKEVIPKLTFKFGTDMEEK 240
           +   +  LT +    ME +
Sbjct: 320 QAVPLVPLTEENEEAMENE 338


>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
          Length = 650

 Score = 32.1 bits (74), Expect = 0.73
 Identities = 17/54 (31%), Positives = 28/54 (51%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           D +  K K+K +      K  K ++KN+E  KE   + K K  +  K ++KE+K
Sbjct: 194 DAEFKKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERK 247



 Score = 29.4 bits (67), Expect = 4.3
 Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 4/52 (7%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKD-KKDKDKIKNKE---KDKEKKEK 149
            E KK K   +      K +K ++K ++D K+   ++K K+      +KKE+
Sbjct: 195 AEFKKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKER 246


>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
           Members of this family are absolutely restricted to the
           Mollicutes (Mycoplasma and Ureaplasma). All have a
           signal peptide, usually of the lipoprotein type,
           suggesting surface expression. Most members have lengths
           of about 280 residues but some members have a nearly
           full-length duplication. The mostly nearly invariant
           residue, a Trp,is part of a strongly conserved 9-residue
           motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
           is hydrophobic. Because the hydrophobic six-residue core
           of this motif almost always contains three to four
           aromatic residues, we name this family aromatic cluster
           surface protein. Multiple paralogs may occur in a given
           Mycoplasma, usually clustered on the genome.
          Length = 297

 Score = 31.6 bits (72), Expect = 0.73
 Identities = 19/78 (24%), Positives = 28/78 (35%)

Query: 95  VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
           + S     E    K IK      K K+ +K K   + K  + + N     ++ EK K  K
Sbjct: 21  LISCSSQSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIK 80

Query: 155 EEKEKKEEKEKKEEKEFK 172
            +K    E   K  K   
Sbjct: 81  SQKNINSELYLKLLKLSL 98



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 15/84 (17%), Positives = 28/84 (33%), Gaps = 4/84 (4%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
            + + D  K+   + D +K +K+K  K     K            + ++ E    K  K 
Sbjct: 26  SQSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLN----LVFKNDEDEKNKYIKS 81

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKK 211
           +K+   E   K     L+      
Sbjct: 82  QKNINSELYLKLLKLSLEYLNNIN 105


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 32.1 bits (73), Expect = 0.74
 Identities = 13/73 (17%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-LKKEKLKKKKK 214
           +++K+EE+ K++ +  ++KE  ++ ++K+++E +  E E+K    +++ L    L+ +  
Sbjct: 10  QQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSANDLRTELS 69

Query: 215 EKERSSHEKEVIP 227
                    +   
Sbjct: 70  PPSNLLKTSDASH 82



 Score = 29.0 bits (65), Expect = 6.9
 Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 9/105 (8%)

Query: 132 KDKDKIKNKEKDKEKKE--------KEKGKKEEKEKKEEKEKKE-EKEFKMKEDKEKPVE 182
           +DK  +  ++K +E+ +        KEK  +E ++ K+E +  E E++F + E +     
Sbjct: 3   RDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSAN 62

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
            +R E        K+ D      + +    K++K   S  KE++P
Sbjct: 63  DLRTELSPPSNLLKTSDASHIADESQPNSIKQKKREISPVKELVP 107


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 31.8 bits (72), Expect = 0.74
 Identities = 16/72 (22%), Positives = 33/72 (45%), Gaps = 10/72 (13%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK---------EEKEKKEEKEKKEEK 169
            +E+  +      +D+D+    E D+E+K+    +K         + K  K  +E++   
Sbjct: 541 TEERIDLDADDWTEDEDE-NEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAAL 599

Query: 170 EFKMKEDKEKPV 181
           + KM +  EKP+
Sbjct: 600 KMKMTDTSEKPL 611


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 31.6 bits (72), Expect = 0.74
 Identities = 14/68 (20%), Positives = 32/68 (47%), Gaps = 3/68 (4%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEKIRKEEKD 190
           K K + +   +  K+ +  KK  K+K++   K+ E   KE++  E +   ++++ K+   
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65

Query: 191 SEKEKKSK 198
                + K
Sbjct: 66  FYVPAEHK 73



 Score = 29.6 bits (67), Expect = 2.5
 Identities = 16/68 (23%), Positives = 30/68 (44%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           K K  +E   +  K+ K +K+  ++K     +  E  V++ RK E++  + K+   K  K
Sbjct: 6   KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65

Query: 204 LKKEKLKK 211
                  K
Sbjct: 66  FYVPAEHK 73


>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809).  This
           family consists of several proteins of unknown function
           Raphanus sativus (Radish) and Brassica napus (Rape).
          Length = 138

 Score = 30.5 bits (68), Expect = 0.75
 Identities = 21/48 (43%), Positives = 32/48 (66%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           +++ KK     K +I+ KE+ KE K + +GK+E+KE K E E KEEK+
Sbjct: 83  DEVTKKVSPIIKGEIEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKK 130


>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
           (A-type).  The trypanosome parasite expresses these
           proteins to evade the immune response. This family
           includes a variety of surface proteins such as
           Trypanosoma brucei VSGs such as expression site
           associated gene (ESAG) 6 and 7.
          Length = 370

 Score = 31.7 bits (72), Expect = 0.75
 Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 4/73 (5%)

Query: 94  KVTSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKEK 149
             T+ +   E+K+   + +   K T  +   EKIKK     D +KIK   ++ D E+  K
Sbjct: 279 DTTTLKASTELKEALKRLLLNKKDTDDEDAAEKIKKLFGDDDAEKIKKLWEKIDNEQIPK 338

Query: 150 EKGKKEEKEKKEE 162
           +     +K K  E
Sbjct: 339 KIAGTTKKTKLGE 351


>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
           production and conversion].
          Length = 194

 Score = 31.2 bits (71), Expect = 0.76
 Identities = 23/110 (20%), Positives = 55/110 (50%), Gaps = 19/110 (17%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK---- 174
           +K  +KI ++ +++  ++I  + +++ +K KE+ K+E +E  EE  +K EKE + +    
Sbjct: 5   EKLIKKILREAEEE-AEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRI 63

Query: 175 -------------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
                        E KE+ +E + +  ++ +    + D + +  +E L +
Sbjct: 64  ISSALLEARRKLLEAKEEILESVFEAVEE-KLRNIASDPEYESLQELLIE 112


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 0.77
 Identities = 25/101 (24%), Positives = 51/101 (50%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           K K+++ K K  E+  E  + +  ++E + +++E    E +   ++ DKE        EE
Sbjct: 77  KWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEE 136

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
           +  E  KK+K+   +   E+ +++  E+E +  EKE   +L
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177


>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
          Length = 312

 Score = 31.6 bits (72), Expect = 0.78
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEK 216
            KE + ++ +  K  K+K       E  +SEK +K K K   L  KK+KL+K  +EK
Sbjct: 109 GKESRIKR-YDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEK 164


>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
          Length = 643

 Score = 31.7 bits (72), Expect = 0.79
 Identities = 26/153 (16%), Positives = 55/153 (35%), Gaps = 4/153 (2%)

Query: 62  MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
           M++K K        +K      G+   +   +    +ED     K  +        +KK 
Sbjct: 1   MANKLKVDELRAALAKRGLDTTGLKAALVRRLDDAIAEDAKTASKSGT---KSSAGRKKR 57

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           +E+     D+    K    ++ K    +E  +   +       +K++   ++    E  V
Sbjct: 58  RERQDDGDDEPVSPKRIAIDEVKGMTVRELREAASERGLATTGRKKDLLERLCAALESDV 117

Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
            K+       E EK+    +++ K+EK+    K
Sbjct: 118 -KVGSANGTGEDEKEKGGDEEREKEEKIVTATK 149


>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
           role in the biomineralisation of teeth.  They seem to
           regulate formation of crystallites during the secretory
           stage of tooth enamel development and are thought to
           play a major role in the structural organisation and
           mineralisation of developing enamel. The extracellular
           matrix of the developing enamel comprises two major
           classes of protein: the hydrophobic amelogenins and the
           acidic enamelins. Circular dichroism studies of porcine
           amelogenin have shown that the protein consists of 3
           discrete folding units: the N-terminal region appears to
           contain beta-strand structures, while the C-terminal
           region displays characteristics of a random coil
           conformation. Subsequent studies on the bovine protein
           have indicated the amelogenin structure to contain a
           repetitive beta-turn segment and a "beta-spiral" between
           Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
           rich region. The beta-spiral offers a probable site for
           interactions with Ca2+ ions. Muatations in the human
           amelogenin gene (AMGX) cause X-linked hypoplastic
           amelogenesis imperfecta, a disease characterised by
           defective enamel. A 9bp deletion in exon 2 of AMGX
           results in the loss of codons for Ile5, Leu6, Phe7 and
           Ala8, and replacement by a new threonine codon,
           disrupting the 16-residue (Met1-Ala16) amelogenin signal
           peptide.
          Length = 165

 Score = 30.9 bits (70), Expect = 0.79
 Identities = 11/24 (45%), Positives = 11/24 (45%)

Query: 19  PRLPPMLQHPLIPQPLVHPPGTPP 42
           P LP   Q P  PQPL  P    P
Sbjct: 90  PNLPQPAQQPFQPQPLQPPQPQQP 113



 Score = 27.8 bits (62), Expect = 8.6
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 22  PPMLQHPLIPQPLVHPPGTPPP 43
           PP  Q P+ PQP VHP    PP
Sbjct: 107 PPQPQQPMQPQPPVHPIPPLPP 128


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 32.1 bits (73), Expect = 0.80
 Identities = 16/56 (28%), Positives = 36/56 (64%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           D +K  ++ KE+K++EK ++ + E+ +E+ ++EEK F  +  ++  +E  +K  K+
Sbjct: 274 DWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKE 329


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
           model represents a group of paralogous families in
           plasmodium species alternately annotated as reticulocyte
           binding protein, 235-kDa family protein and rhoptry
           protein. Rhoptry protein is localized on the cell
           surface and is extremely large (although apparently
           lacking in repeat structure) and is important for the
           process of invasion of the RBCs by the parasite. These
           proteins are found in P. falciparum, P. vivax and P.
           yoelii.
          Length = 2757

 Score = 31.9 bits (72), Expect = 0.82
 Identities = 37/153 (24%), Positives = 63/153 (41%), Gaps = 5/153 (3%)

Query: 71  KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 130
           K KEY K   ++      + N +K    + L ++ K  + + + K     E E+  +  +
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K     IK +  D +  + EK   + K    E  K  E+E++     +K  E I+  E  
Sbjct: 875 K-----IKAEISDDKLNDYEKKFNDSKSLINEINKSIEEEYQNINTLKKVDEYIKICENT 929

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            E  +K  +K   LK+   K     KE +  EK
Sbjct: 930 KESIEKFHNKQNILKEILNKNIDTIKESNLIEK 962



 Score = 31.9 bits (72), Expect = 0.93
 Identities = 35/131 (26%), Positives = 65/131 (49%), Gaps = 7/131 (5%)

Query: 98   SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK---KDKDKIKNKEKDKEKKEKEKGKK 154
            ++++ +IK D      K  H  K  E+IKKK +    + K +I + E   +K       +
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPE 1169

Query: 155  EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKLK 210
            E ++K E    K +K+  + ++ +K + +I + EKD    ++ K  +    K L K  L+
Sbjct: 1170 EIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLE 1229

Query: 211  KKKKEKERSSH 221
            K  +EK++S H
Sbjct: 1230 KIDEEKKKSEH 1240



 Score = 31.6 bits (71), Expect = 0.98
 Identities = 33/110 (30%), Positives = 56/110 (50%), Gaps = 6/110 (5%)

Query: 100  DLGEIK--KDKSIK-KDK--KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
            ++ E+K   DKS   KD+  K  K  EK K   ++ KKD  ++ NK      K K    K
Sbjct: 1484 NINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTK 1543

Query: 155  EEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKK 203
            ++ E   ++ K   K+F ++ +K E+ +++I+KE+   E +    DK  K
Sbjct: 1544 KDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDDAAKNDKSNK 1593



 Score = 30.4 bits (68), Expect = 2.8
 Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 3/126 (2%)

Query: 120  KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM-KEDKE 178
            K +  +K KKD    D   N  + KE  +K KG K+E +K  +  +K ++ F+  K+D  
Sbjct: 1465 KSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVT 1524

Query: 179  KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK--KEKERSSHEKEVIPKLTFKFGTD 236
            + + K       ++  K  KD +  +K+ K   KK   E E+S  + + I K  F+   D
Sbjct: 1525 ELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDD 1584

Query: 237  MEEKTK 242
              +  K
Sbjct: 1585 AAKNDK 1590



 Score = 29.2 bits (65), Expect = 5.1
 Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 18/115 (15%)

Query: 107  DKSIKKDKKTHKKKEKEKIKKKKDKKDK---------DKIKNKEKDKEKKEKEKG----- 152
            DK+I  D     +K+ E I  K DKK           ++I   EKDK   E+ KG     
Sbjct: 1160 DKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSY 1219

Query: 153  ----KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
                 K   EK +E++KK E   K  E   + +++I+++  + E E   +   K 
Sbjct: 1220 GKNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKA 1274


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 32.0 bits (73), Expect = 0.83
 Identities = 31/131 (23%), Positives = 43/131 (32%), Gaps = 21/131 (16%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
            +  K+  K  KK EK     +   K   K   K   K  K     KKE +   E     
Sbjct: 251 QELYKERDKILKKLEKA--LNELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEY---- 304

Query: 167 EEKEFKMKEDKEKPVEKIRKEE-----KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
                    DK++P  +I+K       K  +       K  KL  E    +K   E +  
Sbjct: 305 -------VPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKSLYENTPT 357

Query: 222 EKEVIPKLTFK 232
           +   I   TFK
Sbjct: 358 KSGFI---TFK 365


>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
           motility and secretion].
          Length = 214

 Score = 31.0 bits (70), Expect = 0.83
 Identities = 14/74 (18%), Positives = 33/74 (44%), Gaps = 3/74 (4%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK---EKKEEKEKKEEKEFKMKE 175
           K++ E++  +  K D + ++   +  E+ ++    ++     E+  EKEK   ++ K   
Sbjct: 24  KEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLKEPA 83

Query: 176 DKEKPVEKIRKEEK 189
                 E+  K+ K
Sbjct: 84  QMSSTSEEEEKKAK 97



 Score = 29.4 bits (66), Expect = 3.2
 Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS--KDKDKKLKKEKLK 210
           KK   E+K EK +K    F ++E KEK  E++  + K S+ E      ++ ++ K+  L 
Sbjct: 1   KKPYLEEKIEKAEKVTA-FALEELKEKI-EELPIQAKKSDDELVEELPERYEQTKENSLI 58

Query: 211 KKKKEKERSSHEKEVIPKLTFK 232
           +K    E    EKE +     K
Sbjct: 59  EKVDSIEEEISEKEKVMSEKLK 80


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
            Provisional.
          Length = 2722

 Score = 32.1 bits (73), Expect = 0.86
 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 7/112 (6%)

Query: 107  DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
            DK IK+  K  ++K  + I   +  K K    +   D +K +  K K   +E K  +EK 
Sbjct: 1030 DKLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIK---EEIKLLEEKV 1086

Query: 167  EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            E    K+ E+K K +E   K    S +   + DK+K  + E   KKKK  E+
Sbjct: 1087 EALLKKIDENKNKLIEIKNK----SHEHVVNADKEKNKQTEHYNKKKKSLEK 1134



 Score = 31.0 bits (70), Expect = 1.6
 Identities = 41/167 (24%), Positives = 67/167 (40%), Gaps = 15/167 (8%)

Query: 97   SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--------KDKEKKE 148
             S  L E+ K      D+   +  EKE+ ++ KD ++    +  E        K KE   
Sbjct: 1963 VSNKLSELNKITC--NDESYDEILEKEEYEELKDLRNSFNQEKAETLNNLKLNKIKEDFN 2020

Query: 149  KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
              K   +E EK  +  K  E   K+ E+K+  ++ I    +D EKE +S +       E 
Sbjct: 2021 SYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESIN---PSLDEL 2077

Query: 209  LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
            LKK  K +   S    +I  +  K   D +     E   +I +  +K
Sbjct: 2078 LKKGHKIE--ISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIK 2122


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 31.8 bits (73), Expect = 0.86
 Identities = 21/79 (26%), Positives = 47/79 (59%), Gaps = 4/79 (5%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
           ++K+KK K+++KN +    KK +E  ++++ E++E+ E+    +F     K K  E+  +
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF----VKSKGKEEEEE 545

Query: 187 EEKDSEKEKKSKDKDKKLK 205
           EEK+ + ++  +D+ +  K
Sbjct: 546 EEKEEKDKETEEDEPEGPK 564



 Score = 30.2 bits (69), Expect = 2.7
 Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 15/87 (17%)

Query: 169 KEFKMKED----KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           KE KMKE+    K    +K+++ +++ + E++ KD+   L    +K K KE+E    ++E
Sbjct: 492 KEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF-VKSKGKEEEEEEEKEE 550

Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVI 251
                        E +      PKI++
Sbjct: 551 KDK----------ETEEDEPEGPKIIV 567



 Score = 28.7 bits (65), Expect = 8.1
 Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           KEK  KEE +  +    K+ +E    ++++K  E+ + E+   +   KSK K+++ ++EK
Sbjct: 492 KEKKMKEELKNLKGILNKKLQE---LDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEK 548

Query: 209 LKKKKKEKERSSHEKEVI 226
            +K K+ +E      ++I
Sbjct: 549 EEKDKETEEDEPEGPKII 566


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 0.88
 Identities = 21/67 (31%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+  + +K+ S + ++K   K +KEK K+KK+KK  +K+  K+  K  K   K   ++  
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151

Query: 159 KKEEKEK 165
            K+E+ K
Sbjct: 152 TKKEEGK 158



 Score = 30.1 bits (68), Expect = 1.3
 Identities = 17/72 (23%), Positives = 39/72 (54%), Gaps = 5/72 (6%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
            ++ EK++KE  K+ E+++  + +K+     K ++ ++K  EK+ K++     +  +K  
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKE-----KKEKKEKKVAEKLAKKKSTKTTKNTTKKA 146

Query: 201 DKKLKKEKLKKK 212
            KK   +K + K
Sbjct: 147 TKKTTTKKEEGK 158



 Score = 30.1 bits (68), Expect = 1.4
 Identities = 16/63 (25%), Positives = 36/63 (57%)

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           ++E+++  ++ +++E     KE KEK  +K+ ++    +  K +K+  KK  K+   KK+
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155

Query: 214 KEK 216
           + K
Sbjct: 156 EGK 158



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 15/63 (23%), Positives = 33/63 (52%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           + ++ +   + +EK+  K +KEK +K+EK+  E+  KK+  +      K+   +   K+E
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155

Query: 189 KDS 191
           +  
Sbjct: 156 EGK 158



 Score = 29.7 bits (67), Expect = 1.7
 Identities = 18/66 (27%), Positives = 38/66 (57%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
           ++ + + KE  KE +EKE  K ++++K+++++K  EK  K K  K       +  +K + 
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152

Query: 193 KEKKSK 198
           K+++ K
Sbjct: 153 KKEEGK 158



 Score = 29.0 bits (65), Expect = 3.1
 Identities = 18/59 (30%), Positives = 31/59 (52%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           K+  K  ++KE  K KK+K +K + K+  K   K+  +  K   ++  KK   +K+E K
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158



 Score = 29.0 bits (65), Expect = 3.2
 Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
            E+ +KE+KE  +E E K    K K  +K +KE+K +EK  K K         K   KK 
Sbjct: 92  PEESEKEQKEVSKETEEKEAI-KAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150

Query: 215 EK 216
             
Sbjct: 151 TT 152



 Score = 28.2 bits (63), Expect = 6.1
 Identities = 18/61 (29%), Positives = 31/61 (50%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           ++KE  K  ++K+     K K ++K+K+  EK   KK  K  K   +K  +K    KE+ 
Sbjct: 98  EQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157

Query: 178 E 178
           +
Sbjct: 158 K 158



 Score = 28.2 bits (63), Expect = 6.4
 Identities = 22/63 (34%), Positives = 36/63 (57%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           EKE+ +  K+ ++K+ IK K++ KEKKEK+  +K  K+K  +  K   K+   K   +K 
Sbjct: 96  EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155

Query: 181 VEK 183
             K
Sbjct: 156 EGK 158


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 31.6 bits (71), Expect = 0.90
 Identities = 48/175 (27%), Positives = 69/175 (39%), Gaps = 14/175 (8%)

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK----KDKKDKDKIKNKEKDKEKKE 148
            +++  +   EIKK   IKK KK     E+E   K     +  +          DKE +E
Sbjct: 474 TRISKIQFTQEIKK--LIKKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEE 531

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE----KEKKSKDKDKKL 204
            E    + KE  E KE  +  E K  E  +KP     KE K S+     +K    KD K 
Sbjct: 532 GEHE--DSKESDEPKEGGKPGETKEGEVGKKPGPA--KEHKPSKIPTLSKKPEFPKDPKH 587

Query: 205 KKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
            K+  + KK ++ RS+               D+ +  KR  SPK   +P     P
Sbjct: 588 PKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRP 642



 Score = 31.2 bits (70), Expect = 1.2
 Identities = 54/237 (22%), Positives = 81/237 (34%), Gaps = 23/237 (9%)

Query: 37  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKE-KEYSKVKDVELGVTPMVPNIVKV 95
           PP  P  S +  K     +    E   SKE   PKE  +  + K+ E+G  P        
Sbjct: 510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKP-------- 561

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
                 G  K+ K  K    + K +  +  K  KD ++  K K     +     +  K  
Sbjct: 562 ------GPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLP 615

Query: 156 EKEK--KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
           E     K  K  +  K  K     ++P    R E     K  K     K     K K+K 
Sbjct: 616 ELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKF 675

Query: 214 KEK--ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP-PPVAAADE 267
            +   + ++  KE     T     +  E   +E+ P+    P  +P P PP    DE
Sbjct: 676 YDDYLDAAAKSKETK---TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE 729


>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
          Length = 648

 Score = 31.7 bits (73), Expect = 0.90
 Identities = 24/67 (35%), Positives = 37/67 (55%), Gaps = 9/67 (13%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE-KDSEKEKKSK 198
           +E    K++ E    +E E+KE+K +K E      +D +K VE +R  E  ++EK +KS 
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPEITI---DDFDK-VE-LRVAEVLEAEKVEKS- 567

Query: 199 DKDKKLK 205
             DK LK
Sbjct: 568 --DKLLK 572


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 31.5 bits (72), Expect = 0.91
 Identities = 18/119 (15%), Positives = 49/119 (41%), Gaps = 2/119 (1%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEK-EKGKKEE 156
             +  IK     K      +     ++  + +  D  +++  +++ +    K  + +K+ 
Sbjct: 171 NLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQL 230

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           +E ++E ++       +   K + +E+I + EK  E+ +    K+    K K+   +K 
Sbjct: 231 EELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKL 289


>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
          Length = 111

 Score = 29.9 bits (67), Expect = 0.92
 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           K+K++  +KE    K E+E   KE K K EEK KK E+E          + + RKE +  
Sbjct: 16  KEKEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEE--SLNMYNAALMEARKEAEKK 73

Query: 192 EKEKKSKDKDK 202
             E  +K K +
Sbjct: 74  AVEIINKAKQE 84



 Score = 28.4 bits (63), Expect = 3.3
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           I K+K+    KE   IK +++ + K+  K+K ++K KK +E+          E  K+ EK
Sbjct: 14  IIKEKEESIDKEIANIKNEQENEIKE-AKSKYEEKAKKTEEESLNMYNAALMEARKEAEK 72

Query: 170 EFKMKEDKEK 179
           +     +K K
Sbjct: 73  KAVEIINKAK 82


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 31.2 bits (71), Expect = 0.92
 Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
           E  +K+EKEKKE+ + ++++    K   E+ ++K  KE    + E    + D+  KK+K 
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYV-EYDRAGKKKKA 217

Query: 210 KKKKKE 215
           K K +E
Sbjct: 218 KSKYEE 223


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 30.7 bits (70), Expect = 0.96
 Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 8/60 (13%)

Query: 153 KKEEKEKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           +K EK +K EKE +K  +E   K   +   +K  K +++        +  K++  + +K 
Sbjct: 40  RKMEKYQKREKEIQKRAREL-RKNGDKLSPKKFEKRQEE------LMEDQKEMMMDMMKP 92


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 31.3 bits (71), Expect = 0.97
 Identities = 41/182 (22%), Positives = 71/182 (39%), Gaps = 11/182 (6%)

Query: 42  PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 101
             S   D    + ++    + SS     PKEK+ S      L       +  +    ED+
Sbjct: 235 SESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDI 294

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE--- 158
              +  K   K K    ++ ++ I +KK       +K KE++KE+KE+E  + E +    
Sbjct: 295 DPDQVVKKPVKRKNRRGQRARQAIWEKKYGSGAKHVK-KEREKEQKEREGRQSEWEARQA 353

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-------KKLKKEKLKK 211
           K+E  + K  +  +    + +      K  +  + +K S DK        KK  KEK   
Sbjct: 354 KREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKAN 413

Query: 212 KK 213
            K
Sbjct: 414 AK 415



 Score = 31.3 bits (71), Expect = 1.1
 Identities = 30/134 (22%), Positives = 55/134 (41%), Gaps = 19/134 (14%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-----EEKEKKEEKEKKEEK 169
           +T  KK K K K KK KK + K  + + D+E+ E E   K     E+    EE+E  + +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202

Query: 170 EFKMKE-------DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-------EKLKKKKKE 215
           ++   +       D+E+  E       +   E +S + D ++ +       E+     K+
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKK 262

Query: 216 KERSSHEKEVIPKL 229
            +        +P L
Sbjct: 263 PKEKKTSSTFLPSL 276


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 31.2 bits (71), Expect = 0.97
 Identities = 24/130 (18%), Positives = 58/130 (44%), Gaps = 15/130 (11%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           +  + + ++ + +  ++  K K + K   +D DK  N +   E  E +  + E K K+ E
Sbjct: 52  DELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIE 111

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD--------KKLKKEKLKKKKK 214
           ++ +         D +        +  D + +    + +        +K+KKE+ ++K++
Sbjct: 112 EDARNS-------DADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164

Query: 215 EKERSSHEKE 224
           E+E  + E+E
Sbjct: 165 EEEEKAAEEE 174


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 29.8 bits (67), Expect = 0.98
 Identities = 13/24 (54%), Positives = 20/24 (83%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEE 168
             +EK + K+EEK+K+EEKE++EE
Sbjct: 73  AAEEKAEEKEEEKKKEEEKEEEEE 96



 Score = 29.4 bits (66), Expect = 1.2
 Identities = 11/26 (42%), Positives = 21/26 (80%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKE 170
              E++  +KEE++KKEE++++EE+E
Sbjct: 72  AAAEEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 27.8 bits (62), Expect = 4.6
 Identities = 13/23 (56%), Positives = 19/23 (82%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEE 162
            E+  E+KE+EK K+EEKE++EE
Sbjct: 74  AEEKAEEKEEEKKKEEEKEEEEE 96


>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1.  This
           family consists of several acidic phosphoprotein
           precursor PCEMA1 sequences which appear to be found
           exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
           that is associated with the membrane of the infected
           erythrocyte throughout the entire intraerythrocytic
           cycle. The exact function of this family is unclear.
          Length = 286

 Score = 31.0 bits (70), Expect = 1.0
 Identities = 31/139 (22%), Positives = 53/139 (38%), Gaps = 9/139 (6%)

Query: 37  PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 96
           PP    P K +  V     T   E  +S      +  E       EL          +VT
Sbjct: 157 PPVPDMPKKQELPVEETLST-IHEDDASTLHEDDELDEEVTSYLNELDD--------EVT 207

Query: 97  SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           S  + GE +++    + +     K+ E   + K+K  ++  + K       E E   + E
Sbjct: 208 SYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESE 267

Query: 157 KEKKEEKEKKEEKEFKMKE 175
            E +EE  +++E E K +E
Sbjct: 268 DEYEEEAGEEQENEDKGEE 286


>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin.  Amelogenins play a role in
           biomineralisation. They seem to regulate the formation
           of crystallites during the secretory stage of tooth
           enamel development. thought to play a major role in the
           structural organisation and mineralisation of developing
           enamel. They are found in the extracellular matrix.
           Mutations in X-chromosomal amelogenin can cause
           Amelogenesis imperfecta.
          Length = 174

 Score = 30.7 bits (69), Expect = 1.0
 Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 5/51 (9%)

Query: 8   PFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTP-PPSKMKDKVSPK 53
             P V   G  P L P  QHPL P    +P    P    PP + +    P+
Sbjct: 89  HHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPPVQPQQPADPQ 139


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 14/101 (13%), Positives = 38/101 (37%), Gaps = 2/101 (1%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           L + K+DK   ++K+   + + E +   + + +   + + +++ +K         KE   
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
             E+   +E+K        E   ++   +    E+      
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAA 250



 Score = 28.9 bits (65), Expect = 5.1
 Identities = 17/96 (17%), Positives = 43/96 (44%), Gaps = 5/96 (5%)

Query: 133 DKDKIKNKEKDKEK-KEKEKGKKEEKEK----KEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           DK  ++ +++DK+  +EK+   +++ E     + E E +       K +K   +  +  +
Sbjct: 146 DKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAK 205

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
           E  +  EK + ++ K L +    +  K++  +    
Sbjct: 206 EASALGEKAALEEQKALAEAAAAEAAKQEAAAKAAA 241


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           E K+ K+++EF  +E KE      +K     + E K K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKK 195



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           +KD K K++    E+ KE       K    +K E K+KK
Sbjct: 159 RKDAKQKEEFA-AEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 1/39 (2%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           E+   K+++E   +E KE      KK     K  E K+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQK-VETKKK 195



 Score = 29.7 bits (67), Expect = 2.4
 Identities = 10/34 (29%), Positives = 16/34 (47%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           K+K++   ++ K       KK     K E K+KK
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196



 Score = 28.9 bits (65), Expect = 3.8
 Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 2/39 (5%)

Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           K+ K+K  E+   EE+       +K      K E  KKK
Sbjct: 160 KDAKQK--EEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 215

 Score = 30.7 bits (69), Expect = 1.1
 Identities = 14/70 (20%), Positives = 36/70 (51%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           +++ +K    T KK+ K +   KK + D  +      ++ K E+ +  K+  E++++   
Sbjct: 46  QEQLLKAGLVTSKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAAL 105

Query: 166 KEEKEFKMKE 175
            +E + ++K+
Sbjct: 106 AKEYKAQVKQ 115


>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
          Length = 213

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 14/73 (19%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
           K K + ++ +++ KK E   ++EK  + E++ K    K    ++  + +K   +     D
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPK---PKAPRKKRKPRAQKPEPQHTPVSD 163

Query: 200 KDKKLKKEKLKKK 212
             +    + +K K
Sbjct: 164 ISELTVGQAVKVK 176



 Score = 29.9 bits (68), Expect = 2.0
 Identities = 8/49 (16%), Positives = 26/49 (53%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           K K++ ++ ++   K +   + ++   +E+  K +  +K+ K + ++ E
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155



 Score = 29.5 bits (67), Expect = 2.5
 Identities = 9/60 (15%), Positives = 30/60 (50%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           K K + ++ + +  K +   +++   ++E++   K  ++K++ + +K E +     D  +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166



 Score = 29.1 bits (66), Expect = 3.8
 Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 1/70 (1%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           ++ K K + +  E+  +K+E    K++   ++ + + K  ++ K K   +KP  +     
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRK-KRKPRAQKPEPQHTPVS 162

Query: 189 KDSEKEKKSK 198
             SE      
Sbjct: 163 DISELTVGQA 172



 Score = 28.7 bits (65), Expect = 5.3
 Identities = 11/64 (17%), Positives = 28/64 (43%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           ++ ++  KK+E    K+K  ++++       + K K   +K + +     +  E    + 
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172

Query: 171 FKMK 174
            K+K
Sbjct: 173 VKVK 176



 Score = 28.7 bits (65), Expect = 5.6
 Identities = 10/71 (14%), Positives = 27/71 (38%), Gaps = 5/71 (7%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +E K K + ++      E++ KK E   + ++   +  +   K  +   K +  K + + 
Sbjct: 104 EEAKAKVQAQR-----AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158

Query: 204 LKKEKLKKKKK 214
                + +   
Sbjct: 159 TPVSDISELTV 169



 Score = 28.0 bits (63), Expect = 8.3
 Identities = 15/70 (21%), Positives = 34/70 (48%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
           K K + +  E++ K++    EKEK   +E+K + +   K+ K +  +   +    S+  +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166

Query: 196 KSKDKDKKLK 205
            +  +  K+K
Sbjct: 167 LTVGQAVKVK 176


>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
           Staphylococcus aureus plasmid proteins with no
           characterized function.
          Length = 229

 Score = 30.8 bits (69), Expect = 1.1
 Identities = 26/112 (23%), Positives = 53/112 (47%), Gaps = 6/112 (5%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           I  D+  KK+++    +   K+     +K +  ++N+E  + KK+ +   K   E KE +
Sbjct: 6   IMSDEDEKKNEENMNMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQ 65

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           + +E++   +K + ++  E  +      +K KK  +    L +EKLK    E
Sbjct: 66  DIREKELKAIKSELKEATEDFQ------DKGKKIHNDFVDLLQEKLKHVDAE 111


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 30.7 bits (70), Expect = 1.1
 Identities = 15/75 (20%), Positives = 30/75 (40%), Gaps = 6/75 (8%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
            K    KE+E+   E   ++  +   E+K     +++        +EE D   E+  + +
Sbjct: 15  PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELD------EEEDDRFLEEYRRKR 68

Query: 201 DKKLKKEKLKKKKKE 215
             ++K    K K  E
Sbjct: 69  LAEMKALAEKSKFGE 83



 Score = 28.4 bits (64), Expect = 6.4
 Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 14/87 (16%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF----------KMKED 176
           K    K++++   +   +E  E    KK   E  EE +++E+  F          +MK  
Sbjct: 16  KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKAL 75

Query: 177 KEK----PVEKIRKEEKDSEKEKKSKD 199
            EK     V +I K +   E  + SKD
Sbjct: 76  AEKSKFGEVYEISKPDYVREVTEASKD 102


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 10/123 (8%)

Query: 93  VKVTSSEDLGEIKKDKSIKK----DKKTHKKKEK-EKIKKKKDKKDKDKIKNKE-----K 142
           ++V + +   E   +  +K      KKT+   EK E  K+  +KK++ KIKN+       
Sbjct: 86  IEVYNKQFKHEQLWNSQVKGILTLGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKT 145

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
            +E+KE++    +      +K ++  KE+     ++K   KI KE ++++         +
Sbjct: 146 QREEKEEKDFIADCWRNLYKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLE 205

Query: 203 KLK 205
           K+K
Sbjct: 206 KVK 208



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 10/140 (7%)

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           K+  ++ EK E K+  + +  EK ++     +  +++E+K +          L KK +E+
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170

Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESS----------PKIVIKPVKSPSPPPVAAAD 266
            +   E     K   K   + E      S            +IV +  ++P         
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQTPLALLECNLT 230

Query: 267 EYDTGDSKQVWICPACGVQD 286
           ++D+ ++  +W     G  D
Sbjct: 231 DFDSIENHSIWEQKIVGSGD 250


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 31.4 bits (71), Expect = 1.2
 Identities = 24/112 (21%), Positives = 55/112 (49%), Gaps = 7/112 (6%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-----ED 176
              +++ +  + K K+   E +  +KE E  + + + +++     E  E ++      E 
Sbjct: 55  MTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEV 114

Query: 177 KEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           + K +E++ K  E + +E E+++K    KL  E LK + +++++    KE I
Sbjct: 115 RLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166



 Score = 28.4 bits (63), Expect = 9.1
 Identities = 31/122 (25%), Positives = 57/122 (46%), Gaps = 9/122 (7%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD---KIKNKEKDKE--KKEKEKGKKEEK 157
           E  + +  + +KK      ++        K K    ++K  E+D E  K  KE+  +  +
Sbjct: 202 EECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPE 261

Query: 158 EKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKK 213
            ++E    +EE      MKED E   E++   +   E+ +K ++K  D +L+KEKL+ + 
Sbjct: 262 LERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENEL 321

Query: 214 KE 215
           K 
Sbjct: 322 KS 323


>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 137 to 196
           amino acids in length.
          Length = 149

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 13/46 (28%), Positives = 19/46 (41%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
               +     E ++ +KK  K  +     K KDK+KK K K    E
Sbjct: 1   MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASE 46


>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
           to archaeal Holliday junction resolvase [Nucleotide
           transport and metabolism].
          Length = 175

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 18/53 (33%), Positives = 35/53 (66%), Gaps = 3/53 (5%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEK--KEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           ++ K + K ++ +E  +K E+E+   E + +KEE E+K+KE  EK +E+ R++
Sbjct: 27  LQGKVESKARELEETLQKAERERLVNEAQARKEE-EWKLKEWIEKKIEEARED 78


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 11/43 (25%), Positives = 24/43 (55%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
           K+K+E+E+  +  +E   E  ++   + K KE+++   E+ R 
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRH 163



 Score = 29.7 bits (67), Expect = 2.8
 Identities = 8/40 (20%), Positives = 24/40 (60%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           K K+  +++ + + +   +  ++ ++K K K+++K E E+
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160



 Score = 29.3 bits (66), Expect = 3.3
 Identities = 9/40 (22%), Positives = 26/40 (65%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           K K+++++   + ++   ++ E+ + ++K K+E+K + EE
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160


>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807).  This is
           a family of conserved fungal proteins of unknown
           function.
          Length = 169

 Score = 30.1 bits (68), Expect = 1.2
 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 5/70 (7%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E+E KEE E   E+E ++    +       +   D ++ ++     K  +K    K++K 
Sbjct: 68  ERELKEEAEA--EEEGEVDASPD-AGAVAGESSADRKEAEQQGAAQK--RKSCRDKERKS 122

Query: 216 KERSSHEKEV 225
            +      + 
Sbjct: 123 AKDPRGGTQD 132



 Score = 27.8 bits (62), Expect = 9.1
 Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 3/82 (3%)

Query: 121 EKEKIKKKKD-KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           E   + ++++ K++ +  +  E D          +   ++KE +++   ++ K   DKE+
Sbjct: 61  EIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKER 120

Query: 180 --PVEKIRKEEKDSEKEKKSKD 199
               +     +   +K + S D
Sbjct: 121 KSAKDPRGGTQDVVDKSQASLD 142


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 29.6 bits (67), Expect = 1.2
 Identities = 28/109 (25%), Positives = 51/109 (46%), Gaps = 22/109 (20%)

Query: 126 KKKKDKKDKDKI--KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-------- 175
            K    K KD I  ++K+  K KK+ +  K E+K K++ + +K +   K +         
Sbjct: 12  SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71

Query: 176 DKEKPVEKI------------RKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +KE+ + K+            R  +K +EK  K   K  ++K+E+L K+
Sbjct: 72  EKERRLRKVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKE 120



 Score = 28.1 bits (63), Expect = 4.4
 Identities = 19/81 (23%), Positives = 37/81 (45%), Gaps = 6/81 (7%)

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
           K+   K K     R ++    K+K   +K +K  K +L+ +K++       K V+P+   
Sbjct: 13  KLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED-- 70

Query: 232 KFGTDMEEKTKRESSPKIVIK 252
                 +E+  R+ + K V+K
Sbjct: 71  ----LEKERRLRKVAQKGVVK 87


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 31.2 bits (70), Expect = 1.2
 Identities = 32/134 (23%), Positives = 64/134 (47%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           +D KT  +KEK K  +     DK  ++NKE  +E  +     K+ +E     +K+EE+  
Sbjct: 474 EDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERML 533

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
           K  E+ E+    +R E +   +E   K  + K K +K ++  +  E    +KE   K+  
Sbjct: 534 KQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILE 593

Query: 232 KFGTDMEEKTKRES 245
               +++++ + ++
Sbjct: 594 NKCNNLKKQIENKN 607


>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
           II/NMEMI.  CRISPR loci appear to be mobile elements with
           a wide host range. This model represents a protein found
           only in CRISPR-containing species, near other
           CRISPR-associated proteins (cas), as part of the NMENI
           subtype of CRISPR/Cas locus. The species range so far
           for this protein is animal pathogens and commensals only
           [Mobile and extrachromosomal element functions, Other].
          Length = 805

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 11/55 (20%), Positives = 22/55 (40%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           E  ++E+     ++  KE  +    + KE      ++  K+   E  SK+  K  
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLR 575



 Score = 30.9 bits (70), Expect = 1.6
 Identities = 12/76 (15%), Positives = 29/76 (38%), Gaps = 6/76 (7%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK----KEEKEFKM 173
                ++  K++ KK++DKI  KE      ++   ++  +   +   K     ++    M
Sbjct: 528 GTNFGKRNSKERYKKNEDKI--KEFASALGKEILKEEPTENSSKNILKLRLYYQQNGKCM 585

Query: 174 KEDKEKPVEKIRKEEK 189
              KE  ++ +     
Sbjct: 586 YTGKEIDIDDLFDLSY 601



 Score = 30.1 bits (68), Expect = 2.6
 Identities = 15/76 (19%), Positives = 30/76 (39%), Gaps = 1/76 (1%)

Query: 73  KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK 132
            +  KV +  +        IV   + E+ G     ++ K+  K ++ K KE       + 
Sbjct: 499 LQARKVVNELVKKYGPPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558

Query: 133 DKDKI-KNKEKDKEKK 147
            K++  +N  K+  K 
Sbjct: 559 LKEEPTENSSKNILKL 574



 Score = 28.9 bits (65), Expect = 6.3
 Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 1/108 (0%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           D   KK K T+KK  K     +       + +  +   EK      K   K +K   +K 
Sbjct: 321 DLLFKKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDNA-EKAFNISLKTYHKLRKALGDKD 379

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
                K  +D ++ V+ +   +     +K+ +     L +E++KK  +
Sbjct: 380 LLDNPKNPKDLDEIVKILTLYKDREMIKKRLELYKDVLNEEQVKKLVR 427



 Score = 28.5 bits (64), Expect = 7.7
 Identities = 11/53 (20%), Positives = 22/53 (41%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
           E  +E++    GK+  KE+ ++ E K ++       +    E      K+  K
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILK 573


>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
          Length = 819

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 13/86 (15%), Positives = 38/86 (44%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
           L +  ++++I K  +  K+   ++      ++ ++K+  + +   + ++E  K+  K+ K
Sbjct: 439 LKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVK 498

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRK 186
           +  E       +     E+  EK+  
Sbjct: 499 KIPEVDTYLLLEELGINEETYEKLEA 524


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 15/50 (30%), Positives = 18/50 (36%)

Query: 22  PPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPK 71
           P  +  P  P      P  PP  K  +K  PK K   K +     K  PK
Sbjct: 87  PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPK 136


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 18/62 (29%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
           K +KE   +KE+K+ + +  K++ +    + E+  E+ +K K + +K +++K K+++KEK
Sbjct: 70  KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK-RTRKNREKKFKRRQKEK 128

Query: 217 ER 218
           E+
Sbjct: 129 EK 130



 Score = 28.8 bits (65), Expect = 3.2
 Identities = 15/52 (28%), Positives = 33/52 (63%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
           K+K  K+ +   K++   ++++++ K+  +K K   K++EKK K + K++EK
Sbjct: 79  KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130



 Score = 28.4 bits (64), Expect = 3.6
 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 3/61 (4%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           +    +EKEK KK+K    K +I  + + +  KE+    ++EK  ++ +EKK ++  K K
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERA---EKEKRTRKNREKKFKRRQKEK 128

Query: 175 E 175
           E
Sbjct: 129 E 129



 Score = 27.2 bits (61), Expect = 8.2
 Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTH-KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           KV       + KK K  +  KK     + + +  K++ +K+K   KN+EK  ++++KEK 
Sbjct: 70  KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129

Query: 153 K 153
           K
Sbjct: 130 K 130


>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 205

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 15/80 (18%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 105 KKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           +  +S+ + K    K      ++       K+      +  E++ ++   K ++ K    
Sbjct: 20  QLGQSLAELKLRLDKDALLAGLEDALALVQKEAALEALQRAEQRLQQA-AKAKRAKDAAA 78

Query: 164 EKKEEKEFKMKEDKEKPVEK 183
              E K F  K  KEK V+ 
Sbjct: 79  NAAEGKAFLEKNAKEKGVKT 98


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 30.6 bits (70), Expect = 1.3
 Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 10/87 (11%)

Query: 139 NKEKDKEKKEKEKGKKEE--KEKKE-EKEKKE-EKEFKMKE----DKEKPVEKIRKEEKD 190
            KE +  KKE     KEE  K + E E+E KE   E + +E     KE+ ++  RK+E  
Sbjct: 44  KKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLD--RKDESL 101

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +KE+  ++K+K+L   + + ++KE+E
Sbjct: 102 EKKEESLEEKEKELAARQQQLEEKEEE 128


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 30.6 bits (69), Expect = 1.3
 Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 16/144 (11%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           +I+ K D+ +K       KD+ K E+E  + EEK  KE  ++ EE    + E+K + VE+
Sbjct: 5   EIRNKTDELEKL------KDEIKAEEEALESEEKHLKEYDKEMEE----LLEEKMQHVEE 54

Query: 184 IRKEE---KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF---KFGTDM 237
           +R+      D E E K    + + +  K+ +   E          + KL     +     
Sbjct: 55  LRQIHADINDMETEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEELPQLH 114

Query: 238 EEKTKRESSPKIVIKPVKSPSPPP 261
           EE+ ++ S     + PV  P PP 
Sbjct: 115 EEERQKISPATSPVPPVPLPDPPA 138


>gnl|CDD|223494 COG0417, PolB, DNA polymerase elongation subunit (family B) [DNA
           replication, recombination, and repair].
          Length = 792

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 20/128 (15%), Positives = 48/128 (37%), Gaps = 8/128 (6%)

Query: 89  VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
            P +VK    E L  +   + +++    + +   EK++  +   +K  I  +   +   E
Sbjct: 638 WPELVKEFQREVLEVLLSGRDVEEALAKYVRDVIEKLRNGEVPIEKLVISKRLT-RPLSE 696

Query: 149 KEKGK------KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
            +  K         +++    +  +   + + + K K VE+    E   E+     D + 
Sbjct: 697 YKANKPHVVVAARLRKRGINVKPGDRIPYVIVKGKGKLVERAEPPELVDEENSP-IDYEY 755

Query: 203 KLKKEKLK 210
            + K+ L 
Sbjct: 756 YITKQLLP 763


>gnl|CDD|191973 pfam08232, Striatin, Striatin family.  Striatin is an intracellular
           protein which has a caveolin-binding motif, a
           coiled-coil structure, a calmodulin-binding site, and a
           WD (pfam00400) repeat domain. It acts as a scaffold
           protein and is involved in signalling pathways.
          Length = 133

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 21/77 (27%), Positives = 43/77 (55%), Gaps = 7/77 (9%)

Query: 102 GEIKKDKSIKKDKKTH-------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
           GE +  +++K+D K          K+E+ K+KK K   D  +  +KE+++E+ E++  ++
Sbjct: 39  GERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELEEDDDEE 98

Query: 155 EEKEKKEEKEKKEEKEF 171
              EK +E E K+ + +
Sbjct: 99  SNSEKVDESELKKSRLY 115


>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Proteins in
           this group have been named FCH domain Only (FCHO)
           proteins. Vertebrates have two members, FCHO1 and FCHO2.
           These proteins contain an F-BAR domain and a C-terminal
           domain of unknown function named SAFF which is also
           present in endophilin interacting protein 1. F-BAR
           domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 261

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 14/50 (28%), Positives = 24/50 (48%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
           K K+    +  + E+  +E    +E EK E K KK + E+K   +K   +
Sbjct: 127 KAKEAYHARCLELERLRRENASPKEIEKAEAKLKKAQDEYKALVEKYNNI 176


>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 145

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 5/37 (13%), Positives = 15/37 (40%), Gaps = 3/37 (8%)

Query: 130 DKKDKDKIKN---KEKDKEKKEKEKGKKEEKEKKEEK 163
            K D     +      ++   E  + + +++E+  +K
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140


>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
           homolog [Translation, ribosomal structure and
           biogenesis].
          Length = 395

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 11/61 (18%), Positives = 26/61 (42%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
            + D I  +E+ +++ EK K K  +   K + E+ +++       K+K  +   +     
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394

Query: 192 E 192
            
Sbjct: 395 N 395



 Score = 30.8 bits (70), Expect = 1.5
 Identities = 15/52 (28%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
            ++E+ EK +EK++++       K   ++DKK +  + ++KKKEK+  S  +
Sbjct: 341 SLREELEKRIEKLKEKPPKPPT-KAKPERDKKERPGRYRRKKKEKKAKSERR 391



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           +K+  K KEK      K K ++DK +   +   +K+KEK  K E+   + 
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGR-YRRKKKEKKAKSERRGLQN 395



 Score = 29.3 bits (66), Expect = 4.2
 Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           EK+K+K  K        ++K KE+  + + KK+EK+ K E+   + 
Sbjct: 351 EKLKEKPPKPPTKAKPERDK-KERPGRYRRKKKEKKAKSERRGLQN 395



 Score = 28.5 bits (64), Expect = 7.0
 Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 1/53 (1%)

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKK 214
           +E+ E++  K+KE   KP  K + E    E+  +  + K +K  K + +  + 
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395


>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score = 30.8 bits (69), Expect = 1.4
 Identities = 25/137 (18%), Positives = 60/137 (43%), Gaps = 1/137 (0%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKK 203
           +K  K++ +  +K  ++ +++ E+K F +K       V+ I K+ +    +++ + K+K 
Sbjct: 4   KKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKM 63

Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVA 263
             +EK ++ +K+  R+  + + +    F   T  +    + +  K   +  + P      
Sbjct: 64  RMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLYSDV 123

Query: 264 AADEYDTGDSKQVWICP 280
              E D    K+ WI  
Sbjct: 124 RDKEEDVPLGKRPWINT 140


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 31.1 bits (71), Expect = 1.5
 Identities = 21/67 (31%), Positives = 36/67 (53%)

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            + E +K EE +   +E  +  +EK   E +  EKEKK  D+  KLK+ ++ + +KE E 
Sbjct: 393 VRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452

Query: 219 SSHEKEV 225
              E ++
Sbjct: 453 EVDEDDI 459


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 17/114 (14%), Positives = 65/114 (57%), Gaps = 9/114 (7%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK-------EKKEEKEKK 166
           K    + ++ + K ++ + + ++ +   +++ ++E+++ + +++       ++ E++ ++
Sbjct: 71  KIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQ 130

Query: 167 EEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            E+  KM+E+     E +R+  EE+  E  +++ +++ +L++E ++ K + + R
Sbjct: 131 NEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEAR 184



 Score = 29.3 bits (66), Expect = 3.9
 Identities = 24/115 (20%), Positives = 55/115 (47%), Gaps = 8/115 (6%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           K ++K +  E + + KE E  + +    K E+ + E +E +    ++   E+ R + +D 
Sbjct: 58  KMQEKTRQAELEAKIKEYEAQQAQ---AKLERARVEAEERRKTLQEQTQQEQQRAQYQDE 114

Query: 192 EKEKKSKDK--DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
              K+ + +   ++ + E+L K ++E   S   +E + + T +   +M  +T  E
Sbjct: 115 LARKRYQKELEQQRRQNEELLKMQEE---SVLRQEAMRRATEEEILEMRRETIEE 166


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 30.7 bits (69), Expect = 1.5
 Identities = 22/116 (18%), Positives = 46/116 (39%), Gaps = 10/116 (8%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE--------KGKKEEKEKKEE 162
           +K ++   +++  +   + +K   D  +    D++  + E              K   +E
Sbjct: 129 QKQRRREDQEDGVRQPGRTEKSGSDFWEPPSSDEDDSDSEDSMSDLYPPELFTLKNPGKE 188

Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           +   E+ +F+  ++ E  VE    ++K S+K+  S    KK KK   KK       
Sbjct: 189 QNGDEDDDFETDDEDEMEVESPELQQKRSKKQ--SGSLTKKFKKNHKKKGPFRFSS 242


>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
           Provisional.
          Length = 719

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 24/103 (23%), Positives = 45/103 (43%), Gaps = 17/103 (16%)

Query: 82  ELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 141
           E G  P +P +V + S+          S KK++K  +           D+ +     +  
Sbjct: 395 ERGAPPRIPVMVNMASA--------SVSSKKNQKPQESS------PSLDQTNAASRNSVM 440

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
            D++  + E+ +  E E++ E  K E K+  M+E+   P +KI
Sbjct: 441 MDEDSDDDEEFQIPESEQEPETTKNETKDTAMEEE---PQDKI 480


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 17/72 (23%), Positives = 35/72 (48%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           +DK    ++  EK          D  K  +   +K + E+   ++ +  EEKE KE++E 
Sbjct: 212 QDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271

Query: 172 KMKEDKEKPVEK 183
           +  E++E+  ++
Sbjct: 272 EETEEEEEEEDE 283



 Score = 29.0 bits (65), Expect = 4.5
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 4/71 (5%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDK----IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           K+K   +  EKE       + D  K       K  D+E  +++    EEKE KE++E++E
Sbjct: 214 KNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273

Query: 168 EKEFKMKEDKE 178
            +E + +ED++
Sbjct: 274 TEEEEEEEDED 284


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 26/152 (17%), Positives = 75/152 (49%), Gaps = 11/152 (7%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           +++++E  ++ +  ++  +   KE ++ ++   + +++ K+++EE +  E+K  +++E+ 
Sbjct: 4   EREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEEN 63

Query: 178 EKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 236
            +  E+    E++ E+ E +  +   ++ K + +++KKE E    ++E+          +
Sbjct: 64  RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR---------E 114

Query: 237 MEEKTKRESSPKIVIKPVKSPSPPPVAAADEY 268
            +E  +R     +        +PP VAA    
Sbjct: 115 AQEAHERARQELLEAAA-APTAPPHVAAPVNG 145


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 29.5 bits (66), Expect = 1.6
 Identities = 21/99 (21%), Positives = 53/99 (53%), Gaps = 7/99 (7%)

Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-------KEEKDSE 192
           +++  E   +E+ ++ ++  +E +E+++ K  KM+E++E   + IR       KEE + E
Sbjct: 24  EDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDEEE 83

Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
            +  +++++ +L ++K   ++   E    ++E   K  F
Sbjct: 84  PQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEKSGF 122


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 30.8 bits (69), Expect = 1.6
 Identities = 25/116 (21%), Positives = 52/116 (44%), Gaps = 6/116 (5%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK----KEKEKGKKEEKEKKEEKEKK 166
           +  +       +EK+KK ++      + N E+   +    K+K   +K  K KK + E++
Sbjct: 119 ELLELIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQ 178

Query: 167 EEKEFKMKED--KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
            E+E +M ++  K++   ++    +D  K  K      K   E+L +   E  R++
Sbjct: 179 IEEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNN 234


>gnl|CDD|129262 TIGR00158, L9, ribosomal protein L9.  Ribosomal protein L9 appears
           to be universal in, but restricted to, eubacteria and
           chloroplast [Protein synthesis, Ribosomal proteins:
           synthesis and modification].
          Length = 148

 Score = 29.7 bits (67), Expect = 1.6
 Identities = 13/61 (21%), Positives = 22/61 (36%), Gaps = 1/61 (1%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKKSKDKDK 202
           K+ +EK    K    + +E  +        K  D+ K    I  ++     +    D DK
Sbjct: 52  KKLEEKLAANKAAAARLKEVLELGTLTISKKVGDEGKLFGSITTKQIADALKAAGLDLDK 111

Query: 203 K 203
           K
Sbjct: 112 K 112


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 31/127 (24%), Positives = 57/127 (44%), Gaps = 32/127 (25%)

Query: 127 KKKDKKD-------KDKIKNKEKDKEKKEKEK------------------GKKEEKEKKE 161
           +KK KK+       K K    E+ K K+E E                    K   K    
Sbjct: 171 RKKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMT 230

Query: 162 EKEKKEEKEFKMKE---DKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +EK +E + +++E   D+  +P ++ + EE   E  K+  ++ KKL+ E+L++ + E+E
Sbjct: 231 PEEKDDEYDQRVRELTFDRRAQPTDRTKTEE---ELAKEEAERLKKLEAERLRRMRGEEE 287

Query: 218 RSSHEKE 224
               E++
Sbjct: 288 DDEEEED 294



 Score = 30.7 bits (70), Expect = 1.8
 Identities = 22/92 (23%), Positives = 46/92 (50%), Gaps = 8/92 (8%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           K +    +T K   K  +  ++   + D +++    D+  +  ++ K EE+  KEE E+ 
Sbjct: 212 KDLMSLLRTVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERL 271

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           ++ E        + + ++R EE+D E+E+ SK
Sbjct: 272 KKLE-------AERLRRMRGEEEDDEEEEDSK 296



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 25/121 (20%), Positives = 60/121 (49%), Gaps = 11/121 (9%)

Query: 118 KKKEKEKIK-----------KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           KK E E+++           ++  K+  D + ++ +  +      G+ EE E++EE    
Sbjct: 272 KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD 331

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           +E E    +D E+  E +   +++ ++E +  D +   ++E+ +K+KK+K+ +   +  +
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391

Query: 227 P 227
           P
Sbjct: 392 P 392



 Score = 28.4 bits (64), Expect = 10.0
 Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 7/101 (6%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
            D  D D + +   D        G+ +E++++E+ E+K+ K+  MKE   K   K  K E
Sbjct: 135 DDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAK--SKFYKAE 192

Query: 189 KDSEKEKKSK-----DKDKKLKKEKLKKKKKEKERSSHEKE 224
           +   KE+        D D K     L+  K   +     +E
Sbjct: 193 RQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEE 233


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 28.9 bits (65), Expect = 1.6
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           K+K KK    K K K K+K+K K+E    K+EE++   E + KE +E +  E  E+  +
Sbjct: 11  KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69



 Score = 28.5 bits (64), Expect = 2.0
 Identities = 21/74 (28%), Positives = 38/74 (51%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           K K I   KK  KKK+K K K++   + +++ K+  +   K+ +E    E+ E++E+   
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMN 72

Query: 166 KEEKEFKMKEDKEK 179
             E E   +E + K
Sbjct: 73  LTEAERAFEEAQRK 86


>gnl|CDD|237554 PRK13909, PRK13909, putative recombination protein RecB;
           Provisional.
          Length = 910

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 24/102 (23%), Positives = 41/102 (40%), Gaps = 21/102 (20%)

Query: 113 DKKTHKKKEKEKIKKKKD-------------------KKDKDKIKNKEKDKEKKEKEKGK 153
           DK   +  EKEK  K ++                   KKD+      E   + K  E+G+
Sbjct: 663 DKDYARALEKEKALKYEEEINVLYVAFTRAKNSLIVVKKDESSGSMFEI-LDLKPLERGE 721

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
            E KE K    KKE     +K       E++++ E++ +++ 
Sbjct: 722 IEIKEPKISP-KKESLITSVKLKPHGYQEQVKEIEEEPKEDN 762


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 8/56 (14%)

Query: 179 KPVEKI--------RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
           KP+E+I         +E K+ E EK+  + +KK  ++K KK  K+ +      E +
Sbjct: 61  KPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 14/87 (16%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIK-----KKKDKKD-KDKIKNKEKDKEKKE--- 148
              LG + K      +  +   K  E++K     KK    D   K+      +E+KE   
Sbjct: 3   MIALGRLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGA 62

Query: 149 -----KEKGKKEEKEKKEEKEKKEEKE 170
                K++ +    E   E E     E
Sbjct: 63  LINELKKEVEDAITELTPELEAAGLWE 89



 Score = 29.1 bits (66), Expect = 5.4
 Identities = 17/92 (18%), Positives = 30/92 (32%), Gaps = 9/92 (9%)

Query: 127 KKKDKKDKDKIKNKEKDKEK-KEKEKGKKEE-KEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
            KK     +   +  K  E+ K K  GKK    +  ++  K    E       E+     
Sbjct: 10  AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLE-------ERKEVGA 62

Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
              E   E E    +   +L+   L ++   +
Sbjct: 63  LINELKKEVEDAITELTPELEAAGLWERLAFE 94


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 19/141 (13%), Positives = 60/141 (42%)

Query: 85  VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
           V P   N        +L  + +     ++ +   ++E+EK K+++  K   K +      
Sbjct: 128 VPPYTRNDFDRREMPELPGVTRSDYDVRELRQKIREEREKEKEEQPMKPAFKPERWMGGP 187

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           +  E E    EE      +  K++++ + +   E+ + ++ +    + +E++   + ++ 
Sbjct: 188 DSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQE 247

Query: 205 KKEKLKKKKKEKERSSHEKEV 225
            +E +++   +++    + E 
Sbjct: 248 LEENVERDSDDEDEHGEDSED 268


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 29.7 bits (67), Expect = 1.7
 Identities = 20/94 (21%), Positives = 44/94 (46%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
            K+   +     H  K  +K+ K   +++         +   + +++ KKE +E   + E
Sbjct: 48  IKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSE 107

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
              E+  K ++  +K  +++RK+  + EKE+K K
Sbjct: 108 GLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 23/99 (23%), Positives = 47/99 (47%), Gaps = 2/99 (2%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDK--EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           +  E+IK  K +   +   NK  DK  +   +E+   E     E   + +++  K  E+ 
Sbjct: 43  ERMEEIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEA 102

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
               E + +E K    + K + K+ + K ++L+K++K+K
Sbjct: 103 NGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141


>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16.  Nop16 is a
           protein involved in ribosome biogenesis.
          Length = 173

 Score = 29.7 bits (67), Expect = 1.7
 Identities = 34/145 (23%), Positives = 65/145 (44%), Gaps = 16/145 (11%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-------------KEKEK 151
           KK K+   + K ++K+ K K +KKK     + I  K  DK+K                  
Sbjct: 3   KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTA 62

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
            + +   + +   + ++ E +++ D +        E  DS++E++  +  K+L++E  + 
Sbjct: 63  ARTDLLIQDKPIPELKKGEARIERDDK---GNPLDEPLDSKEEEEKPEVVKQLEEEASEP 119

Query: 212 KKKEKERSSHEKEVIPKLTFKFGTD 236
            KK +  S  E+E I +L  K G D
Sbjct: 120 AKKPRRLSKREEEWIERLVEKHGDD 144


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 26/123 (21%), Positives = 51/123 (41%), Gaps = 9/123 (7%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           IV V   E+       +      +   ++E      + +K      K K+K  +KK K  
Sbjct: 125 IVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLY 184

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
            + +E++++E+K KK E+  +++ +  K  +          K+KK           K KK
Sbjct: 185 KELKERKEREKKLKKVEQRLELQRELMKKGKG---------KKKKIVKDKDGKVVYKWKK 235

Query: 212 KKK 214
           ++K
Sbjct: 236 ERK 238


>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
          Length = 499

 Score = 30.8 bits (69), Expect = 1.8
 Identities = 14/53 (26%), Positives = 21/53 (39%)

Query: 8   PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKE 60
             P +A    +P  P  +Q   +  P V PP   PP   + +V P    +  E
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 2/72 (2%)

Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
            E  +K+   + E ++F          +    +++  EK K   + +++LK   L+KKK 
Sbjct: 44  AELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLK--ALEKKKA 101

Query: 215 EKERSSHEKEVI 226
           E +    E+E I
Sbjct: 102 ELKDLEEEREGI 113


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 17/57 (29%), Positives = 29/57 (50%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           KK H        K KK KK K K+ +K+   +K   ++ K   K KK ++ ++ +K+
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298


>gnl|CDD|218108 pfam04487, CITED, CITED.  CITED, CBP/p300-interacting
           transactivator with ED-rich tail, are characterized by a
           conserved 32-amino acid sequence at the C-terminus.
           CITED proteins do not bind DNA directly and are thought
           to function as transcriptional co-activators.
          Length = 206

 Score = 29.9 bits (67), Expect = 1.8
 Identities = 13/57 (22%), Positives = 15/57 (26%), Gaps = 13/57 (22%)

Query: 3   PFQFSPFPSVAGPGLIP-------------RLPPMLQHPLIPQPLVHPPGTPPPSKM 46
             +  PF  V GP L+                     HP    P    PG   P  M
Sbjct: 85  SSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGM 141


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.8 bits (65), Expect = 1.8
 Identities = 10/21 (47%), Positives = 16/21 (76%)

Query: 150 EKGKKEEKEKKEEKEKKEEKE 170
               +E+KE++EE+E+KEE E
Sbjct: 74  AAAAEEKKEEEEEEEEKEESE 94



 Score = 28.4 bits (64), Expect = 2.6
 Identities = 7/23 (30%), Positives = 17/23 (73%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
                +K+E+E++EE++++ E+E
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEE 96



 Score = 27.6 bits (62), Expect = 5.0
 Identities = 7/23 (30%), Positives = 17/23 (73%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
                +++++E++EE+EK+E +E
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEE 95



 Score = 27.2 bits (61), Expect = 6.4
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 145 EKKEKEKGKKEEKEKKEEKE 164
              E++K ++EE+E+KEE E
Sbjct: 75  AAAEEKKEEEEEEEEKEESE 94



 Score = 27.2 bits (61), Expect = 7.4
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEE 162
               E+K++E+ ++EEKE+ EE
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEE 95


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 30.7 bits (70), Expect = 1.9
 Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 9/106 (8%)

Query: 77  KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKK------KKD 130
           +VK  ELG        ++V    D  E+K++    +++    ++E EKIKK      KK 
Sbjct: 307 EVKAKELGSPAGTKTEIEVGV--DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKA 364

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEFKMKE 175
           +      K ++ +K  + KEK  +E +E +EE ++ KEE E    E
Sbjct: 365 RGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYSE 410



 Score = 30.3 bits (69), Expect = 2.2
 Identities = 24/131 (18%), Positives = 59/131 (45%), Gaps = 20/131 (15%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           ++K++  + +++  + +E+ ++ K+  K   ++   +   EK+E+ E K+ E KEK  E+
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLE-KLLETKEKLSEE 389

Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH---EKEVIPKLTFKFGTDMEEK 240
           + + E+               + ++LK++ +           K + P +    G  +  +
Sbjct: 390 LEELEE---------------ELKELKEELESLYSEGKISVNKTIYPGVKIHIGNKV-LR 433

Query: 241 TKRESSPKIVI 251
            KRE  P   +
Sbjct: 434 IKREIGPCTFV 444


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 30.3 bits (68), Expect = 1.9
 Identities = 27/105 (25%), Positives = 58/105 (55%), Gaps = 6/105 (5%)

Query: 131 KKDKDKIKNKEKDKEKKE--KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE- 187
           KKD+  I+ +E++  K++  +E+ K++ +E+K E  K  E+E K + + +K    +    
Sbjct: 44  KKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANI 103

Query: 188 ---EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
              + D E E++  +  K  + +++K+ ++E+E    EK  I K+
Sbjct: 104 DDVDTDDENEEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKM 148


>gnl|CDD|147348 pfam05124, S_layer_C, S-layer like family, C-terminal region. 
          Length = 223

 Score = 30.1 bits (68), Expect = 1.9
 Identities = 25/104 (24%), Positives = 35/104 (33%), Gaps = 11/104 (10%)

Query: 75  YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 134
           Y  VK +ELG   +          +  G+++       ++KT     K       D KD 
Sbjct: 15  YKDVKTLELGEEYIPDWEAYAVLDDGKGKLEYKDDEVLEQKTVGLALKYVGDYLDDIKDG 74

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           DKIK     K                E+   K    F+M E KE
Sbjct: 75  DKIKIANYAKLLF-----------DDEDSSDKLNVYFEMDEYKE 107


>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
           recombination, and repair].
          Length = 1041

 Score = 30.8 bits (70), Expect = 1.9
 Identities = 14/85 (16%), Positives = 26/85 (30%), Gaps = 12/85 (14%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK---------EDKEKPVEKIRKEEKDSEKE 194
           +   E      +EK  + +    EE+  K            D  K  ++   + K     
Sbjct: 8   RVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAV--- 64

Query: 195 KKSKDKDKKLKKEKLKKKKKEKERS 219
            K  + +      +L  +   KE S
Sbjct: 65  DKHSEIESLNVGMRLDVRLSSKELS 89


>gnl|CDD|184294 PRK13743, PRK13743, conjugal transfer protein TrbF; Provisional.
          Length = 141

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
            K ++   MKE K     KIR+ ++D + E++  D 
Sbjct: 4   YKYQRPLSMKEKKNDAELKIRQTKRDYDYERRVSDI 39


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 19/66 (28%), Positives = 36/66 (54%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
           KD + K+ E+ K  EKE  E++  ++E + ++ E + K +E  R+     EK++    + 
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161

Query: 202 KKLKKE 207
            +LK E
Sbjct: 162 ARLKSE 167



 Score = 28.6 bits (64), Expect = 5.8
 Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           KD +T K +E++ ++K+  +K+   I  +E D E  E E+   E + +    ++K+E E 
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKE---ISRQELDSEIAELERKILELQRQAALLKEKKEAED 158

Query: 172 KMKEDKEKPVEKIRKEEKDSEKE 194
           K     +    KI++E  D+E E
Sbjct: 159 KEIARLKSEASKIKQELVDAELE 181


>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe.  JAK1 is a
           tyrosine kinase protein essential in signaling type I
           and type II cytokines. It interacts with the gamma chain
           of type I cytokine receptors to elicit signals from the
           IL-2 receptor family, the IL-4 receptor family, the
           gp130 receptor family, ciliary neurotrophic factor
           receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
           Leptin-R). It also is involved in transducing a signal
           by type I (IFN-alpha/beta) and type II (IFN-gamma)
           interferons, and members of the IL-10 family via type II
           cytokine receptors. JAK (also called Just Another
           Kinase) is a family of intracellular, non-receptor
           tyrosine kinases that transduce cytokine-mediated
           signals via the JAK-STAT pathway. The JAK family in
           mammals consists of 4 members: JAK1, JAK2, JAK3 and
           TYK2. JAKs are composed of seven JAK homology (JH)
           domains (JH1-JH7) . The C-terminal JH1 domain is the
           main catalytic domain, followed by JH2, which is often
           referred to as a pseudokinase domain, followed by
           JH3-JH4 which is homologous to the SH2 domain, and
           lastly JH5-JH7 which is a FERM domain.  Named after
           Janus, the two-faced Roman god of doorways, JAKs possess
           two near-identical phosphate-transferring domains; one
           which displays the kinase activity (JH1), while the
           other negatively regulates the kinase activity of the
           first (JH2). The FERM domain has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
           C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs) , the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 198

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 13/26 (50%), Positives = 16/26 (61%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKE 148
           EK KK K KK+K K K  E  K+ +E
Sbjct: 90  EKKKKGKSKKNKLKGKKDEDKKKARE 115



 Score = 29.0 bits (65), Expect = 3.7
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
            K       KKKK K  K+K+K K+ + +KK +E
Sbjct: 82  RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115


>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter.  This domain is related to
           pfam00005.
          Length = 85

 Score = 28.3 bits (64), Expect = 2.0
 Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 11/63 (17%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEF-----------KMKEDKEKPVEKIRKEEKDSEK 193
           E+KE+   ++E+  +K++KE K+ +EF           K  + + K +EK+ + EK  EK
Sbjct: 20  EQKEERLEQQEKAYEKQQKEIKKLEEFIDRFRAKASKAKQAQSRIKALEKMERIEKPVEK 79

Query: 194 EKK 196
           + K
Sbjct: 80  KPK 82


>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
           (cas_TM1802).  Clusters of short DNA repeats with
           non-homologous spacers, which are found at regular
           intervals in the genomes of phylogenetically distinct
           prokaryotic species, comprise a family with recognisable
           features. This family is known as CRISPR (short for
           Clustered, Regularly Interspaced Short Palindromic
           Repeats). A number of protein families appear only in
           association with these repeats and are designated Cas
           (CRISPR-Associated) proteins. This minor cas protein is
           found in at least five prokaryotic genomes:
           Methanosarcina mazei, Sulfurihydrogenibium azorense,
           Thermotoga maritima, Carboxydothermus hydrogenoformans,
           and Dictyoglomus thermophilum, the first of which is
           archaeal while the rest are bacterial.
          Length = 584

 Score = 30.4 bits (69), Expect = 2.1
 Identities = 24/151 (15%), Positives = 53/151 (35%), Gaps = 22/151 (14%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKK----------------KEKEKIKKKKDKKDKD 135
           ++K+    +  EI+     + DKKT +K                    KI    ++    
Sbjct: 39  VLKIDFDTEENEIEIVIGEEFDKKTARKYLYKGQAGNGNSSQWYSPTNKITYDPEETLNK 98

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK---EEKDSE 192
           K+K+  K   K + E  K     K  +K  ++  E  +K+  +    +          ++
Sbjct: 99  KLKSIFKKFYKDKGEINKYRYFLKDIKKVLEKNFEKIIKDLIDLKKNEGVLYTIYFLKND 158

Query: 193 KEKKSKDKD---KKLKKEKLKKKKKEKERSS 220
            +K   D +    K+  +  ++   +K +  
Sbjct: 159 GKKYLSDYEEYRNKVLYKIEQRYFSKKSKEE 189


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 3   PFQFSPFPSVAGPGLIPRLPPM-LQHPLIPQPLVHPPGTPPPSKMK 47
           P Q  P      PGL  ++PP   Q P   Q    P   PPP    
Sbjct: 237 PQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 30.2 bits (69), Expect = 2.1
 Identities = 9/35 (25%), Positives = 21/35 (60%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
            +++K     + +    + ++EE+E++EE+E  EE
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319



 Score = 29.8 bits (68), Expect = 2.6
 Identities = 12/39 (30%), Positives = 23/39 (58%)

Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
            DKD +  + K+    + +    EE+E++EE+E++EE  
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 30.1 bits (68), Expect = 2.2
 Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 5/73 (6%)

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIP 227
           K+F   +  E   E      +  + +KK + + KK K  E+LK  +K+ ++   EK    
Sbjct: 66  KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK---- 121

Query: 228 KLTFKFGTDMEEK 240
           +L+    T + E+
Sbjct: 122 QLSSSLQTKLTEE 134



 Score = 29.7 bits (67), Expect = 2.7
 Identities = 14/60 (23%), Positives = 27/60 (45%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           + EKE      + +   K +  E  K+K+ +E    ++K  +  KEK+     + K  +E
Sbjct: 75  EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTEE 134



 Score = 28.6 bits (64), Expect = 5.7
 Identities = 9/41 (21%), Positives = 26/41 (63%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           K + +K+   +  + ++ K+KE+ + ++K++ EE +  ++K
Sbjct: 73  KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113


>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
          Length = 159

 Score = 29.2 bits (66), Expect = 2.2
 Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 3/94 (3%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           +KIK   D +  +     EK  E ++K +  K + E++ +  K+E K  K+ E+ +   E
Sbjct: 28  DKIKAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGK--KIVEEYKSKAE 85

Query: 183 KIRKE-EKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            + +E  K++ +E     +  KL+ ++ K+K + 
Sbjct: 86  NVYEEIVKEAHEEADLIIERAKLEAQREKEKAEY 119


>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein.  This is a family of
           121-amino acid secretory proteins. Laps functions in the
           regulation of neuronal cell adhesion and/or movement and
           synapse attachment. Laps binds to the ApC/EBP (Aplysia
           CCAAT/enhancer binding protein) promoter and activates
           the transcription of ApC/EBP mRNA.
          Length = 124

 Score = 29.0 bits (65), Expect = 2.3
 Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 2/104 (1%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K  K   KE  + KK     D     +  K+    +  ++ +++KK+E  + EE + KM+
Sbjct: 15  KRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKKKEGTESEEDDEKME 74

Query: 175 EDKEKPV--EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
            D +  V  +K  +++        ++ + KKLKK++ KKKKK+K
Sbjct: 75  VDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKK 118


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 30.3 bits (69), Expect = 2.3
 Identities = 18/119 (15%), Positives = 44/119 (36%), Gaps = 14/119 (11%)

Query: 97  SSEDLGEIKK-DKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
            + D+ ++ + D+  +K  +        + +  ++I +   + + D  +   + KE KEK
Sbjct: 24  DALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK-------IRKEEKDSEKEKKSKDKD 201
            K  +   ++ E +             +  PV K       +R+  +        + KD
Sbjct: 84  LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDEDDNVEVRRWGEPPVFVFDFEPKD 142


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 30.4 bits (67), Expect = 2.4
 Identities = 32/109 (29%), Positives = 63/109 (57%), Gaps = 4/109 (3%)

Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK----EEKEF 171
           T + ++ EK+KK + KK + +I    + ++K   +K  ++E  + E + KK    EE+  
Sbjct: 61  TEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHK 120

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           + K ++E+  ++  K+E D +K +  K K+K +K EK +K +K K+ S+
Sbjct: 121 QWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKAST 169


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 30.3 bits (69), Expect = 2.4
 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 3/70 (4%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           K+     +++ EK++K   K ++   KN    K+ K  E  ++       EK   E KE 
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASL---EKRIDEAKEL 297

Query: 172 KMKEDKEKPV 181
             K   E P+
Sbjct: 298 IAKYGNEVPL 307



 Score = 28.4 bits (64), Expect = 8.5
 Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 2/56 (3%)

Query: 111 KKDKKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           +K +K  K   K  EK++K  + K K+K+   E+     EK   + +E   K   E
Sbjct: 249 EKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAKYGNE 304


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 30.4 bits (68), Expect = 2.4
 Identities = 34/150 (22%), Positives = 61/150 (40%), Gaps = 3/150 (2%)

Query: 32  QPLVHPPGTPPPSKMKDKVSPKD-KTSPKEKMSSKEKTSPKEKEYSKVKDVELG-VTPMV 89
                 PG P  ++ +   S  D K+    K  SK  +   E   S     + G  +  V
Sbjct: 126 PNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASV 185

Query: 90  PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
             IV                +++  +T+      K   +K +++ D +KN + DK  +E+
Sbjct: 186 AGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKN-DGDKVAEEE 244

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           E    EE E+ EE +K+  ++   + D+EK
Sbjct: 245 EFELDEEHEEAEEDKKEALEKIGAEGDEEK 274


>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
           and repair / Cell division and chromosome partitioning /
           Translation].
          Length = 953

 Score = 30.3 bits (68), Expect = 2.5
 Identities = 30/147 (20%), Positives = 49/147 (33%), Gaps = 8/147 (5%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-- 161
            ++ K I   K T   + +EK       ++        KD + K  E   K         
Sbjct: 412 YEQKKLIGSIKPTLMDQLQEKKSPDLPDEEFIDTLALPKDLDMKNHELFLKRFANDLGLS 471

Query: 162 -EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-----KEKKSKDKDKKLKKEKLKKKKKE 215
             K  K +  + ++ D         +EE +SE     K           ++E+ + +K  
Sbjct: 472 ISKAIKSKGNYSLEMDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEENETEKTV 531

Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTK 242
             R    KE        F TD EEK +
Sbjct: 532 NLRFPGWKERYYTSKLHFTTDSEEKIR 558


>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV.  Bacterial DNA
           topoisomerase IV, GyrA, ParC.
          Length = 444

 Score = 30.2 bits (69), Expect = 2.5
 Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 20/90 (22%)

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK------- 202
            + K+    K E +     +   +       +  + +  KD  KE K K  ++       
Sbjct: 340 TRRKEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQ 399

Query: 203 -------------KLKKEKLKKKKKEKERS 219
                        KL+ EKL+K+ KE E+ 
Sbjct: 400 ADAILDMRLRRLTKLEVEKLEKELKELEKE 429


>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
           Saccharomyces cerevisiae Reduced viability upon
           starvation protein 167 and similar proteins.  BAR
           domains are dimerization, lipid binding and curvature
           sensing modules found in many different proteins with
           diverse functions. This subfamily is composed of fungal
           proteins with similarity to Saccharomyces cerevisiae
           Reduced viability upon starvation protein 167 (Rvs167p)
           and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
           cerevisiae Rvs167p plays a role in regulation of the
           actin cytoskeleton, endocytosis, and sporulation. It
           forms a heterodimer with another BAR domain protein
           Rvs161p. Rvs161p and Rvs167p share common functions but
           are not interchangeable. Their BAR domains cannot be
           replaced with each other and the overexpression of one
           cannot suppress the mutant phenotypes of the other.
           Rvs167p also interacts with the GTPase activating
           protein (GAP) Gyp5p, which is involved in ER to Golgi
           vesicle trafficking. BAR domains form dimers that bind
           to membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 216

 Score = 29.5 bits (67), Expect = 2.5
 Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 13/81 (16%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           K IKK +KT        IKK+  KK D DK++NK     +K+KE   K+EK    +  K 
Sbjct: 107 KYIKKIRKT--------IKKRDHKKLDYDKLQNKLNKLLQKKKELSLKDEK----QLAKL 154

Query: 167 EEKEFKMKEDKEKPVEKIRKE 187
           E K  + KE+ E   E ++ E
Sbjct: 155 ERKLEEAKEEYEALNELLKSE 175


>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
          Length = 701

 Score = 30.4 bits (68), Expect = 2.5
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           EK   K+  D+ ++D  K  E  +E K +E  + +E +KK+ K+ KE  +    ++K KP
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKP 267

Query: 181 VEKIRKEEKDSEKEK 195
           +    ++ KD  KE+
Sbjct: 268 LWT--RDPKDVTKEE 280


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 30.0 bits (68), Expect = 2.5
 Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 19/137 (13%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD---------KKDKDKIKNKEKDKEKKEK 149
           E+L  +K+D  +   +       K K++ +KD         K+ +D++++ +  +  + K
Sbjct: 151 ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAK 210

Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKE----------DKEKPVEKIRKEEKDSEKEKKSKD 199
           EK KK  +E   + +K EE E +++E           K +   +I + EK  E+ +    
Sbjct: 211 EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTF 270

Query: 200 KDKKLKKEKLKKKKKEK 216
           K+ +  KE+LK  +   
Sbjct: 271 KEIEKLKEQLKLLQSLT 287


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 16/65 (24%), Positives = 25/65 (38%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
             K     K   K KEK   ++KK + D DK +  E D +  E+    +E        + 
Sbjct: 59  PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDI 118

Query: 166 KEEKE 170
             E +
Sbjct: 119 DNEDD 123



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 3/92 (3%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE--FKMKEDKEKPVEK 183
           + KK +    +     K  +KK+KEK  +EEK+ + + +K EE E      E+     E 
Sbjct: 50  QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109

Query: 184 IRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
                  D + E    D D KL  E + K  K
Sbjct: 110 ASANSLSDIDNEDDMDDSDLKLATENIIKDLK 141


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 2.5
 Identities = 31/196 (15%), Positives = 71/196 (36%), Gaps = 14/196 (7%)

Query: 44  SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG- 102
            K  + +    +       S+ +      K      D     T         T++ ++  
Sbjct: 196 QKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPP 255

Query: 103 -EIKKDKSIKKDKKTHKKKEKEKIKKKK---DKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
            E+  ++   K+++   +  +E+I + +   ++   +      K K+  E+    ++E E
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKI---------RKEEKDSEKEKKSKDKDKKLKKEKL 209
           + EE+ K +++  ++  D E  V K+         R  E   + E        + +  K 
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375

Query: 210 KKKKKEKERSSHEKEV 225
           K + KE E      E+
Sbjct: 376 KNRNKEDETQRQLDEI 391


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 28.0 bits (63), Expect = 2.5
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
           +K KEKKE+ +  KEE  ++ +KE++  ++  +   
Sbjct: 50  DKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSA 85



 Score = 27.3 bits (61), Expect = 4.7
 Identities = 13/31 (41%), Positives = 23/31 (74%)

Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           +K +EK+ + +EDKE+ +E+I+KEE+  E  
Sbjct: 50  DKVKEKKEEREEDKEELIERIKKEEETFEDV 80


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 28.5 bits (64), Expect = 2.5
 Identities = 8/23 (34%), Positives = 17/23 (73%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
                ++EE+E++EE+E++E +E
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEE 94



 Score = 28.5 bits (64), Expect = 2.8
 Identities = 8/23 (34%), Positives = 18/23 (78%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
              + ++EE+E++EE+E++ E+E
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEE 95



 Score = 27.7 bits (62), Expect = 5.0
 Identities = 9/22 (40%), Positives = 17/22 (77%)

Query: 147 KEKEKGKKEEKEKKEEKEKKEE 168
              E+ ++EE+E++EE+E+ EE
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEE 94



 Score = 27.3 bits (61), Expect = 7.1
 Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEE 168
               +E+ ++EE+E++EE+E +EE
Sbjct: 72  AAAAEEEEEEEEEEEEEEEESEEE 95


>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 350 to 473 amino acids in length. This domain is
           found associated with pfam07923.
          Length = 402

 Score = 30.0 bits (68), Expect = 2.6
 Identities = 21/78 (26%), Positives = 36/78 (46%)

Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
            K+ + K   K+  EE+E  MK ++    E +  +  D  +  K    +      + + K
Sbjct: 12  SKKVRMKLATKQLWEEREKFMKYERGWDTESLVGDPLDISESVKELKLEMYTSLAEDQPK 71

Query: 213 KKEKERSSHEKEVIPKLT 230
           K+E ER S + E IP+L 
Sbjct: 72  KEEIERLSTDSEEIPRLD 89


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 29.6 bits (67), Expect = 2.6
 Identities = 34/96 (35%), Positives = 58/96 (60%), Gaps = 2/96 (2%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEKEKKEE 168
             + K  + +EKEK  K   K +  K+K+ EK   K  K KEK KK+ K++ +E+++K E
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVE 170

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           K+   ++ K +   K RK++K ++K+KK+K K + L
Sbjct: 171 KKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 30.1 bits (68), Expect = 2.6
 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           K  KE  K K  K+D  ++   E  + KKE  + + E +E K   + K
Sbjct: 58  KLNKEVAKLKIAKEDATELI-AETKELKKEITEKEAEVQEAKAALDAK 104



 Score = 29.7 bits (67), Expect = 3.4
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 7/74 (9%)

Query: 107 DKSIKKDKKTHKKK-EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           D+ I  DK+  +++ E + ++K+ +K       NKE  K K  KE   +   E KE K++
Sbjct: 33  DEVIALDKEWRQRQFELDSLRKEFNKL------NKEVAKLKIAKEDATELIAETKELKKE 86

Query: 166 KEEKEFKMKEDKEK 179
             EKE +++E K  
Sbjct: 87  ITEKEAEVQEAKAA 100


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.9 bits (67), Expect = 2.6
 Identities = 24/114 (21%), Positives = 52/114 (45%), Gaps = 3/114 (2%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           +K  ++  E  + ++  +E KEK EE +K++E+  K  E+ E   E++++  K  E E  
Sbjct: 121 LKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENS 180

Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
             ++  K    ++   KK  +      E+  +      +D+ ++T   +   I 
Sbjct: 181 RLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELI---SDLVKETLNLAPKDIE 231


>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
          Length = 1119

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 2/93 (2%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
            K +K   K  K+++K++     K+++   EK  EK +KE+   ++ E  +        +
Sbjct: 577 SKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSFPILSLSD 636

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
              KE +E P  K+ K   DS KE    D D  
Sbjct: 637 LN-KETEEIPT-KLYKLSSDSLKENMDLDSDGG 667



 Score = 29.8 bits (67), Expect = 3.8
 Identities = 25/126 (19%), Positives = 52/126 (41%), Gaps = 25/126 (19%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
            +    EK+  K       K+E +++     KE+ + +M++  EK      K+E++++ +
Sbjct: 574 VESSKYEKEFNKLV-----KDELKERLSHLTKEQVD-EMEKAYEK-----FKKEREADDD 622

Query: 195 KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT----KRESSPKIV 250
            +  D    L    L K          E E IP   +K  +D  ++         S  ++
Sbjct: 623 PEHLDSFPILSLSDLNK----------ETEEIPTKLYKLSSDSLKENMDLDSDGGSVTVL 672

Query: 251 IKPVKS 256
           + P++S
Sbjct: 673 VHPIES 678


>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB. Topo IB proteins include the
           monomeric yeast and human topo I and heterodimeric topo
           I from Leishmania donvanni. Topo I enzymes are divided
           into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  In addition to differences in
           structure and some biochemical properties,
           Trypanosomatid parasite topos I differ from human topo I
           in their sensitivity to CPTs and other classical topo I
           inhibitors. Trypanosomatid topos I have putative roles
           in organizing the kinetoplast DNA network unique to
           these parasites.  This family may represent more than
           one structural domain.
          Length = 217

 Score = 29.5 bits (66), Expect = 2.7
 Identities = 17/59 (28%), Positives = 33/59 (55%), Gaps = 7/59 (11%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           S +KD K  ++ +           D   IKN  +++++K+K   K+E++ KK+E+ K+E
Sbjct: 68  SFEKDHKFIRRCKLSDA-------DFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119


>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
          Length = 374

 Score = 30.0 bits (67), Expect = 2.7
 Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 8/76 (10%)

Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE-------EKDS 191
           + E D E +      K E+E+  +KE++E+++ +  E+ E+ VE + +E       EKD 
Sbjct: 83  DNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEE-VEVVEEEYDDDEDSEKDD 141

Query: 192 EKEKKSKDKDKKLKKE 207
           EKE  ++  + +L  E
Sbjct: 142 EKESDAEGDENELAGE 157


>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein.  PACS-1 is a
           cytosolic sorting protein that directs the localisation
           of membrane proteins in the trans-Golgi network
           (TGN)/endosomal system. PACS-1 connects the clathrin
           adaptor AP-1 to acidic cluster sorting motifs contained
           in the cytoplasmic domain of cargo proteins such as
           furin, the cation-independent mannose-6-phosphate
           receptor and in viral proteins such as human
           immunodeficiency virus type 1 Nef.
          Length = 413

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
           +K+K+KK+    L KK KEKE  S  + +
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCI 359


>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
           eukaryotic RNase H complex which cleaves RNA-DNA
           hybrids.  Ribonuclease H2B is one of the three proteins
           of eukaryotic RNase H2 complex that is required for
           nucleic acid binding and hydrolysis. RNase H is
           classified into two families, type I (prokaryotic RNase
           HI, eukaryotic RNase H1 and viral RNase H) and type II
           (prokaryotic RNase HII and HIII, and eukaryotic RNase
           H2/HII). RNase H endonucleolytically hydrolyzes an RNA
           strand when it is annealed to a complementary DNA strand
           in the presence of divalent cations, in DNA replication
           and repair. The enzyme can be found in bacteria,
           archaea, and eukaryotes. Most prokaryotic and eukaryotic
           genomes contain multiple RNase H genes. Despite a lack
           of evidence for homology from sequence comparisons, type
           I and type II RNase H share a common fold and similar
           steric configurations of the four acidic active-site
           residues, suggesting identical or very similar catalytic
           mechanisms. Eukaryotic RNase HII is active during
           replication and is believed to play a role in removal of
           Okazaki fragment primers and single ribonucleotides in
           DNA-DNA duplexes. Eukaryotic RNase HII is functional
           when it forms a complex with RNase H2B and RNase H2C
           proteins. It is speculated that the two accessory
           subunits are required for correct folding of the
           catalytic subunit of RNase HII. Mutations in the three
           subunits of human RNase HII cause neurological disorder.
          Length = 211

 Score = 29.6 bits (67), Expect = 2.7
 Identities = 19/67 (28%), Positives = 40/67 (59%), Gaps = 10/67 (14%)

Query: 161 EEKEKKEEKEFKMKEDK-----EKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLK 210
           + KE+ ++K +K  ++K      K VE+++K     +E ++ ++KK  +  ++ KK+K K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKK 187

Query: 211 KKKKEKE 217
           K  K+K+
Sbjct: 188 KSAKKKK 194



 Score = 28.8 bits (65), Expect = 4.2
 Identities = 19/54 (35%), Positives = 33/54 (61%), Gaps = 1/54 (1%)

Query: 114 KKTHK-KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
           KK  + KK++  IK+ +  +DK K  N E++K+KK+K+  KK++ +K      K
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 29.1 bits (66), Expect = 2.8
 Identities = 8/52 (15%), Positives = 22/52 (42%), Gaps = 6/52 (11%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
               + +++ K  +        KEK + + E ++ + + KE + E +    +
Sbjct: 65  AAAPRAELQAKIAR------YKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110


>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 1 (PACSIN1).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 1 or Syndapin I is expressed specifically in the
           brain and is localized in neurites and synaptic boutons.
           It binds the brain-specific proteins dynamin I,
           synaptojanin, synapsin I, and neural Wiskott-Aldrich
           syndrome protein (nWASP), and functions as a link
           between the cytoskeletal machinery and synaptic vesicle
           endocytosis. PACSIN 1 interacts with huntingtin and may
           be implicated in the neuropathology of Huntington's
           disease. It contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 258

 Score = 29.6 bits (66), Expect = 2.8
 Identities = 25/99 (25%), Positives = 50/99 (50%), Gaps = 3/99 (3%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEK 179
           +++K     +D +K+KN +KD   K+   G KE KE ++   + +K   K+ K  E  +K
Sbjct: 81  QEVKNNLLNEDLEKVKNWQKDAYHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKK 140

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
                 KEEK +   + +   ++ +  E+ KK + + ++
Sbjct: 141 AYHLACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDK 179


>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103).  This
           family consists of several repeats of around 30 residues
           in length which are found specifically in
           mature-parasite-infected erythrocyte surface antigen
           proteins from Plasmodium falciparum. This family often
           found in conjunction with pfam00226.
          Length = 215

 Score = 29.4 bits (65), Expect = 2.8
 Identities = 36/146 (24%), Positives = 78/146 (53%)

Query: 78  VKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
           +++V+  +   V + +K   +E   ++K  + I ++ K   KK+ E   K+ D +  DK+
Sbjct: 70  IEEVKEEIKKQVEDGIKENDTEGNDKVKGPEIITEEVKEEIKKQVEDGIKENDTEGNDKV 129

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           K  E   E+ ++E  K+ E+  KE   + ++K    +   E+  E+I+K+ ++  KE  +
Sbjct: 130 KGPEIITEEVKEEIKKQVEEGIKENDTEGKDKLIGPEIITEEVKEEIKKQVEEGIKENDT 189

Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEK 223
           ++KDK + +E + ++ K++     EK
Sbjct: 190 ENKDKVIGQEIITEEVKKEIEKQEEK 215



 Score = 28.7 bits (63), Expect = 4.8
 Identities = 44/178 (24%), Positives = 84/178 (47%)

Query: 48  DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 107
           DK   K+K   K+      + + KE +    K V+      V   +K   +E   ++K  
Sbjct: 7   DKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVKGP 66

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           + I ++ K   KK+ E   K+ D +  DK+K  E   E+ ++E  K+ E   KE   +  
Sbjct: 67  EIIIEEVKEEIKKQVEDGIKENDTEGNDKVKGPEIITEEVKEEIKKQVEDGIKENDTEGN 126

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
           +K    +   E+  E+I+K+ ++  KE  ++ KDK +  E + ++ KE+ +   E+ +
Sbjct: 127 DKVKGPEIITEEVKEEIKKQVEEGIKENDTEGKDKLIGPEIITEEVKEEIKKQVEEGI 184


>gnl|CDD|227376 COG5043, MRS6, Vacuolar protein sorting-associated protein
           [Intracellular trafficking and secretion].
          Length = 2552

 Score = 30.2 bits (68), Expect = 2.8
 Identities = 23/95 (24%), Positives = 39/95 (41%), Gaps = 5/95 (5%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
            +K++K  K + K   KEK  E   K           E++     K FK + +  +   K
Sbjct: 305 VMKQQKFLKYRPKSTPKEKPLEWF-KYIILVVLDSIHEKRYHWTWKYFKERCEDRRAYMK 363

Query: 184 I-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
           I RK+  + +    S D  K+L+K +L     +  
Sbjct: 364 IIRKKYLNEQL---SADLSKELEKRELTYDLIKYR 395


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 30.3 bits (68), Expect = 2.8
 Identities = 21/71 (29%), Positives = 41/71 (57%), Gaps = 1/71 (1%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           +   KD +   K  +EKI   +D+ +K++  ++E    K  + + K E  E++E  +K +
Sbjct: 663 ELRAKDLEAEHKSLQEKISSLEDELEKERQNHQE-LIAKCRELEEKIERAEQEENMQKLD 721

Query: 168 EKEFKMKEDKE 178
           E+E K+K++KE
Sbjct: 722 EEEQKIKQEKE 732


>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
           [Transcription].
          Length = 182

 Score = 29.5 bits (66), Expect = 2.8
 Identities = 17/86 (19%), Positives = 33/86 (38%), Gaps = 3/86 (3%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKE---KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
           +K+K    D  + E EK +KE  +     E  +K + + F M ++  +   K    + + 
Sbjct: 97  EKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEP 156

Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKE 217
            +E   K +        L  + K   
Sbjct: 157 LEEVIDKLEYPVESPADLLHEIKIHL 182


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 29.8 bits (67), Expect = 2.9
 Identities = 13/73 (17%), Positives = 32/73 (43%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
           +++      KK+   +  + + + K +  +KK +  +   +     ++ E  +  K + E
Sbjct: 489 SANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVIQVNKIETE 548

Query: 156 EKEKKEEKEKKEE 168
           E+   E KE  E+
Sbjct: 549 EEVLSELKELVED 561


>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
          Length = 444

 Score = 30.1 bits (68), Expect = 2.9
 Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 5/109 (4%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           IV     E      +   +  D +   ++E+EK  +K  +      K     KE KE   
Sbjct: 224 IVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESKELPD 283

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
           G +E +   EE+ K  E  FK     E P+  + +  K  E +++   +
Sbjct: 284 G-EEIEFGNEERFKAPEILFK----PELPISGLEEAGKIDESKQELVAE 327



 Score = 28.2 bits (63), Expect = 9.7
 Identities = 25/122 (20%), Positives = 48/122 (39%), Gaps = 14/122 (11%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-- 179
           KE++ +  D+     +  +E+ +E++EK   K  E   +  KE    KE K   D E+  
Sbjct: 229 KEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESKELPDGEEIE 288

Query: 180 ---------PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV-IPKL 229
                    P    + E   S  E+    K  + K+E + +  +    +       +P+L
Sbjct: 289 FGNEERFKAPEILFKPELPISGLEEAG--KIDESKQELVAENYEISPTNLGNDIAGLPEL 346

Query: 230 TF 231
            +
Sbjct: 347 VY 348


>gnl|CDD|205150 pfam12925, APP_E2, E2 domain of amyloid precursor protein.  The E2
           domain is the largest of the conserved domains of the
           amyloid precursor protein. The structure of E2 consists
           of two coiled-coil sub-structures connected through a
           continuous helix, and bears an unexpected resemblance to
           the spectrin family of protein structures.E 2 can
           reversibly dimerise in solution, and the dimerisation
           occurs along the longest dimension of the molecule in an
           antiparallel orientation, which enables the N-terminal
           substructure of one monomer to pack against the
           C-terminal substructure of a second monomer. The high
           degree of conservation of residues at the putative dimer
           interface suggests that the E2 dimer observed in the
           crystal could be physiologically relevant. Heparin
           sulfate proteoglycans, the putative ligands for the
           precursor present in extracellular matrix, bind to E2 at
           a conserved and positively charged site near the dimer
           interface.
          Length = 193

 Score = 29.3 bits (66), Expect = 2.9
 Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 2/55 (3%)

Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           D E +  ++ KK  +EK  E+  +  KE++  E + K + K   + K ++  +K 
Sbjct: 22  DNEHERYKEAKKRLEEKHRERMTQVMKEWEEAESQYKNLPK--ADPKAAQLMRKE 74


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 19/78 (24%), Positives = 47/78 (60%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K+ + + K  +++ ++KE+E+  +E K  +EE++++ EK+ K +++  K +E+   +E+D
Sbjct: 29  KRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEED 88

Query: 191 SEKEKKSKDKDKKLKKEK 208
           +E E     +D + +   
Sbjct: 89  AETEDTEDVEDDEWEGFP 106


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 29.8 bits (67), Expect = 3.0
 Identities = 15/127 (11%), Positives = 52/127 (40%), Gaps = 4/127 (3%)

Query: 118 KKKEKEKIKKKKD-KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
           K+ E    +  +D K  + +++ +   K  ++  + K    E + E    + +  +  E 
Sbjct: 232 KETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAEQ 291

Query: 177 -KEKPVEKIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
            +    + I++E+  +E+E +     + ++++   ++++K+ +         I     + 
Sbjct: 292 AEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQA 351

Query: 234 GTDMEEK 240
             + +  
Sbjct: 352 QEEAKAA 358


>gnl|CDD|224291 COG1372, COG1372, Intein/homing endonuclease [DNA replication,
           recombination, and repair].
          Length = 420

 Score = 30.0 bits (67), Expect = 3.0
 Identities = 17/91 (18%), Positives = 39/91 (42%), Gaps = 2/91 (2%)

Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
              K  K+ K  +   K KEK+K   ++   K    ++  K + +   + +   +  +++
Sbjct: 307 YAYKLGKAAKIAEALEKLKEKKKNINRRILRKIAIREVLLKLEGRLLSKLKLLLEILRKE 366

Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K  KE +  K+    ++    +E+I +   D
Sbjct: 367 KLLKEDENLKKLSKGDEYFDEIEEIEEIGYD 397


>gnl|CDD|233469 TIGR01564, S_layer_MJ, S-layer protein, MJ0822 family.  This model
           represents one of several families of proteins
           associated with the formation of prokaryotic S-layers.
           Members of this family are found in archaeal species,
           including Pyrococcus horikoshii (split into two tandem
           reading frames), Methanococcus jannaschii, and related
           species. Some local similarity can be found to other
           S-layer protein families [Cell envelope, Surface
           structures].
          Length = 571

 Score = 29.9 bits (67), Expect = 3.0
 Identities = 32/146 (21%), Positives = 46/146 (31%), Gaps = 20/146 (13%)

Query: 53  KDKTSPKEKMSSKEKTSPKEKEYS------KVKDVELGV--------TPMVPNIVKVTSS 98
           KD     EK  S  K       Y       +V D    V          +  ++  +   
Sbjct: 310 KDGKVVAEKSDSDTKDGALVHVYFGGGIGIQVLDAWENVGGDYGYVQLKIYKDLKTLELG 369

Query: 99  EDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
           E+     K +    D  T  K +    E   + K+            D  K  KE G  E
Sbjct: 370 EEYMPDWKAEFAIDDNNTVIKWDALVDEDPLEDKEIGIFLTYTGDYSDTVKSGKEVGIIE 429

Query: 156 EKEKKEEKEKKEEK---EFKMKEDKE 178
               K + E KE+K   +F M E K+
Sbjct: 430 YAVLKLDDEDKEDKLYIDFSMDEYKD 455


>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins.
           F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Protein
           kinase C and Casein kinase Substrate in Neurons (PACSIN)
           proteins, also called Synaptic dynamin-associated
           proteins (Syndapins), act as regulators of cytoskeletal
           and membrane dynamics. They bind both dynamin and
           Wiskott-Aldrich syndrome protein (WASP), and may provide
           direct links between the actin cytoskeletal machinery
           through WASP and dynamin-dependent endocytosis.
           Vetebrates harbor three isoforms with distinct
           expression patterns and specific functions. PACSINs
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 18/113 (15%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKP 180
            I+ K      +++K  +K+   K    G KE KE ++   K ++   K  K  E  +K 
Sbjct: 82  SIRDKLLNDVVEEVKTWQKENYHKSMMGGFKETKEAEDGFAKAQKPWAKLLKKVEKAKKA 141

Query: 181 VEKIRKEEKD---------------SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
                K EK                 ++ KK +DK +K K+E  K K K ++ 
Sbjct: 142 YHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYEKA 194


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 28.7 bits (65), Expect = 3.1
 Identities = 29/104 (27%), Positives = 61/104 (58%), Gaps = 10/104 (9%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK--MK 174
            K++E E+++   D K       +E+ + ++E  K ++EE EKKEE+ ++   +F   +K
Sbjct: 4   EKRREMEEVQLALDAK-------REEFERREELLKQREEELEKKEEELQESLIKFDKFLK 56

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           E++ K   +  K+ ++ +K +K K+++ K  K +L++ K E E+
Sbjct: 57  ENEAK-RRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEK 99



 Score = 28.3 bits (64), Expect = 4.1
 Identities = 24/82 (29%), Positives = 52/82 (63%), Gaps = 3/82 (3%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEK 195
           ++ + + +E +     K+EE E++EE  K+ E+E + KE++ ++ + K  K  K++E ++
Sbjct: 3   LEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKR 62

Query: 196 KSKDKDKKLKKEKLKKKKKEKE 217
             +  +KK ++EK  +K+KE+E
Sbjct: 63  --RRAEKKAEEEKKLRKEKEEE 82


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 30.0 bits (67), Expect = 3.1
 Identities = 18/98 (18%), Positives = 42/98 (42%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           E+   I  +    K  K   KK  +    K+D++  ++  +   DK K +    +++E  
Sbjct: 203 EEEEAIYYEDFFDKPTKEPVKKHSDVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPN 262

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
            +   E  ++     ++ + +  E+I + EK+    K+
Sbjct: 263 AEGVGEASDKNLSSFEKQQIEMDEQIEELEKELVAPKE 300


>gnl|CDD|173720 cd05631, STKc_GRK4, Catalytic domain of the Protein
           Serine/Threonine Kinase, G protein-coupled Receptor
           Kinase 4.  Serine/Threonine Kinases (STKs), G
           protein-coupled Receptor Kinase (GRK) subfamily, GRK4
           isoform, catalytic (c) domain. STKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The GRK
           subfamily is part of a larger superfamily that includes
           the catalytic domains of other protein STKs, protein
           tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. GRKs phosphorylate and regulate G
           protein-coupled receptors (GPCRs), the largest
           superfamily of cell surface receptors which regulate
           some part of nearly all physiological functions.
           Phosphorylated GPCRs bind to arrestins, which prevents
           further G protein signaling despite the presence of
           activating ligand. There are seven types of GRKs, named
           GRK1 to GRK7. GRK4 has a limited tissue distribution. It
           is mainly found in the testis, but is also present in
           the cerebellum and kidney. It is expressed as multiple
           splice variants with different domain architectures. It
           is post-translationally palmitoylated and localized in
           the membrane. GRK4 polymorphisms are associated with
           hypertension and salt sensitivity, as they cause
           hyperphosphorylation, desensitization, and
           internalization of the dopamine 1 (D1) receptor while
           increasing the expression of the angiotensin II type 1
           receptor. GRK4 plays a crucial role in the D1 receptor
           regulation of sodium excretion and blood pressure.
          Length = 285

 Score = 29.6 bits (66), Expect = 3.2
 Identities = 14/46 (30%), Positives = 30/46 (65%)

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           K++E+ K+EE + ++KED+E+  EK  ++ K   +   +K+  ++L
Sbjct: 203 KRKERVKREEVDRRVKEDQEEYSEKFSEDAKSICRMLLTKNPKERL 248


>gnl|CDD|215412 PLN02769, PLN02769, Probable galacturonosyltransferase.
          Length = 629

 Score = 30.0 bits (68), Expect = 3.2
 Identities = 25/104 (24%), Positives = 33/104 (31%), Gaps = 6/104 (5%)

Query: 74  EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---KKKKD 130
           E+S V       T    N V     E L E     S   D     + + +K+    K   
Sbjct: 61  EFSHVGSARENGTKKTQNQVSEGVDEILKESGLTSSKPSDIVISSRSKLKKVFPDPKLNP 120

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKEEKEF 171
              K           K +     KE K +K   +E EK  E EF
Sbjct: 121 LPVKPHSVPVPSSDTKNKSTAIDKENKGQKADEDENEKSCELEF 164


>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
           PREFRAN.  This family is the long protein of a novel
           CRISPR subtype, PREFRAN, which is most common in
           Prevotella and Francisella, although widely distributed.
           The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
           helicase Cas3 and endonuclease Cas3-HD.
          Length = 1287

 Score = 30.2 bits (68), Expect = 3.2
 Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 17/137 (12%)

Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI----KNKEKDKEKKEK--EKGKKEEK 157
           IK+   +++DK   + +  +++KK  DK  KD I     N E   +  E+  +   K +K
Sbjct: 28  IKELGLLEQDK--ARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKK 85

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK--KKKE 215
            KK++K+  +E++  +++   K ++       +  +E       KKL K  L+   K   
Sbjct: 86  FKKDKKDLAKEQD-TLRKQIVKSLQD------NLPEEIFKNIFSKKLLKGVLEDFIKNTH 138

Query: 216 KERSSHEKEVIPKLTFK 232
           K+    +  +I    FK
Sbjct: 139 KDSEDEQNILIDIKEFK 155



 Score = 29.0 bits (65), Expect = 5.7
 Identities = 27/127 (21%), Positives = 51/127 (40%), Gaps = 20/127 (15%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK------------------DKDKIKNK 140
           ++LG +++DK+  +  K  KK   +  K   ++                   +K K   K
Sbjct: 29  KELGLLEQDKARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKKFKK 88

Query: 141 EKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
           +K    KE++  +K+  +  +    E+  +  F  K  K    + I+   KDSE E+   
Sbjct: 89  DKKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDSEDEQNIL 148

Query: 199 DKDKKLK 205
              K+ K
Sbjct: 149 IDIKEFK 155


>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
           (DUF3340).  This presumed domain is found at the
           C-terminus of tail specific proteases. Its function is
           unknown. This family is found in bacteria and
           eukaryotes. This presumed domain is typically between 88
           to 187 amino acids in length.
          Length = 139

 Score = 28.7 bits (65), Expect = 3.2
 Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 12/77 (15%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
             L E + DK++  ++   +K E+E+ +KK+            K + ++ K  G++  K 
Sbjct: 62  AWLKEERDDKTVSLNEAE-RKAEREEDEKKRL-----------KRENERRKALGEEPLKS 109

Query: 159 KKEEKEKKEEKEFKMKE 175
             +  + KEE +  + E
Sbjct: 110 LDDLPKDKEEPDAYLDE 126


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 29.9 bits (68), Expect = 3.3
 Identities = 17/95 (17%), Positives = 41/95 (43%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           I  VT    L  +++ + + K+  +  K   E + K +   ++ K   KE ++ KK+   
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAA 757

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
            +  +     + E+    +   KE     ++++R+
Sbjct: 758 AELADLLSNAKAEEIGGVKVLAKEVDGADMKELRE 792


>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
           This is a family of conserved proteins of approximately
           700 residues found from worms to humans.
          Length = 253

 Score = 29.3 bits (66), Expect = 3.3
 Identities = 10/48 (20%), Positives = 24/48 (50%)

Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            K+ K    K+K +  ++K ++   +  K      + K + +K++KK+
Sbjct: 93  TKKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKK 140


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
           helical domain is found in a set of bacterial plasmid
           replication proteins. The domain is found to the
           C-terminus of the primase/polymerase domain. Mutants of
           this domain are defective in template binding,
           dinucleotide formation and conformation change prior to
           DNA extension.
          Length = 135

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
           GKKEE++ +E+ EK +E+  K    K K VE IR  E+  +K KKS ++ K+ KK  L  
Sbjct: 2   GKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIR--EEVCKKIKKSLEELKEKKKAILNT 59

Query: 212 KKK 214
            K 
Sbjct: 60  AKG 62


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 29.6 bits (66), Expect = 3.3
 Identities = 33/148 (22%), Positives = 70/148 (47%), Gaps = 2/148 (1%)

Query: 65  KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
           KE+  P+ +E  K K   L          K+  +E+       +  + +     +K K+K
Sbjct: 149 KEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQK 208

Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV--E 182
            ++   + ++ K K +E+ K  +E+E+ +K+E+  ++ +E++E++  K + ++ +    E
Sbjct: 209 QQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAE 268

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
           K +K  +D   E K   K    K   LK
Sbjct: 269 KRQKVPEDGLSEDKKPFKCFTPKGSSLK 296



 Score = 29.2 bits (65), Expect = 4.7
 Identities = 32/153 (20%), Positives = 75/153 (49%), Gaps = 16/153 (10%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           + +T +++EKE+ ++++++ ++ +   K + K      +  ++E+++ E +E+++ K   
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGS 166

Query: 173 MKEDK---------------EKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEK 216
           ++E+                 +   +  + E   E EK K K ++  L+ E+LKKK++E+
Sbjct: 167 LEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREER 226

Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
            +   E+E   K         EE+ KR    +I
Sbjct: 227 RKVLEEEEQRRKQEEADRKSREEEEKRRLKEEI 259


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 29.5 bits (66), Expect = 3.4
 Identities = 12/85 (14%), Positives = 29/85 (34%), Gaps = 2/85 (2%)

Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEK--GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
                +   D I    ++ +  E E      +   +K +  +K+    + K D  + + K
Sbjct: 184 PITTRRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTK 243

Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEK 208
              + K +E  K     +     ++
Sbjct: 244 EELDAKKAELSKALAALEAANAADE 268


>gnl|CDD|225102 COG2191, COG2191, Formylmethanofuran dehydrogenase subunit E
           [Energy production and conversion].
          Length = 206

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKEKEKGKKEEKEKKEEKEKKEEK 169
           K+   K  +  +K++   +D +K +   K         E E  K+  +E +      +E+
Sbjct: 93  KRGSGKAVRVYVKREILDEDPEKTELFPKVRAGYLGPSEIETRKEHWEELEWILLMPDEE 152

Query: 170 EFKMKEDKEKPVEKIRKEE 188
            F++KE   +P EK +K  
Sbjct: 153 LFEVKEVDVEPPEKAKKFG 171


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 29.8 bits (67), Expect = 3.4
 Identities = 14/39 (35%), Positives = 26/39 (66%)

Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
           ++ +EK ++E++E+K+  + K E+E K KE+  K  E I
Sbjct: 6   RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAI 44



 Score = 28.3 bits (63), Expect = 9.6
 Identities = 12/67 (17%), Positives = 41/67 (61%), Gaps = 3/67 (4%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           ++ ++K++ ++++++++ K K ++E K K+E  +++E  E      + + ++ I  + K 
Sbjct: 6   RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE---AAQRSRRLDAIEAQIKA 62

Query: 191 SEKEKKS 197
            ++ ++S
Sbjct: 63  DQQMQES 69


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 29.1 bits (65), Expect = 3.4
 Identities = 26/110 (23%), Positives = 48/110 (43%), Gaps = 3/110 (2%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK---EKKEEKEKKEEKEFKMK 174
            K   EK K+  D++ +   K    D+E+ ++EK +  +K   +       + E + + K
Sbjct: 102 GKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPK 161

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
            +   P + + +E K   K     D+ KK K     KK KE+  S + + 
Sbjct: 162 VEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score = 29.5 bits (67), Expect = 3.5
 Identities = 14/80 (17%), Positives = 34/80 (42%), Gaps = 1/80 (1%)

Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
              +  +EK  E+E+++ ++ K    +E   EKI +   + E+ + + +    L    L 
Sbjct: 2   SPDETLEEKLNEEEREKLEKKKSSLSEED-KEKIIERGLELEELQSTPEDLSCLPTLTLS 60

Query: 211 KKKKEKERSSHEKEVIPKLT 230
              +  +    E+E +  + 
Sbjct: 61  DIPRSGDEYPLEEEKVGDVP 80


>gnl|CDD|224396 COG1479, COG1479, Uncharacterized conserved protein [Function
           unknown].
          Length = 409

 Score = 29.5 bits (66), Expect = 3.6
 Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 4/89 (4%)

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
           K+    D +KNK     K+EK++ K   ++KKE +E K  K    +  K    + +R   
Sbjct: 212 KELSKIDLLKNKLLMLIKEEKKQSKDLNQDKKELEEIKMNKGKITERFK----DFLRDYL 267

Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
                    K + KKLKK     K+K KE
Sbjct: 268 TYKISSNIKKRELKKLKKIYEDNKEKLKE 296


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 29.2 bits (66), Expect = 3.6
 Identities = 12/53 (22%), Positives = 30/53 (56%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
           K+ +  E+ ++EE+EK++E+E++          ++  +++  KE +  E E+ 
Sbjct: 114 KQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166



 Score = 28.8 bits (65), Expect = 4.8
 Identities = 10/50 (20%), Positives = 30/50 (60%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
           + + +E+ K+ ++++++     K + ++ + K+K K+ +KE+ EE   + 
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRA 170



 Score = 28.1 bits (63), Expect = 7.3
 Identities = 15/56 (26%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
           + +++  E+  E++E+EK  +EE+E+     K   ++ ++K+ K K ++K   EE 
Sbjct: 113 RKQLRFLEQ-LEREEEEKRDEEERERLLRAAKSRSEQSRLKQ-KAKEMQKEEDEEM 166


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 27.8 bits (62), Expect = 3.6
 Identities = 19/67 (28%), Positives = 37/67 (55%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           K +KK +K DK  I +   + +K++K+K +  +  + +  +K ++K+ K  E  E P E 
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60

Query: 184 IRKEEKD 190
            R+  +D
Sbjct: 61  KRRRTED 67


>gnl|CDD|224427 COG1510, COG1510, Predicted transcriptional regulators
           [Transcription].
          Length = 177

 Score = 28.9 bits (65), Expect = 3.7
 Identities = 17/77 (22%), Positives = 31/77 (40%)

Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K  + I   K+  KK  +E  +   D+   ++IK  +   E+  K      E   +  + 
Sbjct: 101 KWKREIDPTKEALKKLLEELNEDLDDRDLTERIKEIKSKLERLLKWSEDYYELLTRLLEF 160

Query: 165 KKEEKEFKMKEDKEKPV 181
            + E+ FK  E  +K  
Sbjct: 161 LESEEVFKYLEKPDKIS 177


>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
           2-oxoacid:ferredoxin oxidoreductases, beta subunit
           [Energy production and conversion].
          Length = 294

 Score = 29.2 bits (66), Expect = 3.7
 Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 10/82 (12%)

Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKE-EKEFK-MKE 175
           ++ K  ++       + +         K E+EKGK          +E  E E+ FK + +
Sbjct: 218 EEAKLAVETGYWPLYRYE-------PGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTK 270

Query: 176 DKEKPVEKIRKEEKDSEKEKKS 197
              + ++K++K+  +  +E K 
Sbjct: 271 SNPELIQKLQKDVDERWEELKR 292


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 28/130 (21%), Positives = 57/130 (43%), Gaps = 8/130 (6%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD--KDKIKNKEKD-KEKKEKEKGKKE 155
           EDL +I     + K  K   ++  ++I+    K D  + +IK   K+ +E+++K      
Sbjct: 157 EDLLDISVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIA 216

Query: 156 EKEKKEEKEKKEEKEFKMKED--KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
            K+ K ++  +E K  K + +   ++ +  +   E  S    K      K+K    K ++
Sbjct: 217 RKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKS---KIEQ 273

Query: 214 KEKERSSHEK 223
            +K    +EK
Sbjct: 274 FQKVIKMYEK 283


>gnl|CDD|220178 pfam09321, DUF1978, Domain of unknown function (DUF1978).  Members
           of this family are found in various hypothetical
           proteins produced by the bacterium Chlamydia pneumoniae.
           Their exact function has not, as yet, been identified.
          Length = 241

 Score = 29.4 bits (66), Expect = 3.7
 Identities = 18/114 (15%), Positives = 54/114 (47%)

Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
            +S K++KK  +  ++E  ++ +D  DK+  + +E+ +  +       E +++ E++E  
Sbjct: 86  KQSSKENKKVLQDAQREFERQSQDFYDKEIEEVEERLEALQALYPEVSEREDETEKQETV 145

Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
                 ++   +    +  +  ++ E   K +D  ++  +E+  ++K+ +E   
Sbjct: 146 SSYARDLETALDHIEVEYVQCVREQEDYWKEEDVKEQELREEGGERKRLEEIQE 199


>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
           N-terminus.  This is an N-terminal domain of DNA
           polymerase III polC subunit A that is found only in
           Firmicutes. DNA polymerase polC-type III enzyme
           functions as the 'replicase' in low G + C Gram-positive
           bacteria. Purine asymmetry is a characteristic of
           organisms with a heterodimeric DNA polymerase III
           alpha-subunit constituted by polC which probably plays a
           direct role in the maintenance of strand-biased gene
           distribution; since, among prokaryotic genomes, the
           distribution of genes on the leading and lagging strands
           of the replication fork is known to be biased. The
           domain is associated with DNA_pol3_alpha pfam07733.
          Length = 180

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 16/84 (19%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEK 146
            ++    +E      K K + K  + + K      K  ++  +D+  +++I+  E+ KE+
Sbjct: 95  KLIIKVPNEIEANFLKKKKLDKLLEEYIKFGFGKLKIDVEVDEDESSEEEIEEFEEQKEE 154

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKE 170
           +E +  ++  +  K+++ +K++KE
Sbjct: 155 EEAKLAEEALEALKKKEAEKKKKE 178


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 29.9 bits (67), Expect = 3.8
 Identities = 28/177 (15%), Positives = 66/177 (37%), Gaps = 30/177 (16%)

Query: 57  SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
           S  +++ +  +T  +     +  D+ L   PM    ++    E   +         +  +
Sbjct: 611 SNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFI-----EVDS 665

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
                + ++  K    D         + E+  + K    E E ++E E          E 
Sbjct: 666 VSSTLELQVPSKSQPTD---------ESEENAENKVASIEGEHRKEIEDLL-----FDES 711

Query: 177 KEKPVEKIRKEEKD-----------SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           +E  +  + +EEKD           S +E ++ + +   ++  LK +K++++R + E
Sbjct: 712 EEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLAEQNSLKAQKQQQKRIAAE 768


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
           production and conversion].
          Length = 108

 Score = 28.2 bits (63), Expect = 4.0
 Identities = 18/71 (25%), Positives = 37/71 (52%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
           +E  K E    +E +E KEE E  +KE +E+  E I + E+++EK  +   ++ + + E+
Sbjct: 10  REIKKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEE 69

Query: 209 LKKKKKEKERS 219
             ++   +   
Sbjct: 70  EAEEILAEAEK 80


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 29.8 bits (67), Expect = 4.0
 Identities = 19/103 (18%), Positives = 47/103 (45%), Gaps = 7/103 (6%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK--E 170
             +   K+ +    K+ D+K     +  E + E++ +++   +E+E +   E +EEK   
Sbjct: 376 SSQRELKQTEAAYCKRLDEK-----RLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAA 430

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
                DKE+      + +   +K++  ++K    +++K +  K
Sbjct: 431 NSQVIDKEEVCNLYDRRDTAWQKQRFLREKQTAFERQKTEHTK 473



 Score = 28.3 bits (63), Expect = 9.5
 Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 6/100 (6%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           E + K+ D   K   +N + ++  ++     K+E+ + +E EK+     K+ E +    E
Sbjct: 164 EVLNKEADSLYKPSGRNPQINQLLEKL----KQERNEIDEAEKEYATYHKLLESRRA--E 217

Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
             R  E  SE               +L  + +E ++   E
Sbjct: 218 HARLAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQE 257


>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6.  The VP6 protein a
           minor protein in the core of the virion is probably the
           viral helicase.
          Length = 322

 Score = 29.3 bits (65), Expect = 4.0
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           +    K+K+ KD+D  + +  D E ++K+KG KEE +K+ E
Sbjct: 29  ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETE 69


>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
           (reggie) like proteins. This subgroup contains proteins
           similar to stomatin, prohibitin, flotillin, HlfK/C and
           podicin.  Many of these band 7 domain-containing
           proteins are lipid raft-associated.  Individual proteins
           of this band 7 domain family may cluster to form
           membrane microdomains which may in turn recruit
           multiprotein complexes.  Microdomains formed from
           flotillin proteins may in addition be dynamic units with
           their own regulatory functions.  Flotillins have been
           implicated in signal transduction, vesicle trafficking,
           cytoskeleton rearrangement and are known to interact
           with a variety of proteins.  Stomatin interacts with and
           regulates members of the degenerin/epithelia Na+ channel
           family in mechanosensory cells of Caenorhabditis elegans
           and vertebrate neurons and participates in trafficking
           of Glut1 glucose transporters. Prohibitin may act as a
           chaperone for the stabilization of mitochondrial
           proteins.  Prokaryotic HflK/C plays a role in the
           decision between lysogenic and lytic cycle growth during
           lambda phage infection. Flotillins have been implicated
           in the progression of prion disease, in the pathogenesis
           of neurodegenerative diseases such as Parkinson's and
           Alzheimer's disease and, in cancer invasion and
           metastasis. Mutations in the podicin gene give rise to
           autosomal recessive steroid resistant nephritic
           syndrome.
          Length = 280

 Score = 29.4 bits (66), Expect = 4.2
 Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 4/61 (6%)

Query: 142 KDKEKKEKEKGK---KEEKEKKEEKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKKS 197
           ++ E  E EK K     +K+K  EKE + E++  + E +K   V KI   +K  EKE + 
Sbjct: 163 RNYELMEAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKILFGQKVMEKETEK 222

Query: 198 K 198
           +
Sbjct: 223 R 223


>gnl|CDD|225352 COG2747, FlgM, Negative regulator of flagellin synthesis
           (anti-sigma28 factor) [Transcription / Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 93

 Score = 27.7 bits (62), Expect = 4.3
 Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 2/76 (2%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           KT           KK      ++    +  +  +       E++ +E K+  E  E+K+ 
Sbjct: 19  KTKDAAVTTTGAAKKTDSSNVEVSLSAQQAKLMQPGSSDIREEKVEELKQAIENGEYKV- 77

Query: 175 EDKEKPVEKIRKEEKD 190
            D EK  +K+    K 
Sbjct: 78  -DTEKIADKLLDFAKQ 92


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
           Genome duplication is precisely regulated by
           cyclin-dependent kinases CDKs, which bring about the
           onset of S phase by activating replication origins and
           then prevent relicensing of origins until mitosis is
           completed. The optimum sequence motif for CDK
           phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
           to have at least 11 potential phosphorylation sites.
           Drc1 is required for DNA synthesis and S-M replication
           checkpoint control. Drc1 associates with Cdc2 and is
           phosphorylated at the onset of S phase when Cdc2 is
           activated. Thus Cdc2 promotes DNA replication by
           phosphorylating Drc1 and regulating its association with
           Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
           substrates required for DNA replication.
          Length = 397

 Score = 29.4 bits (66), Expect = 4.3
 Identities = 20/90 (22%), Positives = 41/90 (45%), Gaps = 1/90 (1%)

Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
           ++ KKK +K +   + K +    K  ++    E +  EE  K +EK           +++
Sbjct: 282 RVFKKKGQK-RTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDE 340

Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
             ++E D E +++ + K K  KK + +K  
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.2 bits (65), Expect = 4.3
 Identities = 28/128 (21%), Positives = 55/128 (42%), Gaps = 4/128 (3%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
           +   E   +K  ++D +     EK ++ +     D D+    + D+    +     EE+E
Sbjct: 125 KSDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDE----DNDEAPPAQPDVDNEEEE 180

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
           + EE + +EE+E +               E++ E+ + S D +  +  E  +  KK+ E 
Sbjct: 181 RLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEE 240

Query: 219 SSHEKEVI 226
              E+EVI
Sbjct: 241 EEMEEEVI 248


>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
           ribosomal structure and biogenesis].
          Length = 239

 Score = 29.1 bits (66), Expect = 4.3
 Identities = 18/88 (20%), Positives = 35/88 (39%), Gaps = 4/88 (4%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           E+ +   K   +   K K     +EK+ +EK  +E   KK      + + + +  D  +P
Sbjct: 150 ERSQHLNKALARKMLKGKLYILAQEKRSQEK-NRERALKKLIGWGNQIRSYVL--DPYQP 206

Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            + +R    +   + K  D DK    + 
Sbjct: 207 TKDLR-TGVERRNKSKVLDGDKDGFIKA 233


>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
           (Rrp7p) and similar proteins.  This CD corresponds to
           the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
           gene from Saccharomyces cerevisiae. It is an essential
           yeast protein involved in pre-rRNA processing and
           ribosome assembly. Rrp7p contains an N-terminal RNA
           recognition motif (RRM), also termed RBD (RNA binding
           domain) or RNP (ribonucleoprotein domain), and a
           C-terminal RRP7 domain.
          Length = 128

 Score = 28.0 bits (63), Expect = 4.3
 Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 8/60 (13%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKE-----KKEEKEKKEEKEFKMK--EDKEKPVEKIRKE 187
           +  K  E++K++KEK+K KK+E E     +  EK+K+E+ +   K  EDKE+ VEK++++
Sbjct: 69  EAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKER-VEKMKEK 127


>gnl|CDD|150392 pfam09710, Trep_dent_lipo, Treponema clustered lipoprotein
           (Trep_dent_lipo).  This entry represents a family of six
           predicted lipoproteins from a region of about 20
           tandemly arranged genes in the Treponema denticola
           genome. Two other neighboring genes share the
           lipoprotein signal peptide region but do not show more
           extensive homology. The function of this locus is
           unknown.
          Length = 394

 Score = 29.5 bits (66), Expect = 4.3
 Identities = 23/128 (17%), Positives = 62/128 (48%), Gaps = 4/128 (3%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
           S  K+ K  ++   +     K +  +++  +K +  E  +  +  KE +EKK  +E  ++
Sbjct: 17  SCSKEVKEQREMRIKVESSMKIEPKENEFLSKPEYDEHVKTPEQIKELEEKKAYEESLKQ 76

Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---KLKKEKLKKKKKEKERSSHEKEV 225
            +F++ +  +  + +  K   +   +  ++ K+    K+KK+      K+K  + + K++
Sbjct: 77  LQFEL-DKYDLVLIQAYKTPTNIGIDNLAQQKNLEFIKVKKDGEFIYCKQKAPTQNGKDL 135

Query: 226 IPKLTFKF 233
             ++T+K+
Sbjct: 136 DYQVTYKW 143


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 28.9 bits (65), Expect = 4.4
 Identities = 22/117 (18%), Positives = 62/117 (52%), Gaps = 6/117 (5%)

Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIK---NKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           +++K  K  K  E E+ + + ++K ++K+     +   K+ KE +K +K+  + + + + 
Sbjct: 55  ALEKFGKIQK--ELEEFRDQLEQKLENKVIEPLKEYLKKDLKEIKKRRKKLDKARLDYDA 112

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
             EK  K+++ K+    K+ + E++ ++ K+  ++ ++   E L K +++++     
Sbjct: 113 AREKLKKLRKKKKDS-SKLAEAEEELQEAKEKYEELREELIEDLNKLEEKRKELFLS 168


>gnl|CDD|225405 COG2849, COG2849, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 230

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 4/123 (3%)

Query: 94  KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
            VT +  L    ++K+   +K   K+K  ++  K   K D       E+ K K+E     
Sbjct: 15  GVTITYYLNGAIEEKANYVNKLEGKEKLYDEDGKLLSKLD---FSAGEEVKSKEEYYDNG 71

Query: 154 KEEKEKKEEKEKKEEKEFKM-KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
             + E  E K+       +   E+ E       K  K   + K+  +  K   +   K  
Sbjct: 72  TLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETKEFYENGKLKSEYNYKNG 131

Query: 213 KKE 215
           K  
Sbjct: 132 KLH 134



 Score = 28.1 bits (63), Expect = 7.8
 Identities = 23/119 (19%), Positives = 37/119 (31%), Gaps = 6/119 (5%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
              ++  K+   D  T        I++K          NK + KEK   E GK   K   
Sbjct: 2   GDIVETSKNGVDDGVTITYYLNGAIEEKA------NYVNKLEGKEKLYDEDGKLLSKLDF 55

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
              E+ + KE        K      K+   +   +K  +  +       K  K + E  
Sbjct: 56  SAGEEVKSKEEYYDNGTLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETK 114


>gnl|CDD|218754 pfam05795, Plasmodium_Vir, Plasmodium vivax Vir protein.  This
           family consists of several Vir proteins specific to
           Plasmodium vivax. The vir genes are present at about
           600-1,000 copies per haploid genome and encode proteins
           that are immunovariant in natural infections, indicating
           that they may have a functional role in establishing
           chronic infection through antigenic variation.
          Length = 348

 Score = 29.1 bits (65), Expect = 4.5
 Identities = 30/142 (21%), Positives = 52/142 (36%), Gaps = 10/142 (7%)

Query: 91  NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---------KKKKDKKDKDKIKNKE 141
           +I    + ++  +  K K          + +K K          K KK  K K+K  NK 
Sbjct: 96  DIWNKINKKNSKKKNKCKPDFNYIDILDEWKKRKDLYDYFENYDKIKKKIKSKNKNCNKY 155

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDK 200
               K  K   KK +++     +     E+   + K  P   + K   K+  K++KSK+ 
Sbjct: 156 CKYLKYIKSLYKKYKEKCCSSNDNNYCPEYFKCDKKYNPKNLLSKLSCKNESKKEKSKEG 215

Query: 201 DKKLKKEKLKKKKKEKERSSHE 222
                 EKL+  ++E       
Sbjct: 216 GAAEGDEKLELLEEESSELETS 237


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 29.5 bits (66), Expect = 4.6
 Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           D +K  E     ++  +G++E+++  EE +  +E E +  ED E   E   ++ ++ E+E
Sbjct: 630 DVLKADENKSRHQQLFEGEEEDEDDLEETDD-DEDECEAIEDSESESESDGEDGEEDEQE 688

Query: 195 KKSKD 199
             ++ 
Sbjct: 689 DDAEA 693


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 18/80 (22%), Positives = 31/80 (38%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
            ++KE EK + E +E +E+K K      +         E    EE + +    S      
Sbjct: 41  AQEKEHEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWG 100

Query: 204 LKKEKLKKKKKEKERSSHEK 223
            K  K ++K   K   + E+
Sbjct: 101 KKARKRQRKVIRKLLEAEEQ 120


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 29.0 bits (65), Expect = 4.7
 Identities = 12/44 (27%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
           K  + + K+K +  E+  ++K +K+ +E +E+ +K  EK  + K
Sbjct: 11  KLAEKQAKSKLQQLERA-RDKQEKKAEEYQEQIKKAIEKAEEAK 53


>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
           This model represents a paralogous family of Plasmodium
           yoelii genes preferentially located in the subtelomeric
           regions of the chromosomes. There are no obvious
           homologs to these genes in any other organism.
          Length = 255

 Score = 29.1 bits (65), Expect = 4.8
 Identities = 30/109 (27%), Positives = 44/109 (40%), Gaps = 18/109 (16%)

Query: 116 THKKKEKEKIK----KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE------------- 158
           +H KK KE+         DKK K  I   +K+ E+ +KE   +   E             
Sbjct: 90  SHIKKHKERNTLPDLNNVDKKTKKLINKLQKELEELKKELDNEMNDELTIQPIHDKIIIK 149

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKK 206
           K E     E ++FK  E+++       +E +  S    K K K KKL K
Sbjct: 150 KDENNSVSEHEDFKQLENEKNSSVSEHEEFDIASSDNLKIKRKLKKLVK 198


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 29.3 bits (66), Expect = 4.8
 Identities = 25/157 (15%), Positives = 50/157 (31%), Gaps = 10/157 (6%)

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
           H KKE  KI    D       +     K   EK + KK +  ++   E          E 
Sbjct: 535 HGKKENGKIYVYND------EERGPYLKYSYEKLE-KKLKSLEELPLEIINGDSLLSDEV 587

Query: 177 KEKPVEKIRKEEKDSEKE---KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
            EK  + I + +     E                 +     E  +   + E I  +  + 
Sbjct: 588 DEKINQDILRIQDRILSEAIELARLLDLLHKSSPDVLNSIFEYLKFLTDAEEIRDVPLEE 647

Query: 234 GTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDT 270
             +++E+ +   + +I+    +  S  P    ++ + 
Sbjct: 648 KEELDERIESLYNLEILKYLEEELSEGPTRLGEKENE 684


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 26.6 bits (59), Expect = 4.8
 Identities = 17/38 (44%), Positives = 26/38 (68%)

Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
           D+   K  E DK+ +EKEK  + EKE+ E ++K+EE+E
Sbjct: 4   DERAEKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEE 41


>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA.  FadA (Fusobacterium
           adhesin A) is an adhesin which forms two alpha helices.
          Length = 126

 Score = 28.0 bits (62), Expect = 4.9
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
            ++EK+EKE  +KE +E KE +  +EE   K+KE+ +    K        E +K  K  D
Sbjct: 47  FEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHK-------DEYKKLLKRYD 99

Query: 202 KKLKKEKLKKKKKEKERSSHEK 223
           + L K + K   KE++ S  EK
Sbjct: 100 EYLNKLEQKITDKEQQISEFEK 121


>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1.  This region is found
           in some of the Diaphanous related formins (Drfs). It
           consists of low complexity repeats of around 12
           residues.
          Length = 160

 Score = 28.4 bits (63), Expect = 4.9
 Identities = 15/37 (40%), Positives = 16/37 (43%)

Query: 8   PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS 44
           P P   GPG+ P  PP    P IP P       PPP 
Sbjct: 112 PPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148


>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
           TRaffic INducer (NOSTRIN).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Nitric Oxide Synthase TRaffic
           INducer (NOSTRIN) is expressed in endothelial and
           epithelial cells and is involved in the regulation,
           trafficking and targeting of endothelial NOS (eNOS).
           NOSTRIN facilitates the endocytosis of eNOS by
           coordinating the functions of dynamin and the
           Wiskott-Aldrich syndrome protein (WASP). Increased
           expression of NOSTRIN may be correlated to preeclampsia.
           NOSTRIN contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
           The F-BAR domain of NOSTRIN is necessary and sufficient
           for its membrane association and is responsible for its
           subcellular localization.
          Length = 239

 Score = 28.9 bits (65), Expect = 4.9
 Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 7/75 (9%)

Query: 105 KKDKSIKKDKKTHKK-KEKEK----IKKKKDKKDKDKIKNK-EKDKEKKEKEKGKKEEKE 158
            + + IK  KK H   +E EK    ++  K    K K+ NK +K  E ++KE  K E K 
Sbjct: 121 WRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKR 180

Query: 159 KK-EEKEKKEEKEFK 172
           KK EE   K E E+ 
Sbjct: 181 KKGEESRLKAENEYY 195


>gnl|CDD|205207 pfam13026, DUF3887, Protein of unknown function (DUF3887).  This
           family of lipoproteins is functionally uncharacterized.
           This family of proteins is found in bacteria. Proteins
           in this family are typically between 128 and 167 amino
           acids in length. The proteins in this family contain an
           N-terminal lipid attachment site.
          Length = 101

 Score = 27.6 bits (61), Expect = 5.0
 Identities = 19/88 (21%), Positives = 38/88 (43%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           K+ K+ ++   ++      EK  ++   K +EK  K     + KE   K ++K+  +E  
Sbjct: 1   KETKENVEQSAEEIVTMVNEKVYRDVHNKFDEKMNKALNWEEAKEKLGKTIDKVGFKEDF 60

Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
            + EK   +    ++      K KE+E 
Sbjct: 61  EKIEKSMLEDIDHIRTVIRVAKYKEQEH 88


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 29.1 bits (65), Expect = 5.0
 Identities = 10/122 (8%), Positives = 43/122 (35%), Gaps = 13/122 (10%)

Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
           + +E ++ ++K   +D+D+   + + + +       +  + ++    +      + + D 
Sbjct: 1   RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60

Query: 178 EKPV-----------EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
             P            ++ R+  +        + + +   +    + +K+ ++ S      
Sbjct: 61  RSPRSLRYSSVRRSRDRPRRRSRSVRSI--EQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118

Query: 227 PK 228
           P 
Sbjct: 119 PG 120


>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
           Members of this family of small ribosomal proteins
           possess one of three conserved blocks of sequence found
           in proteins that stimulate the dissociation of guanine
           nucleotides from G-proteins, leaving open the
           possibility that MRP-S27 might be a functional partner
           of GTP-binding ribosomal proteins.
          Length = 417

 Score = 29.0 bits (65), Expect = 5.1
 Identities = 30/152 (19%), Positives = 66/152 (43%), Gaps = 17/152 (11%)

Query: 80  DVELG--VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
            V L      ++ +I++  SSE         S+KK ++   K E  ++ +    ++ +K 
Sbjct: 274 GVALSEEALDLLQSILEKLSSELDEA-----SLKKLQEAIDKSESSELDEDLLLEEVEKS 328

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKE-EKEFKMKEDKEKPVEKIRKE--EKDSEKE 194
           K K  +    +  +  +E + K +   K E      ++E        I KE  E   +++
Sbjct: 329 KQKF-EPLLAQYGESFQEWRSKLQALAKVESRSLLSVEERLPT----IEKEDLELYEQRQ 383

Query: 195 KKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 224
           +    +++KL  +K+KL+++  E+ +  H   
Sbjct: 384 QLWFFENRKLWQRKKKLREQADEEYQQRHATR 415


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 28.1 bits (63), Expect = 5.1
 Identities = 31/111 (27%), Positives = 62/111 (55%), Gaps = 5/111 (4%)

Query: 117 HKKKEKEKIKKKKDKKDKD---KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
            +K+  EK+++++ ++ +    +I+ ++K + K  KE  K E+KE K+E EK     F+ 
Sbjct: 34  QQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKL--PRFQE 91

Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           +E K    EK  +E+K  ++E++   K ++  +E L++ + EK     E E
Sbjct: 92  QEKKRMKAEKEEQEQKHQKQEREFLAKQEENLEEALQQLQNEKRHELAETE 142


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 29.1 bits (65), Expect = 5.1
 Identities = 25/147 (17%), Positives = 43/147 (29%), Gaps = 17/147 (11%)

Query: 26  QHPLIPQPLVHPPGTPPPSK-MKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELG 84
           Q P+  +    P  TP            K   +P +  +         K        EL 
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELL 328

Query: 85  VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
                 ++  ++S  D            D +        K K+++     +       + 
Sbjct: 329 RERFPSDLFAISSLPD-----------SDSEASDSGPTRKRKRRRVPPLPEY---SSDED 374

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           E    E     EKE+K  +E  E+K F
Sbjct: 375 EDDSDEDEVDYEKERKRRRE--EDKNF 399


>gnl|CDD|232987 TIGR00467, lysS_arch, lysyl-tRNA synthetase, archaeal and
           spirochete.  This model represents the lysyl-tRNA
           synthetases that are class I amino-acyl tRNA
           synthetases. It includes archaeal and spirochete
           examples of the enzyme. All other known examples are
           class IIc amino-acyl tRNA synthetases and seem to form a
           separate orthologous set [Protein synthesis, tRNA
           aminoacylation].
          Length = 515

 Score = 29.1 bits (65), Expect = 5.2
 Identities = 25/136 (18%), Positives = 52/136 (38%), Gaps = 7/136 (5%)

Query: 60  EKMSSKEKTSPKEKEYSKVKDVELG---VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
           E+     K   +EK+ +  +  EL     +  +P  V       + +I ++  I+K  + 
Sbjct: 327 ERFYYGVKDKDEEKKRAFKRIYELSQPMPSERIPYQVPFRHLSVISQIFENNDIEKILE- 385

Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
             K+ +  +  +KDK    ++        K   E  K   + K +  E  EE     K+ 
Sbjct: 386 ILKRVQYTVDDQKDKLINKRLNCARNWIRKYAPEDFKFSLRSKFDNMEILEENS---KKA 442

Query: 177 KEKPVEKIRKEEKDSE 192
             +  E ++K  + + 
Sbjct: 443 INELAEFLKKNFEVAT 458


>gnl|CDD|233582 TIGR01812, sdhA_frdA_Gneg, succinate dehydrogenase or fumarate
           reductase, flavoprotein
           subunitGram-negative/mitochondrial subgroup.  This model
           represents the succinate dehydrogenase flavoprotein
           subunit as found in Gram-negative bacteria,
           mitochondria, and some Archaea. Mitochondrial forms
           interact with ubiquinone and are designated EC 1.3.5.1,
           but can be degraded to 1.3.99.1. Some isozymes in E.
           coli and other species run primarily in the opposite
           direction and are designated fumarate reductase [Energy
           metabolism, Aerobic, Energy metabolism, Anaerobic,
           Energy metabolism, TCA cycle].
          Length = 541

 Score = 29.2 bits (66), Expect = 5.2
 Identities = 14/77 (18%), Positives = 37/77 (48%)

Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
           E   K      + E++  K ++   +  ++ +  + V KIR+E  D+  +     + ++L
Sbjct: 375 EYAAKTGNPAADIEEEAVKAEEALIDLLVERNGGERVAKIREELGDTMDDNVGIFRTEEL 434

Query: 205 KKEKLKKKKKEKERSSH 221
            K+ + + ++ +ER  +
Sbjct: 435 LKKAVDEIEELRERYKN 451


>gnl|CDD|184535 PRK14144, PRK14144, heat shock protein GrpE; Provisional.
          Length = 199

 Score = 28.5 bits (63), Expect = 5.3
 Identities = 13/37 (35%), Positives = 24/37 (64%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
           K++KKD  K K++ K++ K E E  ++E  E+ + +E
Sbjct: 3   KQNKKDWKKFKDEHKEEHKVENEILEEETDEESQHQE 39


>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b.  This model
           describes the F1/F0 ATP synthase b subunit in bacteria
           only. Scoring just below the trusted cutoff are the
           N-terminal domains of Mycobacterial b/delta fusion
           proteins and a subunit from an archaeon, Methanosarcina
           barkeri, in which the ATP synthase homolog differs in
           architecture and is not experimentally confirmed. This
           model helps resolve b from the related b' subunit.
           Within the family is an example from a
           sodium-translocating rather than proton-translocating
           ATP synthase [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 147

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 18/96 (18%), Positives = 45/96 (46%), Gaps = 6/96 (6%)

Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
           +  + KI +     E+ +KE    ++K +   KE K+E +  ++   ++       E  +
Sbjct: 25  ETRQKKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEIIENANKR-----GSEILE 79

Query: 191 SEKEKKSKDKDKKLKKEKLK-KKKKEKERSSHEKEV 225
             K +  ++++K   + + + + +KE+ R    K+V
Sbjct: 80  EAKAEAREEREKIKAQARAEIEAEKEQAREELRKQV 115


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 29.2 bits (65), Expect = 5.3
 Identities = 14/97 (14%), Positives = 37/97 (38%), Gaps = 5/97 (5%)

Query: 88  MVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
           + P +    ++E    +            +  +E      + + +   +I +  KD+++K
Sbjct: 60  LRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEF---LSESESEASLEIDSDIKDEKQK 116

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDK--EKPVE 182
             E+ K   +   +++   E++       K  E+P E
Sbjct: 117 SLEEQKIAPEIPVKQQIDSEKERIASICTKIIEEPEE 153



 Score = 28.4 bits (63), Expect = 8.5
 Identities = 14/64 (21%), Positives = 33/64 (51%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
           E+  +K  + +K   K ++KD     K+  K E++  + + E ++ ++ +   ++EK   
Sbjct: 343 EEKARKYWEANKPVSKREKKDIFHLSKKLRKIEKERLRIQSEMRDAEDIEEAIEEEKNQS 402

Query: 183 KIRK 186
           +I K
Sbjct: 403 EILK 406


>gnl|CDD|234487 TIGR04157, glyco_rSAM_CFB, glycosyltransferase, GG-Bacteroidales
           peptide system.  Members of this protein family are
           predicted glycosyltransferases that occur in conserved
           gene neighborhoods in various members of the
           Bacteroidales. These neighborhoods feature a radical SAM
           enzyme predicted to act in peptide modification (family
           TIGR04148), peptides from family TIGR04149 with a
           characteristic GG cleavage motif, and several other
           proteins.
          Length = 406

 Score = 29.2 bits (66), Expect = 5.4
 Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 10/76 (13%)

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDK----EKPVEKIRKEEKDSEKEKKS------K 198
           KE+ K+ E EK  ++E + EKE     D+     +  + +  E+    KEK         
Sbjct: 144 KEETKRSELEKLIKEEFEREKEMFQLADRVICLSQYTQNLLYEDYGIPKEKIVVIYNGLT 203

Query: 199 DKDKKLKKEKLKKKKK 214
           D+   L + + KK +K
Sbjct: 204 DEAIYLSEVERKKLRK 219


>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
           unknown].
          Length = 272

 Score = 28.9 bits (64), Expect = 5.4
 Identities = 25/156 (16%), Positives = 55/156 (35%), Gaps = 11/156 (7%)

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----DKIKNKEKDKEKKEKEKGKKEEKEKK 160
           K ++++K  ++   +  EK   ++  +       + I    K          ++  K+ +
Sbjct: 121 KAENVEKVPESDAIEALEKQLTQQKSEKHNSSAINFIDELNKRLWSDPFVSSQRLRKQFR 180

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           E K+ ++++E K    K +    I      S+K+K     D +L K K  +K   +   S
Sbjct: 181 ERKKIEKKQEAKDLSLKNRAALDIDILPSSSDKDKALLLLDNELGKNKFIRKLDYRRTPS 240

Query: 221 HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
                  +    F    E    ++        P + 
Sbjct: 241 S------RTFSTFAKFAETSFAKKDPFARKFVPSEK 270


>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
          Length = 374

 Score = 29.2 bits (65), Expect = 5.4
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 11/67 (16%)

Query: 117 HKKKEKEKIKK----KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           H+K+ K KI +    KKDK+        E D  K E E+   E K+ K+ K+ K+++  +
Sbjct: 4   HRKQSKIKIYRITSYKKDKR-------SELDSNKFELEQQLVENKQDKQGKQYKQDRGIQ 56

Query: 173 MKEDKEK 179
             E K+ 
Sbjct: 57  YAETKQD 63


>gnl|CDD|241381 cd13227, PH-like_Vps36_GLUE, Vps36 GLUE (GRAM-like
           ubiquitin-binding in Eap45) Pleckstrin homology-like
           domain.  ESCRT complexes form the main machinery driving
           protein sorting from endosomes to lysosomes. Yeast/human
           ESCRT-I consists of Vps23/Tsg101, Vps28/Vps28, and
           Vps37/Vps37 homolog. Yeast/human ESCRT-II is composed of
           Vps25/EAP20, Vps22/EAP30, and Vps36/EAP45. Yeast
           ESCRT-III consists Vps2, Vps20, Vps24, and Snf7
           subunits. In contrast, there are three human paralogs of
           Snf7 (hSnf7-1/CHMP4A, hSnf7-2/CHMP4B, and
           hSnf7-3/CHMP4C) and two paralogs of Vps2 (CHMP2A and
           CHMP2B). Yeast ESCRT-I links directly to ESCRT-II,
           through a tight interaction of Vps28 (ESCRT-I) with the
           yeast-specific zinc-finger insertion within the GLUE
           domain of Vps36. The Vps36 subunit (ESCRT-II) binds
           ubiquitin using one of its two NZF zinc fingers in its
           N-terminal region. Human Vps36, EAP45, also binds
           ubiquitin despite having no NZF domain. Instead,
           mammalian ESCRT-II interacts with Ub through the Eap45
           GLUE domain itself. The yeast Vps36 GLUE has a complete
           PH domain, wherease Eap45 GLUE only has a PH-like fold
           since it lacks the secondary structure element
           corresponding to the 4 strand. ESCRT-II also interacts
           with ESCRT-III via a Vps25(EAP20)/Vps20(CHMP6)
           interaction. Structure 2CAY is missing this insertion
           that contains 2 NZF zinc fingers. It is a split PH
           domain, with a noncanonical lipid binding pocket that
           binds PI(3)P. The interactions of ESCRT-II GLUE domain
           with membranes, ESCRT-I, and ubiquitin are critical for
           ubiquitinated cargo progression from early to late
           endosomes. PH domains have diverse functions, but in
           general are involved in targeting proteins to the
           appropriate cellular location or in the interaction with
           a binding partner. They share little sequence
           conservation, but all have a common fold, which is
           electrostatically polarized. Less than 10% of PH domains
           bind phosphoinositide phosphates (PIPs) with high
           affinity and specificity. PH domains are distinguished
           from other PIP-binding domains by their specific
           high-affinity binding to PIPs with two vicinal phosphate
           groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
           which results in targeting some PH domain proteins to
           the plasma membrane. A few display strong specificity in
           lipid binding. Any specificity is usually determined by
           loop regions or insertions in the N-terminus of the
           domain, which are not conserved across all PH domains.
           PH domains are found in cellular signaling proteins such
           as serine/threonine kinase, tyrosine kinases, regulators
           of G-proteins, endocytotic GTPases, adaptors, as well as
           cytoskeletal associated molecules and in lipid
           associated enzymes.
          Length = 126

 Score = 27.7 bits (62), Expect = 5.5
 Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 2/43 (4%)

Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV--WICPACGVQ 285
            SPKI++      +           TG       W CP C   
Sbjct: 81  RSPKIILFLKDEINSKSGLGQSSNKTGSQNVTSQWTCPICMFT 123


>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor.  The ribosome recycling
           factor (RRF / ribosome release factor) dissociates the
           ribosome from the mRNA after termination of translation,
           and is essential bacterial growth. Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.
          Length = 165

 Score = 28.2 bits (64), Expect = 5.5
 Identities = 16/45 (35%), Positives = 28/45 (62%)

Query: 99  EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
           + L +++KDK I +D+    +KE +K+  K  KK  + +K KEK+
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163


>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
           modification].
          Length = 429

 Score = 29.2 bits (65), Expect = 5.5
 Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 5/93 (5%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK-----MK 174
             + K     + K +       +  E+K+K      + +  +  + K EKEFK      +
Sbjct: 12  NNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKEFKDVLQRFQ 71

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
             +    ++I K+EK+  + K  K     ++ E
Sbjct: 72  VQENDTPKEITKDEKNMSRMKSRKAPTIHMEAE 104


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 29.1 bits (65), Expect = 5.5
 Identities = 36/207 (17%), Positives = 57/207 (27%), Gaps = 31/207 (14%)

Query: 87  PMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 146
           P V    +   SE        KS +  K          + +    +D ++ +    + E 
Sbjct: 329 PAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDES 388

Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKED-----------------KEKPVEKIRKEEK 189
           +   K   EE  ++ E+EK       + ED                   +  E   +E  
Sbjct: 389 EIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETS 448

Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEK--------------ERSSHEKEVIPKLTFKFGT 235
            S  E        +LK EK K + +                E              K  +
Sbjct: 449 RSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVTTEETPEEVKSSS 508

Query: 236 DMEEKTKRESSPKIVIKPVKSPSPPPV 262
               K    S PK      +   PPPV
Sbjct: 509 PGVTKPAIPSRPKKGKPTSEKRKPPPV 535


>gnl|CDD|225349 COG2739, COG2739, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 105

 Score = 27.3 bits (61), Expect = 5.6
 Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 8/70 (11%)

Query: 101 LGEIKKDKSIKKD------KKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
           L EI ++ ++ +       K+T K  E   EK+K  +  K + ++ +K K+     +E  
Sbjct: 36  LSEIAEEFNVSRQAIYDNIKRTEKILEDYEEKLKLYEKYKIRKELYDKLKELITDPEELR 95

Query: 153 KKEEKEKKEE 162
           +  E     E
Sbjct: 96  EILEDLDNLE 105


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 28.9 bits (65), Expect = 5.6
 Identities = 18/68 (26%), Positives = 40/68 (58%), Gaps = 4/68 (5%)

Query: 164 EKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
           E  EE+E   +++ + +  E+ R  E    + ++ ++ +++ ++EK ++KK++KER   E
Sbjct: 147 EVLEEEELAELRQQQRQ-FEQRRNAEL--AETQRLEEAERRRREEKERRKKQDKERKQRE 203

Query: 223 KEVIPKLT 230
           KE   K+ 
Sbjct: 204 KETAEKIA 211



 Score = 28.5 bits (64), Expect = 6.2
 Identities = 13/64 (20%), Positives = 43/64 (67%), Gaps = 3/64 (4%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
           E+E++ + + ++ + + +   ++ E  E ++ ++ E+ ++EEKE++++++ + K+ +++ 
Sbjct: 150 EEEELAELRQQQRQFEQR---RNAELAETQRLEEAERRRREEKERRKKQDKERKQREKET 206

Query: 181 VEKI 184
            EKI
Sbjct: 207 AEKI 210


>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
          Length = 603

 Score = 29.1 bits (65), Expect = 5.6
 Identities = 23/140 (16%), Positives = 50/140 (35%), Gaps = 21/140 (15%)

Query: 95  VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK------DKKDKDKIKNKEKDKEKKE 148
           V+ +         +S       HK KE++    +K      ++        +   K +  
Sbjct: 86  VSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKTQFN 145

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKED---------------KEKPVEKIRKEEKDSEK 193
               K E+  + + +   +EK  ++++                  + V+++R   K+ E+
Sbjct: 146 PPTVKHEKNTRVQPRRATDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTKELER 205

Query: 194 EKKSKDKDKKLKKEKLKKKK 213
                 KDK L K  L++ K
Sbjct: 206 AVGDATKDKDLSKSALQRMK 225


>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
          Length = 814

 Score = 29.2 bits (65), Expect = 5.6
 Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
           E ++GK EEKE+KEE  KK+       ED EKP   I + +  +E
Sbjct: 30  ESDEGKSEEKEEKEEV-KKDRDNIPEIEDGEKPTSGIEQHQYQTE 73


>gnl|CDD|233935 TIGR02577, cas_TM1794_Cmr2, CRISPR-associated protein Cas10/Cmr2,
           subtype III-B.  This model represent a Crm2 family of
           the CRISPR-associated RAMP module, a set of six genes
           recurring found together in prokaryotic genomes. This
           gene cluster is found only in species with CRISPR
           repeats, usually near the repeats themselves. Because
           most of the six (but not this family) contain RAMP
           domains, and because its appearance in a genome appears
           to depend on other CRISPR-associated Cas genes, the set
           is designated the CRISPR RAMP module. This protein,
           typified by TM1794 from Thermotoga maritima, is
           designated Crm2, for CRISPR RAMP Module protein 2
           [Mobile and extrachromosomal element functions, Other].
          Length = 483

 Score = 29.0 bits (65), Expect = 5.7
 Identities = 37/178 (20%), Positives = 57/178 (32%), Gaps = 25/178 (14%)

Query: 59  KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG----EIKKDKSIKKDK 114
           +E +  K K    E     VKD+  G        V     E L      +KK+     DK
Sbjct: 104 QETIRIKIKDELAELLKKAVKDLIEG------PAVITDLEEGLKQKEEALKKELEKYADK 157

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK-------- 166
           +       EK +  KD K+ +         +    E+   +  EK   KE          
Sbjct: 158 RLSAPSLLEKEEVYKDVKNGNVGTGLNNVSKYLSFEESVLKLSEKFLRKELIRIYPKLRE 217

Query: 167 ------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
                   K F  + +K   +  I+              K K L+ + LK   +E+ER
Sbjct: 218 SLDAIGLTKRFSERCEKCLELPGIKCLLCGENLAIAG-VKRKLLELKDLKALLQEEER 274


>gnl|CDD|225768 COG3227, LasB, Zinc metalloprotease (elastase) [Amino acid
           transport and metabolism].
          Length = 507

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 24/127 (18%), Positives = 39/127 (30%), Gaps = 11/127 (8%)

Query: 40  TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK-EYSKVKDVELGVTPMVPNIVKVTSS 98
                 +K  +S K   + KE +   E  +     +         G T      V+    
Sbjct: 35  MKSQIFIKGTLSKKSAPNEKEILQFLENVNADNNLKAISTDTDPNGFT-----HVRYQQV 89

Query: 99  EDLGEIKKDKSIK--KDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEK-DKEKKEKEKGKK 154
            + G   K   +    DK    K     + KKKD      KI  ++     +K+    + 
Sbjct: 90  VN-GVPVKGSEVIVHLDKNGVVKAVNGFVNKKKDLPTNTPKISAEQAIAIARKQFGYERL 148

Query: 155 EEKEKKE 161
            E  K E
Sbjct: 149 TESPKAE 155


>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
          Length = 1176

 Score = 29.1 bits (66), Expect = 5.7
 Identities = 15/51 (29%), Positives = 26/51 (50%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           K +KK +   +  K +     +++ +  +KE+ EK      E EK EEKE+
Sbjct: 827 KEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEE 877


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 28.2 bits (63), Expect = 5.8
 Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 9/105 (8%)

Query: 118 KKKEKEKIKKKKDKKDK-----DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
           +K+E E+++K++   D+     DK+ ++ K++     +   K   E   +KE  E ++  
Sbjct: 44  RKQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPT 103

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
             E  E   +++    KD   EK      ++L+K + K KK++KE
Sbjct: 104 YNEMVEDLFDQV----KDEVDEKNGAALIEELQKHRDKLKKEQKE 144


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 28.6 bits (64), Expect = 5.8
 Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 9/65 (13%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E  K+      + K+ +           I  EE DSEKE KS+  D + +   L +K ++
Sbjct: 9   ESAKERRNRNDKNKKEEHS---------IGSEEGDSEKEPKSESADGRKRCRFLLRKTRK 59

Query: 216 KERSS 220
             +S 
Sbjct: 60  VSQSL 64


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 29.1 bits (66), Expect = 5.9
 Identities = 38/169 (22%), Positives = 65/169 (38%), Gaps = 33/169 (19%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE-- 158
           L ++K        ++  KKKEK+K KKKK K  + KI    K+  +++ E  K    E  
Sbjct: 385 LKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLELLKNVWGEIL 444

Query: 159 ---KKEEKEK---------------------KEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
              K + K                       + E  F+     ++  + I         +
Sbjct: 445 ESLKAQRKSLRALLVNSEPVAASEDTVVLAFEYEIHFEKAMLDKELNDTIENILSKLLGK 504

Query: 195 K-----KSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTD 236
                   +D+ +K+++E L+K K E+E    E    +IP+    FG D
Sbjct: 505 SVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKELFGED 553



 Score = 28.3 bits (64), Expect = 9.3
 Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
             +++  E++ ++ + +      ++K+++KEKK+ K+ K K     P  KI K  K++ +
Sbjct: 375 LQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYK----VPRGKIYKVLKEATR 430

Query: 194 E--KKSKDKDKKLKKEKLKKKKK 214
           +  +  K+   ++  E LK ++K
Sbjct: 431 QDLELLKNVWGEI-LESLKAQRK 452


>gnl|CDD|223173 COG0095, LplA, Lipoate-protein ligase A [Coenzyme metabolism].
          Length = 248

 Score = 28.5 bits (64), Expect = 5.9
 Identities = 14/75 (18%), Positives = 32/75 (42%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +  KEK K K  +  ++     +E  +  ++E  E  +E   K      +E +   ++ +
Sbjct: 170 RVPKEKIKSKGIKSVRERVANLEELLKISVEEFLEALLEAFFKVLGVELEEYELTPEELE 229

Query: 204 LKKEKLKKKKKEKER 218
           L ++  ++K    E 
Sbjct: 230 LAEKLAEEKYSSWEW 244


>gnl|CDD|184538 PRK14149, PRK14149, heat shock protein GrpE; Provisional.
          Length = 191

 Score = 28.3 bits (63), Expect = 6.0
 Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 8/80 (10%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE----EKEKKEEKEFKMKEDKEKPVE 182
           K +  ++ D +  KE + E+K     +++ +EK+E    E E KE+ E K KE  EK + 
Sbjct: 2   KDEHNQEHDHLSQKEPESEEKACACKEQQGEEKQEASEKEGEIKEDFELKYKEMHEKYL- 60

Query: 183 KIRKEEKDSEKEKKSKDKDK 202
              +   D E  KK  ++DK
Sbjct: 61  ---RVHADFENVKKRLERDK 77


>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
           PA. 
          Length = 669

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 12/105 (11%)

Query: 37  PPGTPPPSKMK---DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT---PMVP 90
           P       +M+   DK +P++      +MS + K   K   +++++ + L      P  P
Sbjct: 188 PLPFSTYEQMRDYCDKFTPREIEPNVSRMSGRVKLPIKHYRWTQMRPIPLLKEPEGPPSP 247

Query: 91  NIVKVTSSEDL------GEIKKDKSIKKDKKTHKKKEKEKIKKKK 129
              K+  ++ L          ++  I  D     + E E I+  +
Sbjct: 248 YRAKLLMADALYLTICDPGTDQEGIILIDSTFFSQSEPECIRPSE 292


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 28.4 bits (64), Expect = 6.1
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 114 KKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE--KKEEKE 170
           K   K  E+ K+  +  ++D  DKIK  EK+KE  E E  K EE  +K   E  KK ++ 
Sbjct: 111 KDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIKKIDEL 170

Query: 171 FKMKEDK 177
            K KE +
Sbjct: 171 LKSKEKE 177


>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
          Length = 211

 Score = 28.3 bits (63), Expect = 6.1
 Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 2/56 (3%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKEKEKGKKEEKEKKEEKEK 165
            ++K    ++    K K     + K+K   K +   K        EEK ++EE   
Sbjct: 35  PNRKKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGN 90



 Score = 28.3 bits (63), Expect = 6.4
 Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 1/54 (1%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
              KK   E++   K+K    K ++K K   K  K        E  EEK ++EE
Sbjct: 35  PNRKKSAPEQVPTGKNKLLLTK-RSKLKGIPKPRKLHKHGFWAEIFEEKVEREE 87



 Score = 28.3 bits (63), Expect = 6.8
 Identities = 21/95 (22%), Positives = 35/95 (36%), Gaps = 14/95 (14%)

Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
              +KK   +     KNK     K+ K KG  + ++  +     E  E           E
Sbjct: 34  SPNRKKSAPEQVPTGKNK-LLLTKRSKLKGIPKPRKLHKHGFWAEIFE-----------E 81

Query: 183 KIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKE 215
           K+ +EE  +  +   KD  K K+  +  LK    +
Sbjct: 82  KVEREELGNPCKDLDKDIIKGKRRPRLTLKNTNNQ 116


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 28.5 bits (64), Expect = 6.1
 Identities = 18/84 (21%), Positives = 43/84 (51%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
           ++I+ + +++ + EK +  +E ++K EE  +K+EK+ +M++       K     K     
Sbjct: 11  EEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEARLKVLNAR 70

Query: 195 KKSKDKDKKLKKEKLKKKKKEKER 218
           ++  D   +  KE+L    ++K+ 
Sbjct: 71  EELLDSVFEEAKERLANLSEDKDE 94


>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain.  This
           family is involved in biosynthesis of guanosine
           nucleotide. Members of this family contain a TIM barrel
           structure. In the inosine monophosphate dehydrogenases 2
           CBS domains pfam00571 are inserted in the TIM barrel.
           This family is a member of the common phosphate binding
           site TIM barrel family.
          Length = 467

 Score = 28.8 bits (65), Expect = 6.1
 Identities = 25/136 (18%), Positives = 44/136 (32%), Gaps = 5/136 (3%)

Query: 74  EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIK-----KDKKTHKKKEKEKIKKK 128
           E  KVK  E G       +   T+  +  E+K+   I      +D K        +  + 
Sbjct: 77  EVRKVKRFESGFITDPVTVSPDTTVAEALELKERYGISGVPVTEDGKLLGGLVGIRTSRD 136

Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
            D   K  +     +      E    EE  +   + K E+      + +   +   +  E
Sbjct: 137 IDFLSKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDDGELVGLITRKDIE 196

Query: 189 KDSEKEKKSKDKDKKL 204
           K  +    SKD   +L
Sbjct: 197 KARDYPNASKDAQGRL 212


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 28.1 bits (62), Expect = 6.2
 Identities = 21/151 (13%), Positives = 88/151 (58%)

Query: 96  TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
           T +E+   +  +K  +  ++  +++++ + ++++D+ +++++K +  ++  + +E+ +++
Sbjct: 19  TDAEEATRLLAEKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQ 78

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
           E+E+  EKE+K +++ + +E +E+  ++  +++K+  + +  ++ ++   + +   ++ E
Sbjct: 79  EEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIE 138

Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
           +ER   +K +   +     +++  + K+E  
Sbjct: 139 QERLERKKRLEEIMKRTRKSEVSPQVKKEDP 169


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 28.2 bits (63), Expect = 6.2
 Identities = 24/71 (33%), Positives = 34/71 (47%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
           KKT KKK+    KKK  KK K K K KE+  E   +E    EE E+ ++K   E  + + 
Sbjct: 48  KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107

Query: 174 KEDKEKPVEKI 184
              +    + I
Sbjct: 108 TAAEVNHEDVI 118



 Score = 28.2 bits (63), Expect = 7.4
 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
           EI   K+ KK K T +KK+  K  KKK K      K +  +   +E    ++ E+  K+ 
Sbjct: 44  EIPSKKTSKKKKTTPRKKKTTKKTKKKKK-----EKEEVPELAAEELSDSEENEENDKKV 98

Query: 163 KEKKEEKEFKMKEDKE 178
             +  + +    E   
Sbjct: 99  DYELPKVQNTAAEVNH 114


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
           protein.  This domain family is found in eukaryotes, and
           is typically between 212 and 238 amino acids in length.
           The family is found in association with pfam01805. There
           are two completely conserved residues (W and H) that may
           be functionally important. PRP21 is required for
           assembly of the prespliceosome and it interacts with U2
           snRNP and/or pre-mRNA in the prespliceosome. This family
           also contains proteins similar to PRP21, such as the
           mammalian SF3a. SF3a also interacts with U2 snRNP from
           the prespliceosome, converting it to its active form.
          Length = 230

 Score = 28.6 bits (64), Expect = 6.2
 Identities = 19/84 (22%), Positives = 41/84 (48%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           +E       ++EE+ ++EE++  EEK+ +  +    P+++++       + K  KD    
Sbjct: 97  EEPDAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPS 156

Query: 204 LKKEKLKKKKKEKERSSHEKEVIP 227
             K + +K KK+  +     E+IP
Sbjct: 157 GNKPRAEKPKKKMIKCPITGELIP 180


>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
           PB1.  Two GTP binding sites exist in this protein.
          Length = 740

 Score = 29.1 bits (65), Expect = 6.2
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           +K ++K  K K +K +  +KE  K  K K+  K++++  +EE   +      M +D E+
Sbjct: 176 DKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDRITREEYLIRALTLNTMTKDAER 234


>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
          Length = 615

 Score = 28.8 bits (64), Expect = 6.5
 Identities = 20/101 (19%), Positives = 42/101 (41%), Gaps = 10/101 (9%)

Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG---------KKEEKE 158
           ++   + K+   K+K+   KK+  K         K+KE    EKG         +  ++ 
Sbjct: 29  QTTTSENKSAVSKKKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERIIDRI 88

Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
            +   + + E + K K + +K  + I+  +    K +K +D
Sbjct: 89  FQSSLKNRTEIKVKPKNNPQKK-QNIKPVKPIPSKPEKPED 128


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.  This
           family includes the HABP4 family of hyaluronan-binding
           proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
           HABP4 has been observed to bind hyaluronan (a
           glucosaminoglycan), but it is not known whether this is
           its primary role in vivo. It has also been observed to
           bind RNA, but with a lower affinity than that for
           hyaluronan. PAI-1 mRNA-binding protein specifically
           binds the mRNA of type-1 plasminogen activator inhibitor
           (PAI-1), and is thought to be involved in regulation of
           mRNA stability. However, in both cases, the sequence
           motifs predicted to be important for ligand binding are
           not conserved throughout the family, so it is not known
           whether members of this family share a common function.
          Length = 106

 Score = 27.4 bits (61), Expect = 6.6
 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE---EKEFKMKEDKEKPVE--KIRKEEK 189
           +K + +EK   ++E  K    E+E+ EE+E KE   E+  K  ++K K +   K+RK ++
Sbjct: 46  EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKRKALPALKVRKAKE 105

Query: 190 D 190
            
Sbjct: 106 G 106


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 29.1 bits (65), Expect = 6.7
 Identities = 31/187 (16%), Positives = 56/187 (29%), Gaps = 14/187 (7%)

Query: 89  VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
            P+ V   +S  L +  K  +I  +     +K+      ++   D D     +   E  E
Sbjct: 43  GPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDE 102

Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
            + G     E   + +   E   + ++   +          D   +    D       + 
Sbjct: 103 DDAGNATSGESSTDDDSLLELPDRDEDADTQ-------ANNDQTNDFDQDDSSNSQTDQG 155

Query: 209 LKKKKKEKERSSH--EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 266
           LK+            EK+   + TFKF     +  +  +         KS S  P    D
Sbjct: 156 LKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD-----GGKSKSSDPGPLND 210

Query: 267 EYDTGDS 273
               GD 
Sbjct: 211 SDGQGDD 217


>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
          Length = 193

 Score = 28.2 bits (62), Expect = 6.8
 Identities = 20/71 (28%), Positives = 45/71 (63%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
           KNK K+ ++++ ++  + EK KKE +E  E+K   + E+ E   E+  ++E+++E++K  
Sbjct: 7   KNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPK 66

Query: 198 KDKDKKLKKEK 208
           ++ D  +++ K
Sbjct: 67  EEIDYPIQENK 77


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 28.9 bits (65), Expect = 6.8
 Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 13/97 (13%)

Query: 138 KNKEKDKEKKEKEKGKKE--------EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK--E 187
           KN  +DKEK+EK   K +        EK+ KE K+K  E++   KE  E  ++K+R+  +
Sbjct: 516 KNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEK---KEKIENLIKKLRQALQ 572

Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
             + E  K   ++ +K   E  K+       +     
Sbjct: 573 NDNYESIKSLLEELQKALMEIGKEVYSSTSTTDPASN 609


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 28.5 bits (64), Expect = 6.9
 Identities = 12/89 (13%), Positives = 37/89 (41%), Gaps = 1/89 (1%)

Query: 92  IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           + +        + K    + K +   KK+   +I+ +  +  ++  + +E+ +E K    
Sbjct: 43  VEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALA 102

Query: 152 GKKEE-KEKKEEKEKKEEKEFKMKEDKEK 179
            ++ +      + EK+   + +  +D+ K
Sbjct: 103 QRRSDLSSASYQLEKRRASQLEKLQDEIK 131


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 28.7 bits (64), Expect = 7.0
 Identities = 45/212 (21%), Positives = 79/212 (37%), Gaps = 16/212 (7%)

Query: 45  KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV-----TSSE 99
           +M   +S  D    K+ +    + +  E+ Y K+KDV   +   + NI  +     +S+ 
Sbjct: 139 EMDSLIS-GDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNL 197

Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEK------IKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           +L  IKK   I  D+K+H    KE            D  +  K    E    +  K + +
Sbjct: 198 ELENIKK--QIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYE 255

Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
            E K  + +   + EK    KE +E+ ++ I      +        K K   + K +   
Sbjct: 256 SEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILS 315

Query: 214 KEKERSSHEKEVIPKLTF--KFGTDMEEKTKR 243
                 +    +I KL+   K   D  +K  R
Sbjct: 316 NIDAEINKYHAIIKKLSVLQKDYNDYIKKKSR 347



 Score = 28.7 bits (64), Expect = 8.6
 Identities = 20/118 (16%), Positives = 45/118 (38%), Gaps = 2/118 (1%)

Query: 104 IKKDKSIKKDKKTH--KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
           IKK   ++KD   +  KK   + +  +  + +  ++      K  +  +K  +E  +  E
Sbjct: 328 IKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIE 387

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
                  +  K++E     ++K   E     ++  SK      +   L++   E  R+
Sbjct: 388 RMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRN 445


>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
          Length = 313

 Score = 28.6 bits (64), Expect = 7.0
 Identities = 23/97 (23%), Positives = 35/97 (36%), Gaps = 1/97 (1%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
           KE K K   K   K   +  +++   E    +  E   E IRKE K+ +   +       
Sbjct: 29  KEGKSKFGDKPLGKLDVDSDDEQPAPERGFAQAPEDDFEIIRKERKEPDFG-RENSFHDP 87

Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
           L  + L   + E+E    E+E  P           EK
Sbjct: 88  LIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEK 124


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 28.5 bits (63), Expect = 7.1
 Identities = 11/59 (18%), Positives = 31/59 (52%)

Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
            +    + K  K K++    + ++  ++ K EEK+ + ++++E+  E+ ++EE     +
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243


>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
          Length = 568

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 32/172 (18%), Positives = 57/172 (33%), Gaps = 16/172 (9%)

Query: 88  MVPNIVKVTSSEDLG----EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
            + N ++V SS D      EIK D S  +  +      +E+ +  +          +   
Sbjct: 335 SIKNKIQVNSSSDSSREIEEIKFDLSEDR-SEIEILVFREQSQLSQSSLSGIFAYMQSLK 393

Query: 144 KEKK---------EKEKGKKEEKEKKEEKEKKEEK--EFKMKEDKEKPVEKIRKEEKDSE 192
           KE +          K       + KK  KE+  +     +    +   +  +R+EE   +
Sbjct: 394 KETETLKSRIDLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELSKK 453

Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
           K K  K    +     L     E+     E E   KL     T +  ++ R 
Sbjct: 454 KTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLRSSRS 505


>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
          Length = 153

 Score = 27.8 bits (62), Expect = 7.6
 Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)

Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER---SSHEKEVIPKL 229
           +K+ K        K ++  EK  K K   ++++K  +KKK  E  +   S  EKE+   +
Sbjct: 49  LKQQKMYQKILDLKLKEAKEKCLKVKQLLEEIQKFSVKKKVGENNQIFGSVTEKEISQII 108

Query: 230 TFKFGTDMEEK 240
             K   D+E++
Sbjct: 109 KNKTNIDLEKQ 119


>gnl|CDD|129272 TIGR00168, infC, translation initiation factor IF-3.  infC uses
           abnormal initiation codons such as AUA, AUC, and CUG
           which render its expression particularly sensitive to
           excess of its gene product IF-3 thereby regulating its
           own expression [Protein synthesis, Translation factors].
          Length = 165

 Score = 27.7 bits (62), Expect = 7.7
 Identities = 27/93 (29%), Positives = 49/93 (52%), Gaps = 7/93 (7%)

Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-EKLKKKKK 214
           E+EKK+++ KK +K  ++KE K +P   I + +       K K   + L+K +K+K   +
Sbjct: 62  EQEKKKKEAKKNQKIIQVKEVKMRP--TIDEHDLQF----KLKQAIRFLEKGDKVKFTVR 115

Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
            + R S   E+  K+  +F TD+ +  + E  P
Sbjct: 116 FRGRESTHIELGEKVLDRFYTDVADVAEVEKPP 148


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
           Provisional.
          Length = 1164

 Score = 28.8 bits (64), Expect = 7.7
 Identities = 36/161 (22%), Positives = 61/161 (37%), Gaps = 12/161 (7%)

Query: 51  SPKDKTSPKEKMSS----KEKTSPKEKEYS-KVKDVELGVTPMVPNIVKVTSSEDLGEIK 105
           S       KE  S+        SP+ KE S K    +  VT         T  + +    
Sbjct: 156 SKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVT-------HTTCYDKMRRSP 208

Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
           ++ S  K+    K KEK K K K  KKD+D  K  +++ EK + +    + +  +   ++
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKE 268

Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
            ++ E  +       V   RK +   +      +  KK K 
Sbjct: 269 NKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKT 309


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 28.2 bits (64), Expect = 7.8
 Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 16/63 (25%)

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK---------EKERSSHEKEVIPK 228
           EK +E++       EK+ KS DK+ K + E L+K K+           E +  E +++  
Sbjct: 137 EKRLERL-------EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIKILKS 189

Query: 229 LTF 231
           L  
Sbjct: 190 LQL 192


>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
          Length = 859

 Score = 28.6 bits (65), Expect = 7.8
 Identities = 10/49 (20%), Positives = 17/49 (34%), Gaps = 5/49 (10%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
             +K  K +       K     K  +K       +K K++    EK+ K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVK-----FSRKTKQQYVASEKDGK 857


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 27.3 bits (61), Expect = 8.0
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
           + +++K  K+  KEK K+KK KK K K K K+  K+  +K+K
Sbjct: 94  LVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135


>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal.  This domain
           is specific to the N-terminal part of the prp1 splicing
           factor, which is involved in mRNA splicing (and possibly
           also poly(A)+ RNA nuclear export and cell cycle
           progression). This domain is specific to the N terminus
           of the RNA splicing factor encoded by prp1. It is
           involved in mRNA splicing and possibly also poly(A)and
           RNA nuclear export and cell cycle progression.
          Length = 131

 Score = 27.3 bits (61), Expect = 8.1
 Identities = 15/75 (20%), Positives = 34/75 (45%)

Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
                ++D+   + +D + +      K + E +E     E  + +M E ++K  E+  KE
Sbjct: 29  VDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKE 88

Query: 188 EKDSEKEKKSKDKDK 202
           E +  +E+  K + +
Sbjct: 89  EIEKYREENPKIQQQ 103


>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD.  YqfD is part of
           the sigma-E regulon in the sporulation program of
           endospore-forming Gram-positive bacteria. Mutation
           results in a sporulation defect in Bacillus subtilis.
           Members are found in all currently known
           endospore-forming bacteria, including the genera
           Bacillus, Symbiobacterium, Carboxydothermus,
           Clostridium, and Thermoanaerobacter [Cellular processes,
           Sporulation and germination].
          Length = 382

 Score = 28.5 bits (64), Expect = 8.1
 Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 3/95 (3%)

Query: 119 KKEKEKIKKKKDKKDK-DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE--KEFKMKE 175
             +K  +KK   +  K D  +  +K    +     +  ++   E KEK E+  KE  +++
Sbjct: 275 GGKKIYLKKPPKEFAKYDVEEKTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEEAIEK 334

Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
            KEK  ++++KE   + K    K   ++++  K+K
Sbjct: 335 AKEKAEKELKKELDPNAKIVSDKILSERVEGGKVK 369


>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 147 to 176 amino acids in length. This domain is
           found associated with pfam00226.
          Length = 144

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 15/46 (32%), Positives = 27/46 (58%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
           K +K+KE+ ++ EK K E  +KK+E E  +    +    K+ +EE+
Sbjct: 4   KAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEE 49


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
           consists of several eukaryotic paraflagellar rod
           component proteins. The eukaryotic flagellum represents
           one of the most complex macromolecular structures found
           in any organism and contains more than 250 proteins. In
           addition to its locomotive role, the flagellum is
           probably involved in nutrient uptake since receptors for
           host low-density lipoproteins are localised on the
           flagellar membrane as well as on the flagellar pocket
           membrane.
          Length = 289

 Score = 28.1 bits (63), Expect = 8.3
 Identities = 17/103 (16%), Positives = 52/103 (50%), Gaps = 8/103 (7%)

Query: 123 EKIKKKKDKKD--KDKIKNK-EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF--KMKEDK 177
           + I    + K   K   +   E+ KEK+E+E     ++ K+   +++E ++F  +  E +
Sbjct: 2   KDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQ 61

Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           +     I++ E++    +   ++ ++  + +++  ++E++R +
Sbjct: 62  QACWRAIQELEREL---RDLAEERREEVERRIEAVEREEQRRT 101


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score = 28.6 bits (64), Expect = 8.3
 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 7/86 (8%)

Query: 97  SSEDLGEIKKDK--SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
                  I      SI K KK  +K +KEK  K  D+ D D   N+    E       ++
Sbjct: 50  EWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLD---NENVQLEMLYSISNER 106

Query: 155 EEKEKKEEKEKKE--EKEFKMKEDKE 178
           E ++ +    K +   +  ++K   E
Sbjct: 107 EIRQLRFGGSKVQNPHEGARVKNINE 132


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 28.5 bits (64), Expect = 8.4
 Identities = 26/106 (24%), Positives = 49/106 (46%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
           K+   H K+E E++K+     + +    ++ +KE KE E    E  E  E +E    +  
Sbjct: 322 KENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQ 381

Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
              E+ EK +  I  E++  ++   S  KD+   +E L++ K +  
Sbjct: 382 DNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLH 427


>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
            NSA2 appears to be a protein required for the
           maturation of 27S pre-rRNA in yeast; it has been
           characterized in mammalian cells as a nucleolar protein
           that might play a role in the regulation of the cell
           cycle and in cell proliferation.
          Length = 257

 Score = 28.0 bits (63), Expect = 8.4
 Identities = 14/39 (35%), Positives = 22/39 (56%)

Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
           K K  +KK+ KEKI+ KK  K  ++   K+K  +K  + 
Sbjct: 43  KAKLYNKKRYKEKIQMKKTIKMHEERNVKQKVDDKVPEG 81


>gnl|CDD|234042 TIGR02870, spore_II_D, stage II sporulation protein D.  Stage II
           sporulation protein D (SpoIID) is a protein of the
           endospore formation program in a number of lineages in
           the Firmicutes (low-GC Gram-positive bacteria). It is
           expressed in the mother cell compartment, under control
           of Sigma-E. SpoIID, along with SpoIIM and SpoIIP, is one
           of three major proteins involved in engulfment of the
           forespore by the mother cell [Cellular processes,
           Sporulation and germination].
          Length = 338

 Score = 28.1 bits (63), Expect = 8.5
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
             + +   E+ EK    +++ EK     K+EE   K+    E  +EK+  EE
Sbjct: 25  FGDPQNVTEENEKLAPVEKKPEKGVTNLKEEEVAIKVYRSNEDKIEKVPLEE 76


>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
           protein; Provisional.
          Length = 196

 Score = 28.0 bits (62), Expect = 8.6
 Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKE 194
           K K+E +   K     K ++   +  E ++K  K K +E  +  EE + EK+
Sbjct: 144 KAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGEKD 195


>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
            Provisional.
          Length = 1136

 Score = 28.6 bits (63), Expect = 8.8
 Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 14/146 (9%)

Query: 94   KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK--------------IKN 139
            ++ S E L E K  K  +  KK     EK     KKDKK K K              + N
Sbjct: 866  QILSDEGLDEKKLKKRAESLKKLANAIEKYAGGGKKDKKAKKKDAKDLSGNIAHEINLIN 925

Query: 140  KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
            KE   + +   +  KE  E   E++ +E  E   +E+ E+ VE+  +E  +   E+  ++
Sbjct: 926  KELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEE 985

Query: 200  KDKKLKKEKLKKKKKEKERSSHEKEV 225
              ++  +E +++  +E    + E+ V
Sbjct: 986  NVEENVEENVEENVEENIEENVEENV 1011


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 28.7 bits (64), Expect = 8.8
 Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 8/81 (9%)

Query: 12   VAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSK------ 65
            V G     + PP  +H   PQ    P   P  SK KDKV PKD   P     S       
Sbjct: 1004 VGGIEAPEKTPPPPEHE--PQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061

Query: 66   EKTSPKEKEYSKVKDVELGVT 86
             K + ++KE    K   L  T
Sbjct: 1062 PKGNEEKKEEQPDKGANLPNT 1082


>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564.  This
           family of proteins has no known function. However, one
           of the members is annotated as an EF-hand family
           protein.
          Length = 349

 Score = 28.2 bits (63), Expect = 8.9
 Identities = 20/105 (19%), Positives = 52/105 (49%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
             D++     E+     ++      K         + E+++ + +E+++K+E E++++K+
Sbjct: 244 YLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKK 303

Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
            +M    ++  E     +K  E+ K+   K +K +KE+ K+ ++E
Sbjct: 304 KEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREKEYEQE 348


>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
           structure and biogenesis].
          Length = 107

 Score = 26.8 bits (59), Expect = 9.0
 Identities = 17/50 (34%), Positives = 30/50 (60%)

Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
           K++   ++K EK K    + +KK  K+KK+E+  +     E+ ++KIRKE
Sbjct: 4   KSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKE 53


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 28.6 bits (64), Expect = 9.0
 Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 3/60 (5%)

Query: 7   SPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKE 66
              P  A        P +   P    P V+PP T P   ++     ++K  P  K+SS  
Sbjct: 411 PSAPQSATQPA-GTPPTVSVDPPAAVP-VNPPSTAPQ-AVRPAQFKEEKKIPVSKVSSLG 467


>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator.  This protein is
           found in a wide range of eukaryotes. It is a nuclear
           protein and is suggested to be DNA binding. In plants,
           this family is essential for correct circadian clock
           functioning by acting as a light-quality regulator
           coordinating the activities of blue and red light
           signalling pathways during plant growth - inhibiting
           growth in red light but promoting growth in blue light.
          Length = 233

 Score = 28.1 bits (63), Expect = 9.1
 Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE-FKMKEDKEKPVEKIR 185
           KKK KK K K+   + D+E+ E E   +++  K+  +  +      K K  K   V+   
Sbjct: 1   KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSF 60

Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
               D  +E+K  +  ++L++E LKK++  KE
Sbjct: 61  L--PDKAREEKEAELREELREEFLKKQEAVKE 90


>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
           enzyme res; Provisional.
          Length = 986

 Score = 28.5 bits (64), Expect = 9.2
 Identities = 20/109 (18%), Positives = 44/109 (40%), Gaps = 9/109 (8%)

Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGK-KEE 156
           +GEI  D  ++ +++   ++  +KI  +K K D D  + +   + D +K        KE+
Sbjct: 577 VGEINSDSKVQLNEEKLTEEMIQKIVTEKQKVDPDFTELRLLEDLDDKKIIDRSNNFKED 636

Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
                 +   E  +  +++ K         +       K  K+  ++LK
Sbjct: 637 GFDWLLEAYPELFQTGLRKGK-----VRDNKPASKLTVKLRKENYEELK 680


>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein.  The PcfK-like protein family
           includes the Enterococcus faecalis PcfK protein, which
           is functionally uncharacterized. This family of proteins
           is found in bacteria and viruses. Proteins in this
           family are typically between 137 and 257 amino acids in
           length. There are two completely conserved residues (D
           and L) that may be functionally important.
          Length = 136

 Score = 27.3 bits (61), Expect = 9.2
 Identities = 13/59 (22%), Positives = 27/59 (45%)

Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
           +E + K  K    +      VE   +E+ ++ KE     + ++L+K + + KK +K   
Sbjct: 68  DEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEP 126


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 14/43 (32%), Positives = 19/43 (44%)

Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
           K   K   K K+KKKK K     +   E  +E K   K K ++
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393



 Score = 28.1 bits (63), Expect = 9.9
 Identities = 12/43 (27%), Positives = 21/43 (48%)

Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
           K  KK++ K + +K KKK       K +  ++ K   +K+  K
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393


>gnl|CDD|222859 PHA02535, P, terminase ATPase subunit; Provisional.
          Length = 581

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)

Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
           EKE K   ++K  +   +  E++ +  + S    ++        + K  K+K  K   S 
Sbjct: 66  EKENKTGGDYKEIDLLIRQHERLARVRRYSGTGNEADLNPNVANRNKGPKRKPVKNDIS- 124

Query: 222 EKEVIPKL 229
             E   KL
Sbjct: 125 -DEQTEKL 131


>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Proline-Serine-Threonine
           Phosphatase-Interacting Protein 1 (PSTPIP1), also known
           as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
           in hematopoietic cells. It is a binding partner of the
           cell surface receptor CD2 and PTP-PEST, a tyrosine
           phosphatase which functions in cell motility and Rac1
           regulation. It also plays a role in the activation of
           the Wiskott-Aldrich syndrome protein (WASP), which
           couples actin rearrangement and T cell activation.
           Mutations in the gene encoding PSTPIP1 cause the
           autoinflammatory disorder known as PAPA (pyogenic
           sterile arthritis, pyoderma gangrenosum, and acne)
           syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
           PEST motifs, and a C-terminal SH3 domain. F-BAR domains
           form banana-shaped dimers with a positively-charged
           concave surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 242

 Score = 28.0 bits (62), Expect = 9.3
 Identities = 22/98 (22%), Positives = 43/98 (43%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
           +E++K ++KK +   ++++  +    KK  E  K  E+  +E  E ++  E        K
Sbjct: 94  RERQKEQRKKYEAVMERVQKSKVSLYKKTMESKKTYEQRCREADEAEQTFERSSSTGNPK 153

Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
             EK + + K         ++  K   E+L K + E E
Sbjct: 154 QSEKSQNKAKQCRDAATEAERVYKQNIEQLDKARTEWE 191


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 28.4 bits (63), Expect = 9.4
 Identities = 22/124 (17%), Positives = 55/124 (44%), Gaps = 1/124 (0%)

Query: 98  SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
            E+      +    +D+    +++   ++KK +   +   + + + KE+KE+ +  + + 
Sbjct: 370 QEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDT 429

Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEK 216
                 EK E+   + +   E+  E+  ++E+   E+ +  K + + LK+E    + K  
Sbjct: 430 NTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLS 489

Query: 217 ERSS 220
           ER  
Sbjct: 490 EREL 493


>gnl|CDD|218986 pfam06317, Arena_RNA_pol, Arenavirus RNA polymerase.  This family
           consists of several Arenavirus RNA polymerase proteins
           (EC:2.7.7.48).
          Length = 2206

 Score = 28.4 bits (64), Expect = 9.6
 Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 6/109 (5%)

Query: 122 KEKIKKKKDKKDKD-KI-KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK-MKEDKE 178
           K K K K D +D D KI KN        +K K  +EE    EE ++ +   F  +KE  E
Sbjct: 854 KTKGKYKLDPEDYDYKIQKNLSSLVSGSKKSKSNREELSLYEELDEDQSDYFDEIKESVE 913

Query: 179 KPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
           K + K+RK  K    + K  KS +  ++L   K   +    E S HE E
Sbjct: 914 KTLSKMRKSRKAKSGNLKNTKSINDLERLWAPKGVMRLIRSETSLHEVE 962


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 27.8 bits (62), Expect = 9.6
 Identities = 17/85 (20%), Positives = 45/85 (52%), Gaps = 3/85 (3%)

Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKKEEKEKKEEKEKKEEK 169
           D++     EK+ IK K  + D  +  ++++   D++K+E+E  + +  + + + E +++ 
Sbjct: 88  DEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147

Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKE 194
           E      ++  V++   +++D E E
Sbjct: 148 EIVEILIEDDEVDEDEDDDEDEEDE 172


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 28.1 bits (63), Expect = 9.7
 Identities = 12/44 (27%), Positives = 26/44 (59%)

Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
           K  E+  K+ +K+++ + ++ EK  E+    K  ++E EK++ K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191


>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
           protein; Provisional.
          Length = 311

 Score = 28.1 bits (62), Expect = 9.9
 Identities = 21/82 (25%), Positives = 41/82 (50%), Gaps = 4/82 (4%)

Query: 93  VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKE 148
           +++  S+D+ E +K+ +  K KK+ K    E       KK  K  K ++K  EKD+    
Sbjct: 104 LEIVLSDDVNEREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDP 163

Query: 149 KEKGKKEEKEKKEEKEKKEEKE 170
           +   ++++  K  EK   +++E
Sbjct: 164 ELLKREQQVLKVVEKTASQKEE 185


>gnl|CDD|224031 COG1106, COG1106, Predicted ATPases [General function prediction
           only].
          Length = 371

 Score = 28.3 bits (63), Expect = 9.9
 Identities = 22/102 (21%), Positives = 39/102 (38%), Gaps = 2/102 (1%)

Query: 120 KEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
           + +  I      K++ +     E+DKE +   K  +  K   + + K E   F  K++  
Sbjct: 60  EREINIWNGSYGKEESNLDIFFEQDKEVEIISKNTENNKFNFKLQIKFEINGFPSKDESR 119

Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
           K V KI   +K        K  ++   +  L   K  KE  +
Sbjct: 120 K-VNKIEITKKSKNINDFLKIFNEYFVETLLIIAKYPKEPPN 160


>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family.  This protein
           is plasmid encoded and found to be essential for plasmid
           replication.
          Length = 279

 Score = 28.1 bits (63), Expect = 10.0
 Identities = 20/98 (20%), Positives = 46/98 (46%), Gaps = 2/98 (2%)

Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
            + +K++ E   KK  KK    +     +  ++ KE   +     + E++   +K  + +
Sbjct: 169 NSARKQQLEWENKKLKKKGLIPLT--LDEARRRAKEFHIERAFSYRTERKAFGKKRRRAR 226

Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
           +  +   + IRK+  ++  ++ SK +  K+  + LKKK
Sbjct: 227 KLAKLDEKDIRKKILNALVKEYSKGELTKMGLKGLKKK 264


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.129    0.377 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,014,001
Number of extensions: 2187259
Number of successful extensions: 26481
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13147
Number of HSP's successfully gapped: 3082
Length of query: 446
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 346
Effective length of database: 6,502,202
Effective search space: 2249761892
Effective search space used: 2249761892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.1 bits)