RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy128
(446 letters)
>gnl|CDD|192483 pfam10203, Pet191_N, Cytochrome c oxidase assembly protein PET191.
Pet191_N is the conserved N-terminal of a family of
conserved proteins found from nematodes to humans. It
carries six highly conserved cysteine residues. Pet191
is required for the assembly of active cytochrome c
oxidase but does not form part of the final assembled
complex.
Length = 68
Score = 78.8 bits (195), Expect = 2e-18
Identities = 32/63 (50%), Positives = 44/63 (69%), Gaps = 3/63 (4%)
Query: 363 DKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP---QCEALKNTFFECKRSLLD 419
S+C G R DL CLLE+DCV+ +++TPR+CL+E L +C AL+ FFECKR +LD
Sbjct: 1 MGSSCKGLREDLAECLLESDCVKKEKRTPRECLKEPLKELPEECIALRKAFFECKRGMLD 60
Query: 420 TRQ 422
R+
Sbjct: 61 MRK 63
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 63.2 bits (153), Expect = 2e-10
Identities = 42/122 (34%), Positives = 66/122 (54%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E KK KK + KKK E K KK D+ K K ++K+K ++ K K EE +K +E
Sbjct: 1389 EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE 1448
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+KK E+ K +E K+K E + +E + E+ K + K K E+ KKK E ++++
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEA 1508
Query: 223 KE 224
K+
Sbjct: 1509 KK 1510
Score = 60.9 bits (147), Expect = 9e-10
Identities = 35/127 (27%), Positives = 66/127 (51%), Gaps = 5/127 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK----NKEKDKEKKEKEKGKKEE-K 157
E K K+ KK + K+ + K + + D+ + E ++KKE+ K K + K
Sbjct: 1325 EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+K EEK+K +E + K +EDK+K E + + ++ K ++K K ++ KKK +E +
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444
Query: 218 RSSHEKE 224
++ K+
Sbjct: 1445 KADEAKK 1451
Score = 59.0 bits (142), Expect = 4e-09
Identities = 36/122 (29%), Positives = 64/122 (52%), Gaps = 2/122 (1%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
E KK K + E E ++K +K K++ K K +KK +EK K +E +KK
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
E++KK+ E K +K ++ +K+ ++ +K ++K K ++ KK KKK E+ + +
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKA 1459
Query: 221 HE 222
E
Sbjct: 1460 EE 1461
Score = 57.1 bits (137), Expect = 1e-08
Identities = 53/186 (28%), Positives = 98/186 (52%), Gaps = 8/186 (4%)
Query: 71 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK-KKEKEKIKKKK 129
K+ E +K+K EL V+ ++ E KK + +KK ++ +K K +E K ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
Query: 130 DKKDKDKIKNKEKDKEK-------KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
DKK ++ K E+D++K + +E K EE +KKE +EKK+ +E K E++ K
Sbjct: 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
+ K+E + +K+K + K + +K+K+ KKE+E+ + E + + D E++ +
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792
Query: 243 RESSPK 248
R K
Sbjct: 1793 RMEVDK 1798
Score = 56.7 bits (136), Expect = 2e-08
Identities = 48/181 (26%), Positives = 87/181 (48%), Gaps = 1/181 (0%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
+ + K + + K+K +K+K +K+ + K + + K +
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAA 1415
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
KK K K KKK E KK ++ K D+ K K ++ +K E+ K K EE +K +E
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
+KK E+ K E K+K E+ +K+ +++K ++K K + KK + KK E +++ K
Sbjct: 1476 KKKAEEAKKADEAKKK-AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK 1534
Query: 224 E 224
+
Sbjct: 1535 K 1535
Score = 55.5 bits (133), Expect = 4e-08
Identities = 44/185 (23%), Positives = 94/185 (50%), Gaps = 4/185 (2%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
++ K + + K K K+K + +K + ++K+ + K + K +++ +
Sbjct: 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K K+ +K K KK+ E+ KK + K K + K ++ +KK +E K +E +KK E+
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK----KKKEKERS 219
KK ++ K E+ +K ++ +K + +K ++K ++ K ++ KK KK ++ +
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Query: 220 SHEKE 224
+ EK+
Sbjct: 1542 AEEKK 1546
Score = 55.5 bits (133), Expect = 4e-08
Identities = 35/115 (30%), Positives = 64/115 (55%), Gaps = 4/115 (3%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
KK +KK+ ++ KK ++KK D+ K K ++ +K ++ K K EE +KK + KK+ +E K
Sbjct: 1284 KKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDK----KLKKEKLKKKKKEKERSSHEKE 224
+ K + +E ++ +EK + K K K + KKK +EK+++ K+
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398
Score = 54.0 bits (129), Expect = 1e-07
Identities = 45/208 (21%), Positives = 93/208 (44%), Gaps = 8/208 (3%)
Query: 49 KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
K K + + K + +E+ + + +++ + + + +++L + ++ K
Sbjct: 1231 KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
+ KK +KK+ ++ KKK ++ K K+ ++ KK+ + KK+ +E K+ E +
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-----LKKEKLKKKKKEKERSSHE- 222
+ ++ E EK EK E+ KK D KK K ++ KKK +E ++ + E
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410
Query: 223 --KEVIPKLTFKFGTDMEEKTKRESSPK 248
K + EEK K + + K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKK 1438
Score = 52.5 bits (125), Expect = 4e-07
Identities = 44/181 (24%), Positives = 79/181 (43%), Gaps = 2/181 (1%)
Query: 45 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 104
K ++ D+ K + K+ K+ +K K E K ++ E
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEE 1442
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K K K KK +E KK ++ K D+ K K ++ +K ++ K K EE +KK ++
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
KK + K ++ +K E + +E +E K D+ KK +++K + K+ E +E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
Query: 225 V 225
Sbjct: 1563 K 1563
Score = 51.7 bits (123), Expect = 7e-07
Identities = 44/184 (23%), Positives = 89/184 (48%), Gaps = 8/184 (4%)
Query: 49 KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
K K +K +K +K K K EL + + +E+ + ++DK
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
++ K KK +E ++ K +++ K K ++ +K E+ K K EE +K EE++KK E
Sbjct: 1577 NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSE--------KEKKSKDKDKKLKKEKLKKKKKEKERSS 220
+ K + +++K E+++K E++++ K ++ K K ++ KK + +KK +
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Query: 221 HEKE 224
+E
Sbjct: 1697 EAEE 1700
Score = 49.8 bits (118), Expect = 3e-06
Identities = 50/191 (26%), Positives = 94/191 (49%), Gaps = 1/191 (0%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
+EK ++E +E + ++ E I +V + + K + KK ++
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-EDK 177
K E+ K +++ KK + K + ++K+K E+ K +EE + K +E K+ +E K K E+
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDM 237
+K E +K + +KE + K ++LKK++ ++KKK +E E+E K
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Query: 238 EEKTKRESSPK 248
E+K K E + K
Sbjct: 1741 EDKKKAEEAKK 1751
Score = 49.4 bits (117), Expect = 4e-06
Identities = 40/121 (33%), Positives = 65/121 (53%), Gaps = 5/121 (4%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E KK + KK + KK ++ K K ++ KK + K E+ K+K ++ K E K+K +E
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADE 1514
Query: 163 KEKKEEK----EFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKE 217
+K EE E K E+ +K E + EEK E KK+++ K +K+K ++ KK +E
Sbjct: 1515 AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
Query: 218 R 218
Sbjct: 1575 D 1575
Score = 48.6 bits (115), Expect = 7e-06
Identities = 49/154 (31%), Positives = 78/154 (50%), Gaps = 7/154 (4%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K K+ + K KK+ E+ KKK D+ K K+ D+ KK +E K +E +K EE +K
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK 1535
Query: 166 KEE---KEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
+E E K K D+ K E+++K EEK +E K ++DK + K ++ KK +E
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
Query: 222 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
E + KL + E+ K+ KI + +K
Sbjct: 1596 E---VMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Score = 45.9 bits (108), Expect = 4e-05
Identities = 41/131 (31%), Positives = 78/131 (59%), Gaps = 6/131 (4%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
+++ + ++ K + KK + K+ ++ KK ++KK D++K E+ KK +EK K EE
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKKAEEKKKAEEA 1569
Query: 158 EKKEEKE----KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+K EE + +K E+ K +E + + V K+ +EEK + E+ K ++ K+K E+LKK +
Sbjct: 1570 KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE 1629
Query: 214 KEKERSSHEKE 224
+EK++ K+
Sbjct: 1630 EEKKKVEQLKK 1640
Score = 45.5 bits (107), Expect = 6e-05
Identities = 38/109 (34%), Positives = 66/109 (60%), Gaps = 5/109 (4%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K+ + KK ++ KK+ +EK K ++ KK +++ K K ++ +K+ +E KK E+ KK+E+E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
Query: 165 KKEEKEFKMKEDKE-----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
KK+ K +E+K+ K E + +EE D E EK+ + DKK+K
Sbjct: 1756 KKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804
Score = 43.2 bits (101), Expect = 3e-04
Identities = 39/144 (27%), Positives = 75/144 (52%), Gaps = 3/144 (2%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK--- 159
+ KK + KK ++ KK + K+ ++ K +++K KE +++KK +E K EE+ K
Sbjct: 1673 DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
+E K++ EE + K +E K+ EK + E+EKK+++ K+ + ++ +E E+
Sbjct: 1733 EEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792
Query: 220 SHEKEVIPKLTFKFGTDMEEKTKR 243
E + K F ++ E K
Sbjct: 1793 RMEVDKKIKDIFDNFANIIEGGKE 1816
Score = 42.8 bits (100), Expect = 3e-04
Identities = 43/182 (23%), Positives = 84/182 (46%), Gaps = 17/182 (9%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------KNKEKDKEKKEKEKGKKEE 156
+KK + KKD + KK E+E+ ++ K ++ ++ + K +E ++ ++ KK E
Sbjct: 1228 AVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAE 1287
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDKDKKLKKEKLK---- 210
++KK ++ KK E++ K E K+K E + +E K +E+ KK D KK +E K
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA 1347
Query: 211 -----KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 265
+ ++ ++ EK + + + K+++ K K + A
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 266 DE 267
DE
Sbjct: 1408 DE 1409
Score = 38.6 bits (89), Expect = 0.007
Identities = 41/159 (25%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKG----- 152
E+ + ++ + + KK K+ E+ KK ++ K ++ +N E+ ++ +E
Sbjct: 1209 EEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
Query: 153 ----KKEEKEKKEEKEKKEEK----EFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKK 203
K EE K +E +K EEK E K E+K+K E K + EE E K K ++ K
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
K + KKK +E ++++ + + EEK +
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
Score = 37.4 bits (86), Expect = 0.018
Identities = 32/132 (24%), Positives = 75/132 (56%), Gaps = 11/132 (8%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKE 161
E++K + +K + K +E+ K ++ + +D K + +K +E KK+ E+ KK E+E+
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251
Query: 162 EKEKKEE---------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
E+ +K E ++ +K ++ + ++++K E+ + ++ K ++KK K ++ KKK
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKK-KADEAKKK 1310
Query: 213 KKEKERSSHEKE 224
+E +++ K+
Sbjct: 1311 AEEAKKADEAKK 1322
Score = 35.9 bits (82), Expect = 0.051
Identities = 33/148 (22%), Positives = 74/148 (50%), Gaps = 2/148 (1%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E +K + +K ++ K ++ ++++ + +D K + K ++ K+ E +K E+ +K E
Sbjct: 1132 EARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKEKERSS 220
+ +K E K + ++ E+ +E + +E KK++ K ++ KK+ + KK E+ER++
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251
Query: 221 HEKEVIPKLTFKFGTDMEEKTKRESSPK 248
E + + K E + K
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARK 1279
Score = 35.5 bits (81), Expect = 0.063
Identities = 41/198 (20%), Positives = 91/198 (45%), Gaps = 13/198 (6%)
Query: 55 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
K + +++ + E + +E K ++++ ++ +E+ +IK +++ K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ KK E+ K +++ KK K +EK E+ KEK E+E EE EK+ + K
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKI 1800
Query: 175 ED-------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
+D + V KE +DS ++ + K+ +L++ +K K + + +
Sbjct: 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNEN 1860
Query: 222 EKEVIPKLTFKFGTDMEE 239
++ + F D++E
Sbjct: 1861 GEDGNKEADFNKEKDLKE 1878
Score = 33.2 bits (75), Expect = 0.36
Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 4/113 (3%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE- 178
K+ + K+ ++ D+ + K +E K+ E GK EE K EE +KK E K +E ++
Sbjct: 1077 KDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKA 1136
Query: 179 ---KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
+ E+ RK E E K +D + +E K + +K ++ + E + K
Sbjct: 1137 EDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRK 1189
Score = 33.2 bits (75), Expect = 0.39
Identities = 36/132 (27%), Positives = 69/132 (52%), Gaps = 10/132 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ KK ++ +K ++ K +E K + + + K + + K +E ++ E KK E KK E
Sbjct: 1174 DAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAE 1233
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEE---------KDSEKEKKSKDKDKKLKKEKLKKKK 213
+ KK+ +E K K ++E+ E+IRK E + + + + K +LKK + KKK
Sbjct: 1234 EAKKDAEEAK-KAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292
Query: 214 KEKERSSHEKEV 225
E +++ +K+
Sbjct: 1293 DEAKKAEEKKKA 1304
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 60.3 bits (147), Expect = 7e-10
Identities = 24/76 (31%), Positives = 46/76 (60%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
KK KIK + EKK +E+ K+++K+ K+K+EE+E + ++ +E+ E+ + E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 189 KDSEKEKKSKDKDKKL 204
++ E+E++ K K L
Sbjct: 464 EEKEEEEEKKKKQATL 479
Score = 59.2 bits (144), Expect = 2e-09
Identities = 21/79 (26%), Positives = 53/79 (67%)
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
+++ + + +K +K +K K E+K ++E+KEKK++ K+++E+ EK +KEE+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 190 DSEKEKKSKDKDKKLKKEK 208
E+E++++++ ++ +++K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473
Score = 58.8 bits (143), Expect = 3e-09
Identities = 27/81 (33%), Positives = 52/81 (64%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
+++I+ K+ +K K E+ EKK E+EKKE+K+ K++ E+ KE+K+ E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 193 KEKKSKDKDKKLKKEKLKKKK 213
KE++ ++ +++ ++E+ KKKK
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475
Score = 58.0 bits (141), Expect = 4e-09
Identities = 33/85 (38%), Positives = 55/85 (64%), Gaps = 10/85 (11%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
T KK +KIKK +K +K ++++EKKEK+K K+K+EE+E+++EK KE
Sbjct: 403 TGSKKATKKIKKIVEKAEK------KREEEKKEKKKKAFAGKKKEEEEEEEKEK----KE 452
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDK 200
++++ E+ +EEK+ E+EKK K
Sbjct: 453 EEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 53.4 bits (129), Expect = 1e-07
Identities = 27/66 (40%), Positives = 48/66 (72%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K K IKK + +KK +E+ K+KK K K K +E+++EK++KE+ K+EE+E+ EE++
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 165 KKEEKE 170
++EE++
Sbjct: 467 EEEEEK 472
Score = 52.6 bits (127), Expect = 2e-07
Identities = 29/81 (35%), Positives = 53/81 (65%), Gaps = 1/81 (1%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
EI+ KK K KK EK +KK++++ K+K K K+K+E+E+ +KE+KE+
Sbjct: 395 TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Query: 160 KEEKEKKEEKEFKMKEDKEKP 180
++E+E++E +E K +E+++K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474
Score = 51.5 bits (124), Expect = 5e-07
Identities = 24/83 (28%), Positives = 54/83 (65%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+ E++ + K K K +K ++K +++K +K K K+K++E++E+++ K+EE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 157 KEKKEEKEKKEEKEFKMKEDKEK 179
KE++EE+ ++E++E + K+ K+
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKKQA 477
Score = 46.1 bits (110), Expect = 3e-05
Identities = 22/75 (29%), Positives = 44/75 (58%)
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+EE E +K +K K+ E EK E+ +KE+K K K+++++ +KEK +++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 214 KEKERSSHEKEVIPK 228
+E+E + E++ +
Sbjct: 456 EEEEEEAEEEKEEEE 470
Score = 44.9 bits (107), Expect = 7e-05
Identities = 23/80 (28%), Positives = 52/80 (65%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
++E + KK K+ K+ EK E+K + K++K+K +K+E++ E+EK+ K+++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 203 KLKKEKLKKKKKEKERSSHE 222
+ ++E+ +++K+E+E +
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475
Score = 38.4 bits (90), Expect = 0.007
Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
E+E + +K +KI+K + +EK+++ + K+KK K KKK++E+E +KE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-E 453
Query: 227 PKLTFKFGTDMEEKTKRESSPK 248
K + + E++ + E K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKK 475
Score = 34.9 bits (81), Expect = 0.076
Identities = 14/79 (17%), Positives = 40/79 (50%)
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
++E + + K ++KI ++ + +E+K + K K +K +++++E++ E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 224 EVIPKLTFKFGTDMEEKTK 242
+ + + + EE+ K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473
Score = 31.8 bits (73), Expect = 0.79
Identities = 17/108 (15%), Positives = 40/108 (37%), Gaps = 11/108 (10%)
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKK----SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
+ + + ++ + +E+E + SK KK+KK K +KK +E +K+
Sbjct: 377 IFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436
Query: 229 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 276
+++ + E + + + +E + KQ
Sbjct: 437 -------GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 31.0 bits (71), Expect = 1.4
Identities = 20/107 (18%), Positives = 42/107 (39%), Gaps = 35/107 (32%)
Query: 47 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
K + K EK K + KEK+ +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKK------------------------ 439
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K++E+E+ K+KK+++ +++ + E++KE++E++K K
Sbjct: 440 -----------KEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 60.0 bits (146), Expect = 1e-09
Identities = 29/187 (15%), Positives = 61/187 (32%)
Query: 40 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
T +++ K K + S T + KE + K V + +
Sbjct: 6 TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
+ D KK K K + KKK K + D K EK + + +
Sbjct: 66 TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
+ ++ + + + ++ +E D E + D+ +K++ K+ +K +
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185
Query: 220 SHEKEVI 226
+
Sbjct: 186 DFVWDED 192
Score = 51.1 bits (123), Expect = 7e-07
Identities = 26/171 (15%), Positives = 55/171 (32%), Gaps = 3/171 (1%)
Query: 55 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
K KE + SK+KT + + + T K
Sbjct: 35 KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K + + K +KK+ DKD N KD + + ++ + + + + + +
Sbjct: 95 KLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDE 154
Query: 175 EDKEKPVEKIRKEEKDS---EKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+D E + +E + KE + D ++ + + R +
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAK 205
Score = 48.5 bits (116), Expect = 5e-06
Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 8/142 (5%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K+ ++ KKK K+ K K K K++ K+ E KK ++ +
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSK------GFITKEEIKEALESKKKTPEQIDQVLI 57
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ E + K K+ K + K +K KK K++L KK +++++ +K+
Sbjct: 58 FLSGMVKDTDDATESDIPK--KKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKD 115
Query: 225 VIPKLTFKFGTDMEEKTKRESS 246
+ +
Sbjct: 116 DDLNYVKDIDVLNQADDDDDDD 137
Score = 44.2 bits (105), Expect = 1e-04
Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 1/132 (0%)
Query: 86 TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
T E ++KK + K K K+E ++ + K K + +I
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE-QIDQVLIFLS 60
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K+ E + ++K K K K +K ++ K +EK+ D
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120
Query: 206 KEKLKKKKKEKE 217
+ + + +
Sbjct: 121 VKDIDVLNQADD 132
Score = 39.2 bits (92), Expect = 0.004
Identities = 26/147 (17%), Positives = 52/147 (35%), Gaps = 5/147 (3%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
K + +++ K K K K K KG ++E KE E K++ ++ + +
Sbjct: 5 STKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIF----LS 60
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
KD++ +S KK K K + + E+ + + +K +
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE-KKNALDKDDDLN 119
Query: 246 SPKIVIKPVKSPSPPPVAAADEYDTGD 272
K + ++ D+ D D
Sbjct: 120 YVKDIDVLNQADDDDDDDDDDDLDDDD 146
Score = 38.8 bits (91), Expect = 0.005
Identities = 23/146 (15%), Positives = 46/146 (31%), Gaps = 4/146 (2%)
Query: 47 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
KD + PK+K + K + + K EL + +D K
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSK-KAEKKNALDKDDDLNYVK 122
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKEKEKGKKEEKEKKEEK 163
D + + + D D D + D + ++++ KKE KE ++
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEK 189
+ + + + + K K
Sbjct: 183 DDDDFVWDEDDSEALRQARKDAKLTA 208
>gnl|CDD|214584 smart00249, PHD, PHD zinc finger. The plant homeodomain (PHD)
finger is a C4HC3 zinc-finger-like motif found in
nuclear proteins thought to be involved in epigenetics
and chromatin-mediated transcriptional regulation. The
PHD finger binds two zinc ions using the so-called
'cross-brace' motif and is thus structurally related to
the RING finger and the FYVE finger. It is not yet known
if PHD fingers have a common molecular function. Several
reports suggest that it can function as a
protein-protein interacton domain and it was recently
demonstrated that the PHD finger of p300 can cooperate
with the adjacent BROMO domain in nucleosome binding in
vitro. Other reports suggesting that the PHD finger is a
ubiquitin ligase have been refuted as these domains were
RING fingers misidentified as PHD fingers.
Length = 47
Score = 53.4 bits (128), Expect = 1e-09
Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 3/48 (6%)
Query: 278 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 323
C CG DDG ++ CDGCD WYH C+G L+ E W+CPKC
Sbjct: 1 YCSVCGKPDDGGE-LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47
>gnl|CDD|201356 pfam00628, PHD, PHD-finger. PHD folds into an interleaved type of
Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 51
Score = 52.9 bits (127), Expect = 2e-09
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 278 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPET---SDWFCPKCS 324
C CG DD ++ CDGCD W+H C+G EPE +W+CP+C
Sbjct: 1 YCAVCGKVDDDGE-LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECK 49
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 54.5 bits (131), Expect = 5e-08
Identities = 39/129 (30%), Positives = 68/129 (52%), Gaps = 3/129 (2%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
VK K+ K KE+EK K++ ++ K K KEK KE+ + K
Sbjct: 76 AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKK---KEKPKEEPKDRK 132
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
K+E KEK+ KEK++EKE K++E +++ EK R+ + + KK K KK++ +
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPE 192
Query: 212 KKKEKERSS 220
++K+++ +
Sbjct: 193 EEKQRQAAR 201
Score = 49.1 bits (117), Expect = 3e-06
Identities = 48/190 (25%), Positives = 88/190 (46%)
Query: 78 VKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
VK VE G + K K+++ K+ K KKK+KEK K++ + +
Sbjct: 77 VKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEE 136
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+++ ++KEKEK KK E+ + E+EKK E+ K+ P +K ++K+ +E+K
Sbjct: 137 AKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 257
+ ++ K K ++ +ER E + + T + +E + + +K P
Sbjct: 197 RQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKP 256
Query: 258 SPPPVAAADE 267
P P A+ E
Sbjct: 257 DPSPSMASPE 266
Score = 47.2 bits (112), Expect = 1e-05
Identities = 42/155 (27%), Positives = 77/155 (49%), Gaps = 3/155 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K ++ K+ +K K K ++ + E GK+EEKEK++ KE+K++K+ K KE+
Sbjct: 69 SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEP 128
Query: 178 EKPVEKIRKEEKDSEKEK---KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFG 234
+ K +EK KEK K K ++ +E+ KK+++ + +S +K K K
Sbjct: 129 KDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKK 188
Query: 235 TDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
EE+ +R+++ + V + P +E D
Sbjct: 189 EPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDD 223
Score = 46.8 bits (111), Expect = 2e-05
Identities = 48/231 (20%), Positives = 95/231 (41%), Gaps = 20/231 (8%)
Query: 35 VHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVK 94
V G+ P+ S KE+ KE+ ++K+ K
Sbjct: 80 VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK-------------------K 120
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
++ + +K K K+K+ K+KEKEK KK ++ +D+++ K +E+ + K +K K
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
++ K+++ +EEK+ + + K + ++ EKE+ KD++ +++ +
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEED-DGKDRETTTSPMEEDES 239
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 265
+ + M RESS + +P S PP A
Sbjct: 240 RQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPA 290
Score = 40.3 bits (94), Expect = 0.002
Identities = 34/97 (35%), Positives = 55/97 (56%), Gaps = 2/97 (2%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
+ K + K K K K K KE K + KE+++EKE ++KE+K+K EK +
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPK-NESGKEEEKEKE-QVKEEKKKKKEKPK 125
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+E KD + ++++K+K +KEK K+KK E+ R E
Sbjct: 126 EEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREE 162
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 52.1 bits (126), Expect = 3e-07
Identities = 33/130 (25%), Positives = 73/130 (56%), Gaps = 11/130 (8%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKEKEKGKKEEKEKKEEKE 164
K I + K ++E ++I ++ K+ + K K++ K + + +KE +E++ E +
Sbjct: 26 KKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ 85
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK- 223
K E++ + +E+ ++ +E + K E++ EK++K ++ K+++L+KK++E E E+
Sbjct: 86 KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQ----KQQELEKKEEELEELIEEQL 141
Query: 224 ---EVIPKLT 230
E I LT
Sbjct: 142 QELERISGLT 151
Score = 43.2 bits (103), Expect = 2e-04
Identities = 41/151 (27%), Positives = 83/151 (54%), Gaps = 23/151 (15%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKK--KDKKDK-DKIKNKEK---DKEKKEKEKGKKEEKE 158
K+ ++IKK+ K+E K++ + K+ +++ ++++ EK KE+ K + EK
Sbjct: 49 KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 159 KKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK--KK 214
++E ++K++E E K + E KE+ +E++ +E+ E E+ S L E+ K+ +K
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-ELERIS-----GLTAEEAKEILLEK 162
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
+E + HE V+ K ++EE+ K E+
Sbjct: 163 VEEEARHEAAVLIK-------EIEEEAKEEA 186
Score = 36.3 bits (85), Expect = 0.027
Identities = 29/122 (23%), Positives = 68/122 (55%), Gaps = 19/122 (15%)
Query: 103 EIKKDKSIKKDKKTHKKK----EKEKIKKKKD--KKDKDKIKNKEKDKEKKEKE-KGKKE 155
E +K+ ++++ +K ++E + +K + +K +++++ KEK+ E+K++E + K+E
Sbjct: 72 EFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEE 131
Query: 156 EKEKKEEKEKK-----------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
E E+ E++ + E KE +++ +E+ + K+ E+E K ++ DKK
Sbjct: 132 ELEELIEEQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAK-EEADKKA 190
Query: 205 KK 206
K+
Sbjct: 191 KE 192
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 52.0 bits (125), Expect = 4e-07
Identities = 20/133 (15%), Positives = 65/133 (48%), Gaps = 3/133 (2%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
+L E++++ +++ K+E E+++ + +++++ E + E+ E+ + +
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEI-SLLRERLEELENELEELEERLEELK 329
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKK 213
EK + ++E +E + + ++ + KEE + + ++ ++ + +E+L + +
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELE 389
Query: 214 KEKERSSHEKEVI 226
E +E E +
Sbjct: 390 AELAEIRNELEEL 402
Score = 45.9 bits (109), Expect = 3e-05
Identities = 24/129 (18%), Positives = 68/129 (52%), Gaps = 4/129 (3%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ + ++ K +KE E+++++ + +++ + +E+ +E +++ + K E E+ E+
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Query: 165 KKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
++ ++E + E+ E + +R+ ++ E E + ++ + KEK++ K+E E
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343
Query: 221 HEKEVIPKL 229
E + +L
Sbjct: 344 TLLEELEQL 352
Score = 44.3 bits (105), Expect = 1e-04
Identities = 25/127 (19%), Positives = 68/127 (53%), Gaps = 5/127 (3%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+L E++++ + + K E E+++++ ++ ++ ++ KE+ E+ E E E+
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE-IEELEGEISLLRERL 311
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
++ E E +E +E +E + +E+ E+E ++ ++ L + + K++ E++
Sbjct: 312 EELENELEELEE----RLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 219 SSHEKEV 225
S+ +E+
Sbjct: 368 SALLEEL 374
Score = 44.3 bits (105), Expect = 1e-04
Identities = 29/136 (21%), Positives = 67/136 (49%), Gaps = 6/136 (4%)
Query: 114 KKTHKKKEK--EKIKKKKDKKDK--DKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEE 168
K ++KE+ K+++ ++ ++ D ++ EK EK E++ K E +E K E + E
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227
Query: 169 KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
K ++ K +E++ +E E+E + ++ + ++++++ K E E E E +
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQ 287
Query: 228 KLTFKFGTDMEEKTKR 243
+ + ++EE
Sbjct: 288 EELLELKEEIEELEGE 303
Score = 41.6 bits (98), Expect = 8e-04
Identities = 18/128 (14%), Positives = 65/128 (50%), Gaps = 1/128 (0%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
E+L E+K++ + + + +E+ + + + ++ +++++ ++ E ++E ++E
Sbjct: 287 QEELLELKEEIE-ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
++ E+ E +E K + +++ EE ++ + + +L + + + ++ ++E
Sbjct: 346 LEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKRE 405
Query: 218 RSSHEKEV 225
S E+ +
Sbjct: 406 IESLEERL 413
Score = 40.1 bits (94), Expect = 0.003
Identities = 30/142 (21%), Positives = 69/142 (48%), Gaps = 20/142 (14%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
GEI + ++ + ++ +E++++ K+K + K + +E++ +E E+ E +E KE
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361
Query: 162 EKEK---------KEEKE----------FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
E E+ +E E ++ E + + +E++++E + E+ + +
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNE-LEELKREIESLEERLERLSERL 420
Query: 203 KLKKEKLKKKKKEKERSSHEKE 224
+ KE+LK+ + E E E E
Sbjct: 421 EDLKEELKELEAELEELQTELE 442
Score = 40.1 bits (94), Expect = 0.003
Identities = 21/128 (16%), Positives = 55/128 (42%), Gaps = 6/128 (4%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK- 157
E+ + ++ + K+E E+ ++ ++ + ++ + E E + +
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNEL 399
Query: 158 -EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E K E E EE+ E + +E +++E K+ E E + + + E+L++ +++
Sbjct: 400 EELKREIESLEER----LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455
Query: 217 ERSSHEKE 224
E +
Sbjct: 456 EELRDRLK 463
Score = 38.9 bits (91), Expect = 0.006
Identities = 21/116 (18%), Positives = 62/116 (53%), Gaps = 2/116 (1%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K++ + E+++++ ++ ++ + K + ++E E+ + +E +EE E+ EE+
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEKEV 225
+++E E+ E++ E+ K K+ ++ + K + + + ++ E+E E+ +
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL 809
Score = 38.5 bits (90), Expect = 0.007
Identities = 23/130 (17%), Positives = 65/130 (50%), Gaps = 7/130 (5%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKE--KDKEKKEKEKGKK 154
E+L ++ + + +++ EK ++ ++ K + +++ ++ KE +
Sbjct: 185 EENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEEL 244
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
EE+ + E+E +E +E + +KE +E+++ E + E ++ ++ ++L + K + ++
Sbjct: 245 EEELSRLEEELEELQEELEEAEKE--IEELKSELE--ELREELEELQEELLELKEEIEEL 300
Query: 215 EKERSSHEKE 224
E E S +
Sbjct: 301 EGEISLLRER 310
Score = 38.5 bits (90), Expect = 0.008
Identities = 25/132 (18%), Positives = 68/132 (51%), Gaps = 4/132 (3%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
+ L E+++ + + K+E ++ + + + + + +E ++ ++E E+ ++
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+E +EE E EE K+KE+ E+ + +++E ++ E+E + ++ + +L+
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESL 819
Query: 213 KKEKERSSHEKE 224
++ +ER E E
Sbjct: 820 EQRRERLEQEIE 831
Score = 37.4 bits (87), Expect = 0.015
Identities = 26/126 (20%), Positives = 69/126 (54%), Gaps = 3/126 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
L E++++ +++ ++ E+++++ + ++ K KE+ +E +EK + +EE E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ EE+ ++ E+ + E+ +E + + + E+E + +++ + +EKL + ++E E
Sbjct: 797 ELEEELEEAERR---LDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 219 SSHEKE 224
E E
Sbjct: 854 LEKELE 859
Score = 37.4 bits (87), Expect = 0.016
Identities = 27/144 (18%), Positives = 62/144 (43%), Gaps = 18/144 (12%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK----------DKEKKE 148
L +++ +D +K+ EK++++ +K ++ + E ++
Sbjct: 179 RKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELR 238
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--------K 200
KE + EE+ + E+E +E +E + +KE K EE E E+ ++ +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298
Query: 201 DKKLKKEKLKKKKKEKERSSHEKE 224
+ + + L+++ +E E E E
Sbjct: 299 ELEGEISLLRERLEELENELEELE 322
Score = 37.0 bits (86), Expect = 0.023
Identities = 17/118 (14%), Positives = 57/118 (48%), Gaps = 4/118 (3%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
+L E+K++ +++ + KE++K+ + + ++ + + +E ++E +E E+
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
+E +++ +E E++ + + + EK + + E E+++ + + +
Sbjct: 456 EELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESG 513
Score = 35.5 bits (82), Expect = 0.065
Identities = 23/129 (17%), Positives = 67/129 (51%), Gaps = 2/129 (1%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK-DKKDKDKIKNKEKDKEKKEKEKGKKEE 156
E+L E++ +++ K E ++ + + + + ++ ++ K+E ++E
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
++ + E+ EE+ +++E+ E+ E++ + E++ E +++ K K+ +E +K++ +
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792
Query: 217 E-RSSHEKE 224
E E+E
Sbjct: 793 EELEELEEE 801
Score = 35.1 bits (81), Expect = 0.088
Identities = 27/118 (22%), Positives = 67/118 (56%), Gaps = 1/118 (0%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+S+++ ++ +++ +E ++ ++ ++K +E ++ +KE E+ K+E +E + EKE +
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE-EL 875
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
E E K E++++ +E+ +E + E K + + + + E+L+ K + E E E
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEE 933
Score = 35.1 bits (81), Expect = 0.097
Identities = 18/117 (15%), Positives = 66/117 (56%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
E+++ + + + +++ ++ + + ++ + + +E +++ E E+ +E +++ E
Sbjct: 800 EELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELE 859
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E +++ E+ KE+ E ++++ +E+++ E+E + + + KE+++K ++ E
Sbjct: 860 ELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Score = 35.1 bits (81), Expect = 0.099
Identities = 28/120 (23%), Positives = 71/120 (59%), Gaps = 4/120 (3%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE--KGKKEE 156
E L E++++ + + K+E E+++ +K++ +D++K E++KE+ E+E + + E
Sbjct: 842 EKLDELEEELEELEKELEELKEELEELEAEKEEL-EDELKELEEEKEELEEELRELESEL 900
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E KEE EK E+ +++ E+ ++ + EE+ E+ + + + + + + E+L+++ +
Sbjct: 901 AELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEA 960
Score = 33.9 bits (78), Expect = 0.22
Identities = 28/146 (19%), Positives = 80/146 (54%), Gaps = 20/146 (13%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-------------DKIKNKEKDK 144
E+L +++ + K++ +++++ ++++ ++ ++ ++++ E+ +
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823
Query: 145 EKKEKEKGKKEEK------EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
E+ E+E + EE+ + E +E+ EE E +++E KE+ +E++ E+++ E E K
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEE-LEELEAEKEELEDELKEL 882
Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKE 224
+++K+ +E+L++ + E E E
Sbjct: 883 EEEKEELEEELRELESELAELKEEIE 908
Score = 32.0 bits (73), Expect = 0.71
Identities = 20/68 (29%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
G + KE+KEE E+K E+ E+ + +E + EE + + EK + +K + ++LK
Sbjct: 166 GVSKYKERKEEAERKLER----TEENLERLEDLL-EELEKQLEKLERQAEKAERYQELKA 220
Query: 212 KKKEKERS 219
+ +E E +
Sbjct: 221 ELRELELA 228
Score = 32.0 bits (73), Expect = 0.81
Identities = 23/97 (23%), Positives = 56/97 (57%), Gaps = 4/97 (4%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
EI++ + ++ + + +E++++ + + ++ K + +E + EK+E E KE +E+KEE
Sbjct: 829 EIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888
Query: 163 KEKKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEK 195
E++ + ++KE+ EK E++ + E E+ +
Sbjct: 889 LEEELRELESELAELKEEIEKLRERLEELEAKLERLE 925
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 51.1 bits (122), Expect = 8e-07
Identities = 33/146 (22%), Positives = 78/146 (53%), Gaps = 2/146 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ + D +++ +E + ++++ + K +++ KE++ + ++ K+EEKE
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELE-KEEEILAQVLKENKEEEKE 281
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
KK ++E+ + + +E K + ++ R++ D EK K+S+ K+ K +++LKK+K+E E
Sbjct: 282 KKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESE-KELKKLEKELKKEKEEIEE 340
Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRE 244
E + + + E+ K +
Sbjct: 341 LEKELKELEIKREAEEEEEEQLEKLQ 366
Score = 48.8 bits (116), Expect = 4e-06
Identities = 32/130 (24%), Positives = 72/130 (55%), Gaps = 2/130 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E K +K +++K K K +E+ + +++ K++ + E+++ E+E+ KEE+
Sbjct: 777 LAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEEL 836
Query: 159 KKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
++ E KEE++ + E+ E+ E+I KEE E K ++ +++ K++L+ K++++
Sbjct: 837 EELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKE 896
Query: 217 ERSSHEKEVI 226
+ E E
Sbjct: 897 KEEKKELEEE 906
Score = 47.3 bits (112), Expect = 1e-05
Identities = 22/135 (16%), Positives = 67/135 (49%), Gaps = 1/135 (0%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
+ ++K+ ++++ + K ++ ++ + + +E + K+E EK ++ +
Sbjct: 211 LEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKEKE 217
+E +++E+++ +E+ + ++ + + + K ++ K D +KLK+ + + KK EKE
Sbjct: 271 VLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKE 330
Query: 218 RSSHEKEVIPKLTFK 232
++E+
Sbjct: 331 LKKEKEEIEELEKEL 345
Score = 46.9 bits (111), Expect = 2e-05
Identities = 38/166 (22%), Positives = 81/166 (48%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
+E++ S ++ KE+E E + + E ++ KS + K
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
++EK+K+ + + K + + K++ +E +E EK KE + K+E +E++EE+ K++E E
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ E++ ++K + S K K+ + E +++KE + E
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSE 416
Score = 46.5 bits (110), Expect = 3e-05
Identities = 24/126 (19%), Positives = 60/126 (47%), Gaps = 4/126 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK----IKNKEKDKEKKEKEKGKKEEKE 158
I + +K + K++ K+ ++ + K+ + + + K +E+ +E
Sbjct: 189 LIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLR 248
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
++E+ + ++E + +E+ V K KEE+ +K ++ + K ++E+LK + + ER
Sbjct: 249 DEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLER 308
Query: 219 SSHEKE 224
+ E
Sbjct: 309 RKVDDE 314
Score = 45.3 bits (107), Expect = 5e-05
Identities = 28/138 (20%), Positives = 72/138 (52%), Gaps = 4/138 (2%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
K+ E+ E + K +++++ + KEK ++++K + +EK+++ K +E
Sbjct: 742 LEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQE 801
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD----SEKEKKSKDKDKKLKK 206
+ + +E+ +E+ + E+E + E +EK E+ +E +K +K +++ + +
Sbjct: 802 EELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLE 861
Query: 207 EKLKKKKKEKERSSHEKE 224
E++ K++ +E E+E
Sbjct: 862 EEITKEELLQELLLKEEE 879
Score = 44.2 bits (104), Expect = 1e-04
Identities = 35/179 (19%), Positives = 83/179 (46%), Gaps = 3/179 (1%)
Query: 71 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK---K 127
K + + ++++ + + K+ ++ K + + K ++KIK +
Sbjct: 692 KNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEE 751
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+++K K + +E+ E KEK EE+EK E+ + +EEKE K+K +E+ +
Sbjct: 752 EEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEEL 811
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
++++E ++ + ++ +K K ++ ++ E+++ + EE TK E
Sbjct: 812 KEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELL 870
Score = 43.4 bits (102), Expect = 2e-04
Identities = 30/128 (23%), Positives = 58/128 (45%), Gaps = 8/128 (6%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
KK++ K ++T E I ++ K + K+K + K + + K +K E E++
Sbjct: 170 RKKKERLKKLIEETENLAELI-IDLEELKLQELKLKEQAKKALEYYQLK-EKLELEEENL 227
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK------EKKSKDKDKKLKKEKLKKKKKEK 216
K + + D + + + +EE +S K E+ K+ K+E+ +KK +E+
Sbjct: 228 LYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEE 287
Query: 217 ERSSHEKE 224
E KE
Sbjct: 288 ELKLLAKE 295
Score = 41.9 bits (98), Expect = 7e-04
Identities = 31/150 (20%), Positives = 75/150 (50%), Gaps = 4/150 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKI-KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
E E K +K +++ + +++ ++ ++ K+++ + K KD+ + ++EK K+E+K
Sbjct: 842 ELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK 901
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK---EKLKKKKK 214
E +EE +K E K E +E+ E+ K + ++ ++ K+ + +++++
Sbjct: 902 ELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEE 961
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
+R KE + + + EEK +R
Sbjct: 962 RNKRLLLAKEELGNVNLMAIAEFEEKEERY 991
Score = 40.7 bits (95), Expect = 0.002
Identities = 32/152 (21%), Positives = 72/152 (47%), Gaps = 11/152 (7%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKE-------KEKGKKEE 156
K +K +KK+K+ ++ EKE + + + ++++ + EK +EK E +K + E
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESE 385
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK---LKKKK 213
+ K K+EE E K +E+KE + E+++ +++ K++ K +++ + K+
Sbjct: 386 RLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQG 445
Query: 214 KEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
K E ++ KL + + +
Sbjct: 446 KLTEEKEELEKQALKLLKDKLELKKSEDLLKE 477
Score = 40.3 bits (94), Expect = 0.002
Identities = 31/152 (20%), Positives = 71/152 (46%), Gaps = 14/152 (9%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
K+K+KE++KK ++ + + ++ K ++ K K++ K+ E + K EK +E+
Sbjct: 168 EKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLK-EKLELEEEN 226
Query: 177 -------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
+ E +R E+++ E K+ +K++++ + LK+ K+E++ ++
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286
Query: 224 EVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
E + L + E K E + +K
Sbjct: 287 EELKLLAKEEEELKSELLKLERRKVDDEEKLK 318
Score = 40.0 bits (93), Expect = 0.003
Identities = 29/164 (17%), Positives = 68/164 (41%), Gaps = 24/164 (14%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--------- 149
+ ++K + K++K+ +KKE E+ +K + ++ + + +E+ E+
Sbjct: 880 LEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPE 939
Query: 150 --------EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP------VEKIRKEEKDSEKEK 195
EK K+E+ +++EE+ K K + EK + KD K++
Sbjct: 940 ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKE 999
Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE 239
+ +++ K+L +E +++ + + E V E
Sbjct: 1000 RLEEEKKELLREIIEETCQRFKE-FLELFVSINRGLNKVFFYLE 1042
Score = 38.4 bits (89), Expect = 0.009
Identities = 30/120 (25%), Positives = 57/120 (47%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
E ++ S K K+ + + E+ K+ K + + + +EKKE+ K +E +E E
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLE 441
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
K+ K +E + E + + K + E K SE K K L++ +L +++ E +S
Sbjct: 442 TKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQ 501
Score = 31.1 bits (70), Expect = 1.4
Identities = 22/104 (21%), Positives = 41/104 (39%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+EK KK+E+ KK +E + E + ++ K E KE+ E + +L++E
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 209 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
L K + L + K + E +I+ +
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270
Score = 30.7 bits (69), Expect = 2.1
Identities = 16/115 (13%), Positives = 42/115 (36%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
+ K+++ K+ ++ E + + +E K ++K ++ + K+ + ++
Sbjct: 170 RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLY 229
Query: 202 KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
K ++ +E E+E I + + E + K K K
Sbjct: 230 LDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 49.3 bits (118), Expect = 9e-07
Identities = 21/77 (27%), Positives = 42/77 (54%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
D++ KK + ++ + ++ K+++K +DKE K + + EE E++ E+E +E +
Sbjct: 45 AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSD 104
Query: 171 FKMKEDKEKPVEKIRKE 187
KE +EK + KE
Sbjct: 105 ENEKETEEKTESNVEKE 121
Score = 38.9 bits (91), Expect = 0.002
Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 1/93 (1%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
+ +S D + ++ K D + + E+ K ++K+ +DK + +KE +E E
Sbjct: 33 QLFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
+ +EE E+ ++ +KE +E K + + EK +
Sbjct: 93 EENEEEDEESSDENEKETEE-KTESNVEKEITN 124
Score = 38.6 bits (90), Expect = 0.003
Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE-KIR 185
D+ D+ + K+ D ++ + + KEE+++ E KE+K KED+E E +
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Query: 186 KEEKDSEKEKKSKDKD 201
EE E EK++++K
Sbjct: 99 DEESSDENEKETEEKT 114
Score = 38.6 bits (90), Expect = 0.003
Identities = 21/85 (24%), Positives = 40/85 (47%), Gaps = 2/85 (2%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
++++ KK D++ + + KE++KE E + + +KE++E +EE E +
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE-E 97
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKS 197
E+ EK EEK +K
Sbjct: 98 EDEESSDENEKE-TEEKTESNVEKE 121
Score = 38.6 bits (90), Expect = 0.003
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K D + + K++EKE + + D K E+ +E+ E+E + ++ +KE +E
Sbjct: 53 KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Query: 165 KKEEKEFKMKEDKE-KPV 181
K E K + KPV
Sbjct: 113 KTESNVEKEITNPSWKPV 130
Score = 38.2 bits (89), Expect = 0.005
Identities = 18/86 (20%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKK 154
S + + ++ K + ++ KE+ K+ + +DK +++D+E +E+ + +
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKP 180
EE + EKE +E+ E ++++ P
Sbjct: 100 EESSDENEKETEEKTESNVEKEITNP 125
Score = 37.0 bits (86), Expect = 0.010
Identities = 21/88 (23%), Positives = 49/88 (55%), Gaps = 3/88 (3%)
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+ E+E K +++E E +E KEE++ + ++ +K E++D E E++++
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKE--DKGDAEKEDEESEEENE 96
Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKEVI 226
++ + ++ +K+ +EK S+ EKE+
Sbjct: 97 -EEDEESSDENEKETEEKTESNVEKEIT 123
Score = 35.5 bits (82), Expect = 0.035
Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 7/78 (8%)
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
D+ +E K ++ +E KE+++E E+KE K +KE EE
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKED-------EES 91
Query: 190 DSEKEKKSKDKDKKLKKE 207
+ E E++ ++ + +KE
Sbjct: 92 EEENEEEDEESSDENEKE 109
Score = 31.2 bits (71), Expect = 0.84
Identities = 14/78 (17%), Positives = 37/78 (47%), Gaps = 8/78 (10%)
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
+ +++E K+ +++ +++E KE EEK++ + +DK K+++
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKE--------EEKEAANSEDKEDKGDAEKEDEES 91
Query: 211 KKKKEKERSSHEKEVIPK 228
+++ E+E E +
Sbjct: 92 EEENEEEDEESSDENEKE 109
Score = 31.2 bits (71), Expect = 0.86
Identities = 10/66 (15%), Positives = 32/66 (48%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
++ +E+E K+ + + E +E E+ + +++K +K+ + +E+ ++
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 212 KKKEKE 217
+ +E
Sbjct: 98 EDEESS 103
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 49.8 bits (119), Expect = 1e-06
Identities = 36/141 (25%), Positives = 73/141 (51%), Gaps = 5/141 (3%)
Query: 108 KSIKKDKKTHKKKEKEKIKK---KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
I++ KK KKE+E+ KK + ++ +K + + + KE +++ +K K+ ++ +
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Query: 165 KKEEKEFKMKEDKEKPV-EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
+ EEK+ + +E K K E K E ++EK+ K + K K+ ++E K E ++ + E
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK-KQAEEEAKAKAAAEAKKKAAEA 171
Query: 224 EVIPKLTFKFGTDMEEKTKRE 244
+ + K + + K K E
Sbjct: 172 KKKAEAEAKAKAEAKAKAKAE 192
Score = 46.8 bits (111), Expect = 1e-05
Identities = 31/111 (27%), Positives = 51/111 (45%), Gaps = 4/111 (3%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKEE 168
K K+ + K K+ + K + + + K KE+ K++ E+E K E K E +KK E
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176
Query: 169 KEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E K K + K K + K + ++ K K + + K + E E ER
Sbjct: 177 AEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227
Score = 46.0 bits (109), Expect = 2e-05
Identities = 27/111 (24%), Positives = 51/111 (45%), Gaps = 2/111 (1%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K+ + +K K+ E+ K+ ++K K + K K + + + + EK+ KEE
Sbjct: 91 QKELEQRAAAEK--AAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEA 148
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
+K+ E+E K K E + ++K + K + K K E+ K K +
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199
Score = 44.4 bits (105), Expect = 7e-05
Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 9/122 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-- 160
KK + ++ + + E+ + K+ + + + K+ ++ K+ E+ +K+ +E K
Sbjct: 69 RQKKLEQQAEEAEKQRAAEQARQKELEQRA-AAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127
Query: 161 ---EEKEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E K K E + E K KE+ +K E+ K + +E +KK+ + KK + E K K E
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAE--AKAKAEA 185
Query: 217 ER 218
+
Sbjct: 186 KA 187
Score = 34.4 bits (79), Expect = 0.11
Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE--KEKKEE 162
K+++ K+ ++ K K + KKK + K K + + K K E + K E K K E
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEA---KKKAEAEAKAKAEAKAKAKAEEAKAKAEA 200
Query: 163 KEKKEEKEFKMKEDKEK 179
+ K E K + E
Sbjct: 201 AKAKAAAEAAAKAEAEA 217
Score = 29.4 bits (66), Expect = 3.5
Identities = 21/67 (31%), Positives = 28/67 (41%), Gaps = 2/67 (2%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEK 169
KK KKK + + K K + K K K + + K K + K K E K E E
Sbjct: 165 KKKAAEAKKKAEAEAKAKAEAKAKAKAE-EAKAKAEAAKAKAAAEAAAKAEAEAAAAAAA 223
Query: 170 EFKMKED 176
E + K D
Sbjct: 224 EAERKAD 230
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 49.4 bits (118), Expect = 2e-06
Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 10/89 (11%)
Query: 120 KEKEKIKKKKDKKDKDKIK-----NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K E++K D + + NK+ K+ + K E +E + K +E++ K +
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQE 410
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
E+++K +KE+ D +KEK+ KD+ KK
Sbjct: 411 ENEKK-----QKEQADEDKEKRQKDERKK 434
Score = 42.1 bits (99), Expect = 4e-04
Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 77 KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK 136
+ D +L + + +V S+ DL K+ + +K+ K + K K+ + KD+ D
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTK-KLGEVKDETDAS- 395
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E+ + K ++EK K+EE E K++KE+ +E + K ++D+ K
Sbjct: 396 ----EEAEAKAKEEKLKQEENE-KKQKEQADEDKEKRQKDERK 433
Score = 40.9 bits (96), Expect = 0.001
Identities = 21/92 (22%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
K ++ K + ++ + KE K K + K+ E K+E D + E
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNK--KLQDYTKKLGEVKDET------DASEEAEAK 401
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
KEEK ++E + K K++ + ++ ++K + K
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 40.2 bits (94), Expect = 0.002
Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 7/81 (8%)
Query: 147 KEKEKGKKEEK---EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
K E+ K +K++E K+ K+ + K V K+E D+ +E ++K K++K
Sbjct: 351 KLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV----KDETDASEEAEAKAKEEK 406
Query: 204 LKKEKLKKKKKEKERSSHEKE 224
LK+E+ +KK+KE+ EK
Sbjct: 407 LKQEENEKKQKEQADEDKEKR 427
Score = 35.5 bits (82), Expect = 0.050
Identities = 16/59 (27%), Positives = 33/59 (55%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
+ L + K KD+ ++ + K K++K K+++++ K KE+ E KEK + + +K
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 49.1 bits (118), Expect = 3e-06
Identities = 27/161 (16%), Positives = 65/161 (40%), Gaps = 8/161 (4%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
SS++L E K+ I + K+H ++ KE+ K++ +++ K + + + ++E
Sbjct: 15 VSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEA 72
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+ +E E E + +++ + + +K +K K K+K
Sbjct: 73 KAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKK 132
Query: 210 KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
K+K + + + ++K K++ + KI
Sbjct: 133 PKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173
Score = 41.8 bits (99), Expect = 6e-04
Identities = 23/125 (18%), Positives = 47/125 (37%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E K + + ++ + E ++ + E E E + E +
Sbjct: 55 EAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRP 114
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
K KK K+ K + K+K ++ K K + + + ++ + KKKK++
Sbjct: 115 KAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIPR 174
Query: 223 KEVIP 227
+ VIP
Sbjct: 175 EVVIP 179
Score = 41.8 bits (99), Expect = 7e-04
Identities = 16/119 (13%), Positives = 41/119 (34%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
+ + K KE ++K K+ + K + ++E+ KE+ K+E +E+ + + +
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ +E K + EE + + + + +
Sbjct: 62 EAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAA 120
Score = 41.8 bits (99), Expect = 7e-04
Identities = 15/124 (12%), Positives = 47/124 (37%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E K ++ + +++ K + ++ + + + +++ + +
Sbjct: 54 EEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARR 113
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+ K+ + K +K K + + + ++ ++ + KKKKK+K
Sbjct: 114 PKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173
Query: 223 KEVI 226
+EV+
Sbjct: 174 REVV 177
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 48.3 bits (115), Expect = 5e-06
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 2/128 (1%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
L K + SIK +KKT + KK ++ +K+ + K K K K +
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K+++ +K+E E + + EK + + ++K +KD +L K L+K K K
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFNKFKNE 487
Query: 219 SSHEKEVI 226
S E I
Sbjct: 488 ESAENSFI 495
Score = 29.8 bits (67), Expect = 3.6
Identities = 22/88 (25%), Positives = 36/88 (40%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
I KK + KE+ KK++ +K+ +K EK + E + K+K K+ +
Sbjct: 415 KIFYHKKKINQNNKEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELS 474
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
K K +K K E D K+
Sbjct: 475 KNMLQKFNKFKNEESAENSFIDIIKQAD 502
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 48.5 bits (116), Expect = 6e-06
Identities = 42/184 (22%), Positives = 84/184 (45%), Gaps = 2/184 (1%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKEK 149
+K T+ +D+ DK + ++ + +EKE K ++ K K K K K KK++
Sbjct: 1119 EKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKE 1178
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+K KK +K ++ + ++K K +K ++ +S + D+++K K +K
Sbjct: 1179 KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKS 1238
Query: 210 KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
K+ + ++++ K F D+ ++ K +++PK V SP PP E +
Sbjct: 1239 SVKRLKSKKNNSSKSSEDNDEF-SSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESN 1297
Query: 270 TGDS 273
G
Sbjct: 1298 GGSK 1301
Score = 47.7 bits (114), Expect = 1e-05
Identities = 41/219 (18%), Positives = 77/219 (35%), Gaps = 14/219 (6%)
Query: 42 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKD----------VELGVTPM--V 89
SK K K S K K+K K+K+S + + + V +L P
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKK 1217
Query: 90 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
N +D + K K + KK K + D+ D + + K K ++
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKR 1277
Query: 150 EKGKKEEKE--KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ K + K+K +++ +K+KKS+ K + KK
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337
Query: 208 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
K + K+ +SS K ++ ++ ++ + S
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDS 1376
Score = 40.8 bits (96), Expect = 0.002
Identities = 27/155 (17%), Positives = 54/155 (34%), Gaps = 2/155 (1%)
Query: 40 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
P S +K S K+ +S + + + + KE + V S
Sbjct: 1234 KPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPK--RVSAVQYSPPPPSKR 1291
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
GE K KK E KK K + K K K K ++ + +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSR 1351
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+ +K++ + ++D + V+ E+ + +++
Sbjct: 1352 LLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386
Score = 39.6 bits (93), Expect = 0.003
Identities = 27/177 (15%), Positives = 57/177 (32%), Gaps = 3/177 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
+ EI K++ +K K K ++ KKK+K K K + +K K+ K+ + +
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEK--KKKKSSADKSKKASVVGNSKRVDSD 1204
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+K + + K + + ++ ++ +K + K+ K K K +
Sbjct: 1205 EKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFS-SD 1263
Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 275
++ + P PS P + G
Sbjct: 1264 DLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAA 1320
Score = 39.6 bits (93), Expect = 0.003
Identities = 43/204 (21%), Positives = 76/204 (37%), Gaps = 29/204 (14%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD-------- 143
++ +DL + K + K KKK ++ ++++ ++D + E D
Sbjct: 1027 VITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAV 1086
Query: 144 --------------KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI----R 185
KEK EK + E+KEK+ EK K + ED +K E +
Sbjct: 1087 SYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEE 1146
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
EEK+ KE++ K K K + K K K+KE+ + + +S
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVV---GNSKRVDS 1203
Query: 246 SPKIVIKPVKSPSPPPVAAADEYD 269
K + + +D+ D
Sbjct: 1204 DEKRKLDDKPDNKKSNSSGSDQED 1227
Score = 36.9 bits (86), Expect = 0.022
Identities = 31/162 (19%), Positives = 58/162 (35%), Gaps = 11/162 (6%)
Query: 38 PGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS 97
P ++K K + K+S S + S + K + K V P +
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDG 1294
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK--EKEKGKKE 155
+ G + KK KK + KKKK + K K K K + K+ + +
Sbjct: 1295 ESNGGSKPSSPTKKKVKKRLEGSLAAL-KKKKKSEKKTARKKKSKTRVKQASASQSSRLL 1353
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+ +K++ + E + + D E++D E ++
Sbjct: 1354 RRPRKKKSDSSSEDDDDSEVDDS--------EDEDDEDDEDD 1387
Score = 36.9 bits (86), Expect = 0.022
Identities = 25/167 (14%), Positives = 52/167 (31%), Gaps = 5/167 (2%)
Query: 39 GTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS 98
G+ + K PK + + K + E D L N K S+
Sbjct: 1222 GSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDD-LSKEGKPKNAPKRVSA 1280
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD----KIKNKEKDKEKKEKEKGKK 154
K + K KK K + + K K K + K ++K+ +
Sbjct: 1281 VQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
++ + + + K K D + + + +++ + + D
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387
Score = 29.2 bits (66), Expect = 5.5
Identities = 41/196 (20%), Positives = 74/196 (37%), Gaps = 31/196 (15%)
Query: 101 LGEIKKDKSIKKDKK---THKKKEKEKIKKKKDK---KDKDKIKNKEKDKEKKEKEKGKK 154
LG+++++ + +K H + I K K K+ K+ K+K +
Sbjct: 1001 LGKLERELARLSNKVRFIKHVINGELVITNAKKKDLVKELKKLGYVRFKDIIKKKSEKIT 1060
Query: 155 EEKEKKEEKEKKEEKEFKMKED------------------KEKPVEKIRKE--EKDSEKE 194
E+E+ E++ + + E +E KEK VEK+ E +K+ E E
Sbjct: 1061 AEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEK-VEKLNAELEKKEKELE 1119
Query: 195 KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPV 254
K K + E L K ++ E +E + + KTK ++S K
Sbjct: 1120 KLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRL----KSKTKGKASKLRKPKLK 1175
Query: 255 KSPSPPPVAAADEYDT 270
K ++AD+
Sbjct: 1176 KKEKKKKKSSADKSKK 1191
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 47.9 bits (115), Expect = 8e-06
Identities = 32/112 (28%), Positives = 60/112 (53%), Gaps = 10/112 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K +E E + K+ +K ++ + KEK +E+++K + E++ ++ KE K+E + +KE +
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELR 594
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKL-----KKEKLKKKKKEKERSSHEKE 224
+ +K S K + + K+L KKEK KKK+KEK+ +
Sbjct: 595 QL-----QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641
Score = 46.0 bits (110), Expect = 3e-05
Identities = 38/132 (28%), Positives = 66/132 (50%), Gaps = 21/132 (15%)
Query: 115 KTHKKKEKEK----IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE-------- 162
K ++KEK I ++ + + + K +E + KE EK K+E +EKKE+
Sbjct: 508 KKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKL 567
Query: 163 KEKKEEK-EFKMKEDKEKPVEKIR------KEEKDSEKEKKSKDKDKKLKK--EKLKKKK 213
E+ E++ + +KE K++ E I+ K S K + + K+L K EK +KKK
Sbjct: 568 LEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKK 627
Query: 214 KEKERSSHEKEV 225
K+++ E +V
Sbjct: 628 KKQKEKQEELKV 639
Score = 37.5 bits (88), Expect = 0.013
Identities = 33/151 (21%), Positives = 70/151 (46%), Gaps = 9/151 (5%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
++KK ++E++K+ ++ +K+ + IK +K+ ++ KE + ++ K + +
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK---KEKLKKKKKEKERSSHEKEVIPK 228
K + EK +K++K+ ++E K D+ K L K ++ +KE + I K
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAI--VQAGIMK 671
Query: 229 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
+ D+E K + K K K+ P
Sbjct: 672 MKVPLS-DLE---KIQKPKKKKKKKPKTVKP 698
Score = 35.6 bits (83), Expect = 0.058
Identities = 38/161 (23%), Positives = 68/161 (42%), Gaps = 43/161 (26%)
Query: 99 EDLGEIKKDKSIKKDKKTHK------------KKEKEK-IKKKKDKKDKDKIKNKEKD-K 144
E+L E K+ ++DK + KKE ++ IK+ + + K +
Sbjct: 551 EELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELI 610
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFK------------------MKEDKEKPVE---- 182
E +++ E+KEKK++K+K++++E K + +DKE V+
Sbjct: 611 EARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIM 670
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
K++ D EK +K KK+K KK K K +
Sbjct: 671 KMKVPLSDLEKIQKP-------KKKKKKKPKTVKPKPRTVS 704
Score = 33.6 bits (78), Expect = 0.22
Identities = 35/155 (22%), Positives = 67/155 (43%), Gaps = 29/155 (18%)
Query: 99 EDLGEIKKDKSIKKDKKTHK-----KKEKEK-IKKKKDKKDKDKIKNKEKD-KEKKEKEK 151
E L E + + +K+ + KKE ++ IK+ + + K + E +++
Sbjct: 558 EKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLN 617
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD---------- 201
E+KEKK++K+K++++E K+ ++ K + E DK+
Sbjct: 618 KANEKKEKKKKKQKEKQEELKVGDEV-----KYLSLGQKGEVLSIPDDKEAIVQAGIMKM 672
Query: 202 -------KKLKKEKLKKKKKEKERSSHEKEVIPKL 229
+K++K K KKKKK K + V +L
Sbjct: 673 KVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 47.3 bits (112), Expect = 1e-05
Identities = 34/136 (25%), Positives = 71/136 (52%), Gaps = 12/136 (8%)
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
V D+ ++K+ +S + K+ + KE+ K+ K + K + + +K+ E +K
Sbjct: 194 VNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQK 253
Query: 155 EEKEKKEEK-----EKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKKEK 208
+++ K K KE+K ++ E++++ +EK + E +K+ E+ K+KD K
Sbjct: 254 QQEAKNLPKPADTSSPKEDK--QVAENQKREIEKAQIEIKKNDEEALKAKDH----KAFD 307
Query: 209 LKKKKKEKERSSHEKE 224
LK++ K E+ + +KE
Sbjct: 308 LKQESKASEKEAEDKE 323
Score = 36.1 bits (83), Expect = 0.033
Identities = 24/127 (18%), Positives = 62/127 (48%), Gaps = 7/127 (5%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K ++D + ++D+ +K ++ + K+ DK +N++++ EK + E K
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE--DKQVAENQKREIEKAQIEIKK 292
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKK 212
+E+ K K+ K F +K++ + ++ +E +++K+++ +D +K K + +
Sbjct: 293 NDEEALK----AKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQP 348
Query: 213 KKEKERS 219
E +
Sbjct: 349 TSLNEDA 355
Score = 28.8 bits (64), Expect = 7.3
Identities = 22/96 (22%), Positives = 40/96 (41%), Gaps = 9/96 (9%)
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
++ED EK V R E+E + K + KE+L KK+ + +++ + +
Sbjct: 185 ALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD------- 237
Query: 232 KFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 267
F D +K + E K + P P ++ E
Sbjct: 238 -FAQDNADKQRDEVRQKQQ-EAKNLPKPADTSSPKE 271
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 47.4 bits (113), Expect = 1e-05
Identities = 23/147 (15%), Positives = 62/147 (42%), Gaps = 14/147 (9%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHK--------------KKEKEKIKKKKDKKDKDKI 137
IV + ++ + +I I D + + E+E++K + ++++ +
Sbjct: 772 IVSLFNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLM 831
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+ +E + +E++ K+E K + E ++ EE+ + E+K + VE +K ++
Sbjct: 832 RPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAK 891
Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKE 224
K + K+ ++ + +
Sbjct: 892 DVLIKLVISSDEIKQDEKTTKKKKKDL 918
Score = 43.1 bits (102), Expect = 3e-04
Identities = 21/117 (17%), Positives = 52/117 (44%), Gaps = 5/117 (4%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEK 163
D+ I+ +++ K ++ + ++ + I ++K +E K + +E +E+ +
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867
Query: 164 EKKEEKEFKMKEDKEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ + + + K D +K + I K + K +K KK+K +K ++E
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924
Score = 37.3 bits (87), Expect = 0.015
Identities = 20/118 (16%), Positives = 50/118 (42%), Gaps = 3/118 (2%)
Query: 65 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
++ E+E K ++ M P +++ + E ++ K + + +++ +
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867
Query: 125 IKKKK---DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
+ K + K+ K NK + + ++ K++EK K++K+ K D+E
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEEA 925
Score = 34.6 bits (80), Expect = 0.11
Identities = 19/101 (18%), Positives = 49/101 (48%), Gaps = 2/101 (1%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI--KKKKDKKDKDKIKNKEKDKEKKEKEK 151
++ E L ++ + I +++K K++ K ++ ++ +++ ++ +NK + E K K
Sbjct: 823 RIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPK 882
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
+ K+ K ++K+D++ +K + EK E
Sbjct: 883 KLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDE 923
Score = 32.7 bits (75), Expect = 0.41
Identities = 24/130 (18%), Positives = 57/130 (43%), Gaps = 14/130 (10%)
Query: 133 DKDKIKNKEKDKEKKE-KEKGKKEEKEK-------KEEKEKKEEKEFKMKEDKEKPVEKI 184
D I N + + +E ++ + E+E+ +EE + E+ + E+++ ++I
Sbjct: 791 LSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEI 850
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTDMEEKTK 242
+ E + ++ ++ + K E ++ K K+ + K+V+ KL +E +
Sbjct: 851 KLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLV----ISSDEIKQ 906
Query: 243 RESSPKIVIK 252
E + K K
Sbjct: 907 DEKTTKKKKK 916
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 46.6 bits (111), Expect = 1e-05
Identities = 22/118 (18%), Positives = 43/118 (36%), Gaps = 1/118 (0%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
T L I S + + K K K ++K+ E K+K + KK+
Sbjct: 16 TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+K++K+E + + E + K K+ K + + E +K + +
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP-NEDVDNAFNKIAELAEKSNVY 132
Score = 42.0 bits (99), Expect = 4e-04
Identities = 36/137 (26%), Positives = 55/137 (40%), Gaps = 15/137 (10%)
Query: 57 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
S + K K P YS V + + ++ KKDK K+ ++
Sbjct: 10 SFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTS---TKKDKKEDKNNES 66
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
KKK ++K KKKK+KK+ K E E + KK +K KK+ + K ED
Sbjct: 67 -KKKSEKKKKKKKEKKE-----------PKSEGETKLGFKTPKKSKKTKKKPPKPKPNED 114
Query: 177 KEKPVEKIRKEEKDSEK 193
+ KI + + S
Sbjct: 115 VDNAFNKIAELAEKSNV 131
Score = 40.9 bits (96), Expect = 0.001
Identities = 27/108 (25%), Positives = 49/108 (45%), Gaps = 1/108 (0%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
T +K + + I + E++ K +K+KKE+K + +K+ K+
Sbjct: 15 TTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK-SEKK 73
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
K+K +K K E +++ K+ K KK KK+ K K E ++ K
Sbjct: 74 KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121
Score = 39.7 bits (93), Expect = 0.003
Identities = 25/116 (21%), Positives = 42/116 (36%), Gaps = 1/116 (0%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K K + KE E+E K K+ KE + +K+ + +K+K
Sbjct: 18 KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKK 77
Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE-KTKRESSPKIV 250
K K + K + KL K +K + + +K PK + E S +
Sbjct: 78 KEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYI 133
Score = 38.2 bits (89), Expect = 0.008
Identities = 27/94 (28%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
++ + + KE E+ K S K+ K +DK+ + KK+ KKKKK
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
+KE+ + E KL FK T + K ++ PK
Sbjct: 77 KKEKKEPKSEGETKLGFK--TPKKSKKTKKKPPK 108
Score = 33.2 bits (76), Expect = 0.28
Identities = 23/110 (20%), Positives = 43/110 (39%), Gaps = 4/110 (3%)
Query: 144 KEKKEKEKGKKEEKE-KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
K K + +E +E K +KE+K++E +KKS+ K K
Sbjct: 18 KSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTK--KDKKEDKNNESKKKSEKK-K 74
Query: 203 KLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
K KKEK + K + + + + K T K + +++ + +
Sbjct: 75 KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124
Score = 30.5 bits (69), Expect = 1.8
Identities = 18/98 (18%), Positives = 37/98 (37%), Gaps = 3/98 (3%)
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
++ K + + K + EE E + + K K++K + KK+ E+
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEE---ENKVATTSTKKDKKEDKNNESKKKSEKK 73
Query: 220 SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 257
+K+ + + T + KT ++S P P
Sbjct: 74 KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 46.5 bits (111), Expect = 1e-05
Identities = 25/83 (30%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K K ++ +K DK +++ + K E++ +E E +EKKEEK+K+E + K
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQE----EAQEKKEEKKKEEREA---K 299
Query: 175 EDKEKPVEKIRKEEKDSEKEKKS 197
K P E+ + EEK+ +K+ +
Sbjct: 300 LAKLSPEEQRKLEEKERKKQARK 322
Score = 38.8 bits (91), Expect = 0.004
Identities = 25/83 (30%), Positives = 47/83 (56%), Gaps = 9/83 (10%)
Query: 135 DKIKN---KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
DK+ N + K +K + ++EEK K +E+++E+ + KE+K+K E+ K K S
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKK-EEREAKLAKLS 304
Query: 192 EKEKKSKDKDKKLKKEKLKKKKK 214
+E++ K +KE+ K+ +K
Sbjct: 305 PEEQR-----KLEEKERKKQARK 322
Score = 36.8 bits (86), Expect = 0.014
Identities = 20/66 (30%), Positives = 44/66 (66%), Gaps = 3/66 (4%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGK--KEEKEKKEE 162
+ ++K KT +++E++ +K ++++ ++ + K +EK KE++E + K EE+ K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313
Query: 163 KEKKEE 168
KE+K++
Sbjct: 314 KERKKQ 319
Score = 30.7 bits (70), Expect = 1.4
Identities = 15/62 (24%), Positives = 36/62 (58%), Gaps = 1/62 (1%)
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
V T E+ +I K ++ ++ +KKE++K ++++ K K + +++ E+KE++K
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPE-EQRKLEEKERKKQ 319
Query: 153 KK 154
+
Sbjct: 320 AR 321
Score = 28.8 bits (65), Expect = 6.9
Identities = 14/72 (19%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 236
+ + K+ K ++ E++ ++++ ++ + KK++K+KE + + KL+ +
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEE---REAKLAKLSPEEQRK 310
Query: 237 MEEKTKRESSPK 248
+EEK +++ + K
Sbjct: 311 LEEKERKKQARK 322
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 46.7 bits (111), Expect = 2e-05
Identities = 24/106 (22%), Positives = 45/106 (42%), Gaps = 3/106 (2%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+KK H+K +K+ K K KD K K K K+K +K+ ++ + +
Sbjct: 48 INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYD---NFFDSKNNSK 104
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ + + K + + K+K + +K KK+K+ K E
Sbjct: 105 QFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELI 150
Score = 36.3 bits (84), Expect = 0.037
Identities = 25/109 (22%), Positives = 39/109 (35%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K KK K KKK K+K+ D + KN K ++ K K E+
Sbjct: 66 TDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKL 125
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+KK + K+K + + K + KS L ++L
Sbjct: 126 KKKITVNKSTNKKKKKVLSSKDELIKYDNNKPKSISIHSPLTIQELSTL 174
Score = 34.8 bits (80), Expect = 0.11
Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 21/159 (13%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK----EKKEKEKGKKEEKEKKEEKEKKE 167
+ T K IK K K ++++ KD + +K K ++K KK K
Sbjct: 10 SSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDL 69
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL------KKKKKEKERSSH 221
+ K K ++K ++K + D + SK+ K+ K K EK +
Sbjct: 70 KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129
Query: 222 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 260
K K++ + P
Sbjct: 130 TVNKSTN-----------KKKKKVLSSKDELIKYDNNKP 157
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 46.1 bits (109), Expect = 3e-05
Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 2/133 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKE 175
K+ EKE++K ++ +K ++ + + + ++K+++E+ +K E++KK E K + K
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
+K +E + +E K+K + KK+ + K E++ E E K K
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223
Query: 236 DMEEKTKRESSPK 248
EEK E
Sbjct: 224 AAEEKAAAEKKKA 236
Score = 45.7 bits (108), Expect = 3e-05
Identities = 26/117 (22%), Positives = 49/117 (41%), Gaps = 3/117 (2%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+ KK K K + K K + K E+ + E+ K K E K++ E + +
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAA 204
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---KLKKEKLKKKKKEKERSSHEKEV 225
+ + + + K K+ + + +EK + +K K K K +K K ER + +
Sbjct: 205 EKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261
Score = 37.6 bits (87), Expect = 0.010
Identities = 29/121 (23%), Positives = 57/121 (47%), Gaps = 5/121 (4%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK-KEEKEKKEEK 163
K+ + +K + +K+++E+ +K ++ K K K E K K E +KK E+
Sbjct: 118 KQAEEAEKQAQLEQKQQEEQ--ARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
K +E K K E K + E + +K+K + + K + K + +E+++ EK
Sbjct: 176 AAKAAEEAKAK--AEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEK 233
Query: 224 E 224
+
Sbjct: 234 K 234
Score = 37.6 bits (87), Expect = 0.012
Identities = 28/117 (23%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK-KEEKEKKEEKEKK 166
I+ + + KK E+++ KK++ ++ K K + + K+ EK + K ++++K+ +E +
Sbjct: 65 GRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAE 124
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKERSSHE 222
++ + + K+ +E+ + +++K +E K K+ + KLK KKK E+ + E
Sbjct: 125 KQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAE 181
Score = 33.4 bits (76), Expect = 0.22
Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 5/97 (5%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
KK + K + K K + KKK + + K ++ E + K K EK+ + E
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEA-KAAAEKAKAEAEAKAKA---EKKAEAAAE 226
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
+K E K K K +K K +E++ + D
Sbjct: 227 EKAAAEKKKAAAKAK-ADKAAAAAKAAERKAAAAALD 262
Score = 33.4 bits (76), Expect = 0.23
Identities = 21/99 (21%), Positives = 57/99 (57%), Gaps = 4/99 (4%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
+I++++ +K E+++ KKEE+ +E K K+ ++ ++K+ +++ ++ ++++ E EK
Sbjct: 66 RIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEK 125
Query: 196 KSKDKDKKLK----KEKLKKKKKEKERSSHEKEVIPKLT 230
+++ + K+ + K ++KKK + + KL
Sbjct: 126 QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLK 164
Score = 30.3 bits (68), Expect = 2.1
Identities = 21/114 (18%), Positives = 44/114 (38%), Gaps = 4/114 (3%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K + + + + K KKK +E K ++ K K + +K E + K +
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205
Query: 154 KEEKEKKE----EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
K + E + EK+ + E K +K+K K + ++ + + +
Sbjct: 206 KAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAA 259
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 46.1 bits (109), Expect = 3e-05
Identities = 26/99 (26%), Positives = 55/99 (55%), Gaps = 5/99 (5%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEE--KD 190
+ N + +EK+ K+++K + E KEK ++KE + ++ + PV++ +E K
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIP 227
+++ KKS ++ K E LK K++ ++ +E ++ IP
Sbjct: 65 ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP 103
Score = 44.9 bits (106), Expect = 7e-05
Identities = 41/147 (27%), Positives = 72/147 (48%), Gaps = 10/147 (6%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
D++ KD EK K + KEK K ++V VT V + VK SS++
Sbjct: 11 INNNDRMQEKDD----EKQDQKNRMELKEKVLDKKEEV---VTDNVDSPVKEQSSQE--N 61
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKEE 162
+K +KK K K+ E +K K++ + + + + +K E KE K+ ++ K E
Sbjct: 62 LKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPE 121
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEK 189
+ KK+ K F++ E K+ P+ + E++
Sbjct: 122 QAKKQTKLFRLFEPKQLPIYRANGEKE 148
Score = 41.1 bits (96), Expect = 0.001
Identities = 22/97 (22%), Positives = 48/97 (49%), Gaps = 13/97 (13%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
++ ++ ++++ EK+++K + E KE K+ + KE+ V + DS +++
Sbjct: 2 AYRNRREANINNNDRMQEKDDEKQDQKNRMELKE-KVLDKKEEVVT----DNVDSPVKEQ 56
Query: 197 SKDKDKKLKKEKLKKKKKE--------KERSSHEKEV 225
S ++ K+ E K K+E K + H+KE+
Sbjct: 57 SSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEI 93
Score = 38.4 bits (89), Expect = 0.008
Identities = 29/111 (26%), Positives = 61/111 (54%), Gaps = 3/111 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
++K +K + ++ + KEK+ KK++ D + + K++ +E K E K+ +E+
Sbjct: 17 MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76
Query: 165 KKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
K+ + K KE+ +K ++ +I ++ S + K+S KKL+ K ++ KK
Sbjct: 77 KQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKES--ILKKLEDIKPEQAKK 125
Score = 36.8 bits (85), Expect = 0.026
Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKEEKEKKEEKEFKMKE 175
++ + + I ++KD K+ K + E KEK +++E + KE +E
Sbjct: 3 YRNRREANINNNDRMQEKDD--EKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE 60
Query: 176 D--KEKPVEKIRKEEKDSEKE--KKSKDKDKKLKKEKLKK 211
+ V+K KEE E K ++ K+++ E L+K
Sbjct: 61 NLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQK 100
Score = 36.1 bits (83), Expect = 0.044
Identities = 19/90 (21%), Positives = 48/90 (53%), Gaps = 3/90 (3%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+++ +I ++K+ EK +K + K+K+ +K+++ + KE+ ++ K
Sbjct: 6 RREANINN-NDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
E K+ KE+ ++ +E + K +++ +KE
Sbjct: 65 ADEVKKST-KEESKQLLE-VLKTKEEHQKE 92
Score = 31.1 bits (70), Expect = 1.3
Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 6/65 (9%)
Query: 187 EEKDSEKEK---KSKDKDKKLKKEKLKKKKK---EKERSSHEKEVIPKLTFKFGTDMEEK 240
D +EK K K++ KEK+ KK+ S KE + K ++++
Sbjct: 12 NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKS 71
Query: 241 TKRES 245
TK ES
Sbjct: 72 TKEES 76
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 45.4 bits (108), Expect = 4e-05
Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 6/94 (6%)
Query: 115 KTHKK---KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK---EEKEKKEEKEKKEE 168
KTH+K K +EKIK K + + K + K K K + EK + + K+++
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK 332
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
KE K +E K+K +E++ + + E + K+++K
Sbjct: 333 KEKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Score = 39.6 bits (93), Expect = 0.002
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 138 KNKEKDKEK-KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
K+ EK +EK K + K K+ E + + K+K E+ EK+ E K+ +KEKK
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKK 336
Query: 197 SKDKDKKL------KKEKLKKKKKEKE 217
++K KK + EKL+ + +KE
Sbjct: 337 KEEKKKKQIERLEERIEKLEVQATDKE 363
Score = 38.5 bits (90), Expect = 0.006
Identities = 19/78 (24%), Positives = 38/78 (48%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
+ L K K K+K K ++ ++ ++ K++K+KE+ KK++ E
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIE 346
Query: 159 KKEEKEKKEEKEFKMKED 176
+ EE+ +K E + KE+
Sbjct: 347 RLEERIEKLEVQATDKEE 364
Score = 36.6 bits (85), Expect = 0.021
Identities = 24/75 (32%), Positives = 39/75 (52%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
IK K K K + K+ K K + N++ D E KEK+K KK+E++KK++
Sbjct: 286 IKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345
Query: 164 EKKEEKEFKMKEDKE 178
E+ EE+ K++
Sbjct: 346 ERLEERIEKLEVQAT 360
Score = 33.5 bits (77), Expect = 0.24
Identities = 17/60 (28%), Positives = 32/60 (53%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+K K E++ K + K+K K K KE+ K+K+ + ++ EK + + +K+E K
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367
Score = 33.1 bits (76), Expect = 0.28
Identities = 13/63 (20%), Positives = 34/63 (53%)
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+EK + ++++K K+ + + + + K + ++KL E +KKK++K+ +
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Query: 223 KEV 225
K++
Sbjct: 343 KQI 345
Score = 31.9 bits (73), Expect = 0.73
Identities = 28/102 (27%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
EK + K + + + ++ KK F+M D ++ ++ K E+D+EK + KK
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKS--KFERDNEKLDAEVKEKKK---- 333
Query: 208 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
+ KK++K+K++ +E I KL + E KT + KI
Sbjct: 334 EKKKEEKKKKQIERLEERIEKLEVQATDKEENKTVALGTSKI 375
Score = 28.9 bits (65), Expect = 5.5
Identities = 19/68 (27%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
L E+ D K K + EK + K+ KK+K K + K+K + ++ +E+ +K E +
Sbjct: 301 LLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK-EEKKKKQIERLEERIEKLEVQA 359
Query: 160 KEEKEKKE 167
+++E K
Sbjct: 360 TDKEENKT 367
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 45.8 bits (109), Expect = 4e-05
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 5/88 (5%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
KK KE + EE+E EE + + E + K K++ ++ +EK+ DK + L
Sbjct: 224 KKLKETSETEERE--EETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281
Query: 206 KEKLKKKKKEKERSSHEKEVIPKLTFKF 233
K ++ K+KK++E EK P T F
Sbjct: 282 KLEILKEKKDEELFWFEK---PWPTLLF 306
Score = 39.6 bits (93), Expect = 0.003
Identities = 16/76 (21%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
T K KE + ++++++ D + E K+E+E +E+ E+++ +K E
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKT----E 278
Query: 176 DKEKPVEKIRKEEKDS 191
D +K K++++
Sbjct: 279 DLDKLEILKEKKDEEL 294
Score = 31.9 bits (73), Expect = 0.76
Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
F K+ KE + R+EE D E E S+ K K+E+ +++ S EKE K
Sbjct: 221 FFTKKLKETSETEEREEETDVEIETTSETK--GTKQEQEGSTEEDPSLFSEEKEDPDKTE 278
Query: 231 FKFGTDMEEKTKRESS 246
++ ++ K E
Sbjct: 279 DLDKLEILKEKKDEEL 294
Score = 31.5 bits (72), Expect = 0.95
Identities = 11/72 (15%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
K +K+ +T +++E+ ++ + + K K +++ +++ +E+++ + ++ +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETK-GTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282
Query: 168 EKEFKMKEDKEK 179
+ K K+D+E
Sbjct: 283 LEILKEKKDEEL 294
Score = 30.4 bits (69), Expect = 2.1
Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEK 163
KK K + ++ ++ + E + K K + + + E+D +EK ++ E ++
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 164 EKKEEKEFKMKEDKEKP 180
E +EK+ + EKP
Sbjct: 284 EILKEKKDEELFWFEKP 300
Score = 30.4 bits (69), Expect = 2.5
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 163 KEKKEEKEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
EK F + E K K + + E+K K ++ K +++K K K++EKER
Sbjct: 623 IEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Score = 28.8 bits (65), Expect = 7.3
Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
S D+ E+ K K I + +KK++ KI ++K K + K ++ +E+KE
Sbjct: 630 SFDISELMKIKQI--FRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
Score = 28.4 bits (64), Expect = 9.0
Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 120 KEKEKIKKKKDKKDKDKIKN--KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
K K+ + +KK + KI + +EK K++++KEK K+EEKE+ E E +
Sbjct: 638 KIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERIEIAETWDTIPL 691
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 45.2 bits (107), Expect = 6e-05
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 12/117 (10%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
E+ K + K++ + + K EK+ ++K + EK KE+++ K+E E +M+E KE
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE-- 557
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKE------KLKKKKKEKERSSHEKEVIPKLTF 231
R+ K E EK++++ K LKKE +LK+KK K + E + KL
Sbjct: 558 ----RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKE 610
Score = 41.3 bits (97), Expect = 0.001
Identities = 35/126 (27%), Positives = 62/126 (49%), Gaps = 16/126 (12%)
Query: 107 DKSIKKDKKTHKKKEKEK----IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE------- 155
I + KT + KE+ I+K + + + KN+ +K KE+EK KKE
Sbjct: 495 PHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEE 554
Query: 156 EKEKKEEKEKKEEKEF-----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
KE++ K+ + EKE +K++ E + ++++++ KE KS + KLK+ K K
Sbjct: 555 LKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614
Query: 211 KKKKEK 216
+K
Sbjct: 615 IPQKPT 620
Score = 39.0 bits (91), Expect = 0.005
Identities = 34/113 (30%), Positives = 56/113 (49%), Gaps = 16/113 (14%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-----KDKKDKDKIKNKEKDKEKKEKEKGK 153
E L ++K+ K + KE+EK+KK+ ++ K++++ K E +KE +E K
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577
Query: 154 KEEKEK--KEEKEKKEEKEF---------KMKEDKEKPVEKIRKEEKDSEKEK 195
K+E E +E KEKK K K+KE K+K +K + D +K
Sbjct: 578 KKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDK 630
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 45.5 bits (108), Expect = 6e-05
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 10/98 (10%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM-KEDKEK 179
E + KK ++K K I + EK K + E +EEKE + E + KE K
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEK--------KSIEIENRNQEEKEPAGQGELESDKEKKGN 1545
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ +EK+ E++ D KK K +K K E E
Sbjct: 1546 LESVLSNQEKNIEEDYAESDI-KKRKNKKQYKSNTEAE 1582
Score = 45.1 bits (107), Expect = 8e-05
Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 4/100 (4%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
N+ +K+ +K+KG EKK E +EEKE + + E EK E +
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQ 1553
Query: 195 KKSKDKD-KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
+K+ ++D + +K K KK+ K + E + K +F
Sbjct: 1554 EKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593
Score = 38.2 bits (89), Expect = 0.009
Identities = 21/77 (27%), Positives = 34/77 (44%), Gaps = 13/77 (16%)
Query: 105 KKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIK-------NKEKDKEKKEKEK 151
KK I ++K ++EKE + + + DK+K N+EK+ E+ E
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 152 GKKEEKEKKEEKEKKEE 168
K+ K KK+ K E
Sbjct: 1565 DIKKRKNKKQYKSNTEA 1581
Score = 33.5 bits (77), Expect = 0.27
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K + K D ++ K K K+KKEK KKEE+ K+EEK + E E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEK---KKEEEYKREEKARIEIAE 771
Score = 32.0 bits (73), Expect = 0.91
Identities = 14/53 (26%), Positives = 27/53 (50%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
GE++ DK K + ++ +++ I++ + D K KNK++ K E E
Sbjct: 1530 PAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAE 1582
Score = 30.1 bits (68), Expect = 3.4
Identities = 15/42 (35%), Positives = 24/42 (57%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K + K ++ K+K KEKK++E+ K+EEK + E E
Sbjct: 730 GKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
Score = 29.3 bits (66), Expect = 5.2
Identities = 15/37 (40%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
KD E K + +E+ +KK++KEKK+E+E+K +E
Sbjct: 730 GKDAEFKISDS-VEEKTKKKKKKEKKKEEEYKREEKA 765
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 44.7 bits (106), Expect = 9e-05
Identities = 35/117 (29%), Positives = 69/117 (58%), Gaps = 11/117 (9%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEKKEEKE 170
K++ +KE E ++ K +K ++KI+ E+ E+ +KE + EEK K+ +E ++K E+
Sbjct: 237 KEEIEELEKELESLEGSK-RKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEY 295
Query: 171 FKMKEDKEKPVEKIRKEEKDSEK---------EKKSKDKDKKLKKEKLKKKKKEKER 218
K+ E E+ ++++R+ EK + E+ + ++K+ + E+LKKK KE E+
Sbjct: 296 IKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEK 352
Score = 42.7 bits (101), Expect = 4e-04
Identities = 29/137 (21%), Positives = 75/137 (54%), Gaps = 3/137 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
++L E+ ++ + + ++E EK++K+ + ++ K + +E +KE + E K++ +E
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE 259
Query: 159 KKEEKEK---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
K E E+ + +KE + E+K K +++++++ ++ K + ++ +E K+ +
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRL 319
Query: 216 KERSSHEKEVIPKLTFK 232
+E + +E I +L K
Sbjct: 320 EEEINGIEERIKELEEK 336
Score = 40.0 bits (94), Expect = 0.002
Identities = 38/142 (26%), Positives = 78/142 (54%), Gaps = 13/142 (9%)
Query: 96 TSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK---- 149
+ ++LGE+ K + I++ +K K+ E + K+ +K+ +++ + + +
Sbjct: 162 NAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELRE 221
Query: 150 -----EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDKDKK 203
EK KE +E KEE E+ E++ ++ K K EKIR+ EE+ E +K+ ++ ++K
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281
Query: 204 LKK-EKLKKKKKEKERSSHEKE 224
+K+ ++LK+K +E + S E
Sbjct: 282 VKELKELKEKAEEYIKLSEFYE 303
Score = 39.3 bits (92), Expect = 0.004
Identities = 27/98 (27%), Positives = 54/98 (55%), Gaps = 1/98 (1%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
+E+IK+ ++K+++ + K+ + +K E+ ++ + +E K KKEE E K
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP 386
Query: 182 EKIRKEEKDSEKEKKS-KDKDKKLKKEKLKKKKKEKER 218
EK+ KE ++ EK K+ +++ K+ + KK+ KE
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424
Score = 39.3 bits (92), Expect = 0.004
Identities = 40/171 (23%), Positives = 86/171 (50%), Gaps = 15/171 (8%)
Query: 65 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
E+ K +EY K+K+ + + + ++ K E L E+KK + + K ++E +
Sbjct: 517 LEELEKKAEEYEKLKEKLIKLKGEIKSLKK--ELEKLEELKKKLAELEKKLDELEEELAE 574
Query: 125 IKKKKDKK-------DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE----FKM 173
+ K+ ++ ++++K E + + K ++E E++E++ KK E+E F+
Sbjct: 575 LLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEE 634
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ EK +E++RKE + E EKK +++ + +E+ + +E E E
Sbjct: 635 LAETEKRLEELRKELE--ELEKKYSEEEYEELREEYLELSRELAGLRAELE 683
Score = 38.5 bits (90), Expect = 0.007
Identities = 33/151 (21%), Positives = 75/151 (49%), Gaps = 2/151 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E + E+++ K + +++ +++K+ K+K ++ ++ ++ E + +K
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKR-LS 317
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ EE+ E+ K E+KE+ +E+++K+ K+ EK + ++ +L E+ K KK+E ER
Sbjct: 318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL-YEEAKAKKEELER 376
Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
+ + + EK K E +I
Sbjct: 377 LKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
Score = 38.1 bits (89), Expect = 0.009
Identities = 39/137 (28%), Positives = 79/137 (57%), Gaps = 3/137 (2%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
K I+K+ K ++KE+ K++K+ + +K K E K K+ E+ K+ E++ K+ ++
Sbjct: 462 KRIEKELKEIEEKER-KLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL 520
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
EK+ + E ++ + K++ E K +KE +K ++ KKL + + K + E+E + KE +
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE-L 579
Query: 227 PKLTFKFGTDMEEKTKR 243
+L F+ ++EE+ K
Sbjct: 580 EELGFESVEELEERLKE 596
Score = 38.1 bits (89), Expect = 0.010
Identities = 34/157 (21%), Positives = 79/157 (50%), Gaps = 2/157 (1%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS--SEDLGEIKKDKSIKKDKKT 116
+EK+ ++ K +EY K+ + + I K S E++ I++ ++K+
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEE 338
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
++ K+K+K+ + + ++ + +++ ++ K +KE+ ++ +K +K EKE + E
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398
Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
++ +E+ + E K + K+ K E+LKK K
Sbjct: 399 AKEEIEEEISKITARIGELKKEIKELKKAIEELKKAK 435
Score = 36.2 bits (84), Expect = 0.039
Identities = 25/115 (21%), Positives = 66/115 (57%), Gaps = 4/115 (3%)
Query: 107 DKSIKKDKKTHKKKE--KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
DK+ ++ +T K+ E ++++++ + K +++ + ++ + +E + ++ EK
Sbjct: 629 DKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
++E K K E ++ +E+ K +K+ EK +K+ ++ ++L+++ K K KER+
Sbjct: 689 REEIK--KTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKERA 741
Score = 35.4 bits (82), Expect = 0.065
Identities = 39/160 (24%), Positives = 76/160 (47%), Gaps = 6/160 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEK-----KEKEKGKKEE 156
EI++ + K+ K K+K +E IK + ++ D+++ EK + E+ KE
Sbjct: 274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKEL 333
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
+EK+E E+ ++K ++++ E+ E+ E+ K+++ + K+L +K +KE
Sbjct: 334 EEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKEL 393
Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
E KE I + K + E K K I+ +K
Sbjct: 394 EELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKK 433
Score = 34.7 bits (80), Expect = 0.12
Identities = 42/166 (25%), Positives = 81/166 (48%), Gaps = 26/166 (15%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK----EKDKEKKEKEKGKK 154
+ L E+++ + ++ K KK+E E++KK+ +K++ + EK KE+ E+E K
Sbjct: 352 KRLEELEERHELYEEAKA-KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKI 410
Query: 155 EEKEKKEEKEKKEEKEFKMK-----------------EDKEKPVEKIRKEEKDSEKEKKS 197
+ + +KE KE K+ + E +++ +E+ E K EKE K
Sbjct: 411 TARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKE 470
Query: 198 -KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
++K++KL+KE ++ EK + + K + ++EEK K
Sbjct: 471 IEEKERKLRKE---LRELEKVLKKESELIKLKELAEQLKELEEKLK 513
Score = 32.3 bits (74), Expect = 0.67
Identities = 30/114 (26%), Positives = 58/114 (50%), Gaps = 11/114 (9%)
Query: 140 KEKDKEKKEKEKG-KKEEKEKKEEKEKKEEKEFKMKE---------DKEKPVEKIRKEEK 189
KE + + EK K+ E ++ KEK++E E ++E + + +EK+ KE K
Sbjct: 172 KEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231
Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR 243
+ E+ K+ ++ +K + K+K +E+ +E I +L + ++EEK K
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-EELEEKVKE 284
Score = 29.3 bits (66), Expect = 5.0
Identities = 42/173 (24%), Positives = 81/173 (46%), Gaps = 14/173 (8%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEK--EKIKKKKDKKD--KDKIKNKEKDKEKKEKEKGK 153
+ L + +K K+ ++ K KE+ E+I K + K +IK +K E+ +K KGK
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGK 437
Query: 154 ----------KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ KE EE + ++ K ++ E+ K+RKE ++ EK K + + K
Sbjct: 438 CPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK 497
Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
LK+ + K+ E++ + E + K ++ E+ K + K + K ++
Sbjct: 498 LKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEK 550
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 44.0 bits (104), Expect = 1e-04
Identities = 23/113 (20%), Positives = 53/113 (46%), Gaps = 1/113 (0%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+K ++++ K++++ + ++K +++++K EK++ +++K + EE K+ +
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+K+ +E K K E K + K + K K E KK E
Sbjct: 131 QKQAEEAAAKA-AAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182
Score = 40.9 bits (96), Expect = 0.001
Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 3/103 (2%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE---KEKKEEKEKKEEKEFKMK 174
K+ EKE++ ++ KK ++ + K+K+ +E K K K E + K+ K
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA 163
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ K + +K + + KK + + K KKK E E
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206
Score = 40.2 bits (94), Expect = 0.001
Identities = 29/124 (23%), Positives = 48/124 (38%), Gaps = 11/124 (8%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD------KEKKEKEKGKKEE 156
E KK + K+K+ E+ K K K + + K K E +K + E
Sbjct: 115 EQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE 174
Query: 157 KEKKEEKEKKEEKE----FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
KK E K++ E K + +K E K++ +E +KK+ + K K +
Sbjct: 175 AAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKA-AAEAKAAAAKAAAE 233
Query: 213 KKEK 216
K
Sbjct: 234 AKAA 237
Score = 37.5 bits (87), Expect = 0.013
Identities = 24/131 (18%), Positives = 46/131 (35%), Gaps = 11/131 (8%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----------DKIKNKEKDKEKKEKEKGK 153
K+ +K + K K K + + K + K E + KK + K
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
K+ + + K E K+ E K+K + +K+ K +K + +
Sbjct: 185 KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAA 244
Query: 214 KEKERSSHEKE 224
K E+++ K
Sbjct: 245 KAAEKAAAAKA 255
Score = 36.7 bits (85), Expect = 0.020
Identities = 21/100 (21%), Positives = 47/100 (47%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+++ K +++ KK + + + + K+ E+E+ K+ EKE+ +E+K++ E K+
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K + K + K + + K KK E ++
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Score = 36.3 bits (84), Expect = 0.030
Identities = 19/113 (16%), Positives = 62/113 (54%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
+ + ++K ++ E+++ KKE+++ +E ++K+ ++ ++K+ +++ + ++++ E K
Sbjct: 66 RQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
++ K K+ ++ K K ++ E + K + ++K + E++ K
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178
Score = 35.2 bits (81), Expect = 0.069
Identities = 23/113 (20%), Positives = 44/113 (38%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
K K+ + K+ +K + KK + + K + K+K E E K E K++
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E + +K+ + K+ E K + + K + +K K + K
Sbjct: 203 AEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Score = 33.6 bits (77), Expect = 0.18
Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 3/92 (3%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKE 167
KK + KK + KKK + + K + K K + E +K E +KK E K
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
+ + K EK E+ + K D
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
Score = 32.9 bits (75), Expect = 0.36
Identities = 23/120 (19%), Positives = 57/120 (47%), Gaps = 11/120 (9%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE------EKEFKM 173
K E+ +KKK+++ ++++ K+ ++++ K+ K+ ++++K+ +E K+ +
Sbjct: 75 KRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQA 134
Query: 174 KEDKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
+E K + + + K KK+ + KK + + KK + + E E K
Sbjct: 135 EEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAK 194
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 43.9 bits (104), Expect = 1e-04
Identities = 35/147 (23%), Positives = 69/147 (46%), Gaps = 11/147 (7%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
S E+ E K++ + K + EKE KK K++K++ K K++ E++E E ++ +
Sbjct: 407 SDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVE-----------KIRKEEKDSEKEKKSKDKDKKLK 205
EK K K ++ + +E++E+ E +++DS+K+ SK K
Sbjct: 467 VEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK 526
Query: 206 KEKLKKKKKEKERSSHEKEVIPKLTFK 232
K K K+K++ ++ L +
Sbjct: 527 ISKAAVKVKKKKKKEKSIDLDDDLIDE 553
Score = 41.2 bits (97), Expect = 0.001
Identities = 38/154 (24%), Positives = 78/154 (50%), Gaps = 4/154 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEK 157
++ E++++ +++ + +E K + K ++ K E K KKE + KE+K
Sbjct: 392 AEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKK 451
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPV---EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
E EE+E ++E+E K+++ K + EK +KEE++ E ++++ K KK+
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
K++SS + + K +++K K+E S
Sbjct: 512 SKKKSSSKLDKAANKISKAAVKVKKKKKKEKSID 545
Score = 38.5 bits (90), Expect = 0.006
Identities = 31/156 (19%), Positives = 67/156 (42%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
++E+ ++ K + E +++ E +K KKE K + +EK++ +E+E E++ K
Sbjct: 408 DEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKV 467
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
+ K K SEK +K +++++ ++ K +S+ +++ K + K +
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKI 527
Query: 241 TKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 276
+K K K KS +E
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDD 563
Score = 36.2 bits (84), Expect = 0.032
Identities = 38/154 (24%), Positives = 75/154 (48%), Gaps = 7/154 (4%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--KDKEKKEKEKGKK 154
S E+ + +D ++ +KK K+K+ +D ++ + + + + E ++KE+
Sbjct: 334 SEEEDEDDDEDDDDGENPWMLRKKL-GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
E +E + E E +EE + +E+ E+P +K K + + + + KKLKKE K + K
Sbjct: 393 EIEELRRELEGEEESD---EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKEN-KNEFK 448
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
EK+ S E+E+ + K + KR +
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482
Score = 35.0 bits (81), Expect = 0.092
Identities = 23/126 (18%), Positives = 69/126 (54%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ + K +++ + K++ +I++ + + + ++ ++E+++E +K G+++
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ EKE + +K K +++ K ++ +EE+ ++E+ +K ++ +K +KE+E
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Query: 219 SSHEKE 224
++E
Sbjct: 488 EELDEE 493
Score = 33.5 bits (77), Expect = 0.25
Identities = 30/159 (18%), Positives = 55/159 (34%), Gaps = 4/159 (2%)
Query: 47 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 106
+ K P++ E K++ + KE + + E + KV + K
Sbjct: 422 RRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKA 481
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK- 165
K ++++ + + K K K K K K K + K++K+K
Sbjct: 482 QKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKE 541
Query: 166 ---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
+ + +ED K ++E D E K KD
Sbjct: 542 KSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKD 580
Score = 32.7 bits (75), Expect = 0.45
Identities = 30/132 (22%), Positives = 65/132 (49%), Gaps = 5/132 (3%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K+ ED E S+K ++ +K++E + ++ + +++ +E+ E++ +E K
Sbjct: 360 KLKEGEDDEENSGLLSMKFMQRAEARKKEEN--DAEIEELRRELEGEEESDEEENEEPSK 417
Query: 154 KEEKEKK---EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
K +K E EK+ E + KE+K + EK +E++ ++++ +K K +
Sbjct: 418 KNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKR 477
Query: 211 KKKKEKERSSHE 222
+K +KE E
Sbjct: 478 SEKAQKEEEEEE 489
Score = 31.6 bits (72), Expect = 0.88
Identities = 22/136 (16%), Positives = 55/136 (40%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+++ + KK+ +++ + ++ + EK+ K K+ + K +E+++ +E K
Sbjct: 442 ENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTS 501
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K K++ +K+ K DK + +K+K+ ++ K +
Sbjct: 502 SVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDV 561
Query: 217 ERSSHEKEVIPKLTFK 232
+ E + FK
Sbjct: 562 DDEEDEDDEELPFLFK 577
Score = 30.4 bits (69), Expect = 2.1
Identities = 38/188 (20%), Positives = 69/188 (36%), Gaps = 27/188 (14%)
Query: 45 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 104
+ +K +E+ ++ +P K S V + K+ + +
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--------- 155
K KK KK + + ++D D +++D E+ +K+
Sbjct: 530 AAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFAGD 589
Query: 156 ------EKEKKEEKEKKEEKEFKM------------KEDKEKPVEKIRKEEKDSEKEKKS 197
EKEKKE E+++ KE + + ++K ++ R+ E KK
Sbjct: 590 DVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGVKKE 649
Query: 198 KDKDKKLK 205
K KDKKLK
Sbjct: 650 KRKDKKLK 657
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 43.1 bits (101), Expect = 3e-04
Identities = 29/139 (20%), Positives = 63/139 (45%), Gaps = 13/139 (9%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-----EKGK 153
E++ +I +D S ++K + +E + + + K ++ N E D KE E
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKD 3949
Query: 154 KEEKEKKEEKE----KKEEKEFKMKEDKEKPVEKIRK----EEKDSEKEKKSKDKDKKLK 205
++EKE +EE +E + ++E+ +P + E+ ++++ KD L+
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009
Query: 206 KEKLKKKKKEKERSSHEKE 224
++ + KE + EK+
Sbjct: 4010 DMDMEAADENKEEADAEKD 4028
Score = 33.4 bits (76), Expect = 0.28
Identities = 21/111 (18%), Positives = 55/111 (49%), Gaps = 4/111 (3%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK--EKKEKEKGKKEEK 157
D+ + KD + +++ + ++E + + ++ + +D + N +K ++ +E + E+
Sbjct: 3863 DMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPED-LSNSLNEKLWDEPNEEDLLETEQ 3921
Query: 158 EKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ E+ E + KED K +K R+E++D E+ D +++ +
Sbjct: 3922 KSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972
Score = 31.1 bits (70), Expect = 1.5
Identities = 35/186 (18%), Positives = 77/186 (41%), Gaps = 10/186 (5%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL-- 101
SK D + +DK +EK +E + + D++ + P + EDL
Sbjct: 3937 SKEDDNKALEDKDR-QEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKL 3995
Query: 102 ----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKK 154
G++ KD ++ + KE+ +KD+ +D+ +E D++ ++ +
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDL 4055
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
E ++K ++ EE + +E E V+ + E+ E ++ D K+ + +
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFASAE 4115
Query: 215 EKERSS 220
E ++
Sbjct: 4116 ADEENT 4121
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 42.7 bits (101), Expect = 3e-04
Identities = 32/132 (24%), Positives = 58/132 (43%), Gaps = 6/132 (4%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-K 160
GEIK ++ ++ + + +E K+ K ++K + E +K+++E EK K + +K
Sbjct: 78 GEIKGVFTLGEENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAW 137
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
++ KK + E + K +EK E + + L E+L K K K
Sbjct: 138 KKLAKKYDSNL--SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEEL--KAKIKTLF 193
Query: 220 SHEKEVIPKLTF 231
S K + LT
Sbjct: 194 SSNKPELALLTL 205
Score = 42.3 bits (100), Expect = 4e-04
Identities = 26/123 (21%), Positives = 52/123 (42%), Gaps = 4/123 (3%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+N E + + +E +K K+ +EK E+ E + EK+ + E + ++ + +
Sbjct: 89 ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148
Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR-ESSPKIVIKPVKS 256
+ K L +K K+K KE S L+ + +++ K K SS K + +
Sbjct: 149 SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLE---ELKAKIKTLFSSNKPELALLTL 205
Query: 257 PSP 259
Sbjct: 206 SVI 208
Score = 32.3 bits (74), Expect = 0.65
Identities = 28/140 (20%), Positives = 50/140 (35%), Gaps = 5/140 (3%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKEK 149
NI E+L + K K ++ + ++KE+ +K K DK K K +
Sbjct: 90 NIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLS 149
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
E K +K +++ +E + +E K K S +K +
Sbjct: 150 EALKGLNYKKNFKEKLLKELK---SVILNASSLLSLEELKAKIKTLFSSNKPELALLTLS 206
Query: 210 KKKKKEKERSSH-EKEVIPK 228
E E++ EK +I
Sbjct: 207 VIDFDEIEQAEILEKSIIGS 226
Score = 31.9 bits (73), Expect = 0.70
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 4/94 (4%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDK 202
+E EK K+EK K ++K ++KED + ++ + EK + EK+ K +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVA-ELKEDIDAYQKEKKGLEKAINSLEKEIKQLEA 436
Query: 203 KLKKEKLKKKKKEKERSSHEK--EVIPKLTFKFG 234
++K + + K+ EK+ ++ E + I KL +G
Sbjct: 437 EIKALEKEIKELEKQLTNIEPTADEINKLLKAYG 470
Score = 30.4 bits (69), Expect = 2.1
Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Query: 123 EKIKK-----KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
E I++ KK+K+K K K E ++ +++K+ EK K +
Sbjct: 375 ELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQL 434
Query: 178 EKPVEKIRKEEKDSEKEKKS 197
E ++ + KE K+ EK+ +
Sbjct: 435 EAEIKALEKEIKELEKQLTN 454
Score = 29.6 bits (67), Expect = 4.3
Identities = 10/37 (27%), Positives = 19/37 (51%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
+ KK ++ I + + + + + K +KE KE EK
Sbjct: 414 QKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEK 450
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 41.2 bits (97), Expect = 4e-04
Identities = 26/84 (30%), Positives = 55/84 (65%), Gaps = 3/84 (3%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
KK K + K++ K+ ++++E E+ ++EE++K EEK + E KE +E+ E+ EK ++
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKE---EEELEEEREKKKE 57
Query: 187 EEKDSEKEKKSKDKDKKLKKEKLK 210
EE+ E+E++++ + ++ +K K
Sbjct: 58 EEERKEREEQARKEQEEYEKLKSS 81
Score = 38.1 bits (89), Expect = 0.003
Identities = 24/81 (29%), Positives = 52/81 (64%), Gaps = 8/81 (9%)
Query: 118 KKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
+ K +EK +++ ++ ++ + + ++K +EK+E E+ ++EE E++ EK+K+EE+ + +E
Sbjct: 8 RAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67
Query: 177 KEKPVEKIRKEEKDSEKEKKS 197
RKE+++ EK K S
Sbjct: 68 A-------RKEQEEYEKLKSS 81
Score = 30.1 bits (68), Expect = 1.5
Identities = 16/70 (22%), Positives = 44/70 (62%), Gaps = 6/70 (8%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE----KKEEK 169
KK +K+E E+ K++++ ++++ K KE+++ K+ +E+ +KE++E ++ K ++E
Sbjct: 32 KKLEEKREGER--KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGT 89
Query: 170 EFKMKEDKEK 179
+ +++
Sbjct: 90 DKLSADEESN 99
Score = 29.7 bits (67), Expect = 2.2
Identities = 17/75 (22%), Positives = 48/75 (64%), Gaps = 1/75 (1%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDKKLKKEKLK 210
K K++ + +EK+ ++ + E++E+ K +E+++ E KE++ +++++ KKE+ +
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 211 KKKKEKERSSHEKEV 225
+K++E++ ++E
Sbjct: 61 RKEREEQARKEQEEY 75
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 42.7 bits (101), Expect = 4e-04
Identities = 26/116 (22%), Positives = 48/116 (41%), Gaps = 6/116 (5%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
KS+ K K I++ + + D KNK + +KK + G + + E + E
Sbjct: 113 KSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQSFGFPQLLIEFEVNDISE 172
Query: 168 EKEFKMK------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E+EF+ E ++ E + E+K + K + +K +K K +E
Sbjct: 173 EQEFEKFEEAINEEVEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTEE 228
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 39.9 bits (94), Expect = 4e-04
Identities = 23/92 (25%), Positives = 53/92 (57%), Gaps = 7/92 (7%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE------KE 158
K+ + K ++ +K +++++K K+K KD E +EKKEKE KK + ++
Sbjct: 21 KQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQK 80
Query: 159 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEK 189
+++ +K++++E + DK K ++++ K++
Sbjct: 81 LQQDLQKRQQEELQKILDKINKAIKEVAKKKG 112
Score = 29.5 bits (67), Expect = 1.8
Identities = 14/79 (17%), Positives = 48/79 (60%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
K K EK+ +K++ E K E++ +K +++ +++ + +EK ++++K+ ++ ++
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQR 76
Query: 194 EKKSKDKDKKLKKEKLKKK 212
+++ +D + ++++ +K
Sbjct: 77 KQQKLQQDLQKRQQEELQK 95
Score = 29.1 bits (66), Expect = 2.7
Identities = 19/77 (24%), Positives = 48/77 (62%), Gaps = 8/77 (10%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD----KEKKEKEKGK--KEEKEKKEEKEKKEEKEF- 171
K +++++K+ K+ + ++ EK+ KEK +K+ + +EKKE++ +K+ +EF
Sbjct: 17 KAAQKQLEKEFKKRQAE-LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQ 75
Query: 172 KMKEDKEKPVEKIRKEE 188
+ ++ ++ ++K ++EE
Sbjct: 76 RKQQKLQQDLQKRQQEE 92
Score = 27.9 bits (63), Expect = 5.6
Identities = 18/73 (24%), Positives = 43/73 (58%), Gaps = 6/73 (8%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K +++ +KE K+++ E EK E++ K+KE +K SE ++ K+K+ + K
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQK------DAATLSEAAREKKEKELQKK 70
Query: 206 KEKLKKKKKEKER 218
++ ++K+++ ++
Sbjct: 71 VQEFQRKQQKLQQ 83
Score = 27.5 bits (62), Expect = 7.5
Identities = 17/72 (23%), Positives = 40/72 (55%), Gaps = 7/72 (9%)
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK-LKKK 212
K +++ E++ KK + E + E ++++K ++ +K+ + + + KKEK L+KK
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLE------KELQKLKEKLQKDAATLSEAAREKKEKELQKK 70
Query: 213 KKEKERSSHEKE 224
+E +R + +
Sbjct: 71 VQEFQRKQQKLQ 82
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 41.2 bits (97), Expect = 5e-04
Identities = 32/107 (29%), Positives = 48/107 (44%), Gaps = 15/107 (14%)
Query: 113 DKKTHK-----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
DKKT K +KE E++KK+ D K ++ + ++ K K E E
Sbjct: 107 DKKTKKLINELRKELEEVKKELDNKRNGELAIQPIQDKRIIK---------KDENSSVSE 157
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
++FK E++E +E E S KK K +KLKK + K K
Sbjct: 158 HEDFKQLENEENFLETEYNEITSSNNYKKLKIN-RKLKKAEKKLIKS 203
Score = 28.1 bits (63), Expect = 7.8
Identities = 21/83 (25%), Positives = 38/83 (45%), Gaps = 9/83 (10%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-------KDKDKIKNKEKDKEKKEKE--K 151
L E+KK+ K++ + + ++K KKD+ +D +++N+E E + E
Sbjct: 121 LEEVKKELDNKRNGELAIQPIQDKRIIKKDENSSVSEHEDFKQLENEENFLETEYNEITS 180
Query: 152 GKKEEKEKKEEKEKKEEKEFKMK 174
+K K K KK EK+
Sbjct: 181 SNNYKKLKINRKLKKAEKKLIKS 203
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 41.0 bits (96), Expect = 5e-04
Identities = 29/124 (23%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE------ 156
+ +K K + + +E K K KK KK +I ++ +E ++ +E
Sbjct: 79 TFENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLL 138
Query: 157 ---KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+E+ EE + K +++ + E+ EK +E++ E+ EK + K+K K+L++ K + ++
Sbjct: 139 YLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198
Query: 214 KEKE 217
+E
Sbjct: 199 LLEE 202
Score = 39.5 bits (92), Expect = 0.001
Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 12/141 (8%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
S+ G +K D I+KD+ K + + K K I+ + +KK K+ K
Sbjct: 52 KSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSIL 111
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK----------- 205
+ + ++ ++ E + + P+ +E+ E + K K+K ++L+
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELED 171
Query: 206 -KEKLKKKKKEKERSSHEKEV 225
K+ L+K +EKE+ +E+
Sbjct: 172 EKDLLEKLLEEKEKKKELEEL 192
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 41.8 bits (99), Expect = 5e-04
Identities = 38/157 (24%), Positives = 78/157 (49%), Gaps = 33/157 (21%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK---GKKE--------EKEKKEEKE--K 165
K+ EK ++ K I+ +K+ E +KE K+E E+E KE + +
Sbjct: 20 KRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQ 79
Query: 166 KEEKEFKMKEDK-EKPVEKIRKEEKDSEK-EKKSKDKDKKLKK------EKLKKKKKEKE 217
+ E+ +E+ ++ +E + K+E++ EK EK+ +K+K L + E + ++++E E
Sbjct: 80 RLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE 139
Query: 218 RSS-----HEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
R S KE++ + ++EE+ + E++ I
Sbjct: 140 RISGLTQEEAKEILLE-------EVEEEARHEAAKLI 169
Score = 41.8 bits (99), Expect = 6e-04
Identities = 40/157 (25%), Positives = 87/157 (55%), Gaps = 24/157 (15%)
Query: 103 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEE 156
E KK+ +++KK+ K+E K++ + +++ K++ +N+ + E++ ++ K E
Sbjct: 40 EAKKEAETLKKEALLEAKEEVHKLRAELERELKER-RNELQRLERRLLQREETLDRKMES 98
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK---LKKEKLKKK- 212
+KKEE +K+EKE +KEK +++ KEE+ E + +++ ++ L +E+ K+
Sbjct: 99 LDKKEENLEKKEKEL---SNKEKNLDE--KEEELEELIAEQREELERISGLTQEEAKEIL 153
Query: 213 -KKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
++ +E + HE + K ++EE+ K E+ K
Sbjct: 154 LEEVEEEARHEAAKLIK-------EIEEEAKEEADKK 183
Score = 28.4 bits (64), Expect = 8.2
Identities = 28/108 (25%), Positives = 62/108 (57%), Gaps = 7/108 (6%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK-- 151
++ E+ + K + KK++ +KKEKE K+K+ +K++ + ++++E E+
Sbjct: 84 RLLQREETLDRKMESLDKKEENL-EKKEKELSNKEKNLDEKEEELEELIAEQREELERIS 142
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
G +E+ K+ E+ EE + + + K +++I +E K+ E +KK+K+
Sbjct: 143 GLTQEEAKEILLEEVEE---EARHEAAKLIKEIEEEAKE-EADKKAKE 186
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 40.9 bits (96), Expect = 5e-04
Identities = 20/78 (25%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
+K+ K++ +++ ++ + K+E+K K+E+ E +E ++ ++ ED E+ K E+
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEE--SNEVKIEE 58
Query: 190 DSEKEKKSKDKDKKLKKE 207
++ K K+++KKL+ E
Sbjct: 59 LKDENNKLKEENKKLENE 76
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 40.0 bits (94), Expect = 7e-04
Identities = 26/103 (25%), Positives = 55/103 (53%), Gaps = 2/103 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
++ ++ +KK KD ++K+++K + + + K K+ + + ++ KK+EKE + K
Sbjct: 65 IEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKER 218
++K ++ +K+ K K+L K EK +KK KE+
Sbjct: 125 PYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 41.6 bits (98), Expect = 7e-04
Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 15/115 (13%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEE 168
I++ K+ + + ++ K + DK K ++ + KK K+ + ++ E KE
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392
Query: 169 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K K + +K +E+IR+E + E KSK K +K KK+ EK
Sbjct: 393 IAYYESAKTALEKAEGKKAIEEIREELIE-EGLLKSKKKKRK------KKEWFEK 440
Score = 31.2 bits (71), Expect = 1.3
Identities = 12/62 (19%), Positives = 32/62 (51%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ + S K+ + + K ++K + KK ++I+ + ++ + +K K+++KE E+
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Query: 163 KE 164
Sbjct: 441 FR 442
Score = 30.4 bits (69), Expect = 2.3
Identities = 12/66 (18%), Positives = 28/66 (42%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
++ + K+ K + + K E+ +E+ +E K +K+K+++KE
Sbjct: 377 GAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436
Query: 165 KKEEKE 170
E+
Sbjct: 437 WFEKFR 442
Score = 28.9 bits (65), Expect = 6.7
Identities = 14/65 (21%), Positives = 26/65 (40%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+ +L + K EK + KK ++ + +E + K+K++ KKE
Sbjct: 378 AKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEW 437
Query: 157 KEKKE 161
EK
Sbjct: 438 FEKFR 442
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 40.7 bits (95), Expect = 8e-04
Identities = 33/170 (19%), Positives = 72/170 (42%), Gaps = 12/170 (7%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 118
KEK +S+ P+E E ++ +L V + + E+K+ +K
Sbjct: 68 KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEE--------EVKEQLQSLLEKIVVS 119
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K+E++ K+ + + ++ D ++ E GK E E ++ +E ++ K+
Sbjct: 120 KQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHE----ETEDSYHVEETASEQYKQ 175
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
EK ++E + KE K + K + + + ++ ++E + E K
Sbjct: 176 DMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225
Score = 37.2 bits (86), Expect = 0.010
Identities = 23/125 (18%), Positives = 54/125 (43%), Gaps = 7/125 (5%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
+ ++ + +E+ + K K K EE+ K++ + E+ K++
Sbjct: 71 STSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVS----KQEEDGP 126
Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR 243
++ + D +K ++D D + + + K +E E S H +E + ++ DM+EK
Sbjct: 127 GKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE---QYKQDMKEKASE 183
Query: 244 ESSPK 248
+ +
Sbjct: 184 QENED 188
Score = 30.7 bits (69), Expect = 1.2
Identities = 32/175 (18%), Positives = 77/175 (44%), Gaps = 4/175 (2%)
Query: 25 LQHPLIPQPLVHPPGTPPPSKMKDKVSP-KDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL 83
L+ +P V P P ++ + +++ K K +E++ + ++ ++ SK ++
Sbjct: 67 LKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126
Query: 84 GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKK--THKKKEKEKIKKKKDKKDKDKIKNKE 141
G P + K +ED + ++ K + +E ++ + K+K +E
Sbjct: 127 GKEPQLD-EDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASEQE 185
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+ K+ EK ++ + E + E+ ++E ED + E++ KE + ++E
Sbjct: 186 NEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQEVP 240
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 41.5 bits (98), Expect = 8e-04
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 4/61 (6%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
+E+E ++ K++ K++ EKK + KEE EK K+K E K K +
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAE----KEEAEKAAAKKKAEAAAKKAKGPDGE 452
Query: 180 P 180
Sbjct: 453 T 453
Score = 39.5 bits (93), Expect = 0.003
Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 10/87 (11%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
+ ++++ + N + KK ++K +K EK+ ++E+ +K +K E K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAA--------AKKKAEAAAK 444
Query: 187 EEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+ K + E K D D EKL + +
Sbjct: 445 KAKGPDGETKKVDPD--PLGEKLARTE 469
Score = 36.4 bits (85), Expect = 0.029
Identities = 18/62 (29%), Positives = 29/62 (46%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+ E +E++ E +++K +K RK EK +EKE+ K KK + KK K
Sbjct: 390 KPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGP 449
Query: 216 KE 217
Sbjct: 450 DG 451
Score = 33.8 bits (78), Expect = 0.17
Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
+ +L ++ K KK +K KK EKE+ +K KK + K K + KK +
Sbjct: 403 NGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAK----KAKGPDGETKKVDP 458
Query: 158 EKKEEK 163
+ EK
Sbjct: 459 DPLGEK 464
Score = 33.0 bits (76), Expect = 0.38
Identities = 14/51 (27%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 119 KKEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
E++K++KK + + K + + EK KK+ E K+ K E +K +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 30.3 bits (69), Expect = 2.2
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 12/59 (20%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
++K +KK ++K +KK EK++ +K K K E KK + E K
Sbjct: 408 PAERKKLRKK-----QRKAEKK-------AEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 29.9 bits (68), Expect = 2.8
Identities = 14/65 (21%), Positives = 28/65 (43%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
GE ++ ++ KK K++ K +K + ++ K K K + +K K +
Sbjct: 392 LLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Query: 158 EKKEE 162
E K+
Sbjct: 452 ETKKV 456
Score = 29.9 bits (68), Expect = 3.0
Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
E++E + E+ ++K+ K + KK +KE+ +K +K+ + K
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAK 444
Score = 29.5 bits (67), Expect = 4.0
Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 3/72 (4%)
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
E E+ + KK + KK +K + K +K+E E+++ +K+
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLR---KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Query: 231 FKFGTDMEEKTK 242
G D E K
Sbjct: 445 KAKGPDGETKKV 456
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 41.1 bits (96), Expect = 9e-04
Identities = 41/195 (21%), Positives = 88/195 (45%), Gaps = 17/195 (8%)
Query: 62 MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
+ ++ P E+E +LG V I ++ + I++ K +K +
Sbjct: 245 LKLEDNYEPSEQEL------KLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIK 298
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
+ K + K D +K +N ++K +E K EK K E E KEE+ ++ + ++
Sbjct: 299 TLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEK-LKSEIELKEEEIKALQSNIDELH 357
Query: 182 EKIRK--------EEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEK-EVIPKLTF 231
+++RK E + E+EK +++ DK ++ +KL K K ++ + + + K
Sbjct: 358 KQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLR 417
Query: 232 KFGTDMEEKTKRESS 246
++ + ++ T+ S
Sbjct: 418 QYDSLIQNITRSRSQ 432
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 39.7 bits (93), Expect = 0.001
Identities = 28/94 (29%), Positives = 52/94 (55%), Gaps = 3/94 (3%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK---KEEKEFKMKEDK 177
+ KKK K+ ++I+ +K+ E+K+K K KK++ +KK++K+K ++K+ K ++
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
EK E ++ S E S + K +K L K
Sbjct: 115 EKEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148
Score = 39.7 bits (93), Expect = 0.001
Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
K KK +E EK+KK+ ++K K K K+K +K+K K ++K+ ++ +K E+K+
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWK-----WKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
K EDK + + K E + E K +
Sbjct: 116 KEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 38.1 bits (89), Expect = 0.004
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 9/86 (10%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K+K + +E EK KKE EEK+K + K+ K K+ K+K K++KD +K+
Sbjct: 58 TEAKKKKKELAEEIEKVKKE----YEEKQKWKWKKKKSKKKKDKD-----KDKKDDKKDD 108
Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSH 221
KS+ KD+K ++KL+ K +
Sbjct: 109 KSEKKDEKEAEDKLEDLTKSYSETLS 134
Score = 36.6 bits (85), Expect = 0.011
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
D E E +K KKE E+ E+ +K+ E++ K K K+K K++KD +K+KK KD
Sbjct: 54 DAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKK-----SKKKKDKDKDKKDDKKDD 108
Query: 203 KLKKEKLKKKKKEKERSSHEKEVIP 227
K +K+ K+ + + E +
Sbjct: 109 KSEKKDEKEAEDKLEDLTKSYSETL 133
Score = 35.8 bits (83), Expect = 0.019
Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 3/62 (4%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K KK KK+K+K K KKD K DK EK EK+ ++K + K E E +
Sbjct: 84 WKWKKKKSKKKKDKDKDKKDDKKDDK---SEKKDEKEAEDKLEDLTKSYSETLSTLSELK 140
Query: 171 FK 172
+
Sbjct: 141 PR 142
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 41.1 bits (97), Expect = 0.001
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 1/70 (1%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
+ E K + I + + K EK K KK K K++ +++K K K + + +
Sbjct: 364 IEEPLKARVIDELRPKTKA-PSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNI 422
Query: 161 EEKEKKEEKE 170
++ K
Sbjct: 423 GKRRKPSGTS 432
Score = 40.3 bits (95), Expect = 0.002
Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
+KKT K +K K+ + KK+K+K K K K + + K GK+ + E+
Sbjct: 384 SEKKTGKPSKKVLAKRAE-KKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 39.5 bits (93), Expect = 0.003
Identities = 16/52 (30%), Positives = 24/52 (46%)
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+E K K EK K K+ EKK K+K+K K++ + K +R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 38.4 bits (90), Expect = 0.006
Identities = 15/45 (33%), Positives = 21/45 (46%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
K K + + K+KEKEK K KK +D I + K E+
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 37.6 bits (88), Expect = 0.012
Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 24/78 (30%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
K K EK+ GK +K + EKKE ++++K K K
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKE------------------------KEKEKPKVKK 414
Query: 202 KKLKKEKLKKKKKEKERS 219
+ + + K++K S
Sbjct: 415 RHRDTKNIGKRRKPSGTS 432
Score = 36.8 bits (86), Expect = 0.019
Identities = 15/51 (29%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
+K KK K +KK +K+K+K K K++ ++ K K +K +E
Sbjct: 385 EKKTGKPSKKVLAKRAEKK-EKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 36.8 bits (86), Expect = 0.021
Identities = 16/58 (27%), Positives = 23/58 (39%)
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E + K + K K V R E+K+ EKEK K + K K++K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 33.4 bits (77), Expect = 0.21
Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 177 KEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
K + ++++R K + + K SK K ++K K+K+K K + H
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Score = 33.0 bits (76), Expect = 0.27
Identities = 15/56 (26%), Positives = 28/56 (50%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
K K ++K K K K ++++++KE+ + K + K + K RK SE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 39.8 bits (93), Expect = 0.001
Identities = 17/72 (23%), Positives = 34/72 (47%)
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
E ++ K K+K + + ++K +EK + K E+ K ++ K+EK K KK
Sbjct: 16 LESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKK 75
Query: 214 KEKERSSHEKEV 225
K+ + ++
Sbjct: 76 KKVPLQVNPAQL 87
Score = 39.4 bits (92), Expect = 0.002
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 4/74 (5%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK----EKEKGKKEEKEKKE 161
+ KDK + K + K +K K IK + ++ KK EK++ KK+ K+KK
Sbjct: 19 EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78
Query: 162 EKEKKEEKEFKMKE 175
+ + F E
Sbjct: 79 PLQVNPAQLFVDDE 92
Score = 38.2 bits (89), Expect = 0.004
Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 9/82 (10%)
Query: 114 KKTHKKKEKEKIKKKKDKK-----DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
K+ + E EK K KDK +K ++K +K K + E +K +EKKEE
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Query: 169 KEFKMKEDKEKPVEKIRKEEKD 190
K K+ K+K V +
Sbjct: 70 K----KKPKKKKVPLQVNPAQL 87
Score = 37.5 bits (87), Expect = 0.008
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSE 192
K ++ E +K K K+KG E K ++K EK + + + V+K+ +EK E
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Query: 193 KEKKSKDK 200
K+K K K
Sbjct: 70 KKKPKKKK 77
Score = 30.5 bits (69), Expect = 1.3
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+ E ++ K K++ + E K +DK ++K KL K + ER
Sbjct: 14 RALESEKY---KANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKL 61
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 41.0 bits (96), Expect = 0.001
Identities = 25/141 (17%), Positives = 49/141 (34%), Gaps = 14/141 (9%)
Query: 113 DKKTHKKKEKEKIKKKKDKKD-----KDKIKNKEKDKEKKEKEKGKKEEK-EKKEEKEKK 166
+K +KK ++ + + + E + KK + K E K KKE +K+
Sbjct: 295 REKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR 354
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
E ++ ++E E+ + + K K K L + + E +
Sbjct: 355 RAAEREI--NREARQERAAAMARARARRAAVKAKKKGL-IDASPNEDTPSENEESKGSP- 410
Query: 227 PKLTFKFGTDMEEKTKRESSP 247
P++ + RE S
Sbjct: 411 PQVE----ATTTAEPNREPSQ 427
Score = 35.2 bits (81), Expect = 0.069
Identities = 17/99 (17%), Positives = 40/99 (40%), Gaps = 2/99 (2%)
Query: 103 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEKEKGKKEEKEKK 160
EIKK + K + K KK+ +K + + + +++ + + + K +K+
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
+ E+ + +E K P + + +E +D
Sbjct: 391 IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPSQED 429
Score = 34.4 bits (79), Expect = 0.12
Identities = 14/99 (14%), Positives = 34/99 (34%), Gaps = 5/99 (5%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
IKK + K + K +KK+ K + +E ++E +++ + + + K
Sbjct: 332 IKKTRTAEKNEAKA---RKKEIAQKRRAAEREINREARQERAAAMA--RARARRAAVKAK 386
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ + + EE + + +E
Sbjct: 387 KKGLIDASPNEDTPSENEESKGSPPQVEATTTAEPNREP 425
Score = 32.5 bits (74), Expect = 0.44
Identities = 23/138 (16%), Positives = 49/138 (35%), Gaps = 23/138 (16%)
Query: 140 KEKDKEKKEKEKGKKEEK-------------------EKKEEKEKKEEKEFKMKEDKEKP 180
+EK KEKK + + KK +K E + + KE +K
Sbjct: 295 REKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR 354
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
R+ +++ +E+ + + ++ +K KKK +S ++ + G+ + +
Sbjct: 355 RAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSPPQVE 414
Query: 241 TKRESSPKIVIKPVKSPS 258
+ P S
Sbjct: 415 ATTTAEP----NREPSQE 428
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 41.4 bits (98), Expect = 0.001
Identities = 30/120 (25%), Positives = 59/120 (49%), Gaps = 2/120 (1%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
KS+ K +K + K IK ++ ++D +K K K K+ EK + E + +
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVN-NEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSK 169
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
E+ K + KE+ EK+ KE ++ K+ +++ K K+ K K+ + + ++E+ P
Sbjct: 170 EELEKFEAQKEEEDEKLAKEALEAMKKLEAE-KKKQSKNFDPKEGPVQIGKKIDKEEITP 228
Score = 38.7 bits (91), Expect = 0.007
Identities = 30/132 (22%), Positives = 59/132 (44%), Gaps = 1/132 (0%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEKEFKMKEDKEKP 180
K +KK+K + + +K+ K ++ +++ K K K K+ EK + + D K
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKE 170
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
+ + +K+ E EK +K+ + +KK + +KKK+ K E V K
Sbjct: 171 ELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEEITPMK 230
Query: 241 TKRESSPKIVIK 252
E ++V++
Sbjct: 231 EINEEERRVVVE 242
Score = 35.2 bits (82), Expect = 0.072
Identities = 26/128 (20%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDK----KTHKKKEKEKIKKKKDKKDKDKIKNK--EKDK 144
K S +KK K + K + + E++ +KKK K + + K
Sbjct: 101 EKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILK 160
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
E + K+E ++ + +KE+++E K+ ++ + ++K+ E+K K K+ ++
Sbjct: 161 IDFEIDDSKEELEKFEAQKEEEDE---KLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQI 217
Query: 205 KKEKLKKK 212
K+ K++
Sbjct: 218 GKKIDKEE 225
Score = 30.2 bits (69), Expect = 3.0
Identities = 21/101 (20%), Positives = 47/101 (46%), Gaps = 7/101 (6%)
Query: 90 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
P ++K G +K D I K+ +K E +K ++ + + K+ + EKK++
Sbjct: 145 PKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQ 204
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K ++ + +K +++E P+++I +EE+
Sbjct: 205 SKNFDPKEGPVQIGKKIDKEEI-------TPMKEINEEERR 238
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 40.3 bits (94), Expect = 0.001
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 291 PMIGCDGCDA---WYHWVCVGLVAEPETSDWFCPKCSK 325
M+ CD + W+H CVGL P+ W+CP+C K
Sbjct: 233 QMVACDNANCKREWFHLECVGLKEPPKGK-WYCPECKK 269
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 40.5 bits (95), Expect = 0.001
Identities = 26/77 (33%), Positives = 44/77 (57%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
NK ++K K + KE++ K+ E+E KEEK+ K K + +++I KE EK +
Sbjct: 269 INKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKARAR 328
Query: 198 KDKDKKLKKEKLKKKKK 214
K + +K + EK K K++
Sbjct: 329 KKEQRKERGEKKKLKRR 345
Score = 34.3 bits (79), Expect = 0.12
Identities = 34/124 (27%), Positives = 61/124 (49%), Gaps = 7/124 (5%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKEKEKGKKE 155
SE L E D ++ + E + +K + K K K +++KEK+ KE ++
Sbjct: 237 MSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA 296
Query: 156 EKEKKEEKEKKEEKEFKMKEDKE----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
++EK + +KK + ++KE + K + RK+E+ E+ +K K K +KL K K +
Sbjct: 297 KEEK--QLKKKLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPE 354
Query: 212 KKKE 215
E
Sbjct: 355 PPLE 358
Score = 33.5 bits (77), Expect = 0.23
Identities = 25/139 (17%), Positives = 56/139 (40%), Gaps = 17/139 (12%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKK---------DKKDKDKIKNKEKDKEKKEKEKGK 153
E K++K + K + +D ++ +E +KE K ++K +
Sbjct: 155 EQVAKKTVKTAPISLAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQ 214
Query: 154 KEEKEKKEEKEKKEEKEFKMKE-----DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ E+ ++++ EK + ++ E +E + + + +S E + + K +
Sbjct: 215 ELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVR 274
Query: 209 LKKKKKE---KERSSHEKE 224
K+K K KE+ E E
Sbjct: 275 PKRKTKAQRNKEKRRKELE 293
Score = 33.1 bits (76), Expect = 0.31
Identities = 19/102 (18%), Positives = 45/102 (44%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K EK++ + ++ ++ K + E + + E +E + EE+ E + +
Sbjct: 207 VKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEY 266
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
E + +R + K + K K + + ++ K +K+ K+K
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 40.9 bits (96), Expect = 0.001
Identities = 25/147 (17%), Positives = 71/147 (48%), Gaps = 5/147 (3%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKEKEKGKKEEKE 158
L E+ K+ K ++ + E + + + ++++K +K +E ++++E+ ++E E
Sbjct: 176 LKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE 235
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
EE+ + E+E + E+ + + +I E ++ K ++ + ++ + L++ +++ ER
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELE---RLLEELEEKIER 292
Query: 219 SSHEKEVIPKLTFKFGTDMEEKTKRES 245
+ I +L + + E
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEE 319
Score = 40.9 bits (96), Expect = 0.001
Identities = 19/112 (16%), Positives = 59/112 (52%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+++ + E E+ K++ ++ +K++N ++ E+ +++ ++ KE+ + E + ++ +
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKE 573
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ E+ + + E+ E+ K+ K K K+L++ + ++ + E E
Sbjct: 574 LLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAE 625
Score = 40.5 bits (95), Expect = 0.002
Identities = 24/127 (18%), Positives = 69/127 (54%), Gaps = 1/127 (0%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-EKGKKEEK 157
+L E++++ +++ + ++ I + +K + +E +E +++ + + E
Sbjct: 424 RELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELEL 483
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E+ EE+ +E++E +++E+ E+ +++R+ E++ + + ++ K+ +EKL+K + E
Sbjct: 484 EELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543
Query: 218 RSSHEKE 224
KE
Sbjct: 544 ELEELKE 550
Score = 40.5 bits (95), Expect = 0.002
Identities = 30/140 (21%), Positives = 72/140 (51%), Gaps = 14/140 (10%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--- 155
E+L ++ ++ K +E+++K ++K +K + + +E +EK E K +E
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLK 364
Query: 156 ---------EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
EKE ++ E+ ++ E ++E KE+ E E+ E+ ++ + + ++L++
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424
Query: 207 --EKLKKKKKEKERSSHEKE 224
E+L+++ K+ E ++ E
Sbjct: 425 ELEELEEEIKKLEEQINQLE 444
Score = 39.7 bits (93), Expect = 0.003
Identities = 27/113 (23%), Positives = 64/113 (56%), Gaps = 2/113 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ +E + K + K ++++K E+ E+ EKE K E+ K+ E+ +E KE +
Sbjct: 344 ELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSA 403
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
+E+I++E ++ EKE + +++ + +E++KK +++ + ++ +I +L
Sbjct: 404 A--LEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELA 454
Score = 39.7 bits (93), Expect = 0.003
Identities = 25/127 (19%), Positives = 66/127 (51%), Gaps = 7/127 (5%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK-----EK 159
+ ++ I++ ++ ++ E+ + + + + ++++ + + E+ K EEK +
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
EE +++ + K+ E++ K +E R EE + E EK + K+L++ + K++ E S
Sbjct: 345 LEELAEEKNELAKLLEERLKELE-ERLEELEKELEKALERL-KQLEEAIQELKEELAELS 402
Query: 220 SHEKEVI 226
+ +E+
Sbjct: 403 AALEEIQ 409
Score = 39.0 bits (91), Expect = 0.005
Identities = 22/110 (20%), Positives = 59/110 (53%), Gaps = 1/110 (0%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
I+ + K +E++++ + ++ + K + ++ ++E E+ ++E + + E+ E
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E K+K +E+ +EK+ ++ + E E + ++K + L+++ KE E
Sbjct: 319 ELLEKLKSLEER-LEKLEEKLEKLESELEELAEEKNELAKLLEERLKELE 367
Score = 38.6 bits (90), Expect = 0.007
Identities = 24/110 (21%), Positives = 59/110 (53%), Gaps = 4/110 (3%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEK-EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
E+L E K + + +++ + +E+ E+++K+ +K + + +E +E KE+
Sbjct: 346 EELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAAL 405
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
E+ +E+ ++ EKE E+ E+ +E++ +E K E++ + + + E
Sbjct: 406 EEIQEELEELEKE---LEELERELEELEEEIKKLEEQINQLESKELMIAE 452
Score = 37.4 bits (87), Expect = 0.015
Identities = 25/119 (21%), Positives = 66/119 (55%), Gaps = 2/119 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKEKEKGKKEE 156
++ E K +K ++ + KEK ++++ K++ + D+++ ++ E+ + +KEE
Sbjct: 528 KEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEE 587
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E+ E+ K+ +K+ K E++ +E++ + + SE E + ++ +++L+ E K +
Sbjct: 588 LEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQA 646
Score = 37.0 bits (86), Expect = 0.022
Identities = 27/162 (16%), Positives = 74/162 (45%), Gaps = 26/162 (16%)
Query: 111 KKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKK------------EKEKGKKEEK 157
+K+ + E E+++++ +K++ +++ + ++ EK+ E E+ KEE
Sbjct: 472 EKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEEL 531
Query: 158 EKKEEKEKKEEKEFKMKEDK-------------EKPVEKIRKEEKDSEKEKKSKDKDKKL 204
E+K EK + +E + ++K E ++++++ ++ + K++ ++L
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEEL 591
Query: 205 KKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
++ + KKK KE ++ L ++ E + +
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633
Score = 37.0 bits (86), Expect = 0.023
Identities = 25/142 (17%), Positives = 71/142 (50%), Gaps = 6/142 (4%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
+ ++ ++ E++++ +++ ++EKE++++ K + + + E K ++E+
Sbjct: 217 KLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL 276
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE------KEKKSKDKDKKL 204
+ + E+ EEK ++ E+ + E+ E+ +E +R ++ E K + + + +
Sbjct: 277 RELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEE 336
Query: 205 KKEKLKKKKKEKERSSHEKEVI 226
K EKL+ + +E +E +
Sbjct: 337 KLEKLESELEELAEEKNELAKL 358
Score = 36.3 bits (84), Expect = 0.037
Identities = 17/107 (15%), Positives = 58/107 (54%), Gaps = 6/107 (5%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF--K 172
+ ++ ++++ + + + ++ E++ ++++E +EE E+ E++ ++ E+E
Sbjct: 465 QELPEEHEKELLELYELE----LEELEEELSREKEEAELREEIEELEKELRELEEELIEL 520
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
++ ++ E K EK ++ ++ +KL+ ++LK++ ++ E
Sbjct: 521 LELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567
Score = 35.5 bits (82), Expect = 0.065
Identities = 25/132 (18%), Positives = 69/132 (52%), Gaps = 1/132 (0%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E L + + + + +++ E+++ + ++ + +I+N+E+ +EK E+ + +EE E
Sbjct: 640 EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQ-RIENEEQLEEKLEELEQLEEELE 698
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ E+ ++ K+ E + +E + E ++ +KE + +K +L +E +K K R
Sbjct: 699 QLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLR 758
Query: 219 SSHEKEVIPKLT 230
+ + ++ ++
Sbjct: 759 ADILRNLLAQIE 770
Score = 35.1 bits (81), Expect = 0.074
Identities = 21/121 (17%), Positives = 57/121 (47%), Gaps = 12/121 (9%)
Query: 118 KKKEKEKI---------KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
K KE+++I +K + K+ IK + E+ E + + E + + +EE
Sbjct: 152 KPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEE 211
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
+ + K + +++ ++EE+ ++ + +++ +L++EK + ++ + E +
Sbjct: 212 LK---ELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEA 268
Query: 229 L 229
L
Sbjct: 269 L 269
Score = 35.1 bits (81), Expect = 0.093
Identities = 23/114 (20%), Positives = 60/114 (52%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ + +++ +K ++ E+E + +++ K ++ N+ + KE E EK +E
Sbjct: 407 EIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQE 466
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
EE E ++ E E +E++ +E ++E + +++ ++L+KE + +++ E
Sbjct: 467 LPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIEL 520
Score = 34.7 bits (80), Expect = 0.11
Identities = 26/127 (20%), Positives = 72/127 (56%), Gaps = 6/127 (4%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+ +++ + EK ++ + ++ + ++ E+ E+ E E ++ ++ + EE+ +++ +E
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE 689
Query: 171 FKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEVIP 227
+ E++ +E++R+E E+ +K + + ++L+ K +L++ KKE E+ E++
Sbjct: 690 LEQLEEE---LEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLE 746
Query: 228 KLTFKFG 234
+L K G
Sbjct: 747 ELREKLG 753
Score = 34.4 bits (79), Expect = 0.13
Identities = 25/130 (19%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 104 IKKDKSIKK---DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE---- 156
+K +K D+ +K ++ + K+ + K K +E + + E + ++
Sbjct: 149 LKSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEAL 208
Query: 157 -KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK--LKKEKLKKKK 213
+E KE K+ +E +E + +E+ E+ +E + + + E+EK+ ++ K L+ E L+ +
Sbjct: 209 EEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEA 268
Query: 214 KEKERSSHEK 223
+ +
Sbjct: 269 LKIREEELRE 278
Score = 34.0 bits (78), Expect = 0.17
Identities = 31/174 (17%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-KE 161
++ ++ ++ ++ K K K+ ++ +++ + E E E +E +E+ +
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELES 637
Query: 162 EKEK------KEEKEFKMKEDKEKPVEK----IRKEEKDSEKEKKSKDKDKKLK------ 205
E EK EE E+ E+ VE+ IR+E + E E++ ++K ++L+
Sbjct: 638 ELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEEL 697
Query: 206 ---KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
+E+L++ K+ E + ++EE K + ++ ++
Sbjct: 698 EQLREELEELLKKLGEIEQLIEELESRK----AELEELKKELEKLEKALELLEE 747
Score = 32.4 bits (74), Expect = 0.49
Identities = 26/98 (26%), Positives = 58/98 (59%), Gaps = 4/98 (4%)
Query: 132 KDKDKIKNKEKDKEKKEKE-KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEK 189
+K+ ++ E + E+ E+E +KEE E +EE E+ E++ +++E+ + +E +E+
Sbjct: 471 HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530
Query: 190 DSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEV 225
EK +K ++ ++L+ KEKL+ ++ ++E E +
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRL 568
Score = 32.4 bits (74), Expect = 0.63
Identities = 18/125 (14%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ K ++K + E++++ K+K ++K + + E + +E + E+ +
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLR 582
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKD----SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K+E +E + E++++ +K E+ + ++ + L+ + + + +E E
Sbjct: 583 TRKEELEELR---------ERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633
Query: 219 SSHEK 223
+
Sbjct: 634 ELESE 638
Score = 32.0 bits (73), Expect = 0.77
Identities = 20/80 (25%), Positives = 38/80 (47%)
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
K K K +KE ++ EK E+ +KE ++ KI + E + + + + +
Sbjct: 150 KSKPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALE 209
Query: 207 EKLKKKKKEKERSSHEKEVI 226
E+LK+ KK +E ++E
Sbjct: 210 EELKELKKLEEIQEEQEEEE 229
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 40.6 bits (95), Expect = 0.001
Identities = 25/126 (19%), Positives = 58/126 (46%), Gaps = 5/126 (3%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
I + K K K+ + ++ + K + ++ N EKDKE + EK E ++ +++
Sbjct: 74 INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-----EKLKKKKKEKER 218
+ + + K + K E+++ E + +E +++K ++ L+ K+ E++R
Sbjct: 134 LQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQR 193
Query: 219 SSHEKE 224
S
Sbjct: 194 ESKWAI 199
Score = 39.4 bits (92), Expect = 0.003
Identities = 27/135 (20%), Positives = 67/135 (49%), Gaps = 5/135 (3%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+K+ ++K+ + + KKK+ ++ +K ++K + K+ + +++ K E
Sbjct: 48 EKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLEALEL 107
Query: 173 MKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
+ +K+K +E + KE E E +K+ ++ + ++K++ K +E+ + +EK++ L
Sbjct: 108 LNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLE 167
Query: 231 F---KFGTDMEEKTK 242
KF + E
Sbjct: 168 LEREKFEEQLHEANL 182
Score = 34.8 bits (80), Expect = 0.075
Identities = 24/109 (22%), Positives = 50/109 (45%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K + + + K+ E + + + K+ K + +K++E K EE+ K E +
Sbjct: 100 AKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENE 159
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+K EE +E E+ + + + + E E++ + K LKK K + +
Sbjct: 160 KKLEESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKRRAE 208
Score = 29.0 bits (65), Expect = 5.4
Identities = 27/115 (23%), Positives = 57/115 (49%), Gaps = 1/115 (0%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEE 156
S++L + ++ + +KK K +E++K + +KK ++ ++ +EK +E+ + E
Sbjct: 127 SKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEF 186
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
KE +E++E K K+K E ++++ E + E + K EK K
Sbjct: 187 KENEEQRESKWAILKKLKRRAELGSQQVQGEALELPNESFIRSKFPSDINEKADK 241
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 40.7 bits (95), Expect = 0.001
Identities = 31/145 (21%), Positives = 64/145 (44%), Gaps = 12/145 (8%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
I + ++ + DK + + KK+K+K+ K K K D DK + D + E +
Sbjct: 215 IKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDS 274
Query: 152 GKKEEKEKKE---------EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD- 201
+++ ++E + +E ++ E KP + ++ ++SE+EK ++
Sbjct: 275 DDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGL 334
Query: 202 --KKLKKEKLKKKKKEKERSSHEKE 224
K K +KLK KK ++ +
Sbjct: 335 SKKGKKLKKLKGKKNGLDKDDSDSG 359
Score = 38.8 bits (90), Expect = 0.005
Identities = 32/136 (23%), Positives = 55/136 (40%), Gaps = 7/136 (5%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-------K 149
ED E K K KK K KK + +K K+ D D+ + + D E +E
Sbjct: 232 GGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSS 291
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
G E+ + + + K +++ + E+ + EE+ +K K K K KK L
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Query: 210 KKKKKEKERSSHEKEV 225
K + S + ++
Sbjct: 352 DKDDSDSGDDSDDSDI 367
Score = 36.1 bits (83), Expect = 0.036
Identities = 36/150 (24%), Positives = 59/150 (39%), Gaps = 25/150 (16%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE- 156
+DL + + D K ++ ++K KK K KNK+K + K+ ++G ++
Sbjct: 210 GKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDA 269
Query: 157 ------------------------KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
EE+E K E K + E+ + EE+ +E
Sbjct: 270 DEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNE 329
Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+E K KKLKK K KK +K+ S
Sbjct: 330 EEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Score = 35.3 bits (81), Expect = 0.057
Identities = 24/108 (22%), Positives = 45/108 (41%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
D +E+ K + +++D E+ E+EK ++E K+ K+ K+ K K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKK 348
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
DK+ ++ D + E K +KE K++ + SS
Sbjct: 349 NGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSS 396
Score = 31.9 bits (72), Expect = 0.87
Identities = 27/127 (21%), Positives = 52/127 (40%), Gaps = 13/127 (10%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
++ + + K K KD + D+ E DK ++ ++ K ++K+KK K KK+ +
Sbjct: 198 EETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDD 257
Query: 171 FKMK------EDKEKPVEKIRKEEK-------DSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
K + E + E + S ++++ KL E K + E++
Sbjct: 258 DKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQD 317
Query: 218 RSSHEKE 224
S E E
Sbjct: 318 EDSEESE 324
Score = 29.2 bits (65), Expect = 5.6
Identities = 19/83 (22%), Positives = 36/83 (43%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
K+ ++ ++ E EK K+ K ++ D E ++ K +D ++EK K K
Sbjct: 187 KNGPAAFGDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKK 246
Query: 202 KKLKKEKLKKKKKEKERSSHEKE 224
K K +K K+ +R +
Sbjct: 247 KLAKNKKKLDDDKKGKRGGDDDA 269
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 40.9 bits (96), Expect = 0.002
Identities = 21/115 (18%), Positives = 49/115 (42%), Gaps = 4/115 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
+ +S+K+ + K+ E +++KK ++ + I K ++ + KK
Sbjct: 828 KNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKK 887
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
EE+ K+ +E + E++ + + + +KL KEK ++K +
Sbjct: 888 EEEFKQFAEEEGLNEEELA--FYDDLALNGGKLPENGTELVEKLAKEKSLREKNK 940
Score = 35.8 bits (83), Expect = 0.049
Identities = 25/145 (17%), Positives = 50/145 (34%), Gaps = 6/145 (4%)
Query: 65 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
E +E + ++++ + + + ++ + K +
Sbjct: 453 VELELIEESIKEEAEELDE-RIEEITEDILEKIKKKTKNLEFLAMLAVRLIRAAKDIYDH 511
Query: 125 IKKKKDKKDKDKI-----KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
KK++ K + K + E + + KE E K EF+ DK++
Sbjct: 512 FKKEEVFDLKAMVVASSRKVAVELYEAEIAARLDWHSKESLEGAIKDYNTEFETDFDKKQ 571
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKL 204
K++ E+KD K K KD L
Sbjct: 572 SHAKLKDEKKDLIKRFKLKDDPLDL 596
Score = 34.7 bits (80), Expect = 0.13
Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 24/200 (12%)
Query: 55 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 114
K ++ SK ++ + + +I EI D I +
Sbjct: 764 KARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDY-ILELL 822
Query: 115 KTHKKKEK---------EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
+T K E+I K+ + + K K E+ E + + KK + EK +
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLD---TAEKLE 879
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+ + K +E K +E+ +++ + K E + E
Sbjct: 880 ELYI-----------LAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEK 928
Query: 226 IPKLTFKFGTDMEEKTKRES 245
+ K + ++ +E
Sbjct: 929 LAKEKSLREKNKDDWKAKEE 948
Score = 30.8 bits (70), Expect = 1.8
Identities = 19/113 (16%), Positives = 39/113 (34%), Gaps = 12/113 (10%)
Query: 90 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
I + + L E + + K EK + KK+++ + E++ +E+
Sbjct: 845 EWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEE 904
Query: 150 EKGKK-------EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
+ E E +K KE ++E K + K+ + K
Sbjct: 905 LAFYDDLALNGGKLPENGTELVEKLAKEKSLREKN-----KDDWKAKEEVEAK 952
Score = 29.3 bits (66), Expect = 5.1
Identities = 28/158 (17%), Positives = 57/158 (36%), Gaps = 31/158 (19%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK----------------NKE 141
SED+G ++ + +D + + E+ K K +K +
Sbjct: 697 SEDIGYLEDAGELLRDFAPELEIKVERGNKILAKLFGRFLKAFNALKKLYEFAYTLEEEI 756
Query: 142 KDKEKKEKEKGKKEEKEK--KEEKEKKEEKEFKMKEDKE-------KPVEKIRKEEKDSE 192
+ E K + +K+ + K + +E K KE K +E I K+E + +
Sbjct: 757 QYLEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINID 816
Query: 193 ------KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ K+ + KE +++ KE +K+
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKK 854
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 40.0 bits (94), Expect = 0.002
Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
E + +K K+ + K+ + + ++E K+K EK +++EK + E + +E
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELE--QNLIE 342
Query: 183 KIR-KEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ + K E +KK +D +K+L+K K K
Sbjct: 343 ERKELNSKLEEIQKKLEDLEKRLEKLKSNKS 373
Score = 39.6 bits (93), Expect = 0.003
Identities = 20/84 (23%), Positives = 41/84 (48%), Gaps = 7/84 (8%)
Query: 122 KEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF------KMK 174
EK+ K+ + K ++ + E++ + K K EK +++EK +E ++ ++
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSK 198
E+ +K+ EK EK K +K
Sbjct: 349 SKLEEIQKKLEDLEKRLEKLKSNK 372
Score = 32.7 bits (75), Expect = 0.38
Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K G+ KE E + +EE+E K K ++ ++ R EE + ++ K+ + KL
Sbjct: 293 SGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKL- 351
Query: 206 KEKLKKKKKEKERSSHEK 223
+E KK + ++R K
Sbjct: 352 EEIQKKLEDLEKRLEKLK 369
Score = 30.0 bits (68), Expect = 2.4
Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKK-EKEKGKKEEKEK 159
+ + + ++++ KKK EKI++K+ + ++ + + K+ K E+ + K E+ EK
Sbjct: 304 KEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEK 363
Query: 160 KEEKEKKEEK 169
+ EK K +
Sbjct: 364 RLEKLKSNKS 373
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 40.2 bits (94), Expect = 0.002
Identities = 29/133 (21%), Positives = 64/133 (48%), Gaps = 2/133 (1%)
Query: 86 TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
++ VKV S++ + K+ + +K + K K+ K + + + K K
Sbjct: 169 KSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASL--KRNPPKKSNIMSSFFKKKT 226
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K++KEK + E KEE E++ K + ED+ + ++E + E + + D + +
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286
Query: 206 KEKLKKKKKEKER 218
E+ +K+K+++ +
Sbjct: 287 TEEKEKEKRKRLK 299
Score = 38.3 bits (89), Expect = 0.007
Identities = 40/198 (20%), Positives = 84/198 (42%), Gaps = 12/198 (6%)
Query: 31 PQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK---EYSKVKDVELGVTP 87
P + P S K + + K+ T+ K + + K + ++ + S +
Sbjct: 167 PPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKT 226
Query: 88 MVPNIVKVTSSEDLGEIKKDKSIKKDK---------KTHKKKEKEKIKKKKDKKDKDKIK 138
K S + E +++S K+D + E E K ++ + +
Sbjct: 227 KEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEE 286
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+EK+KEK+++ K E++++ EE E E + +E +E + K+E++ E+ S
Sbjct: 287 TEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSP 346
Query: 199 DKDKKLKKEKLKKKKKEK 216
D ++ + ++ KKK K
Sbjct: 347 DGGRRRGRRRVMKKKTFK 364
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 40.7 bits (95), Expect = 0.002
Identities = 27/103 (26%), Positives = 52/103 (50%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
E +++ +EE E+K++KE K KE + K ++ +KE K + +++ D KK +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 210 KKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
K +K++ E + E + P F + + ++ SP V K
Sbjct: 65 KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVEK 107
Score = 36.4 bits (84), Expect = 0.037
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
G + E E +++ +EE E K K++ EK EK K+ K ++KE K+K + ++
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVP 59
Query: 212 KKKEKERSSHEKEV 225
KK EK+ + E
Sbjct: 60 KKSEKKSRKRDVED 73
Score = 35.6 bits (82), Expect = 0.052
Identities = 21/97 (21%), Positives = 47/97 (48%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
E +KK +++ + K+K+++ KEKE K + +K+ + + + ++ +K +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
K RK + + E + D D ++K + K+ S
Sbjct: 65 KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYS 101
Score = 35.3 bits (81), Expect = 0.078
Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE---EKEKKEEK 163
+K I +++ +KK+KE+ K+K+ K K+K +K+ + K + + + +K E+K
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELK---KLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 164 EKKEEKEFKMKEDKEKP 180
+K + E + ED P
Sbjct: 66 SRKRDVEDENPEDFIDP 82
Score = 31.0 bits (70), Expect = 1.4
Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK--E 167
++ + + E EK++ K D+ K + K ++K KEK +E+ K K +
Sbjct: 987 LQAQGAVNAEAELEKLRNKMDEIQKQQEKLEKKMNASGYKEKVPANIQEEDARKLTKLLQ 1046
Query: 168 EKEFKMKEDKEKPVEK 183
E EF KE K E
Sbjct: 1047 ELEFFEKESKRLEAET 1062
Score = 31.0 bits (70), Expect = 1.5
Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 7/87 (8%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
+ L E + ++ KK++K KEKE +KK K + + K K + + K
Sbjct: 6 SEAEKKILTEEELERKKKKEEKA---KEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKP 180
E+K +K + E + ++F D + P
Sbjct: 62 SEKKSRKRDVEDENPEDF---IDPDTP 85
Score = 30.3 bits (68), Expect = 2.5
Identities = 19/69 (27%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK---FGTDMEEKT 241
R E + +K ++ ++K KKE+ K+K+ K+ + +KE KL + GT++ +K+
Sbjct: 3 RTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62
Query: 242 KRESSPKIV 250
+++S + V
Sbjct: 63 EKKSRKRDV 71
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 40.3 bits (95), Expect = 0.002
Identities = 30/112 (26%), Positives = 55/112 (49%), Gaps = 19/112 (16%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
KDK T K++K I D E ++ K+ E+ E+K++KE E K E E
Sbjct: 483 KDKGT-GKEQKITITASSGLSDD------EIERMVKDAEEYAAEDKKRKERIEAKNEAEE 535
Query: 172 -------KMKEDKEKP----VEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 211
+KE+ +K +K+ + + ++E + +DK++ + K E+L+K
Sbjct: 536 YVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKTEELQK 587
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 40.0 bits (94), Expect = 0.002
Identities = 40/223 (17%), Positives = 93/223 (41%), Gaps = 30/223 (13%)
Query: 37 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 96
+ +S + + + S + + + V+ +E ++ ++K T
Sbjct: 77 DEPKQVKIPRQSGLSEEVAEAEAKATSVAAEATTPKSIQELVEALE----ELLEELLKET 132
Query: 97 SSE-----------DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 145
+S+ DL + K+ ++K D ++ KE++ + K + E+++E
Sbjct: 133 ASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAE---LKAEEEEE 189
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
+ K K+EE K E+E E K +++ + E + EKE+ K ++KL+
Sbjct: 190 LERALKEKREELLSKLEEELLARLESKEAALEKQ-----LRLEFEREKEELRKKYEEKLR 244
Query: 206 KEKLKKKKKEKERSSHEKEVIP----KLTFKFGTDMEEKTKRE 244
+E +++ + K + +L +F +++EK + E
Sbjct: 245 QE---LERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEE 284
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 40.1 bits (94), Expect = 0.002
Identities = 27/165 (16%), Positives = 64/165 (38%), Gaps = 4/165 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ + K ++ + E + ++ K +KE +E ++ + + KE+ + E
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYL-EKEIQELQEQRIDLKEQIKSIE 853
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
K+ E KE+ E+ +E++ +D E K++ + +L++ +++ E + E
Sbjct: 854 KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913
Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
K + +E + S + P P D
Sbjct: 914 KKRKRLSELKAKLEALEEELSE---IEDPKGEDEEIPEEELSLED 955
Score = 38.5 bits (90), Expect = 0.007
Identities = 28/128 (21%), Positives = 66/128 (51%), Gaps = 7/128 (5%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
+L E+ K+ + +D+ +++ EK+K++ ++ +E D+ ++E ++ +E +
Sbjct: 372 ELEEVDKEFAETRDELKDYREKLEKLKREINE------LKRELDRLQEELQRLSEELADL 425
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKER 218
E K +++E+KE +I+K+E E+ K +++L K + + EKE
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL 485
Query: 219 SSHEKEVI 226
S ++E+
Sbjct: 486 SKLQRELA 493
Score = 36.6 bits (85), Expect = 0.027
Identities = 26/141 (18%), Positives = 67/141 (47%), Gaps = 19/141 (13%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD------------------KI 137
E L E+++D S + + + K E ++++ + ++ ++D +I
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRI 793
Query: 138 KNKEKDKEKKEKEKGKKEEK-EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+ + K E+E + E + + E+K + E + E + + +++ R + K+ K +
Sbjct: 794 PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE 853
Query: 197 SKDKDKKLKKEKLKKKKKEKE 217
+ ++ KKE+L+++ +E E
Sbjct: 854 KEIENLNGKKEELEEELEELE 874
Score = 36.6 bits (85), Expect = 0.028
Identities = 30/137 (21%), Positives = 63/137 (45%), Gaps = 14/137 (10%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+S +GEI+K+ + ++ +K KE++++ ++ + + + E KE E
Sbjct: 714 ASRKIGEIEKEIEQLEQEE---EKLKERLEELEEDLSSLEQEIENVKSELKELEA----R 766
Query: 157 KEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDK-----DKKLKKEKL 209
E+ EE K E+ E + + +I+ E E+E + ++KL + L
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826
Query: 210 KKKKKEKERSSHEKEVI 226
+K+ EKE +++ I
Sbjct: 827 EKEYLEKEIQELQEQRI 843
Score = 35.8 bits (83), Expect = 0.051
Identities = 26/144 (18%), Positives = 64/144 (44%), Gaps = 18/144 (12%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKK--------------KEKEKIKKKKDKKDKDKIKNKEKD 143
E L +K++ S + + + ++ +I+K+ ++ ++++ K KE+
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDK 202
+E +E ++E E + + K+ E E+ E+ + K+ + D E + S+ +
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEAR---IEELEEDLHKLEEALNDLEARLSHSRIPEI 796
Query: 203 KLKKEKLKKKKKEKERSSHEKEVI 226
+ + KL+++ E E E
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQK 820
Score = 35.8 bits (83), Expect = 0.058
Identities = 24/137 (17%), Positives = 56/137 (40%), Gaps = 14/137 (10%)
Query: 97 SSEDLGEIKKDKS-----IKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKE 148
++ E +K + + K+ + +E K+ + +D+ K ++ +K K+E
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ K+E +EE ++ E+ + + I + + E+EK+ K + K ++ K
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAA----IAGIEAKINELEEEKEDKALEIKKQEWK 456
Query: 209 LK--KKKKEKERSSHEK 223
L+ K
Sbjct: 457 LEQLAADLSKYEQELYD 473
Score = 35.0 bits (81), Expect = 0.093
Identities = 22/96 (22%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEE-----KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
+ I KE++ E E+ K E + EE E++ E+E K ++ + ++++E
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+D E + DK+ +++LK +++ E+ E
Sbjct: 367 EDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402
Score = 34.7 bits (80), Expect = 0.11
Identities = 29/115 (25%), Positives = 50/115 (43%), Gaps = 13/115 (11%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEF 171
D+K K E+ + ++ ++ I K + E+ +E+ K E + +EK + E E
Sbjct: 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYEL 228
Query: 172 -KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
K KE E+ E I ++ E+E EKL ++ E E+ E E
Sbjct: 229 LKEKEALERQKEAIERQLASLEEE-----------LEKLTEEISELEKRLEEIEQ 272
Score = 33.9 bits (78), Expect = 0.22
Identities = 30/127 (23%), Positives = 68/127 (53%), Gaps = 1/127 (0%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
I + E L +K ++ + + ++ EK ++ +D +++ E DK E E+
Sbjct: 281 IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE 340
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
++E +E+++ ++K E+ ++KE+ E ++ + +K+ E + + KD + K EKLK+
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEF-AETRDELKDYREKLEKLKR 399
Query: 212 KKKEKER 218
+ E +R
Sbjct: 400 EINELKR 406
Score = 32.7 bits (75), Expect = 0.48
Identities = 25/114 (21%), Positives = 62/114 (54%), Gaps = 5/114 (4%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K + +++ ++ +K E+ + K+ ++ + KEK+ +++KE +++ +EE EK
Sbjct: 196 KRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEK 255
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKER 218
E+ + EK +E+I + ++ K+ K ++++L+ KEK+ + + E
Sbjct: 256 LTEE----ISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIAS 305
Score = 32.7 bits (75), Expect = 0.51
Identities = 29/133 (21%), Positives = 69/133 (51%), Gaps = 10/133 (7%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKEKEKGKKEEK 157
E+L E++++ K+++ E+++++++K ++ + ++++ E E K K+ +
Sbjct: 177 EELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALE 236
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+KE E++ E+ +EK+ E+ SE EK+ ++ ++ L++ K K +E
Sbjct: 237 RQKEAIERQ-------LASLEEELEKL--TEEISELEKRLEEIEQLLEELNKKIKDLGEE 287
Query: 218 RSSHEKEVIPKLT 230
KE I +L
Sbjct: 288 EQLRVKEKIGELE 300
Score = 31.6 bits (72), Expect = 1.0
Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 2/105 (1%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
SE+L ++ + + K ++EKE K + KK + K++ D K E+E
Sbjct: 415 LQRLSEELADLNAAIAGIEAKINELEEEKED-KALEIKKQEWKLEQLAADLSKYEQELYD 473
Query: 154 KEEKEKKEEKEKKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+E+ + EKE + ++E E + + E+ + + E+ K+
Sbjct: 474 LKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKA 518
Score = 30.4 bits (69), Expect = 2.3
Identities = 34/177 (19%), Positives = 75/177 (42%), Gaps = 23/177 (12%)
Query: 60 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS--EDLGEIKKDKSIKKDKKTH 117
E S + + E SK+++ V I + L + +K + +
Sbjct: 785 EARLSHSRIPEIQAELSKLEEE-------VSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837
Query: 118 KKKE----KEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+++ KE+IK + + + K +E + +E + + E + K++++E E
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAL--RDLESRLGDLKKERDELE 895
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
+++E + K +E + ++K+ + + K K E L+++ E E E E IP
Sbjct: 896 AQLRELERK-----IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIP 947
Score = 30.0 bits (68), Expect = 3.0
Identities = 28/93 (30%), Positives = 54/93 (58%), Gaps = 5/93 (5%)
Query: 126 KKKKDKKDKDKIKNKEKDKE-KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
KK KD ++++++ KEK E + E ++ EK+ E E EE+ K++ + +K + +I
Sbjct: 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEI 338
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ E++ E+E+K +DK E+ + K+E E
Sbjct: 339 EELEREIEEERKRRDK----LTEEYAELKEELE 367
Score = 29.3 bits (66), Expect = 6.0
Identities = 25/97 (25%), Positives = 49/97 (50%), Gaps = 16/97 (16%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
KKEK + EE E+ E+ E K + +E++R+E + +E+ + + ++
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDE------KRQQLERLRREREKAERYQALLKEKREY 223
Query: 205 K-KEKLKKKKK--------EKERSSHEKEVIPKLTFK 232
+ E LK+K+ E++ +S E+E + KLT +
Sbjct: 224 EGYELLKEKEALERQKEAIERQLASLEEE-LEKLTEE 259
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 39.2 bits (91), Expect = 0.003
Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 1/111 (0%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFK 172
KKT KKK E+ K K K K K K K K+ K KK + K K+
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVT 64
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
+K +K + V+K +K +KE S + + K K K+ + EK
Sbjct: 65 VKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115
Score = 34.2 bits (78), Expect = 0.11
Identities = 26/101 (25%), Positives = 38/101 (37%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K ++E+ K + K+ KK K K K K K K KK K +
Sbjct: 9 KKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
K E KKE KK K+ + + + K+ K K+
Sbjct: 69 KTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKK 109
Score = 30.3 bits (68), Expect = 1.9
Identities = 19/103 (18%), Positives = 38/103 (36%), Gaps = 3/103 (2%)
Query: 40 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
K ++ P K + K ++K K + K K + ++ T
Sbjct: 6 KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAK--KAAKVKKTKSVKTTTKKV 63
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK 142
+ + +K +S+KK+ K +KE + + + KN K
Sbjct: 64 TV-KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105
Score = 29.1 bits (65), Expect = 4.4
Identities = 21/85 (24%), Positives = 35/85 (41%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K K+ K KK + K K K K+ K + E +KE K+ +K+
Sbjct: 31 KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEK 189
+ E K+ ++ K +K+ EK
Sbjct: 91 EVFEASNKLFKNTSKLPKKLFASEK 115
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 39.5 bits (92), Expect = 0.003
Identities = 27/98 (27%), Positives = 52/98 (53%), Gaps = 3/98 (3%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
D+K KEK + K++ DK K+ E++++ ++ E+ KK E E+ + E +E
Sbjct: 114 DRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKAELERVAALSQAEAREI 173
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+ E + K +I +++E+E KD+ K+ K+ L
Sbjct: 174 ILAETENKLTHEIATRIREAERE--VKDRSDKMAKDLL 209
Score = 33.3 bits (76), Expect = 0.26
Identities = 30/108 (27%), Positives = 59/108 (54%), Gaps = 3/108 (2%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K E++ KK K + K KE E KE+ + +EE E++ + E++E K+ + + E
Sbjct: 50 KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRL-TE 108
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 224
+ RK+E S KEK + K++ L K + + +++++ E+ +K+
Sbjct: 109 RATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKK 156
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 39.1 bits (92), Expect = 0.003
Identities = 19/100 (19%), Positives = 65/100 (65%), Gaps = 1/100 (1%)
Query: 120 KEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+E+EK ++++ + + +++ + E + +E+++ + +EK +K++K ++E EF + +
Sbjct: 81 EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K EK R+ E++ + + ++K ++ ++ + +++++++E+
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180
Score = 36.8 bits (86), Expect = 0.020
Identities = 35/155 (22%), Positives = 82/155 (52%), Gaps = 27/155 (17%)
Query: 99 EDLGEIKKDKSIKKDKKTHK-----KKEKEKIKKKKDKKDK----------DKIKNKEKD 143
E+ E+++ + K +K +EK++IK ++ ++++ K +E++
Sbjct: 1 ENSEELRELNEKLRAAKVNKERDAQIEEKKRIKAEEKEEERRIDEMMEEERLKALAEEEE 60
Query: 144 KEKKEKEKGKK---------EEKEK---KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
+E+K KE+ ++ EE+EK +E +E+ +E+E + + E + ++
Sbjct: 61 RERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKR 120
Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
EK+KK +++ + +E++++K++EKER E+ I
Sbjct: 121 EKQKKLREEIDEFNEERIERKEEEKEREREEELKI 155
Score = 34.5 bits (80), Expect = 0.096
Identities = 22/119 (18%), Positives = 74/119 (62%), Gaps = 8/119 (6%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE--------KEK 165
++ K++++E ++ ++++ D+I + +++++ E ++ ++++K+ +EE E+
Sbjct: 81 EEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIER 140
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
KEE++ + +E++ K +E R++ + E+ + + + K+ K+ ++ + + ++E + E+E
Sbjct: 141 KEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDERE 199
Score = 32.6 bits (75), Expect = 0.35
Identities = 31/121 (25%), Positives = 80/121 (66%), Gaps = 9/121 (7%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKD--------KDKIKNKEKDKEKKEKEKGKKEEKE 158
D+ I++ ++ + + +EK +K+K ++ + + K +EK++E++E+ K + ++E
Sbjct: 102 DEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQRE 161
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKE 217
K E +E++E + + KE+KE+ V ++R +++++E E++ D+ + L +E+ ++K+++KE
Sbjct: 162 KAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKE 221
Query: 218 R 218
+
Sbjct: 222 K 222
Score = 32.2 bits (74), Expect = 0.53
Identities = 28/129 (21%), Positives = 75/129 (58%), Gaps = 8/129 (6%)
Query: 108 KSIKKDKKTHKKKEKEKIK---KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K+++K +++E+ KI ++K ++++++ + + KE+KE+E + ++++ E E
Sbjct: 138 IERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDE 197
Query: 165 KKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDK----DKKLKKEKLKKKKKEKERS 219
++E E + +E+ K R KE++++EK ++ K + ++ +EK ++ ++E+
Sbjct: 198 REELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEE 257
Query: 220 SHEKEVIPK 228
E+E + +
Sbjct: 258 EAERERMLE 266
Score = 29.9 bits (68), Expect = 2.7
Identities = 29/136 (21%), Positives = 80/136 (58%), Gaps = 17/136 (12%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK------------DKKDKDKIKNKEKDKEK 146
E+L E++ D ++ ++ ++KEKE+ +K++ ++K++ +E+ +E+
Sbjct: 199 EELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEE 258
Query: 147 KEKE----KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
E+E K ++E+ ++E EK+ K + + + E+ +E+ ++E + +E+E++ ++ ++
Sbjct: 259 AERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEE-KEERRAAEREEELEEGER 317
Query: 203 KLKKEKLKKKKKEKER 218
++E ++ + E+ER
Sbjct: 318 LREEEAERQARIEEER 333
Score = 28.7 bits (65), Expect = 7.2
Identities = 16/94 (17%), Positives = 56/94 (59%), Gaps = 4/94 (4%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K+E+ + ++ +++ +++++ K+ + E+ E+E EK + + E + E E +++E +E
Sbjct: 246 KEERLQEERAEEEAERERMLEKQAEDEELEQEN---AEKRRMKRLEHRRELEQQIEEKEE 302
Query: 179 KPVEKIRKEEKDSEKEKKS-KDKDKKLKKEKLKK 211
+ + +E ++ E+ ++ ++ ++++E+ +
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEEERQRL 336
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 39.3 bits (92), Expect = 0.003
Identities = 26/123 (21%), Positives = 64/123 (52%), Gaps = 4/123 (3%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIK---NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K+ KE E+I ++ K+ ++K++ + K+ +E+ + ++E + ++EE +++EE+
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 171 FKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
+ +E + EK+ E + E+EK ++ +L++ + + + + E KL
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146
Query: 230 TFK 232
K
Sbjct: 147 LLK 149
Score = 33.1 bits (76), Expect = 0.32
Identities = 24/124 (19%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
L E + + + ++ ++E+E++ +K+++ D + + + E + ++E+
Sbjct: 62 LRERNQQRQEARREREELQREEERLVQKEEQLDA-----RAEKLDNLENQLEEREKALSA 116
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
E E +E + K +++ V + E+ ++ K D +L++EK ++ KK +E +
Sbjct: 117 RELELEELE--KQLDNELYRVAGLTPEQ---ARKLLLKLLDAELEEEKAQRVKKIEEEAD 171
Query: 221 HEKE 224
E E
Sbjct: 172 LEAE 175
Score = 28.5 bits (64), Expect = 7.5
Identities = 19/93 (20%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-- 171
K+ EK+ +++ ++ + ++ E +E EK E + ++ ++
Sbjct: 89 KEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLL 148
Query: 172 -----KMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
+++E+K + V+KI +EE D E E+K+++
Sbjct: 149 KLLDAELEEEKAQRVKKI-EEEADLEAERKAQN 180
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 39.3 bits (92), Expect = 0.003
Identities = 15/87 (17%), Positives = 39/87 (44%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+K + + KE K E EK E E+++ + +KE+ +K +I + E+ ++ +
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEK 223
+ + ++++ + E +
Sbjct: 61 KPAASGEGGGGEEEEEEAKAEAAEFRA 87
Score = 36.9 bits (86), Expect = 0.015
Identities = 16/97 (16%), Positives = 37/97 (38%), Gaps = 3/97 (3%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
+E + + K+ K+ EK E E+EK + + +++ +K E + D+ +
Sbjct: 5 REALAELA--KELRKLTEDEKLAE-AEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E E+E++ + + L+
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAE 98
Score = 36.6 bits (85), Expect = 0.025
Identities = 17/95 (17%), Positives = 40/95 (42%), Gaps = 6/95 (6%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E++K +K + ++K + K++ K +I E+ ++ E + E EE
Sbjct: 14 ELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEE 73
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+E++ + E +R + +E+K+
Sbjct: 74 EEEEAKAEAAEFR------AYLRGGDDALAEERKA 102
Score = 34.3 bits (79), Expect = 0.11
Identities = 17/100 (17%), Positives = 41/100 (41%), Gaps = 4/100 (4%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE--- 168
K+ + + ++++K + + + + +EK + KE+ K + E +E +E
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAE-EEKAEYDALKEEIDKLDAEIDRLEELLDELEA 60
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
K E E+ + + +E + D L +E+
Sbjct: 61 KPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEER 100
Score = 28.9 bits (65), Expect = 5.6
Identities = 16/85 (18%), Positives = 30/85 (35%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
++ E +E K E E + ++EK KEE D + + ++ + E
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 213 KKEKERSSHEKEVIPKLTFKFGTDM 237
E E+E +F +
Sbjct: 65 SGEGGGGEEEEEEAKAEAAEFRAYL 89
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 39.3 bits (91), Expect = 0.004
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 26/160 (16%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ H+ E K +K D +K K K KEKKEKEK + +K+KK+E E + +
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKER--DKDKKKEVEGFKSLLLALD 233
Query: 175 ED------------------------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
+ +P E E ++++K K K K ++ +KE+ K
Sbjct: 234 DSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKK 293
Query: 211 KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
KKKK H + + + G EE SS K++
Sbjct: 294 KKKKHHHHRCHHSDGGAEQPVQNGAVEEEPLPPMSSYKLL 333
Score = 32.3 bits (73), Expect = 0.54
Identities = 29/141 (20%), Positives = 59/141 (41%), Gaps = 2/141 (1%)
Query: 43 PSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG 102
+ P + K+ ++K KE++ K K+VE + ++ S+ +
Sbjct: 184 SKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVA 243
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
E + T E ++ K + ++ K K K K++ KEK+EK+K KK +
Sbjct: 244 EADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRC 303
Query: 161 EEKEKKEEKEFKMKEDKEKPV 181
+ E+ + +E+P+
Sbjct: 304 HHSDGGAEQPVQNGAVEEEPL 324
Score = 28.5 bits (63), Expect = 8.7
Identities = 34/129 (26%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE----------KEKKEEKEFK 172
E+ ++ + + +KDK + K+++KEK+ + + E E+ E+ E
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALP 144
Query: 173 MKEDKEKPVEKIR-------KEEKDSEK----------EKKSKDKDKKLKKEKLKKKKKE 215
ED + P + R K DSEK KS +K EK KK K+
Sbjct: 145 SDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKK 204
Query: 216 KERSSHEKE 224
KE+ EKE
Sbjct: 205 KEKKEKEKE 213
Score = 28.1 bits (62), Expect = 9.9
Identities = 27/131 (20%), Positives = 47/131 (35%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
KD D + + + DK + EK ++ E + E+ + E K K +K K+E
Sbjct: 150 KDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKE 209
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 248
K+ E++K K + + K L +S + L + E
Sbjct: 210 KEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDA 269
Query: 249 IVIKPVKSPSP 259
+ KSP
Sbjct: 270 EAEETKKSPKH 280
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 37.3 bits (87), Expect = 0.004
Identities = 33/96 (34%), Positives = 56/96 (58%), Gaps = 9/96 (9%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE-KDSE 192
+DK +EK+E+++ +KEEKE+KEEKEK+ K KE++EK +E++ K + K
Sbjct: 5 RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA 64
Query: 193 KEKKSKDKDKKLKKE--------KLKKKKKEKERSS 220
+ +D D++LK++ + +KKKEK
Sbjct: 65 RYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKK 100
Score = 36.1 bits (84), Expect = 0.011
Identities = 25/97 (25%), Positives = 54/97 (55%), Gaps = 5/97 (5%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K+ + + E+ +++K+++ ++K + +EK+KE + K+E +++ EE EK + K
Sbjct: 7 KSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARY 66
Query: 175 EDKEKPVEKIRKEEKDSE-----KEKKSKDKDKKLKK 206
D E E+++++E+ + KK + DKK K
Sbjct: 67 ADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKP 103
Score = 33.4 bits (77), Expect = 0.077
Identities = 24/102 (23%), Positives = 59/102 (57%), Gaps = 7/102 (6%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
+ +DKS + K++EKE+ K++K++K++ + + + +K+E+EK +E ++ K +
Sbjct: 3 VYRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
Query: 164 EKKEE------KEFKMKEDKEKPVEK-IRKEEKDSEKEKKSK 198
+ +E K +E + P+ + +RK+++ ++K+ K
Sbjct: 63 LARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPL 104
Score = 28.4 bits (64), Expect = 4.0
Identities = 24/98 (24%), Positives = 54/98 (55%), Gaps = 3/98 (3%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
++K + D ++K + K +EK+++++++EK K+ K +++E+++ E K +
Sbjct: 5 RDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLA 64
Query: 182 EKIRKEEKDSEKEKKSKDKD---KKLKKEKLKKKKKEK 216
E+ D E +++ + D + L+K+K K KK K
Sbjct: 65 RYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 39.1 bits (91), Expect = 0.004
Identities = 27/105 (25%), Positives = 44/105 (41%), Gaps = 2/105 (1%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K K+ KK K H K ++ K+ K KN+ E K E G + + KE+
Sbjct: 3 KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENK-MESGTNDNNKNKEKLS 61
Query: 165 KKEEKEFKMKEDKEKP-VEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
K ++E+ E I K +S+K ++ +K +KE
Sbjct: 62 KLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQVSTQKWRKEL 106
Score = 29.5 bits (66), Expect = 4.4
Identities = 22/97 (22%), Positives = 33/97 (34%), Gaps = 3/97 (3%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
K K K K+ +KN D+ K+ K K E K D K EK
Sbjct: 3 KASKATKKFTKNHLKNT-IDRRKQLARSKKVY--GTKNRNSHTENKMESGTNDNNKNKEK 59
Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
+ K D + +++D KL K+ +
Sbjct: 60 LSKLYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQ 96
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 39.3 bits (92), Expect = 0.004
Identities = 25/131 (19%), Positives = 68/131 (51%), Gaps = 4/131 (3%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
+ ++ +++ K I +++ + +++ E+++ + ++ + + E+ E +EK + KEE
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 158 EKKEEKEKKEEKEFKM----KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
E E + ++ E E + E+ E+ +E +R + E + S + + + + +L++ +
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413
Query: 214 KEKERSSHEKE 224
+ER E E
Sbjct: 414 DRRERLQQEIE 424
Score = 37.0 bits (86), Expect = 0.020
Identities = 26/140 (18%), Positives = 67/140 (47%), Gaps = 9/140 (6%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK--IKNKEKDKEKKEK---- 149
E L E++ + S +++ +KE + + + ++ K ++ + + E++ +
Sbjct: 264 ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEA 323
Query: 150 --EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKK 206
E+ + + E EE + EEK ++KE+ E ++ E + E E + ++ +++L+
Sbjct: 324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383
Query: 207 EKLKKKKKEKERSSHEKEVI 226
+ K + E + +S E+
Sbjct: 384 LRSKVAQLELQIASLNNEIE 403
Score = 32.3 bits (74), Expect = 0.57
Identities = 21/111 (18%), Positives = 57/111 (51%), Gaps = 9/111 (8%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
+++ EK ++ K+ K + ++ E +E + +E +E+ +E E++ E+ +
Sbjct: 206 ERQAEKAERYKELKAE--LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQ 263
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKE 224
+ E+ +E++R E SE E++ ++ K+L ++ + +++K+
Sbjct: 264 ELEEKLEELRLE--VSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
Score = 29.6 bits (67), Expect = 4.5
Identities = 21/124 (16%), Positives = 60/124 (48%), Gaps = 6/124 (4%)
Query: 118 KKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
++ E ++++ ++ +D E ++ ++ E+ + E + E+ EE ++ +
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
Query: 177 KEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
E+ E++R+ E SE ++ ++ +KL + +L+ + E + +L+ ++
Sbjct: 896 LEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRID----NLQERLSEEYSL 951
Query: 236 DMEE 239
+EE
Sbjct: 952 TLEE 955
Score = 29.3 bits (66), Expect = 5.6
Identities = 20/140 (14%), Positives = 58/140 (41%), Gaps = 21/140 (15%)
Query: 105 KKDKSIKKDKKTHKK----KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
++ K +K + + + E+++++ ++ ++ + +E E +E +E +EK
Sbjct: 213 ERYKELKAELRELELALLVLRLEELREELEELQEELKEAEE---ELEELTAELQELEEKL 269
Query: 161 EEKE-KKEEKEFKM-------------KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
EE + E E ++ E+ + +R+ + E++ + + + +
Sbjct: 270 EELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELE 329
Query: 207 EKLKKKKKEKERSSHEKEVI 226
KL + +E + E +
Sbjct: 330 SKLDELAEELAELEEKLEEL 349
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 39.3 bits (92), Expect = 0.004
Identities = 27/111 (24%), Positives = 41/111 (36%), Gaps = 13/111 (11%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K + I +EK+ G K KE E+K ED + KE+
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKE-----EQKGNDLSNELEDISLGPLEYDKEK-- 264
Query: 191 SEKEKKSKDKDKKLKK------EKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
EK K K+K KKL + K K + + + + I + + GT
Sbjct: 265 LEKIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGT 315
Score = 36.6 bits (85), Expect = 0.030
Identities = 23/100 (23%), Positives = 39/100 (39%), Gaps = 7/100 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E K + S + KE++K ++ + + E DKEK EK K KE+ +K
Sbjct: 221 EEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGI 280
Query: 163 K-------EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K + +K++ + + I S K K
Sbjct: 281 SIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320
Score = 33.9 bits (78), Expect = 0.21
Identities = 17/93 (18%), Positives = 38/93 (40%), Gaps = 10/93 (10%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-KEKGKKEEKEKKE--EKE 164
KS + + +K + +K +K++ K + E ++ + +KEK E +
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 165 KKEEKEFKMK-------EDKEKPVEKIRKEEKD 190
K++ K+ + + KI+ E+ D
Sbjct: 272 KEKAKKLGISIILFDDMTKNKTTNYKIQNEDPD 304
Score = 32.8 bits (75), Expect = 0.41
Identities = 17/74 (22%), Positives = 30/74 (40%), Gaps = 5/74 (6%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK- 165
D IK + K+EK + +K+ + ++ + E E + E +K
Sbjct: 208 DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELE----DISLGPLEYDKE 263
Query: 166 KEEKEFKMKEDKEK 179
K EK +KE +K
Sbjct: 264 KLEKIKDLKEKAKK 277
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 37.8 bits (88), Expect = 0.004
Identities = 26/105 (24%), Positives = 58/105 (55%), Gaps = 5/105 (4%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM--KE 175
++K + K+ KD ++K + + +E ++ + +KE KE+K +KEE+E KM ++
Sbjct: 67 EEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQEQKMAKEK 126
Query: 176 DKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKEKLKKKKKEKE 217
DK++ ++++ + + K + K + + EKL++KK+
Sbjct: 127 DKQEIIDELETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTF 171
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 38.9 bits (91), Expect = 0.005
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 20/94 (21%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-------PVEKIRKEE---- 188
K DKE+ ++EK +KE ++++ K +++E K K++ EK P E +++E
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605
Query: 189 ---------KDSEKEKKSKDKDKKLKKEKLKKKK 213
D++ E+ SK + KKL KE K+ K
Sbjct: 606 AFDETGLPTHDADGEEISKKERKKLSKEYDKQAK 639
Score = 30.8 bits (70), Expect = 1.5
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
+K+ + +E EK KE KE+K K KK +++K K+ +K ++ E +
Sbjct: 545 WKLDDKEELQREKEEKEA---LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQE 601
Query: 231 FKFGT 235
K+
Sbjct: 602 DKYSA 606
Score = 30.8 bits (70), Expect = 1.8
Identities = 32/121 (26%), Positives = 54/121 (44%), Gaps = 13/121 (10%)
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
V +D E++++K K+ K K+ K K +++K K KE +K +K K +
Sbjct: 544 VWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKK--------KELEKLEKAKIPPAE 595
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
K ++++ +E + + + K KE K KE DK KL +E L K K
Sbjct: 596 FFKRQEDKYSAFDETGLPTHDADGEEISK--KERKKLSKE---YDKQAKLHEEYLAKGGK 650
Query: 215 E 215
Sbjct: 651 S 651
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 38.1 bits (89), Expect = 0.005
Identities = 29/118 (24%), Positives = 50/118 (42%), Gaps = 6/118 (5%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+ + ++E+ KKK+ K K K+K KK+ K K +KK E+
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKK---KKKDPTAAKSPKAAAPRPKKKSERISW 129
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 230
+ P RK + S + K ++ ++E +KK + K R EK+ +LT
Sbjct: 130 APTLLDSP---RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184
Score = 29.3 bits (66), Expect = 3.8
Identities = 17/93 (18%), Positives = 30/93 (32%), Gaps = 2/93 (2%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
+K + KK KK K K + +K +
Sbjct: 90 VKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKS--SRSSTV 147
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+ KE ++KE + + + K K EK+K+
Sbjct: 148 QNKEATHERLKEREIRRKKIQAKARKRKEKKKE 180
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 37.7 bits (87), Expect = 0.005
Identities = 22/77 (28%), Positives = 47/77 (61%), Gaps = 1/77 (1%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK-EEKEFKMKEDKEKPVEKIR 185
KK D K K + +++ EK +E+ + EE+ + E +E+K +E+E + ++++E+ + I
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
Query: 186 KEEKDSEKEKKSKDKDK 202
++ DSE E +D+ +
Sbjct: 171 EQSDDSEHEIIEQDESE 187
Score = 37.0 bits (85), Expect = 0.011
Identities = 24/93 (25%), Positives = 56/93 (60%), Gaps = 3/93 (3%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+S+KKD KKK+ I++++ +K + + + E+ E + +E+ K +E+E +EE+E++
Sbjct: 108 RSVKKDDI--KKKKSLIIRQEQIEKARQEREELEERMEWERREE-KIDEREDQEEQERER 164
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
E++ ++ + E I ++E ++E + +K
Sbjct: 165 EEQTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
Score = 34.3 bits (78), Expect = 0.070
Identities = 20/81 (24%), Positives = 53/81 (65%), Gaps = 4/81 (4%)
Query: 103 EIKKDKS--IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK--EEKE 158
+IKK KS I++++ ++E+E+++++ + + +++ ++ +D+E++E+E+ ++ EE+
Sbjct: 114 DIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQS 173
Query: 159 KKEEKEKKEEKEFKMKEDKEK 179
E E E+ E + + D +K
Sbjct: 174 DDSEHEIIEQDESETESDDDK 194
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 37.0 bits (86), Expect = 0.006
Identities = 16/96 (16%), Positives = 40/96 (41%), Gaps = 2/96 (2%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
KKD I D + + +++ +++K K+KEK ++++ K + E +
Sbjct: 31 YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+ +++ F + E+ ++ K+ S
Sbjct: 91 VKATKKQLF--SSEYEQTSSSSESTSEEETKKTSSI 124
Score = 36.2 bits (84), Expect = 0.009
Identities = 21/97 (21%), Positives = 41/97 (42%), Gaps = 2/97 (2%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+ D K + K EK + + ++E + KE +E+K K+ K K + +
Sbjct: 19 ENDGKLQVKPNRYEKKDIEINTDYLQEETELDKELFTPEEQK--KITFQKHKEKPEQEEL 76
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ E +++ K K++L + E+ SS E
Sbjct: 77 KNQLFSENATENNTVKATKKQLFSSEYEQTSSSSEST 113
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 38.2 bits (89), Expect = 0.006
Identities = 38/187 (20%), Positives = 85/187 (45%), Gaps = 22/187 (11%)
Query: 60 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 119
+ S+E T Y +KD E + + T+S +S+K+ K +
Sbjct: 26 TEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKLERQA 85
Query: 120 KE--------KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-------KEEKE 164
+E K+ ++KK +K ++ + +E++KE++E E ++ KEK K ++
Sbjct: 86 QEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQA 145
Query: 165 KKEEKEFKMKEDKEK-------PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
K+ KE E + R ++ K++ + + KKLK+++ K++++ ++
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRK 205
Query: 218 RSSHEKE 224
+ ++E
Sbjct: 206 QRKKQQE 212
Score = 32.8 bits (75), Expect = 0.32
Identities = 16/107 (14%), Positives = 60/107 (56%), Gaps = 1/107 (0%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
+++K+ + + ++++ K+K ++ + K ++ K++ K K + E + +
Sbjct: 114 QEREKEREEAELRQRLAKEKYEEWCRQ-KAQQAAKQRTPKHKKEAAESASSSLSGSAKPE 172
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
+E+ +K +++ ++ +++K+ +++ K+ KK++ ++++K+K
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQK 219
Score = 32.4 bits (74), Expect = 0.38
Identities = 21/109 (19%), Positives = 56/109 (51%), Gaps = 3/109 (2%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEE 168
K ++ +++E E ++ +K ++ + K + K+ K KKE E K E
Sbjct: 113 KQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPE 172
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+ +E K++ E K+ K +++++ + + ++ K+++ +++K++ E
Sbjct: 173 RNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERKQKAE 221
Score = 31.6 bits (72), Expect = 0.77
Identities = 20/82 (24%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK--EEKEK 159
S+ K + +E+ KK+ + + K+K +++ +E++ +++ KK EE+E+
Sbjct: 157 AAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEER 216
Query: 160 KEEKEKKEEKEFKMKEDKEKPV 181
K++ E+ +K K + + KPV
Sbjct: 217 KQKAEEAWQKWMKNVKQRPKPV 238
Score = 29.7 bits (67), Expect = 3.2
Identities = 16/77 (20%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK--EKDKEKKEKEKGKKEEKEKK 160
++ KK+ K + +++ K + E + +K ++++ K+EE+ +K
Sbjct: 146 AKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRK 205
Query: 161 EEKEKKEEKEFKMKEDK 177
+ K+++EE+E K K ++
Sbjct: 206 QRKKQQEEEERKQKAEE 222
Score = 28.9 bits (65), Expect = 5.5
Identities = 22/115 (19%), Positives = 61/115 (53%), Gaps = 13/115 (11%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK---EKGKKEEKEKKE 161
K+ + + ++ K+K +E ++K + K + +K+ + K E+ +
Sbjct: 118 KEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQ 177
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E+ KK +E+++K K++++++ ++E++ + + KK ++E+ +K+K E+
Sbjct: 178 EEAKKRLQEWELK--------KLKQQQQ--KREEERRKQRKKQQEEEERKQKAEE 222
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 38.5 bits (90), Expect = 0.006
Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+++ K++K++ K++ +KK +K KK+ + + + + ++ E +KK K
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--DAFLAELISENQEAENKQNNKKKKK 61
Query: 199 DKDKKLKKEKLKKKKKEK 216
K KK KK +
Sbjct: 62 KKKKKKKKNLGEAYDLAY 79
Score = 37.8 bits (88), Expect = 0.011
Identities = 21/91 (23%), Positives = 42/91 (46%), Gaps = 3/91 (3%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKEKEKGKKEEKEKKE 161
+ ++ K +K K++ K K KK KKD D + + ++ + K ++K+KK+
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
+K+KK+ + PV +D+
Sbjct: 64 KKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94
Score = 36.6 bits (85), Expect = 0.022
Identities = 15/92 (16%), Positives = 40/92 (43%), Gaps = 2/92 (2%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
E + E K++K++ K K K+K + + + D + ++++ + ++ KK
Sbjct: 1 AMNENGENEVKQQKQQN--KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKK 58
Query: 213 KKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
KK+K++ +K + + + +
Sbjct: 59 KKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90
Score = 35.8 bits (83), Expect = 0.047
Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 3/89 (3%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+ + K K++ K+K KKK+KK K + + D + + +E E K+ +KK++K+
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDV---DDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
K K+ + + +D
Sbjct: 64 KKKKKKNLGEAYDLAYDLPVVWSSAAFQD 92
Score = 32.8 bits (75), Expect = 0.36
Identities = 19/89 (21%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDK 202
+ + + K+++++ K++ KK+ K+ K D + + ++ E +++E ++ +K K K
Sbjct: 2 MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61
Query: 203 KLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
K KK+K K + + + V F
Sbjct: 62 KKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90
Score = 32.4 bits (74), Expect = 0.57
Identities = 17/85 (20%), Positives = 39/85 (45%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
++ K++++ K+K K K+K K+ D + + E +E + ++ K++K+ K K+
Sbjct: 8 NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67
Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKD 201
K+ E + +D
Sbjct: 68 KKNLGEAYDLAYDLPVVWSSAAFQD 92
Score = 30.5 bits (69), Expect = 1.9
Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 19/164 (11%)
Query: 54 DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE-IKKDKSIKK 112
+ ++K +K+K + K+ + SK + L E I +++ +
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--------------DAFLAELISENQEAEN 52
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+ KKK+K+K KKKK+ + + ++ + E+E+K
Sbjct: 53 KQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAF----QDNSHIRKLGNWPEQEWK 108
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
+ PV K K+ + E + + + +K++ EK
Sbjct: 109 QTQPPTIPVSKQFKDGEYPVGEIQEYPGENSSRTSSEEKRELEK 152
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 38.2 bits (89), Expect = 0.008
Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
EK+ EKE +E++E+ EK KEE++ +K+ + +++ K+EK KK KK KE ++ +
Sbjct: 28 EKEVEKEVPDEEEEEEKEEK--KEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Query: 224 EV 225
+
Sbjct: 86 LL 87
Score = 36.7 bits (85), Expect = 0.023
Identities = 25/65 (38%), Positives = 39/65 (60%), Gaps = 8/65 (12%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
K+ EKE +E+E++E++E K +E+K E EE D E+EK+ K K K +K+K+
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKE----EEVDEEEEKEEKKK----KTKKVKET 80
Query: 213 KKEKE 217
E E
Sbjct: 81 TTEWE 85
Score = 35.5 bits (82), Expect = 0.049
Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 4/67 (5%)
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
KE EK +E+E++E++EKKEE+E DKE+ V++ +EEK+ +K+K K K+ +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEE--KTTDKEEEVDE--EEEKEEKKKKTKKVKETTTEW 84
Query: 207 EKLKKKK 213
E L K K
Sbjct: 85 ELLNKTK 91
Score = 34.0 bits (78), Expect = 0.16
Identities = 18/65 (27%), Positives = 44/65 (67%), Gaps = 1/65 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-D 176
K+ EKE ++++++ ++K + +EK +K+E+ ++E++EKK++ +K +E + + +
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 177 KEKPV 181
K KP+
Sbjct: 89 KTKPI 93
Score = 32.8 bits (75), Expect = 0.36
Identities = 21/63 (33%), Positives = 40/63 (63%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K KE E++E+EK +K+E+E+K +++E E + KE+K+K +K+++ + E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLN 88
Query: 196 KSK 198
K+K
Sbjct: 89 KTK 91
Score = 29.0 bits (65), Expect = 5.7
Identities = 20/84 (23%), Positives = 41/84 (48%)
Query: 88 MVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
V V+ ++ E +K++ ++++KT K+E+ +++K++K K K KE E +
Sbjct: 26 WVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEF 171
K K ++ K+E F
Sbjct: 86 LLNKTKPIWTRNPKDVTKEEYAAF 109
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 38.1 bits (89), Expect = 0.008
Identities = 29/109 (26%), Positives = 51/109 (46%), Gaps = 6/109 (5%)
Query: 111 KKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KE 167
+ T +K K + KK K + ++ DKE+ EK K + +E E
Sbjct: 261 NFEVLTGEKITKHYLSIGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYE 320
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD---KDKKLKKEKLKKKK 213
++ +K KE+ VEK +K K +++ KD KD+K+ E+++ K
Sbjct: 321 VQDKVVKLTKEEAVEKGKKLAKKKLEKEIDKDAKIKDEKVLHERVENGK 369
Score = 29.3 bits (66), Expect = 4.7
Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 2/88 (2%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKE-KGKKEEKEKKEEKEKKE 167
+K K +K +KE+ +K K K I +E E ++K K KEE +K +K K+
Sbjct: 284 LKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKK 343
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
+ E ++ +D + EK+ E ++ K +
Sbjct: 344 KLEKEIDKDAKIKDEKVLHERVENGKVR 371
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 36.2 bits (84), Expect = 0.010
Identities = 24/74 (32%), Positives = 45/74 (60%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+D+ ++ ++ K+K++EK KK+ K+ K K K K+K K KK +K +KE + EE
Sbjct: 63 TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESS 122
Query: 165 KKEEKEFKMKEDKE 178
+EE+ + K+++
Sbjct: 123 DEEEEGEEDKQEEP 136
Score = 34.3 bits (79), Expect = 0.039
Identities = 22/73 (30%), Positives = 46/73 (63%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ +++K K ++KT KK+ K + KK+K KK K K +K++++ K + ++E++ E
Sbjct: 70 QKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGE 129
Query: 163 KEKKEEKEFKMKE 175
++K+EE M++
Sbjct: 130 EDKQEEPVEIMEK 142
Score = 33.9 bits (78), Expect = 0.067
Identities = 25/84 (29%), Positives = 50/84 (59%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
D+K K+ E E+ ++K+++K + + K + K++K+K KK++K+K ++ KKEEKE
Sbjct: 56 DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKK 196
++ E+ +E+K E +
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEI 139
Score = 31.6 bits (72), Expect = 0.32
Identities = 27/81 (33%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 147 KEKEKGKKEEKEKKEEKEKKE---EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+E E+ + +++ K+E E +E ++E K ++D+EK +K K +K +K+KK K K
Sbjct: 47 REYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKG 106
Query: 204 LKKEKLKKKKKEKERSSHEKE 224
KKE+ + K +E S E+E
Sbjct: 107 NKKEEKEGSKSSEESSDEEEE 127
Score = 28.5 bits (64), Expect = 3.7
Identities = 22/87 (25%), Positives = 52/87 (59%)
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
D+K K + +++E ++++EK++ +E+ KK K +K++++ K K+ +K +K KE
Sbjct: 56 DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGS 115
Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEK 216
S +E ++++ + K++ + EK
Sbjct: 116 KSSEESSDEEEEGEEDKQEEPVEIMEK 142
Score = 27.3 bits (61), Expect = 8.9
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 8/97 (8%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E E+ E ++K +KE + +E ++K EK RK+E+ + K K K +K+K KKKKK+
Sbjct: 48 EYERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK------KRAKRQKKKQKKKKKK 101
Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
K + + K K + ++ + K
Sbjct: 102 KAKK--GNKKEEKEGSKSSEESSDEEEEGEEDKQEEP 136
>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
[Posttranslational modification, protein turnover,
chaperones].
Length = 320
Score = 37.4 bits (86), Expect = 0.010
Identities = 19/115 (16%), Positives = 46/115 (40%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
DK K+ ++ +++ +K + K + K+ + KG EE+E +
Sbjct: 93 EIAVLDKLAKELLLGLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLRIKGLTEEQEAEA 152
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
++K + K+ K + + + K E +++ K + K + K++
Sbjct: 153 LRDKLQNKQQGKKVTEVQARHILVKAEAKAKEALALLKKGVREAKADFAELAKKQ 207
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 37.6 bits (87), Expect = 0.013
Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 6/64 (9%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE--KEKKEEK 169
K K+K KEK K+ K I++ + E + ++EEK K++K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIK----IESPSVETEGERCTIKQREEKGIDAPAILNVKKKK 847
Query: 170 EFKM 173
+K+
Sbjct: 848 PYKV 851
Score = 36.5 bits (84), Expect = 0.032
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 6/65 (9%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
T K + K +K K+K DK E + E E+ +++E+K +
Sbjct: 790 TIKAAARAK-QKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDA-----PAILNV 843
Query: 176 DKEKP 180
K+KP
Sbjct: 844 KKKKP 848
Score = 34.5 bits (79), Expect = 0.13
Identities = 29/143 (20%), Positives = 53/143 (37%), Gaps = 19/143 (13%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK--PVEKIRKEEKDS 191
K + K+K KEK +K K E E + E K +E+K P K++K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVE--TEGERCTIKQREEKGIDAPAILNVKKKKPY 849
Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV- 250
+ + ++ ++ + K + +P FK G + + + ++
Sbjct: 850 KVDNMTRILPQQSRYISFIKDDR----------FVPVRKFKGGVVVLRDREPKEPVALIE 899
Query: 251 ----IKPVKSPSPPPVAAADEYD 269
+K V +P P P D D
Sbjct: 900 TVRQMKDVNAPLPTPFKVDDNVD 922
Score = 29.9 bits (67), Expect = 3.1
Identities = 27/127 (21%), Positives = 44/127 (34%), Gaps = 7/127 (5%)
Query: 27 HPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT 86
PLI V P K E + +E P+ E + K V T
Sbjct: 727 FPLIH--FVSLSFLPTTVIGIRGSQAIPKFCFNEDLEEEEFEYPRMYEEASGKSVRKVNT 784
Query: 87 PMVPNIVKVTSSEDLGEIKK---DKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEK 142
++ +K + +K DK IK + + + E E+ K +++K D
Sbjct: 785 AVLSTTIKAAARAKQKPKEKGPNDKEIKIESPS-VETEGERCTIKQREEKGIDAPAILNV 843
Query: 143 DKEKKEK 149
K+K K
Sbjct: 844 KKKKPYK 850
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 36.8 bits (85), Expect = 0.013
Identities = 18/80 (22%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
K++ K+KEK +++ + K+ + K +EK + ++K G + +++ K
Sbjct: 160 RKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDEPPK 219
Query: 173 MKEDKEKPVEKIRKEEKDSE 192
K K E+ R E + E
Sbjct: 220 EKRQKHHDPER-RLEPQSHE 238
Score = 34.5 bits (79), Expect = 0.087
Identities = 22/113 (19%), Positives = 54/113 (47%), Gaps = 5/113 (4%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+LG++K + +K+ + +K+ + +K+ K K+K + ++ K K+++ K ++E+
Sbjct: 130 EIRNLGDVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEK 189
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+ E+K + K+E EK +K D +++L+ +
Sbjct: 190 RRNDEDKRPGGGGGSSGGQSGLS-----TKDEPPKEKRQKHHDPERRLEPQSH 237
Score = 34.1 bits (78), Expect = 0.099
Identities = 28/110 (25%), Positives = 53/110 (48%), Gaps = 7/110 (6%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
D K +K EKE + K D + + + K+KEK + E+ +K +E +K+++E +
Sbjct: 136 DVKNAEKFEKECRALSRKKSDDEH-----RKRSGKQKEKRRVEDSQKHKEDRRKKQEEKR 190
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
++ ++P S +KD+ K K++K ++ E SHE
Sbjct: 191 RNDEDKRPGGGGGSSGGQS--GLSTKDEPPKEKRQKHHDPERRLEPQSHE 238
Score = 28.3 bits (63), Expect = 7.5
Identities = 21/105 (20%), Positives = 41/105 (39%), Gaps = 20/105 (19%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
KE + +KK + K K+K+K + E + KE++ KK+E++++ +++
Sbjct: 145 KECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDED--------- 195
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ K E KEK + H+ E
Sbjct: 196 -------KRPGGGGGSSGGQSGLSTKDEP----PKEKRQKHHDPE 229
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 36.9 bits (86), Expect = 0.015
Identities = 26/105 (24%), Positives = 56/105 (53%), Gaps = 2/105 (1%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKEDKEKP 180
E+ + D+ K K E ++ K E + ++E +++KEE++ E +E +E ++
Sbjct: 189 EEAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQL 248
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+EK+ E + E++ + K ++E+L K+ + E S +KE+
Sbjct: 249 IEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEI 293
Score = 35.3 bits (82), Expect = 0.041
Identities = 21/90 (23%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K E E+ K + + +++ ++ K+K++E+ + + + ++ K+ EK E + K+ ++
Sbjct: 204 KAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQ 263
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
E+ +E +E+++ KE K + + L+KE
Sbjct: 264 ERMLEHKLQEQEELLKE-GFKTEAESLQKE 292
Score = 29.6 bits (67), Expect = 2.8
Identities = 22/94 (23%), Positives = 52/94 (55%), Gaps = 6/94 (6%)
Query: 105 KKDKSIKKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKKEEKEKK 160
K+K+I+ ++ + E E ++ ++K K+++ ++ +E+ + K+ EK + E ++
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
E+E+ E K++E +E E + E + +KE
Sbjct: 261 AEQERMLEH--KLQEQEELLKEGFKTEAESLQKE 292
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 37.1 bits (86), Expect = 0.015
Identities = 18/109 (16%), Positives = 36/109 (33%), Gaps = 9/109 (8%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKI---------KNKEKDKEKKEKEKGKKEEKEKKEEK 163
D+K KK K+ K +D + I K K +K ++ + K+
Sbjct: 10 DEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSS 69
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ E +E KP + K K + ++ ++ +
Sbjct: 70 KPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPD 118
Score = 36.3 bits (84), Expect = 0.032
Identities = 27/120 (22%), Positives = 41/120 (34%), Gaps = 8/120 (6%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
S+ +K KK +K K + D+D ++ + + E+ EK K K+ KE +
Sbjct: 7 SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLD-ELEQSEKAKPPKKPKEASRPGTPRNP 65
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 228
K K K EEK K K + K + E E P
Sbjct: 66 K----KSSKPTESSAASSEEK---PAKPRKSAESTRSSHPKSKAPSTESEEEEEPEETPD 118
Score = 36.3 bits (84), Expect = 0.032
Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 2/84 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
E L E+++ + K KK + + KK K + + E+ K K
Sbjct: 36 EFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKP 180
K + E +E + E+ P
Sbjct: 96 SHPKSKAPSTESEEEEEPEETPDP 119
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 36.5 bits (85), Expect = 0.016
Identities = 23/58 (39%), Positives = 34/58 (58%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
KD + KE++ I KK K D I +++++K+K K+E+K KEEKEK EE
Sbjct: 65 KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
Score = 28.0 bits (63), Expect = 8.0
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 126 KKKKDKKDKDKIKNKEK-----DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
+K K++K IK K + E+EK KK+ K+E+K KEEKE +
Sbjct: 68 RKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase. The valyl-tRNA synthetase
(ValS) is a class I amino acyl-tRNA ligase and is
particularly closely related to the isoleucyl tRNA
synthetase [Protein synthesis, tRNA aminoacylation].
Length = 861
Score = 37.3 bits (87), Expect = 0.017
Identities = 23/117 (19%), Positives = 49/117 (41%), Gaps = 10/117 (8%)
Query: 42 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL--GVTPMVPNIVKVTSSE 99
PP+ + + E + + K +VE+ + +V++
Sbjct: 746 PPNAPLKVLL---IYTEAETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGF 802
Query: 100 DL-----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
++ G I K K + + +K K++KE I+ + +++ +K K+ +KEKEK
Sbjct: 803 EIIIPVKGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPKEVIEKEKEK 859
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 35.8 bits (83), Expect = 0.017
Identities = 19/89 (21%), Positives = 42/89 (47%), Gaps = 1/89 (1%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+K E E K++++ +K +KE K ++ + K + + K E + +KEK
Sbjct: 40 PQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLV 99
Query: 198 KDKDKK-LKKEKLKKKKKEKERSSHEKEV 225
+KK + EK +++ +E +++
Sbjct: 100 NAFNKKQQEYEKDLNRREAEEEQKLLEKI 128
Score = 30.4 bits (69), Expect = 1.2
Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
S DL K K+ K+ K +++ K K+ K + D ++ K + E +K K
Sbjct: 46 SADLESEFK----KRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNA 101
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKI 184
K+++E +++ + E+++K +EKI
Sbjct: 102 FNKKQQEYEKDLNRREAEEEQKLLEKI 128
Score = 29.3 bits (66), Expect = 2.3
Identities = 23/94 (24%), Positives = 44/94 (46%), Gaps = 8/94 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
KK ++ + K+ K +++ +K+ + KE + + + K + K E E K K
Sbjct: 42 AKKVSADLESEFKKRQK-ELQKMQKELKAKEAKL-QDDGKMEALSDRAKAEAEIK----K 95
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
EK V K K E EK ++ + +++ L+K
Sbjct: 96 EKLVNAFNK--KQQEYEKDLNRREAEEEQKLLEK 127
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 34.9 bits (81), Expect = 0.018
Identities = 18/70 (25%), Positives = 36/70 (51%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
++ K+ KEK K +KK+++K + K + E+K + + +K EKE +E +
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Query: 165 KKEEKEFKMK 174
+ +E K +
Sbjct: 64 ELARRELKAE 73
Score = 29.1 bits (66), Expect = 1.7
Identities = 16/76 (21%), Positives = 42/76 (55%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ ++EK KE + +++ K EK+ + K+ + + + K+ EEK + +K + + ++
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Query: 204 LKKEKLKKKKKEKERS 219
L+ E +++ K + +
Sbjct: 61 LEAELARRELKAEAKK 76
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 35.8 bits (83), Expect = 0.018
Identities = 27/69 (39%), Positives = 40/69 (57%), Gaps = 2/69 (2%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ E E +KE K E++ + EE+E K K+ K++ K K+EK +KEK + K K
Sbjct: 127 ELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEV--KKEKKEKKDKKEKMVEPKGSK 184
Query: 204 LKKEKLKKK 212
KK+K KKK
Sbjct: 185 KKKKKKKKK 193
Score = 34.3 bits (79), Expect = 0.061
Identities = 21/77 (27%), Positives = 44/77 (57%)
Query: 84 GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
G ++ S + E + ++K+ + ++++KEK KKK+ KK+K + K+K++
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEK 176
Query: 144 KEKKEKEKGKKEEKEKK 160
+ + K KK++K+KK
Sbjct: 177 MVEPKGSKKKKKKKKKK 193
Score = 33.5 bits (77), Expect = 0.12
Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+ D + + +K K +++ + ++++K K+K K KE KEKKEK+ K++
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEK-KEKKKKKEVKKEKKEKKDKKEKM 177
Query: 157 KEKKEEKEKKEEKEFK 172
E K K+KK++K+ K
Sbjct: 178 VEPKGSKKKKKKKKKK 193
Score = 33.2 bits (76), Expect = 0.17
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 7/68 (10%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+ EK+ K +++ + EE+EKKE+K+KKE K KEK +K +KE+ K K
Sbjct: 133 ETSEKETTAKVEKEAEVEEEEKKEKKKKKEVK-------KEKKEKKDKKEKMVEPKGSKK 185
Query: 198 KDKDKKLK 205
K K KK K
Sbjct: 186 KKKKKKKK 193
Score = 32.8 bits (75), Expect = 0.22
Identities = 19/64 (29%), Positives = 39/64 (60%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
+ + EK K +K+ + + ++K+K+KK++ K +K+EK+ K+EK + + K K+
Sbjct: 129 GSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKK 188
Query: 176 DKEK 179
K+K
Sbjct: 189 KKKK 192
Score = 32.4 bits (74), Expect = 0.28
Identities = 20/67 (29%), Positives = 41/67 (61%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
+ + + +K+ KE + E++EK++ KK+++ KKE+KEKK++KE ++ K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Query: 181 VEKIRKE 187
+K +K+
Sbjct: 186 KKKKKKK 192
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 36.6 bits (85), Expect = 0.019
Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 26/143 (18%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---------------------E 148
+ KK + + K+KK ++ K ++ K K K +
Sbjct: 85 LNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIK 144
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKM-----KEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+E KK K ++ K KK K+ K+ K K ++ I+ EE K KK K+K K
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204
Query: 204 LKKEKLKKKKKEKERSSHEKEVI 226
LK K KE E+ ++K+ I
Sbjct: 205 LKNILYNLKDKELEKKGYKKKQI 227
Score = 28.9 bits (65), Expect = 5.9
Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 1/114 (0%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K K + +K K K++ KK K K K K+ +K K K+ + K
Sbjct: 121 NKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIK 180
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-LKKEKLKKKKKEK 216
K K + K K ++ +K K+ K K+ +KK KK++++K ++
Sbjct: 181 LLKIIKIEENFLIKNKKLKNKQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKI 234
Score = 28.5 bits (64), Expect = 7.7
Identities = 28/138 (20%), Positives = 55/138 (39%), Gaps = 9/138 (6%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---KKKKDKKDKDKIKNKEKDKEKKEKEKG 152
T L +++K+ + + H K K KK+ +I NK ++K E+ K
Sbjct: 51 TLQNYLYKLEKEFKVTTNYYKHLGKNNGTEIYYKLNYQKKECYEIINKYFKEKKIERFKK 110
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ KK+ + ++ + K K ++E K K+ K + K
Sbjct: 111 RVNAYFKKKSNKNGSVEKNECIN------NKNNVINKKIKEESKKKNSKNIEKLKIKKYF 164
Query: 213 KKEKERSSHEKEVIPKLT 230
KK +++ +K++ KL
Sbjct: 165 KKCNFKTNLKKDIKIKLL 182
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 37.0 bits (86), Expect = 0.019
Identities = 38/198 (19%), Positives = 89/198 (44%), Gaps = 20/198 (10%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGV---TPMVPNIVKVT----SSEDLGEIKKDKSIK 111
K K+ E+ P+ + V+ ++ V P+ + KV E G ++++
Sbjct: 358 KPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREI 417
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+ KK +E +++ +++ + K + +E +E EK + + E ++ + ++++E
Sbjct: 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREI-EKLESELERFRREVRDKVRKDREI 476
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
+ ++ +I + EK+ E++KK ++ ++ E K +K E +V+ KLT
Sbjct: 477 RARD------RRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT- 529
Query: 232 KFGTDMEEKTKRESSPKI 249
+E + E I
Sbjct: 530 -----LEAIEEAEEEYGI 542
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 36.5 bits (85), Expect = 0.020
Identities = 19/70 (27%), Positives = 36/70 (51%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+ E+ K I + K K ++ K KK++ ++D + +++ + K K ++E KK+
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
Query: 157 KEKKEEKEKK 166
KE K K K
Sbjct: 264 KESKGVKALK 273
Score = 35.4 bits (82), Expect = 0.042
Identities = 18/72 (25%), Positives = 32/72 (44%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
++ K E K + ++ ++K+ + +ED E + K+ K E+ KK K K+ K
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Query: 205 KKEKLKKKKKEK 216
K K K
Sbjct: 269 VKALKKVVAKGM 280
Score = 35.0 bits (81), Expect = 0.054
Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 6/83 (7%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
K+ K + K DK KE +K K+E +E E E + EK+ K KE E
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE------EI 258
Query: 184 IRKEEKDSEKEKKSKDKDKKLKK 206
+K+ K+S+ K K K K
Sbjct: 259 KKKKPKESKGVKALKKVVAKGMK 281
Score = 32.7 bits (75), Expect = 0.30
Identities = 19/73 (26%), Positives = 36/73 (49%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
K+ K + K + K K +E +++ + E +++++ KEE +KK+ KE K
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Query: 174 KEDKEKPVEKIRK 186
+ +K V K K
Sbjct: 269 VKALKKVVAKGMK 281
Score = 32.7 bits (75), Expect = 0.31
Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
DK +K+ KK KK+E E+ + + + + K K+KE+ K+KK KE + +K
Sbjct: 220 FKPLDKYLKESKK--KKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVA 277
Query: 163 KEKK 166
K K
Sbjct: 278 KGMK 281
Score = 31.9 bits (73), Expect = 0.54
Identities = 19/74 (25%), Positives = 36/74 (48%)
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
+ K+ K + + + ++ +K R+ E+D E + +K +K K+E KKK K
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
Query: 215 EKERSSHEKEVIPK 228
E + K+V+ K
Sbjct: 265 ESKGVKALKKVVAK 278
Score = 30.4 bits (69), Expect = 1.6
Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 2/79 (2%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
+++ DL + + + K K KE KKK++ ++ + +K++K KE
Sbjct: 197 DLLCSYLPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKE 256
Query: 151 KGKKEEKEKKEEKEKKEEK 169
+ KK K+ KE K K K
Sbjct: 257 EIKK--KKPKESKGVKALK 273
Score = 29.6 bits (67), Expect = 2.8
Identities = 14/54 (25%), Positives = 26/54 (48%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
KK + ++D + + + ++K K K++ K K ++K KK KG K
Sbjct: 228 KESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKKVVAKGMK 281
Score = 29.2 bits (66), Expect = 4.0
Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 9/107 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKD-------KEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K KE+ ++ + D I N + + K+ K + K +K
Sbjct: 167 KSKEESLVRSALELDIPDDILNLLRLRYACDLLCSYLPPDLYKELLKSLLIPEFKPLDKY 226
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEK 216
K + K++ E+ E +S EKK K K++ K KK K K K
Sbjct: 227 LKESKKKKRETEED-VEAAESRAEKKRKSKEEIKKKKPKESKGVKAL 272
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 34.9 bits (81), Expect = 0.020
Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 2/89 (2%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
KI ++ +K + IK E+ ++ + E++ + E E K++ +K E+
Sbjct: 26 KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIIN-NAKKEAQKLKEE 84
Query: 184 IRKE-EKDSEKEKKSKDKDKKLKKEKLKK 211
I E +KD+E+ +S + + +KE+
Sbjct: 85 ILAEAQKDAERLLESARAEIEQEKEQALA 113
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 37.1 bits (86), Expect = 0.021
Identities = 30/155 (19%), Positives = 58/155 (37%), Gaps = 5/155 (3%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
KE +K+ K+ K ++ + K + + K + K+ KK K+ E E+
Sbjct: 1163 KELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEE 1222
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE-KEVIPKLTFK----FG 234
E ++ + K K + KK K+KE+E + K+ +
Sbjct: 1223 TYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQS 1282
Query: 235 TDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 269
MEE K + + + S ++ +D+ D
Sbjct: 1283 AKMEETVKAVPARRAAARKKPLASVSVISDSDDDD 1317
Score = 30.2 bits (68), Expect = 3.1
Identities = 44/250 (17%), Positives = 93/250 (37%), Gaps = 32/250 (12%)
Query: 34 LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE-LGVTPMVPNI 92
L GT K+++ + +DK + ++ +KT+PK +KD++ L +
Sbjct: 1116 LSMAIGTLTLEKVQELCADRDKLN--IEVEDLKKTTPKSLW---LKDLDALEKELDKLDK 1170
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE---- 148
+ E ++++ + + KK ++ KK KK K E +E
Sbjct: 1171 EDAKAEEAREKLQRAA-ARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAM 1229
Query: 149 ---------KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD--SEKEKKS 197
K KG+ K+K K++E+E ++ + K++ ++ E+
Sbjct: 1230 ETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETV 1289
Query: 198 KDKDKKLKKEKLKKKKKEKERS---SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPV 254
K + + K S + + +++ + E+ K++ K
Sbjct: 1290 KAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVS------LAERLKKKGGRKPAAANK 1343
Query: 255 KSPSPPPVAA 264
K + PP AA
Sbjct: 1344 K-AAKPPAAA 1352
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 37.0 bits (86), Expect = 0.021
Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 6/167 (3%)
Query: 57 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK-KDKSIKKDKK 115
E K + +++K + + ++ +L ++ + +K+ +
Sbjct: 912 KKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSE 971
Query: 116 THK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
++ KK +++ + K KE + K+ ++ K+ KE + E +
Sbjct: 972 EYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSA 1031
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
K + E K +K K + + + K K +E S
Sbjct: 1032 SKIISSESTEL--SILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076
Score = 33.5 bits (77), Expect = 0.30
Identities = 31/128 (24%), Positives = 49/128 (38%), Gaps = 9/128 (7%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKD----KDKIKNKEKDKEKKEKEKGKKE-----EK 157
KSI K + + E E I+ KK + K + + KK E E
Sbjct: 891 VKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEY 950
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
K E K E E K+KE E+ + ++K + K+ + K KKE + K+
Sbjct: 951 VKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGA 1010
Query: 218 RSSHEKEV 225
K++
Sbjct: 1011 LQESTKQL 1018
Score = 31.2 bits (71), Expect = 1.3
Identities = 26/158 (16%), Positives = 50/158 (31%), Gaps = 13/158 (8%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
KK+ + + + E++ + K D K +K + E + E K + E E
Sbjct: 865 KKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLE 924
Query: 165 KKEE---------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
K E ++E K+ + K E E K K+ ++ + LKK
Sbjct: 925 FKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDL-LKKSTIL 983
Query: 216 KERSSHEKEVI---PKLTFKFGTDMEEKTKRESSPKIV 250
+ + K + + K +
Sbjct: 984 VREGNKANSELKNFKKELAELSKQYGALQESTKQLKEL 1021
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 35.6 bits (83), Expect = 0.023
Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
K + ++ K E E+ K+ +E+KEE EK+ + E EK R+EE+ +EK+
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK-----REEEERQIEEKRH 168
Query: 198 KDKDKKLKKEKLKKKKK 214
D+ LKK+ + K +
Sbjct: 169 ADEIAFLKKQNQQLKSQ 185
Score = 29.5 bits (67), Expect = 2.9
Identities = 14/57 (24%), Positives = 32/57 (56%), Gaps = 7/57 (12%)
Query: 134 KDKIKNKEKDK---EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+ +IK E++K EK+ E K E +K E+E+++ +E + ++ + ++K+
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE----IAFLKKQ 178
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 36.6 bits (85), Expect = 0.024
Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 34/169 (20%)
Query: 64 SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---- 119
+ + S E + GV ++VT D I + KD T K+
Sbjct: 424 AADNKSLGRFELDGIPPAPRGV-----PQIEVTFDIDANGILNVTA--KDLGTGKEQSIT 476
Query: 120 -KEKEKIKKKK-DKKDKDKIKNKEKDKEKKEKEKGKKE---------------EKEKKEE 162
K + ++ ++ +D N DK+ +E + + E K +EE
Sbjct: 477 IKASSGLSDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEE 536
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
KEK EE ++E E E+I+ K ++ ++ +KL ++K ++
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKA------KIEELQEVTQKLAEKKYQQ 579
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 35.1 bits (81), Expect = 0.024
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
E+ E+ ++E E + +SK+K K+ + +K+K K K PKL
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153
Score = 35.1 bits (81), Expect = 0.026
Identities = 19/62 (30%), Positives = 32/62 (51%)
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
D E++E E+ +E E+++ E K E + K K + KP + K + + +K K SK
Sbjct: 91 LSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Query: 199 DK 200
K
Sbjct: 151 PK 152
Score = 32.8 bits (75), Expect = 0.16
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
++E+ +EE + E+E E + + KE K++ V K K EK+ K + K K K K
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKP-KTEKEKPKTEPKKPKPSKPK 152
Score = 32.8 bits (75), Expect = 0.17
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 15/91 (16%)
Query: 85 VTPMV----------PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 134
V PMV P + K+ L ++ +++ T + ++++ + K + K+K
Sbjct: 67 VGPMVQQYGPLVRNLPAMWKIFRE--LSSSDDEEEETEEESTDETEQEDPPETKTESKEK 124
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K +E K K EKEK K E K+ K K K
Sbjct: 125 ---KKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 30.9 bits (70), Expect = 0.77
Identities = 14/59 (23%), Positives = 30/59 (50%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
D+ + E++ + +++ E K + +EK+K+E + K +++K K K K K
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Score = 28.2 bits (63), Expect = 5.2
Identities = 15/77 (19%), Positives = 29/77 (37%), Gaps = 11/77 (14%)
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 242
KI +E S+ E++ +++ + E+ + + E +K +PK
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----------PKT 134
Query: 243 RESSPKIVIKPVKSPSP 259
+ PK K K P
Sbjct: 135 EKEKPKTEPKKPKPSKP 151
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 35.6 bits (83), Expect = 0.026
Identities = 22/58 (37%), Positives = 32/58 (55%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
KD + KKE I K +K D I + +++K+K K+E+K KEEK+K EE
Sbjct: 66 KDFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 35.2 bits (81), Expect = 0.029
Identities = 27/81 (33%), Positives = 46/81 (56%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
KK ++ + KI + +K+ KE E K E++E EE+E+++E+E + ED E E +
Sbjct: 17 KKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVE 76
Query: 186 KEEKDSEKEKKSKDKDKKLKK 206
EE++ E E+ + D KK
Sbjct: 77 DEEEEEEDEEDNVDLKDIEKK 97
Score = 34.8 bits (80), Expect = 0.041
Identities = 24/89 (26%), Positives = 48/89 (53%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
KE E +K +K + D+++ + E++ E+ E + ++E E +EE+E+ EE +K+ ++K
Sbjct: 38 KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ I +D + KK +K K
Sbjct: 98 NINDIFNSTQDDNAQNLISKNYKKNEKSK 126
Score = 33.7 bits (77), Expect = 0.076
Identities = 26/98 (26%), Positives = 52/98 (53%), Gaps = 1/98 (1%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
+KE I K+ + +K ++ E+++E+ E+E + E+ E +EE + EE+E + +ED
Sbjct: 31 WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
++ I K+ + D + L + KK +K K+
Sbjct: 91 -LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKK 127
Score = 32.5 bits (74), Expect = 0.24
Identities = 23/98 (23%), Positives = 51/98 (52%), Gaps = 6/98 (6%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
DK + K EK++ ++++++D+++I+ E ++++E + ++EE+E +E+
Sbjct: 31 WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90
Query: 166 KEEKEFKMKED------KEKPVEKIRKEEKDSEKEKKS 197
++ E K D + I K K +EK KK+
Sbjct: 91 LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128
Score = 31.4 bits (71), Expect = 0.51
Identities = 31/104 (29%), Positives = 57/104 (54%), Gaps = 5/104 (4%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKE-----KGKKEEKEKKEEKEKKEEKEFKMKE 175
EK+ ++KKD+ + +K DKE KE K+E+ E++EE++++E +E + E
Sbjct: 10 EKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIE 69
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
D+E+ VE +EE+D E KD +KK + + + ++
Sbjct: 70 DEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 36.4 bits (84), Expect = 0.030
Identities = 29/105 (27%), Positives = 46/105 (43%), Gaps = 15/105 (14%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
D+ DK + E+ + EK+ K++K+K+EE++ EE + +K D E VE+I
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPNKKDKKKEEEEQLSEE-DAMLKGDLELLVERI---- 56
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
+D D L+ L KE +SS KF
Sbjct: 57 ---------QDPDIDLQNNSL-NMLKEVIKSSTSSMTAVPKPLKF 91
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 36.2 bits (84), Expect = 0.031
Identities = 41/162 (25%), Positives = 75/162 (46%), Gaps = 13/162 (8%)
Query: 55 KTSPKEKMSSKEKTSPKEKEYSKVKDVEL-GVTPMVPNIVKVTSSEDLGEIKKDKSIKKD 113
TS + E + + K EL V +P+ + EDL ++ + I+K
Sbjct: 113 STSGAVPETKDETPASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKS 172
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
K ++K K K+KKD +K ++ +K+ + KK+ K E ++ KEE + K+
Sbjct: 173 LKAMQQKWK---KRKKDLPNKQDLEE----YKKRLEAIKKKDIKNPLELQKIKEEFD-KL 224
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKLKK 211
K++ + +KI+ + D + +KK D KK + LK+
Sbjct: 225 KKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDLKR 266
Score = 32.8 bits (75), Expect = 0.39
Identities = 32/138 (23%), Positives = 63/138 (45%), Gaps = 10/138 (7%)
Query: 89 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
VP T + IK+ KS + K + + + K +D K + + EK K
Sbjct: 118 VPETKDETPASAPSSIKEQKSSELKKVDSQLPDPRALLKGEDLKTVETAEEIEKSL--KA 175
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE- 207
++ K+ K+ K+ EE + +++ K+K + K+ + +K K++ KLKKE
Sbjct: 176 MQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDI-------KNPLELQKIKEEFDKLKKEG 228
Query: 208 KLKKKKKEKERSSHEKEV 225
K K+K + ++ + +
Sbjct: 229 KADKQKIKSAKNDLQNDK 246
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 36.4 bits (85), Expect = 0.032
Identities = 35/195 (17%), Positives = 77/195 (39%), Gaps = 52/195 (26%)
Query: 63 SSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEK 122
K +EK+ VK +E ++K E+L +I+K+ +++ + + E
Sbjct: 66 LPKLNPLREEKKKVSVKSLE--------ELIKD-VEEELEKIEKEIKELEEEISELENEI 116
Query: 123 EKIKKKKD----------------------------KKDKDKIKNKEKDKEKKEKEKGKK 154
++++++ + +DK + E D E E K
Sbjct: 117 KELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDK 176
Query: 155 E--------EKEKKEEKEKK-EEKEFKMKE--DKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
KE +E E++ ++ F+ E ++ P E IR+ ++E + +K+++
Sbjct: 177 GYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIRE----IKEELEEIEKERE 232
Query: 204 LKKEKLKKKKKEKER 218
E+LK+ K+
Sbjct: 233 SLLEELKELAKKYLE 247
Score = 31.4 bits (72), Expect = 1.1
Identities = 23/132 (17%), Positives = 58/132 (43%), Gaps = 10/132 (7%)
Query: 58 PKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTH 117
P++K+ + S E + + V +V +V S+++ E K ++ +
Sbjct: 153 PEDKLEELKLESDVENVE-YISTDKGYVYVVV--VVLKELSDEVEEELKKLGFERLELEE 209
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ E I++ K++ +E +KE++ + KE +K E+ + +++ ++
Sbjct: 210 EGTPSELIREIKEEL-------EEIEKERESLLEELKELAKKYLEELLALYEYLEIELER 262
Query: 178 EKPVEKIRKEEK 189
+ + K K +K
Sbjct: 263 AEALSKFLKTDK 274
Score = 29.9 bits (68), Expect = 3.5
Identities = 33/183 (18%), Positives = 69/183 (37%), Gaps = 33/183 (18%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
L ++ + + +E+ KK K ++ IK+ E++ EK EKE + EE+
Sbjct: 49 RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE 108
Query: 158 ------EKKEEKEKKEE----KEFKMKEDKEKP-------VEKIRKEEKDSEKEKKSKDK 200
E KE +++ E F + V + +++ + K + +
Sbjct: 109 ISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVEN 168
Query: 201 DKKLKKEKLK-------KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKP 253
+ + +K K+ E E + KL F+ + + E +P +I+
Sbjct: 169 VEYISTDKGYVYVVVVVLKELSDEVE----EELKKLGFE-----RLELEEEGTPSELIRE 219
Query: 254 VKS 256
+K
Sbjct: 220 IKE 222
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 36.3 bits (84), Expect = 0.032
Identities = 29/120 (24%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK--EKKEEKEFKMKEDKEKPVEKIR 185
K+ K + K + ++K++EK+E + K + K E K ++KE++ K K+
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
K E+D+ + K K + ++ KKEKE+ E + K+ F K++
Sbjct: 425 KSEEDTLDLEIKSIK-KYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKK 483
Score = 35.6 bits (82), Expect = 0.046
Identities = 33/127 (25%), Positives = 63/127 (49%), Gaps = 5/127 (3%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK- 159
+ E+ K+ S ++ K+ +KKE+EK + DK IKN+ K K KEE+ K
Sbjct: 360 ISELLKEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKI 419
Query: 160 --KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKKKE 215
K EK +++ + ++K K+ +E K ++++K + + +K L+ K K
Sbjct: 420 TIKVEKSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITY 479
Query: 216 KERSSHE 222
K++ +
Sbjct: 480 KKKKGKD 486
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 35.9 bits (83), Expect = 0.036
Identities = 24/138 (17%), Positives = 53/138 (38%), Gaps = 2/138 (1%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+K+E+E++EE+++ K + E+ + E+ + ++E K K
Sbjct: 180 EKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKP 239
Query: 213 KKEKERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 271
++S+ ++ P + K + K + S +V K +P + + T
Sbjct: 240 TSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKK-AAPESTSQSPSSAEPTS 298
Query: 272 DSKQVWICPACGVQDDGS 289
+S Q + D S
Sbjct: 299 ESPQTAGNSSLSSLGDYS 316
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 35.8 bits (83), Expect = 0.040
Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 6/100 (6%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK---GKKEEKEKKEEKEKKEEKEFKMKEDK 177
E K + ++ K KD KD +K++E+ KK E+ E+K ++ E+
Sbjct: 60 ELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEI 119
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E +R E+ + ++ + L K++++ +
Sbjct: 120 ---AEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNL 156
Score = 29.6 bits (67), Expect = 3.3
Identities = 23/141 (16%), Positives = 59/141 (41%), Gaps = 3/141 (2%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKE 150
V+VT + + ++ K K E + +K +KK +++ K+ ++ E
Sbjct: 48 AVEVTKLAAVDSELRKFQTREAAKV-KDAEAQLLKDPAEKKREERKAAKKIERSTPSLIE 106
Query: 151 KGKKEEKEKKEE-KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+ +E K+ EE E +++ ++E +E P E + +++ + + ++ L+
Sbjct: 107 RKTQEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRAGLE 166
Query: 210 KKKKKEKERSSHEKEVIPKLT 230
+ K+ S + +
Sbjct: 167 LEILDMKDESYEDLRYFSEKL 187
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 34.0 bits (78), Expect = 0.040
Identities = 16/77 (20%), Positives = 46/77 (59%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
+K +EE +++++ E E K++E +++ E++RKE+++ +E++ K + + ++K+
Sbjct: 21 QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80
Query: 210 KKKKKEKERSSHEKEVI 226
+ +K ++ H +
Sbjct: 81 EDEKLQETWHEHNLALA 97
Score = 32.1 bits (73), Expect = 0.21
Identities = 21/85 (24%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K +++ + K+K + E E+ K EE+EK+E +E ++EK +E + K +E + E+K
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQ-KLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKE 215
+++ + + L + K +
Sbjct: 81 EDEKLQETWHEHNLALANFIRTKTK 105
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 34.8 bits (80), Expect = 0.040
Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 7/89 (7%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK-------IRKEEKDSEKE 194
K ++KE+EK +E K +E+E+K E E KE + E+ ++E E++
Sbjct: 34 KKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEEEK 93
Query: 195 KKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
++ + K+ + K+KK KE + EK
Sbjct: 94 EEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122
Score = 33.3 bits (76), Expect = 0.13
Identities = 28/93 (30%), Positives = 55/93 (59%), Gaps = 1/93 (1%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ +++K K EKE+ K+ D+ K K + ++ + E+ EKEK + E E+++ E
Sbjct: 22 EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEK-EAGEPEREDIAE 80
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
++E++E +E+KE+ E ++++E S K KK
Sbjct: 81 QQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKP 113
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 34.4 bits (80), Expect = 0.045
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
DK ++K EK ++K EK K K E+K+ K+E+ K
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 33.7 bits (78), Expect = 0.074
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
+KE E+ EKK EK K + ++ K K+++ KP
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 32.5 bits (75), Expect = 0.18
Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
DKK K EK +KK +K K K K +E K
Sbjct: 64 DKKELKAWEKA--EKKAEKAKAKAEKKKAKKEEPK 96
Score = 29.4 bits (67), Expect = 1.9
Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 4/66 (6%)
Query: 125 IKKKKDKKDKDKIKN-KEKDKEKKEKEKGK---KEEKEKKEEKEKKEEKEFKMKEDKEKP 180
+KKK KK + +I + E+ K+ KE + K+E + E+ EKK EK E K+
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
Query: 181 VEKIRK 186
E+ +
Sbjct: 92 KEEPKP 97
Score = 28.6 bits (65), Expect = 3.3
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKI 137
K +KK K K K + KK K ++ K ++
Sbjct: 73 KAEKKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 28.3 bits (64), Expect = 5.3
Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+KE K+ +K +K K EK K K ++K+ E +
Sbjct: 64 DKKELKAWEKAEK-KAEKAKAKAEKKKAKKEEPK 96
Score = 28.3 bits (64), Expect = 5.4
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK 140
K +K +K KK EK K K +K K K++ K +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPR 98
Score = 27.5 bits (62), Expect = 9.7
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 149 KEKGKKEEKE----KKEEKEKKEE-KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
K+KGKK E E +E K+ KE + + + + K K EK K +K+K K +K
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELK---AWEKAEK---KAEKAKAKAEK 87
Query: 204 LKKEKLKKKKK 214
K +K + K +
Sbjct: 88 KKAKKEEPKPR 98
>gnl|CDD|235944 PRK07135, dnaE, DNA polymerase III DnaE; Validated.
Length = 973
Score = 35.8 bits (83), Expect = 0.046
Identities = 39/179 (21%), Positives = 67/179 (37%), Gaps = 24/179 (13%)
Query: 75 YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK------KTHKKKEKEKIKKK 128
K K+ E+ + + + + + G K+K + K K + ++++
Sbjct: 92 SKKSKNKEIELNDLDSDNIIIIDHPKNGFYAKNKEQLELKNYYINSNDPKIENAVYVQER 151
Query: 129 K--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK--------------EFK 172
K +D + +K K KE+ K + ++ E +EK EF
Sbjct: 152 KLLFAEDNEYLKILNKIGNNKEENSNFKF-FDFEKWFEDIDEKILKRTNYLVENINIEFP 210
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
KE + E D +K K+ KK +LK KER ++E VI KL F
Sbjct: 211 KKEFNLPDFDNNLGLESDLFLKKILKESVIN-KKAELKYYPNVKERINYEYSVIKKLKF 268
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 35.0 bits (81), Expect = 0.049
Identities = 20/78 (25%), Positives = 46/78 (58%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
++ ++K ++D++ ++K K K K K K KI+ KEK K +KE++ ++ E++
Sbjct: 28 DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87
Query: 159 KKEEKEKKEEKEFKMKED 176
E++ ++ + K++E+
Sbjct: 88 TPEDELAEKLRLRKLQEE 105
Score = 34.6 bits (80), Expect = 0.076
Identities = 24/67 (35%), Positives = 37/67 (55%), Gaps = 2/67 (2%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
E +++++EK K + K KK K K EEKE +E +EK + EE E E K +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLR--ELEEDTPEDELAEKLR 98
Query: 201 DKKLKKE 207
+KL++E
Sbjct: 99 LRKLQEE 105
Score = 33.1 bits (76), Expect = 0.22
Identities = 19/67 (28%), Positives = 41/67 (61%), Gaps = 2/67 (2%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK--IR 185
+ ++K+++K K K K KK + +E+++ K EKE+K +E + +++ EK +R
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Query: 186 KEEKDSE 192
K +++S+
Sbjct: 101 KLQEESD 107
Score = 30.8 bits (70), Expect = 1.1
Identities = 19/70 (27%), Positives = 40/70 (57%), Gaps = 2/70 (2%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
KD D+++ + KE++K K + K+ + K E+++K ++E + K +E E+ E+
Sbjct: 34 KDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL--EEDTPED 91
Query: 189 KDSEKEKKSK 198
+ +EK + K
Sbjct: 92 ELAEKLRLRK 101
Score = 30.4 bits (69), Expect = 1.4
Identities = 21/65 (32%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKK 212
E++EK+EEK K K K K K EK + + + EK + ++D + EKL+ +
Sbjct: 41 EDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRLR 100
Query: 213 KKEKE 217
K ++E
Sbjct: 101 KLQEE 105
Score = 28.1 bits (63), Expect = 8.9
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+++E ++ K + ++ + K K K +KEK K++K+EK E
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
>gnl|CDD|222406 pfam13831, PHD_2, PHD-finger. PHD folds into an interleaved type
of Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 33
Score = 31.5 bits (72), Expect = 0.054
Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 1/33 (3%)
Query: 291 PMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 323
P++ C C H C G V E W C +C
Sbjct: 2 PLLVCSKCSVAVHASCYG-VPELSDGPWLCRRC 33
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 35.5 bits (82), Expect = 0.056
Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 9/84 (10%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
KK + K ++ ++ K+E +KG K +K+ ++K KK + + + ++K
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVST------STAASQKK 237
Query: 190 DSEKEKKSKDKDKKL--KKEKLKK 211
S+ E K + + K+L K+ LKK
Sbjct: 238 SSDLESKLEAQSKELWSLKDDLKK 261
Score = 34.4 bits (79), Expect = 0.14
Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 104 IKKDKSIKK--DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
+KK S K + K++ K+ K+KKD DK K + D++ +++ E
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLE 242
Query: 162 EKEKKEEKE-FKMKEDKEKPV 181
K + + KE + +K+D +K V
Sbjct: 243 SKLEAQSKELWSLKDDLKKHV 263
Score = 30.9 bits (70), Expect = 1.6
Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
KK + K+E+ E+++++ KK K K +K +K + + S S+ K L+
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKS-KKAKTDRDVSTSTAASQKKSSDLE 242
Query: 206 KEKLKKKKKE 215
KL+ + KE
Sbjct: 243 S-KLEAQSKE 251
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 35.4 bits (82), Expect = 0.057
Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 3/83 (3%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K IK K K + ++ K+ K ++ K + + + E+ K++ E+ +E +
Sbjct: 446 KGAKIKIQK--ALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQL 503
Query: 166 KEEKEFKMKEDKEKP-VEKIRKE 187
KE E + + +E+ ++
Sbjct: 504 KEATELQEFMINNEDLIEEAKEL 526
Score = 35.0 bits (81), Expect = 0.074
Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 4/87 (4%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ K K ++ K E + KEFK +K+K + E + E +K ++ +
Sbjct: 446 KGAKIKIQKALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQET---- 501
Query: 208 KLKKKKKEKERSSHEKEVIPKLTFKFG 234
+LK+ + +E + +++I + FG
Sbjct: 502 QLKEATELQEFMINNEDLIEEAKELFG 528
Score = 30.0 bits (68), Expect = 3.2
Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 11/97 (11%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
+ KIK +K K ++ + KE EK+K + + E E + K+D ++
Sbjct: 446 KGAKIKIQKALK-------SAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKD-DEE 497
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
V++ + +E +E ++D E+ K+ K
Sbjct: 498 VQETQLKEATELQEFMINNEDLI---EEAKELFGIKS 531
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 35.4 bits (81), Expect = 0.058
Identities = 30/133 (22%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE---------KDKEK 146
+S E + S + + E + ++D D ++++ ++ K E
Sbjct: 459 RASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSET 518
Query: 147 KEKEKG-KKEEKEKKEEKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
E +K K + +K++ E++EEK+ K M +K+K + K K ++E+ K KK
Sbjct: 519 SEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKK 578
Query: 204 LKKEKLKKKKKEK 216
+ K KK +K
Sbjct: 579 KQIAKQKKLDSKK 591
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 35.1 bits (81), Expect = 0.059
Identities = 23/116 (19%), Positives = 39/116 (33%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E + K K K +E +K KK D + K K +++++K + E
Sbjct: 98 EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPS 157
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ K K R ++ S +K K + KK +K R
Sbjct: 158 STPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 35.5 bits (82), Expect = 0.060
Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 5/150 (3%)
Query: 73 KEYSKVKDVELGVTPMVPNIVKVTSSED-LGEIKKDKSIKKDKKTHKKKE----KEKIKK 127
K Y K K V V I + ED L E+ DK KK + K K+K+KK
Sbjct: 60 KAYQKAKRVSENVKTPRFYIKTLVMLEDFLNELWADKEAKKKMSKNNAKALNTLKQKVKK 119
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+ + D + +E + + E+E+ +++ + + E ++E E+ E
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
K+ ++E + D KK E+ K KK +E
Sbjct: 180 VKEDDEEDEDADLSKKDVLEEPKMFKKPEE 209
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 35.2 bits (81), Expect = 0.061
Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 2/104 (1%)
Query: 111 KKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
+ K K E KI KK D K+ + N + + E K++E EKK + K +E+
Sbjct: 34 PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEE 93
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKK 212
K K++ +KP +K + ++ ++ + + K +KLKK+
Sbjct: 94 IEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 35.3 bits (82), Expect = 0.062
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 166 KEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
KEE +++E KEK P +K E+ +++K K K KK K++ KK +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 34.2 bits (79), Expect = 0.12
Identities = 16/63 (25%), Positives = 37/63 (58%)
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
V S +G+ K++ K+ ++ +K K KK+++KK + + K K++ K+ K+++K
Sbjct: 352 VDYFSGRYIGDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Query: 153 KKE 155
++
Sbjct: 412 GRK 414
Score = 33.0 bits (76), Expect = 0.33
Identities = 16/50 (32%), Positives = 34/50 (68%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
K++ ++I++ K+K K K +E+ K +K K+K K+++K KK +K+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 33.0 bits (76), Expect = 0.35
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 172 KMKEDKEKPVEKIR----KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
++KE+ K +E+I+ K K +EKK + + KK K++K KK+K+K R
Sbjct: 363 ELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.5 bits (72), Expect = 0.81
Identities = 15/48 (31%), Positives = 35/48 (72%), Gaps = 1/48 (2%)
Query: 123 EKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
+++K++ +K+ ++ K K + K+K+E++K +K +K+KK +K+ K+ K
Sbjct: 362 DELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRK 409
Score = 28.4 bits (64), Expect = 8.4
Identities = 17/50 (34%), Positives = 36/50 (72%), Gaps = 1/50 (2%)
Query: 122 KEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
KE++ K+ ++ K+K K+K +EKK +++ KK++++KK +K KK+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 35.2 bits (82), Expect = 0.068
Identities = 21/87 (24%), Positives = 39/87 (44%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
D ++ E E+ +E E++EE E + E EK +EK + K
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQ 230
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+K +K+++K + + + KK K +E
Sbjct: 231 YKKLRKAQEKKVEGRLAQHKKYAKLRE 257
Score = 28.2 bits (64), Expect = 9.8
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 5/59 (8%)
Query: 121 EKEKIKKKKDKKDKDKI----KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
E + + + +K+ K K +K K + KK E + K+ + +E K+KE
Sbjct: 204 ENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLRE-KLKE 261
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 34.2 bits (79), Expect = 0.068
Identities = 23/107 (21%), Positives = 51/107 (47%), Gaps = 9/107 (8%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKD---KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
E I + + + K +I+ ++ +E ++ + + E+K K+E K ++ E ++ ED
Sbjct: 55 SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114
Query: 177 KEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E I+ KEE +++ K + + K K++++E E
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESE 161
Score = 29.6 bits (67), Expect = 2.6
Identities = 21/104 (20%), Positives = 48/104 (46%), Gaps = 10/104 (9%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD--KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE- 175
+ +K++I+ + ++ + + ++K ++E G K + + E E E F +KE
Sbjct: 66 ESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDEDLESNNFDIKEV 125
Query: 176 -------DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
KEK K+ D++ K +++++ + EK+K
Sbjct: 126 NVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEKIKNF 169
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 34.2 bits (79), Expect = 0.070
Identities = 21/83 (25%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
+ K ++ + +E E+ +K E+E+ EE+E +EE E + ++ + +E +E KD ++
Sbjct: 5 EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQA 64
Query: 198 K--DKDKKLKKEKLKKKKKEKER 218
+ + K+ ++E+ + KK E+
Sbjct: 65 EFENLRKRTEREREEAKKYAIEK 87
Score = 29.6 bits (67), Expect = 2.6
Identities = 18/83 (21%), Positives = 48/83 (57%), Gaps = 9/83 (10%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK----KEEKEKKEEKEFKMK 174
K++K + ++ ++ +K + +E ++E+ E+E +EE+++ + + E+ ++K + +
Sbjct: 4 KEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRAQ 63
Query: 175 EDKEKPVEKIRKE-EKDSEKEKK 196
+ E +RK E++ E+ KK
Sbjct: 64 AE----FENLRKRTEREREEAKK 82
Score = 28.8 bits (65), Expect = 3.7
Identities = 14/83 (16%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
DK K ++ ++ E+ + ++++ ++++ + E ++E++E + + + +E ++K
Sbjct: 1 MSDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEE-LKDKY 59
Query: 165 KKEEKEFK-MKEDKEKPVEKIRK 186
+ + EF+ +++ E+ E+ +K
Sbjct: 60 LRAQAEFENLRKRTEREREEAKK 82
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 34.6 bits (80), Expect = 0.073
Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 12/113 (10%)
Query: 132 KDKDKIKNKEKDKEKK---EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
K + K+ K K+ + +K K++ EKKE KK + K K ++ + +
Sbjct: 3 KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKK-KKEKE---RSSHEKE-----VIPKLTFK 232
K +K K L EK+K +E E R+ H K+ VIP T++
Sbjct: 63 KKKPFDKPFKPLSSILDVEKIKDLSAEEIEFIWRAYHAKKDSLCAVIPAETYE 115
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 34.9 bits (80), Expect = 0.075
Identities = 15/94 (15%), Positives = 44/94 (46%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
EI +++ + KK+E+ + + ++ + +D + E + ++EKE+ + EE + E
Sbjct: 61 EISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLRE 120
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+ +++++ + + +K +K
Sbjct: 121 YNLFDRNNLQLEDNLQSLELQYEYSLNQLDKLRK 154
Score = 30.6 bits (69), Expect = 1.5
Identities = 17/97 (17%), Positives = 48/97 (49%), Gaps = 3/97 (3%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+KE++ K+ K + ++ + E ++ KKEE+ +E E+ E+++ + +
Sbjct: 42 EKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELV 101
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKK 214
+ E+ KE+ ++E+ + ++ + +L+ +
Sbjct: 102 ELQEE--KEQLENEELQYLREYNLFDRNNLQLEDNLQ 136
Score = 29.1 bits (65), Expect = 4.7
Identities = 19/86 (22%), Positives = 40/86 (46%)
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+ E +KE++ K+ + + + + E E E E+ R ++ E EK+
Sbjct: 35 DSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDD 94
Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKE 224
D D +L + + +K++ E E + +E
Sbjct: 95 DLDGELVELQEEKEQLENEELQYLRE 120
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 35.1 bits (81), Expect = 0.076
Identities = 23/84 (27%), Positives = 37/84 (44%), Gaps = 6/84 (7%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK- 198
++ K +EK KK + KK K ++ K+ + D EK K + + +K+K
Sbjct: 730 APRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKK 789
Query: 199 --DKDKKLK---KEKLKKKKKEKE 217
K +K+ K K KKK E
Sbjct: 790 PSAKTQKIAAATKAKRAAKKKVAE 813
Score = 32.0 bits (73), Expect = 0.72
Identities = 11/44 (25%), Positives = 18/44 (40%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ EK+ K K + K K ++ K K+ K+K E
Sbjct: 770 RGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKVAE 813
Score = 30.9 bits (70), Expect = 1.5
Identities = 20/87 (22%), Positives = 33/87 (37%), Gaps = 8/87 (9%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
D+ KI E+ + GK EK KK + KK K + + +K+ E +
Sbjct: 716 DERKIDFSLISSERAPRNVGKTAREKAKKGDAGKKGGK-------RRQVGKKVNFEPDSA 768
Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ +K K + KK K +
Sbjct: 769 FRGEKKAKPKAAKKDARKAKKPSAKTQ 795
Score = 30.5 bits (69), Expect = 2.4
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 9/102 (8%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
D I +++ + KT ++K K+ KK K + K K E EK
Sbjct: 721 DFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGK-------KVNFEPDSAFRGEK 773
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
K + + ++ K K+ K +KI K +++ K K +
Sbjct: 774 KAKPKAAKKDARKAKKPSAKT-QKIAAATK-AKRAAKKKVAE 813
>gnl|CDD|153365 cd07681, F-BAR_PACSIN3, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 3 (PACSIN3). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 3 or Syndapin III is expressed ubiquitously and
regulates glucose uptake in adipocytes through its role
in GLUT1 trafficking. It also modulates the subcellular
localization and stimulus-specific function of the
cation channel TRPV4. PACSIN 3 contains an N-terminal
F-BAR domain and a C-terminal SH3 domain. F-BAR domains
form banana-shaped dimers with a positively-charged
concave surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 258
Score = 34.5 bits (79), Expect = 0.076
Identities = 24/96 (25%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEKIRKEE 188
+D +K++ +K+ K+ G +E KE +E K ++ K+ K E +K RK+E
Sbjct: 90 EDSEKVRAWQKEAFHKQMIGGFRESKEAEEGFRKAQKPWVKKLKEVESSKKGYHAARKDE 149
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ ++ + D + +E+L+K + E+ + E E
Sbjct: 150 RTAQTRETHAKADSTVSQEQLRKLQDRVEKCTQEAE 185
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 34.3 bits (79), Expect = 0.081
Identities = 18/71 (25%), Positives = 37/71 (52%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ +KK KK K K+K +++ EK ++K K + E +E+++ +E
Sbjct: 118 SVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEE 177
Query: 168 EKEFKMKEDKE 178
E+E + +ED++
Sbjct: 178 EEEEEEEEDED 188
Score = 32.4 bits (74), Expect = 0.32
Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--EKGKKEEKEKKEEKEKKEE 168
K KK K K K+ ++++ K +K+ KE E +E+++ +EE+E++EE
Sbjct: 125 KAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184
Query: 169 KEFKMKEDKE 178
++ +D +
Sbjct: 185 EDEDFDDDDD 194
Score = 29.0 bits (65), Expect = 3.9
Identities = 20/67 (29%), Positives = 32/67 (47%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
KK K K K K E+E+ E+ E+K K+ E E +ED++ E+
Sbjct: 122 INKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEE 181
Query: 186 KEEKDSE 192
+EE+D +
Sbjct: 182 EEEEDED 188
Score = 28.6 bits (64), Expect = 5.1
Identities = 26/99 (26%), Positives = 50/99 (50%), Gaps = 4/99 (4%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
DK KK K KE + K+ + +K +K K K++ EE+E
Sbjct: 90 DKYQKKRKIGPSIKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDI 149
Query: 167 EEK----EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
+EK E K+KE + + V++ +++++ E+E++ +D+D
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDED 188
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 33.5 bits (77), Expect = 0.082
Identities = 25/104 (24%), Positives = 49/104 (47%), Gaps = 5/104 (4%)
Query: 108 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ IK++ +KK+ E +KKK ++ D +K +++ K ++ + +K +E+
Sbjct: 57 EEIKEEALKYKKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTI 116
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
E E K ++ +E E + +K+K KK KEK
Sbjct: 117 PNHLNEEVSNETKLT--QESSSDESPKEVKLATKNKTKKHDKEK 158
Score = 32.0 bits (73), Expect = 0.33
Identities = 17/82 (20%), Positives = 36/82 (43%), Gaps = 2/82 (2%)
Query: 91 NIVKVTSSEDLGEIKKD--KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
++ K E+L ++K IK + + +K + +++ N+ K ++
Sbjct: 76 SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135
Query: 149 KEKGKKEEKEKKEEKEKKEEKE 170
++ KE K + K KK +KE
Sbjct: 136 SDESPKEVKLATKNKTKKHDKE 157
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 33.7 bits (78), Expect = 0.083
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 2/64 (3%)
Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
I K K D ++K K+KK+KE EEK+KK EE++ K + I
Sbjct: 68 ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKK--YVVSEERKEDQKAVDAALLAAI 125
Query: 185 RKEE 188
+K
Sbjct: 126 KKVP 129
Score = 29.8 bits (68), Expect = 1.7
Identities = 16/48 (33%), Positives = 27/48 (56%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
I K K + K+K+K+K KKK+ + ++K K +E+KE +K
Sbjct: 68 ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQK 115
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 34.2 bits (79), Expect = 0.083
Identities = 14/38 (36%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
KD K ++ +E + KE+ + E EKKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEE-ESAEGEKKESKKKK 196
Score = 33.5 bits (77), Expect = 0.11
Identities = 12/36 (33%), Positives = 19/36 (52%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
K+ K E+ EE E +++E E + KE K+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
Score = 31.9 bits (73), Expect = 0.37
Identities = 11/37 (29%), Positives = 19/37 (51%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
K K ++ ++ + KE++ E KKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.0 bits (68), Expect = 1.7
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
K +E+ EE ++ K ++ + KKE KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 29.6 bits (67), Expect = 2.3
Identities = 15/39 (38%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
KD +E E+ E KEE+ E EKKE K+ K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEES--AEGEKKESKKKK 196
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 35.1 bits (80), Expect = 0.091
Identities = 25/98 (25%), Positives = 58/98 (59%), Gaps = 7/98 (7%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS-EKEKKSKDKDKKLKKEKLKK 211
K + + EKE K +MK ++K +E++ +EE++ E+E+ + + ++L++E+L++
Sbjct: 432 KDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLER 491
Query: 212 KKKEKERSSHEKEVIPKLTFKFGTDME----EKTKRES 245
++ E++R E++ + +L + +E EK +R S
Sbjct: 492 ERLERDRL--ERDRLDRLERERVDRLERDRLEKARRNS 527
Score = 29.7 bits (66), Expect = 4.5
Identities = 31/150 (20%), Positives = 74/150 (49%), Gaps = 23/150 (15%)
Query: 89 VPNIVKVTSSEDLGEI--------KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK 140
V NI +TS + + K D+ +D TH + DK ++ + +
Sbjct: 382 VTNITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIE 441
Query: 141 EKDKEKK-------EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
+++ +K EK++ ++ E+E++E E++ +M+ + + +E+ R E + E+
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERE-----RMERIERERLERERLERERLER 496
Query: 194 EKKSKDKDKKLKKE---KLKKKKKEKERSS 220
++ +D+ +L++E +L++ + EK R +
Sbjct: 497 DRLERDRLDRLERERVDRLERDRLEKARRN 526
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 34.5 bits (80), Expect = 0.100
Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 13/95 (13%)
Query: 158 EKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E+K E+ ++E ++ ++ + K E IR+ + + K KK+K + K+L EKL+ + E+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRL--EKLEARLAEE 297
Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 251
K + K+V+
Sbjct: 298 RPVEEGK----------PLAFRFPPPGKRLGKLVL 322
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 35.0 bits (81), Expect = 0.10
Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKE 207
E +EK E++E++ ++ + E +EKI K+ K ++ + S++ L
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDED-LEKIIKDSKKLKERQDQPDSEEDLATLPTL 514
Query: 208 KLKKKKKEKERSSHEKEVI 226
KL E++S E EV
Sbjct: 515 KLGDVPDPIEKTSLETEVS 533
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 34.6 bits (80), Expect = 0.10
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 14/84 (16%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEE------------KEFKMKEDKEKPVEKIRKEE 188
E ++ KE E +E+K++KEE E + KE K E+ EKI K
Sbjct: 502 EIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKLPAEE-KEKIEKAV 560
Query: 189 KDSEKEKKSKDKDK-KLKKEKLKK 211
+ ++ K +D ++ K K E+L++
Sbjct: 561 AELKEALKGEDVEEIKAKTEELQQ 584
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 34.3 bits (79), Expect = 0.11
Identities = 29/128 (22%), Positives = 66/128 (51%), Gaps = 2/128 (1%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+E L E+K + K K ++ +++ + + ++ IK E+ E +++ EE
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEE 215
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKK 214
+ +K + +EF+ +++ + +EK K + EK K ++K ++LK+ E++ +K K
Sbjct: 216 FVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKRREKREELKERAEEIYEKFK 275
Query: 215 EKERSSHE 222
E+ + E
Sbjct: 276 RGEKLTTE 283
Score = 32.3 bits (74), Expect = 0.43
Identities = 22/106 (20%), Positives = 59/106 (55%), Gaps = 1/106 (0%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEF 171
K+ K+E E++K+K+D+ K+ + EK E K + +E+ +E +EE+++ E+
Sbjct: 13 LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQ 72
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
++KE +++ K+++ K+ + K+ +++ + +++ +
Sbjct: 73 ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIER 118
Score = 29.3 bits (66), Expect = 3.8
Identities = 19/95 (20%), Positives = 46/95 (48%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
E+ E++K+ ++ K+ +++ ++ + + ++K K + KEK K
Sbjct: 197 KLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
++++E KE+ EE K K ++ E++ +K
Sbjct: 257 REKREELKERAEEIYEKFKRGEKLTTEELLLLQKA 291
Score = 28.5 bits (64), Expect = 6.2
Identities = 25/147 (17%), Positives = 69/147 (46%), Gaps = 24/147 (16%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----DKIKNKEKD-------------- 143
E + + + ++ +EK ++ ++++D+ ++K K +
Sbjct: 35 EASELAEKRDELNAKVRELREKAQELREERDEINEEVQELKEKRDEINAKLQELRKEYRE 94
Query: 144 -KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK----EDKEKPVEKIRKEEKDSEKEKKSK 198
KEK+ + + E + ++ EK+ + E++ + V+KI++ K+ E KK+
Sbjct: 95 LKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKAL 154
Query: 199 DKDKKLKKEKLKKKKKEKERSSHEKEV 225
++++KLK+ K + + +K+ +++
Sbjct: 155 EENEKLKELKAEIDELKKKAREIHEKI 181
>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 2 (PACSIN2). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 2 or Syndapin II is expressed ubiquitously and is
involved in the regulation of tubulin polymerization. It
associates with Golgi membranes and forms a complex with
dynamin II which is crucial in promoting vesicle
formation from the trans-Golgi network. PACSIN 2
contains an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 33.9 bits (77), Expect = 0.11
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEKPVEKIRKEE 188
+D +KIKN +K+ K+ G KE KE ++ + +K K+ K E +K KEE
Sbjct: 90 EDFEKIKNWQKEAFHKQMMGGFKETKEAEDGFRKAQKPWAKKLKEVEAAKKAYHTACKEE 149
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
K + + + D L E+LKK + + E+ +
Sbjct: 150 KLATSREANSKADPALNPEQLKKLQDKVEKCKQD 183
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 34.3 bits (79), Expect = 0.12
Identities = 20/120 (16%), Positives = 42/120 (35%)
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
V T +E + D +K K + K ++K+E+
Sbjct: 228 VPATIAEAKELLHADLYLKTLKSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTLR 287
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ EKE ++ ++ E+ + + ++ K KD + + K KK + K+
Sbjct: 288 TRIEKELEKAEKSLEKLKGREFNCEKDARIAAEKILKDYSSVEFLEVDFKSKKKREEAKR 347
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 33.6 bits (77), Expect = 0.12
Identities = 16/79 (20%), Positives = 37/79 (46%), Gaps = 1/79 (1%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
IV + + E+ ++ + K K + + +K + K ++ +K++E+ K
Sbjct: 115 IVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQIL-LEGLQKNQEELFKLL 173
Query: 152 GKKEEKEKKEEKEKKEEKE 170
K E ++ +KE ++KE
Sbjct: 174 DKYNELREQVQKESSKKKE 192
Score = 29.4 bits (66), Expect = 2.9
Identities = 16/62 (25%), Positives = 28/62 (45%), Gaps = 2/62 (3%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG--KKEEKEKKEEKEKKEEKEFKMKEDK 177
+E K K DK + D K + ++ + +G K +E+ K + E +E KE
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 178 EK 179
+K
Sbjct: 189 KK 190
Score = 29.0 bits (65), Expect = 4.3
Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
+E EE KK K KM+ D ++RK + E +K++++ KL +K + +++
Sbjct: 125 EEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKL-LDKYNELREQV 183
Query: 217 ERSSHEKE 224
++ S +K+
Sbjct: 184 QKESSKKK 191
Score = 27.8 bits (62), Expect = 9.7
Identities = 10/68 (14%), Positives = 30/68 (44%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
E++ ++ KK K + + K + + + E ++ +++ K + + V+
Sbjct: 125 EEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQ 184
Query: 183 KIRKEEKD 190
K ++K+
Sbjct: 185 KESSKKKE 192
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 34.3 bits (79), Expect = 0.12
Identities = 24/102 (23%), Positives = 45/102 (44%), Gaps = 1/102 (0%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ E K KK K KK K K + K+ D+ E + + +G + ++E +
Sbjct: 266 YYDVSEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDS 325
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K K ++ +D + + S K ++L ++K KK + E
Sbjct: 326 PKKRKEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPED 367
Score = 33.6 bits (77), Expect = 0.23
Identities = 28/148 (18%), Positives = 58/148 (39%), Gaps = 5/148 (3%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEEK 157
E+ + + KK+ + KKK + + +++ E K+K+
Sbjct: 172 ELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGS 231
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E ++K++E + K+K + + + + K KK KK+K KKKK+ K+
Sbjct: 232 TDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKD 291
Query: 218 RSSHEKEVIPKLTFKFGTDMEEKTKRES 245
E E + + + + E+
Sbjct: 292 LDEDELEPEAEGLGSSDSGSRKDVEEEN 319
Score = 31.7 bits (72), Expect = 0.84
Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 16/157 (10%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNK--------EKDKEKKEK--------EKGKKE 155
+K+ KK + K KK D D NK E+ + KK+K G +
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
++ +K+ +E K++ + + E + D + K K KK KK+K ++K +
Sbjct: 234 DEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLD 293
Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 252
++ E E + D+EE+ R K
Sbjct: 294 EDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRK 330
Score = 29.7 bits (67), Expect = 3.7
Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP--VEKIRKEEKD 190
K + ++ ++ E KK ++E E K K+EE K+ + +EK K+ +
Sbjct: 23 PGAKPGSTKESRDAAAYENWKKRQEEA-EAKRKREELREKIAKAREKRERNSKLGGIKTL 81
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
E + D LKK K ++KKKE ER
Sbjct: 82 GEDDDDDDDTKAWLKKSKKRQKKKEAERKKALLL 115
Score = 29.0 bits (65), Expect = 6.1
Identities = 21/98 (21%), Positives = 46/98 (46%), Gaps = 11/98 (11%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ KK+++E K+K ++ ++KI + +E+ K G K E ++
Sbjct: 40 ENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDD-----------D 88
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+D + ++K +K +K E E+K + +KE+ +
Sbjct: 89 DDTKAWLKKSKKRQKKKEAERKKALLLDEKEKERAAEY 126
Score = 28.6 bits (64), Expect = 9.1
Identities = 24/118 (20%), Positives = 52/118 (44%), Gaps = 2/118 (1%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
S++L + S + + +++ K+K+K D D + D + K+ K
Sbjct: 220 SDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPK 279
Query: 158 EKKEEKEKKEEK--EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+KK++K+K+ + E +++ + E KD E+E + K +KE+ +
Sbjct: 280 KKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDDD 337
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 34.3 bits (78), Expect = 0.12
Identities = 17/89 (19%), Positives = 35/89 (39%)
Query: 90 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
PN+ V E+ + ++ + K K K + + + + K + K K
Sbjct: 29 PNMALVEEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKST 88
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+ +K+EK + K K++ F + E
Sbjct: 89 KSKEKQEKITDKFKVKRKVDRFNGVSEAE 117
>gnl|CDD|116547 pfam07937, DUF1686, Protein of unknown function (DUF1686). The
members of this family are all hypothetical proteins of
unknown function expressed by the eukaryotic parasite
Encephalitozoon cuniculi GB-M1. The region in question
is approximately 250 amino acids long.
Length = 182
Score = 33.6 bits (77), Expect = 0.12
Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 21/73 (28%)
Query: 281 ACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF---CPKCSKVDEGSRKEKKKR 337
AC L M G G VCVGLV F S V
Sbjct: 108 AC-AAGSVKLFMPGSYG---RSSLVCVGLVV-------FAALLLLVSYVKGA-------G 149
Query: 338 GRKKKTHQDIIMS 350
G K + ++ S
Sbjct: 150 GEKVSSKGIVVGS 162
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 33.0 bits (76), Expect = 0.13
Identities = 21/119 (17%), Positives = 48/119 (40%), Gaps = 8/119 (6%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKEKEKGKKE 155
++DL E ++ K + ++E E+ +++ + + K + ++ K E E+ +
Sbjct: 43 ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102
Query: 156 EKEKKEEKEKKEEKEFK------MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
KE E + + E++ + E EK+ ++ D +K D E
Sbjct: 103 IKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAELGEN 161
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 34.3 bits (79), Expect = 0.14
Identities = 21/82 (25%), Positives = 34/82 (41%), Gaps = 5/82 (6%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K N+ + ++ E++ K E K KK K+ K K + +E D E E
Sbjct: 25 KFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTG-GKPDVPPSKEFLDLEEEILDLEAEI 83
Query: 196 KSKDKDKKLKKEKLKKKKKEKE 217
K ++ E L+K+ E E
Sbjct: 84 KEVEE----NLESLEKEINELE 101
Score = 28.9 bits (65), Expect = 7.4
Identities = 19/85 (22%), Positives = 34/85 (40%), Gaps = 6/85 (7%)
Query: 121 EKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E E+ +K + K K I K + + +E EE+ E E K E+ +
Sbjct: 35 EVERKLRKLESKIKKLGIPLK---DTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLE 91
Query: 180 PVEKIR--KEEKDSEKEKKSKDKDK 202
+EK EE + +++ D+
Sbjct: 92 SLEKEINELEEWLNVLDEEKSFLDE 116
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 34.3 bits (79), Expect = 0.15
Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 8/109 (7%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+K T+ K E KK KK +K + K++ K K K+ EK EK K+ ++
Sbjct: 82 PTNKITYLLKTTE-PLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQD 140
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKS----KDKDKKLKKEKLKKKKKE 215
K E + I+ + K + + +DK K K K KK K++
Sbjct: 141 LK---PGENVILTIKIDGKKLLEYDEDYEEFRDKVLKKKVFKDKKAKEK 186
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 34.0 bits (77), Expect = 0.16
Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 3/108 (2%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
+ S+D E +K +K ++ + +++ ++KK KD D D K+K + +K K K
Sbjct: 424 SESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDL---KDKKESINKKNKKGKH 480
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
E+ +++ E ED E+ K +EK KK++ KK
Sbjct: 481 AIERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKK 528
Score = 32.4 bits (73), Expect = 0.53
Identities = 25/116 (21%), Positives = 49/116 (42%), Gaps = 5/116 (4%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-----EKEKKEEKEKKEEKEFKMKEDK 177
E K + +K+ K +++++ KE+ K+ + + K++KE +K K K
Sbjct: 423 ESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAI 482
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
E+ + E ++ + D K LK +K +K R +K + F F
Sbjct: 483 ERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEEGFVF 538
>gnl|CDD|227872 COG5585, COG5585, NAD+--asparagine ADP-ribosyltransferase [Signal
transduction mechanisms].
Length = 417
Score = 34.1 bits (78), Expect = 0.16
Identities = 30/191 (15%), Positives = 57/191 (29%), Gaps = 5/191 (2%)
Query: 67 KTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK 126
+ + + VKD + PNI +DL KD + + ++ + +
Sbjct: 220 RNGFGGVKENFVKDPKSLDQN--PNIHDDLGKKDLKNYTKDPRAD-ENEKLRQTQDRDLG 276
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKE-KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
KK K +KN E KE + KK E +++ + ++ +
Sbjct: 277 KKLRKTGMPALKNGEGQIRPKEVRIRKKKLEDAIGVDEKGGVGVDAVPFVSEDLKPGEGS 336
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 245
+ S K+L E K+ + I +
Sbjct: 337 INVHRLYR-SSSSQDIKELYIENFKETNLAGKLKDAYMNEISLEKDAIYEKIGHVYGLLE 395
Query: 246 SPKIVIKPVKS 256
+ +P KS
Sbjct: 396 IEDVASEPRKS 406
Score = 29.4 bits (66), Expect = 4.5
Identities = 22/133 (16%), Positives = 37/133 (27%), Gaps = 2/133 (1%)
Query: 38 PGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTS 97
T P+ + + K K ++ EK V V + +
Sbjct: 280 RKTGMPALKNGEGQIRPKEVRIRKKKLEDAIGVDEKGGVGVDAVPFVSED--LKPGEGSI 337
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
+ K+ KE K KD + K+ EK G E +
Sbjct: 338 NVHRLYRSSSSQDIKELYIENFKETNLAGKLKDAYMNEISLEKDAIYEKIGHVYGLLEIE 397
Query: 158 EKKEEKEKKEEKE 170
+ E K +E +
Sbjct: 398 DVASEPRKSDEVQ 410
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 33.5 bits (77), Expect = 0.16
Identities = 21/58 (36%), Positives = 33/58 (56%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
KD K KE++ I K K D ++ K +++++K K+E+K K EKEK EE+
Sbjct: 65 KDFKKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
Score = 28.8 bits (65), Expect = 4.9
Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 114 KKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEK---KEKEKGKKEEKEKKEEKEK 165
KK K+EK IK K D +K+++K KE++K K EKEK EE+
Sbjct: 68 KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEEYG 124
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 33.5 bits (77), Expect = 0.17
Identities = 13/69 (18%), Positives = 24/69 (34%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
K +++ +K KKKK KK K + + + E K+ K+ +
Sbjct: 173 KARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRS 232
Query: 174 KEDKEKPVE 182
+ E
Sbjct: 233 IAPDNEKSE 241
Score = 29.6 bits (67), Expect = 2.6
Identities = 21/89 (23%), Positives = 40/89 (44%), Gaps = 2/89 (2%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK--KEEKEFKMKEDKEK 179
+E+++ K + ++++ K K+K K+KK K+ + E + + K
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
++K R D+EK + K KKEK
Sbjct: 226 KLKKKRSIAPDNEKSEVYKSLFTSHKKEK 254
Score = 28.9 bits (65), Expect = 4.7
Identities = 19/89 (21%), Positives = 36/89 (40%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
+D + E++ E + ++ K+KK++K+KK +K E V E S
Sbjct: 157 EDVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSS 216
Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
+ + K K K+ +K + S
Sbjct: 217 GAGQVGEAKKLKKKRSIAPDNEKSEVYKS 245
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 33.2 bits (76), Expect = 0.17
Identities = 24/117 (20%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
L ++ ++ ++ +K +++ +E K K I + K K+E++ + KE+K
Sbjct: 101 LNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQK--ARAAKERK 158
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E + + F D P + ++K+ E++KK K+ ++ K+EK
Sbjct: 159 ERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMAALVAAA 215
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.9 bits (77), Expect = 0.18
Identities = 39/179 (21%), Positives = 69/179 (38%), Gaps = 29/179 (16%)
Query: 13 AGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSP-- 70
+G P P QHP P PPPS + + S + S
Sbjct: 441 SGLHSGPPQSPFAQHPFT--SGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGG 498
Query: 71 -------------KEKEYSKVKDVELGVTPM-----VPNIVKVTS--SEDLGEIKK-DKS 109
KE+ + ++ E P P +V S S+ K D+
Sbjct: 499 CAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHLDRG 558
Query: 110 IKKDKKTH---KKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+T K+ KK+++ +K K + ++K +E++E+EK K++E+E++ E+E
Sbjct: 559 YNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
Score = 29.7 bits (66), Expect = 4.4
Identities = 14/35 (40%), Positives = 26/35 (74%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K + + +K K+E E+ +EE+E+++EKEK+ E+E
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERE 613
Score = 29.3 bits (65), Expect = 6.0
Identities = 15/55 (27%), Positives = 37/55 (67%), Gaps = 4/55 (7%)
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK----ERSSHE 222
K+ + +E+ VEK ++E + +E++ ++K+K+ ++E+ ++++ E+ SSHE
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630
Score = 28.9 bits (64), Expect = 7.8
Identities = 14/42 (33%), Positives = 29/42 (69%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
K +E+ EK ++E +K +E++ EKEK++E+E + + + E+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAER 620
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 31.6 bits (72), Expect = 0.18
Identities = 24/75 (32%), Positives = 34/75 (45%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
K K K + K+K K DK + K K KE K K+ G + E ++ K K K+D
Sbjct: 3 KKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKD 62
Query: 177 KEKPVEKIRKEEKDS 191
+K + KDS
Sbjct: 63 CKKGCKWEGNTCKDS 77
Score = 27.8 bits (62), Expect = 4.3
Identities = 17/66 (25%), Positives = 29/66 (43%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K + +K K KK++ K K K+ K K+ + + EK K ++ +K +K
Sbjct: 7 TKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKG 66
Query: 165 KKEEKE 170
K E
Sbjct: 67 CKWEGN 72
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 33.0 bits (76), Expect = 0.19
Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
K KK KK +K+ K+ K D D+ K +E EK E+++ ++++ E E+K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDREL--NRQRQAEAEQKA 72
Query: 168 EK 169
Sbjct: 73 IV 74
Score = 29.9 bits (68), Expect = 1.7
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 8/65 (12%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK-----KEEKEKKE 167
DKK KK +KEK +K +K K + D+ K+ E+ K E+ E+ ++ + + E
Sbjct: 13 DKKKAKKAKKEK---RKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAE 69
Query: 168 EKEFK 172
+K
Sbjct: 70 QKAIV 74
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 33.7 bits (78), Expect = 0.19
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
K +K++ K +KE K +K K+K + K + K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 32.9 bits (76), Expect = 0.30
Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 5/53 (9%)
Query: 94 KVTS-SEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
+VT+ +E E+K++ K+ DKK K K + KK +K + K +
Sbjct: 41 EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQ---KKKEKQEAKAAKAKSKP 90
Score = 32.1 bits (74), Expect = 0.61
Identities = 10/39 (25%), Positives = 22/39 (56%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
KE++K K + K +K + K++K++ K K + + +
Sbjct: 53 KEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 31.0 bits (71), Expect = 1.4
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
K +KE K+ K +K KKEK + K + +
Sbjct: 57 KAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKPR 91
Score = 30.2 bits (69), Expect = 2.0
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 10/60 (16%)
Query: 125 IKKKKDKKDKDKIKN-KEKDKEKKE---------KEKGKKEEKEKKEEKEKKEEKEFKMK 174
++KK +K + ++ N E+ KE KE KE + +KK+EK++ + + K K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 29.8 bits (68), Expect = 3.0
Identities = 15/67 (22%), Positives = 30/67 (44%), Gaps = 7/67 (10%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
++ K K +++ +++ KE ++ K K+E K K K K++K++ K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKEL-KAWHKAQK-KKEKQE-----AK 82
Query: 187 EEKDSEK 193
K K
Sbjct: 83 AAKAKSK 89
Score = 29.4 bits (67), Expect = 4.1
Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 15/66 (22%)
Query: 144 KEKKEKEKGK-------KEEKEKKE--EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
++K+ +KG+ ++ KE KE + ++KE K +K K+EK K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQK------KKEKQEAKA 83
Query: 195 KKSKDK 200
K+K K
Sbjct: 84 AKAKSK 89
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 33.1 bits (76), Expect = 0.19
Identities = 22/142 (15%), Positives = 58/142 (40%), Gaps = 11/142 (7%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK-----------DKKDKDKIKNKEKDKEKK 147
+ L + K + + E E ++ + ++ + + K E++
Sbjct: 31 KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERE 90
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ + + K+ ++E M+E ++ E +E+ EK + + +L++E
Sbjct: 91 LRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150
Query: 208 KLKKKKKEKERSSHEKEVIPKL 229
+ +++ +E SS +E+ KL
Sbjct: 151 VAEIREEGQELSSKREELKEKL 172
Score = 31.6 bits (72), Expect = 0.65
Identities = 21/83 (25%), Positives = 36/83 (43%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
D EK E KE ++ ++ + + E K E E +E + + E E + +
Sbjct: 16 DLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERI 75
Query: 203 KLKKEKLKKKKKEKERSSHEKEV 225
K +EKL K E+E + E+
Sbjct: 76 KRAEEKLSAVKDERELRALNIEI 98
Score = 30.8 bits (70), Expect = 1.0
Identities = 23/121 (19%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 110 IKKDKKTHKKKEKEK-IKKKKDKKD--KDKIKNKEKDKEKKEKEKGKKEEKEKK-EEKEK 165
++KD+ + KE K +KK K + + ++ E + E E + + E + ++ E+ K
Sbjct: 17 LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76
Query: 166 K-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ EEK +K+++E I + + + +++ + +K+ E + E+
Sbjct: 77 RAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERL 136
Query: 225 V 225
Sbjct: 137 E 137
Score = 29.6 bits (67), Expect = 2.5
Identities = 20/93 (21%), Positives = 49/93 (52%), Gaps = 2/93 (2%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEK 157
++ K+ + +D+ +E EK++K+ + K+ ++++ + E + +E+ + +
Sbjct: 97 EIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE 156
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
E +E K+EE + K+ + E+IRK +K
Sbjct: 157 EGQELSSKREELKEKLDPELLSEYERIRKNKKG 189
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 33.6 bits (77), Expect = 0.19
Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 4/105 (3%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
T KK E + K+ KD K+K K+ + +E +E + +KEE+E++ +KEE+E +M
Sbjct: 117 NTKKKIETYQ-KENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMN 175
Query: 175 E--DKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEK 216
+ +K+ ++++ +E + K KL+ + K K ++
Sbjct: 176 KRKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKP 220
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 32.6 bits (75), Expect = 0.20
Identities = 15/71 (21%), Positives = 47/71 (66%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+KE++K++K+ ++ K K+++ ++K++E +K++ ++E ++K++E + + +
Sbjct: 59 QKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKID 118
Query: 179 KPVEKIRKEEK 189
K ++++ KE+
Sbjct: 119 KAIKEVAKEKG 129
Score = 29.5 bits (67), Expect = 2.1
Identities = 14/77 (18%), Positives = 50/77 (64%), Gaps = 8/77 (10%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK------EEKEKKEEKEKKEEKEF- 171
K +++++K+ KK + +++ KEK+ +K+E++ K+ E ++ K+++ +++++E
Sbjct: 34 KAAQKQLEKEF-KKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQ 92
Query: 172 KMKEDKEKPVEKIRKEE 188
+ ++ ++ +++ ++E
Sbjct: 93 QKQQAAQQELQQKQQEL 109
Score = 28.0 bits (63), Expect = 6.4
Identities = 19/73 (26%), Positives = 46/73 (63%), Gaps = 6/73 (8%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K +++ +KE K+ + E +KKE KE + +E K + ++ SE+ +K+K ++ + K
Sbjct: 34 KAAQKQLEKEFKKLQAELQKKE-KELQKEEQKLQ-----KQAATLSEEARKAKQQELQQK 87
Query: 206 KEKLKKKKKEKER 218
+++L++K++ ++
Sbjct: 88 QQELQQKQQAAQQ 100
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.9 bits (75), Expect = 0.21
Identities = 21/73 (28%), Positives = 37/73 (50%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ + K K K K + +D ++ + E K ++K K KK E +K+ +K+KKE
Sbjct: 102 RLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKE 161
Query: 168 EKEFKMKEDKEKP 180
+K+ K + E P
Sbjct: 162 KKKKKKRHSPEHP 174
Score = 32.5 bits (74), Expect = 0.27
Identities = 20/72 (27%), Positives = 38/72 (52%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
E+ + + KK+ K K ++ + +E P + + + + +KK + DK+ KK+
Sbjct: 99 EQYRLMHIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKK 158
Query: 208 KLKKKKKEKERS 219
K +KKKK+K S
Sbjct: 159 KKEKKKKKKRHS 170
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.8 bits (77), Expect = 0.21
Identities = 20/78 (25%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKEKEKG 152
K D +K + + + + + K+D+K D + D E++E+E+
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEE 871
Query: 153 KKEEKEKKEEKEKKEEKE 170
++EE+E++EE+E++EE+E
Sbjct: 872 EEEEEEEEEEEEEEEEEE 889
Score = 33.0 bits (75), Expect = 0.34
Identities = 26/134 (19%), Positives = 65/134 (48%), Gaps = 5/134 (3%)
Query: 53 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 112
+ +T + + ++ K E E +D E+ V+ + GE + + +
Sbjct: 761 RKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSE 820
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG-----KKEEKEKKEEKEKKE 167
+ + + E +++ + +++ + K EK + G ++EE+E++EE+E++E
Sbjct: 821 TQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEE 880
Query: 168 EKEFKMKEDKEKPV 181
E+E + +E+ E+P+
Sbjct: 881 EEEEEEEEENEEPL 894
Score = 32.3 bits (73), Expect = 0.63
Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 3/101 (2%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
K D + + + ++ ++ E + +E E E++ E E K + + E + RK
Sbjct: 632 KGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKG 691
Query: 188 EKDSEKEKKSKDKDKK---LKKEKLKKKKKEKERSSHEKEV 225
E++ E E ++K+ D K +E + + E E + E E+
Sbjct: 692 EQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI 732
Score = 30.7 bits (69), Expect = 2.1
Identities = 24/134 (17%), Positives = 61/134 (45%), Gaps = 5/134 (3%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
+ KV + DL + ++ ++T ++ E+ + + + E ++E + + K
Sbjct: 621 VAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGE----NGEESGGEAEQEGETETK 676
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
G + E E + E+K E+E + + + ++ K E ++ E E +++ + + + E
Sbjct: 677 G-ENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETG 735
Query: 212 KKKEKERSSHEKEV 225
++ E+ E E
Sbjct: 736 EEGEEVEDEGEGEA 749
Score = 30.3 bits (68), Expect = 2.4
Identities = 28/154 (18%), Positives = 71/154 (46%), Gaps = 10/154 (6%)
Query: 60 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK--KDKSIKKDKKTH 117
E++ + + + K + + + ED GEI+ +D +K D+
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK-------- 169
K E E + +K + + + D + + E G++E + + + K++EK
Sbjct: 799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS 858
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ E++E+ E+ +EE++ E+E++ ++++++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|227470 COG5141, COG5141, PHD zinc finger-containing protein [General
function prediction only].
Length = 669
Score = 33.8 bits (77), Expect = 0.21
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 9/64 (14%)
Query: 261 PVAAADEYDTGDSKQVWICPAC-GVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF 319
P+ +DE+D IC C ++ S ++ CDGC+ H C G+ PE W
Sbjct: 185 PIEPSDEFDD-------ICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPE-GFWL 236
Query: 320 CPKC 323
C KC
Sbjct: 237 CRKC 240
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 32.3 bits (74), Expect = 0.22
Identities = 18/62 (29%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
K G ++++++EE+E+ E +E +E ++ +EK + K EK ++++ K K++ KE+
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR-EKRRENERKQKEILKEQ 150
Query: 209 LK 210
+K
Sbjct: 151 MK 152
Score = 31.9 bits (73), Expect = 0.29
Identities = 16/62 (25%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
++K + K +K +++E+ ++++ +++ D++ KE K K+EK + + E K+K+ K
Sbjct: 90 VRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRR-ENERKQKEILK 148
Query: 164 EK 165
E+
Sbjct: 149 EQ 150
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 33.5 bits (77), Expect = 0.22
Identities = 27/129 (20%), Positives = 47/129 (36%), Gaps = 11/129 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-EKEKKEEKEFKMKED 176
+K +K K K K N ++ K +K KK KE E + + +E
Sbjct: 448 CEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDFIKEFETDALKLQEI----- 502
Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS-HEKEVIPKLTFKFGT 235
+EK+ + + EK D K+L E +E + + H E+ +K
Sbjct: 503 ----LEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEILQPALHHIELNIARIYKLNI 558
Query: 236 DMEEKTKRE 244
+E +
Sbjct: 559 KDKEDFLNK 567
Score = 29.3 bits (66), Expect = 5.0
Identities = 32/146 (21%), Positives = 51/146 (34%), Gaps = 13/146 (8%)
Query: 81 VELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKI 137
EL V NI+ + +DL S K E +KIK K+
Sbjct: 345 YELAVYLGFKNIIFI--GQDLAYTNDGNSHAKGASYGDNYEGDESDKIKVVAYV-GGGKV 401
Query: 138 KNKEK---DKEKKEKEKGKKEEKEKKEEKEK----KEEKEFKMKEDKEKPVEKIRKEEKD 190
+ +EK E++ ++KE E K E KE EK ++K + +
Sbjct: 402 ETDSVWNLFREKFEEDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDKDKNKPFI 461
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEK 216
+ ++K + K K KK
Sbjct: 462 KLILLSNNSQEKAVLKLLQKIKKNND 487
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 33.4 bits (77), Expect = 0.23
Identities = 19/61 (31%), Positives = 41/61 (67%), Gaps = 3/61 (4%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEK 183
K+K K +K++ K +++E++E + ++EE+E+ EE++K++ + K+ E KEK +E
Sbjct: 6 KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
Query: 184 I 184
+
Sbjct: 66 L 66
Score = 30.3 bits (69), Expect = 2.2
Identities = 18/60 (30%), Positives = 35/60 (58%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
DK+K K +K +EK ++EE+E+ E E++EE+E + ++ ++ K + EK+
Sbjct: 3 DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKD 62
Score = 28.4 bits (64), Expect = 9.0
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
+K KEK K K EEK ++EE+E+ E E + +E+ E EEK + K K
Sbjct: 3 DKLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDE--------EEKKEKPGFFDKAK 54
Query: 201 DKKLKKEKL 209
++K++ +
Sbjct: 55 ITEIKEKDI 63
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 33.3 bits (75), Expect = 0.23
Identities = 35/138 (25%), Positives = 72/138 (52%), Gaps = 11/138 (7%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIK--------NKEKDKEKKEKEKGKKEEKEKKEEK 163
+DKK + K++ + +D+ +K I+ +EK K +KE + + + ++EK
Sbjct: 124 QDKKIELAQAKKEAENARDRANKSGIELEQEEQKTEQEKQKTEKEGIELANSQIKAEQEK 183
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEK-LKKKKKEKERSS 220
+K E+++ K +++K+K K E + EK+K +K +K++K K+ ++ + +
Sbjct: 184 QKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQEN 243
Query: 221 HEKEVIPKLTFKFGTDME 238
H + I KL K G +E
Sbjct: 244 HNQFFIKKLGIKAGIAIE 261
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 33.3 bits (76), Expect = 0.23
Identities = 31/114 (27%), Positives = 42/114 (36%), Gaps = 1/114 (0%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
KK K + KK K K K K K K KK K + +K ++ K
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258
Query: 173 MKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+ K V+K +K K + K K K K K + KK K+ S K
Sbjct: 259 ALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKAT 312
Score = 31.8 bits (72), Expect = 0.61
Identities = 28/120 (23%), Positives = 45/120 (37%), Gaps = 1/120 (0%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K S K K +K+ K K K K K KK +K K+ + ++
Sbjct: 212 AKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAA 271
Query: 165 KKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
KK K K + K + K +K + K+ + K K K + K +K + +K
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 33.7 bits (77), Expect = 0.23
Identities = 35/165 (21%), Positives = 49/165 (29%), Gaps = 1/165 (0%)
Query: 34 LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYS-KVKDVELGVTPMVPNI 92
L P M S K K SK K + EK+ V D
Sbjct: 404 LKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELR 463
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
KV S+ K KK + IK KK K K K +D +
Sbjct: 464 KKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPN 523
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
KE+ +K+ K + + P ++K E + K
Sbjct: 524 NKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFP 568
Score = 32.6 bits (74), Expect = 0.52
Identities = 38/208 (18%), Positives = 65/208 (31%), Gaps = 16/208 (7%)
Query: 23 PMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE 82
+ + P P + P P+ + K TS K K K +Y K +
Sbjct: 387 NLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVK---VVKVSKYKGNKSEK 443
Query: 83 LGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK 142
++ I S++L + K K+ K K K K K ++ N
Sbjct: 444 KRDINVLDTIFASPVSKEL----------RKKVGKSKQTKLKNFKPVPNKSKKQLANNNS 493
Query: 143 DKEKKEKEKGKKEEKEKKEEK--EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD- 199
K +K K + + E K K DK+ + S E ++ +
Sbjct: 494 QNIKSKKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTSTSSSVLKSDRSSIEVRNPNA 553
Query: 200 KDKKLKKEKLKKKKKEKERSSHEKEVIP 227
KKL+ K +++ V
Sbjct: 554 NVKKLEDTTYNAKFPTVSKNNAYTLVDI 581
Score = 29.9 bits (67), Expect = 3.1
Identities = 34/155 (21%), Positives = 59/155 (38%), Gaps = 8/155 (5%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
S E+L E K K KK +K +KI N K K K KG K
Sbjct: 383 QSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNPVK-VVKVSKYKGNKS 441
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL----KKEKLKK 211
EK++ + F KE + + ++ + K +K KK + +K
Sbjct: 442 EKKRDI---NVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKS 498
Query: 212 KKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
KK K +++++ + + +EK +++S
Sbjct: 499 KKVVKAKTNNKANLQDVGECSSPPNNKEKNDKQTS 533
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 33.4 bits (77), Expect = 0.23
Identities = 15/106 (14%), Positives = 46/106 (43%), Gaps = 2/106 (1%)
Query: 121 EKEKIKKKKDKKDK-DKIKNK-EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K +I+ + +K K ++ K + E + + E+EK +E + KK + + + + +
Sbjct: 434 AKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALA 493
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ K + + ++ + + + +K + + R + ++
Sbjct: 494 RVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQA 539
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 33.6 bits (77), Expect = 0.24
Identities = 31/115 (26%), Positives = 51/115 (44%), Gaps = 12/115 (10%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
I S ++ +++ + ++ +K K++ K KE+ +E E EK KE E
Sbjct: 330 IATRFSSPNVQENDVEEKNDNSNVQQKEKKKEESKAKEEKQEDIEFEKRFKELME----- 384
Query: 164 EKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E K K D V + + E+D +K K S D K + E +K+K E E
Sbjct: 385 ------ELKEKGDLSIFVALSLSEVEEDGDKVKISFDSSKAMHYELMKEKLPELE 433
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 33.4 bits (76), Expect = 0.25
Identities = 34/114 (29%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 118 KKKEKEKIKKK------KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
KKE + ++ K +KDK +++ EK + K E + EK+ EEK++K+E+E
Sbjct: 453 LKKENDMLQTKLNSMVSAKQKDKQSMQSMEK-RLKSEADSRVNAEKQLAEEKKRKKEEEE 511
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+ R+E +S K+ K +D + ++KK + K KE+E EKE
Sbjct: 512 TAARAAAQAAAS-REECAESLKQAK-QDLEMEIKKLEHDLKLKEEECRMLEKEA 563
Score = 31.8 bits (72), Expect = 0.78
Identities = 34/125 (27%), Positives = 66/125 (52%), Gaps = 2/125 (1%)
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
DLG++KK+ + + K K+K K+ +K ++K++ + EK+ +E+K K
Sbjct: 449 DLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEK-RLKSEADSRVNAEKQL-AEEKKRK 506
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
KEE+E + +E+ E +++ ++D E E K + D KLK+E+ + +KE +
Sbjct: 507 KEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQEL 566
Query: 220 SHEKE 224
+E
Sbjct: 567 RKYQE 571
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.4 bits (76), Expect = 0.25
Identities = 21/107 (19%), Positives = 49/107 (45%), Gaps = 2/107 (1%)
Query: 114 KKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+ + E+ I+ + D K+KDK K+ D E ++ E K+ K++ + K ++E++E
Sbjct: 241 GEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEED 300
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ED + E + ++ + ++L +K + +
Sbjct: 301 DEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAK 347
Score = 33.0 bits (75), Expect = 0.35
Identities = 25/120 (20%), Positives = 56/120 (46%), Gaps = 3/120 (2%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
++ G I+ + +K KK + + + D+ DK+ +K + K ++E E+ ++E
Sbjct: 245 EDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQE 304
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK---KSKDKDKKLKKEKLKKKK 213
++ EE+ + + ++ + ++ S EK K K + ++L+KE L K
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKS 364
Score = 31.5 bits (71), Expect = 1.1
Identities = 30/152 (19%), Positives = 69/152 (45%), Gaps = 6/152 (3%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK--EKGKKE 155
S+D E ++K +++++ +K+ E+ ++K K DK + +K ++ + E + E
Sbjct: 152 SDDETEDDEEKKMEEEEAGEEKESVEQATREK-KFDKSGVDDKFFKLDEMNEFLEATEAE 210
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E+ +++ E+ ED + + E+++ ++E + +D KEK KKK
Sbjct: 211 EEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAG 270
Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
+ + E + K + + K + E
Sbjct: 271 DDAELEDDEPDKEAVKK---EADSKPEEEDEE 299
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as
a single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through
DAZ repeats.
Length = 136
Score = 32.1 bits (73), Expect = 0.25
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 9/51 (17%)
Query: 3 PFQFSPFP--SVAGP-------GLIPRLPPMLQHPLIPQPLVHPPGTPPPS 44
P +P+P S+ P G PPM +P+ P V+PPG+
Sbjct: 25 PQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLV 75
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 33.5 bits (78), Expect = 0.25
Identities = 30/113 (26%), Positives = 49/113 (43%), Gaps = 21/113 (18%)
Query: 112 KDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE--- 167
KDK T K+ + I ++I ++ K+ E +E+K++KE E +
Sbjct: 482 KDKGTGKE---QSITITASSGLSDEEI-----ERMVKDAEANAEEDKKRKELVEARNQAD 533
Query: 168 ------EKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 211
EK K DK EKI K+ ++ K +DK+ K K E+L +
Sbjct: 534 SLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKEAIKAKTEELTQ 586
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 33.6 bits (77), Expect = 0.25
Identities = 21/96 (21%), Positives = 46/96 (47%), Gaps = 9/96 (9%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-------EDKEKPVEKIRKEEK 189
EK+ ++ K +G+K E ++ K+ E E + +D K +++ +E
Sbjct: 132 RDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLKTRTWKDLVKALDEAEEELA 191
Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKE-KERSSHEKE 224
+ KE + +K+ K + E+L++ ER + E++
Sbjct: 192 NLRKELRQLEKE-KQRLERLRRLLPLLAERKALEQQ 226
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 32.8 bits (75), Expect = 0.26
Identities = 10/79 (12%), Positives = 32/79 (40%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
++ + + + ++ + + + + KE+ + +++ + + K KP K +
Sbjct: 51 APTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPK 110
Query: 186 KEEKDSEKEKKSKDKDKKL 204
+ + K+ SK K
Sbjct: 111 VKPQPKPKKPPSKTAAKAP 129
Score = 31.7 bits (72), Expect = 0.60
Identities = 21/81 (25%), Positives = 32/81 (39%), Gaps = 1/81 (1%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEK 208
+ EE + + E +++ K E +P KE+ EK+ KK K K K K K
Sbjct: 49 LEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPK 108
Query: 209 LKKKKKEKERSSHEKEVIPKL 229
K K + K + K
Sbjct: 109 PKVKPQPKPKKPPSKTAAKAP 129
Score = 30.5 bits (69), Expect = 1.3
Identities = 13/66 (19%), Positives = 25/66 (37%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ E++ + + + K K+K K EK+ K + K K + K K + K +
Sbjct: 61 EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120
Query: 178 EKPVEK 183
Sbjct: 121 PSKTAA 126
Score = 29.8 bits (67), Expect = 2.5
Identities = 19/77 (24%), Positives = 29/77 (37%), Gaps = 5/77 (6%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKE--DKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ E E +++ K E + E K KE EK +K + + K K K K K
Sbjct: 57 QPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP---KVKP 113
Query: 204 LKKEKLKKKKKEKERSS 220
K K K + +
Sbjct: 114 QPKPKKPPSKTAAKAPA 130
Score = 28.6 bits (64), Expect = 5.3
Identities = 15/78 (19%), Positives = 28/78 (35%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E +++ + + K+K K +K K K K K K + + K + + K+
Sbjct: 61 EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKP 120
Query: 163 KEKKEEKEFKMKEDKEKP 180
K K +P
Sbjct: 121 PSKTAAKAPAAPNQPARP 138
Score = 28.6 bits (64), Expect = 5.7
Identities = 19/79 (24%), Positives = 28/79 (35%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K E + K K K K + K KK K K K + K K + K + + K+ K
Sbjct: 67 PKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAA 126
Query: 178 EKPVEKIRKEEKDSEKEKK 196
+ P + S
Sbjct: 127 KAPAAPNQPARPPSAASAS 145
>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1. RNA
polymerases catalyze the DNA dependent polymerisation of
RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 1, represents the clamp domain, which a
mobile domain involved in positioning the DNA,
maintenance of the transcription bubble and positioning
of the nascent RNA strand.
Length = 330
Score = 33.0 bits (76), Expect = 0.27
Identities = 27/114 (23%), Positives = 40/114 (35%), Gaps = 7/114 (6%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
+ ++S+K K K KK+ KI N K K K + E
Sbjct: 101 SSLLLNESVKYFFLKVVIDPKGKNSKKR----LKKINNLCKKKSICSKCGEDNGGLKAFE 156
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
K + K K E ++ E + KEK K +K+ + KK KE
Sbjct: 157 GCGKYQPKISKDG--AEAIKALLKNIEIEELKEKLRKLNPEKV-LQIFKKISKE 207
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 33.3 bits (76), Expect = 0.30
Identities = 31/114 (27%), Positives = 53/114 (46%), Gaps = 14/114 (12%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKE-----KEKG----KKEEKEKKEEKEKKEEK 169
K+ KE++K + DKI +E E KEKG ++ +E++EE+E++EE+
Sbjct: 356 KELKERVKDAIPEDLYDKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEE 415
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-----KDKKLKKEKLKKKKKEKER 218
+ + E+PV E K+ K+ EK+ K+ EKE+
Sbjct: 416 AAEAEAPMEEPVPGFEVPEMPMPAAGGGGGIKIVLKNAKITAEKVIIKRAEKEK 469
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 33.3 bits (76), Expect = 0.30
Identities = 27/144 (18%), Positives = 47/144 (32%), Gaps = 4/144 (2%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNK----EKDKEKKEKEKGKKEEKEKKEEKEKK 166
+ + + E + K +++ D IK K E D E K K +E E K+ K K
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
K + K EK +E ++ + + E + + +I
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIISTEIAQLDENQNLI 474
Query: 227 PKLTFKFGTDMEEKTKRESSPKIV 250
F + E+ KI
Sbjct: 475 DTGEFDLENNFSNSFNPENGNKID 498
Score = 31.3 bits (71), Expect = 1.0
Identities = 19/86 (22%), Positives = 36/86 (41%)
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
+ E+E + E +FK E+ I++++ ++E + + K K L++ K K K
Sbjct: 354 ALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNK 413
Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTK 242
S K ++ EKT
Sbjct: 414 NSSFINKTENILTNSPLKDELLEKTT 439
>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 320
Score = 32.8 bits (75), Expect = 0.30
Identities = 22/112 (19%), Positives = 37/112 (33%), Gaps = 12/112 (10%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK------ 147
K+ +++ KK K +K K + N EK+ + K
Sbjct: 130 KIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDKLSNIFN 189
Query: 148 ------EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
+ EK EK K E + +F+ K D +I K D++
Sbjct: 190 EKLSNLKNEKISINEKIKSILNEINPDSKFEKKLDLNLIKLEIVKSNSDNDS 241
Score = 30.8 bits (70), Expect = 1.5
Identities = 17/106 (16%), Positives = 31/106 (29%), Gaps = 1/106 (0%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE- 161
++ DK + K KK K + DK + ++ E
Sbjct: 119 NLEYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEK 178
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ K F K K + E+ S + + D + K +
Sbjct: 179 NIDDKLSNIFNEKLSNLKNEKISINEKIKSILNEINPDSKFEKKLD 224
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 32.2 bits (73), Expect = 0.31
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 10 PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 42
S PG+ P P + P P PPG PP
Sbjct: 147 SSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPP 179
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 33.1 bits (75), Expect = 0.31
Identities = 26/123 (21%), Positives = 59/123 (47%), Gaps = 3/123 (2%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E L E D ++D+K + + + + + K +D ++K + +K++ +E+E
Sbjct: 180 EPLDEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQE 239
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK---DKDKKLKKEKLKKKKKE 215
K+ +K +K E+E + ++ + + +E E ++ KK KE +KK K
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKM 299
Query: 216 KER 218
+++
Sbjct: 300 EKK 302
Score = 30.4 bits (68), Expect = 1.9
Identities = 23/109 (21%), Positives = 46/109 (42%), Gaps = 3/109 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE--KEKKEE 162
K++ + K +++ + K+D KE++KE K+ K ++E + K
Sbjct: 201 AKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALA 260
Query: 163 KEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
K + + + K + + E ++K E KK+ +KK K K
Sbjct: 261 ALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKMEKKAIKNAAK 309
Score = 30.0 bits (67), Expect = 2.8
Identities = 20/88 (22%), Positives = 40/88 (45%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
+ K+D + KE+EK KK K +++ + K K + K + + + E
Sbjct: 223 QIAKKRDPRIKSFKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASA 282
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+ K KE +K ++ +K K++ K+
Sbjct: 283 TAVKKKAKEVMKKALKMEKKAIKNAAKD 310
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.2 bits (75), Expect = 0.31
Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
EI+ + K + E + + ++K +IK D+ KE+ K ++E++ +
Sbjct: 984 EIESKLPLDKRSIAVVSRRIE-LPVPPECREKHEIK----DRIVKERIKDQEEKERMESL 1038
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ KEE+ K ++++E ++IRK D+ KE K KK
Sbjct: 1039 QRAKEEEIGKKEKERE---QRIRKTIHDNYKEMAKKRLKKK 1076
Score = 32.8 bits (74), Expect = 0.48
Identities = 24/97 (24%), Positives = 48/97 (49%), Gaps = 12/97 (12%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-EEKEKKEEKEKKEEKEFKMKED 176
K+ + ++ + + + K + K++ KE+ K EEKE+ E ++ +E+E KE
Sbjct: 992 DKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE- 1050
Query: 177 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
+E++ K D K++ K++LKKK+
Sbjct: 1051 ----------KEREQRIRKTIHDNYKEMAKKRLKKKR 1077
Score = 30.9 bits (69), Expect = 1.9
Identities = 36/184 (19%), Positives = 75/184 (40%), Gaps = 8/184 (4%)
Query: 49 KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 108
K+ + SP+E + + S K E V E V + + S + ++ + +
Sbjct: 503 KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESE 562
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
+ KK K + +KD + +++ ++ EKG E+ E +E E
Sbjct: 563 FEELKKKW----SSLAQLKSRFQKDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEM 618
Query: 169 KEFK----MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+E + E++E E + E++ KK + + +E+ +KK+ + + EK
Sbjct: 619 EESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKR 678
Query: 225 VIPK 228
I +
Sbjct: 679 KIEE 682
Score = 29.3 bits (65), Expect = 5.1
Identities = 19/68 (27%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLK 210
E +EK E K++ ++ K +E+KE+ R KEE+ +KEK+ + + +K + K
Sbjct: 1007 VPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYK 1066
Query: 211 KKKKEKER 218
+ K++ +
Sbjct: 1067 EMAKKRLK 1074
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 32.2 bits (74), Expect = 0.32
Identities = 22/100 (22%), Positives = 53/100 (53%), Gaps = 7/100 (7%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K+E ++KK ++ +K + +++K E K +EE E+ +K K EK+ +
Sbjct: 33 KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL---- 88
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ ++ K+ EKE K+ + ++ +++ +K ++E++
Sbjct: 89 ---KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDE 125
Score = 28.7 bits (65), Expect = 4.6
Identities = 27/93 (29%), Positives = 48/93 (51%), Gaps = 12/93 (12%)
Query: 105 KKDKSIKKDKKTHKKKEK---EKIKKKKDKKD--KDKIKNKEKDKE--KKEKEKGKKEEK 157
K ++ +K ++ K E +KK +++ + + K+K+ EKDK+ K K + K+ EK
Sbjct: 41 KNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEK 100
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
E K K E ++ E + + VE+ R E D
Sbjct: 101 ELKNLK-----WESEVLEQRFEKVERERDELYD 128
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 32.7 bits (75), Expect = 0.32
Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKEEKEFKMKEDK 177
KE E+++K+ + D +K KEKK + K+ E+E K+ E E+ + ++ E+K
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEK 89
Query: 178 EKPVEKIRKEEKDSE 192
+E +S
Sbjct: 90 SSWFPSKDPDEINSA 104
Score = 28.8 bits (65), Expect = 6.3
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 6/61 (9%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK------EKDKEKKEKEKGKKEEKEKK 160
DK I++ +K K+ + K KK+K ++K+ E K + EK KK E+K
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIEHNEKTKKRLSEEK 89
Query: 161 E 161
Sbjct: 90 S 90
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 33.3 bits (76), Expect = 0.34
Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 9/99 (9%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+EK ++ EK KEE E++ + E + + ++EK + + + +E EK ++EK
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEK---------RREK 302
Query: 209 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
L+ K+ RS+ I FK G ++ R S P
Sbjct: 303 LQNLLKKASRSADNVWYIEPSEFKAGDTVKLYYNRSSGP 341
Score = 29.5 bits (66), Expect = 5.0
Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 134 KDKIKNKEK-DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
++K + EK KE+ E+E+ +E++ ++EEK E + K + EK EK++
Sbjct: 252 EEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQ 304
Score = 29.1 bits (65), Expect = 7.1
Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 1/114 (0%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
S K+ ++ + KE + K +E + + E ++ E E + E
Sbjct: 23 SQKRIQQNNGDKE-DSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIE 81
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
E + + + D+KLK E+ +K+E E + E
Sbjct: 82 VEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKREIEELAEE 135
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 33.0 bits (76), Expect = 0.34
Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 6/100 (6%)
Query: 73 KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---KEKEKIKKKK 129
K + ++++E+ V+VT+ E+ + D KE EK++K+
Sbjct: 771 KGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI 830
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
D+ +K K+ N E K +E ++EKEK E + K K
Sbjct: 831 DRIEK-KLSN-EGFVAKAPEEV-VEKEKEKLAEYQVKLAK 867
Score = 31.1 bits (71), Expect = 1.3
Identities = 16/70 (22%), Positives = 32/70 (45%), Gaps = 2/70 (2%)
Query: 102 GEIKKDKSIKKDKKTHKK--KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
G I + + +K +K KE ++I+KK + +E +++KEK + + K
Sbjct: 808 GLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867
Query: 160 KEEKEKKEEK 169
EE+ +
Sbjct: 868 LEERLAVLKA 877
Score = 30.3 bits (69), Expect = 2.5
Identities = 35/150 (23%), Positives = 61/150 (40%), Gaps = 19/150 (12%)
Query: 72 EKEYSKVKDV---------ELGVTPMVP-NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
EKE+ +K++ E+ ++P P +V V S E ++ ++ K +E
Sbjct: 721 EKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIK--GLANLEE 778
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG---KKEEKEKKEEKEKKEEKEFKMKEDK- 177
E I ++ ++ E G E + E++ +K EKE E K
Sbjct: 779 LE-ILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKL 837
Query: 178 --EKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
E V K +E + EKEK ++ + K K
Sbjct: 838 SNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867
>gnl|CDD|204587 pfam11078, Optomotor-blind, Optomotor-blind protein N-terminal
region. This family is conserved in Drosophila spp.
Optomotor-blind is one of the essential toolkit
proteins for coordinating development in diverse animal
taxa, and in Drosophila it plays a key role in
establishing the abdominal pigmentation pattern, in
development of the central nervous system and leg and
wing imaginal disc-formation of Drosophila
melanogaster. This is the N-terminal region of the
protein and does not include the T-box-containing
transcription factor that plays a part in DNA-binding.
Length = 74
Score = 30.4 bits (68), Expect = 0.34
Identities = 14/36 (38%), Positives = 16/36 (44%)
Query: 5 QFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGT 40
Q P P + LIP+ PP HP PL H T
Sbjct: 14 QAGPHPGLYPGTLIPKFPPAHPHPHPHHPLGHAYTT 49
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 32.3 bits (74), Expect = 0.34
Identities = 35/116 (30%), Positives = 62/116 (53%), Gaps = 12/116 (10%)
Query: 119 KKEKEKIKKKKDK-----KDKDKIKNKEKDKEKKEKEKGKKEEK--EKKEEKEKKEEKEF 171
K + E IKK D+ ++K+ KEK+ +K +E+ K +K EK E+K+ K++ +
Sbjct: 61 KDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQ 120
Query: 172 KMK--EDKEKPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+K +++ K +K+ K K + EKE KDK L ++L +K K +S +
Sbjct: 121 LVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEK 176
Score = 29.6 bits (67), Expect = 2.4
Identities = 30/105 (28%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 112 KDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
++ KKEKE I+K +++ K K K +DK+ K+K K + +++ + K K
Sbjct: 79 DKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSYDKLYKA 138
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+K + EK + + K++ + KE K K EK++K KK+
Sbjct: 139 YKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEKIQKAKKK 183
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 32.9 bits (76), Expect = 0.35
Identities = 25/107 (23%), Positives = 40/107 (37%), Gaps = 19/107 (17%)
Query: 122 KEKIKKKKDKKDKD----------------KIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
E KKKK K+K+ KI + + + KEK G E + E+
Sbjct: 261 NEYAKKKKKLKEKNIKGEDVREGLTAFISVKIPEPQFEGQTKEK-LGTSEVRFGVEKIVS 319
Query: 166 KEEKEFKM--KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
+ F + K VEK+ K KK+++ +K K +
Sbjct: 320 ECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 33.0 bits (75), Expect = 0.35
Identities = 28/122 (22%), Positives = 49/122 (40%), Gaps = 4/122 (3%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+K + T + E+EK K K K K K+K+E + EE++ + + +
Sbjct: 77 QKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAK 136
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK----KKKKEKERSSHEKEVI 226
K ++ E + ++ E+ K K K K K K K+K E +EV
Sbjct: 137 AKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVT 196
Query: 227 PK 228
+
Sbjct: 197 EE 198
Score = 29.9 bits (67), Expect = 2.6
Identities = 36/173 (20%), Positives = 69/173 (39%), Gaps = 1/173 (0%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
SK++++ K+K PK + E+ + +K K L +VT E
Sbjct: 35 SKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALA-KQKREGTEEVTEEEKAKA 93
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K + K K K+K + ++ +++K K K K + K+++E EE
Sbjct: 94 KAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEV 153
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
++EE+ K K + K ++++ + + E+ K K K K
Sbjct: 154 TEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAK 206
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 32.9 bits (76), Expect = 0.35
Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
+K K + ++K++K++ +I K K E + +EE + E++ E E +
Sbjct: 270 NEHRKLKREAAEEKEEKEEKEI-RKLYLYSKLEGKLPTEEEFLEYIEEKNPELLETAEEL 328
Query: 176 DKEKPVE---KIRKEEKDSEK 193
+E+ VE K +EK+ EK
Sbjct: 329 IEEEEVEHQAK-TADEKNLEK 348
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.37
Identities = 19/92 (20%), Positives = 54/92 (58%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
D+++++ E + E E+E + EE+ ++ E + E + E++ + +E++ E +++
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETI 532
Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+EK+ + ++ + + +L+ + +EK ++ E E
Sbjct: 533 EEKRERAEELRERAAELEAEAEEKREAAAEAE 564
Score = 28.5 bits (64), Expect = 10.0
Identities = 23/89 (25%), Positives = 52/89 (58%), Gaps = 7/89 (7%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEK---EK 195
E+D+E+ E+ + + E+ E EE E+ EE+ + ++ E +E++ + +D E+ E+
Sbjct: 471 EEDRERVEELEAELEDLE--EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAER 528
Query: 196 KSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ ++K+ + E+L+++ E E + EK
Sbjct: 529 RETIEEKRERAEELRERAAELEAEAEEKR 557
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 32.8 bits (75), Expect = 0.38
Identities = 18/93 (19%), Positives = 41/93 (44%), Gaps = 1/93 (1%)
Query: 120 KEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K K+ KK K + K K + KE K++E +++ +++ + +
Sbjct: 325 KNDSKLNKKYILKIANDWARKKVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSN 384
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
+K +E+ ++ E ++ K+L+ K K+
Sbjct: 385 PEYKKTTSQEEQAKLEFIKLEELKRLENGKQKR 417
Score = 31.2 bits (71), Expect = 1.1
Identities = 16/65 (24%), Positives = 29/65 (44%), Gaps = 1/65 (1%)
Query: 103 EIKKDKSIKKDKKTHKKK-EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
E K+ +K+ KK+ K K+ K + K +E+ + E K EE ++ E
Sbjct: 352 EAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLE 411
Query: 162 EKEKK 166
++K
Sbjct: 412 NGKQK 416
Score = 29.7 bits (67), Expect = 3.2
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 11/81 (13%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K E IK+ + +K+ K + ++KE + KK ++++ K
Sbjct: 348 KTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAK--------- 398
Query: 179 KPVEKIRKEEKDSEKEKKSKD 199
+E I+ EE + K K
Sbjct: 399 --LEFIKLEELKRLENGKQKR 417
Score = 28.9 bits (65), Expect = 5.4
Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
KI N K+ K + +E K++E +++ K + P + +K + +E+
Sbjct: 337 KIANDWARKKVKTAYE-AIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEE 395
Query: 196 KSKDKDKKLKKEKLKKKKKEK 216
++K + KL++ K + K+K
Sbjct: 396 QAKLEFIKLEELKRLENGKQK 416
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 33.2 bits (76), Expect = 0.38
Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 5/73 (6%)
Query: 149 KEKGKKEEKEKKEEKEKK-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
KE+G EE +K E KM+E E+ E EE D K + + K+
Sbjct: 818 KEEGTYEELSNNGPLFQKLMENAGKMEEYVEENGE----EEDDQTSSKPVANGNANNLKK 873
Query: 208 KLKKKKKEKERSS 220
KKK KE S
Sbjct: 874 DSSSKKKSKEGKS 886
Score = 32.0 bits (73), Expect = 0.86
Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 4/62 (6%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E K E+ +E E + + KPV + +K S K K + + + K++E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVA----NGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Query: 216 KE 217
+E
Sbjct: 894 RE 895
>gnl|CDD|187810 cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Multidomain protein with permuted HD nuclease domain,
palm domain and Zn-ribbon; MTH326-like has inactivated
polymerase catalytic domain; alr1562 and slr7011 -
predicted only on the basis of size, presence of HD
domain, and location with RAMPs in one operon; signature
gene for type III; also known as Crm2 family.
Length = 475
Score = 32.8 bits (75), Expect = 0.39
Identities = 26/172 (15%), Positives = 62/172 (36%), Gaps = 9/172 (5%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKD--KEKKEKEKGKKEEKEKKEEKEKKEEKE-FKMKE 175
++E E++K++ +++ + I NK K+ + + K + ++ ++ E + E
Sbjct: 90 EEEAEELKEEIEEETWESILNKVKEYLDIENVEPFLLKSDALERISDRIEDGNEDREEAE 149
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 235
+ E + +R K + +K+ EK + K+ L K G
Sbjct: 150 ELEAARKNVRDFAPVPWIPPAPKIYIYIVSIFN-EKELSEKYKEFLLKKRNGLLLDKLGG 208
Query: 236 DMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVW-----ICPAC 282
++ + ++ + A + D K + +CP C
Sbjct: 209 RARKEYRSVCGLLAAAWKLEKETEQYSLFAKDDLEEDLKLLLEAGERLCPLC 260
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 32.8 bits (75), Expect = 0.39
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE---FKMKE--DKEKPVEKIRKE 187
DK ++ + D+ E EK K E++ +E E K E + MK EK K+
Sbjct: 508 DKGRLSKADIDRMVNEAEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDS 567
Query: 188 EKDSEKEKKSK-----DKDKKLKKEKLKKKKKEKE 217
+K + ++ + +K++ +KE+ + K+KE E
Sbjct: 568 DKATIEKAIDEALEWLEKNQLAEKEEFEHKQKEVE 602
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 32.3 bits (73), Expect = 0.39
Identities = 26/112 (23%), Positives = 52/112 (46%), Gaps = 7/112 (6%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-------K 172
+E + K + + + KE+ E++E+ + E EKEK ++K+ +
Sbjct: 180 NVREHLWNKAATEREKRQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQ 239
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
E E E+ + E D ++++ +K K +K K + KK +K +E+E
Sbjct: 240 SMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQE 291
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 32.7 bits (74), Expect = 0.39
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 5/91 (5%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGK--KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR 185
++++ E+D+E EKE GK E+ E++EE+E K K+F E + K +
Sbjct: 136 PDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTK--KPSV 193
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K+ S S KKL+K + K+K K+
Sbjct: 194 KDNGKSFWSAASV-FSKKLQKWRQKQKLKKP 223
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 32.7 bits (74), Expect = 0.39
Identities = 24/140 (17%), Positives = 59/140 (42%), Gaps = 7/140 (5%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK------IKNKEKDKEKK 147
K S + + D++ KK + K KK + + + N++ +K+
Sbjct: 50 KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKE 109
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
K+ ++ ++ EE++ + K K + V+K+ ++ +D E + D ++
Sbjct: 110 SKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVT 169
Query: 208 KLKKKKKEK-ERSSHEKEVI 226
L+ + +E+ + + E I
Sbjct: 170 SLENESEEELDLEKDDGEDI 189
Score = 30.0 bits (67), Expect = 3.1
Identities = 24/149 (16%), Positives = 56/149 (37%), Gaps = 15/149 (10%)
Query: 40 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 99
+ D+ K P + K + E + EL V + +
Sbjct: 55 ALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL-----------VVNED 103
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
+ +S K ++T ++K +++K + K++ + K K+ E + + E E
Sbjct: 104 AALD---KESKKTPRRT-RRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEV 159
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
+ +E + + + ++E +EK E+
Sbjct: 160 SDVEESEFVTSLENESEEELDLEKDDGED 188
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 31.6 bits (72), Expect = 0.40
Identities = 21/62 (33%), Positives = 40/62 (64%), Gaps = 3/62 (4%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+KK KEK+++ K K K++ EK+KEK+E+ E EK+ + +K++ E K+ +++
Sbjct: 65 RKKAKEKLRRDKLKAKKEEA---EKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKEE 121
Query: 178 EK 179
+K
Sbjct: 122 KK 123
Score = 30.4 bits (69), Expect = 0.85
Identities = 26/100 (26%), Positives = 50/100 (50%), Gaps = 2/100 (2%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
KK KKK+K +++ ++ + ++ +G+K+ KEK + K +KE
Sbjct: 26 KKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAE 85
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
KE +++ E+ K ++EKE+ +K K K ++KK
Sbjct: 86 KEKEKE--ERFMKALAEAEKERAELEKKKAEAKLMKEEKK 123
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 32.8 bits (75), Expect = 0.41
Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF-KMKED 176
KK + +K+K K++ K E + EK + + K KEF K E
Sbjct: 324 KKNKLTYKQKRKLLGLKEEEIFKGLRYEGLKAEKNFNISLKTYHDLRKALGKEFLKDLEL 383
Query: 177 KEKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLKK 211
EK +++I K ++++ ++ KD L +E+LKK
Sbjct: 384 NEKILDEIVKILTLYKDREMIEKILELYKD-LLNEEQLKK 422
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 32.1 bits (73), Expect = 0.41
Identities = 26/84 (30%), Positives = 50/84 (59%), Gaps = 1/84 (1%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
E+ +EK+++E+ K E+ K+ +K KKE + K + K+ +K R+++K + +KS
Sbjct: 91 EEFREKQKEERKKTEDIMKRSQKNKKELYK-KTMKAKKSYEQKCREKDKAEQAYEKSSSG 149
Query: 201 DKKLKKEKLKKKKKEKERSSHEKE 224
+ + EKLKKK + + S+ E +
Sbjct: 150 AQPKEAEKLKKKAAQCKTSAEEAD 173
Score = 29.8 bits (67), Expect = 2.4
Identities = 20/74 (27%), Positives = 45/74 (60%), Gaps = 1/74 (1%)
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLK 205
+E EK ++ +++KEE++K E+ + +++K++ +K K +K E++ + KDK ++ +
Sbjct: 85 EEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAEQAYE 144
Query: 206 KEKLKKKKKEKERS 219
K + KE E+
Sbjct: 145 KSSSGAQPKEAEKL 158
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 32.6 bits (75), Expect = 0.41
Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
D E+E + + ++++ + E+ ++ E+ K E D + + D D
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDD---DEESDEEDEESSKSEDDEDDDDDDDDDDI 158
Query: 203 KLKKEKLKKKKKEKERSSHEKE 224
++ L+++++ +E E
Sbjct: 159 ATRERSLERRRRRREWEEKRAE 180
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.8 bits (73), Expect = 0.41
Identities = 34/153 (22%), Positives = 66/153 (43%), Gaps = 36/153 (23%)
Query: 72 EKEYSKVKDVELGVTPM-VPNIVKVTSSEDLGEIKKDK--------SIKKDKKTHKKKEK 122
EK KVK G++ M V +++ + L ++ +K S K
Sbjct: 18 EKLAPKVK----GISSMSVKEVLQSLVDDGL--VRTEKIGTSNYYWSFPSQALNKLKTRL 71
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
EK+KK+ + E + E + ++ K+ +EE E++ E E
Sbjct: 72 EKLKKELE----------ELKQRIAELQAQIEKLKKGREETEERTELL-----------E 110
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
++++ EK+ +K K +K +K E+++K K+E
Sbjct: 111 ELKQLEKELKKLKAELEKYEKNDPERIEKLKEE 143
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 31.7 bits (72), Expect = 0.42
Identities = 19/85 (22%), Positives = 40/85 (47%), Gaps = 1/85 (1%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K ++ K++K+ +K+K+ K K K ++ ++K E + + +K+ E
Sbjct: 101 AKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEK 189
+E + +E +E P E+ EE
Sbjct: 161 -EEAAAAEEEEAEEAPAEEAPAEES 184
Score = 31.7 bits (72), Expect = 0.44
Identities = 20/76 (26%), Positives = 28/76 (36%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
K+ K KK K K K K + EKK E + EKK + E ++E
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169
Query: 179 KPVEKIRKEEKDSEKE 194
+ E +E E
Sbjct: 170 EAEEAPAEEAPAEESA 185
Score = 29.0 bits (65), Expect = 4.2
Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 1/68 (1%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
E + + KKDK + KK +K + +KK + +K K + E+
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEK-KAAEAAAVAAEEAAA 165
Query: 161 EEKEKKEE 168
E+E+ EE
Sbjct: 166 AEEEEAEE 173
Score = 28.2 bits (63), Expect = 7.5
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
++KE K K K K KK K+ E E K+ E + + V + + E + ++
Sbjct: 108 EEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEE 162
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 32.6 bits (75), Expect = 0.42
Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 2/88 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+ + K+K + I + +K + +D + + + +KE ++ E
Sbjct: 54 DETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSE 113
Query: 165 KK--EEKEFKMKEDKEKPVEKIRKEEKD 190
K E K + +K ++++R E K+
Sbjct: 114 PKEEEPKAAAESKVVQKELDELRDELKE 141
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 32.7 bits (75), Expect = 0.44
Identities = 12/99 (12%), Positives = 37/99 (37%), Gaps = 1/99 (1%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-EEKEKKEEKEKKEEKEFKMKE 175
++ + +K + ++ K ++ K EK++++++ + + +K
Sbjct: 724 PRRGKITVTDEKCPECGLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKG 783
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
K EK++ ++ + + D D K +
Sbjct: 784 VGAKTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 32.8 bits (75), Expect = 0.47
Identities = 15/96 (15%), Positives = 33/96 (34%), Gaps = 11/96 (11%)
Query: 122 KEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
+ +K + D + + + + +E K + + + K E K
Sbjct: 856 RNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNK------ 909
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
EEK +++ D ++L K + + K K
Sbjct: 910 ----YNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 32.4 bits (74), Expect = 0.48
Identities = 22/161 (13%), Positives = 59/161 (36%)
Query: 48 DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 107
DK+ ++ + ++K K++T + + D E + E
Sbjct: 57 DKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAETAGSL 116
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ I+ ++ +E + K ++ + + +E + + + + EE+
Sbjct: 117 RPIRSSAAAYRDINRELGSFYEYPKTDEEKELLREVEELESRLATEPSPAPELEEQLALM 176
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
EK +++ ++ +S+K KK + + +EK
Sbjct: 177 EKSYELAAKYMPRGQERLPVAPESKKGKKPSVQPVRAVEEK 217
Score = 29.7 bits (67), Expect = 2.9
Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 6/71 (8%)
Query: 155 EEKEKKEEKEKK------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
EE++ EE+ K + K+ + DK K E+ E+K +K++ +D +K
Sbjct: 30 EEEDAAEEETKGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKSD 89
Query: 209 LKKKKKEKERS 219
+ + S
Sbjct: 90 KESQDYSSSDS 100
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.4 bits (74), Expect = 0.49
Identities = 15/114 (13%), Positives = 35/114 (30%), Gaps = 13/114 (11%)
Query: 99 EDLGEIKKDK-SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
L ++ + I ++ E+ + ++ K +E+ K + ++
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQ----RAQQAKLAQLLEERKKTLAQLNSELSADQ 223
Query: 158 EKKEEKEKKEEK--------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+K EE E + E + +E + E K + K
Sbjct: 224 KKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKRTGETYK 277
Score = 29.3 bits (66), Expect = 4.0
Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 6/89 (6%)
Query: 130 DKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
D ++K +K+ EK+ +++++ K EK+ K + + +E E
Sbjct: 33 AAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLET------EIASLEAQLIETA 86
Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
D K+ + + D + L+ +++E+ R
Sbjct: 87 DDLKKLRKQIADLNARLNALEVQEREQRR 115
>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
[Transcription].
Length = 507
Score = 32.4 bits (73), Expect = 0.49
Identities = 26/166 (15%), Positives = 59/166 (35%), Gaps = 3/166 (1%)
Query: 42 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 101
P ++ D K K+ +K+ + N TS+
Sbjct: 135 PKKIRLGSITENDMNLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSRE 194
Query: 102 GEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
+ ++ I K K KK ++ E K D+++ + + + + E +K + KK
Sbjct: 195 ADENENYVISKVKDIPKKVRDGESAKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKK 254
Query: 161 EEKEKKEE-KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
+ EK+ K + + + K + + ++++S+ K
Sbjct: 255 AKLEKRRHVKFLEGSNTHEMDQLLKHFLDNSNFRQDRRSRKKKASA 300
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 32.3 bits (73), Expect = 0.51
Identities = 29/164 (17%), Positives = 68/164 (41%), Gaps = 18/164 (10%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKD 199
E DK++ +K + + E+ ++E+ + ++ +++ EK +++ E E E +S
Sbjct: 367 EADKDRIQKLSAELLKLEEWLQEERSQREKLEVELGIEKDCNRVQLSENRRELSELRSAL 426
Query: 200 KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
+ + +KE+L+++K+E + I L + + +EK +++
Sbjct: 427 RVLQKEKEQLQEEKQELL------DYIRVLELRLDKEADEKWAEDAA----TCEDAKSYL 476
Query: 260 PPVAAADEYDTGD-------SKQVWICPACGVQDDGSLPMIGCD 296
A D D+ D S ++ Q + S P +
Sbjct: 477 HRAAYIDLLDSIDESPLDESSSELSPSALDEEQHNESTPKNPKE 520
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 32.7 bits (75), Expect = 0.51
Identities = 26/73 (35%), Positives = 45/73 (61%), Gaps = 8/73 (10%)
Query: 119 KKEKE--KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE----KEKKEEKEKKEEK--E 170
KKE E K++ K KK ++K++ +++ +E+ K K+E K+ K+E EK +EK E
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEE 258
Query: 171 FKMKEDKEKPVEK 183
K+ E+ +K +EK
Sbjct: 259 LKLPEEVKKVIEK 271
Score = 32.3 bits (74), Expect = 0.58
Identities = 25/81 (30%), Positives = 48/81 (59%), Gaps = 4/81 (4%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDK 200
K++ E K + + +K EEK +K ++E+ ++E + K ++K I K++KD ++ K K +
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLRE-QLKAIKKELGIEKDDKDELEKLKEKLE 257
Query: 201 DKKLKKEKLKKKKKEKERSSH 221
+ KL +E K +KE + S
Sbjct: 258 ELKLPEEVKKVIEKELTKLSL 278
Score = 29.6 bits (67), Expect = 3.7
Identities = 18/46 (39%), Positives = 27/46 (58%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
E L IKK+ I+KD K +K KEK+++ K ++ K+ KE K
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTK 275
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 32.0 bits (73), Expect = 0.52
Identities = 28/127 (22%), Positives = 63/127 (49%), Gaps = 5/127 (3%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ +++ +K+ + + I+ + ++ K E+++E +EK KE E
Sbjct: 93 ENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREK---LKELIE 149
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ E +E+ EK K K+ +++ + K E+ E+ ++ ++K K+ K+ LK+ +
Sbjct: 150 QYELREQHFEKLLKTKDLEQQLNDA--KLEQAQEQAEQEQEKHKREKEILLKEAAQVSTL 207
Query: 219 SSHEKEV 225
EKE+
Sbjct: 208 KETEKEL 214
>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
lipoprotein. This model represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 412
Score = 32.2 bits (73), Expect = 0.52
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
KKE+KE E + EK KM+E K E + K E ++ + + K+ K + K
Sbjct: 22 KKEDKESGLR-EMRVEKFSKMEE---KENEFLSKPEYNTHVKSPEQIKELSNKISGVSGK 77
Query: 213 KKEKER 218
+ +
Sbjct: 78 GESDKF 83
Score = 29.9 bits (67), Expect = 3.0
Identities = 17/72 (23%), Positives = 33/72 (45%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
K++KE G +E + +K K +++E EF K + V+ + ++ S K K + K
Sbjct: 23 KEDKESGLREMRVEKFSKMEEKENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGESDK 82
Query: 206 KEKLKKKKKEKE 217
K + +
Sbjct: 83 FVGTPLVKDDYD 94
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 32.1 bits (73), Expect = 0.54
Identities = 22/76 (28%), Positives = 47/76 (61%), Gaps = 1/76 (1%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
E++K +S+++ K+ +++ +K+ I K K K+ K + + KK++ + K E KK++
Sbjct: 13 ELRKAQSLEELKQKYEEAQKQ-IADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKKK 71
Query: 163 KEKKEEKEFKMKEDKE 178
K KKE+ + ++K K+
Sbjct: 72 KFKKEKVDVRVKVVKK 87
Score = 28.7 bits (64), Expect = 6.5
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 4/67 (5%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
E+ K K EE +K+ K ++ +K+ E +E ++K + E+ +E KK KK KKE
Sbjct: 21 EELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELSKKK----KKFKKE 76
Query: 208 KLKKKKK 214
K+ + K
Sbjct: 77 KVDVRVK 83
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 32.2 bits (74), Expect = 0.55
Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 121 EKEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E+ I + KK +D + + ++ E K K + K + KKE ++ ++ E +MKE
Sbjct: 580 EEHGITPQTIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKE---- 635
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ + E+ + +D+ K+LK+E L K
Sbjct: 636 -----AAKNLEFEEAARLRDEIKELKEELLGKS 663
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 32.4 bits (74), Expect = 0.55
Identities = 18/114 (15%), Positives = 44/114 (38%), Gaps = 10/114 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
K+ E E K + + E+ K + E E K++ E + ++ + + K ++
Sbjct: 641 KRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQ 700
Query: 178 EKPVEKIRKEEKDSEK-EKKSK------DKDKKLKKEKLKKKKKEKERSSHEKE 224
++ + K++ + E+ +K + D +L E R+ +
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQVVEGELDNQL---AQLSAAIEAARTQAKAR 751
Score = 30.4 bits (69), Expect = 2.5
Identities = 31/120 (25%), Positives = 59/120 (49%), Gaps = 11/120 (9%)
Query: 114 KKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+T K +++KK+ D+ D N K+ +++ +E E+ E +E + F
Sbjct: 744 ARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAF 803
Query: 172 KMKED--------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
M+E +E+P I+ E +S E+ ++ + +K KL++KK E+ER + EK
Sbjct: 804 -MQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEK 862
Score = 29.3 bits (66), Expect = 4.8
Identities = 27/138 (19%), Positives = 59/138 (42%), Gaps = 16/138 (11%)
Query: 102 GEIKKDKS----IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
++ D+S ++ K + + E+++ D D++ + + E+ E K
Sbjct: 320 AKLAADRSELELLEDQKGAFEDADIEQLQA-----DLDQLPSIRSELEEVEARLDALTGK 374
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKK 211
+ + K E + K+KE E+ +EK +EEKD +K +D + + +
Sbjct: 375 HQDVQ-RKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQL 433
Query: 212 KKKEKERSSHEKEVIPKL 229
+ + E + E E+ +L
Sbjct: 434 EAGKLEFNEEEYELELRL 451
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 32.5 bits (74), Expect = 0.57
Identities = 30/123 (24%), Positives = 55/123 (44%), Gaps = 12/123 (9%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD---------KDKIKNKEKDKEKKEKEKG 152
EI + IK+ +++ + I K++ K D IK+ D + E +
Sbjct: 563 KEINDESFIKQGFAKDEEELSKLILKERGKVLPETLEKLLFGDLIKDDYLDLNENEFAEK 622
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKK--EKL 209
+ E + K+E + EK + +D+ ++IR +E+ EK K K+ K+ K EK+
Sbjct: 623 EPELDKLKDELYEAFEKIKDLIKDRLIGADRIRNAKERKKEKIKIRKENFKEFKALWEKI 682
Query: 210 KKK 212
K
Sbjct: 683 NHK 685
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 32.3 bits (74), Expect = 0.57
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKE 167
EK+KEKK K++GK+E + + K +E
Sbjct: 395 EKEKEKKAKKEGKEERRIHFQNKSIEE 421
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 32.4 bits (75), Expect = 0.59
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 10/65 (15%)
Query: 118 KKKEKEKIKKKKDK--KDKDKIKNK-------EKDKEKK-EKEKGKKEEKEKKEEKEKKE 167
+ E +++K+ K K+ ++++ K K E+ EKE+ K E E+K K K+
Sbjct: 809 VEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKER 868
Query: 168 EKEFK 172
K
Sbjct: 869 LARLK 873
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 32.2 bits (74), Expect = 0.61
Identities = 26/105 (24%), Positives = 61/105 (58%), Gaps = 4/105 (3%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+ E+E +KK+KD+ K+++++ EK+ + E+E EE+ K E+ + ++ +KE+ E
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQ--IKEEIE 475
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
+ ++ + E++ + K ++ + KL +L+K+ + E E+
Sbjct: 476 QVRLELEQAEREGDLAKAAELQYGKL--PELEKRLQAAEAKLGEE 518
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.1 bits (73), Expect = 0.64
Identities = 28/117 (23%), Positives = 43/117 (36%), Gaps = 12/117 (10%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE 175
K ++I+K+ D +K E K K+KK K
Sbjct: 374 KGIYDQIEKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKG----KSTS 429
Query: 176 DKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
K E I +E+D+ K+ KK +DK K+ + KKE +S + IP
Sbjct: 430 TKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNNNIP 486
Score = 29.4 bits (66), Expect = 4.4
Identities = 22/105 (20%), Positives = 36/105 (34%), Gaps = 14/105 (13%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK-------------KDKDKIKNKEK 142
SS L + + + KKKK K D++ K K
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK 450
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+KK ++K K + K KKE + + + P E + K+
Sbjct: 451 KNQKKGRDKSSKVPSDSK-AGGKKESVKSQEDNNNIPPEEWVMKK 494
Score = 28.6 bits (64), Expect = 7.3
Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 4/100 (4%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEK-EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
+S K K KK K T K E I ++ K KN++K ++K K
Sbjct: 409 SSPAASNSDKGSKK-KKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDS 467
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+ KKE + +E+ E E ++KI + D E++
Sbjct: 468 KAGGKKESVKSQEDNNNIPPE--EWVMKKILEWVPDLEED 505
>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
subunit A'' fusion protein; Provisional.
Length = 509
Score = 32.1 bits (73), Expect = 0.65
Identities = 22/153 (14%), Positives = 61/153 (39%), Gaps = 5/153 (3%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 103
++ ++ T K++ + S YS + P++ ++++ S E +
Sbjct: 27 FEVPKNITDGYITLKKKEFTYHVIISKIAP-YSNSNGIIKKKKPVLKTVLEIESEEKIEA 85
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKEKEKGKKEEK--EKK 160
I + + + + + ++K+ + N +D K + K +
Sbjct: 86 IDLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSV 145
Query: 161 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSE 192
E+ K K+ ++ +D+ E+ + +IR+E + +
Sbjct: 146 IEEIAKAMKKKELSDDEYEEILRRIREEYERAR 178
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 31.9 bits (73), Expect = 0.65
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 3/91 (3%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
KE +K + D D K D E+K+ + EE + K + K+ K K K+ +
Sbjct: 15 KESLKARGLSVDIDLEKLIALDDERKKLLS-EIEELQAKRNELSKQIG--KAKGQKKDKI 71
Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
E+I+KE K+ ++E K + +L+ K
Sbjct: 72 EEIKKELKELKEELTELSAALKALEAELQDK 102
Score = 31.6 bits (72), Expect = 0.77
Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 11/91 (12%)
Query: 118 KKKEKEKIKKKKDKKDKDKI-----KNKEKDKE-----KKEKEKGKKEEKEKKEEKEKKE 167
KE K + D +K+ + K+ E K E K+ K K ++K+K E
Sbjct: 13 LVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIE 72
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
E + ++KE KE+ + ++ K E E + K
Sbjct: 73 EIKKELKELKEE-LTELSAALKALEAELQDK 102
Score = 28.5 bits (64), Expect = 7.8
Identities = 27/122 (22%), Positives = 54/122 (44%), Gaps = 5/122 (4%)
Query: 103 EIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKEK-EKGKKEEKEKK 160
+I +K I D + K E E+++ K+++ K + +KK+K E+ KKE KE K
Sbjct: 26 DIDLEKLIALDDERKKLLSEIEELQAKRNELSKQI---GKAKGQKKDKIEEIKKELKELK 82
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
EE + ++ + + + I +S K ++ + ++K+ K +
Sbjct: 83 EELTELSAALKALEAELQDKLLSIPNIPHESVPVGKDEEDNLEVKRWGTPPVFDFKPKPH 142
Query: 221 HE 222
E
Sbjct: 143 WE 144
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 30.1 bits (68), Expect = 0.65
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
+K K K T +K EK +++ K ++K EK+ E++ + + KE + KEEK
Sbjct: 16 TEKKAFRPKSKLTSWEKRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEK 75
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEK 189
E+ E+ KM K VE++++ EK
Sbjct: 76 ERYEKMAAKM---HAKKVERLKRREK 98
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 32.3 bits (74), Expect = 0.66
Identities = 19/97 (19%), Positives = 32/97 (32%), Gaps = 6/97 (6%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKK----EEKEFKMKEDKEKPVEKIRKEE 188
K + KE ++E +E EK E E K E K + + +
Sbjct: 512 KLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAK 571
Query: 189 KDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
++ E K K K L + K+K+ +
Sbjct: 572 VEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA 608
Score = 30.4 bits (69), Expect = 2.1
Identities = 30/139 (21%), Positives = 53/139 (38%), Gaps = 15/139 (10%)
Query: 141 EKDKEKKEKEKGKKEE-KEKKEEKEKKEEKEFKMKEDKEKPVEKIR---KEEKDSEKEKK 196
E+ K + +EK EE EK EE K+ E+E + E VE + K S E K
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELK 552
Query: 197 SKDKDKKLKK-----EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 251
++ E+ K + ++ + +K+ I F + +++K +
Sbjct: 553 AQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAG-AFESTLKDKKNKVLPEAKKR 611
Query: 252 KP-----VKSPSPPPVAAA 265
K V+ + P A
Sbjct: 612 KLKLGDEVEVITGEPGAVV 630
Score = 28.8 bits (65), Expect = 6.8
Identities = 19/105 (18%), Positives = 37/105 (35%), Gaps = 16/105 (15%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEKEKKEEKEF-KMKEDK 177
EK + +K+ +++ ++++ + GK E K + E+
Sbjct: 511 EKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEA 570
Query: 178 EKPVEKIRKEEKDSE------------KEKKSKDKDKKLKKEKLK 210
+ K K + K+KK+K + KK KLK
Sbjct: 571 KVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA-KKRKLK 614
Score = 28.5 bits (64), Expect = 8.5
Identities = 32/140 (22%), Positives = 55/140 (39%), Gaps = 6/140 (4%)
Query: 64 SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKE 123
S KTS E + V +G+ P+ KV + +++ +KK K I D + K+
Sbjct: 542 SGGKTSLLELKAQIVVLAHMGL-PVPAEEAKVEAVDEVKFLKKKKGIL-DAGAFESTLKD 599
Query: 124 KIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
K K + K K+ K D+ E E G + + + K+ +E
Sbjct: 600 KKNKVLPEAKKRKL--KLGDEVEVITGEPGAVVKIIAGILEALVQSGILKVIV-SHLDLE 656
Query: 183 KIRKEEKDSEKEKKSKDKDK 202
I++E K K ++ D
Sbjct: 657 IIKEEPKVKVKGIEAPGLDS 676
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 32.3 bits (73), Expect = 0.66
Identities = 29/132 (21%), Positives = 64/132 (48%), Gaps = 8/132 (6%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGKKEEK 157
L + +K++ K+ K + K K+K+KN KD E K ++ GK +
Sbjct: 914 LETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQD-GKDDYL 972
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDKDK-KLKKEKLKKKK 213
++KE + + + E ++ + + + +++ D++K ++ +D L+K + + K+
Sbjct: 973 KQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKE 1032
Query: 214 KEKERSSHEKEV 225
E+E H KE+
Sbjct: 1033 VEEELKQHLKEM 1044
Score = 31.6 bits (71), Expect = 1.2
Identities = 31/147 (21%), Positives = 66/147 (44%), Gaps = 1/147 (0%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKEKEKGKKEEKEKKE 161
E + + + + K + D + K+++K ++ D+ ++K G+ E +K+
Sbjct: 389 ERQIKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEI 448
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
++K+EE +F +KE ++ R E D E K ++ K K + KKE + +
Sbjct: 449 LEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQN 508
Query: 222 EKEVIPKLTFKFGTDMEEKTKRESSPK 248
EK + + K +ME+ ++
Sbjct: 509 EKADLDRKLRKLDQEMEQLNHHTTTRT 535
Score = 30.0 bits (67), Expect = 2.8
Identities = 25/113 (22%), Positives = 52/113 (46%), Gaps = 6/113 (5%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
IK +++++ ++T +K +E + K K + KEK E +++ K+ + E E
Sbjct: 185 IKALETLRQVRQTQGQKVQEHQMELKYLK-----QYKEKACEIRDQITSKEAQLESSREI 239
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K E E +++ K +E + + E K+ +K + EK + + K
Sbjct: 240 VKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKS-RKKQMEKDNSELELK 291
Score = 30.0 bits (67), Expect = 2.9
Identities = 25/108 (23%), Positives = 51/108 (47%), Gaps = 3/108 (2%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ E + +K + + + + K + D E K + KK+ ++ E E K EK F+ +++
Sbjct: 243 YENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQ 302
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+ + EKE++ D ++L EKL K+++ + E V
Sbjct: 303 LNDLYHNHQRTV-REKERELVDCQREL--EKLNKERRLLNQEKTELLV 347
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 30.5 bits (69), Expect = 0.66
Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE-EKEFKMKED 176
K KE+ K+ K + K+ K K + E+ K+E + +E K +KE EKE D
Sbjct: 27 IKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFAD 86
Query: 177 KEKPV--------EKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
KEK V E+ RK E++ ++ ++ ++++ K + L
Sbjct: 87 KEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKGKDL 127
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 32.2 bits (73), Expect = 0.66
Identities = 6/85 (7%), Positives = 30/85 (35%), Gaps = 7/85 (8%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
++ + + DK + + + + +D+ + + ++ + + + + + +
Sbjct: 11 RNDTRRSDKGRERSRRRS---RSRDRSRRRRDRDYYRGRRGRSRSRSPN----RYYRPRG 63
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKD 190
+ E + + E+D
Sbjct: 64 DRSYRRDDRRSGRNTKEPLTEAERD 88
Score = 28.7 bits (64), Expect = 6.5
Identities = 7/82 (8%), Positives = 32/82 (39%), Gaps = 2/82 (2%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ +++E+ + + D + DK + + + + + +++ ++ + + +
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERSRRRS-RSRDRSRRRRDRDYYRGRRGRSRS-RSPNRY 58
Query: 178 EKPVEKIRKEEKDSEKEKKSKD 199
+P D + +K+
Sbjct: 59 YRPRGDRSYRRDDRRSGRNTKE 80
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 32.2 bits (73), Expect = 0.67
Identities = 26/162 (16%), Positives = 69/162 (42%), Gaps = 3/162 (1%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
++ I K E+E+ ++K+ ++++ ++ D K G+ +E +E++
Sbjct: 385 EEVISKYAMKQDDTEEEE-RRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEE 443
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE-- 224
+E + +E++E+ E+ +EE+ ++E++ + + +E+++ + +E
Sbjct: 444 SVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDA 503
Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 266
+ M E + S P + P P A+
Sbjct: 504 ERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545
>gnl|CDD|234000 TIGR02760, TraI_TIGR, conjugative transfer relaxase protein TraI.
This protein is a component of the relaxosome complex. In
the process of conjugative plasmid transfer the
realaxosome binds to the plasmid at the oriT (origin of
transfer) site. The relaxase protein TraI mediates the
single-strand nicking and ATP-dependent unwinding
(relaxation, helicase activity) of the plasmid molecule.
These two activities reside in separate domains of the
protein.
Length = 1960
Score = 32.2 bits (73), Expect = 0.67
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 4/104 (3%)
Query: 72 EKEYSKVKDVELGVTPMVPN-IVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKK 127
EK +K K + + P+ + E ++ + S +DK K E + K
Sbjct: 1840 EKIINKFKQDNISARIVFPDDWNDIGEEELQKQLMRAISSIEDKDIEIPKAIADFESVLK 1899
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+ I N ++EK + E+ E + ++ E+ KEF
Sbjct: 1900 MPNSDIISIIDNARTEREKSDLEQIVANEHKSQQALERDLNKEF 1943
>gnl|CDD|220265 pfam09491, RE_AlwI, AlwI restriction endonuclease. This family
includes the AlwI (recognises GGATC), Bsp6I (recognises
GC^NGC), BstNBI (recognises GASTC), PleI(recognises
GAGTC) and MlyI (recognises GAGTC) restriction
endonucleases.
Length = 429
Score = 32.0 bits (73), Expect = 0.72
Identities = 23/129 (17%), Positives = 40/129 (31%), Gaps = 11/129 (8%)
Query: 117 HKKKEKEKIKKKKDKKDKD------KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
KK+E ++I K + D + + E+ + +++
Sbjct: 178 EKKEEIDEILKNTARYPIYTRLEDYFEYMSNIDGPLLPTDDLANTKIEQLKIIANNIKEQ 237
Query: 171 FKMKEDKE----KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
F + E K V K+E + KEKK D + E +K E I
Sbjct: 238 FNIAELLSILNSKYVVDELKDELNELKEKKFFDDEYLDLLEVFEKILNEDLYEVTPSADI 297
Query: 227 P-KLTFKFG 234
P +
Sbjct: 298 PAYFEWLTW 306
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 32.0 bits (72), Expect = 0.72
Identities = 22/139 (15%), Positives = 56/139 (40%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
G+ + + + K + EK +++ K ++ K K K ++ ++ + EE+E+
Sbjct: 200 GKRSRAQVVDKLLQLGLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQLP 259
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
E + +E ++ E + + K++K+ L+ ++ +E
Sbjct: 260 ESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEKTSWPLLWLQSCLIRAADDREEDECD 319
Query: 222 EKEVIPKLTFKFGTDMEEK 240
+ + LT + ME +
Sbjct: 320 QAVPLVPLTEENEEAMENE 338
>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
Length = 650
Score = 32.1 bits (74), Expect = 0.73
Identities = 17/54 (31%), Positives = 28/54 (51%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
D + K K+K + K K ++KN+E KE + K K + K ++KE+K
Sbjct: 194 DAEFKKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKERK 247
Score = 29.4 bits (67), Expect = 4.3
Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 4/52 (7%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKD-KKDKDKIKNKE---KDKEKKEK 149
E KK K + K +K ++K ++D K+ ++K K+ +KKE+
Sbjct: 195 AEFKKGKKKFEASFYGYKGKKIELKNEEDVKEILAELKKKDFKVSKVKKKER 246
>gnl|CDD|234537 TIGR04313, aro_clust_Mycop, aromatic cluster surface protein.
Members of this family are absolutely restricted to the
Mollicutes (Mycoplasma and Ureaplasma). All have a
signal peptide, usually of the lipoprotein type,
suggesting surface expression. Most members have lengths
of about 280 residues but some members have a nearly
full-length duplication. The mostly nearly invariant
residue, a Trp,is part of a strongly conserved 9-residue
motif, [ND]-W-[LY]-[WF]-X-[LF]-X-N-[LI], where X usually
is hydrophobic. Because the hydrophobic six-residue core
of this motif almost always contains three to four
aromatic residues, we name this family aromatic cluster
surface protein. Multiple paralogs may occur in a given
Mycoplasma, usually clustered on the genome.
Length = 297
Score = 31.6 bits (72), Expect = 0.73
Identities = 19/78 (24%), Positives = 28/78 (35%)
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
+ S E K IK K K+ +K K + K + + N ++ EK K K
Sbjct: 21 LISCSSQSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLNLVFKNDEDEKNKYIK 80
Query: 155 EEKEKKEEKEKKEEKEFK 172
+K E K K
Sbjct: 81 SQKNINSELYLKLLKLSL 98
Score = 28.5 bits (64), Expect = 7.0
Identities = 15/84 (17%), Positives = 28/84 (33%), Gaps = 4/84 (4%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
+ + D K+ + D +K +K+K K K + ++ E K K
Sbjct: 26 SQSENDLKKLIKNKFDVKKDKKDKKWKVFLNNKYINNLLN----LVFKNDEDEKNKYIKS 81
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKK 211
+K+ E K L+
Sbjct: 82 QKNINSELYLKLLKLSLEYLNNIN 105
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 32.1 bits (73), Expect = 0.74
Identities = 13/73 (17%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK-LKKEKLKKKKK 214
+++K+EE+ K++ + ++KE ++ ++K+++E + E E+K +++ L L+ +
Sbjct: 10 QQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSANDLRTELS 69
Query: 215 EKERSSHEKEVIP 227
+
Sbjct: 70 PPSNLLKTSDASH 82
Score = 29.0 bits (65), Expect = 6.9
Identities = 24/105 (22%), Positives = 48/105 (45%), Gaps = 9/105 (8%)
Query: 132 KDKDKIKNKEKDKEKKE--------KEKGKKEEKEKKEEKEKKE-EKEFKMKEDKEKPVE 182
+DK + ++K +E+ + KEK +E ++ K+E + E E++F + E +
Sbjct: 3 RDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSAN 62
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 227
+R E K+ D + + K++K S KE++P
Sbjct: 63 DLRTELSPPSNLLKTSDASHIADESQPNSIKQKKREISPVKELVP 107
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 31.8 bits (72), Expect = 0.74
Identities = 16/72 (22%), Positives = 33/72 (45%), Gaps = 10/72 (13%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK---------EEKEKKEEKEKKEEK 169
+E+ + +D+D+ E D+E+K+ +K + K K +E++
Sbjct: 541 TEERIDLDADDWTEDEDE-NEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAAL 599
Query: 170 EFKMKEDKEKPV 181
+ KM + EKP+
Sbjct: 600 KMKMTDTSEKPL 611
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 31.6 bits (72), Expect = 0.74
Identities = 14/68 (20%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKPVEKIRKEEKD 190
K K + + + K+ + KK K+K++ K+ E KE++ E + ++++ K+
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65
Query: 191 SEKEKKSK 198
+ K
Sbjct: 66 FYVPAEHK 73
Score = 29.6 bits (67), Expect = 2.5
Identities = 16/68 (23%), Positives = 30/68 (44%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
K K +E + K+ K +K+ ++K + E V++ RK E++ + K+ K K
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65
Query: 204 LKKEKLKK 211
K
Sbjct: 66 FYVPAEHK 73
>gnl|CDD|147685 pfam05663, DUF809, Protein of unknown function (DUF809). This
family consists of several proteins of unknown function
Raphanus sativus (Radish) and Brassica napus (Rape).
Length = 138
Score = 30.5 bits (68), Expect = 0.75
Identities = 21/48 (43%), Positives = 32/48 (66%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
+++ KK K +I+ KE+ KE K + +GK+E+KE K E E KEEK+
Sbjct: 83 DEVTKKVSPIIKGEIEGKEEKKEGKGEIEGKEEKKEGKGEIEGKEEKK 130
>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
(A-type). The trypanosome parasite expresses these
proteins to evade the immune response. This family
includes a variety of surface proteins such as
Trypanosoma brucei VSGs such as expression site
associated gene (ESAG) 6 and 7.
Length = 370
Score = 31.7 bits (72), Expect = 0.75
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 4/73 (5%)
Query: 94 KVTSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKEK 149
T+ + E+K+ + + K T + EKIKK D +KIK ++ D E+ K
Sbjct: 279 DTTTLKASTELKEALKRLLLNKKDTDDEDAAEKIKKLFGDDDAEKIKKLWEKIDNEQIPK 338
Query: 150 EKGKKEEKEKKEE 162
+ +K K E
Sbjct: 339 KIAGTTKKTKLGE 351
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 31.2 bits (71), Expect = 0.76
Identities = 23/110 (20%), Positives = 55/110 (50%), Gaps = 19/110 (17%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK---- 174
+K +KI ++ +++ ++I + +++ +K KE+ K+E +E EE +K EKE + +
Sbjct: 5 EKLIKKILREAEEE-AEEILEEAREEAEKIKEEAKREAEEAIEEILRKAEKEAERERQRI 63
Query: 175 -------------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
E KE+ +E + + ++ + + D + + +E L +
Sbjct: 64 ISSALLEARRKLLEAKEEILESVFEAVEE-KLRNIASDPEYESLQELLIE 112
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 0.77
Identities = 25/101 (24%), Positives = 51/101 (50%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
K K+++ K K E+ E + + ++E + +++E E + ++ DKE EE
Sbjct: 77 KWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEE 136
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
+ E KK+K+ + E+ +++ E+E + EKE +L
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
Length = 312
Score = 31.6 bits (72), Expect = 0.78
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEK 216
KE + ++ + K K+K E +SEK +K K K L KK+KL+K +EK
Sbjct: 109 GKESRIKR-YDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNEEK 164
>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
Length = 643
Score = 31.7 bits (72), Expect = 0.79
Identities = 26/153 (16%), Positives = 55/153 (35%), Gaps = 4/153 (2%)
Query: 62 MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 121
M++K K +K G+ + + +ED K + +KK
Sbjct: 1 MANKLKVDELRAALAKRGLDTTGLKAALVRRLDDAIAEDAKTASKSGT---KSSAGRKKR 57
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
+E+ D+ K ++ K +E + + +K++ ++ E V
Sbjct: 58 RERQDDGDDEPVSPKRIAIDEVKGMTVRELREAASERGLATTGRKKDLLERLCAALESDV 117
Query: 182 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
K+ E EK+ +++ K+EK+ K
Sbjct: 118 -KVGSANGTGEDEKEKGGDEEREKEEKIVTATK 149
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 30.9 bits (70), Expect = 0.79
Identities = 11/24 (45%), Positives = 11/24 (45%)
Query: 19 PRLPPMLQHPLIPQPLVHPPGTPP 42
P LP Q P PQPL P P
Sbjct: 90 PNLPQPAQQPFQPQPLQPPQPQQP 113
Score = 27.8 bits (62), Expect = 8.6
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 22 PPMLQHPLIPQPLVHPPGTPPP 43
PP Q P+ PQP VHP PP
Sbjct: 107 PPQPQQPMQPQPPVHPIPPLPP 128
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 32.1 bits (73), Expect = 0.80
Identities = 16/56 (28%), Positives = 36/56 (64%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
D +K ++ KE+K++EK ++ + E+ +E+ ++EEK F + ++ +E +K K+
Sbjct: 274 DWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKE 329
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell
surface and is extremely large (although apparently
lacking in repeat structure) and is important for the
process of invasion of the RBCs by the parasite. These
proteins are found in P. falciparum, P. vivax and P.
yoelii.
Length = 2757
Score = 31.9 bits (72), Expect = 0.82
Identities = 37/153 (24%), Positives = 63/153 (41%), Gaps = 5/153 (3%)
Query: 71 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 130
K KEY K ++ + N +K + L ++ K + + + K E E+ + +
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K IK + D + + EK + K E K E+E++ +K E I+ E
Sbjct: 875 K-----IKAEISDDKLNDYEKKFNDSKSLINEINKSIEEEYQNINTLKKVDEYIKICENT 929
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
E +K +K LK+ K KE + EK
Sbjct: 930 KESIEKFHNKQNILKEILNKNIDTIKESNLIEK 962
Score = 31.9 bits (72), Expect = 0.93
Identities = 35/131 (26%), Positives = 65/131 (49%), Gaps = 7/131 (5%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK---KDKDKIKNKEKDKEKKEKEKGKK 154
++++ +IK D K H K E+IKKK + + K +I + E +K +
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPE 1169
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKLK 210
E ++K E K +K+ + ++ +K + +I + EKD ++ K + K L K L+
Sbjct: 1170 EIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLE 1229
Query: 211 KKKKEKERSSH 221
K +EK++S H
Sbjct: 1230 KIDEEKKKSEH 1240
Score = 31.6 bits (71), Expect = 0.98
Identities = 33/110 (30%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 100 DLGEIK--KDKSIK-KDK--KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
++ E+K DKS KD+ K K EK K ++ KKD ++ NK K K K
Sbjct: 1484 NINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTK 1543
Query: 155 EEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKK 203
++ E ++ K K+F ++ +K E+ +++I+KE+ E + DK K
Sbjct: 1544 KDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDDAAKNDKSNK 1593
Score = 30.4 bits (68), Expect = 2.8
Identities = 35/126 (27%), Positives = 59/126 (46%), Gaps = 3/126 (2%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM-KEDKE 178
K + +K KKD D N + KE +K KG K+E +K + +K ++ F+ K+D
Sbjct: 1465 KSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVT 1524
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK--KEKERSSHEKEVIPKLTFKFGTD 236
+ + K ++ K KD + +K+ K KK E E+S + + I K F+ D
Sbjct: 1525 ELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIEDD 1584
Query: 237 MEEKTK 242
+ K
Sbjct: 1585 AAKNDK 1590
Score = 29.2 bits (65), Expect = 5.1
Identities = 32/115 (27%), Positives = 50/115 (43%), Gaps = 18/115 (15%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDK---------DKIKNKEKDKEKKEKEKG----- 152
DK+I D +K+ E I K DKK ++I EKDK E+ KG
Sbjct: 1160 DKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSY 1219
Query: 153 ----KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
K EK +E++KK E K E + +++I+++ + E E + K
Sbjct: 1220 GKNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKA 1274
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 32.0 bits (73), Expect = 0.83
Identities = 31/131 (23%), Positives = 43/131 (32%), Gaps = 21/131 (16%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
+ K+ K KK EK + K K K K K KKE + E
Sbjct: 251 QELYKERDKILKKLEKA--LNELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEY---- 304
Query: 167 EEKEFKMKEDKEKPVEKIRKEE-----KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
DK++P +I+K K + K KL E +K E +
Sbjct: 305 -------VPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKSLYENTPT 357
Query: 222 EKEVIPKLTFK 232
+ I TFK
Sbjct: 358 KSGFI---TFK 365
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 31.0 bits (70), Expect = 0.83
Identities = 14/74 (18%), Positives = 33/74 (44%), Gaps = 3/74 (4%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK---EKKEEKEKKEEKEFKMKE 175
K++ E++ + K D + ++ + E+ ++ ++ E+ EKEK ++ K
Sbjct: 24 KEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEKVDSIEEEISEKEKVMSEKLKEPA 83
Query: 176 DKEKPVEKIRKEEK 189
E+ K+ K
Sbjct: 84 QMSSTSEEEEKKAK 97
Score = 29.4 bits (66), Expect = 3.2
Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 4/82 (4%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS--KDKDKKLKKEKLK 210
KK E+K EK +K F ++E KEK E++ + K S+ E ++ ++ K+ L
Sbjct: 1 KKPYLEEKIEKAEKVTA-FALEELKEKI-EELPIQAKKSDDELVEELPERYEQTKENSLI 58
Query: 211 KKKKEKERSSHEKEVIPKLTFK 232
+K E EKE + K
Sbjct: 59 EKVDSIEEEISEKEKVMSEKLK 80
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 32.1 bits (73), Expect = 0.86
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 7/112 (6%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
DK IK+ K ++K + I + K K + D +K + K K +E K +EK
Sbjct: 1030 DKLIKEKGKEIEEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIK---EEIKLLEEKV 1086
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E K+ E+K K +E K S + + DK+K + E KKKK E+
Sbjct: 1087 EALLKKIDENKNKLIEIKNK----SHEHVVNADKEKNKQTEHYNKKKKSLEK 1134
Score = 31.0 bits (70), Expect = 1.6
Identities = 41/167 (24%), Positives = 67/167 (40%), Gaps = 15/167 (8%)
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--------KDKEKKE 148
S L E+ K D+ + EKE+ ++ KD ++ + E K KE
Sbjct: 1963 VSNKLSELNKITC--NDESYDEILEKEEYEELKDLRNSFNQEKAETLNNLKLNKIKEDFN 2020
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
K +E EK + K E K+ E+K+ ++ I +D EKE +S + E
Sbjct: 2021 SYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESIN---PSLDEL 2077
Query: 209 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 255
LKK K + S +I + K D + E +I + +K
Sbjct: 2078 LKKGHKIE--ISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIK 2122
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 31.8 bits (73), Expect = 0.86
Identities = 21/79 (26%), Positives = 47/79 (59%), Gaps = 4/79 (5%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
++K+KK K+++KN + KK +E ++++ E++E+ E+ +F K K E+ +
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF----VKSKGKEEEEE 545
Query: 187 EEKDSEKEKKSKDKDKKLK 205
EEK+ + ++ +D+ + K
Sbjct: 546 EEKEEKDKETEEDEPEGPK 564
Score = 30.2 bits (69), Expect = 2.7
Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 15/87 (17%)
Query: 169 KEFKMKED----KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
KE KMKE+ K +K+++ +++ + E++ KD+ L +K K KE+E ++E
Sbjct: 492 KEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF-VKSKGKEEEEEEEKEE 550
Query: 225 VIPKLTFKFGTDMEEKTKRESSPKIVI 251
E + PKI++
Sbjct: 551 KDK----------ETEEDEPEGPKIIV 567
Score = 28.7 bits (65), Expect = 8.1
Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 3/78 (3%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
KEK KEE + + K+ +E ++++K E+ + E+ + KSK K+++ ++EK
Sbjct: 492 KEKKMKEELKNLKGILNKKLQE---LDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEK 548
Query: 209 LKKKKKEKERSSHEKEVI 226
+K K+ +E ++I
Sbjct: 549 EEKDKETEEDEPEGPKII 566
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.5 bits (69), Expect = 0.88
Identities = 21/67 (31%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ + +K+ S + ++K K +KEK K+KK+KK +K+ K+ K K K ++
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 159 KKEEKEK 165
K+E+ K
Sbjct: 152 TKKEEGK 158
Score = 30.1 bits (68), Expect = 1.3
Identities = 17/72 (23%), Positives = 39/72 (54%), Gaps = 5/72 (6%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
++ EK++KE K+ E+++ + +K+ K ++ ++K EK+ K++ + +K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKE-----KKEKKEKKVAEKLAKKKSTKTTKNTTKKA 146
Query: 201 DKKLKKEKLKKK 212
KK +K + K
Sbjct: 147 TKKTTTKKEEGK 158
Score = 30.1 bits (68), Expect = 1.4
Identities = 16/63 (25%), Positives = 36/63 (57%)
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
++E+++ ++ +++E KE KEK +K+ ++ + K +K+ KK K+ KK+
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155
Query: 214 KEK 216
+ K
Sbjct: 156 EGK 158
Score = 30.1 bits (68), Expect = 1.5
Identities = 15/63 (23%), Positives = 33/63 (52%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
+ ++ + + +EK+ K +KEK +K+EK+ E+ KK+ + K+ + K+E
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155
Query: 189 KDS 191
+
Sbjct: 156 EGK 158
Score = 29.7 bits (67), Expect = 1.7
Identities = 18/66 (27%), Positives = 38/66 (57%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
++ + + KE KE +EKE K ++++K+++++K EK K K K + +K +
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 193 KEKKSK 198
K+++ K
Sbjct: 153 KKEEGK 158
Score = 29.0 bits (65), Expect = 3.1
Identities = 18/59 (30%), Positives = 31/59 (52%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
K+ K ++KE K KK+K +K + K+ K K+ + K ++ KK +K+E K
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEGK 158
Score = 29.0 bits (65), Expect = 3.2
Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
E+ +KE+KE +E E K K K +K +KE+K +EK K K K KK
Sbjct: 92 PEESEKEQKEVSKETEEKEAI-KAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKT 150
Query: 215 EK 216
Sbjct: 151 TT 152
Score = 28.2 bits (63), Expect = 6.1
Identities = 18/61 (29%), Positives = 31/61 (50%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
++KE K ++K+ K K ++K+K+ EK KK K K +K +K KE+
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKEEG 157
Query: 178 E 178
+
Sbjct: 158 K 158
Score = 28.2 bits (63), Expect = 6.4
Identities = 22/63 (34%), Positives = 36/63 (57%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
EKE+ + K+ ++K+ IK K++ KEKKEK+ +K K+K + K K+ K +K
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155
Query: 181 VEK 183
K
Sbjct: 156 EGK 158
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 31.6 bits (71), Expect = 0.90
Identities = 48/175 (27%), Positives = 69/175 (39%), Gaps = 14/175 (8%)
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK----KDKKDKDKIKNKEKDKEKKE 148
+++ + EIKK IKK KK E+E K + + DKE +E
Sbjct: 474 TRISKIQFTQEIKK--LIKKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEE 531
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE----KEKKSKDKDKKL 204
E + KE E KE + E K E +KP KE K S+ +K KD K
Sbjct: 532 GEHE--DSKESDEPKEGGKPGETKEGEVGKKPGPA--KEHKPSKIPTLSKKPEFPKDPKH 587
Query: 205 KKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP 259
K+ + KK ++ RS+ D+ + KR SPK +P P
Sbjct: 588 PKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRP 642
Score = 31.2 bits (70), Expect = 1.2
Identities = 54/237 (22%), Positives = 81/237 (34%), Gaps = 23/237 (9%)
Query: 37 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKE-KEYSKVKDVELGVTPMVPNIVKV 95
PP P S + K + E SKE PKE + + K+ E+G P
Sbjct: 510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKP-------- 561
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
G K+ K K + K + + K KD ++ K K + + K
Sbjct: 562 ------GPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLP 615
Query: 156 EKEK--KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
E K K + K K ++P R E K K K K K+K
Sbjct: 616 ELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKF 675
Query: 214 KEK--ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP-PPVAAADE 267
+ + ++ KE T + E +E+ P+ P +P P PP DE
Sbjct: 676 YDDYLDAAAKSKETK---TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE 729
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 31.7 bits (73), Expect = 0.90
Identities = 24/67 (35%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE-KDSEKEKKSK 198
+E K++ E +E E+KE+K +K E +D +K VE +R E ++EK +KS
Sbjct: 514 EEIAYIKEQMEGSAPKEPEEKEKKPEKPEITI---DDFDK-VE-LRVAEVLEAEKVEKS- 567
Query: 199 DKDKKLK 205
DK LK
Sbjct: 568 --DKLLK 572
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.5 bits (72), Expect = 0.91
Identities = 18/119 (15%), Positives = 49/119 (41%), Gaps = 2/119 (1%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKEK-EKGKKEE 156
+ IK K + ++ + + D +++ +++ + K + +K+
Sbjct: 171 NLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQL 230
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+E ++E ++ + K + +E+I + EK E+ + K+ K K+ +K
Sbjct: 231 EELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKEISKLKAKVSLLQKL 289
>gnl|CDD|135898 PRK06397, PRK06397, V-type ATP synthase subunit H; Validated.
Length = 111
Score = 29.9 bits (67), Expect = 0.92
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
K+K++ +KE K E+E KE K K EEK KK E+E + + RKE +
Sbjct: 16 KEKEESIDKEIANIKNEQENEIKEAKSKYEEKAKKTEEE--SLNMYNAALMEARKEAEKK 73
Query: 192 EKEKKSKDKDK 202
E +K K +
Sbjct: 74 AVEIINKAKQE 84
Score = 28.4 bits (63), Expect = 3.3
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
I K+K+ KE IK +++ + K+ K+K ++K KK +E+ E K+ EK
Sbjct: 14 IIKEKEESIDKEIANIKNEQENEIKE-AKSKYEEKAKKTEEESLNMYNAALMEARKEAEK 72
Query: 170 EFKMKEDKEK 179
+ +K K
Sbjct: 73 KAVEIINKAK 82
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 31.2 bits (71), Expect = 0.92
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 209
E +K+EKEKKE+ + ++++ K E+ ++K KE + E + D+ KK+K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYV-EYDRAGKKKKA 217
Query: 210 KKKKKE 215
K K +E
Sbjct: 218 KSKYEE 223
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 30.7 bits (70), Expect = 0.96
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 8/60 (13%)
Query: 153 KKEEKEKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
+K EK +K EKE +K +E K + +K K +++ + K++ + +K
Sbjct: 40 RKMEKYQKREKEIQKRAREL-RKNGDKLSPKKFEKRQEE------LMEDQKEMMMDMMKP 92
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.3 bits (71), Expect = 0.97
Identities = 41/182 (22%), Positives = 71/182 (39%), Gaps = 11/182 (6%)
Query: 42 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 101
S D + ++ + SS PKEK+ S L + + ED+
Sbjct: 235 SESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDI 294
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE--- 158
+ K K K ++ ++ I +KK +K KE++KE+KE+E + E +
Sbjct: 295 DPDQVVKKPVKRKNRRGQRARQAIWEKKYGSGAKHVK-KEREKEQKEREGRQSEWEARQA 353
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-------KKLKKEKLKK 211
K+E + K + + + + K + + +K S DK KK KEK
Sbjct: 354 KREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEKKAN 413
Query: 212 KK 213
K
Sbjct: 414 AK 415
Score = 31.3 bits (71), Expect = 1.1
Identities = 30/134 (22%), Positives = 55/134 (41%), Gaps = 19/134 (14%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK-----EEKEKKEEKEKKEEK 169
+T KK K K K KK KK + K + + D+E+ E E K E+ EE+E + +
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 170 EFKMKE-------DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-------EKLKKKKKE 215
++ + D+E+ E + E +S + D ++ + E+ K+
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESSPPSKK 262
Query: 216 KERSSHEKEVIPKL 229
+ +P L
Sbjct: 263 PKEKKTSSTFLPSL 276
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 31.2 bits (71), Expect = 0.97
Identities = 24/130 (18%), Positives = 58/130 (44%), Gaps = 15/130 (11%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ + + ++ + + ++ K K + K +D DK N + E E + + E K K+ E
Sbjct: 52 DELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIE 111
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD--------KKLKKEKLKKKKK 214
++ + D + + D + + + + +K+KKE+ ++K++
Sbjct: 112 EDARNS-------DADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164
Query: 215 EKERSSHEKE 224
E+E + E+E
Sbjct: 165 EEEEKAAEEE 174
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 29.8 bits (67), Expect = 0.98
Identities = 13/24 (54%), Positives = 20/24 (83%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEE 168
+EK + K+EEK+K+EEKE++EE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEE 96
Score = 29.4 bits (66), Expect = 1.2
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKE 170
E++ +KEE++KKEE++++EE+E
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 27.8 bits (62), Expect = 4.6
Identities = 13/23 (56%), Positives = 19/23 (82%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEE 162
E+ E+KE+EK K+EEKE++EE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEE 96
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 31.0 bits (70), Expect = 1.0
Identities = 31/139 (22%), Positives = 53/139 (38%), Gaps = 9/139 (6%)
Query: 37 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 96
PP P K + V T E +S + E EL +VT
Sbjct: 157 PPVPDMPKKQELPVEETLST-IHEDDASTLHEDDELDEEVTSYLNELDD--------EVT 207
Query: 97 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
S + GE +++ + + K+ E + K+K ++ + K E E + E
Sbjct: 208 SYFNDGENEENDDELEAEVISYLKDGENDNEVKEKIRREYREWKGDKANTNETEIEDESE 267
Query: 157 KEKKEEKEKKEEKEFKMKE 175
E +EE +++E E K +E
Sbjct: 268 DEYEEEAGEEQENEDKGEE 286
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 30.7 bits (69), Expect = 1.0
Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 5/51 (9%)
Query: 8 PFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTP-PPSKMKDKVSPK 53
P V G P L P QHPL P +P P PP + + P+
Sbjct: 89 HHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPPVQPQQPADPQ 139
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 31.2 bits (71), Expect = 1.0
Identities = 14/101 (13%), Positives = 38/101 (37%), Gaps = 2/101 (1%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
L + K+DK ++K+ + + E + + + + + + +++ +K KE
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
E+ +E+K E ++ + E+
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAA 250
Score = 28.9 bits (65), Expect = 5.1
Identities = 17/96 (17%), Positives = 43/96 (44%), Gaps = 5/96 (5%)
Query: 133 DKDKIKNKEKDKEK-KEKEKGKKEEKEK----KEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
DK ++ +++DK+ +EK+ +++ E + E E + K +K + + +
Sbjct: 146 DKKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAK 205
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
E + EK + ++ K L + + K++ +
Sbjct: 206 EASALGEKAALEEQKALAEAAAAEAAKQEAAAKAAA 241
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.8 bits (70), Expect = 1.0
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
E K+ K+++EF +E KE +K + E K K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKK 195
Score = 29.7 bits (67), Expect = 2.1
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
+KD K K++ E+ KE K +K E K+KK
Sbjct: 159 RKDAKQKEEFA-AEERKEALAAAAKKSATPQKVETKKKK 196
Score = 29.7 bits (67), Expect = 2.1
Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 1/39 (2%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E+ K+++E +E KE KK K E K+K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQK-VETKKK 195
Score = 29.7 bits (67), Expect = 2.4
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
K+K++ ++ K KK K E K+KK
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.9 bits (65), Expect = 3.8
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 2/39 (5%)
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
K+ K+K E+ EE+ +K K E KKK
Sbjct: 160 KDAKQK--EEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 30.7 bits (69), Expect = 1.1
Identities = 14/70 (20%), Positives = 36/70 (51%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
+++ +K T KK+ K + KK + D + ++ K E+ + K+ E++++
Sbjct: 46 QEQLLKAGLVTSKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAAL 105
Query: 166 KEEKEFKMKE 175
+E + ++K+
Sbjct: 106 AKEYKAQVKQ 115
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 30.7 bits (70), Expect = 1.1
Identities = 14/73 (19%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
K K + ++ +++ KK E ++EK + E++ K K ++ + +K + D
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPK---PKAPRKKRKPRAQKPEPQHTPVSD 163
Query: 200 KDKKLKKEKLKKK 212
+ + +K K
Sbjct: 164 ISELTVGQAVKVK 176
Score = 29.9 bits (68), Expect = 2.0
Identities = 8/49 (16%), Positives = 26/49 (53%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K K++ ++ ++ K + + ++ +E+ K + +K+ K + ++ E
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPE 155
Score = 29.5 bits (67), Expect = 2.5
Identities = 9/60 (15%), Positives = 30/60 (50%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
K K + ++ + + K + +++ ++E++ K ++K++ + +K E + D +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Score = 29.1 bits (66), Expect = 3.8
Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 1/70 (1%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
++ K K + + E+ +K+E K++ ++ + + K ++ K K +KP +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRK-KRKPRAQKPEPQHTPVS 162
Query: 189 KDSEKEKKSK 198
SE
Sbjct: 163 DISELTVGQA 172
Score = 28.7 bits (65), Expect = 5.3
Identities = 11/64 (17%), Positives = 28/64 (43%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
++ ++ KK+E K+K ++++ + K K +K + + + E +
Sbjct: 113 QRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQA 172
Query: 171 FKMK 174
K+K
Sbjct: 173 VKVK 176
Score = 28.7 bits (65), Expect = 5.6
Identities = 10/71 (14%), Positives = 27/71 (38%), Gaps = 5/71 (7%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+E K K + ++ E++ KK E + ++ + + K + K + K + +
Sbjct: 104 EEAKAKVQAQR-----AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQH 158
Query: 204 LKKEKLKKKKK 214
+ +
Sbjct: 159 TPVSDISELTV 169
Score = 28.0 bits (63), Expect = 8.3
Identities = 15/70 (21%), Positives = 34/70 (48%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
K K + + E++ K++ EKEK +E+K + + K+ K + + + S+ +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Query: 196 KSKDKDKKLK 205
+ + K+K
Sbjct: 167 LTVGQAVKVK 176
>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
Staphylococcus aureus plasmid proteins with no
characterized function.
Length = 229
Score = 30.8 bits (69), Expect = 1.1
Identities = 26/112 (23%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
I D+ KK+++ + K+ +K + ++N+E + KK+ + K E KE +
Sbjct: 6 IMSDEDEKKNEENMNMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQ 65
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+ +E++ +K + ++ E + +K KK + L +EKLK E
Sbjct: 66 DIREKELKAIKSELKEATEDFQ------DKGKKIHNDFVDLLQEKLKHVDAE 111
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 30.7 bits (70), Expect = 1.1
Identities = 15/75 (20%), Positives = 30/75 (40%), Gaps = 6/75 (8%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
K KE+E+ E ++ + E+K +++ +EE D E+ + +
Sbjct: 15 PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELD------EEEDDRFLEEYRRKR 68
Query: 201 DKKLKKEKLKKKKKE 215
++K K K E
Sbjct: 69 LAEMKALAEKSKFGE 83
Score = 28.4 bits (64), Expect = 6.4
Identities = 22/87 (25%), Positives = 36/87 (41%), Gaps = 14/87 (16%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF----------KMKED 176
K K++++ + +E E KK E EE +++E+ F +MK
Sbjct: 16 KPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEMKAL 75
Query: 177 KEK----PVEKIRKEEKDSEKEKKSKD 199
EK V +I K + E + SKD
Sbjct: 76 AEKSKFGEVYEISKPDYVREVTEASKD 102
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 31.4 bits (71), Expect = 1.1
Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 10/123 (8%)
Query: 93 VKVTSSEDLGEIKKDKSIKK----DKKTHKKKEK-EKIKKKKDKKDKDKIKNKE-----K 142
++V + + E + +K KKT+ EK E K+ +KK++ KIKN+
Sbjct: 86 IEVYNKQFKHEQLWNSQVKGILTLGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKT 145
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+E+KE++ + +K ++ KE+ ++K KI KE ++++ +
Sbjct: 146 QREEKEEKDFIADCWRNLYKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLE 205
Query: 203 KLK 205
K+K
Sbjct: 206 KVK 208
Score = 31.1 bits (70), Expect = 1.5
Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 10/140 (7%)
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K+ ++ EK E K+ + + EK ++ + +++E+K + L KK +E+
Sbjct: 111 KKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170
Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESS----------PKIVIKPVKSPSPPPVAAAD 266
+ E K K + E S +IV + ++P
Sbjct: 171 FKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQTPLALLECNLT 230
Query: 267 EYDTGDSKQVWICPACGVQD 286
++D+ ++ +W G D
Sbjct: 231 DFDSIENHSIWEQKIVGSGD 250
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 31.4 bits (71), Expect = 1.2
Identities = 24/112 (21%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK-----ED 176
+++ + + K K+ E + +KE E + + + +++ E E ++ E
Sbjct: 55 MTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEV 114
Query: 177 KEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
+ K +E++ K E + +E E+++K KL E LK + +++++ KE I
Sbjct: 115 RLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESI 166
Score = 28.4 bits (63), Expect = 9.1
Identities = 31/122 (25%), Positives = 57/122 (46%), Gaps = 9/122 (7%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD---KIKNKEKDKE--KKEKEKGKKEEK 157
E + + + +KK ++ K K ++K E+D E K KE+ + +
Sbjct: 202 EECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPE 261
Query: 158 EKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKK 213
++E +EE MKED E E++ + E+ +K ++K D +L+KEKL+ +
Sbjct: 262 LERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENEL 321
Query: 214 KE 215
K
Sbjct: 322 KS 323
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 30.0 bits (68), Expect = 1.2
Identities = 13/46 (28%), Positives = 19/46 (41%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
+ E ++ +KK K + K KDK+KK K K E
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPKSKRAASE 46
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 30.5 bits (69), Expect = 1.2
Identities = 18/53 (33%), Positives = 35/53 (66%), Gaps = 3/53 (5%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEK--KEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
++ K + K ++ +E +K E+E+ E + +KEE E+K+KE EK +E+ R++
Sbjct: 27 LQGKVESKARELEETLQKAERERLVNEAQARKEE-EWKLKEWIEKKIEEARED 78
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 30.8 bits (70), Expect = 1.2
Identities = 11/43 (25%), Positives = 24/43 (55%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
K+K+E+E+ + +E E ++ + K KE+++ E+ R
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEETRH 163
Score = 29.7 bits (67), Expect = 2.8
Identities = 8/40 (20%), Positives = 24/40 (60%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
K K+ +++ + + + + ++ ++K K K+++K E E+
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160
Score = 29.3 bits (66), Expect = 3.3
Identities = 9/40 (22%), Positives = 26/40 (65%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
K K+++++ + ++ ++ E+ + ++K K+E+K + EE
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEE 160
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 30.1 bits (68), Expect = 1.2
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 5/70 (7%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E+E KEE E E+E ++ + + D ++ ++ K +K K++K
Sbjct: 68 ERELKEEAEA--EEEGEVDASPD-AGAVAGESSADRKEAEQQGAAQK--RKSCRDKERKS 122
Query: 216 KERSSHEKEV 225
+ +
Sbjct: 123 AKDPRGGTQD 132
Score = 27.8 bits (62), Expect = 9.1
Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 121 EKEKIKKKKD-KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E + ++++ K++ + + E D + ++KE +++ ++ K DKE+
Sbjct: 61 EIHALLRERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKER 120
Query: 180 --PVEKIRKEEKDSEKEKKSKD 199
+ + +K + S D
Sbjct: 121 KSAKDPRGGTQDVVDKSQASLD 142
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 29.6 bits (67), Expect = 1.2
Identities = 28/109 (25%), Positives = 51/109 (46%), Gaps = 22/109 (20%)
Query: 126 KKKKDKKDKDKI--KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-------- 175
K K KD I ++K+ K KK+ + K E+K K++ + +K + K +
Sbjct: 12 SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71
Query: 176 DKEKPVEKI------------RKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+KE+ + K+ R +K +EK K K ++K+E+L K+
Sbjct: 72 EKERRLRKVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKE 120
Score = 28.1 bits (63), Expect = 4.4
Identities = 19/81 (23%), Positives = 37/81 (45%), Gaps = 6/81 (7%)
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
K+ K K R ++ K+K +K +K K +L+ +K++ K V+P+
Sbjct: 13 KLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED-- 70
Query: 232 KFGTDMEEKTKRESSPKIVIK 252
+E+ R+ + K V+K
Sbjct: 71 ----LEKERRLRKVAQKGVVK 87
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 31.2 bits (70), Expect = 1.2
Identities = 32/134 (23%), Positives = 64/134 (47%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+D KT +KEK K + DK ++NKE +E + K+ +E +K+EE+
Sbjct: 474 EDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERML 533
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
K E+ E+ +R E + +E K + K K +K ++ + E +KE K+
Sbjct: 534 KQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILE 593
Query: 232 KFGTDMEEKTKRES 245
+++++ + ++
Sbjct: 594 NKCNNLKKQIENKN 607
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 31.2 bits (71), Expect = 1.3
Identities = 11/55 (20%), Positives = 22/55 (40%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
E ++E+ ++ KE + + KE ++ K+ E SK+ K
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLR 575
Score = 30.9 bits (70), Expect = 1.6
Identities = 12/76 (15%), Positives = 29/76 (38%), Gaps = 6/76 (7%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK----KEEKEFKM 173
++ K++ KK++DKI KE ++ ++ + + K ++ M
Sbjct: 528 GTNFGKRNSKERYKKNEDKI--KEFASALGKEILKEEPTENSSKNILKLRLYYQQNGKCM 585
Query: 174 KEDKEKPVEKIRKEEK 189
KE ++ +
Sbjct: 586 YTGKEIDIDDLFDLSY 601
Score = 30.1 bits (68), Expect = 2.6
Identities = 15/76 (19%), Positives = 30/76 (39%), Gaps = 1/76 (1%)
Query: 73 KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK 132
+ KV + + IV + E+ G ++ K+ K ++ K KE +
Sbjct: 499 LQARKVVNELVKKYGPPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558
Query: 133 DKDKI-KNKEKDKEKK 147
K++ +N K+ K
Sbjct: 559 LKEEPTENSSKNILKL 574
Score = 28.9 bits (65), Expect = 6.3
Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 1/108 (0%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
D KK K T+KK K + + + + EK K K +K +K
Sbjct: 321 DLLFKKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDNA-EKAFNISLKTYHKLRKALGDKD 379
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
K +D ++ V+ + + +K+ + L +E++KK +
Sbjct: 380 LLDNPKNPKDLDEIVKILTLYKDREMIKKRLELYKDVLNEEQVKKLVR 427
Score = 28.5 bits (64), Expect = 7.7
Identities = 11/53 (20%), Positives = 22/53 (41%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
E +E++ GK+ KE+ ++ E K ++ + E K+ K
Sbjct: 521 EMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILK 573
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 31.2 bits (71), Expect = 1.3
Identities = 13/86 (15%), Positives = 38/86 (44%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
L + ++++I K + K+ ++ ++ ++K+ + + + ++E K+ K+ K
Sbjct: 439 LKKFLEEEAILKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCENQEEVKKEVPKKVK 498
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRK 186
+ E + E+ EK+
Sbjct: 499 KIPEVDTYLLLEELGINEETYEKLEA 524
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 30.7 bits (70), Expect = 1.3
Identities = 15/50 (30%), Positives = 18/50 (36%)
Query: 22 PPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPK 71
P + P P P PP K +K PK K K + K PK
Sbjct: 87 PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPK 136
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 29.9 bits (68), Expect = 1.3
Identities = 18/62 (29%), Positives = 40/62 (64%), Gaps = 1/62 (1%)
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
K +KE +KE+K+ + + K++ + + E+ E+ +K K + +K +++K K+++KEK
Sbjct: 70 KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK-RTRKNREKKFKRRQKEK 128
Query: 217 ER 218
E+
Sbjct: 129 EK 130
Score = 28.8 bits (65), Expect = 3.2
Identities = 15/52 (28%), Positives = 33/52 (63%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
K+K K+ + K++ ++++++ K+ +K K K++EKK K + K++EK
Sbjct: 79 KEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKEK 130
Score = 28.4 bits (64), Expect = 3.6
Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 3/61 (4%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ +EKEK KK+K K +I + + + KE+ ++EK ++ +EKK ++ K K
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERA---EKEKRTRKNREKKFKRRQKEK 128
Query: 175 E 175
E
Sbjct: 129 E 129
Score = 27.2 bits (61), Expect = 8.2
Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTH-KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
KV + KK K + KK + + + K++ +K+K KN+EK ++++KEK
Sbjct: 70 KVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
Query: 153 K 153
K
Sbjct: 130 K 130
>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
[Posttranslational modification, protein turnover,
chaperones].
Length = 205
Score = 30.5 bits (69), Expect = 1.3
Identities = 15/80 (18%), Positives = 30/80 (37%), Gaps = 2/80 (2%)
Query: 105 KKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
+ +S+ + K K ++ K+ + E++ ++ K ++ K
Sbjct: 20 QLGQSLAELKLRLDKDALLAGLEDALALVQKEAALEALQRAEQRLQQA-AKAKRAKDAAA 78
Query: 164 EKKEEKEFKMKEDKEKPVEK 183
E K F K KEK V+
Sbjct: 79 NAAEGKAFLEKNAKEKGVKT 98
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 30.6 bits (70), Expect = 1.3
Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 10/87 (11%)
Query: 139 NKEKDKEKKEKEKGKKEE--KEKKE-EKEKKE-EKEFKMKE----DKEKPVEKIRKEEKD 190
KE + KKE KEE K + E E+E KE E + +E KE+ ++ RK+E
Sbjct: 44 KKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLD--RKDESL 101
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+KE+ ++K+K+L + + ++KE+E
Sbjct: 102 EKKEESLEEKEKELAARQQQLEEKEEE 128
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 30.6 bits (69), Expect = 1.3
Identities = 37/144 (25%), Positives = 63/144 (43%), Gaps = 16/144 (11%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
+I+ K D+ +K KD+ K E+E + EEK KE ++ EE + E+K + VE+
Sbjct: 5 EIRNKTDELEKL------KDEIKAEEEALESEEKHLKEYDKEMEE----LLEEKMQHVEE 54
Query: 184 IRKEE---KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF---KFGTDM 237
+R+ D E E K + + + K+ + E + KL +
Sbjct: 55 LRQIHADINDMETEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEELPQLH 114
Query: 238 EEKTKRESSPKIVIKPVKSPSPPP 261
EE+ ++ S + PV P PP
Sbjct: 115 EEERQKISPATSPVPPVPLPDPPA 138
>gnl|CDD|223494 COG0417, PolB, DNA polymerase elongation subunit (family B) [DNA
replication, recombination, and repair].
Length = 792
Score = 31.2 bits (71), Expect = 1.3
Identities = 20/128 (15%), Positives = 48/128 (37%), Gaps = 8/128 (6%)
Query: 89 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
P +VK E L + + +++ + + EK++ + +K I + + E
Sbjct: 638 WPELVKEFQREVLEVLLSGRDVEEALAKYVRDVIEKLRNGEVPIEKLVISKRLT-RPLSE 696
Query: 149 KEKGK------KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 202
+ K +++ + + + + + K K VE+ E E+ D +
Sbjct: 697 YKANKPHVVVAARLRKRGINVKPGDRIPYVIVKGKGKLVERAEPPELVDEENSP-IDYEY 755
Query: 203 KLKKEKLK 210
+ K+ L
Sbjct: 756 YITKQLLP 763
>gnl|CDD|191973 pfam08232, Striatin, Striatin family. Striatin is an intracellular
protein which has a caveolin-binding motif, a
coiled-coil structure, a calmodulin-binding site, and a
WD (pfam00400) repeat domain. It acts as a scaffold
protein and is involved in signalling pathways.
Length = 133
Score = 29.6 bits (67), Expect = 1.4
Identities = 21/77 (27%), Positives = 43/77 (55%), Gaps = 7/77 (9%)
Query: 102 GEIKKDKSIKKDKKTH-------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
GE + +++K+D K K+E+ K+KK K D + +KE+++E+ E++ ++
Sbjct: 39 GERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELEEDDDEE 98
Query: 155 EEKEKKEEKEKKEEKEF 171
EK +E E K+ + +
Sbjct: 99 SNSEKVDESELKKSRLY 115
>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Proteins in
this group have been named FCH domain Only (FCHO)
proteins. Vertebrates have two members, FCHO1 and FCHO2.
These proteins contain an F-BAR domain and a C-terminal
domain of unknown function named SAFF which is also
present in endophilin interacting protein 1. F-BAR
domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 261
Score = 30.8 bits (70), Expect = 1.4
Identities = 14/50 (28%), Positives = 24/50 (48%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV 181
K K+ + + E+ +E +E EK E K KK + E+K +K +
Sbjct: 127 KAKEAYHARCLELERLRRENASPKEIEKAEAKLKKAQDEYKALVEKYNNI 176
>gnl|CDD|237147 PRK12586, PRK12586, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 145
Score = 29.7 bits (67), Expect = 1.4
Identities = 5/37 (13%), Positives = 15/37 (40%), Gaps = 3/37 (8%)
Query: 130 DKKDKDKIKN---KEKDKEKKEKEKGKKEEKEKKEEK 163
K D + ++ E + + +++E+ +K
Sbjct: 104 RKNDAHTHASILLSSNEQNSTEALQLRAKKREEHRKK 140
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 30.8 bits (70), Expect = 1.4
Identities = 11/61 (18%), Positives = 26/61 (42%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
+ D I +E+ +++ EK K K + K + E+ +++ K+K + +
Sbjct: 335 GEPDGISLREELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQ 394
Query: 192 E 192
Sbjct: 395 N 395
Score = 30.8 bits (70), Expect = 1.5
Identities = 15/52 (28%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 223
++E+ EK +EK++++ K ++DKK + + ++KKKEK+ S +
Sbjct: 341 SLREELEKRIEKLKEKPPKPPT-KAKPERDKKERPGRYRRKKKEKKAKSERR 391
Score = 30.4 bits (69), Expect = 1.8
Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+K+ K KEK K K ++DK + + +K+KEK K E+ +
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGR-YRRKKKEKKAKSERRGLQN 395
Score = 29.3 bits (66), Expect = 4.2
Identities = 14/46 (30%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
EK+K+K K ++K KE+ + + KK+EK+ K E+ +
Sbjct: 351 EKLKEKPPKPPTKAKPERDK-KERPGRYRRKKKEKKAKSERRGLQN 395
Score = 28.5 bits (64), Expect = 7.0
Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKK 214
+E+ E++ K+KE KP K + E E+ + + K +K K + + +
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 30.8 bits (69), Expect = 1.4
Identities = 25/137 (18%), Positives = 60/137 (43%), Gaps = 1/137 (0%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKK 203
+K K++ + +K ++ +++ E+K F +K V+ I K+ + +++ + K+K
Sbjct: 4 KKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEKM 63
Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVA 263
+EK ++ +K+ R+ + + + F T + + + K + + P
Sbjct: 64 RMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLYSDV 123
Query: 264 AADEYDTGDSKQVWICP 280
E D K+ WI
Sbjct: 124 RDKEEDVPLGKRPWINT 140
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 31.1 bits (71), Expect = 1.5
Identities = 21/67 (31%), Positives = 36/67 (53%)
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ E +K EE + +E + +EK E + EKEKK D+ KLK+ ++ + +KE E
Sbjct: 393 VRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452
Query: 219 SSHEKEV 225
E ++
Sbjct: 453 EVDEDDI 459
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 30.5 bits (69), Expect = 1.5
Identities = 17/114 (14%), Positives = 65/114 (57%), Gaps = 9/114 (7%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK-------EKKEEKEKK 166
K + ++ + K ++ + + ++ + +++ ++E+++ + +++ ++ E++ ++
Sbjct: 71 KIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQ 130
Query: 167 EEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E+ KM+E+ E +R+ EE+ E +++ +++ +L++E ++ K + + R
Sbjct: 131 NEELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEAR 184
Score = 29.3 bits (66), Expect = 3.9
Identities = 24/115 (20%), Positives = 55/115 (47%), Gaps = 8/115 (6%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
K ++K + E + + KE E + + K E+ + E +E + ++ E+ R + +D
Sbjct: 58 KMQEKTRQAELEAKIKEYEAQQAQ---AKLERARVEAEERRKTLQEQTQQEQQRAQYQDE 114
Query: 192 EKEKKSKDK--DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
K+ + + ++ + E+L K ++E S +E + + T + +M +T E
Sbjct: 115 LARKRYQKELEQQRRQNEELLKMQEE---SVLRQEAMRRATEEEILEMRRETIEE 166
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 30.7 bits (69), Expect = 1.5
Identities = 22/116 (18%), Positives = 46/116 (39%), Gaps = 10/116 (8%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE--------KGKKEEKEKKEE 162
+K ++ +++ + + +K D + D++ + E K +E
Sbjct: 129 QKQRRREDQEDGVRQPGRTEKSGSDFWEPPSSDEDDSDSEDSMSDLYPPELFTLKNPGKE 188
Query: 163 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ E+ +F+ ++ E VE ++K S+K+ S KK KK KK
Sbjct: 189 QNGDEDDDFETDDEDEMEVESPELQQKRSKKQ--SGSLTKKFKKNHKKKGPFRFSS 242
>gnl|CDD|215094 PLN00188, PLN00188, enhanced disease resistance protein (EDR2);
Provisional.
Length = 719
Score = 30.9 bits (70), Expect = 1.5
Identities = 24/103 (23%), Positives = 45/103 (43%), Gaps = 17/103 (16%)
Query: 82 ELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 141
E G P +P +V + S+ S KK++K + D+ + +
Sbjct: 395 ERGAPPRIPVMVNMASA--------SVSSKKNQKPQESS------PSLDQTNAASRNSVM 440
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
D++ + E+ + E E++ E K E K+ M+E+ P +KI
Sbjct: 441 MDEDSDDDEEFQIPESEQEPETTKNETKDTAMEEE---PQDKI 480
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 30.6 bits (69), Expect = 1.5
Identities = 17/72 (23%), Positives = 35/72 (48%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
+DK ++ EK D K + +K + E+ ++ + EEKE KE++E
Sbjct: 212 QDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEE 271
Query: 172 KMKEDKEKPVEK 183
+ E++E+ ++
Sbjct: 272 EETEEEEEEEDE 283
Score = 29.0 bits (65), Expect = 4.5
Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 4/71 (5%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDK----IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
K+K + EKE + D K K D+E +++ EEKE KE++E++E
Sbjct: 214 KNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKELKEDEEEEE 273
Query: 168 EKEFKMKEDKE 178
+E + +ED++
Sbjct: 274 TEEEEEEEDED 284
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.5 bits (69), Expect = 1.5
Identities = 26/152 (17%), Positives = 75/152 (49%), Gaps = 11/152 (7%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+++++E ++ + ++ + KE ++ ++ + +++ K+++EE + E+K +++E+
Sbjct: 4 EREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEEN 63
Query: 178 EKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 236
+ E+ E++ E+ E + + ++ K + +++KKE E ++E+ +
Sbjct: 64 RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR---------E 114
Query: 237 MEEKTKRESSPKIVIKPVKSPSPPPVAAADEY 268
+E +R + +PP VAA
Sbjct: 115 AQEAHERARQELLEAAA-APTAPPHVAAPVNG 145
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 29.5 bits (66), Expect = 1.6
Identities = 21/99 (21%), Positives = 53/99 (53%), Gaps = 7/99 (7%)
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIR-------KEEKDSE 192
+++ E +E+ ++ ++ +E +E+++ K KM+E++E + IR KEE + E
Sbjct: 24 EDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDEEE 83
Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 231
+ +++++ +L ++K ++ E ++E K F
Sbjct: 84 PQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEKSGF 122
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 30.8 bits (69), Expect = 1.6
Identities = 25/116 (21%), Positives = 52/116 (44%), Gaps = 6/116 (5%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK----KEKEKGKKEEKEKKEEKEKK 166
+ + +EK+KK ++ + N E+ + K+K +K K KK + E++
Sbjct: 119 ELLELIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQ 178
Query: 167 EEKEFKMKED--KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
E+E +M ++ K++ ++ +D K K K E+L + E R++
Sbjct: 179 IEEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNN 234
>gnl|CDD|129262 TIGR00158, L9, ribosomal protein L9. Ribosomal protein L9 appears
to be universal in, but restricted to, eubacteria and
chloroplast [Protein synthesis, Ribosomal proteins:
synthesis and modification].
Length = 148
Score = 29.7 bits (67), Expect = 1.6
Identities = 13/61 (21%), Positives = 22/61 (36%), Gaps = 1/61 (1%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKKSKDKDK 202
K+ +EK K + +E + K D+ K I ++ + D DK
Sbjct: 52 KKLEEKLAANKAAAARLKEVLELGTLTISKKVGDEGKLFGSITTKQIADALKAAGLDLDK 111
Query: 203 K 203
K
Sbjct: 112 K 112
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.1 bits (71), Expect = 1.6
Identities = 31/127 (24%), Positives = 57/127 (44%), Gaps = 32/127 (25%)
Query: 127 KKKDKKD-------KDKIKNKEKDKEKKEKEK------------------GKKEEKEKKE 161
+KK KK+ K K E+ K K+E E K K
Sbjct: 171 RKKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMT 230
Query: 162 EKEKKEEKEFKMKE---DKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
+EK +E + +++E D+ +P ++ + EE E K+ ++ KKL+ E+L++ + E+E
Sbjct: 231 PEEKDDEYDQRVRELTFDRRAQPTDRTKTEE---ELAKEEAERLKKLEAERLRRMRGEEE 287
Query: 218 RSSHEKE 224
E++
Sbjct: 288 DDEEEED 294
Score = 30.7 bits (70), Expect = 1.8
Identities = 22/92 (23%), Positives = 46/92 (50%), Gaps = 8/92 (8%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
K + +T K K + ++ + D +++ D+ + ++ K EE+ KEE E+
Sbjct: 212 KDLMSLLRTVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERL 271
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
++ E + + ++R EE+D E+E+ SK
Sbjct: 272 KKLE-------AERLRRMRGEEEDDEEEEDSK 296
Score = 30.4 bits (69), Expect = 2.3
Identities = 25/121 (20%), Positives = 60/121 (49%), Gaps = 11/121 (9%)
Query: 118 KKKEKEKIK-----------KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
KK E E+++ ++ K+ D + ++ + + G+ EE E++EE
Sbjct: 272 KKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVD 331
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
+E E +D E+ E + +++ ++E + D + ++E+ +K+KK+K+ + + +
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
Query: 227 P 227
P
Sbjct: 392 P 392
Score = 28.4 bits (64), Expect = 10.0
Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 7/101 (6%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
D D D + + D G+ +E++++E+ E+K+ K+ MKE K K K E
Sbjct: 135 DDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVIAK--SKFYKAE 192
Query: 189 KDSEKEKKSK-----DKDKKLKKEKLKKKKKEKERSSHEKE 224
+ KE+ D D K L+ K + +E
Sbjct: 193 RQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTPEE 233
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 28.9 bits (65), Expect = 1.6
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
K+K KK K K K K+K+K K+E K+EE++ E + KE +E + E E+ +
Sbjct: 11 KLKGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEED 69
Score = 28.5 bits (64), Expect = 2.0
Identities = 21/74 (28%), Positives = 38/74 (51%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K K I KK KKK+K K K++ + +++ K+ + K+ +E E+ E++E+
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMN 72
Query: 166 KEEKEFKMKEDKEK 179
E E +E + K
Sbjct: 73 LTEAERAFEEAQRK 86
>gnl|CDD|237554 PRK13909, PRK13909, putative recombination protein RecB;
Provisional.
Length = 910
Score = 31.1 bits (71), Expect = 1.6
Identities = 24/102 (23%), Positives = 41/102 (40%), Gaps = 21/102 (20%)
Query: 113 DKKTHKKKEKEKIKKKKD-------------------KKDKDKIKNKEKDKEKKEKEKGK 153
DK + EKEK K ++ KKD+ E + K E+G+
Sbjct: 663 DKDYARALEKEKALKYEEEINVLYVAFTRAKNSLIVVKKDESSGSMFEI-LDLKPLERGE 721
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 195
E KE K KKE +K E++++ E++ +++
Sbjct: 722 IEIKEPKISP-KKESLITSVKLKPHGYQEQVKEIEEEPKEDN 762
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 30.7 bits (70), Expect = 1.6
Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 8/56 (14%)
Query: 179 KPVEKI--------RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
KP+E+I +E K+ E EK+ + +KK ++K KK K+ + E +
Sbjct: 61 KPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 30.7 bits (70), Expect = 1.7
Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 14/87 (16%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIK-----KKKDKKD-KDKIKNKEKDKEKKE--- 148
LG + K + + K E++K KK D K+ +E+KE
Sbjct: 3 MIALGRLAKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLEERKEVGA 62
Query: 149 -----KEKGKKEEKEKKEEKEKKEEKE 170
K++ + E E E E
Sbjct: 63 LINELKKEVEDAITELTPELEAAGLWE 89
Score = 29.1 bits (66), Expect = 5.4
Identities = 17/92 (18%), Positives = 30/92 (32%), Gaps = 9/92 (9%)
Query: 127 KKKDKKDKDKIKNKEKDKEK-KEKEKGKKEE-KEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
KK + + K E+ K K GKK + ++ K E E+
Sbjct: 10 AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLE-------ERKEVGA 62
Query: 185 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E E E + +L+ L ++ +
Sbjct: 63 LINELKKEVEDAITELTPELEAAGLWERLAFE 94
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 30.6 bits (69), Expect = 1.7
Identities = 19/141 (13%), Positives = 60/141 (42%)
Query: 85 VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
V P N +L + + ++ + ++E+EK K+++ K K +
Sbjct: 128 VPPYTRNDFDRREMPELPGVTRSDYDVRELRQKIREEREKEKEEQPMKPAFKPERWMGGP 187
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
+ E E EE + K++++ + + E+ + ++ + + +E++ + ++
Sbjct: 188 DSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQE 247
Query: 205 KKEKLKKKKKEKERSSHEKEV 225
+E +++ +++ + E
Sbjct: 248 LEENVERDSDDEDEHGEDSED 268
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 29.7 bits (67), Expect = 1.7
Identities = 20/94 (21%), Positives = 44/94 (46%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K+ + H K +K+ K +++ + + +++ KKE +E + E
Sbjct: 48 IKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSE 107
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
E+ K ++ +K +++RK+ + EKE+K K
Sbjct: 108 GLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
Score = 29.3 bits (66), Expect = 2.3
Identities = 23/99 (23%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDK--EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ E+IK K + + NK DK + +E+ E E + +++ K E+
Sbjct: 43 ERMEEIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEA 102
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
E + +E K + K + K+ + K ++L+K++K+K
Sbjct: 103 NGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 29.7 bits (67), Expect = 1.7
Identities = 34/145 (23%), Positives = 65/145 (44%), Gaps = 16/145 (11%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-------------KEKEK 151
KK K+ + K ++K+ K K +KKK + I K DK+K
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTA 62
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
+ + + + + ++ E +++ D + E DS++E++ + K+L++E +
Sbjct: 63 ARTDLLIQDKPIPELKKGEARIERDDK---GNPLDEPLDSKEEEEKPEVVKQLEEEASEP 119
Query: 212 KKKEKERSSHEKEVIPKLTFKFGTD 236
KK + S E+E I +L K G D
Sbjct: 120 AKKPRRLSKREEEWIERLVEKHGDD 144
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.4 bits (69), Expect = 1.8
Identities = 26/123 (21%), Positives = 51/123 (41%), Gaps = 9/123 (7%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
IV V E+ + + ++E + +K K K+K +KK K
Sbjct: 125 IVFVDDEEEQKSFDPAEYFDTTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLY 184
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
+ +E++++E+K KK E+ +++ + K + K+KK K KK
Sbjct: 185 KELKERKEREKKLKKVEQRLELQRELMKKGKG---------KKKKIVKDKDGKVVYKWKK 235
Query: 212 KKK 214
++K
Sbjct: 236 ERK 238
>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
Length = 499
Score = 30.8 bits (69), Expect = 1.8
Identities = 14/53 (26%), Positives = 21/53 (39%)
Query: 8 PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKE 60
P +A +P P +Q + P V PP PP + +V P + E
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 30.1 bits (68), Expect = 1.8
Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 2/72 (2%)
Query: 155 EEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
E +K+ + E ++F + +++ EK K + +++LK L+KKK
Sbjct: 44 AELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLK--ALEKKKA 101
Query: 215 EKERSSHEKEVI 226
E + E+E I
Sbjct: 102 ELKDLEEEREGI 113
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 30.4 bits (69), Expect = 1.8
Identities = 17/57 (29%), Positives = 29/57 (50%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
KK H K KK KK K K+ +K+ +K ++ K K KK ++ ++ +K+
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 29.9 bits (67), Expect = 1.8
Identities = 13/57 (22%), Positives = 15/57 (26%), Gaps = 13/57 (22%)
Query: 3 PFQFSPFPSVAGPGLIP-------------RLPPMLQHPLIPQPLVHPPGTPPPSKM 46
+ PF V GP L+ HP P PG P M
Sbjct: 85 SSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGM 141
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.8 bits (65), Expect = 1.8
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 150 EKGKKEEKEKKEEKEKKEEKE 170
+E+KE++EE+E+KEE E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESE 94
Score = 28.4 bits (64), Expect = 2.6
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
+K+E+E++EE++++ E+E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96
Score = 27.6 bits (62), Expect = 5.0
Identities = 7/23 (30%), Positives = 17/23 (73%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
+++++E++EE+EK+E +E
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
Score = 27.2 bits (61), Expect = 6.4
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 145 EKKEKEKGKKEEKEKKEEKE 164
E++K ++EE+E+KEE E
Sbjct: 75 AAAEEKKEEEEEEEEKEESE 94
Score = 27.2 bits (61), Expect = 7.4
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEE 162
E+K++E+ ++EEKE+ EE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEE 95
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 30.7 bits (70), Expect = 1.9
Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
Query: 77 KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKK------KKD 130
+VK ELG ++V D E+K++ +++ ++E EKIKK KK
Sbjct: 307 EVKAKELGSPAGTKTEIEVGV--DFPELKEELKELEEELKELEEELEKIKKLLKKLPKKA 364
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK-KEEKEFKMKE 175
+ K ++ +K + KEK +E +E +EE ++ KEE E E
Sbjct: 365 RGQLPPEKREQLEKLLETKEKLSEELEELEEELKELKEELESLYSE 410
Score = 30.3 bits (69), Expect = 2.2
Identities = 24/131 (18%), Positives = 59/131 (45%), Gaps = 20/131 (15%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
++K++ + +++ + +E+ ++ K+ K ++ + EK+E+ E K+ E KEK E+
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLE-KLLETKEKLSEE 389
Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH---EKEVIPKLTFKFGTDMEEK 240
+ + E+ + ++LK++ + K + P + G + +
Sbjct: 390 LEELEE---------------ELKELKEELESLYSEGKISVNKTIYPGVKIHIGNKV-LR 433
Query: 241 TKRESSPKIVI 251
KRE P +
Sbjct: 434 IKREIGPCTFV 444
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 30.3 bits (68), Expect = 1.9
Identities = 27/105 (25%), Positives = 58/105 (55%), Gaps = 6/105 (5%)
Query: 131 KKDKDKIKNKEKDKEKKE--KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE- 187
KKD+ I+ +E++ K++ +E+ K++ +E+K E K E+E K + + +K +
Sbjct: 44 KKDRITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANI 103
Query: 188 ---EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 229
+ D E E++ + K + +++K+ ++E+E EK I K+
Sbjct: 104 DDVDTDDENEEEEYEAWKLRELKRIKRDREEREEMEREKAEIEKM 148
>gnl|CDD|147348 pfam05124, S_layer_C, S-layer like family, C-terminal region.
Length = 223
Score = 30.1 bits (68), Expect = 1.9
Identities = 25/104 (24%), Positives = 35/104 (33%), Gaps = 11/104 (10%)
Query: 75 YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 134
Y VK +ELG + + G+++ ++KT K D KD
Sbjct: 15 YKDVKTLELGEEYIPDWEAYAVLDDGKGKLEYKDDEVLEQKTVGLALKYVGDYLDDIKDG 74
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
DKIK K E+ K F+M E KE
Sbjct: 75 DKIKIANYAKLLF-----------DDEDSSDKLNVYFEMDEYKE 107
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 30.8 bits (70), Expect = 1.9
Identities = 14/85 (16%), Positives = 26/85 (30%), Gaps = 12/85 (14%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK---------EDKEKPVEKIRKEEKDSEKE 194
+ E +EK + + EE+ K D K ++ + K
Sbjct: 8 RVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAV--- 64
Query: 195 KKSKDKDKKLKKEKLKKKKKEKERS 219
K + + +L + KE S
Sbjct: 65 DKHSEIESLNVGMRLDVRLSSKELS 89
>gnl|CDD|184294 PRK13743, PRK13743, conjugal transfer protein TrbF; Provisional.
Length = 141
Score = 29.4 bits (66), Expect = 2.0
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 165 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
K ++ MKE K KIR+ ++D + E++ D
Sbjct: 4 YKYQRPLSMKEKKNDAELKIRQTKRDYDYERRVSDI 39
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 29.7 bits (67), Expect = 2.0
Identities = 19/66 (28%), Positives = 36/66 (54%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
KD + K+ E+ K EKE E++ ++E + ++ E + K +E R+ EK++ +
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161
Query: 202 KKLKKE 207
+LK E
Sbjct: 162 ARLKSE 167
Score = 28.6 bits (64), Expect = 5.8
Identities = 24/83 (28%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
KD +T K +E++ ++K+ +K+ I +E D E E E+ E + + ++K+E E
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKE---ISRQELDSEIAELERKILELQRQAALLKEKKEAED 158
Query: 172 KMKEDKEKPVEKIRKEEKDSEKE 194
K + KI++E D+E E
Sbjct: 159 KEIARLKSEASKIKQELVDAELE 181
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.8 bits (67), Expect = 2.0
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKE 148
EK KK K KK+K K K E K+ +E
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKARE 115
Score = 29.0 bits (65), Expect = 3.7
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
K KKKK K K+K+K K+ + +KK +E
Sbjct: 82 RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKARE 115
>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter. This domain is related to
pfam00005.
Length = 85
Score = 28.3 bits (64), Expect = 2.0
Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 11/63 (17%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEF-----------KMKEDKEKPVEKIRKEEKDSEK 193
E+KE+ ++E+ +K++KE K+ +EF K + + K +EK+ + EK EK
Sbjct: 20 EQKEERLEQQEKAYEKQQKEIKKLEEFIDRFRAKASKAKQAQSRIKALEKMERIEKPVEK 79
Query: 194 EKK 196
+ K
Sbjct: 80 KPK 82
>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
(cas_TM1802). Clusters of short DNA repeats with
non-homologous spacers, which are found at regular
intervals in the genomes of phylogenetically distinct
prokaryotic species, comprise a family with recognisable
features. This family is known as CRISPR (short for
Clustered, Regularly Interspaced Short Palindromic
Repeats). A number of protein families appear only in
association with these repeats and are designated Cas
(CRISPR-Associated) proteins. This minor cas protein is
found in at least five prokaryotic genomes:
Methanosarcina mazei, Sulfurihydrogenibium azorense,
Thermotoga maritima, Carboxydothermus hydrogenoformans,
and Dictyoglomus thermophilum, the first of which is
archaeal while the rest are bacterial.
Length = 584
Score = 30.4 bits (69), Expect = 2.1
Identities = 24/151 (15%), Positives = 53/151 (35%), Gaps = 22/151 (14%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKK----------------KEKEKIKKKKDKKDKD 135
++K+ + EI+ + DKKT +K KI ++
Sbjct: 39 VLKIDFDTEENEIEIVIGEEFDKKTARKYLYKGQAGNGNSSQWYSPTNKITYDPEETLNK 98
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK---EEKDSE 192
K+K+ K K + E K K +K ++ E +K+ + + ++
Sbjct: 99 KLKSIFKKFYKDKGEINKYRYFLKDIKKVLEKNFEKIIKDLIDLKKNEGVLYTIYFLKND 158
Query: 193 KEKKSKDKD---KKLKKEKLKKKKKEKERSS 220
+K D + K+ + ++ +K +
Sbjct: 159 GKKYLSDYEEYRNKVLYKIEQRYFSKKSKEE 189
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 30.5 bits (69), Expect = 2.1
Identities = 15/46 (32%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 3 PFQFSPFPSVAGPGLIPRLPPM-LQHPLIPQPLVHPPGTPPPSKMK 47
P Q P PGL ++PP Q P Q P PPP
Sbjct: 237 PQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 30.2 bits (69), Expect = 2.1
Identities = 9/35 (25%), Positives = 21/35 (60%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
+++K + + + ++EE+E++EE+E EE
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEE 319
Score = 29.8 bits (68), Expect = 2.6
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 132 KDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
DKD + + K+ + + EE+E++EE+E++EE
Sbjct: 279 ADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPS 317
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 30.1 bits (68), Expect = 2.2
Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 5/73 (6%)
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIP 227
K+F + E E + + +KK + + KK K E+LK +K+ ++ EK
Sbjct: 66 KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK---- 121
Query: 228 KLTFKFGTDMEEK 240
+L+ T + E+
Sbjct: 122 QLSSSLQTKLTEE 134
Score = 29.7 bits (67), Expect = 2.7
Identities = 14/60 (23%), Positives = 27/60 (45%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+ EKE + + K + E K+K+ +E ++K + KEK+ + K +E
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQTKLTEE 134
Score = 28.6 bits (64), Expect = 5.7
Identities = 9/41 (21%), Positives = 26/41 (63%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
K + +K+ + + ++ K+KE+ + ++K++ EE + ++K
Sbjct: 73 KPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKK 113
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 29.2 bits (66), Expect = 2.2
Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 3/94 (3%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
+KIK D + + EK E ++K + K + E++ + K+E K K+ E+ + E
Sbjct: 28 DKIKAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGK--KIVEEYKSKAE 85
Query: 183 KIRKE-EKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+ +E K++ +E + KL+ ++ K+K +
Sbjct: 86 NVYEEIVKEAHEEADLIIERAKLEAQREKEKAEY 119
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 29.0 bits (65), Expect = 2.3
Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 2/104 (1%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K K KE + KK D + K+ + ++ +++KK+E + EE + KM+
Sbjct: 15 KRKKNAPKELARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKKKEGTESEEDDEKME 74
Query: 175 EDKEKPV--EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 216
D + V +K +++ ++ + KKLKK++ KKKKK+K
Sbjct: 75 VDNKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKKKK 118
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.3 bits (69), Expect = 2.3
Identities = 18/119 (15%), Positives = 44/119 (36%), Gaps = 14/119 (11%)
Query: 97 SSEDLGEIKK-DKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
+ D+ ++ + D+ +K + + + ++I + + + D + + KE KEK
Sbjct: 24 DALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK-------IRKEEKDSEKEKKSKDKD 201
K + ++ E + + PV K +R+ + + KD
Sbjct: 84 LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDEDDNVEVRRWGEPPVFVFDFEPKD 142
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 30.4 bits (67), Expect = 2.4
Identities = 32/109 (29%), Positives = 63/109 (57%), Gaps = 4/109 (3%)
Query: 116 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK----EEKEF 171
T + ++ EK+KK + KK + +I + ++K +K ++E + E + KK EE+
Sbjct: 61 TEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHK 120
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
+ K ++E+ ++ K+E D +K + K K+K +K EK +K +K K+ S+
Sbjct: 121 QWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKAST 169
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 30.3 bits (69), Expect = 2.4
Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 3/70 (4%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
K+ +++ EK++K K ++ KN K+ K E ++ EK E KE
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASL---EKRIDEAKEL 297
Query: 172 KMKEDKEKPV 181
K E P+
Sbjct: 298 IAKYGNEVPL 307
Score = 28.4 bits (64), Expect = 8.5
Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 2/56 (3%)
Query: 111 KKDKKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
+K +K K K EK++K + K K+K+ E+ EK + +E K E
Sbjct: 249 EKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAKYGNE 304
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.4 bits (68), Expect = 2.4
Identities = 34/150 (22%), Positives = 61/150 (40%), Gaps = 3/150 (2%)
Query: 32 QPLVHPPGTPPPSKMKDKVSPKD-KTSPKEKMSSKEKTSPKEKEYSKVKDVELG-VTPMV 89
PG P ++ + S D K+ K SK + E S + G + V
Sbjct: 126 PNEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASV 185
Query: 90 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEK 149
IV +++ +T+ K +K +++ D +KN + DK +E+
Sbjct: 186 AGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKN-DGDKVAEEE 244
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
E EE E+ EE +K+ ++ + D+EK
Sbjct: 245 EFELDEEHEEAEEDKKEALEKIGAEGDEEK 274
>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
and repair / Cell division and chromosome partitioning /
Translation].
Length = 953
Score = 30.3 bits (68), Expect = 2.5
Identities = 30/147 (20%), Positives = 49/147 (33%), Gaps = 8/147 (5%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE-- 161
++ K I K T + +EK ++ KD + K E K
Sbjct: 412 YEQKKLIGSIKPTLMDQLQEKKSPDLPDEEFIDTLALPKDLDMKNHELFLKRFANDLGLS 471
Query: 162 -EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-----KEKKSKDKDKKLKKEKLKKKKKE 215
K K + + ++ D +EE +SE K ++E+ + +K
Sbjct: 472 ISKAIKSKGNYSLEMDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEENETEKTV 531
Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTK 242
R KE F TD EEK +
Sbjct: 532 NLRFPGWKERYYTSKLHFTTDSEEKIR 558
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 30.2 bits (69), Expect = 2.5
Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 20/90 (22%)
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK------- 202
+ K+ K E + + + + + + KD KE K K ++
Sbjct: 340 TRRKEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQ 399
Query: 203 -------------KLKKEKLKKKKKEKERS 219
KL+ EKL+K+ KE E+
Sbjct: 400 ADAILDMRLRRLTKLEVEKLEKELKELEKE 429
>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
Saccharomyces cerevisiae Reduced viability upon
starvation protein 167 and similar proteins. BAR
domains are dimerization, lipid binding and curvature
sensing modules found in many different proteins with
diverse functions. This subfamily is composed of fungal
proteins with similarity to Saccharomyces cerevisiae
Reduced viability upon starvation protein 167 (Rvs167p)
and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
cerevisiae Rvs167p plays a role in regulation of the
actin cytoskeleton, endocytosis, and sporulation. It
forms a heterodimer with another BAR domain protein
Rvs161p. Rvs161p and Rvs167p share common functions but
are not interchangeable. Their BAR domains cannot be
replaced with each other and the overexpression of one
cannot suppress the mutant phenotypes of the other.
Rvs167p also interacts with the GTPase activating
protein (GAP) Gyp5p, which is involved in ER to Golgi
vesicle trafficking. BAR domains form dimers that bind
to membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 216
Score = 29.5 bits (67), Expect = 2.5
Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 13/81 (16%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
K IKK +KT IKK+ KK D DK++NK +K+KE K+EK + K
Sbjct: 107 KYIKKIRKT--------IKKRDHKKLDYDKLQNKLNKLLQKKKELSLKDEK----QLAKL 154
Query: 167 EEKEFKMKEDKEKPVEKIRKE 187
E K + KE+ E E ++ E
Sbjct: 155 ERKLEEAKEEYEALNELLKSE 175
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 30.4 bits (68), Expect = 2.5
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
EK K+ D+ ++D K E +E K +E + +E +KK+ K+ KE + ++K KP
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKP 267
Query: 181 VEKIRKEEKDSEKEK 195
+ ++ KD KE+
Sbjct: 268 LWT--RDPKDVTKEE 280
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 30.0 bits (68), Expect = 2.5
Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 19/137 (13%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD---------KKDKDKIKNKEKDKEKKEK 149
E+L +K+D + + K K++ +KD K+ +D++++ + + + K
Sbjct: 151 ENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAK 210
Query: 150 EKGKKEEKEKKEEKEKKEEKEFKMKE----------DKEKPVEKIRKEEKDSEKEKKSKD 199
EK KK +E + +K EE E +++E K + +I + EK E+ +
Sbjct: 211 EKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTF 270
Query: 200 KDKKLKKEKLKKKKKEK 216
K+ + KE+LK +
Sbjct: 271 KEIEKLKEQLKLLQSLT 287
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 29.7 bits (67), Expect = 2.5
Identities = 16/65 (24%), Positives = 25/65 (38%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K K K KEK ++KK + D DK + E D + E+ +E +
Sbjct: 59 PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDI 118
Query: 166 KEEKE 170
E +
Sbjct: 119 DNEDD 123
Score = 29.3 bits (66), Expect = 3.0
Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 3/92 (3%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE--FKMKEDKEKPVEK 183
+ KK + + K +KK+KEK +EEK+ + + +K EE E E+ E
Sbjct: 50 QPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQES 109
Query: 184 IRK-EEKDSEKEKKSKDKDKKLKKEKLKKKKK 214
D + E D D KL E + K K
Sbjct: 110 ASANSLSDIDNEDDMDDSDLKLATENIIKDLK 141
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 2.5
Identities = 31/196 (15%), Positives = 71/196 (36%), Gaps = 14/196 (7%)
Query: 44 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG- 102
K + + + S+ + K D T T++ ++
Sbjct: 196 QKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPP 255
Query: 103 -EIKKDKSIKKDKKTHKKKEKEKIKKKK---DKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ ++ K+++ + +E+I + + ++ + K K+ E+ ++E E
Sbjct: 256 AEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENE 315
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKI---------RKEEKDSEKEKKSKDKDKKLKKEKL 209
+ EE+ K +++ ++ D E V K+ R E + E + + K
Sbjct: 316 ELEEEYKIKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375
Query: 210 KKKKKEKERSSHEKEV 225
K + KE E E+
Sbjct: 376 KNRNKEDETQRQLDEI 391
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 28.0 bits (63), Expect = 2.5
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 141 EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
+K KEKKE+ + KEE ++ +KE++ ++ +
Sbjct: 50 DKVKEKKEEREEDKEELIERIKKEEETFEDVDLGSA 85
Score = 27.3 bits (61), Expect = 4.7
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 164 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+K +EK+ + +EDKE+ +E+I+KEE+ E
Sbjct: 50 DKVKEKKEEREEDKEELIERIKKEEETFEDV 80
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.5 bits (64), Expect = 2.5
Identities = 8/23 (34%), Positives = 17/23 (73%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
++EE+E++EE+E++E +E
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
Score = 28.5 bits (64), Expect = 2.8
Identities = 8/23 (34%), Positives = 18/23 (78%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKE 170
+ ++EE+E++EE+E++ E+E
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEE 95
Score = 27.7 bits (62), Expect = 5.0
Identities = 9/22 (40%), Positives = 17/22 (77%)
Query: 147 KEKEKGKKEEKEKKEEKEKKEE 168
E+ ++EE+E++EE+E+ EE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEE 94
Score = 27.3 bits (61), Expect = 7.1
Identities = 9/24 (37%), Positives = 18/24 (75%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEE 168
+E+ ++EE+E++EE+E +EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 30.0 bits (68), Expect = 2.6
Identities = 21/78 (26%), Positives = 36/78 (46%)
Query: 153 KKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
K+ + K K+ EE+E MK ++ E + + D + K + + + K
Sbjct: 12 SKKVRMKLATKQLWEEREKFMKYERGWDTESLVGDPLDISESVKELKLEMYTSLAEDQPK 71
Query: 213 KKEKERSSHEKEVIPKLT 230
K+E ER S + E IP+L
Sbjct: 72 KEEIERLSTDSEEIPRLD 89
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 29.6 bits (67), Expect = 2.6
Identities = 34/96 (35%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK--DKEKKEKEKGKKEEKEKKEEKEKKEE 168
+ K + +EKEK K K + K+K+ EK K K KEK KK+ K++ +E+++K E
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVE 170
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
K+ ++ K + K RK++K ++K+KK+K K + L
Sbjct: 171 KKKAERQKKREENLKKRKDDKKNKKKKKAKKKGRIL 206
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.1 bits (68), Expect = 2.6
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
K KE K K K+D ++ E + KKE + + E +E K + K
Sbjct: 58 KLNKEVAKLKIAKEDATELI-AETKELKKEITEKEAEVQEAKAALDAK 104
Score = 29.7 bits (67), Expect = 3.4
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 7/74 (9%)
Query: 107 DKSIKKDKKTHKKK-EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
D+ I DK+ +++ E + ++K+ +K NKE K K KE + E KE K++
Sbjct: 33 DEVIALDKEWRQRQFELDSLRKEFNKL------NKEVAKLKIAKEDATELIAETKELKKE 86
Query: 166 KEEKEFKMKEDKEK 179
EKE +++E K
Sbjct: 87 ITEKEAEVQEAKAA 100
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.9 bits (67), Expect = 2.6
Identities = 24/114 (21%), Positives = 52/114 (45%), Gaps = 3/114 (2%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+K ++ E + ++ +E KEK EE +K++E+ K E+ E E++++ K E E
Sbjct: 121 LKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENS 180
Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 250
++ K ++ KK + E+ + +D+ ++T + I
Sbjct: 181 RLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELI---SDLVKETLNLAPKDIE 231
>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
Length = 1119
Score = 30.2 bits (68), Expect = 2.7
Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
K +K K K+++K++ K+++ EK EK +KE+ ++ E + +
Sbjct: 577 SKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSFPILSLSD 636
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
KE +E P K+ K DS KE D D
Sbjct: 637 LN-KETEEIPT-KLYKLSSDSLKENMDLDSDGG 667
Score = 29.8 bits (67), Expect = 3.8
Identities = 25/126 (19%), Positives = 52/126 (41%), Gaps = 25/126 (19%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+ EK+ K K+E +++ KE+ + +M++ EK K+E++++ +
Sbjct: 574 VESSKYEKEFNKLV-----KDELKERLSHLTKEQVD-EMEKAYEK-----FKKEREADDD 622
Query: 195 KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT----KRESSPKIV 250
+ D L L K E E IP +K +D ++ S ++
Sbjct: 623 PEHLDSFPILSLSDLNK----------ETEEIPTKLYKLSSDSLKENMDLDSDGGSVTVL 672
Query: 251 IKPVKS 256
+ P++S
Sbjct: 673 VHPIES 678
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 29.5 bits (66), Expect = 2.7
Identities = 17/59 (28%), Positives = 33/59 (55%), Gaps = 7/59 (11%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
S +KD K ++ + D IKN +++++K+K K+E++ KK+E+ K+E
Sbjct: 68 SFEKDHKFIRRCKLSDA-------DFSLIKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 30.0 bits (67), Expect = 2.7
Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 8/76 (10%)
Query: 139 NKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE-------EKDS 191
+ E D E + K E+E+ +KE++E+++ + E+ E+ VE + +E EKD
Sbjct: 83 DNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEE-VEVVEEEYDDDEDSEKDD 141
Query: 192 EKEKKSKDKDKKLKKE 207
EKE ++ + +L E
Sbjct: 142 EKESDAEGDENELAGE 157
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 30.2 bits (68), Expect = 2.7
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 197 SKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+K+K+KK+ L KK KEKE S + +
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCI 359
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 29.6 bits (67), Expect = 2.7
Identities = 19/67 (28%), Positives = 40/67 (59%), Gaps = 10/67 (14%)
Query: 161 EEKEKKEEKEFKMKEDK-----EKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLK 210
+ KE+ ++K +K ++K K VE+++K +E ++ ++KK + ++ KK+K K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKK 187
Query: 211 KKKKEKE 217
K K+K+
Sbjct: 188 KSAKKKK 194
Score = 28.8 bits (65), Expect = 4.2
Identities = 19/54 (35%), Positives = 33/54 (61%), Gaps = 1/54 (1%)
Query: 114 KKTHK-KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
KK + KK++ IK+ + +DK K N E++K+KK+K+ KK++ +K K
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 29.1 bits (66), Expect = 2.8
Identities = 8/52 (15%), Positives = 22/52 (42%), Gaps = 6/52 (11%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK 169
+ +++ K + KEK + + E ++ + + KE + E + +
Sbjct: 65 AAAPRAELQAKIAR------YKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 1 (PACSIN1). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 1 or Syndapin I is expressed specifically in the
brain and is localized in neurites and synaptic boutons.
It binds the brain-specific proteins dynamin I,
synaptojanin, synapsin I, and neural Wiskott-Aldrich
syndrome protein (nWASP), and functions as a link
between the cytoskeletal machinery and synaptic vesicle
endocytosis. PACSIN 1 interacts with huntingtin and may
be implicated in the neuropathology of Huntington's
disease. It contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 258
Score = 29.6 bits (66), Expect = 2.8
Identities = 25/99 (25%), Positives = 50/99 (50%), Gaps = 3/99 (3%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE---EKEKKEEKEFKMKEDKEK 179
+++K +D +K+KN +KD K+ G KE KE ++ + +K K+ K E +K
Sbjct: 81 QEVKNNLLNEDLEKVKNWQKDAYHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKK 140
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
KEEK + + + ++ + E+ KK + + ++
Sbjct: 141 AYHLACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDK 179
>gnl|CDD|115185 pfam06513, DUF1103, Repeat of unknown function (DUF1103). This
family consists of several repeats of around 30 residues
in length which are found specifically in
mature-parasite-infected erythrocyte surface antigen
proteins from Plasmodium falciparum. This family often
found in conjunction with pfam00226.
Length = 215
Score = 29.4 bits (65), Expect = 2.8
Identities = 36/146 (24%), Positives = 78/146 (53%)
Query: 78 VKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
+++V+ + V + +K +E ++K + I ++ K KK+ E K+ D + DK+
Sbjct: 70 IEEVKEEIKKQVEDGIKENDTEGNDKVKGPEIITEEVKEEIKKQVEDGIKENDTEGNDKV 129
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
K E E+ ++E K+ E+ KE + ++K + E+ E+I+K+ ++ KE +
Sbjct: 130 KGPEIITEEVKEEIKKQVEEGIKENDTEGKDKLIGPEIITEEVKEEIKKQVEEGIKENDT 189
Query: 198 KDKDKKLKKEKLKKKKKEKERSSHEK 223
++KDK + +E + ++ K++ EK
Sbjct: 190 ENKDKVIGQEIITEEVKKEIEKQEEK 215
Score = 28.7 bits (63), Expect = 4.8
Identities = 44/178 (24%), Positives = 84/178 (47%)
Query: 48 DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 107
DK K+K K+ + + KE + K V+ V +K +E ++K
Sbjct: 7 DKEDVKEKNDGKKDKVIGSEKTQKEIKEKVEKRVKEKCKKKVKKGIKENDTEGNDKVKGP 66
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ I ++ K KK+ E K+ D + DK+K E E+ ++E K+ E KE +
Sbjct: 67 EIIIEEVKEEIKKQVEDGIKENDTEGNDKVKGPEIITEEVKEEIKKQVEDGIKENDTEGN 126
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 225
+K + E+ E+I+K+ ++ KE ++ KDK + E + ++ KE+ + E+ +
Sbjct: 127 DKVKGPEIITEEVKEEIKKQVEEGIKENDTEGKDKLIGPEIITEEVKEEIKKQVEEGI 184
>gnl|CDD|227376 COG5043, MRS6, Vacuolar protein sorting-associated protein
[Intracellular trafficking and secretion].
Length = 2552
Score = 30.2 bits (68), Expect = 2.8
Identities = 23/95 (24%), Positives = 39/95 (41%), Gaps = 5/95 (5%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
+K++K K + K KEK E K E++ K FK + + + K
Sbjct: 305 VMKQQKFLKYRPKSTPKEKPLEWF-KYIILVVLDSIHEKRYHWTWKYFKERCEDRRAYMK 363
Query: 184 I-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
I RK+ + + S D K+L+K +L +
Sbjct: 364 IIRKKYLNEQL---SADLSKELEKRELTYDLIKYR 395
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 30.3 bits (68), Expect = 2.8
Identities = 21/71 (29%), Positives = 41/71 (57%), Gaps = 1/71 (1%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ KD + K +EKI +D+ +K++ ++E K + + K E E++E +K +
Sbjct: 663 ELRAKDLEAEHKSLQEKISSLEDELEKERQNHQE-LIAKCRELEEKIERAEQEENMQKLD 721
Query: 168 EKEFKMKEDKE 178
E+E K+K++KE
Sbjct: 722 EEEQKIKQEKE 732
>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
[Transcription].
Length = 182
Score = 29.5 bits (66), Expect = 2.8
Identities = 17/86 (19%), Positives = 33/86 (38%), Gaps = 3/86 (3%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKE---KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 191
+K+K D + E EK +KE + E +K + + F M ++ + K + +
Sbjct: 97 EKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDKELLEEKLKGEVIKGEP 156
Query: 192 EKEKKSKDKDKKLKKEKLKKKKKEKE 217
+E K + L + K
Sbjct: 157 LEEVIDKLEYPVESPADLLHEIKIHL 182
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 29.8 bits (67), Expect = 2.9
Identities = 13/73 (17%), Positives = 32/73 (43%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
+++ KK+ + + + + K + +KK + + + ++ E + K + E
Sbjct: 489 SANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVIQVNKIETE 548
Query: 156 EKEKKEEKEKKEE 168
E+ E KE E+
Sbjct: 549 EEVLSELKELVED 561
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 30.1 bits (68), Expect = 2.9
Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 5/109 (4%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
IV E + + D + ++E+EK +K + K KE KE
Sbjct: 224 IVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESKELPD 283
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 200
G +E + EE+ K E FK E P+ + + K E +++ +
Sbjct: 284 G-EEIEFGNEERFKAPEILFK----PELPISGLEEAGKIDESKQELVAE 327
Score = 28.2 bits (63), Expect = 9.7
Identities = 25/122 (20%), Positives = 48/122 (39%), Gaps = 14/122 (11%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK-- 179
KE++ + D+ + +E+ +E++EK K E + KE KE K D E+
Sbjct: 229 KEEVCETDDESAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETSIAKESKELPDGEEIE 288
Query: 180 ---------PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV-IPKL 229
P + E S E+ K + K+E + + + + +P+L
Sbjct: 289 FGNEERFKAPEILFKPELPISGLEEAG--KIDESKQELVAENYEISPTNLGNDIAGLPEL 346
Query: 230 TF 231
+
Sbjct: 347 VY 348
>gnl|CDD|205150 pfam12925, APP_E2, E2 domain of amyloid precursor protein. The E2
domain is the largest of the conserved domains of the
amyloid precursor protein. The structure of E2 consists
of two coiled-coil sub-structures connected through a
continuous helix, and bears an unexpected resemblance to
the spectrin family of protein structures.E 2 can
reversibly dimerise in solution, and the dimerisation
occurs along the longest dimension of the molecule in an
antiparallel orientation, which enables the N-terminal
substructure of one monomer to pack against the
C-terminal substructure of a second monomer. The high
degree of conservation of residues at the putative dimer
interface suggests that the E2 dimer observed in the
crystal could be physiologically relevant. Heparin
sulfate proteoglycans, the putative ligands for the
precursor present in extracellular matrix, bind to E2 at
a conserved and positively charged site near the dimer
interface.
Length = 193
Score = 29.3 bits (66), Expect = 2.9
Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 143 DKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
D E + ++ KK +EK E+ + KE++ E + K + K + K ++ +K
Sbjct: 22 DNEHERYKEAKKRLEEKHRERMTQVMKEWEEAESQYKNLPK--ADPKAAQLMRKE 74
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 28.8 bits (65), Expect = 3.0
Identities = 19/78 (24%), Positives = 47/78 (60%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K+ + + K +++ ++KE+E+ +E K +EE++++ EK+ K +++ K +E+ +E+D
Sbjct: 29 KRKQQRRKKAQEEAKEKEREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEED 88
Query: 191 SEKEKKSKDKDKKLKKEK 208
+E E +D + +
Sbjct: 89 AETEDTEDVEDDEWEGFP 106
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 29.8 bits (67), Expect = 3.0
Identities = 15/127 (11%), Positives = 52/127 (40%), Gaps = 4/127 (3%)
Query: 118 KKKEKEKIKKKKD-KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
K+ E + +D K + +++ + K ++ + K E + E + + + E
Sbjct: 232 KETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWKAETRREAEQ 291
Query: 177 -KEKPVEKIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
+ + I++E+ +E+E + + ++++ ++++K+ + I +
Sbjct: 292 AEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQA 351
Query: 234 GTDMEEK 240
+ +
Sbjct: 352 QEEAKAA 358
>gnl|CDD|224291 COG1372, COG1372, Intein/homing endonuclease [DNA replication,
recombination, and repair].
Length = 420
Score = 30.0 bits (67), Expect = 3.0
Identities = 17/91 (18%), Positives = 39/91 (42%), Gaps = 2/91 (2%)
Query: 102 GEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKEKEKGKKEEKEK 159
K K+ K + K KEK+K ++ K ++ K + + + + + +++
Sbjct: 307 YAYKLGKAAKIAEALEKLKEKKKNINRRILRKIAIREVLLKLEGRLLSKLKLLLEILRKE 366
Query: 160 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K KE + K+ ++ +E+I + D
Sbjct: 367 KLLKEDENLKKLSKGDEYFDEIEEIEEIGYD 397
>gnl|CDD|233469 TIGR01564, S_layer_MJ, S-layer protein, MJ0822 family. This model
represents one of several families of proteins
associated with the formation of prokaryotic S-layers.
Members of this family are found in archaeal species,
including Pyrococcus horikoshii (split into two tandem
reading frames), Methanococcus jannaschii, and related
species. Some local similarity can be found to other
S-layer protein families [Cell envelope, Surface
structures].
Length = 571
Score = 29.9 bits (67), Expect = 3.0
Identities = 32/146 (21%), Positives = 46/146 (31%), Gaps = 20/146 (13%)
Query: 53 KDKTSPKEKMSSKEKTSPKEKEYS------KVKDVELGV--------TPMVPNIVKVTSS 98
KD EK S K Y +V D V + ++ +
Sbjct: 310 KDGKVVAEKSDSDTKDGALVHVYFGGGIGIQVLDAWENVGGDYGYVQLKIYKDLKTLELG 369
Query: 99 EDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
E+ K + D T K + E + K+ D K KE G E
Sbjct: 370 EEYMPDWKAEFAIDDNNTVIKWDALVDEDPLEDKEIGIFLTYTGDYSDTVKSGKEVGIIE 429
Query: 156 EKEKKEEKEKKEEK---EFKMKEDKE 178
K + E KE+K +F M E K+
Sbjct: 430 YAVLKLDDEDKEDKLYIDFSMDEYKD 455
>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Protein
kinase C and Casein kinase Substrate in Neurons (PACSIN)
proteins, also called Synaptic dynamin-associated
proteins (Syndapins), act as regulators of cytoskeletal
and membrane dynamics. They bind both dynamin and
Wiskott-Aldrich syndrome protein (WASP), and may provide
direct links between the actin cytoskeletal machinery
through WASP and dynamin-dependent endocytosis.
Vetebrates harbor three isoforms with distinct
expression patterns and specific functions. PACSINs
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. F-BAR domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 258
Score = 29.6 bits (67), Expect = 3.0
Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 18/113 (15%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE---KEFKMKEDKEKP 180
I+ K +++K +K+ K G KE KE ++ K ++ K K E +K
Sbjct: 82 SIRDKLLNDVVEEVKTWQKENYHKSMMGGFKETKEAEDGFAKAQKPWAKLLKKVEKAKKA 141
Query: 181 VEKIRKEEKD---------------SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K EK ++ KK +DK +K K+E K K K ++
Sbjct: 142 YHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYEKA 194
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 28.7 bits (65), Expect = 3.1
Identities = 29/104 (27%), Positives = 61/104 (58%), Gaps = 10/104 (9%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK--MK 174
K++E E+++ D K +E+ + ++E K ++EE EKKEE+ ++ +F +K
Sbjct: 4 EKRREMEEVQLALDAK-------REEFERREELLKQREEELEKKEEELQESLIKFDKFLK 56
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
E++ K + K+ ++ +K +K K+++ K K +L++ K E E+
Sbjct: 57 ENEAK-RRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEK 99
Score = 28.3 bits (64), Expect = 4.1
Identities = 24/82 (29%), Positives = 52/82 (63%), Gaps = 3/82 (3%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEK 195
++ + + +E + K+EE E++EE K+ E+E + KE++ ++ + K K K++E ++
Sbjct: 3 LEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKR 62
Query: 196 KSKDKDKKLKKEKLKKKKKEKE 217
+ +KK ++EK +K+KE+E
Sbjct: 63 --RRAEKKAEEEKKLRKEKEEE 82
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 30.0 bits (67), Expect = 3.1
Identities = 18/98 (18%), Positives = 42/98 (42%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
E+ I + K K KK + K+D++ ++ + DK K + +++E
Sbjct: 203 EEEEAIYYEDFFDKPTKEPVKKHSDVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPN 262
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
+ E ++ ++ + + E+I + EK+ K+
Sbjct: 263 AEGVGEASDKNLSSFEKQQIEMDEQIEELEKELVAPKE 300
>gnl|CDD|173720 cd05631, STKc_GRK4, Catalytic domain of the Protein
Serine/Threonine Kinase, G protein-coupled Receptor
Kinase 4. Serine/Threonine Kinases (STKs), G
protein-coupled Receptor Kinase (GRK) subfamily, GRK4
isoform, catalytic (c) domain. STKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. The GRK
subfamily is part of a larger superfamily that includes
the catalytic domains of other protein STKs, protein
tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. GRKs phosphorylate and regulate G
protein-coupled receptors (GPCRs), the largest
superfamily of cell surface receptors which regulate
some part of nearly all physiological functions.
Phosphorylated GPCRs bind to arrestins, which prevents
further G protein signaling despite the presence of
activating ligand. There are seven types of GRKs, named
GRK1 to GRK7. GRK4 has a limited tissue distribution. It
is mainly found in the testis, but is also present in
the cerebellum and kidney. It is expressed as multiple
splice variants with different domain architectures. It
is post-translationally palmitoylated and localized in
the membrane. GRK4 polymorphisms are associated with
hypertension and salt sensitivity, as they cause
hyperphosphorylation, desensitization, and
internalization of the dopamine 1 (D1) receptor while
increasing the expression of the angiotensin II type 1
receptor. GRK4 plays a crucial role in the D1 receptor
regulation of sodium excretion and blood pressure.
Length = 285
Score = 29.6 bits (66), Expect = 3.2
Identities = 14/46 (30%), Positives = 30/46 (65%)
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
K++E+ K+EE + ++KED+E+ EK ++ K + +K+ ++L
Sbjct: 203 KRKERVKREEVDRRVKEDQEEYSEKFSEDAKSICRMLLTKNPKERL 248
>gnl|CDD|215412 PLN02769, PLN02769, Probable galacturonosyltransferase.
Length = 629
Score = 30.0 bits (68), Expect = 3.2
Identities = 25/104 (24%), Positives = 33/104 (31%), Gaps = 6/104 (5%)
Query: 74 EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---KKKKD 130
E+S V T N V E L E S D + + +K+ K
Sbjct: 61 EFSHVGSARENGTKKTQNQVSEGVDEILKESGLTSSKPSDIVISSRSKLKKVFPDPKLNP 120
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKK---EEKEKKEEKEF 171
K K + KE K +K +E EK E EF
Sbjct: 121 LPVKPHSVPVPSSDTKNKSTAIDKENKGQKADEDENEKSCELEF 164
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 30.2 bits (68), Expect = 3.2
Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 104 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI----KNKEKDKEKKEK--EKGKKEEK 157
IK+ +++DK + + +++KK DK KD I N E + E+ + K +K
Sbjct: 28 IKELGLLEQDK--ARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKK 85
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK--KKKE 215
KK++K+ +E++ +++ K ++ + +E KKL K L+ K
Sbjct: 86 FKKDKKDLAKEQD-TLRKQIVKSLQD------NLPEEIFKNIFSKKLLKGVLEDFIKNTH 138
Query: 216 KERSSHEKEVIPKLTFK 232
K+ + +I FK
Sbjct: 139 KDSEDEQNILIDIKEFK 155
Score = 29.0 bits (65), Expect = 5.7
Identities = 27/127 (21%), Positives = 51/127 (40%), Gaps = 20/127 (15%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK------------------DKDKIKNK 140
++LG +++DK+ + K KK + K ++ +K K K
Sbjct: 29 KELGLLEQDKARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKKFKK 88
Query: 141 EKDKEKKEKEKGKKE--EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 198
+K KE++ +K+ + + E+ + F K K + I+ KDSE E+
Sbjct: 89 DKKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDSEDEQNIL 148
Query: 199 DKDKKLK 205
K+ K
Sbjct: 149 IDIKEFK 155
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 28.7 bits (65), Expect = 3.2
Identities = 17/77 (22%), Positives = 36/77 (46%), Gaps = 12/77 (15%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
L E + DK++ ++ +K E+E+ +KK+ K + ++ K G++ K
Sbjct: 62 AWLKEERDDKTVSLNEAE-RKAEREEDEKKRL-----------KRENERRKALGEEPLKS 109
Query: 159 KKEEKEKKEEKEFKMKE 175
+ + KEE + + E
Sbjct: 110 LDDLPKDKEEPDAYLDE 126
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 29.9 bits (68), Expect = 3.3
Identities = 17/95 (17%), Positives = 41/95 (43%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
I VT L +++ + + K+ + K E + K + ++ K KE ++ KK+
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAA 757
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK 186
+ + + E+ + KE ++++R+
Sbjct: 758 AELADLLSNAKAEEIGGVKVLAKEVDGADMKELRE 792
>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
This is a family of conserved proteins of approximately
700 residues found from worms to humans.
Length = 253
Score = 29.3 bits (66), Expect = 3.3
Identities = 10/48 (20%), Positives = 24/48 (50%)
Query: 168 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
K+ K K+K + ++K ++ + K + K + +K++KK+
Sbjct: 93 TKKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKK 140
>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain. This alpha
helical domain is found in a set of bacterial plasmid
replication proteins. The domain is found to the
C-terminus of the primase/polymerase domain. Mutants of
this domain are defective in template binding,
dinucleotide formation and conformation change prior to
DNA extension.
Length = 135
Score = 28.6 bits (64), Expect = 3.3
Identities = 26/63 (41%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 152 GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 211
GKKEE++ +E+ EK +E+ K K K VE IR E+ +K KKS ++ K+ KK L
Sbjct: 2 GKKEEEDSEEDFEKLKEEMAKYDRFKGKTVEAIR--EEVCKKIKKSLEELKEKKKAILNT 59
Query: 212 KKK 214
K
Sbjct: 60 AKG 62
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 29.6 bits (66), Expect = 3.3
Identities = 33/148 (22%), Positives = 70/148 (47%), Gaps = 2/148 (1%)
Query: 65 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 124
KE+ P+ +E K K L K+ +E+ + + + +K K+K
Sbjct: 149 KEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQK 208
Query: 125 IKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPV--E 182
++ + ++ K K +E+ K +E+E+ +K+E+ ++ +E++E++ K + ++ + E
Sbjct: 209 QQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAE 268
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
K +K +D E K K K LK
Sbjct: 269 KRQKVPEDGLSEDKKPFKCFTPKGSSLK 296
Score = 29.2 bits (65), Expect = 4.7
Identities = 32/153 (20%), Positives = 75/153 (49%), Gaps = 16/153 (10%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+ +T +++EKE+ ++++++ ++ + K + K + ++E+++ E +E+++ K
Sbjct: 107 ENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGS 166
Query: 173 MKEDK---------------EKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEK 216
++E+ + + + E E EK K K ++ L+ E+LKKK++E+
Sbjct: 167 LEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREER 226
Query: 217 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKI 249
+ E+E K EE+ KR +I
Sbjct: 227 RKVLEEEEQRRKQEEADRKSREEEEKRRLKEEI 259
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 29.5 bits (66), Expect = 3.4
Identities = 12/85 (14%), Positives = 29/85 (34%), Gaps = 2/85 (2%)
Query: 126 KKKKDKKDKDKIKNKEKDKEKKEKEK--GKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
+ D I ++ + E E + +K + +K+ + K D + + K
Sbjct: 184 PITTRRIGVDGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTK 243
Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEK 208
+ K +E K + ++
Sbjct: 244 EELDAKKAELSKALAALEAANAADE 268
>gnl|CDD|225102 COG2191, COG2191, Formylmethanofuran dehydrogenase subunit E
[Energy production and conversion].
Length = 206
Score = 29.3 bits (66), Expect = 3.4
Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKEKEKGKKEEKEKKEEKEKKEEK 169
K+ K + +K++ +D +K + K E E K+ +E + +E+
Sbjct: 93 KRGSGKAVRVYVKREILDEDPEKTELFPKVRAGYLGPSEIETRKEHWEELEWILLMPDEE 152
Query: 170 EFKMKEDKEKPVEKIRKEE 188
F++KE +P EK +K
Sbjct: 153 LFEVKEVDVEPPEKAKKFG 171
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 29.8 bits (67), Expect = 3.4
Identities = 14/39 (35%), Positives = 26/39 (66%)
Query: 146 KKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKI 184
++ +EK ++E++E+K+ + K E+E K KE+ K E I
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAI 44
Score = 28.3 bits (63), Expect = 9.6
Identities = 12/67 (17%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
++ ++K++ ++++++++ K K ++E K K+E +++E E + + ++ I + K
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIE---AAQRSRRLDAIEAQIKA 62
Query: 191 SEKEKKS 197
++ ++S
Sbjct: 63 DQQMQES 69
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 29.1 bits (65), Expect = 3.4
Identities = 26/110 (23%), Positives = 48/110 (43%), Gaps = 3/110 (2%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK---EKKEEKEKKEEKEFKMK 174
K EK K+ D++ + K D+E+ ++EK + +K + + E + + K
Sbjct: 102 GKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPK 161
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ P + + +E K K D+ KK K KK KE+ S + +
Sbjct: 162 VEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 29.5 bits (67), Expect = 3.5
Identities = 14/80 (17%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
Query: 151 KGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
+ +EK E+E+++ ++ K +E EKI + + E+ + + + L L
Sbjct: 2 SPDETLEEKLNEEEREKLEKKKSSLSEED-KEKIIERGLELEELQSTPEDLSCLPTLTLS 60
Query: 211 KKKKEKERSSHEKEVIPKLT 230
+ + E+E + +
Sbjct: 61 DIPRSGDEYPLEEEKVGDVP 80
>gnl|CDD|224396 COG1479, COG1479, Uncharacterized conserved protein [Function
unknown].
Length = 409
Score = 29.5 bits (66), Expect = 3.6
Identities = 27/89 (30%), Positives = 40/89 (44%), Gaps = 4/89 (4%)
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
K+ D +KNK K+EK++ K ++KKE +E K K + K + +R
Sbjct: 212 KELSKIDLLKNKLLMLIKEEKKQSKDLNQDKKELEEIKMNKGKITERFK----DFLRDYL 267
Query: 189 KDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
K + KKLKK K+K KE
Sbjct: 268 TYKISSNIKKRELKKLKKIYEDNKEKLKE 296
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 29.2 bits (66), Expect = 3.6
Identities = 12/53 (22%), Positives = 30/53 (56%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 196
K+ + E+ ++EE+EK++E+E++ ++ +++ KE + E E+
Sbjct: 114 KQLRFLEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166
Score = 28.8 bits (65), Expect = 4.8
Identities = 10/50 (20%), Positives = 30/50 (60%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE 167
+ + +E+ K+ ++++++ K + ++ + K+K K+ +KE+ EE +
Sbjct: 121 QLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRA 170
Score = 28.1 bits (63), Expect = 7.3
Identities = 15/56 (26%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
+ +++ E+ E++E+EK +EE+E+ K ++ ++K+ K K ++K EE
Sbjct: 113 RKQLRFLEQ-LEREEEEKRDEEERERLLRAAKSRSEQSRLKQ-KAKEMQKEEDEEM 166
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 27.8 bits (62), Expect = 3.6
Identities = 19/67 (28%), Positives = 37/67 (55%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
K +KK +K DK I + + +K++K+K + + + + +K ++K+ K E E P E
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 184 IRKEEKD 190
R+ +D
Sbjct: 61 KRRRTED 67
>gnl|CDD|224427 COG1510, COG1510, Predicted transcriptional regulators
[Transcription].
Length = 177
Score = 28.9 bits (65), Expect = 3.7
Identities = 17/77 (22%), Positives = 31/77 (40%)
Query: 105 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K + I K+ KK +E + D+ ++IK + E+ K E + +
Sbjct: 101 KWKREIDPTKEALKKLLEELNEDLDDRDLTERIKEIKSKLERLLKWSEDYYELLTRLLEF 160
Query: 165 KKEEKEFKMKEDKEKPV 181
+ E+ FK E +K
Sbjct: 161 LESEEVFKYLEKPDKIS 177
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 29.2 bits (66), Expect = 3.7
Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 10/82 (12%)
Query: 119 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE-EKEKKEEKEKKE-EKEFK-MKE 175
++ K ++ + + K E+EKGK +E E E+ FK + +
Sbjct: 218 EEAKLAVETGYWPLYRYE-------PGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTK 270
Query: 176 DKEKPVEKIRKEEKDSEKEKKS 197
+ ++K++K+ + +E K
Sbjct: 271 SNPELIQKLQKDVDERWEELKR 292
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 29.6 bits (67), Expect = 3.7
Identities = 28/130 (21%), Positives = 57/130 (43%), Gaps = 8/130 (6%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD--KDKIKNKEKD-KEKKEKEKGKKE 155
EDL +I + K K ++ ++I+ K D + +IK K+ +E+++K
Sbjct: 157 EDLLDISVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIA 216
Query: 156 EKEKKEEKEKKEEKEFKMKED--KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
K+ K ++ +E K K + + ++ + + E S K K+K K ++
Sbjct: 217 RKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKS---KIEQ 273
Query: 214 KEKERSSHEK 223
+K +EK
Sbjct: 274 FQKVIKMYEK 283
>gnl|CDD|220178 pfam09321, DUF1978, Domain of unknown function (DUF1978). Members
of this family are found in various hypothetical
proteins produced by the bacterium Chlamydia pneumoniae.
Their exact function has not, as yet, been identified.
Length = 241
Score = 29.4 bits (66), Expect = 3.7
Identities = 18/114 (15%), Positives = 54/114 (47%)
Query: 107 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK 166
+S K++KK + ++E ++ +D DK+ + +E+ + + E +++ E++E
Sbjct: 86 KQSSKENKKVLQDAQREFERQSQDFYDKEIEEVEERLEALQALYPEVSEREDETEKQETV 145
Query: 167 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
++ + + + ++ E K +D ++ +E+ ++K+ +E
Sbjct: 146 SSYARDLETALDHIEVEYVQCVREQEDYWKEEDVKEQELREEGGERKRLEEIQE 199
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 28.8 bits (65), Expect = 3.8
Identities = 16/84 (19%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEK 146
++ +E K K + K + + K K ++ +D+ +++I+ E+ KE+
Sbjct: 95 KLIIKVPNEIEANFLKKKKLDKLLEEYIKFGFGKLKIDVEVDEDESSEEEIEEFEEQKEE 154
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKE 170
+E + ++ + K+++ +K++KE
Sbjct: 155 EEAKLAEEALEALKKKEAEKKKKE 178
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 29.9 bits (67), Expect = 3.8
Identities = 28/177 (15%), Positives = 66/177 (37%), Gaps = 30/177 (16%)
Query: 57 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
S +++ + +T + + D+ L PM ++ E + + +
Sbjct: 611 SNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFI-----EVDS 665
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
+ ++ K D + E+ + K E E ++E E E
Sbjct: 666 VSSTLELQVPSKSQPTD---------ESEENAENKVASIEGEHRKEIEDLL-----FDES 711
Query: 177 KEKPVEKIRKEEKD-----------SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
+E + + +EEKD S +E ++ + + ++ LK +K++++R + E
Sbjct: 712 EEDNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLAEQNSLKAQKQQQKRIAAE 768
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 28.2 bits (63), Expect = 4.0
Identities = 18/71 (25%), Positives = 37/71 (52%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+E K E +E +E KEE E +KE +E+ E I + E+++EK + ++ + + E+
Sbjct: 10 REIKKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEE 69
Query: 209 LKKKKKEKERS 219
++ +
Sbjct: 70 EAEEILAEAEK 80
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 29.8 bits (67), Expect = 4.0
Identities = 19/103 (18%), Positives = 47/103 (45%), Gaps = 7/103 (6%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEK--E 170
+ K+ + K+ D+K + E + E++ +++ +E+E + E +EEK
Sbjct: 376 SSQRELKQTEAAYCKRLDEK-----RLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAA 430
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
DKE+ + + +K++ ++K +++K + K
Sbjct: 431 NSQVIDKEEVCNLYDRRDTAWQKQRFLREKQTAFERQKTEHTK 473
Score = 28.3 bits (63), Expect = 9.5
Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 6/100 (6%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
E + K+ D K +N + ++ ++ K+E+ + +E EK+ K+ E + E
Sbjct: 164 EVLNKEADSLYKPSGRNPQINQLLEKL----KQERNEIDEAEKEYATYHKLLESRRA--E 217
Query: 183 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
R E SE +L + +E ++ E
Sbjct: 218 HARLAELRSELRADRDHIRALRDAVELWPRLQEWKQLEQE 257
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 29.3 bits (65), Expect = 4.0
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 122 KEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
+ K+K+ KD+D + + D E ++K+KG KEE +K+ E
Sbjct: 29 ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETE 69
>gnl|CDD|239500 cd03406, Band_7_3, A subgroup of the band 7 domain of flotillin
(reggie) like proteins. This subgroup contains proteins
similar to stomatin, prohibitin, flotillin, HlfK/C and
podicin. Many of these band 7 domain-containing
proteins are lipid raft-associated. Individual proteins
of this band 7 domain family may cluster to form
membrane microdomains which may in turn recruit
multiprotein complexes. Microdomains formed from
flotillin proteins may in addition be dynamic units with
their own regulatory functions. Flotillins have been
implicated in signal transduction, vesicle trafficking,
cytoskeleton rearrangement and are known to interact
with a variety of proteins. Stomatin interacts with and
regulates members of the degenerin/epithelia Na+ channel
family in mechanosensory cells of Caenorhabditis elegans
and vertebrate neurons and participates in trafficking
of Glut1 glucose transporters. Prohibitin may act as a
chaperone for the stabilization of mitochondrial
proteins. Prokaryotic HflK/C plays a role in the
decision between lysogenic and lytic cycle growth during
lambda phage infection. Flotillins have been implicated
in the progression of prion disease, in the pathogenesis
of neurodegenerative diseases such as Parkinson's and
Alzheimer's disease and, in cancer invasion and
metastasis. Mutations in the podicin gene give rise to
autosomal recessive steroid resistant nephritic
syndrome.
Length = 280
Score = 29.4 bits (66), Expect = 4.2
Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 4/61 (6%)
Query: 142 KDKEKKEKEKGK---KEEKEKKEEKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKKS 197
++ E E EK K +K+K EKE + E++ + E +K V KI +K EKE +
Sbjct: 163 RNYELMEAEKTKLLIAIQKQKVVEKEAETERKKAVIEAEKVAQVAKILFGQKVMEKETEK 222
Query: 198 K 198
+
Sbjct: 223 R 223
>gnl|CDD|225352 COG2747, FlgM, Negative regulator of flagellin synthesis
(anti-sigma28 factor) [Transcription / Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 93
Score = 27.7 bits (62), Expect = 4.3
Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 2/76 (2%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
KT KK ++ + + + E++ +E K+ E E+K+
Sbjct: 19 KTKDAAVTTTGAAKKTDSSNVEVSLSAQQAKLMQPGSSDIREEKVEELKQAIENGEYKV- 77
Query: 175 EDKEKPVEKIRKEEKD 190
D EK +K+ K
Sbjct: 78 -DTEKIADKLLDFAKQ 92
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.4 bits (66), Expect = 4.3
Identities = 20/90 (22%), Positives = 41/90 (45%), Gaps = 1/90 (1%)
Query: 124 KIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEK 183
++ KKK +K + + K + K ++ E + EE K +EK +++
Sbjct: 282 RVFKKKGQK-RTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDE 340
Query: 184 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
++E D E +++ + K K KK + +K
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.2 bits (65), Expect = 4.3
Identities = 28/128 (21%), Positives = 55/128 (42%), Gaps = 4/128 (3%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE 158
+ E +K ++D + EK ++ + D D+ + D+ + EE+E
Sbjct: 125 KSDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDE----DNDEAPPAQPDVDNEEEE 180
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ EE + +EE+E + E++ E+ + S D + + E + KK+ E
Sbjct: 181 RLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEE 240
Query: 219 SSHEKEVI 226
E+EVI
Sbjct: 241 EEMEEEVI 248
>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
ribosomal structure and biogenesis].
Length = 239
Score = 29.1 bits (66), Expect = 4.3
Identities = 18/88 (20%), Positives = 35/88 (39%), Gaps = 4/88 (4%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
E+ + K + K K +EK+ +EK +E KK + + + + D +P
Sbjct: 150 ERSQHLNKALARKMLKGKLYILAQEKRSQEK-NRERALKKLIGWGNQIRSYVL--DPYQP 206
Query: 181 VEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ +R + + K D DK +
Sbjct: 207 TKDLR-TGVERRNKSKVLDGDKDGFIKA 233
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 28.0 bits (63), Expect = 4.3
Identities = 23/60 (38%), Positives = 40/60 (66%), Gaps = 8/60 (13%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKE-----KKEEKEKKEEKEFKMK--EDKEKPVEKIRKE 187
+ K E++K++KEK+K KK+E E + EK+K+E+ + K EDKE+ VEK++++
Sbjct: 69 EAGKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKKEEQADLLKKFEEDKER-VEKMKEK 127
>gnl|CDD|150392 pfam09710, Trep_dent_lipo, Treponema clustered lipoprotein
(Trep_dent_lipo). This entry represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 394
Score = 29.5 bits (66), Expect = 4.3
Identities = 23/128 (17%), Positives = 62/128 (48%), Gaps = 4/128 (3%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
S K+ K ++ + K + +++ +K + E + + KE +EKK +E ++
Sbjct: 17 SCSKEVKEQREMRIKVESSMKIEPKENEFLSKPEYDEHVKTPEQIKELEEKKAYEESLKQ 76
Query: 169 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---KLKKEKLKKKKKEKERSSHEKEV 225
+F++ + + + + K + + ++ K+ K+KK+ K+K + + K++
Sbjct: 77 LQFEL-DKYDLVLIQAYKTPTNIGIDNLAQQKNLEFIKVKKDGEFIYCKQKAPTQNGKDL 135
Query: 226 IPKLTFKF 233
++T+K+
Sbjct: 136 DYQVTYKW 143
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 28.9 bits (65), Expect = 4.4
Identities = 22/117 (18%), Positives = 62/117 (52%), Gaps = 6/117 (5%)
Query: 109 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIK---NKEKDKEKKEKEKGKKEEKEKKEEKEK 165
+++K K K E E+ + + ++K ++K+ + K+ KE +K +K+ + + + +
Sbjct: 55 ALEKFGKIQK--ELEEFRDQLEQKLENKVIEPLKEYLKKDLKEIKKRRKKLDKARLDYDA 112
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
EK K+++ K+ K+ + E++ ++ K+ ++ ++ E L K +++++
Sbjct: 113 AREKLKKLRKKKKDS-SKLAEAEEELQEAKEKYEELREELIEDLNKLEEKRKELFLS 168
>gnl|CDD|225405 COG2849, COG2849, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 230
Score = 28.9 bits (65), Expect = 4.5
Identities = 25/123 (20%), Positives = 43/123 (34%), Gaps = 4/123 (3%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
VT + L ++K+ +K K+K ++ K K D E+ K K+E
Sbjct: 15 GVTITYYLNGAIEEKANYVNKLEGKEKLYDEDGKLLSKLD---FSAGEEVKSKEEYYDNG 71
Query: 154 KEEKEKKEEKEKKEEKEFKM-KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ E E K+ + E+ E K K + K+ + K + K
Sbjct: 72 TLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETKEFYENGKLKSEYNYKNG 131
Query: 213 KKE 215
K
Sbjct: 132 KLH 134
Score = 28.1 bits (63), Expect = 7.8
Identities = 23/119 (19%), Positives = 37/119 (31%), Gaps = 6/119 (5%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
++ K+ D T I++K NK + KEK E GK K
Sbjct: 2 GDIVETSKNGVDDGVTITYYLNGAIEEKA------NYVNKLEGKEKLYDEDGKLLSKLDF 55
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
E+ + KE K K+ + +K + + K K + E
Sbjct: 56 SAGEEVKSKEEYYDNGTLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETK 114
>gnl|CDD|218754 pfam05795, Plasmodium_Vir, Plasmodium vivax Vir protein. This
family consists of several Vir proteins specific to
Plasmodium vivax. The vir genes are present at about
600-1,000 copies per haploid genome and encode proteins
that are immunovariant in natural infections, indicating
that they may have a functional role in establishing
chronic infection through antigenic variation.
Length = 348
Score = 29.1 bits (65), Expect = 4.5
Identities = 30/142 (21%), Positives = 52/142 (36%), Gaps = 10/142 (7%)
Query: 91 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI---------KKKKDKKDKDKIKNKE 141
+I + ++ + K K + +K K K KK K K+K NK
Sbjct: 96 DIWNKINKKNSKKKNKCKPDFNYIDILDEWKKRKDLYDYFENYDKIKKKIKSKNKNCNKY 155
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK-EEKDSEKEKKSKDK 200
K K KK +++ + E+ + K P + K K+ K++KSK+
Sbjct: 156 CKYLKYIKSLYKKYKEKCCSSNDNNYCPEYFKCDKKYNPKNLLSKLSCKNESKKEKSKEG 215
Query: 201 DKKLKKEKLKKKKKEKERSSHE 222
EKL+ ++E
Sbjct: 216 GAAEGDEKLELLEEESSELETS 237
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 29.5 bits (66), Expect = 4.6
Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
D +K E ++ +G++E+++ EE + +E E + ED E E ++ ++ E+E
Sbjct: 630 DVLKADENKSRHQQLFEGEEEDEDDLEETDD-DEDECEAIEDSESESESDGEDGEEDEQE 688
Query: 195 KKSKD 199
++
Sbjct: 689 DDAEA 693
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 28.2 bits (63), Expect = 4.7
Identities = 18/80 (22%), Positives = 31/80 (38%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
++KE EK + E +E +E+K K + E EE + + S
Sbjct: 41 AQEKEHEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWG 100
Query: 204 LKKEKLKKKKKEKERSSHEK 223
K K ++K K + E+
Sbjct: 101 KKARKRQRKVIRKLLEAEEQ 120
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 29.0 bits (65), Expect = 4.7
Identities = 12/44 (27%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
K + + K+K + E+ ++K +K+ +E +E+ +K EK + K
Sbjct: 11 KLAEKQAKSKLQQLERA-RDKQEKKAEEYQEQIKKAIEKAEEAK 53
>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
This model represents a paralogous family of Plasmodium
yoelii genes preferentially located in the subtelomeric
regions of the chromosomes. There are no obvious
homologs to these genes in any other organism.
Length = 255
Score = 29.1 bits (65), Expect = 4.8
Identities = 30/109 (27%), Positives = 44/109 (40%), Gaps = 18/109 (16%)
Query: 116 THKKKEKEKIK----KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE------------- 158
+H KK KE+ DKK K I +K+ E+ +KE + E
Sbjct: 90 SHIKKHKERNTLPDLNNVDKKTKKLINKLQKELEELKKELDNEMNDELTIQPIHDKIIIK 149
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKK 206
K E E ++FK E+++ +E + S K K K KKL K
Sbjct: 150 KDENNSVSEHEDFKQLENEKNSSVSEHEEFDIASSDNLKIKRKLKKLVK 198
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 29.3 bits (66), Expect = 4.8
Identities = 25/157 (15%), Positives = 50/157 (31%), Gaps = 10/157 (6%)
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
H KKE KI D + K EK + KK + ++ E E
Sbjct: 535 HGKKENGKIYVYND------EERGPYLKYSYEKLE-KKLKSLEELPLEIINGDSLLSDEV 587
Query: 177 KEKPVEKIRKEEKDSEKE---KKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 233
EK + I + + E + E + + E I + +
Sbjct: 588 DEKINQDILRIQDRILSEAIELARLLDLLHKSSPDVLNSIFEYLKFLTDAEEIRDVPLEE 647
Query: 234 GTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDT 270
+++E+ + + +I+ + S P ++ +
Sbjct: 648 KEELDERIESLYNLEILKYLEEELSEGPTRLGEKENE 684
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 26.6 bits (59), Expect = 4.8
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 133 DKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
D+ K E DK+ +EKEK + EKE+ E ++K+EE+E
Sbjct: 4 DERAEKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEE 41
>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA. FadA (Fusobacterium
adhesin A) is an adhesin which forms two alpha helices.
Length = 126
Score = 28.0 bits (62), Expect = 4.9
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 142 KDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 201
++EK+EKE +KE +E KE + +EE K+KE+ + K E +K K D
Sbjct: 47 FEEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRFHK-------DEYKKLLKRYD 99
Query: 202 KKLKKEKLKKKKKEKERSSHEK 223
+ L K + K KE++ S EK
Sbjct: 100 EYLNKLEQKITDKEQQISEFEK 121
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 28.4 bits (63), Expect = 4.9
Identities = 15/37 (40%), Positives = 16/37 (43%)
Query: 8 PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS 44
P P GPG+ P PP P IP P PPP
Sbjct: 112 PPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 28.9 bits (65), Expect = 4.9
Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 7/75 (9%)
Query: 105 KKDKSIKKDKKTHKK-KEKEK----IKKKKDKKDKDKIKNK-EKDKEKKEKEKGKKEEKE 158
+ + IK KK H +E EK ++ K K K+ NK +K E ++KE K E K
Sbjct: 121 WRSEQIKVKKKLHGLARENEKLQDQVEDNKQSCTKQKMLNKLKKSAEVQDKEDEKLEAKR 180
Query: 159 KK-EEKEKKEEKEFK 172
KK EE K E E+
Sbjct: 181 KKGEESRLKAENEYY 195
>gnl|CDD|205207 pfam13026, DUF3887, Protein of unknown function (DUF3887). This
family of lipoproteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are typically between 128 and 167 amino
acids in length. The proteins in this family contain an
N-terminal lipid attachment site.
Length = 101
Score = 27.6 bits (61), Expect = 5.0
Identities = 19/88 (21%), Positives = 38/88 (43%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
K+ K+ ++ ++ EK ++ K +EK K + KE K ++K+ +E
Sbjct: 1 KETKENVEQSAEEIVTMVNEKVYRDVHNKFDEKMNKALNWEEAKEKLGKTIDKVGFKEDF 60
Query: 191 SEKEKKSKDKDKKLKKEKLKKKKKEKER 218
+ EK + ++ K KE+E
Sbjct: 61 EKIEKSMLEDIDHIRTVIRVAKYKEQEH 88
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 29.1 bits (65), Expect = 5.0
Identities = 10/122 (8%), Positives = 43/122 (35%), Gaps = 13/122 (10%)
Query: 118 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDK 177
+ +E ++ ++K +D+D+ + + + + + + ++ + + + D
Sbjct: 1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60
Query: 178 EKPV-----------EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 226
P ++ R+ + + + + + + +K+ ++ S
Sbjct: 61 RSPRSLRYSSVRRSRDRPRRRSRSVRSI--EQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118
Query: 227 PK 228
P
Sbjct: 119 PG 120
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 29.0 bits (65), Expect = 5.1
Identities = 30/152 (19%), Positives = 66/152 (43%), Gaps = 17/152 (11%)
Query: 80 DVELG--VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI 137
V L ++ +I++ SSE S+KK ++ K E ++ + ++ +K
Sbjct: 274 GVALSEEALDLLQSILEKLSSELDEA-----SLKKLQEAIDKSESSELDEDLLLEEVEKS 328
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKE-EKEFKMKEDKEKPVEKIRKE--EKDSEKE 194
K K + + + +E + K + K E ++E I KE E +++
Sbjct: 329 KQKF-EPLLAQYGESFQEWRSKLQALAKVESRSLLSVEERLPT----IEKEDLELYEQRQ 383
Query: 195 KKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 224
+ +++KL +K+KL+++ E+ + H
Sbjct: 384 QLWFFENRKLWQRKKKLREQADEEYQQRHATR 415
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 28.1 bits (63), Expect = 5.1
Identities = 31/111 (27%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
Query: 117 HKKKEKEKIKKKKDKKDKD---KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
+K+ EK+++++ ++ + +I+ ++K + K KE K E+KE K+E EK F+
Sbjct: 34 QQKRTIEKLEQRQTQELRRLPKRIRAEQKTRLKMFKESLKIEKKELKQEVEKL--PRFQE 91
Query: 174 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+E K EK +E+K ++E++ K ++ +E L++ + EK E E
Sbjct: 92 QEKKRMKAEKEEQEQKHQKQEREFLAKQEENLEEALQQLQNEKRHELAETE 142
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 29.1 bits (65), Expect = 5.1
Identities = 25/147 (17%), Positives = 43/147 (29%), Gaps = 17/147 (11%)
Query: 26 QHPLIPQPLVHPPGTPPPSK-MKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELG 84
Q P+ + P TP K +P + + K EL
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELL 328
Query: 85 VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 144
++ ++S D D + K K+++ + +
Sbjct: 329 RERFPSDLFAISSLPD-----------SDSEASDSGPTRKRKRRRVPPLPEY---SSDED 374
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEF 171
E E EKE+K +E E+K F
Sbjct: 375 EDDSDEDEVDYEKERKRRRE--EDKNF 399
>gnl|CDD|232987 TIGR00467, lysS_arch, lysyl-tRNA synthetase, archaeal and
spirochete. This model represents the lysyl-tRNA
synthetases that are class I amino-acyl tRNA
synthetases. It includes archaeal and spirochete
examples of the enzyme. All other known examples are
class IIc amino-acyl tRNA synthetases and seem to form a
separate orthologous set [Protein synthesis, tRNA
aminoacylation].
Length = 515
Score = 29.1 bits (65), Expect = 5.2
Identities = 25/136 (18%), Positives = 52/136 (38%), Gaps = 7/136 (5%)
Query: 60 EKMSSKEKTSPKEKEYSKVKDVELG---VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 116
E+ K +EK+ + + EL + +P V + +I ++ I+K +
Sbjct: 327 ERFYYGVKDKDEEKKRAFKRIYELSQPMPSERIPYQVPFRHLSVISQIFENNDIEKILE- 385
Query: 117 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKED 176
K+ + + +KDK ++ K E K + K + E EE K+
Sbjct: 386 ILKRVQYTVDDQKDKLINKRLNCARNWIRKYAPEDFKFSLRSKFDNMEILEENS---KKA 442
Query: 177 KEKPVEKIRKEEKDSE 192
+ E ++K + +
Sbjct: 443 INELAEFLKKNFEVAT 458
>gnl|CDD|233582 TIGR01812, sdhA_frdA_Gneg, succinate dehydrogenase or fumarate
reductase, flavoprotein
subunitGram-negative/mitochondrial subgroup. This model
represents the succinate dehydrogenase flavoprotein
subunit as found in Gram-negative bacteria,
mitochondria, and some Archaea. Mitochondrial forms
interact with ubiquinone and are designated EC 1.3.5.1,
but can be degraded to 1.3.99.1. Some isozymes in E.
coli and other species run primarily in the opposite
direction and are designated fumarate reductase [Energy
metabolism, Aerobic, Energy metabolism, Anaerobic,
Energy metabolism, TCA cycle].
Length = 541
Score = 29.2 bits (66), Expect = 5.2
Identities = 14/77 (18%), Positives = 37/77 (48%)
Query: 145 EKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 204
E K + E++ K ++ + ++ + + V KIR+E D+ + + ++L
Sbjct: 375 EYAAKTGNPAADIEEEAVKAEEALIDLLVERNGGERVAKIREELGDTMDDNVGIFRTEEL 434
Query: 205 KKEKLKKKKKEKERSSH 221
K+ + + ++ +ER +
Sbjct: 435 LKKAVDEIEELRERYKN 451
>gnl|CDD|184535 PRK14144, PRK14144, heat shock protein GrpE; Provisional.
Length = 199
Score = 28.5 bits (63), Expect = 5.3
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE 164
K++KKD K K++ K++ K E E ++E E+ + +E
Sbjct: 3 KQNKKDWKKFKDEHKEEHKVENEILEEETDEESQHQE 39
>gnl|CDD|130214 TIGR01144, ATP_synt_b, ATP synthase, F0 subunit b. This model
describes the F1/F0 ATP synthase b subunit in bacteria
only. Scoring just below the trusted cutoff are the
N-terminal domains of Mycobacterial b/delta fusion
proteins and a subunit from an archaeon, Methanosarcina
barkeri, in which the ATP synthase homolog differs in
architecture and is not experimentally confirmed. This
model helps resolve b from the related b' subunit.
Within the family is an example from a
sodium-translocating rather than proton-translocating
ATP synthase [Energy metabolism, ATP-proton motive force
interconversion].
Length = 147
Score = 28.1 bits (63), Expect = 5.3
Identities = 18/96 (18%), Positives = 45/96 (46%), Gaps = 6/96 (6%)
Query: 131 KKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 190
+ + KI + E+ +KE ++K + KE K+E + ++ ++ E +
Sbjct: 25 ETRQKKIADGLASAERAKKEAALAQKKAQVILKEAKDEAQEIIENANKR-----GSEILE 79
Query: 191 SEKEKKSKDKDKKLKKEKLK-KKKKEKERSSHEKEV 225
K + ++++K + + + + +KE+ R K+V
Sbjct: 80 EAKAEAREEREKIKAQARAEIEAEKEQAREELRKQV 115
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 29.2 bits (65), Expect = 5.3
Identities = 14/97 (14%), Positives = 37/97 (38%), Gaps = 5/97 (5%)
Query: 88 MVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 147
+ P + ++E + + +E + + + +I + KD+++K
Sbjct: 60 LRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEF---LSESESEASLEIDSDIKDEKQK 116
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDK--EKPVE 182
E+ K + +++ E++ K E+P E
Sbjct: 117 SLEEQKIAPEIPVKQQIDSEKERIASICTKIIEEPEE 153
Score = 28.4 bits (63), Expect = 8.5
Identities = 14/64 (21%), Positives = 33/64 (51%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
E+ +K + +K K ++KD K+ K E++ + + E ++ ++ + ++EK
Sbjct: 343 EEKARKYWEANKPVSKREKKDIFHLSKKLRKIEKERLRIQSEMRDAEDIEEAIEEEKNQS 402
Query: 183 KIRK 186
+I K
Sbjct: 403 EILK 406
>gnl|CDD|234487 TIGR04157, glyco_rSAM_CFB, glycosyltransferase, GG-Bacteroidales
peptide system. Members of this protein family are
predicted glycosyltransferases that occur in conserved
gene neighborhoods in various members of the
Bacteroidales. These neighborhoods feature a radical SAM
enzyme predicted to act in peptide modification (family
TIGR04148), peptides from family TIGR04149 with a
characteristic GG cleavage motif, and several other
proteins.
Length = 406
Score = 29.2 bits (66), Expect = 5.4
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 10/76 (13%)
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDK----EKPVEKIRKEEKDSEKEKKS------K 198
KE+ K+ E EK ++E + EKE D+ + + + E+ KEK
Sbjct: 144 KEETKRSELEKLIKEEFEREKEMFQLADRVICLSQYTQNLLYEDYGIPKEKIVVIYNGLT 203
Query: 199 DKDKKLKKEKLKKKKK 214
D+ L + + KK +K
Sbjct: 204 DEAIYLSEVERKKLRK 219
>gnl|CDD|227463 COG5134, COG5134, Uncharacterized conserved protein [Function
unknown].
Length = 272
Score = 28.9 bits (64), Expect = 5.4
Identities = 25/156 (16%), Positives = 55/156 (35%), Gaps = 11/156 (7%)
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDK-----DKIKNKEKDKEKKEKEKGKKEEKEKK 160
K ++++K ++ + EK ++ + + I K ++ K+ +
Sbjct: 121 KAENVEKVPESDAIEALEKQLTQQKSEKHNSSAINFIDELNKRLWSDPFVSSQRLRKQFR 180
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
E K+ ++++E K K + I S+K+K D +L K K +K + S
Sbjct: 181 ERKKIEKKQEAKDLSLKNRAALDIDILPSSSDKDKALLLLDNELGKNKFIRKLDYRRTPS 240
Query: 221 HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 256
+ F E ++ P +
Sbjct: 241 S------RTFSTFAKFAETSFAKKDPFARKFVPSEK 270
>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
Length = 374
Score = 29.2 bits (65), Expect = 5.4
Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 11/67 (16%)
Query: 117 HKKKEKEKIKK----KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
H+K+ K KI + KKDK+ E D K E E+ E K+ K+ K+ K+++ +
Sbjct: 4 HRKQSKIKIYRITSYKKDKR-------SELDSNKFELEQQLVENKQDKQGKQYKQDRGIQ 56
Query: 173 MKEDKEK 179
E K+
Sbjct: 57 YAETKQD 63
>gnl|CDD|241381 cd13227, PH-like_Vps36_GLUE, Vps36 GLUE (GRAM-like
ubiquitin-binding in Eap45) Pleckstrin homology-like
domain. ESCRT complexes form the main machinery driving
protein sorting from endosomes to lysosomes. Yeast/human
ESCRT-I consists of Vps23/Tsg101, Vps28/Vps28, and
Vps37/Vps37 homolog. Yeast/human ESCRT-II is composed of
Vps25/EAP20, Vps22/EAP30, and Vps36/EAP45. Yeast
ESCRT-III consists Vps2, Vps20, Vps24, and Snf7
subunits. In contrast, there are three human paralogs of
Snf7 (hSnf7-1/CHMP4A, hSnf7-2/CHMP4B, and
hSnf7-3/CHMP4C) and two paralogs of Vps2 (CHMP2A and
CHMP2B). Yeast ESCRT-I links directly to ESCRT-II,
through a tight interaction of Vps28 (ESCRT-I) with the
yeast-specific zinc-finger insertion within the GLUE
domain of Vps36. The Vps36 subunit (ESCRT-II) binds
ubiquitin using one of its two NZF zinc fingers in its
N-terminal region. Human Vps36, EAP45, also binds
ubiquitin despite having no NZF domain. Instead,
mammalian ESCRT-II interacts with Ub through the Eap45
GLUE domain itself. The yeast Vps36 GLUE has a complete
PH domain, wherease Eap45 GLUE only has a PH-like fold
since it lacks the secondary structure element
corresponding to the 4 strand. ESCRT-II also interacts
with ESCRT-III via a Vps25(EAP20)/Vps20(CHMP6)
interaction. Structure 2CAY is missing this insertion
that contains 2 NZF zinc fingers. It is a split PH
domain, with a noncanonical lipid binding pocket that
binds PI(3)P. The interactions of ESCRT-II GLUE domain
with membranes, ESCRT-I, and ubiquitin are critical for
ubiquitinated cargo progression from early to late
endosomes. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 126
Score = 27.7 bits (62), Expect = 5.5
Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 2/43 (4%)
Query: 245 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV--WICPACGVQ 285
SPKI++ + TG W CP C
Sbjct: 81 RSPKIILFLKDEINSKSGLGQSSNKTGSQNVTSQWTCPICMFT 123
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 28.2 bits (64), Expect = 5.5
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 99 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
+ L +++KDK I +D+ +KE +K+ K KK + +K KEK+
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 29.2 bits (65), Expect = 5.5
Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 5/93 (5%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK-----MK 174
+ K + K + + E+K+K + + + + K EKEFK +
Sbjct: 12 NNQNKNASVVNNKAEIAAMIDARRLEQKKKGGVTNSKGKTNKVVDAKLEKEFKDVLQRFQ 71
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 207
+ ++I K+EK+ + K K ++ E
Sbjct: 72 VQENDTPKEITKDEKNMSRMKSRKAPTIHMEAE 104
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 29.1 bits (65), Expect = 5.5
Identities = 36/207 (17%), Positives = 57/207 (27%), Gaps = 31/207 (14%)
Query: 87 PMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 146
P V + SE KS + K + + +D ++ + + E
Sbjct: 329 PAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDES 388
Query: 147 KEKEKGKKEEKEKKEEKEKKEEKEFKMKED-----------------KEKPVEKIRKEEK 189
+ K EE ++ E+EK + ED + E +E
Sbjct: 389 EIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETS 448
Query: 190 DSEKEKKSKDKDKKLKKEKLKKKKKEK--------------ERSSHEKEVIPKLTFKFGT 235
S E +LK EK K + + E K +
Sbjct: 449 RSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVTTEETPEEVKSSS 508
Query: 236 DMEEKTKRESSPKIVIKPVKSPSPPPV 262
K S PK + PPPV
Sbjct: 509 PGVTKPAIPSRPKKGKPTSEKRKPPPV 535
>gnl|CDD|225349 COG2739, COG2739, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 105
Score = 27.3 bits (61), Expect = 5.6
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 8/70 (11%)
Query: 101 LGEIKKDKSIKKD------KKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKEKEKG 152
L EI ++ ++ + K+T K E EK+K + K + ++ +K K+ +E
Sbjct: 36 LSEIAEEFNVSRQAIYDNIKRTEKILEDYEEKLKLYEKYKIRKELYDKLKELITDPEELR 95
Query: 153 KKEEKEKKEE 162
+ E E
Sbjct: 96 EILEDLDNLE 105
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 28.9 bits (65), Expect = 5.6
Identities = 18/68 (26%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 164 EKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 222
E EE+E +++ + + E+ R E + ++ ++ +++ ++EK ++KK++KER E
Sbjct: 147 EVLEEEELAELRQQQRQ-FEQRRNAEL--AETQRLEEAERRRREEKERRKKQDKERKQRE 203
Query: 223 KEVIPKLT 230
KE K+
Sbjct: 204 KETAEKIA 211
Score = 28.5 bits (64), Expect = 6.2
Identities = 13/64 (20%), Positives = 43/64 (67%), Gaps = 3/64 (4%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKP 180
E+E++ + + ++ + + + ++ E E ++ ++ E+ ++EEKE++++++ + K+ +++
Sbjct: 150 EEEELAELRQQQRQFEQR---RNAELAETQRLEEAERRRREEKERRKKQDKERKQREKET 206
Query: 181 VEKI 184
EKI
Sbjct: 207 AEKI 210
>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
Length = 603
Score = 29.1 bits (65), Expect = 5.6
Identities = 23/140 (16%), Positives = 50/140 (35%), Gaps = 21/140 (15%)
Query: 95 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKK------DKKDKDKIKNKEKDKEKKE 148
V+ + +S HK KE++ +K ++ + K +
Sbjct: 86 VSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISARDIQLNHKTQFN 145
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKED---------------KEKPVEKIRKEEKDSEK 193
K E+ + + + +EK ++++ + V+++R K+ E+
Sbjct: 146 PPTVKHEKNTRVQPRRATDEKVKEIRDKIIQAKAYLNLAPPGSNSQLVKELRLRTKELER 205
Query: 194 EKKSKDKDKKLKKEKLKKKK 213
KDK L K L++ K
Sbjct: 206 AVGDATKDKDLSKSALQRMK 225
>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
Length = 814
Score = 29.2 bits (65), Expect = 5.6
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 148 EKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 192
E ++GK EEKE+KEE KK+ ED EKP I + + +E
Sbjct: 30 ESDEGKSEEKEEKEEV-KKDRDNIPEIEDGEKPTSGIEQHQYQTE 73
>gnl|CDD|233935 TIGR02577, cas_TM1794_Cmr2, CRISPR-associated protein Cas10/Cmr2,
subtype III-B. This model represent a Crm2 family of
the CRISPR-associated RAMP module, a set of six genes
recurring found together in prokaryotic genomes. This
gene cluster is found only in species with CRISPR
repeats, usually near the repeats themselves. Because
most of the six (but not this family) contain RAMP
domains, and because its appearance in a genome appears
to depend on other CRISPR-associated Cas genes, the set
is designated the CRISPR RAMP module. This protein,
typified by TM1794 from Thermotoga maritima, is
designated Crm2, for CRISPR RAMP Module protein 2
[Mobile and extrachromosomal element functions, Other].
Length = 483
Score = 29.0 bits (65), Expect = 5.7
Identities = 37/178 (20%), Positives = 57/178 (32%), Gaps = 25/178 (14%)
Query: 59 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG----EIKKDKSIKKDK 114
+E + K K E VKD+ G V E L +KK+ DK
Sbjct: 104 QETIRIKIKDELAELLKKAVKDLIEG------PAVITDLEEGLKQKEEALKKELEKYADK 157
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKK-------- 166
+ EK + KD K+ + + E+ + EK KE
Sbjct: 158 RLSAPSLLEKEEVYKDVKNGNVGTGLNNVSKYLSFEESVLKLSEKFLRKELIRIYPKLRE 217
Query: 167 ------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 218
K F + +K + I+ K K L+ + LK +E+ER
Sbjct: 218 SLDAIGLTKRFSERCEKCLELPGIKCLLCGENLAIAG-VKRKLLELKDLKALLQEEER 274
>gnl|CDD|225768 COG3227, LasB, Zinc metalloprotease (elastase) [Amino acid
transport and metabolism].
Length = 507
Score = 28.9 bits (65), Expect = 5.7
Identities = 24/127 (18%), Positives = 39/127 (30%), Gaps = 11/127 (8%)
Query: 40 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK-EYSKVKDVELGVTPMVPNIVKVTSS 98
+K +S K + KE + E + + G T V+
Sbjct: 35 MKSQIFIKGTLSKKSAPNEKEILQFLENVNADNNLKAISTDTDPNGFT-----HVRYQQV 89
Query: 99 EDLGEIKKDKSIK--KDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEK-DKEKKEKEKGKK 154
+ G K + DK K + KKKD KI ++ +K+ +
Sbjct: 90 VN-GVPVKGSEVIVHLDKNGVVKAVNGFVNKKKDLPTNTPKISAEQAIAIARKQFGYERL 148
Query: 155 EEKEKKE 161
E K E
Sbjct: 149 TESPKAE 155
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 29.1 bits (66), Expect = 5.7
Identities = 15/51 (29%), Positives = 26/51 (50%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
K +KK + + K + +++ + +KE+ EK E EK EEKE+
Sbjct: 827 KEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEE 877
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 28.2 bits (63), Expect = 5.8
Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 9/105 (8%)
Query: 118 KKKEKEKIKKKKDKKDK-----DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK 172
+K+E E+++K++ D+ DK+ ++ K++ + K E +KE E ++
Sbjct: 44 RKQEIEELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPT 103
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E E +++ KD EK ++L+K + K KK++KE
Sbjct: 104 YNEMVEDLFDQV----KDEVDEKNGAALIEELQKHRDKLKKEQKE 144
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain is
found associated with pfam06512, pfam00520. This domain
has a conserved ADD sequence motif.
Length = 222
Score = 28.6 bits (64), Expect = 5.8
Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 9/65 (13%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E K+ + K+ + I EE DSEKE KS+ D + + L +K ++
Sbjct: 9 ESAKERRNRNDKNKKEEHS---------IGSEEGDSEKEPKSESADGRKRCRFLLRKTRK 59
Query: 216 KERSS 220
+S
Sbjct: 60 VSQSL 64
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 29.1 bits (66), Expect = 5.9
Identities = 38/169 (22%), Positives = 65/169 (38%), Gaps = 33/169 (19%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKE-- 158
L ++K ++ KKKEK+K KKKK K + KI K+ +++ E K E
Sbjct: 385 LKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVLKEATRQDLELLKNVWGEIL 444
Query: 159 ---KKEEKEK---------------------KEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
K + K + E F+ ++ + I +
Sbjct: 445 ESLKAQRKSLRALLVNSEPVAASEDTVVLAFEYEIHFEKAMLDKELNDTIENILSKLLGK 504
Query: 195 K-----KSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTD 236
+D+ +K+++E L+K K E+E E +IP+ FG D
Sbjct: 505 SVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKELFGED 553
Score = 28.3 bits (64), Expect = 9.3
Identities = 19/83 (22%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 134 KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 193
+++ E++ ++ + + ++K+++KEKK+ K+ K K P KI K K++ +
Sbjct: 375 LQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYK----VPRGKIYKVLKEATR 430
Query: 194 E--KKSKDKDKKLKKEKLKKKKK 214
+ + K+ ++ E LK ++K
Sbjct: 431 QDLELLKNVWGEI-LESLKAQRK 452
>gnl|CDD|223173 COG0095, LplA, Lipoate-protein ligase A [Coenzyme metabolism].
Length = 248
Score = 28.5 bits (64), Expect = 5.9
Identities = 14/75 (18%), Positives = 32/75 (42%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+ KEK K K + ++ +E + ++E E +E K +E + ++ +
Sbjct: 170 RVPKEKIKSKGIKSVRERVANLEELLKISVEEFLEALLEAFFKVLGVELEEYELTPEELE 229
Query: 204 LKKEKLKKKKKEKER 218
L ++ ++K E
Sbjct: 230 LAEKLAEEKYSSWEW 244
>gnl|CDD|184538 PRK14149, PRK14149, heat shock protein GrpE; Provisional.
Length = 191
Score = 28.3 bits (63), Expect = 6.0
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 8/80 (10%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE----EKEKKEEKEFKMKEDKEKPVE 182
K + ++ D + KE + E+K +++ +EK+E E E KE+ E K KE EK +
Sbjct: 2 KDEHNQEHDHLSQKEPESEEKACACKEQQGEEKQEASEKEGEIKEDFELKYKEMHEKYL- 60
Query: 183 KIRKEEKDSEKEKKSKDKDK 202
+ D E KK ++DK
Sbjct: 61 ---RVHADFENVKKRLERDK 77
>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
PA.
Length = 669
Score = 29.1 bits (65), Expect = 6.0
Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 12/105 (11%)
Query: 37 PPGTPPPSKMK---DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT---PMVP 90
P +M+ DK +P++ +MS + K K +++++ + L P P
Sbjct: 188 PLPFSTYEQMRDYCDKFTPREIEPNVSRMSGRVKLPIKHYRWTQMRPIPLLKEPEGPPSP 247
Query: 91 NIVKVTSSEDL------GEIKKDKSIKKDKKTHKKKEKEKIKKKK 129
K+ ++ L ++ I D + E E I+ +
Sbjct: 248 YRAKLLMADALYLTICDPGTDQEGIILIDSTFFSQSEPECIRPSE 292
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 28.4 bits (64), Expect = 6.1
Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 114 KKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEKKEKEKGKKEEKEKKEEKE--KKEEKE 170
K K E+ K+ + ++D DKIK EK+KE E E K EE +K E KK ++
Sbjct: 111 KDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIKKIDEL 170
Query: 171 FKMKEDK 177
K KE +
Sbjct: 171 LKSKEKE 177
>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
Length = 211
Score = 28.3 bits (63), Expect = 6.1
Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 2/56 (3%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKEKEKGKKEEKEKKEEKEK 165
++K ++ K K + K+K K + K EEK ++EE
Sbjct: 35 PNRKKSAPEQVPTGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGN 90
Score = 28.3 bits (63), Expect = 6.4
Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 1/54 (1%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE 168
KK E++ K+K K ++K K K K E EEK ++EE
Sbjct: 35 PNRKKSAPEQVPTGKNKLLLTK-RSKLKGIPKPRKLHKHGFWAEIFEEKVEREE 87
Score = 28.3 bits (63), Expect = 6.8
Identities = 21/95 (22%), Positives = 35/95 (36%), Gaps = 14/95 (14%)
Query: 123 EKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE 182
+KK + KNK K+ K KG + ++ + E E E
Sbjct: 34 SPNRKKSAPEQVPTGKNK-LLLTKRSKLKGIPKPRKLHKHGFWAEIFE-----------E 81
Query: 183 KIRKEEKDSEKEKKSKD--KDKKLKKEKLKKKKKE 215
K+ +EE + + KD K K+ + LK +
Sbjct: 82 KVEREELGNPCKDLDKDIIKGKRRPRLTLKNTNNQ 116
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 28.5 bits (64), Expect = 6.1
Identities = 18/84 (21%), Positives = 43/84 (51%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
++I+ + +++ + EK + +E ++K EE +K+EK+ +M++ K K
Sbjct: 11 EEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEARLKVLNAR 70
Query: 195 KKSKDKDKKLKKEKLKKKKKEKER 218
++ D + KE+L ++K+
Sbjct: 71 EELLDSVFEEAKERLANLSEDKDE 94
>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain. This
family is involved in biosynthesis of guanosine
nucleotide. Members of this family contain a TIM barrel
structure. In the inosine monophosphate dehydrogenases 2
CBS domains pfam00571 are inserted in the TIM barrel.
This family is a member of the common phosphate binding
site TIM barrel family.
Length = 467
Score = 28.8 bits (65), Expect = 6.1
Identities = 25/136 (18%), Positives = 44/136 (32%), Gaps = 5/136 (3%)
Query: 74 EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIK-----KDKKTHKKKEKEKIKKK 128
E KVK E G + T+ + E+K+ I +D K + +
Sbjct: 77 EVRKVKRFESGFITDPVTVSPDTTVAEALELKERYGISGVPVTEDGKLLGGLVGIRTSRD 136
Query: 129 KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
D K + + E EE + + K E+ + + + + E
Sbjct: 137 IDFLSKVSVVMTMTEDLVTAPEGITLEEANEILHEHKIEKLPIVDDDGELVGLITRKDIE 196
Query: 189 KDSEKEKKSKDKDKKL 204
K + SKD +L
Sbjct: 197 KARDYPNASKDAQGRL 212
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 28.1 bits (62), Expect = 6.2
Identities = 21/151 (13%), Positives = 88/151 (58%)
Query: 96 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKE 155
T +E+ + +K + ++ +++++ + ++++D+ +++++K + ++ + +E+ +++
Sbjct: 19 TDAEEATRLLAEKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQ 78
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
E+E+ EKE+K +++ + +E +E+ ++ +++K+ + + ++ ++ + + ++ E
Sbjct: 79 EEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIE 138
Query: 216 KERSSHEKEVIPKLTFKFGTDMEEKTKRESS 246
+ER +K + + +++ + K+E
Sbjct: 139 QERLERKKRLEEIMKRTRKSEVSPQVKKEDP 169
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 28.2 bits (63), Expect = 6.2
Identities = 24/71 (33%), Positives = 34/71 (47%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKM 173
KKT KKK+ KKK KK K K K KE+ E +E EE E+ ++K E + +
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107
Query: 174 KEDKEKPVEKI 184
+ + I
Sbjct: 108 TAAEVNHEDVI 118
Score = 28.2 bits (63), Expect = 7.4
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%)
Query: 103 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEE 162
EI K+ KK K T +KK+ K KKK K K + + +E ++ E+ K+
Sbjct: 44 EIPSKKTSKKKKTTPRKKKTTKKTKKKKK-----EKEEVPELAAEELSDSEENEENDKKV 98
Query: 163 KEKKEEKEFKMKEDKE 178
+ + + E
Sbjct: 99 DYELPKVQNTAAEVNH 114
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 28.6 bits (64), Expect = 6.2
Identities = 19/84 (22%), Positives = 41/84 (48%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
+E ++EE+ ++EE++ EEK+ + + P+++++ + K KD
Sbjct: 97 EEPDAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTPPIKEMKIPAAGESRLKIRKDYVPS 156
Query: 204 LKKEKLKKKKKEKERSSHEKEVIP 227
K + +K KK+ + E+IP
Sbjct: 157 GNKPRAEKPKKKMIKCPITGELIP 180
>gnl|CDD|216016 pfam00602, Flu_PB1, Influenza RNA-dependent RNA polymerase subunit
PB1. Two GTP binding sites exist in this protein.
Length = 740
Score = 29.1 bits (65), Expect = 6.2
Identities = 16/59 (27%), Positives = 31/59 (52%)
Query: 121 EKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
+K ++K K K +K + +KE K K K+ K++++ +EE + M +D E+
Sbjct: 176 DKPEVKFTKHKNNKRRYDDKETGKLLKRKQPSKEKDRITREEYLIRALTLNTMTKDAER 234
>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
Length = 615
Score = 28.8 bits (64), Expect = 6.5
Identities = 20/101 (19%), Positives = 42/101 (41%), Gaps = 10/101 (9%)
Query: 108 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKG---------KKEEKE 158
++ + K+ K+K+ KK+ K K+KE EKG + ++
Sbjct: 29 QTTTSENKSAVSKKKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERIIDRI 88
Query: 159 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
+ + + E + K K + +K + I+ + K +K +D
Sbjct: 89 FQSSLKNRTEIKVKPKNNPQKK-QNIKPVKPIPSKPEKPED 128
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 27.4 bits (61), Expect = 6.6
Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 135 DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKE---EKEFKMKEDKEKPVE--KIRKEEK 189
+K + +EK ++E K E+E+ EE+E KE E+ K ++K K + K+RK ++
Sbjct: 46 EKNEVEEKQAVEEEANKEGVVEEEEVEEEEDKEMTLEEYEKALQEKRKALPALKVRKAKE 105
Query: 190 D 190
Sbjct: 106 G 106
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 29.1 bits (65), Expect = 6.7
Identities = 31/187 (16%), Positives = 56/187 (29%), Gaps = 14/187 (7%)
Query: 89 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKE 148
P+ V +S L + K +I + +K+ ++ D D + E E
Sbjct: 43 GPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDE 102
Query: 149 KEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 208
+ G E + + E + ++ + D + D +
Sbjct: 103 DDAGNATSGESSTDDDSLLELPDRDEDADTQ-------ANNDQTNDFDQDDSSNSQTDQG 155
Query: 209 LKKKKKEKERSSH--EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 266
LK+ EK+ + TFKF + + + KS S P D
Sbjct: 156 LKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD-----GGKSKSSDPGPLND 210
Query: 267 EYDTGDS 273
GD
Sbjct: 211 SDGQGDD 217
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 28.2 bits (62), Expect = 6.8
Identities = 20/71 (28%), Positives = 45/71 (63%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 197
KNK K+ ++++ ++ + EK KKE +E E+K + E+ E E+ ++E+++E++K
Sbjct: 7 KNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVEELEDQPEEPPEQEEENEEQKPK 66
Query: 198 KDKDKKLKKEK 208
++ D +++ K
Sbjct: 67 EEIDYPIQENK 77
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 28.9 bits (65), Expect = 6.8
Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 13/97 (13%)
Query: 138 KNKEKDKEKKEKEKGKKE--------EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRK--E 187
KN +DKEK+EK K + EK+ KE K+K E++ KE E ++K+R+ +
Sbjct: 516 KNAAEDKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEK---KEKIENLIKKLRQALQ 572
Query: 188 EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
+ E K ++ +K E K+ +
Sbjct: 573 NDNYESIKSLLEELQKALMEIGKEVYSSTSTTDPASN 609
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 28.5 bits (64), Expect = 6.9
Identities = 12/89 (13%), Positives = 37/89 (41%), Gaps = 1/89 (1%)
Query: 92 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
+ + + K + K + KK+ +I+ + + ++ + +E+ +E K
Sbjct: 43 VEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALA 102
Query: 152 GKKEE-KEKKEEKEKKEEKEFKMKEDKEK 179
++ + + EK+ + + +D+ K
Sbjct: 103 QRRSDLSSASYQLEKRRASQLEKLQDEIK 131
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 28.7 bits (64), Expect = 7.0
Identities = 45/212 (21%), Positives = 79/212 (37%), Gaps = 16/212 (7%)
Query: 45 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV-----TSSE 99
+M +S D K+ + + + E+ Y K+KDV + + NI + +S+
Sbjct: 139 EMDSLIS-GDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNL 197
Query: 100 DLGEIKKDKSIKKDKKTHKKKEKEK------IKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
+L IKK I D+K+H KE D + K E + K + +
Sbjct: 198 ELENIKK--QIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYE 255
Query: 154 KEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 213
E K + + + EK KE +E+ ++ I + K K + K +
Sbjct: 256 SEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILS 315
Query: 214 KEKERSSHEKEVIPKLTF--KFGTDMEEKTKR 243
+ +I KL+ K D +K R
Sbjct: 316 NIDAEINKYHAIIKKLSVLQKDYNDYIKKKSR 347
Score = 28.7 bits (64), Expect = 8.6
Identities = 20/118 (16%), Positives = 45/118 (38%), Gaps = 2/118 (1%)
Query: 104 IKKDKSIKKDKKTH--KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKE 161
IKK ++KD + KK + + + + + ++ K + +K +E + E
Sbjct: 328 IKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIE 387
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
+ K++E ++K E ++ SK + L++ E R+
Sbjct: 388 RMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRN 445
>gnl|CDD|235285 PRK04335, PRK04335, cell division protein ZipA; Provisional.
Length = 313
Score = 28.6 bits (64), Expect = 7.0
Identities = 23/97 (23%), Positives = 35/97 (36%), Gaps = 1/97 (1%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 203
KE K K K K + +++ E + E E IRKE K+ + +
Sbjct: 29 KEGKSKFGDKPLGKLDVDSDDEQPAPERGFAQAPEDDFEIIRKERKEPDFG-RENSFHDP 87
Query: 204 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEK 240
L + L + E+E E+E P EK
Sbjct: 88 LIDDPLFGGELEEEEDKFEQEEAPIPVQAQSQPQPEK 124
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 28.5 bits (63), Expect = 7.1
Identities = 11/59 (18%), Positives = 31/59 (52%)
Query: 136 KIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 194
+ + K K K++ + ++ ++ K EEK+ + ++++E+ E+ ++EE +
Sbjct: 185 SLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
>gnl|CDD|227392 COG5059, KIP1, Kinesin-like protein [Cytoskeleton].
Length = 568
Score = 28.9 bits (65), Expect = 7.1
Identities = 32/172 (18%), Positives = 57/172 (33%), Gaps = 16/172 (9%)
Query: 88 MVPNIVKVTSSEDLG----EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 143
+ N ++V SS D EIK D S + + +E+ + + +
Sbjct: 335 SIKNKIQVNSSSDSSREIEEIKFDLSEDR-SEIEILVFREQSQLSQSSLSGIFAYMQSLK 393
Query: 144 KEKK---------EKEKGKKEEKEKKEEKEKKEEK--EFKMKEDKEKPVEKIRKEEKDSE 192
KE + K + KK KE+ + + + + +R+EE +
Sbjct: 394 KETETLKSRIDLIMKSIISGTFERKKLLKEEGWKYKSTLQFLRIEIDRLLLLREEELSKK 453
Query: 193 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 244
K K K + L E+ E E KL T + ++ R
Sbjct: 454 KTKIHKLNKLRHDLSSLLSSIPEETSDRVESEKASKLRSSASTKLNLRSSRS 505
>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
Length = 153
Score = 27.8 bits (62), Expect = 7.6
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 173 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER---SSHEKEVIPKL 229
+K+ K K ++ EK K K ++++K +KKK E + S EKE+ +
Sbjct: 49 LKQQKMYQKILDLKLKEAKEKCLKVKQLLEEIQKFSVKKKVGENNQIFGSVTEKEISQII 108
Query: 230 TFKFGTDMEEK 240
K D+E++
Sbjct: 109 KNKTNIDLEKQ 119
>gnl|CDD|129272 TIGR00168, infC, translation initiation factor IF-3. infC uses
abnormal initiation codons such as AUA, AUC, and CUG
which render its expression particularly sensitive to
excess of its gene product IF-3 thereby regulating its
own expression [Protein synthesis, Translation factors].
Length = 165
Score = 27.7 bits (62), Expect = 7.7
Identities = 27/93 (29%), Positives = 49/93 (52%), Gaps = 7/93 (7%)
Query: 156 EKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-EKLKKKKK 214
E+EKK+++ KK +K ++KE K +P I + + K K + L+K +K+K +
Sbjct: 62 EQEKKKKEAKKNQKIIQVKEVKMRP--TIDEHDLQF----KLKQAIRFLEKGDKVKFTVR 115
Query: 215 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 247
+ R S E+ K+ +F TD+ + + E P
Sbjct: 116 FRGRESTHIELGEKVLDRFYTDVADVAEVEKPP 148
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 28.8 bits (64), Expect = 7.7
Identities = 36/161 (22%), Positives = 61/161 (37%), Gaps = 12/161 (7%)
Query: 51 SPKDKTSPKEKMSS----KEKTSPKEKEYS-KVKDVELGVTPMVPNIVKVTSSEDLGEIK 105
S KE S+ SP+ KE S K + VT T + +
Sbjct: 156 SKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVT-------HTTCYDKMRRSP 208
Query: 106 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEK 165
++ S K+ K KEK K K K KKD+D K +++ EK + + + + + ++
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKE 268
Query: 166 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 206
++ E + V RK + + + KK K
Sbjct: 269 NKKDEKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKT 309
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 28.2 bits (64), Expect = 7.8
Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 16/63 (25%)
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK---------EKERSSHEKEVIPK 228
EK +E++ EK+ KS DK+ K + E L+K K+ E + E +++
Sbjct: 137 EKRLERL-------EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIKILKS 189
Query: 229 LTF 231
L
Sbjct: 190 LQL 192
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 28.6 bits (65), Expect = 7.8
Identities = 10/49 (20%), Positives = 17/49 (34%), Gaps = 5/49 (10%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKK 160
+K K + K K +K +K K++ EK+ K
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVK-----FSRKTKQQYVASEKDGK 857
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 27.3 bits (61), Expect = 8.0
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 110 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEK 151
+ +++K K+ KEK K+KK KK K K K K+ K+ +K+K
Sbjct: 94 LVEEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 27.3 bits (61), Expect = 8.1
Identities = 15/75 (20%), Positives = 34/75 (45%)
Query: 128 KKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
++D+ + +D + + K + E +E E + +M E ++K E+ KE
Sbjct: 29 VDIDDEEDEDPKRYQDGDNEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKE 88
Query: 188 EKDSEKEKKSKDKDK 202
E + +E+ K + +
Sbjct: 89 EIEKYREENPKIQQQ 103
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 28.5 bits (64), Expect = 8.1
Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 3/95 (3%)
Query: 119 KKEKEKIKKKKDKKDK-DKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEE--KEFKMKE 175
+K +KK + K D + +K + + ++ E KEK E+ KE +++
Sbjct: 275 GGKKIYLKKPPKEFAKYDVEEKTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEEAIEK 334
Query: 176 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 210
KEK ++++KE + K K ++++ K+K
Sbjct: 335 AKEKAEKELKKELDPNAKIVSDKILSERVEGGKVK 369
>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 147 to 176 amino acids in length. This domain is
found associated with pfam00226.
Length = 144
Score = 27.7 bits (62), Expect = 8.1
Identities = 15/46 (32%), Positives = 27/46 (58%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 189
K +K+KE+ ++ EK K E +KK+E E + + K+ +EE+
Sbjct: 4 KAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEE 49
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors for
host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 28.1 bits (63), Expect = 8.3
Identities = 17/103 (16%), Positives = 52/103 (50%), Gaps = 8/103 (7%)
Query: 123 EKIKKKKDKKD--KDKIKNK-EKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF--KMKEDK 177
+ I + K K + E+ KEK+E+E ++ K+ +++E ++F + E +
Sbjct: 2 KDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQ 61
Query: 178 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
+ I++ E++ + ++ ++ + +++ ++E++R +
Sbjct: 62 QACWRAIQELEREL---RDLAEERREEVERRIEAVEREEQRRT 101
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 28.6 bits (64), Expect = 8.3
Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 7/86 (8%)
Query: 97 SSEDLGEIKKDK--SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKK 154
I SI K KK +K +KEK K D+ D D N+ E ++
Sbjct: 50 EWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLD---NENVQLEMLYSISNER 106
Query: 155 EEKEKKEEKEKKE--EKEFKMKEDKE 178
E ++ + K + + ++K E
Sbjct: 107 EIRQLRFGGSKVQNPHEGARVKNINE 132
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 28.5 bits (64), Expect = 8.4
Identities = 26/106 (24%), Positives = 49/106 (46%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEF 171
K+ H K+E E++K+ + + ++ +KE KE E E E E +E +
Sbjct: 322 KENNEHLKEEIERVKESYRLAETELGSVRKFEKELKELESVLDEILENIEAQEVAYSELQ 381
Query: 172 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
E+ EK + I E++ ++ S KD+ +E L++ K +
Sbjct: 382 DNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEARENLERLKSKLH 427
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 28.0 bits (63), Expect = 8.4
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 112 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKE 150
K K +KK+ KEKI+ KK K ++ K+K +K +
Sbjct: 43 KAKLYNKKRYKEKIQMKKTIKMHEERNVKQKVDDKVPEG 81
>gnl|CDD|234042 TIGR02870, spore_II_D, stage II sporulation protein D. Stage II
sporulation protein D (SpoIID) is a protein of the
endospore formation program in a number of lineages in
the Firmicutes (low-GC Gram-positive bacteria). It is
expressed in the mother cell compartment, under control
of Sigma-E. SpoIID, along with SpoIIM and SpoIIP, is one
of three major proteins involved in engulfment of the
forespore by the mother cell [Cellular processes,
Sporulation and germination].
Length = 338
Score = 28.1 bits (63), Expect = 8.5
Identities = 14/52 (26%), Positives = 24/52 (46%)
Query: 137 IKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 188
+ + E+ EK +++ EK K+EE K+ E +EK+ EE
Sbjct: 25 FGDPQNVTEENEKLAPVEKKPEKGVTNLKEEEVAIKVYRSNEDKIEKVPLEE 76
>gnl|CDD|184732 PRK14539, PRK14539, 50S ribosomal protein L11/unknown domain fusion
protein; Provisional.
Length = 196
Score = 28.0 bits (62), Expect = 8.6
Identities = 14/52 (26%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 144 KEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKE 194
K K+E + K K ++ + E ++K K K +E + EE + EK+
Sbjct: 144 KAKEEAKAAAKAAALAKAKEASLKSAEAELKASKGKSIEVNVIGEEDEGEKD 195
>gnl|CDD|173534 PTZ00341, PTZ00341, Ring-infected erythrocyte surface antigen;
Provisional.
Length = 1136
Score = 28.6 bits (63), Expect = 8.8
Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 14/146 (9%)
Query: 94 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK--------------IKN 139
++ S E L E K K + KK EK KKDKK K K + N
Sbjct: 866 QILSDEGLDEKKLKKRAESLKKLANAIEKYAGGGKKDKKAKKKDAKDLSGNIAHEINLIN 925
Query: 140 KEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 199
KE + + + KE E E++ +E E +E+ E+ VE+ +E + E+ ++
Sbjct: 926 KELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENVEENVEENVEENVEENVEE 985
Query: 200 KDKKLKKEKLKKKKKEKERSSHEKEV 225
++ +E +++ +E + E+ V
Sbjct: 986 NVEENVEENVEENVEENIEENVEENV 1011
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 28.7 bits (64), Expect = 8.8
Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 8/81 (9%)
Query: 12 VAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSK------ 65
V G + PP +H PQ P P SK KDKV PKD P S
Sbjct: 1004 VGGIEAPEKTPPPPEHE--PQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061
Query: 66 EKTSPKEKEYSKVKDVELGVT 86
K + ++KE K L T
Sbjct: 1062 PKGNEEKKEEQPDKGANLPNT 1082
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 28.2 bits (63), Expect = 8.9
Identities = 20/105 (19%), Positives = 52/105 (49%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE 170
D++ E+ ++ K + E+++ + +E+++K+E E++++K+
Sbjct: 244 YLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKK 303
Query: 171 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 215
+M ++ E +K E+ K+ K +K +KE+ K+ ++E
Sbjct: 304 KEMAPKVKQRFEANDPAQKLQEERKEQLAKLRKEEKEREKEYEQE 348
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 26.8 bits (59), Expect = 9.0
Identities = 17/50 (34%), Positives = 30/50 (60%)
Query: 138 KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 187
K++ ++K EK K + +KK K+KK+E+ + E+ ++KIRKE
Sbjct: 4 KSQLSKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKE 53
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 28.6 bits (64), Expect = 9.0
Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 3/60 (5%)
Query: 7 SPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKE 66
P A P + P P V+PP T P ++ ++K P K+SS
Sbjct: 411 PSAPQSATQPA-GTPPTVSVDPPAAVP-VNPPSTAPQ-AVRPAQFKEEKKIPVSKVSSLG 467
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.1 bits (63), Expect = 9.1
Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 3/92 (3%)
Query: 127 KKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKE-FKMKEDKEKPVEKIR 185
KKK KK K K+ + D+E+ E E +++ K+ + + K K K V+
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSF 60
Query: 186 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
D +E+K + ++L++E LKK++ KE
Sbjct: 61 L--PDKAREEKEAELREELREEFLKKQEAVKE 90
>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
enzyme res; Provisional.
Length = 986
Score = 28.5 bits (64), Expect = 9.2
Identities = 20/109 (18%), Positives = 44/109 (40%), Gaps = 9/109 (8%)
Query: 101 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKEKEKGK-KEE 156
+GEI D ++ +++ ++ +KI +K K D D + + + D +K KE+
Sbjct: 577 VGEINSDSKVQLNEEKLTEEMIQKIVTEKQKVDPDFTELRLLEDLDDKKIIDRSNNFKED 636
Query: 157 KEKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 205
+ E + +++ K + K K+ ++LK
Sbjct: 637 GFDWLLEAYPELFQTGLRKGK-----VRDNKPASKLTVKLRKENYEELK 680
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 27.3 bits (61), Expect = 9.2
Identities = 13/59 (22%), Positives = 27/59 (45%)
Query: 161 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 219
+E + K K + VE +E+ ++ KE + ++L+K + + KK +K
Sbjct: 68 DEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEP 126
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 28.4 bits (64), Expect = 9.2
Identities = 14/43 (32%), Positives = 19/43 (44%)
Query: 114 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEE 156
K K K K+KKKK K + E +E K K K ++
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 28.1 bits (63), Expect = 9.9
Identities = 12/43 (27%), Positives = 21/43 (48%)
Query: 111 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGK 153
K KK++ K + +K KKK K + ++ K +K+ K
Sbjct: 351 KPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
>gnl|CDD|222859 PHA02535, P, terminase ATPase subunit; Provisional.
Length = 581
Score = 28.5 bits (64), Expect = 9.3
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)
Query: 162 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 221
EKE K ++K + + E++ + + S ++ + K K+K K S
Sbjct: 66 EKENKTGGDYKEIDLLIRQHERLARVRRYSGTGNEADLNPNVANRNKGPKRKPVKNDIS- 124
Query: 222 EKEVIPKL 229
E KL
Sbjct: 125 -DEQTEKL 131
>gnl|CDD|153355 cd07671, F-BAR_PSTPIP1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Protein 1. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Proline-Serine-Threonine
Phosphatase-Interacting Protein 1 (PSTPIP1), also known
as CD2 Binding Protein 1 (CD2BP1), is mainly expressed
in hematopoietic cells. It is a binding partner of the
cell surface receptor CD2 and PTP-PEST, a tyrosine
phosphatase which functions in cell motility and Rac1
regulation. It also plays a role in the activation of
the Wiskott-Aldrich syndrome protein (WASP), which
couples actin rearrangement and T cell activation.
Mutations in the gene encoding PSTPIP1 cause the
autoinflammatory disorder known as PAPA (pyogenic
sterile arthritis, pyoderma gangrenosum, and acne)
syndrome. PSTPIP1 contains an N-terminal F-BAR domain,
PEST motifs, and a C-terminal SH3 domain. F-BAR domains
form banana-shaped dimers with a positively-charged
concave surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 242
Score = 28.0 bits (62), Expect = 9.3
Identities = 22/98 (22%), Positives = 43/98 (43%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKEK 179
+E++K ++KK + ++++ + KK E K E+ +E E ++ E K
Sbjct: 94 RERQKEQRKKYEAVMERVQKSKVSLYKKTMESKKTYEQRCREADEAEQTFERSSSTGNPK 153
Query: 180 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 217
EK + + K ++ K E+L K + E E
Sbjct: 154 QSEKSQNKAKQCRDAATEAERVYKQNIEQLDKARTEWE 191
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 28.4 bits (63), Expect = 9.4
Identities = 22/124 (17%), Positives = 55/124 (44%), Gaps = 1/124 (0%)
Query: 98 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEK 157
E+ + +D+ +++ ++KK + + + + + KE+KE+ + + +
Sbjct: 370 QEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDT 429
Query: 158 EKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEK 216
EK E+ + + E+ E+ ++E+ E+ + K + + LK+E + K
Sbjct: 430 NTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLS 489
Query: 217 ERSS 220
ER
Sbjct: 490 EREL 493
>gnl|CDD|218986 pfam06317, Arena_RNA_pol, Arenavirus RNA polymerase. This family
consists of several Arenavirus RNA polymerase proteins
(EC:2.7.7.48).
Length = 2206
Score = 28.4 bits (64), Expect = 9.6
Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 6/109 (5%)
Query: 122 KEKIKKKKDKKDKD-KI-KNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFK-MKEDKE 178
K K K K D +D D KI KN +K K +EE EE ++ + F +KE E
Sbjct: 854 KTKGKYKLDPEDYDYKIQKNLSSLVSGSKKSKSNREELSLYEELDEDQSDYFDEIKESVE 913
Query: 179 KPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 224
K + K+RK K + K KS + ++L K + E S HE E
Sbjct: 914 KTLSKMRKSRKAKSGNLKNTKSINDLERLWAPKGVMRLIRSETSLHEVE 962
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 27.8 bits (62), Expect = 9.6
Identities = 17/85 (20%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 113 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---DKEKKEKEKGKKEEKEKKEEKEKKEEK 169
D++ EK+ IK K + D + ++++ D++K+E+E + + + + + E +++
Sbjct: 88 DEEIQAMTEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDD 147
Query: 170 EFKMKEDKEKPVEKIRKEEKDSEKE 194
E ++ V++ +++D E E
Sbjct: 148 EIVEILIEDDEVDEDEDDDEDEEDE 172
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 28.1 bits (63), Expect = 9.7
Identities = 12/44 (27%), Positives = 26/44 (59%)
Query: 120 KEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEK 163
K E+ K+ +K+++ + ++ EK E+ K ++E EK++ K
Sbjct: 148 KSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|138835 PRK12279, PRK12279, 50S ribosomal protein L22/unknown domain fusion
protein; Provisional.
Length = 311
Score = 28.1 bits (62), Expect = 9.9
Identities = 21/82 (25%), Positives = 41/82 (50%), Gaps = 4/82 (4%)
Query: 93 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKE 148
+++ S+D+ E +K+ + K KK+ K E KK K K ++K EKD+
Sbjct: 104 LEIVLSDDVNEREKELAAIKAKKSKKPLPVEPKAKVETKKVAKPSKVEVKPVEKDENVDP 163
Query: 149 KEKGKKEEKEKKEEKEKKEEKE 170
+ ++++ K EK +++E
Sbjct: 164 ELLKREQQVLKVVEKTASQKEE 185
>gnl|CDD|224031 COG1106, COG1106, Predicted ATPases [General function prediction
only].
Length = 371
Score = 28.3 bits (63), Expect = 9.9
Identities = 22/102 (21%), Positives = 39/102 (38%), Gaps = 2/102 (1%)
Query: 120 KEKEKIKKK-KDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMKEDKE 178
+ + I K++ + E+DKE + K + K + + K E F K++
Sbjct: 60 EREINIWNGSYGKEESNLDIFFEQDKEVEIISKNTENNKFNFKLQIKFEINGFPSKDESR 119
Query: 179 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 220
K V KI +K K ++ + L K KE +
Sbjct: 120 K-VNKIEITKKSKNINDFLKIFNEYFVETLLIIAKYPKEPPN 160
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 28.1 bits (63), Expect = 10.0
Identities = 20/98 (20%), Positives = 46/98 (46%), Gaps = 2/98 (2%)
Query: 115 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKEKEKGKKEEKEKKEEKEKKEEKEFKMK 174
+ +K++ E KK KK + + ++ KE + + E++ +K + +
Sbjct: 169 NSARKQQLEWENKKLKKKGLIPLT--LDEARRRAKEFHIERAFSYRTERKAFGKKRRRAR 226
Query: 175 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 212
+ + + IRK+ ++ ++ SK + K+ + LKKK
Sbjct: 227 KLAKLDEKDIRKKILNALVKEYSKGELTKMGLKGLKKK 264
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.129 0.377
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,014,001
Number of extensions: 2187259
Number of successful extensions: 26481
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13147
Number of HSP's successfully gapped: 3082
Length of query: 446
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 346
Effective length of database: 6,502,202
Effective search space: 2249761892
Effective search space used: 2249761892
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.1 bits)