RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5823
(444 letters)
>gnl|CDD|192483 pfam10203, Pet191_N, Cytochrome c oxidase assembly protein PET191.
Pet191_N is the conserved N-terminal of a family of
conserved proteins found from nematodes to humans. It
carries six highly conserved cysteine residues. Pet191
is required for the assembly of active cytochrome c
oxidase but does not form part of the final assembled
complex.
Length = 68
Score = 88.5 bits (220), Expect = 6e-22
Identities = 36/67 (53%), Positives = 47/67 (70%), Gaps = 3/67 (4%)
Query: 377 DKSACAGSRADLKLCLLETDCVQVDRKTPRQCLREGLAP---QCEALKNTFFECKRSLLD 433
S+C G R DL CLLE+DCV+ +++TPR+CL+E L +C AL+ FFECKR +LD
Sbjct: 1 MGSSCKGLREDLAECLLESDCVKKEKRTPRECLKEPLKELPEECIALRKAFFECKRGMLD 60
Query: 434 NRTRFRG 440
R RFRG
Sbjct: 61 MRKRFRG 67
>gnl|CDD|214584 smart00249, PHD, PHD zinc finger. The plant homeodomain (PHD)
finger is a C4HC3 zinc-finger-like motif found in
nuclear proteins thought to be involved in epigenetics
and chromatin-mediated transcriptional regulation. The
PHD finger binds two zinc ions using the so-called
'cross-brace' motif and is thus structurally related to
the RING finger and the FYVE finger. It is not yet known
if PHD fingers have a common molecular function. Several
reports suggest that it can function as a
protein-protein interacton domain and it was recently
demonstrated that the PHD finger of p300 can cooperate
with the adjacent BROMO domain in nucleosome binding in
vitro. Other reports suggesting that the PHD finger is a
ubiquitin ligase have been refuted as these domains were
RING fingers misidentified as PHD fingers.
Length = 47
Score = 54.1 bits (130), Expect = 6e-10
Identities = 23/48 (47%), Positives = 28/48 (58%), Gaps = 3/48 (6%)
Query: 292 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVG--LVAEPETSDWFCPKC 337
C CG DDG ++ CDGCD WYH C+G L+ E W+CPKC
Sbjct: 1 YCSVCGKPDDGGE-LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47
>gnl|CDD|201356 pfam00628, PHD, PHD-finger. PHD folds into an interleaved type of
Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 51
Score = 53.6 bits (129), Expect = 1e-09
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 292 ICPACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPET---SDWFCPKCS 338
C CG DD ++ CDGCD W+H C+G EPE +W+CP+C
Sbjct: 1 YCAVCGKVDDDGE-LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECK 49
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 60.0 bits (146), Expect = 1e-09
Identities = 33/177 (18%), Positives = 63/177 (35%)
Query: 57 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
T +++ K K + S T + KE + K V + +
Sbjct: 6 TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKD 65
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+ D KK K K + KKK K + D K EK D + K +
Sbjct: 66 TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDID 125
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+ + + +D + + I ++ D + ++ D D + E+ K+ K+ ++ S
Sbjct: 126 VLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Score = 47.7 bits (114), Expect = 9e-06
Identities = 25/139 (17%), Positives = 51/139 (36%), Gaps = 5/139 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K+ ++ KKK K+ K K K K + KE + KK ++ ++
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKK-----TPEQIDQVLIF 58
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
D + +K+ K + K +K KK K++L KK +++++ +K+
Sbjct: 59 LSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDL 118
Query: 242 KLTFKFGTDMEEKTKRESS 260
+ +
Sbjct: 119 NYVKDIDVLNQADDDDDDD 137
Score = 44.2 bits (105), Expect = 9e-05
Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 1/136 (0%)
Query: 103 TPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 162
T E ++KK + K K K+E ++ + K K + +I
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPE-QIDQVLIFLS 60
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
V + + + +KK + K K +K++ E S+K +K DK
Sbjct: 61 GMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNY 120
Query: 223 LKKKKKEKERSSHEKE 238
+K + + +
Sbjct: 121 VKDIDVLNQADDDDDD 136
Score = 44.2 bits (105), Expect = 1e-04
Identities = 23/159 (14%), Positives = 55/159 (34%)
Query: 60 PSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLG 119
+ D+V K+ + E PK+K + K ++ SS+
Sbjct: 48 TPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAE 107
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ + D D D + + + D + + + E ++ +
Sbjct: 108 KKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDD 167
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
++E++ + KE ++ + ++D + + KD KL
Sbjct: 168 EDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
Score = 29.2 bits (66), Expect = 4.9
Identities = 20/129 (15%), Positives = 43/129 (33%), Gaps = 1/129 (0%)
Query: 64 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTP-MVPNIVKVTSSEDLGEIK 122
K +P +K E S K+ E D + + +++ +D +
Sbjct: 80 KTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDD 139
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
D ++ ++ D D+D+ K + K+ EK D + +++ E +
Sbjct: 140 DDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199
Query: 183 KEFKMKEDK 191
K
Sbjct: 200 ARKDAKLTA 208
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 57.1 bits (137), Expect = 1e-08
Identities = 38/143 (26%), Positives = 62/143 (43%), Gaps = 4/143 (2%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E KK KK + KKK KK ++ K + E + + E+ + ++K
Sbjct: 1313 EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
KEE + K K+K EK + +E + E+ K D+ K KKK E ++ + EK+
Sbjct: 1373 KEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
Query: 240 IPKLTFKFGTDMEEKTKRESSPK 262
+ K EE K + + K
Sbjct: 1433 ADEAKKK----AEEAKKADEAKK 1451
Score = 55.9 bits (134), Expect = 3e-08
Identities = 44/172 (25%), Positives = 84/172 (48%), Gaps = 3/172 (1%)
Query: 70 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
+ + + E +++EK EK+ + K K ++ E K K+ +
Sbjct: 1351 EAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADEL 1410
Query: 130 DKKTHKKKEKEKIKKK---KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
K KK+ ++ KKK K K D+ K K +E K + K+ E+ KK E+ KK+ +E K
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
++ +K E+ +K ++ +K +++K K + KK KKK ++ + + E +
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
Score = 55.1 bits (132), Expect = 5e-08
Identities = 32/129 (24%), Positives = 63/129 (48%), Gaps = 1/129 (0%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K ++ + K K+ + K K + + + + ++K + EK KE+ K
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
Query: 171 -KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
K K EEK+K +E + K +EDK+K E + + ++ K ++K K ++ KKK +E
Sbjct: 1383 AKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442
Query: 230 KERSSHEKE 238
+++ K+
Sbjct: 1443 AKKADEAKK 1451
Score = 55.1 bits (132), Expect = 5e-08
Identities = 47/175 (26%), Positives = 95/175 (54%), Gaps = 7/175 (4%)
Query: 88 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK-KKEKEKIKKKK 146
K+ E +K+K EL V+ ++ E KK + +KK ++ +K K +E K ++
Sbjct: 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE 1672
Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
DKK ++ K E+D++K + K+ E+ KK E+ K + +++K E+++K E++++
Sbjct: 1673 DKKKAEEAKKAEEDEKK---AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENK 1729
Query: 207 ---KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
+E K + ++ K K E+ KK ++EK++ +H K+ K + + E + E
Sbjct: 1730 IKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEE 1784
Score = 53.2 bits (127), Expect = 2e-07
Identities = 48/180 (26%), Positives = 82/180 (45%), Gaps = 6/180 (3%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
K K + + K + K+K +K + K ++ E + E
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE---AK 1377
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD---KEKEKGKKEEKE 178
KK + KK + KK ++ K K ++DKK D++K K+K + K +EK K +E +
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
KK E+ K E K+K E + EE + E+ K + K K E+ KK + K+++ K+
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK 1497
Score = 52.8 bits (126), Expect = 3e-07
Identities = 38/143 (26%), Positives = 67/143 (46%), Gaps = 4/143 (2%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E KK KK + KK ++ K K ++ KK D K K ++ +K + K + + E
Sbjct: 1300 EKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA 1359
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+ +E +K+K K + + + E+K K + K K E+ KKK E ++++ K+
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Query: 240 IPKLTFKFGTDMEEKTKRESSPK 262
+ K EEK K + + K
Sbjct: 1420 ADEAKKK----AEEKKKADEAKK 1438
Score = 51.3 bits (122), Expect = 9e-07
Identities = 46/176 (26%), Positives = 93/176 (52%), Gaps = 3/176 (1%)
Query: 64 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
K K K + M+ ++ K+ E +++++V + + +IK
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
++ +KK ++ KK E+ K K+ ++KK +++K E+ E K+ +E K EE +KK E+
Sbjct: 1623 EE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEE 1679
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
K +ED++K E ++KE ++++K ++ K K+ + KK+ + KK E+E +E
Sbjct: 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEA 1735
Score = 47.4 bits (112), Expect = 1e-05
Identities = 44/183 (24%), Positives = 89/183 (48%), Gaps = 6/183 (3%)
Query: 61 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
++ K + + K K K+K + +K + ++K+ + K + K +++ +
Sbjct: 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKK----KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
K K+ +K K KK+ E+ KK KK ++ K + +K E+ ++ K K EE
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
+K +E + K +E K+K E + E K +++ KK+++ K + +K ++ KK E
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Query: 235 HEK 237
E+
Sbjct: 1542 AEE 1544
Score = 47.1 bits (111), Expect = 2e-05
Identities = 52/223 (23%), Positives = 105/223 (47%), Gaps = 7/223 (3%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
+E E+ E+E KV+ ++ ++ +E+ +IK + KK ++ K
Sbjct: 1616 EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
Query: 136 K-----KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
K K +E +KK + K + + +K +E K + +EK K EE +K EE+ E+
Sbjct: 1676 KAEEAKKAEED-EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK-EKERSSHEKEVIPKLTFKFGT 249
+K E+ +K+ ++++K+++ K K LKKE+ KK ++ KE+ + +E + + K
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRM 1794
Query: 250 DMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQVWI 292
++++K K I + + E + K+V
Sbjct: 1795 EVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVAD 1837
Score = 44.7 bits (105), Expect = 1e-04
Identities = 43/140 (30%), Positives = 66/140 (47%), Gaps = 4/140 (2%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K+ + K KK +E K + KK ++K K E K +++ K +EK K EE +K EE +
Sbjct: 1517 KAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDK 1576
Query: 185 FKMKEDKE--KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
E K E+ R EE E++ K K ++ KK + K K E+ + +E K
Sbjct: 1577 NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK--AEEEKKK 1634
Query: 243 LTFKFGTDMEEKTKRESSPK 262
+ + EEK K E K
Sbjct: 1635 VEQLKKKEAEEKKKAEELKK 1654
Score = 44.4 bits (104), Expect = 1e-04
Identities = 48/182 (26%), Positives = 79/182 (43%), Gaps = 7/182 (3%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
K ++ D+ K + K+ K+ +K K E K ++ E
Sbjct: 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA--KKKAEEKKKADEAKKKAEE 1442
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK--EKDKEKKVDKEKEKGKKEEKEK 179
K K K KK +E KK ++ K D+ K K E K + K+ E+ KK+ E
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK---KKKEKERSSHE 236
K+ E K K D+ K E+ +K ++ + E+ K + K +EK K KK E+ + + E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
Query: 237 KE 238
K+
Sbjct: 1563 KK 1564
Score = 44.4 bits (104), Expect = 1e-04
Identities = 48/180 (26%), Positives = 79/180 (43%), Gaps = 10/180 (5%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
K K + K K+ ++K+K +K+ + K + K ++ E
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEE 1455
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD----KEKEKGKKEEK 177
K K K KK E KK ++ K D+ K K ++ +KK D + K K +E
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Query: 178 EKKEEK----EFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKER 232
+K EE E K E+ +K E + EEK E KK+++ K +K+K ++ KK +E
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
Score = 44.0 bits (103), Expect = 2e-04
Identities = 46/159 (28%), Positives = 80/159 (50%), Gaps = 9/159 (5%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--KVDKEKEKGKKEEK 177
E KK KK + KK ++ K K ++ KK D+ K + K+K + K +E K +E
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 178 EKKEEK----EFKMKEDKEKPVEKIRKEE-KDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+K EE E K E+K+K E + EE K +E++KK+++ K + + + +K E+ +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Query: 233 SSHEK--EVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
+ E E + KL + E+ K+ KI + +K
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELK 1626
Score = 42.1 bits (98), Expect = 6e-04
Identities = 35/167 (20%), Positives = 75/167 (44%), Gaps = 2/167 (1%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+E+ ++ + ++ + H + + IK ++ +K + K +EK K + K +EK K
Sbjct: 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKA 1304
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
+E +KK E+ K E K+K E +K + +K +++K + K E + ++ ++
Sbjct: 1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA--EAAADEAEAA 1362
Query: 235 HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 281
EK + + + K+++ K K + ADE
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Score = 40.9 bits (95), Expect = 0.001
Identities = 33/125 (26%), Positives = 70/125 (56%), Gaps = 2/125 (1%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+++ + ++ K + KK +KK+ +++KK ++ K ++ K E+ K+ + DK K
Sbjct: 1524 ADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA 1583
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
EE +K EE + + +K++ EE ++K +++K K ++LKK + +KKK E+ +
Sbjct: 1584 EEAKKAEEARIEEVMKLYEEEKKMKAEE--AKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Query: 235 HEKEV 239
+E
Sbjct: 1642 EAEEK 1646
Score = 32.0 bits (72), Expect = 0.79
Identities = 41/205 (20%), Positives = 86/205 (41%), Gaps = 10/205 (4%)
Query: 61 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
+D + + K +KT + E ++ + + +ED +
Sbjct: 1082 DAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARK 1141
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
++ + + K+ ++ E +K ++ +K +D K + K ++V K +E K E+ K
Sbjct: 1142 AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARK 1201
Query: 180 KEEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
E K ++E+ E+ RK + K +E KK+++ KK+ + KK E+ER++ E
Sbjct: 1202 AEAAR---KAEEERKAEEARKAEDAKKAEAVKKAEE----AKKDAEEAKKAEEERNNEEI 1254
Query: 238 EVIPKLTFKFGTDMEEKTKRESSPK 262
+ + K E + K
Sbjct: 1255 RKFEEARMAHFARRQAAIKAEEARK 1279
Score = 31.6 bits (71), Expect = 1.0
Identities = 33/131 (25%), Positives = 71/131 (54%), Gaps = 12/131 (9%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E++K + +K + K +E+ K ++ + +D K + +K +E K D E+ K +EE+
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN 1251
Query: 180 KEEKEFK--------MKEDKEKPVEKIRKEE-KDSEKEKKS---KDKDKKLKKEKLKKKK 227
+E ++F+ ++ K E + +E K +E++KK+ K ++K K ++ KKK
Sbjct: 1252 EEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311
Query: 228 KEKERSSHEKE 238
+E +++ K+
Sbjct: 1312 EEAKKADEAKK 1322
Score = 31.3 bits (70), Expect = 1.4
Identities = 31/166 (18%), Positives = 79/166 (47%), Gaps = 4/166 (2%)
Query: 72 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 131
K + +++ + E + +E K ++++ ++ +E+ +IK +++ K+ +
Sbjct: 1681 KKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAE 1740
Query: 132 KTHKKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+ KK E+ K KKK K++ K E+ +++K +E+ +E+++++ E + K+
Sbjct: 1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKI 1800
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
K+ + I ++ + SK+ + KE K + E +
Sbjct: 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEA 1846
Score = 28.6 bits (63), Expect = 8.1
Identities = 31/134 (23%), Positives = 70/134 (52%), Gaps = 10/134 (7%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+ED + + + ++ +K + ++ E +K + + ++ + E+ ++ + K+ E KK
Sbjct: 1172 AEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEE---------KDSEKEKKSKDKDKKLKKEKLKK 225
E+ KK+ +E K K ++E+ E+IRK E + + + + K +LKK + KK
Sbjct: 1232 AEEAKKDAEEAK-KAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290
Query: 226 KKKEKERSSHEKEV 239
K E +++ +K+
Sbjct: 1291 KADEAKKAEEKKKA 1304
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 53.0 bits (128), Expect = 2e-07
Identities = 27/86 (31%), Positives = 48/86 (55%), Gaps = 9/86 (10%)
Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
T KK +KIKK +K +K K ++ KEK+K K+K+EE+E + ++ +E
Sbjct: 403 TGSKKATKKIKKIVEKAEK---------KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKL 218
+ E+ + E++ E+E++ K K L
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 52.6 bits (127), Expect = 2e-07
Identities = 23/81 (28%), Positives = 50/81 (61%)
Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
+++ + + +K +K ++ KK E+EKKE+K+ K++ E+ KE+K+ E
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 207 KEKKSKDKDKKLKKEKLKKKK 227
KE++ ++ +++ ++E+ KKKK
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475
Score = 52.6 bits (127), Expect = 2e-07
Identities = 28/81 (34%), Positives = 52/81 (64%), Gaps = 1/81 (1%)
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
EI+ KK K KK EK +KK++++ K+K K K+K+ ++E+EK KKEE
Sbjct: 395 TEEEIEFLTGSKKATKK-IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Query: 177 KEKKEEKEFKMKEDKEKPVEK 197
++++EE+E + ++++E+ +K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKK 474
Score = 51.5 bits (124), Expect = 5e-07
Identities = 25/83 (30%), Positives = 53/83 (63%), Gaps = 2/83 (2%)
Query: 150 DKDKIKN--KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
+++I+ K KK+ K EK +K+ +E+K+EK+ K K+K E+ ++EK E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 208 EKKSKDKDKKLKKEKLKKKKKEK 230
+++ +++ ++ K+E+ +KKKK+
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKKQA 477
Score = 50.7 bits (122), Expect = 9e-07
Identities = 24/72 (33%), Positives = 45/72 (62%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK K +K++E+ KK+K KK K +E+++E+K KE+EK ++EE+ ++E
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Query: 182 EKEFKMKEDKEK 193
++E + K+ K+
Sbjct: 466 KEEEEEKKKKQA 477
Score = 45.7 bits (109), Expect = 3e-05
Identities = 22/80 (27%), Positives = 51/80 (63%), Gaps = 3/80 (3%)
Query: 160 DKEKKV---DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
++E + K+ K K+ EK E+K + K++K+K +K+E++ E+EK+ K+++K
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEK 455
Query: 217 KLKKEKLKKKKKEKERSSHE 236
+ ++E+ +++K+E+E +
Sbjct: 456 EEEEEEAEEEKEEEEEKKKK 475
Score = 44.9 bits (107), Expect = 6e-05
Identities = 19/66 (28%), Positives = 40/66 (60%)
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+K +K K+ E EK E+ +KE+K K K+++++ +KEK +++K+E+E + E
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 237 KEVIPK 242
++ +
Sbjct: 465 EKEEEE 470
Score = 42.6 bits (101), Expect = 3e-04
Identities = 17/70 (24%), Positives = 51/70 (72%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+++ + +I + K++++ +KK+K KKK+++++++ + KE++KE++ ++ +E+ +
Sbjct: 408 ATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Query: 174 KEEKEKKEEK 183
+EE++KK++
Sbjct: 468 EEEEKKKKQA 477
Score = 41.8 bits (99), Expect = 6e-04
Identities = 21/64 (32%), Positives = 44/64 (68%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
IKK + K+ +KKEK+K KK++++ + KEK +E+K ++E+E +++E+E++
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Query: 181 EEKE 184
++K+
Sbjct: 472 KKKK 475
Score = 38.4 bits (90), Expect = 0.007
Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
E+E + +K +KI+K + +EK+++ + K+KK K KKK++E+E +KE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE-E 453
Query: 241 PKLTFKFGTDMEEKTKRESSPK 262
K + + E++ + E K
Sbjct: 454 EKEEEEEEAEEEKEEEEEKKKK 475
Score = 34.9 bits (81), Expect = 0.074
Identities = 14/79 (17%), Positives = 40/79 (50%)
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
++E + + K ++KI ++ + +E+K + K K +K +++++E++ E+
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 238 EVIPKLTFKFGTDMEEKTK 256
+ + + + EE+ K
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473
Score = 31.8 bits (73), Expect = 0.75
Identities = 17/108 (15%), Positives = 40/108 (37%), Gaps = 11/108 (10%)
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKK----SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
+ + + ++ + +E+E + SK KK+KK K +KK +E +K+
Sbjct: 377 IFKHNPELAARLAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436
Query: 243 LTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 290
+++ + E + + + +E + KQ
Sbjct: 437 -------GKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
Score = 29.9 bits (68), Expect = 3.3
Identities = 12/84 (14%), Positives = 37/84 (44%), Gaps = 7/84 (8%)
Query: 84 KTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK 143
+K V+ E + + K K +++++ +KKE+EK +
Sbjct: 405 SKKATKKIKKIVEKAEKKREE------EKKEKKKKAFAGKKKEEEEEEEK-EKKEEEKEE 457
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDK 167
++++ +++ + + ++K K+ +
Sbjct: 458 EEEEAEEEKEEEEEKKKKQATLFD 481
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 52.6 bits (126), Expect = 3e-07
Identities = 32/126 (25%), Positives = 65/126 (51%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
VK K+ K KE+EK K++ ++ K K + +++ + + KE
Sbjct: 76 AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ K K+ KEK++EKE K++E +++ EK R+ + + KK K KK++ +++K
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195
Query: 229 EKERSS 234
+++ +
Sbjct: 196 QRQAAR 201
Score = 51.4 bits (123), Expect = 5e-07
Identities = 37/149 (24%), Positives = 73/149 (48%)
Query: 89 EKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDK 148
E + S + V+ T + + K +K+K+ K+++K+K +K K++
Sbjct: 68 ESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEE 127
Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
K K + K+K +KEKEK KK E+ + E+E K + + K K ++K K+
Sbjct: 128 PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKK 187
Query: 209 KKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
K+ +++K+ + + K K +E +E+
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEE 216
Score = 48.0 bits (114), Expect = 8e-06
Identities = 51/223 (22%), Positives = 95/223 (42%), Gaps = 16/223 (7%)
Query: 61 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
K K KE + K KEKE K + + P ++ +
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP-----------KEEPK 129
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKEKGKKEEKEK 179
+K K K+K+ K+KEKEK KK ++ +D+++ K +E+ + K + K +K +K++
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKKKKEKERSSHEKE 238
E+E + + +E K + + + E+EK+ D KD++ +++ + + +
Sbjct: 190 PPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRS 249
Query: 239 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS---PSPPPVAA 278
M RESS + +P S PS P +A
Sbjct: 250 SSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPASA 292
Score = 46.0 bits (109), Expect = 2e-05
Identities = 49/230 (21%), Positives = 96/230 (41%), Gaps = 21/230 (9%)
Query: 52 VHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVK 111
V G+ P+ S KE+ KE+ ++K+ K
Sbjct: 80 VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKK-------------------K 120
Query: 112 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKE 170
++ + +K K K+K+ K+KEKEK KK ++ +D+++ K +E+ + K + K +
Sbjct: 121 KEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPK 180
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKKKKE 229
K +K++ E+E + + +E K + + + E+EK+ D KD++ +++ +
Sbjct: 181 KKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESR 240
Query: 230 KERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAA 279
+ + M RESS + +P S PP A
Sbjct: 241 QSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPA 290
Score = 43.3 bits (102), Expect = 2e-04
Identities = 37/143 (25%), Positives = 69/143 (48%), Gaps = 3/143 (2%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
+K K K+ + + KE+EK K++ KE+K++K+ K KE+ + K +EK
Sbjct: 81 EKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
Query: 204 DSEKEK---KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
KEK K K ++ +E+ KK+++ + +S +K K K EE+ +R+++
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Query: 261 PKIVIKPVKSPSPPPVAAADEYD 283
+ V + P +E D
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDD 223
Score = 41.8 bits (98), Expect = 6e-04
Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 2/129 (1%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K D+ + + K K K KE K + ++E K KE ++ +K +++ K+ K
Sbjct: 70 KLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129
Query: 208 EKKSKD--KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
++K K+ K+K+ KEK K+K+K+ E +E + + + ++ K++ K
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKE 189
Query: 266 KPVKSPSPP 274
P +
Sbjct: 190 PPEEEKQRQ 198
Score = 32.6 bits (74), Expect = 0.51
Identities = 29/167 (17%), Positives = 63/167 (37%), Gaps = 12/167 (7%)
Query: 48 PQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVP 107
+ P P K K++ K KEK K+ P+++E K ++
Sbjct: 118 KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV-------- 169
Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
S KK KK + ++K+++ ++ K ++ N+E++KE+ K
Sbjct: 170 ----RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGK 225
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
++E +E + + ++ ++K + E + K
Sbjct: 226 DRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSK 272
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 50.9 bits (123), Expect = 8e-07
Identities = 40/160 (25%), Positives = 88/160 (55%), Gaps = 19/160 (11%)
Query: 109 IVKVTSSEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-D 166
+ K + + E +++ K I ++ K KE E IKK+ + K++I + EK++ +
Sbjct: 24 VRKKIAEAKIKEAEEEAKRILEEAK----KEAEAIKKEALLEAKEEIHKLRNEFEKELRE 79
Query: 167 KEKEKGKKEEK-EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
+ E K E++ +KEE + E EK E++ K+EK+ E++++ +K ++ +E +++
Sbjct: 80 RRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
Query: 226 KKKEKER----SSHE-KEVIPKLTFKFGTDMEEKTKRESS 260
+ +E ER ++ E KE++ + +EE+ + E++
Sbjct: 140 QLQELERISGLTAEEAKEILLE-------KVEEEARHEAA 172
Score = 46.3 bits (111), Expect = 2e-05
Identities = 42/151 (27%), Positives = 84/151 (55%), Gaps = 26/151 (17%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKK--KDKKDK-DKIKNKEK---DKEKKVDKEKEKGKKE 175
K+ ++IKK+ K+E K++ + K+ +++ ++++ EK KE+ +D++ E +K
Sbjct: 49 KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 176 EKE---KKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK--KK 228
E+E K++E E K + E KE+ +E++ +E+ E E+ S L E+ K+ +K
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEEQLQ-ELERIS-----GLTAEEAKEILLEK 162
Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
+E + HE V+ K ++EE+ K E+
Sbjct: 163 VEEEARHEAAVLIK-------EIEEEAKEEA 186
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 50.0 bits (120), Expect = 2e-06
Identities = 42/219 (19%), Positives = 75/219 (34%), Gaps = 17/219 (7%)
Query: 59 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKD----------VELGVTPM--V 106
SK K K S K K+K K+K+S + + + V +L P
Sbjct: 1158 LKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKK 1217
Query: 107 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--- 163
N +D + K K + KK K + D+ D + + K K
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKR 1277
Query: 164 --KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
V K + K+K +++ +K+KKS+ K + KK
Sbjct: 1278 VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKS 1337
Query: 222 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
K + K+ +SS K ++ ++ ++ + S
Sbjct: 1338 KTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDS 1376
Score = 42.3 bits (100), Expect = 5e-04
Identities = 35/170 (20%), Positives = 58/170 (34%), Gaps = 10/170 (5%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+ EI K++ +K K K ++ KKK+KK K DK KK + +
Sbjct: 1147 VEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKK----KKSSADKSKKASVVGNSKRVD 1202
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
EK++ + + +E+ K KKS K K KK K ++ + S
Sbjct: 1203 SDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSS 1262
Query: 236 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 285
+ K + + + SP P K P +
Sbjct: 1263 DD---LSKEGKPKNAPKRVSAVQYSPP---PPSKRPDGESNGGSKPSSPT 1306
Score = 38.9 bits (91), Expect = 0.005
Identities = 33/155 (21%), Positives = 48/155 (30%), Gaps = 5/155 (3%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
K K+ K K K K+KEK KKK K K D EK K + KK
Sbjct: 1161 KTKGKASKLRKP--KLKKKEKKKKKSSADKSKKASVVGNSKRVDSD-EKRKLDDKPDNKK 1217
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
+ED E+ K +K K KK+ ++ KE K
Sbjct: 1218 SNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKP--KNAP 1275
Query: 241 PKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPP 275
+++ + + + KP
Sbjct: 1276 KRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKV 1310
Score = 36.6 bits (85), Expect = 0.028
Identities = 27/159 (16%), Positives = 49/159 (30%), Gaps = 5/159 (3%)
Query: 57 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
P S +K S K+ +S + + + + KE + V S
Sbjct: 1234 KPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPK--RVSAVQYSPPPPSKR 1291
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
GE K KK E KK K + K K KK ++ +
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARK---KKSKTRVKQASASQ 1348
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
+ + K K D + + + +++ + + D
Sbjct: 1349 SSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387
Score = 36.6 bits (85), Expect = 0.034
Identities = 32/119 (26%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
K+K +K +++ KEK+ EK + + E+ +K EE ++E +E ++I KE++
Sbjct: 1101 KEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEE---ALEEQEEVEEKEIAKEQR 1157
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
K K K K + KK+KK+K+ S+ + + + D +EK K + P
Sbjct: 1158 LKSKTKGK--ASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPD 1214
Score = 33.9 bits (78), Expect = 0.23
Identities = 36/186 (19%), Positives = 78/186 (41%), Gaps = 20/186 (10%)
Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
I + + E+KK ++ KK EK ++++ ++ D+ +++ ++E
Sbjct: 1027 VITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAV 1086
Query: 168 E-------------KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
KEK +K E E+KE ++++ K + + E+ D +E + +
Sbjct: 1087 SYDYLLSMPIWSLTKEKVEKLNAEL-EKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQE 1145
Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
+ + K+ +++ K K + K PKL +EK K++SS K +
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLK------KKEKKKKKSSADKSKKASVVGNSK 1199
Query: 275 PVAAAD 280
V + +
Sbjct: 1200 RVDSDE 1205
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 47.7 bits (113), Expect = 1e-05
Identities = 35/132 (26%), Positives = 73/132 (55%), Gaps = 2/132 (1%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
+ K + + K+ K ++ + ++++ K + K++ ++ D E+K+ +
Sbjct: 261 LEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 320
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK--DKDKKLKKEKLKKK 226
+++ KK EKE K+EKE + +KE +I++E ++ E+E+ K +K ++L++E L KK
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380
Query: 227 KKEKERSSHEKE 238
K E ER S +
Sbjct: 381 KLESERLSSAAK 392
Score = 47.7 bits (113), Expect = 1e-05
Identities = 33/145 (22%), Positives = 77/145 (53%), Gaps = 3/145 (2%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E+ + D +++ +E + ++++ + K +++ KE++ +V KE ++ +KE
Sbjct: 223 EEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELE-KEEEILAQVLKENKEEEKE 281
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+K ++EE + KE++E E ++ E + D E++ K +K+ K +++LKK+K+E E
Sbjct: 282 KKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEEL 341
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRE 258
E + + + E+ K +
Sbjct: 342 EKELKELEIKREAEEEEEEQLEKLQ 366
Score = 45.3 bits (107), Expect = 5e-05
Identities = 24/145 (16%), Positives = 74/145 (51%), Gaps = 3/145 (2%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ + + +K + + K+ ++E+ +KK +++ + +E++ + ++ K + +
Sbjct: 252 EEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKV 311
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+E++ KE ++ K +KE +K ++E ++ EKE K + ++ ++E+ ++ +K +E+
Sbjct: 312 DDEEKLKESEKELKKLEKEL--KKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKL 369
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRE 258
+E + K ++ +
Sbjct: 370 EQLEEEL-LAKKKLESERLSSAAKL 393
Score = 45.0 bits (106), Expect = 7e-05
Identities = 36/127 (28%), Positives = 70/127 (55%), Gaps = 3/127 (2%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+ L + K+K +++K KK+E+E+ K + K+K+ + +EK ++ KV++EKE+ K
Sbjct: 740 KLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKA 799
Query: 176 EKEKKEEKEFKMK---EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
++E+ E ++K E E+ I +EEK E+E + + K +++ K ++E ER
Sbjct: 800 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELER 859
Query: 233 SSHEKEV 239
E
Sbjct: 860 LEEEITK 866
Score = 43.8 bits (103), Expect = 1e-04
Identities = 33/166 (19%), Positives = 81/166 (48%), Gaps = 3/166 (1%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE---DLGEIKKDKSIKKDKK 132
+E++ S ++ KE+E E + + + E+K + + +K
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310
Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
+++ ++ +K+ K +K+ K KE+ +E + + ++ + K+E +E++EE+ K++E E
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ E++ ++K + S K K+ + E +++KE + E
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSE 416
Score = 43.8 bits (103), Expect = 2e-04
Identities = 29/150 (19%), Positives = 67/150 (44%), Gaps = 1/150 (0%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
KK++ K ++T E I ++ K + K+K + K + +++ +EE
Sbjct: 170 RKKKERLKKLIEETENLAELI-IDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+ +E + E +R E+++ E K+ +K++++ + LK+ K+E++ ++E
Sbjct: 229 YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEE 288
Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
+ L + E K E + +K
Sbjct: 289 LKLLAKEEEELKSELLKLERRKVDDEEKLK 318
Score = 41.9 bits (98), Expect = 6e-04
Identities = 30/131 (22%), Positives = 74/131 (56%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
++ + +L + + + ++ KK ++ ++E K K +K++ K +E + + +
Sbjct: 681 ELQEKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELK 740
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ K +EKE++EEK KE++E+ ++ +EK+ +E++ +K K ++++ K K +
Sbjct: 741 LLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800
Query: 229 EKERSSHEKEV 239
E+E + E+E+
Sbjct: 801 EEELRALEEEL 811
Score = 41.5 bits (97), Expect = 9e-04
Identities = 28/126 (22%), Positives = 57/126 (45%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
L E K K + E+E + K ++ + ++ + +E E K
Sbjct: 199 QELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSK 258
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+E ++++E +KE+KE+ EK +EE+ K+ ++ +L K + +K E++
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLK 318
Query: 234 SHEKEV 239
EKE+
Sbjct: 319 ESEKEL 324
Score = 41.1 bits (96), Expect = 0.001
Identities = 29/127 (22%), Positives = 68/127 (53%), Gaps = 8/127 (6%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+K+ + +++K K E+EK +K K ++++ + +E +E ++ +E++ ++E++
Sbjct: 772 LKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKI 831
Query: 180 KEEK--------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
KEE+ + + K +K E R EE+ +++E + K+ + E+ K K + +
Sbjct: 832 KEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELES 891
Query: 232 RSSHEKE 238
+ EKE
Sbjct: 892 KEEKEKE 898
Score = 40.7 bits (95), Expect = 0.002
Identities = 22/134 (16%), Positives = 56/134 (41%), Gaps = 1/134 (0%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ ++ + ++ ++ +K + + ++ + EK+KE+ +E+E+ K
Sbjct: 910 DNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLA 969
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
KEE + EK + KD K+++ +++ K+L +E +++ + + E V
Sbjct: 970 KEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEETCQRFKE-FLELFV 1028
Query: 240 IPKLTFKFGTDMEE 253
E
Sbjct: 1029 SINRGLNKVFFYLE 1042
Score = 39.6 bits (92), Expect = 0.003
Identities = 27/143 (18%), Positives = 68/143 (47%), Gaps = 4/143 (2%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKVDKEKEKGKK---EE 176
++ + ++++ + ++ +K+++ ++ K K + + K +KEK+ KE E+ +
Sbjct: 854 EEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLL 913
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+EK+ E E ++ E+ ++ + E+ +E K+K++ K+E+ ++ K+
Sbjct: 914 EEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEEL 973
Query: 237 KEVIPKLTFKFGTDMEEKTKRES 259
V +F E K E
Sbjct: 974 GNVNLMAIAEFEEKEERYNKDEL 996
Score = 39.2 bits (91), Expect = 0.005
Identities = 26/140 (18%), Positives = 63/140 (45%), Gaps = 2/140 (1%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E++K+ + K+ +++E+E+++K ++K ++ + + K K + KE E+
Sbjct: 340 ELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKE--EE 397
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
E K + KE K ++E+ E++K+ ++L++ K+ K E ++
Sbjct: 398 LELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQ 457
Query: 240 IPKLTFKFGTDMEEKTKRES 259
KL + + +
Sbjct: 458 ALKLLKDKLELKKSEDLLKE 477
Score = 36.5 bits (84), Expect = 0.033
Identities = 20/121 (16%), Positives = 58/121 (47%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+EK KKK+ K + + +++ K + K +++ K+ E+ ++K + E+
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
+ ++ D ++L +++ ++ + K+ E+E++ ++ + + +EK +E
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQE 286
Query: 259 S 259
Sbjct: 287 E 287
Score = 35.7 bits (82), Expect = 0.049
Identities = 32/131 (24%), Positives = 70/131 (53%), Gaps = 14/131 (10%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+ ++K + K++K+ +KKE E+ +K + ++ + + +E+ E+ + K + + E
Sbjct: 880 LEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPE 939
Query: 176 E------KEKKEEKEFKMKEDKEKPVEKIRKEEKDS--------EKEKKSKDKDKKLKKE 221
E EK++E++ K +E++ + KEE + +EK+ + +LKKE
Sbjct: 940 ELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKE 999
Query: 222 KLKKKKKEKER 232
+L+++KKE R
Sbjct: 1000 RLEEEKKELLR 1010
Score = 29.6 bits (66), Expect = 4.4
Identities = 21/106 (19%), Positives = 37/106 (34%), Gaps = 4/106 (3%)
Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKED----KEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
+EK K K+ K+ EE E + + K E KE+ E + +L++
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224
Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
E L K + L + K + E +I+ +
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQ 270
Score = 29.2 bits (65), Expect = 5.9
Identities = 21/141 (14%), Positives = 66/141 (46%)
Query: 70 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
+++ E S +EK ++KE + + + I + + E + +K + ++
Sbjct: 881 EEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEE 940
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
+++++ K+++++++K K++ V+ +E++E+ + E K +
Sbjct: 941 LLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKER 1000
Query: 190 DKEKPVEKIRKEEKDSEKEKK 210
+E+ E +R+ +++ + K
Sbjct: 1001 LEEEKKELLREIIEETCQRFK 1021
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 47.2 bits (113), Expect = 1e-05
Identities = 26/161 (16%), Positives = 64/161 (39%), Gaps = 6/161 (3%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHK----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
SS++L E K+ I + K+H ++E K + K++ +++ K + +E + ++
Sbjct: 15 VSSKELLEKLKELGI--EVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEA 72
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K + ++ + E +P E ++ + K K KK KKKK
Sbjct: 73 KAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKK 132
Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
K +++ + + + +++ + + +
Sbjct: 133 PKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKPTEKIP 173
Score = 40.2 bits (95), Expect = 0.002
Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 5/140 (3%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
+ K K + + K ++E + + +E +K + +
Sbjct: 41 EEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAA----- 95
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+ + + + K KK KK K K K+ ++ ++ KG K + ++
Sbjct: 96 ARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRG 155
Query: 182 EKEFKMKEDKEKPVEKIRKE 201
+ + K+ K+KP EKI +E
Sbjct: 156 RRRRRKKKKKQKPTEKIPRE 175
Score = 38.7 bits (91), Expect = 0.007
Identities = 21/131 (16%), Positives = 52/131 (39%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K + E+ ++ + + ++ K + ++ + + + E + + E
Sbjct: 49 KREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAE 108
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ K KK K+ K + K+K ++ K K + + + ++ + KKKK++
Sbjct: 109 AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGRRRRRKKKKKQKP 168
Query: 231 ERSSHEKEVIP 241
+ VIP
Sbjct: 169 TEKIPREVVIP 179
Score = 36.0 bits (84), Expect = 0.046
Identities = 17/109 (15%), Positives = 36/109 (33%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
K + KE K+ +K K E +E+E K+E K + EE+ E
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAE 61
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ + + + + + + E + +++ E
Sbjct: 62 EAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAA 110
Score = 35.6 bits (83), Expect = 0.050
Identities = 18/121 (14%), Positives = 41/121 (33%), Gaps = 10/121 (8%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKD-----KKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
+ + K KE ++K K+ K ++ +E KE E ++ +EE
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKE-----EAKREAEEEA 56
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
+ + E+ + ++E E + E + + E + E +
Sbjct: 57 KAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKA 116
Query: 238 E 238
+
Sbjct: 117 K 117
Score = 31.0 bits (71), Expect = 1.4
Identities = 21/132 (15%), Positives = 46/132 (34%), Gaps = 3/132 (2%)
Query: 152 DKIKNKEKDKEKKVDKEK--EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
K++ E KE V ++ EK K+ E K +E+ K K R+ E++++ E
Sbjct: 2 SKVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAK-REAEEEAKAEA 60
Query: 210 KSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
+ + ++ K + + E + + D + ++ + K
Sbjct: 61 EEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAA 120
Query: 270 SPSPPPVAAADE 281
P +
Sbjct: 121 KKKKGPKPKKKK 132
Score = 30.6 bits (70), Expect = 2.2
Identities = 19/151 (12%), Positives = 48/151 (31%), Gaps = 4/151 (2%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
K ++ + + + EK KE ++ + EE+E ++E+ + E++ K
Sbjct: 3 KVRVYELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKA---- 58
Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
EE + + ++ + + + + + + ++
Sbjct: 59 EAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKK 118
Query: 259 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 289
++ K K P AA K
Sbjct: 119 AAKKKKGPKPKKKKPKRKAARGGKRGKGGKG 149
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 45.9 bits (109), Expect = 3e-05
Identities = 28/120 (23%), Positives = 49/120 (40%), Gaps = 1/120 (0%)
Query: 74 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
S + K K P YS V + + ++ + KK+ + KK
Sbjct: 10 SFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKK 69
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+KK+K+K +KK+ K + + K K +K +K+ K + E + K+ E EK
Sbjct: 70 SEKKKKKKKEKKEPKSEG-ETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEK 128
Score = 43.2 bits (102), Expect = 2e-04
Identities = 32/124 (25%), Positives = 46/124 (37%), Gaps = 4/124 (3%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
T L I S + + K K K ++K+ E K EK+K
Sbjct: 16 TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75
Query: 173 KK-EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
KK E+KE K E E K+ K +K +K K K ++D D K +K
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKS---KKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
Query: 232 RSSH 235
+H
Sbjct: 133 IGAH 136
Score = 37.8 bits (88), Expect = 0.010
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
++ + KE E+ K S K+ K +DK+ + KK+ KKKKK
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
+KE+ + E KL FK T + K ++ PK
Sbjct: 77 KKEKKEPKSEGETKLGFK--TPKKSKKTKKKPPK 108
Score = 37.4 bits (87), Expect = 0.011
Identities = 26/118 (22%), Positives = 50/118 (42%), Gaps = 9/118 (7%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ +K K + K+ + +NK K DK+++K + +K+
Sbjct: 11 FFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKS 70
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
+++K K+K +K K E +++ K+ K KK KK+ K K E ++ K
Sbjct: 71 EKKK-------KKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121
Score = 37.4 bits (87), Expect = 0.014
Identities = 28/126 (22%), Positives = 48/126 (38%), Gaps = 10/126 (7%)
Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
+K K + + K+ +E+ K KK++KE K E K+K +K
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTST-KKDKKEDKNNESKKKSEKK-- 73
Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEE-KTKRE 258
+K+KK K + K + KL K +K + + +K PK + E
Sbjct: 74 ------KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAE 127
Query: 259 SSPKIV 264
S +
Sbjct: 128 KSNVYI 133
Score = 35.1 bits (81), Expect = 0.065
Identities = 25/113 (22%), Positives = 47/113 (41%), Gaps = 4/113 (3%)
Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ + + KE E+E K+ K K +KE+K++E +KKS+
Sbjct: 16 TQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTK---KDKKEDKNNESKKKSEK 72
Query: 214 KDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
K KK KKEK + K + + + + K T K + +++ + +
Sbjct: 73 K-KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAE 124
Score = 31.6 bits (72), Expect = 0.83
Identities = 18/109 (16%), Positives = 39/109 (35%), Gaps = 8/109 (7%)
Query: 66 KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 125
V K+ S + +K T+ +K+ + K+ E + K + + KK+
Sbjct: 32 LVLSKEILSTFSEEENKVATTSTKKDKKEDKNNE--------SKKKSEKKKKKKKEKKEP 83
Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+ + K K K+ K KK ++ + K + ++
Sbjct: 84 KSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
Score = 30.5 bits (69), Expect = 1.7
Identities = 18/98 (18%), Positives = 37/98 (37%), Gaps = 3/98 (3%)
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
++ K + + K + EE E + + K K++K + KK+ E+
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEE---ENKVATTSTKKDKKEDKNNESKKKSEKK 73
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSP 271
+K+ + + T + KT ++S P P
Sbjct: 74 KKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 46.2 bits (110), Expect = 3e-05
Identities = 26/122 (21%), Positives = 62/122 (50%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
GEI + ++ + ++ +E++++ K+K + K + +E++ + ++ +E KE
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ EEK + E+ E+ E +R+E + E E + + K +++ ++ ER S E
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Query: 239 VI 240
+
Sbjct: 422 DL 423
Score = 42.4 bits (100), Expect = 5e-04
Identities = 24/129 (18%), Positives = 65/129 (50%), Gaps = 7/129 (5%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+ + ++ K +KE E+++++ + +++ + +E+ +E + + E+ K + EE ++
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Query: 182 EK-------EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E+ + E+ E + +R+ ++ E E + ++ + KEK++ K+E E
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343
Query: 235 HEKEVIPKL 243
E + +L
Sbjct: 344 TLLEELEQL 352
Score = 39.7 bits (93), Expect = 0.003
Identities = 22/123 (17%), Positives = 66/123 (53%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
L E++++ +++ ++ E+++++ + ++ K KE+ +E + ++ + + E
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
E E++ E+ + + E+ +E + + + E+E + +++ + +EKL + ++E E
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856
Query: 236 EKE 238
E E
Sbjct: 857 ELE 859
Score = 39.3 bits (92), Expect = 0.004
Identities = 20/131 (15%), Positives = 67/131 (51%), Gaps = 9/131 (6%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-------EKDKEKKVDKE 168
E+L ++++ +++ ++ ++ EKE + K + ++ + + K++ ++++ E
Sbjct: 246 EELSRLEEE--LEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGE 303
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
++ +E + E E + +E + +E+ E+E ++ ++ L + + K++
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363
Query: 229 EKERSSHEKEV 239
E++ S+ +E+
Sbjct: 364 EEKLSALLEEL 374
Score = 38.2 bits (89), Expect = 0.009
Identities = 28/113 (24%), Positives = 61/113 (53%), Gaps = 8/113 (7%)
Query: 131 KKTHKKKEK--EKIKKKKDKKDK--DKIKNKEKDKEK-KVDKEKEKGKKEEKEKKEEKEF 185
K ++KE+ K+++ ++ ++ D ++ EK EK + EK + +E K + E E
Sbjct: 168 SKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL 227
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ K ++++RKE ++ E+E +++ + +E+L++ +KE E E E
Sbjct: 228 ALLLAK---LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELE 277
Score = 37.8 bits (88), Expect = 0.014
Identities = 21/126 (16%), Positives = 73/126 (57%), Gaps = 1/126 (0%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E+L +++ + K++ +++++ ++++ ++ +++ + + + + + E + ++
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823
Query: 175 EEKEKK-EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
E E++ EE E +++E +EK E + E+ ++ ++ K++ ++L+ EK + + + KE
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELE 883
Query: 234 SHEKEV 239
++E+
Sbjct: 884 EEKEEL 889
Score = 36.2 bits (84), Expect = 0.036
Identities = 21/127 (16%), Positives = 66/127 (51%), Gaps = 4/127 (3%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ L E+++ + + K+E ++++ ++ + +++ E++ E+ ++ +E
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEEL-EQLQSRLEELEEELEELEEELEEL-- 756
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKER 232
+E E+ EE+ ++E K E+I + E+ + ++ ++ +++L++ + + E+E
Sbjct: 757 QERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALEREL 816
Query: 233 SSHEKEV 239
S E+
Sbjct: 817 ESLEQRR 823
Score = 36.2 bits (84), Expect = 0.037
Identities = 21/118 (17%), Positives = 68/118 (57%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E+L E ++ + + ++ +E+++++ ++ +++ + +EK E + + E+ + +
Sbjct: 799 EEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKEL 858
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
EE +++ E+ KE+ E ++++ +E+++ E+E + + + KE+++K ++ E
Sbjct: 859 EELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Score = 36.2 bits (84), Expect = 0.037
Identities = 25/124 (20%), Positives = 69/124 (55%), Gaps = 7/124 (5%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKK 174
L +++ +D +K+ EK++++ +K ++ ++K + ++ E + K K +
Sbjct: 179 RKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELR 238
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKE-----KLKKKKK 228
+E E+ EE+ +++E+ E+ E++ + EK+ E + + ++ ++L++ +LK++ +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298
Query: 229 EKER 232
E E
Sbjct: 299 ELEG 302
Score = 35.8 bits (83), Expect = 0.045
Identities = 23/124 (18%), Positives = 65/124 (52%), Gaps = 6/124 (4%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
L E+++ K + ++E E+++ ++ ++ + +++ +++ + E+ +EE
Sbjct: 711 LEELERQLEELKRELAALEEELEQLQS--RLEELEEELEELEEELEELQERLEE-LEEEL 767
Query: 178 EKKEEKEFKMKEDKEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E EE K+KE+ E+ + +++E ++ E+E + ++ + +L+ ++ +ER
Sbjct: 768 ESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLE 827
Query: 235 HEKE 238
E E
Sbjct: 828 QEIE 831
Score = 35.8 bits (83), Expect = 0.048
Identities = 21/127 (16%), Positives = 61/127 (48%), Gaps = 4/127 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVDKEKEKGK 173
E+L E + + ++ KE++++K ++ +++ +++ +++ E + +
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396
Query: 174 KEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E +E K E E + E + +E +++E K+ E E + + + E+L++ +++ E
Sbjct: 397 NELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLE 456
Query: 232 RSSHEKE 238
+
Sbjct: 457 ELRDRLK 463
Score = 34.7 bits (80), Expect = 0.12
Identities = 25/130 (19%), Positives = 64/130 (49%), Gaps = 11/130 (8%)
Query: 118 LGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
L E++K + +++ + K + +++K + + + + K K+ K++++ +E+ + E
Sbjct: 195 LEELEKQLEKLERQAE--KAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLE 252
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD--------KDKKLKKEKLKKKKK 228
+E +E +E + +KE K EE E E+ ++ ++ + + L+++ +
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLE 312
Query: 229 EKERSSHEKE 238
E E E E
Sbjct: 313 ELENELEELE 322
Score = 33.1 bits (76), Expect = 0.36
Identities = 20/121 (16%), Positives = 64/121 (52%), Gaps = 1/121 (0%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E+K K+ + + ++ + ++++ + + ++ K + ++E + + + + +EE E+
Sbjct: 689 ELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEE 748
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEKE 238
EE+ +++E E+ E++ E+ K K+ ++ + K + + + ++ E+E E+
Sbjct: 749 LEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERR 808
Query: 239 V 239
+
Sbjct: 809 L 809
Score = 32.8 bits (75), Expect = 0.46
Identities = 27/121 (22%), Positives = 66/121 (54%), Gaps = 1/121 (0%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
E++ + ++ + ++ +E+I++ ++K D+ + + +E +KE + KE+ + + EKE
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ E E K E++++ +E+ +E + E K + + + + E+L+ K + E E E
Sbjct: 874 -ELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELE 932
Query: 239 V 239
Sbjct: 933 E 933
Score = 32.4 bits (74), Expect = 0.63
Identities = 16/120 (13%), Positives = 69/120 (57%), Gaps = 4/120 (3%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
E+ + + ++ ++ +E++++ +++ ++++ + ++ E++++ +E K ++E
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEE--LEELQERLEELEEELESLEEALAKLKEE 780
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+E +E + +E +E++ +E +++E+ + +++ + +++ ++ ++E E E E
Sbjct: 781 IEELEEKRQALQEE--LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIE 838
Score = 31.6 bits (72), Expect = 0.94
Identities = 19/64 (29%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 172 GKKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
G + KE+KEE E K++ E+ + +E + EE + + EK + +K + ++LK + +E
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLL-EELEKQLEKLERQAEKAERYQELKAELRE 224
Query: 230 KERS 233
E +
Sbjct: 225 LELA 228
Score = 31.6 bits (72), Expect = 1.2
Identities = 15/118 (12%), Positives = 51/118 (43%), Gaps = 7/118 (5%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKE----KEKIKK---KKDKKDKDKIKNKEKDKEKKVDK 167
+L E+K++ +++ + KE++K+ + ++ + + E+ +E +
Sbjct: 396 RNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
E+ + + +E E++ + + + EK + + E E+++ + + +
Sbjct: 456 EELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESG 513
Score = 30.5 bits (69), Expect = 2.6
Identities = 22/94 (23%), Positives = 59/94 (62%), Gaps = 3/94 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E++ E+++ +++ +KE E++K++ ++ + +K + +D+ K++++EKE+ ++E
Sbjct: 835 EEIEELEEKLDELEEELEELEKELEELKEELEELEAEK--EELEDELKELEEEKEELEEE 892
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
+E + E ++KE+ EK E++ + E E+ +
Sbjct: 893 LRELESELA-ELKEEIEKLRERLEELEAKLERLE 925
Score = 29.7 bits (67), Expect = 3.6
Identities = 24/122 (19%), Positives = 68/122 (55%), Gaps = 11/122 (9%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E L E++++ + + K+E E+++ +K++ + + + +E+ ++++++E + + E
Sbjct: 842 EKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEE--KEELEEELRELESE 899
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRK--------EEKDSEKEKKSKDKDKKLKKEKLKKKK 227
E KEE E K++E E+ K+ + EE+ E+ + + + + + + E+L+++
Sbjct: 900 LAELKEEIE-KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEI 958
Query: 228 KE 229
+
Sbjct: 959 EA 960
Score = 29.7 bits (67), Expect = 3.7
Identities = 18/129 (13%), Positives = 62/129 (48%), Gaps = 26/129 (20%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-----DKEKKVDKEKEKGKKEEKEKKEEK 183
K+ + K++ +E++++ + + + K + ++ + E K+++ + + + E+E +EE
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939
Query: 184 EFKMKEDKEKPVEKI---------------------RKEEKDSEKEKKSKDKDKKLKKEK 222
E ++ + E+ +E++ + ++ + +++ ++ K+ E
Sbjct: 940 EDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEV 999
Query: 223 LKKKKKEKE 231
+++ KEK
Sbjct: 1000 IEELDKEKR 1008
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 45.6 bits (108), Expect = 4e-05
Identities = 25/97 (25%), Positives = 47/97 (48%), Gaps = 12/97 (12%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
K + +K + K +E +K+ K K +V E + +E + K
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNKKLQD------YTKKLGEVKDETDA-SEEAEAKA 402
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
+E++ K +E+++K +KE+ D +KEK+ KD+ KK
Sbjct: 403 KEEKLKQEENEKK-----QKEQADEDKEKRQKDERKK 434
Score = 44.0 bits (104), Expect = 1e-04
Identities = 20/86 (23%), Positives = 48/86 (55%), Gaps = 2/86 (2%)
Query: 142 IKKKKDKKDKDKIKNKEKDKE-KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
K ++ K + ++ + K+ +K+ + K+ E K+E + +E + K E+ K
Sbjct: 350 TKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETD-ASEEAEAKAKEEKLK 408
Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKK 226
+E++ +K+K+ D+DK+ +++ +KK
Sbjct: 409 QEENEKKQKEQADEDKEKRQKDERKK 434
Score = 41.7 bits (98), Expect = 5e-04
Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
KE I + D D + + +E K + + K++E K+ K+ + K V
Sbjct: 330 KEAINIARNLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEV- 388
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
K+E D+ +E ++K K++KLK+E+ +KK+KE+ EK
Sbjct: 389 ---KDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKR 427
Score = 37.1 bits (86), Expect = 0.016
Identities = 21/100 (21%), Positives = 52/100 (52%), Gaps = 4/100 (4%)
Query: 94 KVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK 153
+ D +L + + +V S+ DL K+ + +K+ K + K+ + K + ++
Sbjct: 338 NLDDNDLTLLALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEE 397
Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+ K K+++ K ++ ++K K++ E KE++ ++D+ K
Sbjct: 398 AEAKAKEEKLKQEENEKKQKEQADEDKEKR----QKDERK 433
Score = 33.6 bits (77), Expect = 0.19
Identities = 16/59 (27%), Positives = 34/59 (57%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+ L + K KD+ ++ + K K++K K+++++ K KE+ E K ++K++ KK
Sbjct: 376 KKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 45.4 bits (108), Expect = 4e-05
Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+KS++K ++ K + + + KK + I + ++ + K +++ EK E KEKK K
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKK--K 333
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
E K +E K+K +E++ + + E + K+++K
Sbjct: 334 EKKKEEKKKKQIERLEERIEKLEVQATDKEENK 366
Score = 40.4 bits (95), Expect = 0.001
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 6/83 (7%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K +EK K K ++ K E + + K+K E+ EK+ E K+ +KEKK ++K
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340
Query: 215 DKKL------KKEKLKKKKKEKE 231
KK + EKL+ + +KE
Sbjct: 341 KKKQIERLEERIEKLEVQATDKE 363
Score = 38.1 bits (89), Expect = 0.008
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K EK EK +K K + ++ KK F+M D ++ ++ K E+D+EK +
Sbjct: 273 KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKS--KFERDNEKLDAEVKE 330
Query: 215 DKKLKKEKLKKKKK 228
KK KK++ KKKK+
Sbjct: 331 KKKEKKKEEKKKKQ 344
Score = 35.4 bits (82), Expect = 0.058
Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 5/79 (6%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK- 174
+ L K K K+ K K ++D +K+ + K+K+K+ KE++K K+
Sbjct: 287 KALKYQLKRLKKMILLFEMISDLKRKL-KSKFERDNEKLDAEVKEKKKEKKKEEKKKKQI 345
Query: 175 ---EEKEKKEEKEFKMKED 190
EE+ +K E + KE+
Sbjct: 346 ERLEERIEKLEVQATDKEE 364
Score = 33.5 bits (77), Expect = 0.20
Identities = 17/60 (28%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK---EEKEKKEEKE 184
+K K E++ K + K+K K K KE+ K+K++++ +E+ +K + +K+E K
Sbjct: 308 DLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367
Score = 33.1 bits (76), Expect = 0.27
Identities = 20/73 (27%), Positives = 36/73 (49%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+K K + + +K K K ++D K + KEKK +K+KE+ KK++ E+
Sbjct: 288 ALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIER 347
Query: 180 KEEKEFKMKEDKE 192
EE+ K++
Sbjct: 348 LEERIEKLEVQAT 360
Score = 33.1 bits (76), Expect = 0.27
Identities = 13/63 (20%), Positives = 34/63 (53%)
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+EK + ++++K K+ + + + + K + ++KL E +KKK++K+ +
Sbjct: 283 QEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKK 342
Query: 237 KEV 239
K++
Sbjct: 343 KQI 345
Score = 30.8 bits (70), Expect = 1.7
Identities = 19/65 (29%), Positives = 39/65 (60%), Gaps = 7/65 (10%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K K K ++ ++K + E +KKK+KK KE+ K+K++++ +E+ +K E + +
Sbjct: 309 LKRKLKSKFERDNEKLDAEVKEKKKEKK-------KEEKKKKQIERLEERIEKLEVQATD 361
Query: 182 EKEFK 186
++E K
Sbjct: 362 KEENK 366
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 45.2 bits (107), Expect = 4e-05
Identities = 23/109 (21%), Positives = 55/109 (50%), Gaps = 1/109 (0%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
K ++ KK ++ E+ + ++ ++ + +++ + ++ E+ + EEK+K+ E
Sbjct: 63 AKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAE 122
Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E K K+ E + + EK +++E K + +++ K + KKK E
Sbjct: 123 -EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAE 170
Score = 44.8 bits (106), Expect = 6e-05
Identities = 25/104 (24%), Positives = 55/104 (52%), Gaps = 1/104 (0%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK-EFK 186
+ ++ +K+ +++ +K K ++ + ++K+K+ ++ K K E K K E + E K
Sbjct: 84 RAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKK 143
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
KE+ +K E+ K + +E +KK+ + KK + E K + +
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA 187
Score = 42.5 bits (100), Expect = 3e-04
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 1/111 (0%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
K K ++ K + K K + + +KK K++ K + +++ K + K K E +KK E
Sbjct: 117 KQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAE 176
Query: 183 KEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
E K K + K K + K + ++ K K + + K + E E ER
Sbjct: 177 AEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAER 227
Score = 42.1 bits (99), Expect = 3e-04
Identities = 27/109 (24%), Positives = 51/109 (46%), Gaps = 1/109 (0%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKKEEKEK 179
K+ + +K K+ E+ + ++ +K ++ K K + + K + E EK KEE +K
Sbjct: 91 QKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKK 150
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ E+E K K E + ++K + K + K K E+ K K +
Sbjct: 151 QAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199
Score = 33.3 bits (76), Expect = 0.21
Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 2/74 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
K+++ K+ ++ K K + KKK KK + + + + K E+ K K E +
Sbjct: 144 AKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKA 203
Query: 180 KEEKEFKMKEDKEK 193
K E K + E
Sbjct: 204 KAAAEAAAKAEAEA 217
Score = 31.3 bits (71), Expect = 0.94
Identities = 16/96 (16%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
++ K KE++++KK++++ E+ ++++ ++ ++ ++++ + E+ +
Sbjct: 56 QQQKKPAAKKEQERQKKLEQQAEE-AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQA 114
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
E+K K ++ K K + K K + EK+ +
Sbjct: 115 EEKQKQAEEA--KAKQAAEAKAKAEAEAEKKAKEEA 148
Score = 31.0 bits (70), Expect = 1.3
Identities = 16/94 (17%), Positives = 44/94 (46%), Gaps = 7/94 (7%)
Query: 152 DKIKNKEKDKEKKVDK------EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
++I+ ++K KK + ++ + ++++ ++ ++ ++++ + E+
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Query: 206 EKEKKSK-DKDKKLKKEKLKKKKKEKERSSHEKE 238
+ E+K K ++ K K+ K K E E KE
Sbjct: 113 QAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKE 146
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 45.7 bits (108), Expect = 4e-05
Identities = 27/99 (27%), Positives = 57/99 (57%), Gaps = 5/99 (5%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEE--KD 204
+ + I N ++ +EK +K+ +K + E KEK ++KE + ++ + PV++ +E K
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKI 64
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIP 241
+++ KKS ++ K E LK K++ ++ +E ++ IP
Sbjct: 65 ADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP 103
Score = 40.3 bits (94), Expect = 0.002
Identities = 33/146 (22%), Positives = 67/146 (45%), Gaps = 21/146 (14%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
K +K K++ KEK+ K++ + S VK+ SS++ +
Sbjct: 20 KDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKE---------------QSSQE--NL 62
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKVDKEKEKGKKEEK 177
K +KK K K+ E +K K++ + + + + +K + K+ +K + K E+
Sbjct: 63 KIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQ 122
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEK 203
KK+ K F++ E K+ P+ + E++
Sbjct: 123 AKKQTKLFRLFEPKQLPIYRANGEKE 148
Score = 36.5 bits (84), Expect = 0.035
Identities = 21/100 (21%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+ + ++ + K + + K++++ KEK +KK + + KE+ ++ K
Sbjct: 5 NRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLK- 63
Query: 188 KEDKEKPVEKIRKEEKDSEKE--KKSKDKDKKLKKEKLKK 225
V+K KEE E K ++ K+++ E L+K
Sbjct: 64 ---IADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEILQK 100
Score = 33.4 bits (76), Expect = 0.26
Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 5/112 (4%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKK-----DKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
++K +K + ++ + KEK+ KK D D + ++ K D+ K+ K+E
Sbjct: 17 MQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEES 76
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K+ E + K + KE E ++K E ++ K + +K E+ KK+ K
Sbjct: 77 KQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKKQTK 128
Score = 31.1 bits (70), Expect = 1.5
Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 6/65 (9%)
Query: 201 EEKDSEKEK---KSKDKDKKLKKEKLKKKKK---EKERSSHEKEVIPKLTFKFGTDMEEK 254
D +EK K K++ KEK+ KK+ S KE + K ++++
Sbjct: 12 NNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKS 71
Query: 255 TKRES 259
TK ES
Sbjct: 72 TKEES 76
Score = 29.1 bits (65), Expect = 5.9
Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 8/108 (7%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE-KKVDKEKEK 171
+D + +K++ K+K KK+E K++ +N + E KK KE+ K
Sbjct: 18 QEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKKSTKEESK 77
Query: 172 ------GKKEEKEKKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSK 212
KEE +K+ + E K +P E I K+ +D + E+ K
Sbjct: 78 QLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLEDIKPEQAKK 125
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 45.6 bits (109), Expect = 5e-05
Identities = 32/111 (28%), Positives = 60/111 (54%), Gaps = 12/111 (10%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK----KEEKEFKMKED 190
K +E E + K+ +K ++ + KEK +E++ DK E+ +KE ++ K+E + +KE
Sbjct: 535 KAEEAEALLKEAEKLKEELEEKKEKLQEEE-DKLLEEAEKEAQQAIKEAKKEADEIIKEL 593
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKERSSHEKEV 239
++ +K S K + + K+L K EK +KKKK+++ E +V
Sbjct: 594 RQL-----QKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639
Score = 33.3 bits (77), Expect = 0.27
Identities = 30/128 (23%), Positives = 51/128 (39%), Gaps = 21/128 (16%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG------- 172
EI K+ + K E I+ +K ++ K K+K K+K+ +E + G
Sbjct: 588 EIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLS 647
Query: 173 ---KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
K E ++KE ++ + K++ D EK +K KK+K KK K
Sbjct: 648 LGQKGEVLSIPDDKEAIVQAG----IMKMKVPLSDLEKIQKP-------KKKKKKKPKTV 696
Query: 230 KERSSHEK 237
K +
Sbjct: 697 KPKPRTVS 704
Score = 30.2 bits (69), Expect = 2.5
Identities = 37/154 (24%), Positives = 65/154 (42%), Gaps = 39/154 (25%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN--KEKDKEKKVDKEKEKGK------------- 173
++KK ++E++K+ ++ +K+ + IK KE D+ K ++ +KG
Sbjct: 554 EEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEAR 613
Query: 174 ---KEEKEKKEEKEFKMKEDKEKPVE----KIRKEEKDSEKEKKSKDKD----------- 215
+ EKKE+K+ K KE +E+ K + E DK+
Sbjct: 614 KRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMK 673
Query: 216 ------KKLKKEKLKKKKKEKERSSHEKEVIPKL 243
+K++K K KKKKK K + V +L
Sbjct: 674 VPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLEL 707
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 45.2 bits (107), Expect = 5e-05
Identities = 24/103 (23%), Positives = 41/103 (39%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
+KK H+K +K+ K K KD K K K K+K K + + +
Sbjct: 48 INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ + K + + K+K + +K KK+K+ K E
Sbjct: 108 GPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELI 150
Score = 35.2 bits (81), Expect = 0.073
Identities = 28/125 (22%), Positives = 47/125 (37%), Gaps = 19/125 (15%)
Query: 135 KKKEKEKIKKKKDKK------DKDKIKNKEKDKEKK--------VDKEKEKGKKEEKEKK 180
K KI+ + D K D +K +++ DK+ K D +K K K+++K KK
Sbjct: 26 LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKKKIKK 85
Query: 181 -----EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
++ + K K +K KK KK+K+ S
Sbjct: 86 KLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKKVLSS 145
Query: 236 EKEVI 240
+ E+I
Sbjct: 146 KDELI 150
Score = 34.8 bits (80), Expect = 0.094
Identities = 26/102 (25%), Positives = 37/102 (36%), Gaps = 1/102 (0%)
Query: 71 DKTSPKEKMSSKEKTSPKEKEYSKVK-DVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
DK + K + K S K K K K +L + N ++ S+ +
Sbjct: 57 DKKNKKFNKTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMR 116
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
K +K K+KI K K K KD+ K D K K
Sbjct: 117 KPKPKTEKLKKKITVNKSTNKKKKKVLSSKDELIKYDNNKPK 158
Score = 34.0 bits (78), Expect = 0.16
Identities = 29/159 (18%), Positives = 49/159 (30%), Gaps = 24/159 (15%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-------EKGKKEEKEKKE 181
+ T K IK K K ++++ KD +D K +K KK K
Sbjct: 10 SSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTDDL 69
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL------KKKKKEKERSSH 235
+ K K ++K ++K + D + SK+ K+ K K EK +
Sbjct: 70 KDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKI 129
Query: 236 EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
K K++ + P
Sbjct: 130 TVNKSTN-----------KKKKKVLSSKDELIKYDNNKP 157
Score = 32.5 bits (74), Expect = 0.48
Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 12/159 (7%)
Query: 70 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSI-- 127
+S K ++ S K S + +K +L K ++DL + KK K
Sbjct: 26 LKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNK-----KFNKTDDLKDSKKTKLKQK 80
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
KK KK + K +K + EK KK+ K
Sbjct: 81 KKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNK-----ST 135
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
+ K+K + + K + KS L ++L
Sbjct: 136 NKKKKKVLSSKDELIKYDNNKPKSISIHSPLTIQELSTL 174
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 44.8 bits (106), Expect = 7e-05
Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 5/128 (3%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKVDKEK--EKGK 173
L K + SIK +KKT + KK ++ +K+ + K K +K +
Sbjct: 368 LENKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNN 427
Query: 174 KEEKEKKEEK-EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
KE+ KKEE E + + EK + + ++K +KD +L K L+K K K
Sbjct: 428 KEQDIKKEELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFNKFKNE 487
Query: 233 SSHEKEVI 240
S E I
Sbjct: 488 ESAENSFI 495
Score = 34.4 bits (79), Expect = 0.11
Identities = 43/216 (19%), Positives = 74/216 (34%), Gaps = 35/216 (16%)
Query: 48 PQPLVHPPGTPPPSKMKDKVSPKDKTS------PKEKMSSKEKTSPKEKEYSKVKDVELG 101
P+ +K + V +K S K K+ +K + + +K EL
Sbjct: 379 PEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKEQDIKKEELL 438
Query: 102 VTPMVPNIVKVTSS---EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 158
V K+ + D E+ K K KD + K ++K K+++ +N
Sbjct: 439 EKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIEL----SKNMLQKFNKFKNEESAENSF 494
Query: 159 KDKEKKVDK-----------------EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
D K+ D E K + + K K +K K V I K+
Sbjct: 495 IDIIKQADLVLWVSPNALILGVEDLGLIELINKVNESFEFIKTIFKKFNKNKLVIAISKK 554
Query: 202 EK----DSEKEKKSKDKDKKLKKEKLKKK-KKEKER 232
+ + + +K + + + LK KKEK R
Sbjct: 555 QATKITNIKNQKLKTNIINDVDIDDLKMLIKKEKTR 590
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 43.2 bits (102), Expect = 9e-05
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
D++ KK + ++ + ++ K+++K +DKE K D EKE + EE+ ++E
Sbjct: 45 AADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE------ 98
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKS 211
E+ EK EEK +K
Sbjct: 99 DEESSDENEKE-TEEKTESNVEKE 121
Score = 35.5 bits (82), Expect = 0.030
Identities = 18/97 (18%), Positives = 46/97 (47%), Gaps = 13/97 (13%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
D+ D+ + K+ D ++ + E+ K +++E E+KE K +KE
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKED---------- 88
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
+E + +++++ + +K+ E++ S+ ++ I
Sbjct: 89 ---EESEEENEEEDEESSDENEKETEEKTESNVEKEI 122
Score = 34.7 bits (80), Expect = 0.055
Identities = 17/85 (20%), Positives = 43/85 (50%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
S + + ++ K + ++ KE+ K+ + +DK+ + EK+ E+ ++ +E+
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEED 99
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEK 197
++ E ++E E K + + EK +
Sbjct: 100 EESSDENEKETEEKTESNVEKEITN 124
Score = 33.9 bits (78), Expect = 0.091
Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+++ K + + E+ ++EKE E+KE K + E + +E +E+ +
Sbjct: 40 PSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEE--NEE 97
Query: 199 RKEEKDSEKEKKSKDKD 215
EE E EK++++K
Sbjct: 98 EDEESSDENEKETEEKT 114
Score = 32.8 bits (75), Expect = 0.23
Identities = 25/94 (26%), Positives = 50/94 (53%), Gaps = 9/94 (9%)
Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
D+ +E K + ++ + E+ K+EEKE EDKE +K E++D E
Sbjct: 39 SPSDQAAADEQEAKK-----SDDQETAEIEEVKEEEKEAANSEDKE---DKGDAEKEDEE 90
Query: 207 KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
E++++ ++ + ++ +K+ +EK S+ EKE+
Sbjct: 91 SEEENE-EEDEESSDENEKETEEKTESNVEKEIT 123
Score = 32.0 bits (73), Expect = 0.39
Identities = 20/84 (23%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
D E + + +K+++K E ++ K +K+D++ +E+++E+ + E
Sbjct: 51 AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEES---EEENEEEDEESSDENE 107
Query: 173 KKEEKEKKEEKEFKMKEDKE-KPV 195
K+ E EK E K + KPV
Sbjct: 108 KETE-EKTESNVEKEITNPSWKPV 130
Score = 30.9 bits (70), Expect = 0.92
Identities = 15/72 (20%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 172 GKKEEKEKKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
++ +E+E K +D+E +E++++EEK++ + +DK K+++ +++ E+
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 231 ERSSHEKEVIPK 242
E E +
Sbjct: 98 EDEESSDENEKE 109
Score = 29.7 bits (67), Expect = 2.1
Identities = 10/66 (15%), Positives = 32/66 (48%)
Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
++ +E+E K+ + + E +E E+ + +++K +K+ + +E+ ++
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 226 KKKEKE 231
+ +E
Sbjct: 98 EDEESS 103
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 44.3 bits (105), Expect = 1e-04
Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 12/111 (10%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
E + KK ++K K I + EK K + E +EEKE + + E EK
Sbjct: 1494 GNENVNKKINQKKKGFIPSNEK-----------KSIEIENRNQEEKEPAGQGELESDKEK 1542
Query: 198 IRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
E ++K+ ++D + +K K KK+ K + E + K +F
Sbjct: 1543 KGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTEAELDFFLKRYLRF 1593
Score = 35.1 bits (81), Expect = 0.099
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 16/77 (20%)
Query: 122 KKDKSIKKDKKTH------KKKEKEKIKKKKDKKDKDKIK-------NKEKDKEK---KV 165
KK I ++K ++EKE + + + DK+K N+EK+ E+ +
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 166 DKEKEKGKKEEKEKKEE 182
D +K K KK+ K E
Sbjct: 1565 DIKKRKNKKQYKSNTEA 1581
Score = 34.3 bits (79), Expect = 0.15
Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 4/82 (4%)
Query: 125 KSIKKDKK----THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
K I + KK +++KK E + +++K+ E DKEKK + E +E+ ++
Sbjct: 1500 KKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEE 1559
Query: 181 EEKEFKMKEDKEKPVEKIRKEE 202
+ E +K+ K K K E
Sbjct: 1560 DYAESDIKKRKNKKQYKSNTEA 1581
Score = 31.6 bits (72), Expect = 1.0
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K + K D ++ K K K+KK K++E+ K+EEK + E E
Sbjct: 729 NGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAE 771
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 43.8 bits (104), Expect = 1e-04
Identities = 31/151 (20%), Positives = 79/151 (52%), Gaps = 14/151 (9%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-- 177
++ + + K KKE E +KK+ + K+++ + E+++ + + + ++ E+
Sbjct: 26 KLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRL 85
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS--- 234
++EE + E +K E + K+EK+ ++K+ D+ ++ +E + ++++E ER S
Sbjct: 86 LQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLT 145
Query: 235 --HEKEVIPKLTFKFGTDMEEKTKRESSPKI 263
KE++ + ++EE+ + E++ I
Sbjct: 146 QEEAKEILLE-------EVEEEARHEAAKLI 169
Score = 36.4 bits (85), Expect = 0.024
Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
+ K E+ + E+ KKE + K+E + KE+ K ++ +E K+ E + ++
Sbjct: 24 EKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLER 83
Query: 215 DKKLKKEKLKKK-----KKEKERSSHEKEVIPK 242
++E L +K KKE+ EKE+ K
Sbjct: 84 RLLQREETLDRKMESLDKKEENLEKKEKELSNK 116
Score = 36.4 bits (85), Expect = 0.025
Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 7/104 (6%)
Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKKEEKEFKMKEDKEKPVEKIRK 200
++K+ +K + K ++ KE E KKE E KEE K++ + E+ +++ R
Sbjct: 18 LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEV-HKLRAELERELKERRN 76
Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKK-----KKEKERSSHEKEV 239
E + E+ +++ K E L KK KKEKE S+ EK +
Sbjct: 77 ELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNL 120
Score = 29.9 bits (68), Expect = 3.3
Identities = 29/111 (26%), Positives = 62/111 (55%), Gaps = 16/111 (14%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
++ E+ + K + KK++ +KKEKE K+K+ + KE++ E+ + +++E
Sbjct: 84 RLLQREETLDRKMESLDKKEENL-EKKEKELSNKEKN------LDEKEEELEELIAEQRE 136
Query: 171 K-----GKKEEKEKK---EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ G +E+ K+ EE E + + + K +++I +E K+ E +KK+K+
Sbjct: 137 ELERISGLTQEEAKEILLEEVEEEARHEAAKLIKEIEEEAKE-EADKKAKE 186
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 42.3 bits (100), Expect = 2e-04
Identities = 36/104 (34%), Positives = 49/104 (47%), Gaps = 12/104 (11%)
Query: 130 DKKTHK-----KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
DKKT K +KE E++KK+ D K +N E + DK K K E E ++
Sbjct: 107 DKKTKKLINELRKELEEVKKELDNK-----RNGELAIQPIQDKRIIK-KDENSSVSEHED 160
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
FK E++E +E E S KK K +KLKK + K K
Sbjct: 161 FKQLENEENFLETEYNEITSSNNYKKLKIN-RKLKKAEKKLIKS 203
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 43.0 bits (101), Expect = 2e-04
Identities = 23/111 (20%), Positives = 52/111 (46%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
K ++ + +E EK + + K+ +++ + +++KK + K K E + K E K
Sbjct: 112 KAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKK 171
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
K ++ + K + ++ KK + + K EK K + + K ++ + E
Sbjct: 172 KAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAE 222
Score = 38.8 bits (90), Expect = 0.004
Identities = 30/133 (22%), Positives = 59/133 (44%), Gaps = 5/133 (3%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-----EKGKKEEKEKKEEKEFKMKE 189
K+ EKE++K ++ +K ++ + + + ++K+ +++ E+ KK E K + K
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 249
+K +E + +E K+K + KK+ + K E++ E E K K
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEA 223
Query: 250 DMEEKTKRESSPK 262
EEK E
Sbjct: 224 AAEEKAAAEKKKA 236
Score = 38.0 bits (88), Expect = 0.008
Identities = 28/116 (24%), Positives = 41/116 (35%), Gaps = 1/116 (0%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K + K E K+K + K K + K ++ K + KK E E K E
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAE 205
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKERSSHEKEV 239
E + K + + E EK K K K K +K K ER + +
Sbjct: 206 KAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261
Score = 33.8 bits (77), Expect = 0.16
Identities = 31/154 (20%), Positives = 68/154 (44%), Gaps = 1/154 (0%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
I+ + + KK E+++ KK++ ++ K + ++E+ EKE+ K +E++K+ E+
Sbjct: 65 GRIQSQQSSAKKGEQQRKKKEEQVAEELK-PKQAAEQERLKQLEKERLKAQEQQKQAEEA 123
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
K + ++K E+ ++ +K+K K K + K E ++ + E +
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEA 183
Query: 245 FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAA 278
K K E+ K + K+ + A
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKA 217
Score = 33.8 bits (77), Expect = 0.16
Identities = 22/94 (23%), Positives = 38/94 (40%), Gaps = 2/94 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK + K + K K + KKK + + K ++ E + + EK + E+K
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEA-KAAAEKAKAEAEAKAKAEKKAEAAAEEKA 229
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
E K K K +K K +E++ + D
Sbjct: 230 AAEKKKAAAKAK-ADKAAAAAKAAERKAAAAALD 262
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 43.5 bits (103), Expect = 2e-04
Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 6/108 (5%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEF 185
K D K ++EKE+ + K++K K+K +E+ K+K+++K EK K E K++E ++
Sbjct: 545 WKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKY 604
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKS----KDKDKKLKKEKLKKKKKE 229
++ P EE S+KE+K DK KL +E L K K
Sbjct: 605 SAFDETGLPTHDADGEEI-SKKERKKLSKEYDKQAKLHEEYLAKGGKS 651
Score = 36.9 bits (86), Expect = 0.019
Identities = 28/105 (26%), Positives = 52/105 (49%), Gaps = 16/105 (15%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K + + K DK++ + K +++ +++ K K ++E+K+K+ EK K K P
Sbjct: 538 KPDGPSVWKLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIP---PA 594
Query: 196 EKIRKEE-------------KDSEKEKKSKDKDKKLKKEKLKKKK 227
E +++E D++ E+ SK + KKL KE K+ K
Sbjct: 595 EFFKRQEDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQAK 639
Score = 31.2 bits (71), Expect = 1.5
Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
+K+ + +E EK KE KE+K K KK +++K K+ +K ++ E +
Sbjct: 545 WKLDDKEELQREKEEKEA---LKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQE 601
Query: 245 FKFGT 249
K+
Sbjct: 602 DKYSA 606
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 41.6 bits (98), Expect = 2e-04
Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 2/85 (2%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
K KK +E EK+KK+ ++K K K K+K +KK DK+K+K ++ +K E+K+ K
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQK--WKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKD 213
EDK + + K E + E K +
Sbjct: 119 EDKLEDLTKSYSETLSTLSELKPRK 143
Score = 41.2 bits (97), Expect = 3e-04
Identities = 27/71 (38%), Positives = 44/71 (61%)
Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
K K+K++ +E EK KKE +EK++ K K K K+K +K +K++K +K +K +K+ +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Query: 219 KKEKLKKKKKE 229
K E L K E
Sbjct: 121 KLEDLTKSYSE 131
Score = 35.8 bits (83), Expect = 0.020
Identities = 24/68 (35%), Positives = 36/68 (52%)
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K KK+E ++ EK K E+K+K K +K +K +K+K KD K K EK +K+ E
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Query: 231 ERSSHEKE 238
+ K
Sbjct: 121 KLEDLTKS 128
Score = 35.4 bits (82), Expect = 0.025
Identities = 19/61 (31%), Positives = 33/61 (54%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+K K K KK+ KKK+K+K KK K DK + K++++ ++K D K + +
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139
Query: 182 E 182
+
Sbjct: 140 K 140
Score = 34.3 bits (79), Expect = 0.057
Identities = 26/93 (27%), Positives = 50/93 (53%), Gaps = 4/93 (4%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
+ E KKKK+ ++ + KE ++++K +K+K KK++ + K++K+ K + EK E
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Query: 197 KIRKEEKD----SEKEKKSKDKDKKLKKEKLKK 225
K +++ + S E S + K +K L K
Sbjct: 116 KEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 43.1 bits (101), Expect = 3e-04
Identities = 28/120 (23%), Positives = 63/120 (52%), Gaps = 7/120 (5%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E K K++ K+ + +K ++K D + K +++ ++K+ + + +
Sbjct: 210 EDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSP 269
Query: 180 KEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
KE+K ++ E++++ +EK + E +K+ E+ K+KD K LK++ K E+ + +KE
Sbjct: 270 KEDK--QVAENQKREIEKAQIEIKKNDEEALKAKDH----KAFDLKQESKASEKEAEDKE 323
Score = 42.3 bits (99), Expect = 4e-04
Identities = 31/127 (24%), Positives = 61/127 (48%)
Query: 112 VTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
V D+ ++K+ +S + K+ + KE+ K+ K + K + + +K+ D+ ++K
Sbjct: 194 VNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQK 253
Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
++ + K KEDK+ + R+ EK + KK+ ++ K K K K+E +
Sbjct: 254 QQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESK 313
Query: 232 RSSHEKE 238
S E E
Sbjct: 314 ASEKEAE 320
Score = 35.0 bits (80), Expect = 0.086
Identities = 23/124 (18%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K ++D + ++D+ +K ++ + K+ DK +N++++ EK E +
Sbjct: 235 KADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKE--DKQVAENQKREIEKA-QIEIK 291
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD-KKLKKEKLKKKKKE 229
K +E + K+ K F +K++ + ++ +E +++K+++ +D +K K + +
Sbjct: 292 KNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSL 351
Query: 230 KERS 233
E +
Sbjct: 352 NEDA 355
Score = 28.4 bits (63), Expect = 8.5
Identities = 22/96 (22%), Positives = 40/96 (41%), Gaps = 9/96 (9%)
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
++ED EK V R E+E + K + KE+L KK+ + +++ + +
Sbjct: 185 ALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKAD------- 237
Query: 246 KFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADE 281
F D +K + E K + P P ++ E
Sbjct: 238 -FAQDNADKQRDEVRQKQQ-EAKNLPKPADTSSPKE 271
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 42.7 bits (101), Expect = 4e-04
Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 4/86 (4%)
Query: 163 KKVDKEKEKGKKEEKEKKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
KK+ + E ++EE+ E E + K K++ ++ +EK+ DK + L K
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKL 283
Query: 222 KLKKKKKEKERSSHEKEVIPKLTFKF 247
++ K+KK++E EK P T F
Sbjct: 284 EILKEKKDEELFWFEK---PWPTLLF 306
Score = 31.9 bits (73), Expect = 0.69
Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 2/76 (2%)
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
F K+ KE + R+EE D E E S+ K K+E+ +++ S EKE K
Sbjct: 221 FFTKKLKETSETEEREEETDVEIETTSETK--GTKQEQEGSTEEDPSLFSEEKEDPDKTE 278
Query: 245 FKFGTDMEEKTKRESS 260
++ ++ K E
Sbjct: 279 DLDKLEILKEKKDEEL 294
Score = 30.4 bits (69), Expect = 2.3
Identities = 17/58 (29%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 177 KEKKEEKEFKMKED-KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
EK F + E K K + + E+K K ++ K +++K K K++EKER
Sbjct: 623 IEKTLFFSFDISELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Score = 28.8 bits (65), Expect = 7.8
Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 1/72 (1%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK K + ++ ++ + E + K K + + ++ +EKE K E + +
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE-DLDK 282
Query: 182 EKEFKMKEDKEK 193
+ K K+D+E
Sbjct: 283 LEILKEKKDEEL 294
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 41.9 bits (98), Expect = 4e-04
Identities = 30/110 (27%), Positives = 47/110 (42%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
K KK +++K KK K+ K K K K K+ K KK + K K+ +
Sbjct: 6 KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTV 65
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
K +K + V+K +K +KE S + + K K K+ + EK
Sbjct: 66 KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115
Score = 36.9 bits (85), Expect = 0.015
Identities = 21/98 (21%), Positives = 36/98 (36%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
KK K+K ++K K +KE K KK K ++ K ++ +
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K+ V+ + E E K K + + E + K
Sbjct: 61 KKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNK 98
Score = 34.5 bits (79), Expect = 0.076
Identities = 26/101 (25%), Positives = 39/101 (38%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K ++E+ K + K+ KK K K K K K K KKV + E
Sbjct: 9 KKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFE 68
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
K + +KE +K K + + E K K++ K K
Sbjct: 69 KTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKK 109
Score = 30.3 bits (68), Expect = 1.7
Identities = 22/113 (19%), Positives = 42/113 (37%), Gaps = 3/113 (2%)
Query: 57 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
K ++ P K + K ++K K + K K + ++ T
Sbjct: 6 KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAK--KAAKVKKTKSVKTTTKKV 63
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
+ + +K +S+KK+ K +KE + + + KN K +K EK
Sbjct: 64 TV-KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEK 115
Score = 30.3 bits (68), Expect = 1.9
Identities = 29/97 (29%), Positives = 42/97 (43%), Gaps = 2/97 (2%)
Query: 153 KIKNKEKDKEKKVDKEKEKGKKE-EKEKKEEKEFKMKE-DKEKPVEKIRKEEKDSEKEKK 210
K K+ +EKK +K KE K KK K K K V+K + + ++K
Sbjct: 6 KTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTV 65
Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
+K + +KKE + KK +KE S E FK
Sbjct: 66 KFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKN 102
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 42.3 bits (100), Expect = 4e-04
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 10/129 (7%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
LGE + + + KKE +K+++K ++ + +I+ KE++ EK +K +K K +
Sbjct: 86 LGEENIEI---EAQIEELKKELKKLEEKIEQLEA-EIEKKEEELEKAKNKFLDKAWK-KL 140
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
KK + E + K +EK E + + L E+L K K K S
Sbjct: 141 AKKYDSNL--SEALKGLNYKKNFKEKLLKELKSVILNASSLLSLEEL--KAKIKTLFSSN 196
Query: 237 KEVIPKLTF 245
K + LT
Sbjct: 197 KPELALLTL 205
Score = 34.6 bits (80), Expect = 0.11
Identities = 31/153 (20%), Positives = 68/153 (44%), Gaps = 18/153 (11%)
Query: 114 SSEDLGEIKKDKSIKKDK-KTHKKKEKEKIKK-------KKDKKDKDKIKNKEKDKEKKV 165
D+ E+K ++ + + +K +EK+K + + I + + +
Sbjct: 318 FKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELI 377
Query: 166 DKEKEKGKKEEKEKKEEKEF-------KMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKK 217
+ EK +KEK + K+ ++KED + ++ + EK + EK+ K + +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 218 LKKEKLKKKKKEKERSSHEK--EVIPKLTFKFG 248
+K + + K+ EK+ ++ E + I KL +G
Sbjct: 438 IKALEKEIKELEKQLTNIEPTADEINKLLKAYG 470
Score = 34.6 bits (80), Expect = 0.11
Identities = 27/130 (20%), Positives = 61/130 (46%), Gaps = 10/130 (7%)
Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
+++ + + +I+ +K+ +K +++ E+ + E ++K+EE E + +K +K+ K+
Sbjct: 88 EENIEIEAQIEELKKELKKL-EEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDS 146
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKR-ESSPKI 263
+ E KK KEKL K+ K ++ L+ + +++ K K SS K
Sbjct: 147 NLSEALKGLNYKKNFKEKLLKELKSVILNA-----SSLLSLE---ELKAKIKTLFSSNKP 198
Query: 264 VIKPVKSPSP 273
+ +
Sbjct: 199 ELALLTLSVI 208
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 42.0 bits (99), Expect = 5e-04
Identities = 18/117 (15%), Positives = 51/117 (43%), Gaps = 8/117 (6%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD-----KEKKVDKEKEKGKKEEK 177
D+ I+ +++ K ++ + ++ + I ++K+ K + + ++ + + +
Sbjct: 808 IDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNI 867
Query: 178 EKKEEKEFKMKEDKEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ + + + K D +K + I K + K +K KK+K +K ++E
Sbjct: 868 NENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEE 924
Score = 37.7 bits (88), Expect = 0.011
Identities = 24/112 (21%), Positives = 52/112 (46%), Gaps = 4/112 (3%)
Query: 82 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 141
+ K ++E ++ EL + +K +L EI+ +++ + + + E
Sbjct: 818 RLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQ---EAEEEIQNINENKNEF 874
Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
++ K D K +K+ KD K+ ++ K++EK K++K+ K D+E
Sbjct: 875 VEFKNDPKKLNKLII-AKDVLIKLVISSDEIKQDEKTTKKKKKDLEKTDEEA 925
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 41.9 bits (98), Expect = 6e-04
Identities = 39/128 (30%), Positives = 55/128 (42%), Gaps = 5/128 (3%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK-----VDKE 168
ED E K K KK K KK + +K K+ D D+ + + D E + D
Sbjct: 232 GGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSS 291
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
EE+E K E K + E+ + EE+ +E+E K KKLKK K KK
Sbjct: 292 ASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Query: 229 EKERSSHE 236
+K+ S
Sbjct: 352 DKDDSDSG 359
Score = 33.8 bits (77), Expect = 0.20
Identities = 18/87 (20%), Positives = 38/87 (43%)
Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
++ K +I K + ++ + +E E+ K EE+ +K K+K+ K K + +
Sbjct: 300 REDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG 359
Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ S + KK+KE ++
Sbjct: 360 DDSDDSDIDGEDSVSLVTAKKQKEPKK 386
Score = 30.3 bits (68), Expect = 2.6
Identities = 35/143 (24%), Positives = 60/143 (41%), Gaps = 5/143 (3%)
Query: 104 PMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
P +P ++ ED E +++K+ ++ + K K+ +K+K KK+ DKD + + +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDS 365
Query: 164 KVDKEKEKG---KKEEKEKKEEKEFKMKEDKEKPVEKIR--KEEKDSEKEKKSKDKDKKL 218
+D E K++KE K+E+ R E KD K K + + K
Sbjct: 366 DIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSP 425
Query: 219 KKEKLKKKKKEKERSSHEKEVIP 241
KK K E S + P
Sbjct: 426 ASVPAKKLKTENAPKSSSGKSTP 448
Score = 28.4 bits (63), Expect = 8.4
Identities = 27/131 (20%), Positives = 53/131 (40%), Gaps = 11/131 (8%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE----KGKK 174
G+ ++ +K K K K + D+ D D+ +D +++ K+K+ K KK
Sbjct: 194 GDEDEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKK 253
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEK-------DSEKEKKSKDKDKKLKKEKLKKKK 227
+ + K+ K + E + E + S ++++ KL E K +
Sbjct: 254 KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPE 313
Query: 228 KEKERSSHEKE 238
E++ S E E
Sbjct: 314 IEQDEDSEESE 324
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 40.4 bits (95), Expect = 9e-04
Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 3/115 (2%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
+ + ++E+ KKK+ K K K+K K+ + K +KK E+
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPT 132
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
+ P RK + S + K ++ ++E +KK + K R EK+ +LT
Sbjct: 133 LLDSP---RRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184
Score = 31.6 bits (72), Expect = 0.57
Identities = 31/127 (24%), Positives = 51/127 (40%), Gaps = 18/127 (14%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE--- 176
E+++++ +KK K+ K KE KKKK K K K E+
Sbjct: 77 ELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDS 136
Query: 177 ----------KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
+ KE ++KE + + + K K +K K K+K+L +E+ +
Sbjct: 137 PRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARK-----RKEKKKEKELTQEERLAE 191
Query: 227 KKEKERS 233
KE ER
Sbjct: 192 AKETERI 198
>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
finger [Chromatin structure and dynamics].
Length = 271
Score = 40.7 bits (95), Expect = 0.001
Identities = 43/244 (17%), Positives = 79/244 (32%), Gaps = 38/244 (15%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK----- 183
D K + + KK D ++ E + KE + + ++EK
Sbjct: 31 IDAKVCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLLKEIRELLLKAIYIQKEKSDLAD 90
Query: 184 --EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS-------S 234
E ++ ++ ++I K + + D + E+ + S S
Sbjct: 91 RAEKLLRRHRKLLDDRIAKRPHEKVAARIENCHDAVSRLERNSYSSAARRSSGEHRSAAS 150
Query: 235 HEKEVIPKLTFKFG------TDMEEKTKRESSPKIVIKPV----------KSPSPPPVAA 278
+ KL + E +KRE S + V +
Sbjct: 151 SQGSRHTKLKKRKNIHNLKRRSPELSSKREVSFTLESPSVPDTATRVKEGNNGGSTKSRG 210
Query: 279 ADEYDTGDSKQVWICPACGVQDDGSLPMIGCDGCDA---WYHWVCVGLVAEPETSDWFCP 335
D + ++++ C Q M+ CD + W+H CVGL P+ W+CP
Sbjct: 211 VSSEDNSEGEELY----CFCQQVSYGQMVACDNANCKREWFHLECVGLKEPPKGK-WYCP 265
Query: 336 KCSK 339
+C K
Sbjct: 266 ECKK 269
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 39.8 bits (93), Expect = 0.001
Identities = 27/159 (16%), Positives = 65/159 (40%)
Query: 73 TSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKK 132
++ + + + KD T + L I++ + + K K
Sbjct: 44 YGKTSRLKKSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKG 103
Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
KK +I + + ++ K + + + + EE + K +++ + E+ E
Sbjct: 104 KKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELE 163
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
K +E++ E+ EK + K+K K+L++ K + ++ +E
Sbjct: 164 KELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEE 202
Score = 36.0 bits (83), Expect = 0.018
Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 15/141 (10%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKE 170
S+ G +K D I+KD+ K + + K K I+ K K KKV K
Sbjct: 52 KSKGRGIVKGDIEIEKDEGKKKTYVEITFENNKGKLKLRLIEESRELTKKKGKKVKKSIL 111
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK----------- 219
+ + ++ ++ E + + P+ +E+ E + K K+K ++L+
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELED 171
Query: 220 -KEKLKKKKKEKERSSHEKEV 239
K+ L+K +EKE+ +E+
Sbjct: 172 EKDLLEKLLEEKEKKKELEEL 192
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 40.8 bits (96), Expect = 0.001
Identities = 12/86 (13%), Positives = 39/86 (45%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
+++ + K++ K E K E E+++ + +KE+ +K +I + E+ ++ +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 212 KDKDKKLKKEKLKKKKKEKERSSHEK 237
+ + ++++ + E +
Sbjct: 62 PAASGEGGGGEEEEEEAKAEAAEFRA 87
Score = 36.2 bits (84), Expect = 0.027
Identities = 16/92 (17%), Positives = 39/92 (42%), Gaps = 3/92 (3%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E++K +K + ++K + K++ K +I E+ ++ K G+ E+
Sbjct: 14 ELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEE 73
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
+EE+ + +R + +E+K+
Sbjct: 74 EEEEAKAEAAEFRA---YLRGGDDALAEERKA 102
Score = 35.4 bits (82), Expect = 0.058
Identities = 17/96 (17%), Positives = 35/96 (36%), Gaps = 4/96 (4%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK--KEEKEFKMKEDKEKPVE 196
+E + + K+ K+ EK E + +K + KEE +K E + D+ +
Sbjct: 5 REALAELA--KELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKP 62
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
E E+E++ + + L+
Sbjct: 63 AASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAE 98
Score = 28.9 bits (65), Expect = 5.7
Identities = 14/58 (24%), Positives = 27/58 (46%), Gaps = 6/58 (10%)
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK-EKLKKKKKEKER 232
KE +E KE + K+ ++EK +E E++ + D ++ +KL + E
Sbjct: 1 LKELREALAELAKE-----LRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 40.2 bits (94), Expect = 0.001
Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
+ KDK + K + K +K K IK + + +V K + KKEEK+K ++
Sbjct: 19 EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAE---RVKKLHSQEKKEEKKKPKK 75
Query: 183 KEFKM 187
K+ +
Sbjct: 76 KKVPL 80
Score = 36.7 bits (85), Expect = 0.011
Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 163 KKVDKEKE-KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
K+V + E + K K+K + + ++K +EK + K E+ K ++ K+E
Sbjct: 10 KRVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEE 69
Query: 222 KLKKKKKEKERSSHEKEV 239
K K KKK+ + ++
Sbjct: 70 KKKPKKKKVPLQVNPAQL 87
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 40.9 bits (96), Expect = 0.001
Identities = 25/113 (22%), Positives = 41/113 (36%), Gaps = 4/113 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDK---DKIKNKEKDKEKKVDKEKEKGKKEEKE 178
K+ +K + K K K + + K K + +KK + E K E +
Sbjct: 125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAK 184
Query: 179 KKEEKEFKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
KK E E K E K + ++K + + KK + K K + K
Sbjct: 185 KKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAA 237
Score = 37.9 bits (88), Expect = 0.008
Identities = 20/97 (20%), Positives = 44/97 (45%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
K+ EKE++ ++ KK ++ + K+K+ ++ K K K E + + +K
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKA 163
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ +K+ + +K + + KK + E K E +
Sbjct: 164 AAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK 200
Score = 37.5 bits (87), Expect = 0.011
Identities = 27/125 (21%), Positives = 62/125 (49%), Gaps = 5/125 (4%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
+ KS K+ ++ KKKE+++ ++ + K+ ++ + K+ +KE+ +E++K +E ++
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129
Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKE-----KKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
K+ + +E K + + + K KK+ + KK + + KK + + E
Sbjct: 130 KQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEA 189
Query: 238 EVIPK 242
E K
Sbjct: 190 EAAAK 194
Score = 37.1 bits (86), Expect = 0.014
Identities = 19/97 (19%), Positives = 39/97 (40%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+ +E+ K+ ++ + +K K+ ++ K K E + K+ K + K
Sbjct: 110 RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ +K + + KK + + K KKK E E
Sbjct: 170 KAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE 206
Score = 36.7 bits (85), Expect = 0.022
Identities = 28/109 (25%), Positives = 41/109 (37%), Gaps = 4/109 (3%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK-DKEKKVDKEKEKGKKEEKEKKEEKEF 185
KK + K+ K K+ ++ K K K + E K K E +KK E E
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175
Query: 186 KMK---EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
K E K+K + + K+K + KK E KK E +
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAK 224
Score = 35.6 bits (82), Expect = 0.044
Identities = 21/95 (22%), Positives = 39/95 (41%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
KK KKK + + KK + K K + + K K+K + + ++K E K+
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
E K + + + +EK +K +K +
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
Score = 33.6 bits (77), Expect = 0.20
Identities = 26/120 (21%), Positives = 44/120 (36%), Gaps = 3/120 (2%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
K K+ + K+ +K + KK + + K + K+K + K E K K
Sbjct: 143 AAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAK-K 201
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
K E E K K E +K E + K + + +K K ++ + EV
Sbjct: 202 KAEAEAKKKAAAE--AKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
Score = 31.3 bits (71), Expect = 1.1
Identities = 17/108 (15%), Positives = 55/108 (50%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
+ K E++ K++++ +E ++K+ ++ ++K+ +++ + ++++ E K++ K
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPK 262
K+ ++ K K ++ E + K + ++K + E++ K
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKK 178
Score = 30.9 bits (70), Expect = 1.2
Identities = 19/92 (20%), Positives = 35/92 (38%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK ++ K + K+K + + + K K + + K+K + K+K E K
Sbjct: 169 KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAA 228
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ + K EK E+ + K D
Sbjct: 229 KAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 39.9 bits (93), Expect = 0.001
Identities = 34/167 (20%), Positives = 73/167 (43%), Gaps = 9/167 (5%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
KEK +S+ P+E E ++ +L V + + E+K+ +K
Sbjct: 68 KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEE--------EVKEQLQSLLEKIVVS 119
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K+E++ K+ + D+DK E +++ E K +E ++ +E ++ K+
Sbjct: 120 KQEEDGPGKEP-QLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMK 178
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
EK ++E + KE K + K + + + ++ ++E + E K
Sbjct: 179 EKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIK 225
Score = 39.1 bits (91), Expect = 0.002
Identities = 27/124 (21%), Positives = 59/124 (47%), Gaps = 3/124 (2%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
KEK + ++ + E++ + V K K+K ++E KE+ + K+ K++
Sbjct: 68 KEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPG 127
Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
++ + D +K ++D D + + + K +E E S H +E + ++ DM+EK +
Sbjct: 128 KEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASE---QYKQDMKEKASEQ 184
Query: 259 SSPK 262
+
Sbjct: 185 ENED 188
Score = 31.1 bits (70), Expect = 0.92
Identities = 31/162 (19%), Positives = 68/162 (41%), Gaps = 8/162 (4%)
Query: 104 PMVPNIVKVTSSEDLGEIKKDKSIKKDK---KTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
P VP +E+ G++ K+ +K + K + EKI K ++D + + +
Sbjct: 75 PTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDE 134
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK- 219
+ + ++ + ++ + K +E ED E ++ K KEK S+ +++ K
Sbjct: 135 DKFLLAEDSDDRQETLEAGKVHEE---TEDSYHVEETASEQYKQDMKEKASEQENEDSKE 191
Query: 220 -KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
EK ++ K E + + E + ++E+ +E
Sbjct: 192 PVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPV 233
Score = 29.9 bits (67), Expect = 2.6
Identities = 35/176 (19%), Positives = 77/176 (43%), Gaps = 12/176 (6%)
Query: 63 MKDKVSPKDKTSPKEK--MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
+K+K + + P+E + +E K KV++ + + V+ E+ G
Sbjct: 67 LKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGP 126
Query: 121 IKK-----DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
K+ DK + + +++ E K ++ +D ++ ++ K+ KEK
Sbjct: 127 GKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEK----AS 182
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E+E ++ KE K ++ K E+D ++E + K +K+E K+ +E++
Sbjct: 183 EQENEDSKEPVEKAERTKAETD-DVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQ 237
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 40.8 bits (96), Expect = 0.001
Identities = 34/155 (21%), Positives = 68/155 (43%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
++E E+ KK + K +N EK+ E K K++ K + +EK++ +E+E E++ K
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT 255
+ K K SEK +K +++++ ++ K +S+ +++ K + K + +
Sbjct: 469 KVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKIS 528
Query: 256 KRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV 290
K K K KS +E
Sbjct: 529 KAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDD 563
Score = 40.0 bits (94), Expect = 0.003
Identities = 26/118 (22%), Positives = 57/118 (48%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K+ +S K K+ + +++K ++++ + ++ EK K + + ++ K+EE+E+ +
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELD 491
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
E+ +K +++ K K K +K K KKKK+KE+S +
Sbjct: 492 EENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDD 549
Score = 35.4 bits (82), Expect = 0.067
Identities = 24/128 (18%), Positives = 68/128 (53%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K+ ED E S+K ++ +K++E + ++ + + + + + ++E + +K
Sbjct: 360 KLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
G+++ + EKE + K+ K++ + +++++ E+E+ +++ K++K K K+ +
Sbjct: 420 VGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSE 479
Query: 231 ERSSHEKE 238
+ E+E
Sbjct: 480 KAQKEEEE 487
Score = 34.6 bits (80), Expect = 0.12
Identities = 23/122 (18%), Positives = 61/122 (50%), Gaps = 1/122 (0%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E+ + K +++ + K++ +I++ +++ + +++ +E ++ K + + K
Sbjct: 368 EENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E EK+ E + KE+K + EK +E++ ++++ +K K + +K +KE
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Query: 235 HE 236
E
Sbjct: 488 EE 489
Score = 34.3 bits (79), Expect = 0.15
Identities = 31/133 (23%), Positives = 62/133 (46%), Gaps = 2/133 (1%)
Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
+ + L +K + + KK E E+++++ + +++ + E+ +K V
Sbjct: 361 LKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNV 420
Query: 166 DKEKEKGKKEEKEKKEEKEFKMK--EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+ K + EKE + +K K E KEK +E +D E+ K K +K LK+ +
Sbjct: 421 GRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEK 480
Query: 224 KKKKKEKERSSHE 236
+K++E+E E
Sbjct: 481 AQKEEEEEELDEE 493
Score = 33.1 bits (76), Expect = 0.36
Identities = 39/172 (22%), Positives = 73/172 (42%), Gaps = 17/172 (9%)
Query: 77 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 136
EK+++K ++ + + ++ P + V S + KK S K DK +K
Sbjct: 468 EKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKI 527
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE------D 190
+ KKK KK+K + + E+ K +++E +++ FK K+
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAFA 587
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKL-----------KKEKLKKKKKEKE 231
+ V + KE+K+ +E+ K+ D L KK K K+K+K +
Sbjct: 588 GDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRF 639
Score = 32.3 bits (74), Expect = 0.60
Identities = 30/147 (20%), Positives = 61/147 (41%), Gaps = 12/147 (8%)
Query: 99 ELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE 158
E P N+ + + GE + + K + ++ KEK++ ++++ +D+++ K ++
Sbjct: 410 EENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEK 469
Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE---------- 208
+ EK + ++EE+E EE + K + +K S +
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISK 529
Query: 209 --KKSKDKDKKLKKEKLKKKKKEKERS 233
K K K KK K L ++E S
Sbjct: 530 AAVKVKKKKKKEKSIDLDDDLIDEEDS 556
Score = 32.0 bits (73), Expect = 0.68
Identities = 25/131 (19%), Positives = 57/131 (43%), Gaps = 6/131 (4%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE----KEKGKKEEK 177
K+K +++ + +E+ K++K +K K K +++++E+++D+E K +
Sbjct: 447 FKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKS 506
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKD--SEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
KK++ + K +K KI K +K+K+ ++ K + +
Sbjct: 507 AKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEED 566
Query: 236 EKEVIPKLTFK 246
E + FK
Sbjct: 567 EDDEELPFLFK 577
Score = 29.3 bits (66), Expect = 5.5
Identities = 31/168 (18%), Positives = 66/168 (39%), Gaps = 11/168 (6%)
Query: 64 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
+ K P++ E K++ + KE + + E + KV + +K+
Sbjct: 422 RRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVAN----KLLKR 477
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+ +K+++ + E+ K K K ++ K + K K K
Sbjct: 478 SEKAQKEEEEEELDEENPWLKTTSSVGKSA-----KKQDSKKKSSSKLDKAANKISKAAV 532
Query: 184 EFKMKEDKEKP--VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
+ K K+ KEK ++ +E+DS K ++D+ ++ K+K+
Sbjct: 533 KVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKD 580
Score = 28.9 bits (65), Expect = 6.2
Identities = 32/154 (20%), Positives = 71/154 (46%), Gaps = 13/154 (8%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------KNKEKDKEKKVDK 167
S E+ + +D ++ +KK K+K+ +D ++ + + E K+++ D
Sbjct: 334 SEEEDEDDDEDDDDGENPWMLRKKL-GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDA 392
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK--K 225
E E+ ++E + ++E E E+ E+P +K K + + + + KKLKKE K
Sbjct: 393 EIEELRRELEGEEESDE----EENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFK 448
Query: 226 KKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
+KKE + ++ K + +++++
Sbjct: 449 EKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQ 482
>gnl|CDD|187861 cd09730, Cas8a1_I-A, CRISPR/Cas system-associated protein Cas8a1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Large proteins, some contain Zn-finger domain; signature
gene for I-A subtype; also known as TM1802 family.
Length = 579
Score = 40.4 bits (95), Expect = 0.002
Identities = 25/108 (23%), Positives = 45/108 (41%), Gaps = 18/108 (16%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE----KDKEKKVDKEKEKGKKEEK 177
+K K T + +KIKK +K KE K+ + ++K EK K+ +
Sbjct: 82 PTNKITYLLKTT--EPLNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQ 139
Query: 178 EKKE------------EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ K +K + ED E+ +K+ K++ +K+ K K
Sbjct: 140 DLKPGENVILTIKIDGKKLLEYDEDYEEFRDKVLKKKVFKDKKAKEKG 187
Score = 29.6 bits (67), Expect = 3.3
Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 9/153 (5%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI-KNKEKDKEKKVDK 167
++K+ + EI + DKKT +K K + +D + NK K +
Sbjct: 39 VLKIDFDTEENEINIKGEEEFDKKTFRKY---LYKGQAGNASQDYLPTNKITYLLKTTEP 95
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS--EKEKKSKDKDKKLKKEKLKK 225
+K KK ++ E K + K K ++ + ++ + + + K + + K+
Sbjct: 96 LNKKIKKWFEKSYLLLESKKEILKLKNIKDVLEKNIEKILKDIQDLKPGENVILTIKIDG 155
Query: 226 KKKEKERSSHEK---EVIPKLTFKFGTDMEEKT 255
KK + +E+ +V+ K FK E+
Sbjct: 156 KKLLEYDEDYEEFRDKVLKKKVFKDKKAKEKGI 188
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 40.1 bits (94), Expect = 0.002
Identities = 30/115 (26%), Positives = 51/115 (44%), Gaps = 18/115 (15%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD----KEKKVDKEKEKGKKEEKEKKEE 182
I++ K+ + + ++ K + DK K ++ K KK+ K ++ E KE
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392
Query: 183 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K K + +K +E+IR+E + E KSK K +K KK+ EK
Sbjct: 393 IAYYESAKTALEKAEGKKAIEEIREELIE-EGLLKSKKKKRK------KKEWFEK 440
Score = 36.2 bits (84), Expect = 0.035
Identities = 27/114 (23%), Positives = 51/114 (44%), Gaps = 17/114 (14%)
Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEK-GKKEEKEKKEEKEFKMKEDKEKP----------- 194
+K ++DKIK + EKK++KE +K K EK++ E +E + ++ +
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332
Query: 195 VEKIRKE-----EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
+E+ K +E+ K DK K + + KK K+ + + +L
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQL 386
Score = 30.4 bits (69), Expect = 2.4
Identities = 21/126 (16%), Positives = 52/126 (41%), Gaps = 15/126 (11%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG--------- 172
+K + K + + ++K + + KK + +K +++ ++ EK ++ ++KG
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332
Query: 173 -KKEEKE-----KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
++ K +E K++ DK K + + K+ K + + +LK+
Sbjct: 333 IEEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEA 392
Query: 227 KKEKER 232
E
Sbjct: 393 IAYYES 398
Score = 28.5 bits (64), Expect = 7.5
Identities = 12/63 (19%), Positives = 28/63 (44%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
++ + K+ K + + K E+ +E+ +++ K KK++++KKE
Sbjct: 377 GAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKE 436
Query: 182 EKE 184
E
Sbjct: 437 WFE 439
Score = 28.5 bits (64), Expect = 9.3
Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 2/87 (2%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
+I+ DKS K + ++ K+ K K K + D+ ++ K+ + K +K E +
Sbjct: 351 IKIELDKS-KTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGK 409
Query: 179 KK-EEKEFKMKEDKEKPVEKIRKEEKD 204
K EE ++ E+ +K ++++K+
Sbjct: 410 KAIEEIREELIEEGLLKSKKKKRKKKE 436
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 40.2 bits (94), Expect = 0.002
Identities = 28/127 (22%), Positives = 70/127 (55%), Gaps = 2/127 (1%)
Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
+P+ + + GE K++ ++ +K + +KE E+ + +K K++ K K++ +++
Sbjct: 494 IPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEME 553
Query: 166 DKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+ ++ + K+ + +KE +E +K++ E + ++++++ KE KS + KLK+ K
Sbjct: 554 ELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQ 613
Query: 224 KKKKKEK 230
K +K
Sbjct: 614 KIPQKPT 620
Score = 39.0 bits (91), Expect = 0.004
Identities = 35/117 (29%), Positives = 59/117 (50%), Gaps = 15/117 (12%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEKP 194
E+ K + K++ + + K EK+++++ E +K KE K+E E +M+E KE
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE-- 557
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKE------KLKKKKKEKERSSHEKEVIPKLTF 245
R+ K E EK++++ K LKKE +LK+KK K + E + KL
Sbjct: 558 ----RERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKE 610
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 40.4 bits (94), Expect = 0.002
Identities = 28/139 (20%), Positives = 62/139 (44%), Gaps = 16/139 (11%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E++ +I +D S ++K + +E + + + K ++ N E D K D K K+
Sbjct: 3890 EEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKD 3949
Query: 176 EKEKKEEKEFK------------MKEDKEKPVEKIRK----EEKDSEKEKKSKDKDKKLK 219
+EK++E+E ++E+ +P + E+ ++++ KD L+
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009
Query: 220 KEKLKKKKKEKERSSHEKE 238
++ + KE + EK+
Sbjct: 4010 DMDMEAADENKEEADAEKD 4028
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 40.0 bits (94), Expect = 0.002
Identities = 38/158 (24%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
+EK+ ++ K +EY K+ + ++L EI+K S +++
Sbjct: 279 EEKVKELKELKEKAEEYIKLSEF------------YEEYLDELREIEKRLSRLEEEINGI 326
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
++ +++++K+++ ++ K K KE +K + +E+ + +E K KKEE E K
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP 386
Query: 196 EKIRKEEKDSEKEKKS-KDKDKKLKKEKLKKKKKEKER 232
EK+ KE ++ EK K+ +++ K+ + KK+ KE
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGELKKEIKEL 424
Score = 39.3 bits (92), Expect = 0.004
Identities = 27/129 (20%), Positives = 68/129 (52%), Gaps = 6/129 (4%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKD-----KKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E+K + + ++ KK +E++ K + +K ++++ + ++ EKK +E+ + +
Sbjct: 606 ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELR 665
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
EE + + ++ + E+ ++ + +K EK K+ ++ +K K++L+K +K ER
Sbjct: 666 EEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREK-AKKELEKLEKALERVE 724
Query: 235 HEKEVIPKL 243
+E + K
Sbjct: 725 ELREKVKKY 733
Score = 38.5 bits (90), Expect = 0.006
Identities = 37/150 (24%), Positives = 77/150 (51%), Gaps = 3/150 (2%)
Query: 116 EDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGK 173
E + E+++ + +KK+ + ++K KE + K+ ++ K+ E+ ++ + EK +
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSR 318
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
EE+ E+ K E+KE+ +E+++K+ K+ EK + ++ +L E+ K KK+E ER
Sbjct: 319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHEL-YEEAKAKKEELERL 377
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSPKI 263
+ + + EK K E +I
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
Score = 38.1 bits (89), Expect = 0.009
Identities = 22/116 (18%), Positives = 63/116 (54%), Gaps = 4/116 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
++L E+ ++ + + ++E EK++K+ + ++ KE+ +E + + E +G K
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEE----LKEEIEELEKELESLEGSKR 255
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ E+K + + E+ +K +E++ ++ K+ ++ K+ ++ KL + + + +E
Sbjct: 256 KLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELRE 311
Score = 38.1 bits (89), Expect = 0.010
Identities = 31/125 (24%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 113 TSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
+ ++LGE+ K + I++ +K K+ E + K+ +K+ +++ + + ++ + +E
Sbjct: 162 NAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELRE 221
Query: 171 KGKKEEKEKKEEKEFKMK-EDKEKPVEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKK 226
+ +K EKE KE +E K + E+ EK +E + ++ E +E + + ++ K + E+L++K
Sbjct: 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281
Query: 227 KKEKE 231
KE +
Sbjct: 282 VKELK 286
Score = 37.7 bits (88), Expect = 0.014
Identities = 38/137 (27%), Positives = 77/137 (56%), Gaps = 6/137 (4%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K I+K+ K ++KE+ K++K+ + +K K E K K++ ++ ++ +++ K+ E+
Sbjct: 462 KRIEKELKEIEEKER-KLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEEL 520
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL----KKKKKEKERSSHEKEVI 240
K E+ EK EK+ K + + + KK +K ++LKK+ K + E+E + KE +
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE-L 579
Query: 241 PKLTFKFGTDMEEKTKR 257
+L F+ ++EE+ K
Sbjct: 580 EELGFESVEELEERLKE 596
Score = 37.4 bits (87), Expect = 0.015
Identities = 27/118 (22%), Positives = 65/118 (55%), Gaps = 3/118 (2%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
KD + ++E++++KK +++ DK +++ EK E+ + +E KK +E+ EE +
Sbjct: 608 KDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREE 667
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
E + + +R E ++ EK ++ K + KE+L++++K K+ ++ + ++
Sbjct: 668 YLE-LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVE 724
Score = 36.6 bits (85), Expect = 0.026
Identities = 37/138 (26%), Positives = 70/138 (50%), Gaps = 11/138 (7%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E L + ++ KK ++ K KEK K + K K +++ E+ K+K + EK+ + E
Sbjct: 510 EKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELE 569
Query: 176 EKEKK-----EEKEFKMKEDKEKPVEKIRK------EEKDSEKEKKSKDKDKKLKKEKLK 224
E+ + EE F+ E+ E+ ++++ E KD+EKE + ++K+ K +E+L
Sbjct: 570 EELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELD 629
Query: 225 KKKKEKERSSHEKEVIPK 242
K +E + E + K
Sbjct: 630 KAFEELAETEKRLEELRK 647
Score = 36.2 bits (84), Expect = 0.035
Identities = 43/165 (26%), Positives = 82/165 (49%), Gaps = 7/165 (4%)
Query: 70 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
K K E++ + EK +++++E + I K+T +GE+KK+ K
Sbjct: 368 KAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKIT--ARIGELKKEIKELK 425
Query: 130 DKKTHKKKEKEKI---KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
KK K K ++ ++ + ++ + + K++ EKE + EEKE+K KE +
Sbjct: 426 KAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRI--EKELKEIEEKERKLRKELR 483
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E K ++ K ++ +E+ K+ ++K KK E+L+KK +E E
Sbjct: 484 ELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYE 528
Score = 35.4 bits (82), Expect = 0.058
Identities = 29/122 (23%), Positives = 61/122 (50%), Gaps = 5/122 (4%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
+ ++IK++ ++ + IK E +E +KEKE + + + E + + +
Sbjct: 168 GEVIKEIKRRIERL-EKFIKRTENIEELIKEKEKELEEVLREINEISSEL---PELREEL 223
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKT 255
EK+ KE K+ E+ K+ ++ +K + K+K +E+ +E I +L + ++EEK
Sbjct: 224 EKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEI-EELEEKV 282
Query: 256 KR 257
K
Sbjct: 283 KE 284
Score = 34.7 bits (80), Expect = 0.12
Identities = 31/135 (22%), Positives = 65/135 (48%), Gaps = 9/135 (6%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIK---------KKKDKKDKDKIKNKEKDKEKKVDKEK 169
EI++ + + + K+K +EKI+ KK+ ++ ++K+K ++ KEK + K
Sbjct: 238 EEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIK 297
Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
EE + + K E+ + I + K+ E++++ ++ KK KE K+ ++
Sbjct: 298 LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357
Query: 230 KERSSHEKEVIPKLT 244
+ER +E K
Sbjct: 358 EERHELYEEAKAKKE 372
Score = 33.5 bits (77), Expect = 0.25
Identities = 32/143 (22%), Positives = 77/143 (53%), Gaps = 22/143 (15%)
Query: 116 EDLGEIKKD-KSIKKDKKTHK--KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-- 170
E L ++K + KS+KK+ + + KK+ +++KK D+ +++ + ++ +E + +E
Sbjct: 532 EKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELE 591
Query: 171 ----------------KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKD 213
K ++E E++E++ K++E+ +K E++ + EK E+ K+ ++
Sbjct: 592 ERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE 651
Query: 214 KDKKLKKEKLKKKKKEKERSSHE 236
+KK +E+ ++ ++E S E
Sbjct: 652 LEKKYSEEEYEELREEYLELSRE 674
Score = 33.1 bits (76), Expect = 0.32
Identities = 30/152 (19%), Positives = 70/152 (46%), Gaps = 2/152 (1%)
Query: 121 IKKDKSIKKDKKTHKK--KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
+K+ K +K+ + + K + E+ + + +K + +E+ + ++ + K+E E
Sbjct: 282 VKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ ++K ++++ E+ E+ E+ K+++ + K+L +K +KE E KE
Sbjct: 342 ELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKE 401
Query: 239 VIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
I + K + E K K I+ +K
Sbjct: 402 EIEEEISKITARIGELKKEIKELKKAIEELKK 433
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 40.2 bits (95), Expect = 0.002
Identities = 24/124 (19%), Positives = 46/124 (37%)
Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
K S +KK K + K K E + KK K+ + + + K
Sbjct: 101 EKAKKNSPLFKSLLKKQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILK 160
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
+ ++E ++ + K +ED++ E + +K ++KK + KK
Sbjct: 161 IDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220
Query: 228 KEKE 231
+KE
Sbjct: 221 IDKE 224
Score = 36.0 bits (84), Expect = 0.048
Identities = 29/142 (20%), Positives = 62/142 (43%), Gaps = 10/142 (7%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
KS+ K +K + K IK ++ ++D +K K K K ++ G + + ++ +
Sbjct: 111 KSLLKKQKVEVEGNKLIIKVN-NEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSK 169
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
++++ + +K+ E EK +K+ + +KK + +KKK+ K E V
Sbjct: 170 EELEK---------FEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKK 220
Query: 245 FKFGTDMEEKTKRESSPKIVIK 266
K E ++V++
Sbjct: 221 IDKEEITPMKEINEEERRVVVE 242
Score = 32.9 bits (76), Expect = 0.45
Identities = 20/100 (20%), Positives = 48/100 (48%), Gaps = 1/100 (1%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
I+KV + + +KK K + K K ++K +K + D + K + ++++ D++
Sbjct: 127 IIKVNNEIERDHLKK-KHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDEK 185
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
K E +K E ++ K ++ + ++ +K ++E
Sbjct: 186 LAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 39.9 bits (93), Expect = 0.003
Identities = 28/99 (28%), Positives = 51/99 (51%)
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
EK+ +EE E+K++KE K KE + K ++ +KE K + +++ D KK + K +K
Sbjct: 9 EKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRK 68
Query: 228 KEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIK 266
++ E + E + P F + + ++ SP V K
Sbjct: 69 RDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYSPSAVEK 107
Score = 34.5 bits (79), Expect = 0.14
Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+K KK K KEKE +KK K + + K K + + + K+ KK K E++ +
Sbjct: 20 RKKKKEEKAKEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSRKRDVEDENPED 78
Query: 188 KEDKEKP 194
D + P
Sbjct: 79 FIDPDTP 85
Score = 34.1 bits (78), Expect = 0.18
Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 1/97 (1%)
Query: 138 EKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
E E KK +++ + K K +EK KEK++ K K K+ + + + ++ +K +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
K RK + + E + D D ++K + K+ S
Sbjct: 65 KSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQYS 101
Score = 33.7 bits (77), Expect = 0.23
Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
E +++ +EE E K K++ EK EK K+ K ++KE K+K + ++ KK E
Sbjct: 5 ESEAEKKILTEEELERKKKKE-EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSE 63
Query: 230 KERSSHEKEV 239
K+ + E
Sbjct: 64 KKSRKRDVED 73
Score = 30.6 bits (69), Expect = 2.1
Identities = 21/107 (19%), Positives = 45/107 (42%), Gaps = 13/107 (12%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
+ L E + ++ KK++K +K+ K K+K +K+ + K+ ++
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELK-------------KLKAAQKEAKAKLQAQQA 52
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
+K E+K K + E P + I + +K++ S K+
Sbjct: 53 SDGTNVPKKSEKKSRKRDVEDENPEDFIDPDTPFGQKKRLSSQMAKQ 99
Score = 30.3 bits (68), Expect = 2.4
Identities = 19/69 (27%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK---FGTDMEEKT 255
R E + +K ++ ++K KKE+ K+K+ K+ + +KE KL + GT++ +K+
Sbjct: 3 RTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62
Query: 256 KRESSPKIV 264
+++S + V
Sbjct: 63 EKKSRKRDV 71
Score = 28.7 bits (64), Expect = 7.8
Identities = 20/86 (23%), Positives = 37/86 (43%), Gaps = 7/86 (8%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
++ + + E EK++ K D+ K + K +K+ KEK +E+ K K
Sbjct: 987 LQAQGAVNAEAELEKLRNKMDEIQKQQEK---LEKKMNASGYKEKVPANIQEEDARKLTK 1043
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSK 212
+ ++ +E KE K E E +
Sbjct: 1044 LLQE----LEFFEKESKRLEAETSNS 1065
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 39.2 bits (92), Expect = 0.003
Identities = 16/52 (30%), Positives = 24/52 (46%)
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+E K K EK K K+ EKK K+K+K K++ + K +R
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKR 425
Score = 38.8 bits (91), Expect = 0.004
Identities = 18/54 (33%), Positives = 29/54 (53%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
K S KK K KK ++ +KK+ +K+K K+K + +D + + K G EE
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 37.6 bits (88), Expect = 0.011
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
K K + + K+KEKEK K KK +D I + K
Sbjct: 390 KPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 37.6 bits (88), Expect = 0.011
Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 1/68 (1%)
Query: 118 LGEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+ E K + I + + K EK+ K K K K +++ ++ KV K K
Sbjct: 364 IEEPLKARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIG 423
Query: 177 KEKKEEKE 184
K +K
Sbjct: 424 KRRKPSGT 431
Score = 36.8 bits (86), Expect = 0.021
Identities = 16/58 (27%), Positives = 23/58 (39%)
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E + K + K K V R E+K+ EKEK K + K K++K
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 36.5 bits (85), Expect = 0.022
Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 1/48 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
S K K +KKEKEK K K K+ +D KN K ++ E+
Sbjct: 388 TGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDT-KNIGKRRKPSGTSEE 434
Score = 34.9 bits (81), Expect = 0.083
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+KKT K +K K+ + KK+K+K K K K + + K GK+ + E+
Sbjct: 384 SEKKTGKPSKKVLAKRAE-KKEKEKEKPKVKKRHR---DTKNIGKRRKPSGTSEE 434
Score = 34.1 bits (79), Expect = 0.13
Identities = 15/51 (29%), Positives = 25/51 (49%)
Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K EK+ GK +K + E K KE ++ V+K ++ K+ K +K
Sbjct: 381 KAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGT 431
Score = 34.1 bits (79), Expect = 0.14
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 155 KNKE-KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE--KPVEKIRKEEKDSEK 207
K K +K+ +K K+ EK++KE+++ K+K+ K + K RK SE+
Sbjct: 379 KTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 33.4 bits (77), Expect = 0.22
Identities = 13/50 (26%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 191 KEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
K + ++++R K + + K SK K ++K K+K+K K + H
Sbjct: 369 KARVIDELRPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Score = 32.6 bits (75), Expect = 0.41
Identities = 11/49 (22%), Positives = 19/49 (38%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
KK K + ++KEK K+K K + + + K +E
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 39.2 bits (92), Expect = 0.003
Identities = 25/91 (27%), Positives = 48/91 (52%), Gaps = 16/91 (17%)
Query: 140 EKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEE-KEKKEEKEFKMKEDKEKPVEK 197
+K+ K + K+ +++E+K+ K E+ ++EE +EKKEEK+ + +E
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEERE-------- 297
Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K K S +E++ K +KE+ K+ +K
Sbjct: 298 -AKLAKLSPEEQR-----KLEEKERKKQARK 322
Score = 37.2 bits (87), Expect = 0.011
Identities = 20/80 (25%), Positives = 43/80 (53%), Gaps = 4/80 (5%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
K K ++ +K DK +++ + K E++ +E ++ K+E+K+++ E K K
Sbjct: 247 KLANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREA----KLAK 302
Query: 192 EKPVEKIRKEEKDSEKEKKS 211
P E+ + EEK+ +K+ +
Sbjct: 303 LSPEEQRKLEEKERKKQARK 322
Score = 36.8 bits (86), Expect = 0.015
Identities = 26/83 (31%), Positives = 49/83 (59%), Gaps = 8/83 (9%)
Query: 152 DKIKNKE--KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
DK+ N + + +KVDK +E+ +EEK K +E + +E +EK E+ +KEE + K
Sbjct: 246 DKLANLKLSPEVLRKVDKTREE--EEEKILKAAEEERQEEAQEK-KEEKKKEE---REAK 299
Query: 210 KSKDKDKKLKKEKLKKKKKEKER 232
+K ++ +K + K++KK+ +
Sbjct: 300 LAKLSPEEQRKLEEKERKKQARK 322
Score = 36.1 bits (84), Expect = 0.028
Identities = 16/59 (27%), Positives = 38/59 (64%), Gaps = 5/59 (8%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
++ +K K +++ +E+ ++KK++K K+ + + K K+ E+++ K EEKE+K++
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKE----EREAKLAKLSPEEQR-KLEEKERKKQ 319
Score = 32.6 bits (75), Expect = 0.36
Identities = 18/69 (26%), Positives = 42/69 (60%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
+ ++K KT +++E++ +K ++++ ++ + KE+ K+++ + + K EE+ K EE
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE 313
Query: 183 KEFKMKEDK 191
KE K + K
Sbjct: 314 KERKKQARK 322
Score = 29.1 bits (66), Expect = 4.9
Identities = 9/49 (18%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKK-----LKKEKLKKKKKEKERSS 234
+ + K+ K ++ E++ ++++ KKE+ KK+++E + +
Sbjct: 254 SPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 39.3 bits (91), Expect = 0.004
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 5/139 (3%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK----EEKEKKEEKEF 185
+KK+ K K+KEK +K+K+ +DKDK K E K + + E E
Sbjct: 196 EKKSKKPKKKEKKEKEKE-RDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTV 254
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
+P E E ++++K K K K ++ +KE+ KKKKK H + +
Sbjct: 255 SGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPV 314
Query: 246 KFGTDMEEKTKRESSPKIV 264
+ G EE SS K++
Sbjct: 315 QNGAVEEEPLPPMSSYKLL 333
Score = 32.3 bits (73), Expect = 0.57
Identities = 27/124 (21%), Positives = 51/124 (41%), Gaps = 1/124 (0%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-KGKKEEKEKK 180
K+ K +K++ KKKE E K D + + G + E
Sbjct: 205 KEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPD 264
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
E K+ + +E K+ P K +K+ K+ E++KK K ++ + + E+E +
Sbjct: 265 EPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEEPL 324
Query: 241 PKLT 244
P ++
Sbjct: 325 PPMS 328
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 39.2 bits (91), Expect = 0.004
Identities = 38/194 (19%), Positives = 86/194 (44%), Gaps = 18/194 (9%)
Query: 79 MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
+ ++ P E+E +LG V I ++ + I++ K +K +
Sbjct: 245 LKLEDNYEPSEQEL------KLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIK 298
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK--EEKEKKEEKEFKMKEDKEKPVE 196
+ K + K D +K +N ++K + K +K E E KEE+ ++ + ++ +
Sbjct: 299 TLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHK 358
Query: 197 KIRK--------EEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKERSSHEK-EVIPKLTFK 246
++RK E + E+EK +++ DK ++ +KL K K ++ + + + K +
Sbjct: 359 QLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQ 418
Query: 247 FGTDMEEKTKRESS 260
+ + ++ T+ S
Sbjct: 419 YDSLIQNITRSRSQ 432
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 39.3 bits (92), Expect = 0.004
Identities = 26/164 (15%), Positives = 66/164 (40%), Gaps = 5/164 (3%)
Query: 122 KKDKSIKKDKKTHKKKEK--EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ + K ++ + E +I++K ++ +K +++ +E + + K + + EK
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK 854
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+ E KE+ E+ +E++ +D E K++ + +L++ +++ E + E
Sbjct: 855 EIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914
Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 283
K + +E + S + P P D
Sbjct: 915 KRKRLSELKAKLEALEEELSE---IEDPKGEDEEIPEEELSLED 955
Score = 37.7 bits (88), Expect = 0.013
Identities = 28/134 (20%), Positives = 62/134 (46%), Gaps = 11/134 (8%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+S +GEI+K+ + ++ +K KE++++ ++ + + E K + + E +
Sbjct: 714 ASRKIGEIEKEIEQLEQEE---EKLKERLEELEEDLSSLEQE-IENVKSELKELEARIEE 769
Query: 174 KEEKEKKEEKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDK-----DKKLKKEKLKKK 226
EE K E+ E + + +I+ E E+E + ++KL + L+K+
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829
Query: 227 KKEKERSSHEKEVI 240
EKE +++ I
Sbjct: 830 YLEKEIQELQEQRI 843
Score = 36.2 bits (84), Expect = 0.035
Identities = 20/147 (13%), Positives = 67/147 (45%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
++ E +K + ++ K+E E ++ + ++ DK+ + +++ K+ + EK K +
Sbjct: 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
E +++ ++ + + + + + E + +++K+ K ++KK++ + E+
Sbjct: 401 INELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL 460
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESS 260
+ + + + + + K S
Sbjct: 461 AADLSKYEQELYDLKEEYDRVEKELSK 487
Score = 35.4 bits (82), Expect = 0.072
Identities = 29/125 (23%), Positives = 61/125 (48%), Gaps = 4/125 (3%)
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+L E+ K+ + +D+ K +EK++K K + ++ K + +E + E+
Sbjct: 372 ELEEVDKEFAETRDEL---KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAA 428
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK-DKKLKKEKLKKKKKEKERSSH 235
E K +++E+KE +I+K+E E+ K +++L K + + EKE S
Sbjct: 429 IAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKL 488
Query: 236 EKEVI 240
++E+
Sbjct: 489 QRELA 493
Score = 33.9 bits (78), Expect = 0.22
Identities = 19/103 (18%), Positives = 54/103 (52%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
+ E +++ +K+++ +E+ + + + +K + EE E++ E+E K ++ +
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
++++E +D E + DK+ +++LK +++ E+ E
Sbjct: 360 AELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402
Score = 32.7 bits (75), Expect = 0.40
Identities = 23/141 (16%), Positives = 62/141 (43%), Gaps = 15/141 (10%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKK--------------KEKEKIKKKKDKKDKDKIKNKEKD 160
E L +K++ S + + + ++ +I+K+ ++ ++++ K KE+
Sbjct: 680 RERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERL 739
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE-KEKKSKDKDKKLK 219
+E + D + + E + + ++ E+ E+ + K+ + D E + S+ + + +
Sbjct: 740 EELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAE 799
Query: 220 KEKLKKKKKEKERSSHEKEVI 240
KL+++ E E E
Sbjct: 800 LSKLEEEVSRIEARLREIEQK 820
Score = 31.6 bits (72), Expect = 1.1
Identities = 28/113 (24%), Positives = 51/113 (45%), Gaps = 12/113 (10%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF-K 186
KK+K + +E E+ ++ D +K + E+ + ++ E+ + +EK + E E K
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLK 230
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
KE E+ E I ++ E+E EKL ++ E E+ E E
Sbjct: 231 EKEALERQKEAIERQLASLEEE-----------LEKLTEEISELEKRLEEIEQ 272
Score = 30.4 bits (69), Expect = 2.1
Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKK 225
KEK + EE E+ E+ + ++K + +E++R+E + +E+ + + ++ + E LK+
Sbjct: 172 KEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE 231
Query: 226 KKK--------EKERSSHEKEVIPKLTFK 246
K+ E++ +S E+E + KLT +
Sbjct: 232 KEALERQKEAIERQLASLEEE-LEKLTEE 259
Score = 30.4 bits (69), Expect = 2.4
Identities = 30/177 (16%), Positives = 71/177 (40%), Gaps = 26/177 (14%)
Query: 77 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS--EDLGEIKKDKSIKKDKKTH 134
E S + + E SK+++ V I + L + +K + +
Sbjct: 785 EARLSHSRIPEIQAELSKLEEE-------VSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837
Query: 135 KKKEKEKIKKKKD---------KKDKDKIKNKEKDKEKKV-DKEKEKGKKEEKEKKEEKE 184
++++ +K++ K++++ + ++ E + D E G +++ + E +
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
+ E K +E + ++K+ + + K K E L+++ E E E E IP
Sbjct: 898 LRELERK-------IEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIP 947
Score = 30.4 bits (69), Expect = 2.7
Identities = 14/102 (13%), Positives = 48/102 (47%), Gaps = 4/102 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKE----KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
++ E+K++ +++ +E I + K ++ + + ++K E K + K +
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458
Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ K E++ + +KE+ ++ +++ K +++ + +
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500
Score = 29.3 bits (66), Expect = 5.7
Identities = 25/109 (22%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
I +K+ + ++ + + ++ ++ + + KEK + + E K++E + KE E+ + + +
Sbjct: 192 IIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEE 251
Query: 201 E-----EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
E E+ SE EK+ ++ ++ L++ K K +E KE I +L
Sbjct: 252 ELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 37.8 bits (88), Expect = 0.005
Identities = 25/80 (31%), Positives = 49/80 (61%), Gaps = 5/80 (6%)
Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
K+ KD K ++ ++ K+ E +KE++KGK+E+ E +E ++ ++ ED E+ K
Sbjct: 2 EKECKDAKHENMEEDCCKENE---NKEEDKGKEEDLEFEEIEKEEIIEDSEE--SNEVKI 56
Query: 202 EKDSEKEKKSKDKDKKLKKE 221
E+ ++ K K+++KKL+ E
Sbjct: 57 EELKDENNKLKEENKKLENE 76
Score = 36.3 bits (84), Expect = 0.015
Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
EK+ + + E+ +E E KEE ++++ E+I KEE + E+ ++ K ++
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEED---KGKEEDLEFEEIEKEEIIEDSEESNEVKIEE 58
Query: 218 LKKEKLKKKKKEK 230
LK E K K++ K
Sbjct: 59 LKDENNKLKEENK 71
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 38.8 bits (91), Expect = 0.006
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 4/83 (4%)
Query: 147 DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKEDKEKPVEKIRKEEKD 204
DK + + +E + E++K +K+++ EKK EKE K +K E K+ K
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKG 448
Query: 205 SEKEKKSKDKDKKLKKEKLKKKK 227
+ E K D D EKL + +
Sbjct: 449 PDGETKKVDPD--PLGEKLARTE 469
Score = 37.6 bits (88), Expect = 0.014
Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+E+E ++ K++ K++ EKK +KE+ + K K+K E K K +
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAE-KAAAKKKAEAAAKKAKGPDGET 453
Score = 36.1 bits (84), Expect = 0.034
Identities = 16/54 (29%), Positives = 26/54 (48%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
+ +L ++ K KK +K KK EKE+ +K KK + K K + + K
Sbjct: 403 NGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKV 456
Score = 35.7 bits (83), Expect = 0.052
Identities = 16/53 (30%), Positives = 26/53 (49%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KK KK+ K + K +K++ +K K K + KK + KK + + EK
Sbjct: 412 KKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDPLGEK 464
Score = 35.3 bits (82), Expect = 0.063
Identities = 18/59 (30%), Positives = 28/59 (47%)
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E +E++ E +++K +K RK EK +EKE+ K KK + KK K
Sbjct: 393 LAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDG 451
Score = 31.8 bits (73), Expect = 0.70
Identities = 12/51 (23%), Positives = 26/51 (50%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
+KK ++K +K + K +K++ + K+ + +K KG E +K +
Sbjct: 408 PAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDP 458
Score = 30.7 bits (70), Expect = 1.7
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
E++K++KK+ +K + K + +E +K K + KK + E K
Sbjct: 408 PAERKKLRKKQ-RKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETK 454
Score = 29.9 bits (68), Expect = 3.1
Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
E++E + E+ ++K+ K + KK +KE+ +K +K+ + K
Sbjct: 397 EEEEGENGNLSPAERKKLRKKQRKAE-KKAEKEEAEKAAAKKKAEAAAK 444
Score = 29.5 bits (67), Expect = 4.1
Identities = 16/72 (22%), Positives = 28/72 (38%), Gaps = 3/72 (4%)
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLT 244
E E+ + KK + KK +K + K +K+E E+++ +K+
Sbjct: 388 HDKPLLAEGEEEEGENGNLSPAERKKLR---KKQRKAEKKAEKEEAEKAAAKKKAEAAAK 444
Query: 245 FKFGTDMEEKTK 256
G D E K
Sbjct: 445 KAKGPDGETKKV 456
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 39.0 bits (91), Expect = 0.006
Identities = 26/130 (20%), Positives = 74/130 (56%), Gaps = 3/130 (2%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E L ++++ + + +EK ++ K +++ K +++ + ++ EK+++K E+ K+
Sbjct: 328 EERLEKLEEKLEKLESELEELAEEKNELAKLLEERLK-ELEERLEELEKELEKALERLKQ 386
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E+ +E KE + +E+I++E ++ EKE + +++ + +E++KK +++ +
Sbjct: 387 LEEAIQELKEELAELSAA--LEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE 444
Query: 235 HEKEVIPKLT 244
++ +I +L
Sbjct: 445 SKELMIAELA 454
Score = 37.4 bits (87), Expect = 0.014
Identities = 24/147 (16%), Positives = 70/147 (47%), Gaps = 8/147 (5%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
L E+ K+ K ++ + E + + + ++++K +K +E + ++E+E+ ++E +
Sbjct: 176 LKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE 235
Query: 178 EKKE-----EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+E E+E + E+ + + +I E ++ K ++ + ++ + L++ +++ ER
Sbjct: 236 ALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELE---RLLEELEEKIER 292
Query: 233 SSHEKEVIPKLTFKFGTDMEEKTKRES 259
+ I +L + + E
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEE 319
Score = 37.4 bits (87), Expect = 0.017
Identities = 26/163 (15%), Positives = 78/163 (47%), Gaps = 16/163 (9%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
++L E+ + + + +++ ++KE+ +++++ ++ +K+ + +E+ E +E K +
Sbjct: 471 HEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEE 530
Query: 174 KEEKEKKEEK---EFKMKEDK-------------EKPVEKIRKEEKDSEKEKKSKDKDKK 217
EEK +K E E + ++K E ++++++ ++ + K++ ++
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEE 590
Query: 218 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS 260
L++ + KKK KE ++ L ++ E + +
Sbjct: 591 LRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEE 633
Score = 36.3 bits (84), Expect = 0.032
Identities = 28/166 (16%), Positives = 78/166 (46%), Gaps = 21/166 (12%)
Query: 122 KKDKSIKKDKKTHKKKEKE------KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
++ + +++ K KKK KE ++++ + + +N+ ++ E++++ E EK +
Sbjct: 586 EELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQ 645
Query: 176 E--KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK---------KEKLK 224
+E + +++E E+ +IR+E + E E++ ++K ++L+ +E+L+
Sbjct: 646 AELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELE 705
Query: 225 KKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
+ K+ E + ++EE K + ++ ++
Sbjct: 706 ELLKKLGEIEQLIEELESRK----AELEELKKELEKLEKALELLEE 747
Score = 35.9 bits (83), Expect = 0.044
Identities = 22/103 (21%), Positives = 54/103 (52%), Gaps = 3/103 (2%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
+ H+K+ E + + ++ +++ + KE+ + ++ +E EK +E +E+ E +
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEAL 527
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
KE E K EK ++ ++ +KL+ ++LK++ ++ E
Sbjct: 528 KE---ELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR 567
Score = 35.9 bits (83), Expect = 0.046
Identities = 22/127 (17%), Positives = 65/127 (51%), Gaps = 4/127 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK- 174
+L E++++ +++ + ++ I + +K + +E +E + + + +
Sbjct: 424 RELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELEL 483
Query: 175 ---EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
EE+ +E++E +++E+ E+ +++R+ E++ + + ++ K+ +EKL+K + E
Sbjct: 484 EELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLE 543
Query: 232 RSSHEKE 238
KE
Sbjct: 544 ELEELKE 550
Score = 35.1 bits (81), Expect = 0.072
Identities = 18/115 (15%), Positives = 57/115 (49%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E L +++ ++ K + ++ K + ++ + ++K ++ +++E +
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEEL 591
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
E+ K+ +K+ K E++ +E++ + + SE E + ++ +++L+ E K +
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQA 646
Score = 34.7 bits (80), Expect = 0.12
Identities = 28/151 (18%), Positives = 75/151 (49%), Gaps = 2/151 (1%)
Query: 89 EKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD- 147
++ ++K L + + +K+ E + + +++ + ++ E+E + +++
Sbjct: 248 KERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEEL 307
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
+ + ++ E+ EK E+ K EEK +K E E + +++ + K+ EE+ E
Sbjct: 308 EGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKL-LEERLKEL 366
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
E++ ++ +K+L+K + K+ E+ ++E
Sbjct: 367 EERLEELEKELEKALERLKQLEEAIQELKEE 397
Score = 33.2 bits (76), Expect = 0.30
Identities = 22/102 (21%), Positives = 56/102 (54%), Gaps = 3/102 (2%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+K+ E++K + + + +++ + +E+ + E+ + EEK ++ E+ + E+ E+
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305
Query: 194 PVEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKKKKEKER 232
+E +R ++ E ++ KS ++ + +EKL+K + E E
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEE 347
Score = 32.0 bits (73), Expect = 0.79
Identities = 25/107 (23%), Positives = 57/107 (53%), Gaps = 1/107 (0%)
Query: 135 KKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
K KE+++I + +K +K+ K+ K+ + E+ + + E E+ E ++ +E+
Sbjct: 152 KPKERKEILDELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEE 211
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
E + EE E+E++ +++ + +E+L + ++EKER K +
Sbjct: 212 LKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARL 258
Score = 31.7 bits (72), Expect = 0.98
Identities = 24/105 (22%), Positives = 54/105 (51%), Gaps = 3/105 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE-KEKGKKEEKEKKEEKEFKMKEDKEKP 194
+ KE IK+ K K ++ + + E ++ + E E+ KE K+ +E +E + +E+ E+
Sbjct: 174 ELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQE 233
Query: 195 VEKIRKEEKDSEKEKKSKDKDKK--LKKEKLKKKKKEKERSSHEK 237
+E + + + E+EK+ ++ K L+ E L+ + + +
Sbjct: 234 IEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRE 278
Score = 31.7 bits (72), Expect = 1.0
Identities = 25/124 (20%), Positives = 69/124 (55%), Gaps = 3/124 (2%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+ +++ + EK ++ + ++ + ++ E+ E+ + + + ++ E E++ E++ +
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE 689
Query: 188 KEDKEKPVEKIRKE-EKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERSSHEKEVIPKLT 244
E E+ +E++R+E E+ +K + + ++L+ K +L++ KKE E+ E++ +L
Sbjct: 690 LEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELR 749
Query: 245 FKFG 248
K G
Sbjct: 750 EKLG 753
Score = 30.5 bits (69), Expect = 2.0
Identities = 22/130 (16%), Positives = 62/130 (47%), Gaps = 9/130 (6%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK-------- 171
+ K ++K + E++++ K+K ++K + + E ++ + KE
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLR 582
Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKI-RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+KEE E+ E+ ++K+ ++ E++ + EE E + + + +E+L+ + ++
Sbjct: 583 TRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKL 642
Query: 231 ERSSHEKEVI 240
+ +E++
Sbjct: 643 NLQAELEELL 652
Score = 30.5 bits (69), Expect = 2.2
Identities = 15/99 (15%), Positives = 49/99 (49%), Gaps = 3/99 (3%)
Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
+ K+ + K K ++ E ++ + E + + +EE + + K + +++ ++EE+
Sbjct: 174 ELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELK---ELKKLEEIQEEQEEEEL 230
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
++ + +++ +L++EK + ++ + E + L
Sbjct: 231 EQEIEALEERLAELEEEKERLEELKARLLEIESLELEAL 269
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 38.7 bits (90), Expect = 0.006
Identities = 24/105 (22%), Positives = 43/105 (40%), Gaps = 5/105 (4%)
Query: 122 KKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKN--KEKDKEKKVDKEKEKGKKEEK 177
K K+ KK K H K ++ K + K K +N E E + + +K K
Sbjct: 3 KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ +E+++ E I K +S+K ++ +K +KE
Sbjct: 63 LYSDVDSSSSEEEEDG-SESISKLNVNSKKISLNQVSTQKWRKEL 106
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 37.4 bits (87), Expect = 0.006
Identities = 22/81 (27%), Positives = 53/81 (65%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
KK K + K++ K+ ++++ +E+E+ ++++ E+K E E K +E+ E+ EK ++EE+
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 204 DSEKEKKSKDKDKKLKKEKLK 224
E+E++++ + ++ +K K
Sbjct: 61 RKEREEQARKEQEEYEKLKSS 81
Score = 35.8 bits (83), Expect = 0.021
Identities = 21/81 (25%), Positives = 53/81 (65%), Gaps = 1/81 (1%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
K KK+ K + K+ + ++ ++ + + ++++K +K + + K+EE+ ++E ++ K +E+
Sbjct: 2 KIGAKKRAKLEEKQAR-RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 191 KEKPVEKIRKEEKDSEKEKKS 211
+++ E+ RKE+++ EK K S
Sbjct: 61 RKEREEQARKEQEEYEKLKSS 81
Score = 32.4 bits (74), Expect = 0.27
Identities = 23/70 (32%), Positives = 46/70 (65%), Gaps = 1/70 (1%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
KK K +EK + ++ + EE+E + ++ E+ E RKEE++ E E++ K K+++ +KE
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELE-EEREKKKEEEERKE 63
Query: 222 KLKKKKKEKE 231
+ ++ +KE+E
Sbjct: 64 REEQARKEQE 73
Score = 31.2 bits (71), Expect = 0.68
Identities = 19/70 (27%), Positives = 48/70 (68%), Gaps = 2/70 (2%)
Query: 168 EKEKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
K++ K EEK+ +++++E + +E +E+ + ++E + E+E+ ++++KK K+E+ +K+
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKK-KEEEERKE 63
Query: 227 KKEKERSSHE 236
++E+ R E
Sbjct: 64 REEQARKEQE 73
>gnl|CDD|237869 PRK14962, PRK14962, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 472
Score = 38.2 bits (89), Expect = 0.007
Identities = 31/138 (22%), Positives = 65/138 (47%), Gaps = 16/138 (11%)
Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKT----------HKKKEKEKIKKKKDKKDKD---KI 154
+I++V S L I ++ ++K+ + + ++K+ + +
Sbjct: 298 DIIQV--SRQLLNILREIKFAEEKRLVCKLGSASIATRFSSPNVQENDVEEKNDNSNVQQ 355
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE-KIRKEEKDSEKEKKSKD 213
K K+K++ K ++++E + E++ K+ +E K K D V + + E+D +K K S D
Sbjct: 356 KEKKKEESKAKEEKQEDIEFEKRFKELMEELKEKGDLSIFVALSLSEVEEDGDKVKISFD 415
Query: 214 KDKKLKKEKLKKKKKEKE 231
K + E +K+K E E
Sbjct: 416 SSKAMHYELMKEKLPELE 433
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 38.3 bits (89), Expect = 0.007
Identities = 27/131 (20%), Positives = 59/131 (45%), Gaps = 1/131 (0%)
Query: 103 TPMVPNIVKVTSSEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 161
++ VKV S++ + K+ + K + KT K K K K+ +
Sbjct: 169 KSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKE 228
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
+K+ + E KEE E++ K + ED+ + ++E + E + + D + + E
Sbjct: 229 KKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288
Query: 222 KLKKKKKEKER 232
+ +K+K+++ +
Sbjct: 289 EKEKEKRKRLK 299
Score = 33.3 bits (76), Expect = 0.29
Identities = 34/177 (19%), Positives = 76/177 (42%), Gaps = 11/177 (6%)
Query: 54 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
S ++ + S K +KEK KE S VK+ + ++
Sbjct: 199 KTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRD-VILED 257
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
S + + +D+ + K + ++ + E+ ++K+K+ + ++K +D++ +
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDED----------E 307
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
EE E E + +E +E + K+E++ E+ S D ++ + ++ KKK K
Sbjct: 308 DEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRVMKKKTFK 364
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 38.5 bits (90), Expect = 0.007
Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 21/112 (18%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDSEKE 208
K K N K+++ G K KE+++ + ++++ P+E +KE
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLE--------YDKE 263
Query: 209 KKSKDKDKKLKKEKL-----------KKKKKEKERSSHEKEVIPKLTFKFGT 249
K K KD K K +KL K K + + + + I + + GT
Sbjct: 264 KLEKIKDLKEKAKKLGISIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGT 315
Score = 35.5 bits (82), Expect = 0.061
Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 5/71 (7%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKKEEKEKKEE 182
D IK + K+EK + +K+ + K D +++ + +KEK E
Sbjct: 208 DTLIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLE- 266
Query: 183 KEFKMKEDKEK 193
K+K+ KEK
Sbjct: 267 ---KIKDLKEK 274
Score = 33.9 bits (78), Expect = 0.16
Identities = 19/93 (20%), Positives = 37/93 (39%), Gaps = 13/93 (13%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--------KVDKEKEKGKKEE 176
KS + + +K + +K +K++ K + E + DKEK + K+
Sbjct: 212 KSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDL 271
Query: 177 KEKKEEKEFKMKE-----DKEKPVEKIRKEEKD 204
KEK ++ + + KI+ E+ D
Sbjct: 272 KEKAKKLGISIILFDDMTKNKTTNYKIQNEDPD 304
Score = 33.9 bits (78), Expect = 0.21
Identities = 23/100 (23%), Positives = 40/100 (40%), Gaps = 10/100 (10%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK--KVDKEKEKGKK--- 174
E K + S + KE++K ++ + + E DKEK K+ KEK KK
Sbjct: 221 EEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGI 280
Query: 175 -----EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
++ K + +K++ + + I S K K
Sbjct: 281 SIILFDDMTKNKTTNYKIQNEDPDFITSIVYTSGTSGKPK 320
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 38.2 bits (89), Expect = 0.008
Identities = 27/202 (13%), Positives = 65/202 (32%), Gaps = 31/202 (15%)
Query: 72 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDK 131
K ++ SK ++ + + +I EI D I +
Sbjct: 764 KARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDY-ILELL 822
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKE 189
+T K + K+ I+ K+ + + ++K K + + +
Sbjct: 823 QTFNDKNGA------YESLKELIERIIKEWIEDLRQKK----------KLIERLIEAINQ 866
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK------------KEKERSSHEK 237
+ K ++ K E+ KK ++ + ++E L +++ K E +
Sbjct: 867 YRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELV 926
Query: 238 EVIPKLTFKFGTDMEEKTKRES 259
E + K + ++ +E
Sbjct: 927 EKLAKEKSLREKNKDDWKAKEE 948
Score = 34.3 bits (79), Expect = 0.15
Identities = 27/120 (22%), Positives = 45/120 (37%), Gaps = 11/120 (9%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK- 173
E + I K+ +K + + + K D + E+ +E+ K
Sbjct: 836 KELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFA 895
Query: 174 KEEKEKKEEKEFKM---------KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
+EE +EE F E+ + VEK+ KE+ EK K K+ + KLK
Sbjct: 896 EEEGLNEEELAFYDDLALNGGKLPENGTELVEKLAKEKSLREKNKDDWKA-KEEVEAKLK 954
Score = 30.8 bits (70), Expect = 1.6
Identities = 25/145 (17%), Positives = 51/145 (35%), Gaps = 9/145 (6%)
Query: 82 KEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEK 141
E +E + ++++ + + + ++ + K +
Sbjct: 453 VELELIEESIKEEAEELDE-RIEEITEDILEKIKKKTKNLEFLAMLAVRLIRAAKDIYDH 511
Query: 142 IKKKKDKKDKDKI-----KNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEK 193
KK++ K + K + E ++ + KE E K EF+ DK++
Sbjct: 512 FKKEEVFDLKAMVVASSRKVAVELYEAEIAARLDWHSKESLEGAIKDYNTEFETDFDKKQ 571
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKL 218
K++ E+KD K K KD L
Sbjct: 572 SHAKLKDEKKDLIKRFKLKDDPLDL 596
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 38.2 bits (89), Expect = 0.008
Identities = 22/62 (35%), Positives = 40/62 (64%), Gaps = 2/62 (3%)
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
EK+ EKE +E++E+ EK KEE++ +K+ + +++ K+EK KK KK KE ++ +
Sbjct: 28 EKEVEKEVPDEEEEEEKEEK--KEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
Query: 238 EV 239
+
Sbjct: 86 LL 87
Score = 34.0 bits (78), Expect = 0.15
Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 8/71 (11%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
KE +KE D+E+E+ K+E+KE++E+ K +E E +EEK+ +K+K K K+
Sbjct: 29 KEVEKEVP-DEEEEEEKEEKKEEEEKTTDKEEEVDE-------EEEKEEKKKKTKKVKET 80
Query: 217 KLKKEKLKKKK 227
+ E L K K
Sbjct: 81 TTEWELLNKTK 91
Score = 34.0 bits (78), Expect = 0.17
Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 8/67 (11%)
Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
V+KE EK +E+E++E++E K +E+K E EE D E+EK+ K KK K +K+K
Sbjct: 27 VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE----EEVDEEEEKEEK---KK-KTKKVK 78
Query: 225 KKKKEKE 231
+ E E
Sbjct: 79 ETTTEWE 85
Score = 32.4 bits (74), Expect = 0.47
Identities = 21/52 (40%), Positives = 40/52 (76%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
EKE K+ D++++++ + K++++EK DKE+E ++EEKE+K++K K+KE
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 38.2 bits (89), Expect = 0.009
Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 4/77 (5%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
++K KK D K+ GK+ + KK E EK + +KD+ K KK K +K
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK-AAKKDARKAKKPSAKTQK 796
Query: 218 LK---KEKLKKKKKEKE 231
+ K K KKK E
Sbjct: 797 IAAATKAKRAAKKKVAE 813
Score = 33.2 bits (76), Expect = 0.33
Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 2/80 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K +EK KK K K + K + D KK + + ++ K K+ K
Sbjct: 736 KTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKT- 794
Query: 196 EKIRKEEKDSEKEKKSKDKD 215
+KI K +++ K K +
Sbjct: 795 QKIAAATK-AKRAAKKKVAE 813
Score = 32.0 bits (73), Expect = 0.73
Identities = 21/102 (20%), Positives = 43/102 (42%), Gaps = 12/102 (11%)
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--KIKNKEKDKEKKVDKEKEKGKK 174
D I +++ + KT ++K K+ KK K + K N E D + +K+ +
Sbjct: 721 DFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAA 780
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
++ +K +KP K +K ++ ++ +K K
Sbjct: 781 KKDARK----------AKKPSAKTQKIAAATKAKRAAKKKVA 812
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 37.5 bits (87), Expect = 0.011
Identities = 17/100 (17%), Positives = 44/100 (44%), Gaps = 1/100 (1%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+ K+ +K + K K +I K + E+++++E + + + + +K K+
Sbjct: 330 AEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKG 389
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
++ E+ SE E +SK +++ + +E +
Sbjct: 390 LIDASPNEDTPSENE-ESKGSPPQVEATTTAEPNREPSQE 428
Score = 32.5 bits (74), Expect = 0.46
Identities = 16/99 (16%), Positives = 38/99 (38%), Gaps = 5/99 (5%)
Query: 120 EIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEE 176
EIKK + K + K KK+ +K + + + +++ + + + + KK
Sbjct: 331 EIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGL 390
Query: 177 KEKKE--EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ + + +E K P + + +E +D
Sbjct: 391 IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPSQED 429
Score = 31.0 bits (70), Expect = 1.3
Identities = 22/142 (15%), Positives = 55/142 (38%), Gaps = 4/142 (2%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
+ +++K+KEK + + ++ E + KK +K E + + KE
Sbjct: 291 RAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEI 350
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
+K R+ +++ +E+ + + ++ +K KKK +S ++ + G+
Sbjct: 351 AQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENEESKGSP 410
Query: 251 MEEKTKRESSPKIVIKPVKSPS 272
+ + + P S
Sbjct: 411 PQVEATTTAEP----NREPSQE 428
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 37.7 bits (88), Expect = 0.011
Identities = 34/200 (17%), Positives = 88/200 (44%), Gaps = 28/200 (14%)
Query: 54 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
T ++ S ++ E++ + KE + V+ + + +++
Sbjct: 99 AKATSVAAEATTPKSIQELVEALEELLEEL-----LKETASDPVVQ-ELVSIFNDLIDS- 151
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
ED + + I K+ + K+ ++K +++++ + +K K ++ K+++E
Sbjct: 152 IKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLA 211
Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRK---EEKDSEKEKKSKDKDKKLKKE------- 221
+ ++ E+ +++ + E+ E++RK E+ E E++++ ++KLK E
Sbjct: 212 RLESKEAALEK---QLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIE 268
Query: 222 ------KLKKKKKEKERSSH 235
K K+K E+ER+
Sbjct: 269 LQREFNKEIKEKVEEERNGR 288
Score = 36.9 bits (86), Expect = 0.019
Identities = 41/195 (21%), Positives = 82/195 (42%), Gaps = 13/195 (6%)
Query: 53 HPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV 112
P P + V+ K E K E +V + E T +
Sbjct: 54 DPTLRDPQISVVSSVTKLSKGPKDEPKQVKIPRQSGLSE--EVAEAEAKATSVAAEATTP 111
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK-KDKKDKDKIKNKEKDKEKKVDKEKEK 171
S ++L E ++ + K+T +++ D D K N + D E + KE+
Sbjct: 112 KSIQELVEALEELLEELLKETASDPVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEE 171
Query: 172 -----GKKEEKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
K E + +EE+E + +KE +E+ + K+ +E + K++ + K+L+ E
Sbjct: 172 LDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALE-KQLRLEF-- 228
Query: 225 KKKKEKERSSHEKEV 239
+++KE+ R +E+++
Sbjct: 229 EREKEELRKKYEEKL 243
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 37.2 bits (87), Expect = 0.017
Identities = 33/149 (22%), Positives = 59/149 (39%), Gaps = 20/149 (13%)
Query: 89 EKEYSKVKDV---------ELGVTPMVP-NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
EKE+ +K++ E+ ++P P +V V S E ++ ++ K +E
Sbjct: 721 EKEFELLKEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIK--GLANLEE 778
Query: 139 KEKIKKKKDK-----KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK-- 191
E + ++ +D E + E++ +K EKE E K
Sbjct: 779 LEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLS 838
Query: 192 -EKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
E V K +E + EKEK ++ + K K
Sbjct: 839 NEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867
Score = 31.8 bits (73), Expect = 0.85
Identities = 28/108 (25%), Positives = 51/108 (47%), Gaps = 9/108 (8%)
Query: 90 KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK---KEKEKIKKKK 146
K + ++++E+ V+VT+ E+ + D KE EK++K+
Sbjct: 771 KGLANLEELEILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI 830
Query: 147 DKKDKDKIKNK---EKDKEKKVDKEKEK--GKKEEKEKKEEKEFKMKE 189
D+ +K K+ N+ K E+ V+KEKEK + + K EE+ +K
Sbjct: 831 DRIEK-KLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEERLAVLKA 877
Score = 29.1 bits (66), Expect = 5.4
Identities = 18/96 (18%), Positives = 38/96 (39%), Gaps = 2/96 (2%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
I+ E E+ + + + +K+ +K++ + EKK+ E
Sbjct: 781 ILSPDPEEPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSNE 840
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
K E+ ++EKE K+ E + K + K+ +
Sbjct: 841 GFVAKAPEEVVEKEKE-KLAEYQVK-LAKLEERLAV 874
>gnl|CDD|221957 pfam13175, AAA_15, AAA ATPase domain. This family of domains
contain a P-loop motif that is characteristic of the AAA
superfamily.
Length = 320
Score = 36.6 bits (85), Expect = 0.020
Identities = 20/122 (16%), Positives = 40/122 (32%), Gaps = 10/122 (8%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K+ +++ KK K +K K + N EK+ + K+
Sbjct: 130 KIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNIDDKLS---- 185
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
E+ + ++ + E K E + + K K D L K ++ K +
Sbjct: 186 NIFNEKLSNLKNEKISINEK-----IKSILNEINPDS-KFEKKLDLNLIKLEIVKSNSDN 239
Query: 231 ER 232
+
Sbjct: 240 DS 241
Score = 30.1 bits (68), Expect = 2.3
Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 1/103 (0%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+ +KK +K ++ K KKKD I + +K + K + + ++
Sbjct: 121 EYILKKILDKIEKFDEIKGAKKKDFFKSINIYEIDNFIDKTLSKILDDNIVFQDNLEKNI 180
Query: 184 EFKMKEDKEKPVEKIRKEE-KDSEKEKKSKDKDKKLKKEKLKK 225
+ K+ + + ++ E+ +EK K ++ K + K
Sbjct: 181 DDKLSNIFNEKLSNLKNEKISINEKIKSILNEINPDSKFEKKL 223
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 36.2 bits (84), Expect = 0.023
Identities = 19/80 (23%), Positives = 35/80 (43%), Gaps = 1/80 (1%)
Query: 118 LGEIKKDKSIKKDK-KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
L +++ + K + + + K+K+K KKKK KK+ + E V E G +
Sbjct: 162 LNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQV 221
Query: 177 KEKKEEKEFKMKEDKEKPVE 196
E K+ K+ + + E
Sbjct: 222 GEAKKLKKKRSIAPDNEKSE 241
Score = 28.1 bits (63), Expect = 8.3
Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 5/91 (5%)
Query: 149 KDKDKIK---NKEKDKEKKV--DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
++D + +E+ + K ++E+ K KK++K+KK +K E V E
Sbjct: 155 SEEDVVPLNPTEEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTEL 214
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
S + + K K K+ +K + S
Sbjct: 215 SSGAGQVGEAKKLKKKRSIAPDNEKSEVYKS 245
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 36.3 bits (84), Expect = 0.026
Identities = 27/126 (21%), Positives = 54/126 (42%), Gaps = 8/126 (6%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE----KKVDKEKEKGKKEE 176
I + K K K+ + ++ + K + ++ N EKDKE +K E K +++
Sbjct: 74 INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133
Query: 177 KEKKEEKEFKMKEDKEK----PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ E K +E+ + E +K E+ E E++ ++ L+ K+ E++R
Sbjct: 134 LQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQR 193
Query: 233 SSHEKE 238
S
Sbjct: 194 ESKWAI 199
Score = 34.8 bits (80), Expect = 0.091
Identities = 28/149 (18%), Positives = 72/149 (48%), Gaps = 7/149 (4%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE--KEKG 172
ED I + + K+ + KKK+ + ++++ N++K+++ +++ + +
Sbjct: 34 QEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFEL 93
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+++ K E + +K+K +E + KE E E +K+ ++ + ++K++ K +E+
Sbjct: 94 ALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEER 153
Query: 231 ERSSHEKEVIPKLTF---KFGTDMEEKTK 256
+ +EK++ L KF + E
Sbjct: 154 LKFENEKKLEESLELEREKFEEQLHEANL 182
Score = 33.3 bits (76), Expect = 0.24
Identities = 25/129 (19%), Positives = 56/129 (43%), Gaps = 10/129 (7%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKK-----DKDKIKNKEKDKEKKVDKEKEKGKK 174
E K + + K + E + +KDK+ + +KE K+ + E + K+
Sbjct: 86 EQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKR 145
Query: 175 EEKEKKE----EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK-KE 229
E + +E E E K++E E EK ++ ++ + + K+ +++ + + KK K
Sbjct: 146 ENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRESKWAILKKLKR 205
Query: 230 KERSSHEKE 238
+ ++
Sbjct: 206 RAELGSQQV 214
Score = 30.6 bits (69), Expect = 1.8
Identities = 21/92 (22%), Positives = 44/92 (47%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E L + + S + K+ E + K++ +K ++ EK E+ ++ E+EK +++
Sbjct: 117 ELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQ 176
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
E + EFK E++ + I K+ K +
Sbjct: 177 LHEANLDLEFKENEEQRESKWAILKKLKRRAE 208
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 36.8 bits (85), Expect = 0.027
Identities = 14/66 (21%), Positives = 37/66 (56%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K+ + +I+ K ++ ++D EKG+ ++ K+ K++ ++ + +EKI K + ++
Sbjct: 362 KEAQAEIEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKIDKAQAALKQ 421
Query: 208 EKKSKD 213
+ + D
Sbjct: 422 AQSAFD 427
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 35.1 bits (81), Expect = 0.027
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
E+ E+ ++E E + +SK+K K+ + +K+K K K PKL
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPKL 153
Score = 31.3 bits (71), Expect = 0.48
Identities = 19/86 (22%), Positives = 32/86 (37%), Gaps = 16/86 (18%)
Query: 102 VTPMV----------PNIVKV------TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKK 145
V PMV P + K+ + E+ ++ + + + K + K KKK
Sbjct: 67 VGPMVQQYGPLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKK 126
Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEK 171
++ K K K + KK K K
Sbjct: 127 REVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 31.3 bits (71), Expect = 0.51
Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
+++ + E+E + E+E E + + KE K++ V K K EK+ K + K K K
Sbjct: 93 SSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKP-KTEKEKPKTEPKKPKPSKP 151
Query: 219 K 219
K
Sbjct: 152 K 152
Score = 30.1 bits (68), Expect = 1.3
Identities = 15/58 (25%), Positives = 30/58 (51%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
++++E + + E +++ E K E + K K + KP + K + + +K K SK K
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152
Score = 28.2 bits (63), Expect = 5.8
Identities = 15/77 (19%), Positives = 29/77 (37%), Gaps = 11/77 (14%)
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 256
KI +E S+ E++ +++ + E+ + + E +K +PK
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPK-----------PKT 134
Query: 257 RESSPKIVIKPVKSPSP 273
+ PK K K P
Sbjct: 135 EKEKPKTEPKKPKPSKP 151
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded
by an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 36.2 bits (84), Expect = 0.028
Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 5/143 (3%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
KK K+ KKK + I+K K KK K K K+ K + K EE + K+ K
Sbjct: 141 KKIKEESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKN 200
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK-----EKERSSHEKEVIPK 242
K+ K + K+++ +K K K +K +K K K K EK + ++I K
Sbjct: 201 KQKTLKNILYNLKDKELEKKGYKKKQIEKFFQKIYEKYKNKPHFIIEKNKYKDLNKIIGK 260
Query: 243 LTFKFGTDMEEKTKRESSPKIVI 265
F + K + KI I
Sbjct: 261 FKKSFKKKKKNSKKNYENIKINI 283
Score = 34.7 bits (80), Expect = 0.081
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 29/143 (20%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK---------------------V 165
+ KK + + K+KK ++ K ++ K K K
Sbjct: 85 LNYQKKECYEIINKYFKEKKIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIK 144
Query: 166 DKEKEKGKKEEKEKKEEKEFK--------MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
++ K+K K ++ K +K FK K+ K K ++ I+ EE K KK K+K K
Sbjct: 145 EESKKKNSKNIEKLKIKKYFKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKT 204
Query: 218 LKKEKLKKKKKEKERSSHEKEVI 240
LK K KE E+ ++K+ I
Sbjct: 205 LKNILYNLKDKELEKKGYKKKQI 227
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 36.5 bits (84), Expect = 0.033
Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 12/135 (8%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
K + K+ K KEKG ++ E K E E E+ K R+E+ + K K
Sbjct: 792 KAAARAKQ--KPKEKGPNDK-EIKIESPSVETE-GERCTIKQREEKGIDAPAILNVKKKK 847
Query: 217 KLKKEKLKKKKKEKERS---SHEKEVIPKLTFKFGTDMEEKTKRESSPKIV-----IKPV 268
K + + + ++ R + +P FK G + + + ++ +K V
Sbjct: 848 PYKVDNMTRILPQQSRYISFIKDDRFVPVRKFKGGVVVLRDREPKEPVALIETVRQMKDV 907
Query: 269 KSPSPPPVAAADEYD 283
+P P P D D
Sbjct: 908 NAPLPTPFKVDDNVD 922
Score = 36.1 bits (83), Expect = 0.046
Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 5/112 (4%)
Query: 71 DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKD 130
+ K E +E EY + + ++ KV ++ IK K+
Sbjct: 746 RGSQAIPKFCFNEDLEEEEFEYPR-----MYEEASGKSVRKVNTAVLSTTIKAAARAKQK 800
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
K +KE + + + + ++ +EK +D K++K K +
Sbjct: 801 PKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
Score = 34.9 bits (80), Expect = 0.082
Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 5/63 (7%)
Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-EKEFKMKEDK 191
T K + K +K K+K DK E V+ E E+ +++E+K + + K
Sbjct: 790 TIKAAARAK-QKPKEKGPNDKEIKIES---PSVETEGERCTIKQREEKGIDAPAILNVKK 845
Query: 192 EKP 194
+KP
Sbjct: 846 KKP 848
>gnl|CDD|225324 COG2604, COG2604, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 594
Score = 36.2 bits (84), Expect = 0.034
Identities = 44/202 (21%), Positives = 71/202 (35%), Gaps = 19/202 (9%)
Query: 72 KTSPKEKMSSKEKTSPKEKEYSKVKDVELGV--TPMVPNIVKVTSSEDLGEIKKDKS--- 126
+ K + + V V G T V N+ + ED+ KDK
Sbjct: 370 NSHAKGASYGDNYEGDESDKIKVVAYVGGGKVETDSVWNLFREKFEEDI-ANYKDKEFYN 428
Query: 127 -------IKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
IK + K+ EK + K K+K I +EK V K +K KK
Sbjct: 429 CTEGGARIKGTIELPFKEVCEKLLDKDKNKPFIKLILLSNNSQEKAVLKLLQKIKKNNDF 488
Query: 179 KKE-EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS-HE 236
KE E + ++ +EK+ + + EK D K+L E +E + + H
Sbjct: 489 IKEFETDAL---KLQEILEKVDSKSEKLEKISAKIDNIKELFDESKMSILQEILQPALHH 545
Query: 237 KEVIPKLTFKFGTDMEEKTKRE 258
E+ +K +E +
Sbjct: 546 IELNIARIYKLNIKDKEDFLNK 567
Score = 28.9 bits (65), Expect = 6.2
Identities = 29/146 (19%), Positives = 48/146 (32%), Gaps = 16/146 (10%)
Query: 98 VELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKDKI 154
EL V NI+ + +DL S K E +KIK K+
Sbjct: 345 YELAVYLGFKNIIFI--GQDLAYTNDGNSHAKGASYGDNYEGDESDKIKVVAYV-GGGKV 401
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEK----------KEEKEFKMKEDKEKPVEKIRKEEKD 204
+ + E++ ++KE K E KE EK ++K + +
Sbjct: 402 ETDSVWNLFREKFEEDIANYKDKEFYNCTEGGARIKGTIELPFKEVCEKLLDKDKNKPFI 461
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ ++K + K K KK
Sbjct: 462 KLILLSNNSQEKAVLKLLQKIKKNND 487
>gnl|CDD|222406 pfam13831, PHD_2, PHD-finger. PHD folds into an interleaved type
of Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 33
Score = 31.8 bits (73), Expect = 0.035
Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 1/33 (3%)
Query: 305 PMIGCDGCDAWYHWVCVGLVAEPETSDWFCPKC 337
P++ C C H C G V E W C +C
Sbjct: 2 PLLVCSKCSVAVHASCYG-VPELSDGPWLCRRC 33
>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
secretion].
Length = 407
Score = 35.8 bits (83), Expect = 0.036
Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 3/97 (3%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP--- 194
E K + ++ K KD KD +K +E++ KK E+ E K +E K+
Sbjct: 60 ELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSGEEI 119
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E +R E+ + ++ + L K++++ +
Sbjct: 120 AEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNL 156
Score = 29.6 bits (67), Expect = 3.7
Identities = 24/119 (20%), Positives = 50/119 (42%), Gaps = 5/119 (4%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
V+VT + + ++ K K E + +K +KK +++ K+ ++ E
Sbjct: 48 AVEVTKLAAVDSELRKFQTREAAKV-KDAEAQLLKDPAEKKREERKAAKKIERSTPSLIE 106
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKP--VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
++ +E K+ EE M+++K +E+I E K++ L E L+
Sbjct: 107 RKT--QEVKDSGEEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQSPTRLNLINELLRA 163
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 36.2 bits (84), Expect = 0.036
Identities = 24/123 (19%), Positives = 61/123 (49%), Gaps = 7/123 (5%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKD------KEKKVDKEKEKGKKEEKEKKEEKE 184
K+ KE E+I ++ K+ ++K++ + +E+ +++ + ++EE +++EE+
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 185 FKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
+ +E + EK+ E + E+EK ++ +L++ + + + + E KL
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146
Query: 244 TFK 246
K
Sbjct: 147 LLK 149
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 35.8 bits (83), Expect = 0.037
Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 2/89 (2%)
Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
E G+ E K++K++ K K K+K + + + D + ++++ + ++ KKKK+
Sbjct: 4 ENGENEVKQQKQQN--KQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKK 61
Query: 230 KERSSHEKEVIPKLTFKFGTDMEEKTKRE 258
K++ +K + + + +
Sbjct: 62 KKKKKKKKNLGEAYDLAYDLPVVWSSAAF 90
Score = 35.8 bits (83), Expect = 0.040
Identities = 18/86 (20%), Positives = 40/86 (46%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+ + K K++ K+K KKK+KK K + + + + + + +E K+ +KK++K+ K
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKD 213
K+ + + +D
Sbjct: 67 KKKNLGEAYDLAYDLPVVWSSAAFQD 92
Score = 34.7 bits (80), Expect = 0.095
Identities = 29/161 (18%), Positives = 62/161 (38%), Gaps = 16/161 (9%)
Query: 71 DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE-IKKDKSIKK 129
+ ++K +K+K + K+ + SK + L E I +++ +
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDD--------------DAFLAELISENQEAEN 52
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
+ KKK+K+K KKKK+ + + + ++ E+E+K +
Sbjct: 53 KQNNKKKKKKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWP-EQEWKQTQ 111
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
PV K K+ + E + + + +K++ EK
Sbjct: 112 PPTIPVSKQFKDGEYPVGEIQEYPGENSSRTSSEEKRELEK 152
Score = 34.3 bits (79), Expect = 0.12
Identities = 17/88 (19%), Positives = 39/88 (44%), Gaps = 3/88 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKVDKEKEKGKKEEKE 178
+ ++ K +K K++ K K KK KKD D + + ++ + ++ KK++K+
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
KK++K+ + + +
Sbjct: 64 KKKKKKNLGEAYDLAYDLPVVWSSAAFQ 91
Score = 33.1 bits (76), Expect = 0.28
Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 3/89 (3%)
Query: 150 DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE---EKEFKMKEDKEKPVEKIRKEEKDSE 206
+ + + K+++K+K K+ +K KK+ + E + +E + K K +K++K +
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 207 KEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
K+K + + + +SH
Sbjct: 67 KKKNLGEAYDLAYDLPVVWSSAAFQDNSH 95
Score = 31.6 bits (72), Expect = 0.79
Identities = 17/94 (18%), Positives = 39/94 (41%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
+ +K+ K+ +K+K +K KK K D + + + E ++ K++K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
+KK++K+ K + + + + S
Sbjct: 61 KKKKKKKKKNLGEAYDLAYDLPVVWSSAAFQDNS 94
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 34.3 bits (79), Expect = 0.039
Identities = 24/68 (35%), Positives = 41/68 (60%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK K +K K K++K+K KKKK KK K K +EK+ K ++ ++ ++ E++K+E
Sbjct: 75 KKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134
Query: 182 EKEFKMKE 189
E M++
Sbjct: 135 EPVEIMEK 142
Score = 30.4 bits (69), Expect = 0.76
Identities = 28/74 (37%), Positives = 45/74 (60%), Gaps = 1/74 (1%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ +++K K ++KT KK+ K + KK+K KK K K K K +EK+ K E+ EE+E
Sbjct: 70 QKREEKKRKDEEKTAKKRAKRQKKKQK-KKKKKKAKKGNKKEEKEGSKSSEESSDEEEEG 128
Query: 180 KEEKEFKMKEDKEK 193
+E+K+ + E EK
Sbjct: 129 EEDKQEEPVEIMEK 142
Score = 30.4 bits (69), Expect = 0.79
Identities = 25/80 (31%), Positives = 51/80 (63%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+D+ ++ ++ K+K++EK KK+ K+ K K K K+K K KK +K++EK + E+
Sbjct: 63 TEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESS 122
Query: 182 EKEFKMKEDKEKPVEKIRKE 201
++E + +EDK++ +I ++
Sbjct: 123 DEEEEGEEDKQEEPVEIMEK 142
Score = 29.6 bits (67), Expect = 1.6
Identities = 21/64 (32%), Positives = 38/64 (59%)
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E ++K +KE + +E ++K EK RK+E+ + K++ + K K+ KK+K K KK K+
Sbjct: 53 ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEK 112
Query: 235 HEKE 238
+
Sbjct: 113 EGSK 116
Score = 29.2 bits (66), Expect = 2.3
Identities = 25/83 (30%), Positives = 48/83 (57%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+K KK + +E ++ +++K +KD++K K ++KK K+K+K K ++ KKEEKE
Sbjct: 57 EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Query: 188 KEDKEKPVEKIRKEEKDSEKEKK 210
++ E+ +E+K E +
Sbjct: 117 SSEESSDEEEEGEEDKQEEPVEI 139
Score = 27.7 bits (62), Expect = 7.2
Identities = 23/83 (27%), Positives = 47/83 (56%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
EK K++ D+E ++ ++E+K K EEK K + ++K +K +K++K + KK + + K
Sbjct: 57 EKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSK 116
Query: 218 LKKEKLKKKKKEKERSSHEKEVI 240
+E ++++ +E E I
Sbjct: 117 SSEESSDEEEEGEEDKQEEPVEI 139
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as
a single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through
DAZ repeats.
Length = 136
Score = 34.1 bits (78), Expect = 0.042
Identities = 19/60 (31%), Positives = 26/60 (43%), Gaps = 10/60 (16%)
Query: 11 VVRQSAPMFPFQFSPFP--SVAGP-------GLIPRLPPMLQHPLIPQPLVHPPGTPPPS 61
VV A M P +P+P S+ P G PPM +P+ P V+PPG+
Sbjct: 17 VVPPQAQM-PQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPPGSTVLV 75
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 35.5 bits (82), Expect = 0.047
Identities = 26/123 (21%), Positives = 37/123 (30%), Gaps = 17/123 (13%)
Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKD------KIKNKEKDKEKKVDKEKEKGKKEEKEK 179
S+ +K KK +K K + D+D +++ EK K K KE + K
Sbjct: 7 SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPK 66
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
K K EEK K K + K + E E
Sbjct: 67 --------KSSKPTESSAASSEEK---PAKPRKSAESTRSSHPKSKAPSTESEEEEEPEE 115
Query: 240 IPK 242
P
Sbjct: 116 TPD 118
Score = 29.4 bits (66), Expect = 3.8
Identities = 14/77 (18%), Positives = 24/77 (31%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
K K +K K+ K K E +EK K K + + K
Sbjct: 46 KAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPST 105
Query: 195 VEKIRKEEKDSEKEKKS 211
+ +E +++ S
Sbjct: 106 ESEEEEEPEETPDPIAS 122
Score = 29.4 bits (66), Expect = 4.2
Identities = 16/82 (19%), Positives = 30/82 (36%), Gaps = 3/82 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKK--EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
E L E+++ + K KK + + KK K + + E+ K +
Sbjct: 36 EFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95
Query: 174 KEEKEKKEE-KEFKMKEDKEKP 194
K K + + +E +E P
Sbjct: 96 SHPKSKAPSTESEEEEEPEETP 117
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 35.8 bits (83), Expect = 0.047
Identities = 27/141 (19%), Positives = 58/141 (41%), Gaps = 9/141 (6%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
KS+ K K I++ + + D KNK + +KK+ + G + + E
Sbjct: 113 KSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKL---QSFGFPQLLIEF-EVN 168
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE-RSSHEKEVIPKL 243
+E + + E+ EE + +++ + +KKLK E K +K + +++
Sbjct: 169 DISEEQEFEKFEEAINEE-VEKAAQEALEAEKKLKAESPKVEKPKPLFDGQKGRKIKSTE 227
Query: 244 TFKFGTDMEEKTKRESSPKIV 264
K + + + E+ K+
Sbjct: 228 EIK---PLIKINEEETRVKVE 245
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 33.8 bits (78), Expect = 0.050
Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKK 180
++ K+ KEK K +KK+++K + K + E+K + E EK +KE E+ +
Sbjct: 4 EEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEA 63
Query: 181 EEKEFKMKEDKEK 193
E ++K + +K
Sbjct: 64 ELARRELKAEAKK 76
Score = 29.9 bits (68), Expect = 1.2
Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 3/86 (3%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
E E+ K DK+ I ++ EKK +++K + +K K EEK + + EK ++
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKA---EYELEKLEKE 57
Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKL 223
+ + E + + + + K L ++ L
Sbjct: 58 LEELEAELARRELKAEAKKMLSEKGL 83
Score = 28.4 bits (64), Expect = 3.4
Identities = 18/78 (23%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
+ + ++ +K+VDK K K + ++K+EEK+ + ++ + E+ + E + + EK+ +
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELE-KLEKELE 59
Query: 213 DKDKKLKKEKLKKKKKEK 230
+ + +L + +LK + K+
Sbjct: 60 ELEAELARRELKAEAKKM 77
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.8 bits (83), Expect = 0.053
Identities = 20/129 (15%), Positives = 61/129 (47%), Gaps = 3/129 (2%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
E L E++ + S +++ +KE + + + ++ K +E+ + E+ +
Sbjct: 264 ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEA 323
Query: 173 KKEEKEKKEE---KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
+ EE E K + +E E+K + +++ + + +E +++ ++ + + E+L+++ +
Sbjct: 324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLET 383
Query: 230 KERSSHEKE 238
+ E
Sbjct: 384 LRSKVAQLE 392
Score = 34.6 bits (80), Expect = 0.13
Identities = 25/131 (19%), Positives = 67/131 (51%), Gaps = 7/131 (5%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKI-------KKKKDKKDKDKIKNKEKDKEKKVDK 167
+ ++ +++ K I +++ + +++ E++ + K D+ ++ + +EK +E K +
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
E + + EE E + E+ E+ E+ +E +R + E + S + + + + +L++ +
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413
Query: 228 KEKERSSHEKE 238
+ER E E
Sbjct: 414 DRRERLQQEIE 424
Score = 30.4 bits (69), Expect = 2.6
Identities = 22/125 (17%), Positives = 66/125 (52%), Gaps = 11/125 (8%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVDK-EKEKGKKE 175
L E+++ +K ++ +K E+ K +K + + + + + ++ +++++ ++E + E
Sbjct: 195 LNELER--QLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAE 252
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK--KEKLKKKKKEKERS 233
E+ ++ E + E+K +E++R E SE E++ ++ K+L ++ + +++K+
Sbjct: 253 EELEELTAELQELEEK---LEELRLE--VSELEEEIEELQKELYALANEISRLEQQKQIL 307
Query: 234 SHEKE 238
Sbjct: 308 RERLA 312
Score = 28.5 bits (64), Expect = 8.9
Identities = 17/109 (15%), Positives = 57/109 (52%), Gaps = 4/109 (3%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
++ + + + +K ++ K+ +K + ++ E + + + +EE E+ +E+ + +E+ E+
Sbjct: 200 RQLKSLERQAEKAERYKE-LKAELRELELALLVLRLEELREELEELQEELKEAEEELEEL 258
Query: 195 VEKIRKEEKDSE---KEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
++++ E+ E E +++ + +++L E R +K+++
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQIL 307
>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
[Posttranslational modification, protein turnover,
chaperones].
Length = 320
Score = 35.1 bits (80), Expect = 0.055
Identities = 21/110 (19%), Positives = 43/110 (39%), Gaps = 3/110 (2%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
DK K+ E+ K+ +K + + D + + + KG EE+E
Sbjct: 93 EIAVLDKLAKELLL---GLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLRIKGLTEEQE 149
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ ++ + + K V +++ + E K+K+ LKK + K
Sbjct: 150 AEALRDKLQNKQQGKKVTEVQARHILVKAEAKAKEALALLKKGVREAKAD 199
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 35.5 bits (82), Expect = 0.056
Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 121 IKKDKSIKK--DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
+KK S K + K++ K+ K+KKD DK K + D++ + K + E
Sbjct: 183 VKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKTDRDVSTSTAASQKKSSDLE 242
Query: 179 KKEEKE----FKMKEDKEKPV 195
K E + + +K+D +K V
Sbjct: 243 SKLEAQSKELWSLKDDLKKHV 263
Score = 32.8 bits (75), Expect = 0.43
Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 4/80 (5%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
KK + K ++ ++ K+ K+ K KK+ ++K K K D++ ++K S+
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASG--DDKSKKAKTDRDVSTSTAASQKKSSDL 241
Query: 208 EKKSKDKDKKL--KKEKLKK 225
E K + + K+L K+ LKK
Sbjct: 242 ESKLEAQSKELWSLKDDLKK 261
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 35.3 bits (82), Expect = 0.060
Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 180 KEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
KEE +++E KEK P +K E+ +++K K K KK K++ KK +K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 33.0 bits (76), Expect = 0.33
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 186 KMKEDKEKPVEKIR----KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
++KE+ K +E+I+ K K +EKK + + KK K++K KK+K+K R
Sbjct: 363 ELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.9 bits (73), Expect = 0.61
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 164
EIK+ K K KK+E++K +K+K KK + K K K K +K
Sbjct: 374 EIKE----KYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.9 bits (73), Expect = 0.75
Identities = 19/46 (41%), Positives = 28/46 (60%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K E+IK+K K K K + K+ K KK K K+KGKK +K+ ++
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 31.5 bits (72), Expect = 0.88
Identities = 20/65 (30%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK 169
V S +G+ K++ K+ ++ +K K KK+++KK + + K K++ K+ K K K
Sbjct: 352 VDYFSGRYIGDELKEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGK--KRK 409
Query: 170 EKGKK 174
+KG+K
Sbjct: 410 KKGRK 414
Score = 30.3 bits (69), Expect = 2.1
Identities = 16/50 (32%), Positives = 33/50 (66%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K++ ++I++ K+K K K +E+ K +K K+K++ KK +K KK+ ++
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.8 bits (65), Expect = 6.6
Identities = 16/49 (32%), Positives = 32/49 (65%)
Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
++ K++++ KEK K K+K+EEK+ + ++ K+K +K +K +K K
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.0 bits (81), Expect = 0.061
Identities = 22/91 (24%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 139 KEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
EK+ K+ + K ++ + E ++ + K K EK +++EK + E + +E+
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNE---EERELKKKFTEKIREKEKRLEELE--QNLIEE 343
Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
++ E E + K +D + + EKLK K
Sbjct: 344 RKELNSKLE-EIQKKLEDLEKRLEKLKSNKS 373
Score = 34.6 bits (80), Expect = 0.092
Identities = 23/93 (24%), Positives = 40/93 (43%), Gaps = 13/93 (13%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
K+ K+ H+ + E ++ + K+K EK +KEK + E+
Sbjct: 293 SGLKNSGEPSLKEIHEARLNE-----------EERELKKKFTEKIREKEKRLEELEQNLI 341
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
+E KE + E+ +K+ EK EK K +K
Sbjct: 342 EERKE--LNSKLEEIQKKLEDLEKRLEKLKSNK 372
Score = 31.1 bits (71), Expect = 1.2
Identities = 17/68 (25%), Positives = 34/68 (50%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
EK G K E ++ + + ++E+ K + EK EKEK+ ++ ++ L +E+ +
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 227 KKEKERSS 234
K +E
Sbjct: 349 SKLEEIQK 356
Score = 30.8 bits (70), Expect = 1.4
Identities = 18/70 (25%), Positives = 37/70 (52%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ + + ++++ KKK EKI++K+ + ++ + E+ KE E+ + K E+ EK
Sbjct: 304 KEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEK 363
Query: 180 KEEKEFKMKE 189
+ EK K
Sbjct: 364 RLEKLKSNKS 373
Score = 30.0 bits (68), Expect = 2.7
Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 5/85 (5%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK-EDKEKPVEKIR-KEEKDSEKEKKSKDKD 215
E + +K+ K G+ KE E + + + E K+K EKIR KE++ E E+ ++
Sbjct: 285 ENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEER 344
Query: 216 KKLK---KEKLKKKKKEKERSSHEK 237
K+L +E KK + ++R K
Sbjct: 345 KELNSKLEEIQKKLEDLEKRLEKLK 369
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 35.5 bits (82), Expect = 0.062
Identities = 20/107 (18%), Positives = 42/107 (39%), Gaps = 3/107 (2%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE-KE 184
I D + H+ +++ + + D K ++ N + E + + + EE K + E
Sbjct: 835 IIADHQGHQAIIEQRTQDSGIRNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSE 894
Query: 185 FKMKEDKE-KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ E EEK +++ D ++L K + + K K
Sbjct: 895 LQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
Score = 33.6 bits (77), Expect = 0.28
Identities = 15/104 (14%), Positives = 37/104 (35%), Gaps = 6/104 (5%)
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+ + +D I+ D K H+ + ++ I+ +E + + + + K E
Sbjct: 845 IIEQRTQDSGIRNDVK-HQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEA 903
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
+ + + + P + E K++K+ K K
Sbjct: 904 LSQNNKYNEEKSAQERMP-----GADSPEELMKRAKEYQDKHKG 942
>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
[Translation, ribosomal structure and biogenesis].
Length = 335
Score = 34.9 bits (81), Expect = 0.062
Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 4/90 (4%)
Query: 135 KKKEKEKIKKKKDKKDK-DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
KK I+ D +++K K K+ + +K K + E KE ++ K
Sbjct: 10 AKKGIAAIELASDLLKALEELKVKYLGKKGVLTDLLKKLGKLSPLE-ERKEVGALINELK 68
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+++ + E ++ ++L EK+
Sbjct: 69 --KEVEDAITELTPELEAAGLWERLAFEKI 96
Score = 29.9 bits (68), Expect = 2.8
Identities = 17/87 (19%), Positives = 30/87 (34%), Gaps = 2/87 (2%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
KK + + K E+ K K GKK +K K+ +E+ E
Sbjct: 10 AKKGIAAIELASDLLKALEEL--KVKYLGKKGVLTDLLKKLGKLSPLEERKEVGALINEL 67
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEK 230
E E + +L+ L ++ +
Sbjct: 68 KKEVEDAITELTPELEAAGLWERLAFE 94
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 33.7 bits (78), Expect = 0.067
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
DKK K EK + K +K K +K K K+++ + ++
Sbjct: 64 DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRL 99
Score = 28.3 bits (64), Expect = 5.1
Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+KE K+ +K +K K EK K K ++K+ E +
Sbjct: 64 DKKELKAWEKAEK-KAEKAKAKAEKKKAKKEEPK 96
Score = 28.3 bits (64), Expect = 5.7
Identities = 11/33 (33%), Positives = 16/33 (48%)
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
+KE E+ EKK EK E K+ E+ +
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 27.5 bits (62), Expect = 9.5
Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 11/75 (14%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEE-KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
K K K E ++ E E K+ KE + + + + K K EK K +K+K
Sbjct: 34 KKKGKKGELEITDLNE----EYKDLKESLEAALLDKKELK---AWEKAEK---KAEKAKA 83
Query: 214 KDKKLKKEKLKKKKK 228
K +K K +K + K +
Sbjct: 84 KAEKKKAKKEEPKPR 98
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 34.3 bits (79), Expect = 0.070
Identities = 23/80 (28%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 101 GVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
G ++ S + E + ++K+ + ++++KEK KKK+ KK+K ++KD
Sbjct: 117 GYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKK----EKKD 172
Query: 161 KEKKVDKEKEKGKKEEKEKK 180
K++K+ + K KK++K+KK
Sbjct: 173 KKEKMVEPKGSKKKKKKKKK 192
Score = 34.3 bits (79), Expect = 0.071
Identities = 23/63 (36%), Positives = 36/63 (57%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
+ + EK+ + EK + E+E+K+EK+ K + KEK +K +KE+ K K K K K
Sbjct: 131 ESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
Query: 217 KLK 219
K K
Sbjct: 191 KKK 193
Score = 33.5 bits (77), Expect = 0.12
Identities = 21/73 (28%), Positives = 42/73 (57%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
D + + ++E K EKE +++E+++K +K ++ +K+ +++K K+K +
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVE 179
Query: 218 LKKEKLKKKKKEK 230
K K KKKKK+K
Sbjct: 180 PKGSKKKKKKKKK 192
Score = 31.6 bits (72), Expect = 0.45
Identities = 21/71 (29%), Positives = 39/71 (54%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
D + + EKE K + + + ++ + KEK K+K+V KEK++ K ++++ E
Sbjct: 121 PDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEP 180
Query: 183 KEFKMKEDKEK 193
K K K+ K+K
Sbjct: 181 KGSKKKKKKKK 191
Score = 30.1 bits (68), Expect = 1.5
Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ D + + +K K +++ + ++++K K+K K KE KEKK K+K++
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEK-KEKKKKKEVKKEKKEKKDKKEKM 177
Query: 174 KEEKEKKEEKEFKMK 188
E K K++K+ K K
Sbjct: 178 VEPKGSKKKKKKKKK 192
Score = 29.7 bits (67), Expect = 2.1
Identities = 21/59 (35%), Positives = 39/59 (66%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
+ E +++K+ +++++K KKK+KE K+KK+KKDK + + K +KK K+K+K
Sbjct: 135 SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 29.3 bits (66), Expect = 3.2
Identities = 21/66 (31%), Positives = 39/66 (59%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
+ + + +K+ KE + E++ KEK+K K+ +KEKKE+K+ K K + K +K
Sbjct: 126 SELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKK 185
Query: 198 IRKEEK 203
+K++K
Sbjct: 186 KKKKKK 191
>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
Length = 643
Score = 35.2 bits (81), Expect = 0.072
Identities = 19/111 (17%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKE--KDKEKKVDKEKEKGKKEEKEKKE 181
D +I +D KT K + +K ++++ + E K +D+ K +E +E
Sbjct: 33 DDAIAEDAKTASKSGTKSSAGRKKRRERQDDGDDEPVSPKRIAIDEVKGMTVRELREAAS 92
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
E+ K+ +E++ + K + + K++ ++++++E+
Sbjct: 93 ERGLATTGRKKDLLERLCAALESDVKVGSANGTGEDEKEKGGDEEREKEEK 143
Score = 31.0 bits (70), Expect = 1.4
Identities = 27/153 (17%), Positives = 56/153 (36%), Gaps = 7/153 (4%)
Query: 79 MSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKE 138
M++K K +K G+ + + +ED K + +KK
Sbjct: 1 MANKLKVDELRAALAKRGLDTTGLKAALVRRLDDAIAEDAKTASKSGT---KSSAGRKKR 57
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KKEEKEFKMKEDKEKPV 195
+E+ D+ K ++ K V + +E + +K++ ++ E V
Sbjct: 58 RERQDDGDDEPVSPKRIAIDEVKGMTVRELREAASERGLATTGRKKDLLERLCAALESDV 117
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K+ E EK+ +++ K+EK+ K
Sbjct: 118 -KVGSANGTGEDEKEKGGDEEREKEEKIVTATK 149
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 33.9 bits (78), Expect = 0.072
Identities = 17/81 (20%), Positives = 37/81 (45%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
+KD E D +E+ + +++ E++ K+ K K + + + E +++ K
Sbjct: 33 KKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVK 92
Query: 218 LKKEKLKKKKKEKERSSHEKE 238
K++L + E+ SS E
Sbjct: 93 ATKKQLFSSEYEQTSSSSEST 113
Score = 32.0 bits (73), Expect = 0.25
Identities = 16/94 (17%), Positives = 39/94 (41%), Gaps = 1/94 (1%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-DKEKEKGKKEEKE 178
KKD I D + + +++ +++K K+KEK +++++ ++ + E
Sbjct: 31 YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNT 90
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
K K+ + E+ ++ K+ S
Sbjct: 91 VKATKKQLFSSEYEQTSSSSESTSEEETKKTSSI 124
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 35.4 bits (82), Expect = 0.074
Identities = 29/133 (21%), Positives = 53/133 (39%), Gaps = 10/133 (7%)
Query: 108 NIVKVTSSEDLGE-IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
+ +K+ + E E I+ KS+ D E + K + + K + N + +
Sbjct: 895 SSLKLVNLELESEIIELKKSLSSDLI-----ENLEFKTELIARLKKLLNNIDL----EEG 945
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
E K E K E E K+KE E+ + ++K + K+ + K KKE +
Sbjct: 946 PSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELS 1005
Query: 227 KKEKERSSHEKEV 239
K+ K++
Sbjct: 1006 KQYGALQESTKQL 1018
Score = 30.8 bits (70), Expect = 1.6
Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 19/185 (10%)
Query: 74 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK-KDKSIKKDKK 132
E K + +++K + + ++ +L ++ + +K+ +
Sbjct: 912 KKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSE 971
Query: 133 THK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
++ KK +++ + K KE ++ +E K+
Sbjct: 972 EYEDLLKKSTILVREGNKANSELKNFKKEL-----------AELSKQYGALQEST---KQ 1017
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGT 249
KE PVE + S+ + LK + K E + + + +
Sbjct: 1018 LKELPVEVAEL-QSASKIISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENS 1076
Query: 250 DMEEK 254
+++K
Sbjct: 1077 LLDDK 1081
Score = 30.0 bits (68), Expect = 2.9
Identities = 28/142 (19%), Positives = 50/142 (35%), Gaps = 15/142 (10%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK+ + + + E++ + K D K +K + E ++ E K + E
Sbjct: 865 KKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEI---IELKKSLSSDLIE 921
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
EFK + K D E E S + K + KL + + + + +S E E +
Sbjct: 922 NLEFKTELIAR---LKKLLNNIDLE-EGPSIEYVKLPELNKLHEVESKLKETSEEYEDLL 977
Query: 242 KLTFKFGTDMEEKTKRESSPKI 263
K RE +
Sbjct: 978 K--------KSTILVREGNKAN 991
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 34.1 bits (79), Expect = 0.079
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 3/58 (5%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KD + KKE I K +K D I EK+K+K + KE++K KEEK+K EE
Sbjct: 66 KDFRKILKKEGGAIIKDLEKCDFTPIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|214659 smart00433, TOP2c, TopoisomeraseII. Eukaryotic DNA topoisomerase
II, GyrB, ParE.
Length = 594
Score = 34.8 bits (81), Expect = 0.079
Identities = 23/106 (21%), Positives = 38/106 (35%), Gaps = 20/106 (18%)
Query: 139 KEKIKKKKDKKDKD----------------KIKNKEKD---KEKKVDKEKEKGKKEEKEK 179
E KKKK K+K+ KI + + KEK E G ++ +
Sbjct: 261 NEYAKKKKKLKEKNIKGEDVREGLTAFISVKIPEPQFEGQTKEKLGTSEVRFGVEKIVSE 320
Query: 180 KEEKEFKM-KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
+ + K VEK+ K KK+++ +K K +
Sbjct: 321 CLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSSIS 366
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 35.1 bits (80), Expect = 0.082
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDKEKPV 195
++E +K+ + + IK E + K + + K++ E++EEK+ K M +K+K +
Sbjct: 497 DEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKL 556
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K K ++E+ K KK + K KK +K
Sbjct: 557 YKKMKYSNAKKEEQAENLKKKKKQIAKQKKLDSKK 591
Score = 28.9 bits (64), Expect = 6.1
Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 2/92 (2%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE--KDSEKEKKSKDK 214
E+D + + + + E++ +++E + +++ E + I+ E + + KSK+K
Sbjct: 472 SEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNK 531
Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
+K+ +E+ +KK K S+ +K++ K+ +
Sbjct: 532 KRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYS 563
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 34.0 bits (78), Expect = 0.094
Identities = 15/78 (19%), Positives = 36/78 (46%), Gaps = 2/78 (2%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--KIKNKEKDKEKKVD 166
IV + + E+ ++ + K K + + +K + K ++ +++ K +D
Sbjct: 115 IVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLD 174
Query: 167 KEKEKGKKEEKEKKEEKE 184
K E ++ +KE ++KE
Sbjct: 175 KYNELREQVQKESSKKKE 192
Score = 32.5 bits (74), Expect = 0.27
Identities = 15/61 (24%), Positives = 29/61 (47%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
+E K K DK + D K + ++ ++ E + +EE K +K +++E +K
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQVQKESS 188
Query: 197 K 197
K
Sbjct: 189 K 189
Score = 28.6 bits (64), Expect = 4.8
Identities = 17/64 (26%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
EE KK K KM+ D ++RK + E +K++++ KL +K + +++ ++ S
Sbjct: 129 EENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQEELFKL-LDKYNELREQVQKES 187
Query: 235 HEKE 238
+K+
Sbjct: 188 SKKK 191
>gnl|CDD|116547 pfam07937, DUF1686, Protein of unknown function (DUF1686). The
members of this family are all hypothetical proteins of
unknown function expressed by the eukaryotic parasite
Encephalitozoon cuniculi GB-M1. The region in question
is approximately 250 amino acids long.
Length = 182
Score = 33.6 bits (77), Expect = 0.096
Identities = 18/73 (24%), Positives = 21/73 (28%), Gaps = 21/73 (28%)
Query: 295 ACGVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF---CPKCSKVDEGSRKEKKKR 351
AC L M G G VCVGLV F S V
Sbjct: 108 AC-AAGSVKLFMPGSYG---RSSLVCVGLVV-------FAALLLLVSYVKGA-------G 149
Query: 352 GRKKKTHQDIIMS 364
G K + ++ S
Sbjct: 150 GEKVSSKGIVVGS 162
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 33.6 bits (77), Expect = 0.11
Identities = 20/52 (38%), Positives = 21/52 (40%), Gaps = 4/52 (7%)
Query: 9 QAVVRQSAPM-FPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 59
Q VV Q M P Q S P+ P LP Q P PQPL P P
Sbjct: 65 QPVVPQQPLMPVPGQHSMTPT---QHHQPNLPQPAQQPFQPQPLQPPQPQQP 113
Score = 28.6 bits (64), Expect = 4.5
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 5/50 (10%)
Query: 11 VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPP 60
V +Q P Q P P L + P + Q PL+P P TP
Sbjct: 42 VSQQHPPTHTLQ----PHHHIPVLPAQQPVVPQQPLMPVP-GQHSMTPTQ 86
Score = 28.2 bits (63), Expect = 6.5
Identities = 16/55 (29%), Positives = 20/55 (36%), Gaps = 8/55 (14%)
Query: 11 VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTPPPS 61
V+ P+ P Q P + PG P P +PQP P P PP
Sbjct: 60 VLPAQQPVVPQQ----PLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQP 110
Score = 27.4 bits (61), Expect = 9.6
Identities = 12/22 (54%), Positives = 13/22 (59%)
Query: 39 PPMLQHPLIPQPLVHPPGTPPP 60
PP Q P+ PQP VHP PP
Sbjct: 107 PPQPQQPMQPQPPVHPIPPLPP 128
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 34.3 bits (79), Expect = 0.11
Identities = 33/116 (28%), Positives = 57/116 (49%), Gaps = 3/116 (2%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
SE L E D ++ + E + +K + K K K + ++K KE E+
Sbjct: 237 MSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA 296
Query: 174 KEEKE--KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
KEEK+ KK + ++KE ++ V + K ++++K + + KKLK+ KL K K
Sbjct: 297 KEEKQLKKKLAQLARLKEIAKE-VAQKEKARARKKEQRKERGEKKKLKRRKLGKHK 351
Score = 34.3 bits (79), Expect = 0.11
Identities = 22/94 (23%), Positives = 44/94 (46%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+++ + +KK +K + + E + + + + ++ + E E E K
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
PV RK + KEK+ K+ +++ K+EK KKK
Sbjct: 272 PVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKK 305
Score = 31.6 bits (72), Expect = 0.92
Identities = 21/130 (16%), Positives = 50/130 (38%), Gaps = 17/130 (13%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKK---------DKKDKDKIKNKEKDKEKKVDKEKE 170
E K++K + K + +D ++ +E +KE K +K+++
Sbjct: 155 EQVAKKTVKTAPISLAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQ 214
Query: 171 KGKKEEKEKKEE--------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ ++ E++K E+ E +E + + + +S E + + K +
Sbjct: 215 ELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINKPVR 274
Query: 223 LKKKKKEKER 232
K+K K +
Sbjct: 275 PKRKTKAQRN 284
Score = 30.8 bits (70), Expect = 1.3
Identities = 20/105 (19%), Positives = 43/105 (40%), Gaps = 7/105 (6%)
Query: 136 KKEKEKIKK-------KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
+KE + KK ++ K +K + D+ + E+ EE+ E +
Sbjct: 204 EKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFE 263
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+ E + +R + K + K K + + ++ K +K+ K+K
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308
Score = 29.7 bits (67), Expect = 3.6
Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
E + + + +K ++ + + +KEK + + E + KEEK+ K K + +++
Sbjct: 256 ESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREA-KEEKQLKKKLAQLARLKE 314
Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
I KE EK + K + +K + EK K K++
Sbjct: 315 IAKEVAQKEKARARKKEQRKERGEKKKLKRR 345
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 34.7 bits (80), Expect = 0.12
Identities = 31/128 (24%), Positives = 45/128 (35%), Gaps = 18/128 (14%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+ K+ K KK EK + K K K K K+ +K+E E E
Sbjct: 251 QELYKERDKILKKLEKA--LNELLNKLLKKSHLKTNKKSGKLT----PSRKKEFEILPEY 304
Query: 184 EFKMKEDKEKPVEKIRKEE-----KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
DK++P +I+K K + K KL E +K E + +
Sbjct: 305 ----VPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKSLYENTPTKSG 360
Query: 239 VIPKLTFK 246
I TFK
Sbjct: 361 FI---TFK 365
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 34.3 bits (79), Expect = 0.12
Identities = 35/197 (17%), Positives = 88/197 (44%), Gaps = 21/197 (10%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGV---TPMVPNIVKVT----SSEDLGEIKKDKSIK 128
K K+ E+ P+ + V+ ++ V P+ + KV E G ++++
Sbjct: 358 KPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREI 417
Query: 129 KDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
+ KK +E +++ + + + K +++ +++ EK + + ++ + ++++E +
Sbjct: 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIR 477
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
++ +I + EK+ E++KK ++ ++ E K +K E +V+ KLT
Sbjct: 478 ARD------RRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT-- 529
Query: 247 FGTDMEEKTKRESSPKI 263
+E + E I
Sbjct: 530 ----LEAIEEAEEEYGI 542
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 34.3 bits (79), Expect = 0.13
Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 5/74 (6%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK-----EEKEFKMKE 189
++ K++ KK++DKIK K++ KE+ + K ++ M
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKEFASALGKEILKEEPTENSSKNILKLRLYYQQNGKCMYT 587
Query: 190 DKEKPVEKIRKEEK 203
KE ++ +
Sbjct: 588 GKEIDIDDLFDLSY 601
Score = 30.5 bits (69), Expect = 2.1
Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 1/85 (1%)
Query: 90 KEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKK 149
+ KV + + IV + E+ G ++ K+ K ++ K KE +
Sbjct: 499 LQARKVVNELVKKYGPPDKIVIEMAREEQGTNFGKRNSKERYKKNEDKIKEFASALGKEI 558
Query: 150 DKDKI-KNKEKDKEKKVDKEKEKGK 173
K++ +N K+ K ++ GK
Sbjct: 559 LKEEPTENSSKNILKLRLYYQQNGK 583
Score = 28.9 bits (65), Expect = 6.0
Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 9/103 (8%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE-----KGKKEEKEKKEEKEFKMKE 189
KKK+ K +K + K E + EK K + ++ +K+
Sbjct: 325 KKKKLTYKKLRKLLGLSEDAIFKGLRYEGLDNAEKAFNISLKTYHKLRKALGDKDLLDNP 384
Query: 190 DKEKPVEKIRKE----EKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K +++I K + +K+ + L +E++KK +
Sbjct: 385 KNPKDLDEIVKILTLYKDREMIKKRLELYKDVLNEEQVKKLVR 427
>gnl|CDD|235944 PRK07135, dnaE, DNA polymerase III DnaE; Validated.
Length = 973
Score = 34.3 bits (79), Expect = 0.13
Identities = 35/146 (23%), Positives = 56/146 (38%), Gaps = 35/146 (23%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE--------KGKKEEKEKKEEKEFK 186
K KE+ ++K + KI+N +E+K+ ++ K ++E K F
Sbjct: 123 KNKEQLELKNYYINSNDPKIENAVYVQERKLLFAEDNEYLKILNKIGNNKEENSNFKFFD 182
Query: 187 MKEDKEKPVEKIRKE--------------------EKDSEKEKKSKDKDKKL-------K 219
++ E EKI K + D+ +S KK+ K
Sbjct: 183 FEKWFEDIDEKILKRTNYLVENINIEFPKKEFNLPDFDNNLGLESDLFLKKILKESVINK 242
Query: 220 KEKLKKKKKEKERSSHEKEVIPKLTF 245
K +LK KER ++E VI KL F
Sbjct: 243 KAELKYYPNVKERINYEYSVIKKLKF 268
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 34.2 bits (79), Expect = 0.14
Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 15/93 (16%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
+ + EK++ + + K + E+ EK + K+KE R+++ DSE+
Sbjct: 456 LPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKE---------RQDQPDSEE 506
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
+ L KL E++S E EV
Sbjct: 507 D------LATLPTLKLGDVPDPIEKTSLETEVS 533
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 33.5 bits (77), Expect = 0.15
Identities = 25/142 (17%), Positives = 60/142 (42%), Gaps = 14/142 (9%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD--KKDKDKIKNKEKDKEKKVD------- 166
+ L + K + + E E ++ + + + +I+ + K E+K+
Sbjct: 31 KALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERE 90
Query: 167 -----KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
E + K+ ++E M+E ++ E +E+ EK + + +L++E
Sbjct: 91 LRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150
Query: 222 KLKKKKKEKERSSHEKEVIPKL 243
+ +++ +E SS +E+ KL
Sbjct: 151 VAEIREEGQELSSKREELKEKL 172
Score = 32.3 bits (74), Expect = 0.35
Identities = 19/122 (15%), Positives = 51/122 (41%), Gaps = 9/122 (7%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD--KDKIKNKEKDKEKKVDKEKEKGKKEEK 177
E + + K KK K E E + K + + + ++N+ E ++ + +E+ K+ E+
Sbjct: 23 EPRIKEIRKALKK--AKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEE 80
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
+ K+ + ++ ++ E E + +++ + +K+ E + E+
Sbjct: 81 KLSAVKDERELRALNIEIQIAKERINSLEDEL-----AELMEEIEKLEKEIEDLKERLER 135
Query: 238 EV 239
Sbjct: 136 LE 137
Score = 32.3 bits (74), Expect = 0.40
Identities = 17/119 (14%), Positives = 51/119 (42%), Gaps = 13/119 (10%)
Query: 127 IKKDKKTHKKK-----------EKEKIKKKKDKKD--KDKIKNKEKDKEKKVDKEKEKGK 173
++KD+ + K E E + K + + + ++N+ E ++ + +E+ K
Sbjct: 17 LEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIK 76
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ E++ K+ + ++ ++ E E ++ + +++++ K+ ER
Sbjct: 77 RAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLER 135
Score = 30.4 bits (69), Expect = 1.6
Identities = 25/115 (21%), Positives = 51/115 (44%), Gaps = 2/115 (1%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KS+ +K +K++ + + K+ +K K K E + E E E + + E
Sbjct: 7 KSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLES 66
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
E + ++ K E+ KD E+E ++ + + ++ KE++ + E E E
Sbjct: 67 EIQEIRERIKRAEEKLSAVKD-ERELRALNIEIQIAKERINSLEDELAELMEEIE 120
>gnl|CDD|237867 PRK14953, PRK14953, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 486
Score = 34.0 bits (78), Expect = 0.15
Identities = 33/118 (27%), Positives = 62/118 (52%), Gaps = 7/118 (5%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE--KEKGKKEEKEKKEEKEF 185
K+ K K++ EK +++K++ D DKI + E K+ K KEE+ K K
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424
Query: 186 KMKED-KEKPVEKIRKE--EKDSEKEKKSKDKDKKLKK--EKLKKKKKEKERSSHEKE 238
K +ED + ++ I+K + E+ KK K+K+K+ + EK+ + K + +++K+
Sbjct: 425 KSEEDTLDLEIKSIKKYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKK 482
Score = 32.5 bits (74), Expect = 0.48
Identities = 28/120 (23%), Positives = 53/120 (44%), Gaps = 2/120 (1%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-EKKEEKEFKMKEDKEKPVEKIR 199
K K++ K++ + K +EK++ + K E K K ++KE++ K K+
Sbjct: 365 KEGSKQETKEQPEKKEEEKEELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVE 424
Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
K E+D+ + K K + ++ KKEKE+ E + K+ F K++
Sbjct: 425 KSEEDTLDLEIKSIK-KYFPFIEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKK 483
Score = 30.2 bits (68), Expect = 2.3
Identities = 28/101 (27%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKK---E 175
E+ DK I + K K +K + K+++ KI K EK +E +D E + KK
Sbjct: 385 ELDIDKIILQIIKNEGKIISAILKNAEIKEEEGKITIKVEKSEEDTLDLEIKSIKKYFPF 444
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
+ ++ +KE + ++++ + VEK+ + K S+ K K K
Sbjct: 445 IEFEEVKKEKEKEKERPEFVEKVLELFKGSKIITYKKKKGK 485
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 34.1 bits (78), Expect = 0.16
Identities = 28/152 (18%), Positives = 56/152 (36%), Gaps = 9/152 (5%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
KE K ++ ++K + + + +K + K+ KK K+ E E+
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKV----SRQAPKKPAPKKTTKKASESETTEETYG 1225
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE-KEVIPKLTFK----FGTDM 251
E ++ + K K + KK K+KE+E + K+ + M
Sbjct: 1226 SSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKM 1285
Query: 252 EEKTKRESSPKIVIKPVKSPSPPPVAAADEYD 283
EE K + + + S ++ +D+ D
Sbjct: 1286 EETVKAVPARRAAARKKPLASVSVISDSDDDD 1317
Score = 30.2 bits (68), Expect = 2.8
Identities = 40/242 (16%), Positives = 85/242 (35%), Gaps = 19/242 (7%)
Query: 51 LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVE-LGVTPMVPNI 109
L GT K+++ + +DK + ++ +KT+PK +KD++ L +
Sbjct: 1116 LSMAIGTLTLEKVQELCADRDKLN--IEVEDLKKTTPKSLW---LKDLDALEKELDKLDK 1170
Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEK---------D 160
+ E ++++ + + K + K K K +++ +
Sbjct: 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAME 1230
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLK 219
E + K KG+ K+K + +E+ E K R + + +S ++ +K
Sbjct: 1231 TENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVK 1290
Query: 220 KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKP---VKSPSPPPV 276
++ K+ + + +E K KP K + PP
Sbjct: 1291 AVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPA 1350
Query: 277 AA 278
AA
Sbjct: 1351 AA 1352
>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
subunit A'' fusion protein; Provisional.
Length = 509
Score = 34.0 bits (78), Expect = 0.16
Identities = 24/153 (15%), Positives = 64/153 (41%), Gaps = 8/153 (5%)
Query: 61 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
++ ++ T K++ + S YS + P++ ++++ S E +
Sbjct: 27 FEVPKNITDGYITLKKKEFTYHVIISKIAP-YSNSNGIIKKKKPVLKTVLEIESEEKIEA 85
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-NKEKDKEKKVDKEKEKGKK----- 174
I + + + + + ++K+ + N +D K +D K+ G +
Sbjct: 86 IDLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSV 145
Query: 175 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSE 206
E+ K K+ ++ +D+ E+ + +IR+E + +
Sbjct: 146 IEEIAKAMKKKELSDDEYEEILRRIREEYERAR 178
Score = 28.6 bits (64), Expect = 7.1
Identities = 32/136 (23%), Positives = 49/136 (36%), Gaps = 20/136 (14%)
Query: 125 KSIKKDKKTHKKKE---KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE----- 176
K+I T KKKE I K + + I K+K K V + + + K E
Sbjct: 31 KNITDGYITLKKKEFTYHVIISKIAPYSNSNGIIKKKKPVLKTVLEIESEEKIEAIDLME 90
Query: 177 ------KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK------EKLK 224
+ E F E ++ +E + + K D K + E++
Sbjct: 91 FKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSVIEEIA 150
Query: 225 KKKKEKERSSHEKEVI 240
K K+KE S E E I
Sbjct: 151 KAMKKKELSDDEYEEI 166
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 33.9 bits (78), Expect = 0.16
Identities = 42/191 (21%), Positives = 75/191 (39%), Gaps = 24/191 (12%)
Query: 60 PSKMKDKVSPKDKTSPKEKMSSK---EKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
P + + P++ + K+ + ++ K + K VK ++ V+ +
Sbjct: 180 PPEELENGEPRNIVAKKDGIITRMYVTKGTAVVKVGDVVKKGDILVSGQIGKEGNEYEVH 239
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
GE+ + + + K E + +K K I K K K E EK KEE
Sbjct: 240 AKGEVLAETWYESTVEVPLKTNFEVLTGEKITKHYLSIGGK-KLPLKFRKNEFEKYDKEE 298
Query: 177 KEKK-----------------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD---KDK 216
EK E ++ +K KE+ VEK +K K +++ KD KD+
Sbjct: 299 DEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKKKLEKEIDKDAKIKDE 358
Query: 217 KLKKEKLKKKK 227
K+ E+++ K
Sbjct: 359 KVLHERVENGK 369
Score = 28.9 bits (65), Expect = 6.5
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 11/91 (12%)
Query: 127 IKKDKKTHKKKEKEKIKKKKDKKDKDKIK------NKEKDKEKKVDKE--KEKGKKEEKE 178
+K K +K +KE+ +K K K I + +DK K+ KE EKGKK K+
Sbjct: 284 LKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEKGKKLAKK 343
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK 209
K E++ + +D + EK+ E ++ K +
Sbjct: 344 KLEKE---IDKDAKIKDEKVLHERVENGKVR 371
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 34.1 bits (79), Expect = 0.17
Identities = 22/106 (20%), Positives = 49/106 (46%), Gaps = 22/106 (20%)
Query: 144 KKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKEEKEFK-------------M 187
K K+ K I D+ +++ K+ E+ E+K++KE E K +
Sbjct: 485 KGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSL 544
Query: 188 KEDKEKP----VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
KE+ +K +K+ + + ++E + +DK++ + K ++ +K
Sbjct: 545 KEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEI--EAKTEELQKV 588
Score = 33.7 bits (78), Expect = 0.19
Identities = 31/122 (25%), Positives = 61/122 (50%), Gaps = 13/122 (10%)
Query: 109 IVKVTSSEDLGEIKKDK-SIKKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKE 162
I+ V++ +D G K+ K +I + + +K +DKK K++I+ K + +E
Sbjct: 477 ILTVSA-KDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEE 535
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
EK KEE +K E + K+ E+ +E +++E + +KE+ + K ++L+K
Sbjct: 536 --YVYSLEKSLKEEGDKLPEAD---KKKVEEAIEWLKEELEGEDKEEI-EAKTEELQKVV 589
Query: 223 LK 224
Sbjct: 590 QP 591
Score = 33.0 bits (76), Expect = 0.31
Identities = 24/104 (23%), Positives = 54/104 (51%), Gaps = 3/104 (2%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE-KEKKEEKEFKM 187
KDK T K++K I D+I+ KD E+ ++K++ ++ E K + EE + +
Sbjct: 483 KDKGT-GKEQKITITASSGL-SDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSL 540
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
++ ++ +K+ + +K +E K++ ++K + + K +E
Sbjct: 541 EKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKTEE 584
Score = 31.8 bits (73), Expect = 0.85
Identities = 23/104 (22%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 112 VTSSEDLGEIKKDKSIKKDKKTH----KKKEKEKIK----------KKKDKKDKDKIKNK 157
+T+S L + + ++ +K ++ K+KE+ + K +K K++ DK+
Sbjct: 495 ITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEA 554
Query: 158 EKDK-EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
+K K E+ ++ KE+ + E+KE+ E K ++++ + E++ +
Sbjct: 555 DKKKVEEAIEWLKEELEGEDKEEIEAKTEELQKVVQPIGERMYQ 598
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 32.7 bits (75), Expect = 0.17
Identities = 23/88 (26%), Positives = 52/88 (59%), Gaps = 13/88 (14%)
Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKM----KEDKEKPVEKIRKEE-KDSEKEKKSKDK 214
D E+K ++++ + +++E+++++EKE+ KE++EK +E++ K + K + +D
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDY 72
Query: 215 DKKLKKE--------KLKKKKKEKERSS 234
D++LK++ + +KKKEK
Sbjct: 73 DEELKEQERWDDPMAQFLRKKKEKTDKK 100
Score = 31.1 bits (71), Expect = 0.54
Identities = 22/105 (20%), Positives = 56/105 (53%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
+ +DKS + K++EKE+ K++K++K++ + + + +K+ +++ + ++ K K
Sbjct: 3 VYRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKP 62
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
+ ++ E+ E+ R ++ ++ +K K+K K K K
Sbjct: 63 LARYADDEDYDEELKEQERWDDPMAQFLRKKKEKTDKKGKPLYKG 107
Score = 31.1 bits (71), Expect = 0.56
Identities = 22/93 (23%), Positives = 52/93 (55%), Gaps = 3/93 (3%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
+I ++K+++ + + +EK+++++ +KE KG +++E+++ E K +
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADD 69
Query: 201 EEKDSEKEKKSKDKD---KKLKKEKLKKKKKEK 230
E+ D E +++ + D + L+K+K K KK K
Sbjct: 70 EDYDEELKEQERWDDPMAQFLRKKKEKTDKKGK 102
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 32.6 bits (75), Expect = 0.17
Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
++ ++ +KK KD ++K+++K K + K + + + + ++ KK+EKE + K
Sbjct: 65 IEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKK 124
Query: 192 EKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKLKKKKKEKER 232
++K ++ +K+ K K+L K EK +KK KE+
Sbjct: 125 PYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 33.2 bits (76), Expect = 0.18
Identities = 24/107 (22%), Positives = 60/107 (56%), Gaps = 5/107 (4%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
++ ++K+ K+ I + K + +++ + ++ +E+K KE EK +KE++E+K K
Sbjct: 67 EEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKE-EKRLHLQKEEQEQKMAK- 124
Query: 188 KEDKEKPVEKIRKEEKDSEK---EKKSKDKDKKLKKEKLKKKKKEKE 231
++DK++ ++++ + + K + K + + EKL++KK+
Sbjct: 125 EKDKQEIIDELETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTF 171
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 33.1 bits (75), Expect = 0.18
Identities = 23/91 (25%), Positives = 55/91 (60%), Gaps = 6/91 (6%)
Query: 125 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+S+KKD KK +E+I+K + ++++ + + + + +E+K+D E+E +++E+E++E
Sbjct: 108 RSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKID-EREDQEEQEREREE 166
Query: 182 EKEFKMKEDKEKPV--EKIRKEEKDSEKEKK 210
+ + +D E + + + E D +K +K
Sbjct: 167 QTIEEQSDDSEHEIIEQDESETESDDDKTEK 197
Score = 33.1 bits (75), Expect = 0.18
Identities = 22/83 (26%), Positives = 49/83 (59%), Gaps = 10/83 (12%)
Query: 138 EKEKIKKKKD----KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+K+ IKKKK ++ +K + + ++ E++++ E+ + K +E+E +EE+E +E+
Sbjct: 111 KKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQE------RER 164
Query: 194 PVEKIRKEEKDSEKEKKSKDKDK 216
+ I ++ DSE E +D+ +
Sbjct: 165 EEQTIEEQSDDSEHEIIEQDESE 187
Score = 31.2 bits (70), Expect = 0.81
Identities = 19/88 (21%), Positives = 50/88 (56%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
+D++++ +KD KK + ++ EK ++E +E + + + E+ EKI + E E+E++
Sbjct: 104 RDQLRSVKKDDIKKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQERE 163
Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+++ + + + + + E++ S E +
Sbjct: 164 REEQTIEEQSDDSEHEIIEQDESETESD 191
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 33.5 bits (77), Expect = 0.18
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 8/80 (10%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
K+ +K+ + K +DK ++ KK+++E +E+ E + K+ K E+ KK
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAE--------KKRKSKEEIKK 260
Query: 211 SKDKDKKLKKEKLKKKKKEK 230
K K+ K K K K
Sbjct: 261 KKPKESKGVKALKKVVAKGM 280
Score = 32.3 bits (74), Expect = 0.47
Identities = 18/70 (25%), Positives = 35/70 (50%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ E+ K I + K K ++ K KK++ ++D + +++ + K K ++ K+K
Sbjct: 204 PPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKP 263
Query: 174 KEEKEKKEEK 183
KE K K K
Sbjct: 264 KESKGVKALK 273
Score = 31.9 bits (73), Expect = 0.57
Identities = 19/70 (27%), Positives = 35/70 (50%)
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
K+ K + + + ++ +K R+ E+D E + +K +K K+E KKK KE +
Sbjct: 209 KELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKG 268
Query: 233 SSHEKEVIPK 242
K+V+ K
Sbjct: 269 VKALKKVVAK 278
Score = 29.6 bits (67), Expect = 3.3
Identities = 16/71 (22%), Positives = 30/71 (42%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
+ K+ + K K K+ K K + E+D E + ++K K +E+ KK++ +
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
Query: 185 FKMKEDKEKPV 195
K V
Sbjct: 265 ESKGVKALKKV 275
Score = 28.8 bits (65), Expect = 5.0
Identities = 21/77 (27%), Positives = 36/77 (46%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
K+ K+ E + + K + K K + +E + + EK++ KEE +KK+ K
Sbjct: 205 PDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPK 264
Query: 184 EFKMKEDKEKPVEKIRK 200
E K + +K V K K
Sbjct: 265 ESKGVKALKKVVAKGMK 281
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 33.7 bits (78), Expect = 0.19
Identities = 22/105 (20%), Positives = 62/105 (59%), Gaps = 1/105 (0%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K + ++++ K+ +++ + + +E+ ++EK + ++E +E+ +E+E + +
Sbjct: 52 LKALAEEEERERKRKEERREGRAVLQEQIEEREKRR-QEEYEERLQEREQMDEIIERIQE 110
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
E + ++ EK+KK +++ + +E++++K++EKER E+ I
Sbjct: 111 EDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKI 155
Score = 33.0 bits (76), Expect = 0.33
Identities = 18/102 (17%), Positives = 66/102 (64%), Gaps = 2/102 (1%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
++ K++++E ++ ++++ D+I + +++++ + +EK +K++K ++E EF +
Sbjct: 81 EEREKRRQEEYEERLQEREQMDEIIERIQEEDEA--EAQEKREKQKKLREEIDEFNEERI 138
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ K EK R+ E++ + + ++K ++ ++ + +++++++E+
Sbjct: 139 ERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEK 180
Score = 29.9 bits (68), Expect = 2.5
Identities = 25/114 (21%), Positives = 77/114 (67%), Gaps = 3/114 (2%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E + + K++K+ ++E ++ +++ ++ +++ + + +++ K ++ ++EK ++EE+ +
Sbjct: 111 EDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEERE 170
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKKKKKEKER 232
E +E KE+KE+ V ++R +++++E E++ D+ + L +E+ ++K+++KE+
Sbjct: 171 AERRE--RKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 33.7 bits (78), Expect = 0.19
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK 164
K +K++ K +KE K +K K+K + K + K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQEAKAAKAKSK 89
Score = 31.3 bits (72), Expect = 1.0
Identities = 16/57 (28%), Positives = 31/57 (54%), Gaps = 7/57 (12%)
Query: 111 KVTS-SEDLGEIKKD-KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV 165
+VT+ +E E+K++ K+ DKK K K + KKK+K+ + K + + ++
Sbjct: 41 EVTNLNEQYKEMKEELKAALLDKKELKAWHKAQ--KKKEKQ---EAKAAKAKSKPRL 92
Score = 30.6 bits (70), Expect = 1.4
Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 1/36 (2%)
Query: 200 KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
K +KE K+ K +K KKEK + K + +
Sbjct: 57 KAALLDKKELKAWHKAQK-KKEKQEAKAAKAKSKPR 91
Score = 28.6 bits (65), Expect = 7.2
Identities = 11/42 (26%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
KE++K K + K +K + KK +K++ K K + + +
Sbjct: 53 KEELKAALLDKKELKAWHKAQ---KKKEKQEAKAAKAKSKPR 91
Score = 28.3 bits (64), Expect = 8.1
Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 10/66 (15%)
Query: 153 KIKNKEKDKEKKV----DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
+ K + E +V ++ KE ++ + ++KE K +K K+EK K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQK------KKEKQEAKA 83
Query: 209 KKSKDK 214
K+K K
Sbjct: 84 AKAKSK 89
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 32.2 bits (74), Expect = 0.19
Identities = 19/92 (20%), Positives = 49/92 (53%), Gaps = 10/92 (10%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK---------EKEKG 172
K+ + K ++ +K +++++K K+K KD E +EKK + K++
Sbjct: 21 KQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQK 80
Query: 173 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEK 203
+++ +K++++E + DK K ++++ K++
Sbjct: 81 LQQDLQKRQQEELQKILDKINKAIKEVAKKKG 112
Score = 28.7 bits (65), Expect = 3.1
Identities = 13/69 (18%), Positives = 41/69 (59%), Gaps = 4/69 (5%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKD---KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
K +++++K+ K+ + ++ EK+ ++K+ K+ + +EKKE++ K ++ +
Sbjct: 17 KAAQKQLEKEFKKRQAE-LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQ 75
Query: 193 KPVEKIRKE 201
+ +K++++
Sbjct: 76 RKQQKLQQD 84
>gnl|CDD|227470 COG5141, COG5141, PHD zinc finger-containing protein [General
function prediction only].
Length = 669
Score = 33.8 bits (77), Expect = 0.20
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 9/64 (14%)
Query: 275 PVAAADEYDTGDSKQVWICPAC-GVQDDGSLPMIGCDGCDAWYHWVCVGLVAEPETSDWF 333
P+ +DE+D IC C ++ S ++ CDGC+ H C G+ PE W
Sbjct: 185 PIEPSDEFDD-------ICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPE-GFWL 236
Query: 334 CPKC 337
C KC
Sbjct: 237 CRKC 240
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 32.8 bits (75), Expect = 0.20
Identities = 21/92 (22%), Positives = 43/92 (46%), Gaps = 4/92 (4%)
Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
K+ + + KK KE +K +KE K K+ + + + K + + K E + +KE
Sbjct: 40 PQAKKVSADLESEFKKRQKELQKMQKELKAKEAKLQ---DDGKMEALSDRAKAEAEIKKE 96
Query: 209 KKSKDKDKK-LKKEKLKKKKKEKERSSHEKEV 239
K +KK + EK +++ +E +++
Sbjct: 97 KLVNAFNKKQQEYEKDLNRREAEEEQKLLEKI 128
>gnl|CDD|218370 pfam04997, RNA_pol_Rpb1_1, RNA polymerase Rpb1, domain 1. RNA
polymerases catalyze the DNA dependent polymerisation of
RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 1, represents the clamp domain, which a
mobile domain involved in positioning the DNA,
maintenance of the transcription bubble and positioning
of the nascent RNA strand.
Length = 330
Score = 33.4 bits (77), Expect = 0.21
Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 8/101 (7%)
Query: 133 THKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
K K+ K+ K KK++ +K K ++ K E
Sbjct: 107 ESVKYFFLKVVIDPKGKNSKKRL-------KKINNLCKKKSICSKCGEDNGGLKAFEGCG 159
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
K KI K+ ++ K + ++L KEKL+K EK
Sbjct: 160 KYQPKISKDGAEAIKALLKNIEIEEL-KEKLRKLNPEKVLQ 199
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 33.0 bits (76), Expect = 0.22
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KD + KE++ I KK K D I +EK+K+K + KE++K KEEKEK EE
Sbjct: 65 KDFRKILTKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEEP 122
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 32.5 bits (74), Expect = 0.22
Identities = 19/65 (29%), Positives = 35/65 (53%)
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ KK+ K K ++ + +E P + + + + +KK + DK+ KK+K +KKKK
Sbjct: 106 IQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKK 165
Query: 229 EKERS 233
+K S
Sbjct: 166 KKRHS 170
Score = 28.6 bits (64), Expect = 5.2
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE--KEKKEEKEFKMKEDKE 192
KKK K K KK + + + + + K +K+ +K K E+ + KK++KE K K+ +
Sbjct: 110 KKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRH 169
Query: 193 KP 194
P
Sbjct: 170 SP 171
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 32.6 bits (75), Expect = 0.22
Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
+ K + E+E K EE++++ EK E K + +EK R+EE+ +EK+ D+ LKK+
Sbjct: 120 QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK-REEEERQIEEKRHADEIAFLKKQ 178
Query: 222 KLKKKKK 228
+ K +
Sbjct: 179 NQQLKSQ 185
Score = 28.7 bits (65), Expect = 5.0
Identities = 10/66 (15%), Positives = 38/66 (57%), Gaps = 4/66 (6%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
+ E+ K + +++ K ++ K + + + +++ + E +K E+E+++ +E + ++ +
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE----I 172
Query: 196 EKIRKE 201
++K+
Sbjct: 173 AFLKKQ 178
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 33.2 bits (76), Expect = 0.24
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 18/117 (15%)
Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
K KI+ +K+ + + K KEK +E++E +E EF+ KE++E+ ++KEE E++
Sbjct: 117 NTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFE-KEEEEQRRLLLQKEE---EEQ 172
Query: 209 KKSKDKDKKLKKEKLK-------------KKKKEKERSSHEKEVIPKL-TFKFGTDM 251
+ +K K+K+ ++L+ KK K EK K TF G M
Sbjct: 173 QMNKRKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPNTFSTGIKM 229
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 33.4 bits (77), Expect = 0.24
Identities = 17/67 (25%), Positives = 32/67 (47%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
K+ +++ EK++K K ++ KN K+ K+ + +++ EK E KE K
Sbjct: 241 KEYLDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAK 300
Query: 189 EDKEKPV 195
E P+
Sbjct: 301 YGNEVPL 307
>gnl|CDD|224291 COG1372, COG1372, Intein/homing endonuclease [DNA replication,
recombination, and repair].
Length = 420
Score = 33.4 bits (76), Expect = 0.24
Identities = 20/117 (17%), Positives = 48/117 (41%)
Query: 88 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 147
EK+ + + V + ++ ++++ + LG+ K + K KK +I +K
Sbjct: 281 SEKKGKRERVVTISGANLLLELLRIVYAYKLGKAAKIAEALEKLKEKKKNINRRILRKIA 340
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
++ + K+ E + +K KE + K+ ++ +E+I + D
Sbjct: 341 IREVLLKLEGRLLSKLKLLLEILRKEKLLKEDENLKKLSKGDEYFDEIEEIEEIGYD 397
>gnl|CDD|232969 TIGR00422, valS, valyl-tRNA synthetase. The valyl-tRNA synthetase
(ValS) is a class I amino acyl-tRNA ligase and is
particularly closely related to the isoleucyl tRNA
synthetase [Protein synthesis, tRNA aminoacylation].
Length = 861
Score = 33.5 bits (77), Expect = 0.25
Identities = 23/120 (19%), Positives = 50/120 (41%), Gaps = 13/120 (10%)
Query: 59 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVEL--GVTPMVPNIVKVTSSE 116
PP+ + + E + + K +VE+ + +V++
Sbjct: 746 PPNAPLKVLL---IYTEAETAERLKLNAVDIKGAINFSEVEVVIEKPEVTEAVVELVPGF 802
Query: 117 DL-----GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
++ G I K K + + +K K++KE I+ + +++ +K K + ++KEKEK
Sbjct: 803 EIIIPVKGLINKAKELARLQKQLDKEKKEVIRIEGKLENEGFVKKAPK---EVIEKEKEK 859
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 32.8 bits (75), Expect = 0.25
Identities = 21/115 (18%), Positives = 54/115 (46%), Gaps = 12/115 (10%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
+ E E+ ++ K ++K+ ++ DK +D E + +++E + +++ ++ +
Sbjct: 60 RAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDA 119
Query: 196 EKIRKEEKDSEKEKKSKDKD------------KKLKKEKLKKKKKEKERSSHEKE 238
+ + S D D +K+KKE+ ++K++E+E + E+E
Sbjct: 120 DDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEE 174
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 33.5 bits (76), Expect = 0.26
Identities = 40/132 (30%), Positives = 64/132 (48%), Gaps = 12/132 (9%)
Query: 115 SEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
++++ +IK D K H K E+IKKK + D+IK + D E DK
Sbjct: 1110 ADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENY-IDEIKAQINDLEDVADKAISNDDP 1168
Query: 175 EEKEKKEE-------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD----KKLKKEKL 223
EE EKK E K+ + ++ +K + +I + EKD ++ K + K L K L
Sbjct: 1169 EEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFL 1228
Query: 224 KKKKKEKERSSH 235
+K +EK++S H
Sbjct: 1229 EKIDEEKKKSEH 1240
Score = 29.6 bits (66), Expect = 4.0
Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 24/170 (14%)
Query: 122 KKDKSIKKDKKTHKKKE-KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
K +K IKK K +E K KI+ D KD D+ K K+ + + E+ K
Sbjct: 1387 KSEKLIKKIKDDINLEECKSKIESTLDDKDIDECIKKIKELKNHILSEESNIDTYFKNAD 1446
Query: 181 EEKE--------FKMKEDKEKPVEKIRKEEKDSEKE------KKSKDKDKKLKKEKLKKK 226
E E +M ++K + + KI+K+ ++ + K+ DK K K E K
Sbjct: 1447 ENNENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNA 1506
Query: 227 K---KEKERSSHEKEVIPKLTFKFGT----DMEEKTKRESSPKIVIKPVK 269
K K KE K+ + +L K+ + KTK++S +I+IK +K
Sbjct: 1507 KAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDS--EIIIKEIK 1554
Score = 29.6 bits (66), Expect = 4.7
Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 2/150 (1%)
Query: 88 KEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKD 147
K KEY K ++ + N +K + L ++ K + + + K E E+ + +
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K + +K D EKK + K E K E+E++ +K E I+ E E
Sbjct: 875 KIKAEISDDKLNDYEKKFNDSKS--LINEINKSIEEEYQNINTLKKVDEYIKICENTKES 932
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
+K +K LK+ K KE + EK
Sbjct: 933 IEKFHNKQNILKEILNKNIDTIKESNLIEK 962
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.5 bits (76), Expect = 0.27
Identities = 26/129 (20%), Positives = 48/129 (37%), Gaps = 19/129 (14%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K S + + D++ KK + K KK + + E+ E V+++
Sbjct: 50 KKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISE----PLEEGSELVVNEDAA 105
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K+ +K + + RK S ++ K + K K+ K+KK ++
Sbjct: 106 LDKESKKTPRRTR---------------RKAAAASSDVEEEKTEKKVRKRRKVKKMDEDV 150
Query: 231 ERSSHEKEV 239
E E EV
Sbjct: 151 EDQGSESEV 159
Score = 28.5 bits (63), Expect = 7.6
Identities = 28/150 (18%), Positives = 57/150 (38%), Gaps = 17/150 (11%)
Query: 57 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
+ D+ K P + K + E + EL V + +
Sbjct: 55 ALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSEL-----------VVNED 103
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+ +S K ++T ++K +++K + K++ + K K K + +++G + E
Sbjct: 104 AALD---KESKKTPRRT-RRKAAAASSDVEEEKTEKKVRKRRKVK-KMDEDVEDQGSESE 158
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSE 206
EE EF + E E++ E+ D E
Sbjct: 159 VSDVEESEFVTSLENES-EEELDLEKDDGE 187
>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB proteins similar to the
monomeric yeast and human topo I. Topo I enzymes are
divided into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. This family may represent more than
one structural domain.
Length = 215
Score = 32.7 bits (75), Expect = 0.28
Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 131 KKTHKKKEKEKIKK--KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KK K+EK IK K D +K+++K + KE++K K EKEK EE+
Sbjct: 68 KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKEEKKAIKAEKEKLEEE 122
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 33.4 bits (76), Expect = 0.29
Identities = 29/105 (27%), Positives = 45/105 (42%), Gaps = 15/105 (14%)
Query: 143 KKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEE 202
+ DKK + + + EK+ +K+K KKEE+E+ E++ +K D E VE+I
Sbjct: 2 VDESDKKQQTIDEQSQISPEKQTPNKKDK-KKEEEEQLSEEDAMLKGDLELLVERI---- 56
Query: 203 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKF 247
+D D L+ L KE +SS KF
Sbjct: 57 ---------QDPDIDLQNNSL-NMLKEVIKSSTSSMTAVPKPLKF 91
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 33.0 bits (76), Expect = 0.31
Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 6/81 (7%)
Query: 160 DKEKKVDKEKEKGKK--EEKEKKEEKEFKMKEDKEKPVEK---IRKEEKDSEKEKKSKDK 214
KE ++ K + K K EEK +K ++E+ ++E + K ++K I K++KD ++ K K +
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLRE-QLKAIKKELGIEKDDKDELEKLKEKLE 257
Query: 215 DKKLKKEKLKKKKKEKERSSH 235
+ KL +E K +KE + S
Sbjct: 258 ELKLPEEVKKVIEKELTKLSL 278
Score = 30.0 bits (68), Expect = 2.8
Identities = 18/51 (35%), Positives = 29/51 (56%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
E L IKK+ I+KD K +K KEK+++ K ++ K+ KE K ++
Sbjct: 230 EQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVKKVIEKELTKLSLLE 280
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 32.2 bits (73), Expect = 0.31
Identities = 12/33 (36%), Positives = 14/33 (42%)
Query: 27 PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPP 59
S PG+ P P + P P PPG PP
Sbjct: 147 SSYGAPGVPPPYNPPMLMMKPPPPSPGPPGAPP 179
>gnl|CDD|147348 pfam05124, S_layer_C, S-layer like family, C-terminal region.
Length = 223
Score = 32.4 bits (74), Expect = 0.31
Identities = 26/101 (25%), Positives = 36/101 (35%), Gaps = 8/101 (7%)
Query: 92 YSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDK 151
Y VK +ELG + + G+++ ++KT K D KD
Sbjct: 15 YKDVKTLELGEEYIPDWEAYAVLDDGKGKLEYKDDEVLEQKTVGLALKYVGDYLDDIKDG 74
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
DKIK K D E+ K F+M E KE
Sbjct: 75 DKIKIANYAKLLFDD--------EDSSDKLNVYFEMDEYKE 107
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 32.8 bits (75), Expect = 0.31
Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 3/64 (4%)
Query: 124 DKSIKKDKK-THKKKEKEKIKKKKDKKDK--DKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
DK +D K + +KEK+ I+ + + D ++I++K + +K++D+ K + KK +KE
Sbjct: 31 DKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIA 90
Query: 181 EEKE 184
E KE
Sbjct: 91 ELKE 94
Score = 29.3 bits (66), Expect = 4.0
Identities = 14/101 (13%), Positives = 38/101 (37%), Gaps = 5/101 (4%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
L + K+DK ++K+ + + E + + + + + + +++ +K + K
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209
Query: 176 EKEK---KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
EK +E+K E ++ + E+
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAALQAA 250
Score = 28.5 bits (64), Expect = 6.4
Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 2/90 (2%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K+DK ++ K+ E K E + E E + K +K + + +E +
Sbjct: 154 KEDKKSLEEKQAALEDK--LETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALG 211
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
EK + ++ K L + + K++ +
Sbjct: 212 EKAALEEQKALAEAAAAEAAKQEAAAKAAA 241
>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
Length = 312
Score = 32.7 bits (75), Expect = 0.32
Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKK 228
GKK ++ K + K K+K E +SEK +K K K L KK+KL+K +
Sbjct: 103 FGKKFFGKESRIKRYDNKIFKQKEQYDFALENPNSEKIQKIKQKIILLEKKKKKLEKTNE 162
Query: 229 EK 230
EK
Sbjct: 163 EK 164
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 33.0 bits (75), Expect = 0.33
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 3/88 (3%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
++ + K + DK+K KNKEKDK K D++ +K K EK + + +
Sbjct: 205 RRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQN---SHFDVRI 261
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ KE K EK S + L K K
Sbjct: 262 LRSYTKENKKDEKNVVSGIRSSVLLKRK 289
Score = 30.7 bits (69), Expect = 1.8
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 2/66 (3%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKVDKE 168
+ TS+ K+K K+K + KK+++ K+ K +K K++N D + KE
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKE 268
Query: 169 KEKGKK 174
+K +K
Sbjct: 269 NKKDEK 274
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 33.2 bits (76), Expect = 0.33
Identities = 38/168 (22%), Positives = 65/168 (38%), Gaps = 9/168 (5%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
E + + E EK ++ +PN + + E K K IK + K
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQK 216
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-----EKEFKMKED 190
KE+ KK+ K DK KIK+ + D + + K + +K + E ++ +K
Sbjct: 217 IKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSG 276
Query: 191 KEKPVEKI---RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
K + K +K K D+ + L K KK+ KE+ +
Sbjct: 277 DLKNFAVDLFGPEIRKYLQKFLKYYDQAEPLLN-KSKKEPKEEAVETL 323
Score = 31.2 bits (71), Expect = 1.1
Identities = 35/152 (23%), Positives = 70/152 (46%), Gaps = 7/152 (4%)
Query: 91 EYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD 150
E ++ + G V T E SIK+ K + KK ++ +
Sbjct: 99 EELAIEGLAFGTERSTSGAVPETKDET--PASAPSSIKEQKSSELKKVDSQLPDPRALLK 156
Query: 151 KDKIKNKEKDKE-KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE- 208
+ +K E +E +K K ++ K+ K+ K+ ++E K++ +E I+K++ + E
Sbjct: 157 GEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQ-DLEEYKKR-LEAIKKKDIKNPLEL 214
Query: 209 KKSKDKDKKLKKE-KLKKKKKEKERSSHEKEV 239
+K K++ KLKKE K K+K + ++ + +
Sbjct: 215 QKIKEEFDKLKKEGKADKQKIKSAKNDLQNDK 246
>gnl|CDD|204587 pfam11078, Optomotor-blind, Optomotor-blind protein N-terminal
region. This family is conserved in Drosophila spp.
Optomotor-blind is one of the essential toolkit
proteins for coordinating development in diverse animal
taxa, and in Drosophila it plays a key role in
establishing the abdominal pigmentation pattern, in
development of the central nervous system and leg and
wing imaginal disc-formation of Drosophila
melanogaster. This is the N-terminal region of the
protein and does not include the T-box-containing
transcription factor that plays a part in DNA-binding.
Length = 74
Score = 30.4 bits (68), Expect = 0.33
Identities = 14/36 (38%), Positives = 16/36 (44%)
Query: 22 QFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGT 57
Q P P + LIP+ PP HP PL H T
Sbjct: 14 QAGPHPGLYPGTLIPKFPPAHPHPHPHHPLGHAYTT 49
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 33.1 bits (76), Expect = 0.35
Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 5/72 (6%)
Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
+ E+K+ K + K KK K+ K K + +E D E E K ++
Sbjct: 35 EVERKLRKLESKIKKLGIPLKDTG-GKPDVPPSKEFLDLEEEILDLEAEIKEVEE----N 89
Query: 220 KEKLKKKKKEKE 231
E L+K+ E E
Sbjct: 90 LESLEKEINELE 101
Score = 32.4 bits (74), Expect = 0.51
Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 3/82 (3%)
Query: 138 EKEKIKKKKDKK-DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
E E+ +K + K K I K+ + V KE EE+ E E K E+ + +E
Sbjct: 35 EVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE 94
Query: 197 KIR--KEEKDSEKEKKSKDKDK 216
K EE + +++ D+
Sbjct: 95 KEINELEEWLNVLDEEKSFLDE 116
Score = 30.0 bits (68), Expect = 3.0
Identities = 17/73 (23%), Positives = 31/73 (42%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
KK +K K D K +E+ +D E E + EE + EKE E+ +
Sbjct: 48 KKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVL 107
Query: 196 EKIRKEEKDSEKE 208
++ + ++ +E
Sbjct: 108 DEEKSFLDENLEE 120
>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
This model represents a paralogous family of Plasmodium
yoelii genes preferentially located in the subtelomeric
regions of the chromosomes. There are no obvious
homologs to these genes in any other organism.
Length = 255
Score = 32.6 bits (74), Expect = 0.35
Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 2/92 (2%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
DKKT K K + + ++ KK+ D N E + DK K K E E ++FK E
Sbjct: 108 DKKTKKLINKLQKELEELKKELDNEMNDELTIQPIHDKIIIK-KDENNSVSEHEDFKQLE 166
Query: 190 DKEKPVEKIRKE-EKDSEKEKKSKDKDKKLKK 220
+++ +E + S K K K KKL K
Sbjct: 167 NEKNSSVSEHEEFDIASSDNLKIKRKLKKLVK 198
>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
(Spore_III_AF). This family represents the stage III
sporulation protein AF (Spore_III_AF) of the bacterial
endospore formation program, which exists in some but
not all members of the Firmicutes (formerly called
low-GC Gram-positives). The C-terminal region of these
proteins is poorly conserved.
Length = 185
Score = 31.9 bits (73), Expect = 0.36
Identities = 24/125 (19%), Positives = 56/125 (44%), Gaps = 7/125 (5%)
Query: 107 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIK---KKKDKKD-KDKIKNKEKDKE 162
P++ SE I+ + I+ KK + ++ I K+ +K + K+K + K
Sbjct: 47 PDLEFYILSESSLIIENE--IESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKV 104
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR-KEEKDSEKEKKSKDKDKKLKKE 221
K V+ E ++ + +E +KE+ ++ + D++ K +++++ + E
Sbjct: 105 KDVEVEIDEDLESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAE 164
Query: 222 KLKKK 226
K+K
Sbjct: 165 KIKNF 169
Score = 31.9 bits (73), Expect = 0.43
Identities = 23/107 (21%), Positives = 51/107 (47%), Gaps = 12/107 (11%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDK-----EKKVDKE-KEKGKKEEKEKKEEKEFKMKED 190
E I + + + K +I+ ++ K+++K+ ++K K+E K ++ E ++ ED
Sbjct: 55 SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114
Query: 191 KEKPVEKIR------KEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E I+ KEE +++ K + + K K++++E E
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESE 161
Score = 28.0 bits (63), Expect = 7.1
Identities = 22/116 (18%), Positives = 50/116 (43%), Gaps = 20/116 (17%)
Query: 148 KKDKDKIKNKEKDKEKKVDKE-----------------KEKGKKEEKEKKEEKEFKMKED 190
+ I+N+ + K+K++ ++K K+E K ++ E ++ ED
Sbjct: 55 SESSLIIENEIESKKKEIQASQRAYILEEYAKQLEKQVEKKLKEEYGVKVKDVEVEIDED 114
Query: 191 KEKP---VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
E ++++ K+ KEK+ + + + K K+E+E S +++ L
Sbjct: 115 LESNNFDIKEVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEESEEAEKIKNFL 170
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.4 bits (74), Expect = 0.38
Identities = 23/138 (16%), Positives = 52/138 (37%), Gaps = 2/138 (1%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
++KE+ ++EE+++ K + E+ + E+ + ++E K K
Sbjct: 180 EKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNTPSPKSGSSSPAKP 239
Query: 227 KKEKERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTG 285
++S+ ++ P + K + K + S +V K +P + + T
Sbjct: 240 TSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKK-AAPESTSQSPSSAEPTS 298
Query: 286 DSKQVWICPACGVQDDGS 303
+S Q + D S
Sbjct: 299 ESPQTAGNSSLSSLGDYS 316
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 33.0 bits (76), Expect = 0.38
Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 13/94 (13%)
Query: 173 KKEEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+K E+ ++E ++ ++ + K E IR+ + + K KK+K + K+L EKL+ + E+
Sbjct: 241 QKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRL--EKLEARLAEER 298
Query: 232 RSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
K + K+V+
Sbjct: 299 PVEEGK----------PLAFRFPPPGKRLGKLVL 322
Score = 32.6 bits (75), Expect = 0.48
Identities = 18/106 (16%), Positives = 35/106 (33%), Gaps = 20/106 (18%)
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
E+K ++ +E E ++K ++ KE++ + K + K KK K + K+ EK +
Sbjct: 240 EQKAERLRQEAAAYEKQQK---ELAKEQEWIRRGKAAASKAKKAKS---RIKRLEKLEAR 293
Query: 235 HEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
E + P V +
Sbjct: 294 LA--------------EERPVEEGKPLAFRFPPPGKRLGKLVLEFE 325
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 32.1 bits (73), Expect = 0.39
Identities = 28/107 (26%), Positives = 51/107 (47%), Gaps = 4/107 (3%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
D K +K EKE + K D + K K KEK+ + ++ K +E +K+++E + +
Sbjct: 136 DVKNAEKFEKECRALSRKKSDDEHRKRSGKQKEKR--RVEDSQKHKEDRRKKQEEKRRND 193
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+ ++P S +KD+ K K++K ++ E SHE
Sbjct: 194 EDKRPGGGGGSSGGQS--GLSTKDEPPKEKRQKHHDPERRLEPQSHE 238
Score = 29.1 bits (65), Expect = 4.3
Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 1/77 (1%)
Query: 163 KKVDKEKEKGKKEEKEK-KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
KK D E K ++KEK + E K KED+ K E+ R+ ++D +
Sbjct: 153 KKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLS 212
Query: 222 KLKKKKKEKERSSHEKE 238
+ KEK + H+ E
Sbjct: 213 TKDEPPKEKRQKHHDPE 229
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 32.9 bits (75), Expect = 0.39
Identities = 16/98 (16%), Positives = 45/98 (45%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
+ K+ + K+ D+K + + +E+ +++ D E+E +E +++
Sbjct: 376 SSQRELKQTEAAYCKRLDEKRLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVI 435
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKK 227
DKE+ + + +K++ ++K +++K + K
Sbjct: 436 DKEEVCNLYDRRDTAWQKQRFLREKQTAFERQKTEHTK 473
>gnl|CDD|236267 PRK08451, PRK08451, DNA polymerase III subunits gamma and tau;
Validated.
Length = 535
Score = 32.7 bits (75), Expect = 0.39
Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
K K+ +K K E + KEFK +K+K + E + E +K ++ +
Sbjct: 446 KGAKIKIQKAL-KSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQET--- 501
Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFG 248
+LK+ + +E + +++I + FG
Sbjct: 502 -QLKEATELQEFMINNEDLIEEAKELFG 528
Score = 30.0 bits (68), Expect = 2.7
Identities = 16/94 (17%), Positives = 40/94 (42%), Gaps = 8/94 (8%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
+ KIK +K K ++ + + + K +K++ + + E +E K+ E
Sbjct: 446 KGAKIKIQKALK-------SAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEV 498
Query: 198 IRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ K++ + ++ ++ L +E K+ K
Sbjct: 499 QETQLKEATELQEFMINNEDLIEE-AKELFGIKS 531
Score = 29.6 bits (67), Expect = 4.1
Identities = 14/83 (16%), Positives = 35/83 (42%), Gaps = 6/83 (7%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE- 181
K IK K K + ++ K+ K ++ K + + ++ +E+ K EE ++ +
Sbjct: 446 KGAKIKIQK--ALKSAENPLQSLKEFKPSNEKKKIDTESTAEMLEEEAKKDDEEVQETQL 503
Query: 182 --EKEFKMKEDKEKP-VEKIRKE 201
E + + +E+ ++
Sbjct: 504 KEATELQEFMINNEDLIEEAKEL 526
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 31.5 bits (72), Expect = 0.41
Identities = 17/59 (28%), Positives = 37/59 (62%)
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
KKV K KKE++E++EE+ + D+E+ ++++ ++E K +K ++ ++K K+
Sbjct: 88 KKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEI 146
Score = 28.8 bits (65), Expect = 3.0
Identities = 14/65 (21%), Positives = 38/65 (58%), Gaps = 4/65 (6%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
++K + K +K +++E+ ++++ +++ D++ KE K K+ +++ + E K+K+
Sbjct: 90 VRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKR----EKRRENERKQKE 145
Query: 181 EEKEF 185
KE
Sbjct: 146 ILKEQ 150
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.7 bits (74), Expect = 0.41
Identities = 37/185 (20%), Positives = 67/185 (36%), Gaps = 36/185 (19%)
Query: 30 AGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSP-- 87
+G P P QHP P PPPS + + S + S
Sbjct: 441 SGLHSGPPQSPFAQHPFT--SGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGG 498
Query: 88 -------------KEKEYSKVKDVELGVTPM-----VPNIVKVTS--SEDLGEIKK-DKS 126
KE+ + ++ E P P +V S S+ K D+
Sbjct: 499 CAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEPTVVNTPSHASQSARFYKHLDRG 558
Query: 127 IKKDKKTH-----------KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+T KK +E ++K K + ++ + +E++KEK+ ++E+E+ ++
Sbjct: 559 YNSCARTDLYFTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREA 618
Query: 176 EKEKK 180
E+ K
Sbjct: 619 ERAAK 623
Score = 28.9 bits (64), Expect = 6.4
Identities = 15/55 (27%), Positives = 37/55 (67%), Gaps = 4/55 (7%)
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK----ERSSHE 236
K+ + +E+ VEK ++E + +E++ ++K+K+ ++E+ ++++ E+ SSHE
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAKASSSSHE 630
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 32.6 bits (74), Expect = 0.41
Identities = 22/77 (28%), Positives = 41/77 (53%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
++K KK+E+E + + + + E K+ K D E E K E K +E+E +M E
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560
Query: 190 DKEKPVEKIRKEEKDSE 206
+ + + K ++ EK++E
Sbjct: 561 KEAQELRKYQESEKETE 577
Score = 31.8 bits (72), Expect = 0.73
Identities = 36/128 (28%), Positives = 68/128 (53%), Gaps = 11/128 (8%)
Query: 117 DLGEIKKDKSIKKDK----KTHKKKEKEKIK--KKKDKKDKDKIKNKEKDKEKKVDKEKE 170
DLG++KK+ + + K + K+K+K+ ++ +K+ K + D N EK E++
Sbjct: 449 DLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLA-----EEK 503
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K KKEE+E + +E+ E +++ ++D E E K + D KLK+E+ + +KE
Sbjct: 504 KRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEA 563
Query: 231 ERSSHEKE 238
+ +E
Sbjct: 564 QELRKYQE 571
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 32.3 bits (74), Expect = 0.42
Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG-----KKEEKEKKEEKEFKMKEDKEKP 194
E+ + D+ K K E ++ K E E+ +KEE++ E +E +E ++
Sbjct: 189 EEAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQL 248
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+EK+ E + E++ + K ++E+L K+ + E S +KE+
Sbjct: 249 IEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQKEI 293
Score = 29.6 bits (67), Expect = 3.4
Identities = 21/92 (22%), Positives = 51/92 (55%), Gaps = 5/92 (5%)
Query: 122 KKDKSIKKDKKTHKKKEKE-KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
K+K+I+ ++ + E E ++ ++K K+++ ++ +E+ ++ V + EK + E ++
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLL 260
Query: 181 EEKEF----KMKEDKEKPVEKIRKEEKDSEKE 208
E+E K++E +E E + E + +KE
Sbjct: 261 AEQERMLEHKLQEQEELLKEGFKTEAESLQKE 292
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 31.7 bits (72), Expect = 0.44
Identities = 23/102 (22%), Positives = 56/102 (54%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
++ K ++K K D+ + ++++ ++ V ++E+ K E +E ++EKE E ++
Sbjct: 21 EEAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAE 80
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
++ ++E E++++ + K+ + K+KK KE + EK
Sbjct: 81 QQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122
Score = 29.4 bits (66), Expect = 2.9
Identities = 24/92 (26%), Positives = 51/92 (55%), Gaps = 2/92 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE--KKVDKEKEKGKKEEKEK 179
+ +++K K EKE+ K+ D+ K K + ++ + E +K + E +++ E+
Sbjct: 22 EAKANMQKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQ 81
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
+E++E +E+KE+ E ++++E S K KK
Sbjct: 82 QEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKP 113
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 31.8 bits (72), Expect = 0.45
Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)
Query: 17 PMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHP----PGTP-PPSKMKDKVSPK 70
PM P P V G P L P QHPL P +P P PP + + P+
Sbjct: 82 PMMP-VPGHHPMVPMTGQQPHLQPPAQHPLQPTYGQNPQPQQPTHTQPPVQPQQPADPQ 139
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 32.3 bits (74), Expect = 0.46
Identities = 17/61 (27%), Positives = 38/61 (62%), Gaps = 6/61 (9%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE------FKMKEDKEKPVEK 197
K+K K +K++ K +++E++ E E+ ++EE E++++++ K+ E KEK +E
Sbjct: 6 KEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
Query: 198 I 198
+
Sbjct: 66 L 66
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.4 bits (72), Expect = 0.46
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 142 IKKKKDKKDKDKI-KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
I K K D K K+K K+KK + E + KK++ EE++ K + I+K
Sbjct: 68 ISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAALLAAIKK 127
Query: 201 EE 202
Sbjct: 128 VP 129
Score = 28.3 bits (64), Expect = 4.6
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 2/58 (3%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
KV S D K+ KK KKK++ + ++K K K + + K+ +K VD
Sbjct: 65 KVDISGVKVPKLNDAYFKRKKKKKKKKKEGEFFEEKKK--KYVVSEERKEDQKAVDAA 120
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 31.1 bits (71), Expect = 0.47
Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 3/88 (3%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDKEKPVEKI 198
KI ++ +K + IK E+ ++ E ++ + + E E K++ +K E+I
Sbjct: 26 KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85
Query: 199 RKE-EKDSEKEKKSKDKDKKLKKEKLKK 225
E +KD+E+ +S + + +KE+
Sbjct: 86 LAEAQKDAERLLESARAEIEQEKEQALA 113
Score = 27.6 bits (62), Expect = 6.6
Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 3/90 (3%)
Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
KI ++ K+K KE E+ K+ E E ++ + + + E I +K+++K K+
Sbjct: 26 KILDERKEKIANNIKEAEERLKQAAALLAEAEQQLAQARAEASEIINNAKKEAQKLKEEI 85
Query: 213 DKDKKLKKEKLKKKKK---EKERSSHEKEV 239
+ + E+L + + E+E+ E+
Sbjct: 86 LAEAQKDAERLLESARAEIEQEKEQALAEL 115
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 32.6 bits (74), Expect = 0.49
Identities = 36/172 (20%), Positives = 48/172 (27%), Gaps = 10/172 (5%)
Query: 51 LVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYS-KVKDVELGVTPMVPNI 109
L P M S K K SK K + EK+ V D
Sbjct: 404 LKDPTIIAGKKLMNKLTSEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELR 463
Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD--------- 160
KV S+ K KK + IK KK K K K +D
Sbjct: 464 KKVGKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPN 523
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
++K DK+ K + E + K +E K K +
Sbjct: 524 NKEKNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNA 575
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 32.5 bits (74), Expect = 0.49
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 5/99 (5%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+KI ++D D+++ K+K V + + ++EE+E++EE+ + + E+PV
Sbjct: 371 YDKIATEEDATTIDELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGF 430
Query: 199 RKEEKDSEKEKKSKD-----KDKKLKKEKLKKKKKEKER 232
E K+ K+ EK+ K+ EKE+
Sbjct: 431 EVPEMPMPAAGGGGGIKIVLKNAKITAEKVIIKRAEKEK 469
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.2 bits (73), Expect = 0.50
Identities = 18/109 (16%), Positives = 43/109 (39%), Gaps = 15/109 (13%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
EI +++ + KK+E+ + + ++ + +D + E ++ +EKE+ + EE +
Sbjct: 61 EISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGEL---VELQEEKEQLENEELQY 117
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
E + E +S + + +L K +K
Sbjct: 118 LREYN------------LFDRNNLQLEDNLQSLELQYEYSLNQLDKLRK 154
Score = 31.8 bits (72), Expect = 0.62
Identities = 22/110 (20%), Positives = 49/110 (44%), Gaps = 5/110 (4%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE-KEFKMKE 189
+ EKE+ K K + +N E + +D E ++ KKEE+ +E +E + ++
Sbjct: 35 DSELRDAEKER-DTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKED 93
Query: 190 DK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
D + + ++++E++ E E+ ++ + E S E +
Sbjct: 94 DDLDGELVELQEEKEQLENEELQYLRE--YNLFDRNNLQLEDNLQSLELQ 141
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 31.5 bits (72), Expect = 0.50
Identities = 10/52 (19%), Positives = 31/52 (59%)
Query: 142 IKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
++ + +K + +EK+ +K+ + ++ G K +K E+++ ++ ED+++
Sbjct: 33 LQWLLIDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQKE 84
Score = 31.5 bits (72), Expect = 0.56
Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 8/62 (12%)
Query: 165 VDKEKEKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+D++ EK +K EKE +K +E K + +K K +++ + K++ + +
Sbjct: 38 IDRKMEKYQKREKEIQKRAREL-RKNGDKLSPKKFEKRQEE------LMEDQKEMMMDMM 90
Query: 224 KK 225
K
Sbjct: 91 KP 92
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 32.3 bits (73), Expect = 0.51
Identities = 27/148 (18%), Positives = 60/148 (40%), Gaps = 3/148 (2%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ED E K ++ ++K ++ + K K D K E ++ + + +E+
Sbjct: 155 ETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAA 214
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+++ E+ ED + + E+++ ++E + +D KEK KKK +
Sbjct: 215 LGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAE 274
Query: 234 SHEKEVIPKLTFKFGTDMEEKTKRESSP 261
+ E + K + + K + E
Sbjct: 275 LEDDEPDKEAVKK---EADSKPEEEDEE 299
Score = 31.5 bits (71), Expect = 1.0
Identities = 31/137 (22%), Positives = 63/137 (45%), Gaps = 6/137 (4%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDK-----KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEK 163
K + + E K DKS DK + ++ E + +++ D+D ++ +D +
Sbjct: 171 EEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSE 230
Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+++ G E++E EE ED P EK +K++ + E + + DK+ K++
Sbjct: 231 DGKDDEDFGSGEDEEDDEEGNI-EYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEA 289
Query: 224 KKKKKEKERSSHEKEVI 240
K +E++ E+E
Sbjct: 290 DSKPEEEDEEDDEQEDD 306
Score = 30.7 bits (69), Expect = 1.7
Identities = 25/124 (20%), Positives = 58/124 (46%), Gaps = 4/124 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKK--DKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+D+ ++ + + +K K K D ++++ E DKE + K ++E++E
Sbjct: 241 GEDEEDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEED 300
Query: 180 KEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
E+++ + +E E ++K++ +E E K + EK + K K++ ++
Sbjct: 301 DEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPK-ELSSFEKRQAKLKQQIEQLEKEN 359
Query: 239 VIPK 242
+ PK
Sbjct: 360 LAPK 363
Score = 30.7 bits (69), Expect = 1.8
Identities = 24/116 (20%), Positives = 60/116 (51%), Gaps = 2/116 (1%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
++ G I+ + +K KK + + + D+ DK+ +K + K ++ D+E ++ +
Sbjct: 245 EDDEEGNIEYEDFFDPKEKDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQE 304
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
++ E++ + K ++PV + + +S KE S +K + K+++++ +KE
Sbjct: 305 DDQDEEEPPEAAMDKVKLDEPVLE--GVDLESPKELSSFEKRQAKLKQQIEQLEKE 358
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 30.1 bits (68), Expect = 0.52
Identities = 20/66 (30%), Positives = 36/66 (54%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+EK +K DK+D D I + K ++KK + + + + +K ++K+ K E E P E
Sbjct: 2 EEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEESK 61
Query: 199 RKEEKD 204
R+ +D
Sbjct: 62 RRRTED 67
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.5 bits (75), Expect = 0.52
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 2/93 (2%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+E I D ++ + + E+ D E E+ +++E + E EK +EK
Sbjct: 167 RELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAA--DESELPEKVLEKF 224
Query: 199 RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
+ K +K +K+++K + + + KK K +E
Sbjct: 225 KALAKQYKKLRKAQEKKVEGRLAQHKKYAKLRE 257
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 32.4 bits (75), Expect = 0.54
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDK---EKPVEKIRKEEKDSEKEKKSKDKDKKL 218
E +D E E + E++ K EKE + E K E V K +E + E+EK ++ ++K
Sbjct: 804 EGLIDVEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLA 863
Query: 219 K-KEKLKKKKK 228
K KE+L + K
Sbjct: 864 KLKERLARLKA 874
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 31.4 bits (71), Expect = 0.54
Identities = 23/103 (22%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
Query: 110 VKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKE 168
VK+TS + IK+++ +K +K+ ++E+E+ +++ ++ + + + + +D+E++ + E
Sbjct: 26 VKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDE 85
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
++ ++ EKK + + I K K +EK KK+
Sbjct: 86 EDNVDLKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128
Score = 29.4 bits (66), Expect = 2.1
Identities = 26/103 (25%), Positives = 50/103 (48%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
+KE I K+ + +K ++ E+++E+ ++ +E E++E+ E E + +ED+E V+
Sbjct: 31 WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVD 90
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
E+K+ S D KK EK + + E V
Sbjct: 91 LKDIEKKNINDIFNSTQDDNAQNLISKNYKKNEKSKKTAEDIV 133
Score = 28.3 bits (63), Expect = 5.8
Identities = 27/100 (27%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
KK E K DK+ I + +D K +K+++ ++EE++++E +E + ED+E+
Sbjct: 16 EKKDENLLEHVKITSWDKEDIIKENEDV--KDEKQEDDEEEEEEDEEEIEEPEDIEDEEE 73
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
VE +EE+D E KD +KK + + + ++
Sbjct: 74 IVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDNAQN 113
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 32.4 bits (74), Expect = 0.56
Identities = 23/73 (31%), Positives = 30/73 (41%), Gaps = 11/73 (15%)
Query: 169 KEKGKKEEK-------EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
KE+G EE +K E KM+E E+ E EE D K + + K+
Sbjct: 818 KEEGTYEELSNNGPLFQKLMENAGKMEEYVEENGE----EEDDQTSSKPVANGNANNLKK 873
Query: 222 KLKKKKKEKERSS 234
KKK KE S
Sbjct: 874 DSSSKKKSKEGKS 886
Score = 32.4 bits (74), Expect = 0.62
Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 4/62 (6%)
Query: 170 EKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
E K E+ +E E + + KPV + +K S K K + + + K++E
Sbjct: 838 ENAGKMEEYVEENGEEEDDQTSSKPVA----NGNANNLKKDSSSKKKSKEGKSVLIKQEE 893
Query: 230 KE 231
+E
Sbjct: 894 RE 895
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 32.4 bits (73), Expect = 0.58
Identities = 37/176 (21%), Positives = 81/176 (46%), Gaps = 6/176 (3%)
Query: 76 KEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHK 135
K+S ++ S E + K+ + + +V ++ + GE++ + +++
Sbjct: 365 ASKLSDEDDDSVMESKMQKLFSEKEIDFGLNSELVDMSDDGENGEMEDTFTSHLPA-SNE 423
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
+ +K++ +K D+ + +E KE+++ K K+ + K+KKE K K+ K +
Sbjct: 424 SESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHA-I 482
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLK----KEKLKKKKKEKERSSHEKEVIPKLTFKF 247
E+ +++ E K + D++L K LK +K +K R +K + F F
Sbjct: 483 ERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEEGFVF 538
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 32.2 bits (73), Expect = 0.58
Identities = 24/118 (20%), Positives = 59/118 (50%), Gaps = 7/118 (5%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ + L + +++ D K +E++K+ + + K ++ + ++++E +
Sbjct: 211 AEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALR 270
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK--DKKLKKEKLKKKKKE 229
+E ++ + MKED E E++ + E+ +K ++K D +L+KEKL+ + K
Sbjct: 271 EENRKLRS-----MKEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKS 323
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 32.0 bits (73), Expect = 0.58
Identities = 27/106 (25%), Positives = 38/106 (35%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
+E EK KK+ K K +E K KK + K +E K ++ K +E P
Sbjct: 95 RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPK 242
S K K+ + KK K +S E PK
Sbjct: 155 SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPK 200
Score = 30.5 bits (69), Expect = 1.7
Identities = 22/116 (18%), Positives = 38/116 (32%), Gaps = 3/116 (2%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK-- 177
E + K K K +E +K KK D + K K ++++ ++ E
Sbjct: 98 EKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPS 157
Query: 178 -EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ K K R ++ S +K K + KK +K R
Sbjct: 158 STPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKKMR 213
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 31.6 bits (72), Expect = 0.61
Identities = 17/74 (22%), Positives = 41/74 (55%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
++ ++K ++D++ ++K K K K K K KI+ KEK K +K +K + +++
Sbjct: 28 DEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEED 87
Query: 176 EKEKKEEKEFKMKE 189
E + ++ ++++
Sbjct: 88 TPEDELAEKLRLRK 101
Score = 29.6 bits (67), Expect = 3.1
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK--EKL 223
D+E+++ K+EEK K K K K K EK + + + EK + ++D + EKL
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKL 97
Query: 224 KKKKKEKE 231
+ +K ++E
Sbjct: 98 RLRKLQEE 105
Score = 28.1 bits (63), Expect = 9.8
Identities = 13/48 (27%), Positives = 26/48 (54%)
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+++E ++ K + ++ + K K K +KEK K++K+EK E
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.9 bits (70), Expect = 0.61
Identities = 16/65 (24%), Positives = 35/65 (53%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
E + ++++ + EEKE + K+ K ++ ++K EK+ K++ + +K KK +
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Query: 222 KLKKK 226
K + K
Sbjct: 154 KEEGK 158
Score = 29.7 bits (67), Expect = 2.0
Identities = 18/65 (27%), Positives = 36/65 (55%)
Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
+ EKE+ + ++ +++E KE KEK +K+ ++ + K +K+ KK K+ K
Sbjct: 94 ESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTK 153
Query: 226 KKKEK 230
K++ K
Sbjct: 154 KEEGK 158
Score = 29.3 bits (66), Expect = 2.5
Identities = 20/67 (29%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E+ + +K+ S + ++K K +KEK K+KK+KK +K+ K+ K K +K K
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEK-KEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 176 EKEKKEE 182
K+++ +
Sbjct: 152 TKKEEGK 158
Score = 28.2 bits (63), Expect = 5.2
Identities = 19/63 (30%), Positives = 33/63 (52%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+K++KE K+ ++K+ K K++ KEKKV ++ K K + K K+ K +K
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTTKKE 155
Query: 195 VEK 197
K
Sbjct: 156 EGK 158
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 31.9 bits (73), Expect = 0.61
Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 6/63 (9%)
Query: 138 EKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
E+E + K+ + +K+ + ++K EKK K +KGK EE +E E + +E E
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLELEAEEP-E 125
Query: 193 KPV 195
P+
Sbjct: 126 PPL 128
Score = 30.7 bits (70), Expect = 1.7
Identities = 16/56 (28%), Positives = 28/56 (50%), Gaps = 8/56 (14%)
Query: 193 KPVEKI--------RKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
KP+E+I +E K+ E EK+ + +KK ++K KK K+ + E +
Sbjct: 61 KPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEEL 116
>gnl|CDD|235516 PRK05582, PRK05582, DNA topoisomerase I; Validated.
Length = 650
Score = 32.1 bits (74), Expect = 0.64
Identities = 17/56 (30%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
E KK K + K +K ++ K+++D +I + K K+ KV K K+K +K
Sbjct: 195 AEFKKGKKKFEASFYGYKGKKIEL---KNEEDVKEILAELKKKDFKVSKVKKKERK 247
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 30.7 bits (70), Expect = 0.67
Identities = 19/69 (27%), Positives = 42/69 (60%), Gaps = 1/69 (1%)
Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+ K K +KE +KE+K+ + + K++ + + E+ E+ +K K + +K +++K
Sbjct: 63 DLKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEK-RTRKNREKKF 121
Query: 224 KKKKKEKER 232
K+++KEKE+
Sbjct: 122 KRRQKEKEK 130
Score = 29.2 bits (66), Expect = 1.8
Identities = 20/58 (34%), Positives = 32/58 (55%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
+ +EKEK KK+K K +I + + + K EKEK ++ +EKK ++ K KE
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 31.8 bits (72), Expect = 0.68
Identities = 25/108 (23%), Positives = 42/108 (38%), Gaps = 3/108 (2%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
KK K + KK K K K K K K +K K + ++ + K+
Sbjct: 199 AKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKK 258
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
+K + ++K K K K+ K K + K K K K+++ +
Sbjct: 259 ALKKAAKAVKKAAK---KAAKAAAKAAKGAAKATKGKAKAKKKAGKKA 303
Score = 29.5 bits (66), Expect = 3.6
Identities = 27/93 (29%), Positives = 33/93 (35%), Gaps = 1/93 (1%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K K+ K K KK K K KK K K K K K + K + K KK+
Sbjct: 240 KAAKTAAKAAKKAKKTAK-KALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKK 298
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
+ K K K K +K KK K
Sbjct: 299 AGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 28.7 bits (64), Expect = 6.9
Identities = 31/117 (26%), Positives = 42/117 (35%), Gaps = 2/117 (1%)
Query: 119 GEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK-GKKEEK 177
+ K KK K K K K K K K KK K +K KK K
Sbjct: 217 AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAK 276
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEK-KSKDKDKKLKKEKLKKKKKEKERS 233
+ + K K K K + +K + K K+ K K + K KK K+R+
Sbjct: 277 AAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 31.1 bits (71), Expect = 0.70
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 2/37 (5%)
Query: 146 KDKKDKDKI--KNKEKDKEKKVDKEKEKGKKEEKEKK 180
KD K ++ + E K ++ E KKE K+KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.0 bits (68), Expect = 1.7
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 190 DKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
K +E+ EE ++ K ++ + KKE KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 32.2 bits (73), Expect = 0.71
Identities = 19/71 (26%), Positives = 26/71 (36%), Gaps = 5/71 (7%)
Query: 10 AVVRQSAPMFPFQFSPFPSVAGPGL-----IPRLPPMLQHPLIPQPLVHPPGTPPPSKMK 64
A RQ++P P +P AGP P PP P P P P PP +
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
Query: 65 DKVSPKDKTSP 75
V+ ++
Sbjct: 2786 PAVASLSESRE 2796
Score = 29.9 bits (67), Expect = 4.1
Identities = 20/103 (19%), Positives = 27/103 (26%), Gaps = 10/103 (9%)
Query: 11 VVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPK 70
V R + P F P P P PP PP P
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP--------PP 2934
Query: 71 DKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
P+ ++ + + V LG +VP V V
Sbjct: 2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGA--LVPGRVAVP 2975
Score = 28.8 bits (64), Expect = 7.8
Identities = 14/69 (20%), Positives = 19/69 (27%), Gaps = 4/69 (5%)
Query: 9 QAVVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVS 68
+ +AP P P +A P + P P+ P PPP
Sbjct: 2870 SPAAKPAAPARP----PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
Query: 69 PKDKTSPKE 77
P P
Sbjct: 2926 PPQPQPPPP 2934
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 31.8 bits (72), Expect = 0.73
Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 10/100 (10%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK------KEEK 183
D+K KEK + K++ DK K+ + ++E++V ++ E+ KK E E+ E +
Sbjct: 114 DRKDENLSSKEKTLESKEQSLTDKSKHID-EREEQV-EKLEEQKKAELERVAALSQAEAR 171
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
E + E + K +I +++E+E KD+ K+ K+ L
Sbjct: 172 EIILAETENKLTHEIATRIREAERE--VKDRSDKMAKDLL 209
Score = 31.0 bits (70), Expect = 1.4
Identities = 30/107 (28%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK--EK 193
K E++ KK K + K KE E K + K + + E++ K E +E K E + E+
Sbjct: 50 KAERDAEHIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTER 109
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKL--KKEKLKKKKKEKERSSHEKE 238
RK+E S KEK + K++ L K + + +++++ E+ +K+
Sbjct: 110 ATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKK 156
>gnl|CDD|227708 COG5421, COG5421, Transposase [DNA replication, recombination, and
repair].
Length = 480
Score = 32.0 bits (73), Expect = 0.73
Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 9/129 (6%)
Query: 110 VKVTSSEDLGEIKKDKSIKKDKKT------HKKKEKEKIKKKKDKKDKDKIKNKE-KDKE 162
V T +E + D +K K K + + K++ KE +
Sbjct: 228 VPATIAEAKELLHADLYLKTLKSDERGSPYGTKVVYGGVGQLWVFLLSYKLQKKEEQTLR 287
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+++KE EK +K ++ K +EF ++D EKI K+ S + + K KK ++E
Sbjct: 288 TRIEKELEKAEKSLEKLK-GREFNCEKDARIAAEKILKDYS-SVEFLEVDFKSKKKREEA 345
Query: 223 LKKKKKEKE 231
+ + ++ E
Sbjct: 346 KRGRPRKDE 354
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 31.3 bits (71), Expect = 0.73
Identities = 20/84 (23%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K ++ K++K+ +K+K+ K K K ++ ++K E + + EK E
Sbjct: 101 AKFEAWKEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAA 160
Query: 182 EKEFKMKED--KEKPVEKIRKEEK 203
E+ +E+ +E P E+ EE
Sbjct: 161 EEAAAAEEEEAEEAPAEEAPAEES 184
Score = 29.4 bits (66), Expect = 2.9
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK--GKKEEKEKKEEKEFKMKEDKEK 193
K+ K KK K K K K + EKKV++ + + +K+ E + +E+
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAAEEE 169
Query: 194 PVEKIRKEEKDSEKEK 209
E+ EE +E+
Sbjct: 170 EAEEAPAEEAPAEESA 185
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 31.9 bits (73), Expect = 0.74
Identities = 29/111 (26%), Positives = 51/111 (45%), Gaps = 20/111 (18%)
Query: 129 KDKKTHKKKEKEKIK-KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE----- 182
KDK T K++ I +++I+ K+ E + E++K +KEE E +
Sbjct: 480 KDKGTGKEQ---SITITASSGLSEEEIERMVKEAEA--NAEEDKKRKEEIEARNNADSLA 534
Query: 183 -------KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 225
KE K E+ EKI K + ++ K +D ++ K K E+L++
Sbjct: 535 YQAEKTLKEAGDKLPAEE-KEKIEKAVAELKEALKGEDVEEIKAKTEELQQ 584
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 31.8 bits (72), Expect = 0.75
Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 2/158 (1%)
Query: 43 QHPLIPQPLVHPPGTPPPSK-MKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELG 101
Q P+ + P TP K +P + + K EL
Sbjct: 269 QTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELL 328
Query: 102 VTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK 161
++ ++S D D + +K ++ + +D + +K
Sbjct: 329 RERFPSDLFAISSLPDSDSEASDSGPTRKRK-RRRVPPLPEYSSDEDEDDSDEDEVDYEK 387
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
E+K +E++K K + K + E EK V+ R
Sbjct: 388 ERKRRREEDKNFLRLKALELSKYAGVNERMEKIVQVTR 425
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 32.1 bits (73), Expect = 0.77
Identities = 34/133 (25%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
IK+ +++DK + + +++KK DK KD I+ + E D +E K KK
Sbjct: 28 IKELGLLEQDK--ARAESYKELKKIIDKYHKDFIEETLSNVELSFDLLEEYFDLYNKLKK 85
Query: 181 EEKEFKMKEDKEKPVEKIRKE-----EKDSEKEKKSKDKDKKLKKEKLKK--KKKEKERS 233
+K+ K+D K + +RK+ + + +E KKL K L+ K K+
Sbjct: 86 FKKD---KKDLAKEQDTLRKQIVKSLQDNLPEEIFKNIFSKKLLKGVLEDFIKNTHKDSE 142
Query: 234 SHEKEVIPKLTFK 246
+ +I FK
Sbjct: 143 DEQNILIDIKEFK 155
Score = 28.3 bits (63), Expect = 9.9
Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 11/124 (8%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEI 121
+K ++ P+ S KEK K K K+++Y ++ + + + E +
Sbjct: 380 YIKKQIEPRKGKSKKEKYEEKVKKVVKKQDYLSIQFINDLLEKYI--------KEFDSKD 431
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
+ ++I+ E + + + IK + E+ + KE++ K++E+ K
Sbjct: 432 RLFENIRDYFAEISLMRNEDFQTNDNIAN---IKTAYQTFERILQKERKDLKQDEEVVKL 488
Query: 182 EKEF 185
K
Sbjct: 489 IKSL 492
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 31.0 bits (71), Expect = 0.78
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
KK K KK+K+ +K+ ++ D+ + + K + E+ D+E + ++ E E+K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAER--DRELNRQRQAEAEQKA 72
Query: 182 EK 183
Sbjct: 73 IV 74
Score = 27.6 bits (62), Expect = 9.7
Identities = 14/63 (22%), Positives = 34/63 (53%), Gaps = 4/63 (6%)
Query: 159 KDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
K K KK KEK K +K+ ++ ++ + ++K+ E+ + E+ + ++E + + +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGDDELKQA----AEEAKAEKAERDRELNRQRQAEAE 69
Query: 219 KKE 221
+K
Sbjct: 70 QKA 72
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.9 bits (72), Expect = 0.80
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 3/88 (3%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
+E ++ + + E + E E++ E E K + + E + RK E++ E E ++K+
Sbjct: 645 TGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEA 704
Query: 215 DKK---LKKEKLKKKKKEKERSSHEKEV 239
D K +E + + E E + E E+
Sbjct: 705 DHKGETEAEEVEHEGETEAEGTEDEGEI 732
Score = 31.5 bits (71), Expect = 1.2
Identities = 19/85 (22%), Positives = 42/85 (49%), Gaps = 2/85 (2%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
K D +K + + + + + K+D+K D D ++E+E
Sbjct: 812 KDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDG--GGGSDGGDSEEEEEE 869
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPV 195
+ ++EE+E++EE+E + +E+ E+P+
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 29.2 bits (65), Expect = 6.2
Identities = 32/164 (19%), Positives = 71/164 (43%), Gaps = 13/164 (7%)
Query: 77 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIK--KDKSIKKDKKTH 134
E++ + + + K + + + ED GEI+ +D +K D+
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
K E E + +K + E E + D + K + E+E E + + K+D++
Sbjct: 799 GKVEHEGETEAGEKDEH------EGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGV 852
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ DSE+E++ ++++++ ++E+ +E+E E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEE-----EEEEEEEEENE 891
Score = 29.2 bits (65), Expect = 6.5
Identities = 22/95 (23%), Positives = 53/95 (55%), Gaps = 7/95 (7%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD-KIKNKEKDKEKKVDKEKEK--G 172
E GE + + + + ++ + + ++K + +++ + + + + K EK VD G
Sbjct: 802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGG 861
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
EE+E++EE+E E++E+ E+ +EE+++E+
Sbjct: 862 DSEEEEEEEEEE----EEEEEEEEEEEEEEEENEE 892
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 30.9 bits (70), Expect = 0.83
Identities = 38/146 (26%), Positives = 66/146 (45%), Gaps = 21/146 (14%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV---------------- 165
KK K+ + K ++K+ K K +KKK + I K DK+K +
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTA 62
Query: 166 -DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
+ K E K+ + ++DK P++ E DS++E++ + K+L++E +
Sbjct: 63 ARTDLLIQDKPIPELKKGEARIERDDKGNPLD----EPLDSKEEEEKPEVVKQLEEEASE 118
Query: 225 KKKKEKERSSHEKEVIPKLTFKFGTD 250
KK + S E+E I +L K G D
Sbjct: 119 PAKKPRRLSKREEEWIERLVEKHGDD 144
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 31.8 bits (72), Expect = 0.85
Identities = 34/151 (22%), Positives = 56/151 (37%), Gaps = 20/151 (13%)
Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
K+ D EK + KKE KK EK K+K + V K ++EEK+ +
Sbjct: 110 GKKTNDDLEKIESKKESINKKNEK--KIKNEASLQV-KTQREEKEEKDFIAD-------C 159
Query: 220 KEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESS----------PKIVIKPVK 269
L KK +E+ + E K K + E S +IV + +
Sbjct: 160 WRNLYKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQ 219
Query: 270 SPSPPPVAAADEYDTGDSKQVWICPACGVQD 300
+P ++D+ ++ +W G D
Sbjct: 220 TPLALLECNLTDFDSIENHSIWEQKIVGSGD 250
Score = 29.1 bits (65), Expect = 6.5
Identities = 21/94 (22%), Positives = 45/94 (47%), Gaps = 4/94 (4%)
Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF 185
I+ K++ KK ++KI K + + K + +EK+++ + K+ +E KE+
Sbjct: 119 KIESKKESINKKNEKKI--KNEASLQVKTQREEKEEKDFIADCWRNLYKKNEE--RFKEY 174
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
++K KI KE ++++ +K+K
Sbjct: 175 LENFKRKKFKRKILKEFENAKINASEIVGLEKVK 208
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 30.1 bits (68), Expect = 0.93
Identities = 14/74 (18%), Positives = 44/74 (59%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
++E +++++ E E K++E +++ E++RKE+++ +E++ K + + ++K++ +
Sbjct: 24 SQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKMEDE 83
Query: 227 KKEKERSSHEKEVI 240
K ++ H +
Sbjct: 84 KLQETWHEHNLALA 97
Score = 27.4 bits (61), Expect = 7.8
Identities = 18/84 (21%), Positives = 44/84 (52%)
Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDS 205
K +++ ++ +KEK + + K +E+ K+E +E ++EK +E + K +E + E+K
Sbjct: 22 KFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKME 81
Query: 206 EKEKKSKDKDKKLKKEKLKKKKKE 229
+++ + + L + K +
Sbjct: 82 DEKLQETWHEHNLALANFIRTKTK 105
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 30.7 bits (70), Expect = 0.95
Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 3/84 (3%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K K++I +K E+ +K + +E K E + K +E+ E+ +K++ EKD +
Sbjct: 30 KSLKEEIAEMKK-NEEHNEKLMAEIAQENKRLVEPLK-KAEEEVEELRKKLKDYEKDKQS 87
Query: 208 EKKSKDKDKKLKKEKLKKKKKEKE 231
K K + K+L+KE LK K E E
Sbjct: 88 LKNLKARLKELEKE-LKNLKWESE 110
Score = 28.3 bits (64), Expect = 6.7
Identities = 21/108 (19%), Positives = 56/108 (51%), Gaps = 12/108 (11%)
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
KS+K++ KK E+ ++K +I + K + + K +E+ ++ K+ K+ ++
Sbjct: 30 KSLKEEIAEMKKNEE------HNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEK 83
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
K ++ ++ K+ EKE K+ + ++ +++ +K ++E++
Sbjct: 84 DKQS------LKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDE 125
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 30.3 bits (69), Expect = 0.98
Identities = 21/119 (17%), Positives = 54/119 (45%), Gaps = 11/119 (9%)
Query: 115 SEDLGEIKKDKS-----IKKDKKTHK--KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
++DL E ++ K + + ++ + +++ +I ++ K+ + + + + E+++++
Sbjct: 43 ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102
Query: 168 EKEKGKKE-EKEKKEEKE---FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
KE + E E EK+ E ++ E EK+ ++ D +K D E
Sbjct: 103 IKEAAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAELGEN 161
Score = 28.0 bits (63), Expect = 6.2
Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 2/86 (2%)
Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
K ++ + K E E+ K+E + E E +++E +E+ +I ++ K E E+ ++
Sbjct: 33 KALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQ-ASEIIEQAKK-EAEQIAE 90
Query: 213 DKDKKLKKEKLKKKKKEKERSSHEKE 238
+ + ++E + K+ + EKE
Sbjct: 91 EIKAEAEEELERIKEAAEAEIEAEKE 116
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 30.9 bits (70), Expect = 0.99
Identities = 22/101 (21%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKEDKEK 193
++++++ + ++ ++ + KN K + D +K K+E+K KE KE + E +E
Sbjct: 109 REQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEAREH 168
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
+ + ++ + K+K++K K ++ K+++KE++R +
Sbjct: 169 FGYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEKRMA 209
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 30.8 bits (70), Expect = 1.0
Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 6/90 (6%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM-KEDKE 192
KK K+ K+++ + K + ++ K EK + KE++
Sbjct: 127 KGKKPKKGFAGIPADTSKEEVVALPVKQNKN-----SEVIPFQRTPKREKARVITKEERA 181
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ ++ K + K K+K K+ K++K
Sbjct: 182 FNAYRTLRQAKLNAKFVGKKEKKKQAKEKK 211
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 30.8 bits (70), Expect = 1.0
Identities = 12/38 (31%), Positives = 19/38 (50%)
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
E K+ K+++EF +E KE +K + E K K
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKK 195
Score = 28.9 bits (65), Expect = 3.6
Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 2/39 (5%)
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
K+ K+K E+ EE+ +K K E KKK
Sbjct: 160 KDAKQK--EEFAAEERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.0 bits (70), Expect = 1.1
Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 3/103 (2%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
+ K D E+ K K EE +K++E+ K E+ E E++++ K E E ++ K
Sbjct: 132 DLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPG 191
Query: 222 KLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 264
++ KK + E+ + +D+ ++T + I
Sbjct: 192 EVYDLKKRWDELEPGVELPEEELI---SDLVKETLNLAPKDIE 231
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 30.6 bits (70), Expect = 1.1
Identities = 25/96 (26%), Positives = 53/96 (55%), Gaps = 11/96 (11%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
KKE E +KK+ ++ KE+ + + + E+E ++ + +++EK KE+
Sbjct: 44 KKEAEALKKEA------LLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQKEETLD-- 95
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
RK+E +KE+ ++K+K+L + + ++KE+E
Sbjct: 96 ---RKDESLEKKEESLEEKEKELAARQQQLEEKEEE 128
Score = 28.7 bits (65), Expect = 5.3
Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 9/92 (9%)
Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
KI + E++ ++ + E+ KKE + K+E + KE+ K + +E K+ E + +
Sbjct: 28 KISSAEEEAKQII----EEAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQ 83
Query: 213 DKDKKLKKEKLKKK-----KKEKERSSHEKEV 239
+K K+E L +K KKE+ EKE+
Sbjct: 84 EKRLLQKEETLDRKDESLEKKEESLEEKEKEL 115
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 31.4 bits (71), Expect = 1.1
Identities = 28/101 (27%), Positives = 49/101 (48%), Gaps = 11/101 (10%)
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
+EK+ +E EK KEE E++ + E + + ++EK + + + +E EK ++
Sbjct: 252 EEKR--RELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEK---------RR 300
Query: 221 EKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSP 261
EKL+ K+ RS+ I FK G ++ R S P
Sbjct: 301 EKLQNLLKKASRSADNVWYIEPSEFKAGDTVKLYYNRSSGP 341
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 31.1 bits (70), Expect = 1.1
Identities = 40/185 (21%), Positives = 72/185 (38%), Gaps = 24/185 (12%)
Query: 61 SKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGE 120
SK++++ K+K PK + E+ + +K K L
Sbjct: 35 SKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAAL-------------------- 74
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI--KNKEKDKEKKVDKEKEKGKKEEKE 178
K K ++ T ++K K K K K K K K + E+ ++EK K +
Sbjct: 75 -AKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAA 133
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEK 237
+ K + + K + E++ +EE++++KEK K K K+K E +
Sbjct: 134 AAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTE 193
Query: 238 EVIPK 242
EV +
Sbjct: 194 EVTEE 198
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 31.6 bits (71), Expect = 1.1
Identities = 53/231 (22%), Positives = 77/231 (33%), Gaps = 14/231 (6%)
Query: 54 PPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVT 113
PP P S + K + E SKE P KE K + + G P K
Sbjct: 510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEP--KEGGKPGETKEGEVGKKPGPAKEH 567
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ + K KD K H K +E K K+ + + + K E
Sbjct: 568 KPSKIPTLSKKPEFPKDPK-HPKDPEEPKKPKRPRSAQRPTRPKSPKL-----PELLDIP 621
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK--E 231
K K + K K ++P R E K K K K K+K + +
Sbjct: 622 KSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLD 681
Query: 232 RSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSP-PPVAAADE 281
++ KE T + E +E+ P+ P +P P PP DE
Sbjct: 682 AAAKSKETK---TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDE 729
Score = 29.3 bits (65), Expect = 6.0
Identities = 41/243 (16%), Positives = 77/243 (31%), Gaps = 6/243 (2%)
Query: 56 GTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSS 115
G PP + + + KE KE P E + +V P+ + S
Sbjct: 518 GLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSK 577
Query: 116 -----EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
+D K + KK K+ + + K K + D K+ ++ + K K
Sbjct: 578 KPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
++ ++ E +K K K + K EK KE ++
Sbjct: 638 PPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDE 697
Query: 231 ERSSHEKEVIPKLT-FKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGDSKQ 289
S KE +P+ F T K + +P+ P + + + ++
Sbjct: 698 SFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEER 757
Query: 290 VWI 292
+
Sbjct: 758 TFF 760
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.9 bits (70), Expect = 1.1
Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 4/69 (5%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKK----VDKEKEKGKKEEKEKKEEK 183
K KK K K K+ ++++ K +K+ K D ++E K EE+E++EE+
Sbjct: 125 KAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEE 184
Query: 184 EFKMKEDKE 192
E + +D +
Sbjct: 185 EDEDFDDDD 193
Score = 29.3 bits (66), Expect = 3.6
Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 1/78 (1%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
K +K + K K K +E EK EK+ + E ++ EE E +E+
Sbjct: 121 GINKKAGKKLALSKFKRKVGLFTE-EEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179
Query: 191 KEKPVEKIRKEEKDSEKE 208
+E+ E ++ D + +
Sbjct: 180 EEEEEEDEDFDDDDDDDD 197
>gnl|CDD|179204 PRK01005, PRK01005, V-type ATP synthase subunit E; Provisional.
Length = 207
Score = 30.5 bits (69), Expect = 1.1
Identities = 27/112 (24%), Positives = 51/112 (45%), Gaps = 7/112 (6%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
++K+K+ D ++ +K E++ V KE+ K+ E +EE E ++ +E +
Sbjct: 5 SAQDKLKQICDALREETLKPAEEEAGAIVHNAKEQAKRIIAEAQEEAEKIIRSAEETADQ 64
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK-----ERSSHEKEVIPKL 243
K+++ E S + K + LK+ K +E E + EV KL
Sbjct: 65 KLKQGE--SALVQAGKRSLESLKQAVENKIFRESLGEWLEHVLTDPEVSAKL 114
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 30.5 bits (69), Expect = 1.1
Identities = 16/83 (19%), Positives = 31/83 (37%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
+ E+ K +++ + + KK ++ N+E E K+ +E
Sbjct: 76 SLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHLNEEVSNETKLTQESS 135
Query: 171 KGKKEEKEKKEEKEFKMKEDKEK 193
+ ++ K K K DKEK
Sbjct: 136 SDESPKEVKLATKNKTKKHDKEK 158
Score = 29.3 bits (66), Expect = 2.6
Identities = 24/102 (23%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 125 KSIKKDKKTHKKK---EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE-EKEKK 180
+ IK++ +KK+ E +KKK ++ D +K +++ K + + KK E++
Sbjct: 57 EEIKEEALKYKKEFESAVESLKKKLKFEELDDLKITAENEIKSIQDLLQDYKKSLEEDTI 116
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ ++ K ++ +E E + +K+K KK KEK
Sbjct: 117 PNHLNEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDKEK 158
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 31.3 bits (71), Expect = 1.1
Identities = 16/49 (32%), Positives = 18/49 (36%), Gaps = 1/49 (2%)
Query: 17 PMFPFQFSPFPSVAGPGLIPRLPPM-LQHPLIPQPLVHPPGTPPPSKMK 64
P P Q P PGL ++PP Q P Q P PPP
Sbjct: 234 PQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282
Score = 28.6 bits (64), Expect = 9.3
Identities = 9/55 (16%), Positives = 12/55 (21%), Gaps = 3/55 (5%)
Query: 9 QAVVRQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKM 63
A +Q P P + P P P P P +
Sbjct: 189 AAFPQQGPPEQPPGYPQPP---QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQP 240
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 31.4 bits (72), Expect = 1.1
Identities = 19/73 (26%), Positives = 41/73 (56%)
Query: 144 KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
++K+KK K+++KN + KK+ + E+ K EE+EK E+ + E+ +E++
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 204 DSEKEKKSKDKDK 216
+ +KE + + +
Sbjct: 550 EKDKETEEDEPEG 562
Score = 29.1 bits (66), Expect = 5.7
Identities = 23/97 (23%), Positives = 42/97 (43%), Gaps = 21/97 (21%)
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
KEK KEE +K K +K+++ +++ + E++ KD+ L +K K K
Sbjct: 492 KEKKMKEE----------LKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDF-VKSKGK 540
Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVI 265
E+E ++E E + PKI++
Sbjct: 541 EEEEEEEKEEKDK----------ETEEDEPEGPKIIV 567
Score = 29.1 bits (66), Expect = 6.2
Identities = 14/73 (19%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 155 KNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
+ KEK ++++ + KK ++ +E+K + ++D++ ++ K + E+E++ K+
Sbjct: 490 RRKEKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKE 549
Query: 214 KDKKLKKEKLKKK 226
+ K +E +
Sbjct: 550 EKDKETEEDEPEG 562
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 31.1 bits (71), Expect = 1.2
Identities = 12/77 (15%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEK-KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
++ K ++ + + + ++EK K EE + EE+ + + +
Sbjct: 218 EEAKLAVETGYWPLYRYE--PGKAEEEKGKDIPIGIFYPVEEYLEYEERFKHLTKSNPEL 275
Query: 195 VEKIRKEEKDSEKEKKS 211
++K++K+ + +E K
Sbjct: 276 IQKLQKDVDERWEELKR 292
>gnl|CDD|216187 pfam00913, Trypan_glycop, Trypanosome variant surface glycoprotein
(A-type). The trypanosome parasite expresses these
proteins to evade the immune response. This family
includes a variety of surface proteins such as
Trypanosoma brucei VSGs such as expression site
associated gene (ESAG) 6 and 7.
Length = 370
Score = 31.3 bits (71), Expect = 1.2
Identities = 17/73 (23%), Positives = 32/73 (43%), Gaps = 2/73 (2%)
Query: 111 KVTSSEDLGEIKK--DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
T+ + E+K+ + + K T + EKIKK D +KIK + + + +
Sbjct: 279 DTTTLKASTELKEALKRLLLNKKDTDDEDAAEKIKKLFGDDDAEKIKKLWEKIDNEQIPK 338
Query: 169 KEKGKKEEKEKKE 181
K G ++ + E
Sbjct: 339 KIAGTTKKTKLGE 351
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 31.2 bits (70), Expect = 1.2
Identities = 35/167 (20%), Positives = 71/167 (42%), Gaps = 17/167 (10%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHK---KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDK 167
K+ E L E K + K +K K + K I ++ + NK + +K
Sbjct: 139 KMNRREKLRERKALNAAKFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQD 198
Query: 168 EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI----------RKEEKDSEKEKKSKDKDKK 217
EKE+ +EE+E E E + +++ +K E +SE+E+ S+ + +
Sbjct: 199 EKERYVEEEEESDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDE 258
Query: 218 LKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIV 264
+ E K K ++++ +K P + ++ E++ + E P +
Sbjct: 259 DEDEDNKGKIRKRKTDDAKKSRKPHIHIEY----EQERENEKIPAVQ 301
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 31.2 bits (71), Expect = 1.2
Identities = 16/103 (15%), Positives = 38/103 (36%), Gaps = 7/103 (6%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK-EKPVE 196
EK + +K+ +++ ++++ + E G K + + + + P E
Sbjct: 511 EKLEEVRKELEEELEEVEKLLDEVEL--LTGANSGGKTSLLELKAQIVVLAHMGLPVPAE 568
Query: 197 KIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ + E + E K K K L + K+K+ +
Sbjct: 569 EAKVE---AVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEA 608
Score = 30.8 bits (70), Expect = 1.5
Identities = 24/115 (20%), Positives = 50/115 (43%), Gaps = 10/115 (8%)
Query: 138 EKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE----KKEEKEFKMK--EDK 191
E+ K + ++K+ +++ K ++ K++++E E+ +K E K E K
Sbjct: 493 EEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELK 552
Query: 192 EKPVEKIRKE--EKDSEKEKKSKDKDKKLKKEK--LKKKKKEKERSSHEKEVIPK 242
+ V E + ++ D+ K LKK+K L E + +V+P+
Sbjct: 553 AQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPE 607
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 30.3 bits (69), Expect = 1.2
Identities = 28/143 (19%), Positives = 59/143 (41%), Gaps = 28/143 (19%)
Query: 89 EKEYSKVKDVELGVTPM-VPNIVKVTSSEDLGEIKKDK--------SIKKDKKTHKKKEK 139
EK KVK G++ M V +++ + L ++ +K S K
Sbjct: 18 EKLAPKVK----GISSMSVKEVLQSLVDDGL--VRTEKIGTSNYYWSFPSQALNKLKTRL 71
Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
EK+KK+ ++ + E + ++ K+ +EE E + + +E +
Sbjct: 72 EKLKKELEELKQRI-------------AELQAQIEKLKKGREETEERTELLEELKQLEKE 118
Query: 200 KEEKDSEKEKKSKDKDKKLKKEK 222
++ +E EK K+ ++++K K
Sbjct: 119 LKKLKAELEKYEKNDPERIEKLK 141
>gnl|CDD|112704 pfam03904, DUF334, Domain of unknown function (DUF334).
Staphylococcus aureus plasmid proteins with no
characterized function.
Length = 229
Score = 30.8 bits (69), Expect = 1.2
Identities = 28/110 (25%), Positives = 54/110 (49%), Gaps = 5/110 (4%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK 180
I D+ KK+++ + K+ +K + ++N+E + KK +K K E KE +
Sbjct: 6 IMSDEDEKKNEENMNMQMNSKLTGTHSQKIQKSLENEELQELKKQNKLIIKYIAEIKENQ 65
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKD-SEKEKKSKDKDKKLKKEKLKKKKKE 229
+ +E ++K K ++++ +D +K KK + L +EKLK E
Sbjct: 66 DIREKELKAIK----SELKEATEDFQDKGKKIHNDFVDLLQEKLKHVDAE 111
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 29.7 bits (67), Expect = 1.2
Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 10/102 (9%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--EKKEEKEFKMKE 189
KE++K K ++K+ K K E +++E+ EE+ +K+ EKE
Sbjct: 26 SIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFA 85
Query: 190 DKEKPV--------EKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
DKEK V E+ RK E++ ++ ++ ++++ K + L
Sbjct: 86 DKEKFVTSAYKKQLEENRKLEEEEKEREELEEENDVTKGKDL 127
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 30.5 bits (69), Expect = 1.2
Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 2/79 (2%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
+ D E +K+ + E ++F + +++ EK K + +++LK
Sbjct: 37 AEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEVNERLK-- 94
Query: 222 KLKKKKKEKERSSHEKEVI 240
L+KKK E + E+E I
Sbjct: 95 ALEKKKAELKDLEEEREGI 113
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 30.4 bits (69), Expect = 1.3
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN-KEKDKEKKVDKEKEKGKK 174
KK + +KK + K+ E + KKK + +++K K K+ K+KK+ K G K
Sbjct: 151 KKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVAAVGMK 204
Score = 29.6 bits (67), Expect = 2.6
Identities = 19/67 (28%), Positives = 40/67 (59%), Gaps = 10/67 (14%)
Query: 175 EEKEKKEEKEFKMKEDK-----EKPVEKIRK-----EEKDSEKEKKSKDKDKKLKKEKLK 224
+ KE+ ++K +K ++K K VE+++K +E ++ ++KK + ++ KK+K K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLKKVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKKK 187
Query: 225 KKKKEKE 231
K K+K+
Sbjct: 188 KSAKKKK 194
>gnl|CDD|162081 TIGR00874, talAB, transaldolase. This family includes the majority
of known and predicted transaldolase sequences,
including E. coli TalA and TalB. It excluded two other
families. The first includes E. coli transaldolase-like
protein TalC. The second family includes the putative
transaldolases of Helicobacter pylori and Mycobacterium
tuberculosis [Energy metabolism, Pentose phosphate
pathway].
Length = 317
Score = 30.9 bits (70), Expect = 1.3
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDK---EKPVEKIRKEEKDSE 206
D++K E+K+D E K ++ +E EF+ ED EK E IRK D E
Sbjct: 248 DELKESTGPVERKLDPESAKKVDKQPIILDESEFRFLHNEDAMATEKLAEGIRKFAADQE 307
Query: 207 K 207
K
Sbjct: 308 K 308
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 29.9 bits (67), Expect = 1.3
Identities = 22/101 (21%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR-------KEEKD 204
K ++ ++ D+E ++ +E +E+++ K KM+E++E + IR KEE +
Sbjct: 22 GKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRDKYGIKKKEEDE 81
Query: 205 SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
E + +++++ +L ++K ++ E ++E K F
Sbjct: 82 EEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEEKSGF 122
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 30.7 bits (70), Expect = 1.3
Identities = 9/70 (12%), Positives = 31/70 (44%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
K K + ++ +++ +K + +++K + + + K ++ + +K + D +
Sbjct: 107 KAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISE 166
Query: 217 KLKKEKLKKK 226
+ +K K
Sbjct: 167 LTVGQAVKVK 176
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 31.2 bits (71), Expect = 1.3
Identities = 17/88 (19%), Positives = 27/88 (30%), Gaps = 12/88 (13%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK---------EDKEKPVEKIRKEEKDS 205
KN E KEK + + EE+ K D K ++ + K
Sbjct: 5 KNFRVPSEYGFVSTKEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAV 64
Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKERS 233
K + + +L + KE S
Sbjct: 65 ---DKHSEIESLNVGMRLDVRLSSKELS 89
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 30.4 bits (69), Expect = 1.4
Identities = 15/50 (30%), Positives = 18/50 (36%)
Query: 39 PPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPK 88
P + P P P PP K +K PK K K + K PK
Sbjct: 87 PATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPK 136
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 30.8 bits (70), Expect = 1.4
Identities = 30/129 (23%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEK----- 203
K IK KD +++ + K K E K++ K+ +D K +EKI EK
Sbjct: 204 KVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQ 263
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEK---ERSSHEKEVIPKLTFKFGTDMEEKTKRES- 259
+ ++ ++ + L + + K + + E + E+ +L + +MEE+ S
Sbjct: 264 LEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELE-QVKQEMEERGASMSD 322
Query: 260 -SPKIVIKP 267
SP + IK
Sbjct: 323 GSPLVKIKQ 331
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 30.9 bits (71), Expect = 1.4
Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
+K K + K K +K K K++ V EK GK
Sbjct: 814 LAEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQYVASEK-DGKA 858
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 30.8 bits (70), Expect = 1.4
Identities = 15/52 (28%), Positives = 31/52 (59%), Gaps = 1/52 (1%)
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
++E+ EK +EK++++ K ++DKK + + ++KKKEK+ S +
Sbjct: 341 SLREELEKRIEKLKEKPPKPPT-KAKPERDKKERPGRYRRKKKEKKAKSERR 391
Score = 28.9 bits (65), Expect = 5.9
Identities = 11/60 (18%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+ + ++++ +K+ + K+K K K E++K ++ + ++++KE K K ++
Sbjct: 334 SGEPDGISLREELEKRIE-KLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392
Score = 28.9 bits (65), Expect = 6.3
Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
+ D I +E+ EK+++K KEK K + K E+ KE + K ++++ SE+
Sbjct: 335 GEPDGISLREE-LEKRIEKLKEKPPKPPTKAKPER--DKKERPGRYRRKKKEKKAKSERR 391
Query: 209 KKSK 212
Sbjct: 392 GLQN 395
Score = 28.5 bits (64), Expect = 6.8
Identities = 13/53 (24%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKK 228
+E+ E++ K+KE KP K + E E+ + + K +K K + + +
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
Score = 28.5 bits (64), Expect = 7.0
Identities = 11/48 (22%), Positives = 24/48 (50%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
K+ +K +K K K K ++ K++ + K++K K + +G +
Sbjct: 348 KRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRGLQN 395
>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 379
Score = 30.7 bits (69), Expect = 1.5
Identities = 13/95 (13%), Positives = 32/95 (33%), Gaps = 7/95 (7%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKM 187
+ T ++ + I ++ K + E K +K + +K+ +
Sbjct: 181 GEAPITTRRIGVDGISLSLEET-------KASEVEHLSASLKTATEKVDALEKDLHAAQA 233
Query: 188 KEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
K D + + K + K +E K + ++
Sbjct: 234 KLDSGQALTKEELDAKKAELSKALAALEAANAADE 268
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 31.1 bits (71), Expect = 1.5
Identities = 21/67 (31%), Positives = 36/67 (53%)
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ E +K EE + +E + +EK E + EKEKK D+ KLK+ ++ + +KE E
Sbjct: 393 VRLEIDKPEELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEA 452
Query: 233 SSHEKEV 239
E ++
Sbjct: 453 EVDEDDI 459
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 30.8 bits (69), Expect = 1.5
Identities = 23/120 (19%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKN---KEKDKEKKVDKEKEKG 172
E L E D ++D+K + + + + + K +D ++K K ++ ++ KE+
Sbjct: 180 EPLDEDYPDDMEERDRKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQE 239
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
K+ +K +K E+E + ++ + + +E E ++ +KK+ + KK +
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKKALKM 299
>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
[Transcription].
Length = 507
Score = 30.8 bits (69), Expect = 1.6
Identities = 26/166 (15%), Positives = 59/166 (35%), Gaps = 6/166 (3%)
Query: 59 PPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDL 118
P ++ D K K+ +K+ + N TS+
Sbjct: 135 PKKIRLGSITENDMNLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSRE 194
Query: 119 GEIKKDKSIKKDKKTHKK-KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
+ ++ I K K KK ++ E K D+++ + + + ++ + EK K +K
Sbjct: 195 ADENENYVISKVKDIPKKVRDGESAKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKK 254
Query: 178 EKKEE----KEFKMK-EDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
K E+ K + + + K + + ++++S+ K
Sbjct: 255 AKLEKRRHVKFLEGSNTHEMDQLLKHFLDNSNFRQDRRSRKKKASA 300
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.4 bits (69), Expect = 1.6
Identities = 27/81 (33%), Positives = 48/81 (59%), Gaps = 5/81 (6%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE-EKDSEKEKKSKDKD 215
+ ++ + EK+K K +K++K KE K ++++EK ++K+ + E E KK K K
Sbjct: 159 ISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKK 218
Query: 216 KKLKKEK----LKKKKKEKER 232
KK+ K+K + K KKE++R
Sbjct: 219 KKIVKDKDGKVVYKWKKERKR 239
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 30.4 bits (69), Expect = 1.7
Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 15/113 (13%)
Query: 149 KDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE------EKEFKMKEDKEKPVEKIRKEE 202
K + K+ K K+ + + +K K++ EKKE + K K ++ + +
Sbjct: 3 KYRSKLLQKAKESGLEFIERLKKALKDKIEKKEFSAKKPPTGPSKQASKFKTLKPPKPAD 62
Query: 203 KDSEKEKKSKDKDKKLKKEKLKKK-KKEKE---RSSHEKE-----VIPKLTFK 246
K +K K L EK+K +E E R+ H K+ VIP T++
Sbjct: 63 KKKPFDKPFKPLSSILDVEKIKDLSAEEIEFIWRAYHAKKDSLCAVIPAETYE 115
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.0 bits (68), Expect = 1.7
Identities = 40/130 (30%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
+ +DL KK K K D K KK + + KK ++ + + +EK+K K + E K
Sbjct: 77 AKKDLKLKKKKKKKKTDYKQLLKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVK 136
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK---LKKEKLKKKKKEK 230
++ EK +K K KE ++K +K KE K+ ++KK++ + K+ LKK K KK K+K
Sbjct: 137 VKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKK 196
Query: 231 ERSSHEKEVI 240
+++ + ++
Sbjct: 197 KKAKKKGRIL 206
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 30.5 bits (69), Expect = 1.7
Identities = 17/90 (18%), Positives = 33/90 (36%), Gaps = 7/90 (7%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
I +I K+ KK+ EK KK+K K D++ ++ V E
Sbjct: 50 EWITADLINLGAAISIPKR---KKQTEKGKKEKKPKVSDRMDLDNEN----VQLEMLYSI 102
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
E+E ++ + K +++ +
Sbjct: 103 SNEREIRQLRFGGSKVQNPHEGARVKNINE 132
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 30.4 bits (69), Expect = 1.7
Identities = 16/53 (30%), Positives = 27/53 (50%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KK H K KK KK K K+ +K+ +K K+++ K +K ++ E+
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Score = 28.5 bits (64), Expect = 6.5
Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 22/105 (20%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKIKN-------KEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
K K+++ +K KK+ K K + K+ + + K K KK++K+K K+
Sbjct: 210 KQLKEEDPDKYKKQFSKYIKAGVGPDSLEDMYKKAHAAIRANPSKVKKKKKKKKKVVHKK 269
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKK 228
+K K K + K++K++ K KK + +E+L+KK K
Sbjct: 270 YKTK--------------KLTGKQRKARVKAKKAQRRERLQKKIK 300
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.4 bits (69), Expect = 1.7
Identities = 25/107 (23%), Positives = 58/107 (54%), Gaps = 3/107 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKE---KKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
K+++++I K + K+ + KEK E + + + E EKK++ E++
Sbjct: 75 KEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTPEEER 134
Query: 193 KPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+ V+KI++ K+ E KK+ ++++KLK+ K + + +K+ +++
Sbjct: 135 ELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKI 181
Score = 29.6 bits (67), Expect = 3.1
Identities = 21/106 (19%), Positives = 59/106 (55%), Gaps = 4/106 (3%)
Query: 130 DKKTHKKKEKEKIKKKKDK--KDKDKIKNKEKDKEKKVDKEKEKGK--KEEKEKKEEKEF 185
K+ K+E E++K+K+D+ K+ ++ K + KV + +EK + +EE+++ E+
Sbjct: 13 LKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEEVQ 72
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
++KE +++ K+++ K+ + K+ +++ + +++ +
Sbjct: 73 ELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIER 118
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 30.6 bits (69), Expect = 1.8
Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 2/87 (2%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K+ ++ +KG K++K K + E E K+ K + +KK +DK
Sbjct: 402 KSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK--KNQKKGRDK 459
Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIP 241
K+ + KKE +S + IP
Sbjct: 460 SSKVPSDSKAGGKKESVKSQEDNNNIP 486
Score = 29.4 bits (66), Expect = 4.7
Identities = 28/114 (24%), Positives = 42/114 (36%), Gaps = 10/114 (8%)
Query: 113 TSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG 172
SS L + + + KKKK K K E + + D K KG
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPK-KG 449
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
KK +K K +K K+ D + +K E K +D + +E + KK
Sbjct: 450 KKNQK-KGRDKSSKVPSDSKAGGKK--------ESVKSQEDNNNIPPEEWVMKK 494
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 30.6 bits (69), Expect = 1.9
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 13/140 (9%)
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKK--DKKDKDKIKNKEKDKEKKVDKEKEKGKK 174
DL +IK + KDK ++ K + K K D K +D E + +
Sbjct: 530 DLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRSRS 589
Query: 175 EEKEKK--------EEKEFKMKEDKEKPVEKIRK--EEKDSEKEKKSKDKDKKLKKEKLK 224
E +K+ +E E +DK + IR+ E ++ K ++ ++ K+ EKL+
Sbjct: 590 NEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLR 649
Query: 225 KKKKE-KERSSHEKEVIPKL 243
K K++ + +IP L
Sbjct: 650 GKIDNYKKQIAEIDSIIPDL 669
Score = 30.3 bits (68), Expect = 2.5
Identities = 35/164 (21%), Positives = 60/164 (36%), Gaps = 14/164 (8%)
Query: 62 KMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKV-----TSSE 116
+M +S D K+ + + + E+ Y K+KDV + + NI + +S+
Sbjct: 139 EMDSLIS-GDPAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNL 197
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+L IKK I D+K+H KE ++ + N D E E+
Sbjct: 198 ELENIKK--QIADDEKSHSITLKEI------ERLSIEYNNAMDDYNNLKSALNELSSLED 249
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
+ + E E K E + K+ E+ D K
Sbjct: 250 MKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKN 293
>gnl|CDD|165431 PHA03160, PHA03160, hypothetical protein; Provisional.
Length = 499
Score = 30.4 bits (68), Expect = 1.9
Identities = 14/53 (26%), Positives = 21/53 (39%)
Query: 25 PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKE 77
P +A +P P +Q + P V PP PP + +V P + E
Sbjct: 412 LLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQPPHVQQPRVLPSTDGASNE 464
>gnl|CDD|152940 pfam12506, DUF3713, Protein of unknown function (DUF3713). This
family of proteins is found in bacteria. Proteins in
this family are typically between 92 and 1225 amino
acids in length. There is a single completely conserved
residue S that may be functionally important.
Length = 116
Score = 29.0 bits (65), Expect = 1.9
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK 173
I K+K +++ T K K+K K D DK+K+K+ E K + EK K
Sbjct: 10 ITKNKVGQEETNTSSDKSTNKLKTKDA--DVDKLKDKKTKVETKAKELVEKLK 60
>gnl|CDD|184294 PRK13743, PRK13743, conjugal transfer protein TrbF; Provisional.
Length = 141
Score = 29.4 bits (66), Expect = 1.9
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K ++ MKE K KIR+ ++D + E++ D
Sbjct: 4 YKYQRPLSMKEKKNDAELKIRQTKRDYDYERRVSDI 39
>gnl|CDD|222959 PHA02999, PHA02999, Hypothetical protein; Provisional.
Length = 382
Score = 30.5 bits (69), Expect = 2.0
Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 6/72 (8%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKS 211
D + N K + + + KK+ K KK+ K+ + D IR D E KK
Sbjct: 82 DSVVNLTKKQISSEENTMKMDKKKYKLKKKIKKVREINDT------IRFAHYDYETIKKG 135
Query: 212 KDKDKKLKKEKL 223
K KK+ ++ +
Sbjct: 136 YIKFKKMIRKDI 147
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.4 bits (68), Expect = 2.0
Identities = 25/123 (20%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
E+ + + + THK K E + + KE +K K+ +E +E
Sbjct: 159 EEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE 218
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
K+K+EE+ ++E++++ ++ + K E+E+K + K++ ++ +K++K
Sbjct: 219 LKKKREERRKVLEEEEQRRKQE-EADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDG 277
Query: 236 EKE 238
E
Sbjct: 278 LSE 280
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 30.5 bits (68), Expect = 2.0
Identities = 18/70 (25%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 158 EKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
E ++ ++ K + +++E+KE E + KE+ + K KE + ++ D K+
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRA-KEEEIGKKEKEREQRIRKTI-HDNYKE 1067
Query: 218 LKKEKLKKKK 227
+ K++LKKK+
Sbjct: 1068 MAKKRLKKKR 1077
Score = 29.7 bits (66), Expect = 4.0
Identities = 35/173 (20%), Positives = 72/173 (41%), Gaps = 7/173 (4%)
Query: 66 KVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDK 125
K+ + SP+E + + S K E V E V + + S + ++ + +
Sbjct: 503 KIFYDESLSPEECIEEYKGESAKSSESDLVVQDEPEDFFDVSKVANESISSNHEKLMESE 562
Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF 185
+ KK + + K + KD + + +E+ + ++ EKG E+ E +E
Sbjct: 563 FEELKKKW------SSLAQLKSRFQKDATLDSIEGEEE-LIQDDEKGNFEDLEDEENSSD 615
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
E+ EE E + +++ ++ KKE+L+ + +ER EK+
Sbjct: 616 NEMEESRGSSVTAENEESADEVDYETEREENARKKEELRGNFELEERGDPEKK 668
Score = 28.9 bits (64), Expect = 6.8
Identities = 14/54 (25%), Positives = 29/54 (53%)
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
E +E+ E K + KE+ ++ KE +S + K ++ KK K+ + + +K +
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 30.5 bits (69), Expect = 2.1
Identities = 33/186 (17%), Positives = 64/186 (34%), Gaps = 23/186 (12%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
S + +S K+DK ++K K NK+ + K +
Sbjct: 150 NKNSDSSIKNDTDTQSSKQDKADNQKAP-------SSNNTKPSTSNKQPNSPKPTQPNQS 202
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ + +K K + + DS ++ S+D KK +K+ + KK+K
Sbjct: 203 NSQPASDDTANQK------SSSKDNQSMSDSALDSILDQYSEDA-KKTQKDYASQSKKDK 255
Query: 231 ERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKS-------PSPPPVAAADEYD 283
+S+ K P+L + + K + + +S PS + ++
Sbjct: 256 TETSNTKN--PQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFN 313
Query: 284 TGDSKQ 289
DSK
Sbjct: 314 VVDSKD 319
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 30.4 bits (69), Expect = 2.1
Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 8/90 (8%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K +KK K + IK K KE K++ ++++ K
Sbjct: 336 LKIANDWARKKVKTAYEAIKENRKRKELA--LAKKQSSTTKRKENLPKWS------NPEY 387
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
+K +E+ ++ E ++ K+L+ K K+
Sbjct: 388 KKTTSQEEQAKLEFIKLEELKRLENGKQKR 417
Score = 28.5 bits (64), Expect = 8.2
Identities = 14/61 (22%), Positives = 31/61 (50%), Gaps = 1/61 (1%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+K+ ++ K K++ K KE + K + + K +E+ K + + E+ K + K+K
Sbjct: 358 RKRKELALAK-KQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLENGKQK 416
Query: 180 K 180
+
Sbjct: 417 R 417
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 30.0 bits (68), Expect = 2.1
Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 13/102 (12%)
Query: 132 KTHKKKEKEKIKKKKDKKDKDKI-------KNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
+ +K++ E KK KK + + KE E+ E+ +K ++ K
Sbjct: 169 NSARKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKL 228
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
K+ E IRK+ ++ ++ SK + K+ + LKKK
Sbjct: 229 AKLDEK------DIRKKILNALVKEYSKGELTKMGLKGLKKK 264
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 30.1 bits (68), Expect = 2.2
Identities = 12/70 (17%), Positives = 26/70 (37%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+ E E +++ + + + + K + EK K+ K K + K + K +P
Sbjct: 56 PQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQP 115
Query: 195 VEKIRKEEKD 204
K +
Sbjct: 116 KPKKPPSKTA 125
Score = 29.8 bits (67), Expect = 2.3
Identities = 11/63 (17%), Positives = 22/63 (34%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+++ K + + + ++ EKK K K K K + K K + K +
Sbjct: 64 EEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSK 123
Query: 195 VEK 197
Sbjct: 124 TAA 126
Score = 29.8 bits (67), Expect = 2.6
Identities = 16/79 (20%), Positives = 28/79 (35%), Gaps = 3/79 (3%)
Query: 156 NKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKD 215
+ + E ++ K + E + + K K EK +K + + K K K K
Sbjct: 55 EPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKP---KV 111
Query: 216 KKLKKEKLKKKKKEKERSS 234
K K K K + +
Sbjct: 112 KPQPKPKKPPSKTAAKAPA 130
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 30.6 bits (69), Expect = 2.2
Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 6/133 (4%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDK-EKKVDKEKEK-GK 173
+ + ++K SI K EK+ + K+ K+ D ++ D+ + ++ KEK
Sbjct: 457 KSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNI 516
Query: 174 KEEKEKKEEKEF----KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
K E + +K + E +E I+ + E K + + +K + K K
Sbjct: 517 VNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYI 576
Query: 230 KERSSHEKEVIPK 242
+E H K++I
Sbjct: 577 EENVDHIKDIISL 589
Score = 29.8 bits (67), Expect = 3.4
Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 16/96 (16%)
Query: 140 EKIKKKKDKKDKDKIK---NKEKDKEKKVDKEKE-----KGKKEE------KEKKEEKEF 185
IKK K+ K K++IK K + KK+D+ K K K E KEK ++ E
Sbjct: 1065 IDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEH 1124
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
K K+K +EKI K+ + + KE ++ + + E
Sbjct: 1125 YNK--KKKSLEKIYKQMEKTLKELENMNLEDITLNE 1158
Score = 29.4 bits (66), Expect = 4.6
Identities = 41/167 (24%), Positives = 67/167 (40%), Gaps = 18/167 (10%)
Query: 114 SSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKD-----KDKIKNKEKDKEKKVDKE 168
S L E+ K D+ + EKE+ ++ KD ++ K + N K + K D
Sbjct: 1963 VSNKLSELNKITC--NDESYDEILEKEEYEELKDLRNSFNQEKAETLNNLKLNKIKEDFN 2020
Query: 169 KEKGKKEEKEKK------EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
K +E EK E K+ E+K+ ++ I +D EKE +S + E
Sbjct: 2021 SYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESIN---PSLDEL 2077
Query: 223 LKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVK 269
LKK K + S +I + K D + E +I + +K
Sbjct: 2078 LKKGHKIE--ISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIK 2122
>gnl|CDD|236277 PRK08493, PRK08493, NADH dehydrogenase subunit G; Validated.
Length = 819
Score = 30.4 bits (69), Expect = 2.3
Identities = 15/77 (19%), Positives = 36/77 (46%), Gaps = 1/77 (1%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
KS+++ K++ K+ I ++ +K ++ + ++ +++V KE K K+ E
Sbjct: 449 LKSLEEFKQSIVKEAALSILEEIREKVLEQAEQGCEN-QEEVKKEVPKKVKKIPEVDTYL 507
Query: 184 EFKMKEDKEKPVEKIRK 200
+ E+ EK+
Sbjct: 508 LLEELGINEETYEKLEA 524
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 29.9 bits (67), Expect = 2.3
Identities = 13/57 (22%), Positives = 15/57 (26%), Gaps = 13/57 (22%)
Query: 20 PFQFSPFPSVAGPGLIP-------------RLPPMLQHPLIPQPLVHPPGTPPPSKM 63
+ PF V GP L+ HP P PG P M
Sbjct: 85 SSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAPAGHPGGGGPQQFRPGAGQPPGM 141
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 30.4 bits (69), Expect = 2.3
Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 25/111 (22%)
Query: 144 KKKDKKD--KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-----MKEDKEKPVE 196
+KK KK+ K+ I K K K +++K K + E+ ++ + +FK ++ K P
Sbjct: 171 RKKSKKEVMKEVI---AKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKP 227
Query: 197 KIRKEEKDSE---------KEKKSKDKDK------KLKKEKLKKKKKEKER 232
+ EEKD E +++++ D+ K+E + KK E ER
Sbjct: 228 PMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEEAERLKKLEAER 278
Score = 28.4 bits (64), Expect = 9.0
Identities = 29/126 (23%), Positives = 63/126 (50%), Gaps = 21/126 (16%)
Query: 131 KKTHKKKEKEKIKKKKDKK-DKDKIKNKEKDKEKKVDKE-----------KEKGKKEEKE 178
KK+ K+ KE I K K K ++ K K +++D +++D + K K
Sbjct: 172 KKSKKEVMKEVIAKSKFYKAERQKAKEEDEDLREELDDDFKDLMSLLRTVKPPPKPPMTP 231
Query: 179 KKEEKEFKMK------EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
++++ E+ + + + +P ++ + EE E K+ ++ KKL+ E+L++ + E+E
Sbjct: 232 EEKDDEYDQRVRELTFDRRAQPTDRTKTEE---ELAKEEAERLKKLEAERLRRMRGEEED 288
Query: 233 SSHEKE 238
E++
Sbjct: 289 DEEEED 294
>gnl|CDD|227923 COG5636, COG5636, Uncharacterized conserved protein, contains
Zn-ribbon-like motif [Function unknown].
Length = 284
Score = 30.1 bits (67), Expect = 2.3
Identities = 33/170 (19%), Positives = 74/170 (43%), Gaps = 5/170 (2%)
Query: 58 PPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSED 117
P K K + K +K SS PK+ ++S+ + + P K+T+
Sbjct: 70 IPLKKKKTNLQSGSKLPTFKKASSSTSNLPKKADHSR-QPIVKETDSFKPPSFKMTTEPK 128
Query: 118 LGEIKKDKSIKKDK--KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+ + D K++ K + K + ++K D + + + ++D E K + E +
Sbjct: 129 VYRVVDDLMEDKEELRKLLRSKAQYMMRKDTDPRTIMEDELLDEDSEDKAIQRSECPPSK 188
Query: 176 EKEKKEEKEFK--MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
K+K+ K+ +KE++E + + R+++ E + + D + +K+
Sbjct: 189 TKKKRACKDCTCGLKEEEENEIVRTRQDKVVKFTEDELTEIDFTIDGKKV 238
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 30.1 bits (68), Expect = 2.3
Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 5/73 (6%)
Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIP 241
K+F + E E + + +KK + + KK K E+LK +K+ ++ EK
Sbjct: 66 KDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEK---- 121
Query: 242 KLTFKFGTDMEEK 254
+L+ T + E+
Sbjct: 122 QLSSSLQTKLTEE 134
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 30.2 bits (68), Expect = 2.4
Identities = 21/73 (28%), Positives = 41/73 (56%), Gaps = 3/73 (4%)
Query: 152 DKIKNKEKDKEKKVDK-EKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE-- 208
++ EK K+ + E+E K+ EK KEE+ + E+ ++++ ++E++ KE
Sbjct: 368 KDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELS 427
Query: 209 KKSKDKDKKLKKE 221
K+ K+ +KLK E
Sbjct: 428 KEEKELLEKLKME 440
Score = 29.4 bits (66), Expect = 3.9
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 4/70 (5%)
Query: 175 EEKEKKEEKEFKMKEDK-EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
EE EK KE + E++ EK VEK+ KEE+ K ++ K+L++++ KE S
Sbjct: 371 EEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEE---NFLKELS 427
Query: 234 SHEKEVIPKL 243
EKE++ KL
Sbjct: 428 KEEKELLEKL 437
>gnl|CDD|233573 TIGR01781, Trep_dent_lipo, Treponema denticola clustered
lipoprotein. This model represents a family of six
predicted lipoproteins from a region of about 20
tandemly arranged genes in the Treponema denticola
genome. Two other neighboring genes share the
lipoprotein signal peptide region but do not show more
extensive homology. The function of this locus is
unknown.
Length = 412
Score = 30.3 bits (68), Expect = 2.4
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 3/64 (4%)
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
KE + +E + EK KM+E K E + K E ++ + + K+ K + K +
Sbjct: 23 KEDKESGLREMRVEKFSKMEE---KENEFLSKPEYNTHVKSPEQIKELSNKISGVSGKGE 79
Query: 229 EKER 232
+
Sbjct: 80 SDKF 83
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 29.6 bits (67), Expect = 2.4
Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 1/72 (1%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
+KK+K +++ K+ +KK +K+K K + E KE++ E + ++++ K
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFV-KEYRKAEREIIRLKRLAK 61
Query: 201 EEKDSEKEKKSK 212
+ + K
Sbjct: 62 KPGKFYVPAEHK 73
Score = 29.3 bits (66), Expect = 4.0
Identities = 20/88 (22%), Positives = 35/88 (39%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
K + +++ K+ + KK KK K K E ++ E+E + + KK
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPG 64
Query: 183 KEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
K + E K V +I+ K +K
Sbjct: 65 KFYVPAEHKLVFVIRIKGINGIPPKPRK 92
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 30.5 bits (69), Expect = 2.5
Identities = 30/132 (22%), Positives = 56/132 (42%), Gaps = 25/132 (18%)
Query: 118 LGEIKKDKSIKKDKKTH--KKKEKEKI----KKKKDKKDKDKIKNKEKDKEKKVDKEKEK 171
L E+ + I++ +T K+EK+++ KK K K K +E+ + +
Sbjct: 292 LNELNNLRIIEEQGETKILSKEEKQELLDLLFKKNKLTYKQKRKLLGLKEEEIFKGLRYE 351
Query: 172 GKKEEKEKK-------------EEKEFKMKEDKEKPVEKIRK-----EEKDSEKEKKSKD 213
G K EK ++ K E EK +++I K ++++ ++
Sbjct: 352 GLKAEKNFNISLKTYHDLRKALGKEFLKDLELNEKILDEIVKILTLYKDREMIEKILELY 411
Query: 214 KDKKLKKEKLKK 225
KD L +E+LKK
Sbjct: 412 KD-LLNEEQLKK 422
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 30.2 bits (68), Expect = 2.5
Identities = 27/107 (25%), Positives = 52/107 (48%), Gaps = 6/107 (5%)
Query: 126 SIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE---KK 180
S + +KK K + KI KK D K+ + N + + E K ++ E+K K
Sbjct: 31 SKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKN 90
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD-KDKKLKKEKLKKK 226
+E+ K K++ +KP +K + ++ ++ + + K +KLKK+
Sbjct: 91 KEEIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137
Score = 29.0 bits (65), Expect = 5.1
Identities = 22/103 (21%), Positives = 41/103 (39%), Gaps = 3/103 (2%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI---KNKEKDKEKKVDKEKEKGKKEEKE 178
+K IK ++ T K +K K+ + + K++E +K+ + K +E+
Sbjct: 35 EKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKEEI 94
Query: 179 KKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
+K + E K + K + + K KLKKE
Sbjct: 95 EKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137
>gnl|CDD|224162 COG1241, MCM2, Predicted ATPase involved in replication control,
Cdc46/Mcm family [DNA replication, recombination, and
repair].
Length = 682
Score = 30.4 bits (69), Expect = 2.5
Identities = 15/52 (28%), Positives = 27/52 (51%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
K K K+ K +K D IK + E V++ E+ + +++ EE K+K+
Sbjct: 617 KSKSKRDKIEKVLDIIKELVERSEDPVEEIIEEAEGISEKEVEEALEKLKKK 668
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 30.4 bits (68), Expect = 2.5
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 2/106 (1%)
Query: 127 IKKDKKTH--KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
+K+D+ + ++ KE IKK + D EK EK+V E E+ K+ E EE +
Sbjct: 175 LKEDQMEYLEPRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEEDTKKADEDGEEPK 234
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ ++ ++ +K K+ K+ KE + ++K K L K KE+
Sbjct: 235 VEEVKEGDEGKKKKTKKVKEVTKEYEVQNKHKPLWTRDPKDVTKEE 280
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.7 bits (67), Expect = 2.5
Identities = 33/183 (18%), Positives = 70/183 (38%), Gaps = 24/183 (13%)
Query: 77 EKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKK 136
+ S+E T Y +KD E + + T+S +S+K+ K +
Sbjct: 26 TEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLSSTASSLSDSSTYSRSLKEVKLERQA 85
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE---DKEK 193
+E + K + K +K E+K +E+EK ++E + ++ + K +E K +
Sbjct: 86 QEAYENWLSA--KQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQ 143
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKK-------------------EKLKKKKKEKERSS 234
K R + E + + K +KLK++++++E
Sbjct: 144 QAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEER 203
Query: 235 HEK 237
++
Sbjct: 204 RKQ 206
>gnl|CDD|130844 TIGR01784, T_den_put_tspse, conserved hypothetical protein
(putative transposase or invertase). Several lines of
evidence suggest that members of this family (loaded as
a fragment mode model to find part-length matches) are
associated with transposition, inversion, or
recombination. Members are found in small numbers of
genomes, but in large copy numbers in many of those
species, including over 30 full length and fragmentary
members in Treponema denticola. The strongest
similarities are usually within rather than between
species. PSI-BLAST shows similarity to proteins
designated as possible transposases, DNA invertases
(resolvases), and recombinases. In the oral pathogenic
spirochete Treponema denticola, full-length members are
often found near transporters or other membrane
proteins. This family includes members of the putative
transposase family pfam04754.
Length = 270
Score = 29.7 bits (67), Expect = 2.5
Identities = 20/111 (18%), Positives = 41/111 (36%), Gaps = 4/111 (3%)
Query: 148 KKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KMKEDKEKPVEKIRKEEKDS 205
K K I + D + + + + + E++ K + KE +
Sbjct: 150 PKLKAIIDERLLDVLDRWLLLLKGKDNQSVNLIFLQIPFIQKAEEEIKTLLLSSKELELY 209
Query: 206 EKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTK 256
E+ +K D +E+ K++ E+ + E KL K G +E+ +
Sbjct: 210 EQREKYM-TDAISAEEEGKEEGIEEGILEAKLETAKKL-LKNGLTLEQVAE 258
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 30.0 bits (67), Expect = 2.6
Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 1/87 (1%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
DK K+ K H + K ++ D+++ D +K K + D+E E + E ++
Sbjct: 215 DKPTKEPVKKHSDVKDPKEDEELDEEEHDSAMDKVK-LDLFADEEDEPNAEGVGEASDKN 273
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKK 210
++ + + E+I + EK+ K+
Sbjct: 274 LSSFEKQQIEMDEQIEELEKELVAPKE 300
>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain. PfEMP1 (Plasmodium falciparum
erythrocyte membrane protein) has been identified as the
rosetting ligand of the malaria parasite P. falciparum.
Rosetting is the adhesion of infected erythrocytes with
uninfected erythrocytes in the vasculature of the
infected organ, and is associated with severe malaria.
PfEMP1 interacts with Complement Receptor One on
uninfected erythrocytes to form rosettes.
Length = 150
Score = 29.2 bits (66), Expect = 2.6
Identities = 23/106 (21%), Positives = 44/106 (41%), Gaps = 3/106 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE--EKEFKMKEDKEK 193
K ++K+K + K K K + V+K EK KKE ++ KE K++K+ + +++
Sbjct: 14 IKWRKKLKHCINNGKGKCCKCICKKKCECVEKWIEKKKKEWEKIKEHFNKQYKIDDGRDE 73
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK-KKEKERSSHEKE 238
E + K + +K+K+ K +S
Sbjct: 74 YNVLESFLEDLLFQIDIKKAYGDAKELKKIKELLKCNGCNNSANSA 119
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 29.7 bits (67), Expect = 2.6
Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSE---KEKKSKDKDK 216
D ++K+DK K++ + ++E+ + E+ + E+I K ++E +E K + K
Sbjct: 94 DCDRKIDKAKQRLELTQEEQTKIAADSKAEELAELDEEIGKLLAEAEALGEEGKVDEAMK 153
Query: 217 KLKK-EKLKKKKKEKERSSHEKEVIP 241
+K+ E+LK KKKE E S + P
Sbjct: 154 LMKEVEELKAKKKELEDSDEVRNAAP 179
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 30.2 bits (68), Expect = 2.7
Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 2/105 (1%)
Query: 139 KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGK-KEEKEKKE-EKEFKMKEDKEKPVE 196
++K+ + +K +++ K K + E K ++E K K+E + E E++F + E +
Sbjct: 3 RDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSAN 62
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIP 241
+R E K+ D + + K++K S KE++P
Sbjct: 63 DLRTELSPPSNLLKTSDASHIADESQPNSIKQKKREISPVKELVP 107
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 30.3 bits (68), Expect = 2.7
Identities = 26/161 (16%), Positives = 65/161 (40%), Gaps = 4/161 (2%)
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDK---IKNKEKDKEKKVDKEKEKGKKEEKEKK 180
++ I K E+E+ +K+++++ + + K + ++ E+E+
Sbjct: 385 EEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEES 444
Query: 181 EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-KLKKKKKEKERSSHEKEV 239
E+E + +E++E+ ++ +EE + E+E++ + D ++E + + E
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAE 504
Query: 240 IPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
+ M E + S P + P P A+
Sbjct: 505 RRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAE 545
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 30.2 bits (68), Expect = 2.7
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 211 SKDKDKKLKKEKLKKKKKEKERSSHEKEV 239
+K+K+KK+ L KK KEKE S + +
Sbjct: 331 TKEKNKKVPVMFLGKKPKEKEVESKSQCI 359
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 29.2 bits (66), Expect = 2.7
Identities = 26/137 (18%), Positives = 57/137 (41%), Gaps = 5/137 (3%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED----- 190
++E E K + ++ + KI+ + KEK+ + EK+ K + K +E+
Sbjct: 18 EEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEKQIIISNAKNEARLKVLNAREELLDSV 77
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
E+ E++ +D ++ K K KL + K ++E++ + +
Sbjct: 78 FEEAKERLANLSEDKDEYKDLLKDLIKQALVKLGEPKVIVRSREEDEELVKSALDEAKEE 137
Query: 251 MEEKTKRESSPKIVIKP 267
+EKT +++ I
Sbjct: 138 YKEKTGKDTVETIGDNI 154
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.9 bits (68), Expect = 2.8
Identities = 24/135 (17%), Positives = 59/135 (43%), Gaps = 21/135 (15%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK 193
+ KE++K+ +++ + ++ + + +K + K +K + + +K E+ K+ E KEK
Sbjct: 328 DFPELKEELKELEEELKE--LEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEK 385
Query: 194 PVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH---EKEVIPKLTFKFGTD 250
E++ + E+ + ++LK++ + K + P + G
Sbjct: 386 LSEELEELEE---------------ELKELKEELESLYSEGKISVNKTIYPGVKIHIGNK 430
Query: 251 MEEKTKRESSPKIVI 265
+ + KRE P +
Sbjct: 431 V-LRIKREIGPCTFV 444
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 29.9 bits (66), Expect = 2.8
Identities = 34/175 (19%), Positives = 79/175 (45%), Gaps = 6/175 (3%)
Query: 81 SKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKE 140
SK+ + + K ++ + E+ + K ++ + K
Sbjct: 90 SKQAEHENQINWFKSAIGLTAAAALLLAACFADDQDKKIELAQAKKEAENARDRANKSGI 149
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRK 200
++++++ K +++K K +KE + ++EK+K E+++ K +++K+K K
Sbjct: 150 ELEQEEQKTEQEKQKT---EKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANK 206
Query: 201 E--EKDSEKEKKSKDKDKKLKKEK-LKKKKKEKERSSHEKEVIPKLTFKFGTDME 252
E + EK+K +K +K++K K+ ++ + +H + I KL K G +E
Sbjct: 207 NAIELEQEKQKTENEKQDLIKEQKDFIKEAEQNCQENHNQFFIKKLGIKAGIAIE 261
>gnl|CDD|235570 PRK05703, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 424
Score = 29.9 bits (68), Expect = 2.8
Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 5/88 (5%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKV-----DKEKEKGKKEE 176
+ + K+K + I + +K + +D + + +KE K E
Sbjct: 54 DETPKKNPVLREEKRKPAKSILSLQALLEKRPSRTNSQDALLQAENALPEWKKELEKPSE 113
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
+++E K + +K ++++R E K+
Sbjct: 114 PKEEEPKAAAESKVVQKELDELRDELKE 141
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly related
to archaeal Holliday junction resolvase [Nucleotide
transport and metabolism].
Length = 175
Score = 29.3 bits (66), Expect = 2.8
Identities = 14/51 (27%), Positives = 34/51 (66%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKE 201
+ K+++K ++ E+ + K + + E + ++E+E+K+KE EK +E+ R++
Sbjct: 28 QGKVESKARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARED 78
>gnl|CDD|191973 pfam08232, Striatin, Striatin family. Striatin is an intracellular
protein which has a caveolin-binding motif, a
coiled-coil structure, a calmodulin-binding site, and a
WD (pfam00400) repeat domain. It acts as a scaffold
protein and is involved in signalling pathways.
Length = 133
Score = 28.9 bits (65), Expect = 2.8
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 10/77 (12%)
Query: 119 GEIKKDKSIKKDKKTH-------KKKEKEKIKKKK---DKKDKDKIKNKEKDKEKKVDKE 168
GE + +++K+D K K+E+ K+KK K D K + +E++ E+ D+E
Sbjct: 39 GERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELEEDDDEE 98
Query: 169 KEKGKKEEKEKKEEKEF 185
K +E E K+ + +
Sbjct: 99 SNSEKVDESELKKSRLY 115
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 30.2 bits (69), Expect = 2.9
Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 15/79 (18%)
Query: 158 EKDKEKKVDKEKEKGKKE----EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKD 213
EK KKV ++ EK ++E E+ K +KE +D + VE++R+ K
Sbjct: 211 EKRIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELRE-----------KI 259
Query: 214 KDKKLKKEKLKKKKKEKER 232
+ KL KE +K +KE ++
Sbjct: 260 EKLKLPKEAKEKAEKELKK 278
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 30.0 bits (67), Expect = 2.9
Identities = 25/128 (19%), Positives = 47/128 (36%), Gaps = 5/128 (3%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+++ I S + H E K K ++ + ++ K+ +K
Sbjct: 600 DEIDYIASVDSGNRTNNIHMNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTNRKV 659
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKD-----SEKEKKSKDKDKKLKKEKLKKKKKEK 230
+K + K K DK K ++ S + + +D +KK +LKKK K +
Sbjct: 660 KKRTAKRKTVGYKTDKSKKIKSDSLPTDTNVIVISSESEDEEDGFNIIKKSQLKKKIKSE 719
Query: 231 ERSSHEKE 238
+S E
Sbjct: 720 LKSESSSE 727
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 29.9 bits (67), Expect = 2.9
Identities = 6/82 (7%), Positives = 30/82 (36%), Gaps = 4/82 (4%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
++ + + DK + + + + +D+ + + ++ + + + + + +
Sbjct: 11 RNDTRRSDKGRERSRRRS---RSRDRSRRRRDRDYYRGRRGR-SRSRSPNRYYRPRGDRS 66
Query: 183 KEFKMKEDKEKPVEKIRKEEKD 204
+ E + + E+D
Sbjct: 67 YRRDDRRSGRNTKEPLTEAERD 88
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 29.7 bits (66), Expect = 2.9
Identities = 26/139 (18%), Positives = 58/139 (41%), Gaps = 1/139 (0%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEK-PVEKIRKEEKDSEKEKKSKDKD 215
K KK + +K ++ +++ E+K F +K V+ I K+ + +++ + K+
Sbjct: 2 PPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKE 61
Query: 216 KKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPP 275
K +EK ++ +K+ R+ + + + F T + + + K + + P
Sbjct: 62 KMRMEEKRREPEKQVIRAGVDPKTVVCALFLNKTCAKGDACKFAHGKEEARKTEKPDLYS 121
Query: 276 VAAADEYDTGDSKQVWICP 294
E D K+ WI
Sbjct: 122 DVRDKEEDVPLGKRPWINT 140
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 30.0 bits (68), Expect = 2.9
Identities = 13/73 (17%), Positives = 31/73 (42%), Gaps = 1/73 (1%)
Query: 166 DKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKK 225
+ + E + EE + E+ ++ E+ K E D + + D D ++ L++
Sbjct: 109 ELQDEPRYDDAYRDLEEDD-DDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER 167
Query: 226 KKKEKERSSHEKE 238
+++ +E E
Sbjct: 168 RRRRREWEEKRAE 180
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 28.7 bits (65), Expect = 2.9
Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 11/98 (11%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
+ +KKE+EK K ++ ++ + ++ K + + ++E+ D
Sbjct: 17 EFEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCKKLHARIDRLDEERY---------D 67
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
E+ V K KE +D +KK D K KK LKK +K
Sbjct: 68 IEEKVAKKDKEIED--LKKKVNDLRGKFKKPTLKKVRK 103
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 29.6 bits (67), Expect = 3.0
Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 4/96 (4%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
KK E+ +K+++ K K K+ EK ++ KEK EK KK +K+ K D+
Sbjct: 130 AKKASEEARKQRELK---KF-GKQVQVEKLQERAKEKKDMLEKIKKLKKKRKGGGDEGDL 185
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ + + +KK + K K+K K+
Sbjct: 186 FDFLVDDAAKGGSKKKGRKGGAARGKPNAKRKAKDA 221
>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
Length = 615
Score = 30.0 bits (67), Expect = 3.0
Identities = 21/104 (20%), Positives = 42/104 (40%), Gaps = 5/104 (4%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
+ T+SE+ + K KK K+K K+ K+K+ ++ K++ E+
Sbjct: 29 QTTTSENKSAVSK----KKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERI 84
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPV-EKIRKEEKDSEKEKKSKD 213
+ + K E K+K + I+ + K +K +D
Sbjct: 85 IDRIFQSSLKNRTEIKVKPKNNPQKKQNIKPVKPIPSKPEKPED 128
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 29.8 bits (68), Expect = 3.1
Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 20/87 (22%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK---------- 216
KE GK E + + + + + + KD KE K K ++
Sbjct: 343 KEYLLGKLEAERLHILEGLFIALSIIDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADA 402
Query: 217 ----------KLKKEKLKKKKKEKERS 233
KL+ EKL+K+ KE E+
Sbjct: 403 ILDMRLRRLTKLEVEKLEKELKELEKE 429
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.6 bits (66), Expect = 3.1
Identities = 29/125 (23%), Positives = 58/125 (46%), Gaps = 1/125 (0%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
+ E +K ++D + EK ++ + D D+ ++ + VD E+E+ +E
Sbjct: 125 KSDVEEPSEKVDEEDVEREILAEKPRVTRFNIVWDNDEDNDEAPPAQPDVDNEEEERLEE 184
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
++EE++ ++ D + EE++ E E S D + + E + KK+ E
Sbjct: 185 SDGREEEEDEEVGSDSYGEGNRELNEEEEEEAE-GSDDGEDVVDYEGERIDKKQGEEEEM 243
Query: 236 EKEVI 240
E+EVI
Sbjct: 244 EEEVI 248
>gnl|CDD|173720 cd05631, STKc_GRK4, Catalytic domain of the Protein
Serine/Threonine Kinase, G protein-coupled Receptor
Kinase 4. Serine/Threonine Kinases (STKs), G
protein-coupled Receptor Kinase (GRK) subfamily, GRK4
isoform, catalytic (c) domain. STKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. The GRK
subfamily is part of a larger superfamily that includes
the catalytic domains of other protein STKs, protein
tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. GRKs phosphorylate and regulate G
protein-coupled receptors (GPCRs), the largest
superfamily of cell surface receptors which regulate
some part of nearly all physiological functions.
Phosphorylated GPCRs bind to arrestins, which prevents
further G protein signaling despite the presence of
activating ligand. There are seven types of GRKs, named
GRK1 to GRK7. GRK4 has a limited tissue distribution. It
is mainly found in the testis, but is also present in
the cerebellum and kidney. It is expressed as multiple
splice variants with different domain architectures. It
is post-translationally palmitoylated and localized in
the membrane. GRK4 polymorphisms are associated with
hypertension and salt sensitivity, as they cause
hyperphosphorylation, desensitization, and
internalization of the dopamine 1 (D1) receptor while
increasing the expression of the angiotensin II type 1
receptor. GRK4 plays a crucial role in the D1 receptor
regulation of sodium excretion and blood pressure.
Length = 285
Score = 29.6 bits (66), Expect = 3.2
Identities = 14/46 (30%), Positives = 30/46 (65%)
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
K++E+ K+EE + ++KED+E+ EK ++ K + +K+ ++L
Sbjct: 203 KRKERVKREEVDRRVKEDQEEYSEKFSEDAKSICRMLLTKNPKERL 248
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 29.8 bits (67), Expect = 3.2
Identities = 19/84 (22%), Positives = 36/84 (42%)
Query: 173 KKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ E+E + E +FK E+ I++++ ++E + + K K L++ K K K
Sbjct: 356 SELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNS 415
Query: 233 SSHEKEVIPKLTFKFGTDMEEKTK 256
S K ++ EKT
Sbjct: 416 SFINKTENILTNSPLKDELLEKTT 439
Score = 29.0 bits (65), Expect = 5.9
Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 1/107 (0%)
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
E+ K K I+++ + +++KI+ + D + K KN E+ E K +K K
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKN 414
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKK 217
+ E E EK E I E E E D
Sbjct: 415 SSFINKTENILTNSPLKDELLEKTTEIINI-ENPQEFEFGQIGNDII 460
>gnl|CDD|234655 PRK00133, metG, methionyl-tRNA synthetase; Reviewed.
Length = 673
Score = 29.7 bits (68), Expect = 3.2
Identities = 7/37 (18%), Positives = 13/37 (35%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKI 198
+K+++ E K+ K +E E I
Sbjct: 532 DKQIEALIEASKEAAAAKAAAAAAAAPLAEEPIAETI 568
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 29.0 bits (65), Expect = 3.3
Identities = 23/67 (34%), Positives = 33/67 (49%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
KKT KKK+ KKK KK K K K KE+ E ++ + + EE +KK + E ++
Sbjct: 48 KKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQN 107
Query: 191 KEKPVEK 197
V
Sbjct: 108 TAAEVNH 114
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 29.6 bits (66), Expect = 3.3
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 3/89 (3%)
Query: 107 PNIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
PN+ V E+ + ++ + K K K + + + + K + K K
Sbjct: 29 PNMALVEEQEETSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKST 88
Query: 167 KEKEKGKK---EEKEKKEEKEFKMKEDKE 192
K KEK +K + K K++ F + E
Sbjct: 89 KSKEKQEKITDKFKVKRKVDRFNGVSEAE 117
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 28.8 bits (65), Expect = 3.3
Identities = 15/71 (21%), Positives = 45/71 (63%), Gaps = 3/71 (4%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE--KKEEKEFKMKEDK-E 192
+KE++K++K+ ++ K K+++ ++K + ++K + ++E +K+++ + DK +
Sbjct: 59 QKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKID 118
Query: 193 KPVEKIRKEEK 203
K ++++ KE+
Sbjct: 119 KAIKEVAKEKG 129
>gnl|CDD|192488 pfam10228, DUF2228, Uncharacterized conserved protein (DUF2228).
This is a family of conserved proteins of approximately
700 residues found from worms to humans.
Length = 253
Score = 29.3 bits (66), Expect = 3.4
Identities = 10/48 (20%), Positives = 24/48 (50%)
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
K+ K K+K + ++K ++ + K + K + +K++KK+
Sbjct: 93 TKKSKEIPFKKKKISALKKIQELLTEWAKELGLSLEQKTKVMKQRKKK 140
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.7 bits (67), Expect = 3.5
Identities = 20/95 (21%), Positives = 42/95 (44%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K K+ KKKK KK + K ++++ + + + ++E + K K
Sbjct: 273 VKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEE 332
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
++ +D + + S K ++L ++K KK + E
Sbjct: 333 QEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRPED 367
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 29.0 bits (65), Expect = 3.5
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
K KKKK K K+K+K K+ + +KK +
Sbjct: 82 RKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAREG 116
>gnl|CDD|188644 cd00957, Transaldolase_TalAB, Transaldolases including both TalA
and TalB. Transaldolases including both TalA and TalB.
The enzyme catalyses the reversible transfer of a
dyhydroxyacetone moiety, derived from
fructose-6-phosphate to erythrose-4-phosphate yielding
sedoheptulose-7-phosphate and
glyceraldehyde-3-phosphate. The catalytic mechanism is
similar to other class I aldolases. The enzyme is found
in the non-oxidative branch of the pentose phosphate
pathway and forms a dimer in solution.
Length = 313
Score = 29.5 bits (67), Expect = 3.5
Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 5/61 (8%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK--MKEDK---EKPVEKIRKEEKDSE 206
+++KN E+K+D K +E F+ + ED EK E IR KD+
Sbjct: 248 EELKNSTAKVERKLDPAASKALDIHPNFLDESAFRWALNEDAMAVEKLSEGIRGFAKDAV 307
Query: 207 K 207
K
Sbjct: 308 K 308
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 29.6 bits (66), Expect = 3.6
Identities = 22/112 (19%), Positives = 60/112 (53%), Gaps = 7/112 (6%)
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKE---KDKEKKVDKEKEKGKKEEKEKKE----EKEFK 186
+++ + ++K + + KI+N + + KE+++ +EKE+ + + + EK K
Sbjct: 381 EDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEK 440
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+KE+ +E+++++ E+ ++ + + K + E LK++ + + E+E
Sbjct: 441 ALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKLSERE 492
>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
(cas_TM1802). Clusters of short DNA repeats with
non-homologous spacers, which are found at regular
intervals in the genomes of phylogenetically distinct
prokaryotic species, comprise a family with recognisable
features. This family is known as CRISPR (short for
Clustered, Regularly Interspaced Short Palindromic
Repeats). A number of protein families appear only in
association with these repeats and are designated Cas
(CRISPR-Associated) proteins. This minor cas protein is
found in at least five prokaryotic genomes:
Methanosarcina mazei, Sulfurihydrogenibium azorense,
Thermotoga maritima, Carboxydothermus hydrogenoformans,
and Dictyoglomus thermophilum, the first of which is
archaeal while the rest are bacterial.
Length = 584
Score = 29.6 bits (67), Expect = 3.7
Identities = 23/98 (23%), Positives = 39/98 (39%), Gaps = 17/98 (17%)
Query: 136 KKEKEKIKKKKDKKDKD-KIKNKE----KDKEKKVDKEKEKGKKEEKEKKEEK------- 183
+ +K+K K KD NK KD +K ++K EK K+ + K+ +
Sbjct: 94 ETLNKKLKSIFKKFYKDKGEINKYRYFLKDIKKVLEKNFEKIIKDLIDLKKNEGVLYTIY 153
Query: 184 -----EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
K D E+ K+ + + KKSK++
Sbjct: 154 FLKNDGKKYLSDYEEYRNKVLYKIEQRYFSKKSKEEGV 191
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 28.8 bits (65), Expect = 3.8
Identities = 20/86 (23%), Positives = 46/86 (53%), Gaps = 8/86 (9%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR--KEEKDSEKEK-KS 211
+KE+ E+ +E E+ +K E+E+ EE+E + + + E+ ++I + + + K+K
Sbjct: 2 SDKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLR 61
Query: 212 KDKD-----KKLKKEKLKKKKKEKER 232
+ K+ ++E+ + KK E+
Sbjct: 62 AQAEFENLRKRTEREREEAKKYAIEK 87
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 29.6 bits (66), Expect = 3.8
Identities = 22/75 (29%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 156 NKEKDKE-----KKVDKEKEKG--KKEEKEKKEEKEFKMKEDKEKPVEKIRKE--EKDSE 206
+ E D E V E+E+G K+EE+++ E+ +++E + E E EKD E
Sbjct: 83 DNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDEDSEKDDE 142
Query: 207 KEKKSKDKDKKLKKE 221
KE ++ + +L E
Sbjct: 143 KESDAEGDENELAGE 157
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 29.6 bits (66), Expect = 3.9
Identities = 26/135 (19%), Positives = 63/135 (46%), Gaps = 3/135 (2%)
Query: 130 DKKTHKKKEKEKIK---KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
++K+ ++K K + K KD+ D+ KI + + + + E+E+ + E++E
Sbjct: 57 NEKSEEEKFKNPYRLEGKFKDEADRAKIMAMTEIERESILFEREEEISKLMERRELAIRM 116
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFK 246
++ + R+ + +++K KKL + K ++++E+ S E+ +K
Sbjct: 117 EQQHRSSGCTDTRRSTRYEPLTSAAEEKKKKLLELKKTREREERLYSERHIELQRFKDYK 176
Query: 247 FGTDMEEKTKRESSP 261
+ E+ + E +P
Sbjct: 177 ELEESEQGLQEEYTP 191
>gnl|CDD|225768 COG3227, LasB, Zinc metalloprotease (elastase) [Amino acid
transport and metabolism].
Length = 507
Score = 29.7 bits (67), Expect = 3.9
Identities = 20/130 (15%), Positives = 37/130 (28%), Gaps = 6/130 (4%)
Query: 57 TPPPSKMKDKVSPKDKTSPKEKMSSKEKTSPKEK-EYSKVKDVELGVT-----PMVPNIV 110
+K +S K + KE + E + + G T +V +
Sbjct: 35 MKSQIFIKGTLSKKSAPNEKEILQFLENVNADNNLKAISTDTDPNGFTHVRYQQVVNGVP 94
Query: 111 KVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKE 170
S + K + +KKK+ K + I K+ E+ + K
Sbjct: 95 VKGSEVIVHLDKNGVVKAVNGFVNKKKDLPTNTPKISAEQAIAIARKQFGYERLTESPKA 154
Query: 171 KGKKEEKEKK 180
+ K
Sbjct: 155 ELMVRVKNGS 164
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 28.9 bits (65), Expect = 4.0
Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 9/117 (7%)
Query: 128 KKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
K D + KK + + +K KK+K+ I+ E+ + K K EK + ++ +KK +
Sbjct: 61 KDDNEAIKKLSDQALANVDKREKLLKKEKESIEKSEE-EFKSAKKYIEKIEDKKLKKKAK 119
Query: 183 KEFKMKEDKEKPVEKIRKEEK---DSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+ K+ +++ K +K+ K K + EKE KDK L ++L +K K +S +
Sbjct: 120 QLVKVMKERYKSYDKLYKAYKKALNLEKELYEYLKDKDLTLKELDEKIKAVNQSYEK 176
Score = 28.9 bits (65), Expect = 4.6
Identities = 24/104 (23%), Positives = 54/104 (51%), Gaps = 11/104 (10%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIK-----KKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
+++K ++++K ++ EK++ + + K D + IK VDK +E+
Sbjct: 30 EQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDK------REK 83
Query: 177 KEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKK 220
KKE++ + E++ K +K ++ +D + +KK+K K +K+
Sbjct: 84 LLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKE 127
Score = 28.5 bits (64), Expect = 6.4
Identities = 26/105 (24%), Positives = 62/105 (59%), Gaps = 3/105 (2%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE--KKEEKEF 185
K ++ +++ EK ++K ++++ K++ EK +++ D+ + GK + + K ++
Sbjct: 16 KPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQAL 75
Query: 186 KMKEDKEKPVEKIRKEEKDSEKE-KKSKDKDKKLKKEKLKKKKKE 229
+ +EK ++K ++ + SE+E K +K +K++ +KLKKK K+
Sbjct: 76 ANVDKREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQ 120
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 29.5 bits (66), Expect = 4.0
Identities = 11/122 (9%), Positives = 48/122 (39%), Gaps = 18/122 (14%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF-----KMKEDK 191
+++E ++++ + +D+ ++ E+ + + D+ + + + ++ +E + + D
Sbjct: 1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60
Query: 192 EKPV-----------EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVI 240
P ++ R+ + + + + + + +K+ ++ S
Sbjct: 61 RSPRSLRYSSVRRSRDRPRRRSRSVRSI--EQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118
Query: 241 PK 242
P
Sbjct: 119 PG 120
Score = 28.3 bits (63), Expect = 9.0
Identities = 11/104 (10%), Positives = 46/104 (44%), Gaps = 13/104 (12%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+ +E ++ ++K +D+D+ + + + + + +++ + +E +ED +P
Sbjct: 1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDR-------HRRSRERSYRED-SRP 52
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
++ R + + +++ + + +++ + S E+
Sbjct: 53 RDRRRYDS-----RSPRSLRYSSVRRSRDRPRRRSRSVRSIEQH 91
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 29.5 bits (67), Expect = 4.1
Identities = 20/115 (17%), Positives = 45/115 (39%), Gaps = 7/115 (6%)
Query: 109 IVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKE 168
I VT L +++ + + K+ + K E + K + + + K+ EK++++
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVER------LLEELKELEKELERL 751
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
K+K + K +E V + D ++ ++ D KK +
Sbjct: 752 KKK-LAAAELADLLSNAKAEEIGGVKVLAKEVDGADMKELREIADDLKKKLGSAV 805
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 29.4 bits (66), Expect = 4.1
Identities = 16/65 (24%), Positives = 31/65 (47%)
Query: 155 KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDK 214
K ++ + D +E K +EK E + D++ E + +++ EK++K K K
Sbjct: 304 KPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKK 363
Query: 215 DKKLK 219
+K K
Sbjct: 364 PRKRK 368
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 28.1 bits (63), Expect = 4.3
Identities = 19/81 (23%), Positives = 37/81 (45%), Gaps = 6/81 (7%)
Query: 186 KMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTF 245
K+ K K R ++ K+K +K +K K +L+ +K++ K V+P+
Sbjct: 13 KLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPED-- 70
Query: 246 KFGTDMEEKTKRESSPKIVIK 266
+E+ R+ + K V+K
Sbjct: 71 ----LEKERRLRKVAQKGVVK 87
Score = 27.7 bits (62), Expect = 6.9
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 13/109 (11%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKK---------E 181
K K K+ I + K K K K K + EKK ++ K++ EK
Sbjct: 12 SKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDL 71
Query: 182 EKEFKMKEDKEKPVEK----IRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
EKE ++++ +K V K +R +K +EK K K ++K+E+L K+
Sbjct: 72 EKERRLRKVAQKGVVKLFNAVRAAQKKTEKAVKEAGKKARVKREELLKE 120
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 29.7 bits (67), Expect = 4.3
Identities = 29/119 (24%), Positives = 57/119 (47%), Gaps = 12/119 (10%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK--EFK-- 186
+T K +++KK+ D++ + KE K E+ + E + + E++
Sbjct: 744 ARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAF 803
Query: 187 MKED--------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
M+E +E+P I+ E +S E+ ++ + +K KL++KK E+ER + EK
Sbjct: 804 MQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEK 862
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 28.9 bits (65), Expect = 4.5
Identities = 12/42 (28%), Positives = 29/42 (69%), Gaps = 3/42 (7%)
Query: 165 VDKEK-EKGKKEEKE-KKEEKEFKMKEDKEKPVEKIRKEEKD 204
+D+EK ++ +K KE +KE +E + D +K ++K+++++ +
Sbjct: 69 IDQEKMKELQKMMKEFQKEFREAQESGDMKK-LKKLQEKQME 109
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 27.7 bits (62), Expect = 4.5
Identities = 13/31 (41%), Positives = 23/31 (74%)
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKE 208
+K +EK+ + +EDKE+ +E+I+KEE+ E
Sbjct: 50 DKVKEKKEEREEDKEELIERIKKEEETFEDV 80
>gnl|CDD|233309 TIGR01200, GLPGLI, GLPGLI family protein. This protein family was
first noted as a paralogous set in Porphyromonas
gingivalis, but it is more widely distributed among the
Bacteroidetes. The protein family is now renamed GLPGLI
after its best-conserved motif.
Length = 227
Score = 28.9 bits (65), Expect = 4.5
Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 7/88 (7%)
Query: 121 IKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK--- 177
I K+ S +K + + KK + D + K KE K K+ K
Sbjct: 52 IGKNISKFYSLTFNKN---DSLFKKVKNEGIDANASSFKAKEIGSSYIIYKNYKKGKVTF 108
Query: 178 -EKKEEKEFKMKEDKEKPVEKIRKEEKD 204
+K K+F + K K+ KE K
Sbjct: 109 YDKIFGKKFLYTDTLPKINWKLTKETKT 136
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 29.6 bits (67), Expect = 4.5
Identities = 8/38 (21%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 129 KDKKTHKKKEKEKIKKKKDK-KDKDKIKNKEKDKEKKV 165
++KK +KK +++ + + K K +K++ V
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 28.5 bits (64), Expect = 4.6
Identities = 27/81 (33%), Positives = 38/81 (46%)
Query: 151 KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKK 210
K++ K K K E K +K+K + E ED E E R E + KEK
Sbjct: 13 KNEPKWKSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKL 72
Query: 211 SKDKDKKLKKEKLKKKKKEKE 231
+DK K K+E K+K+KE+
Sbjct: 73 RRDKLKAKKEEAEKEKEKEER 93
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 29.3 bits (67), Expect = 4.8
Identities = 32/119 (26%), Positives = 48/119 (40%), Gaps = 36/119 (30%)
Query: 129 KDKKTHKKKE----------KEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKE 178
KDK T K++ E+I++ KD N E+DK+ +KE E
Sbjct: 482 KDKGTGKEQSITITASSGLSDEEIERMV----KDAEANAEEDKK----------RKELVE 527
Query: 179 KKE---------EKEFKMKEDK--EKPVEKIRKEEKDSEKEKKSKDKDK-KLKKEKLKK 225
+ EK K DK EKI K+ ++ K +DK+ K K E+L +
Sbjct: 528 ARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKEAIKAKTEELTQ 586
>gnl|CDD|225405 COG2849, COG2849, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 230
Score = 28.9 bits (65), Expect = 4.8
Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 3/116 (2%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEK 177
++ K+ D T I++K + +K + K K D++ K+ + + EE
Sbjct: 2 GDIVETSKNGVDDGVTITYYLNGAIEEKANYVNKLEGKEKLYDEDGKLLSKLDFSAGEEV 61
Query: 178 EKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+ KEE K K+ + +K + + K K + E
Sbjct: 62 KSKEE---YYDNGTLKGEYLEGKKTPFTGIVEKYYENGELEAAIYYKNGKLDGETK 114
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.1 bits (66), Expect = 4.9
Identities = 33/182 (18%), Positives = 71/182 (39%), Gaps = 32/182 (17%)
Query: 75 PKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKTH 134
P++K+ + S E + + V +V +V S+++ E K ++ +
Sbjct: 153 PEDKLEELKLESDVENVE-YISTDKGYVYVVV--VVLKELSDEVEEELKKLGFERLELEE 209
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+ E I++ K++ ++++KE+E +E KE ++ ++ E
Sbjct: 210 EGTPSELIREIKEEL-------------EEIEKERESLLEELKELAKKYLEELLALYEY- 255
Query: 195 VEKIRKEEKDSEKEKKS-----------KDKDKKLKKEKLKKKKK----EKERSSHEKEV 239
+E + + K K+ +D+ KKLK+ K E E+E
Sbjct: 256 LEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKATGGSAYVEFVEPDEEEEE 315
Query: 240 IP 241
+P
Sbjct: 316 VP 317
Score = 29.1 bits (66), Expect = 6.2
Identities = 35/210 (16%), Positives = 83/210 (39%), Gaps = 35/210 (16%)
Query: 74 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
KE+ S E+ +K+ + + K+ S + K ++++KK
Sbjct: 35 DLKEE-LSNERLRKLRSLLTKLSEA----------LDKLRS-----YLPKLNPLREEKKK 78
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-----MK 188
K E++ K ++ + +KI+ + K+ E+++ + + + K+ E+E + + + +
Sbjct: 79 VSVKSLEELIKDVEE-ELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLS 137
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK--------EKERSSHEKEVI 240
+ +K ++ K + E + K KE S +E +
Sbjct: 138 LLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEEL 197
Query: 241 PKLTFKFGTDMEEKTKRESSPKIVIKPVKS 270
KL F+ + + E +P +I+ +K
Sbjct: 198 KKLGFE-----RLELEEEGTPSELIREIKE 222
Score = 28.7 bits (65), Expect = 7.1
Identities = 23/103 (22%), Positives = 50/103 (48%), Gaps = 22/103 (21%)
Query: 142 IKKKKDKKDKDKIKNKEKDKEK---KVDK-----EKEKGKKEEKEKKEEKEFKMKEDKEK 193
I+ K++ ++++ K +DK K +EEK+K K
Sbjct: 33 IEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSL-------- 84
Query: 194 PVEKIRKE-EKDSEK-EKKSKDKDKKLK--KEKLKKKKKEKER 232
E++ K+ E++ EK EK+ K+ ++++ + ++K+ ++E ER
Sbjct: 85 --EELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125
>gnl|CDD|234961 PRK01581, speE, spermidine synthase; Validated.
Length = 374
Score = 29.2 bits (65), Expect = 4.9
Identities = 22/80 (27%), Positives = 40/80 (50%), Gaps = 11/80 (13%)
Query: 134 HKKKEKEKIKK----KKDKK---DKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK 186
H+K+ K KI + KKDK+ D +K + +++ E K DK+ ++ K++ + E
Sbjct: 4 HRKQSKIKIYRITSYKKDKRSELDSNKFELEQQLVENKQDKQGKQYKQDRGIQYAET--- 60
Query: 187 MKEDKEKPVEKIRKEEKDSE 206
K+D + E + DS
Sbjct: 61 -KQDNQVQSENVVIVPTDSH 79
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 29.4 bits (66), Expect = 4.9
Identities = 36/149 (24%), Positives = 65/149 (43%), Gaps = 9/149 (6%)
Query: 90 KEYSKVKDVELGVTPMVPNIVKVTSSED-LGEIKKDKSIKKDKKTHKKKE----KEKIKK 144
K Y K K V V I + ED L E+ DK KK + K K+K+KK
Sbjct: 60 KAYQKAKRVSENVKTPRFYIKTLVMLEDFLNELWADKEAKKKMSKNNAKALNTLKQKVKK 119
Query: 145 --KKDKKDKDKIK-NKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE-KIRK 200
K+ + D + + + E + E++ + E + + E ++E E+ E + +
Sbjct: 120 NNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDR 179
Query: 201 EEKDSEKEKKSKDKDKKLKKEKLKKKKKE 229
++D E+++ + K + +E KK E
Sbjct: 180 VKEDDEEDEDADLSKKDVLEEPKMFKKPE 208
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain is
found associated with pfam06512, pfam00520. This domain
has a conserved ADD sequence motif.
Length = 222
Score = 28.6 bits (64), Expect = 5.1
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 15/71 (21%)
Query: 164 KVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+ KE+ + K+++ I EE DSEKE KS+ D + + L
Sbjct: 9 ESAKERRNRNDKNKKEEHS---------------IGSEEGDSEKEPKSESADGRKRCRFL 53
Query: 224 KKKKKEKERSS 234
+K ++ +S
Sbjct: 54 LRKTRKVSQSL 64
>gnl|CDD|241381 cd13227, PH-like_Vps36_GLUE, Vps36 GLUE (GRAM-like
ubiquitin-binding in Eap45) Pleckstrin homology-like
domain. ESCRT complexes form the main machinery driving
protein sorting from endosomes to lysosomes. Yeast/human
ESCRT-I consists of Vps23/Tsg101, Vps28/Vps28, and
Vps37/Vps37 homolog. Yeast/human ESCRT-II is composed of
Vps25/EAP20, Vps22/EAP30, and Vps36/EAP45. Yeast
ESCRT-III consists Vps2, Vps20, Vps24, and Snf7
subunits. In contrast, there are three human paralogs of
Snf7 (hSnf7-1/CHMP4A, hSnf7-2/CHMP4B, and
hSnf7-3/CHMP4C) and two paralogs of Vps2 (CHMP2A and
CHMP2B). Yeast ESCRT-I links directly to ESCRT-II,
through a tight interaction of Vps28 (ESCRT-I) with the
yeast-specific zinc-finger insertion within the GLUE
domain of Vps36. The Vps36 subunit (ESCRT-II) binds
ubiquitin using one of its two NZF zinc fingers in its
N-terminal region. Human Vps36, EAP45, also binds
ubiquitin despite having no NZF domain. Instead,
mammalian ESCRT-II interacts with Ub through the Eap45
GLUE domain itself. The yeast Vps36 GLUE has a complete
PH domain, wherease Eap45 GLUE only has a PH-like fold
since it lacks the secondary structure element
corresponding to the 4 strand. ESCRT-II also interacts
with ESCRT-III via a Vps25(EAP20)/Vps20(CHMP6)
interaction. Structure 2CAY is missing this insertion
that contains 2 NZF zinc fingers. It is a split PH
domain, with a noncanonical lipid binding pocket that
binds PI(3)P. The interactions of ESCRT-II GLUE domain
with membranes, ESCRT-I, and ubiquitin are critical for
ubiquitinated cargo progression from early to late
endosomes. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 126
Score = 28.0 bits (63), Expect = 5.2
Identities = 10/43 (23%), Positives = 13/43 (30%), Gaps = 2/43 (4%)
Query: 259 SSPKIVIKPVKSPSPPPVAAADEYDTGDSKQV--WICPACGVQ 299
SPKI++ + TG W CP C
Sbjct: 81 RSPKIILFLKDEINSKSGLGQSSNKTGSQNVTSQWTCPICMFT 123
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 28.9 bits (65), Expect = 5.2
Identities = 23/168 (13%), Positives = 59/168 (35%), Gaps = 7/168 (4%)
Query: 65 DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKD 124
DK+ ++ + ++K K++T + + D E + E
Sbjct: 57 DKMKAYEQEALEQKQKDKKRTLQDLADSFEKSDKESQDYSSSDSGFDEEPDEPAETAGSL 116
Query: 125 KSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKE 184
+ I+ ++ +E + K + EK+ ++V++ + + + +
Sbjct: 117 RPIRSSAAAYRDINRELGSFYEYPKTDE-----EKELLREVEELESRL--ATEPSPAPEL 169
Query: 185 FKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER 232
+ EK E K ++ + KK KK ++ + +E+
Sbjct: 170 EEQLALMEKSYELAAKYMPRGQERLPVAPESKKGKKPSVQPVRAVEEK 217
Score = 28.5 bits (64), Expect = 6.8
Identities = 14/72 (19%), Positives = 32/72 (44%), Gaps = 2/72 (2%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
E+ +E+ KG + + +++ + DK K E+ E+K +K++ +D +K
Sbjct: 31 EEDAAEEETKGFNADIPQPKDEG--IIGDKMKAYEQEALEQKQKDKKRTLQDLADSFEKS 88
Query: 222 KLKKKKKEKERS 233
+ + S
Sbjct: 89 DKESQDYSSSDS 100
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 29.3 bits (64), Expect = 5.3
Identities = 32/113 (28%), Positives = 64/113 (56%), Gaps = 5/113 (4%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
++ + ++K KK+ KKK +++I K + + K K + + ++ + E +K +E+E K+
Sbjct: 62 EQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQ 121
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
K + + KE+ K+E D +K + K K+K +K EK +K +K K+ S+
Sbjct: 122 WKAEQERIQKEQ-----EKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKAST 169
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 28.4 bits (63), Expect = 5.3
Identities = 15/37 (40%), Positives = 16/37 (43%)
Query: 25 PFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS 61
P P GPG+ P PP P IP P PPP
Sbjct: 112 PPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148
>gnl|CDD|216295 pfam01093, Clusterin, Clusterin.
Length = 434
Score = 28.9 bits (65), Expect = 5.3
Identities = 16/67 (23%), Positives = 33/67 (49%), Gaps = 5/67 (7%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE 221
EK VD+E + K+ K + E E+ + + + ++KEK+ K +E
Sbjct: 11 EKYVDEEVKNALIGVKQMKT-----LMERTEEEHKNLMSTLEKTKKEKEEALKLANEVEE 65
Query: 222 KLKKKKK 228
KL+++++
Sbjct: 66 KLEEEEE 72
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 28.2 bits (64), Expect = 5.4
Identities = 16/45 (35%), Positives = 28/45 (62%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKD 160
+ L +++KDK I +D+ +KE +K+ K KK + +K KEK+
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKEIQKLTDKYIKKIDELLKKKEKE 163
>gnl|CDD|234045 TIGR02876, spore_yqfD, sporulation protein YqfD. YqfD is part of
the sigma-E regulon in the sporulation program of
endospore-forming Gram-positive bacteria. Mutation
results in a sporulation defect in Bacillus subtilis.
Members are found in all currently known
endospore-forming bacteria, including the genera
Bacillus, Symbiobacterium, Carboxydothermus,
Clostridium, and Thermoanaerobacter [Cellular processes,
Sporulation and germination].
Length = 382
Score = 28.8 bits (65), Expect = 5.5
Identities = 43/186 (23%), Positives = 78/186 (41%), Gaps = 9/186 (4%)
Query: 60 PSKMKDKVSPKDKTSPKEKMSSK---EKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSE 116
P + K P++ + K+ + + P K+ VK +L ++ ++ K +
Sbjct: 177 PKPVLKKAEPRNIVAKKDGVIKRVYVTSGEPVVKKGDVVKKGDLLISGILGKEGKTYTVH 236
Query: 117 DLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEE 176
GE+ +K + K ++E KK + I K K KK KE K EE
Sbjct: 237 AEGEVFARTWYEKTVEVPLKTQREVRTGKKIVRYYLNIGGK-KIYLKKPPKEFAKYDVEE 295
Query: 177 KEKK---EEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
K KK ++ KE E K EK +++E + +K K+ +++LKK+ +
Sbjct: 296 KTKKFGLWRWSLPIEIVKETYYEVKEKVEKVTKEE--AIEKAKEKAEKELKKELDPNAKI 353
Query: 234 SHEKEV 239
+K +
Sbjct: 354 VSDKIL 359
>gnl|CDD|223619 COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1
[Posttranslational modification, protein turnover,
chaperones].
Length = 205
Score = 28.5 bits (64), Expect = 5.5
Identities = 14/79 (17%), Positives = 29/79 (36%), Gaps = 3/79 (3%)
Query: 122 KKDKSIKKDKKTHKKKEKEK-IKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEEKE 178
+ +S+ + K K ++ K+ + E+++ + ++ K
Sbjct: 20 QLGQSLAELKLRLDKDALLAGLEDALALVQKEAALEALQRAEQRLQQAAKAKRAKDAAAN 79
Query: 179 KKEEKEFKMKEDKEKPVEK 197
E K F K KEK V+
Sbjct: 80 AAEGKAFLEKNAKEKGVKT 98
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.6 bits (64), Expect = 5.6
Identities = 17/65 (26%), Positives = 28/65 (43%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
EI K KK + T +K K KK K+K + K+ + + +E E +E
Sbjct: 44 EIPSPKQPKKKRPTTPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEES 103
Query: 180 KEEKE 184
+ +E
Sbjct: 104 GDSQE 108
>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Protein of this family often fused to HTH domain; Some
proteins could have an additional fusion with
RecB-family nuclease domain; Core domain appears to have
a Rossmann-like fold; loosely associated with CRISPR/Cas
systems; also known as DxTHG family.
Length = 400
Score = 28.9 bits (65), Expect = 5.7
Identities = 17/99 (17%), Positives = 38/99 (38%), Gaps = 18/99 (18%)
Query: 108 NIVKVTSSEDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKI------------- 154
+ ++ + + EI K +K K+ ++ KK + +KI
Sbjct: 184 HEIESETIAPVTEISYLKEKFINKYKDNNKDLKRYFKKALNEGLEKIIKFASALYNGLPL 243
Query: 155 -----KNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
K ++ + ++ + E + E K +E+E K+K
Sbjct: 244 YLLYLKPSIEELREPLEFKLEIIEIIESNKNQEEEIKVK 282
>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
PA.
Length = 669
Score = 29.1 bits (65), Expect = 5.7
Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 12/105 (11%)
Query: 54 PPGTPPPSKMK---DKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVT---PMVP 107
P +M+ DK +P++ +MS + K K +++++ + L P P
Sbjct: 188 PLPFSTYEQMRDYCDKFTPREIEPNVSRMSGRVKLPIKHYRWTQMRPIPLLKEPEGPPSP 247
Query: 108 NIVKVTSSEDL------GEIKKDKSIKKDKKTHKKKEKEKIKKKK 146
K+ ++ L ++ I D + E E I+ +
Sbjct: 248 YRAKLLMADALYLTICDPGTDQEGIILIDSTFFSQSEPECIRPSE 292
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 29.2 bits (66), Expect = 5.7
Identities = 16/79 (20%), Positives = 48/79 (60%)
Query: 160 DKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
D E++V++ +E+ ++ E + E + E++ + +E++ E +++ +EK+ + ++ + +
Sbjct: 486 DLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRER 545
Query: 220 KEKLKKKKKEKERSSHEKE 238
+L+ + +EK ++ E E
Sbjct: 546 AAELEAEAEEKREAAAEAE 564
>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
protein; Provisional.
Length = 897
Score = 29.0 bits (65), Expect = 5.8
Identities = 21/57 (36%), Positives = 27/57 (47%), Gaps = 8/57 (14%)
Query: 27 PSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPS---KMK----DKVSPKDKTSPK 76
V PG +P H LI +PL HP G P P K++ D V+P KT+ K
Sbjct: 410 AEVPAPGELPADQLAEAHELIYRPLRHPDGPPQPQVEYKLRRFVNDYVAPP-KTAAK 465
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.4 bits (64), Expect = 5.8
Identities = 14/62 (22%), Positives = 35/62 (56%), Gaps = 10/62 (16%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
+ E+ ++EE+EK++E+E E++ + K ++ + K K K+++KE+ ++
Sbjct: 117 RFLEQLEREEEEKRDEEE----------RERLLRAAKSRSEQSRLKQKAKEMQKEEDEEM 166
Query: 227 KK 228
+
Sbjct: 167 RH 168
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 29.1 bits (66), Expect = 6.0
Identities = 38/170 (22%), Positives = 63/170 (37%), Gaps = 38/170 (22%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKG----- 172
L ++K ++ KKKEK+K KKKK K + KI K + + D E K
Sbjct: 385 LKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKI-YKVLKEATRQDLELLKNVWGEI 443
Query: 173 ----KKEEKEK---------------------KEEKEFKMKEDKEKPVEKIRKEEKDSEK 207
K + K + E F+ ++ + I
Sbjct: 444 LESLKAQRKSLRALLVNSEPVAASEDTVVLAFEYEIHFEKAMLDKELNDTIENILSKLLG 503
Query: 208 EK-----KSKDKDKKLKKEKLKKKKKEKERSSHE--KEVIPKLTFKFGTD 250
+ +D+ +K+++E L+K K E+E E +IP+ FG D
Sbjct: 504 KSVEIVAVPEDEWQKIREEFLQKHKNEEEEEEEEEELPLIPEAKELFGED 553
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 28.5 bits (64), Expect = 6.0
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 152 DKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDK 191
K+K EK +EK+ ++K ++ KK EKE + ++ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
Score = 28.1 bits (63), Expect = 9.3
Identities = 18/48 (37%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 172 GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK 219
K +E EK+EEK+ + E K + +K EK+ EK +++K + KLK
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEK-RQAKYSEAKLK 198
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 28.9 bits (64), Expect = 6.1
Identities = 29/131 (22%), Positives = 50/131 (38%), Gaps = 2/131 (1%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE--KIRKEEKDSEKEKKSKDK 214
K + KK KE E + EE+E++EE+E E+ E+ E D D
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDN 180
Query: 215 DKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPP 274
++ ++E + + + E + P F+ T +E P + +
Sbjct: 181 GEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPEYQGEY 240
Query: 275 PVAAADEYDTG 285
A+EYD G
Sbjct: 241 EQTGANEYDGG 251
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.6 bits (64), Expect = 6.1
Identities = 28/152 (18%), Positives = 77/152 (50%), Gaps = 15/152 (9%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEK--DKEKKVDKEKEKGK--KEEKEKKEEKEFKMKEDK 191
++E+++++++ ++ ++D + +++ + E+ + +EK K +EE + E+K +++E+
Sbjct: 4 EREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEEN 63
Query: 192 EKPVEKIRKEEKDSEK-EKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKLTFKFGTD 250
+ E+ E++ E+ E + + ++ K + +++KKE E ++E+ +
Sbjct: 64 RRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR---------E 114
Query: 251 MEEKTKRESSPKIVIKPVKSPSPPPVAAADEY 282
+E +R + +PP VAA
Sbjct: 115 AQEAHERARQELLEAAA-APTAPPHVAAPVNG 145
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 28.3 bits (63), Expect = 6.1
Identities = 18/61 (29%), Positives = 34/61 (55%), Gaps = 9/61 (14%)
Query: 126 SIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK-EKDKEKKVDKEKEKGKKEEKEKKEEKE 184
S +KD K ++ + D IKN E++KEKK + KE+ K+ +K+++ ++E
Sbjct: 68 SFEKDHKFIRRCKLSDA-------DFSLIKNHLEEEKEKKKNLNKEE-KEAKKKERAKRE 119
Query: 185 F 185
+
Sbjct: 120 Y 120
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 29.0 bits (65), Expect = 6.1
Identities = 18/123 (14%), Positives = 41/123 (33%), Gaps = 5/123 (4%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEK-----E 170
++ G+I ++ EK + K K ++ +I N + +VD++
Sbjct: 538 KENGKIYVYNDEERGPYLKYSYEKLEKKLKSLEELPLEIINGDSLLSDEVDEKINQDILR 597
Query: 171 KGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
+ E E K P E + +D L++++ ++ E
Sbjct: 598 IQDRILSEAIELARLLDLLHKSSPDVLNSIFEYLKFLTDAEEIRDVPLEEKEELDERIES 657
Query: 231 ERS 233
+
Sbjct: 658 LYN 660
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 29.1 bits (65), Expect = 6.2
Identities = 31/187 (16%), Positives = 53/187 (28%), Gaps = 17/187 (9%)
Query: 106 VPNIVKVTSSEDLGEIKKDKSIKKDK---KTHKKKEKEKIKKKKDKKDKDKIKNKEKDKE 162
P+ V +S L + K +I + K +K D D D+ +D E
Sbjct: 43 GPSAVDERTSGVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDE 102
Query: 163 KKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+ + E + ++ + D + D +
Sbjct: 103 DDAGNATSGESSTDDDSLLELPDRDEDADTQ-------ANNDQTNDFDQDDSSNSQTDQG 155
Query: 223 LKKKKKEKERSSH--EKEVIPKLTFKFGTDMEEKTKRESSPKIVIKPVKSPSPPPVAAAD 280
LK+ EK+ + TFKF + + + KS S P D
Sbjct: 156 LKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD-----GGKSKSSDPGPLND 210
Query: 281 EYDTGDS 287
GD
Sbjct: 211 SDGQGDD 217
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 28.9 bits (64), Expect = 6.2
Identities = 32/145 (22%), Positives = 72/145 (49%), Gaps = 17/145 (11%)
Query: 106 VPNIVKVTSSEDLGEI--------KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIK-N 156
V NI +TS + + K D+ +D TH + DK ++ +
Sbjct: 382 VTNITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIE 441
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDK 216
KE K ++ + + K+ E+ ++EE+E + ++E+ +E+I +E + E+ ++ + +
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERE---RLERER-MERIERERLERERLERERLERD 497
Query: 217 KLKKEKLKKKKKEK----ERSSHEK 237
+L++++L + ++E+ ER EK
Sbjct: 498 RLERDRLDRLERERVDRLERDRLEK 522
>gnl|CDD|233469 TIGR01564, S_layer_MJ, S-layer protein, MJ0822 family. This model
represents one of several families of proteins
associated with the formation of prokaryotic S-layers.
Members of this family are found in archaeal species,
including Pyrococcus horikoshii (split into two tandem
reading frames), Methanococcus jannaschii, and related
species. Some local similarity can be found to other
S-layer protein families [Cell envelope, Surface
structures].
Length = 571
Score = 28.8 bits (64), Expect = 6.4
Identities = 32/188 (17%), Positives = 58/188 (30%), Gaps = 18/188 (9%)
Query: 70 KDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKK 129
KD EK S K Y +G+ ++ V ++K K +K
Sbjct: 310 KDGKVVAEKSDSDTKDGALVHVY---FGGGIGIQ-VLDAWENVGGDYGYVQLKIYKDLKT 365
Query: 130 DKKTHK----------KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ + + + K D+D +++KE + K KE
Sbjct: 366 LELGEEYMPDWKAEFAIDDNNTVIKWDALVDEDPLEDKEIGIFLTYTGDYSDTVKSGKEV 425
Query: 180 KEEKEFKMKEDKEKPVEKIR---KEEKDSEKEKKSKDKDKKLKKE-KLKKKKKEKERSSH 235
+ +K D E +K+ ++ + DK + L + L+ K E
Sbjct: 426 GIIEYAVLKLDDEDKEDKLYIDFSMDEYKDATLTKGDKVEDLNGDINLEFKPAEAYIPVK 485
Query: 236 EKEVIPKL 243
I L
Sbjct: 486 LTAPIAVL 493
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 28.8 bits (64), Expect = 6.5
Identities = 32/190 (16%), Positives = 76/190 (40%), Gaps = 26/190 (13%)
Query: 129 KDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
K+ ++ +E+E +++ + DKD+I+ + K E+ +EE+ ++E+ E ++
Sbjct: 349 KEGQSQWAQERETLRQSAEA-DKDRIQKLSAELLKL-----EEWLQEERSQREKLEVELG 402
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLK-KEKLKKKKKEKERSSHEKEVIPKLTFKF 247
EKD + + S+++ + + + L+ +KEKE+ EK+ +
Sbjct: 403 I------------EKDCNRVQLSENRRELSELRSALRVLQKEKEQLQEEKQELLDYIRVL 450
Query: 248 GTDMEEKTKRESSPKIVIKPVKSPSPPPVAAADEYDTGD-------SKQVWICPACGVQD 300
++++ + + A D D+ D S ++ Q
Sbjct: 451 ELRLDKEADEKWAEDAATCEDAKSYLHRAAYIDLLDSIDESPLDESSSELSPSALDEEQH 510
Query: 301 DGSLPMIGCD 310
+ S P +
Sbjct: 511 NESTPKNPKE 520
>gnl|CDD|226475 COG3966, DltD, Protein involved in D-alanine esterification of
lipoteichoic acid and wall teichoic acid (D-alanine
transfer protein) [Cell envelope biogenesis, outer
membrane].
Length = 415
Score = 29.0 bits (65), Expect = 6.5
Identities = 25/114 (21%), Positives = 46/114 (40%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
+ K D ++K+ K KK+ K K +I ++ K ++ K
Sbjct: 174 QFKSDVTLKELLKKLAKKKPLTTADKSGVKLNLQILERKDLLFSATSKSGPLKHIKKALK 233
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+F KE ++ VE +K ++ K+K KK+K + K K +K +
Sbjct: 234 DLPDQFSYKELEKLAVEIGKKSTTNNPFGIKNKYYIKKIKPKYKKLKDSQKNFT 287
>gnl|CDD|224396 COG1479, COG1479, Uncharacterized conserved protein [Function
unknown].
Length = 409
Score = 28.8 bits (64), Expect = 6.5
Identities = 28/88 (31%), Positives = 40/88 (45%), Gaps = 5/88 (5%)
Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK--IRKEEK 203
K+ D +KNK K +EK++ K ++KKE +E KM + K K +R
Sbjct: 212 KELSKIDLLKNKLLMLIK---EEKKQSKDLNQDKKELEEIKMNKGKITERFKDFLRDYLT 268
Query: 204 DSEKEKKSKDKDKKLKKEKLKKKKKEKE 231
K + KKLKK K+K KE
Sbjct: 269 YKISSNIKKRELKKLKKIYEDNKEKLKE 296
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 28.8 bits (64), Expect = 6.6
Identities = 25/89 (28%), Positives = 45/89 (50%), Gaps = 4/89 (4%)
Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK-MKEDKEKPVEKIRKEEK 203
++++ E+D E+ +KE K E+ E++EE+E K MK+ + + + K
Sbjct: 136 PDLVLEEEEEVEMEED-EEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVK 194
Query: 204 DSEKEKKSKDK--DKKLKKEKLKKKKKEK 230
D+ K S KKL+K + K+K K+
Sbjct: 195 DNGKSFWSAASVFSKKLQKWRQKQKLKKP 223
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 27.9 bits (62), Expect = 6.6
Identities = 32/107 (29%), Positives = 60/107 (56%), Gaps = 2/107 (1%)
Query: 123 KDKSIKKDKKTHKKKEKEKIKK-KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
K K I+++ + +KE +K +K+ + ++ EK + + K + KK+ EKKE
Sbjct: 20 KVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKEDRKFEKKKKLDEKKE 79
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+ + +E +EK + K +KE + E KK K K+++ +++KL KK K
Sbjct: 80 IAKQRKREQREKELAKRQKELEKIELSKK-KQKERERRRKKLTKKTK 125
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 28.5 bits (64), Expect = 6.6
Identities = 18/68 (26%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 178 EKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
E EE+E +++ + + E+ R E + ++ ++ +++ ++EK ++KK++KER E
Sbjct: 147 EVLEEEELAELRQQQRQ-FEQRRNAEL--AETQRLEEAERRRREEKERRKKQDKERKQRE 203
Query: 237 KEVIPKLT 244
KE K+
Sbjct: 204 KETAEKIA 211
>gnl|CDD|240406 PTZ00411, PTZ00411, transaldolase-like protein; Provisional.
Length = 333
Score = 28.6 bits (64), Expect = 6.8
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 5/51 (9%)
Query: 162 EKKVDKEKEKGKKEEKEKKEEKEFK--MKED---KEKPVEKIRKEEKDSEK 207
E+K+D EK E+ + EKEF+ + ED EK E IR KD EK
Sbjct: 271 ERKLDPEKLTEDTEKLPELTEKEFRWELNEDAMATEKLAEGIRNFAKDLEK 321
>gnl|CDD|221283 pfam11875, DUF3395, Domain of unknown function (DUF3395). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 147 to 176 amino acids in length. This domain is
found associated with pfam00226.
Length = 144
Score = 27.7 bits (62), Expect = 6.8
Identities = 13/47 (27%), Positives = 27/47 (57%)
Query: 157 KEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEK 203
+ K+K+ ++++EK K E +KK+E E + + K+ +EE+
Sbjct: 3 NKAQKKKEQERQREKNKTEIAKKKQEAESAVSLMGDVVERKLTREEE 49
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 28.8 bits (65), Expect = 7.0
Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 11/116 (9%)
Query: 138 EKEKIKKK-KDKKDKDKIKNKEKDKEKK-VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
E+EK ++ + KK + K+KD + + K K + + +
Sbjct: 463 EREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAA 522
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKE---------KLKKKKKEKERSSHEKEVIPK 242
+ RK + + + +K KK K KK ++ + E+EV PK
Sbjct: 523 REARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPK 578
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 27.7 bits (62), Expect = 7.0
Identities = 22/76 (28%), Positives = 51/76 (67%), Gaps = 4/76 (5%)
Query: 130 DKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEF--KM 187
+ KE KI+KK+ K++ +K+ ++ ++K++ EKE ++E+K +K+E+EF K
Sbjct: 62 KTRLKMFKESLKIEKKELKQEVEKLPRFQEQEKKRMKAEKE--EQEQKHQKQEREFLAKQ 119
Query: 188 KEDKEKPVEKIRKEEK 203
+E+ E+ +++++ E++
Sbjct: 120 EENLEEALQQLQNEKR 135
>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
Length = 153
Score = 27.8 bits (62), Expect = 7.1
Identities = 19/71 (26%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 187 MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKER---SSHEKEVIPKL 243
+K+ K K ++ EK K K ++++K +KKK E + S EKE+ +
Sbjct: 49 LKQQKMYQKILDLKLKEAKEKCLKVKQLLEEIQKFSVKKKVGENNQIFGSVTEKEISQII 108
Query: 244 TFKFGTDMEEK 254
K D+E++
Sbjct: 109 KNKTNIDLEKQ 119
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 28.3 bits (63), Expect = 7.3
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 6/80 (7%)
Query: 173 KKEEKEKKEEKEFKMK----EDKEKPVEKIRKEEKDSEKEKKS--KDKDKKLKKEKLKKK 226
KK E+K EK K+ E+ ++ +E++ + K S+ E ++ ++ K+ L +K
Sbjct: 1 KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPERYEQTKENSLIEK 60
Query: 227 KKEKERSSHEKEVIPKLTFK 246
E EKE + K
Sbjct: 61 VDSIEEEISEKEKVMSEKLK 80
>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
This family consists of a series of primate specific
nuclear pore complex interacting protein (NPIP)
sequences. The function of this family is unknown but is
well conserved from African apes to humans.
Length = 350
Score = 28.6 bits (63), Expect = 7.3
Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 1/99 (1%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEK 179
E K I K K T K +KI K+ K K+ ++ + + +KE++ + EE K
Sbjct: 114 ETKVRAKICKMKVTTKINHHDKINGKR-KTAKEHLRKLSMKECEHAEKERQVSEAEENGK 172
Query: 180 KEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKL 218
+ KE ++ + + +R+ +D K K K L
Sbjct: 173 LDMKEIHTYKEMFQRAQALRRRAEDYHKCKIPPSARKPL 211
>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
protein 2. THO and TREX form a eukaryotic complex which
functions in messenger ribonucleoprotein metabolism and
plays a role in preventing the transcription-associated
genetic instability. Tho2, along with four other
subunits forms THO.
Length = 296
Score = 28.4 bits (64), Expect = 7.3
Identities = 16/47 (34%), Positives = 24/47 (51%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEE 182
KE E+++K+ + D +K KEKK K K +EE +K E
Sbjct: 30 DKEIERLEKQIKELDSSSSGIDKKKKEKKRLKSLIKKLEEELKKHIE 76
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 27.0 bits (60), Expect = 7.4
Identities = 16/87 (18%), Positives = 43/87 (49%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
K+ E ++ ++ K++ + K+ ++ + E+ ++E ++ +E + +E+ E+ E
Sbjct: 13 KKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEAREEAEEEAE 72
Query: 197 KIRKEEKDSEKEKKSKDKDKKLKKEKL 223
+I E + SK + K+ + L
Sbjct: 73 EILAEAEKEASAILSKAAEGKVVEAAL 99
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 28.2 bits (64), Expect = 7.5
Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 16/63 (25%)
Query: 192 EKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK---------EKERSSHEKEVIPK 242
EK +E++ EK+ KS DK+ K + E L+K K+ E + E +++
Sbjct: 137 EKRLERL-------EKKAKSGDKEAKEELELLEKIKEHLEEGKPARTLELTDEEIKILKS 189
Query: 243 LTF 245
L
Sbjct: 190 LQL 192
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 28.5 bits (64), Expect = 7.5
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 6/54 (11%)
Query: 136 KKEKEKIKKKKDKKD-KDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK 188
K KE K K K+D + I ++ K++ +KE E +E K + K+K
Sbjct: 58 KLNKEVAKLKIAKEDATELIAETKELKKEITEKE-----AEVQEAKAALDAKLK 106
>gnl|CDD|237796 PRK14712, PRK14712, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1623
Score = 28.7 bits (63), Expect = 7.6
Identities = 14/102 (13%), Positives = 57/102 (55%), Gaps = 6/102 (5%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEK------GKKEEKEKKEEKEFKMK 188
+++ +E I+++ +++ + ++ ++K D + E+ G++ ++ E+E +
Sbjct: 1517 QRQAEEAIRRETERRADEIVRKMAENKPDLPDGKTEQAVREIAGQERDRAAITEREAALP 1576
Query: 189 EDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
E + +++R+ ++ +E +++ ++++++ ++ +KEK
Sbjct: 1577 ESVLREPQRVREAVREVARENLLQERLQQMERDMVRDLQKEK 1618
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 28.3 bits (63), Expect = 7.6
Identities = 29/139 (20%), Positives = 69/139 (49%), Gaps = 11/139 (7%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKE 175
ED ++++ +++++T + + E K + KD+I +E+++E +K E+ K
Sbjct: 10 EDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKR 69
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-----------KLK 224
+ E+++ + K+ E++ K +++K E D D + ++E ++K
Sbjct: 70 KAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIK 129
Query: 225 KKKKEKERSSHEKEVIPKL 243
+ ++E+E EK I K+
Sbjct: 130 RDREEREEMEREKAEIEKM 148
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 28.2 bits (63), Expect = 7.7
Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 7/66 (10%)
Query: 170 EKGKKEEKEKKEEKEFKMKED------KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKL 223
E +K+EKEKKE+ + + K+ E+ ++K KE + E + D+ KK+K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYV-EYDRAGKKKKA 217
Query: 224 KKKKKE 229
K K +E
Sbjct: 218 KSKYEE 223
>gnl|CDD|151885 pfam11446, DUF2897, Protein of unknown function (DUF2897). This is
a bacterial family of uncharacterized proteins.
Length = 55
Score = 26.1 bits (58), Expect = 7.7
Identities = 9/32 (28%), Positives = 14/32 (43%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD 166
K K K + +K K +N E+D + D
Sbjct: 24 KYSAKMKFPQFDKQKKKKAKRNAEQDAKNSAD 55
>gnl|CDD|237554 PRK13909, PRK13909, putative recombination protein RecB;
Provisional.
Length = 910
Score = 28.8 bits (65), Expect = 7.9
Identities = 26/105 (24%), Positives = 43/105 (40%), Gaps = 30/105 (28%)
Query: 130 DKKTHKKKEKEKIKKKKD-------------------KKDKDKIKNKEKDKEKKVD-KEK 169
DK + EKEK K ++ KKD+ E +D K
Sbjct: 663 DKDYARALEKEKALKYEEEINVLYVAFTRAKNSLIVVKKDESSGSMFEI-----LDLKPL 717
Query: 170 EKGKKEEKEKKEE--KEFKMKEDKEKPV---EKIRKEEKDSEKEK 209
E+G+ E KE K KE + K KP E++++ E++ +++
Sbjct: 718 ERGEIEIKEPKISPKKESLITSVKLKPHGYQEQVKEIEEEPKEDN 762
Score = 28.4 bits (64), Expect = 10.0
Identities = 14/63 (22%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 127 IKKDKKTHKKKEK---EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
+KKD+ + E + +++ + + + KI K++ V K K G +E+ ++ EE+
Sbjct: 699 VKKDESSGSMFEILDLKPLERGEIEIKEPKISPKKESLITSV-KLKPHGYQEQVKEIEEE 757
Query: 184 EFK 186
+
Sbjct: 758 PKE 760
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 28.5 bits (64), Expect = 8.0
Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 145 KKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKD 204
++K + + K ++ K+ K K ++K+K EE + ++KE KE+ + ++ K
Sbjct: 36 DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEE-LTELSAALKA 94
Query: 205 SEKEKKSK 212
E E + K
Sbjct: 95 LEAELQDK 102
>gnl|CDD|153319 cd07635, BAR_GRAF2, The Bin/Amphiphysin/Rvs (BAR) domain of GTPase
Regulator Associated with Focal adhesion 2. BAR domains
are dimerization, lipid binding and curvature sensing
modules found in many different proteins with diverse
functions. GTPase Regulator Associated with Focal
adhesion kinase 2 (GRAF2), also called Rho GTPase
activating protein 10 (ARHGAP10) or PS-GAP, is a GAP
with activity towards Cdc42 and RhoA which regulates
caspase-activated p21-activated protein kinase-2
(PAK-2p34). GRAF2 interacts with PAK-2p34, leading to
its stabilization and decrease of cell death. It is
highly expressed in skeletal muscle and also interacts
with PKNbeta, which is a target of Rho. GRAF2 contains
an N-terminal BAR domain, followed by a Pleckstrin
homology (PH) domain, a Rho GAP domain, and a C-terminal
SH3 domain. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
The BAR domain of the related protein GRAF directly
interacts with its Rho GAP domain and inhibits its
activity. Autoinhibited GRAF is capable of binding
membranes and tubulating liposomes, showing that the
membrane-tubulation and GAP-inhibitory functions of the
BAR domain can occur simultaneously.
Length = 207
Score = 28.0 bits (62), Expect = 8.0
Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 8/94 (8%)
Query: 154 IKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE---KPVEKIRKEEKDSEKEKK 210
I + E D E+ +D ++ K +E++E E KP+E+ RKE+ + KE+K
Sbjct: 54 IGDAETDDERCIDASLQEFSNFLKNLEEQREIMALNVTETLIKPLERFRKEQLGAVKEEK 113
Query: 211 SK-----DKDKKLKKEKLKKKKKEKERSSHEKEV 239
K +K+ L ++ L K+KE E +V
Sbjct: 114 KKFDKETEKNYSLLEKHLNLSAKKKEPQLQEADV 147
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 28.5 bits (64), Expect = 8.1
Identities = 15/104 (14%), Positives = 43/104 (41%), Gaps = 1/104 (0%)
Query: 135 KKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKP 194
+++ + I+ + + D++ K ++ K K ++K + E+ + ++++E+
Sbjct: 321 RRRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQ 380
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+EE+ + E++ + E S +KE
Sbjct: 381 RSDEHEEEEGEDSEEEGSQSREDGSSES-SSDVGSDSESKADKE 423
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 28.8 bits (65), Expect = 8.1
Identities = 18/107 (16%), Positives = 34/107 (31%), Gaps = 9/107 (8%)
Query: 122 KKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE 181
++ K D+K + K + N + + +K KEKE + E K
Sbjct: 725 RRGKITVTDEKCPECGLPLLRVKGGFGDELGCCNNPKCNYTEKQKKEKESKSELEALKGV 784
Query: 182 EKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
K EK++ ++ + + D D K +
Sbjct: 785 GA---------KTAEKLKDAGVETVTDLTAADPDAVAAKVDGVSADR 822
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 28.3 bits (63), Expect = 8.4
Identities = 39/172 (22%), Positives = 72/172 (41%), Gaps = 13/172 (7%)
Query: 64 KDKVSPKDKTSPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKK 123
VS +++SP K PKEK+ S L + + ED+ +
Sbjct: 247 SRSVSDSEESSPPSKK-------PKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQV 299
Query: 124 DKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEK 183
K K K ++ ++ I +KK +K KE++KE+K + ++ + + K+E
Sbjct: 300 VKKPVKRKNRRGQRARQAIWEKKYGSGAKHVK-KEREKEQKEREGRQSEWEARQAKREGG 358
Query: 184 EFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKE-----KLKKKKKEK 230
+ K E + +++ ++ +K K K + K + KKK KEK
Sbjct: 359 DAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPSVDKPLHPSWEAKKKAKEK 410
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 28.7 bits (65), Expect = 8.5
Identities = 29/122 (23%), Positives = 48/122 (39%), Gaps = 27/122 (22%)
Query: 120 EIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKD--------KIKNKEKDKEKKVD----- 166
++ I + KKKEKE+ ++ + D D KN ++ +KK D
Sbjct: 149 KVGCGVKILYYHSSLKKKEKEEFLERLKEGDFDILVTTSQFLSKNFDELPKKKFDFVFVD 208
Query: 167 ---------KEKEK-----GKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSK 212
K +K G EE +K + ++K E+ EKIR+ E+ + K K
Sbjct: 209 DVDAVLKSSKNIDKLLYLLGFSEEDIEKAMELIRLKRKYEEIYEKIRELEEKIAELKDKK 268
Query: 213 DK 214
Sbjct: 269 GV 270
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins
is found in bacteria. Proteins in this family are
typically between 115 and 181 amino acids in length.
There are two completely conserved residues (F and G)
that may be functionally important.
Length = 110
Score = 27.0 bits (60), Expect = 8.8
Identities = 15/73 (20%), Positives = 23/73 (31%), Gaps = 6/73 (8%)
Query: 13 RQSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDK 72
Q+ P+Q P+ P PP Q P P P S + P
Sbjct: 12 PQNQQQQPYQQQPYHQQMPPPPYS--PPQQQQGHFMPPQPQP--YPKQSPQ--QQQPPQF 65
Query: 73 TSPKEKMSSKEKT 85
+S + + + T
Sbjct: 66 SSFLSQFKNSDGT 78
>gnl|CDD|221288 pfam11882, DUF3402, Domain of unknown function (DUF3402). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 350 to 473 amino acids in length. This domain is
found associated with pfam07923.
Length = 402
Score = 28.4 bits (64), Expect = 8.9
Identities = 21/78 (26%), Positives = 36/78 (46%)
Query: 167 KEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKK 226
+K + K K+ EE+E MK ++ E + + D + K + + + K
Sbjct: 12 SKKVRMKLATKQLWEEREKFMKYERGWDTESLVGDPLDISESVKELKLEMYTSLAEDQPK 71
Query: 227 KKEKERSSHEKEVIPKLT 244
K+E ER S + E IP+L
Sbjct: 72 KEEIERLSTDSEEIPRLD 89
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 27.9 bits (62), Expect = 8.9
Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 1/103 (0%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPV 195
K+ ++ K+ K+ + +++KE+ +K+ + E + + K + P
Sbjct: 110 KELTDEEKEPYYKEANSDRERYQREKEE-YNKKLPNKAPIGPFIENEPKIRPKVEGPSPD 168
Query: 196 EKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKE 238
+ + +E K K D+ KK K KK KE+ S + +
Sbjct: 169 KALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPEA 211
>gnl|CDD|237972 PRK15483, PRK15483, type III restriction-modification system StyLTI
enzyme res; Provisional.
Length = 986
Score = 28.5 bits (64), Expect = 8.9
Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 11/111 (9%)
Query: 118 LGEIKKDKSIKKDKKTHKKKEKEKIKKKKDKKDKDKIKNK---EKDKEKKVDKEKE-KGK 173
+GEI D ++ +++ ++ +KI +K K D D + + + D +K +D+ K
Sbjct: 577 VGEINSDSKVQLNEEKLTEEMIQKIVTEKQKVDPDFTELRLLEDLDDKKIIDRSNNFKED 636
Query: 174 KEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLK 224
+ + E ++ V +D++ K K +K E+LK
Sbjct: 637 GFDWLLEAYPELFQTGLRKGKV-------RDNKPASKLTVKLRKENYEELK 680
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.1 bits (63), Expect = 9.0
Identities = 26/101 (25%), Positives = 52/101 (51%), Gaps = 3/101 (2%)
Query: 146 KDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFK---MKEDKEKPVEKIRKEE 202
K K+++ K K E+ E D ++E+ + E+++ + E + ++ DKE EE
Sbjct: 77 KWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEE 136
Query: 203 KDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEKEVIPKL 243
+ E KK+K+ + E+ +++ E+E + EKE +L
Sbjct: 137 EKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 28.1 bits (62), Expect = 9.1
Identities = 27/117 (23%), Positives = 51/117 (43%), Gaps = 11/117 (9%)
Query: 128 KKDKKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVD--KEKEKGKKEEKEKKEEKEF 185
+ + +EK+KK ++ ++D I N + +V K+K +K K KK + E
Sbjct: 119 ELLELIDVSLTEEKVKKY-EEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLER 177
Query: 186 KMKEDKEKPVEKIRKE--------EKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSS 234
+++E++ E +KE +D K K K E+L + E R++
Sbjct: 178 QIEEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNN 234
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 28.2 bits (63), Expect = 9.1
Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 7/90 (7%)
Query: 14 QSAPMFPFQFSPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKD-K 72
++ + S S I + P P PP PPP ++ S D
Sbjct: 151 TTSDLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPPPPPPP--PPPPSLQQSTSAIDLI 208
Query: 73 TSPKEKMSSKEKT----SPKEKEYSKVKDV 98
K + S+ KT PK E+ + DV
Sbjct: 209 KERKGQRSAAGKTLVLSKPKSPEFPNMLDV 238
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 28.3 bits (63), Expect = 9.2
Identities = 25/175 (14%), Positives = 64/175 (36%), Gaps = 29/175 (16%)
Query: 74 SPKEKMSSKEKTSPKEKEYSKVKDVELGVTPMVPNIVKVTSSEDLGEIKKDKSIKKDKKT 133
S +++ + +T + + D+ L PM ++ E + +
Sbjct: 611 SNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTL 670
Query: 134 HKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMK-EDKE 192
+ + + +++ + K E E ++E E + E +E
Sbjct: 671 ELQVPSKSQPTDESEEN-----------------AENKVASIEGEHRKEIEDLLFDESEE 713
Query: 193 KPVEKIRKEEKD-----------SEKEKKSKDKDKKLKKEKLKKKKKEKERSSHE 236
+ + +EEKD S +E ++ + + ++ LK +K++++R + E
Sbjct: 714 DNIVGMIEEEKDADDFKNEWQDISLEELEALEANLLAEQNSLKAQKQQQKRIAAE 768
>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
This family consists of several Phlebovirus
nonstructural NS-M proteins which represent the
N-terminal region of the M polyprotein precursor. The
function of this family is unknown.
Length = 264
Score = 27.9 bits (61), Expect = 9.3
Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 164 KVDKEKEKGKKEEKEKKEEKEFK-MKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEK 222
+VD EK K EEK+ +++ K M+ED + ++K +E +K D + +
Sbjct: 140 RVDYEKLKKNAEEKDATIQRKTKEMEEDSRNQIAHHEIQQKKNEIQKLRNDLKRGQEHRD 199
Query: 223 LKKKKKE 229
K + +E
Sbjct: 200 AKLRVEE 206
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 28.2 bits (63), Expect = 9.3
Identities = 16/79 (20%), Positives = 32/79 (40%)
Query: 140 EKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEKIR 199
EK + K ++ + ++ +E + E + E + +E++EKP EK
Sbjct: 203 EKYPPSRGYNLKSELVEYSSEIVNEIKEEVCETDDESAYVSLDAEEEFEEEEEKPAEKST 262
Query: 200 KEEKDSEKEKKSKDKDKKL 218
+ KE + K+L
Sbjct: 263 ESTFQLSKETSIAKESKEL 281
>gnl|CDD|222859 PHA02535, P, terminase ATPase subunit; Provisional.
Length = 581
Score = 28.5 bits (64), Expect = 9.5
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)
Query: 176 EKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSH 235
EKE K ++K + + E++ + + S ++ + K K+K K S
Sbjct: 66 EKENKTGGDYKEIDLLIRQHERLARVRRYSGTGNEADLNPNVANRNKGPKRKPVKNDIS- 124
Query: 236 EKEVIPKL 243
E KL
Sbjct: 125 -DEQTEKL 131
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 27.9 bits (62), Expect = 9.6
Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 137 KEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVE 196
E E+ + + + +K+ E D E K EK ++E +E K +K + + E E
Sbjct: 213 GENEENDDELEAEVISYLKDGENDNEVK-----EKIRREYREWKGDKANTNETEIEDESE 267
Query: 197 KIRKEEKDSEKEKKSKDKD 215
+EE E+E + K ++
Sbjct: 268 DEYEEEAGEEQENEDKGEE 286
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 27.3 bits (61), Expect = 9.6
Identities = 13/59 (22%), Positives = 27/59 (45%)
Query: 175 EEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERS 233
+E + K K + VE +E+ ++ KE + ++L+K + + KK +K
Sbjct: 68 DEDDIKVGKPINCRVTVNHTVELTEEEKAEARKEALKAYQQEELRKIQKRSKKSKKAEP 126
>gnl|CDD|236278 PRK08506, PRK08506, replicative DNA helicase; Provisional.
Length = 472
Score = 28.4 bits (64), Expect = 9.7
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 165 VDKEKEKGKKEEKEKKEEKEFKMKEDKEKPVEK 197
V KE+E+ +KE+K KKE KE + + K +E+
Sbjct: 389 VYKEREEKEKEKKAKKEGKEERRIHFQNKSIEE 421
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 27.2 bits (61), Expect = 9.7
Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 19/100 (19%)
Query: 131 KKTHKKKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKED 190
K T K K +KK + K + KE K K ++ K E + K+EK
Sbjct: 50 KFTEAVKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEK------- 102
Query: 191 KEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEK 230
K KEKK ++K++K + ++ KKK K +
Sbjct: 103 ------------KKRSKEKKEEEKERKRQLKQQKKKAKHR 130
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 28.3 bits (63), Expect = 9.7
Identities = 26/81 (32%), Positives = 31/81 (38%), Gaps = 8/81 (9%)
Query: 29 VAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSK------ 82
V G + PP +H PQ P P SK KDKV PKD P S
Sbjct: 1004 VGGIEAPEKTPPPPEHE--PQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGET 1061
Query: 83 EKTSPKEKEYSKVKDVELGVT 103
K + ++KE K L T
Sbjct: 1062 PKGNEEKKEEQPDKGANLPNT 1082
>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
pre-ribosomes [Translation, ribosomal structure and
biogenesis / Intracellular trafficking and secretion].
Length = 657
Score = 28.4 bits (63), Expect = 9.7
Identities = 34/151 (22%), Positives = 57/151 (37%), Gaps = 17/151 (11%)
Query: 116 EDLGEIKKDKSIKKDKKTHKKKEKE--KIKKKKDKKDKDKIKN----KEKDK-EKKVDKE 168
EDLG+ K I KK+ +E++ +++ + ++ N K KD V+
Sbjct: 36 EDLGK----KVIYDLKKSSSDEEEQDYELRPRVSSSWNNESYNRLPIKTKDNVVADVNNG 91
Query: 169 KEKGKKEEKEKKEEKEFKMKEDKEKPVEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKK 228
+E + E E E EK ++ K+ D +K+ K +
Sbjct: 92 EEFLSESESE--ASLEIDSDIKDEKQKSLEEQKIAPEIPVKQQIDSEKERIASICTKIIE 149
Query: 229 EKERSSHEKEVIPKLTFKFGTDMEEKTKRES 259
E E + E + F T M EK K+ S
Sbjct: 150 EPEENLGMMEEV----FSMITSMAEKAKKVS 176
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 26.7 bits (59), Expect = 9.8
Identities = 10/24 (41%), Positives = 21/24 (87%)
Query: 161 KEKKVDKEKEKGKKEEKEKKEEKE 184
E+K ++++E+ KKEE++++EE+E
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 28.3 bits (63), Expect = 9.8
Identities = 17/52 (32%), Positives = 29/52 (55%)
Query: 141 KIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKEDKE 192
K KKKK K+K K KN+E K+ + E+G + +K + ++K +K+
Sbjct: 685 KKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 28.2 bits (63), Expect = 9.9
Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 3/60 (5%)
Query: 24 SPFPSVAGPGLIPRLPPMLQHPLIPQPLVHPPGTPPPSKMKDKVSPKDKTSPKEKMSSKE 83
P A P + P P V+PP T P ++ ++K P K+SS
Sbjct: 411 PSAPQSATQPA-GTPPTVSVDPPAAVP-VNPPSTAPQ-AVRPAQFKEEKKIPVSKVSSLG 467
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 28.4 bits (64), Expect = 10.0
Identities = 22/103 (21%), Positives = 59/103 (57%), Gaps = 3/103 (2%)
Query: 136 KKEKEKIKKKKDKKDKDKIKNKEKDKEKKVDKEKEKGKKEEKEKKE-EKEFKMKEDKEKP 194
+ E+E +KK+KD+ K+++++ EK+ + ++ + ++ + EK + ++KE+ E+
Sbjct: 418 EIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQV 477
Query: 195 VEKIRKEEKDSEKEKKSKDKDKKLKKEKLKKKKKEKERSSHEK 237
++ + E++ + K ++ + KL +L+K+ + E E+
Sbjct: 478 RLELEQAEREGDLAKAAELQYGKL--PELEKRLQAAEAKLGEE 518
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 27.2 bits (61), Expect = 10.0
Identities = 18/97 (18%), Positives = 42/97 (43%), Gaps = 10/97 (10%)
Query: 101 GVTPMVPNIVK-----VTSSEDLGEIKKDKSIKKDKKTHKKKE---KEKIKKKKDKKDKD 152
++P +P + + + + + +D I K+ K E +K + ++D+
Sbjct: 32 DISPFLPELRQKHKARIAKDPEFQYLNED--IAWLKEERDDKTVSLNEAERKAEREEDEK 89
Query: 153 KIKNKEKDKEKKVDKEKEKGKKEEKEKKEEKEFKMKE 189
K +E ++ K + +E K + + KEE + + E
Sbjct: 90 KRLKRENERRKALGEEPLKSLDDLPKDKEEPDAYLDE 126
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.129 0.374
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,937,580
Number of extensions: 2186480
Number of successful extensions: 21410
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11087
Number of HSP's successfully gapped: 2657
Length of query: 444
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 344
Effective length of database: 6,502,202
Effective search space: 2236757488
Effective search space used: 2236757488
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.4 bits)