RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16785
(1642 letters)
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 50.5 bits (121), Expect = 9e-06
Identities = 78/392 (19%), Positives = 148/392 (37%), Gaps = 59/392 (15%)
Query: 851 VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE-LEHKT 903
VS K R ++ E+KL LE E++ E EKQ L +L Q A E K
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQ------LEKLERQAEKAERYQELKA 220
Query: 904 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
E + ALL AKLKE+++E++ E+ ++++ E+++ ++ + EI L L
Sbjct: 221 --ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278
Query: 964 SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLL 1023
E L ++ + L+ I L+++ + + + L
Sbjct: 279 LREELEELQEELLELKEEIE--------ELEGEISLLRE------------RLEELENEL 318
Query: 1024 VDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETN 1083
+LE+++++ I+ K+ +LE ET + E + + L
Sbjct: 319 EELEERLEELKEKIEALKE--ELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376
Query: 1084 TNEESPADQDETNSTTTL---DKSHIQKEIANNLPKVWRILIELLNTHQEQYENN----- 1135
E + E + + +++EI L E L E+ E+
Sbjct: 377 LFEALREELAELEAELAEIRNELEELKREI--------ESLEERLERLSERLEDLKEELK 428
Query: 1136 DVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIR 1195
++ E ++ + + +L+ L L +LA+L+ LE
Sbjct: 429 ELEAELEELQTELEEL------NEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482
Query: 1196 LNHSEQRLNTVTTELKKTWNVVNKLKLQHKQL 1227
L+ E RL+ + E + + V L+ L
Sbjct: 483 LSSLEARLDRLEAEQRASQGVRAVLEALESGL 514
Score = 47.4 bits (113), Expect = 8e-05
Identities = 54/255 (21%), Positives = 106/255 (41%), Gaps = 20/255 (7%)
Query: 34 VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE-LEHKT 86
VS K R ++ E+KL LE E++ E EKQ L +L Q A E K
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQ------LEKLERQAEKAERYQELKA 220
Query: 87 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
E + ALL AKLKE+++E++ E+ ++++ E+++ ++ + EI L L
Sbjct: 221 --ELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEE 278
Query: 147 SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLL 206
E L ++ + + I+ L++ + ++ + +
Sbjct: 279 LREELEELQEELLELKEEIEE--LEGEIS---LLRERLEELENELEELEERLEELKEKIE 333
Query: 207 VDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDSA 266
E+ ++ETL + ++ L +LE + + + +E + +A+ + A
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393
Query: 267 NIQLDHAETNTNEES 281
I+ + E ES
Sbjct: 394 EIRNELEELKREIES 408
Score = 39.7 bits (93), Expect = 0.018
Identities = 28/118 (23%), Positives = 64/118 (54%), Gaps = 4/118 (3%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
LK +++ ++++L E E + E E+ + ++ +E E++ +A E + + E + A
Sbjct: 328 LKEKIEALKEELEER-ETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELA 386
Query: 97 LLEAKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAEN 151
LEA+L EI+ E+ + ++E +E +++ E ++++ + E+ E L + L E
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444
Score = 38.9 bits (91), Expect = 0.027
Identities = 23/150 (15%), Positives = 69/150 (46%), Gaps = 2/150 (1%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL--EHKTMIEK 90
++ +L+ L++ E++L E++ +++ +++ EL E+ E E + +E+
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 91 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
+ L+ +L+E++ E + ++ KE +++ E+E+ ++ + E+ L E +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRER 913
Query: 151 NHILNSQMTLVNNLFSNMLINPMINLDMLI 180
L +++ + + + +
Sbjct: 914 LEELEAKLERLEVELPELEEELEEEYEDTL 943
Score = 38.2 bits (89), Expect = 0.047
Identities = 23/147 (15%), Positives = 67/147 (45%), Gaps = 2/147 (1%)
Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL--EHKTMIEKELA 910
+L+ L++ E++L E++ +++ +++ EL E+ E E + +E+
Sbjct: 797 ELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856
Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
+ L+ +L+E++ E + ++ KE +++ E+E+ ++ + E+ L E +
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Query: 971 LNSQMTLVNNLFSNMLINPMINLDMLI 997
L +++ + + + +
Sbjct: 917 LEAKLERLEVELPELEEELEEEYEDTL 943
Score = 37.4 bits (87), Expect = 0.087
Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 38 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNAL 97
K LK++E++L E +++ E+ + +LR L + + + E K L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725
Query: 98 ---------LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
L+++L+E++EE++ E+ +E ++++ E+E+ ++ ++ + L E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779
Score = 37.4 bits (87), Expect = 0.087
Identities = 24/114 (21%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNAL 914
K LK++E++L E +++ E+ + +LR L + + + E K L
Sbjct: 666 KRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKREL 725
Query: 915 ---------LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
L+++L+E++EE++ E+ +E ++++ E+E+ ++ ++ + L E
Sbjct: 726 AALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKE 779
Score = 36.2 bits (84), Expect = 0.19
Identities = 32/135 (23%), Positives = 69/135 (51%), Gaps = 9/135 (6%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKI------EYEKQMTNYKKLRELYEQRAAAAELEHKT 86
++ LK ++++E+K EE++ E E+++ ++ E EQR E E +
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEE 832
Query: 87 MIEK-ELAKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
+ E+ E + L LE +L+E+++E++ E+ +E + + E+E ++ ++E L E
Sbjct: 833 LEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEE 892
Query: 144 LTSSLAENHILNSQM 158
L +E L ++
Sbjct: 893 LRELESELAELKEEI 907
Score = 34.7 bits (80), Expect = 0.61
Identities = 29/128 (22%), Positives = 62/128 (48%), Gaps = 2/128 (1%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
L+ +L+ +L+++E++L E E+ E+ +L EL E+ E + E
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE 762
Query: 910 AKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
+ L LE L +++EEI+ EK + ++++ E+E+ ++ + ++ L L S
Sbjct: 763 LEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQR 822
Query: 968 NHILNSQM 975
L ++
Sbjct: 823 RERLEQEI 830
Score = 33.9 bits (78), Expect = 0.82
Identities = 35/199 (17%), Positives = 86/199 (43%), Gaps = 15/199 (7%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
+ + L++ ++L E EE++ E E KL+E E+ + + + E E
Sbjct: 742 LEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEE 801
Query: 94 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
E +L ++ E+++ ++ + +++I E+E+ I+ +++++ L E L E
Sbjct: 802 LE-EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860
Query: 154 LNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKI 213
L ++ + L +++ +L E+ + S L +L+++I
Sbjct: 861 LKEELEELEAEKEE-----------LEDELKELEEEKEEL--EEELRELESELAELKEEI 907
Query: 214 -DQETLAIQNDKQLKQLEN 231
+ + +L++LE
Sbjct: 908 EKLRERLEELEAKLERLEV 926
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 44.7 bits (106), Expect = 4e-04
Identities = 25/114 (21%), Positives = 66/114 (57%), Gaps = 2/114 (1%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEI-KIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEK 90
+VS+L+ +++++K+L EI ++E +KQ+ + + A+LE ++ +++
Sbjct: 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
Query: 91 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
+ A LE KL+E++EE+++ + +E + ++ E+E ++ ++++ L +
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
Score = 44.3 bits (105), Expect = 7e-04
Identities = 25/113 (22%), Positives = 65/113 (57%), Gaps = 2/113 (1%)
Query: 851 VSDLKTRLKDVEKKLLESPEEI-KIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEKE 908
VS+L+ +++++K+L EI ++E +KQ+ + + A+LE ++ +++
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
+ A LE KL+E++EE+++ + +E + ++ E+E ++ ++++ L +
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
Score = 37.0 bits (86), Expect = 0.12
Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 20/129 (15%)
Query: 34 VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTN----------YKKLRELYEQRA 77
+S K R K+ E+KL L+ E+I E E+Q+ + YK+L+ +
Sbjct: 167 ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELE 226
Query: 78 AAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL----EKYKENKKQILEMEKNIQLK 133
A + + +EL + + +E EE+ A+L EK +E + ++ E+E+ I+
Sbjct: 227 LALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL 286
Query: 134 DDEINNLNE 142
E+ L
Sbjct: 287 QKELYALAN 295
Score = 36.6 bits (85), Expect = 0.15
Identities = 21/130 (16%), Positives = 59/130 (45%), Gaps = 2/130 (1%)
Query: 840 LATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE- 898
L + + L+ L+++ +++ +++ + +++ +L ++
Sbjct: 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE 762
Query: 899 -LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
E + +E+ + A EA+++E++ +I+ E+ K ++ + E+ + L ++E NL
Sbjct: 763 IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL 822
Query: 958 NENLTSSLAE 967
E L S
Sbjct: 823 RERLESLERR 832
Score = 35.8 bits (83), Expect = 0.27
Identities = 20/119 (16%), Positives = 57/119 (47%), Gaps = 2/119 (1%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKE 91
+ L+ L+++ +++ +++ + +++ +L ++ E + +E+
Sbjct: 714 LEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEA 773
Query: 92 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
+ A EA+++E++ +I+ E+ K ++ + E+ + L ++E NL E L S
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832
Score = 34.6 bits (80), Expect = 0.52
Identities = 22/140 (15%), Positives = 58/140 (41%), Gaps = 11/140 (7%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE----------L 82
++S L+ + + + ++L +++ E E Q+ + + + A E
Sbjct: 296 EISRLEQQKQILRERLANLERQLE-ELEAQLEELESKLDELAEELAELEEKLEELKEELE 354
Query: 83 EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
+ +E+ A+ LE++L+E++E+++ K + + QI + I+ + + L +
Sbjct: 355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLED 414
Query: 143 NLTSSLAENHILNSQMTLVN 162
E L ++
Sbjct: 415 RRERLQQEIEELLKKLEEAE 434
Score = 34.6 bits (80), Expect = 0.57
Identities = 26/133 (19%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 854 LKTRLKDVEKKLLESPEEI------KIEYEKQMT-NYKKLRELYEQRAAAAELEHKTMIE 906
L+ RL ++E++L E ++ E +++ +KL EL E+ + +E
Sbjct: 307 LRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA-----ELE 361
Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 966
+ A+ LE++L+E++E+++ K + + QI + I+ + + L +
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 967 ENHILNSQMTLVN 979
E L ++
Sbjct: 422 EIEELLKKLEEAE 434
Score = 34.3 bits (79), Expect = 0.68
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 67 KKLRELYEQRAAAAEL--EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQIL 124
++L EL E+ A E E +++ K L ++ E++EEI+ ++ +I
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298
Query: 125 EMEKNIQLKDDEINNLNENL 144
+E+ Q+ + + NL L
Sbjct: 299 RLEQQKQILRERLANLERQL 318
Score = 34.3 bits (79), Expect = 0.68
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 884 KKLRELYEQRAAAAEL--EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQIL 941
++L EL E+ A E E +++ K L ++ E++EEI+ ++ +I
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298
Query: 942 EMEKNIQLKDDEINNLNENL 961
+E+ Q+ + + NL L
Sbjct: 299 RLEQQKQILRERLANLERQL 318
Score = 33.1 bits (76), Expect = 1.7
Identities = 63/367 (17%), Positives = 134/367 (36%), Gaps = 54/367 (14%)
Query: 851 VSDLKTRLKDVEKKL------LESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKT 903
+S K R K+ E+KL L+ E+I E E+Q+ + ++ E E+ + AEL
Sbjct: 167 ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRE-- 224
Query: 904 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
+E L L E + E+ +E ++++ E E +E+ L L
Sbjct: 225 -LELALLVLRLEELR------------EELEELQEELKEAE-------EELEELTAELQE 264
Query: 964 SLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLIN--NHEINDTVS 1021
+ L +++ L+ I+ +Q + + I+ +
Sbjct: 265 LEEKLEELRLEVS---------------ELEEEIEELQKELYALANEISRLEQQKQILRE 309
Query: 1022 LLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAE 1081
L +LE+++++ ++ + E +E ++ S + ++ + E
Sbjct: 310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEE 369
Query: 1082 TNTNEESPADQDETNST----TTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDV 1137
+ E +Q ET + L + + EI ++ R+ QE E
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK- 428
Query: 1138 NENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLN 1197
+ E+ K ++ + + + Q L+ L L +L + + + E L
Sbjct: 429 -KLEEAELKELQAELEELEEELEELQE--ELERLEEALEELREELEEAEQALDAAERELA 485
Query: 1198 HSEQRLN 1204
+ RL+
Sbjct: 486 QLQARLD 492
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 43.4 bits (103), Expect = 8e-04
Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 4/113 (3%)
Query: 41 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEA 100
K+V+ K L SP K E E ++ +L+E ++ + + + + + L +
Sbjct: 305 GKEVKAKELGSPAGTKTEIE-VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKK 363
Query: 101 KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
++ E + QLEK E K+++ E ++ ++E+ L E L S +E I
Sbjct: 364 ARGQLPPEKREQLEKLLETKEKLSEE---LEELEEELKELKEELESLYSEGKI 413
Score = 43.4 bits (103), Expect = 8e-04
Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 4/113 (3%)
Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEA 917
K+V+ K L SP K E E ++ +L+E ++ + + + + + L +
Sbjct: 305 GKEVKAKELGSPAGTKTEIE-VGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKK 363
Query: 918 KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
++ E + QLEK E K+++ E ++ ++E+ L E L S +E I
Sbjct: 364 ARGQLPPEKREQLEKLLETKEKLSEE---LEELEEELKELKEELESLYSEGKI 413
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 42.3 bits (99), Expect = 0.003
Identities = 58/364 (15%), Positives = 116/364 (31%), Gaps = 23/364 (6%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
S K + K+ KKL+E E + + +L EQ A E +
Sbjct: 164 AGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE 223
Query: 910 AKNALLEAKLKEIQEEIQAQLEKY----KENKKQILEMEKNIQLKDDEINNLNENLTSSL 965
+N L LK +E I E +E + E+EK ++ + E
Sbjct: 224 EENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKK 283
Query: 966 AENHILNSQMTLVNNLFSNMLI---NPMINLDMLIKLIQDNYLLIIDLINN-HEINDTVS 1021
+ L L S +L + + + L + ++ L +L EI +
Sbjct: 284 LQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK 343
Query: 1022 LLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAE 1081
L +LE K + E + ++L++ + E +S +
Sbjct: 344 ELKELEIKREAEEEEEEQLEKLQEKLE----------QLEEELLAKKKLESERLSSAAKL 393
Query: 1082 TNTNEESPADQDETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENF 1141
E +++E + L+ S ++++ K + I + + + E
Sbjct: 394 KE-EELELKNEEEKEAKLLLELSEQEEDLLKEEKKE-ELKIVEELEESLETKQGKLTEEK 451
Query: 1142 ESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLNHSEQ 1201
E K + + ++ LK L K +L L+ + + + +
Sbjct: 452 EELEKQALKLLKDKLE---LKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQKESKARE 508
Query: 1202 RLNT 1205
L
Sbjct: 509 GLAV 512
Score = 42.3 bits (99), Expect = 0.003
Identities = 38/287 (13%), Positives = 95/287 (33%), Gaps = 21/287 (7%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
S K + K+ KKL+E E + + +L EQ A E +
Sbjct: 165 GSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224
Query: 94 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
+N L LK +E I E ++ +++I ++ ++ +++ + + + E +
Sbjct: 225 ENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKL 284
Query: 154 LNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKI 213
+ + L + + LL ++ + +L+K
Sbjct: 285 QEEE----------------LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLE 328
Query: 214 DQETLAIQN----DKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDSANIQ 269
+ + +K+LK+LE + + + + E +S +
Sbjct: 329 KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLS 388
Query: 270 LDHAETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLISGLS 316
E +++E + L+ S ++++ + + +
Sbjct: 389 SAAKLKE-EELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVE 434
Score = 34.2 bits (78), Expect = 0.72
Identities = 21/135 (15%), Positives = 48/135 (35%), Gaps = 4/135 (2%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLREL----YEQRAAAAELEHKTMI 88
++ LK E KL E ++ Y+ + + L Y + + ++
Sbjct: 188 ELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELL 247
Query: 89 EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSL 148
E + + +L++ +E + L++ KE +K+ E+ ++L E L L
Sbjct: 248 RDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLE 307
Query: 149 AENHILNSQMTLVNN 163
++
Sbjct: 308 RRKVDDEEKLKESEK 322
Score = 30.7 bits (69), Expect = 8.1
Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 8/128 (6%)
Query: 25 SSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELY--------EQR 76
++ +L+ LK++E K EE + + Q + EL
Sbjct: 329 KELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLS 388
Query: 77 AAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDE 136
+AA E + ++ E K A L +L E +E++ + +K + + LE +
Sbjct: 389 SAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLT 448
Query: 137 INNLNENL 144
Sbjct: 449 EEKEELEK 456
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 40.9 bits (96), Expect = 0.006
Identities = 67/447 (14%), Positives = 169/447 (37%), Gaps = 40/447 (8%)
Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKELA 910
L+ ++E+ E E K+ E+ ++L EL ++ A E + + I++
Sbjct: 337 KLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKE 396
Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS------- 963
+ A L A L+EIQEE++ ++ +E ++++ E+E+ I+ +++IN L
Sbjct: 397 ELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGA 456
Query: 964 ----SLAENHILNSQMTLVNNLFSNML--INPMINLDMLIKLIQDNYLLIIDLINNHEIN 1017
+ + + L+ L + ++ + +++ + + E
Sbjct: 457 GEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEE 516
Query: 1018 DTVSL--LVDLEKKIDQETLAIQNDKQLKQ--LENYETPMSSAMVEAVESNQTSTSKDSA 1073
L L+++++++ ++N + + E + + +E +
Sbjct: 517 LIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELL- 575
Query: 1074 NIQLDHAETNTNEESPADQDETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYE 1133
E + ++ E + KE+ L + L ELL + +
Sbjct: 576 -------EELRLLRTRKEELEELRERLKELKKKLKELEERLSQ----LEELLQSLELSEA 624
Query: 1134 NNDVNENFESCYKNIKTNTGKTHQVISVSQTFIR--LKHLILEKNHLTNQLAKLKTLNNH 1191
N++ E E ++ + + + + ++ L+ L + L ++ +
Sbjct: 625 ENEL-EEAEEELESEL----EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQR-IE 678
Query: 1192 LEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKHKRNIINDLKID 1251
E +L + L + EL++ + +L + ++ + L+ + +L+
Sbjct: 679 NEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELE-K 737
Query: 1252 LEYVREKWDLVKEKNEQNELDYKVLRK 1278
LE E + ++EK + L +LR
Sbjct: 738 LEKALELLEELREKLGKAGLRADILRN 764
Score = 39.4 bits (92), Expect = 0.022
Identities = 28/113 (24%), Positives = 59/113 (52%), Gaps = 2/113 (1%)
Query: 36 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAE--LEHKTMIEKELA 93
L+ ++E+ E E K+ E+ ++L EL ++ A E + + I++
Sbjct: 337 KLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKE 396
Query: 94 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
+ A L A L+EIQEE++ ++ +E ++++ E+E+ I+ +++IN L
Sbjct: 397 ELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELM 449
Score = 34.7 bits (80), Expect = 0.53
Identities = 79/445 (17%), Positives = 160/445 (35%), Gaps = 36/445 (8%)
Query: 864 KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 923
L+S + + E ++ +K +L E E+ + + E + L E
Sbjct: 147 AFLKSKPKERKEILDELFGLEKYEKLSEL---LKEVIKEAKAKIEELEGQLSELLEDIED 203
Query: 924 EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN---LTSSLAENHILNSQMTLVNN 980
+ E + K + ++ E+ + + EI L E L L +++ + +
Sbjct: 204 LLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIES 263
Query: 981 LFSNMLINPMINLDMLIKLIQDNYLLIIDL-INNHEINDTVSLLVDLEKKIDQETLAIQN 1039
L L L L +L+++ I L EI + L L +++ ++
Sbjct: 264 LELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEK 323
Query: 1040 DKQL-KQLENYETPMSSAMVE----AVESNQTSTSKDS-ANIQLDHAETNTNEESPADQD 1093
K L ++LE E + E A E N+ + + + E E A +
Sbjct: 324 LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383
Query: 1094 ETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENFESCYKNIKT--- 1150
+ + +++E+A + I EL +E E E E K ++
Sbjct: 384 LKQLEEAIQE--LKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQIN 441
Query: 1151 ---NTGKTHQVISVSQ-------TFIRLKHLILEKNHLTNQLAKLKTLNNHLEIRLNHSE 1200
+ ++ + + +H +L +L+ E+ E
Sbjct: 442 QLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELE-----EELSREKEE 496
Query: 1201 QRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKHKRNIINDLKIDLEYVREKWD 1260
L EL+K + + ++ +L E+ LK EL+ K + +L +LE ++EK
Sbjct: 497 AELREEIEELEKELRELEEELIELLEL---EEALKEELEEKLEKLENLLEELEELKEKLQ 553
Query: 1261 LVKEKNEQNELDYKVLRKEFALRRL 1285
L + K E +L+ ++ + L L
Sbjct: 554 LQQLKEELRQLEDRLQELKELLEEL 578
Score = 32.8 bits (75), Expect = 2.1
Identities = 27/128 (21%), Positives = 65/128 (50%), Gaps = 16/128 (12%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKI---------EYEKQMTNYKKLRELYEQRAAAAELE 83
++ + RL+++E+++ E EE++ E +++ + ++ E E++ E E
Sbjct: 285 ELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESE 344
Query: 84 HKTMIEKELAKNALLEAKLKEIQE-------EIQAQLEKYKENKKQILEMEKNIQLKDDE 136
+ + E++ LLE +LKE++E E++ LE+ K+ ++ I E+++ +
Sbjct: 345 LEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404
Query: 137 INNLNENL 144
+ + E L
Sbjct: 405 LEEIQEEL 412
Score = 32.0 bits (73), Expect = 3.1
Identities = 27/127 (21%), Positives = 64/127 (50%), Gaps = 16/127 (12%)
Query: 851 VSDLKTRLKDVEKKLLESPEEIKI---------EYEKQMTNYKKLRELYEQRAAAAELEH 901
+ + RL+++E+++ E EE++ E +++ + ++ E E++ E E
Sbjct: 286 LEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESEL 345
Query: 902 KTMIEKELAKNALLEAKLKEIQE-------EIQAQLEKYKENKKQILEMEKNIQLKDDEI 954
+ + E++ LLE +LKE++E E++ LE+ K+ ++ I E+++ + +
Sbjct: 346 EELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAAL 405
Query: 955 NNLNENL 961
+ E L
Sbjct: 406 EEIQEEL 412
Score = 32.0 bits (73), Expect = 3.6
Identities = 16/98 (16%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 850 LVSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 907
+ +L+ +++++E ++ E + + E+++ ++L E EQ E + K
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE-----LLK 709
Query: 908 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
+L + L +L+ + E++ ++ ++ +K + +E+
Sbjct: 710 KLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747
Score = 32.0 bits (73), Expect = 3.6
Identities = 16/98 (16%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 33 KVSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 90
+ +L+ +++++E ++ E + + E+++ ++L E EQ E + K
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE-----LLK 709
Query: 91 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
+L + L +L+ + E++ ++ ++ +K + +E+
Sbjct: 710 KLGEIEQLIEELESRKAELEELKKELEKLEKALELLEE 747
Score = 30.9 bits (70), Expect = 7.5
Identities = 23/112 (20%), Positives = 52/112 (46%), Gaps = 13/112 (11%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
++ +L+ L+++E++L+E E + E+ +KL L E+ E +++EL
Sbjct: 502 EIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEEL 561
Query: 93 AKNALLEAKLK-------------EIQEEIQAQLEKYKENKKQILEMEKNIQ 131
+ +LK E EE++ +L++ K+ K++ E ++
Sbjct: 562 RQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613
Score = 30.5 bits (69), Expect = 8.8
Identities = 23/119 (19%), Positives = 53/119 (44%), Gaps = 13/119 (10%)
Query: 843 TRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK 902
+ + + +L+ L+++E++L+E E + E+ +KL L E+ E
Sbjct: 495 EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554
Query: 903 TMIEKELAKNALLEAKLK-------------EIQEEIQAQLEKYKENKKQILEMEKNIQ 948
+++EL + +LK E EE++ +L++ K+ K++ E ++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 40.6 bits (96), Expect = 0.009
Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 14/111 (12%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYK--------KLRELYEQRAAAAELEH---- 84
L+ + ++ E L E+ E++K E E++ + + + +Q A+ E
Sbjct: 532 LEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590
Query: 85 KTMIEKELAKNALLEAK-LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
K + + + A ++A L E ++ + EK ++ KK+ E ++ +++ D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641
Score = 40.6 bits (96), Expect = 0.009
Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 14/111 (12%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYK--------KLRELYEQRAAAAELEH---- 901
L+ + ++ E L E+ E++K E E++ + + + +Q A+ E
Sbjct: 532 LEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590
Query: 902 KTMIEKELAKNALLEAK-LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
K + + + A ++A L E ++ + EK ++ KK+ E ++ +++ D
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGD 641
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 38.4 bits (90), Expect = 0.029
Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 53 EEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 108
EE++ EYE+++ Y+ +E++E E + K I+K L +L E++ E
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG-KDEEALAEELLELEAE 122
Score = 38.4 bits (90), Expect = 0.029
Identities = 16/56 (28%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 870 EEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 925
EE++ EYE+++ Y+ +E++E E + K I+K L +L E++ E
Sbjct: 68 EELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKG-KDEEALAEELLELEAE 122
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 38.7 bits (90), Expect = 0.032
Identities = 30/126 (23%), Positives = 41/126 (32%), Gaps = 8/126 (6%)
Query: 571 ILEARNARLKRLEDQ-------CKNLFHKMSLTSQRSDTLSERLEELHDYYGTSNERHSP 623
I E +K L C+N+ K + DTL ERLEE D Y S E P
Sbjct: 222 INELCKRLVKELRRCPSFASVYCRNIL-KHPVDGTSVDTLLERLEEDFDIYEDSLEYDDP 280
Query: 624 RRHREGSQDSDDTRLRSPENEPLSGVSANVPDSQENSQESDDTRRRSAETNESHSGVSGN 683
D DD +PL ++ + S + +D S +
Sbjct: 281 GLESTDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAPEPLIKTDSRS 340
Query: 684 VSDLQD 689
L D
Sbjct: 341 NDTLVD 346
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 38.5 bits (90), Expect = 0.033
Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 36 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE------------QRAAAAELE 83
DL+ L++V+K+ E+ +E+K +Y +++ K RE+ E QR + +
Sbjct: 368 DLRAELEEVDKEFAETRDELK-DYREKLEKLK--REINELKRELDRLQEELQRLSEELAD 424
Query: 84 HKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
I AK LE + ++ EI+ Q K ++ + + E+ + +E + + +
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484
Query: 144 LTSSLAE 150
L+ E
Sbjct: 485 LSKLQRE 491
Score = 38.5 bits (90), Expect = 0.033
Identities = 28/127 (22%), Positives = 58/127 (45%), Gaps = 15/127 (11%)
Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE------------QRAAAAELE 900
DL+ L++V+K+ E+ +E+K +Y +++ K RE+ E QR + +
Sbjct: 368 DLRAELEEVDKEFAETRDELK-DYREKLEKLK--REINELKRELDRLQEELQRLSEELAD 424
Query: 901 HKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 960
I AK LE + ++ EI+ Q K ++ + + E+ + +E + + +
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484
Query: 961 LTSSLAE 967
L+ E
Sbjct: 485 LSKLQRE 491
Score = 38.1 bits (89), Expect = 0.050
Identities = 44/229 (19%), Positives = 98/229 (42%), Gaps = 25/229 (10%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH--------- 84
VS ++ RL+++E+KL E + ++ ++ +L EQ + +
Sbjct: 807 VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866
Query: 85 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
+ +E+ A LE++L ++++E E + Q+ E+E+ I+ + +I + L
Sbjct: 867 EEELEELEAALRDLESRLGDLKKER-------DELEAQLRELERKIEELEAQIEKKRKRL 919
Query: 145 TSSLAENHILNSQMTLVNNLFSNMLINPMINLDM--LIKLIQDNYLLIIDL--INNHEIN 200
+ A+ L +++ + + P L + + +Q I L +N I
Sbjct: 920 SELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ 979
Query: 201 D---TVSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVES 246
+ + L +L++K + L + L+++E YE +EA E+
Sbjct: 980 EYEEVLKRLDELKEK--RAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026
Score = 38.1 bits (89), Expect = 0.050
Identities = 44/229 (19%), Positives = 98/229 (42%), Gaps = 25/229 (10%)
Query: 851 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH--------- 901
VS ++ RL+++E+KL E + ++ ++ +L EQ + +
Sbjct: 807 VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866
Query: 902 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
+ +E+ A LE++L ++++E E + Q+ E+E+ I+ + +I + L
Sbjct: 867 EEELEELEAALRDLESRLGDLKKER-------DELEAQLRELERKIEELEAQIEKKRKRL 919
Query: 962 TSSLAENHILNSQMTLVNNLFSNMLINPMINLDM--LIKLIQDNYLLIIDL--INNHEIN 1017
+ A+ L +++ + + P L + + +Q I L +N I
Sbjct: 920 SELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ 979
Query: 1018 D---TVSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVES 1063
+ + L +L++K + L + L+++E YE +EA E+
Sbjct: 980 EYEEVLKRLDELKEK--RAKLEEERKAILERIEEYEKKKREVFMEAFEA 1026
Score = 30.8 bits (70), Expect = 7.3
Identities = 24/112 (21%), Positives = 53/112 (47%), Gaps = 6/112 (5%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
++ L+ RL+ + K+ L S + E ++ + ++ E E +E+E
Sbjct: 675 ELQRLRERLEGL-KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKE-IEQLEQEE 732
Query: 93 AKN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
K LE L +++EI+ + KE + +I E+E+++ ++ +N+L
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784
Score = 30.4 bits (69), Expect = 9.5
Identities = 26/131 (19%), Positives = 57/131 (43%), Gaps = 21/131 (16%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAA-AELEH--KTMIE 89
++SD ++ ++EK++ + +E + E+ + L L ++ +EL+ + E
Sbjct: 710 ELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769
Query: 90 KELAKNALLEA-----------KLKEIQEEIQAQLEKYKENKKQILEMEKNI-------Q 131
E + L EA ++ EIQ E+ E+ + ++ E+E+ + +
Sbjct: 770 LEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKE 829
Query: 132 LKDDEINNLNE 142
+ EI L E
Sbjct: 830 YLEKEIQELQE 840
Score = 30.4 bits (69), Expect = 9.8
Identities = 33/152 (21%), Positives = 61/152 (40%), Gaps = 28/152 (18%)
Query: 836 ESNILATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLREL------ 889
E + L + I + +L L D +K+ E +EI+ +++ ++L EL
Sbjct: 689 ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748
Query: 890 YEQRAAAAELEHKTMI----EKELAKNALLEA-----------KLKEIQEEIQAQLEKYK 934
EQ + E K + E E + L EA ++ EIQ E+ E+
Sbjct: 749 LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVS 808
Query: 935 ENKKQILEMEKNI-------QLKDDEINNLNE 959
+ ++ E+E+ + + + EI L E
Sbjct: 809 RIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 37.9 bits (88), Expect = 0.039
Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
K L +E++L+ +E K + +Q+ +L E A +KEL
Sbjct: 62 KKKELSQLEEQLINQKKEQKNLFNEQIKQ-FELALQDEIAKLEALELLNLEKDKELE--- 117
Query: 97 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 129
LLE +L E+ +E+Q QL+ E ++ E KN
Sbjct: 118 LLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150
Score = 37.9 bits (88), Expect = 0.039
Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
K L +E++L+ +E K + +Q+ +L E A +KEL
Sbjct: 62 KKKELSQLEEQLINQKKEQKNLFNEQIKQ-FELALQDEIAKLEALELLNLEKDKELE--- 117
Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
LLE +L E+ +E+Q QL+ E ++ E KN
Sbjct: 118 LLEKELDELSKELQKQLQNTAEIIEKKRENNKN 150
Score = 31.7 bits (72), Expect = 4.0
Identities = 27/177 (15%), Positives = 72/177 (40%), Gaps = 12/177 (6%)
Query: 55 IKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLE 114
K ++ + E +E+ A ++++ +KEL++ LE +L ++E +
Sbjct: 29 YKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQ---LEEQLINQKKEQKNLFN 85
Query: 115 KYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFSNMLINPMI 174
+ + + L+ ++ +L+ E+ NL ++ L E L+ + N
Sbjct: 86 EQIKQFELALQ-DEIAKLEALELLNLEKDKELELLEKE-LDELSKELQKQLQNTAEI--- 140
Query: 175 NLDMLIKLIQDNYLLIIDLINNHEINDTVSLL-VDLEKKIDQETLAIQNDKQLKQLE 230
+ K + + N ++ +++ L E+++ + L ++ + +Q E
Sbjct: 141 ---IEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194
Score = 31.7 bits (72), Expect = 4.0
Identities = 27/177 (15%), Positives = 72/177 (40%), Gaps = 12/177 (6%)
Query: 872 IKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLE 931
K ++ + E +E+ A ++++ +KEL++ LE +L ++E +
Sbjct: 29 YKQIEQEDQSRILNTLEEFEKEANEKRAQYRSAKKKELSQ---LEEQLINQKKEQKNLFN 85
Query: 932 KYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFSNMLINPMI 991
+ + + L+ ++ +L+ E+ NL ++ L E L+ + N
Sbjct: 86 EQIKQFELALQ-DEIAKLEALELLNLEKDKELELLEKE-LDELSKELQKQLQNTAEI--- 140
Query: 992 NLDMLIKLIQDNYLLIIDLINNHEINDTVSLL-VDLEKKIDQETLAIQNDKQLKQLE 1047
+ K + + N ++ +++ L E+++ + L ++ + +Q E
Sbjct: 141 ---IEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKENEEQRE 194
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 38.1 bits (89), Expect = 0.048
Identities = 74/417 (17%), Positives = 148/417 (35%), Gaps = 54/417 (12%)
Query: 870 EEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA 928
I+ +K + KKLRE E + AE+ + AK K + I +
Sbjct: 816 LACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQ 875
Query: 929 QLEKYKENKKQILEMEKNIQLKDD--EINNL---NENLTSSLAENHILNSQMTLVNNLFS 983
++E L + +LK D I++L N L S + E L +L S
Sbjct: 876 RVE---------LAERQLQELKIDVKSISSLKLVNLELESEIIE---------LKKSLSS 917
Query: 984 NMLINPMINLDMLIKL--IQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQNDK 1041
+++ N +++ +L + +N L + ++ L ++E K+ + + ++
Sbjct: 918 DLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYED-- 975
Query: 1042 QLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNSTTTL 1101
LK+ + A E + L + E + +++ +
Sbjct: 976 LLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKII 1035
Query: 1102 DKSHIQKEIANNLPKVWRILIELLNTHQEQYEN------NDVNENFESCYKNIKTNTGKT 1155
+ I L K+ +L+ N Q +Y+ N + ++ + N KT
Sbjct: 1036 SSESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKT 1095
Query: 1156 HQVISVSQTFIRLKHLILEKNHLTNQLAKLKTLNNHLEI---------RLNHSEQRLNTV 1206
I+V + ++L+ N L +A++ LN EI L Q+L+ +
Sbjct: 1096 ---INVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQLVNTLEPVFQKLSVL 1152
Query: 1207 TTELKKTWNVVN--------KLKLQHKQLYTHEKILKYELKHKRNIINDLKIDLEYV 1255
EL + N ++ + + K + +NDLK +L +
Sbjct: 1153 QLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKLSSSEVNDLKNELIAL 1209
Score = 33.5 bits (77), Expect = 1.3
Identities = 37/189 (19%), Positives = 73/189 (38%), Gaps = 28/189 (14%)
Query: 53 EEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA 111
I+ +K + KKLRE E + AE+ + AK K + I +
Sbjct: 816 LACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQ 875
Query: 112 QLEKYKENKKQILEMEKNIQLKDD--EINNL---NENLTSSLAENHILNSQMTLVNNLFS 166
++E L + +LK D I++L N L S + E L +L S
Sbjct: 876 RVE---------LAERQLQELKIDVKSISSLKLVNLELESEIIE---------LKKSLSS 917
Query: 167 NMLINPMINLDMLIKL--IQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQNDK 224
+++ N +++ +L + +N L + ++ L ++E K+ + + ++
Sbjct: 918 DLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYED-- 975
Query: 225 QLKQLENYE 233
LK+
Sbjct: 976 LLKKSTILV 984
Score = 32.0 bits (73), Expect = 4.2
Identities = 29/176 (16%), Positives = 66/176 (37%), Gaps = 21/176 (11%)
Query: 1119 RILIELLNTHQEQYENNDVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHL 1178
+ + ++ E ++ S N ++I + ++ LI
Sbjct: 872 QSAQRVELAERQLQELKIDVKSISSLKL---VNLELESEIIELKKSL--SSDLIENLEFK 926
Query: 1179 TNQLAKLKTLNNHLEI--------RLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTH 1230
T +A+LK L N++++ +L+ V ++LK+T +++ L
Sbjct: 927 TELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKET-------SEEYEDLLKK 979
Query: 1231 EKILKYELKHKRNIINDLKIDL-EYVREKWDLVKEKNEQNELDYKVLRKEFALRRL 1285
IL E + + + K +L E ++ L + + EL +V + A + +
Sbjct: 980 STILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKII 1035
Score = 31.6 bits (72), Expect = 5.2
Identities = 71/454 (15%), Positives = 134/454 (29%), Gaps = 74/454 (16%)
Query: 848 LPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEK 907
L ++L RLK + + +++ + +L +L+E
Sbjct: 923 LEFKTELIARLKKLLNNI-----DLEEGPSIEYVKLPELNKLHEV--------------- 962
Query: 908 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
E+KLKE EE + L+K ++ + ++ E+ L++ +
Sbjct: 963 --------ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQES 1014
Query: 968 NHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLE 1027
L V L S + + L + +
Sbjct: 1015 TKQLKELPVEVAELQSAS------KIISSESTELSILKPLQKLKGLLLLENNQLQARYKA 1068
Query: 1028 KKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAET----N 1083
K+ +E + DKQL QLE+ E + + V+ +E + K + +Q A+
Sbjct: 1069 LKLRRENSLLD-DKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNL 1127
Query: 1084 TNEES-----------PADQDETNSTTTLDKSHIQKEIANNLPKVWRILIEL-------- 1124
E S P Q + LD + + +
Sbjct: 1128 LQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSAL 1187
Query: 1125 --LNTHQEQYENNDVNENFESCYKNIKTNTGKTHQVISVSQTFIRLKHLILEKNHLTNQL 1182
+ E ND+ + + I + + + LK LI E T
Sbjct: 1188 YDEKSKLSSSEVNDLKNELIALFSKIFSGWPRGDK----------LKKLISEGWVPTEYS 1237
Query: 1183 AKLKTLNNHLEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTHEKILKYELKH-K 1241
LK NN L + + N L + + N L + + L++
Sbjct: 1238 TSLKGFNN-LNKKFDTPASMSNEKLLSLLNS--IDNLLSSYKLEEEVLPATINSLLQYIN 1294
Query: 1242 RNIINDLKIDLEYVREKWDLVKEKNEQNELDYKV 1275
+ N L+ +R K N + D+
Sbjct: 1295 VGLFNALRTKASSLRWKSATEVNYNSEELDDWCR 1328
Score = 30.8 bits (70), Expect = 8.4
Identities = 41/228 (17%), Positives = 77/228 (33%), Gaps = 31/228 (13%)
Query: 23 LTSSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL 82
L S S + + + KT L KKLL + +++ + +L +L+E
Sbjct: 911 LKKSLSSDLIENLEFKTELIARLKKLLNN-IDLEEGPSIEYVKLPELNKLHEV------- 962
Query: 83 EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
E+KLKE EE + L+K ++ + ++ E+ L++
Sbjct: 963 ----------------ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSK 1006
Query: 143 NLTSSLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDT 202
+ L V L S + + L + +
Sbjct: 1007 QYGALQESTKQLKELPVEVAELQSAS------KIISSESTELSILKPLQKLKGLLLLENN 1060
Query: 203 VSLLVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTS 250
K+ +E + DKQL QLE+ E + + V+ +E +
Sbjct: 1061 QLQARYKALKLRRENSLLD-DKQLYQLESTENLLKTINVKDLEVTNRN 1107
>gnl|CDD|223778 COG0706, YidC, Preprotein translocase subunit YidC [Intracellular
trafficking and secretion].
Length = 314
Score = 36.7 bits (85), Expect = 0.097
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 100 AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
AK++E+Q +I+ EKYK KQ + E K ++N L
Sbjct: 137 AKMQELQPKIKEIQEKYKGTDKQKQQQEMMKLYKKHKVNPL 177
Score = 36.7 bits (85), Expect = 0.097
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 917 AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
AK++E+Q +I+ EKYK KQ + E K ++N L
Sbjct: 137 AKMQELQPKIKEIQEKYKGTDKQKQQQEMMKLYKKHKVNPL 177
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 35.8 bits (83), Expect = 0.11
Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 11/89 (12%)
Query: 54 EIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMI-----EKELAKNAL-----LEAKL 102
+ + + + K+L R+L E +A A +LE K + ++LA+ AL LE
Sbjct: 42 KARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLA 101
Query: 103 KEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
K ++ E+Q E+ ++ KKQ+ +E+ I
Sbjct: 102 KALEAELQQAEEQVEKLKKQLAALEQKIA 130
Score = 35.8 bits (83), Expect = 0.11
Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 11/89 (12%)
Query: 871 EIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMI-----EKELAKNAL-----LEAKL 919
+ + + + K+L R+L E +A A +LE K + ++LA+ AL LE
Sbjct: 42 KARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLA 101
Query: 920 KEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
K ++ E+Q E+ ++ KKQ+ +E+ I
Sbjct: 102 KALEAELQQAEEQVEKLKKQLAALEQKIA 130
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 35.8 bits (83), Expect = 0.17
Identities = 22/105 (20%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 41 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAK--NAL 97
L+ +++ L E+ E +K +Y+ M + L + + R LE + K+L
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201
Query: 98 LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
+L +E+++ L++ K++ E+E+ +Q + +I +L
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN 246
Score = 35.8 bits (83), Expect = 0.17
Identities = 22/105 (20%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-RAAAAELEHKTMIEKELAK--NAL 914
L+ +++ L E+ E +K +Y+ M + L + + R LE + K+L
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201
Query: 915 LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
+L +E+++ L++ K++ E+E+ +Q + +I +L
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTN 246
>gnl|CDD|181362 PRK08296, PRK08296, hypothetical protein; Provisional.
Length = 603
Score = 35.8 bits (83), Expect = 0.20
Identities = 23/72 (31%), Positives = 32/72 (44%), Gaps = 9/72 (12%)
Query: 444 DEKDIVKGNVVDYEATKQTRTESPSEEQKSKQLIRQDRITEE------TDESRPFESA-E 496
D+ +I G + DY + R E+ E +DRI EE DE + F+
Sbjct: 305 DDLEIPLGYIKDYIGRLR-RGETI-ERPTEALRAERDRIVEEYRDLLDGDERKAFDELLG 362
Query: 497 CSRTSQPEVENH 508
+RT P VENH
Sbjct: 363 LARTVYPYVENH 374
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 35.8 bits (83), Expect = 0.21
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 18/103 (17%)
Query: 51 SPEEIK--IEYEKQMTNY-KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 107
S EEI+ +E + KK REL E R A L + EK L + + + KE E
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSL--EKALKEIVKVSEEEKEKIE 541
Query: 108 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
E LE+ E +K +EI E L +
Sbjct: 542 EAITDLEEALEGEK-------------EEIKAKIEELQEVTQK 571
Score = 35.8 bits (83), Expect = 0.21
Identities = 29/103 (28%), Positives = 40/103 (38%), Gaps = 18/103 (17%)
Query: 868 SPEEIK--IEYEKQMTNY-KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 924
S EEI+ +E + KK REL E R A L + EK L + + + KE E
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSL--EKALKEIVKVSEEEKEKIE 541
Query: 925 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
E LE+ E +K +EI E L +
Sbjct: 542 EAITDLEEALEGEK-------------EEIKAKIEELQEVTQK 571
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 35.9 bits (82), Expect = 0.24
Identities = 51/268 (19%), Positives = 104/268 (38%), Gaps = 22/268 (8%)
Query: 1209 ELKKTWNVVNKLKLQHKQLYTHEKILKYELK---HKRNIINDLKIDLEYVREKWDLVKEK 1265
E+ K + K+K + + KI ELK ++ + LK ++K + +K+
Sbjct: 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKA 1655
Query: 1266 NEQNELDYKVL--------RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESE 1317
E+N++ +K ++ + D + +E+ E E ++ + E
Sbjct: 1656 EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE-KKAAEALKKEAEEAKKAEELKKKEAEE 1714
Query: 1318 DDKSEVVETQSEHDVPHTNSVMSIEEDDQDEVEEL----ESETNVSAINNEQEVERTEVT 1373
K+E ++ E + E+D+ + EE E + ++ + E+E + E+
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
Query: 1374 DNTSAVIPSRETIENEAVPVSVSPQ-EDIVDTITTSRTDHIESHQEINSSRET-----KE 1427
AVI E+E + V + +DI D E + IN S+E KE
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKE 1834
Query: 1428 IPGTSDANVNYSQQFLNNKITIKTIGGK 1455
+ + + + + F +K G+
Sbjct: 1835 VADSKNMQLEEADAFEKHKFNKNNENGE 1862
Score = 34.7 bits (79), Expect = 0.62
Identities = 59/309 (19%), Positives = 122/309 (39%), Gaps = 16/309 (5%)
Query: 857 RLKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHKTMIEKELAKNALL 915
+LK E + + EE+K E+ + ++ E + A E + EK+ A+
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
Query: 916 EAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQM 975
EA+ + EE++ + + K+ +++ + E+ ++K +E E E +
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756
Query: 976 TLVNNLFSNMLINPMINLDMLIKLIQ---DNYLLIIDLINNHEINDTVSLLVDLEKKIDQ 1032
+ +L +I+ D + + +I D ++ + +
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKE 1816
Query: 1033 ETLAIQNDKQLKQLENYETPMSSAMV--EAVESNQTSTSKDSANIQLDHAETNTNEESPA 1090
L I + K+++ E S M EA + +K++ N + + E + N+E
Sbjct: 1817 GNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDL 1876
Query: 1091 DQD------ETNSTTTLDKSHIQKEIANN--LPKVWRILIELLNTHQEQYENNDVNENFE 1142
+D E + +DK I++EI NN K I+ + L+ +++Y D E E
Sbjct: 1877 KEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLD--KDEYIKRDAEETRE 1934
Query: 1143 SCYKNIKTN 1151
K K +
Sbjct: 1935 EIIKISKKD 1943
>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde
dehydrogenase. Members of this protein family are
L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
product of the LYS2 gene. It is also called
alpha-aminoadipate reductase. In fungi, lysine is
synthesized via aminoadipate. Currently, all members of
this family are fungal.
Length = 1389
Score = 35.8 bits (83), Expect = 0.26
Identities = 52/187 (27%), Positives = 78/187 (41%), Gaps = 38/187 (20%)
Query: 737 NVEAESRADAVVQDR--------IDTRLKSSPGINECTSAVSAVSEESSGLRSVDAIRPV 788
NVE RAD V+ R IDT L P + E + V +E L V I P
Sbjct: 693 NVECCGRADDQVKIRGFRIELGEIDTHLSQHPLVRENVTLVRRDKDEEPTL--VSYIVPQ 750
Query: 789 LNHDEMT--RRNVEAESRADAVVQGQEKPRPLTNGV--CLKTCVPMHSTPPESNILATTR 844
DE+ + V+ E +D VV+G K R L + LK +P ++ P
Sbjct: 751 DKSDELEEFKSEVDDEESSDPVVKGLIKYRKLIKDIREYLKKKLPSYAIP---------- 800
Query: 845 SVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKT 903
+VI+PL KKL +P K++ + +L + + R+A+A E T
Sbjct: 801 TVIVPL------------KKLPLNPNG-KVDKPALPFPDTAQLAAVAKNRSASAADEEFT 847
Query: 904 MIEKELA 910
E+E+
Sbjct: 848 ETEREIR 854
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 35.8 bits (83), Expect = 0.26
Identities = 23/117 (19%), Positives = 52/117 (44%), Gaps = 7/117 (5%)
Query: 847 ILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 906
I L+ + + + + + E E+I +K+ KK L+ +++ +
Sbjct: 359 ITDLIESINDIIDAINELIREHNEKID-NLKKEKNKAKKKLWLHLVAELKEDIDA---YQ 414
Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
KE LE + +++EI+ + K +K+I E+EK + + + +N+ L +
Sbjct: 415 KEKKG---LEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADEINKLLKA 468
Score = 35.0 bits (81), Expect = 0.35
Identities = 26/170 (15%), Positives = 74/170 (43%), Gaps = 14/170 (8%)
Query: 103 KEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVN 162
+E+ E+++ ++KY+ + ++ LE ++I + E + ++ A L +
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340
Query: 163 NLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQN 222
L +P ++++ ++ +I+ IND + + +L ++ +++ ++
Sbjct: 341 QKLEEKLKDPSTSIEL------ESITDLIE-----SINDIIDAINELIREHNEKIDNLKK 389
Query: 223 DKQL--KQLENYETPMSSAMVEAVESNQT-STKAVESNQTSTSKDSANIQ 269
+K K+L + ++A + + KA+ S + + A I+
Sbjct: 390 EKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIK 439
Score = 34.6 bits (80), Expect = 0.57
Identities = 21/117 (17%), Positives = 51/117 (43%), Gaps = 7/117 (5%)
Query: 30 VVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 89
+ + + + + + + E E+I +K+ KK L+ +++ +
Sbjct: 359 ITDLIESINDIIDAINELIREHNEKID-NLKKEKNKAKKKLWLHLVAELKEDIDA---YQ 414
Query: 90 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
KE LE + +++EI+ + K +K+I E+EK + + + +N+ L +
Sbjct: 415 KEKKG---LEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNIEPTADEINKLLKA 468
Score = 34.2 bits (79), Expect = 0.76
Identities = 29/215 (13%), Positives = 87/215 (40%), Gaps = 28/215 (13%)
Query: 920 KEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVN 979
+E+ E+++ ++KY+ + ++ LE ++I + E + ++ A L +
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340
Query: 980 NLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLAIQN 1039
L +P ++++ ++ +I+ IND + + +L ++ +++ ++
Sbjct: 341 QKLEEKLKDPSTSIEL------ESITDLIE-----SINDIIDAINELIREHNEKIDNLKK 389
Query: 1040 DKQL--KQLENYETPMSSAMVEAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNS 1097
+K K+L + V ++ + + K+ ++ N+ E+ +
Sbjct: 390 EKNKAKKKLWLHL-------VAELKEDIDAYQKEKKGLE---KAINSLEKEIKQLEAEIK 439
Query: 1098 TTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQY 1132
+ ++K++ N P + +N + Y
Sbjct: 440 ALEKEIKELEKQLTNIEP-----TADEINKLLKAY 469
Score = 33.4 bits (77), Expect = 1.1
Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 8/117 (6%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEI--KIEYEKQMTNYKKLRELYEQRAAAAELEHKTMI-- 905
L+ L+ L+ +KL E ++ IE E + + ++ + EH I
Sbjct: 328 LLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDA-INELIREHNEKIDN 386
Query: 906 ---EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
EK AK L + E++E+I A ++ K +K I +EK I+ + EI L +
Sbjct: 387 LKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEK 443
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 35.5 bits (82), Expect = 0.27
Identities = 14/108 (12%), Positives = 42/108 (38%), Gaps = 9/108 (8%)
Query: 38 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK-TMIEKELAKNA 96
KD + K L+ + + +++++ + +++ + +I K+
Sbjct: 765 ARLEKDYQSKYLDLIDNLND--------AREIKDKESKAIDLDDIDFELELIGKQEINID 816
Query: 97 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
+ L+ ++ A + ++ I E ++++ K I L E +
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAI 864
Score = 35.5 bits (82), Expect = 0.27
Identities = 14/108 (12%), Positives = 42/108 (38%), Gaps = 9/108 (8%)
Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHK-TMIEKELAKNA 913
KD + K L+ + + +++++ + +++ + +I K+
Sbjct: 765 ARLEKDYQSKYLDLIDNLND--------AREIKDKESKAIDLDDIDFELELIGKQEINID 816
Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
+ L+ ++ A + ++ I E ++++ K I L E +
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAI 864
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 33.9 bits (78), Expect = 0.28
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 82 LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 141
L K +E++LA L +EEI+ + + +++ + + E+EKN + + E +
Sbjct: 26 LRRKRALERQLAAELKQLELL---EEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82
Query: 142 ENLTSSLAENHI 153
+NL L
Sbjct: 83 KNLHPVLRLLES 94
Score = 33.9 bits (78), Expect = 0.28
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 899 LEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 958
L K +E++LA L +EEI+ + + +++ + + E+EKN + + E +
Sbjct: 26 LRRKRALERQLAAELKQLELL---EEEIRREEAELEKDLEYLQELEKNAKALEREREEES 82
Query: 959 ENLTSSLAENHI 970
+NL L
Sbjct: 83 KNLHPVLRLLES 94
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 35.1 bits (82), Expect = 0.33
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 3/92 (3%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE--QRAAAAELEHKTMIEK 90
K +L +L+ +EK+ E E+ + E E++ +K E E ELE + +
Sbjct: 94 KEENLDRKLELLEKRE-EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTA 152
Query: 91 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
E AK LLE +E + E +++ +E K+
Sbjct: 153 EEAKEILLEKVEEEARHEAAVLIKEIEEEAKE 184
Score = 34.8 bits (81), Expect = 0.40
Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 40 RLKDVEKKLLESPEE---IKIEYEKQMTNYKKLREL-----YEQRAAAAELEHKTMIEKE 91
R K E K+ E+ EE I E +K+ KK L + E E + E
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRER-RNE 83
Query: 92 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
L K LE +L + +E + +LE ++ ++++ + EK ++ K E+ E L + E
Sbjct: 84 LQK---LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
Score = 34.8 bits (81), Expect = 0.40
Identities = 31/119 (26%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 857 RLKDVEKKLLESPEE---IKIEYEKQMTNYKKLREL-----YEQRAAAAELEHKTMIEKE 908
R K E K+ E+ EE I E +K+ KK L + E E + E
Sbjct: 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRER-RNE 83
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
L K LE +L + +E + +LE ++ ++++ + EK ++ K E+ E L + E
Sbjct: 84 LQK---LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
Score = 34.4 bits (80), Expect = 0.55
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 3/88 (3%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYE--QRAAAAELEHKTMIEKELAK 911
L +L+ +EK+ E E+ + E E++ +K E E ELE + + E AK
Sbjct: 98 LDRKLELLEKRE-EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAK 156
Query: 912 NALLEAKLKEIQEEIQAQLEKYKENKKQ 939
LLE +E + E +++ +E K+
Sbjct: 157 EILLEKVEEEARHEAAVLIKEIEEEAKE 184
Score = 32.8 bits (76), Expect = 1.7
Identities = 18/110 (16%), Positives = 57/110 (51%), Gaps = 13/110 (11%)
Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQ-----RAAAAELEHKTM-----IEK 907
L++ +K+ +E +E +++ KLR +E+ R +LE + + +++
Sbjct: 44 LEEAKKEAEAIKKEALLEAKEE---IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDR 100
Query: 908 ELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
+L E +L++ ++E++ + ++ ++ ++++ E+ + + + I+ L
Sbjct: 101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGL 150
Score = 31.3 bits (72), Expect = 4.9
Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 9/79 (11%)
Query: 66 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 125
Y +++ E + AE E K ++E+ + + + I++E A LE +E K E
Sbjct: 22 YFVRKKIAEAKIKEAEEEAKRILEEA-------KKEAEAIKKE--ALLEAKEEIHKLRNE 72
Query: 126 MEKNIQLKDDEINNLNENL 144
EK ++ + +E+ L + L
Sbjct: 73 FEKELRERRNELQKLEKRL 91
Score = 31.3 bits (72), Expect = 4.9
Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 9/79 (11%)
Query: 883 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
Y +++ E + AE E K ++E+ + + + I++E A LE +E K E
Sbjct: 22 YFVRKKIAEAKIKEAEEEAKRILEEA-------KKEAEAIKKE--ALLEAKEEIHKLRNE 72
Query: 943 MEKNIQLKDDEINNLNENL 961
EK ++ + +E+ L + L
Sbjct: 73 FEKELRERRNELQKLEKRL 91
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 35.0 bits (81), Expect = 0.35
Identities = 22/106 (20%), Positives = 61/106 (57%), Gaps = 3/106 (2%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
L+ +++++E+++ E +EI+ E E+++ K+L+E E+ +E + EL +
Sbjct: 257 LEEKIRELEERIEELKKEIE-ELEEKVKELKELKEKAEEYIKLSEFYEE--YLDELREIE 313
Query: 97 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
++L+E I+ ++++ +E ++++ E++K ++ + + L E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE 359
Score = 35.0 bits (81), Expect = 0.35
Identities = 22/106 (20%), Positives = 61/106 (57%), Gaps = 3/106 (2%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
L+ +++++E+++ E +EI+ E E+++ K+L+E E+ +E + EL +
Sbjct: 257 LEEKIRELEERIEELKKEIE-ELEEKVKELKELKEKAEEYIKLSEFYEE--YLDELREIE 313
Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
++L+E I+ ++++ +E ++++ E++K ++ + + L E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE 359
Score = 34.3 bits (79), Expect = 0.78
Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 16/118 (13%)
Query: 35 SDLKTRLKDVEKKLLESPEEIKI----------EYEKQMTNYKKLRELYEQRAAAAELEH 84
+++ +K+ EK+L E EI E EK K+L EL E+ +
Sbjct: 189 ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELE 248
Query: 85 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 142
+E K LE K++E++E I+ ++ +E ++++ E+++ ++ K +E L+E
Sbjct: 249 S--LEGSKRK---LEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSE 300
Score = 34.3 bits (79), Expect = 0.78
Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 16/118 (13%)
Query: 852 SDLKTRLKDVEKKLLESPEEIKI----------EYEKQMTNYKKLRELYEQRAAAAELEH 901
+++ +K+ EK+L E EI E EK K+L EL E+ +
Sbjct: 189 ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELE 248
Query: 902 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNE 959
+E K LE K++E++E I+ ++ +E ++++ E+++ ++ K +E L+E
Sbjct: 249 S--LEGSKRK---LEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSE 300
Score = 33.5 bits (77), Expect = 1.0
Identities = 23/121 (19%), Positives = 54/121 (44%), Gaps = 21/121 (17%)
Query: 37 LKTRLKDVEKKLLESPE------------EIKIEYEKQMTNYKKLRELYEQRAAAAELEH 84
LK ++++E+K+ E E E EY ++ +K E+
Sbjct: 271 LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING----- 325
Query: 85 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
IE+ + + E +L+E++++++ ++ +E +++ E+ K +E+ L + L
Sbjct: 326 ---IEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKK-EELERLKKRL 381
Query: 145 T 145
T
Sbjct: 382 T 382
Score = 33.5 bits (77), Expect = 1.0
Identities = 23/121 (19%), Positives = 54/121 (44%), Gaps = 21/121 (17%)
Query: 854 LKTRLKDVEKKLLESPE------------EIKIEYEKQMTNYKKLRELYEQRAAAAELEH 901
LK ++++E+K+ E E E EY ++ +K E+
Sbjct: 271 LKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING----- 325
Query: 902 KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
IE+ + + E +L+E++++++ ++ +E +++ E+ K +E+ L + L
Sbjct: 326 ---IEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKK-EELERLKKRL 381
Query: 962 T 962
T
Sbjct: 382 T 382
Score = 32.7 bits (75), Expect = 2.0
Identities = 25/97 (25%), Positives = 51/97 (52%), Gaps = 5/97 (5%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
K+ +L+ L ++ K+L E E E E+++ K+L Y + + E + +E+E
Sbjct: 564 KLDELEEELAELLKELEELGFESVEELEERL---KELEPFYNEYLELKDAEKE--LEREE 618
Query: 93 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 129
+ LE +L + EE+ ++ +E +K++ E+EK
Sbjct: 619 KELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
Score = 31.2 bits (71), Expect = 6.7
Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 5/94 (5%)
Query: 853 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKN 912
+L+ L ++ K+L E E E E+++ K+L Y + + E + +E+E +
Sbjct: 567 ELEEELAELLKELEELGFESVEELEERL---KELEPFYNEYLELKDAEKE--LEREEKEL 621
Query: 913 ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
LE +L + EE+ ++ +E +K++ E+EK
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.1 bits (81), Expect = 0.40
Identities = 22/108 (20%), Positives = 54/108 (50%), Gaps = 15/108 (13%)
Query: 41 LKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLE 99
L V+++ +E E E+ ++T Y+K + + +E+ +N+ L+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEK--RIKKLEE---------TVERLEEENSELK 442
Query: 100 AKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
+L+E++ EI +++LE+++ + + ++ I+ +D I L + L
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490
Score = 35.1 bits (81), Expect = 0.40
Identities = 22/108 (20%), Positives = 54/108 (50%), Gaps = 15/108 (13%)
Query: 858 LKDVEKKLLESPEEIKIEYEK-QMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLE 916
L V+++ +E E E+ ++T Y+K + + +E+ +N+ L+
Sbjct: 394 LSKVKEEERPREKEGTEEEERREITVYEK--RIKKLEE---------TVERLEEENSELK 442
Query: 917 AKLKEIQEEI---QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
+L+E++ EI +++LE+++ + + ++ I+ +D I L + L
Sbjct: 443 RELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic phosphoprotein
1 (DMP1) gene has been mapped to human chromosome 4q21.
DMP1 is a bone and teeth specific protein initially
identified from mineralised dentin. DMP1 is primarily
localised in the nuclear compartment of undifferentiated
osteoblasts. In the nucleus, DMP1 acts as a
transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates mineralised
matrix formation extracellularly, at later stages of
osteoblast maturation. The DMP1 gene has been found to be
ectopically expressed in lung cancer although the reason
for this is unknown.
Length = 514
Score = 34.6 bits (79), Expect = 0.42
Identities = 37/165 (22%), Positives = 69/165 (41%), Gaps = 13/165 (7%)
Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTN 1336
RK F R+ + GR + + + + D+ E + E +SE E E+ +
Sbjct: 266 RKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDS 325
Query: 1337 SVMSIEEDDQDEVEELESETNVSAINNEQEVERTEVTDNTSAVIPSRETIENEAVPVSVS 1396
++ QD E E ++ + N E + EV + P T +E S S
Sbjct: 326 ------QEVQDPSSESSQEADLPSQENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSES 378
Query: 1397 PQEDIVDTITTSRTDHI------ESHQEINSSRETKEIPGTSDAN 1435
+ED +DT ++S + ES++ ++SS E+ E +++
Sbjct: 379 SEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSS 423
>gnl|CDD|109486 pfam00430, ATP-synt_B, ATP synthase B/B' CF(0). Part of the CF(0)
(base unit) of the ATP synthase. The base unit is
thought to translocate protons through membrane (inner
membrane in mitochondria, thylakoid membrane in plants,
cytoplasmic membrane in bacteria). The B subunits are
thought to interact with the stalk of the CF(1)
subunits. This domain should not be confused with the ab
CF(1) proteins (in the head of the ATP synthase) which
are found in pfam00006.
Length = 132
Score = 32.6 bits (75), Expect = 0.50
Identities = 27/103 (26%), Positives = 41/103 (39%), Gaps = 11/103 (10%)
Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
PL L R + + + E+ EE + L EQ+ A A E +I
Sbjct: 23 PLGKILDERKEKIANNIKEA-EERLKQAAA-------LLAEAEQQLAQARAEASEIINN- 73
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
AK + K +EI E Q E+ E+ + +E EK L +
Sbjct: 74 -AKKEAQKLK-EEILAEAQKDAERLLESARAEIEQEKEQALAE 114
Score = 31.5 bits (72), Expect = 1.2
Identities = 25/98 (25%), Positives = 39/98 (39%), Gaps = 11/98 (11%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
L R + + + E+ EE + L EQ+ A A E +I AK
Sbjct: 28 LDERKEKIANNIKEA-EERLKQAAA-------LLAEAEQQLAQARAEASEIINN--AKKE 77
Query: 97 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
+ K +EI E Q E+ E+ + +E EK L +
Sbjct: 78 AQKLK-EEILAEAQKDAERLLESARAEIEQEKEQALAE 114
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 34.5 bits (79), Expect = 0.50
Identities = 31/105 (29%), Positives = 58/105 (55%), Gaps = 6/105 (5%)
Query: 28 SMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA-AAELEHKT 86
SMV K D K ++ +EK+L +S + ++ EKQ+ KK ++ E+ AA AA +
Sbjct: 466 SMVSAKQKD-KQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAAS 523
Query: 87 MIEK-ELAKNAL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
E E K A LE ++K+++ +++ + E+ + +K+ E+ K
Sbjct: 524 REECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRK 568
Score = 34.5 bits (79), Expect = 0.57
Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 5/95 (5%)
Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA-AAELEHKTMIEK-ELAKN 912
K ++ +EK+L +S + ++ EKQ+ KK ++ E+ AA AA + E E K
Sbjct: 475 KQSMQSMEKRL-KSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQ 533
Query: 913 AL--LEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
A LE ++K+++ +++ + E+ + +K+ E+ K
Sbjct: 534 AKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRK 568
>gnl|CDD|216883 pfam02118, Srg, Srg family chemoreceptor.
Length = 275
Score = 33.7 bits (78), Expect = 0.61
Identities = 11/45 (24%), Positives = 23/45 (51%), Gaps = 7/45 (15%)
Query: 4 FTILSMIVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKDVEKKL 48
F ++ I+ + TI ++ + +K++ L R+K VE+ L
Sbjct: 172 FQLIYFILI--LVFTI-----ITSIVTSYKLAKLSKRIKSVERSL 209
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 32.9 bits (75), Expect = 0.90
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 89 EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
+K + + LEA +QE++ A+ K+KE +KQ LE EK Q
Sbjct: 76 DKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQ-LEEEKRRQ 117
Score = 32.9 bits (75), Expect = 0.90
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 906 EKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
+K + + LEA +QE++ A+ K+KE +KQ LE EK Q
Sbjct: 76 DKVVRRQEALEAARLRMQEDLDAKAAKFKEKQKQ-LEEEKRRQ 117
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 33.5 bits (78), Expect = 0.97
Identities = 26/102 (25%), Positives = 49/102 (48%), Gaps = 23/102 (22%)
Query: 28 SMVVFKVSDLKTRLKDVEKKLLESPEEIK---------IEY---EKQMTNYKKLRELYEQ 75
++ K+ L+ R +++E LL PE I EY E + Y++ ++ E
Sbjct: 2 PSMLDKLEALEERYEELEA-LLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQED 60
Query: 76 RAAAAEL-------EHKTMIEKELAKNALLEAKLKEIQEEIQ 110
A E+ E + M ++EL + LE +L+E++EE++
Sbjct: 61 LEEAKEMLEEESDPEMREMAKEELKE---LEERLEELEEELK 99
Score = 31.6 bits (73), Expect = 4.0
Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 23/94 (24%)
Query: 853 DLKTRLKDVEKKLLESPEEIK---------IEY---EKQMTNYKKLRELYEQRAAAAEL- 899
L+ R +++E LL PE I EY E + Y++ ++ E A E+
Sbjct: 10 ALEERYEELEA-LLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAQEDLEEAKEML 68
Query: 900 ------EHKTMIEKELAKNALLEAKLKEIQEEIQ 927
E + M ++EL + LE +L+E++EE++
Sbjct: 69 EEESDPEMREMAKEELKE---LEERLEELEEELK 99
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 32.7 bits (75), Expect = 1.1
Identities = 29/124 (23%), Positives = 63/124 (50%), Gaps = 15/124 (12%)
Query: 41 LKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHK-----TMIEKELAK 94
++D++ +L ++ + + + + K+L R+L EQ+ A +LE+K T +ELA+
Sbjct: 32 IRDMQSELGKARQAL----AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAR 87
Query: 95 NAL-----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 149
AL LE + + ++ ++ Q ++ +KQ+ +E IQ + L L ++ A
Sbjct: 88 EALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKA 147
Query: 150 ENHI 153
+ +
Sbjct: 148 QEAV 151
Score = 32.7 bits (75), Expect = 1.1
Identities = 29/124 (23%), Positives = 63/124 (50%), Gaps = 15/124 (12%)
Query: 858 LKDVEKKLLESPEEIKIEYEKQMTNYKKL-RELYEQRAAAAELEHK-----TMIEKELAK 911
++D++ +L ++ + + + + K+L R+L EQ+ A +LE+K T +ELA+
Sbjct: 32 IRDMQSELGKARQAL----AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAR 87
Query: 912 NAL-----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLA 966
AL LE + + ++ ++ Q ++ +KQ+ +E IQ + L L ++ A
Sbjct: 88 EALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKA 147
Query: 967 ENHI 970
+ +
Sbjct: 148 QEAV 151
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 32.7 bits (75), Expect = 1.2
Identities = 26/99 (26%), Positives = 53/99 (53%), Gaps = 3/99 (3%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
+ L+ L+D+E ++ + EI+ E+ +KL + ++R A L + I KE
Sbjct: 46 ALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA-LNIEIQIAKER 104
Query: 93 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
+ LE +L E+ EEI+ ++ ++ K+++ +EKN+
Sbjct: 105 INS--LEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141
Score = 32.3 bits (74), Expect = 1.8
Identities = 26/98 (26%), Positives = 53/98 (54%), Gaps = 3/98 (3%)
Query: 851 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 910
+ L+ L+D+E ++ + EI+ E+ +KL + ++R A L + I KE
Sbjct: 47 LEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA-LNIEIQIAKERI 105
Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
+ LE +L E+ EEI+ ++ ++ K+++ +EKN+
Sbjct: 106 NS--LEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 1.2
Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 4/119 (3%)
Query: 34 VSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 91
+ DL+ +++VE++L E E + E+ + L EL +R E E + E+
Sbjct: 484 LEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIE-EKRERAEEL 542
Query: 92 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 150
+ A LEA+ +E +E E+ +E ++++ E+ + + I +L + + LA
Sbjct: 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLE-RIRTLLAA 600
Score = 33.5 bits (77), Expect = 1.2
Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 4/119 (3%)
Query: 851 VSDLKTRLKDVEKKL--LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
+ DL+ +++VE++L E E + E+ + L EL +R E E + E+
Sbjct: 484 LEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIE-EKRERAEEL 542
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAE 967
+ A LEA+ +E +E E+ +E ++++ E+ + + I +L + + LA
Sbjct: 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLE-RIRTLLAA 600
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 33.3 bits (77), Expect = 1.3
Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 20/102 (19%)
Query: 31 VFKVSDLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELY--EQ-RAAAAE 81
+ + D+K RL+ + E LL+ + I+ + ++QM K RE Y EQ +A E
Sbjct: 184 ILETLDVKERLEKLLDLLEKEIDLLQLEKRIRKKVKEQME--KSQREYYLREQLKAIQKE 241
Query: 82 LEHKTMIEKELAKNALLEAKL------KEIQEEIQAQLEKYK 117
L + E+ + L K+ KE +E+ + +L+K +
Sbjct: 242 LGEDDDDKDEVEE---LREKIEKLKLPKEAKEKAEKELKKLE 280
Score = 32.9 bits (76), Expect = 1.6
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 20/97 (20%)
Query: 853 DLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELY--EQ-RAAAAELEHKT 903
D+K RL+ + E LL+ + I+ + ++QM K RE Y EQ +A EL
Sbjct: 189 DVKERLEKLLDLLEKEIDLLQLEKRIRKKVKEQME--KSQREYYLREQLKAIQKELGEDD 246
Query: 904 MIEKELAKNALLEAKL------KEIQEEIQAQLEKYK 934
+ E+ + L K+ KE +E+ + +L+K +
Sbjct: 247 DDKDEVEE---LREKIEKLKLPKEAKEKAEKELKKLE 280
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 33.3 bits (76), Expect = 1.4
Identities = 34/233 (14%), Positives = 64/233 (27%), Gaps = 15/233 (6%)
Query: 538 TLQREPITSVPTATSNTTLPSEPSVSRTRTPEEILEARNARLKRLEDQCKNLFHKMSLTS 597
R + S ++ + +V + E ++A + C H S +
Sbjct: 23 GSLRLGLRSNYLQSTVEDIEGPSAVDERTSGVLRDEGKHANILYNSILCNQKKHA-SFLN 81
Query: 598 QRSDTLSERLEELHDYYGTSNERHSPRRHREGSQDSDDTRLRSPENEPLSGVSANVPDSQ 657
QR + +E Y + E S D D L P+ + + Q
Sbjct: 82 QRKSLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSL-LELPDRDE--DADTQANNDQ 138
Query: 658 ENSQESDDTRRRSAETNESHSGVSGNVSDL--QDRIDT--RLKSSPEINECTSAVSAVSE 713
N + DD+ + S + L + + T K N+ A +
Sbjct: 139 TNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEA----AA 194
Query: 714 ESSGLRSVDAIRPVLNHDEMTRRNVEAESRADAV---VQDRIDTRLKSSPGIN 763
+ G P+ + D + D + PG+
Sbjct: 195 KDGGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKATSVFPGLY 247
>gnl|CDD|191114 pfam04859, DUF641, Plant protein of unknown function (DUF641).
Plant protein of unknown function.
Length = 132
Score = 31.6 bits (72), Expect = 1.4
Identities = 17/64 (26%), Positives = 28/64 (43%)
Query: 94 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 153
K + + EIQ Q K + + ++E ++ KD EI++L E L L N
Sbjct: 68 KRQKGAPQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVANSK 127
Query: 154 LNSQ 157
L +
Sbjct: 128 LEKR 131
Score = 31.6 bits (72), Expect = 1.4
Identities = 17/64 (26%), Positives = 28/64 (43%)
Query: 911 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHI 970
K + + EIQ Q K + + ++E ++ KD EI++L E L L N
Sbjct: 68 KRQKGAPQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVANSK 127
Query: 971 LNSQ 974
L +
Sbjct: 128 LEKR 131
>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein.
Length = 193
Score = 32.1 bits (74), Expect = 1.5
Identities = 13/30 (43%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
Query: 100 AKLKEIQEEIQAQLEKYKENKKQI-LEMEK 128
AK++E+Q EI+ EKYK++ +++ EM K
Sbjct: 30 AKMQELQPEIKEIQEKYKDDPQKLQQEMMK 59
Score = 32.1 bits (74), Expect = 1.5
Identities = 13/30 (43%), Positives = 22/30 (73%), Gaps = 1/30 (3%)
Query: 917 AKLKEIQEEIQAQLEKYKENKKQI-LEMEK 945
AK++E+Q EI+ EKYK++ +++ EM K
Sbjct: 30 AKMQELQPEIKEIQEKYKDDPQKLQQEMMK 59
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 31.8 bits (73), Expect = 1.6
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 88 IEKELAKNA--LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 138
I++ LA+ A LLE + + E+QA++ +YK+ K + K ++ K E
Sbjct: 49 IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAE 101
Score = 31.8 bits (73), Expect = 1.6
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 905 IEKELAKNA--LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 955
I++ LA+ A LLE + + E+QA++ +YK+ K + K ++ K E
Sbjct: 49 IKQHLAELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAE 101
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.8 bits (73), Expect = 1.7
Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 22/111 (19%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 96
LKTRL+ ++K+L E + I E + Q+ KK RE E+R
Sbjct: 67 LKTRLEKLKKELEELKQRIA-ELQAQIEKLKKGREETEER------------------TE 107
Query: 97 LLEAK--LKEIQEEIQAQLEKYKENK-KQILEMEKNIQLKDDEINNLNENL 144
LLE L++ ++++A+LEKY++N ++I ++++ ++ + N +N+
Sbjct: 108 LLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNI 158
Score = 31.8 bits (73), Expect = 1.7
Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 22/111 (19%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNA 913
LKTRL+ ++K+L E + I E + Q+ KK RE E+R
Sbjct: 67 LKTRLEKLKKELEELKQRIA-ELQAQIEKLKKGREETEER------------------TE 107
Query: 914 LLEAK--LKEIQEEIQAQLEKYKENK-KQILEMEKNIQLKDDEINNLNENL 961
LLE L++ ++++A+LEKY++N ++I ++++ ++ + N +N+
Sbjct: 108 LLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAKEAANRWTDNI 158
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 32.5 bits (75), Expect = 1.8
Identities = 21/102 (20%), Positives = 49/102 (48%), Gaps = 16/102 (15%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
+ L+ + D+E+ L ++ EE++ E+ ++ KL +I +E+
Sbjct: 307 ALQQLQQQGIDLEEYLKDTEEELREEFREEAERRVKLG----------------LILEEI 350
Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 951
AK +E +E++ EI+ ++Y E+ +++ + K L +
Sbjct: 351 AKKEKIEVSEEEVEAEIEELAQQYGEDPEEVKKYYKKNGLLE 392
Score = 32.5 bits (75), Expect = 1.8
Identities = 21/100 (21%), Positives = 48/100 (48%), Gaps = 16/100 (16%)
Query: 35 SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 94
L+ + D+E+ L ++ EE++ E+ ++ KL +I +E+AK
Sbjct: 309 QQLQQQGIDLEEYLKDTEEELREEFREEAERRVKLG----------------LILEEIAK 352
Query: 95 NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
+E +E++ EI+ ++Y E+ +++ + K L +
Sbjct: 353 KEKIEVSEEEVEAEIEELAQQYGEDPEEVKKYYKKNGLLE 392
>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
Length = 156
Score = 31.5 bits (72), Expect = 2.0
Identities = 19/104 (18%), Positives = 47/104 (45%), Gaps = 19/104 (18%)
Query: 47 KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 106
++L E+ +YE++++ +K +L ++ E K ++E EL
Sbjct: 67 EILAKANELTKQYEQELSKARKEAQL---EITQSQKEAKEIVENEL-------------- 109
Query: 107 EEIQAQLEKYKENKKQILEMEKNIQLK--DDEINNLNENLTSSL 148
++ Q ++ + LE +K LK +++++ L++ + L
Sbjct: 110 KQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKL 153
Score = 31.5 bits (72), Expect = 2.0
Identities = 19/104 (18%), Positives = 47/104 (45%), Gaps = 19/104 (18%)
Query: 864 KLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQ 923
++L E+ +YE++++ +K +L ++ E K ++E EL
Sbjct: 67 EILAKANELTKQYEQELSKARKEAQL---EITQSQKEAKEIVENEL-------------- 109
Query: 924 EEIQAQLEKYKENKKQILEMEKNIQLK--DDEINNLNENLTSSL 965
++ Q ++ + LE +K LK +++++ L++ + L
Sbjct: 110 KQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKL 153
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 32.3 bits (74), Expect = 2.0
Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 14/90 (15%)
Query: 50 ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 109
E+ EE ++ ++ +T +K E +A AAE A+ LL K KE ++ +
Sbjct: 186 EAVEEAILQTDQALTAKEKAIEAERAKAEAAE-----------AEQELLREKQKEEEQMM 234
Query: 110 QAQLEKYKENKKQI---LEMEKNIQLKDDE 136
+AQ Y+E+ KQ+ +E E+ L + E
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQE 264
Score = 32.3 bits (74), Expect = 2.0
Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 14/90 (15%)
Query: 867 ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 926
E+ EE ++ ++ +T +K E +A AAE A+ LL K KE ++ +
Sbjct: 186 EAVEEAILQTDQALTAKEKAIEAERAKAEAAE-----------AEQELLREKQKEEEQMM 234
Query: 927 QAQLEKYKENKKQI---LEMEKNIQLKDDE 953
+AQ Y+E+ KQ+ +E E+ L + E
Sbjct: 235 EAQERSYQEHVKQLIEKMEAEREKLLAEQE 264
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 32.4 bits (74), Expect = 2.2
Identities = 23/111 (20%), Positives = 52/111 (46%), Gaps = 5/111 (4%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM-IEKEL 92
+ D + LK + K +E+ EEI+ + +KK ++ + E + + I+K+
Sbjct: 148 LPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKD 207
Query: 93 AKNAL----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINN 139
KN L ++ + ++++E +A +K K K + +K ++ E+
Sbjct: 208 IKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKK 258
Score = 32.4 bits (74), Expect = 2.2
Identities = 23/111 (20%), Positives = 52/111 (46%), Gaps = 5/111 (4%)
Query: 851 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM-IEKEL 909
+ D + LK + K +E+ EEI+ + +KK ++ + E + + I+K+
Sbjct: 148 LPDPRALLKGEDLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKD 207
Query: 910 AKNAL----LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINN 956
KN L ++ + ++++E +A +K K K + +K ++ E+
Sbjct: 208 IKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKK 258
>gnl|CDD|218176 pfam04614, Pex19, Pex19 protein family.
Length = 244
Score = 31.9 bits (73), Expect = 2.2
Identities = 34/162 (20%), Positives = 61/162 (37%), Gaps = 25/162 (15%)
Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
L DL+ + ++ + E E K ++EK + EL E+ AAA++ K
Sbjct: 8 ELAKDLQKGMAELLGEDGEEDPETKAQFEKLLK------ELGEEEAAASKEGEK------ 55
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAEN 968
+ A ++ I LE+ KE+ N E +++L + L +
Sbjct: 56 PSAAAGESGTDPSFKDVISETLERLKESGD-------NADAAASEDKKNSDDLLAQLLKQ 108
Query: 969 HILNSQMTLVNNLFSNMLINPMINL---DML---IKLIQDNY 1004
N + S +L M L ++L +K + D Y
Sbjct: 109 LGGNGDGGGGDGDMSKLLQGMMEQLSSKEVLYEPLKELNDKY 150
Score = 31.2 bits (71), Expect = 3.4
Identities = 33/158 (20%), Positives = 60/158 (37%), Gaps = 25/158 (15%)
Query: 36 DLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKN 95
DL+ + ++ + E E K ++EK + EL E+ AAA++ K +
Sbjct: 12 DLQKGMAELLGEDGEEDPETKAQFEKLLK------ELGEEEAAASKEGEK------PSAA 59
Query: 96 ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILN 155
A ++ I LE+ KE+ N E +++L + L + N
Sbjct: 60 AGESGTDPSFKDVISETLERLKESGD-------NADAAASEDKKNSDDLLAQLLKQLGGN 112
Query: 156 SQMTLVNNLFSNMLINPMINL---DML---IKLIQDNY 187
+ S +L M L ++L +K + D Y
Sbjct: 113 GDGGGGDGDMSKLLQGMMEQLSSKEVLYEPLKELNDKY 150
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 32.3 bits (74), Expect = 2.3
Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 4/67 (5%)
Query: 67 KKLRELYEQRAAAAELEH----KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
K+LRE + A E+E A+ L+ ++ ++ EI E E + +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 123 ILEMEKN 129
+
Sbjct: 62 PAASGEG 68
Score = 32.3 bits (74), Expect = 2.3
Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 4/67 (5%)
Query: 884 KKLRELYEQRAAAAELEH----KTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
K+LRE + A E+E A+ L+ ++ ++ EI E E + +
Sbjct: 2 KELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAK 61
Query: 940 ILEMEKN 946
+
Sbjct: 62 PAASGEG 68
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 32.5 bits (74), Expect = 2.3
Identities = 43/220 (19%), Positives = 82/220 (37%), Gaps = 8/220 (3%)
Query: 102 LKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNS--QMT 159
L + I Y ++ K++L + I+ + + L +L + + E I N +
Sbjct: 242 LVSNENLINLLNLLYSKDIKEVLNILNQIKEQGIDPELLLISLINLVKEFIIFNKTKDNS 301
Query: 160 LVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLV-DLEKKIDQETL 218
L+ L I+ D K I+ + L+ DL + NDT+ +L+ L + E
Sbjct: 302 LLEYYSEEDLEKLKIDDDFAYKFIEILFDLLKDLKISDNPNDTLEILIIKLLALSELEEE 361
Query: 219 AIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESN-----QTSTSKDSANIQLDHA 273
K K EN + + N K+ N +T +K+ + ++
Sbjct: 362 DENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNKNSSFINKT 421
Query: 274 ETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLIS 313
E D+ +T ++ + Q+ I N +IS
Sbjct: 422 ENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIIS 461
>gnl|CDD|234272 TIGR03592, yidC_oxa1_cterm, membrane protein insertase, YidC/Oxa1
family, C-terminal domain. This model describes
full-length from some species, and the C-terminal region
only from other species, of the YidC/Oxa1 family of
proteins. This domain appears to be univeral among
bacteria (although absent from Archaea). The
well-characterized YidC protein from Escherichia coli
and its close homologs contain a large N-terminal
periplasmic domain in addition to the region modeled
here [Protein fate, Protein and peptide secretion and
trafficking].
Length = 181
Score = 31.3 bits (72), Expect = 2.4
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 100 AKLKEIQEEIQAQLEKYKENK-KQILEM 126
K++E+Q +++ EKYK++ K EM
Sbjct: 29 RKMQELQPKLKEIQEKYKDDPQKLQQEM 56
Score = 31.3 bits (72), Expect = 2.4
Identities = 10/28 (35%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 917 AKLKEIQEEIQAQLEKYKENK-KQILEM 943
K++E+Q +++ EKYK++ K EM
Sbjct: 29 RKMQELQPKLKEIQEKYKDDPQKLQQEM 56
>gnl|CDD|225283 COG2427, COG2427, Uncharacterized conserved protein [Function
unknown].
Length = 148
Score = 31.2 bits (71), Expect = 2.4
Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 6/102 (5%)
Query: 96 ALLEAKLKEIQEEIQAQLEKYKENKKQILE-MEKNIQLKDDEINNLNENLTSSLAENHIL 154
+E KE EE+ LEK ENK I E M+ LK+ + L+ L +
Sbjct: 4 KTIEQLKKEKLEEL---LEKLAENKDAIKELMDLLKLLKESGL--LDALLGLRAKADIAK 58
Query: 155 NSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINN 196
+ L L N+L N +I L +L + + +++ +
Sbjct: 59 KLKDELAKELIENLLNNMLIMLGLLSLIDSERLSKLVENLIK 100
Score = 31.2 bits (71), Expect = 2.4
Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 6/102 (5%)
Query: 913 ALLEAKLKEIQEEIQAQLEKYKENKKQILE-MEKNIQLKDDEINNLNENLTSSLAENHIL 971
+E KE EE+ LEK ENK I E M+ LK+ + L+ L +
Sbjct: 4 KTIEQLKKEKLEEL---LEKLAENKDAIKELMDLLKLLKESGL--LDALLGLRAKADIAK 58
Query: 972 NSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLIIDLINN 1013
+ L L N+L N +I L +L + + +++ +
Sbjct: 59 KLKDELAKELIENLLNNMLIMLGLLSLIDSERLSKLVENLIK 100
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 32.4 bits (74), Expect = 2.5
Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 10/87 (11%)
Query: 74 EQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA---QLEKYKENKKQILEMEKNI 130
+ AA +I++ELA N L L + E + A + Q+L+ + I
Sbjct: 46 AEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTI 105
Query: 131 QLKDDEINNLNENLTSSLAENHILNSQ 157
E + L SL + IL Q
Sbjct: 106 ----REQIAV---LRGSLLLSRILLQQ 125
Score = 32.4 bits (74), Expect = 2.5
Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 10/87 (11%)
Query: 891 EQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQA---QLEKYKENKKQILEMEKNI 947
+ AA +I++ELA N L L + E + A + Q+L+ + I
Sbjct: 46 AEPAAEEAELQAELIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTI 105
Query: 948 QLKDDEINNLNENLTSSLAENHILNSQ 974
E + L SL + IL Q
Sbjct: 106 ----REQIAV---LRGSLLLSRILLQQ 125
>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869). This
bacterial family of proteins has no known function.
Length = 131
Score = 30.6 bits (70), Expect = 2.6
Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 68 KLRELYEQRAAAAELEHKTMIEKELA------KNALLEAKLK------EIQEEIQAQLEK 115
+ E +++ A E T +E+EL + A+ E + + +IQ++++ + K
Sbjct: 12 IVTERWKEELQAELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAK 71
Query: 116 YKENKKQILE 125
E K Q+L
Sbjct: 72 RLEQKNQLLF 81
Score = 30.6 bits (70), Expect = 2.6
Identities = 16/70 (22%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 885 KLRELYEQRAAAAELEHKTMIEKELA------KNALLEAKLK------EIQEEIQAQLEK 932
+ E +++ A E T +E+EL + A+ E + + +IQ++++ + K
Sbjct: 12 IVTERWKEELQAELQEQLTQLEQELQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAK 71
Query: 933 YKENKKQILE 942
E K Q+L
Sbjct: 72 RLEQKNQLLF 81
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 32.1 bits (73), Expect = 2.7
Identities = 30/90 (33%), Positives = 38/90 (42%), Gaps = 12/90 (13%)
Query: 40 RLKDVEKKLLESPEEIK--IEYEKQMTNYKKLRELYEQRAAAAE-----LEHKTMIEKEL 92
RLK +EK+ L + E+ K E KQ +K E E A AA E +
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE--EAAAKAAAAAKAKAEAEAKRAAAA 159
Query: 93 AKNALLEAKLKEIQEEIQAQLEKYKENKKQ 122
AK A EAK K E A + E KK+
Sbjct: 160 AKKAAAEAKKKAEAE---AAKKAAAEAKKK 186
Score = 32.1 bits (73), Expect = 2.7
Identities = 30/90 (33%), Positives = 38/90 (42%), Gaps = 12/90 (13%)
Query: 857 RLKDVEKKLLESPEEIK--IEYEKQMTNYKKLRELYEQRAAAAE-----LEHKTMIEKEL 909
RLK +EK+ L + E+ K E KQ +K E E A AA E +
Sbjct: 102 RLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAE--EAAAKAAAAAKAKAEAEAKRAAAA 159
Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
AK A EAK K E A + E KK+
Sbjct: 160 AKKAAAEAKKKAEAE---AAKKAAAEAKKK 186
>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
homologs [Posttranslational modification, protein
turnover, chaperones].
Length = 291
Score = 31.6 bits (72), Expect = 2.9
Identities = 28/167 (16%), Positives = 57/167 (34%), Gaps = 19/167 (11%)
Query: 836 ESNILATTRSVILPLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAA 895
+ I A R ++ ++ DVE K ++ PEE++ EKQM +R
Sbjct: 139 RAEINAKIREILDEAADPWGIKVVDVEIKDIDPPEEVQAAMEKQMA---------AERDK 189
Query: 896 AAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEIN 955
AE+ + A EA+ I E +A+ E + ++ +
Sbjct: 190 RAEILEAEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEADAAKIIAAALREAPAAP 249
Query: 956 NLN-ENLTSSLAENHILNSQMTLVNNLFSNMLINPMINLDMLIKLIQ 1001
+ L + + S +++ P ++ L++
Sbjct: 250 QALAQRY---------LEELLEIALAGNSKVVVVPNSAGGAILGLLE 287
Score = 30.5 bits (69), Expect = 7.9
Identities = 25/152 (16%), Positives = 51/152 (33%), Gaps = 19/152 (12%)
Query: 34 VSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELA 93
++ DVE K ++ PEE++ EKQM +R AE+ +
Sbjct: 154 ADPWGIKVVDVEIKDIDPPEEVQAAMEKQMA---------AERDKRAEILEAEGEAQAAI 204
Query: 94 KNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN-ENLTSSLAENH 152
A EA+ I E +A+ E + ++ + +
Sbjct: 205 LRAEGEAEAAIILAEAEAEAEVIARAEADAAKIIAAALREAPAAPQALAQRY-------- 256
Query: 153 ILNSQMTLVNNLFSNMLINPMINLDMLIKLIQ 184
L + + S +++ P ++ L++
Sbjct: 257 -LEELLEIALAGNSKVVVVPNSAGGAILGLLE 287
>gnl|CDD|237440 PRK13588, PRK13588, flagellin B; Provisional.
Length = 514
Score = 32.1 bits (73), Expect = 3.0
Identities = 22/85 (25%), Positives = 46/85 (54%), Gaps = 7/85 (8%)
Query: 1288 DGGRSTSES-GISTDTEDDNVEIVETNDESEDDKSEVVET------QSEHDVPHTNSVMS 1340
D RS S + G + +D + +V+T D++ D++ ++++T Q+ D S +
Sbjct: 52 DSLRSQSANLGQAIRNANDAIGMVQTADKAMDEQIKILDTIKTKAVQAAQDGQTLESRRA 111
Query: 1341 IEEDDQDEVEELESETNVSAINNEQ 1365
++ D Q +EEL++ N ++ N +Q
Sbjct: 112 LQSDIQRLLEELDNIANTTSFNGQQ 136
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 31.7 bits (72), Expect = 3.0
Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 71 ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 121
+L E++A + + + +K+ E K +E QE+I+ +EK +E KK
Sbjct: 11 KLAEKQAKSKLQQLERARDKQ-------EKKAEEYQEQIKKAIEKAEEAKK 54
Score = 31.7 bits (72), Expect = 3.0
Identities = 15/51 (29%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 888 ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 938
+L E++A + + + +K+ E K +E QE+I+ +EK +E KK
Sbjct: 11 KLAEKQAKSKLQQLERARDKQ-------EKKAEEYQEQIKKAIEKAEEAKK 54
Score = 30.9 bits (70), Expect = 6.1
Identities = 11/46 (23%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 86 TMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 131
+I K K A ++KL++++ Q +K +E ++QI + + +
Sbjct: 7 ELISKLAEKQA--KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE 50
Score = 30.9 bits (70), Expect = 6.1
Identities = 11/46 (23%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 903 TMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQ 948
+I K K A ++KL++++ Q +K +E ++QI + + +
Sbjct: 7 ELISKLAEKQA--KSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE 50
>gnl|CDD|216500 pfam01432, Peptidase_M3, Peptidase family M3. This is the Thimet
oligopeptidase family, large family of mammalian and
bacterial oligopeptidases that cleave medium sized
peptides. The group also contains mitochondrial
intermediate peptidase which is encoded by nuclear DNA
but functions within the mitochondria to remove the
leader sequence.
Length = 450
Score = 32.0 bits (73), Expect = 3.2
Identities = 29/120 (24%), Positives = 53/120 (44%), Gaps = 11/120 (9%)
Query: 1171 LILEKNHLTNQLAKLKTLNNHLEIRLNHSEQRLNTVTTELKKTWNVVNKLKLQHKQLYTH 1230
L+ E L +LAKL ++ E L E ++ + + + + +L + + L
Sbjct: 34 LLEELLKLRAELAKLLGYPSYAEASL---EDKMAKIPETVY---DFLEELVNKLRPLLHR 87
Query: 1231 EKILKYELKHKRNIINDLKI-DLEYVREKWDLVKEKNEQNELDYKVLRKEFALRRLQYDG 1289
E L +LK K + +L+ D+ Y EK + + + LD + LR F L ++ G
Sbjct: 88 ELELLKKLKKKELGLEELQPWDVAYYSEK----QREELYDPLDQEELRPYFPLEQVLEKG 143
>gnl|CDD|233137 TIGR00811, sit, silicon transporter. Marine diatoms such as
Cylindrotheca fusiformis encode at least six silicon
transport protein homologues which exhibit similar size
and topology. One characterized member of the family
(Sit1) functions in the energy-dependent uptake of
either Silicic acid [Si(OH)4] or Silicate [Si(OH)3O-] by
a Na+ symport mechanism. The system is found in marine
diatoms which make their "glass houses" out of silicon
[Transport and binding proteins, Other].
Length = 545
Score = 31.9 bits (72), Expect = 3.3
Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 49 LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 108
L + E + E+ + N ++LRE+ + E KT E LA KL +EE
Sbjct: 474 LSALETFSVRRERMLKNIRELREMINNAISDEE---KTTFEAALAIEVKALDKLNAEEEE 530
Score = 31.9 bits (72), Expect = 3.3
Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 866 LESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEE 925
L + E + E+ + N ++LRE+ + E KT E LA KL +EE
Sbjct: 474 LSALETFSVRRERMLKNIRELREMINNAISDEE---KTTFEAALAIEVKALDKLNAEEEE 530
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 32.1 bits (72), Expect = 3.4
Identities = 30/203 (14%), Positives = 61/203 (30%), Gaps = 14/203 (6%)
Query: 483 TEETDESRPFESAECSRTSQPEVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQRE 542
T T + P +++ SRT+ + + T + P + ++ + +
Sbjct: 475 TGTTSSTLPEDTSPTSRTTSA---TPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPI 531
Query: 543 PITSVPTATSNTTLPSEPSVSRTRTPEEILEARNARLKRLEDQCKNLFH-----KMSLTS 597
I TATS T + + + E N + L + S
Sbjct: 532 VIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591
Query: 598 QRSDTLSERLEELHDYYGTSNERHSP---RRHREGSQDSDDTRLRSPENEPLSGVSANVP 654
+ H +++ +P H G ++ + P +S +S
Sbjct: 592 SPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG 651
Query: 655 D---SQENSQESDDTRRRSAETN 674
SQ + + T R E +
Sbjct: 652 PGTTSQVSGPGNSSTSRYPGEVH 674
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 31.9 bits (73), Expect = 3.5
Identities = 31/133 (23%), Positives = 56/133 (42%), Gaps = 41/133 (30%)
Query: 31 VFKVSDLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH 84
V + +++ RLK E +LL+ +I + E++M K RE Y
Sbjct: 181 VLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKME--KTQREYY----------- 227
Query: 85 KTMIEKELAKNALLEAKLKEIQEEI------QAQLEKYKENKKQILEMEKNIQLK-DDEI 137
L +LK I++E+ + +LEK KE K + L++ + ++ + E+
Sbjct: 228 -------------LREQLKAIKKELGIEKDDKDELEKLKE-KLEELKLPEEVKKVIEKEL 273
Query: 138 NNLNENLTSSLAE 150
L L S +E
Sbjct: 274 TKL-SLLEPSSSE 285
Score = 30.7 bits (70), Expect = 7.4
Identities = 30/128 (23%), Positives = 54/128 (42%), Gaps = 41/128 (32%)
Query: 853 DLKTRLKDV------EKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIE 906
+++ RLK E +LL+ +I + E++M K RE Y
Sbjct: 186 NIEKRLKKALELLKKELELLKLQNKITKKVEEKME--KTQREYY---------------- 227
Query: 907 KELAKNALLEAKLKEIQEEI------QAQLEKYKENKKQILEMEKNIQLK-DDEINNLNE 959
L +LK I++E+ + +LEK KE K + L++ + ++ + E+ L
Sbjct: 228 --------LREQLKAIKKELGIEKDDKDELEKLKE-KLEELKLPEEVKKVIEKELTKL-S 277
Query: 960 NLTSSLAE 967
L S +E
Sbjct: 278 LLEPSSSE 285
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.7 bits (70), Expect = 3.6
Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 10/87 (11%)
Query: 69 LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENK-----KQI 123
++++ + A + +EKE K L+A+L++ ++E+Q + +K ++ +
Sbjct: 23 VQKVLSESPAGKAAQ--KQLEKEFKK---LQAELQKKEKELQKEEQKLQKQAATLSEEAR 77
Query: 124 LEMEKNIQLKDDEINNLNENLTSSLAE 150
++ +Q K E+ + L +
Sbjct: 78 KAKQQELQQKQQELQQKQQAAQQELQQ 104
Score = 30.7 bits (70), Expect = 3.6
Identities = 15/87 (17%), Positives = 40/87 (45%), Gaps = 10/87 (11%)
Query: 886 LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENK-----KQI 940
++++ + A + +EKE K L+A+L++ ++E+Q + +K ++ +
Sbjct: 23 VQKVLSESPAGKAAQ--KQLEKEFKK---LQAELQKKEKELQKEEQKLQKQAATLSEEAR 77
Query: 941 LEMEKNIQLKDDEINNLNENLTSSLAE 967
++ +Q K E+ + L +
Sbjct: 78 KAKQQELQQKQQELQQKQQAAQQELQQ 104
Score = 29.5 bits (67), Expect = 8.7
Identities = 18/85 (21%), Positives = 46/85 (54%), Gaps = 4/85 (4%)
Query: 42 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
K +K+L + ++++ E +K+ +K + +++AA E + ++EL + + +
Sbjct: 34 KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQ---KQQE 90
Query: 102 LKEIQEEIQAQL-EKYKENKKQILE 125
L++ Q+ Q +L +K +E + I +
Sbjct: 91 LQQKQQAAQQELQQKQQELLQPIYD 115
Score = 29.5 bits (67), Expect = 8.7
Identities = 18/85 (21%), Positives = 46/85 (54%), Gaps = 4/85 (4%)
Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
K +K+L + ++++ E +K+ +K + +++AA E + ++EL + + +
Sbjct: 34 KAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQ---KQQE 90
Query: 919 LKEIQEEIQAQL-EKYKENKKQILE 942
L++ Q+ Q +L +K +E + I +
Sbjct: 91 LQQKQQAAQQELQQKQQELLQPIYD 115
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is part
of complex B but is not, however, required for the core
subunits to stay associated. This protein is known as
Huntington-interacting protein-1 in humans.
Length = 355
Score = 31.6 bits (72), Expect = 3.7
Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 10/93 (10%)
Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTN 1336
+K F+ +R +Y E + D + +E VE E E+ + TQ
Sbjct: 116 KKGFSFKRPKY-PNEEDEEENVDEDDAEIILEEVEEEVEIEEV-DDDEGTQETKYKRGDT 173
Query: 1337 SVMSIEEDDQDEVEELESETNVSAINNEQEVER 1369
S+ +D E+ + A + EVER
Sbjct: 174 SLTPQAKDVL--------ESLIDAAEWKLEVER 198
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.8 bits (72), Expect = 3.8
Identities = 22/143 (15%), Positives = 58/143 (40%), Gaps = 16/143 (11%)
Query: 1291 RSTSESGISTDTEDDNVE---IVETNDESEDDKSEVVETQSEHDVPHTNSVMSIEEDDQ- 1346
+ + + + +D+ + +E ++ D K EVV + V +S +++ D+
Sbjct: 9 ANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEV 68
Query: 1347 DEVEELESETNVSAINNEQEVERTEVTDNTSAVIPSRETIENEAVPVSVSPQEDIVDTIT 1406
+ + ES+ + + ++E ++ + IP S P+E I+ +
Sbjct: 69 KKSTKEESKQLLEVLKTKEEHQKEIQYEILQKTIP------------SFEPKESILKKLE 116
Query: 1407 TSRTDHIESHQEINSSRETKEIP 1429
+ + + ++ E K++P
Sbjct: 117 DIKPEQAKKQTKLFRLFEPKQLP 139
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 30.3 bits (69), Expect = 4.3
Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 11/98 (11%)
Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
P++ L R + L E+ E +K E + L YEQ A + +IE+
Sbjct: 30 PILKALDERQAKIADDLAEA-ERLKEEAQA-------LLAEYEQELEEAREQASEIIEQ- 80
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKN 946
E +EI+ E + +LE+ KE + +E EK
Sbjct: 81 --AKKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKE 116
Score = 30.0 bits (68), Expect = 5.6
Identities = 18/63 (28%), Positives = 29/63 (46%), Gaps = 3/63 (4%)
Query: 67 KKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEM 126
+ L YEQ A + +IE+ E +EI+ E + +LE+ KE + +E
Sbjct: 57 QALLAEYEQELEEAREQASEIIEQ---AKKEAEQIAEEIKAEAEEELERIKEAAEAEIEA 113
Query: 127 EKN 129
EK
Sbjct: 114 EKE 116
Score = 30.0 bits (68), Expect = 6.1
Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 5/82 (6%)
Query: 38 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK-NA 96
L + E+ E+ + + EYE+++ ++ RE + A+ E + + E+ A+
Sbjct: 43 ADDLAEAERLKEEA-QALLAEYEQEL---EEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98
Query: 97 LLEAKLKEIQEEIQAQLEKYKE 118
LE + + EI+A+ E+ E
Sbjct: 99 ELERIKEAAEAEIEAEKERALE 120
Score = 30.0 bits (68), Expect = 6.1
Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 5/82 (6%)
Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK-NA 913
L + E+ E+ + + EYE+++ ++ RE + A+ E + + E+ A+
Sbjct: 43 ADDLAEAERLKEEA-QALLAEYEQEL---EEAREQASEIIEQAKKEAEQIAEEIKAEAEE 98
Query: 914 LLEAKLKEIQEEIQAQLEKYKE 935
LE + + EI+A+ E+ E
Sbjct: 99 ELERIKEAAEAEIEAEKERALE 120
Score = 30.0 bits (68), Expect = 7.0
Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 66 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 125
+K + + ++R A I +LA+ L+ + + + E + +LE+ +E +I+E
Sbjct: 28 WKPILKALDERQA--------KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIE 79
Score = 30.0 bits (68), Expect = 7.0
Identities = 13/60 (21%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 883 YKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
+K + + ++R A I +LA+ L+ + + + E + +LE+ +E +I+E
Sbjct: 28 WKPILKALDERQA--------KIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIE 79
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 31.2 bits (71), Expect = 4.4
Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%)
Query: 59 YEKQMTNYK-KLRELYEQRAAAAELEHKTMI---EKELAKNALLEAKLKEIQEEIQAQLE 114
+EK M + K++ L Q K MI E L++K + E++ A+++
Sbjct: 275 HEKSMEKLQEKIKALKYQL-----KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVK 329
Query: 115 KYKENKKQILEMEKNIQLKDDEINNLNENLT 145
+ K+ KK+ + +K I+ ++ I L T
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
Score = 31.2 bits (71), Expect = 4.4
Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%)
Query: 876 YEKQMTNYK-KLRELYEQRAAAAELEHKTMI---EKELAKNALLEAKLKEIQEEIQAQLE 931
+EK M + K++ L Q K MI E L++K + E++ A+++
Sbjct: 275 HEKSMEKLQEKIKALKYQL-----KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVK 329
Query: 932 KYKENKKQILEMEKNIQLKDDEINNLNENLT 962
+ K+ KK+ + +K I+ ++ I L T
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 31.2 bits (71), Expect = 4.5
Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)
Query: 101 KLKEIQEEIQAQLEKYKENK-KQILEMEK 128
K++E+Q +I+A +KY ++ K LEM+K
Sbjct: 60 KMQELQPQIKALQKKYGNDRQKMALEMQK 88
Score = 31.2 bits (71), Expect = 4.5
Identities = 12/29 (41%), Positives = 21/29 (72%), Gaps = 1/29 (3%)
Query: 918 KLKEIQEEIQAQLEKYKENK-KQILEMEK 945
K++E+Q +I+A +KY ++ K LEM+K
Sbjct: 60 KMQELQPQIKALQKKYGNDRQKMALEMQK 88
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 30.6 bits (69), Expect = 4.6
Identities = 18/113 (15%), Positives = 46/113 (40%), Gaps = 1/113 (0%)
Query: 10 IVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKD-VEKKLLESPEEIKIEYEKQMTNYKK 68
I + + + + ++ +L+ + + ++ P + + E+ K
Sbjct: 92 IEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFK 151
Query: 69 LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 121
+E E+ + + EK+L + L E + K+ EE++ +LE+ E +
Sbjct: 152 RKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
>gnl|CDD|131798 TIGR02751, PEPCase_arch, phosphoenolpyruvate carboxylase, archaeal
type. This family is the archaeal-type
phosphoenolpyruvate carboxylase, although not every host
species is archaeal. These sequences bear little
resemblance to the bacterial/eukaryotic type. The members
from Sulfolobus solfataricus and Methanothermobacter
thermautotrophicus were verified experimentally, while
the activity is known to be present in a number of other
archaea [Energy metabolism, Other].
Length = 506
Score = 31.2 bits (71), Expect = 4.6
Identities = 25/105 (23%), Positives = 41/105 (39%), Gaps = 11/105 (10%)
Query: 1252 LEYVREK-WDLVKEKNEQNELDYKVLRK--EFALRRLQYDG-GRSTSESGISTDTEDDNV 1307
L + E +++KE Y LR FA R L + +S +TED V
Sbjct: 397 LSELDEDDLEVIKEL-------YPNLRSDLSFAARYLNPESADSLKDIVKLSEETEDMIV 449
Query: 1308 EIVETNDESEDDKSEVVETQSEHDVPHTNSVMSIEEDDQDEVEEL 1352
V+ +++ K +V T EH + + +E+ V L
Sbjct: 450 IDVKYDEDEPGIKYNLVYTLEEHLLYTGLIGKAFKEEVFQAVRSL 494
>gnl|CDD|218083 pfam04426, Bul1_C, Bul1 C terminus. This family contains the C
terminus of Saccharomyces cerevisiae Bul1. Bul1 binds
the ubiquitin ligase Rsp5, via an N terminal PPSY motif.
The complex containing Bul1 and Rsp5 is involved in
intracellular trafficking of the general amino acid
permease Gap1, degradation of Rog1 in cooperation with
Bul2 and GSK-3, and mitochondrial inheritance. Bul1 may
contain HEAT repeats.
Length = 226
Score = 30.8 bits (70), Expect = 4.7
Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 9/118 (7%)
Query: 38 KTRLKDVEKKLLESPEEIKIEYEKQMT-NYKKLRELY---EQRAAAAELEHKTMIEKELA 93
K +LK+++K + ++IK EY+K+ N +KL ELY + EL+ I +LA
Sbjct: 74 KEKLKNIKKTFKDFLKKIK-EYKKKFNENKEKLNELYNLNRTISTNRELKFTDFISTQLA 132
Query: 94 KN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSS 147
+ A L+ + + + + Q+ K KK L+ E+++ + + ++ E L S
Sbjct: 133 NDVESLANLKVNVTNLNDIFKKQVVTLKWLKKGSLQYERDVNVNLEFNPDIKETLVPS 190
Score = 30.8 bits (70), Expect = 4.7
Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 9/118 (7%)
Query: 855 KTRLKDVEKKLLESPEEIKIEYEKQMT-NYKKLRELY---EQRAAAAELEHKTMIEKELA 910
K +LK+++K + ++IK EY+K+ N +KL ELY + EL+ I +LA
Sbjct: 74 KEKLKNIKKTFKDFLKKIK-EYKKKFNENKEKLNELYNLNRTISTNRELKFTDFISTQLA 132
Query: 911 KN----ALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSS 964
+ A L+ + + + + Q+ K KK L+ E+++ + + ++ E L S
Sbjct: 133 NDVESLANLKVNVTNLNDIFKKQVVTLKWLKKGSLQYERDVNVNLEFNPDIKETLVPS 190
>gnl|CDD|220767 pfam10459, Peptidase_S46, Peptidase S46. Dipeptidyl-peptidase 7
(DPP-7) is the best characterized member of this family.
It is a serine peptidase that is located on the cell
surface and is predicted to have two N-terminal
transmembrane domains.
Length = 696
Score = 31.4 bits (72), Expect = 4.9
Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 10/109 (9%)
Query: 840 LATTRSVILPLVSD-LKTRLKDVEKKLLESPEEIKIEYE---KQMTNY-KKLRELYEQRA 894
+ R P + L R+ +E +L+ P EI+I+Y + NY K + E
Sbjct: 261 VEFRRDWAYPARIERLGERIALLEAAMLKDP-EIRIKYASKLAGLANYWKNSIGMLEGLK 319
Query: 895 AAAELEHKTMIEKELA----KNALLEAKLKEIQEEIQAQLEKYKENKKQ 939
L K E L K+ AK + +E+ A + +E ++
Sbjct: 320 DLDLLARKQAREAALRAWVKKDPARGAKYGDALDELAALYAERRELARR 368
>gnl|CDD|172503 PRK14002, PRK14002, potassium-transporting ATPase subunit C;
Provisional.
Length = 186
Score = 30.5 bits (69), Expect = 4.9
Identities = 23/74 (31%), Positives = 30/74 (40%), Gaps = 6/74 (8%)
Query: 746 AVVQDRIDTRLKSSPGINECTSAVSAVSEESSGLRSVDAIRPVLNHDEMTR----RNVEA 801
A VQ RIDT L P ++ V+ SGL I P + ++ R R +
Sbjct: 92 AEVQARIDTFLVHHPYLSRKDIPAEMVTASGSGLDP--NISPQAAYVQVKRVAKARGMSE 149
Query: 802 ESRADAVVQGQEKP 815
E V Q EKP
Sbjct: 150 EKVKQLVDQHVEKP 163
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 31.2 bits (72), Expect = 4.9
Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 16/103 (15%)
Query: 46 KKLLESPEEIKIEYEKQMT-----NYKKLRELYEQRAAAAELEHKTMIEKELAK-NALLE 99
K + E+PE +K E + +L EL E+R +T +E+ A+ NAL +
Sbjct: 5 KLIRENPEAVK---EALAKRGFPLDVDELLELDEERREL-----QTELEELQAERNALSK 56
Query: 100 --AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
+ K E+ +A + + KE K++I +E + + E+ L
Sbjct: 57 EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEEL 99
Score = 31.2 bits (72), Expect = 4.9
Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 16/103 (15%)
Query: 863 KKLLESPEEIKIEYEKQMT-----NYKKLRELYEQRAAAAELEHKTMIEKELAK-NALLE 916
K + E+PE +K E + +L EL E+R +T +E+ A+ NAL +
Sbjct: 5 KLIRENPEAVK---EALAKRGFPLDVDELLELDEERREL-----QTELEELQAERNALSK 56
Query: 917 --AKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 957
+ K E+ +A + + KE K++I +E + + E+ L
Sbjct: 57 EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEEL 99
>gnl|CDD|233028 TIGR00571, dam, DNA adenine methylase (dam). All proteins in this
family for which functions are known are DNA-adenine
methyltransferases. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). The DNA adenine methylase (dam) of
E. coli and related species is instrumental in
distinguishing the newly synthesized strand during DNA
replication for methylation-directed mismatch repair.
This family includes several phage methylases and a
number of different restriction enzyme chromosomal
site-specific modification systems [DNA metabolism, DNA
replication, recombination, and repair].
Length = 266
Score = 30.8 bits (70), Expect = 5.1
Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 8/58 (13%)
Query: 1103 KSHIQKEIANNLPKVWRILIE--------LLNTHQEQYENNDVNENFESCYKNIKTNT 1152
K+ + EI +LPK + L+E N + ++Y ND+NE+ + YK IK N
Sbjct: 11 KTSLLPEIKKHLPKNFNCLVEPFVGGGAVFFNLNPKRYLLNDINEDLINLYKAIKNNV 68
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.2 bits (71), Expect = 5.1
Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 16/104 (15%)
Query: 41 LKDVEKKLLESPEEIKIEYEKQMT----------NYKKLRE----LYEQRAAAAELEHKT 86
LK V EEIKIE +K T YKKL+ L Q + E
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396
Query: 87 MIEKELAKNALLEAKLKEIQEEI--QAQLEKYKENKKQILEMEK 128
K + A + ++EI+EE+ + L+ K+ +K+ EK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 31.2 bits (71), Expect = 5.1
Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 16/104 (15%)
Query: 858 LKDVEKKLLESPEEIKIEYEKQMT----------NYKKLRE----LYEQRAAAAELEHKT 903
LK V EEIKIE +K T YKKL+ L Q + E
Sbjct: 337 LKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYY 396
Query: 904 MIEKELAKNALLEAKLKEIQEEI--QAQLEKYKENKKQILEMEK 945
K + A + ++EI+EE+ + L+ K+ +K+ EK
Sbjct: 397 ESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.8 bits (70), Expect = 5.2
Identities = 20/171 (11%), Positives = 67/171 (39%), Gaps = 17/171 (9%)
Query: 98 LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQ 157
++ +++ + +I+ K E +K+I + + I+ EI L EN+ +L +
Sbjct: 50 IQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVE---RQELLKKR 106
Query: 158 M----------TLVNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLL 206
+ ++ + ++ + +I+ + + ++ + ++ + + +
Sbjct: 107 ARAMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVDADKKILEQQKEDKKSLEEKQ-- 164
Query: 207 VDLEKKIDQ-ETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVES 256
LE K++ L + + QL L + + ++ + ++ +
Sbjct: 165 AALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAA 215
Score = 30.8 bits (70), Expect = 5.3
Identities = 21/171 (12%), Positives = 69/171 (40%), Gaps = 17/171 (9%)
Query: 915 LEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQ 974
++ +++ + +I+ K E +K+I + + I+ EI L EN+ +L +
Sbjct: 50 IQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVE---RQELLKKR 106
Query: 975 M----------TLVNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLL 1023
+ ++ + ++ + +I+ + + ++ + ++ + + +
Sbjct: 107 ARAMQVNGTATSYIDVILNSKSFSDLISRVTAISVIVDADKKILEQQKEDKKSLEEKQ-- 164
Query: 1024 VDLEKKIDQ-ETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTSKDSA 1073
LE K++ L + + QL L + + ++ + ++ + +A
Sbjct: 165 AALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASALGEKAA 215
>gnl|CDD|223874 COG0803, LraI, ABC-type metal ion transport system, periplasmic
component/surface adhesin [Inorganic ion transport and
metabolism].
Length = 303
Score = 30.8 bits (70), Expect = 5.3
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%)
Query: 83 EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQI 123
E+K EK KL ++ EE +A+L K + +
Sbjct: 159 ENKETYEKNAEA---YLKKLNKLDEEAKAKLSKIPAQRDVV 196
Score = 30.8 bits (70), Expect = 5.3
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%)
Query: 900 EHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQI 940
E+K EK KL ++ EE +A+L K + +
Sbjct: 159 ENKETYEKNAEA---YLKKLNKLDEEAKAKLSKIPAQRDVV 196
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
Members of this family function as part of the Mediator
(Med) complex, which links DNA-bound transcriptional
regulators and the general transcription machinery,
particularly the RNA polymerase II enzyme. They play a
role in basal transcription by mediating activation or
repression according to the specific complement of
transcriptional regulators bound to the promoter.
Length = 176
Score = 30.1 bits (68), Expect = 5.6
Identities = 12/50 (24%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 97 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
L + E++ + +LE+++EN+++ILE+ ++ D++I ++ + L
Sbjct: 6 DLISADDELKSAL-KELEEHQENQQRILELRAEVESLDEKIKDILKELKE 54
Score = 30.1 bits (68), Expect = 5.6
Identities = 12/50 (24%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 914 LLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
L + E++ + +LE+++EN+++ILE+ ++ D++I ++ + L
Sbjct: 6 DLISADDELKSAL-KELEEHQENQQRILELRAEVESLDEKIKDILKELKE 54
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 31.2 bits (72), Expect = 5.9
Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 90 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 143
KELA ALL+AK E+ E ++A LE+ KE +K++ +++ +L +L
Sbjct: 704 KELA--ALLKAKPSELPERVEALLEELKELEKELEQLKA--KLAAAAAGDLLAQ 753
Score = 31.2 bits (72), Expect = 5.9
Identities = 19/54 (35%), Positives = 32/54 (59%), Gaps = 4/54 (7%)
Query: 907 KELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNEN 960
KELA ALL+AK E+ E ++A LE+ KE +K++ +++ +L +L
Sbjct: 704 KELA--ALLKAKPSELPERVEALLEELKELEKELEQLKA--KLAAAAAGDLLAQ 753
>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin. Hemagglutinin from
influenza virus causes membrane fusion of the viral
membrane with the host membrane. Fusion occurs after the
host cell internalises the virus by endocytosis. The
drop of pH causes release of a hydrophobic fusion
peptide and a large conformational change leading to
membrane fusion.
Length = 550
Score = 31.2 bits (71), Expect = 6.1
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 12/76 (15%)
Query: 87 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ-----------ILEMEKNIQLKDD 135
+IEK + +E + E++ I L K ++ +LE + + L D
Sbjct: 383 LIEKMNTQFEAIEKEFSELERRIN-NLNKKVDDGITDIWSYNAELLVLLENQHTLDLHDS 441
Query: 136 EINNLNENLTSSLAEN 151
+NNL E + L EN
Sbjct: 442 NVNNLYEKVRRQLREN 457
Score = 31.2 bits (71), Expect = 6.1
Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 12/76 (15%)
Query: 904 MIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQ-----------ILEMEKNIQLKDD 952
+IEK + +E + E++ I L K ++ +LE + + L D
Sbjct: 383 LIEKMNTQFEAIEKEFSELERRIN-NLNKKVDDGITDIWSYNAELLVLLENQHTLDLHDS 441
Query: 953 EINNLNENLTSSLAEN 968
+NNL E + L EN
Sbjct: 442 NVNNLYEKVRRQLREN 457
>gnl|CDD|234942 PRK01318, PRK01318, membrane protein insertase; Provisional.
Length = 521
Score = 31.0 bits (71), Expect = 6.1
Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 100 AKLKEIQEEIQAQLEKYKENK-KQILEM 126
AK+K +Q ++Q EKYK++ K EM
Sbjct: 350 AKMKVLQPKMQELKEKYKDDPQKMQQEM 377
Score = 31.0 bits (71), Expect = 6.1
Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 1/28 (3%)
Query: 917 AKLKEIQEEIQAQLEKYKENK-KQILEM 943
AK+K +Q ++Q EKYK++ K EM
Sbjct: 350 AKMKVLQPKMQELKEKYKDDPQKMQQEM 377
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 30.9 bits (70), Expect = 6.4
Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 4/83 (4%)
Query: 233 ETPMSSAMVEAVESNQTSTKAVE--SNQTSTSKDSANIQLDHAETNTNEESPADQDETNS 290
+P + A +S +T SN TSKD+++ D A+ N D ++
Sbjct: 21 TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQ--DNNDKKF 78
Query: 291 TTTLDKSHIQKEITRVISNSLIS 313
+T + I I +L
Sbjct: 79 STIDSSTSDSNNIIDFIYKNLPQ 101
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.9 bits (71), Expect = 6.4
Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 42 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
+ E PE++ +++ YKKLR+ E++ +HK + + L E +
Sbjct: 207 DSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL---REKLKE-E 262
Query: 102 LKEI---QEEIQAQLEKYKENKKQILEMEKNIQ 131
LK + ++I +E+ ++ K++ E+ +
Sbjct: 263 LKSLRLTSKQIDELVEQLRDINKRVRGQERELL 295
Score = 30.9 bits (71), Expect = 6.4
Identities = 19/93 (20%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
+ E PE++ +++ YKKLR+ E++ +HK + + L E +
Sbjct: 207 DSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKL---REKLKE-E 262
Query: 919 LKEI---QEEIQAQLEKYKENKKQILEMEKNIQ 948
LK + ++I +E+ ++ K++ E+ +
Sbjct: 263 LKSLRLTSKQIDELVEQLRDINKRVRGQERELL 295
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 29.5 bits (67), Expect = 6.7
Identities = 24/103 (23%), Positives = 50/103 (48%), Gaps = 12/103 (11%)
Query: 42 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 101
K +K+L + ++ + E EK +KL+E ++ AA T+ E K E +
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAA-------TLSEAAREK---KEKE 66
Query: 102 LKEIQEEIQAQLEKYKE--NKKQILEMEKNIQLKDDEINNLNE 142
L++ +E Q + +K ++ K+Q E++K + + I + +
Sbjct: 67 LQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109
Score = 29.5 bits (67), Expect = 6.7
Identities = 24/103 (23%), Positives = 50/103 (48%), Gaps = 12/103 (11%)
Query: 859 KDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAK 918
K +K+L + ++ + E EK +KL+E ++ AA T+ E K E +
Sbjct: 17 KAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAA-------TLSEAAREK---KEKE 66
Query: 919 LKEIQEEIQAQLEKYKE--NKKQILEMEKNIQLKDDEINNLNE 959
L++ +E Q + +K ++ K+Q E++K + + I + +
Sbjct: 67 LQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 30.7 bits (69), Expect = 6.8
Identities = 18/94 (19%), Positives = 40/94 (42%), Gaps = 3/94 (3%)
Query: 35 SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 94
S K + +KK + EE+K KQ ++L++L ++R A E + + ++ A+
Sbjct: 72 SSAKKGEQQRKKKEEQVAEELK---PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128
Query: 95 NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 128
+ + + + + + + K E K
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162
Score = 30.7 bits (69), Expect = 6.8
Identities = 18/94 (19%), Positives = 40/94 (42%), Gaps = 3/94 (3%)
Query: 852 SDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK 911
S K + +KK + EE+K KQ ++L++L ++R A E + + ++ A+
Sbjct: 72 SSAKKGEQQRKKKEEQVAEELK---PKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128
Query: 912 NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEK 945
+ + + + + + + K E K
Sbjct: 129 LEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162
>gnl|CDD|193569 cd08656, M28_like_6, M28 Zn-Peptidases. Peptidase family M28 (also
called aminopeptidase Y family), uncharacterized
subfamily. The M28 family contains aminopeptidases as
well as carboxypeptidases. They have co-catalytic zinc
ions; each zinc ion is tetrahedrally co-ordinated, with
three amino acid ligands plus activated water; one
aspartate residue binds both metal ions.
Length = 281
Score = 30.3 bits (69), Expect = 6.9
Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 1612 VISSYKPDYSK---LVSHWNCRPLAVKEEEEANE 1642
+I+S+ P+ K L +HW+ RP+A ++ + +
Sbjct: 62 IIASFNPEAKKRILLAAHWDTRPIADQDPDPKKK 95
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 30.9 bits (70), Expect = 7.0
Identities = 25/117 (21%), Positives = 59/117 (50%), Gaps = 9/117 (7%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQ-MTNYKKLRE--------LYEQRAAAAELEHKTM 87
L+ +L+++E+ L+ E+ +E E+Q ++N +KLRE L + +
Sbjct: 194 LQFQLEELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGL 253
Query: 88 IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 144
E +LA ++++ L+E+ E++ L + +E +++ ++ + +N + E L
Sbjct: 254 GEAQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERL 310
Score = 30.9 bits (70), Expect = 7.0
Identities = 25/117 (21%), Positives = 59/117 (50%), Gaps = 9/117 (7%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQ-MTNYKKLRE--------LYEQRAAAAELEHKTM 904
L+ +L+++E+ L+ E+ +E E+Q ++N +KLRE L + +
Sbjct: 194 LQFQLEELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGL 253
Query: 905 IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENL 961
E +LA ++++ L+E+ E++ L + +E +++ ++ + +N + E L
Sbjct: 254 GEAQLALASVIDGSLRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERL 310
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 30.5 bits (70), Expect = 7.0
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 16/91 (17%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEY--EK-QMTNY-KKLREL-YEQRAAAAEL--EHK 902
LV + + LE+ E ++++Y +K ++T K L +L E+R A L E K
Sbjct: 8 LVEEALAAIAAA--SDLEALEALRVKYLGKKGELTELLKGLGKLPPEERKEAGALINELK 65
Query: 903 TMIEKELAKNALLEAKLKEI-QEEIQAQLEK 932
IE A LE + +E+ + A+L
Sbjct: 66 QAIE------AALEERKEELEAAALNARLAA 90
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 31.1 bits (71), Expect = 7.1
Identities = 26/113 (23%), Positives = 47/113 (41%), Gaps = 17/113 (15%)
Query: 32 FKVSDLKTRLKDVEKKLL----ESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTM 87
++ L++ L + + L P+ + EK+ + K L EL E E +
Sbjct: 43 ERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEEL----IKDVEEELEK- 97
Query: 88 IEKELAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNL 140
IEKE+ + LE ++ E++ EI+ E + + LE N L +
Sbjct: 98 IEKEIKE---LEEEISELENEIKE-----LEQEIERLEPWGNFDLDLSLLLGF 142
>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 129
Score = 29.5 bits (66), Expect = 7.2
Identities = 20/87 (22%), Positives = 29/87 (33%), Gaps = 14/87 (16%)
Query: 1277 RKEFALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDK----------SEVVET 1326
R F E D E++ VE+V D +D K E V+
Sbjct: 38 RSYFEETTPDAAAEEEEEEVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDL 97
Query: 1327 QSEHDVPHTNSVMSIEEDDQDEVEELE 1353
+ D + EEDD D+V +
Sbjct: 98 DDDDDDT----FLEDEEDDDDDVSGII 120
>gnl|CDD|132552 TIGR03513, GldL_gliding, gliding motility-associated protein GldL.
This protein family, GldL, is named for the member from
Flavobacterium johnsoniae, which is required for a type
of rapid gliding motility found in certain members of
the Bacteriodetes. However, members are found also in
several members of the Bacteriodetes that appear not to
be motile [Cellular processes, Chemotaxis and motility].
Length = 202
Score = 30.2 bits (68), Expect = 7.3
Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 26/131 (19%)
Query: 65 NYKKLRELYEQRAAAAELEHKTMIE--KELAKNALLEAKL-------KEIQEEIQAQLEK 115
N +K Y + +A L + + KE K+A L L + + + +
Sbjct: 69 NKRKEAAAYAEEPSAVGLLSQKLDAMLKEAKKDATLMQSLGNGINNFEGAAKTLAPMTDS 128
Query: 116 YKENKKQILEMEK------------NIQLKD-----DEINNLNENLTSSLAENHILNSQM 158
Y + KK I +M QLK D N + N +S E + + +
Sbjct: 129 YAQQKKYIEQMSSLAANMEGLNTIYEAQLKGASSHADANNEIAINSSSLKEEMEKMAANL 188
Query: 159 TLVNNLFSNML 169
T +N ++ ML
Sbjct: 189 TSLNEVYGGML 199
Score = 30.2 bits (68), Expect = 7.3
Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 26/131 (19%)
Query: 882 NYKKLRELYEQRAAAAELEHKTMIE--KELAKNALLEAKL-------KEIQEEIQAQLEK 932
N +K Y + +A L + + KE K+A L L + + + +
Sbjct: 69 NKRKEAAAYAEEPSAVGLLSQKLDAMLKEAKKDATLMQSLGNGINNFEGAAKTLAPMTDS 128
Query: 933 YKENKKQILEMEK------------NIQLKD-----DEINNLNENLTSSLAENHILNSQM 975
Y + KK I +M QLK D N + N +S E + + +
Sbjct: 129 YAQQKKYIEQMSSLAANMEGLNTIYEAQLKGASSHADANNEIAINSSSLKEEMEKMAANL 188
Query: 976 TLVNNLFSNML 986
T +N ++ ML
Sbjct: 189 TSLNEVYGGML 199
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 30.2 bits (68), Expect = 7.4
Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 4/94 (4%)
Query: 48 LLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 107
+ E+ + +++T K R QR L +E+ + LE + E++
Sbjct: 68 AIAEMEQKLAKLREELTELHKKRGELAQRL----LLLNDELEQLRREIQQLEKTIAELRS 123
Query: 108 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 141
EI + + ++ ++++ E EK+ + DE+ +LN
Sbjct: 124 EITSLETEIRDLREELQEKEKDNETLQDELISLN 157
Score = 30.2 bits (68), Expect = 7.4
Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 4/94 (4%)
Query: 865 LLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQE 924
+ E+ + +++T K R QR L +E+ + LE + E++
Sbjct: 68 AIAEMEQKLAKLREELTELHKKRGELAQRL----LLLNDELEQLRREIQQLEKTIAELRS 123
Query: 925 EIQAQLEKYKENKKQILEMEKNIQLKDDEINNLN 958
EI + + ++ ++++ E EK+ + DE+ +LN
Sbjct: 124 EITSLETEIRDLREELQEKEKDNETLQDELISLN 157
>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
metabolism].
Length = 533
Score = 30.7 bits (70), Expect = 7.4
Identities = 43/220 (19%), Positives = 81/220 (36%), Gaps = 54/220 (24%)
Query: 835 PESNILATTRSVILPLVSDLKTRLKDVEKKLLESPE--------EIKI--EYEKQMTNYK 884
P + +LA + ++P D + K +++ L+SP E+ + +QM +
Sbjct: 295 PFTGLLARFVTRLIPGDEDDEIEPKHLDETALDSPVVALANAAREVLRLGDSIEQML--E 352
Query: 885 KLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL---------EKYKE 935
+L E E A KE+ K LE + + EEI+ L E+
Sbjct: 353 RLYEYIEGDAK---------KVKEIRK---LEDAVDRLYEEIKLYLARLSKEGLSEEESR 400
Query: 936 NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILN---SQMTLVNNLFSNMLINPMIN 992
+I++ N++ D I L E +A + +
Sbjct: 401 RWAEIIDAAINLEHIGDIIERLLELADKKIANGRAFSEDGLEE----------------- 443
Query: 993 LDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQ 1032
LD L L +N L I ++ + + LV+ +K++ +
Sbjct: 444 LDALFALTLENLRLAISVLVTGD-LELARRLVERKKRVRR 482
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 31.0 bits (71), Expect = 7.5
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Query: 90 KELAKNALLEAKLKEIQ---EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 146
L K LLEA+ K +Q E+ A L+K K++ ++++ + ++ L +
Sbjct: 46 DALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105
Score = 31.0 bits (71), Expect = 7.5
Identities = 14/60 (23%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Query: 907 KELAKNALLEAKLKEIQ---EEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTS 963
L K LLEA+ K +Q E+ A L+K K++ ++++ + ++ L +
Sbjct: 46 DALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA 105
Score = 31.0 bits (71), Expect = 7.6
Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 6/116 (5%)
Query: 206 LVDLEKKIDQETLAIQNDKQLKQLENYETPMSSAMVEAVESNQTSTKAVESNQTSTSKDS 265
L++++ Q ++ + +LE + E + + S + +ES T
Sbjct: 82 TEQLKQQLAQAPAKLRQAQ--AELEALKDDNDEETRETLST--LSLRQLESRLAQTLDQL 137
Query: 266 ANIQLDHAETNTNEESPADQDETNSTTTLDKSHIQKEITRVISNSLISG--LSPSL 319
N Q D AE N+ S Q E S ++I ++ + G L PS
Sbjct: 138 QNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQ 193
Score = 30.7 bits (70), Expect = 8.0
Identities = 39/264 (14%), Positives = 92/264 (34%), Gaps = 73/264 (27%)
Query: 33 KVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 92
K+ K + ++++L ++P +++ + + ++ K ++ + L
Sbjct: 74 KIDRQKEETEQLKQQLAQAPAKLR-QAQAELEALKD--------------DNDEETRETL 118
Query: 93 AKNAL--LEAKLKEIQE---EIQAQLEKY-----------KENKKQILEMEKNIQLKDDE 136
+ +L LE++L + + Q L +Y + + + + +Q +
Sbjct: 119 STLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQ----Q 174
Query: 137 INN-LNENLTSSLAENH----ILNSQMTLVNNLFSNMLINPMINLDMLIKLIQDNYLLII 191
I N L A +L ++ L+N D+ K ++ N L
Sbjct: 175 IRNLLKGGKVGGKALRPSQRVLLQAEQALLN-----------AQNDLQRKSLEGNTQL-Q 222
Query: 192 DLINNHEINDTVSLLVDLEKKIDQETLAIQ---NDKQLKQLENYETPMSSAMVEAVESNQ 248
DL+ + D ++ + ++ + +Q N K+L E + V+ Q
Sbjct: 223 DLLQ--KQRDYLTARIQR---LEHQLQLLQEAINSKRLTLSE-----------KTVQEAQ 266
Query: 249 TSTKAVESNQTSTSKD--SANIQL 270
+ +A N+QL
Sbjct: 267 SQDEAARIQANPLVAQELEINLQL 290
>gnl|CDD|214442 MTH00160, ND2, NADH dehydrogenase subunit 2; Provisional.
Length = 335
Score = 30.6 bits (70), Expect = 7.5
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 144 LTSSLAENHIL-NSQMTLVNNLFSNMLINPMINLDMLIKL 182
L +LA +L + + ++N + N + + +IN +L+KL
Sbjct: 62 LIQALASIILLFSIILNMLNLMLFNNISSMLINSSLLLKL 101
Score = 30.6 bits (70), Expect = 7.5
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 961 LTSSLAENHIL-NSQMTLVNNLFSNMLINPMINLDMLIKL 999
L +LA +L + + ++N + N + + +IN +L+KL
Sbjct: 62 LIQALASIILLFSIILNMLNLMLFNNISSMLINSSLLLKL 101
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences
in this family of proteins are members of the chain
length determinant family (pfam02706) which includes the
wzc protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain
to the exopolysaccharide layer.
Length = 444
Score = 30.5 bits (69), Expect = 7.6
Identities = 25/104 (24%), Positives = 42/104 (40%), Gaps = 22/104 (21%)
Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEH-KTMIEK 907
P++ +LKT + E KL E + YK A AE+ K+ +
Sbjct: 254 PIIQNLKTDIARAESKLAELSQR----LGPNHPQYK---------RAQAEINSLKSQLNA 300
Query: 908 ELAKNA--------LLEAKLKEIQEEIQAQLEKYKENKKQILEM 943
E+ K +L+ + E++E ++ Q K E +Q EM
Sbjct: 301 EIKKVTSSVGTNSRILKQREAELREALENQKAKVLELNRQRDEM 344
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.8 bits (70), Expect = 7.9
Identities = 22/95 (23%), Positives = 41/95 (43%), Gaps = 5/95 (5%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
L++ K L + KKL E E + E E+ + K+ RE + L E L
Sbjct: 164 LIASAKEELDQLSKKLAELKAEEEEELERAL---KEKREELLSKLEEELLARLESKEAAL 220
Query: 910 AKN--ALLEAKLKEIQEEIQAQLEKYKENKKQILE 942
K E + +E++++ + +L + E + + E
Sbjct: 221 EKQLRLEFEREKEELRKKYEEKLRQELERQAEAHE 255
>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1). This
family consists of the yeast mitochondrial ribosomal
proteins VAR1. Mitochondria possess their own ribosomes
responsible for the synthesis of a small number of
proteins encoded by the mitochondrial genome. In yeast
the two ribosomal RNAs and a single ribosomal protein,
VAR1, are products of mitochondrial genes, and the
remaining approximately 80 ribosomal proteins are
encoded in the nucleus. VAR1 along with 15S rRNA are
necessary for the formation of mature 37S subunits.
Length = 350
Score = 30.3 bits (68), Expect = 7.9
Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 119 NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFS----------NM 168
K +L+M N ++ + +IN N N+ L E + +++ +NN+ + N
Sbjct: 5 LKNYLLKMNNN-RMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNN 63
Query: 169 LINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETL 218
+IN IN ++ KL+ L +++INN+ IN+ + + K + T+
Sbjct: 64 VINTWINDKLVNKLLYKLMKLKVNIINNNIINNGQIKKIIINKPKFKHTI 113
Score = 30.3 bits (68), Expect = 7.9
Identities = 27/110 (24%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 936 NKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTLVNNLFS----------NM 985
K +L+M N ++ + +IN N N+ L E + +++ +NN+ + N
Sbjct: 5 LKNYLLKMNNN-RMNNMDINIKNNNMNKYLNEYNNKGNKLQHINNMNNWNNQLYNYNKNN 63
Query: 986 LINPMINLDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETL 1035
+IN IN ++ KL+ L +++INN+ IN+ + + K + T+
Sbjct: 64 VINTWINDKLVNKLLYKLMKLKVNIINNNIINNGQIKKIIINKPKFKHTI 113
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.8 bits (70), Expect = 8.0
Identities = 30/176 (17%), Positives = 53/176 (30%), Gaps = 22/176 (12%)
Query: 441 ETSDEKDIVKGNVVDYEATKQTRTESPSEEQKSKQLIRQDR---ITEETDESRPFESAEC 497
E D + G D + + + E+ + +R I EE D + +
Sbjct: 363 EVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDL 422
Query: 498 SRTSQPEVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQREPITSVPTATSNTTLP 557
+ E E +E EL RL + + TAT
Sbjct: 423 QALESQLRQQL-------EAGKLEFNEEEYEL-ELRLGRLKQRLDS----ATATPEELEQ 470
Query: 558 SEPSVSRTRTPEEILEARNARLKRLEDQCKNLFHKMSLTSQRSDTLSERLEELHDY 613
E + +E E A +++L+ + + L +R D E L+
Sbjct: 471 LEINDEALEKAQEEQEQAEANVEQLQSELRQL-------RKRRDEALEALQRAERR 519
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 30.6 bits (69), Expect = 8.3
Identities = 29/99 (29%), Positives = 50/99 (50%), Gaps = 11/99 (11%)
Query: 849 PLVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKE 908
PL+ L+ +++ EK +++ +I+ E EK++ +KL + E LE KE
Sbjct: 361 PLLERLEKDVEEGEKTIVKEARQIEEELEKEV---EKLGKEEESLFKRVALEEGL---KE 414
Query: 909 LAKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNI 947
L ++ E LKE+ +E + LEK K + E+EK
Sbjct: 415 LEQD--EENFLKELSKEEKELLEKL---KMEASEVEKLF 448
>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element.
Length = 370
Score = 30.4 bits (68), Expect = 8.3
Identities = 32/140 (22%), Positives = 53/140 (37%), Gaps = 13/140 (9%)
Query: 504 EVENHISNTDTIETAYSEMSEPIEELHTERLSTETLQREPITSVPTATSNTTLPSEPSVS 563
+VE HI NT+ E+ + EL E S + E + + +
Sbjct: 85 QVEAHIENTEKCLKELMELKKKARELKEECRSIKKEHMETTLDIENQKKRQGVVDTSFTN 144
Query: 564 RTRTPEEILEARNARLKRLEDQCKNLFHKMSLTSQRSDTLSERLEELHDYYGTSNERHSP 623
R + EE R+ ED + + +R L + ++E+ DY N R
Sbjct: 145 RIQEMEE-------RISGAEDSIEEIKRTGKFNEKRKKRLEQNIQEIQDYVRRPNLRII- 196
Query: 624 RRHREGSQDSDDTRLRSPEN 643
G +S+D +L+ PEN
Sbjct: 197 -----GVPESEDEQLKGPEN 211
>gnl|CDD|235474 PRK05451, PRK05451, dihydroorotase; Provisional.
Length = 345
Score = 30.5 bits (70), Expect = 8.5
Identities = 11/20 (55%), Positives = 15/20 (75%), Gaps = 1/20 (5%)
Query: 408 PLLRDH-KIKVVAEHISTED 426
PL R K+K+V EHI+T+D
Sbjct: 161 PLRRRFPKLKIVFEHITTKD 180
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 30.1 bits (68), Expect = 8.5
Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 9/59 (15%)
Query: 1281 ALRRLQYDGGRSTSESGISTDTEDDNVEIVETNDESEDDKSEVVETQSEHDVPHTNSVM 1339
A R G++ S + T E D+ VET+ ESED +SE + ++P+ M
Sbjct: 192 AFLR-----GQALGLSALRTPGEPDDDATVETSSESEDSESE----SDDEELPYIVPRM 241
>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein. The CorA
transport system is the primary Mg2+ influx system of
Salmonella typhimurium and Escherichia coli. CorA is
virtually ubiquitous in the Bacteria and Archaea. There
are also eukaryotic relatives of this protein. The
family includes the MRS2 protein from yeast that is
thought to be an RNA splicing protein. However its
membership of this family suggests that its effect on
splicing is due to altered magnesium levels in the cell.
Length = 291
Score = 30.4 bits (69), Expect = 8.8
Identities = 35/169 (20%), Positives = 66/169 (39%), Gaps = 23/169 (13%)
Query: 7 LSMIVFKNFILTIGSPLTSSTSMVVFKVSDLKTRLKDVEK--------------KLLESP 52
LS+ + N ++T+ + + ++ +L K ++LE
Sbjct: 71 LSIFLGDNRLITVRREPLPAVDEIRKRLENLGLGPKSPGDLLYLLLDSIVDRYFEILEKL 130
Query: 53 EEIKIEYEKQMTNYKK---LRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEI 109
EE E E ++ + LREL R + L +E+ N LL I EE
Sbjct: 131 EEELDELEDELEDSTTNELLRELLRLRRSLVRLRRSLAPLREVL-NRLLSDDGPLIDEEE 189
Query: 110 QAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQM 158
+ L + +++LEM + ++ + + +L + S L+ LN M
Sbjct: 190 KEYLRDLLDELERLLEMAEILR---ERLRSLQDAYLSLLSNR--LNRIM 233
>gnl|CDD|227731 COG5444, COG5444, Uncharacterized conserved protein [Function
unknown].
Length = 565
Score = 30.5 bits (69), Expect = 8.8
Identities = 25/139 (17%), Positives = 49/139 (35%), Gaps = 14/139 (10%)
Query: 23 LTSSTSMVVFKVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAEL 82
T +V + KD +KL + +E + ++L+ + Q L
Sbjct: 137 GKLDTEDFKDEVDSARKHAKDTLEKLYKLDQEGMTLMAAVESKMQELKAIIRQ------L 190
Query: 83 EHKTMIEKELAK--------NALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKD 134
E T+ K A E + + E+ Q E Y + K ++ + +++ D
Sbjct: 191 EEWTIKGGATKKGVPIHYVAKAFAEVTIHKKAAEVALQSETYLDIKTELAKERPDMRDLD 250
Query: 135 DEINNLNENLTSSLAENHI 153
+ + N+ LA I
Sbjct: 251 KPLESANKTGYEGLAGEDI 269
>gnl|CDD|200398 TIGR04147, GGGPS_Halobact, phosphoglycerol geranylgeranyltransferase,
putative. In most archaea, phosphoglycerol
geranylgeranyltransferase (EC 2.5.1.41), also known as
GGGP synthase and GGGPS, catalyzes the stereospecific
first step in the biosynthesis of their characteristic
membrane diether lipids. However, some groups of archaeal
GGGPS homologs are more closely related to certain
bacterial proteins than to each other. This family
represents the putative GGGPS family as found in the
Halobacteria.
Length = 229
Score = 29.8 bits (67), Expect = 8.9
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 1302 TEDDNVEIVETNDESEDDKSEVVETQSEHDVP 1333
T D +EI T D +E+ VV+ +E+DVP
Sbjct: 29 TGTDAIEIGGTLDMTEEKMQRVVDACAEYDVP 60
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 29.9 bits (68), Expect = 9.0
Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)
Query: 37 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKL----RELYEQ-RAAAAELEHKTMIEKE 91
LK RLK++EK+L + +K E E ++K+ ELY++ AA +++ KT
Sbjct: 91 LKARLKELEKEL----KNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKT----- 141
Query: 92 LAKNALLEAKLKEIQEEI---QAQLEKYKENKK 121
KN LLE KL+ + EE+ +AQL +
Sbjct: 142 GLKNLLLEQKLEALNEELEKKEAQLNEVLAAAN 174
Score = 29.9 bits (68), Expect = 9.0
Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)
Query: 854 LKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKL----RELYEQ-RAAAAELEHKTMIEKE 908
LK RLK++EK+L + +K E E ++K+ ELY++ AA +++ KT
Sbjct: 91 LKARLKELEKEL----KNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKT----- 141
Query: 909 LAKNALLEAKLKEIQEEI---QAQLEKYKENKK 938
KN LLE KL+ + EE+ +AQL +
Sbjct: 142 GLKNLLLEQKLEALNEELEKKEAQLNEVLAAAN 174
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 30.5 bits (68), Expect = 9.1
Identities = 12/58 (20%), Positives = 31/58 (53%)
Query: 236 MSSAMVEAVESNQTSTKAVESNQTSTSKDSANIQLDHAETNTNEESPADQDETNSTTT 293
M SA ++A++S S S+ +++ +S + +++ T++N + ++S +T
Sbjct: 90 MISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNST 147
>gnl|CDD|181395 PRK08341, PRK08341, amidophosphoribosyltransferase; Provisional.
Length = 442
Score = 30.6 bits (69), Expect = 9.2
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 5/67 (7%)
Query: 54 EIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL 113
+I I + +TN+ LR YE R ++ ++ ++ EL + L E +E +A
Sbjct: 95 KIAIAHNGTLTNFLPLRRKYESRG----VKFRSSVDTELIGISFLW-HYSETGDEFEAMR 149
Query: 114 EKYKENK 120
E + E K
Sbjct: 150 EVFNEVK 156
Score = 30.6 bits (69), Expect = 9.2
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 5/67 (7%)
Query: 871 EIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQL 930
+I I + +TN+ LR YE R ++ ++ ++ EL + L E +E +A
Sbjct: 95 KIAIAHNGTLTNFLPLRRKYESRG----VKFRSSVDTELIGISFLW-HYSETGDEFEAMR 149
Query: 931 EKYKENK 937
E + E K
Sbjct: 150 EVFNEVK 156
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 30.6 bits (69), Expect = 9.6
Identities = 55/296 (18%), Positives = 116/296 (39%), Gaps = 26/296 (8%)
Query: 860 DVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKELAK--NALLEA 917
+ K+L+E KI+ + ++ + K+ +L + KE K L +
Sbjct: 893 NSNKQLVEHLLNNKIDLKNKLEQHMKIINTDNIIQKNEKLNLLNNLNKEKEKIEKQLSDT 952
Query: 918 KLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLTSSLAENHILNSQMTL 977
K+ ++ +I+ LE Y ++K+ I N ++++ + +E LN +
Sbjct: 953 KINNLKMQIEKTLEYYDKSKENI---NGNDGTHLEKLDKEKDEWEHFKSEIDKLNVNYNI 1009
Query: 978 VNNLFSNMLINPMIN-LDMLIKLIQDNYLLIIDLINNHEINDTVSLLVDLEKKIDQETLA 1036
+N +++ + ++++ KLI++ I + ++ + +SLL ++ K+ +
Sbjct: 1010 LNKKIDDLIKKQHDDIIELIDKLIKEKGKEIEEKVDQY-----ISLLEKMKTKL--SSFH 1062
Query: 1037 IQNDKQLKQLENYETPMS--SAMVEAVESNQTSTSKDSANIQLDHAE--TNTNEESPADQ 1092
D + + + + VEA+ I+ E N ++E Q
Sbjct: 1063 FNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEK-NKQ 1121
Query: 1093 DETNSTTTLDKSHIQKEIANNLPKVWRILIELLNTHQEQYENNDVNENFESCYKNI 1148
E + I K++ L EL N + E N+VNE E Y+ I
Sbjct: 1122 TEHYNKKKKSLEKIYKQMEKTLK-------ELENMNLEDITLNEVNE-IEIEYERI 1169
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 30.0 bits (68), Expect = 9.9
Identities = 28/113 (24%), Positives = 53/113 (46%), Gaps = 1/113 (0%)
Query: 850 LVSDLKTRLKDVEKKLLESPEEIKIEYEKQMTNYKKLRELYEQRAAAAELEHKTMIEKEL 909
L ++ LKD++ ++ E E+ +KL+EL EQ + K + K+L
Sbjct: 108 LTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLKELIEQYELREQHFEKLLKTKDL 167
Query: 910 AKNALLEAKLKEIQEEIQAQLEKYKENKKQILEMEKNIQLKDDEINNLNENLT 962
+ L +AKL++ QE+ + + EK+K K+ +L+ + + L L
Sbjct: 168 EQQ-LNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLN 219
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 30.6 bits (69), Expect = 10.0
Identities = 31/140 (22%), Positives = 56/140 (40%), Gaps = 13/140 (9%)
Query: 832 STPPESNILATTRSVILP--LVSDLKTRL---KDVEKKLLESPEEIKIEYEKQMTNYKKL 886
P ES + +P ++ KT K+ L+E ++ E E++ + +KL
Sbjct: 478 GIPGESYAFEIAQRYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKL 537
Query: 887 R--------ELYEQRAAAAELEHKTMIEKELAKNALLEAKLKEIQEEIQAQLEKYKENKK 938
EL ++ E E +E E L+A KE++ I+ EK K
Sbjct: 538 LKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAK 597
Query: 939 QILEMEKNIQLKDDEINNLN 958
+I +E ++LK+ +
Sbjct: 598 EIKSIEDLVKLKETKQKIPQ 617
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.126 0.340
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 79,343,333
Number of extensions: 7862484
Number of successful extensions: 12221
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10886
Number of HSP's successfully gapped: 1148
Length of query: 1642
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1532
Effective length of database: 6,058,662
Effective search space: 9281870184
Effective search space used: 9281870184
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 66 (29.2 bits)