RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8739
(558 letters)
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 297 bits (763), Expect = 5e-91
Identities = 217/375 (57%), Positives = 278/375 (74%), Gaps = 1/375 (0%)
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
+LER K+ELE + RK+ E++ L++KLEDEQ LV + QK+IKEL+ARI ELEEE+EAER
Sbjct: 1 ELERQKRELENQLYRKESELSQLSSKLEDEQALVAQLQKKIKELEARIRELEEELEAERA 60
Query: 244 SRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHES 303
+RAKAEK RADL+RELEEL ERLEEAGGAT+AQIELNKKREAE++KLR+DLEEAN+QHE
Sbjct: 61 ARAKAEKARADLSRELEELSERLEEAGGATAAQIELNKKREAELAKLRKDLEEANLQHEE 120
Query: 304 TLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
LA LRKKH DA++E+ EQI+QL K K KAE E++ E ++ A DQ+A+ K EK
Sbjct: 121 ALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLNAEK 180
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKIS 423
AKQL+ QL+E+Q KLDE R LND + K +L ENSDL RQLEEAE+QVS LSK+K
Sbjct: 181 KAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQ 240
Query: 424 LTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANAD 483
L +QLE+ KR +EE RERA L + R LEHD+D++REQLEEE+E KA+L+RQLSKANA+
Sbjct: 241 LESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAE 300
Query: 484 AQLWRTKYESEGVARAEELEESKSE-ERRVDDLAAELDASQKECRNYSTELFRLKGAYEE 542
Q WR+K+ESEG RAEELEE K + +++ +L +A+ +C + RL+ E+
Sbjct: 301 IQQWRSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELED 360
Query: 543 AQEQLEAVRRENKNL 557
Q +LE L
Sbjct: 361 LQIELERANAAASEL 375
Score = 94.0 bits (234), Expect = 4e-20
Identities = 136/520 (26%), Positives = 232/520 (44%), Gaps = 36/520 (6%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 60
EK AKQL+ QL+E+Q KLDE R LND + K +L ENSDL RQLEEAE+QVS LS
Sbjct: 176 LNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLS 235
Query: 61 KIKISLTTQLEDTKRLADEES---SGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFS 117
K+K L +QLE+ KR +EES + +L + + +L+ + E S
Sbjct: 236 KLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLS 295
Query: 118 -ISLYSQDTVDRLQQEEDARNQLFQQ-KKKLEQENGGLKKDIEDLE---GDVEKQKRKIE 172
+ Q + + E R + ++ KKKL Q+ L++ E +EK K +++
Sbjct: 296 KANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQ 355
Query: 173 GDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIE 232
+L+ Q + ELE+ + DK +A K+++ Q + Q++ + L +
Sbjct: 356 SELEDLQIELERANAAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELF 415
Query: 233 ELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 292
L+ E+E + ++ +L E+ +L ++L E G + ++ EAE +L+
Sbjct: 416 RLKNELEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQA 475
Query: 293 DLEEA----------------NIQH------------ESTLANLRKKHNDAVSEMGEQID 324
LEEA + E N RK H A+ + ++
Sbjct: 476 ALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLE 535
Query: 325 QLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNR 384
K K +A + + N A D + A +K K+ Q Q+ E+Q +++E R
Sbjct: 536 AEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQR 595
Query: 385 SLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERAT 444
+ D + L +LEE S + Q + + T+L + +E + ++
Sbjct: 596 AREDAREQLAVAERRATALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSS 655
Query: 445 LLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADA 484
L+ + R LE ++ ++ L+E + + KA ADA
Sbjct: 656 LIAQKRKLEGELAALQSDLDEAVNELKAAEERAKKAQADA 695
Score = 75.8 bits (187), Expect = 2e-14
Identities = 137/564 (24%), Positives = 235/564 (41%), Gaps = 42/564 (7%)
Query: 9 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTT 68
QLQ ++ E++ ++ E L AA+ K +DL R+LEE L+
Sbjct: 36 QLQKKIKELEARIRELEEELEAERAARAKAEKARADLSRELEE--------------LSE 81
Query: 69 QLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDR 128
+LE+ + EL K + DL+ N E +++ + D
Sbjct: 82 RLEEAGGATAAQI-----ELNKKREAELAKLRKDLEEANLQHEE----ALATLRKKHQDA 132
Query: 129 LQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDV----------EKQKRKIEGDLKLT 178
+ + + QL +QK K E+E L+ +++DL + EK+ +++E L
Sbjct: 133 INELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSEL 192
Query: 179 QEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEV 238
Q + +L+R +L R E + LT +LE+ + V K +L++++EE + +
Sbjct: 193 QVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSL 252
Query: 239 EAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 298
E E + RA + Q L +L+ L E+LEE A + K AE+ + R E
Sbjct: 253 EEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEG 312
Query: 299 IQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREK 358
L L+KK N +SE+ E + N E ++ +E + + ++
Sbjct: 313 ALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAA 372
Query: 359 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
+ EK K L E + K+DE L+ + LS E L +LEE + QV L
Sbjct: 373 SELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALR 432
Query: 419 KIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLS 478
+ +L ++ D E GR L R LE + D ++ LEE + ++
Sbjct: 433 RENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESKVL 492
Query: 479 KANADAQLWRTKYESEGVARAEELEES-KSEERRVDDLAAELDASQKECRNYSTELFRLK 537
+A + R++ E + EE E + K+ +R ++ L A L+A K E RLK
Sbjct: 493 RAQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAK----GKAEASRLK 548
Query: 538 GAYE----EAQEQLEAVRRENKNL 557
E E + L+ + N
Sbjct: 549 KKLEGDINELEIALDHANKANAEA 572
Score = 69.3 bits (170), Expect = 3e-12
Identities = 122/556 (21%), Positives = 244/556 (43%), Gaps = 41/556 (7%)
Query: 4 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEA----ESQVSQL 59
EK L +L E+ +L+E + KK E + L + LEEA E ++ L
Sbjct: 66 EKARADLSRELEELSERLEEAGGATAAQIELNKKREAELAKLRKDLEEANLQHEEALATL 125
Query: 60 SKIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSIS 119
K +L + ++ + +E ++ E L + LD + N+ +
Sbjct: 126 RKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLA-QLDQIAKAKLNAEKKAKQ 184
Query: 120 LYSQDTVDRLQQEEDAR--NQLFQQKKKLEQENGGLKKDIEDLEGDVE---KQKRKIEGD 174
L SQ + +++ +E R N L QK +L+ EN L + +E+ E V K K ++E
Sbjct: 185 LESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQ 244
Query: 175 LKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEEL 234
L+ + ++ + R + L+ +++ + ++ SL +LE+E + ++Q+ + A I++
Sbjct: 245 LEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQW 304
Query: 235 EEEVEAERQSRAKA-EKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 293
+ E+E RA+ E+ + L +++ EL E E A + + + ++E+ L+ +
Sbjct: 305 RSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIE 364
Query: 294 LEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQ 353
LE AN S L +K + ++E ++D+L A+ E E + ++
Sbjct: 365 LERAN-AAASELEKKQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEE 423
Query: 354 LAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQ 413
L + A + K LQ +++++ +L E R++++ + A+++L E +L LEEAE+
Sbjct: 424 LKDQVEALRRENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAA 483
Query: 414 VSQ-----------------------------LSKIKISLTTQLEDTKRLADEEGRERAT 444
+ + + +E + + E + +A
Sbjct: 484 LELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAE 543
Query: 445 LLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEE 504
+ LE DI+ + L+ + A+ Q+ + K + +T+ E E AR + E+
Sbjct: 544 ASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQ 603
Query: 505 SKSEERRVDDLAAELD 520
ERR L AEL+
Sbjct: 604 LAVAERRATALEAELE 619
Score = 52.4 bits (126), Expect = 5e-07
Identities = 114/528 (21%), Positives = 218/528 (41%), Gaps = 52/528 (9%)
Query: 31 FDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWVEELTK 90
+ KKKL+ + S+L E A ++ L K K L ++LED ++ E ++ EL K
Sbjct: 319 LEELKKKLNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDL-QIELERANAAASELEK 377
Query: 91 GSIEA---ITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLE 147
+ +D L + + + +L ++ + + EE L Q + L
Sbjct: 378 KQKNFDKILAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEE-----LKDQVEALR 432
Query: 148 QENGGLKKDIEDLEG----------DVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQ 197
+EN L+ +I DL ++EK +R++E + Q A+ + E + E +
Sbjct: 433 RENKNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESKVL 492
Query: 198 RKDKEIASLTAKLEDEQGLVGKNQK---QIKELQARIEELEEEVEAERQSRAKAEKQRAD 254
R E++ + + E E+ L K ++ K Q IE L+ +EAE + +A+A + +
Sbjct: 493 RAQVELSQI--RSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKK 550
Query: 255 LARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANL------ 308
L ++ EL L+ A A + + KK + ++ +L+ +EE E L
Sbjct: 551 LEGDINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERR 610
Query: 309 RKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQL 368
+ E+ ++Q + + +AE E A N A L +K E L
Sbjct: 611 ATALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGELAAL 670
Query: 369 QHQLNEVQGKL----DETNRSLND-----------------FDAAKKKLSIENSDLLRQL 407
Q L+E +L + ++ D + +K+L + +L +L
Sbjct: 671 QSDLDEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRL 730
Query: 408 EEAESQVSQLSKIKIS-LTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEE 466
+EAE+ + K I L ++ + + D E R A R +E + ++ Q+EE+
Sbjct: 731 DEAEAAALKGGKKMIQKLEARVRELEAELDGEQRRHAETQKNLRKMERRVKELQFQVEED 790
Query: 467 AEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDD 514
+ LQ + K A + ++ + E L + + +R ++D
Sbjct: 791 KKNLERLQDLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELED 838
Score = 42.7 bits (101), Expect = 5e-04
Identities = 64/315 (20%), Positives = 143/315 (45%), Gaps = 8/315 (2%)
Query: 4 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 63
E K Q + +Q L+ + + KKKL + ++L L+ A ++ K
Sbjct: 517 ENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNV 576
Query: 64 ISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQ 123
Q+++ + +EE + + ++ +L+ ++ + + ++
Sbjct: 577 KKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSALEQAERARKQAE 636
Query: 124 ----DTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEG---DVEKQKRKIEGDLK 176
+ +R+ + + L QK+KLE E L+ D+++ E++ +K + D
Sbjct: 637 TELAEASERVNELTAQNSSLIAQKRKLEGELAALQSDLDEAVNELKAAEERAKKAQADAA 696
Query: 177 LTQEAVADLERNKKELEQTIQRKDKEIASLTAKLED-EQGLVGKNQKQIKELQARIEELE 235
E + + + + LE+ ++ + ++ L +L++ E + +K I++L+AR+ ELE
Sbjct: 697 RLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAEAAALKGGKKMIQKLEARVRELE 756
Query: 236 EEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLE 295
E++ E++ A+ +K + R ++EL ++EE +L K +A++ +R LE
Sbjct: 757 AELDGEQRRHAETQKNLRKMERRVKELQFQVEEDKKNLERLQDLVDKLQAKIKTYKRQLE 816
Query: 296 EANIQHESTLANLRK 310
EA + L+ RK
Sbjct: 817 EAEEVAQINLSKYRK 831
Score = 40.8 bits (96), Expect = 0.002
Identities = 97/422 (22%), Positives = 186/422 (44%), Gaps = 40/422 (9%)
Query: 8 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQ--------- 58
K LQ +++++ +L E R++++ + A+++L E +L LEEAE+ +
Sbjct: 436 KNLQDEIHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAEAALELEESKVLRAQ 495
Query: 59 --LSKIKISLTTQL-------EDTKR---LADEESSGWVEELTKGSIEAITLKS-LDLDV 105
LS+I+ + +L E+T++ A E +E KG EA LK L+ D+
Sbjct: 496 VELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEGDI 555
Query: 106 LNFTTENSHSFSISLYSQDTVDRLQQE---------------EDARNQLFQQKKKLEQEN 150
H+ + +Q V + QQ+ EDAR QL +++
Sbjct: 556 NELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRATALE 615
Query: 151 GGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKL 210
L++ LE E+ +++ E +L E V +L L ++ + E+A+L + L
Sbjct: 616 AELEELRSALE-QAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGELAALQSDL 674
Query: 211 EDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA- 269
++ + +++ K+ QA L EE+ E++ E+ R L +++EL RL+EA
Sbjct: 675 DEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAE 734
Query: 270 GGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKL 329
A ++ +K EA + +L +L+ +H T NLRK V E+ Q+++ K
Sbjct: 735 AAALKGGKKMIQKLEARVRELEAELDGEQRRHAETQKNLRKMER-RVKELQFQVEEDKKN 793
Query: 330 KTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDF 389
+ + + + + ++ ++ Q +L + + + D RSLN
Sbjct: 794 LERLQDLVDKLQAKIKTYKRQLEEAEEVAQINLSKYRKAQRELEDAEERADTAERSLNKL 853
Query: 390 DA 391
A
Sbjct: 854 RA 855
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 72.1 bits (177), Expect = 5e-13
Identities = 79/337 (23%), Positives = 160/337 (47%), Gaps = 25/337 (7%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLE------QENGGLKKDIEDLEGDV--------EKQK 168
+ T + L++ ED +L +Q +KLE + LK ++ +LE + K+
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241
Query: 169 RKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQ 228
++E +L +E + +L+ +E E+ I+ E+ L +LE+ Q + + +++I+EL+
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 229 ARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 288
I L E +E + E + +L LEEL E++E A ++E + E+
Sbjct: 302 GEISLLRERLE-------ELENELEELEERLEELKEKIE----ALKEELEERETLLEELE 350
Query: 289 KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTR 348
+L +LEEA + E L+ L ++ + + E++ +L + +E E +
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410
Query: 349 AAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLE 408
++L+ ++ K+L+ +L E+Q +L+E N L + + ++L +L R+L
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELA 470
Query: 409 EAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATL 445
E + ++ +L K SL +L+ + RA L
Sbjct: 471 ELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVL 507
Score = 67.4 bits (165), Expect = 1e-11
Identities = 76/347 (21%), Positives = 153/347 (44%), Gaps = 21/347 (6%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELN 280
++++ EL+A++E+LEEE+++ + E +L R+LEEL +LEE ++
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKR----ELAAL 728
Query: 281 KKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEH----- 335
++ ++ +LEE + E L L+++ + E+ + L KLK + E
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788
Query: 336 -----ERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFD 390
E E D L RE + E+ ++L+ ++ E++ +++E L++ +
Sbjct: 789 QALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848
Query: 391 AAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFR 450
++L E +L +LEE E++ +L L + E+ + E E A L +
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIE 908
Query: 451 NLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEER 510
L ++ + +LE +L+ +L + D E+E E LEE
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYED------TLETELEREIERLEEEIEALG 962
Query: 511 RVDDLA-AELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKN 556
V+ A E + ++ ++ L+ A E+ E +E + +E +
Sbjct: 963 PVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRE 1009
Score = 62.0 bits (151), Expect = 6e-10
Identities = 67/346 (19%), Positives = 155/346 (44%), Gaps = 15/346 (4%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIE---GDLKLTQEAVA 183
+ L + E +L ++ K L+ E L+ +E+L +E+ +R++E +L +E +
Sbjct: 674 EELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELE 733
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
L+ +ELE+ ++ ++E+ L +LE+ + + ++ + +L+ IEELEE+ +A ++
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQE 793
Query: 244 SRAKAEKQRADLARELEELGERL---EEAGGATSAQIELNKKREAEMSKLRRDLEEANIQ 300
+ E++ + R L+ L L E+ +IE ++ E+ + +LEE
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEE--- 850
Query: 301 HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAA 360
L ++ + + E+ + ++L + E E+ E + +L E
Sbjct: 851 -LEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909
Query: 361 TEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKI 420
+ ++L+ +L ++ +L E L + + +E R++E E ++ L +
Sbjct: 910 LRERLEELEAKLERLEVELPELEEELEEEYEDTLETELE-----REIERLEEEIEALGPV 964
Query: 421 KISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEE 466
+ + E+ + +E +R L L I+ + ++ E
Sbjct: 965 NLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRER 1010
Score = 60.9 bits (148), Expect = 1e-09
Identities = 75/359 (20%), Positives = 157/359 (43%), Gaps = 8/359 (2%)
Query: 25 NRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGW 84
RS K+L E ++L QLE+ E ++ L SL LE+ +R +E
Sbjct: 659 KRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQL 718
Query: 85 VEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKK 144
E + + L+ L + E Q+ ++ L++E ++ + + K
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778
Query: 145 KLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIA 204
+ +E ++ +++ ++E++ + E L + + LE+ ++ LEQ I+ ++EI
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIE 838
Query: 205 SLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGE 264
L KL++ + + + +K+++EL+ +EELE E E + E+++ +L EL EL
Sbjct: 839 ELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES 898
Query: 265 RLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTL-ANLRKKHNDAVSEMGEQI 323
L E E ++ EA++ +L +L E + E L + + + E+I
Sbjct: 899 ELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEI 958
Query: 324 DQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDET 382
+ L + +A E ++L ++ E+ ++L + E+ + E
Sbjct: 959 EALGPVNLRAIEEYEEVEER-------YEELKSQREDLEEAKEKLLEVIEELDKEKRER 1010
Score = 58.6 bits (142), Expect = 7e-09
Identities = 78/340 (22%), Positives = 153/340 (45%), Gaps = 14/340 (4%)
Query: 165 EKQKRKIEGDLKLTQEAVADLERNKKELEQTIQ--RKDKEIA----SLTAKLEDEQGLVG 218
+++K + E L+ T+E + LE +ELE+ ++ + E A L A+L + + +
Sbjct: 171 KERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALL 230
Query: 219 -----KNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGAT 273
+ +K+++EL+ + LEEE+E ++ +AEK+ +L ELEEL E LEE
Sbjct: 231 LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEEL 290
Query: 274 SAQIELNKKREAEMSKLRR---DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLK 330
E ++ E E+S LR +LE + E L L++K E+ E+ L +L+
Sbjct: 291 LELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELE 350
Query: 331 TKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFD 390
A A ++L A + +L+ +L E++ +L+E R + +
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410
Query: 391 AAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFR 450
++LS DL +L+E E+++ +L L +LE+ + +E L +
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELA 470
Query: 451 NLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTK 490
L+ ++ + ++L L+ + + +
Sbjct: 471 ELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEAL 510
Score = 53.9 bits (130), Expect = 2e-07
Identities = 68/339 (20%), Positives = 145/339 (42%), Gaps = 13/339 (3%)
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKR 283
+ + + R EE E ++E ++ + E +L ++LE+L + E+A Q + R
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKA---ERYQELKAELR 223
Query: 284 EAEMSKLRRDLEEANIQHESTLANLR------KKHNDAVSEMGEQIDQLNKLKTKAEHER 337
E E++ L L+E + E L ++ + + E ++I++L + E
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Query: 338 NAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLS 397
E + ++L E + + ++L+++L E++ +L+E + ++
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343
Query: 398 IENSDLLRQLEEAESQVSQ----LSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLE 453
+L + L E E + LS + L E + E E A + + L+
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403
Query: 454 HDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVD 513
+I+++ E+LE +E DL+ +L + A+ + +T+ E E E+ + R+
Sbjct: 404 REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLK 463
Query: 514 DLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRR 552
+L EL Q+E + EL L+ + + + A +
Sbjct: 464 ELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQG 502
Score = 53.9 bits (130), Expect = 2e-07
Identities = 58/270 (21%), Positives = 121/270 (44%), Gaps = 18/270 (6%)
Query: 4 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 63
E A+ + +L + KL E + L + + +L E +L +LEEAE ++ +L
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276
Query: 64 ISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQ 123
L +LE+ + + ++ L+ ++ + +
Sbjct: 277 EELREELEELQ--------------EELLELKEEIEELEGEISLLRERLEELENELEELE 322
Query: 124 DTVDRLQQEEDARNQLF----QQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQ 179
+ ++ L+++ +A + ++LEQ L++ E+LE + ++E + +
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
E +A+LE E+ ++ +EI SL +LE + ++++KEL+A +EEL+ E+E
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442
Query: 240 AERQSRAKAEKQRADLARELEELGERLEEA 269
+ + E+Q +L L+EL L E
Sbjct: 443 ELNEELEELEEQLEELRDRLKELERELAEL 472
Score = 52.0 bits (125), Expect = 8e-07
Identities = 53/256 (20%), Positives = 123/256 (48%), Gaps = 14/256 (5%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEG---DLKLTQ 179
++ +++ + +L ++ ++L++E LK++IE+LEG++ + ++E +L+ +
Sbjct: 263 EEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELE 322
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ----IKELQARIEELE 235
E + +L+ + L++ ++ ++ + L L + + + +++ ++EL+ E L
Sbjct: 323 ERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALR 382
Query: 236 EEVEAERQSRAKAEKQRADLARELEEL---GERLEEAGGATSAQIELNKKREAEMSKLRR 292
EE+ A+ + +L RE+E L ERL E ++ K+ EAE+ +L+
Sbjct: 383 EELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL---KELEAELEELQT 439
Query: 293 DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAAD 352
+LEE N + E L + D + E+ ++ +L + + E E ++ + A
Sbjct: 440 ELEELNEELEELEEQLEELR-DRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQR 498
Query: 353 QLAREKAATEKIAKQL 368
+A E + L
Sbjct: 499 ASQGVRAVLEALESGL 514
Score = 48.6 bits (116), Expect = 8e-06
Identities = 50/203 (24%), Positives = 93/203 (45%), Gaps = 7/203 (3%)
Query: 355 AREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQV 414
R A ++ K+L+ +L E++ +L++ L + L +L RQLEE E Q+
Sbjct: 659 KRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQL 718
Query: 415 SQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQ 474
+L + +L +LE + +E E L + L+ ++ + E+LE E A L+
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778
Query: 475 RQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELF 534
++ + Q + E EE + ERR+D L EL++ ++ E+
Sbjct: 779 EEIEELEEKRQ-------ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831
Query: 535 RLKGAYEEAQEQLEAVRRENKNL 557
L+ EE +E+L+ + E + L
Sbjct: 832 ELEEEIEELEEKLDELEEELEEL 854
Score = 45.5 bits (108), Expect = 6e-05
Identities = 59/261 (22%), Positives = 112/261 (42%), Gaps = 7/261 (2%)
Query: 304 TLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
L + + ++ E++ L E + ++L RE AA E+
Sbjct: 671 ELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEE 730
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIEN-------SDLLRQLEEAESQVSQ 416
+QLQ +L E++ +L+E L + ++L E + L ++EE E +
Sbjct: 731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQA 790
Query: 417 LSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQ 476
L + L +LE+ +R D RE +L + LE +I+ + E++EE E +L+ +
Sbjct: 791 LQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEE 850
Query: 477 LSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRL 536
L + + + + + E + E +E K E ++L EL + E E+ +L
Sbjct: 851 LEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKL 910
Query: 537 KGAYEEAQEQLEAVRRENKNL 557
+ EE + +LE + E L
Sbjct: 911 RERLEELEAKLERLEVELPEL 931
Score = 44.7 bits (106), Expect = 1e-04
Identities = 48/196 (24%), Positives = 87/196 (44%), Gaps = 3/196 (1%)
Query: 362 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 421
E A+ + +L + KL E + L + + +L E +L +LEEAE ++ +L
Sbjct: 217 ELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSEL 276
Query: 422 ISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKAN 481
L +LE+ + E E L G+ L ++ + +LEE E +L+ ++
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK 336
Query: 482 ADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYE 541
+ + T E AE E + E ++ L EL+ + R EL L+
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALRE---ELAELEAELA 393
Query: 542 EAQEQLEAVRRENKNL 557
E + +LE ++RE ++L
Sbjct: 394 EIRNELEELKREIESL 409
Score = 43.5 bits (103), Expect = 3e-04
Identities = 55/226 (24%), Positives = 100/226 (44%), Gaps = 10/226 (4%)
Query: 332 KAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDA 391
+ E E + ++L E EK ++L+ +L E++ +L+E L +
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKE 295
Query: 392 AKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRN 451
++L E S L +LEE E+++ +L + L ++E K EE ER TLL +
Sbjct: 296 EIEELEGEISLLRERLEELENELEELEERLEELKEKIEALK----EELEERETLLEELEQ 351
Query: 452 LEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERR 511
L +++ +E+LEE+ + +L +A R + AE E + +R
Sbjct: 352 LLAELEEAKEELEEKLSALLEELEELFEA------LREELAELEAELAEIRNELEELKRE 405
Query: 512 VDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNL 557
++ L L+ + + EL L+ EE Q +LE + E + L
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEEL 451
Score = 42.0 bits (99), Expect = 0.001
Identities = 51/272 (18%), Positives = 119/272 (43%), Gaps = 5/272 (1%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 60
+ E+ ++LQ +L E + +++E L + ++L E +L ++EE E ++S L
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Query: 61 -KIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSIS 119
+++ E +RL + + + E + + L + +S
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368
Query: 120 LYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQ 179
++ + + + +L + ++ E LK++IE LE +E+ ++E + +
Sbjct: 369 ALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELK 428
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
E A+LE + ELE+ ++E+ L +LE+ + + + ++++ ELQ ++ LE+E+
Sbjct: 429 ELEAELEELQTELEE----LNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELS 484
Query: 240 AERQSRAKAEKQRADLARELEELGERLEEAGG 271
+ + E ++ L G
Sbjct: 485 SLEARLDRLEAEQRASQGVRAVLEALESGLPG 516
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 70.5 bits (173), Expect = 1e-12
Identities = 64/301 (21%), Positives = 133/301 (44%), Gaps = 6/301 (1%)
Query: 39 SIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITL 98
E +L ++EE E ++++L K L +LE+ + ++ E + S L
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735
Query: 99 KSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIE 158
L+ +V + +S + +++ E+ + ++ + E E L+ IE
Sbjct: 736 ARLEAEVEQLEERIAQ---LSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
Query: 159 DLEGDVEKQKRKI---EGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQG 215
L+ +++ + + +L L E A+L + LE+ I ++ + L ++E+
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 216 LVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSA 275
+ +I+EL+ IEELE E+EA RA E+ A L ELEEL E L E S
Sbjct: 853 DIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSE 912
Query: 276 QIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEH 335
++ ++++L LE ++ ++ L ++++ + E +++ + +A
Sbjct: 913 LRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARR 972
Query: 336 E 336
Sbjct: 973 R 973
Score = 65.1 bits (159), Expect = 7e-11
Identities = 55/279 (19%), Positives = 125/279 (44%), Gaps = 21/279 (7%)
Query: 126 VDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGD----------VEKQKRKIEGDL 175
V RL++ + +L ++ K+ E+E L ++++LE +E++ +++ +L
Sbjct: 231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKEL 290
Query: 176 KLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELE 235
++ LE+ K+ L + + ++++ L A+LE+ + + + +++ EL+ ++EEL+
Sbjct: 291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350
Query: 236 EEVEAERQSRAKAEKQRADLARELEELGERLEEAGGA---TSAQIELNKKREAEMSKLRR 292
EE+E+ + E + +L LEEL E+LE QI +
Sbjct: 351 EELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
Query: 293 DLEEA-NIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAA 351
LE+ + L+K + E+ ++++L + + + E A
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERL-------EEAL 463
Query: 352 DQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFD 390
++L E E+ + +L ++Q +LD R + +
Sbjct: 464 EELREELEEAEQALDAAERELAQLQARLDSLERLQENLE 502
Score = 63.5 bits (155), Expect = 2e-10
Identities = 73/355 (20%), Positives = 142/355 (40%), Gaps = 54/355 (15%)
Query: 183 ADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAER 242
+LE +ELE+ I +K +A L +LE+ + + + +K+++EL +I L +++
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 243 QSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHE 302
+ E++ A L++EL EL +EE E + EAE+ +L +E+ + +
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 303 STLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATE 362
+ L + +L L +A N R + L R AATE
Sbjct: 800 ALREALDELRA-----------ELTLLNEEA-----------ANLRERLESLERRIAATE 837
Query: 363 KIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKI 422
+ + L+ Q+ E+ ++ + + + +EE ES++ L +
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEE--------------LIEELESELEALLNERA 883
Query: 423 SLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEE----EAEGKADLQRQLS 478
SL L + +E E L K L +++ +RE+L + + +
Sbjct: 884 SLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQE 943
Query: 479 KANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTEL 533
+ + + L LEE+++ E +++D E K N EL
Sbjct: 944 RLSEEYSL--------------TLEEAEALENKIEDDEEEARRRLKRLENKIKEL 984
Score = 60.5 bits (147), Expect = 2e-09
Identities = 63/272 (23%), Positives = 120/272 (44%), Gaps = 8/272 (2%)
Query: 282 KREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAH 341
+R+AE ++ ++L+ + E L LR + E+ E++++L + +AE E
Sbjct: 206 ERQAEKAERYKELKAELRELELALLVLRLE------ELREELEELQEELKEAEEELEELT 259
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENS 401
E ++L E + E+ ++LQ +L + ++ + L +
Sbjct: 260 AELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLE 319
Query: 402 DLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIRE 461
+L QLEE ES++ +L++ L +LE+ K + E L + LE ++ + E
Sbjct: 320 ELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEE 379
Query: 462 QLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDA 521
QLE A L+ Q++ N + + + E R +E + +++++ AEL
Sbjct: 380 QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEE--AELKE 437
Query: 522 SQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
Q E EL L+ E +E LE +R E
Sbjct: 438 LQAELEELEEELEELQEELERLEEALEELREE 469
Score = 60.5 bits (147), Expect = 2e-09
Identities = 72/352 (20%), Positives = 157/352 (44%), Gaps = 12/352 (3%)
Query: 144 KKLEQENGGLKKDIEDLE---GDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKD 200
++LE++ L++ I +LE ++ K+ ++E +L+ ++ + +L R L + + R +
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 201 KEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELE 260
E+ L ++ + + + +I+EL+ R+EE EEE+ + E Q L EL+
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 261 ELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMG 320
L E L+E + E + L R + + E L ++ ++ + +
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE-DLEEQIEELSEDIESLA 858
Query: 321 EQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLD 380
+I++L +L + E E A NE + A L E + ++L+ + +E++ +L+
Sbjct: 859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELE 918
Query: 381 ETNRSLNDFDAAKKKLSIENSDLLRQL--------EEAESQVSQLSKIKISLTTQLEDTK 432
E L + + L + +L +L EEAE+ +++ + +L+ +
Sbjct: 919 ELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLE 978
Query: 433 RLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADA 484
E G + ++ L+ D + Q E+ E K L+ + + + +A
Sbjct: 979 NKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREA 1030
Score = 57.0 bits (138), Expect = 2e-08
Identities = 75/339 (22%), Positives = 160/339 (47%), Gaps = 28/339 (8%)
Query: 144 KKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEI 203
+KLE+ L + +ED+ ++E+Q + +E + + +L+ +ELE + E
Sbjct: 179 RKLERTRENLDR-LEDILNELERQLKSLERQAEKAERY-KELKAELRELELALLVLRLE- 235
Query: 204 ASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELG 263
L +LE+ Q + + +++++EL A ++ELEE++E R ++ E++ +L +EL L
Sbjct: 236 -ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALA 294
Query: 264 ERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQI 323
+ Q ++ ++R A + + +LE + ES L L ++ ++E+ E++
Sbjct: 295 NEISRL----EQQKQILRERLANLERQLEELEAQLEELESKLDELAEE----LAELEEKL 346
Query: 324 DQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETN 383
++L + E E E + ++L + QL+ Q+ + +++
Sbjct: 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLE 406
Query: 384 RSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERA 443
L + +++L E +LL++LEEAE L++ + +E E
Sbjct: 407 ARLERLEDRRERLQQEIEELLKKLEEAE----------------LKELQAELEELEEELE 450
Query: 444 TLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANA 482
L + LE ++ +RE+LEE + +R+L++ A
Sbjct: 451 ELQEELERLEEALEELREELEEAEQALDAAERELAQLQA 489
Score = 55.8 bits (135), Expect = 5e-08
Identities = 70/306 (22%), Positives = 140/306 (45%), Gaps = 3/306 (0%)
Query: 124 DTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA 183
T + L + ED N+L +Q K LE++ ++ E E + + L+ +E +
Sbjct: 183 RTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELE 242
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
+L+ KE E+ ++ E+ L KLE+ + V + +++I+ELQ + L E+ Q
Sbjct: 243 ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQ 302
Query: 244 SRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHES 303
+ ++ A+L R+LEEL +LEE E + E ++ +L+ +LE + E
Sbjct: 303 QKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELE- 361
Query: 304 TLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
L ++ + E+ EQ++ L + E + + +NE A ++L + ++
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKIS 423
++L +L E + L E L + + ++L E L LEE ++ + + +
Sbjct: 422 EIEELLKKLEEAE--LKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDA 479
Query: 424 LTTQLE 429
+L
Sbjct: 480 AERELA 485
Score = 51.2 bits (123), Expect = 1e-06
Identities = 59/279 (21%), Positives = 121/279 (43%), Gaps = 25/279 (8%)
Query: 279 LNKKRE-AEMSKLRRDLEEANIQHESTLANLRKKH---NDAVSEMGEQIDQLNKLKTKAE 334
L ++RE E+ + +LEE + E LA LRK+ + + ++ +++++L++ +
Sbjct: 673 LERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
Query: 335 HERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKK 394
+ E QL++E E ++L+ +L E + +L E + + +A +
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
Query: 395 KLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEH 454
+L E L L+E ++++ L++ +L +LE +R R L + L
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 455 DIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDD 514
DI+++ ++EE E +L+ +L L E S E +
Sbjct: 853 DIESLAAEIEELEELIEELESEL---------------------EALLNERASLEEALAL 891
Query: 515 LAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
L +EL+ +E R ++ L+ EE +E+L +
Sbjct: 892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELR 930
Score = 44.3 bits (105), Expect = 2e-04
Identities = 42/214 (19%), Positives = 92/214 (42%)
Query: 337 RNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL 396
++ R ++L + E+ +L+ L E++ +L+E L ++L
Sbjct: 665 SAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL 724
Query: 397 SIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDI 456
S + S L + L E++V QL + L+ +L + + +E + E +I
Sbjct: 725 SRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEI 784
Query: 457 DNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLA 516
+ + Q+E+ E L+ L + A+ L + + + ERR++DL
Sbjct: 785 EELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLE 844
Query: 517 AELDASQKECRNYSTELFRLKGAYEEAQEQLEAV 550
+++ ++ + + E+ L+ EE + +LEA+
Sbjct: 845 EQIEELSEDIESLAAEIEELEELIEELESELEAL 878
Score = 44.3 bits (105), Expect = 2e-04
Identities = 37/210 (17%), Positives = 87/210 (41%)
Query: 349 AAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLE 408
+L + A K ++L+ +L +++ +L+E +R ++ +L E L ++
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750
Query: 409 EAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAE 468
+ ++++L L +LE+ + E E L + L+ ++ +RE L+E
Sbjct: 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRA 810
Query: 469 GKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRN 528
L + + + + + + E+ + ++ LAAE++ ++
Sbjct: 811 ELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
Query: 529 YSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
+EL L +E L +R E + L+
Sbjct: 871 LESELEALLNERASLEEALALLRSELEELS 900
Score = 43.9 bits (104), Expect = 2e-04
Identities = 50/248 (20%), Positives = 112/248 (45%), Gaps = 14/248 (5%)
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
++ + + E+ E+I +L K + E E R ++L+R+ +A K +L+
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 370 HQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLE 429
++ +++ ++ + ++ L + +A ++L + +L EAE+++ +L L +L+
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 430 DTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRT 489
+ DE E L + NL ++++ ++ DL+ Q+ + + D +
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIE---- 855
Query: 490 KYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEA 549
+ A E+EE E +++L +EL+A E + L L+ EE E+L
Sbjct: 856 -------SLAAEIEE---LEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905
Query: 550 VRRENKNL 557
+ + L
Sbjct: 906 LESKRSEL 913
Score = 43.5 bits (103), Expect = 3e-04
Identities = 70/273 (25%), Positives = 129/273 (47%), Gaps = 26/273 (9%)
Query: 8 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLT 67
K+L+ +L E++ L + + +++L +L +L+EAE ++ +L+ L
Sbjct: 216 KELKAELRELELAL-----LVLRLEELREELE----ELQEELKEAEEELEELTAELQELE 266
Query: 68 TQLEDTKRLADEESSGWVEELTKGSIEAITLKS---LDLDVLNFTTENSHSFSISLYSQD 124
+LE RL E +EEL K S +L N L +Q
Sbjct: 267 EKLE-ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQ- 324
Query: 125 TVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKI---EGDLKLTQEA 181
L++ E ++L ++ +LE++ LK+++E LE ++E+ + ++ E L+ +E
Sbjct: 325 ----LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQ 380
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQ-----ARIEELEE 236
+ L +LE I + EI L A+LE + + Q++I+EL A ++EL+
Sbjct: 381 LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQA 440
Query: 237 EVEAERQSRAKAEKQRADLARELEELGERLEEA 269
E+E + + +++ L LEEL E LEEA
Sbjct: 441 ELEELEEELEELQEELERLEEALEELREELEEA 473
Score = 43.1 bits (102), Expect = 4e-04
Identities = 56/272 (20%), Positives = 119/272 (43%), Gaps = 14/272 (5%)
Query: 7 AKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISL 66
++L+ +L E++ ++ E + + L+ E S L +Q + +++ L + L
Sbjct: 262 LQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEL 321
Query: 67 TTQLEDT--KRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISL---- 120
QLE+ K E +EE + E + +L+ L E S L
Sbjct: 322 EAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
Query: 121 --YSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLT 178
Q N++ + + +LE+ ++ +++E E K+ E +LK
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE---ELLKKLEEAELKEL 438
Query: 179 QEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARI---EELE 235
Q + +LE +EL++ ++R ++ + L +LE+ + + ++++ +LQAR+ E L+
Sbjct: 439 QAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQ 498
Query: 236 EEVEAERQSRAKAEKQRADLARELEELGERLE 267
E +E + K ++ L+ L L E +
Sbjct: 499 ENLEGFSEGVKALLKNQSGLSGILGVLSELIS 530
Score = 35.0 bits (81), Expect = 0.11
Identities = 53/287 (18%), Positives = 113/287 (39%), Gaps = 43/287 (14%)
Query: 4 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 63
E ++L+ +L E + +L E + + +A ++L E L L+E ++++ L++
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819
Query: 64 ISLTTQLED-TKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYS 122
+L +LE +R+A E E + +
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQIEELSED------------IESLAAEIE----- 862
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDV---EKQKRKIEGDLKLTQ 179
+ + +++ E L ++ LE+ L+ ++E+L ++ E ++ ++ +L+ +
Sbjct: 863 -ELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR 921
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKN-----------QKQIKELQ 228
E +A LE + LE I L+ + ++++K L+
Sbjct: 922 EKLAQLELRLEGLEVRIDNL---QERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLE 978
Query: 229 ARIEEL-------EEEVEAERQSRAKAEKQRADLARELEELGERLEE 268
+I+EL EE E ++ Q+ DL E L E +EE
Sbjct: 979 NKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEE 1025
Score = 31.6 bits (72), Expect = 1.5
Identities = 32/155 (20%), Positives = 63/155 (40%), Gaps = 7/155 (4%)
Query: 403 LLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQ 462
L+ +LEE ++ +L + +LE+ E + L + LE +I+ ++++
Sbjct: 230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289
Query: 463 LEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDAS 522
L A + L++Q + E E E ++D+LA EL
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLEEL-------ESKLDELAEELAEL 342
Query: 523 QKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNL 557
+++ EL L+ EE + +LE + + L
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEEL 377
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 63.2 bits (154), Expect = 3e-10
Identities = 70/326 (21%), Positives = 152/326 (46%), Gaps = 18/326 (5%)
Query: 186 ERNKKELEQTIQRKDKEIAS--LTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
ER K E Q + ++ +E L + E + ++Q+ L+ +E+L EE+ +
Sbjct: 206 EREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEK 265
Query: 244 SRAKAEKQRADLARELEELGE----RLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI 299
+ E+ +L +++++LGE R++E G A+I ++ AE + D EE
Sbjct: 266 RLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLA 325
Query: 300 QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKA 359
+ E+ + L + E+ +I++ K + K E E + RA +++ +E A
Sbjct: 326 KLEAEIDKLLAEI----EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFA 381
Query: 360 ATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSK 419
T K + +L +++ +++E R L+ ++LS E +DL + E+++++L +
Sbjct: 382 ETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441
Query: 420 IKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSK 479
K +++ + ++ + + + +L+ + D + ++L + LQR+L++
Sbjct: 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSK-------LQRELAE 494
Query: 480 ANADAQLWRTKYESEGVARAEELEES 505
A A A+ G A E L+ S
Sbjct: 495 AEAQARASEE-RVRGGRAVEEVLKAS 519
Score = 61.2 bits (149), Expect = 9e-10
Identities = 66/325 (20%), Positives = 149/325 (45%), Gaps = 42/325 (12%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEG---DLKLTQEAVA 183
L++ E+ ++L Q+ ++ G ++K+IE LE + EK K ++E DL ++ +
Sbjct: 695 SELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIE 754
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLED------------EQGLVGKNQKQIKELQARI 231
+++ KELE I+ ++++ L L D Q + K ++++ ++AR+
Sbjct: 755 NVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARL 814
Query: 232 EELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLR 291
E+E+++ + EK+ +L + +L E+++ +IE ++ E+ +
Sbjct: 815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE----KEIENLNGKKEELEEEL 870
Query: 292 RDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAA 351
+LE A ES L +L+K+ ++ +++ E ++ +L+ + E +R A
Sbjct: 871 EELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALE 930
Query: 352 DQLAR---------EKAATEKIAKQLQHQLNEVQGKL--------------DETNRSLND 388
++L+ E E + +Q +L V+ ++ +E + L++
Sbjct: 931 EELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDE 990
Query: 389 FDAAKKKLSIENSDLLRQLEEAESQ 413
+ KL E +L ++EE E +
Sbjct: 991 LKEKRAKLEEERKAILERIEEYEKK 1015
Score = 58.2 bits (141), Expect = 8e-09
Identities = 53/248 (21%), Positives = 105/248 (42%), Gaps = 6/248 (2%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKL-EQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEA 181
+ RL++ E +L ++ K L E+E +K+ I +LE ++ +R I +
Sbjct: 261 SELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIA----EKERE 316
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE 241
+ D E +LE I + EI L ++E+E+ K ++ EL+ +E+L E+E
Sbjct: 317 LEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEV 376
Query: 242 RQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQH 301
+ A+ + D +LE+L + E E ++ E++ L + +
Sbjct: 377 DKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKI 436
Query: 302 ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAAT 361
+ L ++ + + +++QL +K E E E + +L RE A
Sbjct: 437 -NELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495
Query: 362 EKIAKQLQ 369
E A+ +
Sbjct: 496 EAQARASE 503
Score = 53.9 bits (130), Expect = 2e-07
Identities = 70/332 (21%), Positives = 138/332 (41%), Gaps = 25/332 (7%)
Query: 223 QIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKK 282
+++ L+ R+E L+ E+ + + + E + +L++EL + ++ E +IE ++
Sbjct: 675 ELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEI----EKEIEQLEQ 730
Query: 283 REAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTK-AEHERNAAH 341
E ++ + +LEE E + N++ + + + + E + L+KL+ + E +H
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENS 401
+ +A +L E + E L E + LN K+ L E
Sbjct: 791 SRIPEIQAELSKLEEEVSRIEAR--------------LREIEQKLNRLTLEKEYLEKEIQ 836
Query: 402 DLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIRE 461
+L Q + + Q+ + K +L + E+ + +E L + +L+ + D +
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA 896
Query: 462 QLEEEAEGKADLQRQLSKANADAQLWRTKYE------SEGVARAEELEESKSEERRVDDL 515
QL E +L+ Q+ K + K E SE E EE EE ++D+
Sbjct: 897 QLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDV 956
Query: 516 AAELDASQKECRNYSTELFRLKGAYEEAQEQL 547
AEL ++E R YEE ++L
Sbjct: 957 QAELQRVEEEIRALEPVNMLAIQEYEEVLKRL 988
Score = 45.1 bits (107), Expect = 9e-05
Identities = 43/231 (18%), Positives = 101/231 (43%), Gaps = 8/231 (3%)
Query: 329 LKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLND 388
LK K E+E E + + R+ A+ E+ ++L +++E++ +L+E + L +
Sbjct: 217 LKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEE 276
Query: 389 FDAAKKKLSIENSDLLR-QLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLG 447
+ K L E ++ ++ E E++++ L + + + +R ++ A L
Sbjct: 277 LNKKIKDLGEEEQLRVKEKIGELEAEIASLER-------SIAEKERELEDAEERLAKLEA 329
Query: 448 KFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKS 507
+ L +I+ + ++EEE + + L + ++ + + R + E AE +E K
Sbjct: 330 EIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKD 389
Query: 508 EERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
+++ L E++ ++E EL RL + + + + L
Sbjct: 390 YREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELE 440
Score = 45.1 bits (107), Expect = 1e-04
Identities = 71/303 (23%), Positives = 124/303 (40%), Gaps = 39/303 (12%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAV 182
+ + + E L ++ +KL +E L+K +E++E +E+ +KI+ L +E
Sbjct: 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIK---DLGEEEQ 289
Query: 183 ADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAER 242
++ ELE I ++ IA +LED + + K + +I +L A IEELE E+E ER
Sbjct: 290 LRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER 349
Query: 243 QSRAK-------AEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLE 295
+ R K +++ DL ELEE+ + E E+ R LE
Sbjct: 350 KRRDKLTEEYAELKEELEDLRAELEEVDKEFAET--------------RDELKDYREKLE 395
Query: 296 EANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLA 355
+ + L D+L + + E + A ++L
Sbjct: 396 KLKREINELKREL---------------DRLQEELQRLSEELADLNAAIAGIEAKINELE 440
Query: 356 REKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVS 415
EK K+ + +L ++ L + + L D ++ E S L R+L EAE+Q
Sbjct: 441 EEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500
Query: 416 QLS 418
Sbjct: 501 ASE 503
Score = 44.3 bits (105), Expect = 2e-04
Identities = 68/352 (19%), Positives = 147/352 (41%), Gaps = 26/352 (7%)
Query: 203 IASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL 262
+A K E + + ++ I+ L I+E +++E R+ R KAE+ +A L + E
Sbjct: 165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYE 224
Query: 263 GERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQ 322
G L + A Q E E +++ L +LE+ L + + E+ +
Sbjct: 225 GYELLKEKEALERQKE---AIERQLASLEEELEK--------LTEEISELEKRLEEIEQL 273
Query: 323 IDQLN-KLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDE 381
+++LN K+K E E+ + A L R A E+ + + +L +++ ++D+
Sbjct: 274 LEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDK 333
Query: 382 TNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRE 441
+ + + R++EE + +L++ L +LED + +E +E
Sbjct: 334 LLAEIEELE--------------REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379
Query: 442 RATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEE 501
A + ++ ++ ++ ++ E LQ +L + + + E
Sbjct: 380 FAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINEL 439
Query: 502 LEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
EE + + + +L+ + Y EL+ LK Y+ +++L ++RE
Sbjct: 440 EEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRE 491
Score = 42.4 bits (100), Expect = 7e-04
Identities = 51/274 (18%), Positives = 115/274 (41%), Gaps = 26/274 (9%)
Query: 26 RSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWV 85
+ + K+ + + + L +LE+ ++S+L K + LE+ + +
Sbjct: 230 KEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKK--------I 281
Query: 86 EELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKK 145
++L + + K +L+ I+ + ++ ++ EDA +L K
Sbjct: 282 KDLGEEEQLRVKEKIGELEA-----------EIASLERSIAEKERELEDAEERL----AK 326
Query: 146 LEQENGGLKKDIEDLEGDVE---KQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKE 202
LE E L +IE+LE ++E K++ K+ + +E + DL +E+++ E
Sbjct: 327 LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386
Query: 203 IASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL 262
+ KLE + + + ++++ LQ ++ L EE+ + A E + +L E E+
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDK 446
Query: 263 GERLEEAGGATSAQIELNKKREAEMSKLRRDLEE 296
+++ K E E+ L+ + +
Sbjct: 447 ALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDR 480
Score = 41.2 bits (97), Expect = 0.002
Identities = 52/284 (18%), Positives = 112/284 (39%), Gaps = 24/284 (8%)
Query: 281 KKREAEMSKLRRDLEEANIQHESTLANLRKKHN------DAVSEMGEQIDQLNKLKTKAE 334
+ AE+ +LR LE + S + LR+ N +S+ +I ++ K + E
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729
Query: 335 HERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKK 394
E L +E + K+L+ ++ E++ L + +LND +A
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS 789
Query: 395 KLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEH 454
I +L + E +VS++ + +L + +E L + +L+
Sbjct: 790 HSRIPEIQ--AELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKE 847
Query: 455 DIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSE-ERRVD 513
I +I +++E K +L+ +L + A +LE + ++ D
Sbjct: 848 QIKSIEKEIENLNGKKEELEEELEELEA---------------ALRDLESRLGDLKKERD 892
Query: 514 DLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNL 557
+L A+L +++ ++ + + E + +LEA+ E +
Sbjct: 893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI 936
Score = 39.3 bits (92), Expect = 0.005
Identities = 44/255 (17%), Positives = 106/255 (41%), Gaps = 34/255 (13%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 60
+KI + + V+ K+ E + + + + E D +L + E+++ +L
Sbjct: 276 ELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLL 335
Query: 61 KIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISL 120
L ++E+ ++ D+ EE + E L++
Sbjct: 336 AEIEELEREIEEERKRRDK----LTEEYAELKEELEDLRA-------------------- 371
Query: 121 YSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEK---QKRKIEGDLKL 177
L++ + + + K ++ LK++I +L+ ++++ + +++ +L
Sbjct: 372 -------ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE 237
A+A +E ELE+ + K EI KLE + K ++++ +L+ + +E+E
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484
Query: 238 VEAERQSRAKAEKQR 252
+ ++ A+AE Q
Sbjct: 485 LSKLQRELAEAEAQA 499
Score = 35.4 bits (82), Expect = 0.081
Identities = 60/333 (18%), Positives = 128/333 (38%), Gaps = 54/333 (16%)
Query: 229 ARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLE----EAGGATSAQIELNKKRE 284
A + E + + E + + E+ L ++E ++LE E A Q L +KRE
Sbjct: 163 AGVAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKRE 222
Query: 285 AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNET 344
E +L ++ E Q E+ Q+ L + K E +
Sbjct: 223 YEGYELLKEKEALERQKEAIER---------------QLASLEEELEKLTEEISELEKRL 267
Query: 345 NNTRAAADQLARE-KAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDL 403
++L ++ K E+ +++ ++ E++ ++ RS+ ++
Sbjct: 268 EEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSI--------------AEK 313
Query: 404 LRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQL 463
R+LE+AE ++++L L ++E+ +R +EE + R L ++ L+ +++++R +L
Sbjct: 314 ERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEL 373
Query: 464 EEEAEGKADLQRQLSKANADAQLWRTK---YESEGVARAEELEESKSEERRVDDLAAELD 520
EE + R + Y + E+ E K E R+ + L
Sbjct: 374 EEVD--------------KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLS 419
Query: 521 ASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
+ + ++ E +E+ E E
Sbjct: 420 EELADLNA---AIAGIEAKINELEEEKEDKALE 449
Score = 31.6 bits (72), Expect = 1.6
Identities = 47/211 (22%), Positives = 91/211 (43%), Gaps = 19/211 (9%)
Query: 367 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTT 426
+L+ +L ++ +L L + +LS E SD R++ E E ++ QL + + L
Sbjct: 678 RLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKE 737
Query: 427 QLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKA---NAD 483
+LE+ + +E + + + LE I+ + E L + E DL+ +LS +
Sbjct: 738 RLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQ 797
Query: 484 AQLWRTKYE-SEGVARAEELE---------------ESKSEERRVDDLAAELDASQKECR 527
A+L + + E S AR E+E E + + + DL ++ + +KE
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIE 857
Query: 528 NYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
N + + L+ EE + L + +L
Sbjct: 858 NLNGKKEELEEELEELEAALRDLESRLGDLK 888
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 51.2 bits (123), Expect = 1e-06
Identities = 84/419 (20%), Positives = 174/419 (41%), Gaps = 16/419 (3%)
Query: 146 LEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIAS 205
LE E L ++IE E E+ + + + E + + LE I+ + IA
Sbjct: 211 LESELAELDEEIERYEEQREQARETRD-EADEVLEEHEERREELETLEAEIEDLRETIAE 269
Query: 206 LTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGER 265
+ E+ V +++++EL+ ++L E + E +R +L EEL +R
Sbjct: 270 TEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDR 329
Query: 266 LEE---AGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQ 322
LEE A A + + E ++ ++ + +L E + ES L R +AV + E+
Sbjct: 330 LEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAR----EAVEDRREE 385
Query: 323 IDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDET 382
I++L + + A + N ++L E+ + +L+ L + +++E
Sbjct: 386 IEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEA 445
Query: 383 NRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRER 442
L + +E S + +EE +V +L L ++E+ + E +
Sbjct: 446 EALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEE-RLERAEDL 504
Query: 443 ATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEEL 502
+ LE +++ E + E E + + + + A + E+E + E
Sbjct: 505 VEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAA----ELEAEAEEKREAA 560
Query: 503 EESKSEERRVDDLAAELDASQKECRNYSTELFRLK---GAYEEAQEQLEAVRRENKNLA 558
E++ E + AEL++ E + L R++ A +A++++E +R + + LA
Sbjct: 561 AEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALA 619
Score = 44.6 bits (106), Expect = 1e-04
Identities = 88/405 (21%), Positives = 162/405 (40%), Gaps = 19/405 (4%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAVADLER---NKKELEQTIQRKDKEIASLTAKLEDE 213
E+LE E+ + ++E Q + E + +LE+ + +E A L ++LE+
Sbjct: 316 REELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA 375
Query: 214 QGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGAT 273
+ V +++I+EL+ IEEL E AE +L E +EL ER E
Sbjct: 376 REAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAE----L 431
Query: 274 SAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKA 333
A + ++R E L EA E H + + E E++++L
Sbjct: 432 EATLRTARERVEE----AEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDL 487
Query: 334 EHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAK 393
E E + R + E+ + L+ + E + ++E +
Sbjct: 488 EEEVEEVEERLERAEDLVEAEDRIERLEER-REDLEELIAERRETIEEKRERAEELRERA 546
Query: 394 KKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLE 453
+L E + EAE + + + L ++L + K + R R TLL + E
Sbjct: 547 AELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIR-TLLAAIADAE 605
Query: 454 HDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVD 513
+I+ +RE+ E AE + + +L++ K E E +EE++ ++ R +
Sbjct: 606 DEIERLREKREALAELNDERRERLAEKR------ERKRELEAEFDEARIEEAREDKERAE 659
Query: 514 DLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
+ +++ E R +L GA E E+LE +R + L
Sbjct: 660 EYLEQVEEKLDELREERDDLQAEIGAVENELEELEELRERREALE 704
Score = 30.4 bits (69), Expect = 2.8
Identities = 44/236 (18%), Positives = 95/236 (40%), Gaps = 13/236 (5%)
Query: 332 KAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDA 391
K E + + N + A D+ + A++ + + +EV + +E L +A
Sbjct: 199 KEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEA 258
Query: 392 AKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRN 451
+ L ++ R+ EE +V L + L + +D A + + + +
Sbjct: 259 EIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREE 318
Query: 452 LEHDIDNIREQLEE----------EAEGKADLQRQLSKANADAQLWRTKYESEGVARAEE 501
LE + +R++LEE EAE + L + + + + ESE E
Sbjct: 319 LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREA 378
Query: 502 LEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNL 557
+E+ + E +++L E++ ++ + +L + EE +E+ + +R L
Sbjct: 379 VEDRREE---IEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAEL 431
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 47.1 bits (112), Expect = 2e-05
Identities = 83/433 (19%), Positives = 178/433 (41%), Gaps = 5/433 (1%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLE 186
+ L++E + + + + L +E L + ++ LE +EK + K+E +E +
Sbjct: 294 EELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353
Query: 187 RNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRA 246
K LE+ ++ ++ + L +LE + + ++ I+EL+ + EL +E ++
Sbjct: 354 ELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413
Query: 247 KAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLA 306
+ EK+ +L RELEEL E +++ QI + +E +++L E+ + +
Sbjct: 414 ELEKELEELERELEELEEEIKKL----EEQINQLESKELMIAELAGAGEKCPVCGQELPE 469
Query: 307 NLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAK 366
K+ + E++++ + + R R ++L E + +
Sbjct: 470 EHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKE 529
Query: 367 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIEN-SDLLRQLEEAESQVSQLSKIKISLT 425
+L+ +L +++ L+E + K + D L++L+E ++ L K L
Sbjct: 530 ELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELE 589
Query: 426 TQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQ 485
E K L + L L+ + E EEAE + + + + A+ +
Sbjct: 590 ELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELE 649
Query: 486 LWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQE 545
E + EELE E + + +L+ +E EL +L+ EE +
Sbjct: 650 ELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLK 709
Query: 546 QLEAVRRENKNLA 558
+L + + + L
Sbjct: 710 KLGEIEQLIEELE 722
Score = 46.3 bits (110), Expect = 4e-05
Identities = 98/569 (17%), Positives = 228/569 (40%), Gaps = 26/569 (4%)
Query: 4 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 63
++++ L+ + ++ +E + +++ E + LEE +++ + +
Sbjct: 192 GQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERL 251
Query: 64 ISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQ 123
L +L + + L E EEL + L+ + E +
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLR 311
Query: 124 DTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA 183
++ L++ + L ++ +KLE++ L+ ++E+L + + + +E LK +E +
Sbjct: 312 ALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLE 371
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
+LE+ ++ + +++ ++ I L +L + + + Q++++EL+ +EELE E+E +
Sbjct: 372 ELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEE 431
Query: 244 SRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHES 303
K E+Q L E + E GA ++ E K +L E ++
Sbjct: 432 EIKKLEEQINQL----ESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELE 487
Query: 304 TLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
+ K+ + E+ E +L +L+ + ++L E+
Sbjct: 488 EELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEE 547
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENS------DLLRQLEEAESQVSQL 417
+ ++LQ L +++ +L + L + ++L + + +L +L+E + ++ +L
Sbjct: 548 LKEKLQ--LQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKEL 605
Query: 418 SKIKISLTTQLE----DTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADL 473
+ L L+ EE E + NL+ +++ + + EE E K +
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEE 665
Query: 474 QRQLSKANADAQLWRTKYESEGV----------ARAEELEESKSEERRVDDLAAELDASQ 523
+ + E + EELEE + ++ L EL++ +
Sbjct: 666 LEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRK 725
Query: 524 KECRNYSTELFRLKGAYEEAQEQLEAVRR 552
E EL +L+ A E +E E + +
Sbjct: 726 AELEELKKELEKLEKALELLEELREKLGK 754
Score = 36.7 bits (85), Expect = 0.036
Identities = 56/313 (17%), Positives = 132/313 (42%), Gaps = 18/313 (5%)
Query: 128 RLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLER 187
L++ E L + K+ + + L+ + +L D+E +E +LK ++ E
Sbjct: 165 GLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKL---EEI 221
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK 247
+++ E+ ++++ + + A+LE+E+ + + + ++ E+++ E + E E + +
Sbjct: 222 QEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELER 281
Query: 248 AEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLAN 307
++ + LEEL +EE E + R L E + L +
Sbjct: 282 LLEELEEKIERLEELEREIEE---------------LEEELEGLRALLEELEELLEKLKS 326
Query: 308 LRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQ 367
L ++ ++ + +L +L + ++L +E + KQ
Sbjct: 327 LEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQ 386
Query: 368 LQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQ 427
L+ + E++ +L E + +L + ++L E +L R+LEE E ++ +L + L ++
Sbjct: 387 LEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLESK 446
Query: 428 LEDTKRLADEEGR 440
LA +
Sbjct: 447 ELMIAELAGAGEK 459
Score = 36.7 bits (85), Expect = 0.042
Identities = 63/340 (18%), Positives = 123/340 (36%), Gaps = 41/340 (12%)
Query: 220 NQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIEL 279
K+ KE+ + LE+ K + ++ +E + E LE IE
Sbjct: 152 KPKERKEILDELFGLEKY--------EKLSELLKEVIKEAKAKIEELEGQLSELLEDIED 203
Query: 280 NKKREAEMSKLRRDLEEAN-IQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERN 338
+ E K + LEE Q E L + + ++E+ E+ ++L +LK + +
Sbjct: 204 LLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIES 263
Query: 339 AAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSI 398
+L R E+ ++L+ E+ ++
Sbjct: 264 LELEALKIREEELRELERLLEELEEKIERLEELEREI---------------EELEEELE 308
Query: 399 ENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDN 458
LL +LEE ++ L + L +LE + +E E+ L +++
Sbjct: 309 GLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEE 368
Query: 459 IREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAE 518
E+LE+E E + +QL + A E EE ++++ E
Sbjct: 369 RLEELEKELEKALERLKQLEE-----------------AIQELKEELAELSAALEEIQEE 411
Query: 519 LDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
L+ +KE EL L+ ++ +EQ+ + + +A
Sbjct: 412 LEELEKELEELERELEELEEEIKKLEEQINQLESKELMIA 451
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 46.6 bits (111), Expect = 3e-05
Identities = 101/467 (21%), Positives = 196/467 (41%), Gaps = 71/467 (15%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQT---IQRKDKEIASLTAK 209
L K+ E +V ++ +I +L +E + LE+ KELE+ I+ +KE+ SL
Sbjct: 194 LIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGS 253
Query: 210 LEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRA---------DLARELE 260
+ + + +++I+EL+ IEELEE+V+ ++ + KAE+ D RE+E
Sbjct: 254 KRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIE 313
Query: 261 ELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMG 320
+ RLEE +I+ +++E + +L++ L+E E L L ++H + E
Sbjct: 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKEL----EKRLEELEERH-ELYEEAK 368
Query: 321 EQIDQLNKLKT--------KAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQL 372
+ ++L +LK K E E ++ +K K+L+ +
Sbjct: 369 AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAI 428
Query: 373 NEVQ---GKLDETNRSLNDFDAA--KKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQ 427
E++ GK R L + ++ + E + ++L+E E + +L K L
Sbjct: 429 EELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKV 488
Query: 428 LEDTKRL------------------------ADEEGRERATLLGKFRNLEHDIDNIREQL 463
L+ L +++ E L K L+ +I +++++L
Sbjct: 489 LKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKEL 548
Query: 464 EEEAE---GKADLQRQLSKANADAQLWRTKYESEGVARAEELEES--------------K 506
E+ E A+L+++L + + + E G EELEE K
Sbjct: 549 EKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608
Query: 507 SEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
E+ ++ EL ++E EL + EE +++LE + ++
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
Score = 43.9 bits (104), Expect = 2e-04
Identities = 87/412 (21%), Positives = 188/412 (45%), Gaps = 28/412 (6%)
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ 223
++++ ++E +K T+ ++ +KELE+ + R+ EI+S +L +E + K K+
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKEKEKELEEVL-REINEISSELPELREELEKLEKEVKE 232
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKR 283
++EL+ IEELE+E+E+ S+ K E++ +L +EEL + +EE EL +K
Sbjct: 233 LEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA 292
Query: 284 EA--EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAH 341
E ++S+ + + + E L+ L ++ ++ + E+I +L + + + E +
Sbjct: 293 EEYIKLSEFYEEYLDELREIEKRLSRLEEE----INGIEERIKELEEKEERLEELK---- 344
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKL-----DETNRSLNDFDAAKKKL 396
+ ++L E+ + AK + +L ++ +L ++ + L + + AK+++
Sbjct: 345 KKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
Query: 397 SIENSDLLRQLEEAESQVSQLSKIKISLTT---QLEDTKRLADEEGRER--ATLLGKFRN 451
E S + ++ E + ++ +L K L + R EE R+ + +
Sbjct: 404 EEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKR 463
Query: 452 LEHDIDNIREQLEE------EAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEES 505
+E ++ I E+ + E E + +L K A+ + E EELE+
Sbjct: 464 IEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKK 523
Query: 506 KSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNL 557
E ++ + +L K + +L LK E +++L+ + E L
Sbjct: 524 AEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575
Score = 38.5 bits (90), Expect = 0.009
Identities = 110/541 (20%), Positives = 225/541 (41%), Gaps = 42/541 (7%)
Query: 36 KKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWVEELTKGSIEA 95
++S E +L +LE+ E +V +L ++K + ++ + L E S +EE + E
Sbjct: 210 NEISSELPELREELEKLEKEVKELEELKEEIEELEKELESL--EGSKRKLEEKIRELEER 267
Query: 96 ITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQE-EDARNQLFQQKKKLEQENGGLK 154
I +++ L + + +E D ++ ++ +LE+E G++
Sbjct: 268 IEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
Query: 155 KDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQ 214
+ I++LE E+ +++ LK ++ + +LE + E + K +E+ L +L
Sbjct: 328 ERIKELEEKEER-LEELKKKLKELEKRLEELEERHELYE-EAKAKKEELERLKKRLTGLT 385
Query: 215 GLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL----------GE 264
K +K+++EL+ EE+EEE+ + +K+ +L + +EEL G
Sbjct: 386 --PEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGR 443
Query: 265 RLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRK-----KHNDAVSEM 319
L E + EL ++ AE+ ++ ++L+E + LR+ K + ++
Sbjct: 444 ELTE-----EHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKL 498
Query: 320 GEQIDQLNKLKTKAE----HERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEV 375
E +QL +L+ K + E E + +L E + +K ++L E+
Sbjct: 499 KELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKL----EEL 554
Query: 376 QGKLDETNRSLNDFDAAKKKLSIENSDL-LRQLEEAESQVSQLSKIK---ISLTTQLEDT 431
+ KL E + L++ + +L E +L +EE E ++ +L + L ++
Sbjct: 555 KKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKEL 614
Query: 432 KRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKY 491
+R E + L F L + + E+L +E E + + +
Sbjct: 615 EREEKELKKLEEELDKAFEELA-ETEKRLEELRKELEELEKKYSEEEYEELREEY--LEL 671
Query: 492 ESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVR 551
E ELEE + + +L +E EL +L+ A E +E E V+
Sbjct: 672 SRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVK 731
Query: 552 R 552
+
Sbjct: 732 K 732
Score = 31.2 bits (71), Expect = 2.0
Identities = 87/430 (20%), Positives = 180/430 (41%), Gaps = 32/430 (7%)
Query: 21 LDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEE 80
+E L + + +L E + + +++E E + +L ++K L + + L
Sbjct: 302 YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL---- 357
Query: 81 SSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLF 140
E + + L+ L + T E L++ E A+ ++
Sbjct: 358 -EERHELYEEAKAKKEELERLKKRLTGLTPE------------KLEKELEELEKAKEEIE 404
Query: 141 QQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDL-----KLTQEAVADLERNKKELEQT 195
++ K+ G LKK+I++L+ +E+ K K +G +LT+E +L +
Sbjct: 405 EEISKITARIGELKKEIKELKKAIEELK-KAKGKCPVCGRELTEEHRKELLEEYTAELKR 463
Query: 196 IQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADL 255
I+++ KEI KL E + K K+ EL E E+ E E + + ++
Sbjct: 464 IEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKK 523
Query: 256 ARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDA 315
A E E+L E+L + G + ++ ++ E+ K +LE+ + E LA L K+ +
Sbjct: 524 AEEYEKLKEKLIKLKGEIKS-LKKELEKLEELKKKLAELEKKLDELEEELAELLKELEEL 582
Query: 316 VSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLN-- 373
E E++++ K +E + +L + + +K ++L
Sbjct: 583 GFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRL 642
Query: 374 -EVQGKLDETNRSLNDFDAAKK-----KLSIENSDLLRQLEEAESQVSQLSKIKISLTTQ 427
E++ +L+E + ++ + + +LS E + L +LEE E + ++ K L +
Sbjct: 643 EELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEE 702
Query: 428 LEDTKRLADE 437
LE+ ++ E
Sbjct: 703 LEEREKAKKE 712
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 46.7 bits (110), Expect = 4e-05
Identities = 76/430 (17%), Positives = 178/430 (41%), Gaps = 16/430 (3%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK 189
+ EE + ++K + ++ KK E+ + + K+K E K + A A+ E
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAA 1356
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
E E ++ + + K +++ K +A+ + E++ +A+ +A A
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA 1416
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
K++AD E ++ EE A A+ + + ++A+ + ++ EEA E+
Sbjct: 1417 KKKAD------EAKKKAEEKKKADEAKKKAEEAKKAD--EAKKKAEEAKKAEEAKKKAEE 1468
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
K D + E+ + ++ K KAE + A A +KA K A + +
Sbjct: 1469 AKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK 1528
Query: 370 HQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQ-------VSQLSKIKI 422
+ + D KK ++ ++ ++ EEA+ + + + K
Sbjct: 1529 KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK 1588
Query: 423 SLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANA 482
+ ++E+ +L +EE + +A K + + +++ EEE + L+++ ++
Sbjct: 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA-EEEKKKVEQLKKKEAEEKK 1647
Query: 483 DAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEE 542
A+ + E + AEE ++++ ++++ ++ + +K E K A E
Sbjct: 1648 KAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
Query: 543 AQEQLEAVRR 552
+++ E ++
Sbjct: 1708 KKKEAEEKKK 1717
Score = 46.3 bits (109), Expect = 5e-05
Identities = 102/453 (22%), Positives = 190/453 (41%), Gaps = 34/453 (7%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGD---VEKQKRKIEGDLKLTQEAVADLE 186
++ E+A+ K + E + E E E+ K+K + K +E E
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Query: 187 RNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEEL----EEEVEAER 242
KK E + + + A+ K DE + +K+ E + + EE E + +AE
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Query: 243 QSRAKAEKQRADLARELEELGERLEEAGGATSA--QIELNKKREAEMSKLR--------- 291
+A+ K++A+ A++ +E ++ EEA A A + E KK+ E K
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Query: 292 RDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAA 351
+ EEA E+ A KK ++A + E+ + ++LK KAE + A + A
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEA--KKAEEKKKADELK-KAEELKKAEEKKKAEEAKKA 1572
Query: 352 DQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAE 411
++ + AK+ + E KL E + + +A K + + ++ L++ EE +
Sbjct: 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK 1632
Query: 412 SQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKA 471
+V QL K + + E+ K+ +E + A K + + ++ E+E +
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
Query: 472 DLQRQLSKANADAQLWRTKYESEGVARAEEL-----------EESKSEERRVDDLAAELD 520
L+++ +A +L K E+E +AEEL EE+K E A E
Sbjct: 1693 ALKKEAEEAKKAEEL--KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
Query: 521 ASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
++E + + + EE +++ EAV E
Sbjct: 1751 KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEE 1783
Score = 41.3 bits (96), Expect = 0.002
Identities = 78/396 (19%), Positives = 166/396 (41%), Gaps = 21/396 (5%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGG--LKKDIEDLEGDVE-----KQKRKIEGDLKLTQ 179
D L++ A+ + + KKK E++ KK E+ + E ++ +K E K +
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELE-EEV 238
EA E KK E + K+ A K DE + +K+ E + E + +E
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Query: 239 EAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 298
+ +++ E ++A+ ++ +EL + E + E KK E + + R EEA
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Query: 299 IQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREK 358
E+ + + K + + E+ + + K KAE + A + +QL +++
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA-----EEEKKKVEQLKKKE 1642
Query: 359 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
A +K A++L+ K +E N+ +A K + + ++ ++ EE E + ++
Sbjct: 1643 AEEKKKAEELK--------KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEAL 1694
Query: 419 KIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLS 478
K + + E+ K+ EE ++ L + + +++ EE+ + + ++
Sbjct: 1695 KKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEE 1754
Query: 479 KANADAQLWRTKYESEGVARAEELEESKSEERRVDD 514
+ A L + + + R E+ + E D+
Sbjct: 1755 EKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE 1790
Score = 34.3 bits (78), Expect = 0.19
Identities = 43/175 (24%), Positives = 82/175 (46%), Gaps = 8/175 (4%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGG---LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA 183
+ L++ E+ + ++ Q KKK +E LKK E+ + ++ +K E D K +EA
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Query: 184 DLERNKKELEQTIQRKD--KEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE 241
E KK E + + K+ L K +E+ + +K +E + + EE ++E E +
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742
Query: 242 RQSRAKA---EKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 293
++ +A E+++ +A +E ++ EE A IE E E ++ D
Sbjct: 1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Score = 34.0 bits (77), Expect = 0.26
Identities = 71/366 (19%), Positives = 156/366 (42%), Gaps = 35/366 (9%)
Query: 126 VDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADL 185
+ ++ E+ +N ++ ++ ++ + IE++ E++K+ + K +EA
Sbjct: 1566 AEEAKKAEEDKNMALRKAEEAKKAE---EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622
Query: 186 ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
E KK E+ + + + K + E+ + + +IK + + E++ +AE +
Sbjct: 1623 EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTL 305
A+ ++++A E L + EEA + E KK+EAE K +L++A +++
Sbjct: 1683 AEEDEKKAA-----EALKKEAEEA-----KKAEELKKKEAEEKKKAEELKKAEEENKIKA 1732
Query: 306 ANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIA 365
+K+ + + E K E E+ + A+++ +EK A I
Sbjct: 1733 EEAKKEAEEDKKKAEEA--------KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV--IE 1782
Query: 366 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLT 425
++L + + + ++D+ + + D A + E + ++ +E E S + ++ S
Sbjct: 1783 EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEME--DSAIKEVADSKN 1840
Query: 426 TQLEDTKRL----------ADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQR 475
QLE+ E+G + A + E D + I E E E K D++R
Sbjct: 1841 MQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIER 1900
Query: 476 QLSKAN 481
++ N
Sbjct: 1901 EIPNNN 1906
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 43.6 bits (103), Expect = 6e-05
Identities = 30/117 (25%), Positives = 62/117 (52%), Gaps = 3/117 (2%)
Query: 174 DLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEE 233
L+ Q +++L++ EL++ ++ ++ IA L ++LED + + ++ +K L+ R+E
Sbjct: 73 KLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLES 132
Query: 234 LEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 290
LEE ++ + + R DL E+EEL E LE E ++ ++ + +L
Sbjct: 133 LEESIKE---LAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
Score = 35.5 bits (82), Expect = 0.037
Identities = 28/133 (21%), Positives = 57/133 (42%), Gaps = 14/133 (10%)
Query: 197 QRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLA 256
+ +++ L +L + + + + Q+++++L+ RI ELE E+E ++ + L
Sbjct: 68 SVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLE 127
Query: 257 RELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAV 316
LE L E ++E E+ +LR+DL E + L L++
Sbjct: 128 ERLESLEESIKE--------------LAKELRELRQDLREEVEELREELERLQENLQRLQ 173
Query: 317 SEMGEQIDQLNKL 329
+ E L +L
Sbjct: 174 EAIQELQSLLEQL 186
Score = 30.5 bits (69), Expect = 1.5
Identities = 23/112 (20%), Positives = 47/112 (41%)
Query: 352 DQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAE 411
++L E + ++ +LQ +L +++ ++ E L D + L L +LE E
Sbjct: 75 EELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLE 134
Query: 412 SQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQL 463
+ +L+K L L + EE L + + ++ ++ EQL
Sbjct: 135 ESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
Score = 30.1 bits (68), Expect = 1.9
Identities = 22/109 (20%), Positives = 45/109 (41%)
Query: 235 EEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDL 294
+++E + ++ ++Q ++L ELE+L ER+ E E + + L L
Sbjct: 71 PQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERL 130
Query: 295 EEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNE 343
E + LR+ D E+ E ++L +L+ + + A
Sbjct: 131 ESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQEL 179
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 44.0 bits (105), Expect = 1e-04
Identities = 41/160 (25%), Positives = 79/160 (49%), Gaps = 12/160 (7%)
Query: 132 EEDARNQLFQQKKKLEQENGGLKKDIEDLEG--DVEKQKRKIEGDLKLTQEAVADLERNK 189
EE+A+ L + KK+ E +KK+ LE ++ K + + E +L+ + + LE+
Sbjct: 37 EEEAKRILEEAKKEAEA----IKKEAL-LEAKEEIHKLRNEFEKELRERRNELQKLEKRL 91
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA--ERQSRAK 247
+ E+ + RK + + +LE ++ + + Q+++++ + +EEL EE ER S
Sbjct: 92 LQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLT 151
Query: 248 AEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 287
AE+ + L LE++ E +IE K EA+
Sbjct: 152 AEEAKEIL---LEKVEEEARHEAAVLIKEIEEEAKEEADK 188
Score = 39.4 bits (93), Expect = 0.005
Identities = 35/150 (23%), Positives = 66/150 (44%), Gaps = 20/150 (13%)
Query: 219 KNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE 278
+ K+ L+A+ E + E E++ R + EL++L +RL + ++E
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERR--------NELQKLEKRLLQKEENLDRKLE 103
Query: 279 LNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL----------NK 328
L +KRE E+ K ++LE+ + E L + + + E+ E+I L K
Sbjct: 104 LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL-ERISGLTAEEAKEILLEK 162
Query: 329 LKTKAEHERNA-AHNETNNTRAAADQLARE 357
++ +A HE + AD+ A+E
Sbjct: 163 VEEEARHEAAVLIKEIEEEAKEEADKKAKE 192
Score = 35.1 bits (82), Expect = 0.089
Identities = 33/121 (27%), Positives = 67/121 (55%), Gaps = 20/121 (16%)
Query: 128 RLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLER 187
RL Q+E+ ++ + +K E+E +K++E + ++EK++ ++E +L +E + +LER
Sbjct: 90 RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELE---ELIEEQLQELER 146
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIK-ELQARIEELEEEVEAERQSRA 246
I+ LTA+ E ++ L+ K +++ + E I+E+EEE + E +A
Sbjct: 147 ---------------ISGLTAE-EAKEILLEKVEEEARHEAAVLIKEIEEEAKEEADKKA 190
Query: 247 K 247
K
Sbjct: 191 K 191
Score = 34.8 bits (81), Expect = 0.11
Identities = 26/108 (24%), Positives = 52/108 (48%), Gaps = 3/108 (2%)
Query: 366 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLT 425
+LQ + K + +R L + +++L + +L ++ +E E + +L ++
Sbjct: 82 NELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQL 141
Query: 426 TQLEDTKRLADEEGRERATLLGKFRN-LEHDIDNIREQLEEEAEGKAD 472
+LE L EE +E LL K H+ + +++EEEA+ +AD
Sbjct: 142 QELERISGLTAEEAKEI--LLEKVEEEARHEAAVLIKEIEEEAKEEAD 187
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 43.2 bits (102), Expect = 3e-04
Identities = 44/211 (20%), Positives = 93/211 (44%), Gaps = 8/211 (3%)
Query: 163 DVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQK 222
+KQ ++I+ ++ ++ + + + + +LE+ ++ + EIASL A+L + + K +K
Sbjct: 35 ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
Query: 223 QIKELQARIEELEEEVEAERQSRAK--AEKQRADLARELEELGERLEEAGGATSAQI--E 278
QI +L AR+ LE + +R+ A+ A QR+ L + A
Sbjct: 95 QIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGA 154
Query: 279 LNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERN 338
LN R + L+ L++ + +A + + +SE Q +L +L E ++
Sbjct: 155 LNPARAERIDALKATLKQLAAV-RAEIAAEQAELTTLLSEQRAQQAKLAQL--LEERKKT 211
Query: 339 AAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
A + A +L +A ++ ++
Sbjct: 212 LAQLNS-ELSADQKKLEELRANESRLKNEIA 241
Score = 41.2 bits (97), Expect = 0.001
Identities = 48/263 (18%), Positives = 98/263 (37%), Gaps = 32/263 (12%)
Query: 220 NQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIEL 279
+ KQ+K++Q I LE+++ ++ RAK EKQ L E+ L +L E
Sbjct: 36 DDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIET---------- 85
Query: 280 NKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNA 339
++ KLR+ + + N + L L + EQ +L + A
Sbjct: 86 ----ADDLKKLRKQIADLN----ARLNALEVQE-------REQRRRLAEQ-------LAA 123
Query: 340 AHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIE 399
N A + + ++A ++D +L A + +++ E
Sbjct: 124 LQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAE 183
Query: 400 NSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNI 459
++L L E +Q ++L+++ L + ++ L L+++I +
Sbjct: 184 QAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASA 243
Query: 460 REQLEEEAEGKADLQRQLSKANA 482
+ E A + ++A A
Sbjct: 244 EAAAAKAREAAAAAEAAAARARA 266
Score = 40.9 bits (96), Expect = 0.002
Identities = 21/104 (20%), Positives = 40/104 (38%), Gaps = 1/104 (0%)
Query: 166 KQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIK 225
++ ++ LK A++ + EL + + + A L LE+ + + + ++
Sbjct: 161 ERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELS 220
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
Q ++EEL + A AE A ARE E
Sbjct: 221 ADQKKLEELRANESRLKNEIASAEAA-AAKAREAAAAAEAAAAR 263
Score = 37.8 bits (88), Expect = 0.012
Identities = 49/231 (21%), Positives = 90/231 (38%), Gaps = 15/231 (6%)
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK 247
+ ++ +++ KEIA+L K+ ++Q K +KQ+K L+ I LE ++ K
Sbjct: 32 SAAADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKK 91
Query: 248 AEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLAN 307
KQ ADL L L + E + Q+ + S ++ +
Sbjct: 92 LRKQIADLNARLNALEVQEREQRRRLAEQL-----AALQRSGRNPPPALLVSPEDAQRSV 146
Query: 308 LRKKHNDAVS-EMGEQIDQLNK-LKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIA 365
+ A++ E+ID L LK A A + T ++Q A++
Sbjct: 147 RLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAK------ 200
Query: 366 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQ 416
L L E + L + N L+ ++L S L ++ AE+ ++
Sbjct: 201 --LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAK 249
Score = 37.0 bits (86), Expect = 0.024
Identities = 46/255 (18%), Positives = 90/255 (35%), Gaps = 28/255 (10%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 60
AA +K KQ+Q ++ ++ K+ E + K L E + L QL E + +L
Sbjct: 34 AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93
Query: 61 KIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTEN-SHSFSIS 119
K +I+ + + + E + E L L + N + +S
Sbjct: 94 K-QIADLNARLNALEVQEREQRRRLAEQ--------------LAALQRSGRNPPPALLVS 138
Query: 120 LYSQDTVDRLQQEEDARNQ-LFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLT 178
RL A N ++ L+ L + +I +
Sbjct: 139 PEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAA-----------VRAEIAAEQAEL 187
Query: 179 QEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEV 238
+++ + +L Q ++ + K +A L ++L +Q + + + L+ I E
Sbjct: 188 TTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAA 247
Query: 239 EAERQSRAKAEKQRA 253
R++ A AE A
Sbjct: 248 AKAREAAAAAEAAAA 262
Score = 30.1 bits (68), Expect = 3.6
Identities = 40/222 (18%), Positives = 74/222 (33%), Gaps = 10/222 (4%)
Query: 339 AAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSI 398
A + + L ++ + +L+ QL ++ ++ L + KKL
Sbjct: 35 ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
Query: 399 ENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEG------RERATLLGKFRNL 452
+ +DL +L E Q + + L QL +R E A +
Sbjct: 95 QIADLNARLNALEVQEREQRRR---LAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIY 151
Query: 453 EHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERR- 511
++ R + + + + A Q T SE A+ +L + E ++
Sbjct: 152 YGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKT 211
Query: 512 VDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
+ L +EL A QK+ RLK A+ R
Sbjct: 212 LAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREA 253
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 43.5 bits (103), Expect = 3e-04
Identities = 65/339 (19%), Positives = 127/339 (37%), Gaps = 54/339 (15%)
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 285
EL+ R+++ EE +++ + +AE+Q ELEE EA A K+
Sbjct: 604 ELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTAL-------KQARL 656
Query: 286 EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL-NKLKTKAEHERNAAHNET 344
++ +L+ + + + E +A +++ + ++ Q+ QL + + E ++
Sbjct: 657 DLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELR 716
Query: 345 NNTRAA--------ADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL 396
A +QLA+ AA E Q + +L E++ +D L
Sbjct: 717 TERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELK----------KQYDRELASL 766
Query: 397 SIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKF---RNLE 453
++ + V +L + L T +E E RA + + +L
Sbjct: 767 DVD-----------PNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLR 815
Query: 454 HDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVD 513
+ N+ QL E +LQ++L++ D +L R K E E + ++
Sbjct: 816 EERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLE--------------QERKALE 861
Query: 514 DLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRR 552
+LD + R+ +L LK Q + R
Sbjct: 862 KQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISER 900
Score = 42.4 bits (100), Expect = 6e-04
Identities = 48/287 (16%), Positives = 99/287 (34%), Gaps = 24/287 (8%)
Query: 126 VDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADL 185
+ + AR L + + + + L+ I + + E Q R+++ LK E
Sbjct: 645 AEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAF 704
Query: 186 ERNKKE--LEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
K+ E +R K L + + +AR++EL+++ + E
Sbjct: 705 LEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYDRELA 764
Query: 244 SRAKAEKQRADLARELEELGERLEEAGG-----------------ATSAQIELNKKREAE 286
S +L R++EEL +E + E +
Sbjct: 765 SLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQ 824
Query: 287 MSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKT--KAEHERNAAH 341
+ +L EE + RKK + +Q+DQL++L + E + A
Sbjct: 825 LRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAEL 884
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLND 388
E N A ++ E+ ++ + +++ L+ + + D
Sbjct: 885 KEPANANQAEGSISERLDQLEEFKRKRKRLSGDLKKFLERFDGVIKD 931
Score = 34.7 bits (80), Expect = 0.16
Identities = 42/229 (18%), Positives = 89/229 (38%), Gaps = 9/229 (3%)
Query: 194 QTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRA 253
+ IQ+ EI L E L + Q EL A E L EE E ++++ + +Q
Sbjct: 239 RAIQKVAPEIEKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLR 298
Query: 254 DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI----QHESTLANLR 309
L +L+E + L + A +A++ ++ + + E+A+I L ++R
Sbjct: 299 TLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIR 358
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
+ + + + + ++ K E + + R R A E+ +Q
Sbjct: 359 SELEEVEARLDALTGKHQDVQRKYERLKQKIKEQ--LERDLEKNNERLAAIREEKDRQKA 416
Query: 370 HQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
++Q + + L A K + + E +L +L + ++ +
Sbjct: 417 AIEEDLQALESQLRQQLE---AGKLEFNEEEYELELRLGRLKQRLDSAT 462
Score = 33.1 bits (76), Expect = 0.44
Identities = 85/470 (18%), Positives = 177/470 (37%), Gaps = 44/470 (9%)
Query: 12 HQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQV-SQLSKIKISLTTQL 70
+ E Q + + L +K E + L++ E Q+ L ++++ L+ Q
Sbjct: 479 EKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQA 538
Query: 71 EDTKRLADEESSGWVEELTKGSIEAITLKSLDLD-VLNFTTENSHSFSISLYSQ--DTVD 127
E+ GW E + K + L+ DLD L +++ + +SL Q D D
Sbjct: 539 GSLLHFLRNEAPGWEESIGKVISPEL-LERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPD 597
Query: 128 RLQQEEDARNQLFQQKKKL---EQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVAD 184
E + R +L Q ++ L + ++ + ++E+QKR EA
Sbjct: 598 YAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEA-------EARTA 650
Query: 185 LERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA-ERQ 243
L++ + +L++ + L + + + ++ + ++ +L+ +++ Q
Sbjct: 651 LKQARLDLQRLQNEQQSLKDKLELAIAERK----------QQAETQLRQLDAQLKQLLEQ 700
Query: 244 SRAKAEKQRADLARELEELGERLEEAGGATSAQI-ELNKKREAEMSKLRRDLEEANIQHE 302
+A E + D E + + G Q+ +L+ EA ++ + L+E Q++
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYD 760
Query: 303 STLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAA---HNETNNTRAAADQLAREKA 359
LA+ + V E+ QI++L + R T D L E+
Sbjct: 761 RELAS-LDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERP 819
Query: 360 A-------TEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAES 412
E A++LQ +L + + L A +K + +LLR L +
Sbjct: 820 NLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDEMR 879
Query: 413 QVSQLSKIKISLTTQLEDTKRLA------DEEGRERATLLGKFRNLEHDI 456
Q+++L + + + ++RL + R L + I
Sbjct: 880 QLAELKEPANANQAEGSISERLDQLEEFKRKRKRLSGDLKKFLERFDGVI 929
Score = 33.1 bits (76), Expect = 0.51
Identities = 65/411 (15%), Positives = 141/411 (34%), Gaps = 14/411 (3%)
Query: 156 DIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQG 215
+IE L+ D E+ L E VAD ER +E E+ + K++ L + +
Sbjct: 247 EIEKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKE 306
Query: 216 LVGKNQKQIKELQARIEELEEEVEA-ERQSRAKAEKQRADLARELEELGE---RLEEAGG 271
+ +++ A++ E+E E Q A + L +L++L LEE
Sbjct: 307 ARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEA 366
Query: 272 ATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQ-LNKLK 330
A ++ + + +L++ ++E + + I++ L L+
Sbjct: 367 RLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALE 426
Query: 331 TKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFD 390
++ + A E N +L + + + E DE +L
Sbjct: 427 SQLRQQLEAGKLEFNE-EEYELELRLGRLKQRLDSATATPEELEQLEINDE---ALEKAQ 482
Query: 391 AAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGR----ERATLL 446
+++ L +L + + + + +L ++ DE + +LL
Sbjct: 483 EEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLL 542
Query: 447 GKFRNLEHDI-DNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEES 505
RN ++I + + E + DL QL + + L+ + + + +
Sbjct: 543 HFLRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANE 602
Query: 506 KSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKN 556
R+ L ++ + + +L + EE + R K
Sbjct: 603 TELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQ 653
Score = 32.7 bits (75), Expect = 0.58
Identities = 96/471 (20%), Positives = 178/471 (37%), Gaps = 71/471 (15%)
Query: 134 DARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN-KKEL 192
D + + + K+K++++ ++D+E + + + + +E + LE +++L
Sbjct: 377 DVQRKYERLKQKIKEQL---ERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQL 433
Query: 193 EQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQR 252
E ++E L +L + + + E ++E +E +E ++ + +AE
Sbjct: 434 EAGKLEFNEEEYELELRLGRLKQRL-DSATATPEELEQLEINDEALEKAQEEQEQAEANV 492
Query: 253 ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQ----HESTLANL 308
L EL +L +R +EA +E ++ E + +LR+ L+E +Q S L L
Sbjct: 493 EQLQSELRQLRKRRDEA-------LEALQRAERRLLQLRQALDELELQLSPQAGSLLHFL 545
Query: 309 RK------------------KHNDAVSEMGEQIDQLN----KLKTKAEHERNAAHNETNN 346
R + D ++ E D L + + A NET
Sbjct: 546 RNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANET-E 604
Query: 347 TRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKK-------KLSIE 399
R Q + KQ + QL + +L+E R+ + A K +L E
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNE 664
Query: 400 NSDLLRQLEEA--------ESQVSQLSKIKISLTTQLEDTKRLADEEGRERATL-LGKFR 450
L +LE A E+Q+ QL L Q + ++ RE T L K++
Sbjct: 665 QQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQ 724
Query: 451 NLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEER 510
+E ++DN QL E + A A+L K EL +
Sbjct: 725 VVEGELDNQLAQLSAAIEAA--------RTQAKARLKELK-----KQYDRELASLDVDPN 771
Query: 511 RVDDLA---AELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
V +L EL+ + + E+ + +E +++R E NLA
Sbjct: 772 TVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWLHRDSLREERPNLA 822
Score = 31.6 bits (72), Expect = 1.6
Identities = 63/318 (19%), Positives = 126/318 (39%), Gaps = 51/318 (16%)
Query: 9 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTT 68
+LQH E+ + + AK +L + L QL+EA +++Q
Sbjct: 264 RLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQ---------- 313
Query: 69 QLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDR 128
+L + +L++L + +++
Sbjct: 314 ELSAAN-------------------AKLAADRSELELLEDQKGA--------FEDADIEQ 346
Query: 129 LQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN 188
LQ + D + + +++E L +D++ E+ K+KI+ L+ DLE+N
Sbjct: 347 LQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLE------RDLEKN 400
Query: 189 KKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKA 248
+ L + KD++ A++ L+ + ++L+A E EE E E + R
Sbjct: 401 NERLAAIREEKDRQKAAIEEDLQ------ALESQLRQQLEAGKLEFNEE-EYELELRLGR 453
Query: 249 EKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANL 308
KQR D A E E+LE A E ++ EA + +L+ +L + + + L L
Sbjct: 454 LKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEAL 513
Query: 309 RKKHNDAVSEMGEQIDQL 326
++ + ++ + +D+L
Sbjct: 514 QRA-ERRLLQLRQALDEL 530
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 43.0 bits (102), Expect = 3e-04
Identities = 41/161 (25%), Positives = 73/161 (45%), Gaps = 8/161 (4%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEG--DVEKQKRKIEGDLKLTQEAVADLER 187
++ A + ++ ++E LKK+ LE +V K + ++E +LK + + LER
Sbjct: 25 KKLGSAEELAKRIIEEAKKEAETLKKEAL-LEAKEEVHKLRAELERELKERRNELQRLER 83
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA--ERQSR 245
+ E+T+ RK + + LE ++ + +K + E + +EEL E ER S
Sbjct: 84 RLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISG 143
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 286
E+ + L LEE+ E +IE K EA+
Sbjct: 144 LTQEEAKEIL---LEEVEEEARHEAAKLIKEIEEEAKEEAD 181
Score = 31.4 bits (72), Expect = 1.2
Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 11/119 (9%)
Query: 129 LQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN 188
LQ+ E +L Q+++ L+++ L K E+LE K E +L ++ + + E
Sbjct: 78 LQRLE---RRLLQREETLDRKMESLDKKEENLE--------KKEKELSNKEKNLDEKEEE 126
Query: 189 KKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK 247
+EL + + + I+ LT + E L ++ E I+E+EEE + E +AK
Sbjct: 127 LEELIAEQREELERISGLTQEEAKEILLEEVEEEARHEAAKLIKEIEEEAKEEADKKAK 185
Score = 29.9 bits (68), Expect = 4.6
Identities = 36/164 (21%), Positives = 74/164 (45%), Gaps = 24/164 (14%)
Query: 325 QLNKLKTKAEHERNAAHNETNNTR-----AAADQLAREKAATEKIAKQ-------LQHQL 372
+L + A+ A E + A +++ + +A E+ K+ L+ +L
Sbjct: 26 KLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRL 85
Query: 373 NEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTK 432
+ + LD SL+ + +K E S+ + L+E E ++ +L + Q E+ +
Sbjct: 86 LQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEEL------IAEQREELE 139
Query: 433 RLAD---EEGRERATLLGKFRN-LEHDIDNIREQLEEEAEGKAD 472
R++ EE +E LL + H+ + +++EEEA+ +AD
Sbjct: 140 RISGLTQEEAKEI--LLEEVEEEARHEAAKLIKEIEEEAKEEAD 181
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 43.1 bits (101), Expect = 4e-04
Identities = 59/315 (18%), Positives = 120/315 (38%), Gaps = 32/315 (10%)
Query: 107 NFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEK 166
+ + ++ + RN L Q+ + +Q L++DI+ L +
Sbjct: 179 ALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLT----Q 234
Query: 167 QKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKE 226
+K++ + L+ TQ+ A+LER K EL+Q +++ T + DE+ Q + ++
Sbjct: 235 KKQENDRVLEGTQDIEAELERMKGELKQRLKKM-------TIQRRDEETERIDLQLENEQ 287
Query: 227 LQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 286
L + L+E +E+ +Q LGE L G I + E
Sbjct: 288 LHEDLRTLQERLESSQQKAGL--------------LGEELASLGSLRDHTIAELHQSRLE 333
Query: 287 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNN 346
+++ L + N+ + + ++ D++ KL + E +
Sbjct: 334 SAQMSLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQ 393
Query: 347 TRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQ 406
+L EK + QL+E + +L E +L K++L E +LL
Sbjct: 394 REKLEVELGIEKDCN-------RVQLSENRRELSELRSALRVLQKEKEQLQEEKQELLDY 446
Query: 407 LEEAESQVSQLSKIK 421
+ E ++ + + K
Sbjct: 447 IRVLELRLDKEADEK 461
Score = 29.2 bits (65), Expect = 7.4
Identities = 33/179 (18%), Positives = 70/179 (39%), Gaps = 16/179 (8%)
Query: 375 VQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRL 434
+Q +L+ R ND L + SD+ ++E+ E+ + ++ L Q ++ +
Sbjct: 141 LQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSS 200
Query: 435 ADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESE 494
++ ER LL + + I + + ++ + K + R L
Sbjct: 201 SESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGT-------------- 246
Query: 495 GVARAEELEESKSE-ERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRR 552
ELE K E ++R+ + + + E + E +L QE+LE+ ++
Sbjct: 247 -QDIEAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQLHEDLRTLQERLESSQQ 304
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 42.3 bits (100), Expect = 6e-04
Identities = 39/273 (14%), Positives = 95/273 (34%), Gaps = 26/273 (9%)
Query: 224 IKELQ----ARIEELEEEVEAERQS--RAKAEKQRADLARELEELGERLEEAGGATSAQI 277
+ EL +E+LEE + ++ AE + L+ L E+ + L G
Sbjct: 22 LAELHDFGLVHLEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLF 81
Query: 278 ELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL-NKLKTKAEHE 336
L + ++ + LEE E + + + ++ E++ +L +L+ +
Sbjct: 82 ILPE-EVEKLEAELKSLEEVIKPAEKFSSEVEE----LTRKLEERLSELDEELEDLEDLL 136
Query: 337 RNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDET------NRSLNDFD 390
+ L K ++ + +L + G +++ N +
Sbjct: 137 EELEPLAYLDF--DLSLLRGLKFLLVRLGLVRREKLEALVGVIEDEVALYGENVEASVVI 194
Query: 391 AAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFR 450
+ + L E ++ ++ + + + + + + E E +L +
Sbjct: 195 VVAHGAED-LDKVSKILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELE 253
Query: 451 NL----EHDIDNIREQLEEEAEGKADLQRQLSK 479
L ++ +RE LE E D+ +L++
Sbjct: 254 ALAEKIAEELLAVREILEIEKA-LGDVLSKLAR 285
Score = 35.4 bits (82), Expect = 0.073
Identities = 37/224 (16%), Positives = 67/224 (29%), Gaps = 47/224 (20%)
Query: 369 QHQLNEVQGKLDETNRSLNDFDA-----AKKKLSIENSDL-------LRQLEEAESQVSQ 416
H V L++ + I S L LR ++ E ++
Sbjct: 25 LHDFGLVH--LEDLEEGEKGLKELEKLKVAEVAQISLSSLLSEVLDYLRSVKGLEGRLFI 82
Query: 417 LSKIKISLTTQLEDTKRL----------ADEEGRERATLLGKFRNLEHDIDNIREQLEEE 466
L + L +L+ + + +E R+ + L+ +++++ + LEE
Sbjct: 83 LPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEE---RLSELDEELEDLEDLLEEL 139
Query: 467 AEGK-----ADLQRQLSKANADAQLWRTKYESEGVARAEELEES------KSEERRVDDL 515
L R L L R + A +E+ E V +
Sbjct: 140 EPLAYLDFDLSLLRGLKFLLVRLGLVR---REKLEALVGVIEDEVALYGENVEASVVIVV 196
Query: 516 AAELDASQKECRNYSTELFRL------KGAYEEAQEQLEAVRRE 553
A + K + + F L G E +LE V E
Sbjct: 197 AHGAEDLDKVSKILNELGFELYEVPEFDGGPSELISELEEVIAE 240
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 41.6 bits (98), Expect = 0.001
Identities = 56/333 (16%), Positives = 132/333 (39%), Gaps = 46/333 (13%)
Query: 124 DTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEG---DLKLTQE 180
D +DRL D N L ++KK + + +IE LE ++++Q K E ++ +
Sbjct: 182 DLIDRLAG--DLTNVLRRRKKSELPSS--ILSEIEALEAELKEQSEKYEDLAQEIAHLRN 237
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLV----GKNQKQIKELQAR------ 230
+ + +R+ + LE+ + + ++ +LE + + N+ Q++EL A
Sbjct: 238 ELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297
Query: 231 ----IEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 286
++ + +++ E QS+ Q R+ EL E L + E+ +
Sbjct: 298 IPNLLDSTKAQLQKEEQSQQNQLTQEELEERDK-ELLESLPKLALPAEHVKEIAAELAEI 356
Query: 287 MSKLRRDLE-------EANIQHESTLANLRKKHNDAVS-------EMGEQIDQLNKLKTK 332
D E Q E + ++++ DA S E+ E++ +++K
Sbjct: 357 DKPATTDSEIPHRLSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDK---- 412
Query: 333 AEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDF--- 389
+ + + + E +E ++L QL ++ ++ ++L++
Sbjct: 413 -KISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQ 471
Query: 390 --DAAKKKLSIENSDLLRQLEEAESQVSQLSKI 420
+A + + +I +D ++ + + K+
Sbjct: 472 KINAFELERAITIADKAKKTLKEFREKLLERKL 504
Score = 39.7 bits (93), Expect = 0.004
Identities = 58/294 (19%), Positives = 113/294 (38%), Gaps = 52/294 (17%)
Query: 219 KNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE 278
+I+ L+A ++E E+ E Q A + + R LE L ++ GG + E
Sbjct: 206 SILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLESLEKKFRSEGGDLFEERE 265
Query: 279 LNKKREAEMSKLRRDLEEANIQH--------------ESTLANLRK----KHNDAVSEMG 320
+++ E+ R+ + +ST A L+K + N E
Sbjct: 266 QLERQLKEIEAARKANRAQLRELAADPLPLLLIPNLLDSTKAQLQKEEQSQQNQLTQEEL 325
Query: 321 EQIDQ-----LNKLKTKAEH---------ERNAAHNETNNT--RAAADQLAREKAATEKI 364
E+ D+ L KL AEH E + + R + +L + + +++
Sbjct: 326 EERDKELLESLPKLALPAEHVKEIAAELAEIDKPATTDSEIPHRLSGSELTQLEVLIQQV 385
Query: 365 AK-------QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQL 417
+ QL +L E++ +L E ++ ++ + ++ + LL +L EA++++ +
Sbjct: 386 KRELQDAKSQLLKELRELEEELAEVDKKISTIPSEEQI-----AQLLEELGEAQNELFRS 440
Query: 418 SKIKISLTTQLEDTKRLADE---EGRERATLLGKFRNLEHDI---DNIREQLEE 465
L QLE K + E+ LE I D ++ L+E
Sbjct: 441 EAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIADKAKKTLKE 494
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 41.1 bits (97), Expect = 0.001
Identities = 19/109 (17%), Positives = 46/109 (42%)
Query: 192 LEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQ 251
L + I KD + L +++ + L+ ++ ++LQ + L + A R++ +
Sbjct: 44 LSREISGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQAL 103
Query: 252 RADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQ 300
A+LA R E ++ +++ + A++ L + + Q
Sbjct: 104 LAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQ 152
Score = 35.3 bits (82), Expect = 0.063
Identities = 23/112 (20%), Positives = 54/112 (48%)
Query: 186 ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
+ ++L+ ++ +++ A+ Q L+ + + R EL +E+++E+Q
Sbjct: 73 RQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVS 132
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEA 297
A+A Q L +++ L +L A A + +++ +A+++ L R L A
Sbjct: 133 ARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVA 184
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 41.2 bits (96), Expect = 0.001
Identities = 30/141 (21%), Positives = 55/141 (39%), Gaps = 8/141 (5%)
Query: 122 SQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEA 181
+QD RL+ + R QL Q + L+ L+ L+ V DLKL
Sbjct: 146 AQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVL--------DLKLRSAQ 197
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE 241
+ +N Q + +E+A A + + + QI + +I E++
Sbjct: 198 IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRER 257
Query: 242 RQSRAKAEKQRADLARELEEL 262
+ + E +A L +E+ +L
Sbjct: 258 ERQLQRLETAQARLEQEVAQL 278
Score = 33.5 bits (76), Expect = 0.34
Identities = 42/245 (17%), Positives = 90/245 (36%), Gaps = 35/245 (14%)
Query: 223 QIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAG------------ 270
Q+ +++ ++ L E+ + + AE +R EL++ + E
Sbjct: 75 QLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 271 -----GATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQ 325
+ Q + + R +++ RR LE +L +K+ + +++ Q+
Sbjct: 135 AQQELARLTKQAQDLQTRLKTLAEQRRQLEA----QAQSLQASQKQLQASATQLKSQVLD 190
Query: 326 LNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRS 385
L + E E N +A ++LAR AA ++ A+ +Q + ++ K +
Sbjct: 191 LKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAAR 250
Query: 386 LNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATL 445
+ RQL+ E+ ++L + L + RL + AT
Sbjct: 251 AEQIRERE-----------RQLQRLETAQARLEQEVAQLEAYYQAYVRLR---QQAAATQ 296
Query: 446 LGKFR 450
G+
Sbjct: 297 RGQVL 301
Score = 33.1 bits (75), Expect = 0.46
Identities = 28/203 (13%), Positives = 76/203 (37%), Gaps = 5/203 (2%)
Query: 286 EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETN 345
E+ + + A + E+ + L+K + + +++ + KA+ E +
Sbjct: 89 ELGTAQGEKRAAETEREAARSELQKARQER-EAVRQELAAARQNLAKAQQELARLTKQAQ 147
Query: 346 NTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLR 405
+ + LA ++ E A+ LQ ++Q + + D ++ E +L
Sbjct: 148 DLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLAT 207
Query: 406 QLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEE 465
+ A+++ +L+ + T + + + + + I QL+
Sbjct: 208 RANAAQARTEELA----RRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQR 263
Query: 466 EAEGKADLQRQLSKANADAQLWR 488
+A L++++++ A Q +
Sbjct: 264 LETAQARLEQEVAQLEAYYQAYV 286
Score = 28.8 bits (64), Expect = 9.8
Identities = 34/205 (16%), Positives = 80/205 (39%), Gaps = 18/205 (8%)
Query: 129 LQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN 188
+ E R + +K QE ++ + + K + +L + DL+
Sbjct: 97 KRAAETEREAARSELQKARQE----REAVRQELAAARQNLAKAQQELARLTKQAQDLQTR 152
Query: 189 KKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE----------V 238
K L + ++ + + SL A + Q + + Q+ +L+ R ++E+E
Sbjct: 153 LKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAA 212
Query: 239 EAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 298
+A + A+ A+ +++ ++ + +A+ E ++RE ++ + LE A
Sbjct: 213 QARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQL----QRLETAQ 268
Query: 299 IQHESTLANLRKKHNDAVSEMGEQI 323
+ E +A L + V +
Sbjct: 269 ARLEQEVAQLEAYYQAYVRLRQQAA 293
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 41.4 bits (98), Expect = 0.001
Identities = 61/314 (19%), Positives = 120/314 (38%), Gaps = 54/314 (17%)
Query: 115 SFSISLYSQDTVDRLQQEEDARNQL--FQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIE 172
S + + L E D + QL ++K LE E+ +++D+E
Sbjct: 21 CLSSAFARAASNGDLPTEADVQAQLDALNKQKLLEAEDKLVQQDLEQ------------- 67
Query: 173 GDLKLTQEAVADLERNKKE---LEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQA 229
T + ++R K+E L+Q + + ++ A+LE L N ++ +E +
Sbjct: 68 -----TLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEA---LKDDNDEETRETLS 119
Query: 230 R--IEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 287
+ +LE + A+ A+ +L L + E A A A N +R +
Sbjct: 120 TLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYA----NSQR---L 172
Query: 288 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNT 347
++R L+ + ++ + R + + Q D K E N
Sbjct: 173 QQIRNLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSL------------EGNTQ 220
Query: 348 RAAADQLAREKAATEKIAKQLQHQLNEVQG-----KLDETNRSLNDFDAAKKKLSIENSD 402
Q R+ T +I +L+HQL +Q +L + +++ + + + I+ +
Sbjct: 221 LQDLLQKQRDY-LTARIQ-RLEHQLQLLQEAINSKRLTLSEKTVQEAQSQDEAARIQANP 278
Query: 403 LLRQLEEAESQVSQ 416
L+ Q E Q+SQ
Sbjct: 279 LVAQELEINLQLSQ 292
Score = 34.1 bits (79), Expect = 0.22
Identities = 53/278 (19%), Positives = 106/278 (38%), Gaps = 65/278 (23%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 60
++ +QL+ QL + KL + L + + E L L + ES+++Q
Sbjct: 76 DRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTL-SLRQLESRLAQ-- 132
Query: 61 KIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISL 120
QL++ +L + + + ++L++
Sbjct: 133 -----TLDQLQN-----------AQNDLAEYNSQLVSLQT-------------------- 156
Query: 121 YSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQE 180
Q E A+ L+ ++L+Q LK + Q+ ++ + L
Sbjct: 157 ----------QPERAQAALYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQALLN- 205
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEE----LEE 236
A + +K LE Q +D L + + + + + Q++ LQ I L E
Sbjct: 206 --AQNDLQRKSLEGNTQLQD----LLQKQRDYLTARIQRLEHQLQLLQEAINSKRLTLSE 259
Query: 237 EVEAERQSRAKAEKQRAD--LARELE---ELGERLEEA 269
+ E QS+ +A + +A+ +A+ELE +L +RL +A
Sbjct: 260 KTVQEAQSQDEAARIQANPLVAQELEINLQLSQRLLKA 297
Score = 29.9 bits (68), Expect = 4.8
Identities = 28/142 (19%), Positives = 58/142 (40%), Gaps = 17/142 (11%)
Query: 408 EEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEA 467
+ ++Q+ L+K K LE +L ++ + LL K + + + +++QL +
Sbjct: 39 ADVQAQLDALNKQK-----LLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAP 93
Query: 468 EGKADLQRQLS--KANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKE 525
Q +L K + D + R + + + E R+ +L +Q +
Sbjct: 94 AKLRQAQAELEALKDDNDEET-RETLSTLSLRQLE---------SRLAQTLDQLQNAQND 143
Query: 526 CRNYSTELFRLKGAYEEAQEQL 547
Y+++L L+ E AQ L
Sbjct: 144 LAEYNSQLVSLQTQPERAQAAL 165
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences
in this family of proteins are members of the chain
length determinant family (pfam02706) which includes the
wzc protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain
to the exopolysaccharide layer.
Length = 444
Score = 40.9 bits (96), Expect = 0.001
Identities = 55/185 (29%), Positives = 80/185 (43%), Gaps = 25/185 (13%)
Query: 130 QQEEDARNQLFQQKKKL---EQENGGLKKDIEDLEGDVEKQK-RKIEGDLKLTQEAVADL 185
QQ R L + + KL +QE G + D E L DVE+ + ++ L Q V D
Sbjct: 178 QQIAALREDLARAQSKLSAYQQEKGIVSSD-ERL--DVERARLNELSAQLVAAQAQVMDA 234
Query: 186 ERNKKELEQT-----------IQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEEL 234
+ IQ +IA +KL + +G N Q K QA I L
Sbjct: 235 SSKEGGSSGKDALPEVIANPIIQNLKTDIARAESKLAELSQRLGPNHPQYKRAQAEINSL 294
Query: 235 EEEVEAE--RQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 292
+ ++ AE + + + R RE EL E LE +ELN++R+ EMS L+R
Sbjct: 295 KSQLNAEIKKVTSSVGTNSRILKQRE-AELREALENQKAKV---LELNRQRD-EMSVLQR 349
Query: 293 DLEEA 297
D+E A
Sbjct: 350 DVENA 354
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 41.5 bits (97), Expect = 0.001
Identities = 40/197 (20%), Positives = 80/197 (40%), Gaps = 11/197 (5%)
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEE---EV 238
V DL +++ I + D+++A L KL++ + +++ +AR EEL E ++
Sbjct: 837 VIDLAAETARVDKEIGKVDQDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREKRGKL 896
Query: 239 EAERQ--SRAKAEKQRADLARELEELGERLEEAGGATSAQIE--LNKKREAEMSKLRRDL 294
EA R S ++A R D E + E + + E ++ +
Sbjct: 897 EAHRAMLSGSEANSARRDTMEIQNEQKPTQDGPAAEAQPAQENTVVESAEKAVAAVSEAA 956
Query: 295 EEANIQHESTLANLR----KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAA 350
++A S + + K +V + + K K + AA + +AA
Sbjct: 957 QQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAA 1016
Query: 351 ADQLAREKAATEKIAKQ 367
A + +KA +K+A++
Sbjct: 1017 AKKKVAKKAPAKKVARK 1033
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 40.9 bits (97), Expect = 0.002
Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 14/81 (17%)
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
D+E LE+ + + +KEI + KL +E G V K +++ VE ER+
Sbjct: 808 DVEAELARLEKELAKLEKEIERVEKKLSNE-GFVAKAPEEV-------------VEKERE 853
Query: 244 SRAKAEKQRADLARELEELGE 264
A+ E++ A L L L
Sbjct: 854 KLAEYEEKLAKLKERLARLKA 874
Score = 30.1 bits (69), Expect = 4.3
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 146 LEQENGGLKKDIEDLEGDVEKQKRKI--EGDLKLTQEAVADLERNK-KELEQTIQRKDKE 202
+E E L+K++ LE ++E+ ++K+ EG + E V + ER K E E+ + + +
Sbjct: 809 VEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKER 868
Query: 203 IASLTA 208
+A L A
Sbjct: 869 LARLKA 874
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 40.3 bits (95), Expect = 0.003
Identities = 25/113 (22%), Positives = 41/113 (36%), Gaps = 12/113 (10%)
Query: 196 IQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEK---QR 252
++ + IA+L E E + + + R+ EL E+ A A E Q
Sbjct: 421 LEDLRRRIAAL----ELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQE 476
Query: 253 ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTL 305
+L + L LE A + A++++L L A Q E L
Sbjct: 477 KELVEAILALRAELEADADAPADDDAA---LRAQLAELEAALASA--QGEEPL 524
Score = 38.4 bits (90), Expect = 0.010
Identities = 25/108 (23%), Positives = 46/108 (42%), Gaps = 12/108 (11%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK--KELEQTIQRKDKEIASLTAKLEDEQ 214
+EDL + + +++ L +EA + ++ EL + + E+A+L A+ + E+
Sbjct: 421 LEDLRRRIAALELELD---ALEREAALGADHDERLAELRAELAALEAELAALEARWQQEK 477
Query: 215 GLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL 262
LV I L+A +E +A A Q A+L L
Sbjct: 478 ELVE----AILALRAELEADA---DAPADDDAALRAQLAELEAALASA 518
Score = 29.9 bits (68), Expect = 4.9
Identities = 24/98 (24%), Positives = 40/98 (40%), Gaps = 1/98 (1%)
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKR 283
+++L+ RI LE E++A + A LA EL LE A A+ + K+
Sbjct: 421 LEDLRRRIAALELELDALEREAALGADHDERLA-ELRAELAALEAELAALEARWQQEKEL 479
Query: 284 EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGE 321
+ LR +LE A LR + + + +
Sbjct: 480 VEAILALRAELEADADAPADDDAALRAQLAELEAALAS 517
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 40.0 bits (93), Expect = 0.004
Identities = 52/258 (20%), Positives = 117/258 (45%), Gaps = 4/258 (1%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK 189
Q+E ++ Q ++++++ + K+ E + E++ + + + + + + LER K
Sbjct: 251 QEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
+ E+ ++ +KE+ L +L+ E+ + + +K++KEL+ + E EEE E + + K E
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLE 370
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
+ +L + + ERL A E + + E K + L E + Q E L +
Sbjct: 371 QLEEELLAKKKLESERLSSAA----KLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEK 426
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
K+ V E+ E ++ T+ + E + + + T+ + Q
Sbjct: 427 KEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQ 486
Query: 370 HQLNEVQGKLDETNRSLN 387
+L ++ KL+E ++ +
Sbjct: 487 LELLLLRQKLEEASQKES 504
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 39.8 bits (94), Expect = 0.004
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 11/119 (9%)
Query: 151 GGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKL 210
G K+ + +L +E+ +R++E + + + + E+ K+ELE+ ++ +E L +
Sbjct: 512 GEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA 571
Query: 211 EDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
E E Q+ IKE + +E+ +E+ ++ + K A EL E +RL +A
Sbjct: 572 EKEA------QQAIKEAKKEADEIIKELRQLQKGGYASVK-----AHELIEARKRLNKA 619
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 39.7 bits (93), Expect = 0.004
Identities = 49/298 (16%), Positives = 107/298 (35%), Gaps = 41/298 (13%)
Query: 122 SQDTVDRLQQEEDARNQ----LFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKL 177
SQ D +++ +Q +Q + D+ DL
Sbjct: 180 SQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRFTR 239
Query: 178 TQEAVADLERNK-------KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQAR 230
T++ E + ++ E+ +L ++++ + + + +IK L ++
Sbjct: 240 TEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASK 299
Query: 231 IEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 290
I+++ EE+ RQ + E++ + + EL E A + + +R + +L
Sbjct: 300 IKQVNEELTTVRQENEELEEEY-KIKKRTVELLPDAENNVAKLQALVVASSER---LLEL 355
Query: 291 RRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEH------ERNAAHNET 344
+ E +L++K+ + E Q+D++ KL+ K E + + +
Sbjct: 356 AQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQL 415
Query: 345 NNTRAAADQLAREKAATEKI--------------------AKQLQHQLNEVQGKLDET 382
+ A + A T +I + LQ ++N + GKLD T
Sbjct: 416 LDEYENAPKSVSRSAYTRRILEIIKNIKKQKEDIDKILSDTRSLQKEINNITGKLDRT 473
Score = 33.2 bits (76), Expect = 0.41
Identities = 50/269 (18%), Positives = 98/269 (36%), Gaps = 22/269 (8%)
Query: 172 EGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTA---KLEDEQGLVGKNQKQIKELQ 228
+L+ LE N + LE +RK + + A + + G G +
Sbjct: 163 SNSSRLSFAQEWRLEWNSQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTTGSSSADDLADL 222
Query: 229 ARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 288
L+ + R +R + +Q A A E+ EE K+REAE+
Sbjct: 223 LSKFRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTEED---------RTKEREAELE 273
Query: 289 KLRRDLEEANIQ------HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHN 342
L+ ++E LA+ K+ N+ ++ + ++ ++L + + +
Sbjct: 274 ALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPD 333
Query: 343 ETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSD 402
NN L A+ + +L Q + L + RSL + + K+ + D
Sbjct: 334 AENNVAK----LQALVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLD 389
Query: 403 LLRQLEEAESQVSQLSKIKISLTTQLEDT 431
+++L ++ + K L QL D
Sbjct: 390 EIKKLRNKIEELESELQTKEQLYKQLLDE 418
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 39.3 bits (92), Expect = 0.006
Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 7/136 (5%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLE-DEQG 215
I DL DV +RK +++ ++ER ++LE I++ + K +
Sbjct: 12 IRDLNEDVSAFQRKFVNEVR----RCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSK 67
Query: 216 LVGKNQKQIKELQARIEELEEEVEA-ERQSRAKAEKQRADLARELEELGERLEEAGGATS 274
+++I +L+A I+E+EE +E+ E++ E L E L E LEE ++
Sbjct: 68 EFLDLEEEILDLEAEIKEVEENLESLEKEINELEE-WLNVLDEEKSFLDENLEELSELSN 126
Query: 275 AQIELNKKREAEMSKL 290
I+ R AE KL
Sbjct: 127 LDIDFKYLRGAEGLKL 142
Score = 36.6 bits (85), Expect = 0.036
Identities = 53/267 (19%), Positives = 99/267 (37%), Gaps = 27/267 (10%)
Query: 85 VEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDR--LQQEEDARNQLFQQ 142
V+ L G + + ++ L+ DV F F + D V+R + E +
Sbjct: 1 VDAL--GELGLVQIRDLNEDVSAF----QRKFVNEVRRCDEVERKLRKLESKIKKLGIPL 54
Query: 143 KKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKE 202
K + + K+ DLE ++ +E ++K +E + LE+ ELE+ + D+E
Sbjct: 55 KDTGGKPDVPPSKEFLDLEEEILD----LEAEIKEVEENLESLEKEINELEEWLNVLDEE 110
Query: 203 IASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELE-- 260
+ L LE+ L + K L+ E L+ A +R K E +L R
Sbjct: 111 KSFLDENLEELSELSNLD-IDFKYLRGA-EGLKLGFVAGVINREKLEAFERELWRACRGY 168
Query: 261 ----ELGERLEEAGG------ATSAQIELNKKREAEMSKLRR-DLEEANIQHESTLANLR 309
E+ E LE+ + K+ + D+ E + ++ +
Sbjct: 169 IRQAEIEEPLEDPKKTVFIIFFVGKEDLDKVKKILDSFGFELYDVPETEGERSELISKVN 228
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHE 336
K+ + + + L K+ K E
Sbjct: 229 KRIEELQRVLEQTESHLEKVLVKIADE 255
Score = 34.7 bits (80), Expect = 0.16
Identities = 27/132 (20%), Positives = 44/132 (33%), Gaps = 36/132 (27%)
Query: 192 LEQTIQRKDKEIASLTAKLED-------EQGLVGKNQKQIKELQARIEELEEEVEAERQS 244
++ + + K + S +L D L+ K K+I+ELQ +E+ E +E
Sbjct: 192 GKEDLDKVKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEKVLVK 251
Query: 245 RAKA----------EKQRADL------------------ARELEELGERLEEAGGAT-SA 275
A EK + A++LE+L LE A + S
Sbjct: 252 IADELLAWDEQVSKEKAVYETLNLFNYDTKTLIAEGWVPAKDLEKLKAALENATEGSGSV 311
Query: 276 QIELNKKREAEM 287
N E
Sbjct: 312 PSIENDIETNEE 323
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 38.6 bits (90), Expect = 0.008
Identities = 75/373 (20%), Positives = 134/373 (35%), Gaps = 42/373 (11%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQ---ENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA 183
D L DA QL + + +LE+ L+ ++ R+ L+ Q
Sbjct: 220 DELTDVADALEQLDEYRDELERLEALERALRNFLQRYRRYARTMLRRRATRLRSAQTQYD 279
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQA--------RIEELE 235
L R+ ++ +E L A+ E + + +++ LQ +E
Sbjct: 280 QLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERAR 339
Query: 236 EEVEAERQSRAKAEKQRADLARELEELGERL-EEAGGATSAQIELNKKREAEMSKLRRDL 294
+ EA + + A A + + LEE RL EEAG A+ EL A +L R
Sbjct: 340 ADAEALQAAAADARQAIREAESRLEEERRRLDEEAGRLDDAEREL----RAAREQLARAA 395
Query: 295 EEANI-----------------QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEH-E 336
E A + Q L R+ + +++ EQ+ L + A+ E
Sbjct: 396 ERAGLSPAHTAEPDAALAAQELQELGALDARRQDADRVIAQRSEQVALLRRRDDVADRAE 455
Query: 337 RNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL 396
A RA D+L E A+ ++ +L +
Sbjct: 456 ATHAA-----ARARRDELDEEAEQAAARAELADEAVHREGARLAWVDAWQAQLRELTILA 510
Query: 397 SIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDI 456
+ L L+ ++ + + +++++ + + LADE RERA L LE +
Sbjct: 511 VDDQPGALADLDSWDALLQGEAPVRVAVYSA---VQPLADELTRERAALRLAEEVLEEER 567
Query: 457 DNIREQLEEEAEG 469
D +R + E +G
Sbjct: 568 DALRTERERLEQG 580
Score = 30.5 bits (69), Expect = 3.1
Identities = 21/121 (17%), Positives = 39/121 (32%), Gaps = 3/121 (2%)
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQRADLAR---ELEELGERLEEAGGATSAQIELN 280
++EL+ E+ ++ + A + D A E EE RL + A ++
Sbjct: 867 VRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEI 926
Query: 281 KKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAA 340
+ R AE + LA + A + E L++ +H
Sbjct: 927 RARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATLDERAEARDHAIGQL 986
Query: 341 H 341
Sbjct: 987 R 987
Score = 30.2 bits (68), Expect = 4.0
Identities = 49/261 (18%), Positives = 90/261 (34%), Gaps = 19/261 (7%)
Query: 9 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDL--LRQLEEAESQVSQLSKIKISL 66
+L +L V +L E R L A ++ L+ E + R L A + ++ + S
Sbjct: 746 ELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLRAAHRRAAEAERQAESA 805
Query: 67 TTQLEDTKRLADEESSGWVEELTKGSIEAITLK-SLDLDVLNFTTENSHSFSISLYSQDT 125
+L R A ++ W + + +A L D D L ++ L +
Sbjct: 806 ERELARAARKAAAAAAAWKQARRELERDAADLDLPTDPDALE---------AVGLALKRF 856
Query: 126 VDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKR---KIEGDLKLTQEAV 182
D L E A +L + ++ + E D + + + + +
Sbjct: 857 GDHLHTLEVAVRELRHAATRAAEQ----RARAARAESDAREAAEDAAEARAEAEEASLRL 912
Query: 183 ADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAER 242
LE + + I+ + E + A E + + +E + R EE E +A
Sbjct: 913 RTLEESVGAMVDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKRAEADATL 972
Query: 243 QSRAKAEKQRADLARELEELG 263
RA+A RE G
Sbjct: 973 DERAEARDHAIGQLREFALTG 993
Score = 29.8 bits (67), Expect = 5.1
Identities = 43/210 (20%), Positives = 77/210 (36%), Gaps = 15/210 (7%)
Query: 186 ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
R EL+ + D E+ A+L E +G Q+ + + A + + A +
Sbjct: 741 LRRIAELDARLAAVDDEL----AELARELRALGARQRALADELAGAPS-DRSLRAAHRRA 795
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD---LEEANI--- 299
A+AE+Q REL + A A Q +R+A L D LE +
Sbjct: 796 AEAERQAESAERELARAARKAAAAAAAW-KQARRELERDAADLDLPTDPDALEAVGLALK 854
Query: 300 ---QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAR 356
H TL ++ A + EQ + + ++ A A A+ +L
Sbjct: 855 RFGDHLHTLEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRT 914
Query: 357 EKAATEKIAKQLQHQLNEVQGKLDETNRSL 386
+ + + +++ +L E + L R L
Sbjct: 915 LEESVGAMVDEIRARLAETRAALASGGREL 944
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 38.1 bits (89), Expect = 0.012
Identities = 44/227 (19%), Positives = 94/227 (41%), Gaps = 18/227 (7%)
Query: 219 KNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE 278
++K+ + A I +L++ ++ E++ R E + + + E L ++ + A
Sbjct: 807 GSRKEYRSYLACIIKLQKTIKREKKLRETEEVEF---SLKAEVLIQKFGRSLKAKKRFSL 863
Query: 279 LNKKREAEMSKLRRDLEEANIQHE----STLANLRKKHNDAVSEMGEQIDQLNK---LKT 331
L K+ S R +L E +Q ++++L+ + + SE+ E L+
Sbjct: 864 LKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENL 923
Query: 332 KAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDA 391
+ + E A + N + + E ++ K L+EV+ KL ET+ D
Sbjct: 924 EFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNK-----LHEVESKLKETSEEYED--- 975
Query: 392 AKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEE 438
KK +I + + E ++ +L+++ E TK+L +
Sbjct: 976 LLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELP 1022
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 36.9 bits (86), Expect = 0.014
Identities = 22/94 (23%), Positives = 52/94 (55%), Gaps = 3/94 (3%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELN 280
Q+++ ELQ + EL+E++ +Q + +++ + L ELE L + L ++ IEL+
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELD 131
Query: 281 KKRE---AEMSKLRRDLEEANIQHESTLANLRKK 311
++ E+++L+++ E ++E N +++
Sbjct: 132 EENRELREELAELKQENEALEAENERLQENEQRR 165
Score = 35.0 bits (81), Expect = 0.049
Identities = 24/96 (25%), Positives = 51/96 (53%), Gaps = 5/96 (5%)
Query: 124 DTVDRLQQE----EDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQ 179
+ + LQQE ++ +L +Q +L+QEN LK+++ LE ++E+ ++++ +K
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELA-RIKQLS 124
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQG 215
+L+ +EL + + +E +L A+ E Q
Sbjct: 125 ANAIELDEENRELREELAELKQENEALEAENERLQE 160
Score = 34.2 bits (79), Expect = 0.089
Identities = 26/96 (27%), Positives = 47/96 (48%), Gaps = 3/96 (3%)
Query: 175 LKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ---IKELQARI 231
L Q+ +A+L+ EL++ + +E L +L + + + QK+ IK+L A
Sbjct: 68 LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANA 127
Query: 232 EELEEEVEAERQSRAKAEKQRADLARELEELGERLE 267
EL+EE R+ A+ +++ L E E L E +
Sbjct: 128 IELDEENRELREELAELKQENEALEAENERLQENEQ 163
Score = 30.4 bits (69), Expect = 1.7
Identities = 23/119 (19%), Positives = 54/119 (45%), Gaps = 18/119 (15%)
Query: 129 LQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN 188
L AR +L + +++L + ++++ +++ L Q+ +L++
Sbjct: 58 LSDTPSARERLPELQQELAE---------------LQEELAELQEQLAELQQENQELKQE 102
Query: 189 KKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQ---KQIKELQARIEELEEEVEAERQS 244
LE ++R KE+A + + L +N+ +++ EL+ E LE E E +++
Sbjct: 103 LSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQEN 161
Score = 30.0 bits (68), Expect = 2.1
Identities = 18/87 (20%), Positives = 41/87 (47%), Gaps = 1/87 (1%)
Query: 399 ENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDN 458
+ +L E + ++++L + L QL + ++ E +E +TL + L+ ++
Sbjct: 60 DTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELAR 119
Query: 459 IREQLEEEAEGKADLQRQLSKANADAQ 485
I+ QL A + R+L + A+ +
Sbjct: 120 IK-QLSANAIELDEENRELREELAELK 145
Score = 28.4 bits (64), Expect = 7.4
Identities = 14/86 (16%), Positives = 32/86 (37%), Gaps = 4/86 (4%)
Query: 332 KAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDA 391
+ + E E + +L +E ++ L+ +L +Q +L + +
Sbjct: 70 ELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIE 129
Query: 392 AKKKLSIENSDLLRQLEEAESQVSQL 417
+ EN +L +L E + + L
Sbjct: 130 LDE----ENRELREELAELKQENEAL 151
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 37.9 bits (88), Expect = 0.016
Identities = 43/171 (25%), Positives = 77/171 (45%), Gaps = 6/171 (3%)
Query: 204 ASLTAKLEDEQGLVGKNQKQIKELQ--ARIEELEEEVEAERQSRAKAEKQRADLARELEE 261
ASL ++E+ L+ Q+Q + L R++EL++E +Q+ +A +L
Sbjct: 223 ASLQVLTDEEKQLLTAQQQQQQSLNWLTRLDELQQEASRRQQALQQALAAEEKAQPQLAA 282
Query: 262 LGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGE 321
L L + E +++ A ++ R+ +EE N + +ST+A + + A + E
Sbjct: 283 L--SLAQPARQLRPHWERIQEQSAALAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSAE 340
Query: 322 QIDQLNKLKT-KAEHER-NAAHNETNNTRAAADQLAREKAATEKIAKQLQH 370
Q L T AEH+R +NE RA Q ++ + +QL H
Sbjct: 341 LQAQQQSLNTWLAEHDRFRQWNNELAGWRAQFSQQTSDREQLRQWQQQLTH 391
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 37.6 bits (88), Expect = 0.016
Identities = 35/144 (24%), Positives = 56/144 (38%), Gaps = 21/144 (14%)
Query: 121 YSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQE 180
S++ DRL+ ED L ++ LE L D E+ +V E L L
Sbjct: 755 GSEELEDRLEANEDDIKGLANLEE-LEI----LSPDPEEPPVEVTAVVGGAELFLPLAGL 809
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA 240
E + LE+ +++ +KEI + KL +E + EE VE
Sbjct: 810 IDLAAELAR--LEKELEKLEKEIDRIEKKLSNEGFVAKAP--------------EEVVEK 853
Query: 241 ERQSRAKAEKQRADLARELEELGE 264
E++ A+ + + A L L L
Sbjct: 854 EKEKLAEYQVKLAKLEERLAVLKA 877
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 37.7 bits (88), Expect = 0.016
Identities = 42/243 (17%), Positives = 89/243 (36%), Gaps = 33/243 (13%)
Query: 303 STLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAAT- 361
S L+ + K + D + E+ +QI L+ + + + R + K
Sbjct: 163 SVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKY 222
Query: 362 ---EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
+ AK ++ ++ E+ +L + D AA KL+ + + +S++ Q
Sbjct: 223 DELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKI-------KSKIEQFQ 275
Query: 419 KIKI---------SLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAE- 468
K+ + T Q+ + + + L L+ ID + E ++E E
Sbjct: 276 KVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQ 335
Query: 469 --GKADLQRQLSKANADAQLWRT----------KYESEGVARAEELEESKSEERRVDDLA 516
+L+ ++S + ++E V AEEL + + E ++
Sbjct: 336 SKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTK 395
Query: 517 AEL 519
+EL
Sbjct: 396 SEL 398
Score = 35.0 bits (81), Expect = 0.11
Identities = 42/205 (20%), Positives = 88/205 (42%), Gaps = 22/205 (10%)
Query: 220 NQKQIKELQARIEELEEEVEAER--QSRAKAEKQRADLARELEELGERLEEAGGATSAQI 277
+QI+ L +I+ ++++++ + K ++AR+ + E +EEA
Sbjct: 179 LNQQIQTLDMKIDHIQQQIKTYNKNIEEQRK-KNGENIARKQNKYDELVEEA-------- 229
Query: 278 ELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHER 337
K +AE+ +L +L + E A L K N A +++ +I+Q K +
Sbjct: 230 ---KTIKAEIEELTDELLNLVMDIEDPSAAL-NKLNTAAAKIKSKIEQFQK-----VIKM 280
Query: 338 NAAHNETNN-TRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL 396
T+ ++ R +K+ K+LQH L ++ +DE +++F+ KKL
Sbjct: 281 YEKGGVCPTCTQQISEGPDRITKIKDKL-KELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339
Query: 397 SIENSDLLRQLEEAESQVSQLSKIK 421
+ + + + V + K+K
Sbjct: 340 LELKNKISTNKQSLITLVDKAKKVK 364
Score = 32.3 bits (74), Expect = 0.70
Identities = 41/228 (17%), Positives = 85/228 (37%), Gaps = 45/228 (19%)
Query: 110 TENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKR 169
T N + + + + R Q + D +L ++ K ++ E L ++ +L D+E
Sbjct: 199 TYNKNIEEQRKKNGENIARKQNKYD---ELVEEAKTIKAEIEELTDELLNLVMDIEDP-- 253
Query: 170 KIEGDL-KLTQEAV---ADLERNKKELE------------QTIQRKDKEIASLTAKLEDE 213
L KL A + +E+ +K ++ Q I I + KL++
Sbjct: 254 --SAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKEL 311
Query: 214 QGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGAT 273
Q + K I EL+ ++E E+ + + + K + L +++
Sbjct: 312 QHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDK------------ 359
Query: 274 SAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGE 321
KK +A + + L+ + + LA L+ + + V E
Sbjct: 360 ------AKKVKAAIEE----LQAEFVDNAEELAKLQDELDKIVKTKSE 397
Score = 31.1 bits (71), Expect = 1.6
Identities = 25/120 (20%), Positives = 51/120 (42%), Gaps = 14/120 (11%)
Query: 154 KKDIEDLEGDVEKQKRK---IEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKL 210
+ I + + K K K ++ L+ A+ +LE E+ + K++ L K+
Sbjct: 291 TQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELE----EIMDEFNEQSKKLLELKNKI 346
Query: 211 EDEQGLVGKNQKQIKELQARIEELE-------EEVEAERQSRAKAEKQRADLARELEELG 263
+ + + K+++A IEEL+ EE+ + K K +++L +E G
Sbjct: 347 STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKEKYHRG 406
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 37.7 bits (87), Expect = 0.017
Identities = 62/327 (18%), Positives = 136/327 (41%), Gaps = 16/327 (4%)
Query: 121 YSQDTVDRLQQEEDARNQLFQQ-KKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQ 179
Y ++ + ED Q+ K E+ + DI +L+ + KI+ +K++Q
Sbjct: 236 YFTESYKSFLKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQ 295
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQ-------KQIKELQARIE 232
+ L + L+ + + + ++ K ++ G + K + ++IK LQ+ I+
Sbjct: 296 -KIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNID 354
Query: 233 ELEEEVEAERQSRAKAEKQ---RADLARELEELGERLEEAGGATSAQIELNKKREAEMSK 289
EL +++ + S + E R L REL+++ + ++ + ++ + + K
Sbjct: 355 ELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEK 414
Query: 290 LRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRA 349
R + + + + ND S + I+QL + + E N+
Sbjct: 415 TLRQYDSLIQNITRSRSQIGHNVND--SSLKINIEQLFPKGSGINESIKKSILELNDEIQ 472
Query: 350 AADQLAREKAAT-EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLR-QL 407
+ K+ T E+ K L+H +NE+ L++ L++ ++ + EN L Q
Sbjct: 473 ERIKTEENKSITLEEDIKNLKHDINELTQILEKLELELSEANSKFELSKEENERELVAQR 532
Query: 408 EEAESQVSQLSKIKISLTTQLEDTKRL 434
E E +L+ + + T + D ++L
Sbjct: 533 IEIEKLEKELNDLNLLSKTSILDAEQL 559
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 37.2 bits (87), Expect = 0.022
Identities = 33/257 (12%), Positives = 92/257 (35%), Gaps = 44/257 (17%)
Query: 3 TEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKI 62
E++ K ++ +L +++ ++ E +++ + K+L E+ ++
Sbjct: 84 LEELIKDVEEELEKIEKEIKELEEEISELENEIKEL-----------EQEIERLEPWGNF 132
Query: 63 KISLTTQLEDTKRL--------ADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSH 114
+ L+ L K + D+ +E + T K +
Sbjct: 133 DLDLS-LLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDK--GYVYVVVVVLKEL 189
Query: 115 SFSISLYSQDTVDRLQQEEDARNQ-------LFQQKKKLEQENGGLKKDIEDLEGDVEKQ 167
S D V+ ++ + +++++E ++K+ E L ++++
Sbjct: 190 S--------DEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKEL 241
Query: 168 KRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKEL 227
+K + L ++E + E K + ++ + +++ V K ++ I +
Sbjct: 242 AKKYL-EELLALYEYLEIELERAEALSKF-LKTDKTFAIEGWVPEDR--VKKLKELIDKA 297
Query: 228 QAR---IEELEEEVEAE 241
+E +E + E E
Sbjct: 298 TGGSAYVEFVEPDEEEE 314
Score = 36.8 bits (86), Expect = 0.031
Identities = 42/226 (18%), Positives = 81/226 (35%), Gaps = 50/226 (22%)
Query: 145 KLEQENGGLKKDIEDLEGDVEKQK----RKIEGDLKLTQEAVADLERNKKELEQTIQRKD 200
KL L + ++ L + K K + +K +E + D+E +++E+ I+ +
Sbjct: 47 KLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELE 106
Query: 201 KEIASLTAK--------------------LEDEQGL------VGKNQKQIKELQARIEEL 234
+EI+ L + L G VG + E +
Sbjct: 107 EEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESD- 165
Query: 235 EEEVEAERQSRAK-------AEKQRADLARELEELG-ERLEEAGGATSAQIELNKKRE-- 284
E VE + ++ ++ EL++LG ERLE T +++ K E
Sbjct: 166 VENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELE 225
Query: 285 ---AEMSKLRRDLEEANIQHESTLANLR------KKHNDAVSEMGE 321
E L +L+E ++ L L + +A+S+ +
Sbjct: 226 EIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLK 271
Score = 31.4 bits (72), Expect = 1.4
Identities = 31/180 (17%), Positives = 64/180 (35%), Gaps = 35/180 (19%)
Query: 357 EKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL-------------SIENSDL 403
+ E++ K ++ +L +++ ++ E +++ + K+L ++ S L
Sbjct: 80 SVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLL 139
Query: 404 L-------------RQLEEAESQVSQLSKIKISLTTQLEDT------KRLADEEGRERAT 444
L E S + ++ T + K L+DE E
Sbjct: 140 LGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKK 199
Query: 445 LLGKFRNLE-HDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELE 503
L F LE + E + E E +++++ + + KY E +A E LE
Sbjct: 200 L--GFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLE 257
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 37.0 bits (86), Expect = 0.023
Identities = 50/233 (21%), Positives = 81/233 (34%), Gaps = 45/233 (19%)
Query: 213 EQGLVGKNQ----------KQIKELQARIEELEEEVEAERQSR-AKAEKQRADLARELEE 261
E+ L K Q +QIK + ++E E ++A +Q Q D E+ E
Sbjct: 149 EETLGSKRQDSDSAQRFIDEQIKTYEKKLEAAENRLKAFKQENGGILPDQEGDYYSEISE 208
Query: 262 LGERLEEAG----------GATSAQI-----ELNKKREAEMSKLRRDLEEANIQHESTLA 306
E LE A A Q+ L S+L +E Q ++
Sbjct: 209 AQEELEAARLELNEAIAQRDALKRQLGGEEPVLLAGSSVANSELDGRIEALEKQLDALRL 268
Query: 307 NLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAK 366
KH D V +I QL + K + +N + +
Sbjct: 269 RYTDKHPD-VIATKREIAQLEEQKEEEGSAKNGGPE--------------RGEIANPVYQ 313
Query: 367 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSK 419
QLQ +L E + ++ SL A LLR + E E++++QL++
Sbjct: 314 QLQIELAEAEAEIA----SLEARVAELTARIERLESLLRTIPEVEAELTQLNR 362
Score = 30.4 bits (69), Expect = 3.1
Identities = 43/219 (19%), Positives = 84/219 (38%), Gaps = 31/219 (14%)
Query: 2 ATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKK-LSIENSDLLRQLEEAESQVSQLS 60
A I +Q++ + KL+ L F L + D ++ EA+ ++
Sbjct: 162 AQRFIDEQIK----TYEKKLEAAENRLKAFKQENGGILPDQEGDYYSEISEAQEELEAA- 216
Query: 61 KIKI--------SLTTQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTEN 112
++++ +L QL + + SS EL G IEA+ K LD L +T +
Sbjct: 217 RLELNEAIAQRDALKRQLGGEEPVLLAGSSVANSEL-DGRIEALE-KQLDALRLRYT--D 272
Query: 113 SHSFSISLYSQ-DTVDRLQQEEDARNQLFQQKKK--------LEQENGGLKKDIEDLEGD 163
H I+ + ++ ++EE + ++ + L+ E + +I LE
Sbjct: 273 KHPDVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQLQIELAEAEAEIASLEAR 332
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKE 202
V + +IE L + + + EL Q + +
Sbjct: 333 VAELTARIERLESLLRT----IPEVEAELTQLNRDYEVN 367
Score = 29.2 bits (66), Expect = 6.5
Identities = 33/213 (15%), Positives = 76/213 (35%), Gaps = 22/213 (10%)
Query: 107 NFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEK 166
F EN D + + ++ + + + LK+ + E +
Sbjct: 186 AFKQENGGILPDQ--EGDYYSEISEAQEELEAARLELNEAIAQRDALKRQLGGEEPVLLA 243
Query: 167 QK----RKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQK 222
+++G ++ ++ + L + + +EIA L + E+E +
Sbjct: 244 GSSVANSELDGRIEALEKQLDALRLRYTDKHPDVIATKREIAQLEEQKEEEGSAKNGGPE 303
Query: 223 Q-------IKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGE------RLEEA 269
+ ++LQ + E E E+ + A+ + L L + E +L
Sbjct: 304 RGEIANPVYQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRD 363
Query: 270 GGATSAQIE-LNKKRE-AEMSKLRRDLEEANIQ 300
+ E L +RE AE+SK + ++++ +
Sbjct: 364 YEVNKSNYEQLLTRRESAEVSK-QMEVQDKAVS 395
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 37.0 bits (86), Expect = 0.025
Identities = 52/276 (18%), Positives = 104/276 (37%), Gaps = 22/276 (7%)
Query: 32 DAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGW------V 85
+ K+L + + + +L + L ++ L+ D +
Sbjct: 174 EEDFKELQLNSGADIVRLLARFLGLGGLLAEELLSRAGLDKKVPAKDLFEEEIKKVREAL 233
Query: 86 EELTKGSIEAITLKSLD-LDVLNFTTENSHSFSISLYSQDTVDR---LQQEEDARNQLFQ 141
EEL K LDV+ + + +R Q + +L +
Sbjct: 234 EELLNPLKPNYYYKDEKYLDVVPLKAYADLEKLFNEALDEKFERDKIKQLASELEKKLEK 293
Query: 142 QKKKLEQENGGLKKDIEDLEGDVEKQKRK---IEGDLKLTQEA--VADLERNKKELEQTI 196
+ KKLE + + ++E+LE E+ ++K + +L+L +E L E I
Sbjct: 294 ELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKI 353
Query: 197 QRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLA 256
+ + S A+ ++ K K L ++ EL+E A +S A ++ A+
Sbjct: 354 ELDKSKTPSENAQRYFKKYKKLKGAK--VNLDRQLSELKEA-IAYYESAKTALEK-AEGK 409
Query: 257 RELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 292
+ +EE+ E L E G + + K+++ E + R
Sbjct: 410 KAIEEIREELIEEG---LLKSKKKKRKKKEWFEKFR 442
Score = 37.0 bits (86), Expect = 0.025
Identities = 55/279 (19%), Positives = 90/279 (32%), Gaps = 24/279 (8%)
Query: 206 LTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRA---KAEKQRADLARELEEL 262
ED + L + I L AR L + E SRA K + E++++
Sbjct: 170 YEQSEEDFKELQLNSGADIVRLLARFLGLGGLLAEELLSRAGLDKKVPAKDLFEEEIKKV 229
Query: 263 GERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQ 322
E LEE +D + ++ A+L K N+A+ E E+
Sbjct: 230 REALEELLNPLKPNYYY------------KDEKYLDVVPLKAYADLEKLFNEALDEKFER 277
Query: 323 IDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDET 382
K+K A E ++ E EK A++L+ + + L
Sbjct: 278 D----KIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLI 333
Query: 383 NRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRER 442
L A + E L + + + K L + R E
Sbjct: 334 EEGLKSVRLADFYGNEEIKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAI 393
Query: 443 ATLLGKFRNLE-----HDIDNIREQLEEEAEGKADLQRQ 476
A LE I+ IRE+L EE K+ +++
Sbjct: 394 AYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKR 432
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 37.0 bits (86), Expect = 0.027
Identities = 30/125 (24%), Positives = 59/125 (47%), Gaps = 14/125 (11%)
Query: 179 QEAVADL--ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELE- 235
EA++ + E +E E T + + +EI +++ + V + +++ EL+ +EEL+
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450
Query: 236 --EEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIEL-NKKREAEMSKLRR 292
E++E+E + + + + RE+ R+E + EL KK+ E L R
Sbjct: 451 EIEKLESELERFRREVRDKVRKDREIRARDRRIER------LEKELEEKKKRVEE--LER 502
Query: 293 DLEEA 297
L E
Sbjct: 503 KLAEL 507
Score = 35.5 bits (82), Expect = 0.082
Identities = 35/157 (22%), Positives = 70/157 (44%), Gaps = 35/157 (22%)
Query: 142 QKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDK 201
+ +L++E LK++IE LE ++E+ +R++ ++ +E I+ +D+
Sbjct: 437 ENSELKRELEELKREIEKLESELERFRREVRDKVRKDRE---------------IRARDR 481
Query: 202 EIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEE 261
I L +LE++ +K+++EL+ ++ EL + + E + K L E E
Sbjct: 482 RIERLEKELEEK-------KKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIE 534
Query: 262 LGERL-------------EEAGGATSAQIELNKKREA 285
E GGA +A+ ++KK A
Sbjct: 535 EAEEEYGIKEGDVILVEDPSGGGARTAEELIDKKPRA 571
Score = 30.4 bits (69), Expect = 2.8
Identities = 25/147 (17%), Positives = 58/147 (39%), Gaps = 3/147 (2%)
Query: 231 IEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 290
+ E +V+ E + R K + + RE+ +R+++ E N + + E+ +L
Sbjct: 390 LAEALSKVKEEERPREKEGTEEEE-RREITVYEKRIKKLEETVERLEEENSELKRELEEL 448
Query: 291 RRDLEEANIQHESTLANLRKKH--NDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTR 348
+R++E+ + E +R K + + +I++L K + + + R
Sbjct: 449 KREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELR 508
Query: 349 AAADQLAREKAATEKIAKQLQHQLNEV 375
K K+ ++L + E
Sbjct: 509 KMRKLELSGKGTPVKVVEKLTLEAIEE 535
Score = 28.9 bits (65), Expect = 8.2
Identities = 22/97 (22%), Positives = 51/97 (52%), Gaps = 11/97 (11%)
Query: 384 RSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERA 443
+ + + ++L ENS+L R+LEE + ++ +L +LE +R ++ R+
Sbjct: 422 KRIKKLEETVERLEEENSELKRELEELKREIEKLES-------ELERFRREVRDKVRKDR 474
Query: 444 TLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKA 480
+ R + I+ + ++LEE+ + +L+R+L++
Sbjct: 475 ----EIRARDRRIERLEKELEEKKKRVEELERKLAEL 507
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 36.9 bits (86), Expect = 0.030
Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 1/87 (1%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERN-KKELEQTIQRKDKEIASLTAKLE 211
+ I+ + + + KI+ K +A L + EL++ I KE L +
Sbjct: 366 INDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAIN 425
Query: 212 DEQGLVGKNQKQIKELQARIEELEEEV 238
+ + + + +IK L+ I+ELE+++
Sbjct: 426 SLEKEIKQLEAEIKALEKEIKELEKQL 452
Score = 35.0 bits (81), Expect = 0.12
Identities = 24/143 (16%), Positives = 55/143 (38%), Gaps = 14/143 (9%)
Query: 131 QEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLT---------QEA 181
E ++ ++L+ L++ +E +E++ + ++L +
Sbjct: 310 DTEKENSEFKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDI 369
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLE-----DEQGLVGKNQKQIKELQARIEELEE 236
+ + +E + I KE KL + + + QK+ K L+ I LE+
Sbjct: 370 IDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEK 429
Query: 237 EVEAERQSRAKAEKQRADLAREL 259
E++ EK+ +L ++L
Sbjct: 430 EIKQLEAEIKALEKEIKELEKQL 452
Score = 33.1 bits (76), Expect = 0.48
Identities = 36/203 (17%), Positives = 88/203 (43%), Gaps = 18/203 (8%)
Query: 8 KQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLT 67
++L QL E+ K + + + ENS+ +EE ++ + L +I
Sbjct: 281 QELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNL 340
Query: 68 TQLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVD 127
+LE+ + S+ E IE+I ++ L I +++ +
Sbjct: 341 QKLEEKL---KDPSTSIELESITDLIESINDIIDAINEL-----------IREHNEKIDN 386
Query: 128 RLQQEEDARNQLFQQ-KKKLEQENGGLKKDIEDLEGDV---EKQKRKIEGDLKLTQEAVA 183
+++ A+ +L+ +L+++ +K+ + LE + EK+ +++E ++K ++ +
Sbjct: 387 LKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIK 446
Query: 184 DLERNKKELEQTIQRKDKEIASL 206
+LE+ +E T +K + +
Sbjct: 447 ELEKQLTNIEPTADEINKLLKAY 469
Score = 32.3 bits (74), Expect = 0.78
Identities = 33/190 (17%), Positives = 74/190 (38%), Gaps = 12/190 (6%)
Query: 179 QEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEV 238
E +L +EL + ++ + D + + + ++EL+A +E LEE +
Sbjct: 277 DEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEIL 336
Query: 239 EAERQSRAKAEKQRADLARELEELGERLEEAGGATSA---QIELNKKR----EAEMSKLR 291
E Q + + + + + ELE + + +E A I + ++ + E +K +
Sbjct: 337 EKNLQ-KLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAK 395
Query: 292 RDLEEANIQHESTLANLRKKH----NDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNT 347
+ L + + +K A++ + ++I QL E E + N
Sbjct: 396 KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQLTNI 455
Query: 348 RAAADQLARE 357
AD++ +
Sbjct: 456 EPTADEINKL 465
Score = 30.7 bits (70), Expect = 2.2
Identities = 23/90 (25%), Positives = 50/90 (55%), Gaps = 5/90 (5%)
Query: 154 KKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLE---RNKKELEQTIQRKDKEIASLTAKL 210
+ I++L+ + K K+K+ L L E D++ + KK LE+ I +KEI L A++
Sbjct: 381 NEKIDNLKKEKNKAKKKLW--LHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEI 438
Query: 211 EDEQGLVGKNQKQIKELQARIEELEEEVEA 240
+ + + + +KQ+ ++ +E+ + ++A
Sbjct: 439 KALEKEIKELEKQLTNIEPTADEINKLLKA 468
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 36.3 bits (84), Expect = 0.032
Identities = 26/91 (28%), Positives = 49/91 (53%), Gaps = 3/91 (3%)
Query: 145 KLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIA 204
+ E+E L+K+ + L+ K+K+E KL ++ + + R +K L+Q I K
Sbjct: 1 ECEEEWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRLKQLIVSLKKLKK 60
Query: 205 SLTAKLEDEQGLVGKNQKQIKELQARIEELE 235
SLT ++ LV + ++QIKE + + ++E
Sbjct: 61 SLT---PEDSELVEQLEEQIKERKNQFFDME 88
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 36.4 bits (84), Expect = 0.035
Identities = 46/201 (22%), Positives = 78/201 (38%), Gaps = 6/201 (2%)
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQ-- 221
+ +++ + K QE LE+ +E E+ + L + E+ Q
Sbjct: 52 ANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAA 111
Query: 222 KQIKELQARIEELEEEVEAERQSR--AKAEKQRADLARELEELGERLEEAGGATSAQIEL 279
KQ +E Q + EE + + AE +++ A+AEK+ + A++ E + + A A E
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEA 171
Query: 280 NKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNA 339
KK EAE + +A + E A A +E + + AE ER A
Sbjct: 172 KKKAEAEAKA--KAEAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKA 229
Query: 340 AHNETNNTRAAADQLAREKAA 360
E + A EK
Sbjct: 230 DEAELGDIFGLASGSNAEKQG 250
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 36.7 bits (84), Expect = 0.037
Identities = 73/259 (28%), Positives = 108/259 (41%), Gaps = 27/259 (10%)
Query: 274 SAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKA 333
SAQ+E + AE+ +EE E+ + RK H SE+ + L ++ +
Sbjct: 403 SAQMEALQAERAELDAAWARVEEGRRSVEAMVEAGRKAHRRHTSELEARKKDLAEIAKEV 462
Query: 334 EHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAK 393
E ER AA T A D L + + +L+ +L+ QG LD AA+
Sbjct: 463 EEEREAALIATTVLNEAQDDLRLQYGSRA---AELEKKLDAAQGVLDAA--------AAR 511
Query: 394 KKLSIENSDLLRQLEEA-ESQVSQLSKIKISLTTQLED-----TKRLADEEGRERATLLG 447
++ + EN RQ EEA E++ L + + L D R A E A
Sbjct: 512 EQRAAENEAASRQREEALEARAMALEERACAKERDLADREAAVAIREATLAAHEAACAEE 571
Query: 448 KFR-NLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESK 506
+F L D RE+ EEAE A + AD+ R + E R LE ++
Sbjct: 572 EFALRLREDALTERERALEEAEAAA-------QQLADSLFLREAAQEEQARR--NLEGAR 622
Query: 507 SEERRVDDLAAELDASQKE 525
+E +D AAEL+A KE
Sbjct: 623 AERAALDQRAAELEARAKE 641
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 36.6 bits (84), Expect = 0.043
Identities = 34/170 (20%), Positives = 82/170 (48%), Gaps = 10/170 (5%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGL 216
+E + +++ +RKI Q + DL+R +++ Q Q K E+ ++ +K+E + L
Sbjct: 794 MERFQMELKDVERKIAQQAAKLQGS--DLDRTVQQVNQEKQEKQHELDTVVSKIELNRKL 851
Query: 217 VGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQ 276
+ Q+QI+ L+++ EL ++E+ ++R +L EL ++ +
Sbjct: 852 IQDQQEQIQHLKSKTNEL----KSEKLQIGTNLQRRQQFEEQLVELSTEVQSL----IRE 903
Query: 277 IELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL 326
I+ K++++ + ++ + S+ KK D V+++ E++ +
Sbjct: 904 IKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNI 953
Score = 33.9 bits (77), Expect = 0.28
Identities = 37/225 (16%), Positives = 90/225 (40%), Gaps = 5/225 (2%)
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
K E +++K+K + Q ++ +K+I EL+ +++++ +++ + + E
Sbjct: 712 KSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQE 771
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
+ E E L + Q+EL K E ++++ L+ +++ N
Sbjct: 772 TLLGTIMPEEESAKVCLTDVTIMERFQMEL-KDVERKIAQQAAKLQGSDLDRTVQQVNQE 830
Query: 310 KKHND-AVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQL 368
K+ + + +I+ KL + + ++TN ++ Q+ ++ +QL
Sbjct: 831 KQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQL 890
Query: 369 QHQLNEVQGKLDETNRSLND---FDAAKKKLSIENSDLLRQLEEA 410
EVQ + E + + +K E +L+ E +
Sbjct: 891 VELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETS 935
Score = 33.1 bits (75), Expect = 0.47
Identities = 75/440 (17%), Positives = 165/440 (37%), Gaps = 17/440 (3%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK 189
+ E+ + L + K++E + K +++ ++K+ + + +L + +
Sbjct: 241 KSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTD 300
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
++L + + +L D Q + K K+ + L EL E + + +
Sbjct: 301 EQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQ 360
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
+ ++ L RLE G E K + R++ E A+L+
Sbjct: 361 EHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQ--LCADLQ 418
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
K + E D+ L E ++ + + +L + + ++++I + Q
Sbjct: 419 SKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQ 478
Query: 370 HQLN-EVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQL 428
E + E N K L E +DL R+L + + ++ QL+ + T
Sbjct: 479 ELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQME 538
Query: 429 EDTKRLADEEGR----------ERATLLGKFRN---LEHDIDNIREQLEEEAEGKADLQR 475
TK D++ + E +LLG F N LE + + +++ + + A L +
Sbjct: 539 MLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNK 598
Query: 476 QLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFR 535
+L+ + + ES+ + E+ + D ++L+ ++E S +
Sbjct: 599 ELASLEQNKNHINNELESK-EEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAM 657
Query: 536 LKGAYEEAQEQLEAVRRENK 555
L GA + + + EN+
Sbjct: 658 LAGATAVYSQFITQLTDENQ 677
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 36.2 bits (84), Expect = 0.044
Identities = 21/115 (18%), Positives = 39/115 (33%), Gaps = 9/115 (7%)
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
KEL + + KE+ LT + + + + L+ I++L+ E++ E
Sbjct: 2 KELREALAELAKELRKLTEDEKLA--EAEEEKAEYDALKEEIDKLDAEIDR-------LE 52
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHEST 304
+ +L + GE A+ E + R EE T
Sbjct: 53 ELLDELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALSTGT 107
Score = 28.5 bits (64), Expect = 9.3
Identities = 23/122 (18%), Positives = 41/122 (33%), Gaps = 12/122 (9%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDL-KLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLE 211
L++ + +L ++ K + + + L+ +L+ I R ++ + L AK
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63
Query: 212 DEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGG 271
E EE EE + RA LA E + L + GG
Sbjct: 64 AS-----------GEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALSTGTDSDGG 112
Query: 272 AT 273
Sbjct: 113 YL 114
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 36.3 bits (84), Expect = 0.044
Identities = 31/218 (14%), Positives = 78/218 (35%), Gaps = 2/218 (0%)
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
+L +N KE Q + IA++ L + V + + + +E +
Sbjct: 740 ELLQNIKEKLQAADDLRQRIAAMERDLARFEEEVEALAEAVA--PEMLGTPADETARALK 797
Query: 244 SRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHES 303
R K + A A +L E E E+ +A ++ + R + + R +
Sbjct: 798 QRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAV 857
Query: 304 TLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
++ ++ ++ + + + + AA + + A ++LAR+ E+
Sbjct: 858 ERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEE 917
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENS 401
+L ++ + +L + + ++ S+
Sbjct: 918 ELNELAQEVGAAKQELARMDGGSTAAELEAERESLLAQ 955
Score = 33.6 bits (77), Expect = 0.30
Identities = 64/369 (17%), Positives = 122/369 (33%), Gaps = 33/369 (8%)
Query: 196 IQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADL 255
I KE+ L A++ + Q K+L ++E EEE+ R+ + EK++ L
Sbjct: 152 INVALKELKELEAEIREVQLKTR----TWKDLVKALDEAEEELANLRKELRQLEKEKQRL 207
Query: 256 ARE---LEELGER--LEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRK 310
R L L ER LE+ A I+L + R +L A E L
Sbjct: 208 ERLRRLLPLLAERKALEQQLAALGEVIDLPPDAVERYEEARAELRAARRNLELLTERLEA 267
Query: 311 KHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQH 370
++D+++ + AA + L +++ + L
Sbjct: 268 LQ--------AELDEISLDEELLAQA------------AAIEALHQQRGEYRNAEQDLPD 307
Query: 371 QLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLED 430
+ E+ + L E + LR A+ V++L K K +L L+
Sbjct: 308 REGEIANAREAAAALLAQIGPD---ADEEAVESLRPSLAAKETVTELEKRKEALDQALKS 364
Query: 431 TKRLADEEGRERATLLGKFRNLEHDIDNIR-EQLEEEAEGKADLQRQLSKANADAQLWRT 489
+ +E RE + + L +A D+ L+ + + +
Sbjct: 365 ARDALEERERELKQVRAQLAALPTVTVPEPLRAALADALRAGDIDAALAALEQEVAVAKR 424
Query: 490 KYESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEA 549
+ + + A ++ A KE + + E +E LE
Sbjct: 425 ELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLET 484
Query: 550 VRRENKNLA 558
+ + ++L
Sbjct: 485 LELQLRHLD 493
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome
partitioning].
Length = 1480
Score = 36.0 bits (83), Expect = 0.056
Identities = 36/161 (22%), Positives = 73/161 (45%), Gaps = 11/161 (6%)
Query: 92 SIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDR---LQQEEDARNQLFQQKKKLEQ 148
++EAI + D D+ + ++ + Y + +R L Q + R +L+ +++L
Sbjct: 245 TLEAIRVTQSDRDLFKHLISEATNYVAADYMRHANERRVHLDQALEFRRELYTSRQQLAA 304
Query: 149 ENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVAD---LERNKKELEQTIQRKDKEIAS 205
E +L EGDL+ +A +D L + ++ I+R ++
Sbjct: 305 EQYRHVDMSRELAELNGA-----EGDLEADYQAASDHLNLVQTALRQQEKIERYQADLEE 359
Query: 206 LTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRA 246
LT +LE++ +V + ++ +E +AR E E EV+ + A
Sbjct: 360 LTIRLEEQNEVVEEANERQEENEARAEAAELEVDELKSQLA 400
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 35.8 bits (83), Expect = 0.064
Identities = 35/147 (23%), Positives = 64/147 (43%), Gaps = 21/147 (14%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA--- 183
R + Q K+LE E GL+ ++ L +E ++E KL ++ +
Sbjct: 137 SRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRL 196
Query: 184 ---DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE-VE 239
+LER + E + E+ L A+LE ++QI ELQ +++E+ E
Sbjct: 197 ELLELERERAEAQ-------GELGRLEAELE-------VLKRQIDELQLERQQIEQTFRE 242
Query: 240 AERQSRAKAEKQRADLARELEELGERL 266
+ +A+ + A+L L + +RL
Sbjct: 243 EVLEELTEAQARLAELRERLNKARDRL 269
Score = 30.4 bits (69), Expect = 2.4
Identities = 31/140 (22%), Positives = 67/140 (47%), Gaps = 7/140 (5%)
Query: 200 DKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLAREL 259
+ + ++ + + QIK+L+A + L+ +++A RQ + + + + R+L
Sbjct: 129 KGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQ-QLEVISEELEARRKL 187
Query: 260 EELG-----ERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHND 314
+E G E LE AQ EL + EAE+ L+R ++E ++ + R++ +
Sbjct: 188 KEKGLVSRLELLELERERAEAQGELGRL-EAELEVLKRQIDELQLERQQIEQTFREEVLE 246
Query: 315 AVSEMGEQIDQLNKLKTKAE 334
++E ++ +L + KA
Sbjct: 247 ELTEAQARLAELRERLNKAR 266
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 35.6 bits (82), Expect = 0.074
Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 12/127 (9%)
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ 223
+E+ K + + L +KELEQ + L L++++ L + +++
Sbjct: 499 IEQAKTFYGEFKEEINVLIEKLSALEKELEQ-------KNEHLEKLLKEQEKLKKELEQE 551
Query: 224 IKELQARIEELEEEVEAE-RQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKK 282
++EL+ R + E+E E +++ +K+ + REL+E + IE K
Sbjct: 552 MEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKE----IKSIEDLVK 607
Query: 283 REAEMSK 289
+ K
Sbjct: 608 LKETKQK 614
Score = 30.2 bits (68), Expect = 3.8
Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 4/99 (4%)
Query: 285 AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNET 344
++S L ++LE+ N E L K + EM E ++ K + E E A
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577
Query: 345 NNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETN 383
+ + +EK KI K + + E KL ET
Sbjct: 578 KKEVESIIRELKEK----KIHKAKEIKSIEDLVKLKETK 612
Score = 29.4 bits (66), Expect = 5.5
Identities = 26/111 (23%), Positives = 55/111 (49%), Gaps = 6/111 (5%)
Query: 151 GGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKL 210
G K++I L + ++++E + ++ + + E+ KKELEQ ++ + + +L
Sbjct: 507 GEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLEL 566
Query: 211 EDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEE 261
E E Q+ +K L+ +E + E++ ++ +AK K DL + E
Sbjct: 567 EKEA------QEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKET 611
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 35.0 bits (81), Expect = 0.079
Identities = 25/128 (19%), Positives = 62/128 (48%), Gaps = 15/128 (11%)
Query: 179 QEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGL--VGKNQKQIKELQARIEELEE 236
+ +A L +EL+Q ++ + + KL + V + ++++ +++ARI +L+E
Sbjct: 26 RLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKE 85
Query: 237 EVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEE 296
E+E +R+ + ++ A +L SA +L K+R +++ KL+ +++
Sbjct: 86 EIEQKRERIEELKRALAQRRSDLS-------------SASYQLEKRRASQLEKLQDEIKR 132
Query: 297 ANIQHEST 304
+ +
Sbjct: 133 TRSKLNAL 140
Score = 29.3 bits (66), Expect = 5.8
Identities = 27/123 (21%), Positives = 53/123 (43%), Gaps = 6/123 (4%)
Query: 221 QKQIKELQARIEELEEEVEAE-RQSRAKAEKQRADLAR-ELEELGERLEEAGGATSAQIE 278
+ + L EEL+++VE + + K ADL + E+ ERL + S E
Sbjct: 26 RLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKE 85
Query: 279 LNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQI----DQLNKLKTKAE 334
+++ + +L+R L + S L K+ + ++ ++I +LN L +
Sbjct: 86 EIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLA 145
Query: 335 HER 337
+R
Sbjct: 146 EKR 148
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 35.4 bits (82), Expect = 0.087
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 25/108 (23%)
Query: 461 EQLEEEAEGKADLQRQLSKANADAQLWRTKY----------ESEGVARAEELEESKSEER 510
E LE+EA G+ + Q+ KA A+ Q RTK ES G +RAE +++E R
Sbjct: 673 ELLEQEARGRLERQKMHDKAKAEEQ--RTKLLELQAESAAVESSGQSRAE--ALAEAEAR 728
Query: 511 RVDDLAAELDASQKECRNYS----TELFRLKG------AYEEAQEQLE 548
+ + AE++ ++ + EL +L+ YE+AQ +LE
Sbjct: 729 LI-EAEAEVEQAELRAKALRIEAEAELEKLRKRQELELEYEQAQNELE 775
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 34.8 bits (80), Expect = 0.10
Identities = 20/93 (21%), Positives = 45/93 (48%), Gaps = 4/93 (4%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAV-ADLERNKKELEQTIQRK--DKEIASLTAKLE-D 212
+E LE DVE+ ++ I + + +E + ++E+ KE E +R ++ + L E
Sbjct: 363 LERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENF 422
Query: 213 EQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
+ L + ++ +++L+ E+E+ R
Sbjct: 423 LKELSKEEKELLEKLKMEASEVEKLFGRALPVR 455
Score = 31.7 bits (72), Expect = 0.96
Identities = 18/80 (22%), Positives = 39/80 (48%), Gaps = 3/80 (3%)
Query: 144 KKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLK--LTQEAVAD-LERNKKELEQTIQRKD 200
K +E+ + K+ +E ++EK+ K+ + + + A+ + L+ +++ E ++
Sbjct: 368 KDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKELS 427
Query: 201 KEIASLTAKLEDEQGLVGKN 220
KE L KL+ E V K
Sbjct: 428 KEEKELLEKLKMEASEVEKL 447
Score = 30.6 bits (69), Expect = 2.7
Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 7/94 (7%)
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQK---QIKELQARIEELEE 236
+ LE++ +E E+TI ++ ++I +LE E +GK ++ + L+ ++ELE+
Sbjct: 361 PLLERLEKDVEEGEKTIVKEARQIE---EELEKEVEKLGKEEESLFKRVALEEGLKELEQ 417
Query: 237 EVEAERQSRAKAEK-QRADLARELEELGERLEEA 269
+ E + +K EK L E E+ + A
Sbjct: 418 DEENFLKELSKEEKELLEKLKMEASEVEKLFGRA 451
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 34.6 bits (80), Expect = 0.12
Identities = 21/107 (19%), Positives = 52/107 (48%), Gaps = 11/107 (10%)
Query: 133 EDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKEL 192
+ RN K+ L+ + D+E L ++ +++K+ +++ Q +RN EL
Sbjct: 5 KLLRNNPDLVKESLKARGLSVDIDLEKLI-ALDDERKKLLSEIEELQA-----KRN--EL 56
Query: 193 EQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
+ I + + +++ E + + ++++ EL A ++ LE E++
Sbjct: 57 SKQIGKAKGQKKDKIEEIKKE---LKELKEELTELSAALKALEAELQ 100
Score = 30.0 bits (68), Expect = 3.2
Identities = 17/76 (22%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 191 ELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQAR----IEELEEEVEAERQSRA 246
+LE+ I D+ L+ ++E+ Q + KQI + + + IEE+++E++ ++
Sbjct: 28 DLEKLIALDDERKKLLS-EIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELT 86
Query: 247 KAEKQRADLARELEEL 262
+ L EL++
Sbjct: 87 ELSAALKALEAELQDK 102
>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
kinase Etk; Provisional.
Length = 726
Score = 34.5 bits (79), Expect = 0.16
Identities = 38/156 (24%), Positives = 75/156 (48%), Gaps = 14/156 (8%)
Query: 354 LAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKK--LSIENSDLLRQLEEAE 411
+AR+ A + + LQ QL EV+ +LD+ LN + + L++E +L Q+ +
Sbjct: 258 IARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVD 317
Query: 412 SQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKA 471
+Q+++ LT + + +L ++ LL K + LE + R++L +
Sbjct: 318 NQLNE-------LTFREAEISQLYKKDHPTYRALLEKRQTLEQE----RKRLNKRVSAMP 366
Query: 472 DLQRQLSKANADAQLWRTKYESEGVARAEELEESKS 507
Q+++ + + D + R Y + + R +EL SKS
Sbjct: 367 STQQEVLRLSRDVEAGRAVYL-QLLNRQQELSISKS 401
Score = 29.9 bits (67), Expect = 4.0
Identities = 16/53 (30%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 10 LQHQLNEVQGKLDETNRSLNDFDAAKKK--LSIENSDLLRQLEEAESQVSQLS 60
LQ QL EV+ +LD+ LN + + L++E +L Q+ ++Q+++L+
Sbjct: 272 LQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELT 324
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 33.9 bits (78), Expect = 0.16
Identities = 37/142 (26%), Positives = 68/142 (47%), Gaps = 16/142 (11%)
Query: 144 KKLEQENGGLKKDIEDLE---GDVEKQKRKIEGDLKLTQEAVADLE---------RNKKE 191
KK + E L K +E LE D+E Q ++E +++ +E + E R +
Sbjct: 34 KKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA 93
Query: 192 LEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQS----RAK 247
L IQ + I SL +L + + K +K+I++L+ R+E LE+ + A+
Sbjct: 94 LNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAE 153
Query: 248 AEKQRADLARELEELGERLEEA 269
++ +L+ + EEL E+L+
Sbjct: 154 IREEGQELSSKREELKEKLDPE 175
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 34.5 bits (80), Expect = 0.17
Identities = 17/100 (17%), Positives = 38/100 (38%), Gaps = 7/100 (7%)
Query: 225 KELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE 284
++ + + L++EV +Q Q ARE + E A L + E
Sbjct: 138 EDPENLLHALQQEVLTLKQ-------QLELQAREKAQSQALAEAQQQELVALEGLAAELE 190
Query: 285 AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQID 324
+ +L LE+ + T ++K + + ++++
Sbjct: 191 EKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKRLE 230
Score = 33.0 bits (76), Expect = 0.44
Identities = 21/110 (19%), Positives = 48/110 (43%), Gaps = 17/110 (15%)
Query: 177 LTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEE 236
L ++ K++LE + K + A A+ ++ L + EL+ + +ELE
Sbjct: 143 LLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVAL----EGLAAELEEKQQELEA 198
Query: 237 EVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 286
++E ++ A+ ++R ++ +E + ++EL+ E E
Sbjct: 199 QLEQLQEKAAETSQERK----------QKRKEITDQAAKRLELS---EEE 235
Score = 31.5 bits (72), Expect = 1.4
Identities = 18/78 (23%), Positives = 34/78 (43%), Gaps = 3/78 (3%)
Query: 223 QIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLE---EAGGATSAQIEL 279
Q++ + + EA++Q E A+L + +EL +LE E TS + +
Sbjct: 157 QLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQ 216
Query: 280 NKKREAEMSKLRRDLEEA 297
+K + + R +L E
Sbjct: 217 KRKEITDQAAKRLELSEE 234
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 34.3 bits (79), Expect = 0.17
Identities = 25/145 (17%), Positives = 58/145 (40%), Gaps = 12/145 (8%)
Query: 166 KQKRKIEGDLKLTQEAVADLERNKKELEQTIQR-----KDKEIASLTAKLEDEQGLVGKN 220
K++R + ++ QEA + E + + ++++ + E+ Q +
Sbjct: 27 KRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERL 86
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQI--- 277
++ ++L AR E+L+ + + +L ++L L G T Q
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL 146
Query: 278 ---ELNKKREAEMSKL-RRDLEEAN 298
L+ + E E ++ ++ EEA+
Sbjct: 147 LLKLLDAELEEEKAQRVKKIEEEAD 171
Score = 33.1 bits (76), Expect = 0.42
Identities = 24/96 (25%), Positives = 47/96 (48%), Gaps = 1/96 (1%)
Query: 461 EQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAELD 520
E++ +EA+ +A+ + + + A L R + + AR E EE + EE R+ +LD
Sbjct: 36 ERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRER-EELQREEERLVQKEEQLD 94
Query: 521 ASQKECRNYSTELFRLKGAYEEAQEQLEAVRRENKN 556
A ++ N +L + A + +LE + ++ N
Sbjct: 95 ARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130
>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
subunit alpha; Validated.
Length = 319
Score = 34.0 bits (79), Expect = 0.18
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 10/48 (20%)
Query: 222 KQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
K I EL+A+IEEL RA AE DL+ E+E L ++LEE
Sbjct: 10 KPIAELEAKIEEL----------RAVAEDSDVDLSEEIERLEKKLEEL 47
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 34.3 bits (78), Expect = 0.20
Identities = 47/172 (27%), Positives = 78/172 (45%), Gaps = 21/172 (12%)
Query: 192 LEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQ 251
L Q + +++ L E+ + L + +Q L+A+ EELE AE+ RA+AE
Sbjct: 61 LSQQAELISRQLQELRRLEEEVRSLRETSLQQKMRLEAQAEELEALAVAEKAGRAEAEGL 120
Query: 252 RADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKK 311
RA LA EE+ + LEE G +++LEEA H+ L++L +
Sbjct: 121 RAALAGA-EEVRKNLEEGG--------------------QQELEEAQRLHQEQLSSLTQA 159
Query: 312 HNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK 363
H +A+S + + + L K E R +A AD L + + T++
Sbjct: 160 HLEALSSLRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQE 211
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 34.0 bits (78), Expect = 0.22
Identities = 14/56 (25%), Positives = 24/56 (42%)
Query: 220 NQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSA 275
Q E+Q +++++ E R+ KQR D R +E+LG+ A
Sbjct: 67 RQHATTEMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKA 122
Score = 30.6 bits (69), Expect = 2.8
Identities = 13/51 (25%), Positives = 27/51 (52%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGG 271
QKQ +E++ ++ L ++ +++ K + A LA +++ LG A G
Sbjct: 82 QKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATG 132
Score = 29.4 bits (66), Expect = 5.0
Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 2/83 (2%)
Query: 328 KLKTKAEHERNAAHNETNNTRAAADQLAREKAATEK--IAKQLQHQLNEVQGKLDETNRS 385
+A E+ + T D R+ A TE A Q+Q Q E++ +LD N+
Sbjct: 39 SGNGEAVAEQEPVPDMTGVVDTTFDDKVRQHATTEMQVTAAQMQKQYEEIRRELDVLNKQ 98
Query: 386 LNDFDAAKKKLSIENSDLLRQLE 408
D +KL +N+ L Q++
Sbjct: 99 RGDDQRRIEKLGQDNAALAEQVK 121
>gnl|CDD|219412 pfam07439, DUF1515, Protein of unknown function (DUF1515). This
family consists of several hypothetical bacterial
proteins of around 130 residues in length. Members of
this family seem to be found exclusively in Rhizobium
species. The function of this family is unknown.
Length = 112
Score = 31.8 bits (72), Expect = 0.24
Identities = 17/54 (31%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 222 KQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSA 275
+Q+ LQA ++ L E++ R+S +++ RA + R ++EL ER+ G+ S
Sbjct: 8 QQLGTLQAEVKNLREDI---RRSEDRSDASRASMYRRMDELVERVTTLEGSVSK 58
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 33.6 bits (77), Expect = 0.27
Identities = 49/202 (24%), Positives = 81/202 (40%), Gaps = 6/202 (2%)
Query: 197 QRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLA 256
Q+K + A K +++Q QKQ E Q R+++LE+E A ++ + +AE+ A
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAE-QERLKQLEKERLAAQEQKKQAEEAAKQAA 128
Query: 257 RELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAV 316
+ ++ E +A A A+ E KR A +K + + E+ + A
Sbjct: 129 LKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAE 188
Query: 317 SEMGEQIDQLNKLKTKAEHERNAA-----HNETNNTRAAADQLAREKAATEKIAKQLQHQ 371
+E + K K +AE ++ AA AAA A KAA EK A +
Sbjct: 189 AEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAE 248
Query: 372 LNEVQGKLDETNRSLNDFDAAK 393
E + D+ K
Sbjct: 249 KAAAAKAAAEVDDLFGGLDSGK 270
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 33.3 bits (76), Expect = 0.36
Identities = 35/133 (26%), Positives = 69/133 (51%), Gaps = 6/133 (4%)
Query: 123 QDTVD-RLQQEEDARNQLFQQKKKLEQENGGLKKD-IEDLEGDVEKQKRKIEGDLKLTQE 180
Q+ V+ R + E DA KK ++E+ LKK+ + + + + K + +IE + K ++
Sbjct: 42 QEAVNLRGKAERDAE----HIKKTAKRESKALKKELLLEAKEEARKYREEIEQEFKSERQ 97
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA 240
+ +E E ++ RKD+ ++S LE ++ + K I E + ++E+LEE+ +A
Sbjct: 98 ELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQKKA 157
Query: 241 ERQSRAKAEKQRA 253
E + A + A
Sbjct: 158 ELERVAALSQAEA 170
>gnl|CDD|227605 COG5280, COG5280, Phage-related minor tail protein [Function
unknown].
Length = 634
Score = 33.3 bits (76), Expect = 0.38
Identities = 26/134 (19%), Positives = 51/134 (38%), Gaps = 12/134 (8%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKN-QKQIKELQARIEELEE 236
+++ + K L +R +E SL ++L+D L +N + L RI+ L
Sbjct: 6 IGGLTIEIDGDLKGLNGAFERVKRESNSLGSELKDAGELFKRNISASTEALAQRIQLLTG 65
Query: 237 EVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEE 296
+V ++ KQ+A + + GE EEA + E+ L
Sbjct: 66 QVTLTVKNLDDLAKQQAQVDAQ-FGSGEIGEEA----------ERLFTREIRVTEGGLTG 114
Query: 297 ANIQHESTLANLRK 310
+ + ++R+
Sbjct: 115 SLSELNEPTVSVRE 128
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 33.3 bits (76), Expect = 0.39
Identities = 71/357 (19%), Positives = 136/357 (38%), Gaps = 37/357 (10%)
Query: 226 ELQARIEELEEEVEAERQ--SRAKAEKQR-ADLARELEELGERLEEAGGATSAQIELNKK 282
RI EL+ E++ R S +AE +R + L++EL+E E LE G +I+ K
Sbjct: 31 YYLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIKEYKF 90
Query: 283 REAEMSKLRRDLEEANIQHESTLANLR-------------KKHNDAVSEMGEQIDQLNKL 329
REA + + +LEE NI + ++ LR ++ + + Q+++ +L
Sbjct: 91 REARLLQDYSELEEENISLQKQVSVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARL 150
Query: 330 KTKAEHERNAAHNETNNTRAAADQLAREKAATEKIA-----KQLQHQLNEVQGKLDETNR 384
K AE + A R + L +E + + L L+ ++ D
Sbjct: 151 KEIAEKQLEEALETLKTEREQKNALRKELSHHLNLEDFYSLSNLSISLDGLKFSEDPEAS 210
Query: 385 SLNDFDAAKKKLSIENSDLLRQLEEAE----------SQVSQLSKIKISLTTQLEDTKRL 434
+ + D + + L + S++ +S +++ K+
Sbjct: 211 TEPNNDGEDENGHLNGGPGLAKSLGTPRKGELFPPAPGVSDLFSELNLS---EIQKLKQQ 267
Query: 435 ADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLS---KANADAQLWRTKY 491
++ RE+A+LL + L+ ++ L E+ E L L+ A +
Sbjct: 268 LEQVEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQDGLD 327
Query: 492 ESEGVARAEELEESKSEERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLE 548
+ E+ + + + + L + + E EL LK Y E QE+ E
Sbjct: 328 SEKDRGSHEDGDYYEVDINGPEILECKYRVAVSEVGELKAELKALKARYNELQERYE 384
Score = 29.0 bits (65), Expect = 7.1
Identities = 40/190 (21%), Positives = 77/190 (40%), Gaps = 29/190 (15%)
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
++L+Q +++ ++E ASL A L++ Q + +++ + E ++ L E + A R +A E
Sbjct: 262 QKLKQQLEQVEREKASLLANLQELQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKE 321
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
+Q E E G I + L
Sbjct: 322 QQ----DGLDSEKDRGSHEDGDYYEVDI-------------------------NGPEILE 352
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQ 369
K+ AVSE+GE +L LK + + +E + R LA + + EK + + Q
Sbjct: 353 CKYRVAVSEVGELKAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQ 412
Query: 370 HQLNEVQGKL 379
+ +++ +L
Sbjct: 413 ERSGDLEKEL 422
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 32.9 bits (75), Expect = 0.45
Identities = 33/175 (18%), Positives = 65/175 (37%), Gaps = 26/175 (14%)
Query: 222 KQIKELQARIEELEEEVEAER--QSRAKAEKQRADLARE--LEELGERLEEAGGATS--- 274
++++E ++ +L+E++E R R K ++R + LEE RL E A S
Sbjct: 96 ERLQESYQKLTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGRLREELAALSREV 155
Query: 275 -AQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKA 333
Q L + A + R ++ + E A + + + L+TKA
Sbjct: 156 GKQRGLLSRGLATFKRDRILQQQWREEQEKYDAADKAR-------------AIYALQTKA 202
Query: 334 EHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLND 388
+ ++ + EK A+ + E + +++E LN
Sbjct: 203 DERNLETVLQSLSQADFQLAGVAEKELETVEAR-----IKEARYEIEELENKLNL 252
Score = 29.4 bits (66), Expect = 4.8
Identities = 27/136 (19%), Positives = 58/136 (42%), Gaps = 7/136 (5%)
Query: 125 TVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVAD 184
T + E + K+ LE+ G L++++ L +V KQ+ + L +
Sbjct: 119 TGRLKEGRE---RHFQKSKEALEETIGRLREELAALSREVGKQRGLLSRGLATFKRDRIL 175
Query: 185 LERNKKELE-QTIQRKDKEIASLTAKLEDE--QGLVGKNQKQIKELQARIEELEEEVEAE 241
++ ++E E K + I +L K ++ + ++ + +L E+ E VEA
Sbjct: 176 QQQWREEQEKYDAADKARAIYALQTKADERNLETVLQSLSQADFQLAGVAEKELETVEAR 235
Query: 242 -RQSRAKAEKQRADLA 256
+++R + E+ L
Sbjct: 236 IKEARYEIEELENKLN 251
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 33.3 bits (76), Expect = 0.45
Identities = 76/329 (23%), Positives = 129/329 (39%), Gaps = 34/329 (10%)
Query: 154 KKDIEDLEGDVEKQKRKI----EGDLKLTQEAVAD-LERNKKELEQTIQRKDKEIASLTA 208
K + +LEG+VEK K +I E K ++ A+ + +E+ KKE++ + L
Sbjct: 428 KTPVRELEGEVEKLKEQILKAKESSSKPSELALNEMIEKLKKEIDLEYTEAVIAMG-LQE 486
Query: 209 KLEDEQGLVGK----NQKQIKELQARIEELEEEVEAERQSRAKAE---KQRADLARELEE 261
+LE+ + K +Q L +IE+L++E +R SRA K + D+ E
Sbjct: 487 RLENLREEFSKANSQDQLMHPVLMEKIEKLKDEFN-KRLSRAPNYLSLKYKLDMLNEFSR 545
Query: 262 LGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGE 321
E+ A + E+NKK + M D E + E+ A + S
Sbjct: 546 AKALSEKKSKAEKLKAEINKKFKEVM-----DRPEIKEKMEALKAEVAS------SGASS 594
Query: 322 QIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDE 381
+ + LK K E + E + L + Q +Q K++
Sbjct: 595 GDELDDDLKEKVEKMKKEIELELAGVLKSMG-LEVIGVTKKNKDTAEQTPPPNLQEKIES 653
Query: 382 TNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQ----LEDTKRLADE 437
N +N K + I +SDL ++E + +V++ SK + LE +
Sbjct: 654 LNEEIN----KKIERVIRSSDLKSKIELLKLEVAKASKTPDVTEKEKIEALEQQIKQKIA 709
Query: 438 EGRERATLLGKFRNLEHDIDNIREQLEEE 466
E + L KF LE ++ RE E
Sbjct: 710 EALNSSELKEKFEELEAELAAARETAAES 738
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 32.6 bits (75), Expect = 0.45
Identities = 31/149 (20%), Positives = 55/149 (36%), Gaps = 15/149 (10%)
Query: 126 VDRLQQEEDARNQ------LFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQ 179
+ +QQEE A Q + L E L+ +I++ + +V ++
Sbjct: 114 LLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQE------AQC 167
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
EA + K +++ + A+LE L + I +L+A+ LE +
Sbjct: 168 EAEGTGGTGVAGKGPVYKEKREKLDAAQARLET---LKARLDAAIAQLEAQKAALERNRQ 224
Query: 240 AERQSRAKAEKQRADLARELEELGERLEE 268
A + + L LE LGE E
Sbjct: 225 AAVAEKQARIEANDGLLARLEALGELTTE 253
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 32.6 bits (74), Expect = 0.49
Identities = 21/71 (29%), Positives = 39/71 (54%)
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 285
+L+ EEL+E++E ++ + + K+ +L E EE+ ERL+ S E+ KK
Sbjct: 132 DLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPG 191
Query: 286 EMSKLRRDLEE 296
E+ L++ +E
Sbjct: 192 EVYDLKKRWDE 202
>gnl|CDD|234705 PRK00260, cysS, cysteinyl-tRNA synthetase; Validated.
Length = 463
Score = 32.8 bits (76), Expect = 0.54
Identities = 34/96 (35%), Positives = 46/96 (47%), Gaps = 13/96 (13%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKE----IASLTAKLEDEQGLVGKNQKQIKELQARIEE 233
T EA+A L +E+ + ++ KDKE +A+L L D GL + A E
Sbjct: 356 TPEALAVLFELAREINRALETKDKEAAAALAALLRALADVLGLEQDEEAL--FEAAADEL 413
Query: 234 LEEEVEA---ERQSRAKAEKQ--RADLAR-ELEELG 263
L+ E+EA ERQ A+ K AD R EL LG
Sbjct: 414 LDAEIEALIEERQ-EARKAKDFALADAIRDELAALG 448
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 32.6 bits (74), Expect = 0.61
Identities = 58/291 (19%), Positives = 112/291 (38%), Gaps = 23/291 (7%)
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK 247
N K T + E SL +E Q + + +E ++ E+Q+ +K
Sbjct: 62 NSKLQSNTDENGQLENTSLRTVMELPQKST-SSDDDHNRASMQRDEAIAAIDNEQQTNSK 120
Query: 248 AEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN-IQHESTLA 306
+Q +D +LE+L ++ A + LN+ R + L + L E +Q + +
Sbjct: 121 DGEQLSDF--QLEDLVGMIQNAEKNI---LLLNQARLQALEDLEKILTEKEALQGKINIL 175
Query: 307 NLRKKHNDAVSEMGEQI--------DQLNKLKTKAEHERNAAHNETNNTRAAADQLAREK 358
+R DA ++ Q +QL KL+ + ++ D L E
Sbjct: 176 EMRLSETDARIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEEN 235
Query: 359 AATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
+ + L+ +L EV ET + + + L +L + A+ VS+LS
Sbjct: 236 MLLKDDIQFLKAELIEVA----ETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLS 291
Query: 419 KIKISL----TTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEE 465
++ L+D A + + A +L + ++L +D + L+E
Sbjct: 292 PLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKE 342
>gnl|CDD|226055 COG3524, KpsE, Capsule polysaccharide export protein [Cell envelope
biogenesis, outer membrane].
Length = 372
Score = 32.5 bits (74), Expect = 0.61
Identities = 27/113 (23%), Positives = 50/113 (44%), Gaps = 14/113 (12%)
Query: 182 VADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE 241
V D + + + + + E+ + A+L+ + ++ QI L+ARIE L +++ E
Sbjct: 211 VFDPKAQAEVQMSLVSKLEDELIVIQAQLDTVKSVMNPENPQIPGLKARIESLRKQLLQE 270
Query: 242 RQ-----------SRAKAEKQRADLARELEELGERLEEAGGA-TSAQIELNKK 282
+Q S AE QR L E + A + SA+IE +++
Sbjct: 271 KQAISAGGSSQSLSNQAAEFQRLYLENTFAE--KAYAAALTSLESARIEADRQ 321
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.2 bits (73), Expect = 0.65
Identities = 34/129 (26%), Positives = 51/129 (39%), Gaps = 11/129 (8%)
Query: 185 LERNKKELEQTIQRKD--KEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAER 242
LER EL + +D K+ S E ++ EL+ E L +E+E
Sbjct: 31 LERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELE--- 87
Query: 243 QSRAKAEKQRADLARELEELGERLEEA-GGATSAQIELNKKREAEMSKLRRDLEEANIQH 301
+ EK+ DL EL EL E E+ E N + L +L+ +Q+
Sbjct: 88 ----ELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQ-LEDNLQSLELQY 142
Query: 302 ESTLANLRK 310
E +L L K
Sbjct: 143 EYSLNQLDK 151
Score = 30.6 bits (69), Expect = 2.1
Identities = 29/136 (21%), Positives = 55/136 (40%), Gaps = 12/136 (8%)
Query: 302 ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAAT 361
+ L L + DA E L+KL+++ N ++ D+L +E+
Sbjct: 28 DILLERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSE-----LDELKKEE--- 79
Query: 362 EKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIK 421
++L +L E++ + D+ + L + K++L E LR+ + QL
Sbjct: 80 ----ERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNL 135
Query: 422 ISLTTQLEDTKRLADE 437
SL Q E + D+
Sbjct: 136 QSLELQYEYSLNQLDK 151
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 32.3 bits (75), Expect = 0.66
Identities = 23/100 (23%), Positives = 43/100 (43%), Gaps = 22/100 (22%)
Query: 167 QKRKIEGDLKLTQEAVADLERNKKELEQTIQ-----RKD--KEIASLTAKLEDEQGLVGK 219
KR D+ + + +L+ ++EL+ ++ R KEI K ED + L
Sbjct: 19 AKRGFPLDV----DELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEAL--- 71
Query: 220 NQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLAREL 259
++KEL+ I+ LE E++ E + +L +
Sbjct: 72 -IAEVKELKEEIKALEAELDE-------LEAELEELLLRI 103
Score = 30.8 bits (71), Expect = 1.9
Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 3/79 (3%)
Query: 187 RNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQAR---IEELEEEVEAERQ 243
R + D+E L +LE+ Q K+I + + + E L EV+ ++
Sbjct: 21 RGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKE 80
Query: 244 SRAKAEKQRADLARELEEL 262
E + +L ELEEL
Sbjct: 81 EIKALEAELDELEAELEEL 99
Score = 28.5 bits (65), Expect = 9.7
Identities = 20/83 (24%), Positives = 39/83 (46%), Gaps = 7/83 (8%)
Query: 306 ANLRKKHNDA-VSEMGEQIDQLNKLKTKAE---HERNAAHNETNNTRAAADQLAREKAAT 361
L K+ V E+ E ++ +L+T+ E ERNA E + + A
Sbjct: 16 EALAKRGFPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEV 75
Query: 362 EKIA---KQLQHQLNEVQGKLDE 381
+++ K L+ +L+E++ +L+E
Sbjct: 76 KELKEEIKALEAELDELEAELEE 98
>gnl|CDD|183173 PRK11519, PRK11519, tyrosine kinase; Provisional.
Length = 719
Score = 32.4 bits (74), Expect = 0.69
Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 24/149 (16%)
Query: 174 DLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEE 233
D + ++ + + RN LEQ I+RK +E + A L +Q+ E+++R++
Sbjct: 238 DREQIRDILNSITRNY--LEQNIERKSEEASKSLAFL----------AQQLPEVRSRLDV 285
Query: 234 LEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL--- 290
E ++ A RQ +K DL E + + + + AQ+ +EAE+SKL
Sbjct: 286 AENKLNAFRQ-----DKDSVDLPLEAKAVLDSMV----NIDAQLNELTFKEAEISKLYTK 336
Query: 291 RRDLEEANIQHESTLANLRKKHNDAVSEM 319
++ L + + K N V+ M
Sbjct: 337 EHPAYRTLLEKRKALEDEKAKLNGRVTAM 365
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 32.2 bits (74), Expect = 0.71
Identities = 42/215 (19%), Positives = 85/215 (39%), Gaps = 19/215 (8%)
Query: 232 EELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLR 291
E +E + Q+ +A ++ DL + E +R + Q+E E E L+
Sbjct: 153 ELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLL----QFQLE-----ELEELNLQ 203
Query: 292 RDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQ------LNKLKTKAEHERNAAHNETN 345
+E + L+N +K +A+ E + L +A
Sbjct: 204 PGEDEELEEERKRLSNS-EKLAEAIQNALELLSGEDDTVSALSLLGRALEALEDLSEYDG 262
Query: 346 NTRAAADQLAREKAATEKIAKQLQHQLNEVQ---GKLDETNRSLNDFDAAKKKLSIENSD 402
A+ L E+ +++L+ L+E++ +L+E L + +K + D
Sbjct: 263 KLSELAELLEEALYELEEASEELRAYLDELEFDPNRLEEVEERLFALKSLARKYGVTIED 322
Query: 403 LLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADE 437
LL L++ + +++QL + SL ++ K+L E
Sbjct: 323 LLEYLDKIKEELAQLDNSEESLEALEKEVKKLKAE 357
>gnl|CDD|223047 PHA03332, PHA03332, membrane glycoprotein; Provisional.
Length = 1328
Score = 32.2 bits (73), Expect = 0.77
Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 11/93 (11%)
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA 240
+ L + I + IA ++A L I+ + R+ +LE++V
Sbjct: 899 KIGGLNARVDKTSDVITKLGDTIAKISATL----------DNNIRAVNGRVSDLEDQVNL 948
Query: 241 E-RQSRAKAEKQRADLARELEELGERLEEAGGA 272
L ER+EE A
Sbjct: 949 RFLAVATNFNTLATQLKELGTTTNERIEEVMAA 981
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 32.2 bits (74), Expect = 0.81
Identities = 19/81 (23%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 136 RNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDL-KLTQEAVADLERNKKELEQ 194
+L ++ K+LE+E L++++E ++ ++K +K G L +E + L K++L +
Sbjct: 329 FPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSE 388
Query: 195 TIQRKDKEIASLTAKLEDEQG 215
++ ++E+ L +LE
Sbjct: 389 ELEELEEELKELKEELESLYS 409
Score = 30.7 bits (70), Expect = 2.4
Identities = 19/82 (23%), Positives = 40/82 (48%), Gaps = 12/82 (14%)
Query: 170 KIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQ-------- 221
++ D +E + +LE KELE+ +++ K + L K + + Q
Sbjct: 324 EVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETK 383
Query: 222 ----KQIKELQARIEELEEEVE 239
++++EL+ ++EL+EE+E
Sbjct: 384 EKLSEELEELEEELKELKEELE 405
>gnl|CDD|187736 cd02255, Peptidase_C12, Cysteine peptidase C12 contains ubiquitin
carboxyl-terminal hydrolase (UCH) families L1, L3, L5
and BAP1. The ubiquitin C-terminal hydrolase (UCH;
ubiquitinyl hydrolase; ubiquitin thiolesterase) family
of deubiquitinating enzymes (DUBs) consists of four
members to date: UCH-L1, UCH-L3, UCH-L5 (UCH37) and
BRCA1-associated protein-1 (BAP1), all containing a
conserved catalytic domain with cysteine peptidase
activity. UCH-L1 hydrolyzes carboxyl terminal esters
and amides of ubiquitin (Ub). Dysfunction of this
hydrolase activity can lead to an accumulation of
alpha-synuclein, which is linked to Parkinson's disease
(PD) and neurofibrillary tangles, linked to Alzheimer's
disease (AD). UCH-L1, in its dimeric form, has
additional enzymatic activity as a ubiquitin ligase.
UCH-L3 hydrolyzes isopeptide bonds at the C-terminal
glycine of either Ub or Nedd8, a ubiquitin-like protein.
UCH-L3 can also interact with Lys48-linked Ub dimers to
protect it from degradation while inhibiting its
hydrolase activity at the same time. UCH-L1 and UCH-L3
are the most closely related of the UCH members. UCH-L5
(UCH37) is involved in the deubiquitinating activity in
the 19S proteasome regulatory complex. It is also
associated with the human Ino80 chromatin-remodeling
complex (hINO80) in the nucleus. BAP1 binds to the
wild-type BRCA1 RING finger domain, localized in the
nucleus. It consists of the N-terminal UCH domain and
two predicted nuclear localization signals (NLSs), only
one of which is functional. The full-length human BRCA1
is a ubiquitin ligase. However, BAP1 does not appear to
function in the deubiquitination of autoubiquitinated
BRCA1. There is growing evidence that UCH enzymes and
human malignancies are closely correlated. Studies show
that UCH enzymes play a crucial role in some signaling
pathways and in cell-cycle regulation.
Length = 222
Score = 31.7 bits (71), Expect = 0.88
Identities = 9/81 (11%), Positives = 27/81 (33%), Gaps = 4/81 (4%)
Query: 334 EHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDF---- 389
E + + + ++ Q T + + + +++ + T + +
Sbjct: 65 EEKIKSQGQDVTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLKKFLEESVSM 124
Query: 390 DAAKKKLSIENSDLLRQLEEA 410
++ +EN D +R E
Sbjct: 125 SPEERARYLENYDAIRVTHET 145
Score = 28.2 bits (62), Expect = 9.1
Identities = 7/56 (12%), Positives = 20/56 (35%), Gaps = 4/56 (7%)
Query: 1 AATEKIAKQLQHQLNEVQGKLDETNRSLNDF----DAAKKKLSIENSDLLRQLEEA 52
T + + + +++ + T + + ++ +EN D +R E
Sbjct: 90 CGTIGLIHAIANNKDKMHFESGSTLKKFLEESVSMSPEERARYLENYDAIRVTHET 145
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 31.1 bits (71), Expect = 0.88
Identities = 18/63 (28%), Positives = 33/63 (52%)
Query: 222 KQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNK 281
+ + +L+ R+E+L++E+E +Q A+ + Q L + EE ER E + EL K
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121
Query: 282 KRE 284
+
Sbjct: 122 LKA 124
Score = 29.5 bits (67), Expect = 3.0
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 5/89 (5%)
Query: 403 LLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQ 462
L +LE+ + ++ +L + L Q+E K+ +E ER LL + + LE ++ ++ +
Sbjct: 67 LKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETE-ERTELLEELKQLEKELKKLKAE 125
Query: 463 LEE----EAEGKADLQRQLSKANADAQLW 487
LE+ + E L+ + A A W
Sbjct: 126 LEKYEKNDPERIEKLKEETKVAKEAANRW 154
Score = 29.5 bits (67), Expect = 3.6
Identities = 13/70 (18%), Positives = 37/70 (52%), Gaps = 6/70 (8%)
Query: 177 LTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQG------LVGKNQKQIKELQAR 230
+A+ L+ ++L++ ++ + IA L A++E + + +++K+L+
Sbjct: 59 FPSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKE 118
Query: 231 IEELEEEVEA 240
+++L+ E+E
Sbjct: 119 LKKLKAELEK 128
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 32.0 bits (73), Expect = 0.91
Identities = 16/88 (18%), Positives = 41/88 (46%), Gaps = 9/88 (10%)
Query: 192 LEQTIQRKDKEIA----SLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK 247
L + + R D+E+ + A+ + + + Q + +L ++ +L+ ++ + ++
Sbjct: 344 LNRKVDRLDQELVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQA 403
Query: 248 AEKQRADLAR-----ELEELGERLEEAG 270
E+Q DL+R + E+ + L A
Sbjct: 404 LEQQYQDLSRNRDDWMIAEVEQMLSSAS 431
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 32.2 bits (74), Expect = 0.92
Identities = 65/340 (19%), Positives = 123/340 (36%), Gaps = 34/340 (10%)
Query: 233 ELEEEVEAERQSRAKAEKQRADLARELEELGER---LEEAGGATSAQIELNKKREAEMSK 289
EL E+ R+ A + + ++AREL EL E LE+ A S + L + + K
Sbjct: 290 ELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQEK 349
Query: 290 LRR---DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNN 346
+ R DLEE + E N+ V E EQ ++ AE E + ++ +
Sbjct: 350 IERYQADLEELEERLEE--------QNEVVEEADEQQEENEARAEAAEEEVDELKSQLAD 401
Query: 347 TRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQ 406
+ A D + + +A + A Q L + + + ++ + ++ + + +
Sbjct: 402 YQQALD-VQQTRAIQYQQAVQA---LERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEE 457
Query: 407 LEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERA-----TLLGKFRNLEHDIDN--- 458
L E ++S + +++A E R A LL + R H +
Sbjct: 458 LLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLQQ 517
Query: 459 IREQLEEEAEGKADLQRQLSKANADAQ-------LWRTKYESEGVARAEELEESKSEERR 511
+R +L E E + Q++ + A+ + E LE
Sbjct: 518 LRMRL-SELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSE 576
Query: 512 VDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVR 551
+ L ++ + L A+ AQ+ L +R
Sbjct: 577 ARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLR 616
Score = 31.9 bits (73), Expect = 1.2
Identities = 66/339 (19%), Positives = 124/339 (36%), Gaps = 71/339 (20%)
Query: 172 EGDLKLTQEAVADLERNKKELEQTIQR------KDKEIASLTAKLEDEQGLVGKNQKQIK 225
E +L+ +LER + E Q+ + KE S +L L+
Sbjct: 836 EAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPRLNLLADET---- 891
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE--LNKKR 283
L R+EE+ E+++ AE+ + + + G AQ+E ++ +
Sbjct: 892 -LADRVEEIREQLDE-------AEEAKRFVQQH------------GNALAQLEPIVSVLQ 931
Query: 284 E--AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAH 341
+ +L++D ++A + T + +++ A++E+ ++ + +N+
Sbjct: 932 SDPEQFEQLKQDYQQA----QQTQRDAKQQ-AFALTEVVQRRAHFSYEDAAEMLAKNSDL 986
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQH-QLNEVQGKLD-----------ETNRSLNDF 389
NE R + R +A + Q Q Q N+V L E + L D
Sbjct: 987 NEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDL 1046
Query: 390 -----DAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERAT 444
A+++ +L +L S+ +QL K +++ E
Sbjct: 1047 GVPADSGAEERARARRDELHARLSANRSRRNQLEK-QLTFCE-------------AEMDN 1092
Query: 445 LLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANAD 483
L K R LE D +REQ+ G + R L K N
Sbjct: 1093 LTKKLRKLERDYHEMREQVVNAKAGWCAVLR-LVKDNGV 1130
Score = 29.9 bits (68), Expect = 4.5
Identities = 80/387 (20%), Positives = 158/387 (40%), Gaps = 64/387 (16%)
Query: 149 ENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQE---------------AVADLERNKKELE 193
EN G++K +D+E + + + +E +++TQ AD R+ E
Sbjct: 224 ENSGVRKAFQDMEAALRENRMTLE-AIRVTQSDRDLFKHLITESTNYVAADYMRHANERR 282
Query: 194 QTIQR---KDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA--ERQSRAK- 247
++ +E+ + +L EQ + + +++ EL +LE++ +A + + +
Sbjct: 283 VHLEEALELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQT 342
Query: 248 AEKQRADLAR---ELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHEST 304
A +Q+ + R +LEEL ERLEE E ++ EA +++E Q
Sbjct: 343 ALRQQEKIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQ---- 398
Query: 305 LANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQL-AREKAATEK 363
LA+ ++ D Q Q + +A+ +N ++ A+E+ ATE
Sbjct: 399 LADYQQAL-DVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATE- 456
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKL-----SIENSD-------LLRQLEEAE 411
+L ++ KL + + F+ A + + + S+ LLR+L E
Sbjct: 457 -------ELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQR 509
Query: 412 SQVSQLSKIKISLTT-------QLEDTKRLADEEGRERATLLGK------FRNLEHDIDN 458
QL ++++ L+ Q + LA+ R L + LE +++
Sbjct: 510 HLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLES 569
Query: 459 IREQLEEEAEGKADLQRQLSKANADAQ 485
+ E + E E + L++QL + A Q
Sbjct: 570 LSESVSEARERRMALRQQLEQLQARIQ 596
Score = 29.5 bits (67), Expect = 5.7
Identities = 58/345 (16%), Positives = 114/345 (33%), Gaps = 47/345 (13%)
Query: 229 ARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 288
RIE+L E E + A L R + + + EAE+
Sbjct: 786 KRIEQLRAEREELAERYATLSFDVQKLQRLHQAF-----SRFIGSHLAVAFEADPEAELR 840
Query: 289 KLRRDLEEANIQHESTLANLRKKHNDAVSEMG---EQIDQLNKLKTKAEHERNAAHNETN 345
+L R E E LA+ + S++ E + LN+L + + +
Sbjct: 841 QLNRRRVEL----ERALADHESQEQQQRSQLEQAKEGLSALNRLLPRLNLLADETLAD-- 894
Query: 346 NTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLR 405
+QL + A ++ +Q + L +++ + F+ L +
Sbjct: 895 RVEEIREQLDEAEEA-KRFVQQHGNALAQLEPIVSVLQSDPEQFEQ-----------LKQ 942
Query: 406 QLEEAESQVSQLSKIKISLTTQL--------EDTKRLADEEGRERATLLGKFRNLEHDID 457
++A+ + +LT + ED + + L + E +
Sbjct: 943 DYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLRQRLEQAEQERT 1002
Query: 458 NIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEE------SKSEER- 510
REQL + Q ++ A + +EL++ S +EER
Sbjct: 1003 RAREQLRQ----AQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEERA 1058
Query: 511 --RVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEAVRRE 553
R D+L A L A++ +L + + ++L + R+
Sbjct: 1059 RARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERD 1103
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 31.9 bits (72), Expect = 0.93
Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 21/109 (19%)
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ 223
+E+ R + D++ ADL+R K E EQ Q A+ AK K
Sbjct: 283 IEELARGLRADIERVARENADLQRQKLEAEQAAQ------AAQEAK-----------AKA 325
Query: 224 IKELQARIEELEEEVEAERQ----SRAKAEKQRADLARELEELGERLEE 268
KE QAR +L+ E + Q +A +R +LA+ELE E+
Sbjct: 326 EKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKELEAKKREAEQ 374
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 31.9 bits (73), Expect = 0.96
Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 20/163 (12%)
Query: 367 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSI-ENSDLLRQLE---EAESQVSQLSKIKI 422
L+H+L V T R+ + + A+++L I E LL L E + + SK K
Sbjct: 353 FLEHRLEVV------TRRTEYELNKAEERLHILEG--LLIALLNIDEVIEIIRE-SKDKP 403
Query: 423 SLTTQLEDTKRLADEEGRERATL-LGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKAN 481
+L L++++ L L + LE + I ++L+E + ADL++ L+
Sbjct: 404 EAKEELMARFGLSEKQAEAILDLRLRRLTGLE--EEKIEKELKELEKEIADLEKILASEE 461
Query: 482 ADAQLWRTK----YESEGVARAEELEESKSEERRVDDLAAELD 520
+ + + + G R E+ E + +E +DL AE D
Sbjct: 462 RLLDIIKKELLEIKKKFGDERRTEIVEEEEDEIEDEDLIAEED 504
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 32.0 bits (73), Expect = 0.96
Identities = 21/107 (19%), Positives = 42/107 (39%), Gaps = 8/107 (7%)
Query: 137 NQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLE-RNKKELEQT 195
+ F++ K L++E LK I D + D K K + +K+ V ++ ++ K L+
Sbjct: 729 ERFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVL---VEVVDAKDMKSLKTM 785
Query: 196 I----QRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEV 238
+ I L + + LV + + EL ++
Sbjct: 786 ADRLKSKLGSAIYVLASFANGKVKLVCGVSNDVLNKGIKAGELINQI 832
>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 120
Score = 30.4 bits (69), Expect = 1.0
Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 222 KQIKELQARIEELEEE-VEAERQSRAKAEKQRAD-LARELEELGERLEEAGG 271
++++E +A +EELE E ER A+ K+ L ++ +EE G
Sbjct: 20 EEVREAKAELEELEAALSEGERSLEARGLKEELRELEEQIRAAIAEIEELGI 71
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 31.9 bits (73), Expect = 1.1
Identities = 27/178 (15%), Positives = 75/178 (42%), Gaps = 9/178 (5%)
Query: 157 IEDLEGDVEKQKRKIEGDLKLTQEAV--ADLERNKKELEQTIQRKDKEIASLTAKLEDEQ 214
+E L ++EK+++++E T + + DL++ ++ LE+ + ++KEIA +
Sbjct: 1104 VEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTK 1163
Query: 215 GLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATS 274
G K +K + + + ++ + + +++ +R D + + + + ++
Sbjct: 1164 GKASKLRKPKLKKKEK-KKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG 1222
Query: 275 AQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTK 332
+ E +++++ + K +S N K D + + + K K
Sbjct: 1223 SDQEDDEEQKTKPKKSSVKRL------KSKKNNSSKSSEDNDEFSSDDLSKEGKPKNA 1274
>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family. This family of
eukaryotic proteins includes 5' nucleotidase enzymes,
such as purine 5'-nucleotidase EC:3.1.3.5.
Length = 448
Score = 31.5 bits (72), Expect = 1.1
Identities = 19/68 (27%), Positives = 28/68 (41%), Gaps = 16/68 (23%)
Query: 231 IEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 290
+ ELE E+E + + E+ + L+ L ERL AE+S L
Sbjct: 323 VPELEREIEIWNTEQYRFEELQ-----RLDILLERLY----DHLDVH-------AELSTL 366
Query: 291 RRDLEEAN 298
R DL E +
Sbjct: 367 RPDLSELD 374
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 31.2 bits (71), Expect = 1.2
Identities = 25/126 (19%), Positives = 60/126 (47%)
Query: 152 GLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLE 211
GL++++E ++ D E + + + + L+ K+E+ Q D+ +LE
Sbjct: 151 GLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELE 210
Query: 212 DEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGG 271
+ + +I E + ++EEL++E++ + ++++L E+ E + EE G
Sbjct: 211 KARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRG 270
Query: 272 ATSAQI 277
++ +I
Sbjct: 271 WSAKEI 276
Score = 29.2 bits (66), Expect = 5.3
Identities = 23/101 (22%), Positives = 41/101 (40%), Gaps = 9/101 (8%)
Query: 143 KKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKL-TQEAVADLERNKKELEQTIQRKDK 201
K KL ++ LK++I L R++ +L L + + + L I K K
Sbjct: 177 KPKLRKKLQALKEEIASL--------RQLADELNLCDPLELEKARQELRSLSVKISEKRK 228
Query: 202 EIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAER 242
++ L +L++ + + EL I E E+ E R
Sbjct: 229 QLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECR 269
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 31.6 bits (72), Expect = 1.3
Identities = 50/225 (22%), Positives = 98/225 (43%), Gaps = 15/225 (6%)
Query: 220 NQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIEL 279
Q+Q+ + A E + Q+ KA +Q D ++ +EL +RL+ Q +L
Sbjct: 145 EQRQLLDTFAGANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLD------FLQFQL 198
Query: 280 NKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAV----SEMGEQIDQLNKLKTKAEH 335
+ EA++ + EA Q S L LR+ +A+ ++ Q L + +A+
Sbjct: 199 EELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAALRGDVDVQEGSLLEGLGEAQL 258
Query: 336 ERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQG---KLDETNRSLNDFDAA 392
+ + + R A+Q+ E+ ++LQ+ L+E++ +L+E L
Sbjct: 259 ALASVIDGS--LRELAEQVGNALTEVEEATRELQNYLDELEFDPERLNEIEERLAQIKRL 316
Query: 393 KKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADE 437
K+K ++L E+ + ++ QL SL E+ +L +E
Sbjct: 317 KRKYGASVEEVLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEE 361
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 31.4 bits (72), Expect = 1.4
Identities = 26/112 (23%), Positives = 50/112 (44%), Gaps = 12/112 (10%)
Query: 141 QQKKKLEQENGGLKKDIEDLEGDVEK-----QKRKIEGDLKLTQEAVA-DLERNKKELEQ 194
+QK + +G +IE + D E+ +KRK + K E LE++ KE
Sbjct: 490 EQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGD 549
Query: 195 TIQRKDKE-IASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
+ DK+ + L++E + +E++A+ EEL++ V+ +
Sbjct: 550 KLPEADKKKVEEAIEWLKEE-----LEGEDKEEIEAKTEELQKVVQPIGERM 596
>gnl|CDD|173881 cd08516, PBP2_NikA_DppA_OppA_like_11, The substrate-binding
component of an uncharacterized ABC-type
nickel/dipeptide/oligopeptide-like import system
contains the type 2 periplasmic binding fold. This CD
represents the substrate-binding domain of an
uncharacterized ATP-binding cassette (ABC) type
nickel/dipeptide/oligopeptide-like transporter. The
oligopeptide-binding protein OppA and the
dipeptide-binding protein DppA show significant sequence
similarity to NikA, the initial nickel receptor. The
DppA binds dipeptides and some tripeptides and is
involved in chemotaxis toward dipeptides, whereas the
OppA binds peptides of a wide range of lengths (2-35
amino acid residues) and plays a role in recycling of
cell wall peptides, which precludes any involvement in
chemotaxis. Most of other periplasmic binding proteins
are comprised of only two globular subdomains
corresponding to domains I and III of the
dipeptide/oligopeptide binding proteins. The structural
topology of these domains is most similar to that of the
type 2 periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis. Besides transport
proteins, the PBP2 superfamily includes the
ligand-binding domains from ionotropic glutamate
receptors, LysR-type transcriptional regulators, and
unorthodox sensor proteins involved in signal
transduction.
Length = 457
Score = 31.5 bits (72), Expect = 1.4
Identities = 23/102 (22%), Positives = 44/102 (43%), Gaps = 14/102 (13%)
Query: 48 QLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWVEELTKGSIEA---ITLKSLDLD 104
++ A+ +QL+ I I++ +L E + W++++ KG +A T + D D
Sbjct: 331 HVDTAQVIQAQLAAIGINVEIEL--------VEWATWLDDVNKGDYDATIAGTSGNADPD 382
Query: 105 -VLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKK 145
+ N + + YS VD L R + + K+K
Sbjct: 383 GLYNRYFTSGGKLNFFNYSNPEVDELL--AQGRAETDEAKRK 422
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 31.6 bits (72), Expect = 1.4
Identities = 32/147 (21%), Positives = 58/147 (39%), Gaps = 27/147 (18%)
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 285
LQ +++ELE E + E R K K+ +E+ GE LEE + ++ ++ E
Sbjct: 88 GLQRKLKELEREQKEEEV-REKHNKKI------IEKFGEDLEEVYKFMKGEAKVEEEEEK 140
Query: 286 EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNE-- 343
+M L + L+ + K+ N + + +L K K ER +
Sbjct: 141 QMEILEKALKSYL--------KIVKEEN-------KSLQRLAKALQKESEERTQDETKMI 185
Query: 344 ---TNNTRAAADQLAREKAATEKIAKQ 367
+ A + + EK ++ A Q
Sbjct: 186 EEYRDKIDALKNAIEVEKEGLQEEAIQ 212
>gnl|CDD|225224 COG2348, COG2348, Peptidoglycan interpeptide bridge formation
enzyme [Cell wall/membrane/envelope biogenesis].
Length = 418
Score = 31.2 bits (71), Expect = 1.5
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
A DL+ K+L Q + + EI + L+ E K Q ++ LQ ++E EE +
Sbjct: 240 AYLDLDEYLKKLNQELAKLAAEIERVQEALK-ESPKSEKAQNKLNRLQMQLEAFEERIA 297
>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
Length = 508
Score = 31.4 bits (71), Expect = 1.5
Identities = 36/128 (28%), Positives = 53/128 (41%), Gaps = 8/128 (6%)
Query: 240 AERQSRAKAEKQR-ADLARELEELGERLEEAGGATSAQIELNKKR-EAEMSKLRRDLEEA 297
AE +R + R AD A+ R A+ ++E K R +AE+ + + L A
Sbjct: 192 AETLAREAEDAARAADEAKTAAAAAAREAAPLKASLRKLERAKARADAELKRADKALAAA 251
Query: 298 NIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLARE 357
A R++ + E QL+ K AE +R AA T+ AA A +
Sbjct: 252 KTDEAKARAEERQQKAAQ--QAAEAATQLDTAKADAEAKRAAA----AATKEAAKAAAAK 305
Query: 358 KAATEKIA 365
KA T K A
Sbjct: 306 KAETAKAA 313
>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1. This entry is of a
family of proteins all approximately 300 residues in
length. The proteins have a single C-terminal
trans-membrane domain and a SNARE [soluble NSF
(N-ethylmaleimide-sensitive fusion protein) attachment
protein receptor] domain of approximately 60 residues.
The SNARE domains are essential for membrane fusion and
are conserved from yeasts to humans. Use1 is one of the
three protein subunits that make up the SNARE complex
and it is specifically required for Golgi-endoplasmic
reticulum retrograde transport.
Length = 251
Score = 30.9 bits (70), Expect = 1.6
Identities = 41/187 (21%), Positives = 69/187 (36%), Gaps = 25/187 (13%)
Query: 50 EEAESQVSQLSKIKISLTTQLEDTKRLADEES----SGWVEELT--KGSIEAITLKSLDL 103
E+ E +L K +L LE+ K+ + S + + + KG +EA S
Sbjct: 20 EKREQNEWRLEKYVKALREMLEELKKSLSKPSVDVLTEYRRRVDFLKGLLEAEKPSS-PS 78
Query: 104 DVLNFTTENSHSFSISLYSQDT----VDRLQQEE----DARNQLFQQKKKLEQENGGLKK 155
+ + S + ++ + Q + + R +L L + +
Sbjct: 79 EKALANQKLSRGRAPTVDEPRSPASKEIHQQTKSRYTSELRKELLSGSTSLRRSTDAKDR 138
Query: 156 DIEDLEGDVEKQKRKIEGDLK---LTQEAVAD----LERNKKELEQTIQ---RKDKEIAS 205
D D E I+ L+ QE +A+ L RN KE Q ++D EI S
Sbjct: 139 DSSGPINDDEASAASIDQVLQYHQNLQEKLAEEMLLLARNLKEQSLAAQQIIKEDTEILS 198
Query: 206 LTAKLED 212
+AKL D
Sbjct: 199 SSAKLAD 205
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 31.2 bits (71), Expect = 1.8
Identities = 24/105 (22%), Positives = 41/105 (39%), Gaps = 20/105 (19%)
Query: 285 AEMSKLRRDLEEANIQHESTLA---NLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAH 341
AE+ +LR+ L + ++E+ A L+K+ + +QI Q A
Sbjct: 66 AEVKELRKRLAKLISENEALKAENERLQKREQS----IDQQIQQ-------------AVQ 108
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSL 386
+ET +QL E+ + + QLQ +L V L
Sbjct: 109 SETQELTKEIEQLKSERQQLQGLIDQLQRRLAGVLTGPSGGGSDL 153
Score = 29.2 bits (66), Expect = 6.6
Identities = 19/95 (20%), Positives = 41/95 (43%), Gaps = 14/95 (14%)
Query: 122 SQDT----VDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKL 177
DT V +++ +L + + L+ EN L+K + ++ +++ ++ +
Sbjct: 57 PADTLRTLVAEVKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQ---AVQSE--- 110
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLED 212
TQE ++E+ K E +Q I L +L
Sbjct: 111 TQELTKEIEQLKSERQQLQGL----IDQLQRRLAG 141
>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570). This is
a family of proteins with unknown function.
Length = 110
Score = 29.1 bits (65), Expect = 2.0
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 185 LERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEV 238
L K QTI+++ K LT LE E+ V + QK E++ + EE EEV
Sbjct: 30 LRAENKAQAQTIEQQQKANQRLTDALEQERQAVEEQQKIANEIERKAEENAEEV 83
>gnl|CDD|152929 pfam12495, Vip3A_N, Vegetative insecticide protein 3A N terminal.
This family of proteins is found in bacteria. Proteins
in this family are typically between 170 and 789 amino
acids in length. The family is found in association with
pfam02018. Vip3A represents a novel class of proteins
insecticidal to lepidopteran insect larvae.
Length = 177
Score = 30.0 bits (67), Expect = 2.1
Identities = 26/101 (25%), Positives = 46/101 (45%), Gaps = 6/101 (5%)
Query: 369 QHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQV-----SQLSKIKIS 423
Q LN++ GKLD N SLND A+ L+ E S + ++ ++QV ++L I
Sbjct: 44 QQLLNDISGKLDGVNGSLNDL-IAQGNLNTELSKEILKIANEQNQVLNDVNNKLDAINTM 102
Query: 424 LTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLE 464
L L + + ++ L + L + I ++L+
Sbjct: 103 LRVYLPKITSMLSDVMKQNYALSLQIEYLSKQLQEISDKLD 143
Score = 28.1 bits (62), Expect = 8.5
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 6/65 (9%)
Query: 11 QHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQV-----SQLSKIKIS 65
Q LN++ GKLD N SLND A+ L+ E S + ++ ++QV ++L I
Sbjct: 44 QQLLNDISGKLDGVNGSLNDL-IAQGNLNTELSKEILKIANEQNQVLNDVNNKLDAINTM 102
Query: 66 LTTQL 70
L L
Sbjct: 103 LRVYL 107
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.5 bits (67), Expect = 2.2
Identities = 18/92 (19%), Positives = 51/92 (55%), Gaps = 4/92 (4%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE 237
+ A LE+ K+L+ +Q+K+KE+ KL+ + + +++ K Q +++ ++E
Sbjct: 33 GKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATL--SEEARKAKQQELQQKQQE 90
Query: 238 VEAERQSRAKAEKQRADLARELEELGERLEEA 269
++ ++Q A ++ + L+ + +++++A
Sbjct: 91 LQQKQQ--AAQQELQQKQQELLQPIYDKIDKA 120
>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
metabolism].
Length = 533
Score = 30.7 bits (70), Expect = 2.4
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 407 LEEAESQVSQLSK-IKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEE 465
+ + E V +L + IK+ L + L++EE R A ++ NLEH D I E+L E
Sbjct: 368 IRKLEDAVDRLYEEIKLYLARLS--KEGLSEEESRRWAEIIDAAINLEHIGD-IIERLLE 424
Query: 466 EAEGKADLQRQLS 478
A+ K R S
Sbjct: 425 LADKKIANGRAFS 437
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 30.6 bits (69), Expect = 2.4
Identities = 53/280 (18%), Positives = 101/280 (36%), Gaps = 24/280 (8%)
Query: 9 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTT 68
QL +L + + ++DE + A KL +L E S++ +L
Sbjct: 185 QLLEKLKQERNEIDEAEKEY----ATYHKLLESRRAEHARLAELRSELRADRDHIRALRD 240
Query: 69 QLEDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDR 128
+E RL +E +ELT+ E T + L + + + R
Sbjct: 241 AVELWPRL--QEWKQLEQELTRRREELATFPRDGVLRLEKREAHLQKTEAEI--DALLVR 296
Query: 129 LQQEEDARNQLFQQKKKLEQENGGL---KKDIEDLEGDVEKQKRKIEGDLKLTQEAVADL 185
L + +D +QL K+ + Q L +I+ ++ + IE DL+ +EA +
Sbjct: 297 LAELKDLASQLIPAKEAVLQALVRLHQQLSEIKASAFELTETLAGIEADLRDKEEAAGNG 356
Query: 186 ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
++ + S +L+ + A + L+E+ E ++
Sbjct: 357 FEAERVHDLRSLECMLRYQSSQRELK-------------QTEAAYCKRLDEKRLFEDEAE 403
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 285
+A ++ AD E+ E EE A S I+ +
Sbjct: 404 EEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNL 443
>gnl|CDD|223895 COG0825, AccA, Acetyl-CoA carboxylase alpha subunit [Lipid
metabolism].
Length = 317
Score = 30.3 bits (69), Expect = 2.7
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 10/49 (20%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
+K I EL+A+I+EL +A AE+ DL+ E+E L +RL E
Sbjct: 8 EKPIAELEAKIDEL----------KALAEENDVDLSDEIERLEKRLAEL 46
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 30.3 bits (69), Expect = 2.8
Identities = 17/76 (22%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 181 AVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA 240
A + + EL++ +++ +K++A L KLE K + ++ EL+ ++ LE+ ++
Sbjct: 236 AYLNFKEYLDELQEKLEKLEKDLAKLEEKLEKNP--NSKKKNKLAELEQQLASLEKRID- 292
Query: 241 ERQSRAKAEKQRADLA 256
E + LA
Sbjct: 293 EAKELIAKYGNEVPLA 308
Score = 28.8 bits (65), Expect = 8.3
Identities = 15/55 (27%), Positives = 32/55 (58%), Gaps = 5/55 (9%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAK-----AEKQRADLARELEELGERLEEAG 270
Q+++++L+ + +LEE++E S+ K E+Q A L + ++E E + + G
Sbjct: 248 QEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELIAKYG 302
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.4 bits (69), Expect = 2.9
Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK 189
Q+ A +L + L++E + +++ L + K +++ LK+ +E +L
Sbjct: 22 QRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEV-AKLKIAKEDATELIAET 80
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVG 218
KEL++ I K+ E+ A L+ + +G
Sbjct: 81 KELKKEITEKEAEVQEAKAALDAKLKTIG 109
>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1112
Score = 30.4 bits (69), Expect = 2.9
Identities = 22/98 (22%), Positives = 50/98 (51%), Gaps = 11/98 (11%)
Query: 216 LVG-KNQKQIKELQ---ARIEELEEEVEA--ERQSRAKAEKQRADLARELEELGERLEEA 269
L G K+++ +KE+Q +I+ E+EA + RAK ++ + + + ++ ++EE
Sbjct: 8 LFGDKSERDLKEIQPIVEKIKAEYPELEALSNDELRAKTDEFKQYIKEAVADIDAKIEEL 67
Query: 270 GGATSAQIELNKKRE--AEMSKLRRDLEEANIQHESTL 305
++++++ + A++ KL +EA E L
Sbjct: 68 KAEAIESLDIDEREDIYAQIDKLE---KEAYEILEKVL 102
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 29.9 bits (68), Expect = 2.9
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEE 236
+Q+ + +EL I++ KEI +L + Q LV Q++I LQ +IE++E+
Sbjct: 41 SQKKIDKWADQTQELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQIEQIEK 99
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 29.6 bits (67), Expect = 2.9
Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 1/58 (1%)
Query: 203 IASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRA-DLAREL 259
++ KL +Q + + QK +KE Q E +E + ++ + + ++ D REL
Sbjct: 60 YITILQKLLIDQEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQREL 117
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 30.2 bits (69), Expect = 3.0
Identities = 69/294 (23%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 180 EAVADLERNKKELEQTIQRK--------DKEIASLTAKLEDEQGLVGKNQKQIKELQARI 231
+ +LE +EL Q + ++EI L +L++ + Q +I L+
Sbjct: 25 QQNKELEAKIEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARLQLEIDNLRLAA 84
Query: 232 EELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLR 291
E+ E+ E E R AE L ++L+E AT A+++L K E+
Sbjct: 85 EDFREKYEDELNLRQSAENDIVGLRKDLDE----------ATLARVDLEMKIESLQ---- 130
Query: 292 RDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAA 351
EE LA L+K H + V E+ QI + + E +AA + T+A A
Sbjct: 131 ---EE--------LAFLKKNHEEEVRELQSQI------QGQVNVEMDAARK-LDLTKALA 172
Query: 352 DQLAREKAATEKIAKQLQHQLNE-VQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEA 410
+ +A E++A++ + + E + KL+E ++ A + E ++L RQ++
Sbjct: 173 EI----RAQYEELAEKNRQEAEEWYKSKLEELQQAAARNGDALRSAKEEITELRRQIQSL 228
Query: 411 ESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLE 464
E ++ L K K SL QL + + + E A LE ++ ++ ++
Sbjct: 229 EIELQSLKKQKASLERQLAELE---ERYELELADYQDTISELEEELQQLKAEMA 279
Score = 29.1 bits (66), Expect = 6.0
Identities = 49/225 (21%), Positives = 89/225 (39%), Gaps = 33/225 (14%)
Query: 342 NETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENS 401
E R D+L E+A +LQ +++ ++ ++ D ++ +
Sbjct: 54 QEIRELRKQLDELTNERA-------RLQLEIDNLRLAAEDFREKYEDELNLRQSAENDIV 106
Query: 402 DLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHD------ 455
L + L+EA L SL +L K+ +EE RE + + N+E D
Sbjct: 107 GLRKDLDEATLARVDLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDAARKLD 166
Query: 456 ----IDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYE--SEGVAR-AEELEESKSE 508
+ IR Q EE AE ++ A+ +++K E + AR + L +K E
Sbjct: 167 LTKALAEIRAQYEELAE--------KNRQEAEEW-YKSKLEELQQAAARNGDALRSAKEE 217
Query: 509 ----ERRVDDLAAELDASQKECRNYSTELFRLKGAYEEAQEQLEA 549
R++ L EL + +K+ + +L L+ YE +
Sbjct: 218 ITELRRQIQSLEIELQSLKKQKASLERQLAELEERYELELADYQD 262
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.5 bits (67), Expect = 3.0
Identities = 19/72 (26%), Positives = 42/72 (58%), Gaps = 4/72 (5%)
Query: 131 QEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDL----KLTQEAVADLE 186
Q E +++L Q+ KKLE+E L+K + +LE +E +++ E + K + +A L+
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLK 176
Query: 187 RNKKELEQTIQR 198
+ ++L+ +++
Sbjct: 177 KQNQQLKSQLEQ 188
Score = 28.3 bits (64), Expect = 8.7
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 17/69 (24%)
Query: 186 ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSR 245
E+ K ELEQ I++ ++E L +K++ EL+A++E +E+ E ERQ
Sbjct: 119 EQGKSELEQEIKKLEEEKEEL--------------EKRVAELEAKLEAIEKREEEERQIE 164
Query: 246 AKAEKQRAD 254
EK+ AD
Sbjct: 165 ---EKRHAD 170
>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
organic solvents, periplasmic component [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 359
Score = 30.1 bits (68), Expect = 3.3
Identities = 24/134 (17%), Positives = 37/134 (27%), Gaps = 4/134 (2%)
Query: 344 TNNTRAAADQLAREKAATEKIAKQLQH---QLNEVQGKLDETNRSLNDFDAAKKKLSIEN 400
+N D L + L L +LD +L AA
Sbjct: 182 LDNLAQFTDALNARDGDIGALIANLNQLLDSLAAASDQLDRLLDNLATLTAALAARRDAL 241
Query: 401 SDLLRQLEEAESQVSQL-SKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNI 459
D L L + V+ L ++ + +L L + + LA L L N+
Sbjct: 242 DDALAALSALAATVNDLLAENRPNLNQALANLRPLATLLVDYLPGLEQLLHGLPTYAANL 301
Query: 460 REQLEEEAEGKADL 473
G L
Sbjct: 302 VGGALGSLPGFFAL 315
>gnl|CDD|217077 pfam02514, CobN-Mg_chel, CobN/Magnesium Chelatase. This family
contains a domain common to the cobN protein and to
magnesium protoporphyrin chelatase. CobN is implicated
in the conversion of hydrogenobyrinic acid a,c-diamide
to cobyrinic acid. Magnesium protoporphyrin chelatase is
involved in chlorophyll biosynthesis.
Length = 1079
Score = 30.2 bits (69), Expect = 3.3
Identities = 25/130 (19%), Positives = 41/130 (31%), Gaps = 18/130 (13%)
Query: 441 ERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQL--WRTKYESEGVAR 498
RA L G+ LE ID RE + + L+ ++ + + L + E + R
Sbjct: 494 TRAGLYGELAELEELIDEYREADGLDPARREALREEILELAEELGLDRDLGEDFEEFLER 553
Query: 499 AE----ELEESK------------SEERRVDDLAAELDASQKECRNYSTELFRLKGAYEE 542
EL+ES+ E V+ L A L E
Sbjct: 554 LHDYLCELKESQIPDGLHVFGEPPEGEELVELLLAILRDPNGGLVPSLAEALAEDLGLRG 613
Query: 543 AQEQLEAVRR 552
+ +A+
Sbjct: 614 DRTAGDALEL 623
>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
resistance protein A). [Transport and binding proteins,
Other].
Length = 334
Score = 29.8 bits (67), Expect = 3.5
Identities = 39/150 (26%), Positives = 63/150 (42%), Gaps = 16/150 (10%)
Query: 343 ETNNTRAAADQLAREKAATEKIAKQLQHQLNEV-QG----KLDETNRSLNDFDAAKKKLS 397
T++ A+QL + + + + V QG +LD TN L A+ L+
Sbjct: 32 STDDAYVKANQLQVSSQVSGSVIEVNVDDTDYVKQGDVLVRLDPTNAELA-LAKAEANLA 90
Query: 398 IENSDLLRQLEEAESQVSQLSK----IKISLTTQLEDTKRLADEEGRERATLLGKFRNLE 453
L+RQ ++ E V QL +KI L Q + A+ + R R L K
Sbjct: 91 ----ALVRQTKQLEITVQQLQAKVESLKIKL-EQAREKLLQAELDLRRRVPLFKKGLISR 145
Query: 454 HDIDNIRE-QLEEEAEGKADLQRQLSKANA 482
++D+ R+ L +A A +Q QL+ A
Sbjct: 146 EELDHARKALLSAKAALNAAIQEQLNANQA 175
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 30.4 bits (68), Expect = 3.6
Identities = 34/152 (22%), Positives = 62/152 (40%), Gaps = 4/152 (2%)
Query: 408 EEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEA 467
EE E+ V Q + + +++E K D RE TL N++ + L +
Sbjct: 298 EEQETLVRQFTVEQTQAKSKVESAKIETDR-AREMETLAHD--NVKQIVGAQHGILSAKR 354
Query: 468 EGKADLQRQLSKANADAQLWRTKYESEGVARAEE-LEESKSEERRVDDLAAELDASQKEC 526
EG D +R +S A A AEE LE + + VD+ A +A +
Sbjct: 355 EGAVDKRRTISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVDEYPAAREALESAG 414
Query: 527 RNYSTELFRLKGAYEEAQEQLEAVRRENKNLA 558
+ + R ++ A ++L ++ + + N+
Sbjct: 415 QRNVEDRTRAVDEFKAADQELSSLSKGSSNIE 446
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 29.8 bits (68), Expect = 3.7
Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 12/59 (20%)
Query: 184 DLERNKKELEQTIQRKDKE------IASLTAKLEDEQGLVGKNQKQIKELQARIEELEE 236
DL++ KKEL + + + + LT K E E K +K+I+EL+ I ELE+
Sbjct: 361 DLKKAKKELIEELSEIQADYLLDMRLRRLT-KEEIE-----KLEKEIEELEKEIAELEK 413
>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
component [Inorganic ion transport and metabolism].
Length = 289
Score = 29.9 bits (67), Expect = 3.8
Identities = 23/100 (23%), Positives = 47/100 (47%), Gaps = 7/100 (7%)
Query: 128 RLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLER 187
RL + E R+Q+ K +EQ+ + +E L +KR+++ + +L E
Sbjct: 169 RLLEVETLRDQVD---KMVEQQVRVINSQLERLR----LEKRRLQLNGQLDDEFQQHYVA 221
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKEL 227
K EL++ + + E+ +L A+LE + + Q + +
Sbjct: 222 EKSELQKRLAQLQTELDALRAELERQFLYLVDMQGETVSV 261
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.2 bits (68), Expect = 3.8
Identities = 55/241 (22%), Positives = 105/241 (43%), Gaps = 20/241 (8%)
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKR 283
+KE Q +EL + + ++++ L +E E+L ERL + G++ +L K+
Sbjct: 181 LKEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKK 240
Query: 284 EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNE 343
+ L+E N + E+ + R K + E+ E + ++L T E A +E
Sbjct: 241 YNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDEL-TSLAAESQALKDE 299
Query: 344 TNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDL 403
+ R ++D+ AK+L+ Q+ + KL++ LND K L N+
Sbjct: 300 IDVLRESSDK-----------AKKLEAQVETYKKKLED----LNDLRRQVKLLEERNAMY 344
Query: 404 LRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQL 463
++ + E ++ + + + QLE KR E + + K LE + + E+L
Sbjct: 345 MQNTVQLEEELKKAN----AARGQLETYKRQVQELHAKLSEESKKADKLEFEYKRLEEKL 400
Query: 464 E 464
E
Sbjct: 401 E 401
>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 581
Score = 30.2 bits (69), Expect = 3.8
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 6/92 (6%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGG------ATS 274
Q+ +K+L+ I+EL+ ++A S A + L L+EL E L G +
Sbjct: 16 QEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLELLRTLGAFVSCLLSAD 75
Query: 275 AQIELNKKREAEMSKLRRDLEEANIQHESTLA 306
E + +++++L LE+ + LA
Sbjct: 76 TTDEEAQALLSQLNQLSASLEQLLTLLDKKLA 107
>gnl|CDD|220767 pfam10459, Peptidase_S46, Peptidase S46. Dipeptidyl-peptidase 7
(DPP-7) is the best characterized member of this family.
It is a serine peptidase that is located on the cell
surface and is predicted to have two N-terminal
transmembrane domains.
Length = 696
Score = 29.9 bits (68), Expect = 3.9
Identities = 39/221 (17%), Positives = 64/221 (28%), Gaps = 54/221 (24%)
Query: 220 NQKQIKELQARIEELEEEV----EAERQSRAK------------------------AEKQ 251
+I+ L RI LE + E + +K A KQ
Sbjct: 269 YPARIERLGERIALLEAAMLKDPEIRIKYASKLAGLANYWKNSIGMLEGLKDLDLLARKQ 328
Query: 252 R--ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLR 309
A L +++ R + G A ++ E+++ LEEA L +
Sbjct: 329 AREAALRAWVKKDPARGAKYGDALD-ELAALYAERRELARRYFYLEEAFRS--GELLS-- 383
Query: 310 KKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTR---AAADQLAREKAATEKIAK 366
+ +L K + K + ER + E + R A +K
Sbjct: 384 ---------AARTLVRLAKEREKPDAEREPGYQERDLPRLEQQLERIDKPYDAEVDK--A 432
Query: 367 QLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQL 407
L L E + L + A K L E+
Sbjct: 433 VLAAMLEEYRELLGA-----DRLPAVDKVLGKESKGDPEAY 468
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 29.6 bits (67), Expect = 4.0
Identities = 23/83 (27%), Positives = 49/83 (59%), Gaps = 8/83 (9%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLED 212
L++ I+ L+ +++ K+ I L E ++DL K++L+ +R ++++ + + +
Sbjct: 282 LQEKIKALKYQLKRLKKMI-----LLFEMISDL---KRKLKSKFERDNEKLDAEVKEKKK 333
Query: 213 EQGLVGKNQKQIKELQARIEELE 235
E+ K +KQI+ L+ RIE+LE
Sbjct: 334 EKKKEEKKKKQIERLEERIEKLE 356
>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K. CENP-K is one
of seven new CENP-A-nucleosome distal (CAD) centromere
components (the others being CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S) that are identified as
assembling on the CENP-A nucleosome associated complex,
NAC. The CENP-A NAC is essential, as disruption of the
complex causes errors of chromosome alignment and
segregation that preclude cell survival despite
continued centromere-derived mitotic checkpoint
signalling. CENP-K is centromere-associated through its
interaction with one or more components of the CENP-A
NAC.
Length = 273
Score = 29.5 bits (66), Expect = 4.1
Identities = 29/109 (26%), Positives = 49/109 (44%), Gaps = 6/109 (5%)
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
+L++ +++LE + + L LE EQ + + Q+ + L R EL+ +V +
Sbjct: 98 ELQKLRQDLEMVLSSVQSKNEKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVTFSE 157
Query: 244 SR------AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 286
SR K + + + L LGE LEE AQ + KKR+
Sbjct: 158 SRIFQELKTKMLRIKEYKEKLLSALGEFLEEHFPLPEAQASIKKKRKGI 206
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 29.7 bits (67), Expect = 4.1
Identities = 30/124 (24%), Positives = 55/124 (44%), Gaps = 19/124 (15%)
Query: 194 QTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE--------RQSR 245
TIQ ++ A + ++ D +G Q+ L+A++ EL +++ AE
Sbjct: 285 PTIQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNEL 344
Query: 246 AKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTL 305
A E+Q A L +EL +L RL + K + ++ +L R+ E A +E+ L
Sbjct: 345 ALLEQQEAALEKELAQLKGRLSKLP-----------KLQVQLRELEREAEAARSLYETLL 393
Query: 306 ANLR 309
+
Sbjct: 394 QRYQ 397
Score = 28.9 bits (65), Expect = 8.9
Identities = 39/214 (18%), Positives = 68/214 (31%), Gaps = 23/214 (10%)
Query: 351 ADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSI---------ENS 401
ADQL + A + + L +L E++ +L E + DF A + S
Sbjct: 183 ADQLEAQLEAFRRASDSLDERLEELRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLS 242
Query: 402 DLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIRE 461
L QL+ A ++++Q SL L + A + + R +
Sbjct: 243 ALNTQLQSARARLAQAEARLASLLQLLPLGREAAALREVLESPTIQDLRQQYAQVRQQIA 302
Query: 462 QLEEEAEGKAD----LQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAA 517
L E K L+ QL++ + + L E++ L
Sbjct: 303 DLSTELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELALL------EQQEAALEK 356
Query: 518 ELDASQKECRNYSTELFRLKGAYEEAQEQLEAVR 551
EL + + E + + EA R
Sbjct: 357 ELAQLKGRLSKLPKLQVQ----LRELEREAEAAR 386
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 29.3 bits (66), Expect = 4.2
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 224 IKELQARIEELEEEVEAERQ 243
+K+ ARIEELE + + +
Sbjct: 57 LKQANARIEELEAQAQHPQS 76
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 29.9 bits (68), Expect = 4.3
Identities = 12/49 (24%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 221 QKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
++ Q + + +E + + + +KA+K ++ + R LE+L RL E
Sbjct: 250 AAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKR-LEKLEARLAEE 297
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 29.9 bits (68), Expect = 4.3
Identities = 16/85 (18%), Positives = 36/85 (42%), Gaps = 4/85 (4%)
Query: 137 NQLFQQKKKLEQENGGLK-KDIEDLEGDVEKQKRKIE-GDLKLTQEAVADLERN--KKEL 192
+L ++ K+LE+E LK K D+ + E G +K+ + V + ++
Sbjct: 735 ERLLEELKELEKELERLKKKLAAAELADLLSNAKAEEIGGVKVLAKEVDGADMKELREIA 794
Query: 193 EQTIQRKDKEIASLTAKLEDEQGLV 217
+ ++ + L + + + LV
Sbjct: 795 DDLKKKLGSAVIVLASVADGKVSLV 819
Score = 29.5 bits (67), Expect = 5.2
Identities = 31/127 (24%), Positives = 53/127 (41%), Gaps = 17/127 (13%)
Query: 191 ELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEK 250
E Q + KEIASL + V + +++KEL+ +E L+++ A
Sbjct: 708 EYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKK---------LAAA 758
Query: 251 QRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRK 310
+ ADL + EE GG E++ A+M +LR ++ + S + L
Sbjct: 759 ELADLLSNAKA-----EEIGGVKVLAKEVD---GADMKELREIADDLKKKLGSAVIVLAS 810
Query: 311 KHNDAVS 317
+ VS
Sbjct: 811 VADGKVS 817
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 30.0 bits (67), Expect = 4.4
Identities = 30/219 (13%), Positives = 70/219 (31%), Gaps = 2/219 (0%)
Query: 127 DRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADL- 185
A + QE ++ E ++++R I +L E
Sbjct: 381 LAQDDLTQAELNYAAADQAANQEGALNAREDEAEVLSTQEERRDILKNLLADAEKRTARQ 440
Query: 186 -ERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQS 244
E NK + I + DK + + + + Q ++ + + A +Q
Sbjct: 441 EELNKALAKAKILQADKAAKAYQEDILQREAQSRGKTAAAERSQEQMTAALKALLAFQQQ 500
Query: 245 RAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHEST 304
A + + + L + E+ + + + + + +++ E Q
Sbjct: 501 IADLSGAKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEYTSQLAEL 560
Query: 305 LANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNE 343
L + + G Q ++ + L +A + AA N+
Sbjct: 561 LDQQADRFELSAQAAGSQKERGSDLYREALAQNAAALNK 599
>gnl|CDD|215286 PLN02523, PLN02523, galacturonosyltransferase.
Length = 559
Score = 29.8 bits (67), Expect = 4.5
Identities = 35/142 (24%), Positives = 61/142 (42%), Gaps = 24/142 (16%)
Query: 368 LQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQ 427
L+ +L++++ + D+ +N + A +KL ++NS LLR + + L K S
Sbjct: 76 LKTRLDQIRKQADDHRTLVNAYAAYARKLKLDNSKLLRLFADLSRNFTDLIS-KPSYRAL 134
Query: 428 LEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQLEEEAEGKADLQRQL---SKANADA 484
L DE D +R Q E+E + + + RQ+ SK + D
Sbjct: 135 LSSDGSAIDE-------------------DVLR-QFEKEVKERVKVARQMIAESKESFDN 174
Query: 485 QLWRTKYESEGVARAEELEESK 506
QL K + A E+L ++K
Sbjct: 175 QLKIQKLKDTIFAVNEQLTKAK 196
>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3. Aip3p/Bud6p is a
regulator of cell and cytoskeletal polarity in
Saccharomyces cerevisiae that was previously identified
as an actin-interacting protein. Actin-interacting
protein 3 (Aip3p) localizes at the cell cortex where
cytoskeleton assembly must be achieved to execute
polarized cell growth, and deletion of AIP3 causes gross
defects in cell and cytoskeletal polarity. Aip3p
localization is mediated by the secretory pathway,
mutations in early- or late-acting components of the
secretory apparatus lead to Aip3p mislocalization.
Length = 426
Score = 29.6 bits (67), Expect = 4.5
Identities = 32/168 (19%), Positives = 70/168 (41%), Gaps = 18/168 (10%)
Query: 185 LERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQS 244
L N +EL++ + D EI +L +L++ + + ++ I+ L+ R + + S
Sbjct: 69 LVLNVEELDEVKKHIDDEIDTLQNELDEVKQALESQREAIQRLKERQQNSAANIARPAAS 128
Query: 245 RAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHEST 304
+ + A + +LNK++ AE+ L+R+L H S
Sbjct: 129 PSPVLASSS--------------SAISLANNPDKLNKEQRAELKSLQRELAVLRQTHNSF 174
Query: 305 LANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAAD 352
+++ + ++ E+ID+ A N A+ E++ + + D
Sbjct: 175 FTEIKES----IKDILEKIDKFKSSSLSASGSSNRAYVESSKKKLSED 218
>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
(MRP). This model describes multi drug
resistance-associated protein (MRP) in eukaryotes. The
multidrug resistance-associated protein is an integral
membrane protein that causes multidrug resistance when
overexpressed in mammalian cells. It belongs to ABC
transporter superfamily. The protein topology and
function was experimentally demonstrated by epitope
tagging and immunofluorescence. Insertion of tags in the
critical regions associated with drug efflux, abrogated
its function. The C-terminal domain seem to highly
conserved [Transport and binding proteins, Other].
Length = 1522
Score = 29.9 bits (67), Expect = 4.6
Identities = 27/115 (23%), Positives = 46/115 (40%), Gaps = 12/115 (10%)
Query: 316 VSEMGEQIDQLNKLKTKAEHERNAA------HNETNNTRAA------ADQLAREKAATEK 363
+SEMG + L + AE R A H E + T A + T+
Sbjct: 839 ISEMGSYQELLQRDGAFAEFLRTYAPDEQQGHLEDSWTALVSGEGKEAKLIENGMLVTDV 898
Query: 364 IAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLS 418
+ KQLQ QL+ + +R +K + E + L + ++A++ +LS
Sbjct: 899 VGKQLQRQLSASSSDSGDQSRHHGSSAELQKAEAKEETWKLMEADKAQTGQVELS 953
>gnl|CDD|130083 TIGR01010, BexC_CtrB_KpsE, polysaccharide export inner-membrane
protein, BexC/CtrB/KpsE family. This family contains
gamma proteobacterial proteins involved in capsule
polysaccharide export [Transport and binding proteins,
Carbohydrates, organic alcohols, and acids].
Length = 362
Score = 29.7 bits (67), Expect = 4.7
Identities = 45/209 (21%), Positives = 74/209 (35%), Gaps = 30/209 (14%)
Query: 303 STLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATE 362
S + L DA E ++I+Q + E ER N RA D +A
Sbjct: 130 SGILTLNVTAFDA--EEAQKINQ----RLLKEGER---LINRLNERARKDTIAF------ 174
Query: 363 KIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKI 422
++++ E + +L+ T L + K D Q S +S L I
Sbjct: 175 -----AENEVKEAEQRLNATKAELLKYQIKNKVF-----DPKAQSSAQLSLISTLEGELI 224
Query: 423 SLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQL-----EEEAEGKADLQRQL 477
+ QL + + E+ + +L + ++L ID R QL + E AD QR +
Sbjct: 225 RVQAQLAQLRSITPEQNPQVPSLQARIKSLRKQIDEQRNQLSGGLGDSLNEQTADYQRLV 284
Query: 478 SKANADAQLWRTKYESEGVARAEELEESK 506
+ Q + S R E +
Sbjct: 285 LQNELAQQQLKAALTSLQQTRVEADRQQL 313
>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component. Nup88 can be divided
into two structural domains; the N-terminal two-thirds
of the protein has no obvious structural motifs but is
the region for binding to Nup98, one of the components
of the nuclear pore. the C-terminal end is a predicted
coiled-coil domain. Nup88 is overexpressed in tumour
cells.
Length = 717
Score = 29.8 bits (67), Expect = 4.7
Identities = 25/120 (20%), Positives = 51/120 (42%), Gaps = 16/120 (13%)
Query: 226 ELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 285
E Q R++ L+ + E + + ++R L+ E+L E+ EEA +E
Sbjct: 562 EFQRRVKLLQLQKEKQLEDIQDCREERKSLSERAEKLAEKFEEA----------KYNQEL 611
Query: 286 EMSKLRRDLEEANIQ------HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNA 339
+++ +R L+ AN Q E ++ ++ N + + I Q+ K K + +
Sbjct: 612 LVNRCKRLLQSANSQLPVLSDSERDMSKELQRINKQLQHLANGIKQVKKKKNYQRYHMAS 671
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 29.5 bits (67), Expect = 5.0
Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 22/103 (21%)
Query: 190 KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAE 249
++LE++ QR + +A L E + L + + I +L I E R KAE
Sbjct: 78 EQLEESRQRLSRLVAKLEEMRERDLELNVQLKDNIAQLNQEIAE-----------REKAE 126
Query: 250 KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 292
+ R + +L+ + EE QIEL + S L R
Sbjct: 127 EARQEAFEQLKNEIKEREE------TQIEL-----EQQSSLLR 158
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 29.9 bits (67), Expect = 5.2
Identities = 53/372 (14%), Positives = 117/372 (31%), Gaps = 48/372 (12%)
Query: 154 KKDIEDLEGDVEKQKR---KIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKL 210
K+I L+ ++ + +E KLT++ ++ + + L + ++++ K L ++
Sbjct: 21 VKNINVLKSSIKDSTQFWKMLEKQQKLTKDGLSASKGKYEGLSEAMEKQKKAYEDLKQEV 80
Query: 211 EDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAG 270
++ ++K +E A+ + E Q A
Sbjct: 81 KEVNRATQASKKAYQEYNAQYTQAE-------NKLRSLSGQ--------------FGVAS 119
Query: 271 GATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLK 330
Q + ++ + +S L + + A L ++ + ++ L +
Sbjct: 120 EQLMLQQKEIQRLQYAISTLNK--------SMAAQARLLEQTGNKFGTADAKVVGLRESF 171
Query: 331 TKAEHERNAAHNETNNTRAAADQLARE-KAATEKIAKQLQHQLNEVQGKLDETNRSLNDF 389
+ N T A + E + + +L E + ++ +++ L
Sbjct: 172 GRQTEALNKQLERTKKVADALTYVLDEAQQKLSQALSARLERLQESRTQMSQSSGQLGKR 231
Query: 390 DAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGK- 448
K + L L A + + T Q+ ++ + LGK
Sbjct: 232 LETDKA----GAGALGLLGAALAGSFAAIGAAVRRTAQMNGELMDKTKQVKGARDNLGKV 287
Query: 449 ---FRNLEHDIDNIREQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEES 505
+ I E + D R+LSKA A + +E + +
Sbjct: 288 TSQGEEMSDAIQETAEHI-------KDSGRELSKAAAIGAAAAMGVAAGKFPTGQEAKPT 340
Query: 506 KSEERRVDDLAA 517
E RV L A
Sbjct: 341 LKEMARVAALTA 352
>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1. This protein is
conserved from worms to humans. The protein of 413 amino
acids contains a central coiled-coil domain, possibly
the region that binds to clusterin. Cluap1 expression is
highest in the nucleus and gradually increases during
late S to G2/M phases of the cell cycle and returns to
the basal level in the G0/G1 phases. In addition, it is
upregulated in colon cancer tissues compared to
corresponding non-cancerous mucosa. It thus plays a
crucial role in the life of the cell.
Length = 269
Score = 29.2 bits (66), Expect = 5.3
Identities = 30/115 (26%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 176 KLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELE 235
++T+ A + KEL+ + + K ++ E E+ L ++ IK L AR+++L+
Sbjct: 130 EITERGAALYDLLGKELDLR-ESRQKALSRPLELAEVEKAL----KEAIKNLAARLQQLQ 184
Query: 236 EEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 290
E++ + A E + +ELE +RL+ A +E +K E E+ KL
Sbjct: 185 AELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFMEEYEKLEEELQKL 239
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 29.5 bits (66), Expect = 5.3
Identities = 46/264 (17%), Positives = 96/264 (36%), Gaps = 14/264 (5%)
Query: 209 KLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEE 268
++E E + + +K+ + +E EE+ K E RADL ++ E ++
Sbjct: 487 EIEIEVKDIDEKIVDLKKRKEYLES--EEINKSINEYNKIESARADLEDIKIKINELKDK 544
Query: 269 AGGATSAQIELNKKREAEMSKLRRDLEEANIQ--------HESTLANLRKKHNDAVSEMG 320
+ + ++ R A + S ++K+ ND S +
Sbjct: 545 HDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLESRLQ 604
Query: 321 EQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLD 380
E K+ + NE NN +++ K EK+ ++ + ++ D
Sbjct: 605 EIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEI-D 663
Query: 381 ETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQL-SKIKISLTTQLEDTKRLADEEG 439
L + + + + L++A++ ++L S I+I T E + R+ D
Sbjct: 664 SIIPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDI-- 721
Query: 440 RERATLLGKFRNLEHDIDNIREQL 463
E + K + D+ +RE
Sbjct: 722 NETLESMKKIKKAIGDLKRLREAF 745
Score = 29.5 bits (66), Expect = 6.7
Identities = 20/119 (16%), Positives = 50/119 (42%), Gaps = 3/119 (2%)
Query: 184 DLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQ 243
DLE +E+E I ++E+E + +I+E + IE+L +++ ++
Sbjct: 598 DLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKK 657
Query: 244 SRAKA---EKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI 299
A+ ++ + ++ + L+++ A + E+ + LR + E +
Sbjct: 658 QIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSD 716
>gnl|CDD|221201 pfam11744, ALMT, Aluminium activated malate transporter.
Length = 399
Score = 29.6 bits (67), Expect = 5.3
Identities = 29/113 (25%), Positives = 51/113 (45%), Gaps = 13/113 (11%)
Query: 23 ETNRSL-NDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTT---QLEDTKRLAD 78
+ L N F +++SIE + +LR+L E+ ++ +LS I+I L E+ + D
Sbjct: 285 QAPPELRNKFKEPCQRVSIEGAKVLRELAESIKKMRKLSPIEILLDNVHVAAEELQSSID 344
Query: 79 EESSGWVEELTKGSIEAITLKSLDL-DVLNFTTENSHSFSISLYSQDTVDRLQ 130
+S ++ K + KS +L + L+ T S I + V RL
Sbjct: 345 SKSYLFLNS-NKQLSKEEEKKSYELSEALSLAT--FASLLI-----EFVARLD 389
>gnl|CDD|224085 COG1163, DRG, Predicted GTPase [General function prediction only].
Length = 365
Score = 29.6 bits (67), Expect = 5.3
Identities = 19/67 (28%), Positives = 32/67 (47%), Gaps = 11/67 (16%)
Query: 195 TIQRKDKEIASLTAKLEDEQGLVGKN---QKQIKELQARIEELEEEVEAERQSRAKAEKQ 251
TI+ K K I E+E KN + I L+A++ EL EE+E +R+S++
Sbjct: 3 TIEEKIKAI-------EEEIARTPKNKATEHHIGLLKAKLAELREELE-KRKSKSGGGGS 54
Query: 252 RADLARE 258
+ +
Sbjct: 55 GFAVKKS 61
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 29.5 bits (67), Expect = 5.5
Identities = 32/133 (24%), Positives = 61/133 (45%), Gaps = 21/133 (15%)
Query: 180 EAVADLERNKKELEQTIQ--RKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE 237
E + +L+R +LE + +K+K+ AS +LED + K++ EL+ +LEE+
Sbjct: 405 EELDELDRRIIQLEIEREALKKEKDEASK-ERLEDLE-------KELAELEEEYADLEEQ 456
Query: 238 VEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSK---LRRDL 294
+AE+ + + E+E++ LE+A + +L K E + K L + L
Sbjct: 457 WKAEKAAIQGI----QQIKEEIEQVRLELEQA----EREGDLAKAAELQYGKLPELEKRL 508
Query: 295 EEANIQHESTLAN 307
+ A +
Sbjct: 509 QAAEAKLGEETKP 521
Score = 29.2 bits (66), Expect = 7.9
Identities = 17/76 (22%), Positives = 38/76 (50%), Gaps = 6/76 (7%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDL----KLT 178
+ + L++E + ++ +K Q +K++IE + ++E+ +R EGDL +L
Sbjct: 440 EKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQVRLELEQAER--EGDLAKAAELQ 497
Query: 179 QEAVADLERNKKELEQ 194
+ +LE+ + E
Sbjct: 498 YGKLPELEKRLQAAEA 513
>gnl|CDD|183125 PRK11413, PRK11413, putative hydratase; Provisional.
Length = 751
Score = 29.6 bits (67), Expect = 5.6
Identities = 20/82 (24%), Positives = 43/82 (52%), Gaps = 8/82 (9%)
Query: 41 EN-SDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESSGWVEELTKGSIEAITLK 99
+N +D+LR++E ES+ K K+SL ++E+ + + G + + G+ E +
Sbjct: 316 QNLTDILREVEI-ESERVAHGKAKLSLLDKIENGRLKVQQ---GIIAGCSGGNYENVIAA 371
Query: 100 SLDLDVLNFTTENSHSFSISLY 121
+ + L + + +FS+S+Y
Sbjct: 372 A---NALRGQSCGNDTFSLSVY 390
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 29.5 bits (66), Expect = 5.7
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 8/65 (12%)
Query: 460 REQLEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELEESKSEERRVDDLAAEL 519
RE+LE E QR+ K A +L R + E A+ E E+ RR+D + A++
Sbjct: 9 REKLERE-------QRE-RKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQI 60
Query: 520 DASQK 524
A Q+
Sbjct: 61 KADQQ 65
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 27.2 bits (61), Expect = 5.7
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 219 KNQKQIKELQARIEELEEEVEAERQSRAKAEKQ 251
Q ++ +L+A EELE EV+ + E++
Sbjct: 28 ALQAELAKLKAENEELEAEVKDLKSDPDYIEER 60
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 29.4 bits (67), Expect = 5.8
Identities = 25/133 (18%), Positives = 56/133 (42%), Gaps = 16/133 (12%)
Query: 252 RADLARELEELGERLEEAGGATSAQIELNKKR-------------EAEMSKLRRDLEEAN 298
RA+L + L++L +RL A ++E ++R E +++ ++ L+
Sbjct: 262 RAELLQRLQQLQQRLAR---AMRRRLEQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQ 318
Query: 299 IQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREK 358
+ + L + + + +++ Q N + ++ E RA QL R++
Sbjct: 319 QRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRAMRRQLKRKR 378
Query: 359 AATEKIAKQLQHQ 371
E +A+QL+
Sbjct: 379 QRLEALAQQLEAL 391
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 29.4 bits (66), Expect = 6.1
Identities = 41/204 (20%), Positives = 85/204 (41%), Gaps = 33/204 (16%)
Query: 105 VLNFTTENSHSFSISLYSQDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDV 164
VL+ T +++HS + + Q T Q + D + + Q + + E+ + V
Sbjct: 102 VLSATDDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQSASQAESLEHVQQSAQTSEKV 161
Query: 165 -EKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ 223
EK+ + D + Q + D + Q +D+ I AK+ N
Sbjct: 162 DEKEPLLTKTDKQTDQTVMPDA--------RVRQLRDQLI---KAKVYLSLPATKANPHF 210
Query: 224 IKELQARIEELE-----------------EEVEAERQSRAKAEKQRADLARELEELGERL 266
+EL+ RI+E++ E+++A Q+ AK ++ + D + +++L L
Sbjct: 211 TRELRLRIKEVQRVLGDASKDSDLPKNANEKLKAMEQTLAKGKQMQDDCSIVVKKLRAML 270
Query: 267 EEAGGATSAQIELNKKREAEMSKL 290
A Q+ ++KK+ +++L
Sbjct: 271 HSA----EEQLRVHKKQTMFLTQL 290
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.2 bits (66), Expect = 6.2
Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 195 TIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRAD 254
+ + KEI E+E+ L K ++I+E + R+EELE+ + ER+ ++
Sbjct: 298 SGEPSLKEIHEARLN-EEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEI-- 354
Query: 255 LARELEELGERLEEAGGATS 274
++LE+L +RLE+ S
Sbjct: 355 -QKKLEDLEKRLEKLKSNKS 373
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 28.6 bits (64), Expect = 6.2
Identities = 21/88 (23%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
Query: 223 QIKELQARIEE----LEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE 278
+K+ Q + E LE+E+ + SR + + + A+L R++ EL + A+ +
Sbjct: 100 SLKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDK 159
Query: 279 LNKKREAEMSKLRRDLEEANIQHESTLA 306
+ ++E SK++++L +A ++ ++T A
Sbjct: 160 EIARLKSEASKIKQELVDAELEFQTTAA 187
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.1 bits (65), Expect = 6.6
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 8/102 (7%)
Query: 128 RLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLER 187
+ ++EE A Q E+ LK+ +DLE +++K +E DLKL +E LE+
Sbjct: 506 KKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKK----LEHDLKLKEEECRMLEK 561
Query: 188 NKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQA 229
+EL + Q +KE L + L Q + KN L A
Sbjct: 562 EAQEL-RKYQESEKETEVLMSAL---QAMQDKNLMLENSLSA 599
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 29.3 bits (66), Expect = 6.7
Identities = 40/171 (23%), Positives = 80/171 (46%), Gaps = 8/171 (4%)
Query: 163 DVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQK 222
D Q + I+ D+ +++V ++ + L ++++++ I+ + KL + Q + + K
Sbjct: 44 DNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNK 103
Query: 223 QIKELQARIEELEEEVEAERQSRAK----AEKQRADLARELEELGERLEEAGGATSAQIE 278
QI EL A I +LE++ A+ + A A +Q +L GE + +
Sbjct: 104 QIDELNASIAKLEQQQAAQERLLAAQLDAAFRQGEHTGLQLILSGEESQRGERILAYFGY 163
Query: 279 LNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKL 329
LN+ R+ +++L++ EE Q L +K + + + EQ Q KL
Sbjct: 164 LNQARQETIAELKQTREELAAQKAE----LEEKQSQQKTLLYEQQAQQQKL 210
>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
replication, recombination, and repair].
Length = 654
Score = 29.4 bits (66), Expect = 6.8
Identities = 23/107 (21%), Positives = 50/107 (46%)
Query: 156 DIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQG 215
+E L +++ +E DLK ++ LE+ ++L + + E+ +L +L ++
Sbjct: 248 TLERLLKEIQALGETLEKDLKRLEDLADRLEKALEDLRELLIFDVDELGNLRERLREQLS 307
Query: 216 LVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL 262
+ K +A +E+L+ E S+ + ++ + L R EEL
Sbjct: 308 SEEAKEALGKLEEALLEKLKNLSELLGLSQNELDRPTSILERLKEEL 354
>gnl|CDD|237292 PRK13182, racA, polar chromosome segregation protein; Reviewed.
Length = 175
Score = 28.5 bits (64), Expect = 6.8
Identities = 23/102 (22%), Positives = 40/102 (39%), Gaps = 17/102 (16%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE 237
T+E + LE K ++E+ +D + S E + N E+LE +
Sbjct: 39 TEEDLQLLEYVKSQIEEGQNMQDTQKPSSNDVEETQV-----NTIVQNISSVDFEQLEAQ 93
Query: 238 VEAERQSRAKAEKQRADLA------------RELEELGERLE 267
+ + + E+Q A RE+EE+ ERL+
Sbjct: 94 LNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQ 135
>gnl|CDD|213617 TIGR01417, PTS_I_fam, phosphoenolpyruvate-protein
phosphotransferase. This model recognizes a distinct
clade of phophoenolpyruvate (PEP)-dependent enzymes.
Most members are known or deduced to function as the
phosphoenolpyruvate-protein phosphotransferase (or
enzyme I) of PTS sugar transport systems. However, some
species with both a member of this family and a homolog
of the phosphocarrier protein HPr lack a IIC component
able to serve as a permease. An HPr homolog designated
NPr has been implicated in the regulation of nitrogen
assimilation, which demonstrates that not all
phosphotransferase system components are associated
directly with PTS transport.
Length = 565
Score = 29.0 bits (65), Expect = 7.1
Identities = 12/65 (18%), Positives = 28/65 (43%), Gaps = 4/65 (6%)
Query: 216 LVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARE-LEELGERLEEAGGATS 274
L+ K I + ++++E+ ++RAKA++ + + + G +E
Sbjct: 17 LLKKPDLVIDRKKISASQVDQEISRFLKARAKAKEDLETIKTKAGKTFG---QEKAAIFE 73
Query: 275 AQIEL 279
A I +
Sbjct: 74 AHILI 78
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 29.1 bits (66), Expect = 7.2
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 13/63 (20%)
Query: 219 KNQKQIKELQARIEELEEEVEAERQSRAKAE-------------KQRADLARELEELGER 265
K Q+++++L +E+LE E+EA + A A+ AD +ELE ER
Sbjct: 565 KLQRELEQLPQLLEDLEAEIEALQAQVADADFFSQPHEQTQKVLADLADAEQELEVAFER 624
Query: 266 LEE 268
EE
Sbjct: 625 WEE 627
>gnl|CDD|234592 PRK00045, hemA, glutamyl-tRNA reductase; Reviewed.
Length = 423
Score = 29.0 bits (66), Expect = 7.3
Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 6/77 (7%)
Query: 231 IEELEEEVEAERQSRAK-AEKQRADLARELEELGERLEEAGGATSAQIELNKK----REA 285
+++L+E VE R + AEK A + E+ E E L L ++ RE
Sbjct: 300 VDDLQEIVEENLAQRQEAAEKAEAIVEEEVAEFMEWLRSL-EVVPTIRALREQAEEIREE 358
Query: 286 EMSKLRRDLEEANIQHE 302
E+ + + L + E
Sbjct: 359 ELERALKKLGPGEDEEE 375
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 29.1 bits (65), Expect = 7.3
Identities = 13/63 (20%), Positives = 36/63 (57%)
Query: 123 QDTVDRLQQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAV 182
+ +++L +EE R + +++++ E+E ++ D + +VEK++ K++ LK +
Sbjct: 255 RRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLKKASRSA 314
Query: 183 ADL 185
++
Sbjct: 315 DNV 317
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.9 bits (65), Expect = 7.3
Identities = 17/70 (24%), Positives = 41/70 (58%)
Query: 178 TQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEE 237
+++L++ KK ++ I+ D +I + +K+++ Q + +++ +IK+LQ I EL+E
Sbjct: 36 QDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKEN 95
Query: 238 VEAERQSRAK 247
+ ++ K
Sbjct: 96 IVERQELLKK 105
>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein. This family
consists of proteins over 500 amino acids long in
Caenorhabditis elegans and several bacteria (Pseudomonas
aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
etc.). The function is unknown.
Length = 525
Score = 28.9 bits (65), Expect = 7.3
Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 7/115 (6%)
Query: 132 EEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDL--KLTQEAVADLERNK 189
+L + +K++ + L+ D+ED GD K K D+ LT+ +RN+
Sbjct: 66 SRPDPERLAELRKQIRELEAELR-DLED-RGDALKALAKFLEDIREGLTEPIKDSAKRNE 123
Query: 190 ---KELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEAE 241
KE Q EI L + + + + + +KQ+ ELQ + L
Sbjct: 124 PDLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTGKSQR 178
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 29.1 bits (66), Expect = 7.5
Identities = 31/174 (17%), Positives = 59/174 (33%), Gaps = 7/174 (4%)
Query: 229 ARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 288
A I +E+E + +++A+ E ++A L RE R ++A A +A+ +
Sbjct: 436 AEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARV 495
Query: 289 KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHNETNNTR 348
K ++ I ++ S + + + E+ AA
Sbjct: 496 KAKKAAATQPIVIKAGAR-------PDNSAVIAAREARKAQARARQAEKQAAAAADPKKA 548
Query: 349 AAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSD 402
A A +AR KA EV K ++ A K +++
Sbjct: 549 AVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAE 602
Score = 28.8 bits (65), Expect = 8.5
Identities = 47/237 (19%), Positives = 83/237 (35%), Gaps = 40/237 (16%)
Query: 165 EKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQI 224
EK R+ + + A A ++K + + R + A+ T + + G N I
Sbjct: 465 EKAARE----ARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVI 520
Query: 225 KELQARIEELEEEVEAERQS---------------RAKAEKQRADLARELEELGERLEEA 269
+AR + ++ + RAKA+K A E E ++
Sbjct: 521 AAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKK--AAQQAANAEAEEEVDPK 578
Query: 270 GGATSAQIELNKKREAEMS---------KLRRDLEEANIQHESTLANLRKKHNDAVSEMG 320
A +A I K ++A D ++A + A +K A +E
Sbjct: 579 KAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPE 638
Query: 321 EQIDQL----------NKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQ 367
E +D K + A+ + NA E + + AA A +A +K A+Q
Sbjct: 639 EPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAKAKKAAQQ 695
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 29.3 bits (65), Expect = 7.8
Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 3/90 (3%)
Query: 274 SAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKA 333
S I+L +R AE + + + + D E+ Q Q + +
Sbjct: 622 SYGIKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQD---ELARQFAQTQQQRYGE 678
Query: 334 EHERNAAHNETNNTRAAADQLAREKAATEK 363
+++ + N + AA +LAR+ A T++
Sbjct: 679 QYQHDVPVNAEDADAAAEAELARQFAQTQQ 708
>gnl|CDD|215505 PLN02935, PLN02935, Bifunctional NADH kinase/NAD(+) kinase.
Length = 508
Score = 29.0 bits (65), Expect = 7.9
Identities = 18/41 (43%), Positives = 22/41 (53%), Gaps = 8/41 (19%)
Query: 463 LEEEAEGKADLQRQLSKANADAQLWRTKYESEGVARAEELE 503
L AEGKA A A+A W+ KYE E AR ++LE
Sbjct: 50 LRRVAEGKA-------LAQAEAAEWKRKYELE-RARNQQLE 82
>gnl|CDD|236584 PRK09602, PRK09602, translation-associated GTPase; Reviewed.
Length = 396
Score = 29.0 bits (66), Expect = 8.0
Identities = 36/143 (25%), Positives = 58/143 (40%), Gaps = 37/143 (25%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLED 212
L+K+ E + +K IE L E ++ L N++ +++ + R+ + K D
Sbjct: 149 LEKNWEKFSRKAQAEKFDIEEALA---EQLSGLGINEEHVKEAL-RELGLPEDPS-KWTD 203
Query: 213 EQGLVGKNQKQIKELQARIEELEEEVEAERQSRAK-----AEKQRADLARELEELGERLE 267
E + EL E+ + +K A K ADL EE ERL+
Sbjct: 204 ED----------------LLELAREL----RKISKPMVIAANK--ADLP-PAEENIERLK 240
Query: 268 EAGG----ATSAQIELNKKREAE 286
E TSA+ EL +R A+
Sbjct: 241 EEKYYIVVPTSAEAELALRRAAK 263
>gnl|CDD|147765 pfam05791, Bacillus_HBL, Bacillus haemolytic enterotoxin (HBL).
This family consists of several Bacillus haemolytic
enterotoxins (HblC, HblD, HblA, NheA, and NheB) which
can cause food poisoning in humans.
Length = 181
Score = 28.1 bits (63), Expect = 8.2
Identities = 16/82 (19%), Positives = 43/82 (52%), Gaps = 7/82 (8%)
Query: 152 GLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLE 211
LK+ + L+G+++ +++++ L+ +L+ K +L + + ++ LT+ L+
Sbjct: 104 TLKEGLTKLQGEIQTNQKEVDQLLE-------ELKDFKNKLAKDTRSFKTDVDQLTSILK 156
Query: 212 DEQGLVGKNQKQIKELQARIEE 233
+ + Q +I+ +Q I+E
Sbjct: 157 GQGAGIPALQNEIETIQGSIKE 178
>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
Length = 536
Score = 29.1 bits (65), Expect = 8.3
Identities = 29/175 (16%), Positives = 67/175 (38%), Gaps = 35/175 (20%)
Query: 104 DVLNFTTENSHSFSISLYSQD-TVDRLQ-----------QEEDARNQLFQQKKKLEQENG 151
D + TE Y++D ++D L+ Q+++ R + ++ +K Q+ G
Sbjct: 307 DEKDVMTEKVKDKERIDYTKDLSIDGLKGKKIGLLFSVDQQDENRKAVAEKIRKDLQDAG 366
Query: 152 GLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLE 211
+ D ++ ++ L E ++ + Q+K+ + SL
Sbjct: 367 AILTD------YIQLNNGGVDNLQTLEYEFKHNVN------DYFSQQKNVPVKSL----- 409
Query: 212 DEQGLVGKNQK----QIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEEL 262
+ ++ N+K +IK Q IE E+ + + + + + +EL+
Sbjct: 410 --EEIIAFNKKDSKRRIKYGQTLIEASEKSAITKDEFEKVVQTSQENAKKELDRY 462
>gnl|CDD|217457 pfam03255, ACCA, Acetyl co-enzyme A carboxylase carboxyltransferase
alpha subunit. Acetyl co-enzyme A carboxylase
carboxyltransferase is composed of an alpha and beta
subunit.
Length = 145
Score = 28.0 bits (63), Expect = 8.4
Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 10/48 (20%)
Query: 222 KQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEA 269
K + EL+A+I+EL R A K D++ E+ L ++L++
Sbjct: 7 KPLAELEAKIDEL----------RKLARKNDVDVSDEIHRLEKKLDKL 44
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 28.8 bits (65), Expect = 8.5
Identities = 41/231 (17%), Positives = 92/231 (39%), Gaps = 17/231 (7%)
Query: 11 QHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQL 70
Q E++ + E + + ++ I N L ++ A V L + L
Sbjct: 69 QFSCKELKKYISEGRDLFKEIE---EETLINNPPLFKEYFSASPDVKLLMDKQFQLVKTF 125
Query: 71 EDTKRLADEESSGWVEELTKGSIEAITLKSLDLDVLNFTTENSHSFSISLYSQDTVDRLQ 130
+ A + W +L +G E + L+ ++ ++ DR
Sbjct: 126 --ARLEAKKMWYEWRMKLLEGLKEGLDEN-LEGLKEDYKLLMKELELLNSIKPKLRDRKD 182
Query: 131 QEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNKK 190
E+ QL Q + +LE D +L+ EK K K+ ++ + + + +LE +
Sbjct: 183 ALEEELRQLKQLEDELED------CDPTELDRAKEKLK-KLLQEIMIKVKKLEELEEELQ 235
Query: 191 ELEQTIQRKDKEIASLTAKLEDEQGLVGKNQ----KQIKELQARIEELEEE 237
ELE I+ + + L ++ + + + + + K+I++L+ +++ L+
Sbjct: 236 ELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKLLQSL 286
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 28.7 bits (65), Expect = 8.6
Identities = 40/163 (24%), Positives = 80/163 (49%), Gaps = 16/163 (9%)
Query: 130 QQEEDARNQLFQQKKKLEQENGGLKKDIEDLEGDVEKQKRKIEGDLKLTQEAVADLERNK 189
Q+ D ++ + +KLE++ L+K++E+ E + ++ G+L LT A +++
Sbjct: 292 QKRSDLEKRVENELEKLEKKLEKLEKELEEAE---NAENYRLYGEL-LTANLYA-IKKGM 346
Query: 190 KELEQT--IQRKDKEIA---SLTAKLEDEQGLVGKNQKQ---IKELQARIEELEEEVEAE 241
KE+E ++ I + + E+ Q K +K ++ ++ +IEE +EE+E
Sbjct: 347 KEVELPNYYTGEEITIPLDPAKSPS-ENAQKYYKKYKKLKRAVEAVEEQIEETKEEIEYL 405
Query: 242 RQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE 284
A+ E A+ +LEE+ E L E G + + KK++
Sbjct: 406 ESVEAQLEN--AESLEDLEEIREELIEQGYLKEKKRKKKKKKK 446
>gnl|CDD|224493 COG1577, ERG12, Mevalonate kinase [Lipid metabolism].
Length = 307
Score = 28.4 bits (64), Expect = 8.8
Identities = 30/118 (25%), Positives = 45/118 (38%), Gaps = 20/118 (16%)
Query: 164 VEKQKRKIEGDLKLTQEAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQ 223
V + +E + ++ + + +E E +Q D E +L + L+ NQ
Sbjct: 188 VAGVAKLLEEEPEVIDPILDAIGELVQEAEAALQTGDFE------ELGE---LMNINQGL 238
Query: 224 IKELQARIEELEEEVEAERQSRAKAEKQR-----------ADLARELEELGERLEEAG 270
+K L EL+E VEA R A K A E L RLE+AG
Sbjct: 239 LKALGVSTPELDELVEAARSLGALGAKLTGAGGGGCIIALAKNEEIAETLSNRLEKAG 296
>gnl|CDD|130078 TIGR01005, eps_transp_fam, exopolysaccharide transport protein
family. The model describes the exopolysaccharide
transport protein family in bacteria. The transport
protein is part of a large genetic locus which is
associated with exopolysaccharide (EPS) biosynthesis.
Detailed molecular characterization and gene fusion
analysis revealed atleast seven gene products are
involved in the overall regulation, which among other
things, include exopolysaccharide biosynthesis, property
of conferring virulence and exopolysaccharide export
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 754
Score = 28.8 bits (64), Expect = 9.0
Identities = 37/205 (18%), Positives = 79/205 (38%), Gaps = 14/205 (6%)
Query: 283 REAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKAEHERNAAHN 342
A++SK RD EA + ++L +N ++ +Q+ +LN ++A R AA
Sbjct: 202 EIADLSKQSRD-AEAEVAAYRAQSDLLMGNNATLAT--QQLAELNTELSRARANRAAAEG 258
Query: 343 ETNNTRAAADQLAREKAATEKIAKQLQHQLNEVQGKLDETNRSLNDFDAAKKKLSIENSD 402
++ + A E ++ QL+ + + + + D LS
Sbjct: 259 TADSVKKALQNGGSLDVLPEVLSSQLKLEDLIQRLRERQAELRATIAD-----LSTTMLA 313
Query: 403 LLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEEGRERATLLGKFRNLEHDIDNIREQ 462
++ A+S ++ L S ++ + + + + R + L D++ ++
Sbjct: 314 NHPRVVAAKSSLADLDAQIRSELQKITKSLLMQADAAQARESQLVS------DVNQLKAA 367
Query: 463 LEEEAEGKADLQRQLSKANADAQLW 487
+ E + DL A A QL+
Sbjct: 368 SAQAGEQQVDLDALQRDAAAKRQLY 392
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 28.8 bits (64), Expect = 9.3
Identities = 24/116 (20%), Positives = 42/116 (36%), Gaps = 4/116 (3%)
Query: 180 EAVADLERNKKELEQTIQRKDKEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVE 239
E + + E + ++ E + A+ E E G+N+ + + R E E E E
Sbjct: 639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGE 698
Query: 240 AERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLE 295
E AK + + E E E G +IE ++ E + + E
Sbjct: 699 IE----AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAE 750
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 28.5 bits (64), Expect = 9.4
Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 15/111 (13%)
Query: 153 LKKDIEDLEGDVEKQKRKIEGDLKLTQEAVA--DLERNKKELEQTIQRKDKEIASLTAKL 210
L++ + D + ++K K++ L+LTQE + +EL + D+EI L A+
Sbjct: 88 LERFVHDCDRKIDKAKQR----LELTQEEQTKIAADSKAEELAEL----DEEIGKLLAEA 139
Query: 211 ED--EQGLVGKNQKQIKELQ---ARIEELEEEVEAERQSRAKAEKQRADLA 256
E E+G V + K +KE++ A+ +ELE+ E + + A+ Q+ L
Sbjct: 140 EALGEEGKVDEAMKLMKEVEELKAKKKELEDSDEVRNAAPSSAQAQQQKLR 190
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 28.5 bits (64), Expect = 9.7
Identities = 12/80 (15%), Positives = 32/80 (40%)
Query: 213 EQGLVGKNQKQIKELQARIEELEEEVEAERQSRAKAEKQRADLARELEELGERLEEAGGA 272
++G KN+K K++ L E + +++ K ++ ++ ++ E A
Sbjct: 19 QKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKKKNLGEAYDLA 78
Query: 273 TSAQIELNKKREAEMSKLRR 292
+ + + S +R+
Sbjct: 79 YDLPVVWSSAAFQDNSHIRK 98
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 28.8 bits (65), Expect = 9.7
Identities = 27/154 (17%), Positives = 63/154 (40%), Gaps = 31/154 (20%)
Query: 318 EMGEQIDQLNKLKTKAEHERNAAHNETNNTRAAADQLAREKAATEKIAKQL----QHQLN 373
++ + + ++N L + AE A+N N K A + A L Q+ +
Sbjct: 615 QIQKPLTEVN-LNSFAEELDALANNLPNGKL---------KVALKNEALDLRSIQQNFVP 664
Query: 374 EVQGKLDETNRSLNDFDAAKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKR 433
++ L++ N ++ D + L D+L +++ A+ ++ +
Sbjct: 665 PMESLLEKLNSNVKTLDESSNDLQNAVKDVLAEVQAAQGFLN-------------NNASE 711
Query: 434 LADEEGRER-ATLLGKFRNLEHDIDNIREQLEEE 466
+ +EE ++ +LG + ID + E++ E+
Sbjct: 712 IVNEESKKFLDRILGYI---DQYIDWVIEEITEK 742
>gnl|CDD|234799 PRK00578, prfB, peptide chain release factor 2; Validated.
Length = 367
Score = 28.7 bits (65), Expect = 9.8
Identities = 33/121 (27%), Positives = 50/121 (41%), Gaps = 34/121 (28%)
Query: 201 KEIASLTAKLEDEQGLVGKNQKQIKELQARIEELEEEVEA--------------ERQSRA 246
+ + L KLE+ +G++ + L+ R+EELE E E + S
Sbjct: 7 ERLKDLDEKLENIRGVL-----DVDALKERLEELEAEAEDPDFWNDQERAQKVTKELSSL 61
Query: 247 KAE-------KQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI 299
KA+ +QR D EL EL E EE T A+ EAE+ L + L +
Sbjct: 62 KAKLDTLEELRQRLDDLEELLELAE--EEDDEETLAEA------EAELKALEKKLAALEL 113
Query: 300 Q 300
+
Sbjct: 114 E 114
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 28.5 bits (65), Expect = 9.9
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 15/88 (17%)
Query: 176 KLTQEAVADLER--NKKELEQTIQR-----KDKEIASLTAKL----EDEQGLVGKNQKQI 224
+L +EA+A + + + LE R K E+ L L +E+ G ++
Sbjct: 7 ELVEEALAAIAAASDLEALEAL--RVKYLGKKGELTELLKGLGKLPPEERKEAGALINEL 64
Query: 225 K-ELQARIEELEEEVEAER-QSRAKAEK 250
K ++A +EE +EE+EA +R AE
Sbjct: 65 KQAIEAALEERKEELEAAALNARLAAET 92
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.304 0.122 0.305
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,558,281
Number of extensions: 2843443
Number of successful extensions: 11095
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6900
Number of HSP's successfully gapped: 1953
Length of query: 558
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 456
Effective length of database: 6,413,494
Effective search space: 2924553264
Effective search space used: 2924553264
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 62 (27.7 bits)