RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8072
(584 letters)
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 59.3 bits (144), Expect = 4e-09
Identities = 62/340 (18%), Positives = 135/340 (39%), Gaps = 34/340 (10%)
Query: 39 DIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRI 98
++ L+ + K L + + ELR ++E+LE++ + L + QL +
Sbjct: 682 QLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEE 741
Query: 99 LLQRFDAETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWD 158
L + E +E + Q L L +E A ++
Sbjct: 742 LEEE--LEELEEELEELQERL------------EELEEELESLEEALAKLKEEIEELEEK 787
Query: 159 KEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVR 218
++ L E+L ++ + + A +R ++ E+ L+ E+ L+E +
Sbjct: 788 RQALQEELEE-LEEELEEAERRLDALERELESLEQRRERLEQEIEE--------LEEEIE 838
Query: 219 EANIEIQTENRNLTEQNTK---LHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQY 275
E ++ L E + L + + + EL+D + E E EL ++ +L+
Sbjct: 839 ELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELES 898
Query: 276 ELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKD 335
EL++++ +KL L E K++ + E E+ + E+T+ +++++ L+++
Sbjct: 899 ELAELKEEIEKLRERLEELEAKLERLEVELPELE-EELEEEYEDTLETELEREIERLEEE 957
Query: 336 LEE-------AKELANNRLQELDKLHLQHREALKDVEKLK 368
+E A E + ++L Q + + EKL
Sbjct: 958 IEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLL 997
Score = 48.9 bits (117), Expect = 6e-06
Identities = 65/323 (20%), Positives = 137/323 (42%), Gaps = 42/323 (13%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQ 274
+ E E+Q E ++ +L ++ + ++ ELQ+ + + E EL +I L+
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Query: 275 YELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQK 334
L ++++ ++LE L E EK + ++++ + E + + Q +L++
Sbjct: 309 ERLEELENELEELEERLEELKEK---------IEALKEELEERETLLEELEQLL-AELEE 358
Query: 335 DLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQ 394
EE +E + L+EL++L RE L ++E + NE+ +
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEI----RNELE-------------E 401
Query: 395 LPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEM 454
L R+I L E + + + L+ L E + +T+ +E L + +
Sbjct: 402 LKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLE----- 456
Query: 455 METEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMRHLITS 514
E K+L E+ +L++ L ++ KE E L E ++ +R ++ +
Sbjct: 457 ---ELRDRLKELERELAELQEELQRLEKELS----SLEARLDRLEAEQRASQGVRAVLEA 509
Query: 515 LQNHNKQIKG---EVNRYKRKYK 534
L++ + G E+ + K KY+
Sbjct: 510 LESGLPGVYGPVAELIKVKEKYE 532
Score = 44.7 bits (106), Expect = 1e-04
Identities = 66/319 (20%), Positives = 142/319 (44%), Gaps = 32/319 (10%)
Query: 197 ILKGELNNSNGDPIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAV 256
I G N + + + E E++ + L E+ L + S+ + EL+ +
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711
Query: 257 NGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIK 316
E + EL+ ++ L+ EL ++QSR ++LE L E E+++ Q + + +
Sbjct: 712 EELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQ---------ERLEE 762
Query: 317 PEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFN 376
EE + ++ ++ L L++++EE +E +EL++L + EA + ++ L+ + E
Sbjct: 763 LEEELESL-EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELE---- 817
Query: 377 NEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQE 436
+ +L ++I +L E + + L+ L E + K + +E
Sbjct: 818 ----------SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEE 867
Query: 437 FQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLA 496
++ ++ + E+ E EE +++L E+ +LE L+++++E E LR E A
Sbjct: 868 LEAEK------EELEDELKELEE--EKEELEEELRELESELAELKEEIEKLRERLEELEA 919
Query: 497 ANEQTGPINREMRHLITSL 515
E+ E+ +
Sbjct: 920 KLERLEVELPELEEELEEE 938
Score = 42.0 bits (99), Expect = 8e-04
Identities = 50/256 (19%), Positives = 109/256 (42%), Gaps = 19/256 (7%)
Query: 39 DIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRI 98
+++ L+ + ++ + IE+ ELR +E+L++ + + + + L E +
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEE 313
Query: 99 LLQRFDAETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWD 158
L + E K + ++ + +E E + L +
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEE----------RETLLEELEQLLAELEEA---- 359
Query: 159 KEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITM---ILKGELNN--SNGDPIPSL 213
KEEL+EKL+ ++ + + + L +I LK E+ + + +
Sbjct: 360 KEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419
Query: 214 DEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDL 273
E ++E E++ E L + +L+ + + ++ EL+D + E E AEL+ ++ L
Sbjct: 420 LEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRL 479
Query: 274 QYELSKVQSRNDKLEN 289
+ ELS +++R D+LE
Sbjct: 480 EKELSSLEARLDRLEA 495
Score = 41.6 bits (98), Expect = 0.001
Identities = 51/245 (20%), Positives = 116/245 (47%), Gaps = 17/245 (6%)
Query: 160 EELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVRE 219
EE E+L +R++ + + ++ + + L +R E++ L E++ E
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEEL----------------ESLEE 772
Query: 220 ANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSK 279
A +++ E L E+ L + + ++ E + ++ E E L + + L+ E+ +
Sbjct: 773 ALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEE 832
Query: 280 VQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEA 339
++ ++LE L E E+++ ++ E +E + ++ E+ K+L++ +++LEE
Sbjct: 833 LEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEE 892
Query: 340 KELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQI 399
+ L EL + + RE L+++E K++ E+ + +L T+L R+I
Sbjct: 893 LRELESELAELKEEIEKLRERLEELEA-KLERLEVELPELEEELEEEYEDTLETELEREI 951
Query: 400 RQLPE 404
+L E
Sbjct: 952 ERLEE 956
Score = 39.3 bits (92), Expect = 0.005
Identities = 59/337 (17%), Positives = 124/337 (36%), Gaps = 43/337 (12%)
Query: 48 KKLAQRIEQKNQVEHELRSRIEQL---------------EKRQTQDDAVLNVVNRY---W 89
++ + +E+ + EL ++E+L E R+ + +L +
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241
Query: 90 NQLNEDIRILLQRFDAETAD-ESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATT 148
+L E++ L + + + E K L++ L E
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSE---------------LEELREELEEL 286
Query: 149 SFLMQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGD 208
+ + EEL+ +++ + + ++ + +RL + EKI + +
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERET-- 344
Query: 209 PIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRN 268
L E + + E+ E KL A + L++ + E E AE+RN
Sbjct: 345 ----LLEELEQLLAEL---EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN 397
Query: 269 QIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKK 328
++++L+ E+ ++ R ++L L + E++K + E E + + E ++
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Query: 329 LDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVE 365
L D K+LE LQ L+K L +E
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLE 494
Score = 37.8 bits (88), Expect = 0.018
Identities = 57/316 (18%), Positives = 131/316 (41%), Gaps = 12/316 (3%)
Query: 237 KLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLEN-HLLEAT 295
K + K+ ++ + E EL Q++ L+ + K + + LE
Sbjct: 169 KYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELA 228
Query: 296 EKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHL 355
+ +++ E + ++ + E Q++L++ +K++EE K +EL++L
Sbjct: 229 LLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQE 288
Query: 356 QHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKC 415
+ E +++E+L+ + + + L L +L ++ E++ E E +
Sbjct: 289 ELLELKEEIEELEGEISLL-RERLEELE--NELEELEERLEELKEKI-EALKEELEERET 344
Query: 416 LQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQI--EMMETEELMVQKK-----LRG 468
L L E + AK + +E S L + E+ E E + + + L+
Sbjct: 345 LLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKR 404
Query: 469 EMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMRHLITSLQNHNKQIKGEVNR 528
E+ LE+ L ++ + E L+ E + A E+ E+ + L+ ++++ +
Sbjct: 405 EIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKE 464
Query: 529 YKRKYKETSAEIQKVR 544
+R+ E E+Q++
Sbjct: 465 LERELAELQEELQRLE 480
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 48.5 bits (116), Expect = 8e-06
Identities = 34/152 (22%), Positives = 69/152 (45%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQ 274
E ++E E + E LT + +L K + L++SEL++ + + E L N+I L+
Sbjct: 242 EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE 301
Query: 275 YELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQK 334
+ ++ R LE L E +++ + E +E ++ + + L+ +
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELE 361
Query: 335 DLEEAKELANNRLQELDKLHLQHREALKDVEK 366
+LE E +RL+EL++ R + +E
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLEL 393
Score = 45.0 bits (107), Expect = 9e-05
Identities = 52/269 (19%), Positives = 117/269 (43%), Gaps = 27/269 (10%)
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQS 282
E++ E L E+ + + ++ ++ EL++ + E +EL +I++LQ EL + +
Sbjct: 236 ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN 295
Query: 283 RNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKEL 342
+LE E++ + + + + ++ E+ + ++L +L++ LEE KE
Sbjct: 296 EISRLEQQKQILRERL---ANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE 352
Query: 343 ANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQL 402
+ EL++L + E +E+L+ E T ++++ QL QI L
Sbjct: 353 LESLEAELEELEAELEELESRLEELEEQLE-TLRSKVA-------------QLELQIASL 398
Query: 403 PESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEMMETEELMV 462
+ + L+ + QE + L + + ++++ E
Sbjct: 399 NNEIERLEARLERLEDR----------RERLQQEIEELLKKLEEAELKELQAELEELEEE 448
Query: 463 QKKLRGEMMQLEDALSQIRKEYEMLRIEF 491
++L+ E+ +LE+AL ++R+E E
Sbjct: 449 LEELQEELERLEEALEELREELEEAEQAL 477
Score = 42.0 bits (99), Expect = 0.001
Identities = 49/311 (15%), Positives = 115/311 (36%), Gaps = 23/311 (7%)
Query: 48 KKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDI---RILLQRFD 104
++L ++IE+ + EL + +L K + + L + + +L+ I R L R +
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 105 AETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWDKEELD- 163
AE E +Q + ++ + EA + ++ E
Sbjct: 740 AEVEQLEERIAQLSKELTEL--------EAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
Query: 164 EKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIE 223
E+L ++ + A+ ++ L + + L+ A +
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI---------AATERRLED 842
Query: 224 IQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSR 283
++ + L+E L A+ + I EL+ + E A L + L+ EL ++
Sbjct: 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902
Query: 284 NDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELA 343
+LE+ E +++ ++ + + ++ N+ ++ ++ LEEA+ L
Sbjct: 903 LRELESKRSELRRELEELREKLAQLELRLEGLEVRI--DNLQERLSEEYSLTLEEAEALE 960
Query: 344 NNRLQELDKLH 354
N + ++
Sbjct: 961 NKIEDDEEEAR 971
Score = 39.3 bits (92), Expect = 0.007
Identities = 32/207 (15%), Positives = 81/207 (39%), Gaps = 6/207 (2%)
Query: 160 EELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVRE 219
EL ++L + ++ ++ + ++ + + + + I L + + E
Sbjct: 701 AELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE--ERIAQLSKELTE 758
Query: 220 ANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSK 279
EI+ L E +L +I EL+ + + E LR +D+L+ EL+
Sbjct: 759 LEAEIEELEERLEEAEEELAE----AEAEIEELEAQIEQLKEELKALREALDELRAELTL 814
Query: 280 VQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEA 339
+ L L ++ ++ + ++ + + + ++L++L ++LE
Sbjct: 815 LNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESE 874
Query: 340 KELANNRLQELDKLHLQHREALKDVEK 366
E N L++ R L+++ +
Sbjct: 875 LEALLNERASLEEALALLRSELEELSE 901
Score = 38.5 bits (90), Expect = 0.011
Identities = 51/327 (15%), Positives = 125/327 (38%), Gaps = 58/327 (17%)
Query: 47 NKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRILLQRFDAE 106
K + +E++ ++E EL +IE+LE++ + + L + + +L E++ L + +
Sbjct: 666 AKTNSSILERRREIE-ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL 724
Query: 107 TADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWDKEELDEKL 166
+ S + A + E E + QLS E E
Sbjct: 725 SRQISALRKDLA--------------------RLEAEVEQLEERIAQLSKELTELEAEIE 764
Query: 167 ANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIEIQT 226
++ +A + L + +I L+ + + E++
Sbjct: 765 ELE--------ERLEEAEEELAEAEAEIE----------------ELEAQIEQLKEELKA 800
Query: 227 ENRNLTEQN---TKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSR 283
L E T L+ + ++ ++ L+ + E +L QI++L ++ + +
Sbjct: 801 LREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAE 860
Query: 284 NDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELA 343
++LE + E +++ E+ + E + + +L++L ++L E +
Sbjct: 861 IEELEELIEELESELEALL-------NERASL---EEALALLRSELEELSEELRELESKR 910
Query: 344 NNRLQELDKLHLQHREALKDVEKLKMD 370
+ +EL++L + + +E L++
Sbjct: 911 SELRRELEELREKLAQLELRLEGLEVR 937
Score = 37.0 bits (86), Expect = 0.031
Identities = 49/300 (16%), Positives = 114/300 (38%), Gaps = 59/300 (19%)
Query: 248 KISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTE 307
KI EL++ + E AELR ++++L+ EL +++ ++L +
Sbjct: 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISAL------------- 731
Query: 308 DGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKL 367
KD+ + E + + ++ + L K+L E + + L++ + EA ++E+L
Sbjct: 732 ---RKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEEL 787
Query: 368 KMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNEN 427
+ QI QL E + L++ ++L NE
Sbjct: 788 E----------------------------AQIEQLKEELKALREALDELRAELTLL-NEE 818
Query: 428 QLAKTQAQEFQSHLSASKNNQQRQIEMMETEELMVQ---KKLRGEMMQLEDALSQIRKEY 484
+ E + +R+I E ++ ++L ++ L + ++ +
Sbjct: 819 AANLRERLE----------SLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868
Query: 485 EMLRIEFETNLAANEQTGPINREMRHLITSLQNHNKQIKGEVNRYKRKYKETSAEIQKVR 544
E L E E L +R + L ++++ + + +R+ +E ++ ++
Sbjct: 869 EELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE 928
Score = 36.6 bits (85), Expect = 0.041
Identities = 44/282 (15%), Positives = 104/282 (36%), Gaps = 38/282 (13%)
Query: 43 LKFQNKKLAQRIEQKNQVEH-----------------ELRSRIEQLEKRQTQDDAVLNVV 85
L+ ++L + E+ + E ELR + +LE+ + L +
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYAL 293
Query: 86 NRYWNQLNEDIRILLQRFDA--ETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDE 143
++L + +IL +R +E E++ + LD A E
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELA------------EELAE 341
Query: 144 NEATTSFLMQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELN 203
E L + + E L+ +L + +++ + ++L K+ + +
Sbjct: 342 LEEKLEELKE----ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIAS 397
Query: 204 NSNGDPIPSLDEAVREANIEIQTENRNLTEQNTKLH-AKYHSISLKISELQDAVNGKETE 262
+N I L+ + + + + E KL A+ + ++ EL++ + + E
Sbjct: 398 LNN--EIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
Query: 263 AAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQM 304
L +++L+ EL + + D E L + ++ +++
Sbjct: 456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERL 497
Score = 34.3 bits (79), Expect = 0.23
Identities = 61/385 (15%), Positives = 143/385 (37%), Gaps = 85/385 (22%)
Query: 3 KRGVDCEDASSSGQPPVKKVQFEPIRIGPISTLEEMDIKVLKFQNKKLAQRIEQKNQVEH 62
K + S+ + + +++ E ++ ++ L+ + ++L +R+E+ +
Sbjct: 719 KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778
Query: 63 ELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRILLQRFDAETADESESKSQAALRAL 122
E + IE+LE + Q L E + L +A L L
Sbjct: 779 EAEAEIEELEAQIEQL-------KEELKALREALDEL----------------RAELTLL 815
Query: 123 DVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWDKEELDEKLANRVQVSKRAVAKIIQ 182
EE L R++ +R +A +
Sbjct: 816 ------------------------------------NEEAAN-LRERLESLERRIAATER 838
Query: 183 AFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIEIQTENRNLTEQNTKLHAKY 242
+ L ++ E+++ + E + I L+E + E E++ L + L
Sbjct: 839 RLEDLEEQIEELSEDI--ESLAAE---IEELEELIEELESELEA----LLNERASLEEAL 889
Query: 243 HSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQ 302
+ ++ EL + + E++ +ELR ++++L+ +L++++ R + LE + E++
Sbjct: 890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEY 949
Query: 303 QMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDL-----------EEAKELANNRLQELD 351
+ E+ + + ++ ++++L L+ + EE +EL R L
Sbjct: 950 SLTLEEAEALENKIEDDEEE--ARRRLKRLENKIKELGPVNLAAIEEYEEL-KERYDFLT 1006
Query: 352 KLHLQHREALKDVEKL--KMDWEVT 374
EA + +E+ ++D E
Sbjct: 1007 AQKEDLTEAKETLEEAIEEIDREAR 1031
Score = 32.7 bits (75), Expect = 0.71
Identities = 42/239 (17%), Positives = 100/239 (41%), Gaps = 37/239 (15%)
Query: 326 QKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWS 385
++KL+ +++L+ +++ N ++L L Q +A + E LK + E+ +
Sbjct: 178 ERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELR-----ELELALLV 231
Query: 386 VNPTSLTTQLPR---QIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQS--- 439
+ L +L ++++ E + T E + L+ L E + + +E Q
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
Query: 440 --------------HLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYE 485
L N +RQ+E +E + ++ KL +L + L+++ ++ E
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLD----ELAEELAELEEKLE 347
Query: 486 MLRIEFETNLAANEQTGPINREMRHLITSLQNHNKQIKGEVNRYKRKYKETSAEIQKVR 544
L+ E E+ A E E+ + L++ ++++ ++ + K + +I +
Sbjct: 348 ELKEELESLEAELE-------ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN 399
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 45.1 bits (107), Expect = 1e-04
Identities = 41/234 (17%), Positives = 78/234 (33%), Gaps = 35/234 (14%)
Query: 158 DKEELDEKLANRVQVSKRAVAKIIQ--------AFDRLVQRNEKITMILKGELNNSNGDP 209
+ + + +++ V+ ++II+ + L + E I LK LNN + +
Sbjct: 886 ELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIAR-LKKLLNNIDLEE 944
Query: 210 IPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDA---VNGKETEAAEL 266
PS++ ++ L E + + ++ + E A + + E AEL
Sbjct: 945 GPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAEL 1004
Query: 267 RNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQ 326
Q LQ +++ ++ K+ TE K + K + +
Sbjct: 1005 SKQYGALQESTKQLKELPVEVAEL----QSASKIISSESTELSILKPLQKLKGLL----- 1055
Query: 327 KKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEIS 380
L NN+LQ K RE +K E T N +
Sbjct: 1056 --------------LLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKT 1095
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.1 bits (107), Expect = 1e-04
Identities = 51/233 (21%), Positives = 102/233 (43%), Gaps = 18/233 (7%)
Query: 262 EAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETM 321
E LR +++ L+ ELS +QS ++EN L E ++++ + E KE + ++ EE
Sbjct: 675 ELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEK 734
Query: 322 SNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISW 381
++L++ LE+ E + L+EL+ + E L +E+ D E ++
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSR-- 792
Query: 382 LVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHL 441
++ ++ +L E V + ++ + L E + + + QE Q
Sbjct: 793 ----------IPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQR 842
Query: 442 SASKNNQ---QRQIEMMET--EELMVQ-KKLRGEMMQLEDALSQIRKEYEMLR 488
K +++IE + EEL + ++L + LE L ++KE + L
Sbjct: 843 IDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
Score = 40.4 bits (95), Expect = 0.003
Identities = 53/293 (18%), Positives = 123/293 (41%), Gaps = 23/293 (7%)
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGK-ETEAAELRNQIDDLQYELSKVQ 281
++ E LTE+ ++L + I + EL + E E ++ +I +L+ E++ ++
Sbjct: 248 SLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLE 307
Query: 282 SRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKE 341
+ E L +A E++ + + E + ++ E + KL + +L+E E
Sbjct: 308 RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367
Query: 342 LANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQ 401
L+E+DK + R+ LKD + ++ L +N L R++ +
Sbjct: 368 DLRAELEEVDKEFAETRDELKDYRE-----------KLEKLKREINE------LKRELDR 410
Query: 402 LPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEMMETEELM 461
L E + + E L +A + + + + + ++ + + ++ + E
Sbjct: 411 LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQE 470
Query: 462 VQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMRHLITS 514
+ L+ E ++E LS++++E E E A+E+ R + ++ +
Sbjct: 471 L-YDLKEEYDRVEKELSKLQRELA----EAEAQARASEERVRGGRAVEEVLKA 518
Score = 38.5 bits (90), Expect = 0.010
Identities = 34/187 (18%), Positives = 79/187 (42%), Gaps = 16/187 (8%)
Query: 181 IQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIEIQTENRNLTEQNTKLHA 240
+ + V R E ++ +LN + E + + E+Q + +L EQ +
Sbjct: 800 LSKLEEEVSRIEARLREIEQKLNRLTLE-----KEYLEKEIQELQEQRIDLKEQIKSIEK 854
Query: 241 KYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKL 300
+ +++ K EL E E EL + DL+ L ++ D+LE L E K+
Sbjct: 855 EIENLNGKKEEL-------EEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI-- 905
Query: 301 CQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREA 360
+++ + K++ + + +++L +++ E +E+ L L+ + + +
Sbjct: 906 -EELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEEL-SLEDVQAELQRV 963
Query: 361 LKDVEKL 367
+++ L
Sbjct: 964 EEEIRAL 970
Score = 35.8 bits (83), Expect = 0.065
Identities = 56/267 (20%), Positives = 103/267 (38%), Gaps = 26/267 (9%)
Query: 32 ISTLEEMDIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQL-EKRQTQDDAVLNVVNRYWN 90
+++LEE +++ L + +L +R+E+ Q+ EL +I+ L E+ Q + + +
Sbjct: 246 LASLEE-ELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIA 304
Query: 91 QLNEDIRILLQRFDAETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSF 150
L I + E A+E +K +A + L I +E +
Sbjct: 305 SLERSIAEKERE--LEDAEERLAKLEAEIDKLLAEIEELER-------EIEEERKRRDKL 355
Query: 151 LMQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNG--- 207
+ + KEEL++ A +V K D L EK+ LK E+N
Sbjct: 356 TEEYAE-LKEELEDLRAELEEVDKEFAE----TRDELKDYREKLEK-LKREINELKRELD 409
Query: 208 ---DPIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAA 264
+ + L E + + N I + E + K I + +L+
Sbjct: 410 RLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQ 469
Query: 265 E---LRNQIDDLQYELSKVQSRNDKLE 288
E L+ + D ++ ELSK+Q + E
Sbjct: 470 ELYDLKEEYDRVEKELSKLQRELAEAE 496
Score = 35.8 bits (83), Expect = 0.065
Identities = 39/235 (16%), Positives = 98/235 (41%), Gaps = 8/235 (3%)
Query: 137 CLAITDENEATTSFLMQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITM 196
I E + L ++ + EK+ ++ ++ + I +R ++ E+
Sbjct: 267 LEEIEQLLEELNKKIKDLGEEEQLRVKEKIGE-LEAEIASLERSIAEKERELEDAEERLA 325
Query: 197 ILKGELNNSNGDPIPSLDEAVREANIE---IQTENRNLTEQNTKLHAKYHSISLKISELQ 253
L+ E++ + I L+ + E + E L E+ L A+ + + +E +
Sbjct: 326 KLEAEIDKLLAE-IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETR 384
Query: 254 DAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKD 313
D + + +L+ +I++L+ EL ++Q +L L + + + E +EK+
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444
Query: 314 VIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLK 368
E + KL+ L DL + ++ + +E D++ + + +++ + +
Sbjct: 445 DKALE---IKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAE 496
Score = 35.4 bits (82), Expect = 0.085
Identities = 46/297 (15%), Positives = 110/297 (37%), Gaps = 38/297 (12%)
Query: 247 LKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHT 306
+ EL E + + Q+ L+ EL K+ +LE L E + +
Sbjct: 223 YEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLL-------- 274
Query: 307 EDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEK 366
+ K + E ++K+ +L+ ++ + + +EL+ + + +++K
Sbjct: 275 -EELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDK 333
Query: 367 LKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNE 426
L + E +L R+I + + T EY L+ L E
Sbjct: 334 LLAEIE---------------------ELEREIEEERKRRDKLTEEYAELKEELEDLRAE 372
Query: 427 NQLAKTQAQEFQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEM 486
+ + E + L + ++E + E+ +L+ E+ +L++ L ++ +E
Sbjct: 373 LEEVDKEFAETRDELKD----YREKLEKL-KREI---NELKRELDRLQEELQRLSEELAD 424
Query: 487 LRIEFETNLAANEQTGPINREMRHLITSLQNHNKQIKGEVNRYKRKYKETSAEIQKV 543
L A + + I + +Q+ ++++Y+++ + E +V
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRV 481
Score = 32.7 bits (75), Expect = 0.58
Identities = 34/207 (16%), Positives = 91/207 (43%), Gaps = 5/207 (2%)
Query: 159 KEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVR 218
E+L E+++ + + + + ++ E+ + +K ++ + I SL+ ++
Sbjct: 253 LEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAE-IASLERSIA 311
Query: 219 EANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELS 278
E E++ + E+ KL A+ + +I EL+ + + +L + +L+ EL
Sbjct: 312 EKERELE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367
Query: 279 KVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEE 338
+++ ++++ E +++K ++ + +E + +K E ++L + DL
Sbjct: 368 DLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNA 427
Query: 339 AKELANNRLQELDKLHLQHREALKDVE 365
A ++ EL++ +K E
Sbjct: 428 AIAGIEAKINELEEEKEDKALEIKKQE 454
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 43.5 bits (103), Expect = 2e-04
Identities = 31/154 (20%), Positives = 58/154 (37%), Gaps = 11/154 (7%)
Query: 152 MQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIP 211
MQL KE L+E L ++ + + K + + + + K LK E+ +
Sbjct: 142 MQLLEGLKEGLEENL-EGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIAS------- 193
Query: 212 SLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQID 271
L + E N+ E ++ L K ++ ELQ + L N+
Sbjct: 194 -LRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKS 252
Query: 272 DLQYELSKVQSRNDKLENHLLEATEKVKLCQQMH 305
+L E++ ++ + E A E KL ++
Sbjct: 253 ELLEEIA--EAEKIREECRGWSAKEISKLKAKVS 284
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 40.4 bits (95), Expect = 0.003
Identities = 32/152 (21%), Positives = 71/152 (46%), Gaps = 2/152 (1%)
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQS 282
E+ E + E+ K + +I I E + + E E+ +++ +L+ EL K++
Sbjct: 169 EVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEK 228
Query: 283 RNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKEL 342
+LE E E K + + K ++ I+ E +K++++L++ ++E KEL
Sbjct: 229 EVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL 288
Query: 343 AN--NRLQELDKLHLQHREALKDVEKLKMDWE 372
+L + + ++ + L+++EK E
Sbjct: 289 KEKAEEYIKLSEFYEEYLDELREIEKRLSRLE 320
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 39.6 bits (93), Expect = 0.004
Identities = 44/221 (19%), Positives = 80/221 (36%), Gaps = 31/221 (14%)
Query: 217 VREANIEIQTEN--RNLTEQNTKLHAKYHSISLK-----ISELQDAVNGKETEAAELRNQ 269
+RE N +IQT + + +Q K + K K I+ Q+ + EA ++ +
Sbjct: 176 IRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAE 235
Query: 270 IDDLQYELSKVQSRNDKLENHL-------LEATEKVKLCQQMHT--EDGKE----KDVIK 316
I++L EL + + L + K++ Q++ E G I
Sbjct: 236 IEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQIS 295
Query: 317 PEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFN 376
+ KL +LQ LE+ + EL+++ + E K + +LK T
Sbjct: 296 EGPDRITKIKDKLKELQHSLEKLDT----AIDELEEIMDEFNEQSKKLLELKNKIS-TNK 350
Query: 377 NEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQ 417
+ LV ++ I +L + E LQ
Sbjct: 351 QSLITLVDKAK------KVKAAIEELQAEFVDNAEELAKLQ 385
Score = 31.5 bits (72), Expect = 1.3
Identities = 20/107 (18%), Positives = 48/107 (44%), Gaps = 7/107 (6%)
Query: 264 AELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSN 323
++++++ +LQ+ L K+ + D+LE + E E+ K ++ + K +
Sbjct: 302 TKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLV---- 357
Query: 324 VSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMD 370
K ++ +EE + + +EL KL + + +K +L +
Sbjct: 358 ---DKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKE 401
Score = 28.8 bits (65), Expect = 9.7
Identities = 45/264 (17%), Positives = 95/264 (35%), Gaps = 42/264 (15%)
Query: 32 ISTLEEMDIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQ 91
IS L EMD K+ K + ++L Q+I+ ++ ++ +Q++ + +
Sbjct: 162 ISVLSEMD-KLNKDKIRELNQQIQT---LDMKIDHIQQQIKTYN-------KNIEEQRKK 210
Query: 92 LNEDIRILLQRFDAETADESESKSQAALR--ALDVCAPPAYGNTNVMCLAITDENEATTS 149
E+I ++D + K++ L + A+ N A
Sbjct: 211 NGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSA----ALNKLNTAAAK 266
Query: 150 FLMQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDP 209
++ + K K+ + V +I + DR+ + +K+ +
Sbjct: 267 IKSKIEQFQKVI---KMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHS--------- 314
Query: 210 IPSLDEAVREAN---IEIQTENRNLTEQNTKLHAKYHSISL----------KISELQDAV 256
+ LD A+ E E +++ L E K+ S+ I ELQ
Sbjct: 315 LEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374
Query: 257 NGKETEAAELRNQIDDLQYELSKV 280
E A+L++++D + S++
Sbjct: 375 VDNAEELAKLQDELDKIVKTKSEL 398
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 37.0 bits (86), Expect = 0.029
Identities = 77/510 (15%), Positives = 173/510 (33%), Gaps = 55/510 (10%)
Query: 45 FQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRILLQRFD 104
F +K + E +V E +++IE+LE + ++L EDI LL+ +
Sbjct: 164 FGLEKYEKLSELLKEVIKEAKAKIEELEGQ--------------LSELLEDIEDLLEALE 209
Query: 105 AETADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTWDKEELDE 164
E + + + + + E A + K L E
Sbjct: 210 EELKELKKLEEIQEEQEEEELEQEIEALE---------ERLAELEEEKERLEELKARLLE 260
Query: 165 KLANRVQVSKRAVAKIIQAFDRLVQRNEKITMI--LKGELNNSNGDPIPSLDEAVREANI 222
+ ++ K ++ + L + EKI + L+ E+ + E
Sbjct: 261 IESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALL--EELE 318
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETE----AAELRNQIDDLQYELS 278
E+ + ++L E+ KL K + ++ EL + N EL ++++L+ EL
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELE 378
Query: 279 KVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEE 338
K R +LE + E E+ + +++ + E + ++ +L++ EE
Sbjct: 379 KALERLKQLEEAIQELKEE------LAELSAALEEIQEELEELEKELEELERELEELEEE 432
Query: 339 AKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQLPRQ 398
K+L + L + E + L +
Sbjct: 433 IKKL--EEQINQLESKELMIAELAGAGEKCPVCGQELPEE--------HEKELLELYELE 482
Query: 399 IRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEMMETE 458
+ +L E + + K + + + +E L + ++ E +E
Sbjct: 483 LEELEEEL----SREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKL 538
Query: 459 ELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMRHLITSLQNH 518
E ++++ + L + ++ E E + L R + + L+
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL----LRTRKEELEELRER 594
Query: 519 NKQIKGEVNRYKRKYKETSAEIQKVRISER 548
K++K ++ + + + +Q + +SE
Sbjct: 595 LKELKKKLKELEERLSQLEELLQSLELSEA 624
Score = 32.4 bits (74), Expect = 0.86
Identities = 83/509 (16%), Positives = 190/509 (37%), Gaps = 50/509 (9%)
Query: 48 KKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRILLQRFDAET 107
++ +R+E+ E+ S + K + ++ L + + E + L + +
Sbjct: 245 EEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELE 304
Query: 108 ADESESKSQAALRALDVCAPPAYGNTNVMCLAITDENEATTSFLMQLSTW--DKEELDEK 165
+ ++ + + + ++ E S L +L+ + +L E+
Sbjct: 305 EELEGLRALLEELEELL---EKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEE 361
Query: 166 LANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIEIQ 225
++ + K ++ +++ E+ LK EL L A+ E E++
Sbjct: 362 RLKELEERLEELEKELEKALERLKQLEEAIQELKEEL--------AELSAALEEIQEELE 413
Query: 226 TENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQID-------DLQYELS 278
+ L E +L I K+ E + + KE AEL + +L E
Sbjct: 414 ELEKELEELERELEELEEEI-KKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHE 472
Query: 279 KVQSRNDKLE-NHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLE 337
K +LE L E + K ++ E + + ++ E + + L+++LE
Sbjct: 473 KELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELE 532
Query: 338 EAKELANNRLQELDKLHLQH---------REALKDVEKLKMDWEVTFNNEISWLVWSVNP 388
E E N L+EL++L + R+ +++LK E + +
Sbjct: 533 EKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE--LRLLRTRKEELE- 589
Query: 389 TSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKT--------QAQEFQSH 440
L +L +++++ + + ++ + L + + EN+L + + Q+
Sbjct: 590 -ELRERL-KELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAE 647
Query: 441 LSASKNNQQRQIE-MMETEELMVQKKL-----RGEMMQLEDALSQIRKEYEMLRIEFETN 494
L ++E +E E ++++L ++ + + L Q+ +E E LR E E
Sbjct: 648 LEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEEL 707
Query: 495 LAANEQTGPINREMRHLITSLQNHNKQIK 523
L + + E+ L+ K+++
Sbjct: 708 LKKLGEIEQLIEELESRKAELEELKKELE 736
Score = 30.1 bits (68), Expect = 4.6
Identities = 29/196 (14%), Positives = 77/196 (39%), Gaps = 11/196 (5%)
Query: 159 KEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVR 218
+ + +K ++ + +++Q+ + NE + E + L+E ++
Sbjct: 594 RLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQ 653
Query: 219 EANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELS 278
A E++ + L + + + + +L+ E +L +++ L+ EL
Sbjct: 654 AALEELEEKVEELEAEIRRELQRIENEEQLEEKLE--------ELEQLEEELEQLREELE 705
Query: 279 KVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKL---DDLQKD 335
++ + ++E + E + +++ E K + ++ E + K D L+
Sbjct: 706 ELLKKLGEIEQLIEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLRADILRNL 765
Query: 336 LEEAKELANNRLQELD 351
L + + AN L +L
Sbjct: 766 LAQIEAEANEILSKLS 781
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 35.8 bits (83), Expect = 0.044
Identities = 23/150 (15%), Positives = 60/150 (40%), Gaps = 13/150 (8%)
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQS 282
++ E L + ++ ++ L A+ E E +L NQ+ L+ E+ +++
Sbjct: 14 KLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRE 73
Query: 283 RNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVS-------------QKKL 329
R + E L ++ +L ++ + E+ ++ + +++L
Sbjct: 74 RIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERL 133
Query: 330 DDLQKDLEEAKELANNRLQELDKLHLQHRE 359
+ L+K+L EA+ + E+ + +
Sbjct: 134 ERLEKNLAEAEARLEEEVAEIREEGQELSS 163
Score = 29.3 bits (66), Expect = 5.1
Identities = 34/146 (23%), Positives = 64/146 (43%), Gaps = 15/146 (10%)
Query: 247 LKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHT 306
L I +L + E E+R + + EL + N LE +E + Q+ +
Sbjct: 10 LAIQKLDLEKDRLEPRIKEIRKALKKAKAELEAL---NKALEALEIELEDLENQVSQLES 66
Query: 307 EDGKEKDVIKP-EETMSNVS-QKKLDDLQKDLEEAKELAN----------NRLQELDKLH 354
E + ++ IK EE +S V +++L L +++ AKE N +++L+K
Sbjct: 67 EIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEI 126
Query: 355 LQHREALKDVEKLKMDWEVTFNNEIS 380
+E L+ +EK + E E++
Sbjct: 127 EDLKERLERLEKNLAEAEARLEEEVA 152
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 35.6 bits (83), Expect = 0.057
Identities = 33/140 (23%), Positives = 58/140 (41%), Gaps = 20/140 (14%)
Query: 211 PSLDEAVREANIEIQTE---NRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELR 267
L +A+ E I Q E +N E +K + + DA+ + E ELR
Sbjct: 165 LDLTKALAE--IRAQYEELAEKNRQEAEEWYKSKLEELQQAAARNGDALRSAKEEITELR 222
Query: 268 NQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVS-- 325
QI L+ EL ++ + LE L E E+ E ++ ++T+S +
Sbjct: 223 RQIQSLEIELQSLKKQKASLERQLAE------------LEERYELELADYQDTISELEEE 270
Query: 326 -QKKLDDLQKDLEEAKELAN 344
Q+ ++ + L E +EL +
Sbjct: 271 LQQLKAEMARQLREYQELLD 290
Score = 30.2 bits (69), Expect = 2.6
Identities = 60/295 (20%), Positives = 109/295 (36%), Gaps = 74/295 (25%)
Query: 229 RNLTEQNTKLHAKYHSISLKISELQDAVNGK--------ETEAAELRNQIDDLQYELSKV 280
R L +QN +L AK I EL+ + + E E ELR Q+D+L E +++
Sbjct: 21 RFLEQQNKELEAK-------IEELRQKKSAEPSRLYSLYEQEIRELRKQLDELTNERARL 73
Query: 281 QSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIK-----PEETMSNVS-QKKLDDLQK 334
Q D L L + K +++ E D++ E T++ V + K++ LQ+
Sbjct: 74 QLEIDNLR--LAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDLEMKIESLQE 131
Query: 335 DLEEAKELANNRLQELDKLHLQHR-------------EALKDV--------EKLKMDWEV 373
+L K+ ++EL +AL ++ EK + + E
Sbjct: 132 ELAFLKKNHEEEVRELQSQIQGQVNVEMDAARKLDLTKALAEIRAQYEELAEKNRQEAEE 191
Query: 374 TFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQ 433
+ +++ L R L S E TE + Q
Sbjct: 192 WYKSKL---------EELQQAAARNGDAL-RSAKEEITEL-----------------RRQ 224
Query: 434 AQEFQSHLSASKN---NQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYE 485
Q + L + K + +RQ+ +E + + + +LE+ L Q++ E
Sbjct: 225 IQSLEIELQSLKKQKASLERQLAELEERYELELADYQDTISELEEELQQLKAEMA 279
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 35.8 bits (83), Expect = 0.062
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 9/79 (11%)
Query: 216 AVREANIEIQT---ENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDD 272
V+E + EN L +N +L + SI +I Q AV E EL +I+
Sbjct: 67 EVKELRKRLAKLISENEALKAENERLQKREQSIDQQI---QQAVQS---ETQELTKEIEQ 120
Query: 273 LQYELSKVQSRNDKLENHL 291
L+ E ++Q D+L+ L
Sbjct: 121 LKSERQQLQGLIDQLQRRL 139
Score = 28.8 bits (65), Expect = 8.7
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 505 NREMRHLITSLQNHNKQIKGEVNRYKRKYKETSAEIQKVRISER 548
+E+R + L + N+ +K E R +++ + +IQ+ SE
Sbjct: 68 VKELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSET 111
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 35.3 bits (82), Expect = 0.098
Identities = 37/195 (18%), Positives = 73/195 (37%), Gaps = 26/195 (13%)
Query: 321 MSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKD-VEKLKMDWEVTFNNEI 379
+ V K + KD EE K L + +L + + ++ L ++ +
Sbjct: 104 VVRVDNLKEEPFDKDDEEIKALTREIKETFREL-ISLSKLFREQPALLSALEDIDEPGRL 162
Query: 380 SWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQ---AQE 436
+ V + +L +++ E+V +E K L+ A +L E +L K Q ++
Sbjct: 163 ADFVAASLQLKEKDEL----QEVLETVNIE----KRLKKALELLKKELELLKLQNKITKK 214
Query: 437 FQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLA 496
+ + + Q E E+L KK G +D L ++++ + E
Sbjct: 215 VEEKME------KTQREYYLREQLKAIKKELGIEKDDKDELEKLKE-------KLEELKL 261
Query: 497 ANEQTGPINREMRHL 511
E I +E+ L
Sbjct: 262 PEEVKKVIEKELTKL 276
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 34.0 bits (78), Expect = 0.11
Identities = 26/110 (23%), Positives = 60/110 (54%), Gaps = 5/110 (4%)
Query: 260 ETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKE-KDVIKPE 318
+ E +EL+ Q+ +LQ EL ++ R +LE+ L + E ++L +++ + + + +
Sbjct: 78 QGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESI 137
Query: 319 ETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLK 368
+ + K+L +L++DL E E L+ L + + +EA+++++ L
Sbjct: 138 KEL----AKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLL 183
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 35.0 bits (80), Expect = 0.14
Identities = 49/307 (15%), Positives = 114/307 (37%), Gaps = 24/307 (7%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQ 274
+ +RE +++++ + K K+ +L+ + KE E L++ ID+L
Sbjct: 298 KTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELH 357
Query: 275 YELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQK 334
+L K ++ E E + + +++ + K +K + + K L+ +
Sbjct: 358 KQLRKQGISTEQFELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLR 417
Query: 335 DLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTF------NNEISWLVWSVNP 388
++ +Q + + Q + D LK++ E F N I + +N
Sbjct: 418 QY-------DSLIQNITRSRSQIGHNVNDSS-LKINIEQLFPKGSGINESIKKSILELND 469
Query: 389 TS---LTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASK 445
+ T+ + I L E + + L L E A ++ + +
Sbjct: 470 EIQERIKTEENKSIT-LEEDIKNLKHDINELTQILEKLELELSEANSKFELSKEENEREL 528
Query: 446 NNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPIN 505
Q+ +IE +E E + + ++ E + + + E + N + I+
Sbjct: 529 VAQRIEIEKLEKELNDLNLLSKTSILDAEQLVQSTEIKLD------ELKVDLNRKRYKIH 582
Query: 506 REMRHLI 512
+++ H+I
Sbjct: 583 KQVIHVI 589
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 34.6 bits (80), Expect = 0.14
Identities = 28/149 (18%), Positives = 55/149 (36%), Gaps = 30/149 (20%)
Query: 173 SKRAVAKIIQAFDRLVQRNEKITMILKGELNN--------SNGDPIPSLDEAVREANIEI 224
K V ++ + L + EK L+ +L + S D I S+++ + N I
Sbjct: 318 FKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELI 377
Query: 225 QTENR---NLT-EQN------------------TKLHAKYHSISLKISELQDAVNGKETE 262
+ N NL E+N + + I+ L+ + E E
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 263 AAELRNQIDDLQYELSKVQSRNDKLENHL 291
L +I +L+ +L+ ++ D++ L
Sbjct: 438 IKALEKEIKELEKQLTNIEPTADEINKLL 466
Score = 30.4 bits (69), Expect = 3.1
Identities = 23/116 (19%), Positives = 48/116 (41%), Gaps = 13/116 (11%)
Query: 251 ELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLE--ATEKVKLCQQMHTED 308
E++ + + E +L +I+ L+ E+ K + +K +N L+ + K +E
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEA 151
Query: 309 G---------KEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHL 355
KEK + + + ++ L L++ + K L ++ EL L L
Sbjct: 152 LKGLNYKKNFKEKLLKELKSV--ILNASSLLSLEELKAKIKTLFSSNKPELALLTL 205
Score = 29.2 bits (66), Expect = 7.2
Identities = 67/394 (17%), Positives = 133/394 (33%), Gaps = 46/394 (11%)
Query: 158 DKEELDEKLANRV--QVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDE 215
+ E+ K ++ +++K+ + + +A L + +LK EL + + L
Sbjct: 124 ELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEKLLK-ELKSVILNASSLLSL 182
Query: 216 AVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQY 275
+A I+ L N A + E++ A +++ I +L
Sbjct: 183 EELKAKIKT------LFSSNKPELALLTLSVIDFDEIEQAEILEKSIIGSSDVPISELIN 236
Query: 276 ELSKVQSRNDKLEN--HLLEATEKVKLCQQMHTEDGKEK--DVIKPEETMSNVSQKKLDD 331
L +D ++ E + CQQ TE+ K + E ++L++
Sbjct: 237 NLGN----SDWVKEGLEYHEEGDTCPFCQQTITEERKAELEAHFD--EEYQE-LIEQLEE 289
Query: 332 LQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSL 391
L E E A L+ + ++ E DVE+LK E + N L
Sbjct: 290 LIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEA-LEEIL-----EKNLQKL 343
Query: 392 TTQLPRQIRQL-PESVIVETTEYKCLQSAYSVLYNE-NQLAKTQAQEFQSHLSASKNNQQ 449
+L + ES+ + A + L E N+ +E +
Sbjct: 344 EEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLV 403
Query: 450 RQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMR 509
+++ +K L + LE + Q+ E + + +E++
Sbjct: 404 AELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIK-----------------ALEKEIK 446
Query: 510 HLITSLQNHNKQIKGEVNRYKRKYKETSAEIQKV 543
L L N E+N+ + Y + E++K
Sbjct: 447 ELEKQLTNIEPTAD-EINKLLKAYGFGNFELEKE 479
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 34.3 bits (78), Expect = 0.19
Identities = 55/276 (19%), Positives = 104/276 (37%), Gaps = 37/276 (13%)
Query: 231 LTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENH 290
L + L S+ + + A+ + EA LR Q+ Q EL + ++L +
Sbjct: 167 LRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEAQVTLVEQLRKY 226
Query: 291 LLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQEL 350
+ E Q E G+ + K L + + L+ EL R+Q L
Sbjct: 227 VGEQVVAESRSQAWELERGELLQTV-----------KHLQEDRAALQATVELLQVRVQSL 275
Query: 351 -DKLHLQHREALKDVEKLKM--------------DW-EVTFNNEISWLVWSVNPTSLTTQ 394
L LQ E + + + + W E F + ++ T Q
Sbjct: 276 THILALQEEELTRKAQPVDILEPEFARKCRSLLSRWREKVFALMVQLKAQELSHADSTCQ 335
Query: 395 LPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEM 454
L +Q+ +L E V ++ E LQ + E ++ + ++ Q LS ++ ++R +
Sbjct: 336 LQKQVAELQEEVTSQSQEQAILQRSLQDKEAEVEVERMNSKVLQGELSRAQEARRRLQQQ 395
Query: 455 METEELMV----------QKKLRGEMMQLEDALSQI 480
+ + E + Q+ L M ++E AL+Q+
Sbjct: 396 VASAEEQLQLVVEAVHSSQQGLLATMAEVEVALAQL 431
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.7 bits (77), Expect = 0.19
Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 8/104 (7%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQ 274
E ++E E+Q E L ++ +L A+Y E+Q+ + E E + L + L
Sbjct: 138 EELKEKLEELQKEKEELLKELEELEAEYE-------EVQERLKRLEVENSRLEEMLKKLP 190
Query: 275 YELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPE 318
E+ ++ R D+LE +E E+ + + I+ +
Sbjct: 191 GEVYDLKKRWDELEPG-VELPEEELISDLVKETLNLAPKDIEGQ 233
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 34.1 bits (79), Expect = 0.25
Identities = 40/171 (23%), Positives = 69/171 (40%), Gaps = 32/171 (18%)
Query: 153 QLSTWDKE--ELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPI 210
QL + E + +L + +RA A + QR ++I +LKG G +
Sbjct: 136 QLQNAQNDLAEYNSQLVSLQTQPERAQAALYAN----SQRLQQIRNLLKG--GKVGGKAL 189
Query: 211 -PSLDE--AVREANIEIQTE-NRNLTEQNTKLHAKYHS----ISLKIS-------ELQDA 255
PS +A + Q + R E NT+L ++ +I LQ+A
Sbjct: 190 RPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEA 249
Query: 256 VNGK-----ETEAAELRNQIDDLQYE----LSKVQSRNDKLENHLLEATEK 297
+N K E E ++Q + + + +++ N +L LL+ATEK
Sbjct: 250 INSKRLTLSEKTVQEAQSQDEAARIQANPLVAQELEINLQLSQRLLKATEK 300
>gnl|CDD|163611 pfam10046, BLOC1_2, Biogenesis of lysosome-related organelles
complex-1 subunit 2. Members of this family of proteins
play a role in cellular proliferation, as well as in the
biogenesis of specialized organelles of the
endosomal-lysosomal system.
Length = 99
Score = 31.5 bits (72), Expect = 0.26
Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 4/85 (4%)
Query: 219 EANIEIQTENRNLTEQ-NTKLHAKYHSISLKISELQ---DAVNGKETEAAELRNQIDDLQ 274
EA +EI E+ L E+ N KY + L+ +A+N K E QID ++
Sbjct: 13 EAELEISVEDLKLLEKMNENTALKYKKMRQSAKGLRVEIEALNQKYEELKPYLQQIDAIE 72
Query: 275 YELSKVQSRNDKLENHLLEATEKVK 299
+++ ++ +L+ + E KVK
Sbjct: 73 QQVTTLEEAVYELDEYSKELESKVK 97
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 33.7 bits (77), Expect = 0.30
Identities = 22/119 (18%), Positives = 52/119 (43%), Gaps = 6/119 (5%)
Query: 220 ANIEIQTENRNLTEQNTKLHAKYHSISLKISE-LQDAVNGKETEAAELRNQIDDLQYELS 278
A+ E Q + ++ + Y+ + + E L+ +E EL+ ++ L+ +
Sbjct: 318 ASKEQQDGLDSEKDRGSHEDGDYYEVDINGPEILECKYRVAVSEVGELKAELKALKARYN 377
Query: 279 KVQSR----NDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPE-ETMSNVSQKKLDDL 332
++Q R D+L N + EK+ ++ ED + ++ E +++V+ + L
Sbjct: 378 ELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGDLEKELRKLTHVAGESQGSL 436
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 33.5 bits (77), Expect = 0.32
Identities = 63/358 (17%), Positives = 117/358 (32%), Gaps = 64/358 (17%)
Query: 219 EANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELS 278
++ R +E + + ++ ++ ++ E + E A LRN++++ Q L
Sbjct: 188 AGDLTNVLRRRKKSELPSSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLE 247
Query: 279 KVQ-----------SRNDKLENHLLEATEKVK-LCQQMHTEDGKEKDVIKPEETMSNVSQ 326
++ ++LE L E K Q+ P + N+
Sbjct: 248 SLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADP----LPLLLIPNLLD 303
Query: 327 KKLDDLQKD---------LEEAKELANNRLQELDKLHLQHREALKDVEK--LKMDWEVTF 375
LQK+ EE +E L+ L KL L E +K++ ++D T
Sbjct: 304 STKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALP-AEHVKEIAAELAEIDKPATT 362
Query: 376 NNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQ 435
++EI + T L + + R+ LQ A S L E + + +
Sbjct: 363 DSEIPHRLSGSELTQLEVLIQQVKRE--------------LQDAKSQLLKELRELEEELA 408
Query: 436 EFQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNL 495
E S + QI +L E+ + ++ L + E E L + ET
Sbjct: 409 EV--DKKISTIPSEEQIA-----------QLLEELGEAQNELFRSEAEIEELLRQLETLK 455
Query: 496 AANEQTGPINREMRHLITSLQNHNKQIK--GEVNRYKRKYKETSAEIQKVRISERSPW 551
A E R + K K+T E ++ + +
Sbjct: 456 EAIEAL-------RKTLDEKTKQKINAFELERAITIADKAKKTLKEFREKLLERKLQQ 506
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 33.1 bits (76), Expect = 0.33
Identities = 31/154 (20%), Positives = 62/154 (40%), Gaps = 11/154 (7%)
Query: 152 MQLSTWDKEELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIP 211
M+L KE LDE L ++ + + K ++ + + + L+ EL
Sbjct: 139 MKLLEGLKEGLDENL-EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQ------- 190
Query: 212 SLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQID 271
L + E TE E+ KL + K+ EL++ + E++ +L N+
Sbjct: 191 -LKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKS 249
Query: 272 DLQYELSKVQSRNDKLENHLLEATEKVKLCQQMH 305
+L E+++ + + + E KL +Q+
Sbjct: 250 ELNTEIAEAEKK--LEQCRGFTFKEIEKLKEQLK 281
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 33.5 bits (77), Expect = 0.34
Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 12/118 (10%)
Query: 260 ETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQ-----MHTEDGKEKDV 314
+ + AELR + + Q + ++ S+ +L+ L +A + +Q D
Sbjct: 363 DAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQQYQDLSRNRDDWMIAE 422
Query: 315 IKPEETMSNVSQK-KLD-DLQKDLEEAKELANNRLQELDKLH-LQHREAL-KDVEKLK 368
+ E+ +S+ SQ+ +L ++Q L A + A+ RL D + R+A+ +D+E+LK
Sbjct: 423 V--EQMLSSASQQLQLTGNVQLAL-IALQNADARLATSDSPQAVAVRKAIAQDIERLK 477
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 33.1 bits (76), Expect = 0.35
Identities = 20/123 (16%), Positives = 56/123 (45%), Gaps = 14/123 (11%)
Query: 263 AAELRNQIDDLQYELSKVQSRNDKLENHLLEATE-------KVKLCQQMHTEDGKEKDVI 315
A RN + +L+ +L+++ N++L+ + EA E K+ K++ +
Sbjct: 15 AHCARNSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLN 74
Query: 316 KPEETMSN------VSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKD-VEKLK 368
+ +S ++++++L++ L + + ++ +L+K E L+D +++ +
Sbjct: 75 QIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTR 134
Query: 369 MDW 371
Sbjct: 135 SKL 137
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.5 bits (76), Expect = 0.38
Identities = 48/285 (16%), Positives = 113/285 (39%), Gaps = 24/285 (8%)
Query: 234 QNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLL- 292
Q + L ++ + E Q ++ ++ R I D Q ++ ++S+ ++L++ L
Sbjct: 816 QGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQ 875
Query: 293 -------------EATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEA 339
+ E Q + E K+ P ET Q++ ++L E +
Sbjct: 876 IGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETS 935
Query: 340 KELANNRLQ----ELDKLHLQHREALKDVEKLKMDWEVTFNNEISWLVWSVNPTSL-TTQ 394
+ A +++ ++ +H ++ ++ K D+ E++ + + +
Sbjct: 936 NKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEK 995
Query: 395 LPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEM 454
+ +R + + + + + + LQ ++ EN+L K +E + HL K Q Q+
Sbjct: 996 INEDMRLMRQDIDTQKIQERWLQDNLTLRKRENEL-KEVEEELKQHL---KEMGQMQVLQ 1051
Query: 455 METEELMVQKKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANE 499
M+ E +++ + AL + +K YE F+ L +
Sbjct: 1052 MKQEHQKLEENIDLIKRNHVLALGR-QKGYEKEIKHFKKELREPQ 1095
>gnl|CDD|240314 PTZ00208, PTZ00208, 65 kDa invariant surface glycoprotein;
Provisional.
Length = 436
Score = 33.4 bits (76), Expect = 0.38
Identities = 33/145 (22%), Positives = 60/145 (41%), Gaps = 19/145 (13%)
Query: 230 NLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQ--IDDLQYELSKVQSRNDKL 287
+ N KL + + K+ L D V + + + R Q +++EL ++ +KL
Sbjct: 26 KHVKANKKLTKEGAAALCKMKHLADKVAEERADKLKDRTQNFAGYIEFELDRLDYWLEKL 85
Query: 288 ENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRL 347
H+ DG K E + + +K D + K L EAK+
Sbjct: 86 NG-------------PKHSNDGYAKLSDSDTEKVKEIFEKAKDGIAKQLPEAKKAG---- 128
Query: 348 QELDKLHLQHREALKDVEKLKMDWE 372
+E +KLH + ++A K+ ++D E
Sbjct: 129 EEAEKLHQEVKKAAKNARGWRLDDE 153
>gnl|CDD|111923 pfam03082, MAGSP, Male accessory gland secretory protein. The
accessory gland of male insects is a genital tissue that
secretes many components of the ejaculatory fluid, some
of which affect the female's receptivity to courtship
and her rate of oviposition. This protein is expressed
exclusively in the male accessory glands of adult
Drosophila melanogaster. The proteins are transferred to
the female fly during copulation and are rapidly altered
in the female genital tract.
Length = 267
Score = 32.8 bits (74), Expect = 0.39
Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 227 ENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDK 286
++R L EQN L + HSISL + ++T+ + R DL+ EL ++ + +
Sbjct: 130 KHRLLMEQNNNLFLRNHSISL-----MKEIEARKTDIIKARQLNIDLELELESLKRKLLE 184
Query: 287 LENHLLEATEKVKLCQQMHTED 308
L L A + K C++ ++D
Sbjct: 185 LNGQLQNARKSTKPCKKRPSKD 206
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 32.9 bits (75), Expect = 0.53
Identities = 36/145 (24%), Positives = 62/145 (42%), Gaps = 20/145 (13%)
Query: 208 DPIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELR 267
+ + +L E +AN + Q + L E+ KL +++ L A N L+
Sbjct: 486 ERLENLREEFSKANSQDQLMHPVLMEKIEKLKDEFN------KRLSRAPN-----YLSLK 534
Query: 268 NQIDDLQYELSKVQS---RNDKLENHLLEATEKVKLCQQMHTEDGKEK-DVIKPEETMSN 323
++D L E S+ ++ + K E E +K K + M + KEK + +K E S
Sbjct: 535 YKLDMLN-EFSRAKALSEKKSKAEKLKAEINKKFK--EVMDRPEIKEKMEALKAEVASSG 591
Query: 324 VSQKKLDDLQKDLEEAKELANNRLQ 348
S D+L DL+E E ++
Sbjct: 592 AS--SGDELDDDLKEKVEKMKKEIE 614
Score = 30.6 bits (69), Expect = 2.9
Identities = 28/95 (29%), Positives = 49/95 (51%), Gaps = 12/95 (12%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHS-------ISLKISELQDAVNGKETEAAE-- 265
EAV+ E++ E L EQ K AK S ++ I +L+ ++ + TEA
Sbjct: 425 EAVKTPVRELEGEVEKLKEQILK--AKESSSKPSELALNEMIEKLKKEIDLEYTEAVIAM 482
Query: 266 -LRNQIDDLQYELSKVQSRNDKLENHLLEATEKVK 299
L+ ++++L+ E SK S++ + L+E EK+K
Sbjct: 483 GLQERLENLREEFSKANSQDQLMHPVLMEKIEKLK 517
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.2 bits (73), Expect = 0.70
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 213 LDEAVREANIEIQTENR--NLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQI 270
LD +R+A E T + + E + Y ++ ++ EL+ E EL +
Sbjct: 34 LDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKED 93
Query: 271 DDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLD 330
DDL EL ++Q ++LEN L+ + L + + + ++ + S +LD
Sbjct: 94 DDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEY---SLNQLD 150
Query: 331 DLQK 334
L+K
Sbjct: 151 KLRK 154
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 32.0 bits (73), Expect = 0.86
Identities = 21/99 (21%), Positives = 44/99 (44%), Gaps = 5/99 (5%)
Query: 18 PVKKVQFEPIRIGPISTLEEMDIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQ 77
V+K + R + LE + K + + + E + Q+++EL R+ L T
Sbjct: 86 LVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELY-RVAGL----TP 140
Query: 78 DDAVLNVVNRYWNQLNEDIRILLQRFDAETADESESKSQ 116
+ A ++ +L E+ +++ + E E+E K+Q
Sbjct: 141 EQARKLLLKLLDAELEEEKAQRVKKIEEEADLEAERKAQ 179
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.2 bits (73), Expect = 0.88
Identities = 43/301 (14%), Positives = 86/301 (28%), Gaps = 2/301 (0%)
Query: 55 EQKNQVEHELRSRIEQLEKRQTQDDAVLNVVNRYWNQLNEDIRILLQRFDAETADESESK 114
E K Q + LE Q ++ L N + + + DE E
Sbjct: 195 ELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEI 254
Query: 115 SQAALRALDVCAPPAY-GNTNVMCLAITDENEATTSFLM-QLSTWDKEELDEKLANRVQV 172
+ A N E L + E L +
Sbjct: 255 ESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 314
Query: 173 SKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVREANIEIQTENRNLT 232
K ++ + EK + + + +E + + +
Sbjct: 315 EKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEE 374
Query: 233 EQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLL 292
E K + +S ++ + K E E + ++ + E ++ + +
Sbjct: 375 ELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVE 434
Query: 293 EATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDK 352
E E ++ Q TE+ +E + + + KK +DL K+ + K L L L +
Sbjct: 435 ELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQ 494
Query: 353 L 353
Sbjct: 495 K 495
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 32.1 bits (73), Expect = 0.99
Identities = 60/368 (16%), Positives = 137/368 (37%), Gaps = 41/368 (11%)
Query: 192 EKITMILKGELNNSNGDPIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISE 251
EK +M K + +S+ + +DE ++ N I+ +N + + Y I
Sbjct: 479 EKDSMDSKEKKESSDSNYQEKVDELLQIIN-SIKEKNNIVNNNFKNIEDYYI----TIEG 533
Query: 252 LQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKE 311
L++ + G I+ L + +S + ++N + E V + + + + +
Sbjct: 534 LKNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEI 593
Query: 312 KDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDW 371
++I+ E + N + + + + +E L + K LQ E L ++ D
Sbjct: 594 DNIIQQIEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQ--ELLDELSHFLDDH 651
Query: 372 EVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAK 431
+ ++ S ++ L + E + + ++S N + + K
Sbjct: 652 KYLYHEAKS---------------KEDLQTLLNTSKNEYEKLEFMKSD-----NIDNIIK 691
Query: 432 TQAQEFQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRKEYEMLRI-- 489
+E Q+ LS +N ++Q+ +E + + + L+ ++ + ++E E L +
Sbjct: 692 NLKKELQNLLSLKENIIKKQLNNIEQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYK 751
Query: 490 --------EFETNLAANEQTGP----INREMRHLITSLQNHNKQIKGEVNRYKRKYKETS 537
EF +L N++ P E ++ N +I ++N K K
Sbjct: 752 HQIINRKNEFILHLYENDKDLPDGKNTYEEFLQYKDTILNKENKISNDINILKENKKNNQ 811
Query: 538 AEIQKVRI 545
+ I
Sbjct: 812 DLLNSYNI 819
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.9 bits (72), Expect = 1.4
Identities = 38/143 (26%), Positives = 58/143 (40%), Gaps = 28/143 (19%)
Query: 260 ETEAAELRNQIDDLQYELSKVQSRNDKL-----ENHLLEATEKVKLCQQMHTE---DGKE 311
E E +L +++D+ +LS S N+KL E LLE +K + E KE
Sbjct: 3882 EEENQDLDEEVNDIPEDLS--NSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKE 3939
Query: 312 KDVIKPE----------ETMSN-VSQKKLDDLQKDLEEAKELANNRLQELD-----KLHL 355
D E E MS+ V D++Q D++E ++LD KL
Sbjct: 3940 DDNKALEDKDRQEKEDEEEMSDDVGID--DEIQPDIQENNSQPPPENEDLDLPEDLKLDE 3997
Query: 356 QHREALKDVEKLKMDWEVTFNNE 378
+ + KD + MD E N+
Sbjct: 3998 KEGDVSKDSDLEDMDMEAADENK 4020
>gnl|CDD|215104 PLN00207, PLN00207, polyribonucleotide nucleotidyltransferase;
Provisional.
Length = 891
Score = 31.4 bits (71), Expect = 1.5
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 6/57 (10%)
Query: 163 DEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILKGELNNSNGDPIPSLDEAVRE 219
+EKL V+V + AV I + + LV++ K M+ +L P P L + V+E
Sbjct: 273 EEKLLEAVEVGQDAVRAICKEIEVLVKKCGKPKMLDAIKL------PPPELYKHVKE 323
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.5 bits (72), Expect = 1.5
Identities = 30/166 (18%), Positives = 67/166 (40%), Gaps = 8/166 (4%)
Query: 213 LDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDD 272
+ E +E + ++ +L + + I+E ++ + K A ELR + +
Sbjct: 489 EEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAE 548
Query: 273 LQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDL 332
L+ E + + + E EA E+V E + + ++ T + D
Sbjct: 549 LEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRT----LLAAIADA 604
Query: 333 QKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNE 378
+ ++E +E + + L +L+ + RE L + + K + E F+
Sbjct: 605 EDEIERLRE----KREALAELNDERRERLAEKRERKRELEAEFDEA 646
Score = 31.5 bits (72), Expect = 1.6
Identities = 30/155 (19%), Positives = 74/155 (47%), Gaps = 4/155 (2%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQ 274
E RE ++ E +L E + + ++ ++ +L++ + E E +L +
Sbjct: 247 EERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDD 306
Query: 275 YELSKVQSRNDKLENH---LLEATEKVKLCQQMHTEDGK-EKDVIKPEETMSNVSQKKLD 330
+ V++R ++LE+ L + E+ ++ Q H E+ + ++ E + +++
Sbjct: 307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAA 366
Query: 331 DLQKDLEEAKELANNRLQELDKLHLQHREALKDVE 365
+L+ +LEEA+E +R +E+++L + E +
Sbjct: 367 ELESELEEAREAVEDRREEIEELEEEIEELRERFG 401
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 31.3 bits (71), Expect = 1.7
Identities = 57/345 (16%), Positives = 108/345 (31%), Gaps = 40/345 (11%)
Query: 28 RIGPISTLEEM-DIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQDDAVLNVVN 86
+IGP + E + ++ + + + ++K ++ L+S + LE+R+ +
Sbjct: 325 QIGPDADEEAVESLRPSLAAKETVTELEKRKEALDQALKSARDALEEREREL-------- 376
Query: 87 RYWNQLNEDIRILLQRFDAETADESESKSQA---ALRALDVCAPPAYGNTNVMCLAITDE 143
Q+ + L T E A ALRA D+ A A E
Sbjct: 377 ---KQVRAQLAAL------PTVTVPEPLRAALADALRAGDIDAALAALEQE--VAVAKRE 425
Query: 144 NEATTSFLMQLSTWDKE--ELDEKLANRVQVSKRAVAKIIQAFDRLVQRNEKITMILK-- 199
S L +E L A +VQ + +I+QA + R ++ L+
Sbjct: 426 LAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRLLELEEDLETL 485
Query: 200 --GELNNSNGDPIPSLDE--AVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDA 255
+ +P+ +E A R + + R + Y +L D
Sbjct: 486 ELQLRHLDAAGAVPTEEEVAAARARRDALWQDIRRGYALLEPTASAYEEAVESADQLADR 545
Query: 256 VNGKETEAAEL---RNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEK 312
+ + EL R Q + + L +++ + LE L E +
Sbjct: 546 LLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQAQWAAAGLPLTPA 605
Query: 313 DVIKPEETMSNVS------QKKLDDLQKDLEEAKELANNRLQELD 351
++ + K +L L+ L L
Sbjct: 606 EMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALK 650
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 31.0 bits (70), Expect = 1.7
Identities = 26/118 (22%), Positives = 53/118 (44%), Gaps = 10/118 (8%)
Query: 265 ELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNV 324
+R I++ Q E +++ +N KL N L A K K + E +S
Sbjct: 89 NIRESIEERQKESDQLKLQNQKLVNQLGHARGVFI----------KTKGRYQKLEELSRH 138
Query: 325 SQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNEISWL 382
+++ LQ L+ + N +++E +L+ + E L ++L +++ TF + + L
Sbjct: 139 LKEENQCLQIQLDALVQECNEKIEENQELNRELAETLAYQQELNDEYQATFVEQHNML 196
>gnl|CDD|181613 PRK09019, PRK09019, translation initiation factor Sui1; Validated.
Length = 108
Score = 29.2 bits (66), Expect = 1.9
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 216 AVREANIEIQTENRNLTEQ 234
AV++ IEIQ + R+L +
Sbjct: 76 AVKDGVIEIQGDKRDLLKS 94
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 30.7 bits (70), Expect = 1.9
Identities = 17/95 (17%), Positives = 44/95 (46%), Gaps = 3/95 (3%)
Query: 244 SISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQ 303
++ +I+EL D ++ + +L++ + +L+ LS ++ +L+ L E +
Sbjct: 57 RLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEG 116
Query: 304 MHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEE 338
E +E D E+ +S + +++ L + +
Sbjct: 117 RAGELAQELD---SEKQVSARALAQVELLNQQIAA 148
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.9 bits (71), Expect = 2.2
Identities = 21/95 (22%), Positives = 46/95 (48%), Gaps = 15/95 (15%)
Query: 291 LLEATEKV-KLCQQMHTE--------DGKEKDVIKPEETMSNVSQ------KKLDDLQKD 335
LLEA E++ KL + E EK +++ EE + + ++L+ +K+
Sbjct: 59 LLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118
Query: 336 LEEAKELANNRLQELDKLHLQHREALKDVEKLKMD 370
LE+ ++ + +EL++L + + L+ + L +
Sbjct: 119 LEQKQQELEKKEEELEELIEEQLQELERISGLTAE 153
>gnl|CDD|216301 pfam01105, EMP24_GP25L, emp24/gp25L/p24 family/GOLD. Members of
this family are implicated in bringing cargo forward
from the ER and binding to coat proteins by their
cytoplasmic domains. This domain corresponds closely to
the beta-strand rich GOLD domain described in. The GOLD
domain is always found combined with lipid- or
membrane-association domains.
Length = 178
Score = 29.9 bits (68), Expect = 2.5
Identities = 13/38 (34%), Positives = 20/38 (52%)
Query: 464 KKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQT 501
L E+ +LED L+ I++E + LR + NE T
Sbjct: 110 DPLEEELKKLEDQLNDIKREQKYLREREARHRETNEST 147
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 30.3 bits (69), Expect = 2.7
Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 6/113 (5%)
Query: 245 ISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQM 304
+ E Q A + AA R +I L E++ +Q+ D+ + + A ++ + C+
Sbjct: 114 LLEIQQEEQAAA--QAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQ-CEAE 170
Query: 305 ---HTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLH 354
T + V K + + +Q +L+ L+ L+ A + L++
Sbjct: 171 GTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQKAALERNR 223
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 30.3 bits (69), Expect = 3.1
Identities = 28/135 (20%), Positives = 59/135 (43%), Gaps = 25/135 (18%)
Query: 213 LDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDD 272
++EA +EA + L E ++H + E E E RN++
Sbjct: 38 IEEAKKEAE---TLKKEALLEAKEEVHKLRAEL--------------ERELKERRNELQR 80
Query: 273 LQYELSKVQSRNDKLENHLLEATEKVK-LCQQMHTEDGKEKDVIKPEETMSNVSQKKLDD 331
L+ ++ R + L+ + +K + L ++ KEK++ + EE + + ++ ++
Sbjct: 81 LE---RRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREE 137
Query: 332 LQK----DLEEAKEL 342
L++ EEAKE+
Sbjct: 138 LERISGLTQEEAKEI 152
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 30.3 bits (68), Expect = 3.2
Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 22/131 (16%)
Query: 252 LQDAVNGK----ETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTE 307
+QDA K E AEL ++ +LQ EL++ + ND L L E ++ M E
Sbjct: 72 IQDAATEKTKEWEKVKAELEKKLLELQKELARAAAENDALSRSLQEKSK-------MIME 124
Query: 308 DGKEKDVIKPE-----------ETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQ 356
+EK + E E ++ + ++ L K+LE E N LQ D Q
Sbjct: 125 LSEEKSRAESEIEDLKARLESAEKENSSLKYEVHVLSKELEIRNEERNMSLQSADAASKQ 184
Query: 357 HREALKDVEKL 367
H E +K + KL
Sbjct: 185 HLEGVKKIAKL 195
>gnl|CDD|213374 cd12085, DD_cGKI-alpha, Dimerization/Docking domain of Cyclic
GMP-dependent Protein Kinase I alpha. Cyclic
GMP-dependent Protein Kinase I (PKG1 or cGKI) is a
Serine/Threonine Kinase (STK), catalyzing the transfer
of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. cGKI
exists as two splice variants, cGKI-alpha and cGKI-beta.
They contain an N-terminal regulatory domain containing
a dimerization/docking region and an autoinhibitory
pseudosubstrate region, two cGMP-binding domains, and a
C-terminal catalytic domain. Binding of cGMP to both
binding sites releases the inhibition of the catalytic
center by the pseudosubstrate region, allowing
autophosphorylation and activation of the kinase. cGKI
is a soluble protein expressed in all smooth muscles,
platelets, cerebellum, and kidney. It is involved in the
regulation of smooth muscle tone, smooth cell
proliferation, and platelet activation. The
dimerization/docking (D/D) domain is a
leucine/isoleucine zipper that mediates both
homodimerization and interaction with isotype-specific
G-kinase-anchoring proteins (GKAPs). The D/D domain of
the two variants (alpha and beta) differ, allowing for
their targeting to different subcellular compartments
and intracellular substrates. cGKI-alpha specifically
binds to myosin light chain phosphatase targeting
subunit (MYPT1) and the regulator of G-protein
signaling-2 (RGS-2). cGKI-alpha activates the
phosphatase activity of MYPT1, resulting in
vasorelaxation. It increases the activity of RGS-2
toward G proteins, with implications in the downstream
signaling for vasoconstrictive agents.
Length = 48
Score = 26.9 bits (60), Expect = 3.3
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 248 KISELQDAVNGKETEAAELRNQIDDLQ 274
+I EL+ + ++ E ELR+Q+D Q
Sbjct: 17 RIRELEQLLQQRDEEIQELRSQLDKFQ 43
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 30.3 bits (68), Expect = 3.5
Identities = 26/121 (21%), Positives = 59/121 (48%)
Query: 248 KISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTE 307
K + LQ+ + G + E +L + L+ ++S ++SR ++LE L +TEK + ++ H E
Sbjct: 137 KATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKE 196
Query: 308 DGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKL 367
+ + E + + +LE+ + + QE D++ ++ ++E++
Sbjct: 197 AQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERM 256
Query: 368 K 368
K
Sbjct: 257 K 257
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 30.3 bits (69), Expect = 3.5
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 27/110 (24%)
Query: 263 AAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMS 322
AA +R +ID EL ++ R +LE E+ L KEKD
Sbjct: 393 AARIRMEIDSKPEELDELDRRIIQLE------IEREAL--------KKEKDE-------- 430
Query: 323 NVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWE 372
S+++L+DL+K+L E +E +L++ + A++ ++++K + E
Sbjct: 431 -ASKERLEDLEKELAELEE----EYADLEEQWKAEKAAIQGIQQIKEEIE 475
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 30.3 bits (69), Expect = 3.6
Identities = 9/49 (18%), Positives = 26/49 (53%)
Query: 245 ISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLE 293
+ + L++ + E E ++ +I +L+ E+S++++ +LE +
Sbjct: 77 KKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIER 125
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 29.6 bits (67), Expect = 3.7
Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 17/91 (18%)
Query: 215 EAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETE------------ 262
+ RE E+Q E L E+ +L + + + EL+ ++ E E
Sbjct: 62 PSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK 121
Query: 263 -----AAELRNQIDDLQYELSKVQSRNDKLE 288
A EL + +L+ EL++++ N+ LE
Sbjct: 122 QLSANAIELDEENRELREELAELKQENEALE 152
Score = 29.2 bits (66), Expect = 4.9
Identities = 14/65 (21%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 227 ENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDK 286
+R L++ + + + +++ELQ+ + + + AEL+ + +L+ ELS +++ ++
Sbjct: 54 LSRYLSDTPS-ARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELER 112
Query: 287 LENHL 291
L+ L
Sbjct: 113 LQKEL 117
>gnl|CDD|217077 pfam02514, CobN-Mg_chel, CobN/Magnesium Chelatase. This family
contains a domain common to the cobN protein and to
magnesium protoporphyrin chelatase. CobN is implicated
in the conversion of hydrogenobyrinic acid a,c-diamide
to cobyrinic acid. Magnesium protoporphyrin chelatase is
involved in chlorophyll biosynthesis.
Length = 1079
Score = 30.2 bits (69), Expect = 3.9
Identities = 24/90 (26%), Positives = 30/90 (33%), Gaps = 23/90 (25%)
Query: 78 DDAVLNVVNR----YWNQLNEDIRILLQRFDAETADESESKSQAALRALDV--CAPPAYG 131
D+AV V N + + L E DE E++ A R V AP AYG
Sbjct: 813 DEAVRLVAALDEPDEMNYVRKHSLELEAELLEEGYDEEEARELATAR---VFGSAPGAYG 869
Query: 132 NTNVMCLAITDENEATTSFLMQLSTWDKEE 161
V E S S W+ E
Sbjct: 870 -AGV--------QELIES-----SAWEDES 885
>gnl|CDD|99863 cd06110, BSuCS-II_like, Bacillus subtilis (Bs) citrate synthase
(CS)-II_like. CS catalyzes the condensation of acetyl
coenzyme A (AcCoA) and oxalacetate (OAA) to form citrate
and coenzyme A (CoA), the first step in the citric acid
cycle (TCA or Krebs cycle). 2MCS catalyzes the
condensation of propionyl-coenzyme A (PrCoA) and OAA to
form 2-methylcitrate and CoA during propionate
metabolism. The overall CS reaction is thought to
proceed through three partial reactions: a) the
carbanion or equivalent is generated from AcCoA by base
abstraction of a proton, b) the nucleophilic attack of
this carbanion on OAA to generate citryl-CoA, and c) the
hydrolysis of citryl-CoA to produce citrate and CoA.
This group contains proteins similar to BsCS-II, the
major CS of the gram-positive bacterium Bacillus
subtilis. A mutation in the gene which encodes BsCS-II
(citZ gene) has been described which resulted in a
significant loss of CS activity, partial glutamate
auxotrophy, and a sporulation deficiency, all of which
are characteristic of strains defective in the Krebs
cycle. Streptococcus mutans CS, found in this group, may
participate in a pathway for the anaerobic biosynthesis
of glutamate. This group also contains functionally
uncharacterized CSs of various gram-negative bacteria.
Some of the gram-negative species represented in this
group have a second CS isozyme found in another group.
This group contains proteins which functions exclusively
as either a CS or a 2MCS, as well as those with relaxed
specificity which have dual functions as both a CS and a
2MCS.
Length = 356
Score = 29.6 bits (67), Expect = 4.5
Identities = 13/43 (30%), Positives = 21/43 (48%), Gaps = 5/43 (11%)
Query: 154 LSTWDKEELDEKLANRVQVSKRAVAK---IIQAFDRLVQRNEK 193
L+ +D E D ++ + R +AK I+ AF R+ RN
Sbjct: 98 LALYDPEADDMSREANLRKAIRLIAKMPTIVAAFHRI--RNGL 138
>gnl|CDD|233583 TIGR01813, flavo_cyto_c, flavocytochrome c. This model describes a
family of redox proteins related to the succinate
dehydrogenases and fumarate reductases of E. coli,
mitochondria, and other well-characterized systems. A
member of this family from Shewanella frigidimarina
NCIMB400 is characterized as a water-soluble periplasmic
protein with four heme groups, a non-covalently bound
FAD, and essentially unidirectional fumarate reductase
activity. At least seven distinct members of this family
are found in Shewanella oneidensis, a species able to
use a wide variety of pathways for respiraton [Energy
metabolism, Electron transport].
Length = 439
Score = 29.6 bits (67), Expect = 4.9
Identities = 19/107 (17%), Positives = 43/107 (40%), Gaps = 10/107 (9%)
Query: 441 LSASKNNQQRQIEMMETEELMVQKKLR-GEMMQLEDALSQIRKE----YEMLRIEFETNL 495
++A+ +QQ+ + + ++ EL ++ L+ G + D + + +E + L+ L
Sbjct: 46 MNAAGTDQQKALGIEDSPELFIKDTLKGGRGINDPDLVRILAEESADAVDWLQDGVGARL 105
Query: 496 AANEQTGPINREMRHLITSLQNHNKQIKGE-VNRYKRKYKETSAEIQ 541
Q G + H T E V + +K K+ + +
Sbjct: 106 DDLIQLGGHSVPRAHRPT----GGAASGAEIVQKLYKKAKKEGIDTR 148
>gnl|CDD|180802 PRK07035, PRK07035, short chain dehydrogenase; Provisional.
Length = 252
Score = 29.2 bits (66), Expect = 5.0
Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 468 GEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPI 504
GEM Q++ + IR+ + L I N AAN G I
Sbjct: 67 GEMEQIDALFAHIRERHGRLDI-LVNNAAANPYFGHI 102
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.7 bits (67), Expect = 5.3
Identities = 35/173 (20%), Positives = 64/173 (36%), Gaps = 26/173 (15%)
Query: 249 ISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTED 308
+ LQ+ ++ E+ E+ ++I L ++ +V + E E+ K+
Sbjct: 272 LEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKI-------- 323
Query: 309 GKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKD-VEKL 367
K++ V + +NV++ LQ + + E RL EL + HR L D L
Sbjct: 324 -KKRTVELLPDAENNVAK-----LQALVVASSE----RLLELAQQWEAHRTPLIDEYRSL 373
Query: 368 KMDWEVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAY 420
K + L +L +I +L + + YK L Y
Sbjct: 374 KEKNRNKEDETQRQL-------DEIKKLRNKIEELESELQTKEQLYKQLLDEY 419
>gnl|CDD|239651 cd03679, MM_CoA_mutase_alpha_like, Coenzyme
B12-dependent-methylmalonyl coenzyme A (CoA) mutase
(MCM) family, Alpha subunit-like subfamily; contains
proteins similar to the alpha subunit of
Propionbacterium shermanni MCM, as well as human and E.
coli MCM. Members of this subfamily contain an
N-terminal MCM domain and a C-terminal coenzyme B12
binding domain. MCM catalyzes the isomerization of
methylmalonyl-CoA to succinyl-CoA. The reaction proceeds
via radical intermediates beginning with a
substrate-induced homolytic cleavage of the Co-C bond of
coenzyme B12 to produce cob(II)alamin and the
deoxyadenosyl radical. MCM plays an important role in
the conversion of propionyl-CoA to succinyl-CoA during
the degradation of propionate for the Krebs cycle. In
higher animals, MCM is involved in the breakdown of
odd-chain fatty acids, several amino acids, and
cholesterol. Methylobacterium extorquens MCM
participates in the glyoxylate regeneration pathway. In
M. extorquens, MCM forms a complex with MeaB; MeaB may
protect MCM from irreversible inactivation. In some
bacteria, MCM is involved in the reverse metabolic
reaction, the rearrangement of succinyl-CoA to
methylmalonyl-CoA. Examples include P. shermanni MCM
during propionic acid fermentation, E.coli MCM in a
pathway for the conversion of succinate to propionate
and Streptomyces MCM in polyketide biosynthesis.
Sinorhizobium meliloti strain SU47 MCM plays a role in
the polyhydroxyalkanoate degradation pathway. P.
shermanni and Streptomyces cinnamonensis MCMs are
alpha/beta heterodimers. It has been shown for P.
shermanni MCM that only the alpha subunit binds coenzyme
B12 and substrates. Human MCM is a homodimer with two
active sites. Mouse and E.coli MCMs are also homodimers.
In humans, impaired activity of MCM results in
methylmalonic aciduria, a disorder of propionic acid
metabolism.
Length = 536
Score = 29.6 bits (67), Expect = 5.5
Identities = 25/89 (28%), Positives = 38/89 (42%), Gaps = 17/89 (19%)
Query: 67 RIEQLE-KRQTQDDA---VLNVVNRYWNQLNEDIRIL-----------LQRFDAETADES 111
RIE+ +RQ + D+ V+ VN+Y E + +L + R A+
Sbjct: 429 RIEEAAARRQARIDSGREVIVGVNKYRLDHEEPLDVLKIDNTAVRAEQIARLKKLRAERD 488
Query: 112 ESKSQAALRALDVCAPPAYGNTNVMCLAI 140
QAAL AL A G N++ LA+
Sbjct: 489 PEAVQAALDALTEAA--ETGEGNLLALAV 515
>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein. This family
consists of proteins over 500 amino acids long in
Caenorhabditis elegans and several bacteria (Pseudomonas
aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
etc.). The function is unknown.
Length = 525
Score = 29.7 bits (67), Expect = 5.5
Identities = 22/118 (18%), Positives = 44/118 (37%), Gaps = 13/118 (11%)
Query: 261 TEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEET 320
AELR QI +L+ EL ++ R D L+ L + E + + + + + E
Sbjct: 71 ERLAELRKQIRELEAELRDLEDRGDALKA-LAKFLEDI----REGLTEPIKDSAKRNEPD 125
Query: 321 MSNVSQKKLDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDWEVTFNNE 378
+ K+ E +E ++ + RE K + +L+ + +
Sbjct: 126 L-----KEWFQAFDFNGSEIERLLTEDREAER---RIRELEKQLSELQNELNALLTGK 175
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 29.1 bits (66), Expect = 6.1
Identities = 11/38 (28%), Positives = 21/38 (55%)
Query: 40 IKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEKRQTQ 77
I+ L+ N +L + + + Q L+ +IEQ+EK +
Sbjct: 66 IENLRVYNDQLQRLVANQQQEIASLQQQIEQIEKTRQG 103
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 29.5 bits (66), Expect = 6.1
Identities = 57/291 (19%), Positives = 103/291 (35%), Gaps = 38/291 (13%)
Query: 209 PIPSLDEAVREANIEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRN 268
P + R + + R + Q K K +I+ K + E +L N
Sbjct: 20 RAPFYLPSRRRLAVSCRARRRGFSSQQKKKRGK--NIAPKQRSSNSKLQSNTDENGQLEN 77
Query: 269 QIDDLQYELSKVQSRNDKLEN----HLLEATEKVKLCQQMHTEDGKEKDVIKPEE--TMS 322
EL + + +D N EA + QQ +++DG++ + E+ M
Sbjct: 78 TSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMI 137
Query: 323 NVSQKK-----------LDDLQKDLEEAKELANNRLQELDKLHLQHREALKDVEKLKMDW 371
++K L+DL+K L E + L +++ L ++ E ++K+
Sbjct: 138 QNAEKNILLLNQARLQALEDLEKILTEKEALQ----GKINILEMRLSET---DARIKLAA 190
Query: 372 EVTFNNEISWLVWSVNPTSLTTQLPRQIRQLPESVIVETTEYKCLQSAYSVLYNENQLAK 431
+ + EI L QL + +L E L VL EN L K
Sbjct: 191 QEKIHVEI-----------LEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLK 239
Query: 432 TQAQEFQSHLSASKNNQQRQIEMMETEELMVQKKLRGEMMQLEDALSQIRK 482
Q ++ L ++R + +E E ++ LR + A + K
Sbjct: 240 DDIQFLKAELIEVAETEER-VFKLEKERSLLDASLRELESKFIVAQEDVSK 289
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.7 bits (65), Expect = 6.1
Identities = 18/99 (18%), Positives = 51/99 (51%), Gaps = 18/99 (18%)
Query: 254 DAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKD 313
A+N +T +L+ ++++L+ ++++Q++ +KL+ E E+ +L +++
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQL------ 115
Query: 314 VIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDK 352
+K+L L+ +LE+ ++ R+++L +
Sbjct: 116 ------------EKELKKLKAELEKYEKNDPERIEKLKE 142
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 29.4 bits (66), Expect = 6.5
Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 309 GKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKELANNRLQELDKLH---LQHREALKDVE 365
+ K + NV +KL L+K+LE+ E L+E +KL Q E LK+ E
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559
Query: 366 K 366
+
Sbjct: 560 R 560
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 29.4 bits (66), Expect = 6.7
Identities = 61/275 (22%), Positives = 109/275 (39%), Gaps = 55/275 (20%)
Query: 249 ISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTED 308
+ E Q+ + EL Q+ LQ E + +Q N+KL+ +++ +
Sbjct: 181 LKEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQ-------------ERLAQLE 227
Query: 309 GKEKDVIKPEETMSNVSQKKLDDLQKD---LEEAKELANNRLQELDK--LHLQHR----- 358
G + N+ Q +L+ LQ++ LE A++ + +EL+K LQHR
Sbjct: 228 GSSLGPNQLGSKKYNLLQSQLEQLQEENFRLEAARDDYRIKCEELEKELAELQHRNDELT 287
Query: 359 ------EALKDVEKLKMDWEVTFNNEISWLVWSVNPTSLTTQ----LPRQIRQLPES--- 405
+ALKD ++D +++ L V + L RQ++ L E
Sbjct: 288 SLAAESQALKD----EIDVLRESSDKAKKLEAQVETYKKKLEDLNDLRRQVKLLEERNAM 343
Query: 406 VIVETTEYKCLQSAYSVLYNENQLAKTQAQEFQSHLSASKNNQQRQIEMMETEELMVQKK 465
+ T + + + + + K Q QE + LS E + + K
Sbjct: 344 YMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSE---------ESKKAD------K 388
Query: 466 LRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQ 500
L E +LE+ L ++KE E L E ++ NE+
Sbjct: 389 LEFEYKRLEEKLEALQKEKERLLAERDSLRETNEE 423
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 29.2 bits (65), Expect = 6.9
Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 3/131 (2%)
Query: 223 EIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQS 282
E E +L ++ + A S+ ++ E + + E LRN ++ L+ EL + +
Sbjct: 236 EASAELESLQKE-ISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEKK 294
Query: 283 RNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQKDLEEAKEL 342
++L EA E + EK+ K E M + KL + EEA++
Sbjct: 295 ELEELREKEGEAEEAASSLEAELNRTKSEKES-KAREKMVEIP-LKLQQASSEAEEARKE 352
Query: 343 ANNRLQELDKL 353
A +EL KL
Sbjct: 353 AEAAREELRKL 363
>gnl|CDD|178266 PLN02660, PLN02660, pantoate--beta-alanine ligase.
Length = 284
Score = 28.9 bits (65), Expect = 7.1
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 215 EAVREAN-IEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQI 270
VREA+ + + + N L+ + + K SIS ++ ++ V ET+A EL+ Q+
Sbjct: 180 PIVREADGLAMSSRNVRLSAEERE---KALSISRSLARAEELVEEGETDADELKEQV 233
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3
ubiquitin ligase that has been shown to act as a
transcriptional activator through direct activator
interactions.
Length = 96
Score = 27.3 bits (61), Expect = 7.4
Identities = 16/37 (43%), Positives = 24/37 (64%)
Query: 37 EMDIKVLKFQNKKLAQRIEQKNQVEHELRSRIEQLEK 73
+ + K L Q K ++ IEQ +E+ELR RI++LEK
Sbjct: 58 DAENKKLNTQLNKSSELIEQLKDLENELRRRIKELEK 94
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 29.2 bits (66), Expect = 7.4
Identities = 26/122 (21%), Positives = 41/122 (33%), Gaps = 21/122 (17%)
Query: 183 AFDRLVQRNEKITMILKGELNNSNGDP-------IPSLDEAVREANIEIQTENRNLTEQN 235
A D + + IT+ L++ + +LD+ RE E RN E
Sbjct: 464 AKDLGTGKEQSITIKASSGLSDEEIERMVEDAEANAALDKKFRE-----LVEARNEAESL 518
Query: 236 TKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYELSKVQSRNDKLENHLLEAT 295
K L++ V E E ++ I DL+ L + L E T
Sbjct: 519 IYSLEKA---------LKEIVKVSEEEKEKIEEAITDLEEALEGEKEEIKAKIEELQEVT 569
Query: 296 EK 297
+K
Sbjct: 570 QK 571
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 26.8 bits (60), Expect = 8.3
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 265 ELRNQIDDLQYELSKVQSRNDKLE 288
+L +I LQ EL+K+++ N++LE
Sbjct: 21 QLNQEIAALQAELAKLKAENEELE 44
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 28.9 bits (65), Expect = 8.5
Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 464 KKLRGEMMQLEDALSQIRKEYEMLRIEFETNLAANEQTGPINREMRHL---ITSLQNHNK 520
++L E +L+ L ++++E E L E E +RE+R I L+ +
Sbjct: 432 ERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELE 491
Query: 521 QIKGEVNRYKRKYKETSAEIQKVRISERSPW 551
+ K V +RK AE++K+R E S
Sbjct: 492 EKKKRVEELERK----LAELRKMRKLELSGK 518
>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
II-B/NMENI. Members of this family of CRISPR-associated
(cas) protein are found, so far, in CRISPR/cas loci in
Wolinella succinogenes DSM 1740, Legionella pneumophila
str. Paris, and Francisella tularensis, where the last
probably is an example of a degenerate CRISPR locus,
having neither repeats nor a functional Cas1. The
characteristic repeat length is 37 base pairs and period
is about 72. One region of this large protein shows
sequence similarity to pfam01844, HNH endonuclease
[Mobile and extrachromosomal element functions, Other].
Length = 802
Score = 29.2 bits (65), Expect = 8.6
Identities = 17/88 (19%), Positives = 37/88 (42%), Gaps = 11/88 (12%)
Query: 266 LRNQIDDLQYELSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVS 325
L + L YE++ + K H+ +A + + E++ + EE++ +
Sbjct: 676 LARMMQRLAYEIAMAKWEQIK---HIPDA-------KSLLIPIILEQNAFEFEESLKKIK 725
Query: 326 QKKLDDLQ-KDLEEAKELANNRLQELDK 352
DD + K LE+A E + ++ +
Sbjct: 726 GSSSDDRRKKTLEQAIEKQEIQWKDKFQ 753
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 28.5 bits (64), Expect = 9.3
Identities = 17/94 (18%), Positives = 40/94 (42%), Gaps = 16/94 (17%)
Query: 277 LSKVQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQK--KLDDLQK 334
L + R ++LE L + ++ ++ K + + K + + + Q+
Sbjct: 9 LESLLERYEELEALLSDP--------EVISDQDKLRKLSKEYSQLEEIVDCYREYQQAQE 60
Query: 335 DLEEAKELANNRLQELDKLHLQHREALKDVEKLK 368
D++EAKE+ E+ A +++E+L+
Sbjct: 61 DIKEAKEILEESDPEM------REMAKEELEELE 88
>gnl|CDD|185654 PTZ00475, PTZ00475, RESA-like protein; Provisional.
Length = 282
Score = 28.4 bits (63), Expect = 9.6
Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 2/114 (1%)
Query: 222 IEIQTENRNLTEQNTKLHAKYHSISLKISELQDAVNGKETEAAELRNQIDDLQYE--LSK 279
I I T EQN + +S L D +N + + ID+L YE L+
Sbjct: 158 ISIITNYYPFKEQNNPFIKQAQYVSSSNYVLDDIINNIDYSIDNIHRAIDNLYYEHILNL 217
Query: 280 VQSRNDKLENHLLEATEKVKLCQQMHTEDGKEKDVIKPEETMSNVSQKKLDDLQ 333
++ +++ +L K+ LC T + V++ E +N+ K+ LQ
Sbjct: 218 LEEEKNEILEEILRNILKIILCDVETTVRRSAQKVLQNAEGDTNLMLKRAKGLQ 271
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 28.9 bits (65), Expect = 9.7
Identities = 14/90 (15%), Positives = 36/90 (40%), Gaps = 17/90 (18%)
Query: 29 IGPISTLEEMDIKVLKFQNKKLAQRIEQKNQVEH-----------------ELRSRIEQL 71
+ + +E++ K+ K ++K I K+ +L + I+++
Sbjct: 27 VNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEV 86
Query: 72 EKRQTQDDAVLNVVNRYWNQLNEDIRILLQ 101
E+ + +N + + N L+E+ L +
Sbjct: 87 EENLESLEKEINELEEWLNVLDEEKSFLDE 116
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.127 0.347
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,476,463
Number of extensions: 2752733
Number of successful extensions: 4844
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4279
Number of HSP's successfully gapped: 517
Length of query: 584
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 482
Effective length of database: 6,413,494
Effective search space: 3091304108
Effective search space used: 3091304108
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 62 (27.7 bits)