RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy588
(903 letters)
>gnl|CDD|215951 pfam00498, FHA, FHA domain. The FHA (Forkhead-associated) domain
is a phosphopeptide binding motif.
Length = 67
Score = 71.9 bits (177), Expect = 1e-15
Identities = 34/79 (43%), Positives = 41/79 (51%), Gaps = 14/79 (17%)
Query: 33 VKIGRCIARAKVAPSNGI-FDCKVLSRNHALIWYYNG-KFYLQDTKSSNGTFVNNQRLSK 90
V IGR +P I D +SR HA I Y G +FYL+D S+NGTFVN QRL
Sbjct: 1 VTIGR-------SPDCDIVLDDPSVSRRHAEIRYDGGGRFYLEDLGSTNGTFVNGQRLGP 53
Query: 91 TNEESAPHELCSGDVVQFG 109
P L GDV++ G
Sbjct: 54 E-----PVRLRDGDVIRLG 67
>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
and prokaryotic proteins. Putative nuclear signalling
domain. FHA domains may bind phosphothreonine,
phosphoserine and sometimes phosphotyrosine. In
eukaryotes, many FHA domain-containing proteins localize
to the nucleus, where they participate in establishing
or maintaining cell cycle checkpoints, DNA repair, or
transcriptional regulation. Members of the FHA family
include: Dun1, Rad53, Cds1, Mek1,
KAPP(kinase-associated protein phosphatase),and Ki-67 (a
human nuclear protein related to cell proliferation).
Length = 102
Score = 66.6 bits (163), Expect = 2e-13
Identities = 30/82 (36%), Positives = 38/82 (46%), Gaps = 10/82 (12%)
Query: 33 VKIGRCIARAKVAPSNGIFDCKVLSRNHALIWYYN-GKFYLQDTKSSNGTFVNNQRLSKT 91
IGR + + D +SR HA+I Y G L D S+NGTFVN QR+S
Sbjct: 24 YTIGRDSDNC-----DIVLDDPSVSRRHAVIRYDGDGGVVLIDLGSTNGTFVNGQRVSPG 78
Query: 92 NEESAPHELCSGDVVQFGVDVI 113
P L GDV++ G I
Sbjct: 79 E----PVRLRDGDVIRLGNTSI 96
>gnl|CDD|214578 smart00240, FHA, Forkhead associated domain. Found in eukaryotic
and prokaryotic proteins. Putative nuclear signalling
domain.
Length = 52
Score = 51.8 bits (125), Expect = 9e-09
Identities = 24/57 (42%), Positives = 31/57 (54%), Gaps = 6/57 (10%)
Query: 33 VKIGRCIARAKVAPSNGIFDCKVLSRNHALIWY-YNGKFYLQDTKSSNGTFVNNQRL 88
V IGR + D +SR HA+I Y G+FYL D S+NGTFVN +R+
Sbjct: 1 VTIGRSSEDCDI-----QLDGPSISRRHAVIVYDGGGRFYLIDLGSTNGTFVNGKRI 52
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 58.6 bits (142), Expect = 1e-08
Identities = 107/639 (16%), Positives = 256/639 (40%), Gaps = 66/639 (10%)
Query: 151 SMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRV- 209
++ L +L +L ++E + + L+ +L +LQ + + +K + L + L +
Sbjct: 228 ALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQ 287
Query: 210 -EFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTE 268
E LE + LR+ ++ L + E +L+E+ +K+ A K E+ ++ E
Sbjct: 288 EELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLE 347
Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLE 328
+ +EL ++ LEE+ + ++ + E E+ + ++ N +E +E
Sbjct: 348 ELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407
Query: 329 KLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNI-- 386
L+ + ++L+ + + +L+ + E++ ++ +L + +
Sbjct: 408 SLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELER 467
Query: 387 -LVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLRE---------QLHTA 436
L + +++ ++ + R + E+ S + +L
Sbjct: 468 ELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVYGPVAELIKV 527
Query: 437 QEEYT---------SLQEIVNRNEEQ----LAFLEQSK-----YLHLDKIGGLELQLEKL 478
+E+Y LQ +V NEE + FL+++K +L LD+I L
Sbjct: 528 KEKYETALEAALGNRLQAVVVENEEVAKKAIEFLKENKAGRATFLPLDRIKPLRSLKSDA 587
Query: 479 KQTYNSDCYENEKFGKAY-------NCDTLLIN----EEKTVAPNQKDVNIIDVSNQY-- 525
+ + F Y DTL+++ + + I+ +
Sbjct: 588 APGFLGLASDLIDFDPKYEPAVRFVLGDTLVVDDLEQARRLARKLRIKYRIVTLDGDLVE 647
Query: 526 ---------NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINL 576
+++ ++ +E++E ++ ++ A++ K EL KN L
Sbjct: 648 PSGSITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKN----------EL 697
Query: 577 QEQFSLLEKSKETWVSLKLNSAESQS-SASLQSQLNDALDNIKKYKEEHSKMTMEIESLK 635
+ LLE+ + L+ E + A+L+ +L +++ +EE ++ E+E L+
Sbjct: 698 RSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQ 757
Query: 636 EKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE-EITQQ 694
E+L+ + + L E KLK I++ + ++ E + +++ +++E+ ++ E +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELE 817
Query: 695 SLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQS 733
SL L++ I + E + E E++ ++
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEK 856
Score = 57.0 bits (138), Expect = 4e-08
Identities = 68/401 (16%), Positives = 165/401 (41%), Gaps = 58/401 (14%)
Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLE 328
R + + E L+E L +LE++ ++ + + + E +++L +E LE
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719
Query: 329 KLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILV 388
+L+ E +++L+ + + ++L+ ++E++ ++ EL + L
Sbjct: 720 ELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELE-SLEEALAKLK 778
Query: 389 KYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVN 448
+ +++ + E +E++E+ ++ +L + E SL++
Sbjct: 779 EEIEELEEKRQALQE--------------ELEELEEELEEAERRLDALERELESLEQRRE 824
Query: 449 RNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKT 508
R E+++ LE+ +K+ LE +LE+L++ L E+
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELEKELEE----------------LKEELEEL 868
Query: 509 VAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK 568
E + + E++E + +++ E+ + +EL + K +
Sbjct: 869 -------------------EAEKEELEDELKELEEEKEELEEELRELESELAELKE---E 906
Query: 569 NEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMT 628
EK L+E + LE+ + + E + +L+++L ++ +++ E +
Sbjct: 907 IEKLRERLEELEAKLERLEVE-LPELEEELEEEYEDTLETELEREIERLEEEIEALGPVN 965
Query: 629 M----EIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNV 665
+ E E ++E+ + +S+ DL E EKL VI++
Sbjct: 966 LRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 50.1 bits (120), Expect = 5e-06
Identities = 52/326 (15%), Positives = 131/326 (40%), Gaps = 29/326 (8%)
Query: 417 NSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLE 476
S ++ K+L E+L + + L+E + + +L LE ++ LE QLE
Sbjct: 660 RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLE 719
Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINK 536
+LK+ + E E+ Q + ++ + E+ + +
Sbjct: 720 ELKRELAALEEELEQL--------------------QSRLEELEEELEELEEEL-EELQE 758
Query: 537 EIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLN 596
+EE +++ + + K E+ + + ++ + L+E+ E+ + +
Sbjct: 759 RLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELES 818
Query: 597 SAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLK 656
Q L+ ++ + + I++ +E+ ++ E+E L+++L+ + + +L E E+L+
Sbjct: 819 --LEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876
Query: 657 LVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTS 716
+K+ + + E + + + + ++ IE+ + E E +E
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELE---- 932
Query: 717 NMENDKSNEDDNSSVQSTDEVKDTEE 742
E + +D + E++ EE
Sbjct: 933 --EELEEEYEDTLETELEREIERLEE 956
Score = 50.1 bits (120), Expect = 5e-06
Identities = 67/409 (16%), Positives = 161/409 (39%), Gaps = 81/409 (19%)
Query: 275 ENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
E + +E L+R D LEE+ ++K + Q K + + QE L +L+
Sbjct: 178 ERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERY-----------QELKAELRELE 226
Query: 332 YETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYF 391
L A+ + +++++ ++ EL
Sbjct: 227 LALLLAKLK-----------------------ELRKELEELEEELS-------------- 249
Query: 392 DDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNE 451
++ + +E E+E IE+++ ++LRE+L QEE L+E + E
Sbjct: 250 ----RLEEELEELQEELEEAE----KEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 452 EQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLL------INE 505
+++ L + +++ LE +LE+LK+ + E+ + L + E
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEAL---KEELEERETLLEELEQLLAELEE 358
Query: 506 EKTVAPNQKDVNIIDVSNQY-NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKN 564
K + + ++ + ++ + E+ E +++++ EI L
Sbjct: 359 AKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSE 418
Query: 565 CLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEH 624
L ++ + L+ + L+ E ++ L+ QL + D +K+ + E
Sbjct: 419 RLEDLKEELKELEAELEELQTELEE---------LNEELEELEEQLEELRDRLKELEREL 469
Query: 625 SKMTMEIESLKEKLKFSQSKENDL---LEEYEKLKLVIKDYQNVLPSIV 670
+++ E++ L+++L +++ + L + ++ V++ ++ LP +
Sbjct: 470 AELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVY 518
Score = 47.8 bits (114), Expect = 3e-05
Identities = 46/281 (16%), Positives = 120/281 (42%), Gaps = 26/281 (9%)
Query: 420 EDIEKVTKDL-REQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKL 478
++ ++ L +L ++E L+E ++R EE+L L++ +I L+ +LE+L
Sbjct: 220 AELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEEL 279
Query: 479 KQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQY-NSEKDFVSINKE 537
++ E + + + + I + + E + + +
Sbjct: 280 REELEELQEELLELKEEIE---------------ELEGEISLLRERLEELENELEELEER 324
Query: 538 IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSL---- 593
+EE KI+ + E+ + L + + L + E+ L+E+ S L + E
Sbjct: 325 LEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384
Query: 594 --KLNSAESQSSA---SLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDL 648
+L + ++ L+ ++ + +++ E + E++ L+ +L+ Q++ +L
Sbjct: 385 LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEEL 444
Query: 649 LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE 689
EE E+L+ +++ ++ L + + +++ Q E+ +
Sbjct: 445 NEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSS 485
Score = 47.0 bits (112), Expect = 4e-05
Identities = 64/318 (20%), Positives = 148/318 (46%), Gaps = 26/318 (8%)
Query: 154 LEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLE 213
EEL L ++ L+ +L +L+ + +LK+ + ++L SR+E LE
Sbjct: 687 EEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELA---ALEEELEQLQSRLEELE 743
Query: 214 SQLAAYSKSFTD-----DKLRDEIKLLHRDKSTYENQLKEV--MKKMLAEKV-EVMQKCQ 265
+L + + ++L +E++ L + + +++E+ ++ L E++ E+ ++ +
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803
Query: 266 DTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
+ ER+L +E E +L++ +R E+EE+ ++ ++ K E E +++L +E
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863
Query: 326 TLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNEL-GLETKVDQ 384
LE+L+ E + + +L + + ++L+ +KE++ ++ L LE K+++
Sbjct: 864 ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLER 923
Query: 385 ------NILVKYFDDVHQILTQFDEYFLRFNESESSGL-----NSIEDIEKVTK---DLR 430
+ + ++ L E + E E L +IE+ E+V + +L+
Sbjct: 924 LEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELK 983
Query: 431 EQLHTAQEEYTSLQEIVN 448
Q +E L E++
Sbjct: 984 SQREDLEEAKEKLLEVIE 1001
Score = 47.0 bits (112), Expect = 5e-05
Identities = 63/405 (15%), Positives = 164/405 (40%), Gaps = 60/405 (14%)
Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMK-KMLAEKVEVMQKCQDTERKLVNIENECSNLQEL 284
++L D ++ L + E Q ++ + + L ++ ++ KL + E L+E
Sbjct: 189 ERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELEL-ALLLAKLKELRKELEELEEE 247
Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
L R +ELEE+ ++ + E E I++L + +E LE+LQ E L+ +++++
Sbjct: 248 LSRLEEELEELQEELE-------EAEKEIEELKSELEELREELEELQEELLELKEEIEEL 300
Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
+ +L+ ++L+ ++E++ +K ++ + + L + + ++ E
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKI----EALKEELEERETLLEELEQLLAEL 356
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
E E +E++E++ + LRE+L + E ++ + + ++ LE+
Sbjct: 357 EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416
Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
+++ L+ +L++L+ EE + + +
Sbjct: 417 SERLEDLKEELKELEAEL----------------------EELQTELEELNEEL------ 448
Query: 525 YNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLE 584
+ +++EE ++ ++ E+ + EL + L E + L+ + +
Sbjct: 449 -------EELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQ 501
Query: 585 KSKETWVSLK------------LNSAESQSSASLQSQLNDALDNI 617
+ +L+ L + + +L++ L + L +
Sbjct: 502 GVRAVLEALESGLPGVYGPVAELIKVKEKYETALEAALGNRLQAV 546
Score = 40.5 bits (95), Expect = 0.005
Identities = 46/307 (14%), Positives = 117/307 (38%), Gaps = 27/307 (8%)
Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSK-----YLHLD------- 466
+ ++ ++ +L +E L++++ E+QL LE+ Y L
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELE 226
Query: 467 ------KIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID 520
K+ L +LE+L++ + E E+ + I E K+ ++
Sbjct: 227 LALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEEL 286
Query: 521 VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQF 580
+++ + EI +++++ E+ + L + K + ++ + +
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346
Query: 581 SLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDN-IKKYKEEHSKMTMEIESLKEKLK 639
LE+ + S L + L+ + +EE +++ E+ ++ +L+
Sbjct: 347 EELEQLL------AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELE 400
Query: 640 FSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPL 699
+ + L E E+L ++D + L + + + + ++ + +EE ++ L L
Sbjct: 401 ELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE--LEEQLEEL 458
Query: 700 SSELDET 706
L E
Sbjct: 459 RDRLKEL 465
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 57.8 bits (140), Expect = 2e-08
Identities = 86/519 (16%), Positives = 205/519 (39%), Gaps = 48/519 (9%)
Query: 153 PLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDR-LLSRVEF 211
L EL +L ++E ++R L+ ++ +L+ + L+ + + L+++ + L R+E
Sbjct: 275 ELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLE-ELEELLEKLKSLEERLEK 333
Query: 212 LESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKL 271
LE +L KL E++ L +K+ L+E +K+ E +L
Sbjct: 334 LEEKLE---------KLESELEELAEEKNELAKLLEERLKE--------------LEERL 370
Query: 272 VNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
+E E E L + + ++E+ ++ + +E + +++L +E LE+L+
Sbjct: 371 EELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELE 430
Query: 332 YETLQAQKQLDAFTLKDNLMH----DKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNIL 387
E + ++Q++ K+ ++ K + +E + LE + +
Sbjct: 431 EEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEE-E 489
Query: 388 VKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIV 447
+ + ++ + +E E E + +E E + ++L E+L + L+E+
Sbjct: 490 LSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELK 549
Query: 448 NRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEK 507
+ + Q L ++ +L++L + E+ + L + K
Sbjct: 550 EKLQ------LQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLK 603
Query: 508 TVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITK--CTNELVDAKNC 565
+ + +Q + +++ E +++ +E+ K EL +
Sbjct: 604 E---------LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654
Query: 566 LVKN-EKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEH 624
++ E+ V L+ + + E L+ E + QL + L+ + K E
Sbjct: 655 ALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEI 714
Query: 625 SKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
++ E+ES K +L+ + + L + E L+ + +
Sbjct: 715 EQLIEELESRKAELEELKKELEKLEKALELLEELREKLG 753
Score = 55.5 bits (134), Expect = 1e-07
Identities = 84/503 (16%), Positives = 198/503 (39%), Gaps = 19/503 (3%)
Query: 225 DDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQEL 284
+ +L + ++ + E +LKE+ K ++ + ++ + L E +E
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKER 250
Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
L+ + L EI + + +E +++LL +EK LE+L+ E + +++L+
Sbjct: 251 LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGL 310
Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
+ + L+ + EK+ +L E + + + + L + +E
Sbjct: 311 RALLEELEELLEKLK-SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEER 369
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
+ L ++ +E+ ++L+E+L +QE + E++L LE+
Sbjct: 370 LEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEEL 429
Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
++I LE Q+ +L+ K C L E + ++ + ++ +
Sbjct: 430 EEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEE 489
Query: 525 YNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLE 584
+ EK+ + +EIEE +++ ++ E+ + K L + + + NL E+ L+
Sbjct: 490 LSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELK 549
Query: 585 KSKETWVSLKLNSAESQSSASLQSQLNDALDN---IKKYKEEHSKMTMEIESLKEKLKFS 641
+ + E + +L + L+ ++ KEE ++ ++ LK+KLK
Sbjct: 550 EKLQLQQ----LKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKEL 605
Query: 642 QSKENDL---------LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIE--EE 690
+ + + L E +L+ ++ ++ L + + E+ E EE
Sbjct: 606 EERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEE 665
Query: 691 ITQQSLNPLSSELDETIHKNVES 713
+ + L +E +
Sbjct: 666 LEAEIRRELQRIENEEQLEEKLE 688
Score = 55.2 bits (133), Expect = 1e-07
Identities = 77/511 (15%), Positives = 200/511 (39%), Gaps = 38/511 (7%)
Query: 168 LQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD-D 226
L+ E++ L + + +L++ + + + + + L +E LE +L + +
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318
Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLD 286
+L +++K L E +L+++ ++ E + + E +L +E L++ L+
Sbjct: 319 ELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELE 378
Query: 287 RNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTL 346
+ + L+++ + Q+ + E + ++++ +E LE+L+ E + ++++
Sbjct: 379 KALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEE 438
Query: 347 KDNLMHDKLKNYYM-----------NFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVH 395
+ N + K + +E + LE + + + +
Sbjct: 439 QINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEE-ELSREKEEA 497
Query: 396 QILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLA 455
++ + +E E E + +E E + ++L E+L + L+E+ + + Q
Sbjct: 498 ELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQ-- 555
Query: 456 FLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKD 515
L ++ +L++L + E+ + L + K
Sbjct: 556 ----QLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE------- 604
Query: 516 VNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITK--CTNELVDAKNCLVKN-EKW 572
+ + +Q + +++ E +++ +E+ K EL + ++ E+
Sbjct: 605 --LEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEK 662
Query: 573 VINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIE 632
V L+ + + E L+ E + QL + L+ + K E IE
Sbjct: 663 VEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGE-------IE 715
Query: 633 SLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
L E+L+ +++ +L +E EKL+ ++ +
Sbjct: 716 QLIEELESRKAELEELKKELEKLEKALELLE 746
Score = 34.4 bits (79), Expect = 0.31
Identities = 45/271 (16%), Positives = 110/271 (40%), Gaps = 33/271 (12%)
Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKL 478
+E EK+++ L+E + A+ + L+ ++ E + L ++ L ++ LE E+
Sbjct: 166 LEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQ 225
Query: 479 KQTYNSDCYE--NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSE-KDFVSIN 535
++ E E+ + L + + + ++ E ++ +
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIES--LELEALKIREEELRELERLL 283
Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNE--------------------KWVIN 575
+E+EE +++++ EI + EL + L + E + + +
Sbjct: 284 EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLES 343
Query: 576 LQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLK 635
E+ + + + +L E + L+ +L AL+ +K+ +E I+ LK
Sbjct: 344 ELEELAEEKNELAKLLEERLKELEERLEE-LEKELEKALERLKQLEE-------AIQELK 395
Query: 636 EKLKFSQSKENDLLEEYEKLKLVIKDYQNVL 666
E+L + ++ EE E+L+ +++ + L
Sbjct: 396 EELAELSAALEEIQEELEELEKELEELEREL 426
Score = 33.6 bits (77), Expect = 0.67
Identities = 41/195 (21%), Positives = 79/195 (40%), Gaps = 23/195 (11%)
Query: 138 KEAKASPTSMLISSMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWK 197
+E K + + EEL +L ++E ++ L+ +LS+L+
Sbjct: 569 QELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE--------------- 613
Query: 198 ALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEK 257
L+Q L LE +++ L + + L+E+ +K+ +
Sbjct: 614 ELLQSLELSEAENELEEAEEELE------SELEKLNLQAELEELLQAALEELEEKVEELE 667
Query: 258 VEVMQKCQDTER--KLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQ 315
E+ ++ Q E +L E L+E L++ R+ELEE++KK + +E ES +
Sbjct: 668 AEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAE 727
Query: 316 LLNSNQEKSNTLEKL 330
L +E + L
Sbjct: 728 LEELKKELEKLEKAL 742
>gnl|CDD|224630 COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms].
Length = 191
Score = 54.2 bits (130), Expect = 3e-08
Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 12/91 (13%)
Query: 28 NKNAAVKIGRCIARAKVAPSNGIFDCKVLSRNHALIWYYNGKFYLQDTKSSNGTFVNNQR 87
IGR ++ + D V+SR HA + + +L+D S+NGT+VN ++
Sbjct: 86 LGEPVTTIGR------DPDNDIVLDDDVVSRRHAELRREGNEVFLEDLGSTNGTYVNGEK 139
Query: 88 LSKTNEESAPHELCSGDVVQFGVDVIETNRN 118
+ L GDV++ G + E R
Sbjct: 140 V------RQRVLLQDGDVIRLGGTLAERLRI 164
>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
Members of this protein family are FHA
(forkhead-associated) domain-containing proteins that
are part of type VI secretion loci in a considerable
number of bacteria, most of which are known pathogens.
Species include Pseudomonas aeruginosa PAO1, Aeromonas
hydrophila, Yersinia pestis, Burkholderia mallei, etc
[Protein fate, Protein and peptide secretion and
trafficking, Cellular processes, Pathogenesis].
Length = 396
Score = 50.8 bits (122), Expect = 2e-06
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 7/58 (12%)
Query: 54 KVLSRNHALIWYYNGKFYLQDTKSSNGTFVNN--QRLSKTNEESAPHELCSGDVVQFG 109
+ +S HA I Y +G + L D S+NG F+N L + N P L GD ++ G
Sbjct: 43 RHVSGRHARIRYRDGAYLLTDL-STNGVFLNGSGSPLGRGN----PVRLEQGDRLRLG 95
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 50.1 bits (120), Expect = 6e-06
Identities = 45/217 (20%), Positives = 102/217 (47%), Gaps = 6/217 (2%)
Query: 528 EKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSK 587
E+ +I +++ +++ + EI++ L + + L + K + +L E+ L K K
Sbjct: 236 ERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEK 295
Query: 588 ETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKEND 647
+ ++ S E + + +L DA + + K + E K+ EIE L+ +++ + + +
Sbjct: 296 IGELEAEIASLERSIAEKER-ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDK 354
Query: 648 LLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETI 707
L EEY +LK ++D + L + + T+D+ +D + +E+ ++ +N L ELD
Sbjct: 355 LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEK--LKREINELKRELDRLQ 412
Query: 708 H--KNVESVTSNMEND-KSNEDDNSSVQSTDEVKDTE 741
+ + +++ E + ++ E K E
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALE 449
Score = 43.1 bits (102), Expect = 8e-04
Identities = 66/394 (16%), Positives = 154/394 (39%), Gaps = 84/394 (21%)
Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQ 340
L R R+ LE + ++ ++ + E+ +D+L + S + +++ E Q +++
Sbjct: 672 EPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
Query: 341 LDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQ 400
+ ++L+ + +++++ N+K+EL +++ I
Sbjct: 732 EEKLK-------ERLEELEEDLSSLEQEIENVKSEL---KELEARI-------------- 767
Query: 401 FDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQS 460
+E ++ E + LN +E + ++ Q E + L+E V+R E +L +EQ
Sbjct: 768 -EELEEDLHKLEEA-LNDLEA-----RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQK 820
Query: 461 KYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID 520
L+++ + LEK Q ID
Sbjct: 821 ----LNRLTLEKEYLEKEIQE--------------------------------LQEQRID 844
Query: 521 VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQF 580
+ Q S + KEIE + K +++ E+ + L D ++ L +K L+ Q
Sbjct: 845 LKEQIKSIE------KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Query: 581 SLLEKSKETWVS----LKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKE 636
LE+ E + + +E ++ + +++ K EE + + +E ++
Sbjct: 899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQA 958
Query: 637 KLKFSQSKENDL-------LEEYEKLKLVIKDYQ 663
+L+ + + L ++EYE++ + + +
Sbjct: 959 ELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELK 992
Score = 42.7 bits (101), Expect = 9e-04
Identities = 56/327 (17%), Positives = 140/327 (42%), Gaps = 23/327 (7%)
Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELL 285
D+ R +++ L R++ E + +K E E++++ + ER+ IE + ++L+E L
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253
Query: 286 DRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQ-EKSNTLEKLQYETLQAQKQLDAF 344
++ +E+ E+ K+ ++ +E I L Q + +L+ E ++ +
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 345 TLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEY 404
+ ++L + ++ ++ E+ E + ++ L + + ++ +E
Sbjct: 314 ERELEDAEERLAKLEAEIDKLLAEIEELEREIE-EERKRRDKLTEEYAELK------EEL 366
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
E E E ++ KD RE+L + E L+ ++R +E+L L +
Sbjct: 367 EDLRAELEEVDKEFAETRDE-LKDYREKLEKLKREINELKRELDRLQEELQRLSEELADL 425
Query: 465 LDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQ 524
I G+E ++ +L++ E +K +E + D++ +
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKK-------------QEWKLEQLAADLSKYE-QEL 471
Query: 525 YNSEKDFVSINKEIEESSRKIDDINAE 551
Y+ ++++ + KE+ + R++ + A+
Sbjct: 472 YDLKEEYDRVEKELSKLQRELAEAEAQ 498
Score = 41.2 bits (97), Expect = 0.003
Identities = 46/281 (16%), Positives = 107/281 (38%), Gaps = 30/281 (10%)
Query: 466 DKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID----- 520
+++ GL+ +L L+ EN + L + + + +K++ ++
Sbjct: 681 ERLEGLKRELSSLQSELRR--IENRLDELS----QELSDASRKIGEIEKEIEQLEQEEEK 734
Query: 521 -VSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK-----NEKWVI 574
E+D S+ +EIE ++ ++ A I + +L + L + +
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIP 794
Query: 575 NLQEQFSLLEKSKETWVSL------KLNSAE------SQSSASLQSQLNDALDNIKKYKE 622
+Q + S LE+ + KLN + LQ Q D + IK ++
Sbjct: 795 EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK 854
Query: 623 EHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQD 682
E + + E L+E+L+ ++ DL LK + + L + + + + +
Sbjct: 855 EIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914
Query: 683 SEQSIEE-EITQQSLNPLSSELDETIHKNVESVTSNMENDK 722
+ + E + ++L SE+++ ++ E + +
Sbjct: 915 KRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLED 955
Score = 38.1 bits (89), Expect = 0.023
Identities = 56/279 (20%), Positives = 119/279 (42%), Gaps = 18/279 (6%)
Query: 418 SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSK-YLHLDKIGGLELQLE 476
+IE ++ + + R+QL + E +E R + L + + Y L + LE Q E
Sbjct: 185 NIERLDLIIDEKRQQLERLRRE----REKAERYQALLKEKREYEGYELLKEKEALERQKE 240
Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINE-EKTVAPNQKDVNIIDVSNQYNSEKDFVSIN 535
+++ S E EK T I+E EK + ++ + ++ + E++ + +
Sbjct: 241 AIERQLASLEEELEKL-------TEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVK 293
Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKL 595
++I E +I + I + EL DA+ L K E + L + LE+ E +
Sbjct: 294 EKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353
Query: 596 N-SAESQSSAS----LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLE 650
+ E L+++L + + ++E ++E LK ++ + + + L E
Sbjct: 354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQE 413
Query: 651 EYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEE 689
E ++L + D + I +N +++ +D I++
Sbjct: 414 ELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452
Score = 37.4 bits (87), Expect = 0.037
Identities = 51/258 (19%), Positives = 113/258 (43%), Gaps = 32/258 (12%)
Query: 228 LRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDR 287
LR+ ++ L R+ S+ +++L+ + ++ E+ Q+ D RK+ IE E L++ ++
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLD----ELSQELSDASRKIGEIEKEIEQLEQEEEK 734
Query: 288 NRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLK 347
++ LEE+ + + + +S + +L +E L KL ++ L+ L+
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKL-------EEALND--LE 785
Query: 348 DNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLR 407
L H ++ I+ +++ ++ E+ + + ++ Q L +
Sbjct: 786 ARLSHSRIPE-------IQAELSKLEEEV--------SRIEARLREIEQKLNRLT-LEKE 829
Query: 408 FNESESSGL-NSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLD 466
+ E E L D+++ K + +++ + L+E + E L LE S+ L
Sbjct: 830 YLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE-SRLGDLK 888
Query: 467 K-IGGLELQLEKLKQTYN 483
K LE QL +L++
Sbjct: 889 KERDELEAQLRELERKIE 906
Score = 35.8 bits (83), Expect = 0.11
Identities = 45/261 (17%), Positives = 114/261 (43%), Gaps = 27/261 (10%)
Query: 200 IQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVE 259
+ L S + +E++L D+L E+ R E +++++ ++ K
Sbjct: 688 RELSSLQSELRRIENRL---------DELSQELSDASRKIGEIEKEIEQLEQEEEKLKER 738
Query: 260 VMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVT--KNKEFESTIDQLL 317
+ + +D IEN S L+EL R +ELEE + K ++ + + + S I ++
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEAR-IEELEEDLHKLEEALNDLEARLSHSRIPEIQ 797
Query: 318 NSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG 377
+ + +++ + +++L+ TL+ + +++ ++E+ ++K ++
Sbjct: 798 AELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQE-------LQEQRIDLKEQIK 850
Query: 378 LETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQ 437
K +N+ K ++ + +E + ES + D++K +L QL +
Sbjct: 851 SIEKEIENLNGK----KEELEEELEELEAALRDLES----RLGDLKKERDELEAQLRELE 902
Query: 438 EEYTSLQEIVNRNEEQLAFLE 458
+ L+ + + ++L+ L+
Sbjct: 903 RKIEELEAQIEKKRKRLSELK 923
Score = 31.2 bits (71), Expect = 3.4
Identities = 60/315 (19%), Positives = 116/315 (36%), Gaps = 35/315 (11%)
Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
E L+RE + L + + ++ L +++ L + E +E QLA+ +
Sbjct: 201 ERLRREREKAERYQALLKEKREYE-----GYELLKEKEALERQKEAIERQLASLEEELEK 255
Query: 226 -DKLRDEI-KLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTE---RKLVNIENECSN 280
+ E+ K L + E K++ E++ V +K + E L E
Sbjct: 256 LTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKER 315
Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNK----EFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
E + +LE I K ++ + + E D+L E LE L+ E +
Sbjct: 316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE 375
Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQ 396
K+ +D L + K EK+ NEL E Q L + +++
Sbjct: 376 VDKEFA--ETRDELKDYREK---------LEKLKREINELKRELDRLQEELQRLSEELAD 424
Query: 397 ILTQFDEYFLRFNESESSGLNSIEDIEKVT----------KDLREQLHTAQEEYTSLQEI 446
+ + NE E + +I+K ++L+ +EEY +++
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKE 484
Query: 447 VNRNEEQLAFLEQSK 461
+++ + +LA E
Sbjct: 485 LSKLQRELAEAEAQA 499
Score = 30.8 bits (70), Expect = 3.9
Identities = 62/375 (16%), Positives = 148/375 (39%), Gaps = 43/375 (11%)
Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKN-KEFESTIDQLLNSNQEKSNTL 327
RK+++ E + + E + LEE+ ++V +N + + ID EK L
Sbjct: 156 RKIID---EIAGVAEFDRKKEKALEEL-----EEVEENIERLDLIID-------EKRQQL 200
Query: 328 EKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNIL 387
E+L+ E +A++ L + ++ + I+ +L + + +
Sbjct: 201 ERLRREREKAERYQA---LLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLT 257
Query: 388 VKYFD---DVHQILTQFDEYFLRFNESESSGLNSI-EDIEKVTKDLREQLHTAQEEYTSL 443
+ + + +I +E + + + E I ++ + L + E
Sbjct: 258 EEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAE-IASLERSIAEKERE 316
Query: 444 QEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLI 503
E EE+LA LE +I L ++E+L++ + +K + Y
Sbjct: 317 LE---DAEERLAKLEA-------EIDKLLAEIEELEREIEEERKRRDKLTEEY------A 360
Query: 504 NEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAK 563
++ + + ++ +D ++ +++E+ R+I+++ E+ + EL
Sbjct: 361 ELKEELEDLRAELEEVDKEFAETRDE-LKDYREKLEKLKREINELKRELDRLQEELQRLS 419
Query: 564 NCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEE 623
L + ++ + + LE+ KE +L++ E + L + L+ + KEE
Sbjct: 420 EELADLNAAIAGIEAKINELEEEKED-KALEIKKQEWKLE-QLAADLSKYEQELYDLKEE 477
Query: 624 HSKMTMEIESLKEKL 638
+ ++ E+ L+ +L
Sbjct: 478 YDRVEKELSKLQREL 492
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc gene
is often associated with scpB (TIGR00281) and scpA genes,
where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 46.2 bits (110), Expect = 7e-05
Identities = 63/350 (18%), Positives = 142/350 (40%), Gaps = 29/350 (8%)
Query: 307 KEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIK 366
+E E I++L E L +L+ E + +++L+ + + ++ + ++
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
Query: 367 EKMTNIK---NELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN-SIEDI 422
++ ++ +L E + + + + + + + E E+E L IE +
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE-----AEAEIEELEAQIEQL 794
Query: 423 EKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTY 482
++ K LRE L + E T L E E+L LE+ ++ LE Q+E+L +
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854
Query: 483 NSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESS 542
S E + L E N++ ++ + +++E+ E
Sbjct: 855 ES--LAAEIEELEELIEEL--ESELEALLNERASLEEALALL---RSELEELSEELRELE 907
Query: 543 RKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFS-----LLEKSKETWVSLKLNS 597
K ++ E+ + +L + L E + NLQE+ S LE+++ ++ +
Sbjct: 908 SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDE 967
Query: 598 AESQSS-ASLQSQLN-------DALDNIKKYKEEHSKMTMEIESLKEKLK 639
E++ L++++ A++ ++ KE + +T + E L E +
Sbjct: 968 EEARRRLKRLENKIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKE 1017
Score = 45.4 bits (108), Expect = 1e-04
Identities = 47/264 (17%), Positives = 104/264 (39%), Gaps = 19/264 (7%)
Query: 399 TQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLE 458
+ +E E E +E++ +L E++ Q+E +L ++R E+Q L
Sbjct: 253 EELEELTAELQELEEK----LEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILR 308
Query: 459 QSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNI 518
+ ++ LE QLE+L+ D E + L E A ++
Sbjct: 309 ERLANLERQLEELEAQLEELESK--LDELAEELAELEEKLEELKEELESLEAELEELEAE 366
Query: 519 IDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCL----VKNEKWVI 574
++ E + +++E K+ + +I NE+ + L + E+
Sbjct: 367 LEELESRLEELE-----EQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 575 NLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESL 634
++E LE+++ + +L E + LQ +L + +++ +EE + +++
Sbjct: 422 EIEELLKKLEEAELKELQAELEELE-EELEELQEELERLEEALEELREELEEAEQALDAA 480
Query: 635 KEKLKFSQSKEN---DLLEEYEKL 655
+ +L Q++ + L E E
Sbjct: 481 ERELAQLQARLDSLERLQENLEGF 504
Score = 45.0 bits (107), Expect = 2e-04
Identities = 47/285 (16%), Positives = 117/285 (41%), Gaps = 33/285 (11%)
Query: 203 DRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ 262
+ L ++E LE ++A L + L ++ E +L+++ E E+ +
Sbjct: 680 EELEEKIEELEEKIAE---------LEKALAELRKELEELEEELEQL----RKELEELSR 726
Query: 263 KCQDTERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNS 319
+ + L +E E L+E + + EL E+ I++ ++++ + +E + + +
Sbjct: 727 QISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786
Query: 320 NQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLE 379
+ + +E+L+ E ++ LD +L ++E++ +++ +
Sbjct: 787 LEAQ---IEQLKEELKALREALDEL-------RAELTLLNEEAANLRERLESLERRIAAT 836
Query: 380 TKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKD---LREQLHTA 436
+ ++ L + +++ + + E E ++E + + L E L
Sbjct: 837 ERRLED-LEEQIEELSEDIE---SLAAEIEELEELIEELESELEALLNERASLEEALALL 892
Query: 437 QEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQT 481
+ E L E + E + + L + +K+ LEL+LE L+
Sbjct: 893 RSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVR 937
Score = 38.1 bits (89), Expect = 0.024
Identities = 45/259 (17%), Positives = 102/259 (39%), Gaps = 20/259 (7%)
Query: 227 KLRDEIKLLHRDKSTYE-NQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELL 285
+L+ E++ L +L+E ++++ E E ++ ++ +L L+E L
Sbjct: 217 ELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAEL-------QELEEKL 269
Query: 286 DRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFT 345
+ R E+ E+ ++ ++ + + I +L Q L L+ + + + QL+
Sbjct: 270 EELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEEL- 328
Query: 346 LKDNLMHDKLKNYYMNFHIIKEKMTNIKNEL-GLETKVDQNILVKYFDDVHQILTQFDEY 404
KL ++EK+ +K EL LE ++++ L +++ L + +E
Sbjct: 329 ------ESKLDELAEELAELEEKLEELKEELESLEAELEE--LEAELEELESRLEELEEQ 380
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLH 464
+ I + + L +L ++ LQ+ E E
Sbjct: 381 LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ--EIEELLKKLEEAELKEL 438
Query: 465 LDKIGGLELQLEKLKQTYN 483
++ LE +LE+L++
Sbjct: 439 QAELEELEEELEELQEELE 457
Score = 35.8 bits (83), Expect = 0.12
Identities = 106/616 (17%), Positives = 248/616 (40%), Gaps = 90/616 (14%)
Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
+ELY L I Q++ +L+ +L+ L+ L++ + + D L + LE
Sbjct: 288 KELYALANEISRLEQQKQILRERLANLE---RQLEELEAQLEELESKLDELAEELAELEE 344
Query: 215 QLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTER---KL 271
+L + L E++ L + E++L+E+ +++ + +V Q ++
Sbjct: 345 KLEELKEELES--LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEI 402
Query: 272 VNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQ 331
+E L++ +R + E+EE++KK ++ + KE ++ +++L +E LE+L+
Sbjct: 403 ERLEARLERLEDRRERLQQEIEELLKKLEEA--ELKELQAELEELEEELEELQEELERLE 460
Query: 332 YETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYF 391
+ +++L+ + +L ++ N++ +
Sbjct: 461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG---------------FS 505
Query: 392 DDVHQILTQFDEYFLRFNESESSG-LNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRN 450
+ V +L N+S SG L + ++ V + + A LQ +V N
Sbjct: 506 EGVKALLK---------NQSGLSGILGVLSELISVDEGYEAAIEAALGG--RLQAVVVEN 554
Query: 451 EEQ----LAFLEQSK-----YLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTL 501
+AFL+Q++ +L LD I G E+Q + N + + G A +
Sbjct: 555 LNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGF----LGVAKDLVKF 610
Query: 502 LINEEKTVAPNQKDVNIID-------VSNQYNSEKDFVSINKE----------------- 537
K ++ V ++D ++ + V+++ +
Sbjct: 611 DPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNS 670
Query: 538 -IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEK---SKETWVSL 593
I E R+I+++ +I + ++ + + L + K + L+E+ L K +S
Sbjct: 671 SILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISA 730
Query: 594 KLNSAE---------SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSK 644
+ A L +L + I++ +E + E+ + +++ +++
Sbjct: 731 LRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ 790
Query: 645 ENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELD 704
L EE + L+ + + + L + E N +++ + E+ I T++ L L ++
Sbjct: 791 IEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAA--TERRLEDLEEQI- 847
Query: 705 ETIHKNVESVTSNMEN 720
E + +++ES+ + +E
Sbjct: 848 EELSEDIESLAAEIEE 863
Score = 33.1 bits (76), Expect = 0.84
Identities = 48/287 (16%), Positives = 117/287 (40%), Gaps = 44/287 (15%)
Query: 427 KDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDC 486
++L E++ +E+ L++ + ++L LE+ +LE+L++
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELEELEE--------------ELEQLRKE----- 720
Query: 487 YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKID 546
E + L E V E+ ++KE+ E +I+
Sbjct: 721 -LEELSRQISALRKDLARLEAEVE---------------QLEERIAQLSKELTELEAEIE 764
Query: 547 DINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLN---SAESQSS 603
++ + + EL +A+ + + E + L+E+ L ++ + +L + ++ +
Sbjct: 765 ELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD-----ELRAELTLLNEEA 819
Query: 604 ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQ 663
A+L+ +L I + + +IE L E ++ ++ +L E E+L+ ++
Sbjct: 820 ANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALL 879
Query: 664 NVLPSIVESMNNTKDKPQDSEQSIEE-EITQQSLNPLSSELDETIHK 709
N S+ E++ + + ++ + + E E + L EL E + +
Sbjct: 880 NERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQ 926
Score = 31.2 bits (71), Expect = 3.0
Identities = 26/154 (16%), Positives = 65/154 (42%), Gaps = 24/154 (15%)
Query: 575 NLQEQFSLLEKSKETWVSLKLNSAESQSS--------ASLQSQLNDALDNIKKYKEEHSK 626
L+E L++++E L E + + L+ ++ + + E S+
Sbjct: 240 ELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISR 299
Query: 627 MTMEIESLKEKLKFSQ--------------SKENDLLEEYEKLKLVIKDYQNVLPSIVES 672
+ + + L+E+L + SK ++L EE +L+ +++ + L S+
Sbjct: 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359
Query: 673 MNNTKDKPQDSEQSIEEEITQQSLNPLSSELDET 706
+ + + ++ E +EE ++ L L S++ +
Sbjct: 360 LEELEAELEELESRLEE--LEEQLETLRSKVAQL 391
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 45.9 bits (109), Expect = 8e-05
Identities = 44/200 (22%), Positives = 89/200 (44%), Gaps = 23/200 (11%)
Query: 156 ELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFL-ES 214
+L QE QR + L+ +L +L+ + + +AL E + LS +E L E
Sbjct: 176 QLKDRQQKEQELAQRLDFLQFQLEELEEADLQPGE-----DEALEAEQQRLSNLEKLREL 230
Query: 215 QLAAYSKSFTDDKLRDEIKLLHRDK-------STYENQLKEVMKKMLAEKVEVMQKCQDT 267
A + D + E LL S + L+E+ +++ EV + ++
Sbjct: 231 SQNALAALRGDVDV-QEGSLLEGLGEAQLALASVIDGSLRELAEQVGNALTEVEEATREL 289
Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTK----NKEFESTIDQLLNSNQEK 323
+ L +E + L E+ +R +++ + +KY V + ++ + +DQL +S++
Sbjct: 290 QNYLDELEFDPERLNEIEER-LAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDDSDES- 347
Query: 324 SNTLEKLQYETLQAQKQLDA 343
LE L+ E + +++LD
Sbjct: 348 ---LEALEEEVDKLEEELDK 364
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 44.1 bits (105), Expect = 3e-04
Identities = 44/196 (22%), Positives = 87/196 (44%), Gaps = 15/196 (7%)
Query: 156 ELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQ 215
EL L +E QR ++L+ +L +L+ + +L+ + L +E + LS E L
Sbjct: 172 ELEDLQEKERERAQRADLLQFQLEELEEL--NLQ---PGEDEELEEERKRLSNSEKLAEA 226
Query: 216 LA-AYSKSFTDDKLRDEIKLLHR------DKSTYENQLKEVMKKMLAEKVEVMQKCQDT- 267
+ A +D + LL R D S Y+ +L E+ ++L E + +++ +
Sbjct: 227 IQNALELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSEL-AELLEEALYELEEASEEL 285
Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTL 327
L +E + + L+E+ +R L+ + +KY + E+ I + L +L
Sbjct: 286 RAYLDELEFDPNRLEEVEER-LFALKSLARKYGVTIEDLLEYLDKIKEELAQLDNSEESL 344
Query: 328 EKLQYETLQAQKQLDA 343
E L+ E + + +L
Sbjct: 345 EALEKEVKKLKAELLE 360
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 42.7 bits (101), Expect = 3e-04
Identities = 43/204 (21%), Positives = 81/204 (39%), Gaps = 16/204 (7%)
Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKL 595
I++ + D + I + L AK L K + L+ + LE
Sbjct: 10 LAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQV------SQ 63
Query: 596 NSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKL 655
+E Q + + L +K +E + +EI+ KE++ + + +L+EE EKL
Sbjct: 64 LESEIQEIRERIKRAEEKLSAVKDERE-LRALNIEIQIAKERINSLEDELAELMEEIEKL 122
Query: 656 KLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEI--TQQSLNPLSSELDETIHKNVES 713
+ I+D + E + + ++E +EEE+ ++ LSS+ +E K
Sbjct: 123 EKEIEDLK-------ERLERLEKNLAEAEARLEEEVAEIREEGQELSSKREELKEKLDPE 175
Query: 714 VTSNMENDKSNEDDNSSVQSTDEV 737
+ S E + N+ V V
Sbjct: 176 LLSEYERIRKNKKGVGVVPLEGRV 199
Score = 30.0 bits (68), Expect = 5.1
Identities = 18/145 (12%), Positives = 60/145 (41%), Gaps = 9/145 (6%)
Query: 268 ERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLLNSNQEK- 323
E ++ I + L+ LE + ++ ++QV++ + I + + +EK
Sbjct: 23 EPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL 82
Query: 324 -----SNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGL 378
L L E A++++++ + + ++++ +KE++ ++ L
Sbjct: 83 SAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAE 142
Query: 379 ETKVDQNILVKYFDDVHQILTQFDE 403
+ + + ++ ++ ++ +E
Sbjct: 143 AEARLEEEVAEIREEGQELSSKREE 167
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 40.4 bits (95), Expect = 0.004
Identities = 42/194 (21%), Positives = 73/194 (37%), Gaps = 35/194 (18%)
Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
EEL + EA+ + + LK +L + +++ A+ L R+E LE
Sbjct: 211 EELEAARLELNEAIAQRDALKRQLGGEEPVLLAGSSVAN---------SELDGRIEALEK 261
Query: 215 QLAAYSKSFTDD-----KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ---- 265
QL A +TD + EI L ++ E K + V Q+ Q
Sbjct: 262 QLDALRLRYTDKHPDVIATKREIAQL-EEQKEEEGSAKNGGPERGEIANPVYQQLQIELA 320
Query: 266 DTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
+ E ++ ++E + L ++R L I E E+ + QL N++
Sbjct: 321 EAEAEIASLEARVAELTARIERLESLLRTI-----------PEVEAELTQL---NRDYEV 366
Query: 326 TLEKLQYETLQAQK 339
K YE L ++
Sbjct: 367 --NKSNYEQLLTRR 378
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 40.8 bits (95), Expect = 0.004
Identities = 100/487 (20%), Positives = 210/487 (43%), Gaps = 50/487 (10%)
Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTER---------KLVNIEN 276
+K++D+IK L + + L+E+ KK E+ + D E IE
Sbjct: 1114 NKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEK 1173
Query: 277 ECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
+ N+ +D+ ++ +EI KK +++ + ++ +++++++ N L KL E +
Sbjct: 1174 KIENIVTKIDKKKNIYDEI-KKLLNEIAEIEKDKTSLEEVKGINLSYGKNLGKLFLEKID 1232
Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQ-----NILVKYF 391
+K+ +K ++ Y + IKEK I+NE+G+E + NI
Sbjct: 1233 EEKKKSEHMIK------AMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDD 1286
Query: 392 DDVHQILTQFDEYFLRFNESE---SSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVN 448
D H I + DE E + DI + K+L++ L AQ+ + + +N
Sbjct: 1287 KDHHIISKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLN 1346
Query: 449 RNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKT 508
E + ++ K+ ++ ++++K+ Y + EN K N L EK
Sbjct: 1347 ---------EIANIYNILKLNKIKKIIDEVKE-YTKEIEENNK-----NIKDELDKSEKL 1391
Query: 509 VAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVK 568
+ + D+N+ + ++ S D I++ I++ + I +E + +A +
Sbjct: 1392 IKKIKDDINLEECKSKIESTLDDKDIDECIKKIKELKNHILSEESNIDTYFKNAD----E 1447
Query: 569 NEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMT 628
N + V+ L + + + + + +K ++A + ++ ++L + +D K K+E K
Sbjct: 1448 NNENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFNI-NELKEHIDKSKGCKDEADKNA 1506
Query: 629 MEIESLKEKLKFSQSKENDLLEEYEKLKL------VIKDYQNVLPSIVESMNNTKDKPQD 682
IE KE + + +LL +Y L + KD + ++ I ++ + +
Sbjct: 1507 KAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEK 1566
Query: 683 SEQSIEE 689
SEQ I+E
Sbjct: 1567 SEQKIKE 1573
Score = 30.8 bits (69), Expect = 4.2
Identities = 36/200 (18%), Positives = 92/200 (46%), Gaps = 12/200 (6%)
Query: 259 EVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLN 318
++++ ++ E++L N N+ + ++ L++ + ++ EI Y DQ+ + + Q +
Sbjct: 755 KILEDFKNKEKELSNKINDYAKEKDELNKYKSKISEIKNHYNDQINIDNIKDEDAKQNYD 814
Query: 319 SNQEKSNTLEKLQYETLQAQKQL----DAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKN 374
++E T+ + E + ++ D F K + + N E+ + N
Sbjct: 815 KSKEYIKTISIKEDEIFKIINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTN 874
Query: 375 ELGLETKVDQ-NILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIE---KVTKDLR 430
++ E D+ N K F+D ++ + ++ E E +N+++ ++ K+ ++ +
Sbjct: 875 KIKAEISDDKLNDYEKKFNDSKSLINEINKSI----EEEYQNINTLKKVDEYIKICENTK 930
Query: 431 EQLHTAQEEYTSLQEIVNRN 450
E + + L+EI+N+N
Sbjct: 931 ESIEKFHNKQNILKEILNKN 950
>gnl|CDD|178515 PLN02927, PLN02927, antheraxanthin epoxidase/zeaxanthin epoxidase.
Length = 668
Score = 40.1 bits (93), Expect = 0.005
Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 56 LSRNHALIWYYNGKFYLQDTKSSNGTFV-NNQRLSKTNEESAPHELCSGDVVQFGVDV-- 112
+S+ HA + Y +G F+L D +S +GT+V +N+ + P S D+++FG D
Sbjct: 581 VSKMHARVIYKDGAFFLMDLRSEHGTYVTDNEGRRYRATPNFPARFRSSDIIEFGSDKKA 640
Query: 113 ---IETNRNTP 120
++ R TP
Sbjct: 641 AFRVKVIRKTP 651
>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
secretion system, contains a FHA domain [Intracellular
trafficking, secretion, and vesicular transport;
Signal transduction mechanisms].
Length = 430
Score = 39.8 bits (93), Expect = 0.005
Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 4/59 (6%)
Query: 56 LSRNHALIWYYNGKFYLQDTKSSNGTFVNNQRLSKTNEESAPHELCSGDVVQFGVDVIE 114
+S+ H I Y +G F L DT S+ G V N E SA L GD + G +I
Sbjct: 47 VSKQHCTISYRDGGFCLTDT-SNGGLLV-NGSDLPLGEGSAR--LQQGDEILIGRYIIR 101
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 39.7 bits (93), Expect = 0.007
Identities = 111/560 (19%), Positives = 228/560 (40%), Gaps = 71/560 (12%)
Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
+EL ++ I E L+ +L KL+ V +L++ + + + + L LE
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE 259
Query: 215 QLAAYSKSFTDDKLR--------DEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQD 266
++ + + K E+K L Y +L E ++ L E E+ ++
Sbjct: 260 KIRELEERIEELKKEIEELEEKVKELKELKEKAEEYI-KLSEFYEEYLDELREIEKRLSR 318
Query: 267 TERKLVNIE---NECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLN----- 318
E ++ IE E +E L+ + +L+E+ K+ ++ +++ +E
Sbjct: 319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE-EAKAKKEELERL 377
Query: 319 SNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG- 377
+ T EKL+ E + +K + + + + ++ IKE I EL
Sbjct: 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELK---KEIKELKKAI-EELKK 433
Query: 378 -----------LETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKV- 425
L + + +L +Y ++ +I + E E E + ++EKV
Sbjct: 434 AKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKE----IEEKERKLRKELRELEKVL 489
Query: 426 --------TKDLREQLHTAQEEYTS--LQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQL 475
K+L EQL +E+ L+E+ + EE E+ L +I L+ +L
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLK-GEIKSLKKEL 548
Query: 476 EKLKQTYNSDCYENEKFGKAYNCDTLLINE-EKTVAPNQKDVN--IIDVSNQYNSEKDFV 532
EKL++ +K + L+ E E+ + +++ + ++ YN +
Sbjct: 549 EKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608
Query: 533 SINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFS----------L 582
KE+E +++ + E+ K EL + + L + K + L++++S
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEY 668
Query: 583 LEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQ 642
LE S+E L AE + + ++ L+ +K+ EE K E+E L++ L+ +
Sbjct: 669 LELSRE----LAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVE 724
Query: 643 SKENDLLEEYEKLKLVIKDY 662
+L E+ +K K ++K+
Sbjct: 725 ----ELREKVKKYKALLKER 740
Score = 33.1 bits (76), Expect = 0.85
Identities = 75/430 (17%), Positives = 172/430 (40%), Gaps = 25/430 (5%)
Query: 244 NQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQV 303
+ E ++K + + + ++ E++L + E + + L R+ELE++ K+ ++
Sbjct: 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE 234
Query: 304 TKNKEFESTIDQLLNSNQEKSNTLEKL-QYETLQAQKQLDAFTLKDNLMH-DKLKNYYMN 361
+E E +L + K EK+ + E + + + L++ + +LK
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEE 294
Query: 362 FHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIED 421
+ + E +EL K L + ++++ I + E E + L ++
Sbjct: 295 YIKLSEFYEEYLDELREIEKR----LSRLEEEINGIEERIKEL-----EEKEERLEELKK 345
Query: 422 IEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQT 481
K + E+L E Y + E ++ L +K+ +LEK K+
Sbjct: 346 KLKELEKRLEELEERHELYEEAKAKKEELER---LKKRLTGLTPEKLEKELEELEKAKEE 402
Query: 482 YNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEES 541
+ ++ + + +K + +K V + +E+ + +EE
Sbjct: 403 IEEE--ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKEL---LEEY 457
Query: 542 SRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQ 601
+ ++ I E+ + + + L + EK V+ + + L++ E +L E +
Sbjct: 458 TAELKRIEKELKEIEEKERKLRKELRELEK-VLKKESELIKLKELAE-----QLKELEEK 511
Query: 602 SSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKD 661
+L + +K KE+ K+ EI+SLK++L+ + + L E +KL + ++
Sbjct: 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEE 571
Query: 662 YQNVLPSIVE 671
+L + E
Sbjct: 572 LAELLKELEE 581
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 39.2 bits (91), Expect = 0.010
Identities = 78/448 (17%), Positives = 168/448 (37%), Gaps = 36/448 (8%)
Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVM---QKCQDTERKLVNIENECSNLQ- 282
K +EI L T E + K++ AE+ + KC ++L + E LQ
Sbjct: 249 KAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQT 308
Query: 283 --ELLDRNRDELEEIIKKYQDQVT----KNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
E L ++ + + K ++ +T + ++ +D L + K NTL K
Sbjct: 309 ELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQA 368
Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETK--VDQNILVKYFDDV 394
AQ++ + + M D+ + ++++K+ N++ + + ++ ++
Sbjct: 369 AQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTD 428
Query: 395 HQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQL 454
T ++ E E E ++ + +E+ T ++E+ L+E V + +L
Sbjct: 429 TNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQLKL 488
Query: 455 AFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQK 514
+ E+ L L K +L +LKQ + + E + L ++ A +
Sbjct: 489 S--ERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPES 546
Query: 515 DVNIIDVSNQYNSEKDFVSINKEIEESSRKI--------DDINAEITKCTNELVDAK--- 563
V + + + + + DD E + EL A+
Sbjct: 547 ADRGSAVDAGTSRSRADSAGARNEVDRLLDRLEKAEQERDDTEMEAGRLAKELEKAQRHL 606
Query: 564 --------NCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALD 615
++ E+ L E+ LEKS+ +L+ ++ Q + + N
Sbjct: 607 TKQQEKTEATRIEFERKSAELLEEAERLEKSEAEEETLRQST---QIGHAQAAAHNHIEH 663
Query: 616 NIKKYKEEHSKMTMEIESLKEKLKFSQS 643
+++K + + ++ E E L +L+ SQ
Sbjct: 664 HVQKLESDLKQLRAEREQLVAQLEKSQQ 691
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 39.0 bits (91), Expect = 0.014
Identities = 88/523 (16%), Positives = 194/523 (37%), Gaps = 81/523 (15%)
Query: 200 IQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEI------KLLHRDKSTYENQLKEVMK-- 251
+E + ++++ + ++L+ +K +D+ DEI + L ++++ + E +
Sbjct: 1951 NKESAISNKIDNVSNKLSELNKITCNDESYDEILEKEEYEELKDLRNSFNQEKAETLNNL 2010
Query: 252 --KMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
+ E + D K V N++++++ + ++ I +D + +
Sbjct: 2011 KLNKIKEDFNSYKNLLDELEKSVKTLKASENIKKIVENKKTSIDAINTNIEDIEKEIESI 2070
Query: 310 ESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYM-NFHIIKEK 368
++D+LL + + + +Q + D+ + D ++ Y+ N + +K
Sbjct: 2071 NPSLDELLKKGHKIEISRYTSIIDNVQTKISNDSKNINDIEKKAQIYLAYIKNNYNSIKK 2130
Query: 369 MTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKD 428
+ NE E +V + ILT D+ +E + NS E IE + K
Sbjct: 2131 DISTLNEYFDEKQVS-----------NYILTNIDKANKLSSELSEAVTNSEEIIENIKK- 2178
Query: 429 LREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYE 488
EI+ NE ++ LE +KLK+ Y +
Sbjct: 2179 ----------------EIIEINEN-------------TEMNTLENTADKLKELYEN---- 2205
Query: 489 NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDI 548
L ++ + K +N I + NS + + I+K ++
Sbjct: 2206 -------------LKKKKNIINNIYKKINFIKLQEIENSSEKYNDISKLFNNV---VETQ 2249
Query: 549 NAEITKCTNELVDAKNCLVKNEKWVINLQEQFSL--LEKSKETWVSLKLNSAESQSSASL 606
++ N++ + K+ + EK +IN+ F+L ++ E + +K N + L
Sbjct: 2250 KKKLLDNKNKINNIKDKINDKEKELINVDSSFTLESIKTFNEIYDDIKSNIGDLY---KL 2306
Query: 607 QSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVL 666
+ ND L +K Y E + + I +L L Q + + E + + +
Sbjct: 2307 EDTNNDELKKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELN----NENNSYI 2362
Query: 667 PSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHK 709
E +NN K++ ++I+ T + N + +
Sbjct: 2363 IKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNIGKS 2405
Score = 34.4 bits (79), Expect = 0.39
Identities = 89/519 (17%), Positives = 200/519 (38%), Gaps = 58/519 (11%)
Query: 250 MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
+KK EK+ ++K + + L++I +L + D E+ + K E
Sbjct: 444 IKKKYDEKINELKKSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDEL 503
Query: 310 ESTIDQLLNSNQEKSN----------TLEKLQYETLQAQKQLDAF--TLKDNLMHDKLKN 357
I+ + N +N T+E L+ E + + + +++ + +KLK
Sbjct: 504 LQIINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKR 563
Query: 358 YYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN 417
N IK K+ I+ + + I + D++ I+ Q +E + L
Sbjct: 564 SMKN--DIKNKIKYIEENV---DHIKDIISLN--DEIDNIIQQIEELI-------NEALF 609
Query: 418 SIEDIEKVTKDLREQLHTAQEEY--TSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQL 475
+ E DL+E++ ++ LQE+++ E FL+ KYL+ + +LQ
Sbjct: 610 NKEKFINEKNDLQEKVKYILNKFYKGDLQELLD---ELSHFLDDHKYLYHEAKSKEDLQ- 665
Query: 476 EKLKQTYNSDCYENEKFGKAYNCDTL-LINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSI 534
L T ++ + E + + + +E + K+ NII K +I
Sbjct: 666 -TLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKE-NIIK--------KQLNNI 715
Query: 535 NKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLK 594
++I S + ++ E + + L + +IN + +F L + +
Sbjct: 716 EQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDG 775
Query: 595 LNSAES---------QSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE 645
N+ E + + +N +N K ++ + + I+ L+ + + +
Sbjct: 776 KNTYEEFLQYKDTILNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAHTEKNDEEL 835
Query: 646 NDLLE--EYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSEL 703
LL+ E L +K+ + + ++N ++ ++I ++LN ++
Sbjct: 836 KQLLQKFPTEDENLNLKELEKEFNENNQIVDNIIKDIENMNKNIN---IIKTLN-IAINR 891
Query: 704 DETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVKDTEE 742
+ + VE + +N + K+ + + + +TD + E
Sbjct: 892 SNSNKQLVEHLLNNKIDLKNKLEQHMKIINTDNIIQKNE 930
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 38.6 bits (90), Expect = 0.015
Identities = 51/413 (12%), Positives = 124/413 (30%), Gaps = 23/413 (5%)
Query: 308 EFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKE 367
+ + D+ N N++ S+ L L + L+A T + D L +
Sbjct: 189 KLKDGTDEASNGNKKLSDLLNTLNNSSATFSDGLNALTSGLTTLTDGLNQLDSGLGTLAA 248
Query: 368 KMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTK 427
+ +K ++++ I F L + + G++++ +
Sbjct: 249 GIGELKQGAE---QLNEGIGE--FSSGLSELNSGVQDLAAGVPQLNQGISALAAGLSLPD 303
Query: 428 DLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQ-SKYLHLDKIGGLELQLEKLKQTYNSDC 486
L +Q + QE T + + + + Q S + + +
Sbjct: 304 SLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSKSLDLTAEGATVDAL 363
Query: 487 YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNS--EKDFVSINKEIEESSRK 544
+ L +++ ++ ID ++ E+ + K + +
Sbjct: 364 GAPDGVQ------WLDESQKTLATLSELLSTGIDGVSEGLDALEQASAQLAKSLAKLKTA 417
Query: 545 IDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSA 604
+ I A I + + L K + L Q + + L+ A +
Sbjct: 418 VAQIAASIAQLLPGASEVLKTL--KSKGLDKLLNQLNGALAKGSNALVQGLSDANDSFRS 475
Query: 605 SLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQN 664
+QL L+ + + S + + L + K ++L ++
Sbjct: 476 ITSAQLKAGLNTLADGSNDLSSLGPGLGQLADGSKLLADGLSEL-------NTGSAQLRD 528
Query: 665 VLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSN 717
L + + + D QD+ + + E K+++ V +
Sbjct: 529 GLGELSDGLTELADSLQDAADQLSLANDSDKQASFIANPVELKEKDIDPVPNY 581
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 37.6 bits (87), Expect = 0.030
Identities = 97/519 (18%), Positives = 185/519 (35%), Gaps = 53/519 (10%)
Query: 168 LQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSF---- 223
+Q EN L K + + I+L++ A + +E +E L K
Sbjct: 71 IQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEA 130
Query: 224 ---------TDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNI 274
DKL E L +K + KE + ++ + E+ + Q+ + +L +
Sbjct: 131 AEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLL 190
Query: 275 ENECSNLQELLDRNRDELEEIIKKYQ----------DQVTKNKEFESTIDQLLNSNQ-EK 323
E+E L+E L+ + EL E KK Q D K K E + + + K
Sbjct: 191 ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVK 250
Query: 324 SNTLEKLQYETLQAQKQ-------------LDAFTLKDNLMHDKLKNYYMN-----FHII 365
S + LQ L+ + D LK+ L + + +
Sbjct: 251 SMKEQLLQIPELERELAALREENRKLRSMKEDNELLKEELEDLQSRLERFEKMREKLADL 310
Query: 366 KEKMTNIKNELGLETKVDQNIL--VKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIE 423
+ + ++NEL + Q+I ++ DD+ + + L+ E S +S + +E
Sbjct: 311 ELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLE 370
Query: 424 KVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYN 483
+ L+ + A E L++ + + + L++ L + GL L +
Sbjct: 371 TTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELT 430
Query: 484 SDCYENEKFGKAYNCDTLL--INEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEES 541
+ + + L+ + NQ DV Q + ++ EI+
Sbjct: 431 ETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNN---TLETEIKLL 487
Query: 542 SRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQ 601
++ ++ + + E+ L+++ SLLE E L L +
Sbjct: 488 KEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLE---HLCLQGDYNA 544
Query: 602 SSAS-LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLK 639
S L LN A + + K+ + E E LKE+L+
Sbjct: 545 SRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQ 583
Score = 34.9 bits (80), Expect = 0.21
Identities = 85/468 (18%), Positives = 177/468 (37%), Gaps = 71/468 (15%)
Query: 258 VEVMQKCQDTER-----KLVNIENECSNLQELLDRNRDELEEII----KKYQDQVTKNKE 308
E M Q E KL+ +ENE + R + ELE + Y+ ++ +N E
Sbjct: 52 EESMTLLQRAELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLE 111
Query: 309 FESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNL-MHDKLKNYYMNFHIIKE 367
E + L ++ N + + E + +LDA +LK D+LK + IK
Sbjct: 112 LEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKN 171
Query: 368 KMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESES--SGLNSIEDIEKV 425
++ ++ + +L +++ + Q +E E+E L S +
Sbjct: 172 DLSEMQCR-AQNADTELKLLESELEELRE---QLEECQKELAEAEKKLQSLTSEQASSAD 227
Query: 426 TKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYL---------HLDKIGGLELQLE 476
+ L + Y E+V +EQL + + + L + L+
Sbjct: 228 NSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREENRKLRSMKEDNELLK 287
Query: 477 KLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNII-----------DVSNQY 525
+ + S EK + L + +EK + +++ D+S +
Sbjct: 288 EELEDLQSRLERFEKMREKLA--DLELEKEKLENELKSWKSLLQDIGLNLRTPDDLSRRI 345
Query: 526 ----NSEKDFVSINKEIEESSRK----IDDINAEITKCTNELVDAKNCLVKNEKWVINLQ 577
N E N I S+++ + + E K +E+++ K L + V LQ
Sbjct: 346 VVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQ 405
Query: 578 EQFSLLEKSKETWVSL--KLNSAESQSSASLQ-----------------------SQLND 612
+ +L+ K ++ ++ + +++S S Q +QL++
Sbjct: 406 RRLTLVTKERDGLRAILNSYDKELTETSVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSE 465
Query: 613 ALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIK 660
+++ + K+ ++ + EI+ LKE+L ++ + + E L+L I+
Sbjct: 466 LEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKIE 513
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 37.8 bits (88), Expect = 0.031
Identities = 102/606 (16%), Positives = 218/606 (35%), Gaps = 70/606 (11%)
Query: 165 QEALQRENMLKNKLSKLQHIVIDLKKAAD--------VSWKALIQEDRLLSRVEFLESQL 216
+ ++ + LK +L+ +I+LKK+ + + + +LL+ ++ E
Sbjct: 888 KIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPS 947
Query: 217 AAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIEN 276
Y K +KL + L YE+ LK+ ++ E + + ++ +++L +
Sbjct: 948 IEYVKLPELNKLHEVESKLKETSEEYEDLLKK-STILVREGNKANSELKNFKKELAELSK 1006
Query: 277 ECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
+ LQE + ++ E+ + +K EST +L Q+ L L+ LQ
Sbjct: 1007 QYGALQESTKQLKELPVEVAELQSA--SKIISSESTELSILKPLQKLKGLLL-LENNQLQ 1063
Query: 337 AQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELG-----------LETKVDQN 385
A+ ++N + D + Y + K N+K+ L+ V Q
Sbjct: 1064 AR-YKALKLRRENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQM 1122
Query: 386 ILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQE 445
I + ++ + L+Q L + S L D +L S +
Sbjct: 1123 IKLNLLQEISKFLSQ-LVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKR 1181
Query: 446 IVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINE 505
+ S ++ L+ +L L S + K + + E
Sbjct: 1182 LYQSALYDEKSKLSSSEVND-----LKNELIALFSKIFSG-WPRGDKLKKLISEGWVPTE 1235
Query: 506 EKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNC 565
T ++N + S + +S+ I+ + + E+
Sbjct: 1236 YSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSID-NLLSSYKLEEEVL----------PA 1284
Query: 566 LVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHS 625
+ + IN F+ L ++K + + K + + N + + E S
Sbjct: 1285 TINSLLQYIN-VGLFNAL-RTKASSLRWKSATEVNY---------NSEELDDWCREFEIS 1333
Query: 626 KMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQ 685
+ E+E L + +K Q ++DL KL ++ ++ P+ ++++ + + P D E
Sbjct: 1334 DVDEELEELIQAVKVLQLLKDDL----NKLDELLDACYSLNPAEIQNLKS-RYDPADKEN 1388
Query: 686 SIEEEI--------TQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEV 737
++ +EI +Q L DET ++ + S ++ S D +S+ +EV
Sbjct: 1389 NLPKEILKKIEALLIKQELQLSLEGKDET-EVHLSEIFSEEKSLIS--LDRNSIY-KEEV 1444
Query: 738 KDTEEG 743
+
Sbjct: 1445 LSSLSA 1450
Score = 30.4 bits (69), Expect = 5.8
Identities = 54/317 (17%), Positives = 108/317 (34%), Gaps = 48/317 (15%)
Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLES 214
+E CI + LQ+ + KL + + + LK LIQ+ R +
Sbjct: 810 KEYRSYLACIIK-LQKTIKREKKLRETEEVEFSLKAEV------LIQK---FGRSLKAKK 859
Query: 215 QLAAYSKSFTDDKLRDEIKLLHRDKSTYE------NQLKEVMKKMLAEKVEVMQKCQDTE 268
+ + K + ++L R + + LK V ++ +E +E+ +
Sbjct: 860 RFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDL 919
Query: 269 RKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNK--EFESTIDQLLNSNQEKSNT 326
+ + + E + L N D E +Y NK E ES + + +++E +
Sbjct: 920 IENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKE---TSEEYEDL 976
Query: 327 LEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNI 386
L+K + K + + K Y +++ K
Sbjct: 977 LKKSTILVREGNKANSELKNFKKELAELSKQ-YGALQESTKQL-----------KELPVE 1024
Query: 387 LVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEI 446
+ + I SES+ L+ ++ ++K+ L + + Q Y +L+
Sbjct: 1025 VAELQSASKII------------SSESTELSILKPLQKLKGLLLLENNQLQARYKALK-- 1070
Query: 447 VNRNEEQLAFLEQSKYL 463
R E L +Q L
Sbjct: 1071 -LRRENSLLDDKQLYQL 1086
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 36.6 bits (85), Expect = 0.065
Identities = 38/177 (21%), Positives = 77/177 (43%), Gaps = 21/177 (11%)
Query: 167 ALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDD 226
A + K KL K++ + +L DV +R+ + V A SK ++
Sbjct: 350 AYKAYLAYKPKLEKVERKLPELGIWKDV--------ERIKALVIRGYPLAEALSKVKEEE 401
Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ-KCQDTERKLVNIENECSNLQELL 285
+ R++ ++ K + K L E VE ++ + + +R+L ++ E L
Sbjct: 402 RPREKEGTEEEERREITVYEKRIKK--LEETVERLEEENSELKRELEELKREIEKL---- 455
Query: 286 DRNRDELEEIIKKYQDQVTKNKEFE---STIDQLLNSNQEKSNTLEKLQYETLQAQK 339
ELE ++ +D+V K++E I++L +EK +E+L+ + + +K
Sbjct: 456 ---ESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRK 509
>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 148
Score = 34.6 bits (80), Expect = 0.074
Identities = 29/134 (21%), Positives = 53/134 (39%), Gaps = 15/134 (11%)
Query: 221 KSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSN 280
+++ E+ + +K E QLK + + E Q + K + N
Sbjct: 14 AKKEEEEAAIELSKIRSEKENAEEQLKM----LSGYRNEYEQNLNEK-LKSGVSAAQWQN 68
Query: 281 LQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQ 340
Q+ + + LE I + Q Q++K ++Q QEK L+ E L+ ++Q
Sbjct: 69 YQQFISQ----LEVAIDQQQSQLSK---LRKQVEQKREIWQEKQIELKA--LEKLK-ERQ 118
Query: 341 LDAFTLKDNLMHDK 354
F L++N K
Sbjct: 119 KTEFLLEENRREQK 132
>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1. NBP1 is a
nuclear protein which has been shown in Saccharomyces
cerevisiae to be essential for the G2/M transition of
the cell cycle.
Length = 313
Score = 35.2 bits (81), Expect = 0.12
Identities = 27/123 (21%), Positives = 52/123 (42%), Gaps = 22/123 (17%)
Query: 617 IKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNT 676
I K +E+ +++ E++S ++KL+F + K N LLE + D N+ V+S +
Sbjct: 175 IDKLEEKLAELEQELQSTQKKLQFVKEK-NHLLES-------LLDDANIDDEYVKSRRDI 226
Query: 677 KDKPQDSEQSIEEEIT---QQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQS 733
K+ D+ + +++ + +NPL S N + E +Q
Sbjct: 227 KNLQNDNLKPELRDLSPSPVRPVNPL-----------FTSSPIRKSNRQGRERPLQPLQK 275
Query: 734 TDE 736
D
Sbjct: 276 GDN 278
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 35.5 bits (82), Expect = 0.17
Identities = 83/483 (17%), Positives = 175/483 (36%), Gaps = 56/483 (11%)
Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
EA++ + L ++L + +V +L+ A + ++L +E LE LAA +++
Sbjct: 534 EAVESADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREAWQA 593
Query: 226 -------DKLRDEIKLLHRDKSTYENQLKEVMKK--MLAEKVEVMQKCQDTERKL---VN 273
E++ +++T Q++ K L ++ + + R V
Sbjct: 594 QWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVA 653
Query: 274 IENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYE 333
I L ELL+ R LEE K Q + + + + LE+ +
Sbjct: 654 IVLPGEELAELLELARQLLEEAEK----QAARKASLDERL------RDAERA-LEEAEER 702
Query: 334 TLQAQKQLDAF--TLKDNLMHDKL--KNYYMNFHIIKEKMTNIKNELGLETKVDQNI--L 387
+AQ L+A+ D L+ L + E + NIK +L + Q I +
Sbjct: 703 HDEAQSALEAWQEQWYDALLEAGLGGRASPAGALDALELLQNIKEKLQAADDLRQRIAAM 762
Query: 388 VKYFDDVHQILTQF-DEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEI 446
+ + + + + + ++ + L+++L A++
Sbjct: 763 ERDLARFEEEVEALAEAVAPEMLGTPA---------DETARALKQRLKRARDT----AAA 809
Query: 447 VNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEE 506
+ E++ E+ + E +L L + E E DT +
Sbjct: 810 AEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIE-ELLAAVERSDTYRELRK 868
Query: 507 KTVAPNQKDVNIIDV-----SNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVD 561
+ A + V + D + +EE +R I+++ E+ + E+
Sbjct: 869 RIAALERTLVRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVGA 928
Query: 562 AKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYK 621
AK L + ++ + LE +E+ ++ + AE +L S+L I++Y+
Sbjct: 929 AKQELAR-----MDGGSTAAELEAERESLLAQLRDLAERYLELALASRL--LRKAIERYR 981
Query: 622 EEH 624
E+
Sbjct: 982 EQR 984
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 35.4 bits (82), Expect = 0.17
Identities = 43/294 (14%), Positives = 102/294 (34%), Gaps = 8/294 (2%)
Query: 538 IEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKE--TWVSLKL 595
++E + +I K +E + A+ E + ++ L + +S+ +
Sbjct: 1037 VKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPI 1096
Query: 596 NSAESQSSASLQSQLNDALDNIKKYKE-EHSKMTME-IESLKEKLKFSQSKENDLLEEYE 653
S + L ++L ++K K M +E ++ +E L+ + E + + +
Sbjct: 1097 WSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQ 1156
Query: 654 KLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVES 713
+LK K + L K K +++S + + S S E + K
Sbjct: 1157 RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNK 1216
Query: 714 VTSNMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGD 773
+++ +D+ ++++ + VK + + S SED ++ +
Sbjct: 1217 KSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSK--SSEDNDEFSSDDLSKEGKPKNA 1274
Query: 774 LSPIDPPESVIALINDDDTSDSSSEMTDSSTLIRIRKPKPVSVPTDASLVKPQS 827
+ + + D S + +K K + A+L K +
Sbjct: 1275 PKRVSAVQYSPPPPS--KRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKK 1326
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 35.0 bits (81), Expect = 0.18
Identities = 47/209 (22%), Positives = 88/209 (42%), Gaps = 29/209 (13%)
Query: 175 KNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKL 234
+++ Q+ +L + A + KA + + L + L + S + KL
Sbjct: 212 GENIARKQNKYDELVEEAK-TIKA--EIEELTDELLNLVMDIEDPSAALN--KLNTAAAK 266
Query: 235 LHRDKSTYENQLKEVMKKMLAEKVEV---MQKCQDTERKLVNIENECSNLQ---ELLDRN 288
+ KS E K + KM + Q+ + ++ I+++ LQ E LD
Sbjct: 267 I---KSKIEQFQKVI--KMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTA 321
Query: 289 RDELEEIIKKYQDQVTKNKEFESTIDQ-------LLNSNQEKSNTLEKLQYETLQAQKQL 341
DELEEI+ ++ +Q K E ++ I L++ ++ +E+LQ E + ++L
Sbjct: 322 IDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEEL 381
Query: 342 DAFTLKDNLMHDKLKNYYMNFHIIKEKMT 370
L+D L DK+ ++KEK
Sbjct: 382 A--KLQDEL--DKIVKTKSE--LVKEKYH 404
Score = 32.3 bits (74), Expect = 1.3
Identities = 35/245 (14%), Positives = 95/245 (38%), Gaps = 29/245 (11%)
Query: 250 MKKMLAEKV-EVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKE 308
M K+ +K+ E+ Q+ Q + K+ +I+ + + ++ R + E I + Q++ + E
Sbjct: 168 MDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVE 227
Query: 309 FESTIDQLLNSNQEK-----------SNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKN 357
TI + ++ S L KL + + +++ F K+
Sbjct: 228 EAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQ--------KVIK 279
Query: 358 YYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLN 417
Y + I + + K D + ++ ++ +E ++
Sbjct: 280 MYEKGGVCPTCTQQISEG--------PDRITKIKDKLKELQHSLEKLDTAIDE-LEEIMD 330
Query: 418 SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEK 477
+ K +L+ ++ T ++ +L + + + + L+ + +++ L+ +L+K
Sbjct: 331 EFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDK 390
Query: 478 LKQTY 482
+ +T
Sbjct: 391 IVKTK 395
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 35.3 bits (82), Expect = 0.19
Identities = 35/199 (17%), Positives = 74/199 (37%), Gaps = 18/199 (9%)
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNE------EQLAFLE 458
E +E IEK K+L E++ + E L++ + R E L+ L
Sbjct: 81 VKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLL 140
Query: 459 QSKYLHLDKIGGLEL-QLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVN 517
KY+ + +G + +LE+LK + + E K Y +++ + +V
Sbjct: 141 GFKYVSV-FVGTVPEDKLEELKLESDVENVEYISTDKGY--VYVVVV---VLKELSDEVE 194
Query: 518 II--DVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVIN 575
+ + ++ + ++ I E ++++I E EL + K + ++
Sbjct: 195 EELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAK---KYLEELLA 251
Query: 576 LQEQFSLLEKSKETWVSLK 594
L E + + E
Sbjct: 252 LYEYLEIELERAEALSKFL 270
Score = 33.7 bits (78), Expect = 0.52
Identities = 14/56 (25%), Positives = 26/56 (46%)
Query: 617 IKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVES 672
I++ KEE ++ E ESL E+LK K + L + + + L +++
Sbjct: 217 IREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALSKFLKT 272
Score = 30.7 bits (70), Expect = 4.0
Identities = 29/181 (16%), Positives = 75/181 (41%), Gaps = 27/181 (14%)
Query: 164 IQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSF 223
I+E + + L+N++ +L+ + L+ W + LL +++ + +
Sbjct: 102 IKELEEEISELENEIKELEQEIERLEP-----WGNFDLDLSLLLGFKYVSVFVGTVPEDK 156
Query: 224 TDDKLRDEIK----LLHRDKST----------YENQLKEVMKKMLAEKVEVMQKCQDTER 269
++ + + DK ++++E +KK+ E++E+ ++ +E
Sbjct: 157 LEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSEL 216
Query: 270 KLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEK 329
+ I+ E +++ + +EL+E+ KKY + + + L E++ L K
Sbjct: 217 -IREIKEELEEIEKERESLLEELKELAKKYLE-------ELLALYEYLEIELERAEALSK 268
Query: 330 L 330
Sbjct: 269 F 269
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 35.1 bits (81), Expect = 0.24
Identities = 47/253 (18%), Positives = 97/253 (38%), Gaps = 30/253 (11%)
Query: 245 QLKEVMKKMLAEKVEVMQKCQDTER-----KLVNIENECSNLQELLDRNRDELEEIIKKY 299
L+ + K +A ++E +Q+ + +L ++ E +E L ++E +E +
Sbjct: 237 ALRAIQK--VAPEIEKLQE-DFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRL 293
Query: 300 QDQV-TKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNY 358
+ Q+ T + + D+L + L + E + Q AF ++ ++L+
Sbjct: 294 RQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAF-EDADI--EQLQAD 350
Query: 359 YMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNS 418
I+ ++ ++ L T Q++ KY +I Q E + N
Sbjct: 351 LDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQ--------LERDLEKNNE 402
Query: 419 IEDIEKVTKDLREQLHTAQEEYTSL-QEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEK 477
+ KD Q +E+ +L ++ + E + +Y LEL+L +
Sbjct: 403 RLAAIREEKD--RQKAAIEEDLQALESQLRQQLEAGKLEFNEEEY-------ELELRLGR 453
Query: 478 LKQTYNSDCYENE 490
LKQ +S E
Sbjct: 454 LKQRLDSATATPE 466
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 34.6 bits (80), Expect = 0.28
Identities = 44/260 (16%), Positives = 91/260 (35%), Gaps = 51/260 (19%)
Query: 414 SGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLEL 473
L S++ +E L E++ + E SL+E++ E+ + +E+ +++ L+
Sbjct: 68 DYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDE 127
Query: 474 QLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVS 533
+LE L + ++ D+S + V
Sbjct: 128 ELEDL-------------------------EDLLEELEPLAYLD-FDLSLLRGLKFLLVR 161
Query: 534 INKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSL 593
+ E + + + E V+A +V V + E L+K + L
Sbjct: 162 LGLVRREKLEALVGVIEDEVALYGENVEASVVIV-----VAHGAE---DLDKVSKILNEL 213
Query: 594 KLNSAESQSS----ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKL-KFSQSKENDL 648
E + L S+L + + I+ E+ESL+ +L ++ +L
Sbjct: 214 GFELYEVPEFDGGPSELISELEEVIAEIQD----------ELESLRSELEALAEKIAEEL 263
Query: 649 LEEYEKLKL--VIKDYQNVL 666
L E L++ + D + L
Sbjct: 264 LAVREILEIEKALGDVLSKL 283
Score = 33.9 bits (78), Expect = 0.46
Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 16/173 (9%)
Query: 164 IQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKS- 222
+ E L+ L +L L ++ DL + + + L R E LE+ +
Sbjct: 125 LDEELEDLEDLLEELEPLAYLDFDL---SLLRGLKFLLVRLGLVRREKLEALVGVIEDEV 181
Query: 223 ---FTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECS 279
+ + I + H + + + +++ ++ E EV + + +E +
Sbjct: 182 ALYGENVEASVVIVVAHGAEDLDK--VSKILNELGFELYEVPEFDGGPSELISELEEVIA 239
Query: 280 NLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQY 332
+Q+ L+ R ELE + +K + + ++L + + L KL
Sbjct: 240 EIQDELESLRSELEALAEKIAE-------ELLAVREILEIEKALGDVLSKLAR 285
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 34.4 bits (79), Expect = 0.29
Identities = 19/88 (21%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 240 STYENQLK---EVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEII 296
+ QLK E +K+ EK E+ Q+C + +++++ ++ E ++LQ+ ++ ++ L ++
Sbjct: 168 GNLDQQLKKALEDLKEAQEEKDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLE 227
Query: 297 KKYQDQVT----KNKEFESTIDQLLNSN 320
K +S ++QL N
Sbjct: 228 GSSLGPNQLGSKKYNLLQSQLEQLQEEN 255
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 34.3 bits (79), Expect = 0.38
Identities = 54/289 (18%), Positives = 105/289 (36%), Gaps = 38/289 (13%)
Query: 206 LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKK--MLAEKVEVMQK 263
L ++ L L + +L I + E +LKE +K LA+++ ++
Sbjct: 181 LDLIDRLAGDLTNVLRRRKKSELPSSI---LSEIEALEAELKEQSEKYEDLAQEIAHLRN 237
Query: 264 -CQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQ-----LL 317
++ +R L ++E + + L R++LE +K+ + N+ + LL
Sbjct: 238 ELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297
Query: 318 NSNQEKSNTLEKLQYE--TLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNE 375
N S T +LQ E + Q Q + +D + + L + +KE I E
Sbjct: 298 IPNLLDS-TKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKE----IAAE 352
Query: 376 LGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHT 435
L I D + I+ +++ +D + QL
Sbjct: 353 L-------AEIDKPATTDSEIPHRLSGSELTQLE-------VLIQQVKRELQDAKSQLLK 398
Query: 436 AQEEYTSLQEIVNRN------EEQLAFLEQSKYLHLDKIGGLELQLEKL 478
E V++ EEQ+A L + +++ E ++E+L
Sbjct: 399 ELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEEL 447
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 33.8 bits (77), Expect = 0.39
Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 8/87 (9%)
Query: 405 FLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSL--------QEIVNRNEEQLAF 456
N + SG+ ++DI + LR +L TAQ E + E+ +E+ A
Sbjct: 62 LFLLNRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAV 121
Query: 457 LEQSKYLHLDKIGGLELQLEKLKQTYN 483
++ + + KQ +
Sbjct: 122 RQELAAARQNLAKAQQELARLTKQAQD 148
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 33.1 bits (76), Expect = 0.41
Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 3/84 (3%)
Query: 584 EKSKETWVSLK-LNSAESQSS--ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKF 640
K +E WV + L+ S LQ +L + + + + +E+ +++ E + LK++L
Sbjct: 46 PKGREGWVLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELST 105
Query: 641 SQSKENDLLEEYEKLKLVIKDYQN 664
+++ L +E ++K + +
Sbjct: 106 LEAELERLQKELARIKQLSANAIE 129
>gnl|CDD|176245 cd08285, NADP_ADH, NADP(H)-dependent alcohol dehydrogenases. This
group is predominated by atypical alcohol
dehydrogenases; they exist as tetramers and exhibit
specificity for NADP(H) as a cofactor in the
interconversion of alcohols and aldehydes, or ketones.
Like other zinc-dependent alcohol dehydrogenases (ADH)
of the medium chain alcohol dehydrogenase/reductase
family (MDR), tetrameric ADHs have a catalytic zinc that
resides between the catalytic and NAD(H)binding domains;
however, they do not have and a structural zinc in a
lobe of the catalytic domain. The medium chain alcohol
dehydrogenase family (MDR) has a NAD(P)(H)-binding
domain in a Rossmann fold of a beta-alpha form. The
N-terminal region typically has an all-beta catalytic
domain. These proteins typically form dimers (typically
higher plants, mammals) or tetramers (yeast, bacteria),
and have 2 tightly bound zinc atoms per subunit.
Length = 351
Score = 33.4 bits (77), Expect = 0.47
Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 808 IRKPKPVSVPTDASLVKPQSTTGCQTSD 835
I KP PV P DA +V+P + C TSD
Sbjct: 15 IEKPIPVCGPNDA-IVRPTAVAPC-TSD 40
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 32.3 bits (74), Expect = 0.53
Identities = 28/120 (23%), Positives = 54/120 (45%), Gaps = 3/120 (2%)
Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
+ +L+ QL L ++ +EE + E+E E L+ + L E E+
Sbjct: 26 LRRKRALERQLAAELKQLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEE---S 82
Query: 660 KDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNME 719
K+ VL + + + QDS + E +L+P +L + ++K++ES+ N+E
Sbjct: 83 KNLHPVLRLLESEVLEENELLQDSLLELSERNFSPNLDPELLDLLKQLNKHLESLQGNLE 142
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 33.4 bits (76), Expect = 0.57
Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 9/147 (6%)
Query: 592 SLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEE 651
SL+ + S AS+ +L +A N++K EE + +ESLK++L+ + KE + L E
Sbjct: 243 SLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQELEEEK-KELEELRE 301
Query: 652 YEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNV 711
E S+ +N TK + + + EI + L SSE +E K
Sbjct: 302 KEGEAEEA------ASSLEAELNRTKSEKESKAREKMVEIPLK-LQQASSEAEEA-RKEA 353
Query: 712 ESVTSNMENDKSNEDDNSSVQSTDEVK 738
E+ + K + + T E++
Sbjct: 354 EAAREELRKLKEEAEQTKAALETAELR 380
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 33.0 bits (76), Expect = 0.58
Identities = 41/162 (25%), Positives = 70/162 (43%), Gaps = 30/162 (18%)
Query: 153 PLEELYQLNTCIQ--EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVE 210
PL +L ++N + E L L+ KL +L LK+ D+ +RLLSR+E
Sbjct: 27 PLLDLKEINERLDAVEELVENPELRQKLRQL------LKRIPDL--------ERLLSRIE 72
Query: 211 FLE---SQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ-D 266
L S + L++ +LL L +V+ + L E +E++ + D
Sbjct: 73 RGRASPRDLLRLYDSL--EGLKEIRQLLESLDGPLLGLLLKVILEPLLELLELLLELLND 130
Query: 267 TERKLVNIENECSN--------LQELLDRNRDELEEIIKKYQ 300
+ VN + L+E L+ +ELEE++KK +
Sbjct: 131 DDPLEVNDGGLIKDGFDPELDELREKLEELEEELEELLKKER 172
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 33.4 bits (76), Expect = 0.68
Identities = 51/363 (14%), Positives = 127/363 (34%), Gaps = 32/363 (8%)
Query: 251 KKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFE 310
+K +++++ ++ ++++E +L ++ + LE + + +E
Sbjct: 170 RKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALE---YYQLKEKLELEEEN 226
Query: 311 STIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMT 370
L N+E+ + L++L + + + K +E +
Sbjct: 227 LLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEK-----------------EEEILA 269
Query: 371 NIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLR 430
+ E E K + + + E +E ++D EK+ +
Sbjct: 270 QVLKENKEEEKEKKLQEEE-------LKLLAKEEEELKSELLKLERRKVDDEEKLKESE- 321
Query: 431 EQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENE 490
++L ++E +E + E++L LE + ++ LE EKL+Q + +
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKK 381
Query: 491 KFGKAYNCDTLLINEEKTVAPN--QKDVNIIDVSNQYNSEKDFV--SINKEIEESSRKID 546
+ + L EE + ++ ++++S Q K +EE ++
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLE 441
Query: 547 DINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASL 606
++T+ EL L+K++ + ++ + K L + AS
Sbjct: 442 TKQGKLTEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLLLRQKLEEASQ 501
Query: 607 QSQ 609
+
Sbjct: 502 KES 504
Score = 31.9 bits (72), Expect = 1.8
Identities = 35/209 (16%), Positives = 68/209 (32%), Gaps = 14/209 (6%)
Query: 535 NKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLK 594
+E+ ++ + + + + E++ K E+ LQE+ L +E + +
Sbjct: 243 LQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSE 302
Query: 595 LNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEK 654
L E + + + K KE + E KE + +E + EE +
Sbjct: 303 LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQL 362
Query: 655 LKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESV 714
KL K Q E K K + S ++ ++ L + E E
Sbjct: 363 EKLQEKLEQL------EEELLAKKKLESERLSSAAKLKEEELELKNEEEKE--------A 408
Query: 715 TSNMENDKSNEDDNSSVQSTDEVKDTEEG 743
+E + ED + + E
Sbjct: 409 KLLLELSEQEEDLLKEEKKEELKIVEELE 437
Score = 31.9 bits (72), Expect = 2.1
Identities = 54/415 (13%), Positives = 136/415 (32%), Gaps = 35/415 (8%)
Query: 250 MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEF 309
E ++++ + E L + L E + E + +Q + K
Sbjct: 628 GILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKELLAEQELQEKA- 686
Query: 310 ESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKM 369
ES + + +++ ++ + + + +L+ L + + + ++++K+
Sbjct: 687 ESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLEQKI 746
Query: 370 TNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDIEKVTKDL 429
+ E ++++ + + ++ + E E E ++ ++L
Sbjct: 747 KEKEEEE-EKSRLKKEEEEEEKSELSLKEKELAEE-EEKTEKLKVEEEKEEKLKAQEEEL 804
Query: 430 REQLHTAQ-EEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYE 488
R + E +E + +E+ E+ + L L+ +L+ ++ +
Sbjct: 805 RALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 864
Query: 489 NEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDI 548
++ EE+ KD + KE+EE S+K + +
Sbjct: 865 TKEELLQELLLKEEELEEQ----KLKDELESKEEKEKE-------EKKELEEESQKDNLL 913
Query: 549 NAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQS 608
+ + +E LL+ E L + E + +
Sbjct: 914 EEKENEIEE----------------RIAEEAIILLKYESEPEELLLEEADEKEKEEDNKE 957
Query: 609 QLNDALDNIKKYKEEHSKMT----MEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
+ + + KEE + E E +E+ + K+ L EE ++L I
Sbjct: 958 EEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREI 1012
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 32.2 bits (74), Expect = 0.80
Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 14/144 (9%)
Query: 209 VEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ------ 262
++ LE L + DK + + + +L+E+ +++ A K
Sbjct: 20 IQDLEQALNFLDEI---DKSKQKADQYQKQIDDAPKELRELRQELEALKKTDAPVFPELA 76
Query: 263 --KCQDTERKLVNIENECSNLQELLDRNRDELEEI---IKKYQDQVTKNKEFESTIDQLL 317
E++L ++ LQE L + +L E+ ++ Q Q+++ + I L
Sbjct: 77 NLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQLSEARRRLQEIRNRL 136
Query: 318 NSNQEKSNTLEKLQYETLQAQKQL 341
+ L + Q LQA+
Sbjct: 137 QALSPGGTPLAQAQRTLLQAELAA 160
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 33.1 bits (77), Expect = 0.81
Identities = 18/75 (24%), Positives = 40/75 (53%), Gaps = 7/75 (9%)
Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
+ +A L+++ ++ + ++ EE ++ E+E LK KL + + DLL + +++ V
Sbjct: 704 KELAALLKAKPSELPERVEALLEELKELEKELEQLKAKL--AAAAAGDLLAQAKEVNGV- 760
Query: 660 KDYQNVLPSIVESMN 674
VL + VE ++
Sbjct: 761 ----KVLAAQVEGVD 771
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 32.7 bits (75), Expect = 0.83
Identities = 23/119 (19%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 226 DKLRDEIKLLHRDKS---TYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQ 282
+ L + ++ L D L + K L ++ + +++ ++L + +C
Sbjct: 147 EGLDENLEGLKEDYKLLMKELELLNSIKPK-LRDRKDALEEELRQLKQLEDELEDCD--P 203
Query: 283 ELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL 341
LDR +++L++++++ +V K +E E + +L + ++ +N +L E +A+K+L
Sbjct: 204 TELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL 262
Score = 31.1 bits (71), Expect = 2.3
Identities = 32/138 (23%), Positives = 58/138 (42%), Gaps = 9/138 (6%)
Query: 512 NQKDVNIIDVSNQYNSEKDFVS-INKEIEESSRKIDDINAEITKC-TNELVDAKNCLVKN 569
+ ++ NS K + +EE R++ + E+ C EL AK L K
Sbjct: 157 KEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKL 216
Query: 570 EKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTM 629
+ ++ ++ LE+ + ++ + + +S+LN + +K E+ T
Sbjct: 217 LQEIMIKVKKLEELEEELQE------LESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTF 270
Query: 630 -EIESLKEKLKFSQSKEN 646
EIE LKE+LK QS
Sbjct: 271 KEIEKLKEQLKLLQSLTG 288
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 32.7 bits (75), Expect = 0.84
Identities = 17/77 (22%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 404 YFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYL 463
YF N++ +I ++++ ++L Q +E+ + + ++ + Q LEQ++
Sbjct: 160 YFGYLNQARQ---ETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNE 216
Query: 464 HLDKIGGLELQLEKLKQ 480
+ GLE L+K +Q
Sbjct: 217 RKKTLTGLESSLQKDQQ 233
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.8 bits (76), Expect = 0.84
Identities = 23/101 (22%), Positives = 47/101 (46%), Gaps = 10/101 (9%)
Query: 243 ENQLKEVMKKMLAE-KVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQD 301
+ + + + K+ L E K E+ + + E++L NE LQ+L R + EE + + +
Sbjct: 48 KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE---LQKLEKRLLQK-EENLDRKLE 103
Query: 302 QVTKNKEFESTIDQLLNSNQEKSNTLEKLQ--YETLQAQKQ 340
+ K E +++ ++K LEK + E L ++
Sbjct: 104 LLEKR---EEELEKKEKELEQKQQELEKKEEELEELIEEQL 141
Score = 32.1 bits (74), Expect = 1.5
Identities = 17/67 (25%), Positives = 35/67 (52%)
Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRN 288
R+E++ L + E L ++ + + E+ +K ++ E+K +E + L+EL++
Sbjct: 81 RNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQ 140
Query: 289 RDELEEI 295
ELE I
Sbjct: 141 LQELERI 147
Score = 31.3 bits (72), Expect = 2.6
Identities = 21/112 (18%), Positives = 51/112 (45%), Gaps = 27/112 (24%)
Query: 201 QEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEV 260
E RLL + E L+ +L K +++L + K L + + E + +E+ ++++ E+++
Sbjct: 87 LEKRLLQKEENLDRKLELLEKR--EEELEKKEKELEQKQQELEKKEEEL-EELIEEQLQE 143
Query: 261 MQKCQDTERKLVNIENECSNL------QELLDRNRDELEE----IIKKYQDQ 302
++ S L + LL++ +E +IK+ +++
Sbjct: 144 LE--------------RISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEE 181
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 32.8 bits (75), Expect = 0.91
Identities = 34/221 (15%), Positives = 74/221 (33%), Gaps = 26/221 (11%)
Query: 242 YENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDR---------NRDEL 292
E+ LK ++K K + + R++ + E++ +L ++ N D +
Sbjct: 759 LEDLLKARLEKDYQSKYLDLIDNLNDAREIKDKESKAIDLDDIDFELELIGKQEINIDYI 818
Query: 293 EEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMH 352
E+++ + D+ + + I++++ E +KL ++A Q A
Sbjct: 819 LELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKK------L 872
Query: 353 DKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESE 412
D + + I K + E +++ L Y D L N +E
Sbjct: 873 DTAEKLEELY--ILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGK------LPENGTE 924
Query: 413 SSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQ 453
+ E + + EE + + R Q
Sbjct: 925 LVEKLAKEKSLREKNKDDWKAK---EEVEAKLKAFFRRFIQ 962
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 32.7 bits (74), Expect = 0.92
Identities = 32/118 (27%), Positives = 55/118 (46%), Gaps = 13/118 (11%)
Query: 664 NVLPSIVESMNNTKDKPQD-SE----QSIEEEITQQSLNPLSSELDETIHKNVESVTSNM 718
V S V S +++K P SE +S +E ++Q+S ++E ++ + S
Sbjct: 61 KVWKSAVSSSDDSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTD--SEE 118
Query: 719 ENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSP 776
++D S E+DN S S KD + T V S + S T+ ++NT+ S+ L
Sbjct: 119 DDDDSEEEDNKSTSS----KDGKGSKKTQPGVSTS--SGSTTSGTDLNTKQSQTGLGA 170
>gnl|CDD|226655 COG4192, COG4192, Signal transduction histidine kinase regulating
phosphoglycerate transport system [Signal transduction
mechanisms].
Length = 673
Score = 32.6 bits (74), Expect = 1.0
Identities = 36/200 (18%), Positives = 84/200 (42%), Gaps = 11/200 (5%)
Query: 166 EALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTD 225
Q+ L+ L+ + ++ L + + + +QE R L + + ++ D
Sbjct: 82 TLNQQYQRLQQDLTAIHALLASLPQNEQAALQQGLQELRQLLQ----QINDLVSARIDVD 137
Query: 226 DKLR---DEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ-DTERKLVNIENECSNL 281
+L ++I LH+D + L++ ++ ++ + + Q D ++++ L
Sbjct: 138 RRLALFAEQIDWLHQDFGMELSPLRQELQWQQETLLDGITQAQVDNGQRVLQYRLYQQEL 197
Query: 282 QELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL 341
+E+ + R E +I + +D+V + ++ STI QL + + L +Q
Sbjct: 198 EEVYNVLRLE-GQIQQSLRDRVVETQKLNSTI-QLDQQHTAYLDLLANADALFIQV-LGQ 254
Query: 342 DAFTLKDNLMHDKLKNYYMN 361
A T+ + + D+L N
Sbjct: 255 TASTVTLHQLSDELDAIGHN 274
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.4 bits (74), Expect = 1.1
Identities = 34/188 (18%), Positives = 65/188 (34%), Gaps = 21/188 (11%)
Query: 155 EELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQ-EDRLLSRVEFLE 213
++ +L ++ L+ +L + + L+K L E + + L
Sbjct: 59 DQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLA 118
Query: 214 SQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVN 273
QLAA +S + L+ + + +L + + E + + T ++L
Sbjct: 119 EQLAALQRSGRNPPP---ALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAA 175
Query: 274 IENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYE 333
+ E + Q L E K + QLL +E+ TL +L E
Sbjct: 176 VRAEIAAEQAELTTLLSEQRAQQAK--------------LAQLL---EERKKTLAQLNSE 218
Query: 334 TLQAQKQL 341
QK+L
Sbjct: 219 LSADQKKL 226
Score = 32.0 bits (73), Expect = 1.6
Identities = 27/190 (14%), Positives = 58/190 (30%), Gaps = 31/190 (16%)
Query: 523 NQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSL 582
+K+ ++ K+I E + + ++ E+ + L++ + L++Q +
Sbjct: 39 QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIAD 98
Query: 583 LEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQ 642
L L E + L QL + + +
Sbjct: 99 LNARLN-----ALEVQEREQRRRLAEQLAAL------------QRSGRNPPPALLVSPED 141
Query: 643 SKENDLLEEYEKLKLVIKDYQNVLPSI---VESMNNTKDKPQDSEQSIEEEITQQSLNPL 699
++ + L Y Y + P+ ++++ T + I E Q L L
Sbjct: 142 AQRSVRLAIY---------YGALNPARAERIDALKATLKQLAAVRAEIAAE--QAELTTL 190
Query: 700 SSELDETIHK 709
SE K
Sbjct: 191 LSEQRAQQAK 200
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 32.7 bits (74), Expect = 1.1
Identities = 43/193 (22%), Positives = 75/193 (38%), Gaps = 20/193 (10%)
Query: 622 EEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQ 681
E+ + E + L+E + L+E E +L D V V S N D +
Sbjct: 3825 EDITNTLNEDDDLEELANEEDTANQSDLDESEARELE-SDMNGVTKDSVVSENENSDS-E 3882
Query: 682 DSEQSIEEEITQ------QSLN------PLSSELDETIHKNVESVTSNMENDKSN-EDDN 728
+ Q ++EE+ SLN P +L ET K+ E +N E+D + EDDN
Sbjct: 3883 EENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDN 3942
Query: 729 SSVQSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPESVIALIN 788
+++ D + +E + + E + + N + +D PE + L
Sbjct: 3943 KALEDKDRQEKEDEEEMSDDVGIDDE----IQPDIQENNSQPPPENEDLDLPED-LKLDE 3997
Query: 789 DDDTSDSSSEMTD 801
+ S++ D
Sbjct: 3998 KEGDVSKDSDLED 4010
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 32.7 bits (75), Expect = 1.1
Identities = 42/261 (16%), Positives = 95/261 (36%), Gaps = 35/261 (13%)
Query: 254 LAEKVEVMQ--------KCQDTERKLVNIENECSNLQELL-----------DRNRDELEE 294
L E V Q +C + ERKL +E++ L L + +LEE
Sbjct: 15 LNEDVSAFQRKFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEE 74
Query: 295 IIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQL-DAFTLKDNLMHD 353
I + ++ + +E ++++ +N +E N L++ + + ++L + L + +
Sbjct: 75 EILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFKYL 134
Query: 354 KLKNYYMNFHII----KEKMTNIKNEL-----GLETKVDQNILVKY-FDDVHQILTQFDE 403
+ + +EK+ + EL G + + ++ V I E
Sbjct: 135 RGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKKTVFIIFFVGKE 194
Query: 404 YFLRFNESESSGLN-SIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKY 462
+ + + D+ + + E + ++E+ E+ + LE+
Sbjct: 195 DLDKV-KKILDSFGFELYDVPETEGERSELISKVN---KRIEELQRVLEQTESHLEKVLV 250
Query: 463 LHLDKIGGLELQLEKLKQTYN 483
D++ + Q+ K K Y
Sbjct: 251 KIADELLAWDEQVSKEKAVYE 271
Score = 30.4 bits (69), Expect = 4.8
Identities = 34/209 (16%), Positives = 80/209 (38%), Gaps = 38/209 (18%)
Query: 155 EELYQLNTCIQEALQRE--------NMLKNKLSKLQHIVIDLKKAA--DVSWKALIQEDR 204
E+ ++ E+L++E N+L + S L + +L + + D+ +K L +
Sbjct: 81 AEIKEVEE-NLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFKYLRGAEG 139
Query: 205 L----------LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTY---------ENQ 245
L ++E E +L + + E L K+ + ++
Sbjct: 140 LKLGFVAGVINREKLEAFERELWRACRGYIRQ-AEIEEPLEDPKKTVFIIFFVGKEDLDK 198
Query: 246 LKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTK 305
+K+++ E +V + + + + LQ +L++ LE+++ K D++
Sbjct: 199 VKKILDSFGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEKVLVKIADEL-- 256
Query: 306 NKEFESTIDQLLNSNQEKSNTLEKLQYET 334
D+ ++ + TL Y+T
Sbjct: 257 -----LAWDEQVSKEKAVYETLNLFNYDT 280
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 32.5 bits (74), Expect = 1.4
Identities = 50/226 (22%), Positives = 94/226 (41%), Gaps = 50/226 (22%)
Query: 174 LKNKLSKLQHIVI------------DLKKAADVSWKALIQEDRLLSRVEFLESQLA---A 218
LK KL L LK + +K ++ + ++E L++++A A
Sbjct: 533 LKYKLDMLNEFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEIKEKMEALKAEVASSGA 592
Query: 219 YSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENEC 278
S DD L+++++ + K E +L V+K M E + V +K +DT +
Sbjct: 593 SSGDELDDDLKEKVE---KMKKEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPP----- 644
Query: 279 SNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQ 338
L+E I+ N+E I++++ S+ K + +E L+ E +A
Sbjct: 645 -----------PNLQEKIESL------NEEINKKIERVIRSSDLK-SKIELLKLEVAKAS 686
Query: 339 KQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQ 384
K D T K+ + + L+ IK+K+ N L+ K ++
Sbjct: 687 KTPDV-TEKEKI--EALEQQ------IKQKIAEALNSSELKEKFEE 723
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 31.3 bits (71), Expect = 1.4
Identities = 15/41 (36%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 419 IEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQ 459
E++EK + LR++L T +E+Y +L +I++R +LA +E
Sbjct: 120 NEELEKELEKLRQRLSTIEEDYQTLIDIMDR-ARKLAVVED 159
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 29.2 bits (66), Expect = 1.6
Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 606 LQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNV 665
L+S++ A++ I + MEIE LKE+ + + +L EE +KLK +Q+
Sbjct: 9 LESKIQQAVETIALLQ-------MEIEELKEENEQLSEENEELEEENKKLKEERNAWQSR 61
Query: 666 LPSIVESMNNT 676
L +++ ++
Sbjct: 62 LRALLGKLDEV 72
>gnl|CDD|185283 PRK15385, PRK15385, magnesium transport protein MgtC; Provisional.
Length = 225
Score = 31.4 bits (71), Expect = 1.7
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 297 KKYQDQVTKNKEFESTIDQ-LLNSNQEKSNTLEKLQYETLQAQKQLD 342
K+Y +VT NKE ES + Q LLN +E + L+ L Q Q +
Sbjct: 141 KRYILKVTCNKEDESAVRQWLLNIVKEAAICLQGLGSVPAQEQGYKE 187
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 32.0 bits (73), Expect = 1.8
Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 5/85 (5%)
Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE-NDLLEEYEKLKLV 658
+ + L+ N+ I+++ EE + E+ESLK K+ + E D E +K++
Sbjct: 711 KELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAETINGVKVL 770
Query: 659 IKD----YQNVLPSIVESMNNTKDK 679
++ L ++ + + +
Sbjct: 771 VEVVDAKDMKSLKTMADRLKSKLGS 795
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 31.8 bits (73), Expect = 2.0
Identities = 44/187 (23%), Positives = 78/187 (41%), Gaps = 50/187 (26%)
Query: 272 VNIENECSNLQELLDRNRDELEEI-IKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKL 330
++IE E +L+E +D+N LEE+ + + ++ +N+E E ID L
Sbjct: 248 LDIEKELQDLKEQIDQNLALLEELDLDEAEE---ENEEIEERIDTL-------------- 290
Query: 331 QYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETK-VDQNILVK 389
Y+ L+ + + F K+ DKL ++ E +L LE + Q+
Sbjct: 291 -YDILEKEVKAKKFVEKN---IDKLTDFL-------EHAREQNKQLLLELDRLQQS---- 335
Query: 390 YFDDVHQILTQFDEYFLRFNESESS-GLNS-IEDIEKVTKDLREQLHTAQEEYTSLQEIV 447
Y L +E E+ L +E++E L E++ + Y+ LQE +
Sbjct: 336 --------------YTLNEDELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEEL 381
Query: 448 NRNEEQL 454
E+QL
Sbjct: 382 EEIEKQL 388
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.5 bits (72), Expect = 2.2
Identities = 36/200 (18%), Positives = 72/200 (36%), Gaps = 27/200 (13%)
Query: 387 LVKYFDDVHQ----ILTQFDEYFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQEEYTS 442
L +FD+ +Q L + + + E L SI D EK + + + + +
Sbjct: 272 LEAHFDEEYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEA 331
Query: 443 LQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLL 502
L+EI+ +N ++L +K+ +E S E + L
Sbjct: 332 LEEILEKNLQKLE----------EKLKDPSTSIE-----LESITDLIESINDIIDAINEL 376
Query: 503 INEEKTVAPNQKDVNIIDVSNQYNS--EKDFV-SINKEIEESSRKIDDINAEITKCTNEL 559
I E N+K N+ N+ V + ++I+ ++ + I E+
Sbjct: 377 IREH-----NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEI 431
Query: 560 VDAKNCLVKNEKWVINLQEQ 579
+ + EK + L++Q
Sbjct: 432 KQLEAEIKALEKEIKELEKQ 451
Score = 30.7 bits (70), Expect = 4.4
Identities = 29/201 (14%), Positives = 84/201 (41%), Gaps = 28/201 (13%)
Query: 474 QLEKLKQTYNSDC-YENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFV 532
QLE+L Y S E+ + + + V + + ++ + N +K
Sbjct: 286 QLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQK--- 342
Query: 533 SINKEIEE--SSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETW 590
+ +++++ +S +++ I I + + + ++ + + NL+++ + +K
Sbjct: 343 -LEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLH 401
Query: 591 VSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLE 650
+ + + ++I Y++E + I SL++++K +++ L +
Sbjct: 402 L------------------VAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEK 443
Query: 651 EYEKLKLVIKDYQNVLPSIVE 671
E ++L+ + N+ P+ E
Sbjct: 444 EIKELEKQLT---NIEPTADE 461
>gnl|CDD|234953 PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed.
Length = 476
Score = 31.6 bits (73), Expect = 2.3
Identities = 30/155 (19%), Positives = 55/155 (35%), Gaps = 49/155 (31%)
Query: 149 ISSMPLEELYQLNTCIQEALQRENMLKNKLSKLQHIVIDLKKAADVSWKALIQEDRLLSR 208
I + EEL +L + L++E + L+ +V L++E R
Sbjct: 325 IRELDPEELAEL--LLPWHLEQEGLDTEDGPYLEKVV------------PLLKE-----R 365
Query: 209 VEFLESQLAAYSKS-FTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDT 267
+ L +LA ++ F D DE + LKEV +++L +E
Sbjct: 366 AKTL-KELAELARFFFEDFPEYDEDAA--------KKHLKEVAREVLEALLE-------- 408
Query: 268 ERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQ 302
L+ L + + +E IK ++
Sbjct: 409 ------------KLEALEEWTAEAIEAAIKAVAEE 431
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 31.5 bits (72), Expect = 2.4
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLV-NIENECSNLQELLDR 287
D + + D +YE Q K+ K+++++K + +E E L+ +DR
Sbjct: 3 LDALDEILGDLESYEPQEYL--NKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDR 60
Query: 288 NRDELEEI 295
R+E+E +
Sbjct: 61 LREEIERL 68
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.3 bits (71), Expect = 2.5
Identities = 28/175 (16%), Positives = 67/175 (38%), Gaps = 22/175 (12%)
Query: 671 ESMNNTKDKPQDSEQSIEEEITQ-QSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNS 729
+ + + ++ + +++ + S S+ + + I+KN+ N K+ DDN
Sbjct: 58 KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNY 117
Query: 730 SVQST--------DEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPE 781
S+ + ++ D E+ ++ + S + + +T+SS+ D +
Sbjct: 118 SLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKA------ 171
Query: 782 SVIALINDDDTSDSSSEMTDSSTLIRIRKPKPVSVPTDASLVKPQSTTGCQTSDE 836
D+ + SS+ S++ + PKP S T ++S +
Sbjct: 172 -------DNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSK 219
Score = 29.4 bits (66), Expect = 9.9
Identities = 32/245 (13%), Positives = 78/245 (31%), Gaps = 16/245 (6%)
Query: 512 NQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNE----LVDAKNCLV 567
N + +S + + + +IN +TK + L L
Sbjct: 68 TSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLF 127
Query: 568 KNEKWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKM 627
+ + ++ + + + ++ + + + QS D DN K ++K
Sbjct: 128 NLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDT---QSSKQDKADNQKAPSSNNTKP 184
Query: 628 TMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQN------VLPSIV-ESMNNTKDKP 680
+ + ++ N + N L SI+ + + K
Sbjct: 185 STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQ 244
Query: 681 QDSEQSIE--EEITQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVK 738
+D + + T + NP DE HK+ + + + ++SN S ++ +
Sbjct: 245 KDYASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLS 304
Query: 739 DTEEG 743
+ ++
Sbjct: 305 NNDDS 309
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 30.7 bits (70), Expect = 2.6
Identities = 24/141 (17%), Positives = 54/141 (38%), Gaps = 21/141 (14%)
Query: 228 LRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQ----------DTERKLVNIENE 277
L++ KL + K +KEV++ ++ + + +K KL +
Sbjct: 14 LKELEKLAPKVKGISSMSVKEVLQSLVDDGLVRTEKIGTSNYYWSFPSQALNKL---KTR 70
Query: 278 CSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA 337
L++ L+ + + E+ + + +E +LL ++ L+KL+
Sbjct: 71 LEKLKKELEELKQRIAELQAQIEKLKKGREE-TEERTELLEELKQLEKELKKLK------ 123
Query: 338 QKQLDAFTLKDNLMHDKLKNY 358
+L+ + D +KLK
Sbjct: 124 -AELEKYEKNDPERIEKLKEE 143
>gnl|CDD|144876 pfam01442, Apolipoprotein, Apolipoprotein A1/A4/E domain. These
proteins contain several 22 residue repeats which form a
pair of alpha helices. This family includes:
Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein
E.
Length = 191
Score = 30.7 bits (70), Expect = 2.7
Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 17/106 (16%)
Query: 609 QLNDALDNIKKYKEEHSKMTMEIESLKEKL-KFSQSKENDLLEEYEKLKLVI-KDYQNVL 666
L+D+LD + Y EE L+E+L ++Q L +E E L+ + KD ++V
Sbjct: 2 LLSDSLDELNSYAEE----------LQEQLGPYAQEFWARLSKETEALREELQKDLEDVR 51
Query: 667 PSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVE 712
+ ++ K Q++EE +Q L P + EL + ++++ E
Sbjct: 52 ARLQPYLDELK---AKVGQNLEE--LRQRLAPYAEELRKRLNRDAE 92
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 31.4 bits (71), Expect = 3.1
Identities = 59/278 (21%), Positives = 113/278 (40%), Gaps = 73/278 (26%)
Query: 147 MLISSMPLEELYQLNTCI--QEALQRE-NMLKNKLSKLQHIVIDLKKAADVSWKALIQED 203
+L++ L+ L L + +EALQ + N+L+ +LS+ +K AA I E+
Sbjct: 145 LLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDA---RIKLAAQEKIHVEILEE 201
Query: 204 RL-------LSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKEVMKKMLAE 256
+L L R + + SK L++E LL D + +L EV
Sbjct: 202 QLEKLRNELLIRGATEGLCVHSLSKELDV--LKEENMLLKDDIQFLKAELIEV------- 252
Query: 257 KVEVMQKCQDTERKLVNIENECS------------------------------------N 280
+TE ++ +E E S N
Sbjct: 253 --------AETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVEN 304
Query: 281 LQELLDRNRDELEE---IIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA 337
LQ+LLDR +++E+ ++ + QD K + E+++ + N ++ S +E LQ +
Sbjct: 305 LQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKE-ANVSKFSSYKVELLQQKLKLL 363
Query: 338 QKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNE 375
+++L A D+ +H ++ Y + ++ ++ +K E
Sbjct: 364 EERLQA---SDHEIHSYIQLYQESIKEFQDTLSKLKEE 398
Score = 30.6 bits (69), Expect = 4.8
Identities = 59/328 (17%), Positives = 139/328 (42%), Gaps = 26/328 (7%)
Query: 363 HIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSGLNSIEDI 422
E + I NE +K + + +D+ ++ ++ L N++ L ++ED+
Sbjct: 102 MQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAEKNILLLNQAR---LQALEDL 158
Query: 423 EKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKIGGLELQLEKLKQTY 482
EK+ + +E L Q + L+ ++ + ++ Q K +H++ LE QLEKL+
Sbjct: 159 EKILTE-KEAL---QGKINILEMRLSETDARIKLAAQEK-IHVEI---LEEQLEKLRNEL 210
Query: 483 NSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSEKDFVSINKEIEESS 542
+ + + ++ EE + + ++ +E+ + KE
Sbjct: 211 LIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLD 270
Query: 543 RKIDDINAEITKCTNELVDAKNCLVKNEKW---VINLQEQFSLLEKSKETWVSLKLNSAE 599
+ ++ ++ ++ +K ++ + W V NLQ+ LL+++ L +
Sbjct: 271 ASLRELESKFIVAQEDV--SKLSPLQYDCWWEKVENLQD---LLDRATNQVEKAALVLDQ 325
Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTM---EIESLKEKLKFSQSKENDLLEEYEKLK 656
+Q +L +L K K+ + +++ L+E+L+ S + + ++ Y++
Sbjct: 326 NQDLRDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQES- 384
Query: 657 LVIKDYQNVLPSIV-ESMNNTKDKPQDS 683
IK++Q+ L + ES + + P D
Sbjct: 385 --IKEFQDTLSKLKEESKKRSLEHPADD 410
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 29.2 bits (66), Expect = 3.3
Identities = 20/69 (28%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 208 RVEFLESQLAAYSKSFTDDKLRDE----IKLLHRDKSTYENQLKEVMKKMLAEKVEVMQ- 262
R+ LE L TD LR E ++ ++ + E +LKE K A+K+E Q
Sbjct: 44 RIAGLEKALKEVQAHCTDAGLRAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQR 103
Query: 263 KCQDTERKL 271
K + + +L
Sbjct: 104 KLAEAQAEL 112
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 31.0 bits (70), Expect = 3.4
Identities = 79/472 (16%), Positives = 170/472 (36%), Gaps = 49/472 (10%)
Query: 283 ELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLD 342
L+RN D+L+++I + +++ E + + + + K+++
Sbjct: 162 NSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIE 221
Query: 343 AFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFD 402
+++ N D N + + + ++KN E K ++ L + + +
Sbjct: 222 RLSIEYNNAMDDYNNLKSALNELSS-LEDMKNRYESEIKTAESDLSMELEKNNYYKELEE 280
Query: 403 EYFLRFNESESSGLNSIEDIEKVTKDL---REQLHTAQEEYTSLQEIVNRNEEQLAFLEQ 459
+ N+ N I D K D+ ++ L E I+ +
Sbjct: 281 RHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDYND 340
Query: 460 --SKYLHLDKIGGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVN 517
K D + L+LE + YNS E K K N + ++
Sbjct: 341 YIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIE------EYSK----NIERMS 390
Query: 518 IIDVSNQYNSEKDFVSINKEIEESSRKIDDINA-----------------EITKCTNELV 560
E D +I KE+ E + K+ DI++ E+++ L
Sbjct: 391 AFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLN 450
Query: 561 DAKNCLV--------KNEKWVINLQEQFSLLE-KSKETWVSLKLNSAESQSSASLQSQLN 611
C V K+ + + E+ S LE K +E + +K + + L
Sbjct: 451 GQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLE 510
Query: 612 DALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVE 671
+ I K E++K+ ES + L+ + K N+L ++++K + + Y+++ ++
Sbjct: 511 S--EEINKSINEYNKI----ESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLD 564
Query: 672 SMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNMENDKS 723
S + I+ E + N + +L++ + ++ + +DKS
Sbjct: 565 SKRTSWLNALAVISLIDIETNRSRSNEIKKQLND-LESRLQEIEIGFPDDKS 615
>gnl|CDD|227612 COG5293, COG5293, Predicted ATPase [General function prediction
only].
Length = 591
Score = 31.0 bits (70), Expect = 3.4
Identities = 51/335 (15%), Positives = 104/335 (31%), Gaps = 62/335 (18%)
Query: 180 KLQHIVIDLKKAADVSWKALIQEDRLLSRVEFLESQLAAYSKSFTDDKLRDEIKLLHRDK 239
+L H +I + D +++ L+ + +SK F L
Sbjct: 131 QLLHDLIGWRMKKDWTFRKLLGY--------LIRDSQENFSKVF----------ELSHSV 172
Query: 240 STYENQLKEV-MKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELEEIIK- 297
+ + +L E + + D ++L + + L++ + D LEEI
Sbjct: 173 LKHAKWKNYPFLLHLLGEDGKCAAEYYDKIQELESKKKLAELLRKTWIGSLDSLEEIETT 232
Query: 298 --KYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHDKL 355
+ QD+V K + +T D E + + +++ + M
Sbjct: 233 ELRKQDEVNKKQATLNTFDFHAQDYAETEELVNTVD-------ERIAELNNRRISMQSHW 285
Query: 356 KNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDEYFLRFNESESSG 415
K + + + + V + V + + E
Sbjct: 286 KRVKTSLKEQILF------CPDEIQVLYEEVGVLFPGQVKKDFEHVIAFNRAITEERHDY 339
Query: 416 LNSIEDIEKVTKDLRE-------------------QLHTAQEEYTSLQEIVNRNEEQLAF 456
L E+I ++ DL+E + E+Y +L E + +LA
Sbjct: 340 LQ--EEIAEIEGDLKEVNAELDDLGKRRAEGLAFLKNRGVFEKYQTLCEEIIALRGELAE 397
Query: 457 LEQ-----SKYLHL-DKIGGLELQLEKLKQTYNSD 485
LE K L IG L+ + L++ ++
Sbjct: 398 LEYRIEPLRKLHALDQYIGTLKHECLDLEERIYTE 432
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins
in this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This
family is related to many other flap endonuclease
families including the fen1 family. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1034
Score = 31.0 bits (70), Expect = 3.5
Identities = 18/79 (22%), Positives = 38/79 (48%), Gaps = 7/79 (8%)
Query: 265 QDTERKLVNIENECSNLQELLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKS 324
++ E K+ +IE E E L + E + I+ +++ K+ + ++ + + E+
Sbjct: 686 ENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEE----KDADDFKNEWQDISLEEL 741
Query: 325 NTLE---KLQYETLQAQKQ 340
LE + +L+AQKQ
Sbjct: 742 EALEANLLAEQNSLKAQKQ 760
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 29.1 bits (66), Expect = 3.8
Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 609 QLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPS 668
+ A ++ K+ + EI+ LK +L+ +++ L E+ E+ + Y+ L S
Sbjct: 61 KRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLEEYQP----YEEFLES 116
Query: 669 IVE 671
+VE
Sbjct: 117 VVE 119
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 31.1 bits (70), Expect = 3.8
Identities = 32/150 (21%), Positives = 70/150 (46%), Gaps = 12/150 (8%)
Query: 536 KEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKE-TWVSLK 594
+E+E + ++++ E++KC ++L ++KN L + EK + L+ + ++S LK
Sbjct: 594 EELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSLAETQLK 653
Query: 595 LNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEK 654
+S SL+ + D + EH + +I SL+++L+ + +L+ + +
Sbjct: 654 ---CMVESYESLELRAKDL-------EAEHKSLQEKISSLEDELEKERQNHQELIAKCRE 703
Query: 655 LKLVIKDYQNVLPSIVESMNNTKDKPQDSE 684
L+ I+ + K K Q+ E
Sbjct: 704 LEEKIERAEQEENMQKLDEEEQKIK-QEKE 732
Score = 30.7 bits (69), Expect = 3.9
Identities = 57/302 (18%), Positives = 106/302 (35%), Gaps = 34/302 (11%)
Query: 473 LQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIID-VSNQYNSEKDF 531
L++EKL + K D+ + + + + D + S
Sbjct: 386 LEMEKLAMLPPESSADGSKVSSPKCSDSSVALTGPVEHEDNLESKLSDRLPEVLQSVLKL 445
Query: 532 VSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKSKETWV 591
V I E S ++DI AE+ + ++E +++E+S+E
Sbjct: 446 VMEKHIISEISEILEDIEAELDSLERSSNGDDENEEVAMVGSL-VEESSAVIERSQELEG 504
Query: 592 SLKLNSAESQSSASLQSQLNDALDNIKKYKE--------------------EHSKMTMEI 631
++ S + + D LD ++ + S EI
Sbjct: 505 AVSRISEFTSVLEHEVTVCQDLLDGKADLEKFIQEFSLTLEWVVNQEKSLQDVSVEASEI 564
Query: 632 ESLKEKLKFSQSKENDL-----------LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKP 680
+ LK S+ + N LEE E LK ++ L + + +K+K
Sbjct: 565 KKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKENLDGELSKCKDDLEESKNKL 624
Query: 681 QDSEQSIEEEITQQSLNPLSSELDET-IHKNVESVTSNMENDKSNEDDNSSVQSTDEVKD 739
Q++E+ +EE ++ + S+ L ET + VES S K E ++ S+Q +
Sbjct: 625 QETEKKLEELKSELDASQESNSLAETQLKCMVESYESLELRAKDLEAEHKSLQEKISSLE 684
Query: 740 TE 741
E
Sbjct: 685 DE 686
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 30.9 bits (70), Expect = 3.8
Identities = 48/271 (17%), Positives = 87/271 (32%), Gaps = 28/271 (10%)
Query: 409 NESESSGLNSIEDIEKVTKDLREQLHTAQEEYTSLQEIVNRNEEQLAFLEQSKYLHLDKI 468
SS +N E+I + E L E I N E + +I
Sbjct: 411 KNKNSSFINKTENILTNSPLKDELLEKTTEIIN----IENPQEFEFG-----------QI 455
Query: 469 GGLELQLEKLKQTYNSDCYENEKFGKAYNCDTLLINEEKTVAPNQKDVNIIDVSNQYNSE 528
G + E + N + + +F N N N+ D NI + +
Sbjct: 456 GNDIISTEIAQLDENQNLIDTGEFDLENN----FSNSFNPENGNKIDENINETFDTSTIS 511
Query: 529 KDFVSINKEIEESSRKIDD--INAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEKS 586
+ +S D IN+EI + N KN K ++ S+ E
Sbjct: 512 ANLSENKTNFAQSFNNKDTNLINSEIPIDLIKDTITINNSQKNVK-KNGNKDYLSVEEVI 570
Query: 587 KETWVSLKLNSAESQSSASLQSQLNDALDNIKK----YKEEHSKMTMEIESLKEKLKFSQ 642
+++K +S L N L + + H ++I + +
Sbjct: 571 NLIMLAIKFHSQNQVEYKKLVQNWNKNLPLFEYDVEFMEIAHFLKDLKILASSDNFILFS 630
Query: 643 SKENDLLEEYEKLKLVIKDYQNVLPSIVESM 673
SK +++ E +KL +Y+ + +E +
Sbjct: 631 SKRDEIDELI--IKLNKNNYKVNFQNFLEKI 659
Score = 29.8 bits (67), Expect = 8.0
Identities = 40/268 (14%), Positives = 91/268 (33%), Gaps = 29/268 (10%)
Query: 511 PNQKDVNIIDVSNQYNSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNE 570
KD+ I D N + I K + S + +D N EI E N +K +
Sbjct: 330 DLLKDLKISDNPNDTLE----ILIIKLLALSELEEEDEN-EIKFKKIEENSIDNLDIKEK 384
Query: 571 KWVINLQEQFSLLEKSKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTME 630
K + K+ E + +++ + ++ E +
Sbjct: 385 KIENENDIEGKSDTKNLEE----------GFETKDNKNKNSSFINKT----ENILTNSPL 430
Query: 631 IESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEE 690
+ L EK + EN E+ ++ +++ + + ++ ++ E +E
Sbjct: 431 KDELLEKTTEIINIENPQEFEFGQIG------NDIISTEIAQLDENQNLIDTGEFDLENN 484
Query: 691 ITQQSLNPLSSELDETIHKNVESVTSNMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLV 750
+ +++DE I++ + TS + + S + ++ + KDT +
Sbjct: 485 FSNSFNPENGNKIDENINETFD--TSTISANLSE--NKTNFAQSFNNKDTNLINSEIPID 540
Query: 751 DKSEDTISVTTLVNVNTESSEGDLSPID 778
+ + NV ++ LS +
Sbjct: 541 LIKDTITINNSQKNVKKNGNKDYLSVEE 568
>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
Length = 880
Score = 30.8 bits (71), Expect = 4.1
Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 24/111 (21%)
Query: 242 YEN--QLKEVMKKMLAEKVEVMQKCQDTERKL------VNIENECSNLQELLDRNRDELE 293
YEN ++K K+ L E E RKL V +E + +L EL +R++L
Sbjct: 214 YENLDEIKGKKKEKLRENKEQAFLS----RKLATIKTDVPLEVDLEDL-ELQPPDREKLI 268
Query: 294 EIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAF 344
+ K+ EF+S + + ++ L++ YET+ +++L+A+
Sbjct: 269 ALFKEL--------EFKSLLRRA---AAAEAAPLDEEDYETILDEEELEAW 308
>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like. This family
represents a conserved region within a number of myosin
II heavy chain-like proteins that seem to be specific to
Arabidopsis thaliana.
Length = 351
Score = 30.4 bits (68), Expect = 4.3
Identities = 34/162 (20%), Positives = 65/162 (40%), Gaps = 16/162 (9%)
Query: 611 NDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVIKDYQNVL--PS 668
ND + +K++ EE + E++ L++KL S + EK KL +K + L P+
Sbjct: 104 NDKVMPVKQWLEERRFLQGEMQQLRDKLAISDRAAKSEAQLKEKFKLRLKVLEEGLKGPN 163
Query: 669 IVESMNNTKDKPQDSEQSIEEEITQQSLNPL------------SSELDETIHKNVESVTS 716
T + + + + + + +P SS+L ++ + +V
Sbjct: 164 SSFVRPTTVGRSESNGPTRRQSLGGAETSPKFTSNGGLSKKRPSSQLRGSLTGRISTVLK 223
Query: 717 NMENDKSNEDDNSSVQSTDEVKDTEEGPHTPSLVDKSEDTIS 758
+ + + D + +S D K GP L DK E+ S
Sbjct: 224 HAKGTSISFDGGT--RSMDRSKILANGPSNFPLNDKHEEGTS 263
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 30.7 bits (69), Expect = 4.4
Identities = 25/143 (17%), Positives = 45/143 (31%), Gaps = 21/143 (14%)
Query: 680 PQDSEQSIEEEITQQSLNPLSSELDETIHKNVESVTSNME-NDKSNEDDNSSV------- 731
Q Q + + ++ E + K E+ + + + + +
Sbjct: 509 RQRWRQKLRNNKEMEDDIEMNDLGRERLQKEKEAHAATFDLSTTTTTQLGTENGRGGGLE 568
Query: 732 --QSTDEVKDTEEGPHTPSLVDKSEDTISVTTLVNVNTESSEGDLSPIDPPESVIALIND 789
T+ + +E L DK + V T +G S ID D
Sbjct: 569 ERSKTNIKERSENVNTIYGL-DKLARDTENRDVTYVPTSRYDGIESEID----------D 617
Query: 790 DDTSDSSSEMTDSSTLIRIRKPK 812
T ++ SE SS RI+K +
Sbjct: 618 VYTYENDSESIASSERRRIKKLR 640
>gnl|CDD|234204 TIGR03423, pbp2_mrdA, penicillin-binding protein 2. Members of
this protein family are penicillin-binding protein 2
(PBP-2), a protein whose gene (designated pbpA or mrdA)
generally is found next to the gene for RodA, a protein
required for rod (bacillus) shape in many bacteria.
PBP-2 acts as a transpeptidase for cell elongation
(hence, rod-shape) [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan].
Length = 592
Score = 30.6 bits (70), Expect = 5.1
Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 3/35 (8%)
Query: 447 VNR-NEEQLAFLEQSKYLHLDKIG--GLELQLEKL 478
V NEE L LE + Y D IG G+E E
Sbjct: 161 VGEINEEDLERLEPANYAGGDYIGKSGIEKYYEDE 195
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 30.7 bits (70), Expect = 5.3
Identities = 43/218 (19%), Positives = 81/218 (37%), Gaps = 54/218 (24%)
Query: 270 KLVNIENECSNLQELLDR----NRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSN 325
+L + S L LL R + L + +++ ++Q+ + +E + + Q N
Sbjct: 866 QLEQAKEGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQ-------HGN 918
Query: 326 TLEKL--QYETLQAQKQLDAFTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVD 383
L +L LQ+ D ++LK Y ++ + K + T+V
Sbjct: 919 ALAQLEPIVSVLQS----------DPEQFEQLKQDYQ---QAQQTQRDAKQQAFALTEVV 965
Query: 384 QNILVKYFDDVHQILTQFDEYFLRFNESESSGLNS-----IEDIEKVTKDLREQLHTAQE 438
Q ++D ++L +++S LN +E E+ REQL AQ
Sbjct: 966 QRRAHFSYEDAAEML------------AKNSDLNEKLRQRLEQAEQERTRAREQLRQAQA 1013
Query: 439 EYT-------SLQEIVNRNEEQLAFLEQSKYLHLDKIG 469
+ SL+ + + L L+Q L +G
Sbjct: 1014 QLAQYNQVLASLKSSYDAKRQMLQELKQE----LQDLG 1047
>gnl|CDD|220625 pfam10191, COG7, Golgi complex component 7 (COG7). COG7 is a
component of the conserved oligomeric Golgi complex
which is required for normal Golgi morphology and
localisation. Mutation in COG7 causes a congenital
disorder of glycosylation.
Length = 768
Score = 30.5 bits (69), Expect = 5.4
Identities = 29/130 (22%), Positives = 54/130 (41%), Gaps = 16/130 (12%)
Query: 583 LEKSKETWVSL--KLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKF 640
LEK SL KL Q + +L+ AL N+ + + + E LKE++
Sbjct: 29 LEKGDAHASSLVMKLQLYVQQVNYALEESSGQALQNMPRVLRDVEALRQEASFLKEQM-- 86
Query: 641 SQSKENDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLS 700
+ ++ +K + VL E ++ K + Q +++S++E LS
Sbjct: 87 -----ILVKQDIKKFEQDTGQSMQVL----ERLDTVKQRMQAAKESLQE---ADGWGTLS 134
Query: 701 SELDETIHKN 710
+EL++T
Sbjct: 135 AELEDTFETQ 144
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 29.9 bits (68), Expect = 6.0
Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 7/91 (7%)
Query: 170 REN--MLKNKLSKLQHIVIDLKK--AADVSWKALIQE-DRLLSRVEFLESQLAAYSKSFT 224
REN ++ KL K +D+ K D + L++E + L + L ++ K
Sbjct: 8 RENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGE 67
Query: 225 DD--KLRDEIKLLHRDKSTYENQLKEVMKKM 253
DD +L E+K L E L E+ ++
Sbjct: 68 DDAEELIAEVKELKEKLKELEAALDELEAEL 98
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 29.6 bits (67), Expect = 6.4
Identities = 40/215 (18%), Positives = 88/215 (40%), Gaps = 13/215 (6%)
Query: 227 KLRDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLD 286
+LR++ + L ++ +++E+ K+ E +Q+ + R+L NE + +
Sbjct: 52 ELREKAQELREERDEINEEVQEL-KEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIK 110
Query: 287 RNRDELEEIIKKYQDQVT---KNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDA 343
E+E + KK Q V + +E I + L E + + + + + ++D
Sbjct: 111 SLEREIERLEKKQQTSVLTPEEERELVQKIKE-LRKELEDAKKALEENEKLKELKAEIDE 169
Query: 344 FTLKDNLMHDKLKNYYMNFHIIKEKMTNIKNELGLETKVDQNILVKYFDDVHQILTQFDE 403
K +H+K++ E+M + E + L K D++H+ + +
Sbjct: 170 LKKKAREIHEKIQELANEAQEYHEEMIKLFEEA--------DELRKEADELHEEFVELSK 221
Query: 404 YFLRFNESESSGLNSIEDIEKVTKDLREQLHTAQE 438
+E + N + ++EK K LR + A+
Sbjct: 222 KIDELHEEFRNLQNELRELEKKIKALRAKEKAAKR 256
>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
subunit A'' fusion protein; Provisional.
Length = 509
Score = 29.8 bits (67), Expect = 6.9
Identities = 27/176 (15%), Positives = 59/176 (33%), Gaps = 10/176 (5%)
Query: 526 NSEKDFVSINKEIEESSRKIDDINAEITKCTNELVDAKNCLVKNEKWVINLQEQFSLLEK 585
N ++++ K+ I I N ++ K K V+ ++ + +
Sbjct: 32 NITDGYITLKKKEFTYHVIISKIAPYSNS--NGIIKKKK---PVLKTVLEIESEEKIEAI 86
Query: 586 SKETWVSLKLNSAESQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKE 645
+ L + S S L + + + + K + S
Sbjct: 87 DLMEFKRLFGRILDENMSFSTGELLTAEEKEYYEENSNEDVLKVIDDVKKLGFRLPPSVI 146
Query: 646 NDLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQD-----SEQSIEEEITQQSL 696
++ + +K +L +Y+ +L I E + P + + QSI E TQ ++
Sbjct: 147 EEIAKAMKKKELSDDEYEEILRRIREEYERARVDPYEAVGIVAAQSIGEPGTQMTM 202
>gnl|CDD|204415 pfam10212, TTKRSYEDQ, Predicted coiled-coil domain-containing
protein. This is the C-terminal 500 amino acids of a
family of proteins with a predicted coiled-coil domain
conserved from nematodes to humans. It carries a
characteristic TTKRSYEDQ sequence-motif. The function is
not known.
Length = 518
Score = 29.9 bits (67), Expect = 6.9
Identities = 27/114 (23%), Positives = 49/114 (42%), Gaps = 5/114 (4%)
Query: 600 SQSSASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDLLEEYEKLKLVI 659
++S L Q+ +L+ I K ++E +E + K KL+ + DL + K +L
Sbjct: 301 TESREGLAQQVQQSLEKISKLEQEKEHWMLEAQLAKIKLEKENQRIADLEKGTLKGQLAE 360
Query: 660 KDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDETIHKNVES 713
K ++N + + + S +S E SL + + DET + ES
Sbjct: 361 KLHENATLT-----QAHEQPEEFSSKSEREPTKSTSLIGMLTVTDETEAPDEES 409
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 28.4 bits (64), Expect = 7.4
Identities = 21/81 (25%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 213 ESQLAAYSKSFTDDKLRDEI------KLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQD 266
L + K +D+ D I ++ R K E + KE ++K E ++ Q+ D
Sbjct: 47 NEGLFSDGKYDDEDEEADRIYESIDERMDERRKKRREQKEKEEIEKYREENPKIQQQFAD 106
Query: 267 TERKLVNI-ENECSNLQELLD 286
+R L + E+E +N+ E+ D
Sbjct: 107 LKRNLATVTEDEWANIPEVGD 127
>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640). This
family consists of sequences derived from hypothetical
eukaryotic proteins. A region approximately 100 residues
in length is featured.
Length = 177
Score = 29.1 bits (66), Expect = 7.6
Identities = 26/113 (23%), Positives = 51/113 (45%), Gaps = 16/113 (14%)
Query: 226 DKLRDEIKLLHRDKSTYENQLKEVMKKM--LAEKVEVMQKCQDTERKLVNIENECSNLQE 283
D + D I+ + +D T E Q K ++ A+ +Q + +E + ENE
Sbjct: 30 DVINDSIENVSKDMVTKEQQEKTEYQQKVDFAKLKSELQSLEKSEFANLQAENE------ 83
Query: 284 LLDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQ 336
+ +++LE++ + +D++TK T L+ N EK E+ + L+
Sbjct: 84 ---KLKNDLEKLKNRLRDEITK-----VTAGVRLDLNLEKGRIREESAEQELK 128
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 29.6 bits (67), Expect = 7.6
Identities = 20/119 (16%), Positives = 51/119 (42%), Gaps = 18/119 (15%)
Query: 604 ASLQSQLNDALDNIKKYKEEHSKMTMEIESLKEKLKFSQSKEN----------------- 646
++L++QL L IK+ + E + + ++++L+++L+ +
Sbjct: 140 STLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELL 199
Query: 647 DLLEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSELDE 705
+L E + + + + L + ++ + + Q EQ+ EE+ + L + L E
Sbjct: 200 ELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEV-LEELTEAQARLAE 257
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 28.7 bits (65), Expect = 7.8
Identities = 11/64 (17%), Positives = 29/64 (45%), Gaps = 1/64 (1%)
Query: 285 LDRNRDELEEIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQA-QKQLDA 343
L E+ +K++ ++ K+ EF+ + + +E+ + L +A +++ +
Sbjct: 30 YGDISPFLPELRQKHKARIAKDPEFQYLNEDIAWLKEERDDKTVSLNEAERKAEREEDEK 89
Query: 344 FTLK 347
LK
Sbjct: 90 KRLK 93
>gnl|CDD|206307 pfam14138, COX16, Cytochrome c oxidase assembly protein COX16.
This family represents homologues of COX16 which has
been shown to be involved in assembly of cytochrome
oxidase. Protein in this family are typically between
106 and 134 amino acids in length.
Length = 79
Score = 27.5 bits (62), Expect = 8.1
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 630 EIESLKEKLKFSQSKEN-DLLEEYEKLK-LVIKDYQNV 665
+ +E LK + + DL EEY +L+ L I D++NV
Sbjct: 34 QQLEEEEALKLLKKRRKVDLEEEYYRLQGLDIDDWENV 71
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 29.5 bits (67), Expect = 8.2
Identities = 18/67 (26%), Positives = 35/67 (52%)
Query: 229 RDEIKLLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRN 288
R+E++ L R E L M+ + ++ + +K ++ K N++ + L+EL+
Sbjct: 75 RNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQ 134
Query: 289 RDELEEI 295
R+ELE I
Sbjct: 135 REELERI 141
>gnl|CDD|234038 TIGR02865, spore_II_E, stage II sporulation protein E. Stage II
sporulation protein E (SpoIIE) is a multiple membrane
spanning protein with two separable functions. It plays
a role in the switch to polar cell division during
sporulation. By means of it protein phosphatase
activity, located in the C-terminal region, it activates
sigma-F. All proteins that score above the trusted
cutoff to this model are found in endospore-forming
Gram-positive bacteria. Surprisingly, a sequence from
the Cyanobacterium-like (and presumably
non-spore-forming) photosynthesizer Heliobacillus
mobilis is homologous, and scores between the trusted
and noise cutoffs [Cellular processes, Sporulation and
germination].
Length = 764
Score = 29.7 bits (67), Expect = 8.4
Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 23/124 (18%)
Query: 216 LAAYSKSFTDDKLRDEIKLLHRDKSTYENQLKE------VMKKM------------LAEK 257
S +F + +E + R S L E MK + E
Sbjct: 323 FRELSNTFVEALASNEKLTMKRKSSYLLENLAERVCQSCNMKHRCWKREFDYTYSAMEEL 382
Query: 258 VEVMQKCQDTERKLVN-IENECSNLQELLDRNRDEL--EEIIKKYQDQVTKNKEFESTID 314
+E +++ +D KL + E +C +EL++ D L I + ++ ++ + + +
Sbjct: 383 IENLEEKKDPNSKLPDEFERKCIKRKELINTTEDILNNYIINEMWRKRLEEGRRL--VAE 440
Query: 315 QLLN 318
QL
Sbjct: 441 QLKG 444
>gnl|CDD|222986 PHA03102, PHA03102, Small T antigen; Reviewed.
Length = 153
Score = 28.5 bits (64), Expect = 8.5
Identities = 14/80 (17%), Positives = 31/80 (38%), Gaps = 7/80 (8%)
Query: 226 DKLRDEIKLLHRD------KSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECS 279
K R+ +K L S E+ + ++V + C+D + L N+ + C
Sbjct: 54 KKFRESVKSLRDLDGEEDSSSEEEDVPSGYVGATFGDRVNALY-CKDWDTCLKNLSSLCK 112
Query: 280 NLQELLDRNRDELEEIIKKY 299
+ LL + +++ +
Sbjct: 113 CIFCLLRKQHRLKQKLRPEP 132
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 29.9 bits (68), Expect = 8.7
Identities = 24/115 (20%), Positives = 45/115 (39%), Gaps = 15/115 (13%)
Query: 594 KLNSAESQSS--ASLQSQLNDAL---DNIKKYKEEHSKMTMEIESLKEKLKFSQSKENDL 648
LN + + +Q L L D I + KE E E LK++L + +K
Sbjct: 47 ALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKE-------ETEQLKQQLAQAPAKLRQA 99
Query: 649 LEEYEKLKLVIKDYQNVLPSIVESMNNTKDKPQDSEQSIEEEITQQSLNPLSSEL 703
E E LK D + +++ + + + ++ + + Q L +S+L
Sbjct: 100 QAELEALK---DDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQL 151
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.7 bits (67), Expect = 9.1
Identities = 31/155 (20%), Positives = 60/155 (38%), Gaps = 28/155 (18%)
Query: 528 EKDFVSINKEIEESSRKIDDINAEITKC------TNELVDAKNCLVKNEKWVINLQEQFS 581
E + ++ ++I+E I+++ +EI NE + + + ++++
Sbjct: 269 EAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTV 328
Query: 582 LLEKSKETWVSLKLNS---AESQSSASLQSQLNDA----LDNIKKYKE-------EHSKM 627
L E V KL + A S+ L Q +D + KE E +
Sbjct: 329 ELLPDAENNV-AKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQ 387
Query: 628 TMEIESLKEKLKFSQSK-------ENDLLEEYEKL 655
EI+ L+ K++ +S+ LL+EYE
Sbjct: 388 LDEIKKLRNKIEELESELQTKEQLYKQLLDEYENA 422
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 29.5 bits (66), Expect = 9.9
Identities = 25/122 (20%), Positives = 48/122 (39%), Gaps = 2/122 (1%)
Query: 234 LLHRDKSTYENQLKEVMKKMLAEKVEVMQKCQDTERKLVNIENECSNLQELLDRNRDELE 293
LL R S + KE + + + ++ +++E L++ +R DELE
Sbjct: 30 LLERLDSELRDAEKE--RDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELE 87
Query: 294 EIIKKYQDQVTKNKEFESTIDQLLNSNQEKSNTLEKLQYETLQAQKQLDAFTLKDNLMHD 353
E+ K+ D + E + +QL N + LQ + L + L+ +
Sbjct: 88 ELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLN 147
Query: 354 KL 355
+L
Sbjct: 148 QL 149
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.127 0.344
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 43,643,482
Number of extensions: 4293140
Number of successful extensions: 7107
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6165
Number of HSP's successfully gapped: 712
Length of query: 903
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 797
Effective length of database: 6,236,078
Effective search space: 4970154166
Effective search space used: 4970154166
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (28.4 bits)