BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781204|ref|YP_003065617.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] (707 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781204|ref|YP_003065617.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|254040881|gb|ACT57677.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|317120670|gb|ADV02493.1| hypothetical protein SC1_gp045 [Liberibacter phage SC1] gi|317120814|gb|ADV02635.1| hypothetical protein SC1_gp045 [Candidatus Liberibacter asiaticus] Length = 707 Score = 1154 bits (2986), Expect = 0.0, Method: Composition-based stats. Identities = 707/707 (100%), Positives = 707/707 (100%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR Sbjct: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 Query: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ Sbjct: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 Query: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH Sbjct: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 Query: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD Sbjct: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 Query: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR Sbjct: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 Query: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA Sbjct: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL Sbjct: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA Sbjct: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN Sbjct: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 Query: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV Sbjct: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 Query: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY Sbjct: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 Query: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP Sbjct: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 >gi|268589385|ref|ZP_06123606.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] gi|291315412|gb|EFE55865.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] Length = 832 Score = 403 bits (1036), Expect = e-110, Method: Composition-based stats. Identities = 94/478 (19%), Positives = 183/478 (38%), Gaps = 64/478 (13%) Query: 236 TSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLR--- 292 T LSDV S + + ++ G W L ++ +L++ + N + + Sbjct: 377 TRLSDVDKSQYQPEIRK----AQGGGFPWWPMLTPVQQYQLIKQGEAAQNKQRQEYKSVL 432 Query: 293 ----KETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPD 348 K+ +A A R ++ N PS L+++ A+G G + Y Q D Sbjct: 433 DGKVKDAEALALRGESYSN-----PPS----LDEFKYAHGENDGIKEYEQFQKNMAMGAD 483 Query: 349 VARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAI 408 +A ++ +S L+ + + + + K + L ++ + D I Sbjct: 484 IATVQYLSPQAQTALLESKKPKGDYDAANNY-------KRYDTLARAIDTVNKVRASDPI 536 Query: 409 SWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKR 468 + I +Q++ L F + ++F +SL QR+ V I +Y + F+ EE L + K Sbjct: 537 QFSIDRQQVNQLDFSSIQNFTNSLTQRSSNVGDISKNYQTPLTVFSAQEELVLSQLMEKA 596 Query: 469 PASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVV----------------RLSS 512 PAS+ ++ + L ++ E ++ + + S V + SS Sbjct: 597 PASQKIEYLDAIRQGLKNN--ESYTAALRQVSKSDTSLAVAGVIMNKPTQLTVEYNKWSS 654 Query: 513 EFSDDAKNAAMVILSG---MKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + AA +I+ G K KD +T+ + D + D + ++ + + Sbjct: 655 DIVVTPQQAAQLIVQGSTARKSGKDFVLPKDTELREEFADFVGDVFAG----DIDGANSA 710 Query: 570 GNFNKDKEAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMS 627 KD A G M +GN ++ A+ I G +M P GM Sbjct: 711 YEVAKDAYA------GMMAKNGNINGEYDKDIWQQAINIATGGIYDFNGRGKVMLPWGMD 764 Query: 628 KTDFEDR----LWYATKDTGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 F+ + L Y + G I ++G +Y+I + + K+G +II++ Sbjct: 765 GDTFKKQVKSELKYHEPEIGNISVDIIGMQSLGDSQYLIKRGSGYLLYKDGTPVIIDL 822 >gi|301028420|ref|ZP_07191666.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878531|gb|EFI86742.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 843 Score = 344 bits (881), Expect = 4e-92, Method: Composition-based stats. Identities = 81/520 (15%), Positives = 173/520 (33%), Gaps = 68/520 (13%) Query: 212 ARDPRVFENFMRAF--YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLD 269 A +P ++ +A +++G P L+ +A S ++ + G+A +D L Sbjct: 343 ASNPDTLKSLCKAIIHHEQGSQPYSDQQLAAGVSAALGLS-QLPTKNKRYTGVAWFDALS 401 Query: 270 DTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELIPLEDYTQAY 327 ++ + +L + + R +++ R A G+ P E ++ AY Sbjct: 402 ESDQASVLRQTDALARQQQAEYRTMLESRVRDATAAYMRGVEFPNPPGE----GEFIAAY 457 Query: 328 GVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATK 387 GV +G Y + +A + + M T +++++ + AS + Sbjct: 458 GVREGNLRYTEFRNLQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYASR---AE 513 Query: 388 YKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYN 447 + + + + + + + S ++ P+ + + + R + + Sbjct: 514 LFDRVSAAATKGITQRQNNPFSAAVEIGAYKPITSNNPDDITAEVANRFSSQESL-RALG 572 Query: 448 LTTSHFNKTEENQLRTQLV-KRPASESVDLIRGAYNTLS-------------DSDKEGVR 493 + N E L Q+ + +++ L++ L +S Sbjct: 573 INAPLLNSEEAAALAQQVRGTQNVDQTIKLLQSMGEKLPAPAIRQVASVIAPNSAATAYS 632 Query: 494 SSFAHIEDN---GLSAVVRLSSEFSDDAK----NAAMVILSG------MKHQKD-----T 535 + DN + S + A VILSG K K+ Sbjct: 633 ALLLGTPDNQYDNTRPTIPYSQFIGYKPTMNKYDVAKVILSGDQLLNPTKAMKNAGITPV 692 Query: 536 ETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTL 595 + K + + +++ P + + K A Y G Sbjct: 693 QLPSEDKMKDAFDEQVGNAFARNPQLR----QISYSLFKAAYAGIAYQSGDASTVRTSMP 748 Query: 596 NRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRL----WYATKDTG-------E 644 + V+DA++ G N ++ P GM K+ F+DR A KD G Sbjct: 749 DTDIVADAVKYATGGIYNGFNGGDVVMPFGMDKSTFKDRYTTSAKQALKDAGLNVNAVSN 808 Query: 645 YDPYTIKYMNVGSGKYMIIKNGNPKVD--KEGKTIIINVE 682 + P +N+G+ +Y ++ + K I++ VE Sbjct: 809 FTP-----VNIGNNRYRLVNGSGRWATDPRTNKAIVVRVE 843 >gi|30387394|ref|NP_848223.1| hypothetical protein epsilon15p15 [Enterobacteria phage epsilon15] gi|30266049|gb|AAO06078.1| 15 [Salmonella phage epsilon15] Length = 842 Score = 337 bits (864), Expect = 4e-90, Method: Composition-based stats. Identities = 76/476 (15%), Positives = 156/476 (32%), Gaps = 70/476 (14%) Query: 235 STSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKE 294 + LS + S + + D + A + R D R++ + R N +++++R Sbjct: 377 AIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ--QQAEYRTN-IDSRVRDA 433 Query: 295 TQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKL 354 + A R +D P++ D+ AYGV +G Y + +A + + Sbjct: 434 SAAYMRGVD------FPNAPTQ----TDFLAAYGVREGNLRYTEFRNTQIAGQYIGSFRN 483 Query: 355 MSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKY 414 M T +++++ AS + + + + + + + + + ++ Sbjct: 484 MPTSSITAYVEQLRP-GTEETGEGYASR---AQLFDQVTAAATKVITQRQNNPFNAAVEI 539 Query: 415 KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV-KRPASES 473 P+ + + + R + + + + E L Q+ + +++ Sbjct: 540 GAYKPIASNNQNDITSEVANRFSSQESL-RALGINAPILSSEEAAALSEQVRGTKDVNQT 598 Query: 474 VDLIRGAYNTLS-------------DSDKEGVRSSFAHIEDN---GLSAVVRLSSEFSDD 517 + L++ TL ++ + DN + S Sbjct: 599 ISLLQSMGKTLPAPAMRQVASAIAPNNAATAYSALLLGTPDNQYDNTKPSIAYSQFIGYK 658 Query: 518 AK----NAAMVILSG------MKHQKD-----TETRYNTDHKSNKFDSLYDSYINTPLTK 562 + + VIL+G K KD + K D + +S+ N P + Sbjct: 659 PTMNKYDVSKVILAGDQLLNPTKAMKDAGITPVQLPSEDKLKRAFDDQVGNSFANNPQAR 718 Query: 563 LEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMP 622 N K A Y G + N V A Q G N ++ Sbjct: 719 ----QLSYNLFKAAYAGIAYQSGDASMTKTDAANSDVVEKAAQYATGGVYKGFNGGDVVM 774 Query: 623 PRGMSKTDFEDRL----WYATKDTG-------EYDPYTIKYMNVGSGKYMIIKNGN 667 P GM K+ F+DR A KD G + P +N+G+ +Y ++ Sbjct: 775 PFGMDKSTFKDRYTASAQQALKDAGLNVNAVSNFTP-----VNIGNNQYRLVSGSG 825 >gi|85059171|ref|YP_454873.1| hypothetical protein SG1193 [Sodalis glossinidius str. 'morsitans'] gi|84779691|dbj|BAE74468.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 843 Score = 313 bits (801), Expect = 8e-83, Method: Composition-based stats. Identities = 83/452 (18%), Positives = 165/452 (36%), Gaps = 55/452 (12%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQA 326 +LD + RL + N + R E + + + A G P E +++T+A Sbjct: 398 QLDPVQLARLRSVAQGQLNQQQREYRVELEGELKDFTAAALRGKNY-PREFTA-DEFTRA 455 Query: 327 YGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMAT 386 YG + G + Y D++ ++ +S + L + P A A Sbjct: 456 YGYDAGQKAYANYLDTQQLGSDISTVQQLSPVGQQALLNAREP----VPGEGYAE---AA 508 Query: 387 KYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDY 446 K E L+K+ + QD + + + KQI PL T E+F L R ++ Y Sbjct: 509 KRHETLQKAVEYVNRARMQDPVQYAAEQKQIAPLDMQTPEAFRAGLSSRGSLAPELARHY 568 Query: 447 NLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSA 506 + F+++E +Q+ L PAS+S+ + L ++ + SA Sbjct: 569 GTPLAIFSQSEASQIGEMLRSAPASQSIAYLDALRQGL--GAGVQYSAALQQVSRYAPSA 626 Query: 507 VVR-----------------LSSEFSDDAKNAAMVILSGMKHQ--------------KDT 535 V LS++ +AA I+ G + + Sbjct: 627 AVAGAIMGKRGNVIGNEGGWLSADLMVTPDDAAKTIIEGANARAGVTTNVNGVENKTRGI 686 Query: 536 ETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTL 595 E +TD + D + ++ E + + KD A + G++ Sbjct: 687 EMPKDTDLRPEFVDKVGKAFAG----DAEGAAQAYDVAKDYYAGVMARKGNVSGE----Y 738 Query: 596 NRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK-----DTGEYDPYTI 650 ++ + A+ + G + ++ P GMS++ F + A K D + P Sbjct: 739 DKDTWTQAINVATGGVYDYGGQGDVLLPWGMSESQFNKAVGVAWKKQVLDDGIKPPPGRY 798 Query: 651 KYMNVGSGKYMIIKNGNPKVDKEGKTIIINVE 682 + G +Y++ G + K+G+ ++++++ Sbjct: 799 GLQSHGDSQYLVKLGGGYLLGKDGRPVVLHID 830 >gi|320175034|gb|EFW50147.1| putative bacteriophage protein [Shigella dysenteriae CDC 74-1112] Length = 837 Score = 280 bits (715), Expect = 8e-73, Method: Composition-based stats. Identities = 78/471 (16%), Positives = 165/471 (35%), Gaps = 54/471 (11%) Query: 238 LSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQA 297 LS + + + + D + A + R D RR + + +++K+R T A Sbjct: 381 LSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR---QQQAEYKTIIDSKVRDATAA 437 Query: 298 QARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMST 357 R ++ P E +D+ AYGV +G Y + +A + + M T Sbjct: 438 YMRGVE------FPNPPGE----DDFIAAYGVREGNLRYTEFRNTQIAGQYIGSFRNMPT 487 Query: 358 FDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQI 417 +++++ + AS + + + Q +++ D + + + Q Sbjct: 488 SSITAYVEQLRPD-TGETGEGYASR---AALYDNVVSAANQVIKQRQSDPVQFSLASGQA 543 Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 P+ +++F S+ RA + + Y + F+K E NQ+ P S+ + Sbjct: 544 KPIDMSNKDNFGQSVALRAAQISDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL 603 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNA 521 S + S+ I N SA V + + S + A Sbjct: 604 DTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGILAETNWFNPDVSVSPETA 661 Query: 522 AMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A IL+G +K T + D + D + D++ + ++ KD Sbjct: 662 AQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG----DAQGASMAYEIAKD 717 Query: 576 KEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRL 635 A + G + ++ A+ + G ++ P GMS F+ ++ Sbjct: 718 YYAGVMAKKGVISGE----IDTDIWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQV 773 Query: 636 W--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + + G + P + G +Y++ + +G +++N+ Sbjct: 774 NQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVLNL 824 >gi|218700979|ref|YP_002408608.1| hypothetical protein ECIAI39_2669 [Escherichia coli IAI39] gi|218370965|emb|CAR18792.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 837 Score = 279 bits (714), Expect = 1e-72, Method: Composition-based stats. Identities = 74/450 (16%), Positives = 159/450 (35%), Gaps = 47/450 (10%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELI 318 G+A +D L ++ + +L + + + ++ R A GI P E Sbjct: 393 GIAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIEFPNPPGE-- 450 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +D+ AYGV +G Y + +A + + M T +++++ + + Sbjct: 451 --DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGDTGEG 507 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A+ + + + Q +++ D + + + Q P+ +++F S+ RA Sbjct: 508 YAAR---ATLYDNVVSAANQVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQ 564 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 V + Y + F+K E NQ+ P S+ + S + S+ Sbjct: 565 VSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQ 622 Query: 499 IEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKDT------E 536 I N SA V + + S + AA IL+G +K T Sbjct: 623 ISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIP 682 Query: 537 TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN 596 + D + D + D++ + ++ KD A + G + ++ Sbjct: 683 MPKDADLRLEFSDMVKDAFAG----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----ID 734 Query: 597 RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIK 651 A+ + G ++ P GMS F+ ++ + + G + P Sbjct: 735 NDVWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYG 794 Query: 652 YMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + G +Y++ + +G +++N+ Sbjct: 795 LQSYGDSQYLVKLGTGYLLKDDGSPVVLNL 824 >gi|300898438|ref|ZP_07116779.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357905|gb|EFJ73775.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 837 Score = 279 bits (713), Expect = 1e-72, Method: Composition-based stats. Identities = 74/450 (16%), Positives = 159/450 (35%), Gaps = 47/450 (10%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELI 318 G+A +D L ++ + +L + + + ++ R A G+ P E Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGVEFPNPPGE-- 450 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +D+ AYGV +G Y + +A + + M T +++++ + + Sbjct: 451 --DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGDTGEG 507 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A+ + + + Q +++ D + + + Q P+ +++F S+ RA Sbjct: 508 YAAR---AALYDNVVSAANQVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQ 564 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 V + Y + F+K E NQ+ P S+ + S + S+ Sbjct: 565 VSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQ 622 Query: 499 IEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKDT------E 536 I N SA V + + S + AA IL+G +K T Sbjct: 623 ISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIP 682 Query: 537 TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN 596 + D + D + D++ + ++ KD A + G + ++ Sbjct: 683 MPKDADLRIEFSDMVKDAFAG----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----ID 734 Query: 597 RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKD----TGEYDP-YTIK 651 A+ + G ++ P GMS F+ ++ A + TG P Sbjct: 735 NDIWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQVNQAWNEQVVGTGIKTPLGQYG 794 Query: 652 YMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + G +Y++ + +G +++++ Sbjct: 795 LQSYGDSQYLVKLGTGYLLKDDGSPVVLDL 824 >gi|117624701|ref|YP_853614.1| hypothetical protein APECO1_4052 [Escherichia coli APEC O1] gi|115513825|gb|ABJ01900.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 837 Score = 279 bits (713), Expect = 1e-72, Method: Composition-based stats. Identities = 74/450 (16%), Positives = 159/450 (35%), Gaps = 47/450 (10%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELI 318 G+A +D L ++ + +L + + + ++ R A GI P E Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIEFPNPPGE-- 450 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +D+ AYGV +G Y + +A + + M T +++++ + + Sbjct: 451 --DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGDTGEG 507 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A+ + + + Q +++ D + + + Q P+ +++F S+ RA Sbjct: 508 YAAR---ATLYDNVVSAANQVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQ 564 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 V + Y + F+K E NQ+ P S+ + S + S+ Sbjct: 565 VSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQ 622 Query: 499 IEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKDT------E 536 I N SA V + + S + AA IL+G +K T Sbjct: 623 ISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIP 682 Query: 537 TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN 596 + D + D + D++ + ++ KD A + G + ++ Sbjct: 683 MPKDADLRLEFSDMVKDAFAG----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----ID 734 Query: 597 RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIK 651 A+ + G ++ P GMS F+ ++ + + G + P Sbjct: 735 NDVWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYG 794 Query: 652 YMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + G +Y++ + +G +++N+ Sbjct: 795 LQSYGDSQYLVKLGTGYLLKDDGSPVVLNL 824 >gi|324008549|gb|EGB77768.1| hypothetical protein HMPREF9532_01736 [Escherichia coli MS 57-2] Length = 837 Score = 279 bits (713), Expect = 1e-72, Method: Composition-based stats. Identities = 73/450 (16%), Positives = 158/450 (35%), Gaps = 47/450 (10%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELI 318 G+A +D L ++ + +L + + + ++ R A G+ P E Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGVEFPNPPGE-- 450 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 451 --DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEG 507 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A+ + + + Q +++ D + + + Q P+ +++F S+ RA Sbjct: 508 YAAR---AALYDNVVSAANQVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQ 564 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 V + Y + F+K E NQ+ P S+ + S + S+ Sbjct: 565 VSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQ 622 Query: 499 IEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKDT------E 536 I N SA V + + S + AA IL+G +K T Sbjct: 623 ISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIP 682 Query: 537 TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN 596 + D + D + D++ + ++ KD A + G + ++ Sbjct: 683 MPKDADLRLEFSDMVKDAFAG----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----ID 734 Query: 597 RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIK 651 A+ + G ++ P GMS F+ ++ + + G + P Sbjct: 735 NDVWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYG 794 Query: 652 YMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + G +Y++ + +G +++N+ Sbjct: 795 LQSYGDSQYLVKLGTGYLLKDDGSPVVLNL 824 >gi|323948674|gb|EGB44579.1| hypothetical protein ERKG_04897 [Escherichia coli H252] Length = 837 Score = 278 bits (712), Expect = 1e-72, Method: Composition-based stats. Identities = 74/450 (16%), Positives = 159/450 (35%), Gaps = 47/450 (10%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK--PSELI 318 G+A +D L ++ + +L + + + ++ R A GI P E Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIEFPNPPGE-- 450 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +D+ AYGV +G Y + +A + + M T +++++ + + Sbjct: 451 --DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGDTGEG 507 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A+ + + + Q +++ D + + + Q P+ +++F S+ RA Sbjct: 508 YAAR---ATLYDNVVSAANQVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQ 564 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 V + Y + F+K E NQ+ P S+ + S + S+ Sbjct: 565 VSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQ 622 Query: 499 IEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKDT------E 536 I N SA V + + S + AA IL+G +K T Sbjct: 623 ISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIP 682 Query: 537 TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN 596 + D + D + D++ + ++ KD A + G + ++ Sbjct: 683 MPKDADLRLEFSDMVKDAFAG----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----ID 734 Query: 597 RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIK 651 A+ + G ++ P GMS F+ ++ + + G + P Sbjct: 735 NDVWKQAVNVATGGVHDYNGMGNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYG 794 Query: 652 YMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 + G +Y++ + +G +++N+ Sbjct: 795 LQSYGDSQYLVKLGTGYLLKDDGSPVVLNL 824 >gi|331648165|ref|ZP_08349255.1| hypothetical protein ECIG_04091 [Escherichia coli M605] gi|331043025|gb|EGI15165.1| hypothetical protein ECIG_04091 [Escherichia coli M605] Length = 837 Score = 278 bits (711), Expect = 2e-72, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 152/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E +D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----DDFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAANQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDVWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQINQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++N+ Sbjct: 816 DGSPVVLNL 824 >gi|323156122|gb|EFZ42281.1| prophage MuMc02, structural protein P5 [Escherichia coli EPECa14] Length = 837 Score = 276 bits (705), Expect = 1e-71, Method: Composition-based stats. Identities = 71/429 (16%), Positives = 151/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGEA----DFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAASQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDIWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++++ Sbjct: 816 DGSPVVLDL 824 >gi|298381707|ref|ZP_06991306.1| minor structural protein [Escherichia coli FVEC1302] gi|298279149|gb|EFI20663.1| minor structural protein [Escherichia coli FVEC1302] Length = 837 Score = 275 bits (704), Expect = 1e-71, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 151/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGEA----DFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAASQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRATQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDVWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++N+ Sbjct: 816 DGSPVVLNL 824 >gi|89152439|ref|YP_512272.1| hypothetical structural protein [Escherichia phage phiV10] gi|74055462|gb|AAZ95911.1| hypothetical structural protein [Escherichia phage phiV10] Length = 837 Score = 275 bits (704), Expect = 1e-71, Method: Composition-based stats. Identities = 71/429 (16%), Positives = 151/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGEA----DFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAASQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDIWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++++ Sbjct: 816 DGSPVVLDL 824 >gi|332344343|gb|AEE57677.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 837 Score = 275 bits (704), Expect = 1e-71, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 151/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGEA----DFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAASQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRFEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDVWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++N+ Sbjct: 816 DGSPVVLNL 824 >gi|327252173|gb|EGE63845.1| prophage MuMc02, structural protein P5 [Escherichia coli STEC_7v] Length = 837 Score = 274 bits (701), Expect = 3e-71, Method: Composition-based stats. Identities = 71/429 (16%), Positives = 151/429 (35%), Gaps = 55/429 (12%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGEA----DFIAAYGVREGKLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A+ + + + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-TGETGEGYAAR---AALYDNVVSAASQV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------- 509 Q+ P S+ + S + S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYLDTIRQ--STGGGQVYMSALQQISANAPSAAVAGILMDKPGGI 643 Query: 510 ------LSSEFSDDAKNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYIN 557 + + S + AA IL+G +K T + D + D + D++ Sbjct: 644 LAEKNWFNPDVSVSPETAAQTILAGAAARKGTDDAKGIPMPKDADLRLEFSDMVKDAFAG 703 Query: 558 TPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 + ++ KD A + G + ++ A+ + G Sbjct: 704 ----DAQGASMAYEIAKDYYAGVMAKKGVVSGE----IDNDIWKQAVNVATGGVHDYNGM 755 Query: 618 SMLMPPRGMSKTDFEDRLW--YATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDK 672 ++ P GMS F+ ++ + + G + P + G +Y++ + Sbjct: 756 GNVLLPWGMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKD 815 Query: 673 EGKTIIINV 681 +G +++++ Sbjct: 816 DGSPVVLDL 824 >gi|309702801|emb|CBJ02132.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 836 Score = 269 bits (687), Expect = 1e-69, Method: Composition-based stats. Identities = 79/474 (16%), Positives = 165/474 (34%), Gaps = 54/474 (11%) Query: 235 STSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKE 294 + LS + S + + D + A + R D R++ + R N +++++R Sbjct: 377 AIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ--QQAEYRTN-IDSRVRDA 433 Query: 295 TQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKL 354 + A R +D P++ D+ AYGV +G Y + +A + + Sbjct: 434 SAAYMRGVD------FPNAPTQ----NDFMAAYGVREGNLRYTEFRNTQIAGQYIGSFRN 483 Query: 355 MSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKY 414 M T + ++ + + A+ + + + + + + D I + + Sbjct: 484 MPTSSIQAAVENLKPD-TGETGEGYAAR---AQTYDAVVSAASTVLAQRKADPIQFSLSS 539 Query: 415 KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESV 474 Q P+ + +F+ ++ RA ++ Y + F+K E NQ+ P S+ Sbjct: 540 GQTKPIDMTNQNNFSQTIALRASQAAELAKSYGTPLTFFSKDEANQIGAFFRDAPVSQQS 599 Query: 475 DLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR----------------LSSEFSDDA 518 + S + S+ I N SA V + + S Sbjct: 600 AYLDTIRQ--STGGGQVYMSALQQISTNAPSAAVAGILMDKPGGVVAEKNWFNPDVSVSP 657 Query: 519 KNAAMVILSGMKH------QKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNF 572 ++AA ILSG K + D + D + D++ ++ Sbjct: 658 ESAAQTILSGAAARKGTGDVKGIPMPKDNDLRLEFSDMVKDAFAG----DAHGASMAYEI 713 Query: 573 NKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFE 632 KD A + G + ++ A+ + G ++ P GMS F+ Sbjct: 714 AKDYYAGVMAKKGVVSGE----IDSDTWKQAVNVATGGVHDYNGMGSVLLPWGMSAEQFD 769 Query: 633 DRLWYATK----DTG-EYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 ++ A K D G + P + G +Y++ + +G ++I++ Sbjct: 770 KQVDQAWKTQVTDAGIKAPPGQYGLQSYGDSQYLVKLGTGYLLKDDGTPVVIDL 823 >gi|215487810|ref|YP_002330241.1| hypothetical protein E2348C_2743 [Escherichia coli O127:H6 str. E2348/69] gi|215265882|emb|CAS10291.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 836 Score = 268 bits (685), Expect = 2e-69, Method: Composition-based stats. Identities = 79/474 (16%), Positives = 166/474 (35%), Gaps = 54/474 (11%) Query: 235 STSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKE 294 + LS + S + + D + A + R D R++ + R N +++++R Sbjct: 377 AIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ--QQAEYRTN-IDSRVRDA 433 Query: 295 TQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKL 354 + A R +D P++ D+ AYGV +G Y + +A + + Sbjct: 434 SAAYMRGVD------FPNAPTQ----NDFLAAYGVREGNLRYTEFRNTQIAGQYIGSFRN 483 Query: 355 MSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKY 414 M T + ++ + + A+ + + + + + + D I + + Sbjct: 484 MPTSSIQAAVENLKPD-TGETGEGYAAR---AQTYDAVVSAASTVLAQRKADPIQFSLSS 539 Query: 415 KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESV 474 Q P+ + +F ++ RA + Y + F+K E NQ+ P S+ Sbjct: 540 GQTKPIDMTNQNNFGQTISLRASQAVDLAKSYGTPLTFFSKEEANQIGAFFRDAPVSQQT 599 Query: 475 DLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR----------------LSSEFSDDA 518 + S + S+ I N SA V + + S Sbjct: 600 AYLDTIRQ--STGGGQVYMSALQQISTNAPSAAVAGILMDKPGGVVAEKNWFNPDVSVSP 657 Query: 519 KNAAMVILSGMKHQKDT------ETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNF 572 + AA ILSG +K T + D + D + D++ + ++ Sbjct: 658 ETAAQTILSGAAARKGTDDAKGIPMPKDNDLRLEFSDMVKDAFAG----DAQGASMAYEI 713 Query: 573 NKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFE 632 KD A + G + ++ A+ + G ++ P GMS F+ Sbjct: 714 AKDYYAGVMAKKGVVSGE----VDSDTWKQAVNVATGGVHDYNGMGSVLLPWGMSAEQFD 769 Query: 633 DRL--WYATKDTG---EYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINV 681 ++ + T+ TG + P + G +Y++ + +G ++I++ Sbjct: 770 KQVDQAWKTQVTGAGIKAPPGQYGLQSYGDSQYLVKLGTGYLLKDDGTPVVIDL 823 >gi|304398392|ref|ZP_07380266.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354258|gb|EFM18631.1| conserved hypothetical protein [Pantoea sp. aB] Length = 835 Score = 252 bits (642), Expect = 2e-64, Method: Composition-based stats. Identities = 81/474 (17%), Positives = 153/474 (32%), Gaps = 60/474 (12%) Query: 259 AIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR-RIDAQLNHGIT-VKPSE 316 A+GL D + S + A + K Q +AR ++ AQL I+ K S Sbjct: 372 ALGLTALDSPKRYTGNVAFDAASPQAQASYLRQAKSLQGEARTQLKAQLTDVISDAKASY 431 Query: 317 LIPLE--------DYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKID 368 L +E AYG +G + + + + VA + + M + ++ + Sbjct: 432 LKGVEYPNPPSQAQLISAYGYREGNQRFADLENQRVAGQYIGSFRNMPSSSITNYVADLK 491 Query: 369 DEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESF 428 + S A A + +E + +Q + + + P+ Sbjct: 492 TQLGS--GEGFAGRADA---YDHVEAAAKQVISLRESNPYQAAMDMGAYKPIASTNPADI 546 Query: 429 ADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV-KRPASESVDLIRGAYNTL--- 484 ++ R ++K + +K E + ++ ++S+ L++ L Sbjct: 547 TSEIKSRTAATDQLKS-LGINAPILSKEEAASISERVRGTTDVNQSISLLQSFGRDLQPQ 605 Query: 485 ----------SDSDKEGVRSSFAHIEDN---GLSAVVRLSSEFSDDAK----NAAMVILS 527 DS + +DN S + S S A IL Sbjct: 606 ALRSVAASIAPDSAATAYSALILGTDDNQYNNRSPSIPYSQFVSYKPTMNKYEVAKTILQ 665 Query: 528 G------MKHQKD-----TETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDK 576 G K QKD + + K D + +++ + P + + K Sbjct: 666 GDQLINPTKAQKDAGISAVKLPADDKLKQTFDDEIGNAFSHNPQAR----QMAWSIYKSA 721 Query: 577 EAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLW 636 A Y G ++ A+Q+ G N ++ P GM K+ F+DR Sbjct: 722 YAGLAYTSGDSDGVNTKAVDSDIAEKAIQMATGGVIKGFNGGDVVMPFGMDKSTFKDRYT 781 Query: 637 YATKD---TGEYDPYTIKY---MNVGSGKYMIIKNGNPKVD--KEGKTIIINVE 682 A + + +P +NVG +Y ++ K G I + V+ Sbjct: 782 AAAGEALKSAGLNPAAQSNFIPVNVGDSQYRLVTGSGRWATDPKTGAPITVRVQ 835 >gi|330007166|ref|ZP_08305908.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] gi|328535513|gb|EGF61973.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] Length = 838 Score = 241 bits (615), Expect = 3e-61, Method: Composition-based stats. Identities = 96/545 (17%), Positives = 196/545 (35%), Gaps = 78/545 (14%) Query: 190 LKQKLLSEAKEKLNT-NQIIG--AHARDPRVFENFMRAFYKKGH-PPKDSTSLSDVSDSA 245 L + LLS ++ +T ++I+ A A D E ++ A KK + P D ++SD+ ++ Sbjct: 288 LGKNLLSYQRQGYDTVSEIVNRWAPASDGNNTEAYIAALCKKLNVTPNDQLNMSDI-NTL 346 Query: 246 RERSLEVVED---------------VSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTK 290 R+ +++ VS A+GL + + + S + A + Sbjct: 347 RQLCAGIIQHENGKQPYSEDQLNTGVSAALGLTTLESPKRYSGNQAFDAASPQQQAAYLR 406 Query: 291 LRKETQAQAR-RIDAQLNHGI---TVKPSELIPLE------DYTQAYGVEQGTELYNLQQ 340 E + QAR + AQL + T + I + D+ A+G +GT+ +N + Sbjct: 407 QSMELRNQARTQFKAQLVDQVQDATAAYLKGIQFDNPPSQGDFINAFGYREGTQRFNDFE 466 Query: 341 FKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSM 400 VA + + M T ++++ + ++ + LA A + ++ + ++ + Sbjct: 467 NLRVAGQYIGSFRTMPTASIQQYVSDLKNQVGN--GEGLAGRAAA---FDHVQAAAQRII 521 Query: 401 QELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQ 460 + D + P+ T ++ A ++ R ++K +T +K E Sbjct: 522 SQRQSDPFQSAVDIGAYKPISNSTPDAIASEVKNRYAAQDQLK-AIGITPPLLSKQESQV 580 Query: 461 LRTQLV-KRPASESVDLIRGAYNTLS-------------DSDKEGVRSSFAHIEDN---G 503 L + ++++ L++G TL S + +DN Sbjct: 581 LTDAVRNSTDVNQAISLLQGLGRTLPPQALRSVASSIAPGSPGTAYAALLLGQQDNQYDN 640 Query: 504 LSAVVRLSSEFSDDAK----NAAMVILSG------MKHQKD-----TETRYNTDHKSNKF 548 S ++ S S + A +L+G K KD + K+ Sbjct: 641 RSGIIPYSQFVSYKPTLDKYDVAKTVLAGDQMLNPTKAMKDAGISAVSIPSDEKLKTYFD 700 Query: 549 DSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVL 608 + +++ + + + A Y G + + + A Q Sbjct: 701 KEVGNAFAYSAQAR----QVAWGNFRSAYAALAYQSGDASKTNTVSPDSDIAEKAAQYAT 756 Query: 609 GNTPVNINESMLMPPRGMSKTDFEDRLWYATKD------TGEYDPYTIKYMNVGSGKYMI 662 G +N S ++ P GM KT F DR A ++ K +NVG+ +Y + Sbjct: 757 GGVYKGLNGSDVVMPFGMDKTTFRDRYTAAGREAMTQAGLNPSSLENFKAVNVGNNQYRL 816 Query: 663 IKNGN 667 + Sbjct: 817 VNGSG 821 >gi|50282929|ref|YP_052985.1| hypothetical protein VP2p17 [Vibrio phage VP2] Length = 694 Score = 240 bits (611), Expect = 7e-61, Method: Composition-based stats. Identities = 88/459 (19%), Positives = 159/459 (34%), Gaps = 75/459 (16%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQA--------RRIDAQLNHGITV 312 G AG L++ +R ++ L +R N +N + Q + R IDA L G V Sbjct: 245 GYAG--ELNEPQRLATIDQLKARINLVNKRNEASYQGRLTLLGVEVDRTIDA-LESGEMV 301 Query: 313 KPSELIPLEDYTQAYGVEQG---------TELYNLQQFKSVAAPDVARIKLMSTFDAKKF 363 P L QA + G + + +PD+ + K + + Sbjct: 302 SPQHQARLFAQIQA-AKDAGLISSEDSSWVKRTAFFSEAATYSPDLVQFKSKPLQEQEAV 360 Query: 364 LQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQID--PLR 421 ++++ + A + +I K+H + LN D + +G + ID P Sbjct: 361 IREMR---------NSAVSGSDYHKLDIYTKNHENAKSMLNSDPLMYGAQTGIIDLQPFD 411 Query: 422 FDTEESFADSLRQRAGFVKKIKDDYNLTT--SHFNKTEENQLRTQLVKRPASESVDLIRG 479 F A SL +R V +K Y + + F K E + + P + + ++ Sbjct: 412 FADP---AASLVKRQNDVASMKSQYGDSMVPAFFTKQEAMEFSRVVQDAPVKDQMRMLSI 468 Query: 480 AYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRY 539 LS + VRS + NG + S + A IL G K+ + Sbjct: 469 INGQLSKDNANAVRS---QLMGNGGGVLAVASDALDSNQPYVAEKILMGRNALKEVPEAF 525 Query: 540 ---NTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNY--T 594 +TD + ++Y +P T + E+IK + G++ T Sbjct: 526 KSWDTDASPLVLSEIGNAYSASPAT----------MGQVMESIKAAYAAQTVNEGDFSGT 575 Query: 595 LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKD----TGEYDPYT- 649 +N R+ ++ V G + +++ P R M + F+ L G Y Sbjct: 576 INSKRLKSVIKEVTGGLIKIGSSTIVSPDRNMDENKFDSWLRDVQPKYLDTLGGVKGYNG 635 Query: 650 ---------------IKYMNVGSGKYMIIKNGNPKVDKE 673 + +NVG KY++ V ++ Sbjct: 636 RAGVMALHTQVSNGDVTMVNVGKDKYLLRGENGFLVKRD 674 >gi|50282962|ref|YP_053018.1| hypothetical protein VP5_gp16 [Vibrio phage VP5] Length = 694 Score = 240 bits (611), Expect = 7e-61, Method: Composition-based stats. Identities = 88/459 (19%), Positives = 159/459 (34%), Gaps = 75/459 (16%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQA--------RRIDAQLNHGITV 312 G AG L++ +R ++ L +R N +N + Q + R IDA L G V Sbjct: 245 GYAG--ELNEPQRLATIDQLKARINLVNKRNEASYQGRLTLLGVEVDRTIDA-LESGEMV 301 Query: 313 KPSELIPLEDYTQAYGVEQG---------TELYNLQQFKSVAAPDVARIKLMSTFDAKKF 363 P L QA + G + + +PD+ + K + + Sbjct: 302 SPQHQARLFAQIQA-AKDAGLISSEDSSWVKRTAFFSEAATYSPDLVQFKSKPLQEQEAV 360 Query: 364 LQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQID--PLR 421 ++++ + A + +I K+H + LN D + +G + ID P Sbjct: 361 IREMR---------NSAVSGSDYHKLDIYTKNHENAKSMLNSDPLMYGAQTGIIDLQPFD 411 Query: 422 FDTEESFADSLRQRAGFVKKIKDDYNLTT--SHFNKTEENQLRTQLVKRPASESVDLIRG 479 F A SL +R V +K Y + + F K E + + P + + ++ Sbjct: 412 FADP---AASLVKRQNDVASMKSQYGDSMVPAFFTKQEAMEFSRVVQDAPVKDQMRMLSI 468 Query: 480 AYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRY 539 LS + VRS + NG + S + A IL G K+ + Sbjct: 469 INGQLSKDNANAVRS---QLMGNGGGVLAVASDALDSNQPYVAEKILMGRNALKEVPEAF 525 Query: 540 ---NTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNY--T 594 +TD + ++Y +P T + E+IK + G++ T Sbjct: 526 KSWDTDASPLVLSEIGNAYSASPAT----------MGQVMESIKAAYAAQTVNEGDFSGT 575 Query: 595 LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKD----TGEYDPYT- 649 +N R+ ++ V G + +++ P R M + F+ L G Y Sbjct: 576 INSKRLKSVIKEVTGGLIKIGSSTIVSPDRNMDENKFDSWLRDVQPKYLDTLGGVKGYNG 635 Query: 650 ---------------IKYMNVGSGKYMIIKNGNPKVDKE 673 + +NVG KY++ V ++ Sbjct: 636 RAGVMALHTQVSNGDVTMVNVGKDKYLLRGENGFLVKRD 674 >gi|317120711|gb|ADV02533.1| hypothetical protein SC2_gp045 [Liberibacter phage SC2] gi|317120772|gb|ADV02593.1| hypothetical protein SC2_gp045 [Candidatus Liberibacter asiaticus] Length = 753 Score = 164 bits (416), Expect = 4e-38, Method: Composition-based stats. Identities = 136/723 (18%), Positives = 283/723 (39%), Gaps = 58/723 (8%) Query: 16 PNQDAVRDSVNPQAGLRDLGDALGKATQFL-EGIRRDNAFATANTRYTELSFKAVQDFHD 74 P ++ V+ +P GLRD+ + L ++T+ + E +RR +AF A++ + + + +A + F++ Sbjct: 16 PVKEHVQQRYSPDEGLRDIANKLNQSTKLIEETVRRQSAF-KADSEFLQTTVEAEKVFNE 74 Query: 75 FTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKG 134 L + + ++ + I+ Y + + + +VR+ Q + + K Sbjct: 75 KLQGLQGGNEEEVKKNLESILSNDIKPLYGKMHEKLDYPEVRRHILQQSNKQMARFQLKA 134 Query: 135 VDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 + ++G + E+ + + +L D S+ NY ++VQ +D I+ LP++ K Sbjct: 135 NEYKLGFQVQRTEEGIREIERSVSNSLLLDGSDSNYREKVQFASDAIDKLPLNPTTKDAR 194 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE 254 +E Q +P+VF +F++ +G D ++++D+ D+ + V + Sbjct: 195 RRTTRENAALAQAQRYMVDNPQVFSSFIQKREGQGSKIDDKSTIADIVDNTPLDTFSVND 254 Query: 255 DVSKAIG-LAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 V GW L K+ +L+ +++ + R + + I + L GIT K Sbjct: 255 TVVDNTNPFEGWKDLSTQKKIAILKEVANASTTGKQEERSRVGTKIKNIASFLERGITPK 314 Query: 314 PSELIPL--EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEY 371 + L ++ +G +G L + Q AP + +I LM + L K+ + Sbjct: 315 NIQDADLSEDNLRHLFGNTRGQMLSSQLQTLKEFAPKINQIFLMPNDQVETELTKLKPDV 374 Query: 372 ISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADS 431 +L K K+ ++ +E + I W I + + D + D Sbjct: 375 -----QNLVGAEFKNKLYSDGVKAVYKNDKERKEHPIKWAITHGLTQEMP-DEPSKWGDF 428 Query: 432 LRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAY---NTLSDSD 488 R + +K Y +++ +K E +L L K + + V + L +S Sbjct: 429 FVHRREVSQTLKGFYGVSSPLISKEESRKLTQYLDKLESKDVVSHLEDYAERLGGLKNSV 488 Query: 489 KEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSG-----MKHQKDTE----TRY 539 S H + +++ D + A +L G ++ R+ Sbjct: 489 WIEAMQSLEHPVMGEVGLLLK-------DDPDVASDVLRGYKYRVRNRGENIRDITGERF 541 Query: 540 NTDHKSNKFDSLYDSYIN---------TPLTKLEQSTAGGNFNKDK-----EAIKLYLLG 585 + K + + + + +KL G DK EA L Sbjct: 542 DAWFKERFREKYGNVFASIGDYGDAEFDRASKLVGYHLAGELLSDKTYSFHEAPTQSLKE 601 Query: 586 SMKDSGNYTLNRVRVSDAMQIVLGNTPVNIN--ESMLMPPRGMSKTDFEDRLWYATKDT- 642 + ++ + + + V+GN PV + +S L+PPRGMS DF ++ + Sbjct: 602 WVLRQPASQYDKN-LQHSFEAVVGNIPVPMGDFKSPLIPPRGMSSEDFTNKFTAVAQQVF 660 Query: 643 -------GEYDPYTIKYMNVGSGKYMIIKNGNPKVD-KEGKTIIINVE--DVNRDERMES 692 EY I + GS +Y++ + K G ++I V + ++ ++ Sbjct: 661 EESGLSGKEYGYENIGPTSDGSAQYLVKFGREYIHNPKTGAPLVIEVSPYSIEHEQTIQK 720 Query: 693 TIR 695 T++ Sbjct: 721 TLK 723 >gi|315122890|ref|YP_004063379.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496292|gb|ADR52891.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 583 Score = 156 bits (393), Expect = 2e-35, Method: Composition-based stats. Identities = 138/554 (24%), Positives = 248/554 (44%), Gaps = 25/554 (4%) Query: 147 EDNLNMTVGLAAAHVLHDPSNENY-FQRVQ--SITDHINSLPIDLRLKQKLLSEAKEKLN 203 E ++ T A ++ P+NEN QR + + D P D K++ L + L+ Sbjct: 6 EKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---LH 62 Query: 204 TNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE-DVSKAIGL 262 +Q IG + +P + N Y KD S ++++A ++E+ E + G Sbjct: 63 KSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEIDEHKLDGKTGY 122 Query: 263 AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG-ITVKPSELIPLE 321 G D L + LL+ + + NA +K + + N G I+ E+ Sbjct: 123 TGIDSLSREDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDYDEVFSES 182 Query: 322 DYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEY-ISNPSLSLA 380 + ++ Y + + K AP + ++ M+ + +F+ ++ + + Sbjct: 183 NLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVDYDLNDRFK 242 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVK 440 + + ++ S ++EL++D W + P A S+ + Sbjct: 243 AQAFLKELQDKRVAS----LKELSKDPHGWQRSRGLVPPNLSLEAGQLASSVLPIFDANE 298 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIR--GAYNTLSDSDKEGVRSSFAH 498 K + D+ + +E QL ++ A + V R ++ SD E ++S Sbjct: 299 KTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEKIKSVVDG 358 Query: 499 IEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINT 558 ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L+++ IN Sbjct: 359 MKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKLFNNLINK 417 Query: 559 PLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINES 618 + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT +N S Sbjct: 418 EIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNTAYAVNGS 477 Query: 619 MLMPPRGMSKTDFEDRLWYATKD-----TGEYD----PYTIKYMNVGSGKYMIIKNGNPK 669 +MPPRGMS +F +RL T D G+ P + Y + G GKY + G K Sbjct: 478 YVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSLTVGGVYK 537 Query: 670 VDKEGKTIIINVED 683 DK+G I+IN+ D Sbjct: 538 KDKQGNPIVINIYD 551 >gi|315121928|ref|YP_004062417.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495330|gb|ADR51929.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 547 Score = 147 bits (370), Expect = 8e-33, Method: Composition-based stats. Identities = 134/547 (24%), Positives = 242/547 (44%), Gaps = 25/547 (4%) Query: 147 EDNLNMTVGLAAAHVLHDPSNENY-FQRVQ--SITDHINSLPIDLRLKQKLLSEAKEKLN 203 E ++ T A ++ P+NEN QR + + D P D K++ L + L+ Sbjct: 6 EKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---LH 62 Query: 204 TNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE-DVSKAIGL 262 +Q IG + +P + N Y KD S ++++A ++E+ E + G Sbjct: 63 KSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEIDEHKLDGKTGY 122 Query: 263 AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG-ITVKPSELIPLE 321 G D L + LL+ + + NA +K + + N G I+ E+ Sbjct: 123 TGIDSLSREDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDYDEVFSES 182 Query: 322 DYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEY-ISNPSLSLA 380 + ++ Y + + K AP + ++ M+ + +F+ ++ + + Sbjct: 183 NLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVDYDLNDRFK 242 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVK 440 + + ++ S ++EL++D W + P A S+ + Sbjct: 243 AQAFLKELQDKRVAS----LKELSKDPHGWQRSRGLVPPNLSLEAGQLASSVLPIFDANE 298 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIR--GAYNTLSDSDKEGVRSSFAH 498 K + D+ + +E QL ++ A + V R ++ SD E ++S Sbjct: 299 KTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEKIKSVVDG 358 Query: 499 IEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINT 558 ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L+++ IN Sbjct: 359 MKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKLFNNLINK 417 Query: 559 PLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINES 618 + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT +N S Sbjct: 418 EIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNTAYAVNGS 477 Query: 619 MLMPPRGMSKTDFEDRLWYATKD-----TGEYD----PYTIKYMNVGSGKYMIIKNGNPK 669 +MPPRGMS +F +RL T D G+ P + Y + G GKY + G K Sbjct: 478 YVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSLTVGGVYK 537 Query: 670 VDKEGKT 676 DK+G Sbjct: 538 KDKQGHP 544 >gi|293609611|ref|ZP_06691913.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828063|gb|EFF86426.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 683 Score = 96.4 bits (238), Expect = 1e-17, Method: Composition-based stats. Identities = 69/445 (15%), Positives = 151/445 (33%), Gaps = 51/445 (11%) Query: 233 KDSTSL-SDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKL 291 +D T + S ++ + + S+E +++V A GL + +D + ++ + S+ L Sbjct: 203 RDITDINSRITTAIAQNSVEGLQEV--ATGLKDYKFIDGSAVQKFQTEIQSKITTLQQ-- 258 Query: 292 RKETQAQARRIDAQ------LNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 R++ Q R +A+ + +T +P ++ D QA +GT QF Sbjct: 259 RQQVQENKRINEAEKVVNEFIQSTLTGRPLDMRYQNDVEQAV---KGTPSEAEYQFYKKQ 315 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + D R + ++T + + ++ S A+ + K + + ++ + Sbjct: 316 SADFMRFQGLTTDQQLAEINSRKAKMKNSSS---ANPVAENKILSTYQSIYDNKLKTAKE 372 Query: 406 DAISW----GIKYKQIDPLRFD-TEESFADSLRQRAGF-VKKIKDDYNLTTSHFNKTEEN 459 + GI+ +++PL FA ++ + V + D N T Sbjct: 373 NPTQALREKGIELPEVNPLTLKVNPSDFAKNIVTIGSYQVAQRDKDPNATIKPIPNEALP 432 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLS--DSDKEGVRSSFAHIEDNGLSA-----VVRLSS 512 + + ++ +DLI N + + + + NG +A R ++ Sbjct: 433 AAKQAWEEANVNQKLDLISSMINQTKGVKNGAKIWGEALGQL-GNGDAAYQMAGYARANN 491 Query: 513 EFSDDAKNAAMVILSGMKHQKDTET--RYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGG 570 SD + A I++G + K+ + + K + S E + Sbjct: 492 FRSDAGLDVATAIVAGKQALKNKQMIQPKDALLKEKFNQYVGQSVSG------ETANLNY 545 Query: 571 NFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTP-----------VNINESM 619 + A G + A+ + G I++ Sbjct: 546 AAFQSIYAYLTEARGQTHKDAD-EYKEDIGRTALGLATGGVYTQSGRFKDYTDRGISDWK 604 Query: 620 LMPPRGMSKTDFEDRLWYATKDTGE 644 + P GM FE ++ D + Sbjct: 605 VSKPYGMKDETFEAKIQKGYADISK 629 >gi|319793415|ref|YP_004155055.1| hypothetical protein Varpa_2746 [Variovorax paradoxus EPS] gi|315595878|gb|ADU36944.1| hypothetical protein Varpa_2746 [Variovorax paradoxus EPS] Length = 732 Score = 91.4 bits (225), Expect = 5e-16, Method: Composition-based stats. Identities = 64/403 (15%), Positives = 132/403 (32%), Gaps = 65/403 (16%) Query: 331 QGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKE 390 +GT D ++ M D ++++Q + E + +LA + K Sbjct: 316 EGTPYAEDFNRLVKQERDTQQVLRMPMADQQRYIQTREAE-LQQAGGTLADRANLQRIKA 374 Query: 391 ILEKSHRQSMQELNQDAISWGIKY--KQIDPLRFDT------EESFADSLRQRAGFVKKI 442 +E + ++L + + + + + PL + A RA + + Sbjct: 375 TVENNA----KQLADEPLVAAQRLFGRAVQPLDPNDLLQPGGAGRAAGIFADRAATLAAM 430 Query: 443 KDDYNLTTSH--FNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHI- 499 + Y E L + +++ L ++D D R+ I Sbjct: 431 QKQYGSQVGTKPLLPQEAGLLVKAIDSAGPADANQLFGTLRGAINDDD--TYRAVMQQIA 488 Query: 500 EDNGLSAV---------------------VRLSSEFSDDAKNAAMVILSGMKHQKDTE-- 536 D+ + A VR+SS + A +IL+ K QK + Sbjct: 489 PDSPVKARAGMLTAIDRPVTIQSNLIASDVRVSSAKVAETMLAGEIILNKTKGQKQVDGQ 548 Query: 537 -----TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSG 591 D + DS+ + Y + P + A K LG + + Sbjct: 549 AHSLFVPARQDFSTAFVDSVGNLYRSRPGAQEGDLQAAF----AYYVGKSAELGRLASTP 604 Query: 592 NYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKD---------- 641 ++ V +A+ +G+ + P GM+ +D + +L + Sbjct: 605 KD-IDSTLVKEAITATIGDVVDVNGQGKAKAPIGMTGSDMQAKLRERFSEMVKEQDMPPS 663 Query: 642 -TGEYDPY-TIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINVE 682 ++ Y I Y G+YM+ G P + K G ++++++ Sbjct: 664 AVASFEHYGVINYRR--DGQYMLTLGGAPVLGKGGAPLVVDLD 704 >gi|169795394|ref|YP_001713187.1| hypothetical protein ABAYE1265 [Acinetobacter baumannii AYE] gi|169148321|emb|CAM86186.1| hypothetical protein ABAYE1265 [Acinetobacter baumannii AYE] Length = 683 Score = 90.2 bits (222), Expect = 1e-15, Method: Composition-based stats. Identities = 61/433 (14%), Positives = 145/433 (33%), Gaps = 44/433 (10%) Query: 233 KDSTSL-SDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKL 291 +D T + S ++++ E S+E +++V A L + ++ ++ + S+ L + Sbjct: 203 RDITDIQSKITNAISENSVEGLQEV--ATSLKDYKFINGQAVQKFQTEIQSKITTLQQRQ 260 Query: 292 RKETQAQARRIDAQLN----HGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAP 347 + + + + LN + +T +P +L + +A +GT F + + Sbjct: 261 QVQENKRINEAEKVLNEYKQNVLTGRPMDLTYQTNVEKAV---KGTPSETEYNFYTKQSS 317 Query: 348 DVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDA 407 D R + +ST + K ++ S A + K + + ++ ++ Sbjct: 318 DFLRFQKLSTDQQLAEINKRKANMKNSSS---ADAVAENKILATYQSIYDNKLKTAKENP 374 Query: 408 ISW----GIKYKQIDPLRFD-TEESFADSLRQRAGF-VKKIKDDYNLTTSHFNKTEENQL 461 GI+ +++PL FA ++ + V + D N T Sbjct: 375 TQALREKGIELPEVNPLTLKVNPSDFAKNIVTIGSYQVAQRDKDPNATIKPIPNEALPAA 434 Query: 462 RTQLVKRPASESVDLIRGA---YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDA 518 + + + ++LI + D K + N + ++ + Sbjct: 435 KQAWEEATVDQKLNLISSMIAQTKGVKDGVKIWGAALGQLGGGNSNYVMAGVAKANGYRS 494 Query: 519 KN---AAMVILSGMKHQKDTET--RYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFN 573 A I+ G + K+ + D + + + T + Sbjct: 495 TEGRELANSIVIGTQLLKNKQLIMPKEDDMREAFNKYVGQTLTGT------NANNAYEVF 548 Query: 574 KDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTP-----------VNINESMLMP 622 K A + G S + N+ ++ A+ + G +++ + Sbjct: 549 KAVYADTMNERGFSHSSKDEAPNKKILNTALGMATGGVYTQPTSFRNYRGDKVSDWKVTK 608 Query: 623 PRGMSKTDFEDRL 635 P G++ FE +L Sbjct: 609 PYGITDDAFEAQL 621 >gi|332875214|ref|ZP_08443047.1| putative methyl-coenzyme M reductase, alpha subunit [Acinetobacter baumannii 6014059] gi|332736658|gb|EGJ67652.1| putative methyl-coenzyme M reductase, alpha subunit [Acinetobacter baumannii 6014059] Length = 680 Score = 86.0 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 64/443 (14%), Positives = 149/443 (33%), Gaps = 47/443 (10%) Query: 233 KDSTSL-SDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKL 291 +D T + S ++ + + S+E +++V A GL + ++ ++ + S+ L+ + Sbjct: 200 RDITDINSRITTAIAQNSVEGLQEV--ATGLKDYKFINGQAVQKFQTEIQSKITTLHQRQ 257 Query: 292 RKETQAQARRIDAQLN----HGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAP 347 + + + + LN + +T +P +L + +A +GT F + + Sbjct: 258 QVQENKRINEAEKVLNEYKQNVLTGRPMDLTYQTNVEKAV---KGTPSETEYNFYTKQSS 314 Query: 348 DVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDA 407 D R + +ST + K ++ S A + K + + ++ ++ Sbjct: 315 DFLRFQKLSTDQQLAEINKRKANMKNSSS---ADAVAENKILATYQSIYDNKLKTAKENP 371 Query: 408 ISW----GIKYKQIDPLRFD-TEESFADSLRQRAGF-VKKIKDDYNLTTSHFNKTEENQL 461 GI+ +++PL +FA ++ + V + D N T Sbjct: 372 TQALREKGIELPEVNPLTLKVNPSAFAKNIVTIGSYQVAQRDKDPNATIKPIPNEALPAA 431 Query: 462 RTQLVKRPASESVDLIRGAYNTLS--DSDKEGVRSSFAHIEDNGLSA-----VVRLSSEF 514 + + + ++LI + + + + NG +A R ++ Sbjct: 432 KQAWEEATVDQKLNLISSMIAQTKGIKNGAKIWGEALGQL-GNGDAAYQMAGYARANNFR 490 Query: 515 SDDAKNAAMVILSGMKHQKDTET--RYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNF 572 SD + A I++G + K+ + + K + S E + Sbjct: 491 SDAGLDVATAIVAGKQALKNKQMIQPKDALLKEKFNKYVGQSVSG------ETANLNYAA 544 Query: 573 NKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTP-----------VNINESMLM 621 + A G + A+ + G I++ + Sbjct: 545 FQAIYAYLTEARGQTHKDAD-EYKEEIGRTALGLATGGVYTQSGRFKDYTDRGISDWKVS 603 Query: 622 PPRGMSKTDFEDRLWYATKDTGE 644 P GM+ FE ++ D + Sbjct: 604 KPYGMTDATFEAKIQKGYADISK 626 >gi|298485993|ref|ZP_07004067.1| Structural protein P5 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159470|gb|EFI00517.1| Structural protein P5 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 679 Score = 86.0 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 51/397 (12%), Positives = 124/397 (31%), Gaps = 48/397 (12%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR---------RIDAQLNHGITVKPSEL 317 +LD KR +LL ++ R K E QA+ R ++D Q GI P+ Sbjct: 243 KLDPEKRTQLLNTITGR--IYQIKEHNERQAEIRENKAERALTQMDRQAATGIPPTPA-- 298 Query: 318 IPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSL 377 D + + GT + + +V + ++++ + + +N Sbjct: 299 ----DQQRWQSLVSGTSAAGEFKERINQMAEVQGLLRQPIEAQQQYVDQKRQQMAANGG- 353 Query: 378 SLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKY--KQIDPLRFDT------EESFA 429 +A L+ + +++ + D +S+ ++PL +++ Sbjct: 354 GVAEQANL----NRLQSAIDTNVKLMKTDPLSFSALRTGSDVEPLDLSQIGTTEGQQALG 409 Query: 430 DSLRQRAGFVKKIKDDYNLTTSH--FNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDS 487 + + R ++ Y S F E ++ + ++ + Sbjct: 410 EQIASRFDVTNAVRKKYGPEVSRVPFKAQELEAIKAGYEAADDKTKIGILAAVGGAAP-T 468 Query: 488 DKEGVRSSFAHIEDNGLSAVVRLSSEFSDDA---KNAAMVILSGMKHQKD--TETRYNTD 542 + + A + ++ + ++ A + A ++ G K KD T ++ Sbjct: 469 GPDAAAAIKAIAPEQPITLLAGMAQYRGLKAADGTDVAKTLMVGSKILKDKSVATPEDSV 528 Query: 543 HKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSD 602 +++ + ++ + K A G D ++ Sbjct: 529 MQASFESIVGNAMPGGTQQR----EHAYTGFKAIYAGLAEASGKRYDPAVKDVDGALAKK 584 Query: 603 AMQIVLGNTPVN------INESMLMPPRGMSKTDFED 633 A ++ G + ++ P GM F+ Sbjct: 585 AADMITGGVTTRGGGWFSGSSYKVIKPYGMDDDAFDK 621 >gi|332160977|ref|YP_004297554.1| Hypothetical structural protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665207|gb|ADZ41851.1| Hypothetical structural protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862133|emb|CBX72297.1| hypothetical protein YEW_AK02340 [Yersinia enterocolitica W22703] Length = 708 Score = 84.1 bits (206), Expect = 8e-14, Method: Composition-based stats. Identities = 68/459 (14%), Positives = 156/459 (33%), Gaps = 68/459 (14%) Query: 266 DRLDDTKRRRLLEHLSSRDNALNTKLR---KETQAQARRIDAQLNHGITVK-PSELIPLE 321 D+LD KR LL S L + + +A+A R ++ I++ P+ E Sbjct: 234 DKLDPEKRNELLNQSMSYQVTLENRAQVAAARREAKAGRAFSEFERQISMDLPTTA---E 290 Query: 322 DYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLAS 381 +GT + + + + A+ ++ ++ +N Sbjct: 291 QQVNLISATKGTAVEADVKDLLANQVKIREVISQGPLAAQNYVNELSANLTTNGGD---- 346 Query: 382 TMMATKYKEILEKSHRQSMQELNQDAISWGIKY--KQIDPLRF----------------- 422 M + ++L + ++S ++L +++ + + PL Sbjct: 347 -MKQWRNLQLLNTAVQKSNKQLMDSPLTFNQNRTGEAVQPLDMMAVSPTLSAVGSAIEGK 405 Query: 423 DTEESFADSLRQRAGFVKKIKDDYNLTTSH--FNKTEENQLRTQLVKRPASESVDLIRGA 480 D + FA+ + R+ + + + H E L + L +P E ++ Sbjct: 406 DISDRFAEQIANRSATLDALSQNTGFKVPHKILLPQEVAVLTSTLKDKPTGEQAAILAQL 465 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTE---- 536 ++ D ++ I + L+ SD + + +L G + K + Sbjct: 466 RKSIPDDRD--YKAVMQQIYPE--APTAALAGSLSD--QKSTNKVLQGYEQLKANKGDMT 519 Query: 537 --TRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYT 594 D + +++ + Y +G+ G Y+ Sbjct: 520 NWNAAKKDFVTEFDSITGNAFAGRHEARQGAINIAF----------AYHVGNASQKGEYS 569 Query: 595 LN--RVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDR---LWYATKDTGEYDPYT 649 + +V DA++ +G + ++ + + P G+S +DF+ R L Y + D Sbjct: 570 TDISGSQVKDAIKNTVGES-TSLGGAQVFVPVGVSVSDFKYRYPALLYKALERQGMDVSR 628 Query: 650 IKYMNV-------GSGKYMIIKNGNPKVDKEGKTIIINV 681 ++ G Y + + + VD +G+ ++IN+ Sbjct: 629 ANQASLFNLLPAGPDGDYFVAQGRDALVDAKGRPLMINL 667 >gi|254251751|ref|ZP_04945069.1| hypothetical protein BDAG_00948 [Burkholderia dolosa AUO158] gi|124894360|gb|EAY68240.1| hypothetical protein BDAG_00948 [Burkholderia dolosa AUO158] Length = 698 Score = 79.1 bits (193), Expect = 2e-12, Method: Composition-based stats. Identities = 70/468 (14%), Positives = 133/468 (28%), Gaps = 77/468 (16%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG--ITVKPSELIPLEDYT 324 +LD KR ++L + S +L ++ E QA+ R N + + P E Sbjct: 248 KLDAGKRNQVLASVVSNRLSLQNQMTSEQQAREREAVTAFNQATDLMTQGKRFSP-EYVQ 306 Query: 325 QAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMM 384 Q +GT L Q A A +S + + +Q + + + Sbjct: 307 QLTAATRGTALEQQTQGVIKQAAVGASFSTLSVPEMRAAVQANESK-QNQSGTDPLEAAA 365 Query: 385 ATKYKEILEKSHRQSMQELNQDAISWGIKYKQID---PLRFDTEESFADSLRQRAGFVKK 441 + K+IL + + D + ++ ID P+ A SL RA Sbjct: 366 IKQQKQILTATDEAYKR----DPWNAALERGAIDVVPPIDTSGITQLASSLAARAQLAPV 421 Query: 442 IKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDK------------ 489 ++ S E + + P + + ++ ++ + Sbjct: 422 VEHKAARRVSLLTPDEARNVLQTIDALPTNTKAQALALLGRSMGNAARINDLAEQWKDKS 481 Query: 490 -EGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKD----TETRYNTDHK 544 + A D ++ S A IL G + + T + Sbjct: 482 PAAALAMKAGAADPSGGPLMMQSG------MPVAQYILDGQDALANKLVKVDAAAATGLQ 535 Query: 545 SNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAM 604 + + D+ P +E + S + +G N V + Sbjct: 536 ATIAKRIGDAL--PPQ----------QLGDARETAYFAAVASARKNGRDVPNSTDVETGI 583 Query: 605 QIVLGNTPVNIN------ESMLMPPRGMSKTDFEDRLWYA-----TKDTGEYDPYTI--- 650 + G M P G S DF+ + A G ++ Sbjct: 584 NVATGGLAKTGGIDPRGDRYMAAKPWGWSDDDFDGGVKQASITNIENQPGGRPVDSVIAN 643 Query: 651 ----------------KYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + VG G Y ++ P D G ++I++ Sbjct: 644 GTKIPIDQFMQQFASYRLQRVGIGGTYTVLTGARPVTDTTGAPLLIHL 691 >gi|221201508|ref|ZP_03574547.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207936|ref|ZP_03580942.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172121|gb|EEE04562.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178776|gb|EEE11184.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 706 Score = 75.6 bits (184), Expect = 3e-11, Method: Composition-based stats. Identities = 47/332 (14%), Positives = 94/332 (28%), Gaps = 53/332 (15%) Query: 391 ILEKSHRQSMQELNQDAISWGIKYKQ---IDPLRFDTEESFADSLRQRAGFVKKIKDDYN 447 LE+ H S+ D + + + + + SL RA ++ Sbjct: 359 QLEQIHTASVAAYKADPWNAALDRGVLQGVPQVDTSSVPGLVTSLASRAQAAGAVEQAAG 418 Query: 448 LTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG-------------VRS 494 S E + T + P + ++ + + Sbjct: 419 RRVSLLTPDEAQTVLTAVDALPIDTKAQALNQIGQAFGNAGRIADLAAQWKEKSPAMALA 478 Query: 495 SFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN----TDHKSNKFDS 550 A D G ++ S + IL+G + KD + + T ++ ++ Sbjct: 479 LKAGAADAGGRPLMTTSGA------PLSTFILAGAQALKDKTVKIDDVAGTGMRAAIANA 532 Query: 551 LYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGN 610 + DS P + A A G + S N + + G Sbjct: 533 IGDSL--PPEQAEDAKEAAYFIA----AGSAARGGRTQPSSTDVQNAISGATGGISTTGG 586 Query: 611 TPVNINESMLMPPRGMSKTDFEDRLWYAT-KDTGEYDPYTI------------------- 650 +N +++ P G + DF+ + T + T+ Sbjct: 587 LQINGKPNIVAMPYGWREEDFQSAVKSVTAANIENPGVDTVVANGHEIPVADFVKQFASY 646 Query: 651 KYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + VG G Y + DK G I +++ Sbjct: 647 RLVRVGVRGTYAVSTGSKFVTDKTGAPITVHL 678 >gi|221213944|ref|ZP_03586917.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166121|gb|EED98594.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 706 Score = 75.2 bits (183), Expect = 3e-11, Method: Composition-based stats. Identities = 46/334 (13%), Positives = 95/334 (28%), Gaps = 57/334 (17%) Query: 391 ILEKSHRQSMQELNQDAISWGIKYKQ---IDPLRFDTEESFADSLRQRAGFVKKIKDDYN 447 LE+ H S+ D + + + + + SL RA ++ Sbjct: 359 QLEQIHTASVAAYKADPWNAALDRGVLQGVPQVDTTSVPGLVTSLASRAQAAGAVEQAAG 418 Query: 448 LTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG-------------VRS 494 S E + + P + ++ + + Sbjct: 419 RRVSLLTPDEAQTVLNAVDALPIDTKAQALNQIGQAFGNAGRIADLAAQWKEKSPAMALA 478 Query: 495 SFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDS 554 A D G ++ S + IL+G + KD + + + ++ ++ Sbjct: 479 LKAGAADAGGRPLMTTSGA------PLSTFILAGAQALKDKTVKIDDVAGTGMRATIANA 532 Query: 555 YINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPV- 613 ++ T+ + KEA GS G + V +A+ G Sbjct: 533 IGDSLPTEQAED--------AKEAAYFIAAGSAARGGRTQPSSTDVQNAISGATGGISTT 584 Query: 614 -----NINESMLMPPRGMSKTDFEDRLWYAT-KDTGEYDPYTI----------------- 650 N +++ P G + DF+ + T + T+ Sbjct: 585 GGLQINGKPNIVAMPYGWREEDFQSAVKSVTAANIENPGVDTVVANGHEIPVADFVKQFA 644 Query: 651 --KYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + VG G Y + DK G I +++ Sbjct: 645 SYRLVRVGVRGTYAVKTGSKFVTDKSGTPITVHL 678 >gi|262371856|ref|ZP_06065135.1| predicted protein [Acinetobacter junii SH205] gi|262311881|gb|EEY92966.1| predicted protein [Acinetobacter junii SH205] Length = 663 Score = 73.3 bits (178), Expect = 2e-10, Method: Composition-based stats. Identities = 69/406 (16%), Positives = 148/406 (36%), Gaps = 45/406 (11%) Query: 317 LIPLEDYTQAYGVE-QGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNP 375 LIP ED ++ +GTE + S A +V + + + + +L K E + Sbjct: 276 LIPSEDVIKSRLARVKGTEKESEFVQYSGALVEVQQFMRLGPDEREAYLSKRRVEAQNTA 335 Query: 376 SLSLASTMMATKYKEILEKSHRQS-MQELNQDAISWGIKYKQ------IDPLRFDTEESF 428 + + +L K+H E N +++ IK Q + + E+ Sbjct: 336 QDN---PKDVSWKLNLLSKTHENMLNYEKNNSTLAYSIKTGQDLTVVPTNAILSGNPEAI 392 Query: 429 ADSLRQRAGFVKKIKDDYNLTTSH-----FNKTEENQLRTQLVKRPASESVDLIRGAYNT 483 A + +K I + L + F+ ++++L+ K + + L+ Y + Sbjct: 393 AA----LSKNIKSIHANNVLNGTVGSLNPFSTQQQSELKQFWEKAKPGDKLSLLTSLYKS 448 Query: 484 LSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDA-KNAAMVILSGMKHQKDTETRYNTD 542 S + R I + RLS+ ++ ++ A I++G +D + Sbjct: 449 -SAGNANASRDMINGIAGDSG--AYRLSASLNNRGLQDIAGQIVTG----QDLIEKGLVK 501 Query: 543 HKSNKFDSLYDSY-INTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSG----NYTLNR 597 + L ++Y + K A + + DS N T++ Sbjct: 502 VDESGLTKLTEAYLAGITSPGKPDFQIYLDSIKANYAYLVQKSEKVTDSKGNILNKTIDE 561 Query: 598 VRVSDAMQIVLGNTPV----NINESMLMPPRGMSKTDFEDRLW-YATKDTGEYD-PYTIK 651 + A + V G ++S+++ P + + F ++L + +++ Y Sbjct: 562 ELFNKAAKNVTGGKFTSGGFFGSKSVVLRPHTVGEKSFREQLESFNSRNARNYGGSDKDF 621 Query: 652 YMNVGSGKYMIIKNGNPKVD--KEGKTIIINVEDVNRDERMESTIR 695 ++++ + NP V K G I++ D R +R+ T+R Sbjct: 622 FLDLPLEQ----DPKNPYVYYFKNGTKYIMDATDKKRQKRLAFTVR 663 >gi|48697205|ref|YP_024935.1| hypothetical protein BcepC6B_gp15 [Burkholderia phage BcepC6B] gi|47779011|gb|AAT38374.1| gp15 [Burkholderia phage BcepC6B] Length = 706 Score = 71.4 bits (173), Expect = 5e-10, Method: Composition-based stats. Identities = 45/394 (11%), Positives = 113/394 (28%), Gaps = 57/394 (14%) Query: 333 TELYNLQQFKSVAAPD---VARIKLMSTFDAKKFLQKIDDEYISNP-SLSLASTMMATKY 388 +L + +VA + + F + Q+ + + + Sbjct: 295 NQLLTNTEGTTVAGAVRELIGTASQHAGFASASLPQQAATLQQYQTEAATRGTDPDQAAS 354 Query: 389 KEILEKSHRQSMQELNQDAISWGIKYKQ---IDPLRFDTEESFADSLRQRAGFVKKIKDD 445 + LE+ H S+ D + + + + + SL RA ++ Sbjct: 355 VKQLEQIHTASVAAYKADPWNAALDRGVLQGVPKVDTSSVPGLVSSLAARAQTAGAVEQA 414 Query: 446 YNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLS 505 S E + T + P + ++ + ++ + ++ Sbjct: 415 AGRRVSLLTPDEAEKTLTAVDSLPIDTKAQALNQIGQAFGNAGRIADLAAQWKEKSPAMA 474 Query: 506 AVVRL-SSEFSDDA------KNAAMVILSGMKHQKDTETRYN----TDHKSNKFDSLYDS 554 ++ +++ + IL+G + KD + + T ++ +++ D+ Sbjct: 475 LALKAGAADVGGKPLLTTSGAPLSTFILTGAQALKDKTVKIDEVAGTGMRATIANAIGDA 534 Query: 555 YINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVN 614 P + A GS G + V +A+ G Sbjct: 535 L--PPEQADDAKEAAYFIA----------AGSAARGGRTQPSSTDVQNAISGATGGISTT 582 Query: 615 INE------SMLMPPRGMSKTDFEDRLWYAT-KDTGEYDPYTI----------------- 650 + +++ P G + DF+ + T + ++ Sbjct: 583 GGQQINGKPNIVAMPYGWREDDFQSAVKQVTAANIENPGVDSVLANGHEIPVADFVKQFA 642 Query: 651 --KYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + VG G Y + DK G + +++ Sbjct: 643 SYRLVRVGIRGTYAVSTGSKFVTDKTGAPVTVHL 676 >gi|294648409|ref|ZP_06725908.1| hypothetical protein HMP0015_0117 [Acinetobacter haemolyticus ATCC 19194] gi|292825714|gb|EFF84418.1| hypothetical protein HMP0015_0117 [Acinetobacter haemolyticus ATCC 19194] Length = 675 Score = 65.2 bits (157), Expect = 4e-08, Method: Composition-based stats. Identities = 57/394 (14%), Positives = 123/394 (31%), Gaps = 43/394 (10%) Query: 289 TKLRKETQAQARRID---AQLNHGITVKPSELIPLEDYTQAYGVEQ--GTELYNLQQFKS 343 ++++K + +++ +D AQL PL + +E GT+ + Sbjct: 247 SRIQKGQEIRSKELDKEHAQLTKDFVADAFTGYPLSESLVNQTLEAVKGTKYEAEVREAI 306 Query: 344 VAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQEL 403 D + + S + ++ + ++ E + K ++ + S Q Sbjct: 307 GLNKDAQKFRDASPIEQERSIARLTAELENV---GQEDATGLKKKLDVFKSIAATSKQRA 363 Query: 404 NQDAIS----------WGIKYKQID--PLRFDTEESFADSLRQRAGFVKKIKDDYNLTTS 451 N D ++ + + +QI + F + + L ++ + + Sbjct: 364 NDDPVAQVQSQTGHKLYTVTPEQIGTGQIDFKKAQITTELLAEQKKANGDVG-----SLI 418 Query: 452 HFNKTEENQLR-TQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN-----GLS 505 +NK+E + P + L + + + S E N G++ Sbjct: 419 QWNKSERTAFKDRYFEASPKQQKAMLTDLTKMAGKNKEAQKEYFSLIGGEKNAYDYMGIA 478 Query: 506 AVVRLSSEFSDDAKNAAMVILSGM---KHQKDTETRYNTDHKSNKFDSLYDSYINTPLTK 562 + +L + AA V L G + + + + ++ Sbjct: 479 KLNQLDVTLHNTNIRAAEVALEGKQILNAGQASVLGAEKEFHNAIQAEFGNAAAIGTNEH 538 Query: 563 LEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPV---NINESM 619 + A + + D GN +N+ A I G T N + Sbjct: 539 RAYQNLAYSIYLGL-AKRGENIVKRDDKGNPIINKEMAKQAFDIATGGTYKQKLGKNTNH 597 Query: 620 LMPPRGMSKTDFEDRLWYA-----TKDTGEYDPY 648 + P G ++T FED + K+TG P Sbjct: 598 IFMPYGFTQTSFEDHIQNHFRTQYRKETGFLPPD 631 >gi|212710807|ref|ZP_03318935.1| hypothetical protein PROVALCAL_01875 [Providencia alcalifaciens DSM 30120] gi|212686504|gb|EEB46032.1| hypothetical protein PROVALCAL_01875 [Providencia alcalifaciens DSM 30120] Length = 913 Score = 57.5 bits (137), Expect = 7e-06, Method: Composition-based stats. Identities = 63/415 (15%), Positives = 141/415 (33%), Gaps = 35/415 (8%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITV--KPSELI 318 G +D LD++ + + L+ D +L+ E A +G+ PSE Sbjct: 400 GSVAFDALDESDQAKYLKQAEQMDKQRQQQLQIELGTTISDSYAAWENGLDAPNAPSE-- 457 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 + A+G ++GT + + A ++ K M+ + L I + + + Sbjct: 458 --KTIISAFGYDKGTAMIADMREAKRYAGLISAAKDMTPQAQRSLLNSIMPDPSES---N 512 Query: 379 LASTMMATKYK-EILEKSHRQSMQELNQDAISWGIKYK-QIDPLRFDTEESFADSLRQRA 436 AS M + + ++ + + + + + + I+ +DP +++ Sbjct: 513 YASKMQRWEKFGKFVDSNIKAQNKAFSANRLQLSIQNNFPLDPNDKSNQDAADHYFANNI 572 Query: 437 GFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSF 496 I+D+ +L + + +Q+ + A+ S +S + Sbjct: 573 ESSFNIRDENSLNAVAEISSRTGIIPSQIKSKLATGSTSRNPELVIPMSK--------MY 624 Query: 497 AHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRY------NTDHKSNKFDS 550 I DN SA+ LS++ + SGM +K + Y N K+ + Sbjct: 625 GQIFDNNPSAINGLSTDAMAYYGKIYDLTRSGMDSEKAIDIAYNLTYEQNDRTKAIIAEQ 684 Query: 551 LYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGN 610 + D + K Q F + + ++ Y + + DA G Sbjct: 685 MRDKKYSKERDKAAQGNINSFFALGGFS-SPNVTKPGINNREYMRDYQTLYDANFASTGG 743 Query: 611 TPVN---INESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKYMI 662 P + ++ + G++ + +D + + G+G ++I Sbjct: 744 DPDLAKKMTDAQVKKTWGVTSINGKDEVMRYAPEA------VYGTNESGAGNWII 792 >gi|227355849|ref|ZP_03840242.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164168|gb|EEI49065.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 1027 Score = 49.0 bits (115), Expect = 0.003, Method: Composition-based stats. Identities = 43/288 (14%), Positives = 99/288 (34%), Gaps = 25/288 (8%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITV--KPSELI 318 G A +D LD+ + + L +L+++ + A G+ P+ Sbjct: 496 GSAAFDALDEADQAKYLRQAEQLRKQKQGELQQQLGTRVADSYAAWERGLEAPNAPTH-- 553 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 ++ A+G ++G+ + Q A ++ K MS + L +I + + Sbjct: 554 --DELISAFGYDKGSAMSADMQEAKRYAGFMSAAKEMSPQAQQALLSQIRPQTGEV---N 608 Query: 379 LASTMMATKYK-EILEKSHRQSMQELNQDAISWGIKYK-QIDPLRFDTEESFADSLRQRA 436 S + + + +E + ++ ++ + + I+ +DP + +++ D + Sbjct: 609 YESKIQRWEKFGKFVEGNIKEQDKQFAANRLQLSIQNNFPLDPNDKNNQQAADDYFEKHI 668 Query: 437 GFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGV---R 493 ++DD +L + +Q+ + D E V Sbjct: 669 QQSFNLRDDNSLNAVAELTARTGIIPSQVK-----------SVLNMGATSKDPEVVLPIA 717 Query: 494 SSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNT 541 + I DN ++ + S + +GM K ET + T Sbjct: 718 KMYGQIFDNNPASATDIPSSTMAYYSKVYSLSRAGMPDDKAVETAFKT 765 >gi|301046397|ref|ZP_07193557.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301623|gb|EFJ58008.1| conserved domain protein [Escherichia coli MS 185-1] Length = 915 Score = 49.0 bits (115), Expect = 0.003, Method: Composition-based stats. Identities = 50/383 (13%), Positives = 117/383 (30%), Gaps = 32/383 (8%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPL 320 G A +D + + + L + N + + + + L+ G+ KP +L Sbjct: 390 GNAAFDAMSPQMQIQALRQANELRNQYRQQYADQLSTVVKDAYSALDEGL--KPEKLPSE 447 Query: 321 EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLA 380 +D+ +A G G + Q + + K ++ + L+++ + + P + Sbjct: 448 DDFIRANGPRIGAMKWKDMQAQIQYGGVIGAAKDLTPEGRQDILERLRPQDPNAPGFAAN 507 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVK 440 + E +Q+ +S +DP + + + Q Sbjct: 508 QQRWEKMQAKFKEMDREWEIQQGRNRFVSSMQNNFPLDPNDKNNQAAVDRYFAQDIAPSF 567 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVK--------RPASESVDLIRGAYNTLSDSDKEG- 491 I D ++ T T+ + TQ+ R + V + + ++ Sbjct: 568 SISDPQSINTLVTVTTKSGMIPTQVKTMLNSGATSRDPALVVPMAKFYGQLFDNNPAAAA 627 Query: 492 -------------VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETR 538 S A + + + + S F + +LS K Sbjct: 628 TLDKSTMAFYGKVYDYSRAGVPE-DKAVDMAYSQVF--QQDDRMKQMLSTAMRDKKYVAA 684 Query: 539 YNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRV 598 T ++N + +T +S A +D + I + +G Sbjct: 685 RATAAQNNASSLTSFGSWSPDITDPGKSNAAYQ--RDYQTI---YDANFVQTGGDAEQAE 739 Query: 599 RVSDAMQIVLGNTPVNINESMLM 621 ++++AM ++ +M Sbjct: 740 KMTNAMIRTTWGVSTVNGKAEVM 762 >gi|294489804|gb|ADE88560.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 904 Score = 49.0 bits (115), Expect = 0.003, Method: Composition-based stats. Identities = 50/383 (13%), Positives = 117/383 (30%), Gaps = 32/383 (8%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPL 320 G A +D + + + L + N + + + + L+ G+ KP +L Sbjct: 379 GNAAFDAMSPQMQIQALRQANELRNQYRQQYADQLSTVVKDAYSALDEGL--KPEKLPSE 436 Query: 321 EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLA 380 +D+ +A G G + Q + + K ++ + L+++ + + P + Sbjct: 437 DDFIRANGPRIGAMKWKDMQAQIQYGGVIGAAKDLTPEGRQDILERLRPQDPNAPGFAAN 496 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVK 440 + E +Q+ +S +DP + + + Q Sbjct: 497 QQRWEKMQAKFKEMDREWEIQQGRNRFVSSMQNNFPLDPNDKNNQAAVDRYFAQDIAPSF 556 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVK--------RPASESVDLIRGAYNTLSDSDKEG- 491 I D ++ T T+ + TQ+ R + V + + ++ Sbjct: 557 SISDPQSINTLVTVTTKSGMIPTQVKTMLNSGATSRDPALVVPMAKFYGQLFDNNPAAAA 616 Query: 492 -------------VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETR 538 S A + + + + S F + +LS K Sbjct: 617 TLDKSTMAFYGKVYDYSRAGVPE-DKAVDMAYSQVF--QQDDRMKQMLSTAMRDKKYVAA 673 Query: 539 YNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRV 598 T ++N + +T +S A +D + I + +G Sbjct: 674 RATAAQNNASSLTSFGSWSPDITDPGKSNAAYQ--RDYQTI---YDANFVQTGGDAEQAE 728 Query: 599 RVSDAMQIVLGNTPVNINESMLM 621 ++++AM ++ +M Sbjct: 729 KMTNAMIRTTWGVSTVNGKAEVM 751 >gi|282877204|ref|ZP_06286042.1| transglutaminase-like protein [Prevotella buccalis ATCC 35310] gi|281300696|gb|EFA93027.1| transglutaminase-like protein [Prevotella buccalis ATCC 35310] Length = 855 Score = 45.6 bits (106), Expect = 0.028, Method: Composition-based stats. Identities = 45/299 (15%), Positives = 95/299 (31%), Gaps = 39/299 (13%) Query: 435 RAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRS 494 R I+ Y + +E++L LVK + +I+ Y+ + ++ + S Sbjct: 398 RFAQEDSIRKAYESN--FYQAVDESELSQLLVKARGN--WRVIKNFYDKYPEKEERAL-S 452 Query: 495 SFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDS 554 + D L + S + +D A LS + + ++ + Sbjct: 453 LLKTLSDKDLRDI---SMDILEDHLLAKSNQLSPRVENEMIILPFKQQLQNAFDPATTKK 509 Query: 555 YINTPLTKLEQSTAGGNFNKDKEAIKLYLLG-------------------SMKDSGNYTL 595 + P +E N D EA+++ + S Sbjct: 510 FQQNPALLVEWIQKNIRINPDNEALRIAQTPIGVWKSRFTDARSRDIFFVDLARSIGIEA 569 Query: 596 NRVRVSDAMQIVLGNTPVNIN----------ESMLMPPRGMSKTDFEDRLWYATKDTGEY 645 + V+ +Q L + +++N + L+ + +D +Y + Sbjct: 570 RKDVVTSKVQYRLNDQWMDVNFETEKQKTAPKGKLVLSY-LPSKYLDDPKYYNHFSISKI 628 Query: 646 DPYTIKYMNVGSGKYMIIKNGNPK-VDKEGKTIIINVEDVNRDERMESTIRHYEHQIFN 703 + +N G+ + + K K+G ++ + RM S HQIFN Sbjct: 629 ENGIPVLLNFDEGQVDMGGGTSWKNTFKDGTSLDAGTYFLVTGTRMASGNVKESHQIFN 687 >gi|255007652|ref|ZP_05279778.1| hypothetical protein Bfra3_00854 [Bacteroides fragilis 3_1_12] gi|313145346|ref|ZP_07807539.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313134113|gb|EFR51473.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 1045 Score = 45.6 bits (106), Expect = 0.034, Method: Composition-based stats. Identities = 52/334 (15%), Positives = 94/334 (28%), Gaps = 29/334 (8%) Query: 307 NHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQK 366 N GI+ +L L+ YT A + ++N + + A P K+ + Sbjct: 258 NFGISKVVKKLDMLDGYTYALYRNEAARMFNEYENANEAIPYPGTSKVDPNTGESVY--- 314 Query: 367 IDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEE 426 + + S + E + D + I +D Sbjct: 315 -SPGPEDYRNGTYPSVNWQDEVFETAFSQEYNLSVNGSNDKGYYAISGNILDQ-----SG 368 Query: 427 SFADSLRQRAGFVKKIKD------DYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 +S +R F + + S N L S+ + Sbjct: 369 IIHNSGYKRYSFRANLARKVHDWIEIGTNMSFTNS-----LNKLAKTNSVSDGIIRGALF 423 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 Y + D E + N E + ++ ++ + + KD + R N Sbjct: 424 YPATAPLDDETNNAQLNWFSSNPYVYTRAAKDELTTNSFFSSSFV--EITPYKDLKVRQN 481 Query: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN-RVR 599 N + D Y N + + T G D A L L + + N + Sbjct: 482 VGFSYNINER--DVYYNRETVEGKDPTNGYASKADNWAKNLVLETMATYNKTFNKNHSLN 539 Query: 600 VSDAMQIVLGNTPVNINESMLMPPRGMSKT-DFE 632 V A G N++M+ T DF+ Sbjct: 540 VVAAFSYERG---DYGNKAMVATGFPQDLTEDFD 570 >gi|332800735|emb|CBY88575.1| hypothetical protein [Pantoea phage LIMEzero] Length = 893 Score = 45.2 bits (105), Expect = 0.038, Method: Composition-based stats. Identities = 64/403 (15%), Positives = 123/403 (30%), Gaps = 51/403 (12%) Query: 325 QAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMM 384 Q G ++ L Q + + + + M + +Q+ + + +N ++ Sbjct: 95 QTAGYSDQSKRMELAQQAAKISAKMPQYAAMDPEEFLGVVQQANQDLFANT-DGMSMRGR 153 Query: 385 ATKYKEILE--------KSHRQSMQELNQDAISWGIKYKQIDPLRFD-----TEESFADS 431 L ++ +NQ + + + + + +++A Sbjct: 154 RDLLDNQLTNTNTLIRTQAATHGAYLVNQRSQMYNAQGNTLASMVAQAKSTGDGDAYAAG 213 Query: 432 LRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV--KRPASESVDLIRGAYNTLSDSDK 489 + Q +VK I D L K + + L R A ESV + G NTL Sbjct: 214 IGQTMQWVKSIMTDPQLPMDAKQKQVTSMMGLMLQQDSRTAVESV-VNSGLLNTLPPDQL 272 Query: 490 EGVRSSFA------HIEDN--------------GLSAVVRLSSEFSDDAKNAAMVILSGM 529 ++S+ ++DN V +S S + A +++G Sbjct: 273 ASIQSAIRESKNRTQLQDNQGLYDQFSQMRARQDTYGDVPVSDFNSLLTRMADARLVTGD 332 Query: 530 KHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKE---------AIK 580 + E K+ K L +S+ T + + A D A Sbjct: 333 GYMAAQEQYLKNYAKNAKAAQLGNSFAQGDQTGMLRLNATPQDAADAYIKTAMKQPGATS 392 Query: 581 LYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK 640 + GS+ G T V A ++ + P S T DR+ A + Sbjct: 393 ASVAGSLLSIGMATGQPVAYQRAAGLIEPGLTNFGTVGQMNPDAAQSVTGMLDRITVAEQ 452 Query: 641 --DTGEYDPYTIKYMNVGSGKYMIIK---NGNPKVDKEGKTII 678 DT + K + ++ VD+ G+ I Sbjct: 453 HGDTTAFSKLLSGLTPENQEKLVFMREQIKSGKTVDQAGQDYI 495 >gi|153952470|ref|YP_001398457.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. doylei 269.97] gi|152939916|gb|ABS44657.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. doylei 269.97] Length = 859 Score = 44.8 bits (104), Expect = 0.046, Method: Composition-based stats. Identities = 62/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E + + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISTKKKEIDLLKNEASLAKTRGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPNLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNTDNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A ++ + K+ E + K + L D PL K E KD Sbjct: 723 IASNVIMNLN-GKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|157372112|ref|YP_001480101.1| lytic transglycosylase catalytic [Serratia proteamaculans 568] gi|157323876|gb|ABV42973.1| Lytic transglycosylase catalytic [Serratia proteamaculans 568] Length = 898 Score = 44.4 bits (103), Expect = 0.059, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 87/330 (26%), Gaps = 24/330 (7%) Query: 333 TELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEIL 392 + Y Q I + + + +S+P + + + Sbjct: 460 QQSYQRYQQTLAMQAAYQTIVQGDVQQGMAAVNAMKPD-VSDPDFTFKQ-----QRYAQV 513 Query: 393 EKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNL-TTS 451 + Q + D +W ++ + ++ S A ++ K+ + + Sbjct: 514 ASKYSQVQKAREDDPGTWLLQNSPVIKDAYNQFSQRQVSGEYFASRIRAEKERLGIRSKD 573 Query: 452 HFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLS 511 N + Q+ +SV ++ + S V + + LS V Sbjct: 574 ILPAEMVNSVLHQIDSSN-EQSVAAVQAIAQSYG-SQAPAVIAQLQGKSGSALSVV---- 627 Query: 512 SEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYI--NTPLTKLEQSTAG 569 S D NAA + K D + + S+ D + L Sbjct: 628 --MSVDNANAANALFQNRKLSTDELKKASGTQSSSVESEWSDQFADFGVSLASQRGGLKA 685 Query: 570 GNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKT 629 + ++ Y+ S G A + G+ L ++ Sbjct: 686 YSDFNEQGKRLAYIYASRGMDGASAA-----KMAFNQIAGDQYDFEGAWRLPKSLNLNVR 740 Query: 630 DFEDRLWYATKDTGEYDPYTIKYMNVGSGK 659 D D + D + VG + Sbjct: 741 DITDGTKNYLEKLKPEDIDPQGF--VGDSR 768 >gi|86149199|ref|ZP_01067431.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni CF93-6] gi|85840557|gb|EAQ57814.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni CF93-6] Length = 857 Score = 44.4 bits (103), Expect = 0.061, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 105/359 (29%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ + E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFQNEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASNAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|121613623|ref|YP_001000212.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 81-176] gi|167005168|ref|ZP_02270926.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 81-176] gi|87250327|gb|EAQ73285.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 81-176] Length = 857 Score = 44.4 bits (103), Expect = 0.061, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|205355368|ref|ZP_03222139.1| ATP-dependent CLp protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni CG8421] gi|205346602|gb|EDZ33234.1| ATP-dependent CLp protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni CG8421] gi|315057930|gb|ADT72259.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni S3] Length = 857 Score = 44.4 bits (103), Expect = 0.063, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|148925882|ref|ZP_01809569.1| ATP-dependent CLP protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni CG8486] gi|145844868|gb|EDK21972.1| ATP-dependent CLP protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni CG8486] Length = 857 Score = 44.4 bits (103), Expect = 0.065, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|86151601|ref|ZP_01069815.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 260.94] gi|86153521|ref|ZP_01071725.1| chaperone clpB [Campylobacter jejuni subsp. jejuni HB93-13] gi|85841230|gb|EAQ58478.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 260.94] gi|85843247|gb|EAQ60458.1| chaperone clpB [Campylobacter jejuni subsp. jejuni HB93-13] Length = 857 Score = 44.4 bits (103), Expect = 0.066, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|284925775|gb|ADC28127.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni IA3902] Length = 857 Score = 44.4 bits (103), Expect = 0.068, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|73663123|ref|YP_301904.1| argininosuccinate synthase [Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305] gi|119361330|sp|Q49WA0|ASSY_STAS1 RecName: Full=Argininosuccinate synthase; AltName: Full=Citrulline--aspartate ligase gi|72495638|dbj|BAE18959.1| argininosuccinate synthase [Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305] Length = 401 Score = 44.4 bits (103), Expect = 0.073, Method: Composition-based stats. Identities = 42/271 (15%), Positives = 85/271 (31%), Gaps = 43/271 (15%) Query: 321 EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLA 380 E+++Q Y Y ++ L +KK ++ + Sbjct: 64 EEFSQDY------VSYAIKGNLMYEGTYPLVSALSRPLISKKLVEIASKTNSVGIAHGCT 117 Query: 381 STMMATKYKEILEKSHRQSMQELN---------QDAISWGIKYKQIDPLRFDTEESFADS 431 E+ K+ ++ ++ I + IK+ P++ D+ S + Sbjct: 118 GKGNDQVRFEVAIKALDPELKVFAPVREWAWSREEEIDYAIKHNIPVPIQHDSPYSIDQN 177 Query: 432 LRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDS-DKE 490 L R+ ++D Y + L +L P E+ +++ + S D Sbjct: 178 LWGRSNECGILEDPYATP-----PKDAYDLTNELEDAP-DEAEEIVLSFEKGIPVSLDGT 231 Query: 491 GVRSSFAHIEDNGL--------------SAVVRLSSEFSDDAKNAAMVILSGMKHQKDTE 536 + S +E N L V S E + AA VI++ K + Sbjct: 232 AYKLSELILELNKLAGKHGVGRIDHVENRLVGIKSREVYETP--AAEVIMNAHKALETIT 289 Query: 537 TRYNTDH-----KSNKFDSLYDSYINTPLTK 562 + H + + +Y+ +PLT Sbjct: 290 LTKDVAHFKPVIEKQFSEQIYNGLWFSPLTD 320 >gi|88597118|ref|ZP_01100354.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 84-25] gi|218562162|ref|YP_002343941.1| ATP-dependent Clp protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|54035912|sp|Q9PI02|CLPB_CAMJE RecName: Full=Chaperone protein ClpB gi|88190807|gb|EAQ94780.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 84-25] gi|112359868|emb|CAL34656.1| ATP-dependent Clp protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|315927008|gb|EFV06363.1| chaperone protein clpB [Campylobacter jejuni subsp. jejuni DFVF1099] gi|315929555|gb|EFV08745.1| chaperone protein clpB [Campylobacter jejuni subsp. jejuni 305] Length = 857 Score = 44.0 bits (102), Expect = 0.084, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASNAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|282881289|ref|ZP_06289973.1| hypothetical protein HMPREF9019_1865 [Prevotella timonensis CRIS 5C-B1] gi|281304834|gb|EFA96910.1| hypothetical protein HMPREF9019_1865 [Prevotella timonensis CRIS 5C-B1] Length = 457 Score = 44.0 bits (102), Expect = 0.093, Method: Composition-based stats. Identities = 48/383 (12%), Positives = 115/383 (30%), Gaps = 61/383 (15%) Query: 334 ELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILE 393 ++Y + + ++ + +ST+ L ++++ + Sbjct: 2 KIYKSMRNFRLLILVMSTVCALSTYSQ-----------DYKSKFGLYGNLLSSGRYLEMS 50 Query: 394 KSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQ---RAGFVKKIKDDYNLTT 450 K + + + K + + + + +A SL R + D L Sbjct: 51 KVYTDKDSI---NPFL-KLNNKIVAAIA-NNQNRYAISLINDNLRYNWQAYGFDALYLAD 105 Query: 451 SHFN--------KTEENQLRTQLVKRPASESVDLIR----GAYNTLSDSDKEGVRSSFAH 498 +F EE L S+ + A L ++ +++ Sbjct: 106 VYFTLCKAEGLYDQEEALLEWLKQDYYPSQMIYYNDKKYTAATEKLISDEENSLKAKVLI 165 Query: 499 IEDNGLSAVVRLSSEFSDDAKNAAMVILSG--MKHQKDTETRYNTDHKSNKFDSLY---- 552 + + + +FS + + L+G + TE RY D L Sbjct: 166 SPVHAIRKDESQAVKFSVNPIIRTKIKLNGETLDAIWTTELRYPLQITPQVADQLGIVCG 225 Query: 553 -DSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDA------MQ 605 DS P+ L+ G A+ ++ + L+R + + Sbjct: 226 KDSVAGCPVVYLDSINIGNLQFNHVPALVSVMVDPTAYIKEHKLSRKYAKELKAFYTRIN 285 Query: 606 IVLGNTPVNINESMLMPPRG----------MSKTDFEDRLWYATKDTGEY-------DPY 648 + P+ + + K DFE+ +++A D + + Sbjct: 286 VPQIGLPIIRLLEKIAIDWKHQIINFPTDTLPKADFEESMYFAYPDFPALYLPVSINNEF 345 Query: 649 TIKYMNVGSGKYMIIKNGNPKVD 671 + +N+G+ Y+++++ K Sbjct: 346 AVGQINLGNSDYIVVRDNFYKKY 368 >gi|315124037|ref|YP_004066041.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni ICDCCJ07001] gi|315017759|gb|ADT65852.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni ICDCCJ07001] Length = 857 Score = 44.0 bits (102), Expect = 0.096, Method: Composition-based stats. Identities = 66/384 (17%), Positives = 115/384 (29%), Gaps = 76/384 (19%) Query: 248 RSLEVVEDVSKAIGLAGWDRLDDTKRRRL------LEHLSSRDNALNTKLRKETQ----- 296 + +E +E ++A+ + D+ ++RL L +L + NALN++ E Sbjct: 417 KDIETLEVENEALKMEN----DEKNQKRLDEIVKELANLKEKQNALNSQFENEKSVFDGI 472 Query: 297 -AQARRID------------------AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYN 337 A+ + ID A+L +G + + + + E G L N Sbjct: 473 SAKKKEIDLLKNEASLAKARGEFQKAAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKN 532 Query: 338 LQQFKSVAA---PDVA-RIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILE 393 VA ++ M T + +KFL+ S K L Sbjct: 533 QVDEDLVAGILSKWTGISVQKMLTSEKQKFLEVEKHLKESVIGQD--------KALSALA 584 Query: 394 KSHRQSMQELNQD-----------AISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVK 440 ++ +++ LN D G K + FD E++ Sbjct: 585 RAIKRNKAGLNADNKPIGSFLFLGPTGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKH 644 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIE 500 + + E +L + ++P S V L D V + I Sbjct: 645 SVSRLLGAPPGYIGHEEGGELTEAVRRKPYS--VLLFDEVEKAHKD-----VFNVLLGIL 697 Query: 501 DNG-----LSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHK----SNKFDSL 551 D+G V + N A + + K+ E + K + L Sbjct: 698 DDGRATDSKGVTVDFKNTIIILTSNIASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRL 756 Query: 552 YDSYINTPLTKLEQSTAGGNFNKD 575 D PL K E KD Sbjct: 757 DDIITFNPLGKDEAYEIVKLLFKD 780 >gi|288959380|ref|YP_003449721.1| hypothetical protein AZL_025390 [Azospirillum sp. B510] gi|288911688|dbj|BAI73177.1| hypothetical protein AZL_025390 [Azospirillum sp. B510] Length = 715 Score = 44.0 bits (102), Expect = 0.098, Method: Composition-based stats. Identities = 26/241 (10%), Positives = 59/241 (24%), Gaps = 26/241 (10%) Query: 318 IPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSL 377 IP AYG E+G +++ + DV + ++ + L + Sbjct: 295 IPEARLVNAYGPERGRQMFAELEATRKLGQDVKTVAMVPPDVQDRLLASYAPDGAGYAGE 354 Query: 378 SLASTMMATKYKEILEKSHRQSMQELNQDAISW----GIKYKQIDPLRFDTEESFADSLR 433 + L ++ + + +D ++ K + + ++ Sbjct: 355 --------RERYGELARAVARDRKNR-EDPAAYVLSNSSKLRDLVAGARTDPTKAQAAVA 405 Query: 434 QRAGFVKKIKDDYNLTTS---HFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKE 490 + + + QL P D + G T + Sbjct: 406 LSLTLQADA----GIPEVDRRILPVSMADTAARQLATLPPEARADAVEGMAATYGEHWPR 461 Query: 491 GVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDS 550 R A G + + + A I G + + D Sbjct: 462 VFRELVAQKLPTGYEVLATVQHPTARKELAEAQAI--GRDDLRKAV----DKNAKLVDDG 515 Query: 551 L 551 + Sbjct: 516 V 516 >gi|157414793|ref|YP_001482049.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 81116] gi|157385757|gb|ABV52072.1| ATP-dependent CLP protease ATP-binding subunit [Campylobacter jejuni subsp. jejuni 81116] gi|307747431|gb|ADN90701.1| Chaperone protein clpB [Campylobacter jejuni subsp. jejuni M1] gi|315931705|gb|EFV10660.1| chaperone protein clpB [Campylobacter jejuni subsp. jejuni 327] Length = 857 Score = 43.2 bits (100), Expect = 0.17, Method: Composition-based stats. Identities = 63/359 (17%), Positives = 104/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDVVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|38234115|ref|NP_939882.1| putative chromosome partition protein [Corynebacterium diphtheriae NCTC 13129] gi|28375459|emb|CAD66593.1| SMC protein [Corynebacterium diphtheriae] gi|38200377|emb|CAE50065.1| Putative chromosome partition protein [Corynebacterium diphtheriae] Length = 1161 Score = 42.9 bits (99), Expect = 0.17, Method: Composition-based stats. Identities = 55/317 (17%), Positives = 110/317 (34%), Gaps = 30/317 (9%) Query: 168 ENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARD--PRVFENFMRAF 225 EN+ +RV++++ + +LR + EA+++LN + + D RV E+ Sbjct: 700 ENFSRRVEAVSHEAEARKQELRRHALRVQEAQDRLNQLRQLWEETSDRLSRVEED----- 754 Query: 226 YKKGHPPKDSTSLSDVSDSA--RERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSR 283 + STS D + +A RS+EV RL + EH+ R Sbjct: 755 ---NSEEEPSTSERDKAAAALSHMRSMEVEA------------RLSLRTAQDKAEHIRGR 799 Query: 284 DNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKS 343 + L + E QA+AR A + + ++ + A V+ + + Sbjct: 800 GDGLRRQAEYERQARARHDQAVARQRRKAEIASVVAQQCLVVAERVDDALARATQWKENA 859 Query: 344 VAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQEL 403 +A V + AK + + S A+ + + + LE++H + +++L Sbjct: 860 IAQRGVISAR---VSAAKDTVSALRTNANRLTEKSHAAELSLGQAQVRLEEAHEKIVEQL 916 Query: 404 NQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRT 463 I+ + P + L+ + + L F EE Sbjct: 917 G---IAISDLMRDYTPHEGFDIGAQRVRLKNAEKDLNALGKVNPLALEEFKALEERYEFL 973 Query: 464 QLVKRPASESVDLIRGA 480 ++ + G Sbjct: 974 STQLADVEQARRDLHGV 990 >gi|57505126|ref|ZP_00371071.1| ATP-dependent Clp protease, ATP-binding subunit ClpB [Campylobacter coli RM2228] gi|305433063|ref|ZP_07402219.1| chaperone protein ClpB [Campylobacter coli JV20] gi|57019101|gb|EAL55816.1| ATP-dependent Clp protease, ATP-binding subunit ClpB [Campylobacter coli RM2228] gi|304443764|gb|EFM36421.1| chaperone protein ClpB [Campylobacter coli JV20] Length = 857 Score = 42.9 bits (99), Expect = 0.17, Method: Composition-based stats. Identities = 64/359 (17%), Positives = 107/359 (29%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKET------QAQARRIDAQLNHGITVK------- 313 RLD+ + L +L + +ALN++ E A+ + ID+ N + K Sbjct: 440 RLDEIAKE--LANLKEKQSALNSQFENEKAVFDSISAKKKEIDSLKNEAVFAKNKGEFQK 497 Query: 314 PSEL----IPLEDYTQAYGVEQGTELYNL--QQFKSVAAPDVA---------RIKLMSTF 358 +EL IP + A E+ ++ V VA ++ M T Sbjct: 498 AAELEYGKIPECEKEVANLEEKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYVGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNILLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|3063758|emb|CAA73776.1| heat shock protein [Campylobacter jejuni] Length = 857 Score = 42.9 bits (99), Expect = 0.18, Method: Composition-based stats. Identities = 62/351 (17%), Positives = 103/351 (29%), Gaps = 52/351 (14%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLAS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEI---LEKSHRQSMQELNQDAISWGIKY- 414 + +KFL+ S A + +A K L ++ + L G Sbjct: 558 EKQKFLEVEKHLKESVIGQDKALSALARAIKRNKAGLNADNKPIGKLLFLGPTGVGKTQS 617 Query: 415 -KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASES 473 K + FD E++ + + E +L + ++P S Sbjct: 618 AKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAVRRKPYS-- 675 Query: 474 VDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKNAAMVILSG 528 V L D V + I D+G V + N A + Sbjct: 676 VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSNIASSAIMN 730 Query: 529 MKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 + K+ E + K + L D PL K E KD Sbjct: 731 L-SGKEQEDAVKNELKNFSKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|283954202|ref|ZP_06371727.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 414] gi|283794481|gb|EFC33225.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 414] Length = 857 Score = 42.9 bits (99), Expect = 0.19, Method: Composition-based stats. Identities = 64/358 (17%), Positives = 104/358 (29%), Gaps = 66/358 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E + + ID Sbjct: 440 RLDEITKE--LANLKEKQNALNSQFENEKSVFDGISTKKKEIDMLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEVLEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ I + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSISRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAM---VILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + LSG + + + K + L D PL K E KD Sbjct: 723 IASSAIINLSGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|315121929|ref|YP_004062418.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122891|ref|YP_004063380.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495331|gb|ADR51930.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496293|gb|ADR52892.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 129 Score = 42.9 bits (99), Expect = 0.21, Method: Composition-based stats. Identities = 27/76 (35%), Positives = 38/76 (50%) Query: 12 EAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQD 71 E + PNQDAVRD+V+P GL+ + L Q + R ++ A A T+ S D Sbjct: 12 EPLAPNQDAVRDAVDPLGGLKTVAQGLDSLNQLYQERRSEHIKAQALTQDALHSADDHAD 71 Query: 72 FHDFTNSLDTRDSLQA 87 F + SL T +QA Sbjct: 72 FARYMTSLPTGVDVQA 87 >gi|57238288|ref|YP_178631.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni RM1221] gi|57167092|gb|AAW35871.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni RM1221] Length = 857 Score = 42.9 bits (99), Expect = 0.21, Method: Composition-based stats. Identities = 62/359 (17%), Positives = 103/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E A+ + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDGISAKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+ V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDSRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKNFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|283955934|ref|ZP_06373424.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 1336] gi|283792594|gb|EFC31373.1| ATP-dependent chaperone protein ClpB [Campylobacter jejuni subsp. jejuni 1336] Length = 857 Score = 42.9 bits (99), Expect = 0.22, Method: Composition-based stats. Identities = 62/359 (17%), Positives = 103/359 (28%), Gaps = 68/359 (18%) Query: 267 RLDDTKRRRLLEHLSSRDNALNTKLRKETQ------AQARRID----------------- 303 RLD+ + L +L + NALN++ E + + ID Sbjct: 440 RLDEIAKE--LANLKEKQNALNSQFENEKSVFDDISTKKKEIDLLKNEASLAKARGEFQK 497 Query: 304 -AQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTF 358 A+L +G + + + + E G L N VA ++ M T Sbjct: 498 AAELEYGKIPSLEKEVEILEDKWKKMSENGVLLKNQVDEDLVAGILSKWTGISVQKMLTS 557 Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD-----------A 407 + +KFL+ S K L ++ +++ LN D Sbjct: 558 EKQKFLEVEKHLKESVIGQD--------KALSALARAIKRNKAGLNADNKPIGSFLFLGP 609 Query: 408 ISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 G K + FD E++ + + E +L + Sbjct: 610 TGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRLLGAPPGYIGHEEGGELTEAV 669 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKN 520 ++P S V L D V + I D+G V + N Sbjct: 670 RRKPYS--VLLFDEVEKAHKD-----VFNVLLGILDDGRATDSKGVTVDFKNTIIILTSN 722 Query: 521 AAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A + + K+ E + K + L D PL K E KD Sbjct: 723 IASSAIMNL-SGKEQEDAVKNELKIFFKPEFLNRLDDIITFNPLGKDEAYEIVKLLFKD 780 >gi|260807192|ref|XP_002598393.1| hypothetical protein BRAFLDRAFT_128123 [Branchiostoma floridae] gi|229283665|gb|EEN54405.1| hypothetical protein BRAFLDRAFT_128123 [Branchiostoma floridae] Length = 1869 Score = 42.5 bits (98), Expect = 0.28, Method: Composition-based stats. Identities = 77/424 (18%), Positives = 161/424 (37%), Gaps = 44/424 (10%) Query: 155 GLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHA-- 212 GLA A + H S+E R Q ++D+INS+ DL ++Q LSE + + Q+I A Sbjct: 591 GLARAQMDH--SSELEADRQQKMSDNINSVLDDL-VQQGALSEEESERIQKQLIDREAAL 647 Query: 213 ---RDPRVFENFMRAFYKKGHPPKDSTSLSDVSD------SARERSLEVVEDVSKAIGLA 263 D R +N + + + + + D A+ E ED I L Sbjct: 648 KASMDKR--KNLQASLMRGRMADRKRRRIQQLKDRQGLEREAKMHPTEDEEDSDSVIKLE 705 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDY 323 + + + RR E L + + + + + + +R +D + H ++K ++ L++ Sbjct: 706 AGEDVAELDRRHARE-LEALEAQIAAEEAEHLRDISRSLDEE--HTNSLKETQKDVLKNL 762 Query: 324 TQAYGVEQ--GTELYNLQQFKSVAAPDVARIK-LMSTFDAKKFLQKIDDEYISNPSLSLA 380 + +GV+ E+ + + ++ I+ T D + L I++ ++ A Sbjct: 763 SSKHGVDAVRQQEIMDRFRKDQENLEYMSNIRKERQTQDLQAKLAARKARKIAD-AMRQA 821 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFA-------DSLR 433 A ++ + EK +Q Q Q+A S G+ Q+ P E++ D L Sbjct: 822 EEDAAKRFIDEQEKQLQQVAQLQQQEA-SAGLVIPQLAPGDSLEEQAIKKEHQRAQDELV 880 Query: 434 QRAGFV-KKIKDDYNLTTSHFNKTE----ENQLRTQLVKRPASESVDLIRGAYNTLSDSD 488 R ++++ + E E + L ++ ++ +L A L++ + Sbjct: 881 ARHKDEREELEAKMDQEMRKKEAEERKKFEAEREKALREKKNKQAAEL--AARKDLTEEE 938 Query: 489 KEGVRSSFAH-IEDNGLSAVVRLS-SEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSN 546 + ++ +ED S + S K A ++ + + + Sbjct: 939 MAALLAAHEQELEDLNDRLDAERSRQQLSLRDKLLARK----RNRLQNKQRKQEAEMTKE 994 Query: 547 KFDS 550 + Sbjct: 995 LLEQ 998 >gi|294867047|gb|ADF46235.1| beta-agarase [Alteromonas sp. X3] Length = 529 Score = 41.3 bits (95), Expect = 0.51, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 79/284 (27%), Gaps = 35/284 (12%) Query: 386 TKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDD 445 T+ L + + L D I + + + RA + + + Sbjct: 37 TQLNNALTSAQNNGLDTL--DTIKISGTINIYGQIDIKSSVKITGAQTNRASKLT--RRN 92 Query: 446 YNLTTSHFNKTEENQLRT--QLVKRPASESVDLIRGAYNTLSDSD-----KEGVRSSFAH 498 N +N L A+ + L+ A + S+S ++ Sbjct: 93 TGFYNPLINVQYKNVTVENITLEGIGANTNQQLLSLAGDDTSNSALINLPATAYATNPKG 152 Query: 499 IEDNGLSA--VVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYI 556 ++ + ++ + + + E + D + D+ + Sbjct: 153 FTATNVNFFDSAIGVASVGILPQDLNVSYNTFRNINRAVELLRD-------VDRVDDALL 205 Query: 557 NTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNIN 616 N K+ + G KL + N +N + A+ + GN Sbjct: 206 NNAQIKVNGNIIG------LYGGKL-------NISNNLINAENMRLAISLDGGNDGAGFI 252 Query: 617 ESMLMPPRGMSKTDFEDRLWYATKDTGE--YDPYTIKYMNVGSG 658 S + P + + D+ Y + GE + I Y G Sbjct: 253 GSGFLNPWSEKRQTYTDKPVYYSSVIGEAVINNNRIGYYQHTDG 296 >gi|22299310|ref|NP_682557.1| succinoglycan transport protein exoP-like protein [Thermosynechococcus elongatus BP-1] gi|22295493|dbj|BAC09319.1| tll1767 [Thermosynechococcus elongatus BP-1] Length = 740 Score = 41.3 bits (95), Expect = 0.59, Method: Composition-based stats. Identities = 45/200 (22%), Positives = 77/200 (38%), Gaps = 15/200 (7%) Query: 246 RERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQ 305 R+R E+V V KA+ A D ++ +L L D+ +N L + + Q R+ A Sbjct: 179 RDRQPEMVTAVLKAVAEAFVDYSQRDRQEQLNRTLKQLDSHINQHLTELEELQ-RQFQAG 237 Query: 306 LNHGITVKPSELIPL--EDYTQAYGVEQ-----GTELYNLQQFKS----VAAPDVARIKL 354 G +++PSE + E Y Q + + Q E Y Q + + Sbjct: 238 QTQGTSLQPSEQLRFLTEQYRQIHSLRQAARLKAVEAYGRFQGFQEQLKMTPAEALAAAN 297 Query: 355 MSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKY 414 +ST A + E + +LA T + LE+ R +++L A Sbjct: 298 LSTSPAYQQQLARIQELDQEIAKNLAIFREGTPILQALEEQRRVLLEQLQAIARQV---I 354 Query: 415 KQIDPLRFDTEESFADSLRQ 434 Q P+ T +F ++ Q Sbjct: 355 GQQYPVDNPTALAFQGTVSQ 374 >gi|294782075|ref|ZP_06747401.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] gi|294480716|gb|EFG28491.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] Length = 464 Score = 41.3 bits (95), Expect = 0.63, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 53/150 (35%), Gaps = 6/150 (4%) Query: 542 DHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVS 601 + L +SY E + ++ KD + K+ + ++ ++ N Sbjct: 215 KANRKRIKFLANSYKKNSQVVYEYPLSPNSYFKDDDGTKINYFDFLVNN-SFNYNSDIAK 273 Query: 602 DAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKYM 661 ++IV +N N + + +R+ TK+ + + + K + Sbjct: 274 SIVKIVPKLIELNNNGGIYDILDKLDGMIIPERIEVITKEL-NFKVDEMFIPTLEISKEL 332 Query: 662 IIKNGNPKVDKEGKTIIINVEDVNRDERME 691 + K + + K+ + N++E+ E Sbjct: 333 LNKEIDKFMLKDYS----RFNEYNKEEKRE 358 >gi|326670102|ref|XP_001922712.3| PREDICTED: cadherin EGF LAG seven-pass G-type receptor 3 [Danio rerio] Length = 3584 Score = 40.9 bits (94), Expect = 0.65, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 78/271 (28%), Gaps = 32/271 (11%) Query: 354 LMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAI---SW 410 + F K+++ + ++ P + + K+ H + ++ S Sbjct: 860 NVPQFSQKRYVVAVREDVR--PHSEILRISATDQDKDGNAAVHYNIISGNSRGQFAIDSV 917 Query: 411 GIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNL----------TTSHFNKTEENQ 460 + + + PL F+TE + LR + + ++ + F T Sbjct: 918 TGEIQVVAPLDFETEREYTLRLRAQDNGRPPLSNNTGIVSVQVTDVNDNPPIFVSTPFQA 977 Query: 461 LRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIE-----DNGLSAVVRLSS--- 512 T L P S+ I+ D+ + R + + + V S Sbjct: 978 --TVLESAPIGHSILHIQAIDTDNGDNARLEYRLTGTGTDTPFVINAATGWVTVSSELDR 1035 Query: 513 ------EFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQS 566 F +A++ LS T N + + Y +N Sbjct: 1036 ESVEHYFFGVEARDYGSPPLSATASVTITVMDVNDN-RPEFLQKEYYVRLNEDAAVGTSV 1094 Query: 567 TAGGNFNKDKEAIKLYLLGSMKDSGNYTLNR 597 + ++D + Y + + ++ Sbjct: 1095 VSVTAVDRDVNSAVTYQITGGNTRNRFAIST 1125 >gi|159570762|emb|CAP19588.1| cadherin, EGF LAG seven-pass G-type receptor 3 [Danio rerio] gi|169145650|emb|CAQ14037.1| cadherin, EGF LAG seven-pass G-type receptor 3 [Danio rerio] gi|169158045|emb|CAQ13975.1| cadherin, EGF LAG seven-pass G-type receptor 3 [Danio rerio] Length = 2065 Score = 40.9 bits (94), Expect = 0.65, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 78/271 (28%), Gaps = 32/271 (11%) Query: 354 LMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAI---SW 410 + F K+++ + ++ P + + K+ H + ++ S Sbjct: 256 NVPQFSQKRYVVAVREDVR--PHSEILRISATDQDKDGNAAVHYNIISGNSRGQFAIDSV 313 Query: 411 GIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNL----------TTSHFNKTEENQ 460 + + + PL F+TE + LR + + ++ + F T Sbjct: 314 TGEIQVVAPLDFETEREYTLRLRAQDNGRPPLSNNTGIVSVQVTDVNDNPPIFVSTPFQA 373 Query: 461 LRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIE-----DNGLSAVVRLSS--- 512 T L P S+ I+ D+ + R + + + V S Sbjct: 374 --TVLESAPIGHSILHIQAIDTDNGDNARLEYRLTGTGTDTPFVINAATGWVTVSSELDR 431 Query: 513 ------EFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQS 566 F +A++ LS T N + + Y +N Sbjct: 432 ESVEHYFFGVEARDYGSPPLSATASVTITVMDVNDN-RPEFLQKEYYVRLNEDAAVGTSV 490 Query: 567 TAGGNFNKDKEAIKLYLLGSMKDSGNYTLNR 597 + ++D + Y + + ++ Sbjct: 491 VSVTAVDRDVNSAVTYQITGGNTRNRFAIST 521 >gi|118385975|ref|XP_001026110.1| hypothetical protein TTHERM_00971900 [Tetrahymena thermophila] gi|89307877|gb|EAS05865.1| hypothetical protein TTHERM_00971900 [Tetrahymena thermophila SB210] Length = 2536 Score = 40.9 bits (94), Expect = 0.67, Method: Composition-based stats. Identities = 35/176 (19%), Positives = 72/176 (40%), Gaps = 10/176 (5%) Query: 390 EILEKSHRQSMQELNQDAISWGIKYKQI--DPLRFDTEESFADSLRQRAGFVKKIKDDYN 447 +I E + + E NQ+ I+ KQI ++ + + +IK Y Sbjct: 2251 DINESNATLNQVEKNQEDNEKQIELKQIFSKEIQASQNNHPSPLYKHTQDKQGQIKPIYG 2310 Query: 448 LTTSHFNKTEENQLRTQLVKRPASE----SVDLIRGA--YNTLSDSDKEGVRSSFAHIED 501 + + FN+TE QL+ Q K A + S + G TLSD + + ++++ + Sbjct: 2311 NSKTIFNQTESLQLQNQDEKSNADQYYVDSNKFLMGIQPKKTLSDINFQQNQNAYYK-SN 2369 Query: 502 NGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYIN 557 N + + E ++ L+ + + + T++ + + ++DS IN Sbjct: 2370 NKKQLYIEEADEIQYTQQDEKSENLNEQNEF-NLRSPHYTENSTQRNQYVFDSKIN 2424 >gi|57505852|ref|ZP_00371777.1| ATP-dependent Clp protease, ATP-binding subunit ClpB [Campylobacter upsaliensis RM3195] gi|57015882|gb|EAL52671.1| ATP-dependent Clp protease, ATP-binding subunit ClpB [Campylobacter upsaliensis RM3195] Length = 857 Score = 40.9 bits (94), Expect = 0.67, Method: Composition-based stats. Identities = 67/379 (17%), Positives = 115/379 (30%), Gaps = 66/379 (17%) Query: 248 RSLEVVEDVSKAIGL----AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQA------ 297 + LE +E +A+ + A RL++ + L +L NALN + E Sbjct: 417 KELETLEVEKEALKMEQNEANEKRLEEIAKE--LANLKEEQNALNAQFENEKSVFDSISV 474 Query: 298 QARRID------------------AQLNHG-ITVKPSELIPLEDYTQAYGVEQGTELYNL 338 + + ID A+L +G I E++ LE+ Q G L N Sbjct: 475 KKKEIDTLKNEANLAKNKGDFQKAAELEYGQIPSLEKEVLSLEEKWQLLSQN-GVLLKNK 533 Query: 339 QQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQ 398 VA + + +K L+ +++ S + K L K+ ++ Sbjct: 534 VDEDLVAG----ILSKWTGISVQKMLKSEKQKFLEVEKHLKESVIGQDKALNALAKAIKR 589 Query: 399 SMQELNQD-----------AISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDD 445 + LN+D G K + FD E++ + Sbjct: 590 NKAGLNEDSKPIGSFLFLGPTGVGKTQSAKALAKFLFDDEKAMIRFDMSEFMEKHSVSRL 649 Query: 446 YNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN--- 502 + E +L + ++P S V L D V + I D+ Sbjct: 650 LGAPPGYIGHEEGGELTEAVRRKPYS--VILFDEIEKAHKD-----VFNILLGILDDARA 702 Query: 503 --GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHK----SNKFDSLYDSYI 556 V + N A + +K K+ E + K + L D Sbjct: 703 TDSKGVSVDFKNTIIILTSNIASSAIMNLK-GKEQEEAVKAELKNFFKPEFLNRLDDIIT 761 Query: 557 NTPLTKLEQSTAGGNFNKD 575 PL + E KD Sbjct: 762 FNPLGENEAYEIVKLLFKD 780 >gi|239979653|ref|ZP_04702177.1| penicillin-binding protein [Streptomyces albus J1074] Length = 734 Score = 40.9 bits (94), Expect = 0.75, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 83/279 (29%), Gaps = 33/279 (11%) Query: 254 EDVSKAIGLAGWDRL--DDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG-I 310 E+ ++ GW D K++R LE S D++LN KL +AR++DA++ G Sbjct: 298 EEAQTSVEAGGWTMTLNIDKKKQRELER--SVDSSLNDKL----DPKARKVDARVQTGAA 351 Query: 311 TVKPSELIPLEDYTQAYGVEQGTE--LYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKID 368 +V P YG E L N + A + L S + K Q Sbjct: 352 SVDPRN----GKVLALYGGRDYIEHQLSNATRRDYQVASTFKPLILASALENKATTQDGR 407 Query: 369 DEYISNPSLSLASTMMATKYKEILEKSHRQS-----------MQELNQDAISWGIKYKQI 417 + + + + +N G+ Sbjct: 408 PVTAGTIYDGTSERPLEGSPTPYAPPNEDDVDYGPVTVQKAMNKSINSVFAQMGVDVGMD 467 Query: 418 DPLRFD-----TEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKR-PAS 471 ++ E Q G + + + F + L Sbjct: 468 KVMKTAGDLGMDVEGLEAVPAQTLGSMGASPLEMAGVYATFANHGKLVTPAVLKSAENRD 527 Query: 472 ESVDLIRGAY-NTLSDSDKEGVRSSFAHIEDNGLSAVVR 509 +VDL + + S + V S + D+G A VR Sbjct: 528 RTVDLPKAVGEQVIKRSTADSVTSVLTGVVDDGTGAAVR 566 >gi|114566147|ref|YP_753301.1| signal transduction histidine kinase-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337082|gb|ABI67930.1| multi-sensor hybrid histidine kinase [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 1442 Score = 40.9 bits (94), Expect = 0.82, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 53/166 (31%), Gaps = 5/166 (3%) Query: 363 FLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRF 422 F+ E + + M + I+ +M+ + + G+ P+ F Sbjct: 1259 FMDCQMPEMDGYETTARIRQMEGEERHTIIVAMTANAMEGDREKCLQAGMDDYISKPVDF 1318 Query: 423 DTEESFADSLRQRAG-FVKKIKDDYNLTTSHFNKTEENQLRT--QLVKRPASESVDLIRG 479 + + R V ++ S F +L + S ++ Sbjct: 1319 QLLFNIIEQYTARQQESVSEVAAMLEENLSSFIAETGLSKGEGRELYDKLWSTLPGVLEE 1378 Query: 480 AYNTLSDSDKEGVRSSFAHIEDNGLSAVVR--LSSEFSDDAKNAAM 523 N L++SD + +RS ++ + + ++ F + K A Sbjct: 1379 LQNALANSDFKKIRSLAHQLKGSSGTLRIQKLYELFFDLEQKAIAE 1424 >gi|291451518|ref|ZP_06590908.1| penicillin-binding protein [Streptomyces albus J1074] gi|291354467|gb|EFE81369.1| penicillin-binding protein [Streptomyces albus J1074] Length = 760 Score = 40.9 bits (94), Expect = 0.83, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 83/279 (29%), Gaps = 33/279 (11%) Query: 254 EDVSKAIGLAGWDRL--DDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG-I 310 E+ ++ GW D K++R LE S D++LN KL +AR++DA++ G Sbjct: 324 EEAQTSVEAGGWTMTLNIDKKKQRELER--SVDSSLNDKL----DPKARKVDARVQTGAA 377 Query: 311 TVKPSELIPLEDYTQAYGVEQGTE--LYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKID 368 +V P YG E L N + A + L S + K Q Sbjct: 378 SVDPRN----GKVLALYGGRDYIEHQLSNATRRDYQVASTFKPLILASALENKATTQDGR 433 Query: 369 DEYISNPSLSLASTMMATKYKEILEKSHRQS-----------MQELNQDAISWGIKYKQI 417 + + + + +N G+ Sbjct: 434 PVTAGTIYDGTSERPLEGSPTPYAPPNEDDVDYGPVTVQKAMNKSINSVFAQMGVDVGMD 493 Query: 418 DPLRFD-----TEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKR-PAS 471 ++ E Q G + + + F + L Sbjct: 494 KVMKTAGDLGMDVEGLEAVPAQTLGSMGASPLEMAGVYATFANHGKLVTPAVLKSAENRD 553 Query: 472 ESVDLIRGAY-NTLSDSDKEGVRSSFAHIEDNGLSAVVR 509 +VDL + + S + V S + D+G A VR Sbjct: 554 RTVDLPKAVGEQVIKRSTADSVTSVLTGVVDDGTGAAVR 592 >gi|222823717|ref|YP_002575291.1| protein disaggregating chaperone ClpB [Campylobacter lari RM2100] gi|222538939|gb|ACM64040.1| protein disaggregating chaperone ClpB [Campylobacter lari RM2100] Length = 857 Score = 40.5 bits (93), Expect = 0.87, Method: Composition-based stats. Identities = 82/465 (17%), Positives = 146/465 (31%), Gaps = 67/465 (14%) Query: 164 DPSNENYFQRVQSITDHINSLPIDLR-LKQKLLSEAKEKLNTNQIIGAHARDPRVFENFM 222 D + + FQ V +N LR +K+KL +N + ++ A R N Sbjct: 326 DAALQRRFQPVNVAEPSVNEALAMLRGIKEKLEIHHNVTINDSALVAAAKLSKRYIAN-- 383 Query: 223 RAFYKKGHPPKDSTS------LSDVSDSARE--RSLEVVEDVSKAIGL----AGWDRLDD 270 R K D + + +S R+ + +E +E ++A+ + A RL++ Sbjct: 384 RFLPDKAIDLIDEAAAELKMQIESEPNSLRKVRKQIESLEVENEALKMEENEANEKRLEE 443 Query: 271 TKRRRLLEHLSSRDNALNTKLRKET------QAQARRIDAQLNHGITVK----------- 313 K+ L +L LN K E A+ + ID N I K Sbjct: 444 IKKE--LANLKEEQIKLNAKFENEKAVFNGISAKKKEIDTLKNEAILAKNKGDFQKAAEI 501 Query: 314 --------PSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA---PDVA-RIKLMSTFDAK 361 E++ LE+ ++G L N VA ++ M T + + Sbjct: 502 EYGKILECEKEVLNLEE-KWKQMTQEGVLLKNQVDEDLVAGILSKWTGISVQKMLTSEKQ 560 Query: 362 KFLQKIDDEYISNPS--LSLASTMMATKYKEI-LEKSHRQSMQELNQDAISWGI--KYKQ 416 K+L + S +L++ A K + L ++ + L G K Sbjct: 561 KYLHIQEYLQESVVGQDEALSALARAIKRNKAGLNQATKPIGSFLFLGPTGVGKTESAKA 620 Query: 417 IDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDL 476 + FD E++ I + E +L + ++P S V L Sbjct: 621 LAKFLFDDEKAMVRFDMSEFMEKHSISRLLGAPPGYIGHEEGGELTEAVRRKPYS--VIL 678 Query: 477 IRGAYNTLSDSDKEGVRSSFAHIEDNG-----LSAVVRLSSEFSDDAKNAAMVILS---G 528 D V + I D+G V ++ N + G Sbjct: 679 FDEVEKAHKD-----VFNILLGILDDGRATDSKGVTVDFTNTIIILTSNIGANFIMELKG 733 Query: 529 MKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFN 573 + +K + + + + L D PL + E Sbjct: 734 EEREKAIKEALRSFFRPEFLNRLDDIITFNPLGESEAEKIVKLLF 778 >gi|301612111|ref|XP_002935566.1| PREDICTED: protein-glutamine gamma-glutamyltransferase 6-like [Xenopus (Silurana) tropicalis] Length = 703 Score = 40.5 bits (93), Expect = 0.97, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 62/200 (31%), Gaps = 20/200 (10%) Query: 327 YGVEQGTELYNLQQFKSVAAPDVAR--IKLMSTFDAKKFLQKIDDEYISNPSLSLASTMM 384 YG L ++ + S D Q E SL Sbjct: 398 YGRRNDGTFSILYTDTRKVGQYISTKAVGNFSRQDVTD--QYKFPEGSKEERDSLHKAQG 455 Query: 385 ATKYKEILEKSHRQSMQ-----ELNQDAISWGIKYKQIDPLRFDTE----ESFADS--LR 433 T + + + R+ ++ I K + I + E + F + L+ Sbjct: 456 LTARSDRVMLASRRLNATAARVVEDKPPIQEAPKPEIIGTFKLSGELQVGQDFTVTLNLK 515 Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQL---RTQLVKRPASESVDLIRGAYNTLSD--SD 488 K++ + LT +N+T ++ + P E V I Y + Sbjct: 516 NPTSSKKQVNAKWTLTAIVYNRTPVKEILSDSQSVTLAPNEEKVIEITVLYPQYEKALTP 575 Query: 489 KEGVRSSFAHIEDNGLSAVV 508 +R+S +E+NG S +V Sbjct: 576 DNMIRASAVCVEENGGSLLV 595 >gi|269975273|gb|ACZ55497.1| phage conserved protein [Staphylococcus phage SA1] Length = 737 Score = 40.5 bits (93), Expect = 1.0, Method: Composition-based stats. Identities = 32/280 (11%), Positives = 84/280 (30%), Gaps = 19/280 (6%) Query: 349 VARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKY-KEILEKSHRQSMQELNQDA 407 +A + + +LQ + + S + ++ +E ++ +L A Sbjct: 218 IAENARLKSVKEAAWLQDRANASLRQLVGGFVSAYALLELSQKTIEAGVKRQSAQLASTA 277 Query: 408 ISWGIKYKQI----DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRT 463 I +G +Q +++ D+++Q + F + + Sbjct: 278 I-FGADTQQARMFAASFAHQIGQNYTDTIKQYSNFAAGAQPTLGFQGTQEFYKNAAMFAR 336 Query: 464 QLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM 523 + E + I A+ ++ K + D AV + + Sbjct: 337 I--RGATDEDLKGIMVAFQQMASKGKVQAEELRGQLGDRLAGAVQLFADAIGKTPQELDK 394 Query: 524 VILSGMKHQKDTETRYNTDHKSNKFDSLY--DSYINTPLTKLEQSTAGGN-----FNKDK 576 ++ G KD ++ + ++ T + Q+ A + + Sbjct: 395 LMQDGKLLAKDV-LPKVSERMAELVQQAGGMNAVSKQTATSMGQAKAMWDNTLVALFNNS 453 Query: 577 EAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNIN 616 L S+ + + ++A+ +V+GN + Sbjct: 454 SEGISQLSNSVAM---FLQGSMGTTEALGVVIGNLLKGAS 490 >gi|221487158|gb|EEE25404.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 1141 Score = 40.5 bits (93), Expect = 1.0, Method: Composition-based stats. Identities = 77/464 (16%), Positives = 140/464 (30%), Gaps = 69/464 (14%) Query: 191 KQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDS--TSLSDVSDSARE- 247 KQ+LL +E L + P V + R KDS +++ + RE Sbjct: 271 KQELLKFKEEHLPLRNL-------PEVLDELGRRIQANASTQKDSQLAFQAELCQNQREY 323 Query: 248 ----RSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRID 303 LE +E+ S+A + L RLLE ++ L T+ +ET Sbjct: 324 RDLRDRLENMEEQSRAHIKTNHETL------RLLE-----EHKLATQQLRET-------- 364 Query: 304 AQLNHGITVKPSELIPLEDY---TQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 H ++ EL +++ + Q E+ Q A ++ R LMS Sbjct: 365 ----HEELLRAQELRLRDEFQLQLAEFSKNQSQEVSTQMQDLRKALDELRRETLMS---Q 417 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 +K + + + + S +KE+ +Q QD+++ ++ Sbjct: 418 QKASEPERLDLMREAAEKRCSEGEQQLWKELEVIKKQQVELLRQQDSLAHQKSQQEKAER 477 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVD----- 475 E + R + Y + + T + + P S Sbjct: 478 VEKKLEDIRSFVSDREKDA---QKHYLDLMAALSCTASSSPSNGHFRAPLYSSSQKEVAL 534 Query: 476 --------LIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILS 527 + + S E S + + NG S V S + A V S Sbjct: 535 HAALLLRCTFDAIHAAATASFSEESSCSDSQVPRNGTSVSVEASD------RQAVAVARS 588 Query: 528 GMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSM 587 + +D +D D N +T ++ N + G Sbjct: 589 AEEGAEDIPL-KRSDSTDTSAGQSTDRLANR-MTPASETFLAAGANAALGSRVDSPRGEA 646 Query: 588 KDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 D N A++ + S ++ P G ++ F Sbjct: 647 SD--NSAALGQSWKAAVRTSQAASSGVGGTSGVLSPGGELESAF 688 >gi|212710806|ref|ZP_03318934.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] gi|212686503|gb|EEB46031.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] Length = 1122 Score = 40.5 bits (93), Expect = 1.1, Method: Composition-based stats. Identities = 39/290 (13%), Positives = 77/290 (26%), Gaps = 25/290 (8%) Query: 342 KSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQ 401 V D+ IK + +D I + + + ++ ++ + Sbjct: 9 NDVMGADIGSIKQPEPQYGES-----NDVGILDAANPFSDKNEVSRLRDAAFRIDNSVGS 63 Query: 402 ELNQDAISWGIKYKQIDPLR----FDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTE 457 + + + +P E FAD+ + + Sbjct: 64 LIATMPFNQFEEQDGYNPFDDERTLSGYEDFADAFIHSKSPEETSAIKQRINRQI----- 118 Query: 458 ENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDD 517 Q R L + LI + D A G + S + Sbjct: 119 --QDRQLLQDAGGA---GLISSIAMGVIDPINVAAMMVPAGTIARGGDVLATASKFAVSN 173 Query: 518 AKN--AAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKD 575 A A+ + LS ++T T + L + + F D Sbjct: 174 ALGGVASELALSAT---QETRTLEESALNVTFDAMLGGLLGSAAQLVKNRGQIVSKFKND 230 Query: 576 KEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRG 625 + + ++ D+ + V + Q L P IN +M + P G Sbjct: 231 VLGEQQTIQQNIPDNTSVGAMEVPETTLEQETLKG-PSFINRTMNVSPVG 279 >gi|332533810|ref|ZP_08409666.1| putative phage protein [Pseudoalteromonas haloplanktis ANT/505] gi|332036741|gb|EGI73204.1| putative phage protein [Pseudoalteromonas haloplanktis ANT/505] Length = 1098 Score = 40.2 bits (92), Expect = 1.1, Method: Composition-based stats. Identities = 30/268 (11%), Positives = 82/268 (30%), Gaps = 46/268 (17%) Query: 312 VKPSELIPLEDYTQAYGVEQGTELYNLQQF--KSVAAPDVAR---IKLMSTFDAKKFLQK 366 V+P ++ ++ QAY + G + + + ++ +A I + ++ Sbjct: 684 VQPQTMV-MQSADQAYSRDYGQSVISKASAPAQQASSQYIAPNSPINYAAPK-VTAIVET 741 Query: 367 IDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQEL--NQDAISWGIKYKQIDPLRFDT 424 + + + A+ + + + M E DA + + P Sbjct: 742 EEPRTQALAGPAAAN--------DPAYSAQPKVMTEREAANDAFKFTSQRLPAVPTDSTA 793 Query: 425 EESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTL 484 + R +A + + ++ + + S + Sbjct: 794 PLNLQAQARSQAL----------INNAFPSEQNNAVNNSAVTSSAVSAT----------- 832 Query: 485 SDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKD-----TETRY 539 ++ V S+ ++N A ++S + NAA I + ++ T + Sbjct: 833 QNNAATAVNSATNATQNN---AATAINSATNTTQSNAATAIDNATNATQNNAATVTNSAT 889 Query: 540 NTDHKSNKFDSLYDSYINTPLTKLEQST 567 +T + S+ ++ N+ + Sbjct: 890 STKALAAVNSSVNNATQNSLAPAAIAAQ 917 >gi|195027149|ref|XP_001986446.1| GH21371 [Drosophila grimshawi] gi|193902446|gb|EDW01313.1| GH21371 [Drosophila grimshawi] Length = 1525 Score = 40.2 bits (92), Expect = 1.3, Method: Composition-based stats. Identities = 63/411 (15%), Positives = 116/411 (28%), Gaps = 70/411 (17%) Query: 305 QLNHGITVKPSELIPLE---DYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAK 361 Q+ + +T+K +++ E DYT V EL Q K I + + Sbjct: 371 QIGNEVTIKLGKMLDYEAITDYTLTMSVRNTFELATEHQIKIKVEDVNDNIPFFTEVKSG 430 Query: 362 KFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLR 421 L+ +M + ++ + D Y IDP Sbjct: 431 TILENEPPGT----------PVMQVRAFDMDGTPANNIITFELAD----NTDYFAIDPQT 476 Query: 422 FDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV--------KRPASES 473 + +R + K+ N +S FN + N +P + Sbjct: 477 GNITALTTFDREERDFYNVKVSATDNSPSSLFNNGDHNHGYQVFRISIGDKNDHKPHFQQ 536 Query: 474 VDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQK 533 + + L + ED ++ + S E N G+K K Sbjct: 537 TTYLA---DKLLEDANTNYEVIEVKAEDEDNASQIVYSIESG----NVGDAFKIGLKTGK 589 Query: 534 -------DTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGS 586 D ET + K FD +YD Y T + K+E K + +I + Sbjct: 590 ITVNQRLDYETITEYELKVRAFDGIYDDYA-TVVVKIEDVNDNPPLFKKEYSITIQEGPV 648 Query: 587 MKDS------------GNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDR 634 + + T ++ ++ N L + + + Sbjct: 649 QSNECILIIEAYDPDIKDRTADQHITYSIVKEDHKRLLTIDNSGCLRLIQPLDRD----- 703 Query: 635 LWYATKDTGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINVEDVN 685 G + + G +K+ K +II +ED+N Sbjct: 704 -----PPNGHKSWQVLISASDQDGVGTTLKSV--------KPVIITLEDIN 741 >gi|303231586|ref|ZP_07318311.1| ABC transporter, substrate-binding protein, QAT family [Veillonella atypica ACS-049-V-Sch6] gi|302513749|gb|EFL55766.1| ABC transporter, substrate-binding protein, QAT family [Veillonella atypica ACS-049-V-Sch6] Length = 511 Score = 40.2 bits (92), Expect = 1.3, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 69/223 (30%), Gaps = 29/223 (13%) Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESV---DLIRGAYNTLSDSDKE 490 +R G V I + + + + + + +P +E + D+++ +D E Sbjct: 217 KRIGIVASIVIISGIGVFMWPSQQSDVI--HVATKPMTEQLILGDMLKLLIEQDTDLTVE 274 Query: 491 GVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDS 550 + + + S +F + + + +T + N FD Sbjct: 275 VTAGVGGGT--SNIQPAME-SGQFDIYPEY--------TGTGWNAVLKRDTQYSENLFDE 323 Query: 551 LYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGN 610 L +Y +T K FN + + + + N + Q++LG Sbjct: 324 LQRAYEDTYQFKWVGM---YGFNN---TYGMAVRKDIANKYNLKTYSDLARVSSQLILGG 377 Query: 611 TPVNINESM----LMPPRGMSKTDFEDR---LWYATKDTGEYD 646 L GM D +D L Y ++G D Sbjct: 378 EYDFFGRQDGYSGLQRVYGMDFKDTKDMDIGLKYQAIESGHVD 420 >gi|300871846|ref|YP_003786719.1| DNA-directed RNA polymerase omega subunit family protein-like protein [Brachyspira pilosicoli 95/1000] gi|300689547|gb|ADK32218.1| DNA-directed RNA polymerase, omega subunit family protein-like protein [Brachyspira pilosicoli 95/1000] Length = 5567 Score = 40.2 bits (92), Expect = 1.3, Method: Composition-based stats. Identities = 86/419 (20%), Positives = 162/419 (38%), Gaps = 81/419 (19%) Query: 33 DLGDALGKATQFLEGIRRDN--AFATANTRYTELS--FKAVQDFHD----FTNSLDTR-- 82 +L + L K + E + +N F+T T++S ++ D D FT+ L+ R Sbjct: 3834 ELSEELKKIEESFELFKSNNNDIFSTLEEYKTKISSIVDSLSDIEDNNLNFTSRLEKRIE 3893 Query: 83 ---DSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQI 139 DS GDKIKE I D F S D+ KF+ E +++I Sbjct: 3894 FFEDSWGDGDKIKELYSDFIAEKADSFKS-----DLEVKFRDSFEE---------FNLKI 3939 Query: 140 GATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEA- 198 ++ D++N + D N +R++SI++ IN+L DL K + EA Sbjct: 3940 DNVEKNFADSINSIIESL------DKENLTSNERLESISNTINNLKSDLENKIVSIEEAN 3993 Query: 199 -KEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVS 257 K KL + + +FEN M AF + + DV DS + ++ + V+ Sbjct: 3994 DKSKLFIDDKLS-------IFENKMEAFEDR---------IKDVEDSQHYLNNDITQYVN 4037 Query: 258 KAIG-LAGWDRLDDTKRRRLLEHLSSRD-----NALNT-------------------KLR 292 + L ++ DTK L LS+ N +++ +L+ Sbjct: 4038 RLDKVLNDYEEAIDTKINNLKNSLSNESGHVEHNLISSLELSNELAKDKKELLDSIHELK 4097 Query: 293 KETQAQARRIDAQLNHGITVKPSEL---IPLEDYTQAYGVEQGTELYNLQQ--FKSVAAP 347 K+ R DAQ+N + +E+ IP + V + + N + + Sbjct: 4098 KDVYETIERKDAQMNEEFSKFKTEIIESIPSMEDISDLFVSEKNNIVNEFEGFKNEILTS 4157 Query: 348 DVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD 406 + M + K ++++D+ + ++++ +K K+ + + + + Sbjct: 4158 NNKNFADMFEEEKNKLIEELDNFKSAIRVEYMSNSDKIDAFKNEYLKNFNEIIDKEKAN 4216 >gi|154301924|ref|XP_001551373.1| hypothetical protein BC1G_10199 [Botryotinia fuckeliana B05.10] gi|150855591|gb|EDN30783.1| hypothetical protein BC1G_10199 [Botryotinia fuckeliana B05.10] Length = 864 Score = 40.2 bits (92), Expect = 1.4, Method: Composition-based stats. Identities = 59/374 (15%), Positives = 129/374 (34%), Gaps = 52/374 (13%) Query: 227 KKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIG-------LAGWDRLDDTKRRRLLEH 279 K G P K + + ++D + E ++D++K I + G D DD + Sbjct: 126 KSGKPGKYARKIVLITDGNGFMNSEELDDIAKEINNNDIRLVVIGVD-FDDAEY-----G 179 Query: 280 LSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELI-----PLEDYTQAYGVEQGTE 334 D L ++ E + R + + GI +E I P+ T+ Y G Sbjct: 180 FKEEDKDL---VKSENEKLLRSFTEKCSDGIFGTTAEAIEALATPVVKVTREYNTYTGPL 236 Query: 335 LYNLQQF--KSVAAPDVARI---KLMSTFDAKKFLQKIDDEYISN--PSLSLASTMMATK 387 + + ++ + DVAR K AK ++ ++ DE I P+ +++ + Sbjct: 237 ILGDVKKYPETAMSIDVARYFRTKQAKPVSAKSYVDRVTDEPIDVDLPNTDDLTSVTQAR 296 Query: 388 YKEILEKSHRQSMQELNQDAISWGIKYKQ-IDPLRFDTEESFADSLRQRAGFVKKIKDDY 446 ++ + + +++ +DA+ G +Y I P+ E + + + +DY Sbjct: 297 TYKVNDTTAPGGKKDVEKDALERGYEYGSTIVPISQSDESITKLQTIKDFSIMGFVPNDY 356 Query: 447 NLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSA 506 + + ++ + + S + + A Sbjct: 357 ---------ERYLNMGESCIT------------IAQKTNEKARMALSSLIHALHELDSCA 395 Query: 507 VVRLSSEFSDDAKNA--AMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLE 564 V R+ + D A +I G++ D + D ++ +F L + ++ + Sbjct: 396 VARIVKKDGSDPLIVLLAPLIEPGLEGLIDVPLPFAEDVRAYRFAPLDKVFNSSGGIMEK 455 Query: 565 QSTAGGNFNKDKEA 578 N K + Sbjct: 456 HKNLPSNDLKAAMS 469 >gi|296505336|ref|YP_003667036.1| minor extracellular protease VpR [Bacillus thuringiensis BMB171] gi|296326388|gb|ADH09316.1| minor extracellular protease VpR [Bacillus thuringiensis BMB171] Length = 1407 Score = 39.8 bits (91), Expect = 1.5, Method: Composition-based stats. Identities = 61/385 (15%), Positives = 116/385 (30%), Gaps = 65/385 (16%) Query: 336 YNLQQFKSVAAPDVARIKLM--------STFDAKKFL----QKIDDEY-ISNPSLSLAST 382 Y Q S+A+P +A + + S FD K L + ++Y + Sbjct: 610 YERMQGTSMASPHIAGVAALILQEHPDYSPFDVKASLMNTANDLKEKYSVYEVGAGRVDA 669 Query: 383 MMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKI 442 A E K + +N + I + I +F + + Sbjct: 670 YNAV-RTETSFKVLDTTQTVVNDEVIEVPEETGSIAFGKF------------YQKDGEAL 716 Query: 443 KDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN 502 + N+ ++ NK E+ + +T++ PAS +++ N + S E + + Sbjct: 717 EQKRNIKVTNHNKQEKKEFKTEISYTPASSTIN--DAVANGVKISAPETITLDAGKTGEI 774 Query: 503 GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKS----------------- 545 V + I + ++ + ++ Sbjct: 775 EAKINVPAGVKQGRYEGYIH--ITNTKNKEETYQIPFSIRVSEPGIENAILSRKAISTDT 832 Query: 546 NKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQ 605 +KF+ +SY++ + KD + K LG + L Sbjct: 833 SKFNPYGESYVHGAFQLNSELETLDLIVKDSKTNKA--LGFIGTLNTSGLKTDVYYYLDS 890 Query: 606 IVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVG----SGKYM 661 G N P G K + + G+Y + Y G G Y+ Sbjct: 891 FFNGKVYPFTN--DPAKPIGDEKINLPE---------GDYTIEFVGYDKAGKARVKGDYV 939 Query: 662 IIKNGNPKVDKEG-KTIIINVEDVN 685 +I N P+V G K I + + N Sbjct: 940 VIDNTPPEVKLTGLKPGIYELNEEN 964 >gi|229158492|ref|ZP_04286553.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus ATCC 4342] gi|228625011|gb|EEK81777.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus ATCC 4342] Length = 1407 Score = 39.8 bits (91), Expect = 1.5, Method: Composition-based stats. Identities = 53/382 (13%), Positives = 115/382 (30%), Gaps = 59/382 (15%) Query: 336 YNLQQFKSVAAPDVARIKLM--------STFDAKKFLQKIDDEYISNPSLSLASTMMATK 387 Y Q S+A+P +A + + S FD K L D+ + + Sbjct: 610 YERMQGTSMASPHIAGVAALILQEHPEYSPFDVKASLMNTADDLK----EKYSVYEVGAG 665 Query: 388 YKEILEKSH--------RQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFV 439 + H + +N + I + I +F Sbjct: 666 RVDAYNAVHTETSFKVLDTTKTVVNDEVIEVPEETGSIAFGKF------------YQKDG 713 Query: 440 KKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHI 499 + ++ N+ + NK E+ + +T++ PAS S++ N + S E + Sbjct: 714 EALEQKRNIKVENHNKQEKKEFKTEISYTPASSSIN--DAIANGVKVSVPETITLDAGKS 771 Query: 500 EDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTP 559 ++ V + I + ++ + ++ + Sbjct: 772 DEIEAKVNVPAGVKQGRYEGYIH--ITNTKNKEETYQIPFSIRVSEPGIED-----AVLS 824 Query: 560 LTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESM 619 + T N ++ + L S ++ + + + + A+ + G + ++ Sbjct: 825 RKSISTDTTKFNPYRESYVHGAFQLNSELETLDLIVKDSKTNKALGFI-GTLNTSGLKTD 883 Query: 620 L-----------MPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVG----SGKYMIIK 664 + + P + G+Y + Y G G Y+II Sbjct: 884 VYYYLESFFNGKVYPFTNDPAKPIGDEKINLPE-GDYTIEFVGYDKAGKARVKGDYVIID 942 Query: 665 NGNPKVDKEG-KTIIINVEDVN 685 N P+V G K I + + N Sbjct: 943 NTPPEVKLTGLKPGIYELNEEN 964 >gi|229147445|ref|ZP_04275794.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus BDRD-ST24] gi|228636127|gb|EEK92608.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus BDRD-ST24] Length = 1407 Score = 39.8 bits (91), Expect = 1.6, Method: Composition-based stats. Identities = 61/385 (15%), Positives = 116/385 (30%), Gaps = 65/385 (16%) Query: 336 YNLQQFKSVAAPDVARIKLM--------STFDAKKFL----QKIDDEY-ISNPSLSLAST 382 Y Q S+A+P +A + + S FD K L + ++Y + Sbjct: 610 YERMQGTSMASPHIAGVAALILQEHPDYSPFDVKASLMNTANDLKEKYSVYEVGAGRVDA 669 Query: 383 MMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKI 442 A E K + +N + I + I +F + + Sbjct: 670 YNAV-RTETSFKVLDTTQTVVNDEVIEVPEETGSIAFGKF------------YQKDGEAL 716 Query: 443 KDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN 502 + N+ ++ NK E+ + +T++ PAS +++ N + S E + + Sbjct: 717 EQKRNIKVTNHNKQEKKEFKTEISYTPASSTIN--DAVANGVKISAPETITLDAGKTGEI 774 Query: 503 GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKS----------------- 545 V + I + ++ + ++ Sbjct: 775 EAKINVPAGVKQGRYEGYIH--ITNTKNKEETYQIPFSIRVSEPGIENAILSRKAISTDT 832 Query: 546 NKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQ 605 +KF+ +SY++ + KD + K LG + L Sbjct: 833 SKFNPYGESYVHGAFQLNSELETLDLIVKDSKTNKA--LGFIGTLNTSGLKTDVYYYLDS 890 Query: 606 IVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVG----SGKYM 661 G N P G K + + G+Y + Y G G Y+ Sbjct: 891 FFNGKVYPFTN--DPAKPIGDEKINLPE---------GDYTIEFVGYDKAGKARVKGDYV 939 Query: 662 IIKNGNPKVDKEG-KTIIINVEDVN 685 +I N P+V G K I + + N Sbjct: 940 VIDNTPPEVKLTGLKPGIYELNEEN 964 >gi|160901818|ref|YP_001567399.1| flagellar basal body rod protein [Petrotoga mobilis SJ95] gi|160359462|gb|ABX31076.1| flagellar basal body rod protein [Petrotoga mobilis SJ95] Length = 240 Score = 39.8 bits (91), Expect = 1.7, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 47/141 (33%), Gaps = 14/141 (9%) Query: 553 DSYINTPL-TKLEQSTAGGNFNKDKEA--IKLYLLGSMKDS--GNYTLNRVRVSDAMQIV 607 +S N N K I+ Y K GN + V + + + + Sbjct: 19 NSISNNLANLNTVGYKKDFNLFKSVYEKEIRSYQNAQGKGEPIGNI-YSSVVLDEVIPNL 77 Query: 608 LGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKYMIIKNGN 667 V + + F + + G Y ++ Y++ K+G+ Sbjct: 78 EQGPLVETSGK---LDFAIEGDGF-----FKIERNGNYFYTRNGEFSLSPENYLVTKSGD 129 Query: 668 PKVDKEGKTIIINVEDVNRDE 688 +D+E I++++++ DE Sbjct: 130 YVLDEENNPILVDLDNFFVDE 150 >gi|77736177|ref|NP_001029787.1| tumor protein D54 [Bos taurus] gi|74354962|gb|AAI03374.1| Tumor protein D52-like 2 [Bos taurus] gi|296481163|gb|DAA23278.1| tumor protein D52-like 2 [Bos taurus] Length = 186 Score = 39.8 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 54/165 (32%), Gaps = 8/165 (4%) Query: 453 FNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSS 512 N + L + P +V A L++++ E +R+ A +E+ +V L Sbjct: 10 LNSPNKGLLSDSMTDVPVDTAVAAQAPAVEGLTETEAEELRAELAKVEEE----IVTLRQ 65 Query: 513 EFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFD-SLYDSYINTPLTKLEQSTAGGN 571 + ++ + K T + + D + ++Y T T + Sbjct: 66 VLAAKERHCGEL---KRKLGLSTLEGLKQNLSRSWHDMQVSNAYKKTQETLSQAGQKTSA 122 Query: 572 FNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNIN 616 + LG M++S + RV V+G + Sbjct: 123 ALSTMGSAISRKLGDMRNSATFKSFEDRVGTIKSKVVGGRENGGD 167 >gi|296314927|ref|ZP_06864868.1| chaperone protein DnaK [Neisseria polysaccharea ATCC 43768] gi|296838116|gb|EFH22054.1| chaperone protein DnaK [Neisseria polysaccharea ATCC 43768] Length = 642 Score = 39.4 bits (90), Expect = 1.9, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 52/159 (32%), Gaps = 12/159 (7%) Query: 520 NAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAI 579 AA ++ K+ + + + D+++ +L + + Sbjct: 67 YAAKRLIGHKFEGKEVQRDIESMPFEIIKANNGDAWVKAQGKELSPPQISAEVLRKMKEA 126 Query: 580 KLYLLGSMKDSGNYTL-------NRVRVSDAMQIVLGNTPVNINE-SMLMPPRGMSKTDF 631 LG T+ R DA +I + INE + GM K D Sbjct: 127 AEAYLGEKVTEAVITVPAYFNDSQRQATKDAGRIAGLDVKRIINEPTAAALAFGMDKGDN 186 Query: 632 EDRLWYATKDT--GEYDPYTIKYMNV-GSGKYMIIKNGN 667 +DR A D G +D I+ N+ G ++ ++ Sbjct: 187 KDR-KVAVYDLGGGTFDISIIEIANLDGDKQFEVLATNG 224 >gi|30022936|ref|NP_834567.1| minor extracellular protease VpR precursor [Bacillus cereus ATCC 14579] gi|229130153|ref|ZP_04259114.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus BDRD-Cer4] gi|29898495|gb|AAP11768.1| Minor extracellular protease VPR precursor [Bacillus cereus ATCC 14579] gi|228653368|gb|EEL09245.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus BDRD-Cer4] Length = 1407 Score = 39.4 bits (90), Expect = 2.0, Method: Composition-based stats. Identities = 61/385 (15%), Positives = 116/385 (30%), Gaps = 65/385 (16%) Query: 336 YNLQQFKSVAAPDVARIKLM--------STFDAKKFL----QKIDDEY-ISNPSLSLAST 382 Y Q S+A+P +A + + S FD K L + ++Y + Sbjct: 610 YERMQGTSMASPHIAGVAALILQEHPDYSPFDVKASLMNTANDLKEKYSVYEVGAGRVDA 669 Query: 383 MMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKI 442 A E K + +N + I + I +F + + Sbjct: 670 YNAV-RTETSFKVLDTTQTVVNDEVIEVPEETGSIAFGKF------------YQKDGEAL 716 Query: 443 KDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN 502 + N+ ++ NK E+ + +T++ PAS +++ N + S E + + Sbjct: 717 EQKRNIKVTNHNKQEKKEFKTEISYTPASSTIN--DAVANGVKISAPETITLDAGKTGEI 774 Query: 503 GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKS----------------- 545 V + I + ++ + ++ Sbjct: 775 KAKINVPAGVKQGRYEGYIH--ITNTKNKEETYQIPFSIRVSEPGIENAILSRKAISTDT 832 Query: 546 NKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQ 605 +KF+ +SY++ + KD + K LG + L Sbjct: 833 SKFNPYGESYVHGAFQLNSELETLDLIVKDSKTNKA--LGFIGTLNTSGLKTDVYYYLDS 890 Query: 606 IVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVG----SGKYM 661 G N P G K + + G+Y + Y G G Y+ Sbjct: 891 FFNGKVYPFTN--DPAKPIGDEKINLPE---------GDYTIEFVGYDKAGKARVKGDYV 939 Query: 662 IIKNGNPKVDKEG-KTIIINVEDVN 685 +I N P+V G K I + + N Sbjct: 940 VIDNTPPEVKLTGLKPGIYELNEEN 964 >gi|319954804|ref|YP_004166071.1| type i restriction-modification system, m subunit [Cellulophaga algicola DSM 14237] gi|319423464|gb|ADV50573.1| type I restriction-modification system, M subunit [Cellulophaga algicola DSM 14237] Length = 905 Score = 39.4 bits (90), Expect = 2.2, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 63/241 (26%), Gaps = 31/241 (12%) Query: 405 QDAISWGIKYKQIDPLR-----FDTEE-SFADSLRQRAGFVKKIKDDYNLTTSHFNKTEE 458 D G + ID L F+ F+ + +A + D Y HF Sbjct: 98 NDPTKLGSDKEMIDTLDALIGIFENPALDFS---KNKADGDDILGDAYEFLMRHFATQSG 154 Query: 459 NQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVR------SSFAHIEDNGLSAVVRLSS 512 S ++ I G N ++SD S + S V Sbjct: 155 KSKGQFYTPSEVSRTMASIIGINNVDTNSDITVYDPTCGSGSLLLKVGTEAKSKVTLYGQ 214 Query: 513 EFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFD-------SLYDSYINTPLTKLEQ 565 E A + + + + NT K D +D + P ++ Sbjct: 215 EKDATTAGLARMNMILHDNPTAEIKQGNTLSKPLFEDPKLEANLKTFDFVVANPPFSDKR 274 Query: 566 STAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRG 625 + G DK +G+Y V + ++ P G Sbjct: 275 WSNGLTLPDDKYNRFADYGIPPSKNGDYAFLLHIVRS---------LKRNGKGAIILPHG 325 Query: 626 M 626 + Sbjct: 326 V 326 >gi|308063751|gb|ADO05638.1| vacuolating cytotoxin VacA-like protein [Helicobacter pylori Sat464] Length = 2391 Score = 39.4 bits (90), Expect = 2.2, Method: Composition-based stats. Identities = 37/326 (11%), Positives = 84/326 (25%), Gaps = 19/326 (5%) Query: 337 NLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSH 396 + A A + + D I++ + T K+ ++ Sbjct: 434 TQFLGDDLQAKAKATYQQDLSNSQTALNNATSDNKIASSDTGYTNNQNTTIKKD--AQNL 491 Query: 397 RQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKT 456 + Q++ QD + ++ L T + Q + FN Sbjct: 492 ENTSQQIAQDEQTLKGDLDKLQQLANATTGFNEQAFNQAQSTEQHDLQTLQNEEKTFNSE 551 Query: 457 EENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSD 516 +E L + + + +T ++ V ++ + + V + Sbjct: 552 QEG-LEQAIAQAKHTSPTPSPTPTKHTAQNTPPNKVSPPTQNLPTTDVWSGVYWLQNQTY 610 Query: 517 DAKNAAMVI--LSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNK 574 + + LSG Q + + L S+ Sbjct: 611 SKQGVYYIDPNLSGQSGQ------SGNTLSTYTANLLGRSFGVNIQNGTLIIGNNTESVN 664 Query: 575 DKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDR 634 D I + G +G + + +++ G + S D Sbjct: 665 DNGLIWIGHGGFGYITGTFNAANIYLTN--NFKTGEGVSGSDGGGANITFKASDNITMDG 722 Query: 635 LWYATKDTGEYDPYTIKYMNVGSGKY 660 L Y +T K + G+ ++ Sbjct: 723 LNYNDAET------VTKMIQTGASQH 742 >gi|229048585|ref|ZP_04194145.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus AH676] gi|229112338|ref|ZP_04241877.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus Rock1-15] gi|228671178|gb|EEL26483.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus Rock1-15] gi|228722787|gb|EEL74172.1| Peptidase Vpr; Serine peptidase; MEROPS family S08A [Bacillus cereus AH676] Length = 1407 Score = 39.4 bits (90), Expect = 2.4, Method: Composition-based stats. Identities = 61/385 (15%), Positives = 116/385 (30%), Gaps = 65/385 (16%) Query: 336 YNLQQFKSVAAPDVARIKLM--------STFDAKKFL----QKIDDEY-ISNPSLSLAST 382 Y Q S+A+P +A + + S FD K L + ++Y + Sbjct: 610 YERMQGTSMASPHIAGVAALILQEHPDYSPFDVKASLMNTANDLKEKYSVYEVGAGRVDA 669 Query: 383 MMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKI 442 A E K + +N + I + I +F + + Sbjct: 670 YNAV-RTETSFKVLDTTQTVVNDEVIEVPEETGSIAFGKF------------YQKDGEAL 716 Query: 443 KDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDN 502 + N+ ++ NK E+ + +T++ PAS +++ N + S E + + Sbjct: 717 EQKRNIKVTNHNKQEKKEFKTEISYTPASSTIN--DAVANGVKISAPETITLDAGKTGEI 774 Query: 503 GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKS----------------- 545 V + I + ++ + ++ Sbjct: 775 EAKINVPAGVKQGRYEGYIH--ITNTKNKEETYQIPFSIRVSEPGIENAILSRKAISTDT 832 Query: 546 NKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQ 605 +KF+ +SY++ + KD + K LG + L Sbjct: 833 SKFNPYGESYVHGAFQLNSELETLDLIVKDSKTNKA--LGFIGTLNTSGLKTDLYYYLDS 890 Query: 606 IVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVG----SGKYM 661 G N P G K + + G+Y + Y G G Y+ Sbjct: 891 FFNGKVYPFTN--DPAKPIGDEKINLPE---------GDYTIEFVGYDKAGKARVKGDYV 939 Query: 662 IIKNGNPKVDKEG-KTIIINVEDVN 685 +I N P+V G K I + + N Sbjct: 940 VIDNTPPEVKLTGLKPGIYELNEEN 964 >gi|224499224|ref|ZP_03667573.1| hypothetical protein LmonF1_05837 [Listeria monocytogenes Finland 1988] Length = 1023 Score = 39.0 bits (89), Expect = 2.7, Method: Composition-based stats. Identities = 58/313 (18%), Positives = 111/313 (35%), Gaps = 35/313 (11%) Query: 254 EDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKE---TQAQARRIDAQLNHGI 310 E V+K + +AGWD+ + + L E L+ + T R+E Q QA + L+ G Sbjct: 456 ELVNKRVKMAGWDKQKQVEEQTLAELLTEKTEIETTIKREELKRQQEQAATLAIHLHDGD 515 Query: 311 ------TVKPSELIPLEDYTQAYGVE-QGTELYNLQQFKSVAAPDVARIK-LMSTFDAKK 362 +V EL + +E +L QQ A +++ + ++ + Sbjct: 516 ACPVCGSVSHPELAKFGESANLETLEVASAKLNEKQQAIHSAEKSISQFEWQLNEWAEIA 575 Query: 363 FLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRF 422 + E + ++ T E L+ + ++ I ++ +I+ + Sbjct: 576 DVNL--AEVEQTLAENIQQAKNLTAQIETLQA------KVAQKETIQATLETLKINQNKA 627 Query: 423 DTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYN 482 + E++ A V+ + L N E++ K + + + Sbjct: 628 EIEKN------NIAVQVENLHQQVQLAAGKLNYLEQSIPADLRDKAVFEHKKNELNTSIQ 681 Query: 483 TLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTD 542 T E V ++F E L+S A ILS + ++ + T Sbjct: 682 T-HIEQAERVDAAFRQAEKATTQLESTLAS--------AQKAILSAQEALQEQREIFKTA 732 Query: 543 HKSNKFDSLYDSY 555 K N F S YD+Y Sbjct: 733 MKQNDFSS-YDAY 744 >gi|321454600|gb|EFX65765.1| hypothetical protein DAPPUDRAFT_303533 [Daphnia pulex] Length = 931 Score = 39.0 bits (89), Expect = 2.9, Method: Composition-based stats. Identities = 67/299 (22%), Positives = 109/299 (36%), Gaps = 38/299 (12%) Query: 186 IDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSA 245 ++ +LKQ L E K T +I H + + FE +R YK+ SD S + Sbjct: 492 LERKLKQNLTGEKKL---TKEINARHLLERKNFETQVRREYKQTKERWKQELSSDESTTK 548 Query: 246 RER--SLEVVEDVSKAIGLAGWDRLDDTKRRRL-LEHLSSRDNAL---NTKLRKETQAQA 299 R+R SL+ +D KAI +RL +++ L LE R L + R+ + Sbjct: 549 RQRDASLQNYKDSLKAIEAREEERLTKSQKDYLELEVRRLRRAKLLSYHRLERELLCEEL 608 Query: 300 RRIDAQLN--HGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMST 357 R AQL H + ++ EL +Y Q V L Q + Sbjct: 609 NRRQAQLETAHAMLLRHHELTQELEYRQQRAVH---MLREEQLRRQ---------HQTEL 656 Query: 358 FDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQI 417 + ++ Q+ + E ++ L + K KEIL + Q I Sbjct: 657 SNQHEYTQRAERELRKKHAVELKQQPKSLKQKEILIR-----KQFRETVKIQTRQYKALK 711 Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDL 476 + T Q+ +KK+KDD + + E + L K +S+ L Sbjct: 712 AQVLASTPRD------QQKSVIKKLKDDQRRKLALLGEQYEQSIAEMLQK----QSIRL 760 >gi|302505046|ref|XP_003014744.1| hypothetical protein ARB_07306 [Arthroderma benhamiae CBS 112371] gi|291178050|gb|EFE33841.1| hypothetical protein ARB_07306 [Arthroderma benhamiae CBS 112371] Length = 670 Score = 39.0 bits (89), Expect = 3.0, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 10/121 (8%) Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 + F E+F + AG +K D F + K + D + Sbjct: 533 SQVDFTNPENFYHDVNSVAGLLKLFFRDL--PDPLFTNERYSAFIDAARKDDDIQRRDAL 590 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM---VILSGMKHQKD 534 N+L D + +R+ H+ V SSE +A N A+ + L G ++ Sbjct: 591 HALINSLPDPNYATLRALILHLNH-----VQERSSENRMNAGNIAISFGLTLMGTNAGRN 645 Query: 535 T 535 Sbjct: 646 I 646 >gi|221506845|gb|EEE32462.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 1141 Score = 39.0 bits (89), Expect = 3.1, Method: Composition-based stats. Identities = 76/468 (16%), Positives = 142/468 (30%), Gaps = 77/468 (16%) Query: 191 KQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDS--TSLSDVSDSARE- 247 KQ+LL +E+L + P V + R KDS +++ + RE Sbjct: 271 KQELLKFKEEQLPLRNL-------PEVLDELGRRIQANASTQKDSQLAFQAELCQNQREY 323 Query: 248 ----RSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRID 303 LE +E+ S+A + L RLLE ++ L T+ +ET Sbjct: 324 RDLRDRLENMEEQSRAHIKTNHETL------RLLE-----EHKLATQQLRET-------- 364 Query: 304 AQLNHGITVKPSELIPLEDY---TQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 H ++ EL +++ + Q E+ Q A ++ R LMS Sbjct: 365 ----HEELLRAQELRLRDEFQLQLAEFSKNQSQEVSTQMQDLRKALDELRRETLMS---Q 417 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 +K + + + + S +KE+ +Q QD+++ ++ Sbjct: 418 QKASEPERLDLMREAAEKRCSEGEQQLWKELEVIKKQQLELLRQQDSLAHQKSQQEKAER 477 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVD----- 475 E + R + Y + + T + + P S Sbjct: 478 VEKKLEDIRSFVSDREKDA---QKHYLDLMAALSCTASSSPSNGHFRAPLYSSSQKEVAL 534 Query: 476 --------LIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSS----EFSDDAKNAAM 523 + + S E S + + NG S V S + A+ A Sbjct: 535 HAALLLRCTFDAIHAAATASFSEESSCSDSQVPRNGTSVSVEASDRQTVAVARSAEEGAE 594 Query: 524 VILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYL 583 I DT +TD +N+ ++++ S + Sbjct: 595 DIPLKRSDSTDTSAGQSTDRLANRMTPASETFLAAGANAALGSRVDSPRGEA-------- 646 Query: 584 LGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 S N A++ + S ++ P G ++ F Sbjct: 647 ------SDNSAALGQSWKAAVRTSQAASSGVGGTSGVLSPGGELESAF 688 >gi|237831383|ref|XP_002364989.1| hypothetical protein, conserved [Toxoplasma gondii ME49] gi|211962653|gb|EEA97848.1| hypothetical protein, conserved [Toxoplasma gondii ME49] Length = 1141 Score = 39.0 bits (89), Expect = 3.1, Method: Composition-based stats. Identities = 76/468 (16%), Positives = 142/468 (30%), Gaps = 77/468 (16%) Query: 191 KQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDS--TSLSDVSDSARE- 247 KQ+LL +E+L + P V + R KDS +++ + RE Sbjct: 271 KQELLKFKEEQLPLRNL-------PEVLDELGRRIQANASTQKDSQLAFQAELCQNQREY 323 Query: 248 ----RSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRID 303 LE +E+ S+A + L RLLE ++ L T+ +ET Sbjct: 324 RDLRDRLENMEEQSRAHIKTNHETL------RLLE-----EHKLATQQLRET-------- 364 Query: 304 AQLNHGITVKPSELIPLEDY---TQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 H ++ EL +++ + Q E+ Q A ++ R LMS Sbjct: 365 ----HEELLRAQELRLRDEFQLQLAEFSKNQSQEVSTQMQDLRKALDELRRETLMS---Q 417 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 +K + + + + S +KE+ +Q QD+++ ++ Sbjct: 418 QKASEPERLDLMREAAEKRCSEGEQQLWKELEVIKKQQLELLRQQDSLAHQKSQQEKAER 477 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVD----- 475 E + R + Y + + T + + P S Sbjct: 478 VEKKLEDIRSFVSDREKDA---QKHYLDLMAALSCTASSSPSNGHFRAPLYSSSQKEVAL 534 Query: 476 --------LIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSS----EFSDDAKNAAM 523 + + S E S + + NG S V S + A+ A Sbjct: 535 HAALLLRCTFDAIHAAATASFSEESSCSDSQVPRNGTSVSVEASDRQTVAVARSAEEGAE 594 Query: 524 VILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYL 583 I DT +TD +N+ ++++ S + Sbjct: 595 DIPLKRSDSTDTSAGQSTDRLANRMTPASETFLAAGANAALGSRVDSPRGEA-------- 646 Query: 584 LGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 S N A++ + S ++ P G ++ F Sbjct: 647 ------SDNSAALGQSWKAAVRTSQAASSGVGGTSGVLSPGGELESAF 688 >gi|260805003|ref|XP_002597377.1| hypothetical protein BRAFLDRAFT_118210 [Branchiostoma floridae] gi|229282640|gb|EEN53389.1| hypothetical protein BRAFLDRAFT_118210 [Branchiostoma floridae] Length = 870 Score = 38.6 bits (88), Expect = 3.2, Method: Composition-based stats. Identities = 50/349 (14%), Positives = 122/349 (34%), Gaps = 25/349 (7%) Query: 359 DAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSM--QELNQD-AISWGIKYK 415 ++ L++ + E S LA + E + + + + + + N D + + Sbjct: 341 ALQEQLKQKEREKQSLLQERLALKTEVVEKLEKVAELNEKVLDLEVENNDMHVKLSASHM 400 Query: 416 QIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVD 475 Q++ ++ T SF R+R G ++ ++ E + +VK P + + + Sbjct: 401 QMERMKTMTPSSFG---RRRHGEIES---------GFDSQRESSAESMTVVKAPRTPTPE 448 Query: 476 LIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDT 535 + A T+ + + + S ++ + S V + FS D ++ +S + + Sbjct: 449 PLPPAGPTVEEETLKALNHSALELDKSYRSLKVIVKDGFS-DLQDLTKKAISALGQISEM 507 Query: 536 ETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTL 595 ++ + + + ++ Q N ++ +D Sbjct: 508 DSSSGNEVEELRVLYRKEAL---------QRKLLYNQLQELRGNIRVFCRCRRDDRVGGY 558 Query: 596 NRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNV 655 + + + + G + + + P + FED L Y+ + Y Sbjct: 559 MQFPNDEDIVVPTGGSKKTFSFDKVFSPASTQEQVFEDTLPIVQSCVDGYNVCILAYGQT 618 Query: 656 GSGKYMIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNE 704 GSGK + +TI + N E ++ T++ +++NE Sbjct: 619 GSGKTFTMMGPPDYPGVNIRTIKELLRICNDKETVDYTLKISMVEVYNE 667 >gi|302664420|ref|XP_003023840.1| hypothetical protein TRV_02037 [Trichophyton verrucosum HKI 0517] gi|291187858|gb|EFE43222.1| hypothetical protein TRV_02037 [Trichophyton verrucosum HKI 0517] Length = 669 Score = 38.6 bits (88), Expect = 3.4, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 10/121 (8%) Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 + F E+F + AG +K D F + K + D + Sbjct: 532 SQVDFTNPENFYHDVNSVAGLLKLFFRDL--PDPLFTNERYSAFIDAARKDDDIQRRDAL 589 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM---VILSGMKHQKD 534 N+L D + +R+ H+ V SSE +A N A+ + L G ++ Sbjct: 590 HALINSLPDPNYATLRALILHLNH-----VQERSSENRMNAGNIAISFGLTLMGTNAGRN 644 Query: 535 T 535 Sbjct: 645 I 645 >gi|149391787|emb|CAO00730.1| putative cellulosomal scaffoldin protein precursor [Ruminococcus flavefaciens] Length = 1503 Score = 38.6 bits (88), Expect = 3.7, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 46/143 (32%), Gaps = 9/143 (6%) Query: 474 VDLIRGAYNTLSDSDKEGVR--SSFAHIEDN-GLSAVVRLSSEFSDDAKNAAMVILSGMK 530 + L+ L+ GV+ ++ ++ D+ SAV+R + S + ++ VIL Sbjct: 1363 MQLLDKDGKPLTVPAYIGVKGDATLDNMVDSVDASAVLRYYASISTNDRDVYDVILQSTV 1422 Query: 531 HQKDTETRYN--TDHKSNKFDSLYDSYIN-TPLTKLEQSTAGGNFNKDKEAIKLYLLGSM 587 ET + + + D + + TK N D I Y Sbjct: 1423 AGLKVETPKDELDELAAFLADVNTNEWSADNWKTKKNGRRIDSN---DASKILAYYARRS 1479 Query: 588 KDSGNYTLNRVRVSDAMQIVLGN 610 + N ++ + Sbjct: 1480 SSDYDNVSNYDIWNEVLGTARFG 1502 >gi|322816009|gb|EFZ24470.1| hypothetical protein TCSYLVIO_9383 [Trypanosoma cruzi] Length = 1041 Score = 38.6 bits (88), Expect = 3.8, Method: Composition-based stats. Identities = 51/305 (16%), Positives = 103/305 (33%), Gaps = 40/305 (13%) Query: 245 ARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALN---------TKLRKET 295 A ER LE ++++ + AG +R K+RR R++ L + R++T Sbjct: 175 AEERRLEKLQEMKSKLVAAG-ERSFQAKQRRTQAEERLRNDTLERHENGARRVEEAREQT 233 Query: 296 QAQARR-----IDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKS----VAA 346 + +ARR + +LN+ + + L+ L+ Q + LQ+ Sbjct: 234 KERARRGHSRVEEVRLNYELRNQTKALL-LDRKMSEVEHNQEQKREELQRHAQERNEAML 292 Query: 347 PDVARIKLMSTFDAKKFL----------------QKIDDEYISNPSLSLASTMMATKYKE 390 R + +S ++ +K++ + + + + Sbjct: 293 AVAERRRNLSQERIERQQQREQQRRENLRRHEEQKKLEKDLKEQRAEEWEKKVQQRQRDA 352 Query: 391 ILEKSHR-QSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLT 449 E R + ++E Q ++ + +++ + + +E RQR K Sbjct: 353 CAEAEARGRKVEERIQQSVQLREEKREMLRQKLEKQEQKIKEARQRKEREADTKPTIKEL 412 Query: 450 TSHFNKTEENQLRTQLVK---RPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSA 506 + EE QL +L K A + + + S K+ RS I S Sbjct: 413 MPFISPQEEEQLAQKLAKVVAVTARQGKSYMEIYQKESTMSAKDLNRSKLRSIISRLGSN 472 Query: 507 VVRLS 511 S Sbjct: 473 TSSAS 477 >gi|326471562|gb|EGD95571.1| rho GTPase activator [Trichophyton tonsurans CBS 112818] Length = 651 Score = 38.6 bits (88), Expect = 3.9, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 10/121 (8%) Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 + F E+F + AG +K D F + K + D + Sbjct: 514 SQVDFTNPENFYHDVNSVAGLLKLFFRDL--PDPLFTNERYSAFIDAARKDDDIQRRDAL 571 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM---VILSGMKHQKD 534 N+L D + +R+ H+ V SSE +A N A+ + L G ++ Sbjct: 572 HALINSLPDPNYATLRALILHLNH-----VQERSSENRMNAGNIAISFGLTLMGTNAGRN 626 Query: 535 T 535 Sbjct: 627 I 627 >gi|253686481|ref|YP_003015671.1| cellulose synthase operon C domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753059|gb|ACT11135.1| cellulose synthase operon C domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 1164 Score = 38.6 bits (88), Expect = 3.9, Method: Composition-based stats. Identities = 49/245 (20%), Positives = 83/245 (33%), Gaps = 32/245 (13%) Query: 266 DRLDDTKRRRLLEH------LSSRDNAL---NTKLRKETQAQARRIDAQLNHGITVKPSE 316 +RL+ K L++ ++ DNAL N L ++ QAR++D Sbjct: 339 NRLNSGKWEGLIQSTQYWTTIAEGDNALKANNLSLARQKYQQARQMD-------NTNAYA 391 Query: 317 LIPLEDYTQAYGVEQGTELYNLQQFKSVAAPD------VARIKLMSTFDAKKFLQKIDDE 370 LI L D A + + Q D V + S A +L + Sbjct: 392 LIGLGDVAVASKNDAAAQPLYQQALHLEPGNDNALRGLVGIYQRQSPEKALAYLNSLPRS 451 Query: 371 YISNPSLSLAS------TMMATKYKEI--LEKSHRQSMQELNQDAISWGIKYKQIDPLRF 422 + +LA+ A + E ++ + Q QD + Y LR Sbjct: 452 QQNTMRETLAALQLNILKQQADQLSEQQQWTQAEEKYRQASQQDPNDIWLAYHYAQTLRQ 511 Query: 423 DTEESFADSLRQRAGFVK--KIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 + ADS+ QRA V + +Y + + + Q L P ++ D +R Sbjct: 512 LGQTQQADSVVQRATAVPPTSAEKNYVYSLYLSSTNRDEQALAHLNTLPTAQWSDDMREL 571 Query: 481 YNTLS 485 L+ Sbjct: 572 SQRLT 576 >gi|326481866|gb|EGE05876.1| rho GTPase activator [Trichophyton equinum CBS 127.97] Length = 651 Score = 38.6 bits (88), Expect = 4.0, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 10/121 (8%) Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 + F E+F + AG +K D F + K + D + Sbjct: 514 SQVDFTNPENFYHDVNSVAGLLKLFFRDL--PDPLFTNERYSAFIDAARKDDDIQRRDAL 571 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM---VILSGMKHQKD 534 N+L D + +R+ H+ V SSE +A N A+ + L G ++ Sbjct: 572 HALINSLPDPNYATLRALILHLNH-----VQERSSENRMNAGNIAISFGLTLMGTNAGRN 626 Query: 535 T 535 Sbjct: 627 I 627 >gi|327306031|ref|XP_003237707.1| rho GTPase activator [Trichophyton rubrum CBS 118892] gi|326460705|gb|EGD86158.1| rho GTPase activator [Trichophyton rubrum CBS 118892] Length = 651 Score = 38.6 bits (88), Expect = 4.0, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 41/121 (33%), Gaps = 10/121 (8%) Query: 418 DPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI 477 + F E+F + AG +K D F + K + D + Sbjct: 514 SQVDFTNPENFYHDVNSVAGLLKLFFRDL--PDPLFTNERYSAFIDAARKDDDIQRRDAL 571 Query: 478 RGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM---VILSGMKHQKD 534 N+L D + +R+ H+ V SSE +A N A+ + L G ++ Sbjct: 572 HALINSLPDPNYATLRALILHLNH-----VQERSSENRMNAGNIAISFGLTLMGTNAGRN 626 Query: 535 T 535 Sbjct: 627 I 627 >gi|331269450|ref|YP_004395942.1| DNA repair protein RecN [Clostridium botulinum BKT015925] gi|329126000|gb|AEB75945.1| DNA repair protein RecN [Clostridium botulinum BKT015925] Length = 564 Score = 38.6 bits (88), Expect = 4.2, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 51/135 (37%), Gaps = 11/135 (8%) Query: 571 NFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPV------NINESMLMPPR 624 NF +E + G +G + + DA+ VLG N + + Sbjct: 9 NFALIEELTINFEKGFNVLTGETGAGKSILIDAINYVLGGKFNKGLIRTGENRTFVEAIF 68 Query: 625 GMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGK---TIIINV 681 + T + L + D D I SGK ++ NG + + + +I+IN+ Sbjct: 69 DIENTTIANVLKHM--DISCEDLLIISRETFKSGKSVVKANGKSLLIADIRKISSILINI 126 Query: 682 EDVNRDERMESTIRH 696 + ++ + ++ +H Sbjct: 127 HGQHDNQELLNSAKH 141 >gi|83745917|ref|ZP_00942974.1| Peptidyl-prolyl cis-trans isomerase [Ralstonia solanacearum UW551] gi|207743054|ref|YP_002259446.1| peptidyl-prolyl cis-trans isomerase protein [Ralstonia solanacearum IPO1609] gi|83727607|gb|EAP74728.1| Peptidyl-prolyl cis-trans isomerase [Ralstonia solanacearum UW551] gi|206594451|emb|CAQ61378.1| peptidyl-prolyl cis-trans isomerase protein [Ralstonia solanacearum IPO1609] Length = 648 Score = 38.6 bits (88), Expect = 4.2, Method: Composition-based stats. Identities = 38/267 (14%), Positives = 83/267 (31%), Gaps = 39/267 (14%) Query: 266 DRLDDTKRRRLLEHLSSRDN--ALNTKLRKETQ-----AQARRIDAQLNHGITVKPSELI 318 D LD ++R++ + +SR N +TK+R+ + AQ R+ D + + +K Sbjct: 87 DVLDGIIQQRMMVNEASRTNLSVADTKVRETIEQIPAVAQLRKPDGKFDTDAYIK----- 141 Query: 319 PLEDYTQAYGV--EQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPS 376 A G+ EQ + + + + ++ D + Sbjct: 142 ----LLAAQGMTPEQFDARLRSELVLQQIPQSIVSSAFVPKSLVDRLIEARDQQREVQAL 197 Query: 377 L----SLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 L AS + + +++ + Q+ +Y + Sbjct: 198 LLKPADYASKVAVD---DKAIQAYYDAHQQEFAVPEQVKAEYVVFSGEDMMKQIPVTPE- 253 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGV 492 ++K+ Y+ + F E+ + L+K P + A + E V Sbjct: 254 --------QLKEYYDQNAARFKTEEQRRAAHILIKLPDNAK-----PADKDAAKKRAEEV 300 Query: 493 RSSFAHIEDNGLSAVVRLSSEFSDDAK 519 + N + S + A+ Sbjct: 301 LAEVRKTPGNFAELAKKYSEDPGSAAQ 327 >gi|312212433|emb|CBX92516.1| predicted protein [Leptosphaeria maculans] Length = 308 Score = 38.2 bits (87), Expect = 4.3, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 75/254 (29%), Gaps = 42/254 (16%) Query: 355 MSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHR-------QSMQELNQDA 407 + T D + + + IS AS E LE + Q Sbjct: 20 LPTSDHSYYAEVAREHRISLNPPKRASHTSRKVMHEHLEPITTPKSTPYYTANQLFQIAP 79 Query: 408 ISWGIKYKQIDPLRFD----TEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRT 463 G+ + I P+ + T S A S +R Y + ++ + + Sbjct: 80 ---GMTLEDISPVSSEQSPLTPSSIATSNTRRQSSF-----HYGIPPLTRHQMPQPAISQ 131 Query: 464 QLVKRPASESV-------------DLIRGAYNTLSDSDKEGVRSSFAHIED-----NGLS 505 + PAS+ +R +++T S + S A + + + Sbjct: 132 DVQVSPASDHFQNYPYRMEPESHQRTMRTSFHTASSTAGFVYGSDTAGLPQTCFLSSNGA 191 Query: 506 AVVRLSSEFSDDAKNA---AMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTK 562 S D + I G T Y + F+ Y+SY N + + Sbjct: 192 P--EYSRPSHDLPPYIYPPSPHIQGGSNLCDKTARYYEYESSPRTFNHAYNSYYNVNVGQ 249 Query: 563 LEQSTAGGNFNKDK 576 S + +++K Sbjct: 250 WTSSASAYPLDENK 263 >gi|71401106|ref|XP_803264.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70866081|gb|EAN81818.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 1041 Score = 38.2 bits (87), Expect = 4.5, Method: Composition-based stats. Identities = 51/305 (16%), Positives = 103/305 (33%), Gaps = 40/305 (13%) Query: 245 ARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALN---------TKLRKET 295 A ER LE ++++ + AG +R K+RR R++ L + R++T Sbjct: 175 AEERRLEKLQEMKSKLVAAG-ERSFQAKQRRTQAEERLRNDTLERHENGARRVEEAREQT 233 Query: 296 QAQARR-----IDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQ----FKSVAA 346 + +ARR + +LN+ + + L+ L+ Q + LQ+ Sbjct: 234 KERARRGHSRVEEVRLNYELRNQTKALL-LDRKMSEVEHNQEQKREELQRHAQERNEAML 292 Query: 347 PDVARIKLMSTFDAKKFL----------------QKIDDEYISNPSLSLASTMMATKYKE 390 R + +S ++ +K++ + + + + Sbjct: 293 AVAERRRNLSQERIERQQQREQQRRENLRRHEEQKKLEKDLKEQRAEEWEKKVQQRQRDA 352 Query: 391 ILEKSHR-QSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLT 449 E R + ++E Q ++ + +++ + + +E RQR K Sbjct: 353 CAEAEARGRKVEERIQQSVHLREEKREMLRQKLEKQEQKIKEARQRKEREADTKPTIKEL 412 Query: 450 TSHFNKTEENQLRTQLVK---RPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSA 506 + EE QL +L K A + + + S K+ RS I S Sbjct: 413 MPFISSQEEEQLAQKLAKVVAVTARQGKSYMEIYQKESTMSAKDLNRSKLRSIISRLGSN 472 Query: 507 VVRLS 511 S Sbjct: 473 TSSAS 477 >gi|313205668|ref|YP_004044845.1| tetratricopeptide tpr_1 repeat-containing protein [Riemerella anatipestifer DSM 15868] gi|312444984|gb|ADQ81339.1| Tetratricopeptide TPR_1 repeat-containing protein [Riemerella anatipestifer DSM 15868] gi|315022600|gb|EFT35626.1| TPR-domain containing protein [Riemerella anatipestifer RA-YM] gi|325336890|gb|ADZ13164.1| Tetratricopeptide TPR-1 [Riemerella anatipestifer RA-GD] Length = 986 Score = 38.2 bits (87), Expect = 4.6, Method: Composition-based stats. Identities = 45/298 (15%), Positives = 90/298 (30%), Gaps = 29/298 (9%) Query: 320 LEDYTQAYGVEQGTELYNLQQFKSVA---APDVARIKLMSTFDAKKFLQKIDDEYISNPS 376 L D Y + ++ Y L Q A+I M A+ + D+ + Sbjct: 556 LNDAIAIYNNAETSDEYTLFQKAMALGFKGDTEAKISEMKKLVAQYPNSEYKDDALYEIG 615 Query: 377 LSLA---STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLR 433 + A ++++Y + + K+ + N + +Q + Sbjct: 616 TAYAANDEFALSSEYFDKVVKTSTDTHLVANAEIYMVQNDIEQ-------------NQEA 662 Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVR 493 + +K + Y T + +K L + + + L S+ + + Sbjct: 663 KAFSGIKSLAKKYQ-NTVYADKVLAVARSLYLKTGDTAGYQNFAKNLGVGLDKSEIDEIN 721 Query: 494 SSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTET-----RYNTDHKSNKF 548 S A L A + K A + S Q E + D K+ K Sbjct: 722 LSTAR----QLYAKKQYEKAIPYYEKYLAQNVSSNTAFQAQYELGESYYQSGDDAKALKS 777 Query: 549 DSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQI 606 + +Y N + +A +K+K L + DS N + + M+ Sbjct: 778 FNFVTAYPNDYQEEARIRSAQILLSKNKGEEAAEHLAKLVDSNNPKIKSFAQQELMKY 835 >gi|322378477|ref|ZP_08052928.1| Methyl-accepting chemotaxis protein [Helicobacter suis HS1] gi|321149079|gb|EFX43528.1| Methyl-accepting chemotaxis protein [Helicobacter suis HS1] Length = 432 Score = 38.2 bits (87), Expect = 4.6, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 64/215 (29%), Gaps = 12/215 (5%) Query: 333 TELYNLQQFKSVAAPDV-ARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEI 391 T+ Y + + K ++ +L+ + + A + Sbjct: 206 TQEYRDFWNRLKSGQFTRGTFKRLTKSGKTVYLEASYNPVFNRDGEIYKFIKFAWDATDR 265 Query: 392 LEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTS 451 EK+H ++ ++ A++++ + V + + ++ Sbjct: 266 EEKAHMVIDLIRENQQLTDHGNEMIEK--TTSNIQNVANTMKNNSNLVNTLSAQSDSIST 323 Query: 452 HFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLS 511 ++ +T L+ A+ I A V + + +V +S Sbjct: 324 ITQTIKDVADQTNLLALNAA-----IEAARAGEHGRGFAVVADEVRKLAERTGKSVTEIS 378 Query: 512 SEFSDDAKNAAMV---ILSGMK-HQKDTETRYNTD 542 + + + V I +G + QK + Y T Sbjct: 379 AIIGQIREITSQVVTDITAGTQESQKTVDLSYETK 413 >gi|260588935|ref|ZP_05854848.1| hypothetical membrane protein [Blautia hansenii DSM 20583] gi|260540714|gb|EEX21283.1| hypothetical membrane protein [Blautia hansenii DSM 20583] Length = 730 Score = 38.2 bits (87), Expect = 4.7, Method: Composition-based stats. Identities = 63/368 (17%), Positives = 121/368 (32%), Gaps = 50/368 (13%) Query: 269 DDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYG 328 DD + +L +L+ ++NA+ K+ + R+ A+L T+ L ++ + Sbjct: 128 DDVEHSDILYYLNEKENAIAPKVTDKGAGAVRKQIAEL-FTKTISQVGLQLMDSLSSMMS 186 Query: 329 VEQGTELYNLQQFK--------SVAAPDVARIKLMSTF------DAKKFLQKIDDEYISN 374 E E+ A+ + M+ FL++ +N Sbjct: 187 KEDSQEMIGRVHENITEIADSLLAASGTIKAFSNMTDSMGNTLESTSSFLKQTGSNADTN 246 Query: 375 PSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFAD---S 431 SL + K+ L + L + + +Q+D + A S Sbjct: 247 LSLLTDTGKSVKSLKDALSGTTDSVNDVLAAGSDVYQAVSEQVDETFSSLSQDAAAAGQS 306 Query: 432 LRQRAGFVKKIKDDY----------NLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAY 481 L AG ++ I D Y + + + QL T + K AS + Sbjct: 307 LNGLAGQIQTIIDRYAAFRDSLQGISDSLPEVFPHIKEQLGTVIGKLNAS--IAQQEAVR 364 Query: 482 NTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNT 541 + L + + + S + D G SE + AK +A I ++ +T Y Sbjct: 365 DKL-NEAADKISQS---VSDAGN-----YQSELKELAKQSAQQI-------QNVQTDYEK 408 Query: 542 DHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVS 601 + K N D L+ + N T + S ++ + I+ + D G + Sbjct: 409 NVKGN-LDQLFQTLSN---TNKDVSLILSKLDEGAQGIEDLSGSAASDLGKLKTALDTSA 464 Query: 602 DAMQIVLG 609 + G Sbjct: 465 SLLDEASG 472 >gi|322380318|ref|ZP_08054532.1| methyl-accepting chemotaxis protein [Helicobacter suis HS5] gi|321147253|gb|EFX41939.1| methyl-accepting chemotaxis protein [Helicobacter suis HS5] Length = 432 Score = 38.2 bits (87), Expect = 4.9, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 63/215 (29%), Gaps = 12/215 (5%) Query: 333 TELYNLQQFKSVAAPDV-ARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEI 391 T+ Y + + K ++ +L+ + A + Sbjct: 206 TQEYRDFWNRLKSGQFTRGTFKRLTKSGKTVYLEASYNPVFDRDGEIYKFIKFAWDATDR 265 Query: 392 LEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTS 451 EK+H ++ ++ A++++ + V + + ++ Sbjct: 266 EEKAHMVIDLIRENQQLTDHGNEMIEK--TTSNIQNVANTMKNNSNLVNTLSAQSDSIST 323 Query: 452 HFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLS 511 ++ +T L+ A+ I A V + + +V +S Sbjct: 324 ITQTIKDVADQTNLLALNAA-----IEAARAGEHGRGFAVVADEVRKLAERTGKSVTEIS 378 Query: 512 SEFSDDAKNAAMV---ILSGMK-HQKDTETRYNTD 542 + + + V I +G + QK + Y T Sbjct: 379 AIIGQIREITSQVVTDITAGTQESQKTVDLSYETK 413 >gi|71668012|ref|XP_820950.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70886314|gb|EAN99099.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 1172 Score = 37.9 bits (86), Expect = 5.6, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 33/89 (37%), Gaps = 4/89 (4%) Query: 389 KEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAG-FVKKIKDDYN 447 ++ +SH +++ ++D W + + D +ES A RQR + +K Y Sbjct: 982 FDLFIRSHEDVLEQRSEDDKEWDWEKSHEQEMDEDNDESNAAYFRQRFTRLIGWVKRHYG 1041 Query: 448 ---LTTSHFNKTEENQLRTQLVKRPASES 473 + + L + + A + Sbjct: 1042 KYLSNDAITSFGTSGTLSEFMRECSAPQR 1070 >gi|320011392|gb|ADW06242.1| band 7 protein [Streptomyces flavogriseus ATCC 33331] Length = 349 Score = 37.9 bits (86), Expect = 6.6, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 55/145 (37%), Gaps = 4/145 (2%) Query: 352 IKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEIL---EKSHRQSMQELNQDAI 408 K++ A ++ SL S + + ++L EK ++ ++ A+ Sbjct: 93 FKVVDPASAVIQVEDYRFAVSQMAQTSLRSIIGKSDLDDLLSDREKLNQGLELMIDSPAV 152 Query: 409 SWGIKYKQIDPLRFDTEESFADSLRQRA-GFVKKIKDDYNLTTSHFNKTEENQLRTQLVK 467 WG++ +++ E+ S+ ++A ++ N + Q Q+ Sbjct: 153 GWGVQIDRVEIKDVSLPETMKRSMARQAEADRERRARVINADAELQASKKLAQAAQQMST 212 Query: 468 RPASESVDLIRGAYNTLSDSDKEGV 492 +PA+ + L++ ++ + V Sbjct: 213 QPAALQLRLLQTVVAVAAEKNSTLV 237 >gi|319900345|ref|YP_004160073.1| Tetratricopeptide TPR_1 repeat-containing protein [Bacteroides helcogenes P 36-108] gi|319415376|gb|ADV42487.1| Tetratricopeptide TPR_1 repeat-containing protein [Bacteroides helcogenes P 36-108] Length = 617 Score = 37.9 bits (86), Expect = 6.8, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 60/205 (29%), Gaps = 14/205 (6%) Query: 372 ISNPSLSLASTMMATKYKEILEKSH--RQSMQELNQDAISWGIKYKQIDPLRFDTEESFA 429 P + A +A+ Y+E +K RQ + + K + E+ Sbjct: 271 EEEPENAQAMYSLASYYEETGQKELYERQLDTLMLNKKVPSDTKLNVMRQFIMQNEQEGK 330 Query: 430 DS---LRQRAGFVKKIKDDYNLTTSH--------FNKTEENQLRTQLVKRPASESVDLIR 478 DS + +++ DD L + NK LR L PA+ + ++ Sbjct: 331 DSTRVITLFGRIMEQDPDDAQLPMLYVQYLLAKNMNKEALPVLRQVLDIDPANTAARMML 390 Query: 479 GAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKN-AAMVILSGMKHQKDTET 537 + E +R A IE + VI K Sbjct: 391 IGDAAQKEDAPEIIRLCEAGIETSPDKLEFYYYLAMGYAQTERTDDVIAVCRKALGHVNA 450 Query: 538 RYNTDHKSNKFDSLYDSYINTPLTK 562 + S+ + + DSY K Sbjct: 451 DSKKEVVSDFYAFIGDSYHTKKQNK 475 >gi|118389852|ref|XP_001027971.1| hypothetical protein TTHERM_00497830 [Tetrahymena thermophila] gi|89309741|gb|EAS07729.1| hypothetical protein TTHERM_00497830 [Tetrahymena thermophila SB210] Length = 797 Score = 37.5 bits (85), Expect = 7.5, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 58/154 (37%), Gaps = 9/154 (5%) Query: 329 VEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKY 388 E + Y Q+F + +D + QK+ + + N + + ++++ Sbjct: 617 EENQKKQYEEQKFNQNLGQY-GNQNINKQYDQQDEFQKLLEIHRKNFNEN-RQNFISSEL 674 Query: 389 KEILEKSHRQSM--QELNQDAISWGIKYKQIDPLRFDTEESF-ADSLRQRAG--FVKKIK 443 + K M Q D + I+++Q+ L + + L+ + + + Sbjct: 675 DKQFSKGIDYRMLDQLTVDDLSNLLIQFQQLKMLESPELQGLNVNELQNQFNNLNIDPLS 734 Query: 444 DDYNLTTSHFNKTEENQLRT--QLVKRPASESVD 475 + L S+ N E +L Q++++ ++V Sbjct: 735 QNLGLDFSNLNPQEMMELAQFNQMLQQNILQTVQ 768 >gi|71031650|ref|XP_765467.1| hypothetical protein [Theileria parva strain Muguga] gi|68352423|gb|EAN33184.1| hypothetical protein TP02_0899 [Theileria parva] Length = 1406 Score = 37.5 bits (85), Expect = 8.1, Method: Composition-based stats. Identities = 45/384 (11%), Positives = 108/384 (28%), Gaps = 60/384 (15%) Query: 268 LDDTKRRRLLEHLSSRD----NALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDY 323 LDD K + + S++ A + ET +D+Q++ + K EL E Sbjct: 375 LDDFKHKPMFREEKSQEIDSHEAESDSNSDETLEDEESMDSQMDQFVDQKDEEL--FEST 432 Query: 324 TQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFL------QKIDDEYISNPSL 377 + + ++ + ++ ++ ++ + +KI+ E + Sbjct: 433 VRCESFKSSERKPEEFDLNNLTSDEL----VLDDQQVEELVKNVEIPEKIEPEADEKEAE 488 Query: 378 SLASTMMATKYKEILE---KSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQ 434 LA T + + + ++ L + K L+ F D Sbjct: 489 ELAETEDIENLYKDISYVMGTLKRINTYLESKPPKFVANSKTYQELQASLHRDFFDKYLT 548 Query: 435 RAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESV---DLIRGAYNTLSDSDKEG 491 K + EE QL ++ + + DL L Sbjct: 549 YGSDKLSKKK--------LLEQEEKQLAQ--EEKKVRDEILGKDLFSNMDENLYKRLSNI 598 Query: 492 VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSL 551 + V+ + + + + + + Sbjct: 599 YLPRKNSTFSSN---VLPYDNSLV-------------KSSISSSVPPFPSYSTYSINTCI 642 Query: 552 YDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNT 611 + Y N P+T + N +D Y ++ S L++ Sbjct: 643 DNPYNNAPVTIDSHTPTSYNPGRDTYYTINYNPADLQKSSEL-LDKK-----------GI 690 Query: 612 PVNINESMLMPPRGMSKTDFEDRL 635 + + ++ +G+ + +++L Sbjct: 691 YSDSKRAEIVFTKGIPSDNLKNKL 714 >gi|210134803|ref|YP_002301242.1| flagellin A [Helicobacter pylori P12] gi|210132771|gb|ACJ07762.1| flagellin A [Helicobacter pylori P12] Length = 510 Score = 37.5 bits (85), Expect = 8.5, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 76/239 (31%), Gaps = 21/239 (8%) Query: 384 MATKYKEILEKSHRQSMQELNQDAISWGIKY-----KQIDPLRFDTEESFADSLRQRAGF 438 MA + + + L Q+A+ ++ + + + ADSLR +A Sbjct: 1 MAFQVNTNINAMNAHVQSALTQNALKTSLERLSSGLRINKAADDASGMTVADSLRSQASS 60 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 + + + N Q+ + E + ++ + + ++G + Sbjct: 61 LGQAIANTNDGMGII----------QVADKAMDEQLKILDTVKVKATQAAQDGQTTESRK 110 Query: 499 IEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINT 558 + + +++ + +LSG K+ + ++ S+ S +T Sbjct: 111 AIQSDIVRLIQGLDNIGNTTTYNGQALLSGRFTNKEFQVGAYSN------QSIKASIGST 164 Query: 559 PLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINE 617 K+ Q + + TL V+VS + +G IN+ Sbjct: 165 TSDKIGQVRIATGALITASGDISLTFKQVDGVNDVTLESVKVSSSAGTGIGVLAEVINK 223 >gi|119510966|ref|ZP_01630088.1| hypothetical protein N9414_01295 [Nodularia spumigena CCY9414] gi|119464405|gb|EAW45320.1| hypothetical protein N9414_01295 [Nodularia spumigena CCY9414] Length = 692 Score = 37.1 bits (84), Expect = 9.6, Method: Composition-based stats. Identities = 47/352 (13%), Positives = 99/352 (28%), Gaps = 40/352 (11%) Query: 326 AYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISN-PSLSLASTMM 384 A+ +QG +N D+ + + D F Q DE N P + ++ Sbjct: 116 AWWEDQGRPKFNPF--------DIKTVGQYESVDPTSFHQSKFDEEAKNLPIETTEPILL 167 Query: 385 ATKYKEILEKSH---RQSMQELNQDAISWGIKYKQI-DPLRFDTEESFADSLRQRAGFVK 440 + + + R+ +SW + + P F + S R + Sbjct: 168 QPQQYSLETINILAFRKFNILNKIQPVSWASDVEALEKPRSFQIIQP--ASTRAYEILQR 225 Query: 441 KIKDDYNLTTSHFNKTEE--NQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAH 498 IK ++T E + +R+ + D ++ L D + S + H Sbjct: 226 AIKQTPQVSTPVVGDEEALKSAIRSLMNSSQVKP--DAVKVIKEALGDKQ---ITSEYWH 280 Query: 499 IEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTET------------RYNTD-HKS 545 +G A ++ + ++ ++ L M K + Sbjct: 281 TLFDGQGAKTAINQKI-YSSQMVRLITLRAMVIPKTIPEFLTWLNVKAGKNPDENETISL 339 Query: 546 NKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQ 605 N ++ + LT+ + D + L + + R + ++ Sbjct: 340 NFQSAVGKQFPKYQLTEGIKLLLPKLLQGDITPELFHWL--FIKGRAWDVCRSEFINDVR 397 Query: 606 IVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGS 657 L + N F+ W K G + Y + Sbjct: 398 YDLQLIYKHFNSGQQTHTFSFDALKFQKVTW--QKIIGSWRAINGGYYQLDD 447 >gi|294954476|ref|XP_002788187.1| hypothetical protein Pmar_PMAR006880 [Perkinsus marinus ATCC 50983] gi|239903402|gb|EER19983.1| hypothetical protein Pmar_PMAR006880 [Perkinsus marinus ATCC 50983] Length = 887 Score = 37.1 bits (84), Expect = 9.6, Method: Composition-based stats. Identities = 46/289 (15%), Positives = 100/289 (34%), Gaps = 22/289 (7%) Query: 247 ERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRI---- 302 E L +EDV+ A D+ + RR+ E + + ++R+E +AQ +RI Sbjct: 181 EEKLRALEDVAAAASAGN----DEDRHRRIEERMLRYKRECDNRVREEVEAQVQRIRSIE 236 Query: 303 --DAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 +A+L ++ + E+ + + Q+ + + + Sbjct: 237 LPNARLEAAAQLRAEQRTAREELEREAKEREARFKLVEQRTLERLRLKESELDRRAVSSR 296 Query: 361 KKFLQKIDDE--YISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQID 418 + L+ ++ E + + +LA+ + E + +RQ +QE + + +++ + Sbjct: 297 ETLLRDLEIERAGVREKARALAARENELAHWEKRIEGNRQDIQEREEKLRTLRQQFEDLK 356 Query: 419 PLRF--------DTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPA 470 E+ +L ++ V K++ + L T A Sbjct: 357 NKEVELATLKIKAEMETDRRALIEQQNEVDKLRACLETEEVRLGAVHDTHLSTGQQLLTA 416 Query: 471 SESVDLIRGAYNTLSDSDKEGVRS--SFAHIEDNGLSAVVRLSSEFSDD 517 D + L +S E R +N AV ++ E S Sbjct: 417 ERERDALAVQVEQLQESLAEARRQSRLLEATIENDRRAVEKMEHELSMT 465 >gi|168050999|ref|XP_001777944.1| predicted protein [Physcomitrella patens subsp. patens] gi|162670704|gb|EDQ57268.1| predicted protein [Physcomitrella patens subsp. patens] Length = 429 Score = 37.1 bits (84), Expect = 9.6, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 73/256 (28%), Gaps = 27/256 (10%) Query: 382 TMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKK 441 T+ KY + + + Q A + + L+ T E S+ + Sbjct: 29 TLWPGKYHSPVTAALWHASSSFAQRARNVSMDSPAQITLQSKTPEENRTSVVYKFSSDTS 88 Query: 442 IKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIED 501 ++++Y + F ++ L A + G SD + + Sbjct: 89 LREEYRNPWNLF------RIGKLLEDLDA------LAGTIAVKHCSDSDHTTRPLLLVTA 136 Query: 502 NGLSAVVRLSSEFSDDAKNAAMVILSGMKHQK---DTETRYNTDHKSNKFDSLYDSYINT 558 + + D + A V G + + + F + + Sbjct: 137 SVDKISTKKRLRVDVDLELAGSVTWVGRSSMEIRMEVIQPKDDVALIANFTFVARDWRTQ 196 Query: 559 ---------PLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLN-RVRVSDAMQI-- 606 P T+ E+ +D KL + + + + R++ ++ Sbjct: 197 KAAPVNHLVPQTEEEKQLFAMGAARDAHRKKLRMGNQLSSTSPFAKEYNDRLTQILREGR 256 Query: 607 VLGNTPVNINESMLMP 622 VL P + + ++ Sbjct: 257 VLCEMPALGDRNSILM 272 >gi|145501983|ref|XP_001436971.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404117|emb|CAK69574.1| unnamed protein product [Paramecium tetraurelia] Length = 485 Score = 37.1 bits (84), Expect = 9.6, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 68/199 (34%), Gaps = 23/199 (11%) Query: 335 LYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEY--ISNPSLSLASTMMATKYKEIL 392 LYN + + A ++ ++L + K + + E + + ++ L Sbjct: 156 LYNNEINEKYAFSNLETVQLFQKLNELKQINRQRSEQPQEYVELEGQRFYHVLSDLQDYL 215 Query: 393 EKSHRQ---SMQELNQD---AISWGIKYKQIDPLRF-----DTEESFADS----LRQRAG 437 +K + + I ++ KQ++P +++ L RA Sbjct: 216 DKKVDKKTLIHEMRAFQAFYPIRRKLQVKQLNPGDSLCQGTESQTDITQKATYRLFDRAR 275 Query: 438 FVKKIK-DDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSD-----SDKEG 491 + +K + +N FN E QL + S+ ++ +R S + Sbjct: 276 TLANMKCEVFNTNNYVFNVKEVPNFGQQLRRNHFSQLIEELRENGLKTPREINEVSPENS 335 Query: 492 VRSSFAHIEDNGLSAVVRL 510 +++S +N + + + Sbjct: 336 IKASQVESSENDVKPIRKF 354 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.124 0.308 Lambda K H 0.267 0.0385 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 4,230,790,841 Number of Sequences: 14124377 Number of extensions: 130556008 Number of successful extensions: 316355 Number of sequences better than 10.0: 267 Number of HSP's better than 10.0 without gapping: 32 Number of HSP's successfully gapped in prelim test: 235 Number of HSP's that attempted gapping in prelim test: 316136 Number of HSP's gapped (non-prelim): 336 length of query: 707 length of database: 4,842,793,630 effective HSP length: 147 effective length of query: 560 effective length of database: 2,766,510,211 effective search space: 1549245718160 effective search space used: 1549245718160 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 85 (37.4 bits)