BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781204|ref|YP_003065617.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] (707 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781204|ref|YP_003065617.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|254040881|gb|ACT57677.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|317120670|gb|ADV02493.1| hypothetical protein SC1_gp045 [Liberibacter phage SC1] gi|317120814|gb|ADV02635.1| hypothetical protein SC1_gp045 [Candidatus Liberibacter asiaticus] Length = 707 Score = 1462 bits (3786), Expect = 0.0, Method: Compositional matrix adjust. Identities = 707/707 (100%), Positives = 707/707 (100%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR Sbjct: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 Query: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ Sbjct: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 Query: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH Sbjct: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 Query: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD Sbjct: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 Query: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR Sbjct: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 Query: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA Sbjct: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL Sbjct: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA Sbjct: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN Sbjct: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 Query: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV Sbjct: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 Query: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY Sbjct: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 Query: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP Sbjct: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 >gi|315122890|ref|YP_004063379.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496292|gb|ADR52891.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 583 Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 146/560 (26%), Positives = 255/560 (45%), Gaps = 37/560 (6%) Query: 147 EDNLNMTVGLAAAHVLHDPSNENY-FQRVQS--ITDHINSLPIDLRLKQKLLSEAKEKLN 203 E ++ T A ++ P+NEN QR + + D P D K++ L + L+ Sbjct: 6 EKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---LH 62 Query: 204 TNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLA 263 +Q IG + +P + N Y KD S ++++A ++E+ D K G Sbjct: 63 KSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEI--DEHKLDGKT 120 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRI-------DAQLNHG-ITVKPS 315 G+ +D R E L S + N K E Q+ +R+ + N G I+ Sbjct: 121 GYTGIDSLSR----EDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDYD 176 Query: 316 ELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNP 375 E+ + ++ Y + + K AP + ++ M+ + +F+ ++ + Sbjct: 177 EVFSESNLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVD-- 234 Query: 376 SLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEES-FADSLRQ 434 L A + + L+ S++EL++D W + + + P E A S+ Sbjct: 235 -YDLNDRFKAQAFLKELQDKRVASLKELSKDPHGWQ-RSRGLVPPNLSLEAGQLASSVLP 292 Query: 435 RAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNT--LSDSDKEGV 492 +K + D+ + +E QL ++ A + V R ++ SD E + Sbjct: 293 IFDANEKTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEKI 352 Query: 493 RSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLY 552 +S ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L+ Sbjct: 353 KSVVDGMKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKLF 411 Query: 553 DSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTP 612 ++ IN + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT Sbjct: 412 NNLINKEIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNTA 471 Query: 613 VNINESMLMPPRGMSKTDFEDRLWYATKD--TGEYD-------PYTIKYMNVGSGKYMII 663 +N S +MPPRGMS +F +RL T D G + P + Y + G GKY + Sbjct: 472 YAVNGSYVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSLT 531 Query: 664 KNGNPKVDKEGKTIIINVED 683 G K DK+G I+IN+ D Sbjct: 532 VGGVYKKDKQGNPIVINIYD 551 >gi|315121928|ref|YP_004062417.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495330|gb|ADR51929.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 547 Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 141/550 (25%), Positives = 246/550 (44%), Gaps = 35/550 (6%) Query: 147 EDNLNMTVGLAAAHVLHDPSNENY-FQRVQS--ITDHINSLPIDLRLKQKLLSEAKEKLN 203 E ++ T A ++ P+NEN QR + + D P D K++ L + L+ Sbjct: 6 EKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---LH 62 Query: 204 TNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLA 263 +Q IG + +P + N Y KD S ++++A ++E+ D K G Sbjct: 63 KSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEI--DEHKLDGKT 120 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRI-------DAQLNHG-ITVKPS 315 G+ +D R E L S + N K E Q+ +R+ + N G I+ Sbjct: 121 GYTGIDSLSR----EDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDYD 176 Query: 316 ELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNP 375 E+ + ++ Y + + K AP + ++ M+ + +F+ ++ + Sbjct: 177 EVFSESNLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVD-- 234 Query: 376 SLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQR 435 L A + + L+ S++EL++D W + P A S+ Sbjct: 235 -YDLNDRFKAQAFLKELQDKRVASLKELSKDPHGWQRSRGLVPPNLSLEAGQLASSVLPI 293 Query: 436 AGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNT--LSDSDKEGVR 493 +K + D+ + +E QL ++ A + V R ++ SD E ++ Sbjct: 294 FDANEKTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEKIK 353 Query: 494 SSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYD 553 S ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L++ Sbjct: 354 SVVDGMKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKLFN 412 Query: 554 SYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPV 613 + IN + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT Sbjct: 413 NLINKEIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNTAY 472 Query: 614 NINESMLMPPRGMSKTDFEDRLWYATKD--TGEYD-------PYTIKYMNVGSGKYMIIK 664 +N S +MPPRGMS +F +RL T D G + P + Y + G GKY + Sbjct: 473 AVNGSYVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSLTV 532 Query: 665 NGNPKVDKEG 674 G K DK+G Sbjct: 533 GGVYKKDKQG 542 >gi|317120711|gb|ADV02533.1| hypothetical protein SC2_gp045 [Liberibacter phage SC2] gi|317120772|gb|ADV02593.1| hypothetical protein SC2_gp045 [Candidatus Liberibacter asiaticus] Length = 753 Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 139/711 (19%), Positives = 284/711 (39%), Gaps = 64/711 (9%) Query: 16 PNQDAVRDSVNPQAGLRDLGDALGKATQFL-EGIRRDNAFATANTRYTELSFKAVQDFHD 74 P ++ V+ +P GLRD+ + L ++T+ + E +RR +AF A++ + + + +A + F++ Sbjct: 16 PVKEHVQQRYSPDEGLRDIANKLNQSTKLIEETVRRQSAFK-ADSEFLQTTVEAEKVFNE 74 Query: 75 FTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKG 134 L + + ++ + I+ Y + + + +VR+ Q + + K Sbjct: 75 KLQGLQGGNEEEVKKNLESILSNDIKPLYGKMHEKLDYPEVRRHILQQSNKQMARFQLKA 134 Query: 135 VDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 + ++G + E+ + + +L D S+ NY ++VQ +D I+ LP++ K Sbjct: 135 NEYKLGFQVQRTEEGIREIERSVSNSLLLDGSDSNYREKVQFASDAIDKLPLNPTTKDAR 194 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE 254 +E Q +P+VF +F++ +G D ++++D+ D+ + V + Sbjct: 195 RRTTRENAALAQAQRYMVDNPQVFSSFIQKREGQGSKIDDKSTIADIVDNTPLDTFSVND 254 Query: 255 DVSKAIG-LAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 V GW L K+ +L+ +++ + R + + I + L GIT K Sbjct: 255 TVVDNTNPFEGWKDLSTQKKIAILKEVANASTTGKQEERSRVGTKIKNIASFLERGITPK 314 Query: 314 PSELIPL--EDYTQAYGVEQG----TELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKI 367 + L ++ +G +G ++L L++F AP + +I LM + L K+ Sbjct: 315 NIQDADLSEDNLRHLFGNTRGQMLSSQLQTLKEF----APKINQIFLMPNDQVETELTKL 370 Query: 368 DDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEES 427 + + + + + K + + +E + I W I + + D Sbjct: 371 KPDVQNLVGAEFKNKLYSDGVKAVYKND-----KERKEHPIKWAITHGLTQEMP-DEPSK 424 Query: 428 FADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDS 487 + D R + +K Y +++ +K E +L L K + + V + L Sbjct: 425 WGDFFVHRREVSQTLKGFYGVSSPLISKEESRKLTQYLDKLESKDVVSHLEDYAERLGGL 484 Query: 488 DKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNK 547 + +E + V L DD + A +L G K++ D + Sbjct: 485 KNSVWIEAMQSLEHPVMGEVGLL---LKDDP-DVASDVLRGYKYRVRNRGENIRDITGER 540 Query: 548 FDSLYDSYINTPLTKLEQST---AGGNFNKDKEAIKLYLLGSMKDSGNYTLNRV------ 598 FD+ + + S F++ + + +L G + Y+ + Sbjct: 541 FDAWFKERFREKYGNVFASIGDYGDAEFDRASKLVGYHLAGELLSDKTYSFHEAPTQSLK 600 Query: 599 -------------RVSDAMQIVLGNTPVNINE--SMLMPPRGMSKTDFEDRLWYATKDTG 643 + + + V+GN PV + + S L+PPRGMS DF ++ + Sbjct: 601 EWVLRQPASQYDKNLQHSFEAVVGNIPVPMGDFKSPLIPPRGMSSEDFTNKFTAVAQQVF 660 Query: 644 EYDPYTIK---YMNV-----GSGKYMIIKNG-----NPKVDKEGKTIIINV 681 E + K Y N+ GS +Y+ +K G NPK G ++I V Sbjct: 661 EESGLSGKEYGYENIGPTSDGSAQYL-VKFGREYIHNPKT---GAPLVIEV 707 >gi|85059171|ref|YP_454873.1| hypothetical protein SG1193 [Sodalis glossinidius str. 'morsitans'] gi|84779691|dbj|BAE74468.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 843 Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 84/400 (21%), Positives = 156/400 (39%), Gaps = 57/400 (14%) Query: 321 EDYTQAYGVEQGTELY----NLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPS 376 +++T+AYG + G + Y + QQ S D++ ++ +S + L + P Sbjct: 450 DEFTRAYGYDAGQKAYANYLDTQQLGS----DISTVQQLSPVGQQALLNAREPV----PG 501 Query: 377 LSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRA 436 A A K E L+K+ + QD + + + KQI PL T E+F L R Sbjct: 502 EGYAE---AAKRHETLQKAVEYVNRARMQDPVQYAAEQKQIAPLDMQTPEAFRAGLSSRG 558 Query: 437 GFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSF 496 ++ Y + F+++E +Q+ L PAS+S+ + + L GV+ S Sbjct: 559 SLAPELARHYGTPLAIFSQSEASQIGEMLRSAPASQSIAYL----DALRQGLGAGVQYSA 614 Query: 497 A--HIEDNGLSAVVR-----------------LSSEFSDDAKNAAMVILSGMKHQKDTET 537 A + SA V LS++ +AA I+ G + T Sbjct: 615 ALQQVSRYAPSAAVAGAIMGKRGNVIGNEGGWLSADLMVTPDDAAKTIIEGANARAGVTT 674 Query: 538 RYN-TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEA-------IKLYLLGSMKD 589 N ++K+ + D+ + G F D E K Y G M Sbjct: 675 NVNGVENKTRGIEMPKDTDLRPEFV----DKVGKAFAGDAEGAAQAYDVAKDYYAGVMAR 730 Query: 590 SGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK-----DT 642 GN + ++ + A+ + G + ++ P GMS++ F + A K D Sbjct: 731 KGNVSGEYDKDTWTQAINVATGGVYDYGGQGDVLLPWGMSESQFNKAVGVAWKKQVLDDG 790 Query: 643 GEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINVE 682 + P + G +Y++ G + K+G+ ++++++ Sbjct: 791 IKPPPGRYGLQSHGDSQYLVKLGGGYLLGKDGRPVVLHID 830 >gi|218700979|ref|YP_002408608.1| hypothetical protein ECIAI39_2669 [Escherichia coli IAI39] gi|218370965|emb|CAR18792.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 837 Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust. Identities = 62/302 (20%), Positives = 118/302 (39%), Gaps = 44/302 (14%) Query: 261 GLAGWDRLDDTKRRRLLEH---LSSRDNA-----LNTKLRKETQAQARRIDAQLNHGITV 312 G+A +D L ++ + +L L+ + A L++++R T A R I+ Sbjct: 393 GIAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIE------FPN 446 Query: 313 KPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYI 372 P E +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 447 PPGE----DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-- 500 Query: 373 SNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 AT Y ++ ++ Q +++ D + + + Q P+ +++F S+ Sbjct: 501 -TGDTGEGYAARATLYDNVVSAAN-QVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSV 558 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG- 491 RA V + Y + F+K E NQ+ P S+ + +T+ S G Sbjct: 559 ALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQ 614 Query: 492 -VRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKD 534 S+ I N SA V + + S + AA IL+G +K Sbjct: 615 VYMSALQQISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKG 674 Query: 535 TE 536 T+ Sbjct: 675 TD 676 >gi|323948674|gb|EGB44579.1| hypothetical protein ERKG_04897 [Escherichia coli H252] Length = 837 Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust. Identities = 62/302 (20%), Positives = 118/302 (39%), Gaps = 44/302 (14%) Query: 261 GLAGWDRLDDTKRRRLLEH---LSSRDNA-----LNTKLRKETQAQARRIDAQLNHGITV 312 G+A +D L ++ + +L L+ + A L++++R T A R I+ Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIE------FPN 446 Query: 313 KPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYI 372 P E +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 447 PPGE----DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-- 500 Query: 373 SNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 AT Y ++ ++ Q +++ D + + + Q P+ +++F S+ Sbjct: 501 -TGDTGEGYAARATLYDNVVSAAN-QVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSV 558 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG- 491 RA V + Y + F+K E NQ+ P S+ + +T+ S G Sbjct: 559 ALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQ 614 Query: 492 -VRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKD 534 S+ I N SA V + + S + AA IL+G +K Sbjct: 615 VYMSALQQISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKG 674 Query: 535 TE 536 T+ Sbjct: 675 TD 676 >gi|117624701|ref|YP_853614.1| hypothetical protein APECO1_4052 [Escherichia coli APEC O1] gi|115513825|gb|ABJ01900.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 837 Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust. Identities = 62/302 (20%), Positives = 118/302 (39%), Gaps = 44/302 (14%) Query: 261 GLAGWDRLDDTKRRRLLEH---LSSRDNA-----LNTKLRKETQAQARRIDAQLNHGITV 312 G+A +D L ++ + +L L+ + A L++++R T A R I+ Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGIE------FPN 446 Query: 313 KPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYI 372 P E +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 447 PPGE----DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-- 500 Query: 373 SNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 AT Y ++ ++ Q +++ D + + + Q P+ +++F S+ Sbjct: 501 -TGDTGEGYAARATLYDNVVSAAN-QVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSV 558 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG- 491 RA V + Y + F+K E NQ+ P S+ + +T+ S G Sbjct: 559 ALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQ 614 Query: 492 -VRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKD 534 S+ I N SA V + + S + AA IL+G +K Sbjct: 615 VYMSALQQISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKG 674 Query: 535 TE 536 T+ Sbjct: 675 TD 676 >gi|324008549|gb|EGB77768.1| hypothetical protein HMPREF9532_01736 [Escherichia coli MS 57-2] Length = 837 Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust. Identities = 60/302 (19%), Positives = 117/302 (38%), Gaps = 44/302 (14%) Query: 261 GLAGWDRLDDTKRRRLLEH---LSSRDNA-----LNTKLRKETQAQARRIDAQLNHGITV 312 G+A +D L ++ + +L L+ + A L++++R T A R ++ Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGVE------FPN 446 Query: 313 KPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYI 372 P E +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 447 PPGE----DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-- 500 Query: 373 SNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 A Y ++ ++ Q +++ D + + + Q P+ +++F S+ Sbjct: 501 -TGETGEGYAARAALYDNVVSAAN-QVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSV 558 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG- 491 RA V + Y + F+K E NQ+ P S+ + +T+ S G Sbjct: 559 ALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQ 614 Query: 492 -VRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKD 534 S+ I N SA V + + S + AA IL+G +K Sbjct: 615 VYMSALQQISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKG 674 Query: 535 TE 536 T+ Sbjct: 675 TD 676 >gi|300898438|ref|ZP_07116779.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357905|gb|EFJ73775.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 837 Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust. Identities = 60/302 (19%), Positives = 117/302 (38%), Gaps = 44/302 (14%) Query: 261 GLAGWDRLDDTKRRRLLEH---LSSRDNA-----LNTKLRKETQAQARRIDAQLNHGITV 312 G+A +D L ++ + +L L+ + A L++++R T A R ++ Sbjct: 393 GVAWFDALSESDQASVLRQTDALARQQQAEYKTMLDSRVRDATAAYMRGVE------FPN 446 Query: 313 KPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYI 372 P E +D+ AYGV +G Y + +A + + M T +++++ + Sbjct: 447 PPGE----DDFIAAYGVREGNLRYTEFKNTQIAGQYIGSFRNMPTSSITAYVEQLRPD-- 500 Query: 373 SNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSL 432 A Y ++ ++ Q +++ D + + + Q P+ +++F S+ Sbjct: 501 -TGDTGEGYAARAALYDNVVSAAN-QVIKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSV 558 Query: 433 RQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG- 491 RA V + Y + F+K E NQ+ P S+ + +T+ S G Sbjct: 559 ALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQ 614 Query: 492 -VRSSFAHIEDNGLSAVVR----------------LSSEFSDDAKNAAMVILSGMKHQKD 534 S+ I N SA V + + S + AA IL+G +K Sbjct: 615 VYMSALQQISANAPSAAVAGILMDKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKG 674 Query: 535 TE 536 T+ Sbjct: 675 TD 676 >gi|330007166|ref|ZP_08305908.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] gi|328535513|gb|EGF61973.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] Length = 838 Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust. Identities = 120/568 (21%), Positives = 222/568 (39%), Gaps = 92/568 (16%) Query: 190 LKQKLLSEAKEKLNT-NQIIG--AHARDPRVFENFMRAFYKKGH-PPKDSTSLSDVSDSA 245 L + LLS ++ +T ++I+ A A D E ++ A KK + P D ++SD+ ++ Sbjct: 288 LGKNLLSYQRQGYDTVSEIVNRWAPASDGNNTEAYIAALCKKLNVTPNDQLNMSDI-NTL 346 Query: 246 RERSLEVVE---------------DVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTK 290 R+ +++ VS A+GL + + + S + A + Sbjct: 347 RQLCAGIIQHENGKQPYSEDQLNTGVSAALGLTTLESPKRYSGNQAFDAASPQQQAAYLR 406 Query: 291 LRKETQAQAR-RIDAQLN-----------HGITV--KPSELIPLEDYTQAYGVEQGTELY 336 E + QAR + AQL GI PS+ D+ A+G +GT+ + Sbjct: 407 QSMELRNQARTQFKAQLVDQVQDATAAYLKGIQFDNPPSQ----GDFINAFGYREGTQRF 462 Query: 337 NLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSH 396 N + VA + + M T ++++ + ++ + L+ A + + + Sbjct: 463 NDFENLRVAGQYIGSFRTMPTASIQQYVSDLKNQVGNGEGLA----GRAAAFDHVQAAAQ 518 Query: 397 R---QSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHF 453 R Q + Q A+ G YK P+ T ++ A ++ R ++K +T Sbjct: 519 RIISQRQSDPFQSAVDIG-AYK---PISNSTPDAIASEVKNRYAAQDQLKA-IGITPPLL 573 Query: 454 NKTEENQLRTQLVKRP--ASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLS 511 +K +E+Q+ T V+ ++++ L++G TL V SS A A + L Sbjct: 574 SK-QESQVLTDAVRNSTDVNQAISLLQGLGRTLPPQALRSVASSIAPGSPGTAYAALLLG 632 Query: 512 ---------------SEFSD-----DAKNAAMVILSG------MKHQKDTETRYNTDHKS 545 S+F D + A +L+G K KD + Sbjct: 633 QQDNQYDNRSGIIPYSQFVSYKPTLDKYDVAKTVLAGDQMLNPTKAMKDAGISAVSIPSD 692 Query: 546 NKFDSLYDSYINTPLTKLEQS--TAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDA 603 K + +D + Q+ A GNF A+ Y G + + + A Sbjct: 693 EKLKTYFDKEVGNAFAYSAQARQVAWGNFRSAYAALA-YQSGDASKTNTVSPDSDIAEKA 751 Query: 604 MQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDT---GEYDPYTI---KYMNVGS 657 Q G +N S ++ P GM KT F DR A ++ +P ++ K +NVG+ Sbjct: 752 AQYATGGVYKGLNGSDVVMPFGMDKTTFRDRYTAAGREAMTQAGLNPSSLENFKAVNVGN 811 Query: 658 GKYMIIKNGNPKVDKEGKT---IIINVE 682 +Y ++ NG+ + + +T + + VE Sbjct: 812 NQYRLV-NGSGRWATDPRTNQPVTVRVE 838 >gi|320175034|gb|EFW50147.1| putative bacteriophage protein [Shigella dysenteriae CDC 74-1112] Length = 837 Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust. Identities = 54/275 (19%), Positives = 105/275 (38%), Gaps = 50/275 (18%) Query: 287 LNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA 346 +++K+R T A R ++ P E +D+ AYGV +G Y + +A Sbjct: 427 IDSKVRDATAAYMRGVE------FPNPPGE----DDFIAAYGVREGNLRYTEFRNTQIAG 476 Query: 347 PDVARIKLMSTFDAKKFLQKI-------DDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + + M T +++++ + Y S +L Y ++ ++ Q Sbjct: 477 QYIGSFRNMPTSSITAYVEQLRPDTGETGEGYASRAAL----------YDNVVSAAN-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA + + Y + F+K E N Sbjct: 526 IKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQISDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAETNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|331648165|ref|ZP_08349255.1| hypothetical protein ECIG_04091 [Escherichia coli M605] gi|331043025|gb|EGI15165.1| hypothetical protein ECIG_04091 [Escherichia coli M605] Length = 837 Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust. Identities = 55/275 (20%), Positives = 105/275 (38%), Gaps = 40/275 (14%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E +D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----DDFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A Y ++ ++ Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAAN-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQSDPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|315121929|ref|YP_004062418.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122891|ref|YP_004063380.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495331|gb|ADR51930.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496293|gb|ADR52892.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 129 Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 41/87 (47%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 M + + E + PNQDAVRD+V+P GL+ + L Q + R ++ A A T+ Sbjct: 1 MANLDLKQTASEPLAPNQDAVRDAVDPLGGLKTVAQGLDSLNQLYQERRSEHIKAQALTQ 60 Query: 61 YTELSFKAVQDFHDFTNSLDTRDSLQA 87 S DF + SL T +QA Sbjct: 61 DALHSADDHADFARYMTSLPTGVDVQA 87 >gi|332344343|gb|AEE57677.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 837 Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust. Identities = 55/275 (20%), Positives = 103/275 (37%), Gaps = 40/275 (14%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----ADFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A Y ++ + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAAS-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|327252173|gb|EGE63845.1| prophage MuMc02, structural protein P5 [Escherichia coli STEC_7v] Length = 837 Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust. Identities = 58/279 (20%), Positives = 105/279 (37%), Gaps = 48/279 (17%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQG----TEL 335 +LR++ QA+ + ID+Q+ P E D+ AYGV +G TE Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----ADFIAAYGVREGKLRYTEF 469 Query: 336 YNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKS 395 N Q +A + + M T +++++ + A Y ++ + Sbjct: 470 RNTQ----IAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAA 522 Query: 396 HRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNK 455 Q +++ D + + + Q P+ +++F S+ RA V + Y + F+K Sbjct: 523 S-QVIKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSK 581 Query: 456 TEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR---- 509 E NQ+ P S+ + +T+ S G S+ I N SA V Sbjct: 582 DEANQIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILM 637 Query: 510 ------------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 638 DKPGGILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|323156122|gb|EFZ42281.1| prophage MuMc02, structural protein P5 [Escherichia coli EPECa14] Length = 837 Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust. Identities = 55/275 (20%), Positives = 103/275 (37%), Gaps = 40/275 (14%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----ADFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A Y ++ + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAAS-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|89152439|ref|YP_512272.1| hypothetical structural protein [Escherichia phage phiV10] gi|74055462|gb|AAZ95911.1| hypothetical structural protein [Escherichia phage phiV10] Length = 837 Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust. Identities = 55/275 (20%), Positives = 103/275 (37%), Gaps = 40/275 (14%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----ADFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A Y ++ + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAAS-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|268589385|ref|ZP_06123606.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] gi|291315412|gb|EFE55865.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] Length = 832 Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust. Identities = 58/254 (22%), Positives = 107/254 (42%), Gaps = 37/254 (14%) Query: 236 TSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN--------AL 287 T LSDV S + + ++ G W L ++ +L++ + N L Sbjct: 377 TRLSDVDKSQYQPEIRK----AQGGGFPWWPMLTPVQQYQLIKQGEAAQNKQRQEYKSVL 432 Query: 288 NTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAP 347 + K+ K+ +A A R ++ N PS L+++ A+G G + Y Q Sbjct: 433 DGKV-KDAEALALRGESYSN-----PPS----LDEFKYAHGENDGIKEYEQFQKNMAMGA 482 Query: 348 DVARIKLMSTFDAKKFLQ--KIDDEYISNPSLSLASTMMATKYK--EILEKSHRQSMQEL 403 D+A ++ +S L+ K +Y + A YK + L ++ + Sbjct: 483 DIATVQYLSPQAQTALLESKKPKGDYDA-----------ANNYKRYDTLARAIDTVNKVR 531 Query: 404 NQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRT 463 D I + I +Q++ L F + ++F +SL QR+ V I +Y + F+ EE L Sbjct: 532 ASDPIQFSIDRQQVNQLDFSSIQNFTNSLTQRSSNVGDISKNYQTPLTVFSAQEELVLSQ 591 Query: 464 QLVKRPASESVDLI 477 + K PAS+ ++ + Sbjct: 592 LMEKAPASQKIEYL 605 >gi|50282929|ref|YP_052985.1| hypothetical protein VP2p17 [Vibrio phage VP2] Length = 694 Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust. Identities = 58/257 (22%), Positives = 101/257 (39%), Gaps = 29/257 (11%) Query: 390 EILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFAD---SLRQRAGFVKKIKDDY 446 +I K+H + LN D + +G + ID FD FAD SL +R V +K Y Sbjct: 378 DIYTKNHENAKSMLNSDPLMYGAQTGIIDLQPFD----FADPAASLVKRQNDVASMKSQY 433 Query: 447 --NLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGL 504 ++ + F K E + + P + + ++ LS + VRS + NG Sbjct: 434 GDSMVPAFFTKQEAMEFSRVVQDAPVKDQMRMLSIINGQLSKDNANAVRS---QLMGNGG 490 Query: 505 SAVVRLSSEFSDDAKNAAMVILSG---MKHQKDTETRYNTDHKSNKFDSLYDSYINTPLT 561 + S + A IL G +K + ++TD + ++Y +P T Sbjct: 491 GVLAVASDALDSNQPYVAEKILMGRNALKEVPEAFKSWDTDASPLVLSEIGNAYSASPAT 550 Query: 562 KLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNY--TLNRVRVSDAMQIVLGNTPVNINESM 619 + E+IK + G++ T+N R+ ++ V G + I S Sbjct: 551 ----------MGQVMESIKAAYAAQTVNEGDFSGTINSKRLKSVIKEVTGGL-IKIGSST 599 Query: 620 LMPP-RGMSKTDFEDRL 635 ++ P R M + F+ L Sbjct: 600 IVSPDRNMDENKFDSWL 616 >gi|50282962|ref|YP_053018.1| hypothetical protein VP5_gp16 [Vibrio phage VP5] Length = 694 Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust. Identities = 58/257 (22%), Positives = 101/257 (39%), Gaps = 29/257 (11%) Query: 390 EILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFAD---SLRQRAGFVKKIKDDY 446 +I K+H + LN D + +G + ID FD FAD SL +R V +K Y Sbjct: 378 DIYTKNHENAKSMLNSDPLMYGAQTGIIDLQPFD----FADPAASLVKRQNDVASMKSQY 433 Query: 447 --NLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGL 504 ++ + F K E + + P + + ++ LS + VRS + NG Sbjct: 434 GDSMVPAFFTKQEAMEFSRVVQDAPVKDQMRMLSIINGQLSKDNANAVRS---QLMGNGG 490 Query: 505 SAVVRLSSEFSDDAKNAAMVILSG---MKHQKDTETRYNTDHKSNKFDSLYDSYINTPLT 561 + S + A IL G +K + ++TD + ++Y +P T Sbjct: 491 GVLAVASDALDSNQPYVAEKILMGRNALKEVPEAFKSWDTDASPLVLSEIGNAYSASPAT 550 Query: 562 KLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNY--TLNRVRVSDAMQIVLGNTPVNINESM 619 + E+IK + G++ T+N R+ ++ V G + I S Sbjct: 551 ----------MGQVMESIKAAYAAQTVNEGDFSGTINSKRLKSVIKEVTGGL-IKIGSST 599 Query: 620 LMPP-RGMSKTDFEDRL 635 ++ P R M + F+ L Sbjct: 600 IVSPDRNMDENKFDSWL 616 >gi|298381707|ref|ZP_06991306.1| minor structural protein [Escherichia coli FVEC1302] gi|298279149|gb|EFI20663.1| minor structural protein [Escherichia coli FVEC1302] Length = 837 Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust. Identities = 55/275 (20%), Positives = 103/275 (37%), Gaps = 40/275 (14%) Query: 290 KLRKETQAQARR-IDAQLNHGITV---------KPSELIPLEDYTQAYGVEQGTELYNLQ 339 +LR++ QA+ + ID+Q+ P E D+ AYGV +G Y Sbjct: 414 QLRRQQQAEYKTMIDSQVRDATAAYMRGVEFPNPPGE----ADFIAAYGVREGNLRYTEF 469 Query: 340 QFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 + +A + + M T +++++ + A Y ++ + Q Sbjct: 470 RNTQIAGQYIGSFRNMPTSSITAYVEQLRPD---TGETGEGYAARAALYDNVVSAAS-QV 525 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 +++ D + + + Q P+ +++F S+ RA V + Y + F+K E N Sbjct: 526 IKQRQADPVQFSLAAGQAKPIDMSNKDNFGQSVALRATQVSDLAKSYGTPLTFFSKDEAN 585 Query: 460 QLRTQLVKRPASESVDLIRGAYNTLSDSDKEG--VRSSFAHIEDNGLSAVVR-------- 509 Q+ P S+ + +T+ S G S+ I N SA V Sbjct: 586 QIGVFFRDAPVSQQAAYL----DTIRQSTGGGQVYMSALQQISANAPSAAVAGILMDKPG 641 Query: 510 --------LSSEFSDDAKNAAMVILSGMKHQKDTE 536 + + S + AA IL+G +K T+ Sbjct: 642 GILAEKNWFNPDVSVSPETAAQTILAGAAARKGTD 676 >gi|269975302|gb|ACZ55526.1| primase [Staphylococcus phage SA1] Length = 326 Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust. Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 22/91 (24%) Query: 177 ITDHINSLPIDLRLKQKLL--SEAKEKLNTNQIIGAHARDPRVFENFMRA---------- 224 T+H N+LP+D ++ L+ ++A+ L+ ++ G+ A R F+N RA Sbjct: 115 FTNHFNALPLDENQRRYLVVSTQAENHLDMERVYGSKAERSRFFKNVYRAIDNHVPALKK 174 Query: 225 ----------FYKKGHPPKDSTSLSDVSDSA 245 F KGH P+D+ + + ++D++ Sbjct: 175 WFLDWEISPEFDHKGHAPQDTEAFAIMADAS 205 >gi|116003915|ref|NP_001070317.1| CST complex subunit STN1 [Bos taurus] gi|122132203|sp|Q08DB2|STN1_BOVIN RecName: Full=CST complex subunit STN1; AltName: Full=Oligonucleotide/oligosaccharide-binding fold-containing protein 1; AltName: Full=Suppressor of cdc thirteen homolog gi|115305224|gb|AAI23848.1| Oligonucleotide/oligosaccharide-binding fold containing 1 [Bos taurus] gi|296472683|gb|DAA14798.1| oligonucleotide/oligosaccharide-binding fold containing 1 [Bos taurus] Length = 370 Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust. Identities = 51/197 (25%), Positives = 88/197 (44%), Gaps = 28/197 (14%) Query: 224 AFYKKGHPPKDSTSLSDVSDSARERSLEV---VEDVSKAIGLAGWDRLDDTKRRRLLEHL 280 F+ GHP K L V RE+ V+D + I W RL++TK Sbjct: 46 VFFYNGHPIKQVDILGTVI-GVREKDAFYSYGVDDSTGVINCICWKRLNNTKSSSATATP 104 Query: 281 SSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQ 340 S+R+ +L ++L+K + A+R A+L G ++I + + + + E+ E++ Sbjct: 105 SARELSLTSQLKKLQETIAQR--AKLEIG------DIIRVRGHIRMFRGER--EIHATTY 154 Query: 341 FK---SVAAPDVARIKLMSTFDAKKFLQKI------DDEYISNPS---LSLASTMMATKY 388 +K V +AR+ + K + Q +DE +SNP L + +++ K Sbjct: 155 YKVDDPVCNVQIARMLELPAIYRKVYDQPFHSPALKEDEALSNPGTLDLDSLTCLLSEKA 214 Query: 389 KEILEKSHRQSM--QEL 403 KE L ++ QS QEL Sbjct: 215 KEFLVENRVQSFYQQEL 231 >gi|328954399|ref|YP_004371733.1| Protein translocase subunit secA [Desulfobacca acetoxidans DSM 11109] gi|328454723|gb|AEB10552.1| Protein translocase subunit secA [Desulfobacca acetoxidans DSM 11109] Length = 840 Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust. Identities = 103/509 (20%), Positives = 203/509 (39%), Gaps = 89/509 (17%) Query: 32 RDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKI 91 R D L +A + EG+R +N T + + F+ Q T + DT +A + Sbjct: 329 RRYSDGLHQALEAKEGVRIENENQTLASITFQNYFRMYQKLAGMTGTADT----EAEEFK 384 Query: 92 KEY-VDGRIRSAYDRFLSSISHRDV-----RKKFQAQVEHDIRDYHTKGVDIQIGATQRA 145 K Y ++ + + R + + H D +KFQA VE +I+D H +G + +G T Sbjct: 385 KIYNLEVMVVPTHKRMI-RVDHPDAIYKSESEKFQAVVE-EIKDCHQQGQPVLVGTTSIE 442 Query: 146 QEDNLNMTVGLAAAHVLHDPSNENYFQR-----VQSITDHINSLPIDL--RLKQKLLSEA 198 + + L+ L A + H+ N + ++ Q+ + ++ ++ R +L Sbjct: 443 KSERLSRM--LKAQGIKHEVLNAKHHEKEAQIVAQAGQSGMVTIATNMAGRGTDIVLGTG 500 Query: 199 KEKLNTNQIIGAHARDPRVFENFMRAFY-KKGHPPKDSTSLSDVSDSARERSLEVVEDVS 257 IIG + R +N +R ++G P LS D R + ++++ Sbjct: 501 VVDRGGLHIIGTERHESRRIDNQLRGRSGRQGDPGSSRFYLSLEDDLLRIFGSDRIKNLM 560 Query: 258 KAIGLAGWDRLDDTKRRRLLE---------------HLSSRDNALNTKL------RKET- 295 +G+ ++ +E HL DN +N + R+E Sbjct: 561 GRLGMEDGQPIEHRMVSSAIERAQKRVEAHNFDIRKHLLEYDNVMNKQREVIYGKRREIL 620 Query: 296 -----QAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQG--------TELYNL--QQ 340 + + +++ A + G+ V+ ++ + + G+E+ EL L + Sbjct: 621 GGEDLEEEIQQMAADMVDGLLVQFTDPRTMPEDWDLKGLEEALWRQFGLHIELSRLSPEN 680 Query: 341 FKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSM 400 P++ ++++ F+ K+ Q I EY + + M+ T +K+ H +M Sbjct: 681 LDGEKLPELLHQQVLAAFEGKR--QAIGPEYFPSLQQQIMLQMVDTHWKD-----HLLAM 733 Query: 401 QELNQDAISWGIK-YKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEEN 459 L +D I G++ Y Q+DPLR +E Y++ + +EN Sbjct: 734 DHL-RDGI--GLRGYAQVDPLRAYQKEG------------------YDMFMEMMQRIQEN 772 Query: 460 QLRTQ-LVKRPASESVDLIRGAYNTLSDS 487 +RT L++ E + L++G +S S Sbjct: 773 TVRTIFLIRLRQPEEIALLQGRQTPMSYS 801 >gi|328883363|emb|CCA56602.1| hypothetical protein SVEN_3316 [Streptomyces venezuelae ATCC 10712] Length = 230 Score = 37.4 bits (85), Expect = 9.2, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 3/80 (3%) Query: 393 EKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSH 452 EK + +++ELN A+++ KQ+D L ++ FAD L+ A + KI+ + + Sbjct: 115 EKVQQDAIKELNATAVAYEGLKKQVDALDPTNQQKFADGLQGVADGLTKIE---KMDQNA 171 Query: 453 FNKTEENQLRTQLVKRPASE 472 +K EE +L + K+P + Sbjct: 172 LSKLEEGELGQAMAKQPGCQ 191 Searching..................................................done Results from round 2 >gi|254781204|ref|YP_003065617.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|254040881|gb|ACT57677.1| hypothetical protein CLIBASIA_05555 [Candidatus Liberibacter asiaticus str. psy62] gi|317120670|gb|ADV02493.1| hypothetical protein SC1_gp045 [Liberibacter phage SC1] gi|317120814|gb|ADV02635.1| hypothetical protein SC1_gp045 [Candidatus Liberibacter asiaticus] Length = 707 Score = 957 bits (2473), Expect = 0.0, Method: Composition-based stats. Identities = 707/707 (100%), Positives = 707/707 (100%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR Sbjct: 1 MIEITTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTR 60 Query: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ Sbjct: 61 YTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQ 120 Query: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH Sbjct: 121 AQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH 180 Query: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD Sbjct: 181 INSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSD 240 Query: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR Sbjct: 241 VSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQAR 300 Query: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA Sbjct: 301 RIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDA 360 Query: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL Sbjct: 361 KKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPL 420 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA Sbjct: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN Sbjct: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYN 540 Query: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV Sbjct: 541 TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 Query: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY Sbjct: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKYMNVGSGKY 660 Query: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP Sbjct: 661 MIIKNGNPKVDKEGKTIIINVEDVNRDERMESTIRHYEHQIFNEHAP 707 >gi|317120711|gb|ADV02533.1| hypothetical protein SC2_gp045 [Liberibacter phage SC2] gi|317120772|gb|ADV02593.1| hypothetical protein SC2_gp045 [Candidatus Liberibacter asiaticus] Length = 753 Score = 749 bits (1934), Expect = 0.0, Method: Composition-based stats. Identities = 134/719 (18%), Positives = 284/719 (39%), Gaps = 50/719 (6%) Query: 16 PNQDAVRDSVNPQAGLRDLGDALGKATQFL-EGIRRDNAFATANTRYTELSFKAVQDFHD 74 P ++ V+ +P GLRD+ + L ++T+ + E +RR +AF A++ + + + +A + F++ Sbjct: 16 PVKEHVQQRYSPDEGLRDIANKLNQSTKLIEETVRRQSAFK-ADSEFLQTTVEAEKVFNE 74 Query: 75 FTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKG 134 L + + ++ + I+ Y + + + +VR+ Q + + K Sbjct: 75 KLQGLQGGNEEEVKKNLESILSNDIKPLYGKMHEKLDYPEVRRHILQQSNKQMARFQLKA 134 Query: 135 VDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 + ++G + E+ + + +L D S+ NY ++VQ +D I+ LP++ K Sbjct: 135 NEYKLGFQVQRTEEGIREIERSVSNSLLLDGSDSNYREKVQFASDAIDKLPLNPTTKDAR 194 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE 254 +E Q +P+VF +F++ +G D ++++D+ D+ + V + Sbjct: 195 RRTTRENAALAQAQRYMVDNPQVFSSFIQKREGQGSKIDDKSTIADIVDNTPLDTFSVND 254 Query: 255 DVSKAIG-LAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 V GW L K+ +L+ +++ + R + + I + L GIT K Sbjct: 255 TVVDNTNPFEGWKDLSTQKKIAILKEVANASTTGKQEERSRVGTKIKNIASFLERGITPK 314 Query: 314 PSELIPL--EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEY 371 + L ++ +G +G L + Q AP + +I LM + L K+ + Sbjct: 315 NIQDADLSEDNLRHLFGNTRGQMLSSQLQTLKEFAPKINQIFLMPNDQVETELTKLKPDV 374 Query: 372 ISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADS 431 + + + + K + + +E + I W I + + D + D Sbjct: 375 QNLVGAEFKNKLYSDGVKAVYKND-----KERKEHPIKWAITHGLTQEMP-DEPSKWGDF 428 Query: 432 LRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEG 491 R + +K Y +++ +K E +L L K + + V + L Sbjct: 429 FVHRREVSQTLKGFYGVSSPLISKEESRKLTQYLDKLESKDVVSHLEDYAERLGGLKNSV 488 Query: 492 VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSL 551 + +E + V L DD + A +L G K++ D +FD+ Sbjct: 489 WIEAMQSLEHPVMGEVGLL---LKDDP-DVASDVLRGYKYRVRNRGENIRDITGERFDAW 544 Query: 552 YDSYINTPLTKLEQST---AGGNFNKDKEAIKLYLLGSMKDSGNYTLNRV---------- 598 + + S F++ + + +L G + Y+ + Sbjct: 545 FKERFREKYGNVFASIGDYGDAEFDRASKLVGYHLAGELLSDKTYSFHEAPTQSLKEWVL 604 Query: 599 ---------RVSDAMQIVLGNTPVNINE--SMLMPPRGMSKTDFEDRLWYATKDTGEYDP 647 + + + V+GN PV + + S L+PPRGMS DF ++ + E Sbjct: 605 RQPASQYDKNLQHSFEAVVGNIPVPMGDFKSPLIPPRGMSSEDFTNKFTAVAQQVFEESG 664 Query: 648 YT---IKYMNV-----GSGKYMIIKNGNPKVD-KEGKTIIINV--EDVNRDERMESTIR 695 + Y N+ GS +Y++ + K G ++I V + ++ ++ T++ Sbjct: 665 LSGKEYGYENIGPTSDGSAQYLVKFGREYIHNPKTGAPLVIEVSPYSIEHEQTIQKTLK 723 >gi|315122890|ref|YP_004063379.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496292|gb|ADR52891.1| hypothetical protein CKC_05730 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 583 Score = 565 bits (1456), Expect = e-159, Method: Composition-based stats. Identities = 146/561 (26%), Positives = 255/561 (45%), Gaps = 37/561 (6%) Query: 146 QEDNLNMTVGLAAAHVLHDPSNENY-FQRVQS--ITDHINSLPIDLRLKQKLLSEAKEKL 202 E ++ T A ++ P+NEN QR + + D P D K++ L + L Sbjct: 5 SEKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---L 61 Query: 203 NTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGL 262 + +Q IG + +P + N Y KD S ++++A ++E+ D K G Sbjct: 62 HKSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEI--DEHKLDGK 119 Query: 263 AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRI-------DAQLNHG-ITVKP 314 G+ +D R E L S + N K E Q+ +R+ + N G I+ Sbjct: 120 TGYTGIDSLSR----EDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDY 175 Query: 315 SELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISN 374 E+ + ++ Y + + K AP + ++ M+ + +F+ ++ + Sbjct: 176 DEVFSESNLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVD- 234 Query: 375 PSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEES-FADSLR 433 L A + + L+ S++EL++D W + + + P E A S+ Sbjct: 235 --YDLNDRFKAQAFLKELQDKRVASLKELSKDPHGWQ-RSRGLVPPNLSLEAGQLASSVL 291 Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNT--LSDSDKEG 491 +K + D+ + +E QL ++ A + V R ++ SD E Sbjct: 292 PIFDANEKTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEK 351 Query: 492 VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSL 551 ++S ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L Sbjct: 352 IKSVVDGMKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKL 410 Query: 552 YDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNT 611 +++ IN + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT Sbjct: 411 FNNLINKEIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNT 470 Query: 612 PVNINESMLMPPRGMSKTDFEDRLWYATKD--TGEYD-------PYTIKYMNVGSGKYMI 662 +N S +MPPRGMS +F +RL T D G + P + Y + G GKY + Sbjct: 471 AYAVNGSYVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSL 530 Query: 663 IKNGNPKVDKEGKTIIINVED 683 G K DK+G I+IN+ D Sbjct: 531 TVGGVYKKDKQGNPIVINIYD 551 >gi|315121928|ref|YP_004062417.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495330|gb|ADR51929.1| hypothetical protein CKC_00890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 547 Score = 551 bits (1419), Expect = e-154, Method: Composition-based stats. Identities = 142/554 (25%), Positives = 249/554 (44%), Gaps = 37/554 (6%) Query: 146 QEDNLNMTVGLAAAHVLHDPSNENY-FQRVQS--ITDHINSLPIDLRLKQKLLSEAKEKL 202 E ++ T A ++ P+NEN QR + + D P D K++ L + L Sbjct: 5 SEKHIGDTAVALALNIRLSPTNENLELQRNRGYDVIDKSLLTPEDKVKKKQALDKT---L 61 Query: 203 NTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGL 262 + +Q IG + +P + N Y KD S ++++A ++E+ D K G Sbjct: 62 HKSQTIGLLSNEPDIVGNLSSLVYGSPVAVKDGLSPDQIAENADGGTIEI--DEHKLDGK 119 Query: 263 AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRI-------DAQLNHG-ITVKP 314 G+ +D R E L S + N K E Q+ +R+ + N G I+ Sbjct: 120 TGYTGIDSLSR----EDLKSLLDEHNRKTNAERQSGKKRVIETIKLRTTEANKGNISSDY 175 Query: 315 SELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISN 374 E+ + ++ Y + + K AP + ++ M+ + +F+ ++ + Sbjct: 176 DEVFSESNLSRYYQPADVESIITQAKLKKDIAPYIRVVETMTNEEYAEFVSTVNSRTVD- 234 Query: 375 PSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEES-FADSLR 433 L A + + L+ S++EL++D W + + + P E A S+ Sbjct: 235 --YDLNDRFKAQAFLKELQDKRVASLKELSKDPHGWQ-RSRGLVPPNLSLEAGQLASSVL 291 Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNT--LSDSDKEG 491 +K + D+ + +E QL ++ A + V R ++ SD E Sbjct: 292 PIFDANEKTEKDHGVIVKGMGTDKERQLSEKIKGERAEDFVSYFRDEMTKEGVTKSDIEK 351 Query: 492 VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSL 551 ++S ++D S++ RL+ S +A+ +A+ ++SG+KH+ D E + + K N L Sbjct: 352 IKSVVDGMKDKVTSSICRLAMSDSAEARASAIPVISGVKHRGDIELKLESS-KGNGVKKL 410 Query: 552 YDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNT 611 +++ IN + +L Q + N+ +D E IKLY++G+M +GNYTLN V DA++ V GNT Sbjct: 411 FNNLINKEIGQLYQGSEDANYKQDAEVIKLYIMGNMHKTGNYTLNGEVVRDAVKAVFGNT 470 Query: 612 PVNINESMLMPPRGMSKTDFEDRLWYATKD--TGEYD-------PYTIKYMNVGSGKYMI 662 +N S +MPPRGMS +F +RL T D G + P + Y + G GKY + Sbjct: 471 AYAVNGSYVMPPRGMSHYEFGNRLHGLTSDKLVGLFGDKSKDRYPESYGYQSEGDGKYSL 530 Query: 663 IKNGNPKVDKEGKT 676 G K DK+G Sbjct: 531 TVGGVYKKDKQGHP 544 >gi|85059171|ref|YP_454873.1| hypothetical protein SG1193 [Sodalis glossinidius str. 'morsitans'] gi|84779691|dbj|BAE74468.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 843 Score = 375 bits (962), Expect = e-101, Method: Composition-based stats. Identities = 92/458 (20%), Positives = 171/458 (37%), Gaps = 59/458 (12%) Query: 263 AGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLED 322 +LD + RL + N + R E + + + A G ++ Sbjct: 394 TAVTQLDPVQLARLRSVAQGQLNQQQREYRVELEGELKDFTAAALRG--KNYPREFTADE 451 Query: 323 YTQAYGVEQGTELY----NLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 +T+AYG + G + Y + QQ S D++ ++ +S + L + P Sbjct: 452 FTRAYGYDAGQKAYANYLDTQQLGS----DISTVQQLSPVGQQALLNAREPV----PGEG 503 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGF 438 A A K E L+K+ + QD + + + KQI PL T E+F L R Sbjct: 504 YAE---AAKRHETLQKAVEYVNRARMQDPVQYAAEQKQIAPLDMQTPEAFRAGLSSRGSL 560 Query: 439 VKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSF-- 496 ++ Y + F+++E +Q+ L PAS+S+ + + L GV+ S Sbjct: 561 APELARHYGTPLAIFSQSEASQIGEMLRSAPASQSIAYL----DALRQGLGAGVQYSAAL 616 Query: 497 AHIEDNGLSAVVR-----------------LSSEFSDDAKNAAMVILSGMKHQKDTETRY 539 + SA V LS++ +AA I+ G + T Sbjct: 617 QQVSRYAPSAAVAGAIMGKRGNVIGNEGGWLSADLMVTPDDAAKTIIEGANARAGVTTNV 676 Query: 540 N-TDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEA-------IKLYLLGSMKDSG 591 N ++K+ + D+ + G F D E K Y G M G Sbjct: 677 NGVENKTRGIEMPKDTDLRPEFV----DKVGKAFAGDAEGAAQAYDVAKDYYAGVMARKG 732 Query: 592 NYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK-----DTGE 644 N + ++ + A+ + G + ++ P GMS++ F + A K D + Sbjct: 733 NVSGEYDKDTWTQAINVATGGVYDYGGQGDVLLPWGMSESQFNKAVGVAWKKQVLDDGIK 792 Query: 645 YDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIINVE 682 P + G +Y++ G + K+G+ ++++++ Sbjct: 793 PPPGRYGLQSHGDSQYLVKLGGGYLLGKDGRPVVLHID 830 >gi|218700979|ref|YP_002408608.1| hypothetical protein ECIAI39_2669 [Escherichia coli IAI39] gi|218370965|emb|CAR18792.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 837 Score = 217 bits (551), Expect = 7e-54, Method: Composition-based stats. Identities = 87/542 (16%), Positives = 184/542 (33%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA ++ + G+A +D L ++ + +L + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSQLPTKNKRYTGIAWFDALSESDQASVLRQTDALAR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A GI +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMLDSRVRDATAAYMRGIEFPNPP--GEDDFIAAYGVREGNLRYTEFKNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDTGDT-GEGYAAR---ATLYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|323948674|gb|EGB44579.1| hypothetical protein ERKG_04897 [Escherichia coli H252] Length = 837 Score = 216 bits (550), Expect = 9e-54, Method: Composition-based stats. Identities = 87/542 (16%), Positives = 184/542 (33%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA ++ + G+A +D L ++ + +L + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSQLPTKNKRYTGVAWFDALSESDQASVLRQTDALAR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A GI +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMLDSRVRDATAAYMRGIEFPNPP--GEDDFIAAYGVREGNLRYTEFKNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDTGDT-GEGYAAR---ATLYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|117624701|ref|YP_853614.1| hypothetical protein APECO1_4052 [Escherichia coli APEC O1] gi|115513825|gb|ABJ01900.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 837 Score = 216 bits (550), Expect = 9e-54, Method: Composition-based stats. Identities = 87/542 (16%), Positives = 184/542 (33%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA ++ + G+A +D L ++ + +L + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSQLPTKNKRYTGVAWFDALSESDQASVLRQTDALAR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A GI +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMLDSRVRDATAAYMRGIEFPNPP--GEDDFIAAYGVREGNLRYTEFKNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDTGDT-GEGYAAR---ATLYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|324008549|gb|EGB77768.1| hypothetical protein HMPREF9532_01736 [Escherichia coli MS 57-2] Length = 837 Score = 216 bits (549), Expect = 1e-53, Method: Composition-based stats. Identities = 86/542 (15%), Positives = 184/542 (33%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA ++ + G+A +D L ++ + +L + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSQLPTKNKRYTGVAWFDALSESDQASVLRQTDALAR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A G+ +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMLDSRVRDATAAYMRGVEFPNPP--GEDDFIAAYGVREGNLRYTEFKNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|331648165|ref|ZP_08349255.1| hypothetical protein ECIG_04091 [Escherichia coli M605] gi|331043025|gb|EGI15165.1| hypothetical protein ECIG_04091 [Escherichia coli M605] Length = 837 Score = 213 bits (543), Expect = 7e-53, Method: Composition-based stats. Identities = 85/542 (15%), Positives = 178/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADDPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGNQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEDDFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQINQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|215487810|ref|YP_002330241.1| hypothetical protein E2348C_2743 [Escherichia coli O127:H6 str. E2348/69] gi|215265882|emb|CAS10291.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 836 Score = 213 bits (541), Expect = 1e-52, Method: Composition-based stats. Identities = 87/541 (16%), Positives = 173/541 (31%), Gaps = 46/541 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--Y 226 Q + ++ D IN P D + +L A +P + + + Sbjct: 299 RQGIDTVNDIINRWAPPSDNNNTDAYIQAVCAQLGVTPDQPLDASNPDTLKALCASIIQH 358 Query: 227 KKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNA 286 + G P L+ SA ++ + G A +D + L Sbjct: 359 ENGSQPYSDQQLATGV-SAAIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ 417 Query: 287 LNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA 346 + R ++ R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQAEYRTNIDSRVRDASAAYMRGV--DFPNAPTQNDFLAAYGVREGNLRYTEFRNTQIAG 475 Query: 347 PDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD 406 + + M T + ++ + + A+ A Y ++ + + + D Sbjct: 476 QYIGSFRNMPTSSIQAAVENLKPDT-GETGEGYAAR--AQTYDAVV-SAASTVLAQRKAD 531 Query: 407 AISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV 466 I + + Q P+ + +F ++ RA + Y + F+K E NQ+ Sbjct: 532 PIQFSLSSGQTKPIDMTNQNNFGQTISLRASQAVDLAKSYGTPLTFFSKEEANQIGAFFR 591 Query: 467 KRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR----------------L 510 P S+ + + + S+ I N SA V Sbjct: 592 DAPVSQQTAYLDTIRQSTGGG--QVYMSALQQISTNAPSAAVAGILMDKPGGVVAEKNWF 649 Query: 511 SSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGG 570 + + S + AA ILSG +K T+ + D+ + + + + G Sbjct: 650 NPDVSVSPETAAQTILSGAAARKGTD-------DAKGIPMPKDNDLRLEFSDMVKDAFAG 702 Query: 571 NFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPRG 625 + E K Y G M G + ++ A+ + G ++ P G Sbjct: 703 DAQGASMAYEIAKDYYAGVMAKKGVVSGEVDSDTWKQAVNVATGGVHDYNGMGSVLLPWG 762 Query: 626 MSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIIN 680 MS F+ ++ A K + P + G +Y++ + +G ++I+ Sbjct: 763 MSAEQFDKQVDQAWKTQVTGAGIKAPPGQYGLQSYGDSQYLVKLGTGYLLKDDGTPVVID 822 Query: 681 V 681 + Sbjct: 823 L 823 >gi|320175034|gb|EFW50147.1| putative bacteriophage protein [Shigella dysenteriae CDC 74-1112] Length = 837 Score = 212 bits (540), Expect = 2e-52, Method: Composition-based stats. Identities = 84/542 (15%), Positives = 178/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADDPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A G+ +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTIIDSKVRDATAAYMRGVEFPNPP--GEDDFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + AS + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYASR---AALYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA + + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQISDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAETNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVISGEIDTDIWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|300898438|ref|ZP_07116779.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357905|gb|EFJ73775.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 837 Score = 212 bits (539), Expect = 2e-52, Method: Composition-based stats. Identities = 84/542 (15%), Positives = 183/542 (33%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA ++ + G+A +D L ++ + +L + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSQLPTKNKRYTGVAWFDALSESDQASVLRQTDALAR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + ++ R A G+ +D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMLDSRVRDATAAYMRGVEFPNPP--GEDDFIAAYGVREGNLRYTEFKNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDTGDT-GEGYAAR---AALYDNVVSAANQVIKQRQS 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLASGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRIEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDIWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGTGIKTPLGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 ++ Sbjct: 823 DL 824 >gi|332344343|gb|AEE57677.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 837 Score = 212 bits (539), Expect = 2e-52, Method: Composition-based stats. Identities = 85/542 (15%), Positives = 177/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEADFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAASQVIKQRQA 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRFEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|298381707|ref|ZP_06991306.1| minor structural protein [Escherichia coli FVEC1302] gi|298279149|gb|EFI20663.1| minor structural protein [Escherichia coli FVEC1302] Length = 837 Score = 212 bits (538), Expect = 2e-52, Method: Composition-based stats. Identities = 85/542 (15%), Positives = 177/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEADFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAASQVIKQRQA 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLAAGQAKPIDMSNKDNFGQSVALRATQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDVWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGSGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 N+ Sbjct: 823 NL 824 >gi|309702801|emb|CBJ02132.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 836 Score = 211 bits (536), Expect = 4e-52, Method: Composition-based stats. Identities = 87/541 (16%), Positives = 175/541 (32%), Gaps = 46/541 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--Y 226 Q + ++ D IN P D + +L A +P + + + Sbjct: 299 RQGIDTVNDIINRWAPPSDNNNTDTYIQAVCAQLGVTPDQPLDASNPDTLKALCASIIQH 358 Query: 227 KKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNA 286 + G P L+ SA ++ + G A +D + L Sbjct: 359 ENGSQPYSDQQLATGV-SAAIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ 417 Query: 287 LNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA 346 + R ++ R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQAEYRTNIDSRVRDASAAYMRGV--DFPNAPTQNDFMAAYGVREGNLRYTEFRNTQIAG 475 Query: 347 PDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD 406 + + M T + ++ + + A+ A Y ++ + + + D Sbjct: 476 QYIGSFRNMPTSSIQAAVENLKPDT-GETGEGYAAR--AQTYDAVV-SAASTVLAQRKAD 531 Query: 407 AISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV 466 I + + Q P+ + +F+ ++ RA ++ Y + F+K E NQ+ Sbjct: 532 PIQFSLSSGQTKPIDMTNQNNFSQTIALRASQAAELAKSYGTPLTFFSKDEANQIGAFFR 591 Query: 467 KRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR----------------L 510 P S+ + + + S+ I N SA V Sbjct: 592 DAPVSQQSAYLDTIRQSTGGG--QVYMSALQQISTNAPSAAVAGILMDKPGGVVAEKNWF 649 Query: 511 SSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGG 570 + + S ++AA ILSG +K T D+ + + + + G Sbjct: 650 NPDVSVSPESAAQTILSGAAARKGT-------GDVKGIPMPKDNDLRLEFSDMVKDAFAG 702 Query: 571 NFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPRG 625 + + E K Y G M G + ++ A+ + G ++ P G Sbjct: 703 DAHGASMAYEIAKDYYAGVMAKKGVVSGEIDSDTWKQAVNVATGGVHDYNGMGSVLLPWG 762 Query: 626 MSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIIIN 680 MS F+ ++ A K + P + G +Y++ + +G ++I+ Sbjct: 763 MSAEQFDKQVDQAWKTQVTDAGIKAPPGQYGLQSYGDSQYLVKLGTGYLLKDDGTPVVID 822 Query: 681 V 681 + Sbjct: 823 L 823 >gi|323156122|gb|EFZ42281.1| prophage MuMc02, structural protein P5 [Escherichia coli EPECa14] Length = 837 Score = 210 bits (534), Expect = 7e-52, Method: Composition-based stats. Identities = 84/542 (15%), Positives = 177/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPADKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEADFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAASQVIKQRQA 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDIWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 ++ Sbjct: 823 DL 824 >gi|327252173|gb|EGE63845.1| prophage MuMc02, structural protein P5 [Escherichia coli STEC_7v] Length = 837 Score = 210 bits (534), Expect = 8e-52, Method: Composition-based stats. Identities = 84/542 (15%), Positives = 177/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPTDKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEADFIAAYGVREGKLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAASQVIKQRQA 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDIWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 ++ Sbjct: 823 DL 824 >gi|89152439|ref|YP_512272.1| hypothetical structural protein [Escherichia phage phiV10] gi|74055462|gb|AAZ95911.1| hypothetical structural protein [Escherichia phage phiV10] Length = 837 Score = 210 bits (533), Expect = 9e-52, Method: Composition-based stats. Identities = 84/542 (15%), Positives = 177/542 (32%), Gaps = 48/542 (8%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--- 225 Q + ++++ IN P D + E+L + A +P + A Sbjct: 300 RQGIDTVSEIINRWAPPADKNDTMSYIKAVCEQLGVSADEPLDASNPDTLKALCAAIIHH 359 Query: 226 YKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDN 285 P D + V SA + + + G A +D + + Sbjct: 360 ENGSQPYSDQQLTAGV--SAALGLSTIPTNTKRYTGNAAFDAASPEAQASFMRQADQLRR 417 Query: 286 ALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 + + +Q R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQQAEYKTMIDSQVRDATAAYMRGVEFPNPP--GEADFIAAYGVREGNLRYTEFRNTQIA 475 Query: 346 APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQ 405 + + M T +++++ + A+ + + + Q +++ Sbjct: 476 GQYIGSFRNMPTSSITAYVEQLRPDT-GETGEGYAAR---AALYDNVVSAASQVIKQRQA 531 Query: 406 DAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQL 465 D + + + Q P+ +++F S+ RA V + Y + F+K E NQ+ Sbjct: 532 DPVQFSLAAGQAKPIDMSNKDNFGQSVALRAAQVSDLAKSYGTPLTFFSKDEANQIGVFF 591 Query: 466 VKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVR---------------- 509 P S+ + + + S+ I N SA V Sbjct: 592 RDAPVSQQAAYLDTIRQSTGGG--QVYMSALQQISANAPSAAVAGILMDKPGGILAEKNW 649 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAG 569 + + S + AA IL+G +K T+ + D+ + + + + Sbjct: 650 FNPDVSVSPETAAQTILAGAAARKGTD-------DAKGIPMPKDADLRLEFSDMVKDAFA 702 Query: 570 GNFNKDK---EAIKLYLLGSMKDSGNYT--LNRVRVSDAMQIVLGNTPVNINESMLMPPR 624 G+ E K Y G M G + ++ A+ + G ++ P Sbjct: 703 GDAQGASMAYEIAKDYYAGVMAKKGVVSGEIDNDIWKQAVNVATGGVHDYNGMGNVLLPW 762 Query: 625 GMSKTDFEDRLWYATKD-----TGEYDPYTIKYMNVGSGKYMIIKNGNPKVDKEGKTIII 679 GMS F+ ++ A + + P + G +Y++ + +G +++ Sbjct: 763 GMSAEQFDKQVNQAWNEQVVGTGIKTPPGQYGLQSYGDSQYLVKLGTGYLLKDDGSPVVL 822 Query: 680 NV 681 ++ Sbjct: 823 DL 824 >gi|268589385|ref|ZP_06123606.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] gi|291315412|gb|EFE55865.1| hypothetical protein PROVRETT_05517 [Providencia rettgeri DSM 1131] Length = 832 Score = 184 bits (466), Expect = 6e-44, Method: Composition-based stats. Identities = 81/445 (18%), Positives = 160/445 (35%), Gaps = 44/445 (9%) Query: 261 GLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPL 320 G W L ++ +L++ + N + + + + +A G + S L Sbjct: 398 GFPWWPMLTPVQQYQLIKQGEAAQNKQRQEYKSVLDGKVKDAEALALRGES--YSNPPSL 455 Query: 321 EDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLA 380 +++ A+G G + Y Q D+A ++ +S L+ + + + + Sbjct: 456 DEFKYAHGENDGIKEYEQFQKNMAMGADIATVQYLSPQAQTALLESKKPKGDYDAANNY- 514 Query: 381 STMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVK 440 K + L ++ + D I + I +Q++ L F + ++F +SL QR+ V Sbjct: 515 ------KRYDTLARAIDTVNKVRASDPIQFSIDRQQVNQLDFSSIQNFTNSLTQRSSNVG 568 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLI----------RGAYNTLSDSDKE 490 I +Y + F+ EE L + K PAS+ ++ + L K Sbjct: 569 DISKNYQTPLTVFSAQEELVLSQLMEKAPASQKIEYLDAIRQGLKNNESYTAALRQVSKS 628 Query: 491 GVRSSFAHIEDNGLSAVV----RLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSN 546 + A + N + + + SS+ + AA +I+ G +K Sbjct: 629 DTSLAVAGVIMNKPTQLTVEYNKWSSDIVVTPQQAAQLIVQGSTARK----------SGK 678 Query: 547 KFDSLYDSYINTPL----TKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYT--LNRVRV 600 F D+ + + G N E K G M +GN ++ Sbjct: 679 DFVLPKDTELREEFADFVGDVFAGDIDGA-NSAYEVAKDAYAGMMAKNGNINGEYDKDIW 737 Query: 601 SDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK----DTGEYDPYTIKYMNVG 656 A+ I G +M P GM F+ ++ K + G I ++G Sbjct: 738 QQAINIATGGIYDFNGRGKVMLPWGMDGDTFKKQVKSELKYHEPEIGNISVDIIGMQSLG 797 Query: 657 SGKYMIIKNGNPKVDKEGKTIIINV 681 +Y+I + + K+G +II++ Sbjct: 798 DSQYLIKRGSGYLLYKDGTPVIIDL 822 >gi|330007166|ref|ZP_08305908.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] gi|328535513|gb|EGF61973.1| hypothetical protein HMPREF9538_03597 [Klebsiella sp. MS 92-3] Length = 838 Score = 109 bits (273), Expect = 1e-21, Method: Composition-based stats. Identities = 80/470 (17%), Positives = 162/470 (34%), Gaps = 46/470 (9%) Query: 250 LEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG 309 L +E + G +D ++ L N T+ + + Q + A G Sbjct: 378 LTTLESPKRYSGNQAFDAASPQQQAAYLRQSMELRNQARTQFKAQLVDQVQDATAAYLKG 437 Query: 310 ITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDD 369 I D+ A+G +GT+ +N + VA + + M T ++++ + + Sbjct: 438 IQFDNPP--SQGDFINAFGYREGTQRFNDFENLRVAGQYIGSFRTMPTASIQQYVSDLKN 495 Query: 370 EYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFA 429 + + LA A + ++ + ++ + + D + P+ T ++ A Sbjct: 496 QVGN--GEGLAGRAAA---FDHVQAAAQRIISQRQSDPFQSAVDIGAYKPISNSTPDAIA 550 Query: 430 DSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV-KRPASESVDLIRGAYNTLSDSD 488 ++ R ++K +T +K E L + ++++ L++G TL Sbjct: 551 SEVKNRYAAQDQLKA-IGITPPLLSKQESQVLTDAVRNSTDVNQAISLLQGLGRTLPPQA 609 Query: 489 KEGVRSSFAHIEDNGLSAVVRLSSEF--------------------SDDAKNAAMVILSG 528 V SS A A + L + + D + A +L+G Sbjct: 610 LRSVASSIAPGSPGTAYAALLLGQQDNQYDNRSGIIPYSQFVSYKPTLDKYDVAKTVLAG 669 Query: 529 MKHQKDTETRYNTDHKSNKF--DSLYDSYINTPLTKLEQSTAGG------NFNKDKEAIK 580 + T+ + + D +Y + + +A NF + A Sbjct: 670 DQMLNPTKAMKDAGISAVSIPSDEKLKTYFDKEVGNAFAYSAQARQVAWGNF-RSAYAAL 728 Query: 581 LYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFEDRLWYATK 640 Y G + + + A Q G +N S ++ P GM KT F DR A + Sbjct: 729 AYQSGDASKTNTVSPDSDIAEKAAQYATGGVYKGLNGSDVVMPFGMDKTTFRDRYTAAGR 788 Query: 641 DTGEYDP------YTIKYMNVGSGKYMIIKN-GNPKVD-KEGKTIIINVE 682 + K +NVG+ +Y ++ G D + + + + VE Sbjct: 789 EAMTQAGLNPSSLENFKAVNVGNNQYRLVNGSGRWATDPRTNQPVTVRVE 838 >gi|301028420|ref|ZP_07191666.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878531|gb|EFI86742.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 843 Score = 102 bits (253), Expect = 3e-19, Method: Composition-based stats. Identities = 82/539 (15%), Positives = 178/539 (33%), Gaps = 56/539 (10%) Query: 187 DLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--YKKGHPPKDSTSLSDVSDS 244 D + +L A +P ++ +A +++G P L+ S Sbjct: 318 DDNKTDAYIRAVCAQLGVTPDQPLDASNPDTLKSLCKAIIHHEQGSQPYSDQQLAAGV-S 376 Query: 245 ARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDA 304 A ++ + G+A +D L ++ + +L + + R +++ R A Sbjct: 377 AALGLSQLPTKNKRYTGVAWFDALSESDQASVLRQTDALARQQQAEYRTMLESRVRDATA 436 Query: 305 QLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFL 364 G+ ++ AYGV +G Y + +A + + M T ++ Sbjct: 437 AYMRGVEFPNPP--GEGEFIAAYGVREGNLRYTEFRNLQIAGQYIGSFRNMPTSSITAYV 494 Query: 365 QKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDT 424 +++ + AS + + + + + + + + S ++ P+ + Sbjct: 495 EQLRPDT-GETGEGYASR---AELFDRVSAAATKGITQRQNNPFSAAVEIGAYKPITSNN 550 Query: 425 EESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRP-ASESVDLIRGAYNT 483 + + R + + + N E L Q+ +++ L++ Sbjct: 551 PDDITAEVANRFSSQESL-RALGINAPLLNSEEAAALAQQVRGTQNVDQTIKLLQSMGEK 609 Query: 484 LSDSDKEGVRSSFAHIEDNGLSAVVRLSSEF--------------------SDDAKNAAM 523 L V S A + + L + + + + A Sbjct: 610 LPAPAIRQVASVIAPNSAATAYSALLLGTPDNQYDNTRPTIPYSQFIGYKPTMNKYDVAK 669 Query: 524 VILSG---------MKHQKDTETRYNTDHK-SNKFDSLYDSYI--NTPLTKLEQSTAGGN 571 VILSG MK+ T + ++ K + FD + N L ++ S Sbjct: 670 VILSGDQLLNPTKAMKNAGITPVQLPSEDKMKDAFDEQVGNAFARNPQLRQISYSLFKAA 729 Query: 572 FNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 + Y G + V+DA++ G N ++ P GM K+ F Sbjct: 730 YAGIA-----YQSGDASTVRTSMPDTDIVADAVKYATGGIYNGFNGGDVVMPFGMDKSTF 784 Query: 632 EDRLWYATKDTGEYDPYTIKY------MNVGSGKYMIIKN-GNPKVD-KEGKTIIINVE 682 +DR + K + + +N+G+ +Y ++ G D + K I++ VE Sbjct: 785 KDRYTTSAKQALKDAGLNVNAVSNFTPVNIGNNRYRLVNGSGRWATDPRTNKAIVVRVE 843 >gi|30387394|ref|NP_848223.1| hypothetical protein epsilon15p15 [Enterobacteria phage epsilon15] gi|30266049|gb|AAO06078.1| 15 [Salmonella phage epsilon15] Length = 842 Score = 96.1 bits (237), Expect = 2e-17, Method: Composition-based stats. Identities = 84/554 (15%), Positives = 177/554 (31%), Gaps = 52/554 (9%) Query: 171 FQRVQSITDHINSL--PIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF--Y 226 Q + ++ D IN P D + +L A +P + + + Sbjct: 299 RQGIDTVNDIINRWAPPSDNNNTDAYIQAVCAQLGVTPDQPLDASNPDTLKALCASIIQH 358 Query: 227 KKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNA 286 + G P L+ SA ++ + G A +D + L Sbjct: 359 ENGSQPYSDQQLATGV-SAAIGLSQLPTSTKRYTGNAAFDAASPEAQATFLRQADQIRKQ 417 Query: 287 LNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAA 346 + R ++ R A G+ D+ AYGV +G Y + +A Sbjct: 418 QQAEYRTNIDSRVRDASAAYMRGV--DFPNAPTQTDFLAAYGVREGNLRYTEFRNTQIAG 475 Query: 347 PDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQD 406 + + M T +++++ AS + + + + + + + + Sbjct: 476 QYIGSFRNMPTSSITAYVEQLRPGTEET-GEGYASR---AQLFDQVTAAATKVITQRQNN 531 Query: 407 AISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLV 466 + ++ P+ + + + R + + + + E L Q+ Sbjct: 532 PFNAAVEIGAYKPIASNNQNDITSEVANRFSSQESL-RALGINAPILSSEEAAALSEQVR 590 Query: 467 K-RPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVI 525 + ++++ L++ TL V S+ A +N +A L D+ + Sbjct: 591 GTKDVNQTISLLQSMGKTLPAPAMRQVASAIA--PNNAATAYSALLLGTPDNQYDNTKPS 648 Query: 526 LSGMKHQKDTETRYNTDHKS---------NKFDSLYDSYINT-------PLTKLEQSTAG 569 ++ + T D N ++ D+ I L + G Sbjct: 649 IAYSQFIGYKPTMNKYDVSKVILAGDQLLNPTKAMKDAGITPVQLPSEDKLKRAFDDQVG 708 Query: 570 GNFNKDKEA-------IKLYLLGSMKDSGNYTL------NRVRVSDAMQIVLGNTPVNIN 616 +F + +A K G SG+ ++ N V A Q G N Sbjct: 709 NSFANNPQARQLSYNLFKAAYAGIAYQSGDASMTKTDAANSDVVEKAAQYATGGVYKGFN 768 Query: 617 ESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYTIKY------MNVGSGKYMIIKN-GNPK 669 ++ P GM K+ F+DR + + + + +N+G+ +Y ++ G Sbjct: 769 GGDVVMPFGMDKSTFKDRYTASAQQALKDAGLNVNAVSNFTPVNIGNNQYRLVSGSGRWA 828 Query: 670 VD-KEGKTIIINVE 682 D K + I++ VE Sbjct: 829 TDPKTNEAIVVRVE 842 >gi|304398392|ref|ZP_07380266.1| conserved hypothetical protein [Pantoea sp. aB] gi|304354258|gb|EFM18631.1| conserved hypothetical protein [Pantoea sp. aB] Length = 835 Score = 93.4 bits (230), Expect = 1e-16, Method: Composition-based stats. Identities = 75/477 (15%), Positives = 142/477 (29%), Gaps = 60/477 (12%) Query: 250 LEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG 309 L ++ + G +D + L S T+L+ + A G Sbjct: 375 LTALDSPKRYTGNVAFDAASPQAQASYLRQAKSLQGEARTQLKAQLTDVISDAKASYLKG 434 Query: 310 ITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDD 369 + + AYG +G + + + + VA + + M + ++ + Sbjct: 435 V--EYPNPPSQAQLISAYGYREGNQRFADLENQRVAGQYIGSFRNMPSSSITNYVADLK- 491 Query: 370 EYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFA 429 A A Y + E + +Q + + + P+ Sbjct: 492 -TQLGSGEGFAGR--ADAYDHV-EAAAKQVISLRESNPYQAAMDMGAYKPIASTNPADIT 547 Query: 430 DSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKR-PASESVDLIRGAYNTLSDSD 488 ++ R ++K + +K E + ++ ++S+ L++ D Sbjct: 548 SEIKSRTAATDQLKS-LGINAPILSKEEAASISERVRGTTDVNQSISLLQSFG---RDLQ 603 Query: 489 KEGVRSSFAHIE-DNGLSAVVRLSSEFSDDAKN----------------------AAMVI 525 + +RS A I D+ +A L D+ N A I Sbjct: 604 PQALRSVAASIAPDSAATAYSALILGTDDNQYNNRSPSIPYSQFVSYKPTMNKYEVAKTI 663 Query: 526 LSG------MKHQKDTETRYNTDHKSNKFDSLYDSYI------NTPLTKLEQSTAGGNFN 573 L G K QKD +K +D I N ++ S + Sbjct: 664 LQGDQLINPTKAQKDAGISAVKLPADDKLKQTFDDEIGNAFSHNPQARQMAWSIYKSAYA 723 Query: 574 KDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFED 633 Y G ++ A+Q+ G N ++ P GM K+ F+D Sbjct: 724 GLA-----YTSGDSDGVNTKAVDSDIAEKAIQMATGGVIKGFNGGDVVMPFGMDKSTFKD 778 Query: 634 RLWYATKDTGEYDPYTIKYM------NVGSGKYMIIKN-GNPKVD-KEGKTIIINVE 682 R A + + NVG +Y ++ G D K G I + V+ Sbjct: 779 RYTAAAGEALKSAGLNPAAQSNFIPVNVGDSQYRLVTGSGRWATDPKTGAPITVRVQ 835 Score = 46.8 bits (109), Expect = 0.011, Method: Composition-based stats. Identities = 111/688 (16%), Positives = 218/688 (31%), Gaps = 74/688 (10%) Query: 5 TTNKPLVEAVTPNQDAVRDSVNPQ---AGLRDLGDALGKATQFLEGIRRDNAFATANTRY 61 T+ + +++ P V NP AG ++G+ + A F + + N T + Sbjct: 6 TSQRQILDQGLPAPQ-VSVQTNPDTFGAGWGEVGNRI--AGIFAQEQHKANVAQTQDA-V 61 Query: 62 TELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQA 121 + A F++ + L T+ A + + + IR + + +R+ F Sbjct: 62 LQFQSFADDQFNNTDSGLYTKQGKNAVGQSEAVLSN-IRGKAEELSKQVP-EAMRRDFIQ 119 Query: 122 QVEHDIRDYHTKGVDIQIGATQRAQEDN----LNMTVGLAAAHVLHDPSNENYFQRVQSI 177 Q+ + Y + +IG R E+ L + AA +DP + + V+ Sbjct: 120 QINQAGQQYKRQASTYEIG-QVRQYEEGQFKALQESTVTAAQGQYNDP--QAFTSTVKQG 176 Query: 178 TDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTS 237 I+ + S AK +L GA + +N+M G P + Sbjct: 177 FTAIDQFADAHGWSDEERSNAKNQLKERSADGALSAAAN--QNYMDFIAANGEPGDYDGA 234 Query: 238 LSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKR------------RRLLEHLSSRDN 285 + + R L + G W+ + R L ++L S + Sbjct: 235 VRVSGSTGDARGLRNNNPGNIEAGQNNWEGQAGSDGRFAKFVTPEHGIRALGKNLLSYGD 294 Query: 286 AL--------NTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYN 337 N L ++VKP + + L D + G Sbjct: 295 KGFDTVNEIVNRWAPASDGNNTTAYVKALCEQLSVKPDDQLNLSDLNVLKKLSAG----- 349 Query: 338 LQQFKSVAAPDVARIKLMSTFDAKKFLQKIDD--EYISNPSLSLASTMMATKYK---EIL 392 + + ++ + P + +L + A L +D Y N + AS Y + L Sbjct: 350 IVKHENGSIPY-SDSQLDTGLRAALGLTALDSPKRYTGNVAFDAASPQAQASYLRQAKSL 408 Query: 393 EKSHRQSMQELNQDAISWGIKY--KQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTT 450 + R ++ D IS K ++ ++ + R G ++ D Sbjct: 409 QGEARTQLKAQLTDVISDAKASYLKGVEYPNPPSQAQLISAYGYREG-NQRFAD------ 461 Query: 451 SHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDK-EGVRSSFAHIEDNGLSAVVR 509 N+ Q P+S + + L + G ++ H+E +A + Sbjct: 462 -LENQRVAGQYIGSFRNMPSSSITNYVADLKTQLGSGEGFAGRADAYDHVE----AAAKQ 516 Query: 510 LSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSN--KFDSLYDSYINTPLTKLEQST 567 + S + AAM + + ++ KS D L IN P+ E++ Sbjct: 517 VISLRESNPYQAAMDMGAYKPIASTNPADITSEIKSRTAATDQLKSLGINAPILSKEEAA 576 Query: 568 AGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRVSDAMQIVLGNT-PVNINESMLMPPRGM 626 + + + + S+ + R A++ V + P + + G Sbjct: 577 SISERVRGTTDVNQSI--SLLQ----SFGRDLQPQALRSVAASIAPDSAATAYSALILGT 630 Query: 627 SKTDFEDRLWYA-TKDTGEYDPYTIKYM 653 + +R Y P KY Sbjct: 631 DDNQYNNRSPSIPYSQFVSYKPTMNKYE 658 >gi|319793415|ref|YP_004155055.1| hypothetical protein Varpa_2746 [Variovorax paradoxus EPS] gi|315595878|gb|ADU36944.1| hypothetical protein Varpa_2746 [Variovorax paradoxus EPS] Length = 732 Score = 83.7 bits (205), Expect = 9e-14, Method: Composition-based stats. Identities = 86/587 (14%), Positives = 182/587 (31%), Gaps = 60/587 (10%) Query: 142 TQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEK 201 L A V+ + D + + + L E Sbjct: 132 QYSRGAQRLRFQATAAVNQVIDTAQRSELKSTFTAGLDRLGKIALLP---DASLEEVLAN 188 Query: 202 LNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIG 261 + + G A P +K + + S V+ SL+ + + Sbjct: 189 ADKVALSGRAAGIPEADVAKQVQTFKDNTRFQRAQS-DLVAARRDMPSLDAFQQRLETGD 247 Query: 262 LAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSEL---- 317 L+G LD KR LL+ + L + + + Q + + +T ++ Sbjct: 248 LSG--TLDPDKRIALLKE----TDTLKFQAQNQQQHALDKREKLAERAVTNATRQIETGL 301 Query: 318 -IPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPS 376 + + +T +GT D ++ M D ++++Q + E Sbjct: 302 PLTADGWTGLRTTVEGTPYAEDFNRLVKQERDTQQVLRMPMADQQRYIQTREAELQQAGG 361 Query: 377 LSLASTMMATKYKEILEKSHRQSMQELNQDAIS-WGIKYKQIDPLRFDTEESFADS---L 432 +LA + K +E + +Q E A +G + +DP + Sbjct: 362 -TLADRANLQRIKATVENNAKQLADEPLVAAQRLFGRAVQPLDPNDLLQPGGAGRAAGIF 420 Query: 433 RQRAGFVKKIKDDYN--LTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKE 490 RA + ++ Y + T E L + +++ L ++D D Sbjct: 421 ADRAATLAAMQKQYGSQVGTKPLLPQEAGLLVKAIDSAGPADANQLFGTLRGAINDDDT- 479 Query: 491 GVRSSFAHI----------------EDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKD 534 R+ I + ++S+ + A +L+G Sbjct: 480 -YRAVMQQIAPDSPVKARAGMLTAIDRPVTIQSNLIASDVRVSSAKVAETMLAGEIILNK 538 Query: 535 TETRYNTDHKSNKFDSLYDSYINTPL----TKLEQSTAGGNFNKDKEAIKLYLLGSMKDS 590 T+ + D +++ +T L +S G D +A Y +G + Sbjct: 539 TKGQKQVDGQAHSLFVPARQDFSTAFVDSVGNLYRSRPGAQ-EGDLQAAFAYYVGKSAEL 597 Query: 591 GNYT-----LNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDFE----DRLWYATKD 641 G ++ V +A+ +G+ + P GM+ +D + +R K+ Sbjct: 598 GRLASTPKDIDSTLVKEAITATIGDVVDVNGQGKAKAPIGMTGSDMQAKLRERFSEMVKE 657 Query: 642 TGEYDPYTIKYMNVG------SGKYMIIKNGNPKVDKEGKTIIINVE 682 + + G G+YM+ G P + K G ++++++ Sbjct: 658 QDMPPSAVASFEHYGVINYRRDGQYMLTLGGAPVLGKGGAPLVVDLD 704 >gi|48697205|ref|YP_024935.1| hypothetical protein BcepC6B_gp15 [Burkholderia phage BcepC6B] gi|47779011|gb|AAT38374.1| gp15 [Burkholderia phage BcepC6B] Length = 706 Score = 77.2 bits (188), Expect = 8e-12, Method: Composition-based stats. Identities = 108/710 (15%), Positives = 208/710 (29%), Gaps = 118/710 (16%) Query: 26 NPQAGLRDLGDALGKATQFLEGIRRDN-----AFATANTRYTEL-SFKAVQDFH-----D 74 N A L+ +G +LG+ T L +R + A + R T L + + +D D Sbjct: 31 NTGAALQQVGTSLGQVTDVLAAQKRQDEQAAVARQIGDDRVTWLQNMQTAKDSAAPGAPD 90 Query: 75 FTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKG 134 FT +L + ++++ DG + Y LS + + Q E + Sbjct: 91 FTPNLIKGFDDYSQQQLQQMPDGNAKRFYTLQLSELRTSLAGQAVTWQAEQHRAN---NV 147 Query: 135 VDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 Q G N T A + DPS + Q I+S +D + K KL Sbjct: 148 NQYQQG----------NDTA---ARAIAMDPSLYGSTRASQLAL--IDSAALDPQTKAKL 192 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVE 254 K+ +T + + DP + + P Sbjct: 193 ADNFKDVASTAAGMRMVSADPAATLGVLTQKPDQPLP----------------------- 229 Query: 255 DVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKP 314 AG++ + D +L+ L + Q R Q + Sbjct: 230 --------AGYEWVGDLPPTKLI-----ALQGHARTLVAQQQNALDRAALQRENTAVDLH 276 Query: 315 SELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDE---- 370 ++ + L + + E +L + + A V + + + F + Sbjct: 277 NQALTLVNEGKQLSPEFTNQLLTNTE-GTTVAGAVREL-IGTASQHAGFASASLPQQAAT 334 Query: 371 ---YISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLR-FDTEE 426 Y + + A K+ LE+ H S+ D + + + + DT Sbjct: 335 LQQYQTEAATRGTDPDQAASVKQ-LEQIHTASVAAYKADPWNAALDRGVLQGVPKVDTSS 393 Query: 427 --SFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTL 484 SL RA ++ S E + T + P +D A N + Sbjct: 394 VPGLVSSLAARAQTAGAVEQAAGRRVSLLTPDEAEKTLTAVDSLP----IDTKAQALNQI 449 Query: 485 SDSDKEG--VRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDT---ETRY 539 + + A ++ + + L + A +L+ T Sbjct: 450 GQAFGNAGRIADLAAQWKEKSPAMALALKAG---AADVGGKPLLTTSGAPLSTFILTGAQ 506 Query: 540 NTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLL-GSMKDSGNYTLNRV 598 K+ K D + + + + D + ++ GS G + Sbjct: 507 ALKDKTVKIDEVAGTGMRATIANAIGDALPPEQADDAKEAAYFIAAGSAARGGRTQPSST 566 Query: 599 RVSDAMQIVLGNTPVNINE------SMLMPPRGMSKTDFEDRLWYATKDTGEYDP----- 647 V +A+ G + +++ P G + DF+ + T E Sbjct: 567 DVQNAISGATGGISTTGGQQINGKPNIVAMPYGWREDDFQSAVKQVTAANIENPGVDSVL 626 Query: 648 ---------------YTIKYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + + VG G Y + DK G + +++ Sbjct: 627 ANGHEIPVADFVKQFASYRLVRVGIRGTYAVSTGSKFVTDKTGAPVTVHL 676 >gi|221213944|ref|ZP_03586917.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166121|gb|EED98594.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 706 Score = 76.8 bits (187), Expect = 1e-11, Method: Composition-based stats. Identities = 98/655 (14%), Positives = 193/655 (29%), Gaps = 115/655 (17%) Query: 74 DFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTK 133 DFT +L A ++++ DG + Y ++ D+R Q + H Sbjct: 92 DFTPNLIKGFDDYAQQQLQQMADGPAKRFY-----TLQLNDLRTSLAGQAIAWQAEQH-- 144 Query: 134 GVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQK 193 + Q+ + AA + DP+ + Q I++ ID + K K Sbjct: 145 -RSYNVSQYQQGND--------AAARAIAMDPNLYGATRASQLAL--IDAAQIDPQTKAK 193 Query: 194 LLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVV 253 L+ K+ +T + + DP M + P Sbjct: 194 LVENFKDVASTAAGMRMVSNDPAAALGVMTQKPDQPLP---------------------- 231 Query: 254 EDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 AG++ + D +L+ L + Q A R Q + Sbjct: 232 ---------AGYEWVGDLPPAKLV-----ALQGHARTLVAQRQNAAEREALQRENAAVDL 277 Query: 314 PSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYIS 373 ++ + L + + E +L Q + A V + M++ A + + + Sbjct: 278 HNQALTLVNEGKQLSPEFTNQLLTSTQ-GTSVADAVRELIGMASQRAGFASASLAQQAAT 336 Query: 374 NPSLSLASTMMAT-----KYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLR-FDTEE- 426 + T + LE+ H S+ D + + + + DT Sbjct: 337 LQQYQTEAATRGTDPDQAATVKQLEQIHTASVAAYKADPWNAALDRGVLQGVPQVDTTSV 396 Query: 427 -SFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLS 485 SL RA ++ S E + + P +D A N + Sbjct: 397 PGLVTSLASRAQAAGAVEQAAGRRVSLLTPDEAQTVLNAVDALP----IDTKAQALNQIG 452 Query: 486 D--SDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMV---------ILSGMKHQKD 534 + + A ++ + + L + +D M IL+G + KD Sbjct: 453 QAFGNAGRIADLAAQWKEKSPAMALALKAGAADAGGRPLMTTSGAPLSTFILAGAQALKD 512 Query: 535 TETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLL-GSMKDSGNY 593 K+ K D + + + + + +D + ++ GS G Sbjct: 513 ---------KTVKIDDVAGTGMRATIANAIGDSLPTEQAEDAKEAAYFIAAGSAARGGRT 563 Query: 594 TLNRVRVSDAMQIVLGNTPVNIN------ESMLMPPRGMSKTDFEDRLWYATKDTGEYDP 647 + V +A+ G +++ P G + DF+ + T E Sbjct: 564 QPSSTDVQNAISGATGGISTTGGLQINGKPNIVAMPYGWREEDFQSAVKSVTAANIENPG 623 Query: 648 --------------------YTIKYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + + VG G Y + DK G I +++ Sbjct: 624 VDTVVANGHEIPVADFVKQFASYRLVRVGVRGTYAVKTGSKFVTDKSGTPITVHL 678 >gi|221201508|ref|ZP_03574547.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207936|ref|ZP_03580942.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172121|gb|EEE04562.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178776|gb|EEE11184.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 706 Score = 76.4 bits (186), Expect = 2e-11, Method: Composition-based stats. Identities = 99/655 (15%), Positives = 194/655 (29%), Gaps = 115/655 (17%) Query: 74 DFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTK 133 DFT + A ++++ DG + Y ++ D+R Q + H Sbjct: 92 DFTPNFIKGFDDYAQQQLQQMADGPAKRFY-----TLQLNDLRTSLAGQAITWQAEQH-- 144 Query: 134 GVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQK 193 + Q+ + AA + DP+ + Q I++ ID + K K Sbjct: 145 -RSYNVSQYQQGND--------AAARAIAMDPNLYGATRASQLAL--IDAAQIDPQTKAK 193 Query: 194 LLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVV 253 L+ K+ +T + + DP M + P Sbjct: 194 LVENFKDVASTAAGMRMVSNDPAAALGVMTQKPDQPLP---------------------- 231 Query: 254 EDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 AG++ + D +L+ A L + Q A R Q + Sbjct: 232 ---------AGYEWVGDLPPAKLI-----ALQAHARTLVAQQQNAAEREALQRENAAVDL 277 Query: 314 PSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYIS 373 ++ + L + + E +L Q + A V + M++ A + + + Sbjct: 278 HNQALTLVNEGKQLSPEFTNQLLTSTQ-GTSVAGAVRELIGMASQRAGFASASLAQQAAT 336 Query: 374 NPSLSLASTMMAT-----KYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLR-FDTEE- 426 + T + LE+ H S+ D + + + + DT Sbjct: 337 LQQYQAEAATRGTDPDQAATVKQLEQIHTASVAAYKADPWNAALDRGVLQGVPQVDTSSV 396 Query: 427 -SFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLS 485 SL RA ++ S E + T + P +D A N + Sbjct: 397 PGLVTSLASRAQAAGAVEQAAGRRVSLLTPDEAQTVLTAVDALP----IDTKAQALNQIG 452 Query: 486 D--SDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMV---------ILSGMKHQKD 534 + + A ++ + + L + +D M IL+G + KD Sbjct: 453 QAFGNAGRIADLAAQWKEKSPAMALALKAGAADAGGRPLMTTSGAPLSTFILAGAQALKD 512 Query: 535 TETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLL-GSMKDSGNY 593 K+ K D + + + + + +D + ++ GS G Sbjct: 513 ---------KTVKIDDVAGTGMRAAIANAIGDSLPPEQAEDAKEAAYFIAAGSAARGGRT 563 Query: 594 TLNRVRVSDAMQIVLGNTPVNIN------ESMLMPPRGMSKTDFEDRLWYATKDTGEYDP 647 + V +A+ G +++ P G + DF+ + T E Sbjct: 564 QPSSTDVQNAISGATGGISTTGGLQINGKPNIVAMPYGWREEDFQSAVKSVTAANIENPG 623 Query: 648 --------------------YTIKYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + + VG G Y + DK G I +++ Sbjct: 624 VDTVVANGHEIPVADFVKQFASYRLVRVGVRGTYAVSTGSKFVTDKTGAPITVHL 678 >gi|254251751|ref|ZP_04945069.1| hypothetical protein BDAG_00948 [Burkholderia dolosa AUO158] gi|124894360|gb|EAY68240.1| hypothetical protein BDAG_00948 [Burkholderia dolosa AUO158] Length = 698 Score = 70.6 bits (171), Expect = 9e-10, Method: Composition-based stats. Identities = 70/474 (14%), Positives = 145/474 (30%), Gaps = 47/474 (9%) Query: 247 ERSLEVVEDVSKAIGLAGW--DRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDA 304 + + G+ +LD KR ++L + S +L ++ E QA+ R Sbjct: 226 GDLAALTQLEKDLTSPDGYYAGKLDAGKRNQVLASVVSNRLSLQNQMTSEQQAREREAVT 285 Query: 305 QLNHGIT-VKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKF 363 N + + E Q +GT L Q A A +S + + Sbjct: 286 AFNQATDLMTQGKRFSPEYVQQLTAATRGTALEQQTQGVIKQAAVGASFSTLSVPEMRAA 345 Query: 364 LQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQID---PL 420 +Q + + + + A K+ ++ + + +D + ++ ID P+ Sbjct: 346 VQANESK-QNQSG---TDPLEAAAIKQQ-KQILTATDEAYKRDPWNAALERGAIDVVPPI 400 Query: 421 RFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGA 480 A SL RA ++ S E + + P + + Sbjct: 401 DTSGITQLASSLAARAQLAPVVEHKAARRVSLLTPDEARNVLQTIDALPTNTKAQALALL 460 Query: 481 YNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMK-HQKDTETRY 539 ++ ++ + +D +A + + D +++ SGM Q + + Sbjct: 461 GRSMGNA--ARINDLAEQWKDKSPAAALA-MKAGAADPSGGPLMMQSGMPVAQYILDGQD 517 Query: 540 NTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKEAIKLYLL-GSMKDSGNYTLNRV 598 +K K D+ + + + K D + S + +G N Sbjct: 518 ALANKLVKVDAAAATGLQATIAKRIGDALPPQQLGDARETAYFAAVASARKNGRDVPNST 577 Query: 599 RVSDAMQIVLGNTP-----VNINESMLMP-PRGMSKTDFEDRLWYATKDTGEYDPY---- 648 V + + G + + P G S DF+ + A+ E P Sbjct: 578 DVETGINVATGGLAKTGGIDPRGDRYMAAKPWGWSDDDFDGGVKQASITNIENQPGGRPV 637 Query: 649 --------------------TIKYMNVG-SGKYMIIKNGNPKVDKEGKTIIINV 681 + + VG G Y ++ P D G ++I++ Sbjct: 638 DSVIANGTKIPIDQFMQQFASYRLQRVGIGGTYTVLTGARPVTDTTGAPLLIHL 691 >gi|50282962|ref|YP_053018.1| hypothetical protein VP5_gp16 [Vibrio phage VP5] Length = 694 Score = 69.9 bits (169), Expect = 1e-09, Method: Composition-based stats. Identities = 106/662 (16%), Positives = 217/662 (32%), Gaps = 77/662 (11%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDS-VNPQAGLRDLGDA----LGKATQFLEGIRRDNAFA 55 MI + T L TP+ + S V+ + + + + K Q +++ AF+ Sbjct: 1 MINLPTAAQLQTPATPSAQGLASSIVSNNQKVMGIANQAASEVNKVIQERNDEQQNAAFS 60 Query: 56 TANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDV 115 N F A ++ + D L + +D + + R Sbjct: 61 KVNN--VMHDFDAANAGKEYYTGDEIPDDLDFKKT-EIQIDAD-----GNEVEVVRERIP 112 Query: 116 RKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQ 175 + QA + I + R M + ++ Y +R Sbjct: 113 AYEIQAAMRDKYHRKAIAAAAADIESPSRRTLFMQQMEERANSQNIR-------YLERSA 165 Query: 176 SITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDS 235 + + Q LL+ E A + F +K + Sbjct: 166 AEQTESIRRQQNEDFTQSLLNRDYETAA------YIAHN---FNGSDTERQEKARNVRYM 216 Query: 236 TSLSDVSDSARERSLEVVEDVSKAIGLAGWDR-LDDTKRRRLLEHLSSRDNALNTKLRKE 294 +D+ E + ++ K + G+ L++ +R ++ L +R N +N + Sbjct: 217 MEADSYADAMSEDDVGRMQSALKFLSKDGYAGELNEPQRLATIDQLKARINLVNKRNEAS 276 Query: 295 TQAQARRIDAQLNHGITV-KPSELIPLEDYTQAYGVEQG-----------TELYNLQQFK 342 Q + + +++ I + E++ + + + Q + F Sbjct: 277 YQGRLTLLGVEVDRTIDALESGEMVSPQHQARLFAQIQAAKDAGLISSEDSSWVKRTAFF 336 Query: 343 SVAA---PDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 S AA PD+ + K + + ++++ + +S +I K+H + Sbjct: 337 SEAATYSPDLVQFKSKPLQEQEAVIREMRNSAVSGSDYH---------KLDIYTKNHENA 387 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTT--SHFNKTE 457 LN D + +G + ID FD + A SL +R V +K Y + + F K E Sbjct: 388 KSMLNSDPLMYGAQTGIIDLQPFDFADP-AASLVKRQNDVASMKSQYGDSMVPAFFTKQE 446 Query: 458 ENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDD 517 + + P + + ++ LS + VRS + NG + S + Sbjct: 447 AMEFSRVVQDAPVKDQMRMLSIINGQLSKDNANAVRS---QLMGNGGGVLAVASDALDSN 503 Query: 518 AKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINT----PLTKLEQSTAGGNFN 573 A IL G K+ F S D+ + + ++ Sbjct: 504 QPYVAEKILMGRNALKEVP---------EAFKSW-DTDASPLVLSEIGNAYSASP-ATMG 552 Query: 574 KDKEAIKLYLLGSMKDSGNY--TLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 + E+IK + G++ T+N R+ ++ V G + +++ P R M + F Sbjct: 553 QVMESIKAAYAAQTVNEGDFSGTINSKRLKSVIKEVTGGLIKIGSSTIVSPDRNMDENKF 612 Query: 632 ED 633 + Sbjct: 613 DS 614 >gi|50282929|ref|YP_052985.1| hypothetical protein VP2p17 [Vibrio phage VP2] Length = 694 Score = 69.5 bits (168), Expect = 2e-09, Method: Composition-based stats. Identities = 105/662 (15%), Positives = 217/662 (32%), Gaps = 77/662 (11%) Query: 1 MIEITTNKPLVEAVTPNQDAVRDSV-NPQAGLRDLGDA----LGKATQFLEGIRRDNAFA 55 MI + T L TP+ + S+ + + + + + K Q +++ AF+ Sbjct: 1 MINLPTAAQLQTPATPSAQGLASSIISNNQKVMGIANQAASEVNKVIQERNDEQQNAAFS 60 Query: 56 TANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDV 115 N F A ++ + D L + +D + + R Sbjct: 61 KVNN--VMHDFDAANAGKEYYTGDEIPDDLDFKKT-EIQIDAD-----GNEVEVVRERIP 112 Query: 116 RKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQ 175 + QA + I + R M + ++ Y +R Sbjct: 113 AYEIQAAMRDKYHRKAIAAAAADIESPSRRTLFMQQMEERANSQNIR-------YLERSA 165 Query: 176 SITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDS 235 + + Q LL+ E A + F +K + Sbjct: 166 AEQTESIRRQQNEDFTQSLLNRDYETAA------YIAHN---FNGSDTERQEKARNVRYM 216 Query: 236 TSLSDVSDSARERSLEVVEDVSKAIGLAGWDR-LDDTKRRRLLEHLSSRDNALNTKLRKE 294 +D+ E + ++ K + G+ L++ +R ++ L +R N +N + Sbjct: 217 MEADSYADAMSEDDVGRMQSALKFLSKDGYAGELNEPQRLATIDQLKARINLVNKRNEAS 276 Query: 295 TQAQARRIDAQLNHGITV-KPSELIPLEDYTQAYGVEQG-----------TELYNLQQFK 342 Q + + +++ I + E++ + + + Q + F Sbjct: 277 YQGRLTLLGVEVDRTIDALESGEMVSPQHQARLFAQIQAAKDAGLISSEDSSWVKRTAFF 336 Query: 343 SVAA---PDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQS 399 S AA PD+ + K + + ++++ + +S +I K+H + Sbjct: 337 SEAATYSPDLVQFKSKPLQEQEAVIREMRNSAVSGSDYH---------KLDIYTKNHENA 387 Query: 400 MQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTT--SHFNKTE 457 LN D + +G + ID FD + A SL +R V +K Y + + F K E Sbjct: 388 KSMLNSDPLMYGAQTGIIDLQPFDFADP-AASLVKRQNDVASMKSQYGDSMVPAFFTKQE 446 Query: 458 ENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDD 517 + + P + + ++ LS + VRS + NG + S + Sbjct: 447 AMEFSRVVQDAPVKDQMRMLSIINGQLSKDNANAVRS---QLMGNGGGVLAVASDALDSN 503 Query: 518 AKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINT----PLTKLEQSTAGGNFN 573 A IL G K+ F S D+ + + ++ Sbjct: 504 QPYVAEKILMGRNALKEVP---------EAFKSW-DTDASPLVLSEIGNAYSASP-ATMG 552 Query: 574 KDKEAIKLYLLGSMKDSGNY--TLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF 631 + E+IK + G++ T+N R+ ++ V G + +++ P R M + F Sbjct: 553 QVMESIKAAYAAQTVNEGDFSGTINSKRLKSVIKEVTGGLIKIGSSTIVSPDRNMDENKF 612 Query: 632 ED 633 + Sbjct: 613 DS 614 >gi|212710807|ref|ZP_03318935.1| hypothetical protein PROVALCAL_01875 [Providencia alcalifaciens DSM 30120] gi|212686504|gb|EEB46032.1| hypothetical protein PROVALCAL_01875 [Providencia alcalifaciens DSM 30120] Length = 913 Score = 52.5 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 144/416 (34%), Gaps = 31/416 (7%) Query: 258 KAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSEL 317 + G +D LD++ + + L+ D +L+ E A +G+ + Sbjct: 397 RLTGSVAFDALDESDQAKYLKQAEQMDKQRQQQLQIELGTTISDSYAAWENGLDAPNAP- 455 Query: 318 IPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSL 377 + A+G ++GT + + A ++ K M+ + L I + + Sbjct: 456 -SEKTIISAFGYDKGTAMIADMREAKRYAGLISAAKDMTPQAQRSLLNSIMPDPSES--- 511 Query: 378 SLASTMMA-TKYKEILEKSHRQSMQELNQDAISWGIKYKQ-IDPLRFDTEESFADSLRQR 435 + AS M K+ + ++ + + + + + + I+ +DP +++ Sbjct: 512 NYASKMQRWEKFGKFVDSNIKAQNKAFSANRLQLSIQNNFPLDPNDKSNQDAADHYFANN 571 Query: 436 AGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSS 495 I+D+ +L + + +Q+ + A+ S +S Sbjct: 572 IESSFNIRDENSLNAVAEISSRTGIIPSQIKSKLATGSTSRNPELVIPMSKMYG------ 625 Query: 496 FAHIEDNGLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSN-KFDSLYDS 554 I DN SA+ LS++ + SGM +K + YN ++ N + ++ Sbjct: 626 --QIFDNNPSAINGLSTDAMAYYGKIYDLTRSGMDSEKAIDIAYNLTYEQNDRTKAIIAE 683 Query: 555 YINTPLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSG----NYTLNRVRVSDAMQIVLGN 610 + E+ A A+ + ++ G Y + + DA G Sbjct: 684 QMRDKKYSKERDKAAQGNINSFFALGGFSSPNVTKPGINNREYMRDYQTLYDANFASTGG 743 Query: 611 TPV---NINESMLMPPRGMSKTDFEDRLWYATKDTGEYDPYT-IKYMNVGSGKYMI 662 P + ++ + G++ + +D + Y P G+G ++I Sbjct: 744 DPDLAKKMTDAQVKKTWGVTSINGKD-------EVMRYAPEAVYGTNESGAGNWII 792 >gi|319409425|emb|CBI83071.1| conserved membrane hypothetical protein [Bartonella schoenbuchensis R1] Length = 810 Score = 50.6 bits (119), Expect = 8e-04, Method: Composition-based stats. Identities = 61/306 (19%), Positives = 104/306 (33%), Gaps = 45/306 (14%) Query: 19 DAVRDSV-NPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAV-QDFHDFT 76 ++ + + QA LRD AL T E R A Y + +D ++ Sbjct: 498 ESAQQRLKQAQAALRD---ALRHGTSSEEIERLMANLRQAVNDYISALPEKELKDSNERR 554 Query: 77 NSLDTRDSLQAGDKIKEYV----DGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHT 132 +L Q D I+E ++ H RK QA+ E+D ++ Sbjct: 555 ATLSEDSLSQKLDSIEEMAKLGSSSAAEQLLAEIEQTLDHLQNRKSNQAKEENDHQETQM 614 Query: 133 KGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSN--ENYFQRVQSITDHINSLPIDLR- 189 K Q+G R Q++ LN T L +D ++ N ++ +++ D + L +L Sbjct: 615 KKKLDQLGDLMRRQQEILNDTHHLEKKQ-RYDETDVTSNKEKQSKALEDRLIELQSELSE 673 Query: 190 ----------LKQKLLSEAKEKLNTNQI-------IGAHARDPRVFENF----------M 222 + L EA++K+N + + E+ M Sbjct: 674 LEKGLSAQGLKQNDALKEAEKKMNLAKTALDNGDYETSMQNQSEALESLRQGAQSVLQEM 733 Query: 223 RAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSS 282 R KKG D+ D + G+ G + D + RR+LE + Sbjct: 734 REILKKGDSHNDAHGPQD-----PLGRPLSSQTGQGQEGVGGAQKNDQMRARRILEEIRK 788 Query: 283 RDNALN 288 R N Sbjct: 789 RLNNGR 794 >gi|301119773|ref|XP_002907614.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262106126|gb|EEY64178.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 1549 Score = 48.3 bits (113), Expect = 0.004, Method: Composition-based stats. Identities = 53/241 (21%), Positives = 92/241 (38%), Gaps = 23/241 (9%) Query: 82 RDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYH----TKGVDI 137 + + D+++ V+ R R+ +FLS R R + QV I H TK Sbjct: 432 GNEFKLLDELQAAVNSRSRTVDSQFLSGFMQRYERHHYIQQVSRFILSMHPELETKQPQA 491 Query: 138 QIGATQRAQEDNLNMTVGLAA---------AHVLHDPSNENYFQRVQSITDHINSLPIDL 188 AT + Q D+L + A + +D +Y VQ D L + Sbjct: 492 IQQATGQKQFDDLTDEEKVCERRRQFHFRVAAIRNDLKKADY--NVQWCKDRQIDLVALI 549 Query: 189 RLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARER 248 R + + E + +L ++ D VFEN ++ F+ K + + D Sbjct: 550 RSAESRILEKQAELERARMFKGPQMDSDVFENGLQRFFTKEL-------IVALEDEVEID 602 Query: 249 SLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNT-KLRKETQAQARRIDAQLN 307 L +++ S+ + + + R D +R LL+ L R AL + A++ R A L Sbjct: 603 QLYILDTKSEIVLVENYLRADIKRRDTLLKRLKDRQEALGVFESSPRNTAESTRTAATLA 662 Query: 308 H 308 Sbjct: 663 K 663 >gi|195433509|ref|XP_002064753.1| GK15035 [Drosophila willistoni] gi|194160838|gb|EDW75739.1| GK15035 [Drosophila willistoni] Length = 3390 Score = 48.3 bits (113), Expect = 0.004, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 72/227 (31%), Gaps = 31/227 (13%) Query: 10 LVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAV 69 ++E + VR+S LRDLG ++ + + FA + ++ Sbjct: 2056 ILEHIKNLSVHVRESH---EELRDLGQSIDQHL--------NQTFAEQGETAVQTIYQLH 2104 Query: 70 QDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRD 129 + + N LD +L + + + R+VR+ + + E Sbjct: 2105 KQLNQDLNQLDGSTAL-----LNSTL----------VHQAEQQREVRRHWLPKAEKHADH 2149 Query: 130 YHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENY--FQRVQSITDHINSLPID 187 + + + M A ++ ++ FQ + + + +L Sbjct: 2150 LLERSNEYARQFQPTRNAARIAMLASSAHTNISKAINDARLASFQAKERVHEAQRTL--Y 2207 Query: 188 LRLKQKLLSEAKEKLNTNQIIGAHARDPRVFEN-FMRAFYKKGHPPK 233 ++ A+ L+ +Q + A N + ++G + Sbjct: 2208 PTDGNSMIDRARHSLHRSQQLQQEAIKQMHKSNVLKQKVQRQGQQVE 2254 >gi|223982669|ref|ZP_03632901.1| hypothetical protein HOLDEFILI_00175 [Holdemania filiformis DSM 12042] gi|223965368|gb|EEF69648.1| hypothetical protein HOLDEFILI_00175 [Holdemania filiformis DSM 12042] Length = 1550 Score = 47.1 bits (110), Expect = 0.010, Method: Composition-based stats. Identities = 64/404 (15%), Positives = 121/404 (29%), Gaps = 34/404 (8%) Query: 11 VEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQ---FLE--GIRRDNAFATANTRYTELS 65 V + N++ Q L L D G A Q +E +++D A A + + Sbjct: 479 VSCIIKNRNHDGIVHWSQLTLVTLADEFGHARQAVGIIEDITLQKDAELAYAKEQQYRKA 538 Query: 66 FKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSIS----HRDVRKKFQA 121 A N D +GD +D R+ YD+F +++ H D R+ F Sbjct: 539 IAAGACLMIEINLCDNLVEKISGD-WSFLIDEATRTPYDQFQRNVARQQIHPDDRRAFLQ 597 Query: 122 QVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHI 181 + +I R M AH++ DP+ ++ +V I Sbjct: 598 TFSREALKAAYNQERREIPLQYRRLSQKRQMVWVETTAHIIQDPATQHL--KVLFFIRDI 655 Query: 182 NSLP--IDLRLKQKLLSEAKEKLNT----NQIIGAHARDPRVFENFMRAFYKKGHPPKD- 234 + + Q LN QI DP+ + + D Sbjct: 656 DKEKKHELILQYQSQRDSLTGLLNKGSTEAQIQSVLENDPQALHALLVLDIDEFKKINDT 715 Query: 235 -STSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRK 293 D S R + V + +G G D + E + +R Sbjct: 716 FGHLYGDQVLSRIARQIREVFRENDVLGRIGGDEFAILMKNCKTEQI----------VRD 765 Query: 294 ETQAQARRIDAQLNHGITVKPSELIPLED---YTQAYGVEQGTELYNLQQFKSVAAPDVA 350 + + +++ G++ +D + Q Y + +Q K+ + Sbjct: 766 KAEQLCQKLTTLFADGVSCSIGVSFFAKDGTYFYQLYQNADIALYQSKRQGKNQVQFYLP 825 Query: 351 RIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEK 394 +K S + ++ E + + + + E L Sbjct: 826 SMKQESWTVNSTEIDELRTEMPVIAAPEMEEHLTSQ-LLEELND 868 >gi|194761028|ref|XP_001962734.1| GF14282 [Drosophila ananassae] gi|190616431|gb|EDV31955.1| GF14282 [Drosophila ananassae] Length = 3399 Score = 47.1 bits (110), Expect = 0.011, Method: Composition-based stats. Identities = 49/349 (14%), Positives = 115/349 (32%), Gaps = 56/349 (16%) Query: 28 QAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTE-------LSFKAVQDFHDFTNSLD 80 Q +R+LG + ++ Q + + A + Y E L+ + ++ N LD Sbjct: 2069 QEHVRNLGLRVTESHQDIRDLNLQIA-KQLDPDYLERGEGMVRLTLQKQLVLNEHLNRLD 2127 Query: 81 TRD---SLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDI 137 + G K+++ R+VRK + + E R + + Sbjct: 2128 GSAVMLNATLGVKLEQ------------------QREVRKHWLPKAEKHARHLQDRSNEY 2169 Query: 138 QIGATQRAQEDNLNMTVGLAAAHVLHDPSNENY-----------FQRVQSITDH---INS 183 + M A +++ ++ QR +D I+ Sbjct: 2170 ARQFQPTRNGARIAMLASSAHSNITKAINDARVASILAKEGVYEAQRTLYPSDGGSMIDR 2229 Query: 184 LPIDLRLKQKLLSEAKEKLNTNQII--GAHARDPRVFENFMRAFYKKGHPPKDSTSLSDV 241 L ++L EA ++++ + ++ H ++ RV F S L + Sbjct: 2230 AKHSLYRSKQLQQEALKQMHKSNVLKDKLHRQEQRVEGIKATIFDAGRRTNVISGQLQGL 2289 Query: 242 SDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSR---------DNALNTKLR 292 ++++ R + D+++ G R D R + E + + D + + Sbjct: 2290 AENSARRQAKESLDLAERTGKQ--MRADLQMARTMQESIQNMRKSFAALEPDWEIKLGMA 2347 Query: 293 KETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQF 341 +E + + N ++ + ++ +A+ +L L+ Sbjct: 2348 QENISLTKTNLRLANVSLSYLELQAAKEQEVFEAWNNSMAQQLQQLRDL 2396 >gi|325000921|ref|ZP_08122033.1| von Willebrand factor type A [Pseudonocardia sp. P1] Length = 652 Score = 46.8 bits (109), Expect = 0.013, Method: Composition-based stats. Identities = 50/344 (14%), Positives = 115/344 (33%), Gaps = 27/344 (7%) Query: 8 KPLVEAVTP---NQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTEL 64 + ++E +P Q+ +R ++ + GL DL + + + L+ R + + Sbjct: 38 RDVMEGSSPRQALQELMRRGLDGRRGLDDLTRRVWEQRRDLQRDNRIDGTLQEVRELLQR 97 Query: 65 SFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVE 124 + A Q+ +S D R D + G +R S R+ + + + Sbjct: 98 ALDAEQESLRNEDSDDARFREMQLDALPSDTGGAVRELGSYDWRSADAREAYSEIRDLLG 157 Query: 125 HDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQR-VQSITDHINS 183 ++ D +G+ + + +N + + H E+ +R + + H + Sbjct: 158 REMLDQRFEGMKQALEGATPEDVERINEMLDDLNDLLEHHARGEDTTERFAEFMGKHGDH 217 Query: 184 LPIDLRLKQKLLSEAKEKLNTNQ-IIGAHARDPRV-FENFMRAFYKKGHPPKDSTSLSDV 241 P + + ++L+ + Q ++ + + + R + + + L Sbjct: 218 FPENPQNTEELVDALAARSAAAQRMMNSMSAEQRAELAELAQQAFGDPRLAQSLARLD-- 275 Query: 242 SDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARR 301 A+ + + ED S G + + + R LE L D + A+ Sbjct: 276 ---AQLQGMRSGEDWSGEGNFRGDNPMGMGEATRALEQLGQLDALAEQLAQSYPGARM-- 330 Query: 302 IDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVA 345 E I L+ T+ G E + L + + Sbjct: 331 --------------EDIDLDALTEMLGPEAAADARTLAELEREL 360 >gi|15887953|ref|NP_353634.1| transcriptional regulator [Agrobacterium tumefaciens str. C58] gi|15155555|gb|AAK86419.1| transcriptional regulator [Agrobacterium tumefaciens str. C58] Length = 471 Score = 45.2 bits (105), Expect = 0.035, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 68/217 (31%), Gaps = 32/217 (14%) Query: 135 VDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 I G R + L +T A + PS N +R Q L + K L Sbjct: 4 NKIFAGPRVRRIRNGLALTQTAMAEALAISPSYLNLIERNQRPLTVQLLLKLASVYKVDL 63 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEV-- 252 E + + DP + P++ ++D + +A L++ Sbjct: 64 DDLQGESAGSAGQLREVFADPLLVGEVPS--------PQELIEVADAAPNAASGVLKLYR 115 Query: 253 --------VEDVSKAIGLAGWDRLDDTKRRRLLE--HL------------SSRDNALNTK 290 + D+S + G + + R + E H+ ++ + Sbjct: 116 AYREQAQRLSDLSDLLAREGHETALSSTRLPIDEVRHVFENRPAYFHAIDAAAEELHKQL 175 Query: 291 LRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAY 327 + A R Q NHGI V+ + + + + Y Sbjct: 176 STGDDLASGLRAWLQKNHGIVVRTLPVHAMPNLRRRY 212 >gi|156554052|ref|XP_001599488.1| PREDICTED: similar to CG9170-PA [Nasonia vitripennis] Length = 1362 Score = 44.8 bits (104), Expect = 0.049, Method: Composition-based stats. Identities = 55/385 (14%), Positives = 147/385 (38%), Gaps = 39/385 (10%) Query: 46 EGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDR 105 E +++ + + + + F+ V++ + S + + ++ + ++D +R + + Sbjct: 509 EHMQQQESRSIDDEEFLPNKFENVKNIDLISKSDSEKTTPRSSESRSSFLD-DLRKSKEI 567 Query: 106 FLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDP 165 + ++ ++R K A+ E I + + ++ + ++ + P Sbjct: 568 MMENMKKSELRAKKIAEDEARIAENQSWSS--KMLRSLELSKEGIK------------SP 613 Query: 166 SNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAF 225 S+EN +++I +++ + K ++K + NQ+ +++ + E ++ Sbjct: 614 SSENLMTNLEAIKNNMIKENENEIRIFKQELDSKLEAVKNQLEKEFSKEKQSLEASIQKK 673 Query: 226 ---YKKGHPPKDSTSLSDVSDSARE----------RSLEVVEDVSKAIGLAGWDRLDDTK 272 +K K+ + + + E LEV + + L+ D + Sbjct: 674 LLELRKEMTSKEEHEIKKLIEDMGEVRADNLKKVRNELEVCYEKERQDILSNLKGELDQR 733 Query: 273 RRRLLE----HLSSRDNALNTKLRKETQAQA--RRIDAQLNHGITVKPSELIPL-EDYT- 324 +R LLE +S +N L + QA+ I Q N I EL ++ Sbjct: 734 KRELLELRSQEISKLENEHERGLDDDKQAKIVEMEITRQHNERIEDFRKELAKEFDNVRA 793 Query: 325 --QAYGVEQGTEL-YNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLAS 381 +A E+ T++ + ++ + D + ++ K+ L +I +Y + + Sbjct: 794 DLRAQQREKITKITEDHEKCLADILRDFRVDEGLARKVYKQRLDEIRTDYSKDVENEMKK 853 Query: 382 TMMATKYKEILEKSHRQSMQELNQD 406 + +E ++ + + L +D Sbjct: 854 HNDSAVRQESVDFEKIRCEKRLLED 878 >gi|256376324|ref|YP_003099984.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255920627|gb|ACU36138.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 648 Score = 44.5 bits (103), Expect = 0.061, Method: Composition-based stats. Identities = 54/344 (15%), Positives = 115/344 (33%), Gaps = 31/344 (9%) Query: 8 KPLVEAVTP---NQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTEL 64 + ++E +P ++ +R + GL DL L + L R + R + Sbjct: 31 RDIMEGASPESALRELLRRGLPGGRGLDDLTARLWQKRAALARRNRLDGTLQEVRRLLDE 90 Query: 65 SFKAVQ--DFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQ 122 + +A + F D + D R D + +R S R+ ++ Sbjct: 91 ALEAERAALFPDPDD--DARFREAQLDALPTGTAAAVRELSSYDWRSPRARESYERISQL 148 Query: 123 VEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQR-VQSITDHI 181 + ++ D +G+ +G +A + + +G A + S ++ +R + + H Sbjct: 149 LGRELMDQRFQGMKQAMGGVGQADVERIQEMLGDLNALLAAHASGQDVRERFAEFMRRHG 208 Query: 182 NSLPIDLRLKQKLLSEAKEKLNTNQ-IIGAHARDPRV-FENFMRAFYKKGHPPKDSTSLS 239 P D + +L+ + Q ++ + + R + + G S Sbjct: 209 EFFPEDPKDVDELVDVLAARSAAAQRVMNSMTEEQRAELSALSQQAFG-GSQVGSQLSAL 267 Query: 240 DVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQA 299 D + R L ED + G + L + + + L+ D Sbjct: 268 D----SLLRGLRPGEDWDSSARFRGDNPLGLGEAAQAMADLAELD--------------- 308 Query: 300 RRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKS 343 ++ QL+ E I L+ + G E G + L + + Sbjct: 309 -QLAEQLSQSYPGARLEDIDLDALVRQLGPEAGVDARALAELER 351 >gi|227355849|ref|ZP_03840242.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164168|gb|EEI49065.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 1027 Score = 44.1 bits (102), Expect = 0.079, Method: Composition-based stats. Identities = 58/423 (13%), Positives = 130/423 (30%), Gaps = 44/423 (10%) Query: 259 AIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELI 318 G A +D LD+ + + L +L+++ + A G+ + Sbjct: 494 LTGSAAFDALDEADQAKYLRQAEQLRKQKQGELQQQLGTRVADSYAAWERGLEAPNAP-- 551 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 ++ A+G ++G+ + Q A ++ K MS + L +I + S Sbjct: 552 THDELISAFGYDKGSAMSADMQEAKRYAGFMSAAKEMSPQAQQALLSQIRPQTGEVNYES 611 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQ-IDPLRFDTEESFADSLRQRAG 437 K+ + +E + ++ ++ + + I+ +DP + +++ D + Sbjct: 612 KIQRW--EKFGKFVEGNIKEQDKQFAANRLQLSIQNNFPLDPNDKNNQQAADDYFEKHIQ 669 Query: 438 FVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFA 497 ++DD +L + +Q+ + ++ + ++ A Sbjct: 670 QSFNLRDDNSLNAVAELTARTGIIPSQVKSVLNMGATSKDPEVVLPIAKMYGQIFDNNPA 729 Query: 498 HIEDNGLSAVVRLSSEFSDD----AKNAAMVILSGMKHQKDTETR-----------YNTD 542 D S + S +S + A+ ++D T+ Y D Sbjct: 730 SATDIPSSTMAYYSKVYSLSRAGMPDDKAVETAFKTTFEQDERTKQMIASQIRDKGYIKD 789 Query: 543 HKS------NKFDSLYDSYINTP----------LTKLEQSTAGGNFN---KDKEAIKLYL 583 N F Y + + + Q+ NF D E K Sbjct: 790 RDKAAQSNINDFYPWYKPFSSPSVSKPGTQNGAYLRDYQTLYDANFAETGGDAELAKKMT 849 Query: 584 LGSMKDS---GNYTLNRVRVSDAMQIVLGNTPVNINESMLMPPRGMSKTDF-EDRLWYAT 639 +K + N + + A + V G + + + K A+ Sbjct: 850 NAQIKRTWAVSNINGSEEVMRYAPEAVYG-INESGAGNWIAGQWEEEKKQLMSKSFGGAS 908 Query: 640 KDT 642 DT Sbjct: 909 SDT 911 >gi|254573256|ref|XP_002493737.1| Myosin-like protein associated with the nuclear envelope [Pichia pastoris GS115] gi|238033536|emb|CAY71558.1| Myosin-like protein associated with the nuclear envelope [Pichia pastoris GS115] gi|328354439|emb|CCA40836.1| Nucleoporin nup211 [Pichia pastoris CBS 7435] Length = 1688 Score = 43.7 bits (101), Expect = 0.10, Method: Composition-based stats. Identities = 43/334 (12%), Positives = 115/334 (34%), Gaps = 25/334 (7%) Query: 198 AKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVS 257 + NT+Q+I ++ + + S+++ E+ + Sbjct: 1175 TGDNDNTSQLISFLRDQKDHVSRELQQSQMEVTRLRQQLSIAEE---------EIEQSKI 1225 Query: 258 KAIGLAGWDRLDDT-----KRRRLLEHLSSRDNALNTKLRKETQAQARRIDA---QLNHG 309 +A L G+++L + + L + S NT LR E Q+Q RI A QL Sbjct: 1226 EAEKLEGYEKLSNELQGEITKSAELSKIVSELRQGNTDLRNENQSQVNRITALYHQLEQE 1285 Query: 310 ITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDD 369 I ++ + N +Q + + + + + ++++++ + Sbjct: 1286 KAKNEPFEIQIKQLKVEVETKGKEATMNQEQLEYWKTRSLNLLNEVGSIGNEEYVKEAES 1345 Query: 370 EYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFA 429 + + ++ ++ L+ + ++ +D Y+ + + ++ A Sbjct: 1346 LKEKLAEVEGKHSELSARFNR-LKDEAQSKLKRRKED-------YEMLRREHEELRKTLA 1397 Query: 430 DSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDK 489 D+ + + +K+ + S + E+ + + + + VD ++ + Sbjct: 1398 DTNAEVSSLTEKLNLSSKTSESLTQELEQLKSKQTSADLNSEKHVDELKKKQEEVERLLA 1457 Query: 490 EGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAAM 523 E + ++ S V + +AK A Sbjct: 1458 ENNKLKETQVQSPETSEEVEALKKELQEAKEAVK 1491 >gi|221102272|ref|XP_002170393.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 898 Score = 43.3 bits (100), Expect = 0.15, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 84/256 (32%), Gaps = 26/256 (10%) Query: 42 TQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRS 101 T + + D T Y +L K + + SL S + DK E + + Sbjct: 389 TDIEDNFKEDTKLLLCQTEYKQL--KDEFELYKKKQSL---SSHPSKDKNDELLQKIVE- 442 Query: 102 AYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVD-IQIGATQRAQEDNLNMTVGLAAAH 160 + I + + + + H + V +QI T + +ED L L H Sbjct: 443 ----YKKKIDYLESNNEQLVLKNNLEEKEHLRIVSTLQIQFT-KLEEDCLQKIEQLKVEH 497 Query: 161 VLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFEN 220 + + EN ++ + ++ L +L + E Q + P+ Sbjct: 498 LKNIELIENELKKQR--IRTLDLLAEKDIEIDRLREKQYENEYLLQ------QTPK---- 545 Query: 221 FMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHL 280 + +K K+ + + ++ ++ S + L R+D + R LE + Sbjct: 546 -SQEVFKFP-EFKEKSVIDEMLQNSPTMSPTKRATMVHYTELLEQQRIDLSSYRAQLEDV 603 Query: 281 SSRDNALNTKLRKETQ 296 + + + Q Sbjct: 604 KNALKDSERREDQLVQ 619 >gi|121602289|ref|YP_988395.1| hypothetical protein BARBAKC583_0057 [Bartonella bacilliformis KC583] gi|120614466|gb|ABM45067.1| conserved hypothetical protein TIGR02302 [Bartonella bacilliformis KC583] Length = 805 Score = 43.3 bits (100), Expect = 0.16, Method: Composition-based stats. Identities = 57/302 (18%), Positives = 110/302 (36%), Gaps = 33/302 (10%) Query: 5 TTNKPLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTEL 64 + + L +A T +DA+R + R + D L +A + IR A N+ Sbjct: 497 SAQQRLKQAQTALRDALRRGASGAEIERLMAD-LRQAMH--DYIRALAEKAPENSDSNNK 553 Query: 65 SFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVE 124 + + H +S++ + + ++ +D ++ H V+K + Q + Sbjct: 554 ANLSEDSLHKKLDSIEQMAQIGSSSSAEQLLD--------EIEQTLDHLRVQKSNRDQEQ 605 Query: 125 HDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVL----HDPSNENYFQRVQSITDH 180 + + K ++G R Q++ LN T L +E +R + Sbjct: 606 KNTQTTKMKKTIDKLGDVMRRQQEILNDTHRLETEKRRGETVTQEQSEALVERQAQLQSE 665 Query: 181 INSLPIDLR----LKQKLLSEAKEKLNTNQIIGAHA-------RDPRVFENFMR----AF 225 ++ L +L K +L EA+EK+N+ + H V E+ + Sbjct: 666 LSVLEKELSEQGFEKGGVLKEAEEKMNSAKAALGHGNHETSVKNQSDVLESLRQGAQNIL 725 Query: 226 YKKGHPPKDSTSLSDVSDSAR--ERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSR 283 K K S + + ++S R L D + G D + R++LE + R Sbjct: 726 QKMREALKQSEDMQNAAESQDPLGRPLSSDSDQGQN-GKTMSQEGDKAQARQILEEIRRR 784 Query: 284 DN 285 N Sbjct: 785 LN 786 >gi|197101129|ref|NP_001126386.1| FCH and double SH3 domains protein 1 [Pongo abelii] gi|55731282|emb|CAH92355.1| hypothetical protein [Pongo abelii] Length = 598 Score = 42.9 bits (99), Expect = 0.20, Method: Composition-based stats. Identities = 58/413 (14%), Positives = 120/413 (29%), Gaps = 47/413 (11%) Query: 22 RDSVNPQAGLRDLGDALGKATQFLEGIRRDNA--FATANTRYTELSFKAVQDFHDFTNSL 79 + + RDL G++ + E + R A + + Sbjct: 96 QTRLQASDRYRDLAGGTGRSAK--EQVLRKGTENLQRAQAEVLQSVRDLSR--SRKLYGQ 151 Query: 80 DTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQI 139 R A +K + RS + F S S + + K AQ + + + Sbjct: 152 RERVWALAQEKAADVQARLNRSDHGIFHSRTSLQKLSTKLSAQSAQYSQQLQAARNEYLL 211 Query: 140 GA-TQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEA 198 A D+ A L +E+ + S++ + + Sbjct: 212 NLVATNAHLDHYYQEELPALLKALVSELSEHLRDPLTSLSRTELEATEVILEHAHRGEQT 271 Query: 199 KEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSK 258 +++ Q + ++P VF PP+ + +V Sbjct: 272 TSQVSWEQDLKLFLQEPGVFS---------PTPPQ---------QFQPAGTDQVCVLEWG 313 Query: 259 AIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELI 318 A G+AG L+ K + L ++RD + ++ +E + R+ ++ +E Sbjct: 314 AEGVAGKSGLE--KEVQRLTSRAARDYKIQDEVEQERRLSEARLS---QRDLSPT-AEDA 367 Query: 319 PLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 L D+ + + E Q + P A + + L + E++ Sbjct: 368 ELSDFEECEETGELFEEPAPQALATRPLPCPAHVVFRYQAGREDELTITEGEWLEIIEEG 427 Query: 379 LASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADS 431 A ++ NQ + + ++ ES DS Sbjct: 428 DADEW----------------VKARNQHGEVGFVPERYLNFPDLSLPESSQDS 464 >gi|118385975|ref|XP_001026110.1| hypothetical protein TTHERM_00971900 [Tetrahymena thermophila] gi|89307877|gb|EAS05865.1| hypothetical protein TTHERM_00971900 [Tetrahymena thermophila SB210] Length = 2536 Score = 42.9 bits (99), Expect = 0.21, Method: Composition-based stats. Identities = 35/270 (12%), Positives = 85/270 (31%), Gaps = 34/270 (12%) Query: 27 PQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQ 86 P L+++ + ++ L + + N E + Q F + Sbjct: 2241 PNEYLKNVQGDINESNATLNQVEK-------NQEDNEKQIELKQIFSKEIQASQNNHPSP 2293 Query: 87 AGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRD-YHTKGVDIQIGATQRA 145 ++ G+I+ Y + + + + Q Q E D Y+ +G + Sbjct: 2294 LYKHTQDK-QGQIKPIYGNSKTIFNQTE-SLQLQNQDEKSNADQYYVDSNKFLMGIQPKK 2351 Query: 146 QEDNL----NMTVGLAAAH---VLHDPSNE-NYFQRVQSITDHINSLPIDLRLKQKLLSE 197 ++ N + + + + ++E Y Q+ + + +LR + Sbjct: 2352 TLSDINFQQNQNAYYKSNNKKQLYIEEADEIQYTQQDEKSENLNEQNEFNLRSPHYTENS 2411 Query: 198 AKEKLNT-NQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDV----------SDSAR 246 + + I + NF + +KK +D+ D+ +S + Sbjct: 2412 TQRNQYVFDSKINLLKDN-----NFSQQNHKKDQRQQDTIIEEDIYSLQHKIKDDQESQK 2466 Query: 247 ERSLEVVEDVSKAIGLAGWDRLDDTKRRRL 276 + + + K AG LD+ + + Sbjct: 2467 GQQISIQIIEEKKNNNAGKRDLDNQSNKII 2496 >gi|291405461|ref|XP_002718960.1| PREDICTED: myosin 18A [Oryctolagus cuniculus] Length = 2038 Score = 42.5 bits (98), Expect = 0.26, Method: Composition-based stats. Identities = 54/445 (12%), Positives = 134/445 (30%), Gaps = 33/445 (7%) Query: 69 VQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIR 128 DF + D L+ + K ++ R+ S ++ Q++ + Sbjct: 1380 EVDFTKKRLQQEFEDKLEVEQQSKRQLERRLGDLQADSDES-------QRALQQLKKKCQ 1432 Query: 129 DYHTKGVDIQI---GATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLP 185 + D ++ G R E + HD + QR + + L Sbjct: 1433 RLTGELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHDEAQREKLQREKLQREKDMLLA 1492 Query: 186 IDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSA 245 LKQ+L + + Q + + + + + KD SL+ V Sbjct: 1493 EAFSLKQQLEEKDMDIAGFTQKVVSLEAELQDISS---------QESKDEASLAKVKKQL 1543 Query: 246 RERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKE----------T 295 R+ +V + + AG ++ + + RL + + ++ Sbjct: 1544 RDLEAKVKDQEEELDEQAGTIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSC 1603 Query: 296 QAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLM 355 Q + ++++ QL K L + + + D+ R K + Sbjct: 1604 QKKLKQMEVQLEEEYEDKQKVLRDKRELESKLTALSDQVSRRDLESEKRLRKDLKRTKAL 1663 Query: 356 STFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYK 415 + L + + +++ R++M+ +D ++ Sbjct: 1664 LADAQVMLDHLKNSAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEIEDLH---LQID 1720 Query: 416 QIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVD 475 I + EE + R++ ++++D K + + ++ Sbjct: 1721 DIAKTKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDLAQMNDLQA 1780 Query: 476 LIRGAYNTLSDSDKEGVRSSFAHIE 500 + N +E +++ + +E Sbjct: 1781 QLE-EANKEKQELQEKLQALQSQVE 1804 >gi|89100713|ref|ZP_01173569.1| hypothetical protein B14911_02484 [Bacillus sp. NRRL B-14911] gi|89084588|gb|EAR63733.1| hypothetical protein B14911_02484 [Bacillus sp. NRRL B-14911] Length = 520 Score = 42.1 bits (97), Expect = 0.34, Method: Composition-based stats. Identities = 63/355 (17%), Positives = 125/355 (35%), Gaps = 58/355 (16%) Query: 34 LGDALGKATQFLEGIRRDN------AFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQA 87 + A A Q LE RR+ A A +L +A Q+ D N L +++ Sbjct: 33 IAGAKNAAEQILEDARREADASKKEALLEAKDEIHKLRTEAEQEVRDRRNELQKQENRLL 92 Query: 88 GDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQE 147 +E +D + S D+ S + RD + Q H + ++ ++ RAQ+ Sbjct: 93 QK--EENLDRKDESL-DKRESILEKRDDSLNKKQQ--------HIEEMESKVDEMVRAQQ 141 Query: 148 DNLNMTVGLA---AAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKL-- 202 L GL A ++ D + + I + R K++ +AKE L Sbjct: 142 SELERISGLTREEAKSIILDRVEQEVAHDI-----AIMVKETENRAKEEADKKAKEVLSL 196 Query: 203 --------NTNQIIGAHARDP-RVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVV 253 + + + P + + + ++ + D+ ++ Sbjct: 197 AIQRCAADHVAETTVSVVNLPNDEMKGRIIGREGRNIRTLETLTGIDL----------II 246 Query: 254 EDVSKAIGLAGWDRLDDTKRRRLLEHL-------SSRDNALNTKLRKETQAQARRIDAQL 306 +D +A+ L+G+D + R LE L +R + K R+E R I Q Sbjct: 247 DDTPEAVILSGFDPIRRETARLALEKLVQDGRIHPARIEEMVDKARREVDEHIREIGEQT 306 Query: 307 N-----HGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMS 356 HG+ +++ + +YG + Q + A ++ + ++ Sbjct: 307 TFEVGVHGLHPDLIKILGRLKFRTSYGQNVLKHSMEVAQLSGLLAAELGEDETLA 361 >gi|311112058|ref|YP_003983280.1| ATP-dependent RNA helicase DeaD [Rothia dentocariosa ATCC 17931] gi|310943552|gb|ADP39846.1| ATP-dependent RNA helicase DeaD [Rothia dentocariosa ATCC 17931] Length = 695 Score = 41.8 bits (96), Expect = 0.44, Method: Composition-based stats. Identities = 57/388 (14%), Positives = 116/388 (29%), Gaps = 44/388 (11%) Query: 107 LSSISHRDVRKKFQA---QVEHDIRDYHTKGVDIQIGATQRAQEDNLN--MTVGLAAAHV 161 L + + R+K A Q + + + + GAT E+ L+ TV ++ Sbjct: 3 LECVGESERRRKACARTPQQNEEKVTENIPAENAEQGATLGNTEETLSVEKTVEESSLET 62 Query: 162 LHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSE-AKEKLNTNQIIGA----HARDPR 216 P+NE D S+ + R + ++ + + G DPR Sbjct: 63 GEKPTNEK--------ADAAESMTAEPREDNAATKKNTEDNTHESHEEGTRFTDLGLDPR 114 Query: 217 VFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRL 276 V Y+K P ++ T +VV G L R Sbjct: 115 VLSALEEVGYEKPSPIQEQTIP------LLLDGKDVVGLAQTGTGKTAAFALPALSRMAE 168 Query: 277 LEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELY 336 L + N ++ + A R + Q+ + + + G G +L Sbjct: 169 LADI----NGVSRDTQVLVLAPTRELALQVAEAFSSYATHMEDFTVLPIYGGSPYGPQLA 224 Query: 337 NLQQFKSVA--------------APDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLAST 382 L++ V + D++ ++ + +A + L+ E + Sbjct: 225 GLRRGAQVVVGTPGRVIDHLEKGSLDLSNLQYLVLDEADEMLRMGFAEDVETILEGTPDA 284 Query: 383 MMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTE--ESFADSLRQRAGFVK 440 + + S R+ Q+ D +K K + V Sbjct: 285 KQVALFSATMPNSIRKIAQQYLNDPTEVRVKTKTTTGANIRQRYMQVMHSHKLDAMTRVL 344 Query: 441 KIKDDYNLTTSHFNKTEENQLRTQLVKR 468 ++++ + K E ++ +L R Sbjct: 345 EVENYDGIIVFVRTKKETEEVADKLKAR 372 >gi|310778825|ref|YP_003967158.1| phage tail tape measure protein, TP901 family [Ilyobacter polytropus DSM 2926] gi|309748148|gb|ADO82810.1| phage tail tape measure protein, TP901 family [Ilyobacter polytropus DSM 2926] Length = 793 Score = 41.4 bits (95), Expect = 0.48, Method: Composition-based stats. Identities = 83/581 (14%), Positives = 179/581 (30%), Gaps = 79/581 (13%) Query: 96 DGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRA----QEDNLN 151 ++ D L S K +A E + + + + + Q+ +N Sbjct: 26 SKEMKKLSDEMLKLKSTNRHISKLKASEEKLEKQFASGREEYK---RQQRELFGTSKKVN 82 Query: 152 MTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKE-KLNTNQIIGA 210 S N F++ Q +H+ + K+ L ++ K N+ Sbjct: 83 DLRRAIEKTKKPSKSMINEFKKAQKAENHLKETTEN--QKKALTDMMRQMKAAKNETKRY 140 Query: 211 HARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDD 270 + ++ M+ ++ K+ +L + + + A+G+A +DD Sbjct: 141 SQSQEELAKS-MKKVEERHKKLKEYENLKKSVSNNAGGTFARSAGQAVALGVAVKFAIDD 199 Query: 271 TKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVE 330 + + + K ++E + + I SE+ + + Sbjct: 200 EEAFADVRKTTGLAGEEAKKFQRELKKATKDIPKF--------NSEIYEIAAAA----GQ 247 Query: 331 QGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKE 390 G L + QF S A V+ M A + L ++E Sbjct: 248 AGINLQEIPQFTSDTA-KVSVAFDMEAGKAGETLAT---------------------WRE 285 Query: 391 ILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTT 450 + S + ++ D ++ ++ P + + L + A F + T Sbjct: 286 AFKMSQSEVIK--LADQMNLLGDSIKVAPAQVAEIATQVGGLGRMANFTEAQTSALGGTL 343 Query: 451 SHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRL 510 + T + K Y T++ D S+ + +L Sbjct: 344 IALGVKDAGTASTAIRKL------------YTTMASGDSAS--STMSAAFQKIGIDPGQL 389 Query: 511 SSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGG 570 + + D++ A M + G+ N K + L+ + + L +T Sbjct: 390 AEDLQKDSQGALMKVFQGLN-------DLNDSEKLSVTKQLFGEEAMSSMGMLINNT--K 440 Query: 571 NFNKDKEAIKL--YLLGSMKDSGNYTLNRVRVSDAMQIVLGNTPVNINESMLMPP-RGMS 627 ++ + + GS+ + N LN + + +PP RG + Sbjct: 441 FLKENLKLVGDATKYAGSVNNEYNNKLNTTASDIKLALKATTDMAAATTRFFLPPIRGAT 500 Query: 628 K--TDFEDRLWYATKDTGEYDPYTIKYMNVGSGKYMIIKNG 666 K +F + + T+D P K M G+ ++ +K G Sbjct: 501 KGMVNFSEGITKFTEDY----PKLAKAMAFGAAGFVGLKLG 537 >gi|118350032|ref|XP_001008297.1| hypothetical protein TTHERM_00013180 [Tetrahymena thermophila] gi|89290064|gb|EAR88052.1| hypothetical protein TTHERM_00013180 [Tetrahymena thermophila SB210] Length = 1326 Score = 41.4 bits (95), Expect = 0.54, Method: Composition-based stats. Identities = 52/352 (14%), Positives = 113/352 (32%), Gaps = 38/352 (10%) Query: 70 QDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRK----KFQAQVEH 125 Q+F +FT S+ A ++ + L + R +++ K Q + Sbjct: 829 QNFQNFTVSIQNNILRNAKRQLSSIDS------INNILEKKNFRRIKRISSAKLTFQNNN 882 Query: 126 DIRDYHTKGVDIQIGAT-----QRAQEDNLNMTVGLAAAHVLHDPSNENYFQRV---QSI 177 ++ +I N+ T + ++ D + +NY +++ Q I Sbjct: 883 KLK--QLSPNSCKINFQKCPSIVCLDNQNIKDT----SNNIKQDGTQQNYIEKITSKQKI 936 Query: 178 TDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYK-KGHPPKDST 236 D N D + K+++E + N Q ++ ++ + +G K Sbjct: 937 FDTQNR--RDFKSFDKMVNELSKYQNQTQKHNYTDKNFNQLNSYEQYCNAHQGCDEKQEE 994 Query: 237 SLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQ 296 ++S S + ++ + ++ + +++ N K +E Sbjct: 995 RKRNISIDQYMSSQKQNNQAKSSLFQKIHNSVEYLNGIDNIIQKKNQNKTENDKKSEENS 1054 Query: 297 AQARRIDAQLNHGITVKPSELIPLEDYTQA--YGVEQGTELYNLQQFKSVAAPDVARIKL 354 + + LN K + +A +EQ E YN Q K L Sbjct: 1055 KKLQAFPVYLNQ--ANKNENDFQQYKFQKADSIQIEQIQEFYNTNQIKQSKE---NSAFL 1109 Query: 355 MSTFDAKK----FLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQE 402 + FL ++ + S + S + +Y K Q+ Q+ Sbjct: 1110 SPNMSYQNTARPFLNNLEAKMQSKLNKSNDKFSQSQQYANNHTKQEEQNEQQ 1161 >gi|145481369|ref|XP_001426707.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124393783|emb|CAK59309.1| unnamed protein product [Paramecium tetraurelia] Length = 1493 Score = 41.4 bits (95), Expect = 0.61, Method: Composition-based stats. Identities = 21/110 (19%), Positives = 50/110 (45%), Gaps = 3/110 (2%) Query: 37 ALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVD 96 K+T ++ ++ D + + Q H+ +LD + ++ +K +E + Sbjct: 384 KQNKSTSLIQQMKFDICTLKCKFELKQQMTISSQMIHNKAFNLDIDNFIKVENKSEELSN 443 Query: 97 GRIRSAYDR--FLSSISHRDVRK-KFQAQVEHDIRDYHTKGVDIQIGATQ 143 IR ++ FLS ++ ++++ K + QV H I+ + +I++ Q Sbjct: 444 QLIRDCQNKIKFLSLVNQNEIKRSKLEKQVNHCIQKFEKIINEIEVLYKQ 493 >gi|73969294|ref|XP_538328.2| PREDICTED: similar to SMC1 structural maintenance of chromosomes 1-like 2 [Canis familiaris] Length = 1235 Score = 41.0 bits (94), Expect = 0.68, Method: Composition-based stats. Identities = 45/273 (16%), Positives = 90/273 (32%), Gaps = 38/273 (13%) Query: 45 LEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEY--VDGRIRSA 102 LE + R+ + + E + KA + + + TR Q ++K + + R Sbjct: 242 LEHVSRNLSVTKESLSRHENTVKAKK----KEHGMLTRQLQQTEKELKSLEALLNQKRPQ 297 Query: 103 YDRFLSSISHR----DVRKKFQAQVEHDIRDYH--TKGVDIQI--------GATQRAQED 148 Y + + SH DV KK E K ++ ++ ++ +E+ Sbjct: 298 YIKAKENTSHHLKKLDVAKKSIKDSEKQCSKQEDDIKALETELVDLDGAWRSFEKQVEEE 357 Query: 149 NL---NMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLL-----SEAKE 200 L A+ + E ++V +T + L + + ++ L + Sbjct: 358 ILHKGRDIELEASQLDRYKGLKEQVRKKVAIMTQQLEKLHWEQKADKERLAFERRRHGEV 417 Query: 201 KLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDST----SLSDVSDSARERSLEVVED- 255 + N QI + E K+ +L ++ + R EV E+ Sbjct: 418 QENLKQIKEQIEDHKKRIEKLEEYTKTCMDCLKEKKQQEETLVGEIENTKSRMSEVNEEL 477 Query: 256 -----VSKAIGLAGWDRLDDTKRRRLLEHLSSR 283 + G+ + KR +LEHL Sbjct: 478 NLIRSELQNAGIDSHEGTRQQKRAEVLEHLKRL 510 >gi|332025407|gb|EGI65574.1| hypothetical protein G5I_05964 [Acromyrmex echinatior] Length = 977 Score = 41.0 bits (94), Expect = 0.75, Method: Composition-based stats. Identities = 74/503 (14%), Positives = 156/503 (31%), Gaps = 64/503 (12%) Query: 9 PLVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSF-- 66 P E D + V+P+ L+D+ ++ K + +N F + +T S Sbjct: 468 PAFEDEPVVNDLIDLPVSPERNLKDIKNSDTKQSPASVDSAEENIFKEMESPFTNSSVVK 527 Query: 67 -------------------KAVQDFHDFTNSLD---------------TRDSLQAGDKIK 92 KA + D N LD T + Q+ KI Sbjct: 528 QNITSLLLDLRKLVKESNPKAKKLLDDLENILDINYKNNTELLVTCFNTSNKSQSSLKIS 587 Query: 93 EYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDI-RDYHTKGVDIQIGATQRAQEDNLN 151 + +RF S + ++ E+++ +Y K Q + QE + N Sbjct: 588 S-------ESIERFNKSSTVVRLKNTALNSTENEMQANYIEKQNIQQTTPVRNRQESSRN 640 Query: 152 MTVGLAAAHVLHDPSNENYFQRV-QSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGA 210 + + ++H S E+ +++ + I S P + + E N Sbjct: 641 VHSSTLSTKMVHRRSLESKSKKIDRRSISAIESSPKN--TQVCRRKNTSESENR---QKR 695 Query: 211 HARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDD 270 + DP N + SDS +E+S+ + + +K R D Sbjct: 696 FSSDPGFINNLSNKKL-MPEAYNSRKAEVAKSDSGKEKSIAISDVKNKLK-----KRSDV 749 Query: 271 TKRRRLLEHLSSRDNALNTKLRKETQAQARRIDA--QLNHGITVKPSELIPLEDYT-QAY 327 +R ++ + DN + QA + +I + + I + + T Y Sbjct: 750 INKRGPMKAVHPLDNIQKRRASFGRQAPSSQITTPPKSDKAIPSDNKIISSTPNSTDNYY 809 Query: 328 GVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATK 387 ++ ++ + + + + L+S + I L+ + K Sbjct: 810 MPKKPSKSKPIAS-STPDGQNSKSVPLISANKKRNLSCDISPVTTHVNMLNSDEQKNSPK 868 Query: 388 YKEILEKSHRQSMQELNQ-DAISWGI---KYKQIDPLRFDTEESFADSLRQRAGFVKKIK 443 L + + Q D I + ++ + + + SL + + Sbjct: 869 RMSKLPTPKKCTTPTRQQTDGIPRFLTPPRHYSFNTNNLQSPQRLNKSLISFQRYSPANE 928 Query: 444 DDYNLTTSHFNKTEENQLRTQLV 466 + T E N++ ++ Sbjct: 929 KNVGKTIQQSPLKETNRITPKVK 951 >gi|317126334|ref|YP_004100446.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315590422|gb|ADU49719.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 685 Score = 40.6 bits (93), Expect = 0.85, Method: Composition-based stats. Identities = 50/352 (14%), Positives = 113/352 (32%), Gaps = 40/352 (11%) Query: 10 LVEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATAN----------- 58 L EA+ + V +P+ +R+ G+A Q L+ + R A Sbjct: 22 LAEALEAIGEEVMSGRSPERAMREFLRRGGRAQQGLDDLARRVAERRREILQRHHLDGTM 81 Query: 59 ---TRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDV 115 + + A + D R + + + + D S R+ Sbjct: 82 QEVRELLDQAVLAERKQLARDLDDDARFAEMRIGNLPQSTAAAVSELSDYPWRSSDAREA 141 Query: 116 RKKFQAQVEHDIRDYHTKGVDIQI-GAT---QRAQEDNLNMTVGLAAAHVLHDPSNENYF 171 ++ + + ++ D G+ + GAT ++ + L+ L AH +P+++ + Sbjct: 142 YEQIKDLLGRELLDQRFAGLKQALEGATDEDRKRINEMLDDLNNLLEAHRSGEPTDDLFR 201 Query: 172 QRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHP 231 + H + P + + +L+ ++ Q + ++ P M + Sbjct: 202 D---FMDKHGDFFPENPQSIDELMDALAQRSAAAQRM-LNSMTPEQRAELMALSQQAFGS 257 Query: 232 PKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKL 291 P L + D+ R+L ED + G L ++ L+ D+ Sbjct: 258 PDLVNQLGRLDDN--LRALRPGEDWDGSERFEGDQGLGLGDGTGAVQDLAELDDLAEALA 315 Query: 292 RKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKS 343 + A+ + + L+ + G E + L + + Sbjct: 316 QSYAGARL----------------DDVDLDQLARQLGPEAAVDARTLAELER 351 >gi|228471145|ref|ZP_04055964.1| conserved hypothetical protein [Porphyromonas uenonis 60-3] gi|228307085|gb|EEK16159.1| conserved hypothetical protein [Porphyromonas uenonis 60-3] Length = 544 Score = 40.2 bits (92), Expect = 1.1, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 98/326 (30%), Gaps = 15/326 (4%) Query: 31 LRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDK 90 LR+ + L KA Q E +++ A +L + + D T + +R + Sbjct: 61 LRESEEILEKAKQDAEILKQQKVIE-AKEYALQLKSELEERLADRTAQVQSR--ETTLQQ 117 Query: 91 IKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNL 150 +++ + R + D + +K Q E + + I Q E+ Sbjct: 118 LEQTLTARTKQLEDASTEL----ERERKQMTQQEQLVAQKQEELTQ-AIRTQQAKLEEIG 172 Query: 151 NMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLL------SEAKEKLNT 204 ++ A ++ E Q + I ++ + + L A E Sbjct: 173 GLSAAQAKEQLIESLRLEARDQAAAYTAEVIEEAKMNASQEARRLIVATIQRIATETAVE 232 Query: 205 NQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARER-SLEVVEDVSKAIGLA 263 N + H + V + + + +T + + D E L + V + I Sbjct: 233 NSVSVFHIDNDEVKGRIIGREGRNIRALEAATGVEIIVDDTPEAIVLSAFDPVRREIARL 292 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDY 323 +L R D + + R I HG+ + L+ Y Sbjct: 293 ALHQLVQDGRIHPARIEEVVDRVSKQIEEEIIETGKRTIIDLGIHGMHPELVRLVGKMKY 352 Query: 324 TQAYGVEQGTELYNLQQFKSVAAPDV 349 +YG + A ++ Sbjct: 353 RSSYGQNLLQHARETANLCATMASEL 378 >gi|332299880|ref|YP_004441801.1| 2,3 cyclic-nucleotide 2-phosphodiesterase [Porphyromonas asaccharolytica DSM 20707] gi|332176943|gb|AEE12633.1| 2,3 cyclic-nucleotide 2-phosphodiesterase [Porphyromonas asaccharolytica DSM 20707] Length = 544 Score = 40.2 bits (92), Expect = 1.1, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 100/326 (30%), Gaps = 15/326 (4%) Query: 31 LRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDK 90 LR+ L KA Q E +++ A +L + + D T + +R + Sbjct: 61 LREADSILEKAQQDAEILKQQKMIE-AKEYALQLKGELEERLADRTAQVQSR--ETTLQQ 117 Query: 91 IKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNL 150 +++ + R + D S + H +K AQ E + + + + +E Sbjct: 118 LEQTLTARTKQLEDA-SSDLEH---ERKQMAQQEQLVAQKQEELNAAILSQQAKLEEIG- 172 Query: 151 NMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLL------SEAKEKLNT 204 ++ A ++ E Q + I ++ + + L A E Sbjct: 173 GLSATQAREQLIESLRLEARDQAAAYTAEVIEEAKMNASQEARRLIVATIQRIATETSVE 232 Query: 205 NQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARER-SLEVVEDVSKAIGLA 263 N + H + V + + + +T + + D E L + V + + Sbjct: 233 NSVSVFHIDNDEVKGRIIGREGRNIRALEAATGVEIIVDDTPEAIVLSAFDPVRREVARL 292 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDY 323 +L R D + + R I HG+ + L+ Y Sbjct: 293 ALHQLVQDGRIHPARIEEVVDRVSKQIEEEIIETGKRTIIDLGIHGMHPELVRLVGKMKY 352 Query: 324 TQAYGVEQGTELYNLQQFKSVAAPDV 349 +YG + A ++ Sbjct: 353 RSSYGQNLLQHARETANLCATMASEL 378 >gi|311256115|ref|XP_003126510.1| PREDICTED: activating transcription factor 7-interacting protein 1-like [Sus scrofa] Length = 1134 Score = 40.2 bits (92), Expect = 1.2, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 95/291 (32%), Gaps = 18/291 (6%) Query: 137 IQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLS 196 ++I +++ QE+N + + +V+ D NEN + TD + + I L++ + Sbjct: 255 MEIDQSEKNQEENSSDLAETISENVMKDDKNENILEN----TDSMETDEIIPILEKLAPA 310 Query: 197 EAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDV 256 E + + + E M + D S+S + + V+ D Sbjct: 311 EDELTCFSKASLLPIDETNTDLEEKMEGSFGSPSK-------QDSSESLPKEAFLVLSDE 363 Query: 257 SKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSE 316 G + E + S + + + + R ++ + K S+ Sbjct: 364 EDVSGEKDESEVISQNETCSPEEVESTEKDSKAEEEERVIHEDER-PSEKSEFFRRKRSK 422 Query: 317 LIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVAR-IKLMSTFDAKKFLQKIDDEYISNP 375 +++ E Q K A D+ + ++ + + ++ L + Sbjct: 423 SEDMDNVQSKRRRYMEEEYEAEFQVKITAKGDINQKLQKVVQWLLEEKLCALQCAVFDKT 482 Query: 376 SLSLA---STMMATKYKEILEKSHRQSMQELNQD--AISWGIKYKQIDPLR 421 L + K + + + + L + A +K +Q +P Sbjct: 483 LAELKTRVEKIECNKRHKTVLTELQAKIARLTKRFGAAKEDLKKRQENPPN 533 >gi|169824265|ref|YP_001691876.1| ATP-dependent DNA helicase [Finegoldia magna ATCC 29328] gi|167831070|dbj|BAG07986.1| ATP-dependent DNA helicase [Finegoldia magna ATCC 29328] Length = 725 Score = 39.8 bits (91), Expect = 1.6, Method: Composition-based stats. Identities = 70/446 (15%), Positives = 148/446 (33%), Gaps = 28/446 (6%) Query: 39 GKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGR 98 G Q ++ +D F A E ++++ Q D N++ +SL+ + + Sbjct: 257 GANIQNIQNFEKD--FKNAKVILLEQNYRSTQPILDLANTVIKNNSLRKDKNLWTAKNEG 314 Query: 99 IRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNM------ 152 Y R S I D ++ Q + Y + + A R E+ LN Sbjct: 315 DMPIYRRMYSDIDEADQVVQWIEQERYRENPYEEMAILYRTNAQSRLFEERLNRLGIPNR 374 Query: 153 TVGLAAAHVLHDPSN-ENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAH 211 VG + + + +Y + V+++ D++ I K+ + +KL + Sbjct: 375 VVGGLKFYDRKEIKDCVSYLRIVENLNDNMALNRIINEPKRGIGKTTMDKLLQCSQNRSI 434 Query: 212 ARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDT 271 + + + + F K + S + + V E + + G+ Sbjct: 435 SMMQYIIDTDLPDF--SNATAKKLQEFYYLIKSFPKDQMNVAELMEYILDKTGY------ 486 Query: 272 KRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQ 331 R LE ++RD+ + E + + + E L T + Sbjct: 487 --RANLEKSTNRDDKTRLENIDEYVSSLYQFVQDNPDKNLREYLETSSL--MTDLDKTDD 542 Query: 332 GTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEI 391 T+ ++ + + + + + ++D++ L + A K I Sbjct: 543 NTKGVSMMTIHAAKGLEFDVVF-FTGLEEGTIPSRVDEDVEEERRLCYVAITRARKKLYI 601 Query: 392 LEKSHRQSMQELNQDAISWGIKYKQIDPL-RFDTEESFADSLRQRAGFVKKIKDDYNLTT 450 + QS + N+ + + I+ + +ES + R F K+ DDY Sbjct: 602 ---TSVQSRRRFNE--FQTKAESRFIEEMENKYKDESPNKEVSFRESFNTKVVDDYRTEA 656 Query: 451 SHFNKTEENQLRTQLVKRPASESVDL 476 S ++N + K A + V Sbjct: 657 SASITKKQNNVTKNNEKYRAGDKVSH 682 >gi|115698855|ref|XP_781651.2| PREDICTED: similar to myosin tail domain-containing protein [Strongylocentrotus purpuratus] gi|115976751|ref|XP_001183515.1| PREDICTED: similar to myosin tail domain-containing protein [Strongylocentrotus purpuratus] Length = 805 Score = 39.8 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 48/333 (14%), Positives = 106/333 (31%), Gaps = 20/333 (6%) Query: 272 KRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQ 331 +R E + + QAQ ++ AQL G K + L Q+ E+ Sbjct: 6 RRLNESERDKLELTSAFNSENSDLQAQIAKLRAQLEKGEADKQNVEYELAVARQSVSREK 65 Query: 332 GTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKI--DDEYISNPSLSLASTMMATKYK 389 QQ K + ++ +S + +Q + E ++ + K Sbjct: 66 IGASEKEQQLKGIITSQSEKLSSLSAQVQELQIQASQHEREMVNIQNKHHREMDEQEKRL 125 Query: 390 E-------ILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKI 442 +L+ + + Q ++ + +++ LR + + +Q Sbjct: 126 ASVIAERDVLQTAKDEIEQVFHEQESLYSDAQEKMTELRIERDTQTQTVRQQLHDLQGAA 185 Query: 443 KDDYNLTTSHF-NKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIED 501 + D T K E +++ ++ K ++ + D EG S + Sbjct: 186 ERDATTRTELVAAKEELKEMQERIEKERSAHTESKFNFEIVQCQKRDLEGALSVEKSARE 245 Query: 502 N------GLSAVVRLSSEFSDDAKNAAMVILSGMKHQKDTETRYNTDHKSNKFDSLYDSY 555 +S +RL++E+ + I K + + T D F+ L D Sbjct: 246 QAIYNMDAMSKQIRLTAEYETTKNALSAEI--DQKREVLSGTSQQIDLYQKNFNGLKDEL 303 Query: 556 INTPLTKLEQSTAGGNFNKDKEAI--KLYLLGS 586 ++ G ++ E + +++G Sbjct: 304 TRAKKRQIYLEETYGGCMRELELLLSNFHVMGE 336 >gi|313886984|ref|ZP_07820684.1| YmdA/YtgF family protein [Porphyromonas asaccharolytica PR426713P-I] gi|312923510|gb|EFR34319.1| YmdA/YtgF family protein [Porphyromonas asaccharolytica PR426713P-I] Length = 544 Score = 39.8 bits (91), Expect = 1.8, Method: Composition-based stats. Identities = 47/326 (14%), Positives = 100/326 (30%), Gaps = 15/326 (4%) Query: 31 LRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDK 90 LR+ L KA Q E +++ A +L + + D T + +R + Sbjct: 61 LREADSILEKAQQDAEILKQQKMIE-AKEYALQLKGELEERLADRTAQVQSR--ETTLQQ 117 Query: 91 IKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNL 150 +++ + R + D + S + +K AQ E + + + + +E Sbjct: 118 LEQTLTARTKQLED----ASSDLERERKQMAQQEQLVAQKQEELNAAILSQQAKLEEIG- 172 Query: 151 NMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLL------SEAKEKLNT 204 ++ A ++ E Q + I ++ + + L A E Sbjct: 173 GLSATQAREQLIESLRLEARDQAAAYTAEVIEEAKMNASQEARRLIVATIQRIATETSVE 232 Query: 205 NQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARER-SLEVVEDVSKAIGLA 263 N + H + V + + + +T + + D E L + V + + Sbjct: 233 NSVSVFHIDNDEVKGRIIGREGRNIRALEAATGVEIIVDDTPEAIVLSAFDPVRREVARL 292 Query: 264 GWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDY 323 +L R D + + R I HG+ + L+ Y Sbjct: 293 ALHQLVQDGRIHPARIEEVVDRVSKQIEEEIIETGKRTIIDLGIHGMHPELVRLVGKMKY 352 Query: 324 TQAYGVEQGTELYNLQQFKSVAAPDV 349 +YG + A ++ Sbjct: 353 RSSYGQNLLQHARETANLCATMASEL 378 >gi|85859460|ref|YP_461662.1| membrane bound protein mediates protein-protein interactions [Syntrophus aciditrophicus SB] gi|85722551|gb|ABC77494.1| membrane bound protein mediates protein-protein interactions [Syntrophus aciditrophicus SB] Length = 225 Score = 39.4 bits (90), Expect = 1.9, Method: Composition-based stats. Identities = 12/91 (13%), Positives = 34/91 (37%) Query: 33 DLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIK 92 D+ A+ +F+ +N F + +++ +D+ +L + ++AG Sbjct: 118 DIDSAIKSYEKFIANASDENEFKVLSYSSLAYCYESKKDYQKALKALQQAEKVEAGKYFG 177 Query: 93 EYVDGRIRSAYDRFLSSISHRDVRKKFQAQV 123 ++ + Y++ + +K Q Sbjct: 178 TFLYRDMGRLYEKMQNRSEALKFYRKALEQS 208 >gi|323303756|gb|EGA57542.1| Sir3p [Saccharomyces cerevisiae FostersB] Length = 978 Score = 39.4 bits (90), Expect = 1.9, Method: Composition-based stats. Identities = 57/363 (15%), Positives = 111/363 (30%), Gaps = 37/363 (10%) Query: 146 QEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDH---INSLPIDLRLKQKLLSEAKEKL 202 QE++++ + A+ L P++ + + +SI+ +PI+ ++ E Sbjct: 277 QEESVSPNISSASPSALTSPTDSSKILQKRSISKELIVSEEIPINSSEQESDYEPNNETS 336 Query: 203 NTNQIIGAHARDP-----RVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVS 257 + G+ ENF+ A D + + + E S E + V+ Sbjct: 337 VLSSKPGSKPEKTSTELVDGRENFVYA--NNPEVSDDGGLEEETDEVSSESSDEAIIPVN 394 Query: 258 KAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSEL 317 K G G L R+ ++ T+ E + + N I Sbjct: 395 KRRGAHG-SELSSKIRKIHIQETQEFSQNYTTETDNEMNGNGKPGIPRXNTKIHSMNENP 453 Query: 318 IPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSL 377 P + G + + L + K + R + L K+ DE S Sbjct: 454 TPEK------GNAKMIDFATLSKLKKKYQIILDRFAPDNQVTDSSQLNKLTDEQSSLDVA 507 Query: 378 SLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAG 437 L K + N D + EES +SL++R Sbjct: 508 GLEDKF----RKACSSSGRETILSNFNAD---------------INLEESIRESLQKREL 548 Query: 438 FVKKIKDDYNLTTSHFNKTEENQLRT-QLVKRPASESVDLIRGAYNTLSDSDKEGVRSSF 496 +++D + ++ +Q + + S L+ + L S F Sbjct: 549 LKSQVEDFTRIFLPIYDSLMSSQNKLFYITNADDSTKFQLVNDVMDELITSSARKELPIF 608 Query: 497 AHI 499 +I Sbjct: 609 DYI 611 >gi|309360073|emb|CAP31915.2| CBR-AKA-1 protein [Caenorhabditis briggsae AF16] Length = 1337 Score = 39.4 bits (90), Expect = 2.2, Method: Composition-based stats. Identities = 40/326 (12%), Positives = 92/326 (28%), Gaps = 35/326 (10%) Query: 109 SISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNE 168 S+S + + + + RD+ + D I + MT L + +DP+ + Sbjct: 241 SLSEPETQNNDTSTRKAGGRDFSDESSDESILFQYADKISEEVMTAALGTRKLHYDPAVQ 300 Query: 169 NYFQRVQSITDHINSLPIDLRLKQKLL--SEAKEKLNTNQIIGAHARDPRVFENFMRAFY 226 ++ SI D ++ L + + T + +V + Sbjct: 301 EFYHANDSIPDVFAMSFASTTSRKSELNNDTSNQLSGTENKQKLPSVVEQVEQAEESEKE 360 Query: 227 KKGHPPKDSTSLS-----DVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEH-- 279 + +D++ D DS+ + V + L + + +E Sbjct: 361 NEPDEQQDNSESDCSLKHDKEDSSPQN---VESETVHMTSLEQYPEVSLLTDVISVEEEI 417 Query: 280 ----------LSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGV 329 + S+D + I + N + K SE + + Sbjct: 418 AASDISGDCSMDSKDTEQPVVEQPSINDSEDNIATEANTPVVEKNSEEKVTSETVDSMPS 477 Query: 330 EQGTEL-YNLQQFKSVAAPDV----------ARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 + + + AP + ++ +S + +DE + P Sbjct: 478 DDDESAPETVVEAVGFKAPTTDEEENREDADSIVESVSNDSQDSLVGTTEDETENRPRFD 537 Query: 379 LASTMMATKYKEILEKSHRQSMQELN 404 S++ + Q +E Sbjct: 538 --SSIATVHVLHESDSDESQPRRERR 561 >gi|254503648|ref|ZP_05115799.1| Sel1 repeat family [Labrenzia alexandrii DFL-11] gi|222439719|gb|EEE46398.1| Sel1 repeat family [Labrenzia alexandrii DFL-11] Length = 1338 Score = 39.4 bits (90), Expect = 2.2, Method: Composition-based stats. Identities = 79/512 (15%), Positives = 159/512 (31%), Gaps = 47/512 (9%) Query: 26 NPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSL 85 P AG+ D+ AL + Q ++G+ D +A R S +A + D + D Sbjct: 86 APDAGIDDVMGALDRLDQQVQGLAGDEDYADEYERAPSRSHRAGRYAIDALDESDAYGDD 145 Query: 86 QAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRA 145 Y D Y R R+ A + Y G +I A ++ Sbjct: 146 DGYYNDDPYADAD----YGDEDDVYYQRAPRRSSIASRNASMHAYKDLGR--RIDALRKP 199 Query: 146 QED---NLNMTVGLA--AAHVLHDPSNENYFQRV---QSITDHINSLPIDLRLKQKLLSE 197 QE ++ +G A + +NE ++ + + D ++ L D + Q Sbjct: 200 QEQALAHVREELGSLRDALGGISRGTNETVNRQNTELRRLADMVDRLRNDKKNDQLAKEI 259 Query: 198 AKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPK--DSTSLSDVSDSARERSLEVVED 255 KE ++G V + H + D S + V D RS+ V + Sbjct: 260 RKEVAELKSLVG----RTNVEGALQTLEHGYAHILQRLDELSRASV-DPRVLRSVTVRLN 314 Query: 256 VSKAIGLAGWDRLDDTKRRRLLEH----LSSRDNA-LNTKLRKETQ---AQARRIDAQLN 307 + + L ++ +LE ++ R L K E + A+ R + + Sbjct: 315 EIE----DAFAALPRSEHMMVLEDRVVSIAERMEELLQRKDHAEIEPLRAELREVRGFVQ 370 Query: 308 HGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKI 367 + L+ D + + +L L + + ++ ++ + + ++ Sbjct: 371 Q---IDVKGLVESIDDRMKFVSNRLDDLETLAKEQRGLDNRLSAMEQRMPE--PETITRL 425 Query: 368 DDEYISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEES 427 MMA ++ H + D + + +Q P++ + Sbjct: 426 QGRLEDIVG------MMADDRSAPADQEHLAQLDHRLNDIVDRLERMEQAKPIQSGDPGA 479 Query: 428 FADSLRQRAGFVKKIKD--DYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLS 485 F D+L R + D + Q + ++ D + L Sbjct: 480 F-DALGTRLEAISTKIDAIEKRAARPVIAPVAAAQSASGADTEFFAQIQDRLNDLTTRLD 538 Query: 486 DSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDD 517 S + + R++ S D Sbjct: 539 QPKDTVTTSDLDKLRSEITAMRERVAQPASTD 570 >gi|300742137|ref|ZP_07072158.1| ATP-dependent RNA helicase DeaD [Rothia dentocariosa M567] gi|300381322|gb|EFJ77884.1| ATP-dependent RNA helicase DeaD [Rothia dentocariosa M567] Length = 719 Score = 39.4 bits (90), Expect = 2.4, Method: Composition-based stats. Identities = 59/400 (14%), Positives = 120/400 (30%), Gaps = 48/400 (12%) Query: 96 DGRIRSAYDRFLSSISHRDVRKKFQA---QVEHDIRDYHTKGVDIQIGATQRAQEDNLN- 151 D + R + L + + R+K A Q + + + + GAT E+ L+ Sbjct: 18 DNKARRMW---LECVGESERRRKACARTPQQNEEKVTENIPAENTEQGATLGNTEETLSV 74 Query: 152 -MTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEK----LNTNQ 206 TV ++ P+NE D S+ + R + E + + Sbjct: 75 EKTVEESSLETGEKPANEK--------ADAAESMTAEPREDNAATKKNTEDNTHESHEEE 126 Query: 207 IIGA--HARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAG 264 DPRV Y+K P ++ T + + +VV G Sbjct: 127 GTRFTDLGLDPRVLSALEEVGYEKPSPIQEQTIPPLLDGN------DVVGLAQTGTGKTA 180 Query: 265 WDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYT 324 L R L + N ++ + A R + Q+ + + + Sbjct: 181 AFALPALSRMAELADI----NGVSRDTQVLVLAPTRELALQVAEAFSSYATHMEDFTVLP 236 Query: 325 QAYGVEQGTELYNLQQFKSVA--------------APDVARIKLMSTFDAKKFLQKIDDE 370 G G +L L++ V + D++ ++ + +A + L+ E Sbjct: 237 IYGGSPYGPQLAGLRRGAQVVVGTPGRVIDHLEKGSLDLSNLQYLVLDEADEMLRMGFAE 296 Query: 371 YISNPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTE--ESF 428 + + + S R+ Q+ D +K K + Sbjct: 297 DVETILEGTPDAKQVALFSATMPNSIRKIAQQYLNDPTEVRVKTKTTTGANIRQRYMQVM 356 Query: 429 ADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKR 468 V ++++ + K E ++ +L R Sbjct: 357 HSHKLDAMTRVLEVENYDGIIVFVRTKKETEEVADKLKAR 396 >gi|315121929|ref|YP_004062418.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122891|ref|YP_004063380.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495331|gb|ADR51930.1| hypothetical protein CKC_00895 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496293|gb|ADR52892.1| hypothetical protein CKC_05735 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 129 Score = 39.1 bits (89), Expect = 2.8, Method: Composition-based stats. Identities = 27/76 (35%), Positives = 38/76 (50%) Query: 12 EAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQD 71 E + PNQDAVRD+V+P GL+ + L Q + R ++ A A T+ S D Sbjct: 12 EPLAPNQDAVRDAVDPLGGLKTVAQGLDSLNQLYQERRSEHIKAQALTQDALHSADDHAD 71 Query: 72 FHDFTNSLDTRDSLQA 87 F + SL T +QA Sbjct: 72 FARYMTSLPTGVDVQA 87 >gi|296005375|ref|XP_002809012.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] gi|225631952|emb|CAX64293.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] Length = 1069 Score = 39.1 bits (89), Expect = 2.8, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 63/182 (34%), Gaps = 24/182 (13%) Query: 84 SLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQ 143 + +K+Y +I+S D F I +++K + EH I + I + Sbjct: 158 TEYMNKNLKDY-SDKIKSLDDEFNDEIPFENIQKFNDEKSEHTIEMNRERDDIYDINNNK 216 Query: 144 RAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPI--DLRLKQKL-LSEAKE 200 ++N N+ + + NEN +D I ++ K++ + + Sbjct: 217 NDSKENNNINSRCITNNTKY---NENIIMNTSYHSDKNILESIYNNVVNKKERDNDKTYK 273 Query: 201 KLNTNQII----GAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDV 256 ++ N++I +P ++ + ++ E + + D+ Sbjct: 274 FIDKNKVINNVNDIENNNPDIY-------------VNSANCPDTKENNTNEDNHMINIDI 320 Query: 257 SK 258 +K Sbjct: 321 NK 322 >gi|323394609|gb|ADX61736.1| erythrocyte membrane protein 1 [Plasmodium falciparum] Length = 192 Score = 39.1 bits (89), Expect = 3.0, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 34/93 (36%), Gaps = 5/93 (5%) Query: 490 EGVRSSFAHIEDNGLSAVVRLSSEFSDDAKNAA-----MVILSGMKHQKDTETRYNTDHK 544 EGV S H + + +L +E + + + G + E + D+ Sbjct: 33 EGVLISADHGKHQLTNEGSQLCTELARSFADIGDIIRGKDLFRGYNQKDRNEKKQLQDNL 92 Query: 545 SNKFDSLYDSYINTPLTKLEQSTAGGNFNKDKE 577 + F ++++ +T Q NF K +E Sbjct: 93 KDIFKKIHNNLQDTEARDRYQDDEDKNFFKLRE 125 >gi|120436226|ref|YP_861912.1| phosphodiesterase [Gramella forsetii KT0803] gi|205831622|sp|A0M2K0|CNPD_GRAFK RecName: Full=2',3'-cyclic-nucleotide 2'-phosphodiesterase gi|117578376|emb|CAL66845.1| HD superfamily hydrolase [Gramella forsetii KT0803] Length = 520 Score = 39.1 bits (89), Expect = 3.0, Method: Composition-based stats. Identities = 45/348 (12%), Positives = 109/348 (31%), Gaps = 23/348 (6%) Query: 13 AVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDF 72 A+ + + S + ++ G L +A E I++D A ++ EL + + Sbjct: 19 AIAKMLEKKQASGTIASAKKEAGSILKEAKAEGESIKKDKILQ-AKEKFIELKSEHEKVI 77 Query: 73 HDFTNSLDTRDSLQAGDKIKEYVDGRI---RSAYDRFLSSISHRDVRKKFQAQVEHDIRD 129 ++ D+ + + +V + + ++ D R F + + DI Sbjct: 78 LSRDKKIN--DAEKRIKDKESHVSNELGKNKKLNKDLEEKVADYDHRLDFLEKKQEDIDK 135 Query: 130 YH-TKGVDIQIGATQRAQE------DNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHIN 182 H +K +++ + A++ ++L T A ++ D E Q I Sbjct: 136 LHNSKVQQLEVISGLSAEDAKAQLIESLKDTAKADAMSIIQDTVEEAKLTAQQEARKIII 195 Query: 183 SLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVS 242 + + E+ N + + V + + + +T + + Sbjct: 196 NT---------IQRIGTEEAIENCVSVFNLESDDVKGRIIGREGRNIRALEAATGVEIIV 246 Query: 243 DSARERSL-EVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARR 301 D E + + V + + +L R + R Sbjct: 247 DDTPEAIILSCFDSVRREVARLSLHKLVTDGRIHPARIEEVVKKTRKQIEEEIIDIGKRT 306 Query: 302 IDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDV 349 + HG+ + +++ Y +YG + + V A ++ Sbjct: 307 VIDLGIHGLQPELIKMVGRMKYRSSYGQNLLQHSREVAKLCGVMAAEL 354 >gi|268530936|ref|XP_002630594.1| C. briggsae CBR-AKA-1 protein [Caenorhabditis briggsae] Length = 1305 Score = 38.7 bits (88), Expect = 3.3, Method: Composition-based stats. Identities = 40/326 (12%), Positives = 92/326 (28%), Gaps = 35/326 (10%) Query: 109 SISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNE 168 S+S + + + + RD+ + D I + MT L + +DP+ + Sbjct: 225 SLSEPETQNNDTSTRKAGGRDFSDESSDESILFQYADKISEEVMTAALGTRKLHYDPAVQ 284 Query: 169 NYFQRVQSITDHINSLPIDLRLKQKLL--SEAKEKLNTNQIIGAHARDPRVFENFMRAFY 226 ++ SI D ++ L + + T + +V + Sbjct: 285 EFYHANDSIPDVFAMSFASTTSRKSELNNDTSNQLSGTENKQKLPSVVEQVEQAEESEKE 344 Query: 227 KKGHPPKDSTSLS-----DVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEH-- 279 + +D++ D DS+ + V + L + + +E Sbjct: 345 NEPDEQQDNSESDCSLKHDKEDSSPQN---VESETVHMTSLEQYPEVSLLTDVISVEEEI 401 Query: 280 ----------LSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGV 329 + S+D + I + N + K SE + + Sbjct: 402 AASDISGDCSMDSKDTEQPVVEQPSINDSEDNIATEANTPVVEKNSEEKVTSETVDSMPS 461 Query: 330 EQGTEL-YNLQQFKSVAAPDV----------ARIKLMSTFDAKKFLQKIDDEYISNPSLS 378 + + + AP + ++ +S + +DE + P Sbjct: 462 DDDESAPETVVEAVGFKAPTTDEEENREDADSIVESVSNDSQDSLVGTTEDETENRP--R 519 Query: 379 LASTMMATKYKEILEKSHRQSMQELN 404 S++ + Q +E Sbjct: 520 FDSSIATVHVLHESDSDESQPRRERR 545 >gi|47215513|emb|CAG01175.1| unnamed protein product [Tetraodon nigroviridis] Length = 1504 Score = 38.7 bits (88), Expect = 3.4, Method: Composition-based stats. Identities = 57/383 (14%), Positives = 131/383 (34%), Gaps = 33/383 (8%) Query: 126 DIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHIN--S 183 +I + ++ + G ED + L + + N+N RV+++ + I+ Sbjct: 404 EIARLQMELMEYKTGKRL-VGEDGMEGMNDLVHENSMLQTENDNLRVRVKAMQETIDAQR 462 Query: 184 LPIDLRLKQKLLSEAKEKLNTNQIIGAHARD-PRVFENFMRAFY-----KKGHPPKDSTS 237 + L + + N+ IG ++ + E + G + Sbjct: 463 ARLTQILSDQAYQAMAKAGEGNEEIGNMIQNYIKEIEELSVNREEAPDNESGAKQRRRGR 522 Query: 238 LSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDT---KRRRLLEHLSSRDNA--LNTKLR 292 + SD E ED G + D ++ + L++ + KL Sbjct: 523 GGEGSDHDEGEDAEAEEDEYDLAGDETSEDSDSEELDEKEDVQADLANITCEIAIKQKLI 582 Query: 293 KETQAQARRIDAQLNHGITVKPSE-LIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDVAR 351 E + RR+ H + + + L+ L++ + +E+ L+N+ +S + Sbjct: 583 DELENSQRRL-----HTLKQQYEQKLMMLQNKIRDTQLERDKVLHNIGSVESGMEEKAKK 637 Query: 352 IKLMSTFDAKKFLQKIDDEYISNPSL--SLASTMMATKYKEILEKSHRQSMQELNQDAIS 409 I+ + ++ L ++ E S A + E K + + E+ + ++ Sbjct: 638 IR----VEYERKLSSMNKELQKLQSAQREHARLLKNQSQYEKQLKKLQMDVTEMKKTKVT 693 Query: 410 WGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRP 469 + +R + + A R+ +K D + E + + +L+ R Sbjct: 694 ------LMRQMREQQDRNRAAECRRNREIA-SLKKDQRRAEYQLKQMEAQKRQQELMLRR 746 Query: 470 ASESVDLIRGAYNTLSDSDKEGV 492 +E V +R +S + V Sbjct: 747 KNEEVTALRRQVRPVSGTVSRKV 769 >gi|312211071|emb|CBX91157.1| similar to nuclear matrix protein [Leptosphaeria maculans] Length = 696 Score = 38.7 bits (88), Expect = 3.9, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 89/309 (28%), Gaps = 44/309 (14%) Query: 316 ELIPLEDYT--QAYGVEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYIS 373 + PL D + G + + F + + + D K + E S Sbjct: 196 QSFPLGDKSSVNLRGNFHVENVTTFEDFLKES----NEVDDLMEVDHDKTVPTATGESAS 251 Query: 374 NPSLSLASTMMATKYKEILEKSHRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLR 433 + + T +T K L+ + W +++ +P R E++F Sbjct: 252 EHEVLSSETTKSTDAKSTLDFDALY--------PVFWSLQHSFSNPPRLFEEDNFKQF-- 301 Query: 434 QRAGFVKKIKDDYNLTTSHFNKTEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVR 493 R + + + E + +L + + L+ + Sbjct: 302 -RTSLDATLAKFKEVPKVIQSGDSERKKGQKL----------HLDDDQDALTSTFNPKYL 350 Query: 494 SSFAHIE----------DNGLSAVVRLSSEFSDDAKNAAMVILSGMKH--QKDTETRYNT 541 +S + + A++ + + + Q + R + Sbjct: 351 TSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEDAKKQPFYQNAQKAMQYNFTLREDD 410 Query: 542 DHKSNKFDSLYDSYINT-PLTKLEQSTAGGNFNKDKEAIKLYLLGSMKDSGNYTLNRVRV 600 + S +Y+ P K ++DK ++ M++ +T +RV Sbjct: 411 TKWALGIKSAIANYLQEDPDGKFYYRMVDTVLSRDKNWVRW----KMENCQPFTRDRVAA 466 Query: 601 SDAMQIVLG 609 D +Q G Sbjct: 467 KDFLQAKSG 475 >gi|160886551|ref|ZP_02067554.1| hypothetical protein BACOVA_04562 [Bacteroides ovatus ATCC 8483] gi|156108436|gb|EDO10181.1| hypothetical protein BACOVA_04562 [Bacteroides ovatus ATCC 8483] Length = 1052 Score = 38.3 bits (87), Expect = 4.1, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 68/202 (33%), Gaps = 7/202 (3%) Query: 50 RDNAFATANT--RYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFL 107 R F A T Y E+S + V + +L + + + ++E ++ + R + D Sbjct: 733 RTEFFKKALTLNEYIEMSLEEVAQRIEIMKNLPKIEKEEQENLLEEIIEYQKRISVDAEY 792 Query: 108 SSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSN 167 +I H + K+ +E D+ D + G T E L + Sbjct: 793 KTIKHINSLKQKLLILERDLADNMVSCRNRMDGMTTADIEKGLAEVEQRIKNIMSLPKIR 852 Query: 168 ENYFQR-VQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFY 226 N ++ ++ I + + + D+ + + ++ G + + FY Sbjct: 853 TNELEKSLKKIIEDKDKIRTDIEYQTTTQIDKLKEKLLKLENGL----ADCIAPYSQTFY 908 Query: 227 KKGHPPKDSTSLSDVSDSARER 248 ++ + + + R Sbjct: 909 EQYAAVGHEEKIKSLKEKVDGR 930 >gi|46446033|ref|YP_007398.1| putative eucaryotic myosin heavy chain [Candidatus Protochlamydia amoebophila UWE25] gi|46399674|emb|CAF23123.1| putative eucaryotic myosin heavy chain [Candidatus Protochlamydia amoebophila UWE25] Length = 840 Score = 38.3 bits (87), Expect = 4.1, Method: Composition-based stats. Identities = 51/281 (18%), Positives = 104/281 (37%), Gaps = 32/281 (11%) Query: 18 QDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTN 77 Q+ + V+ +A RDLG + IR + + + + + +A+Q Sbjct: 156 QELNQKRVDLEAARRDLGSLTPNLNALQKDIRSLEEKKSQDEQALQQANQALQ------- 208 Query: 78 SLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVR-KKFQAQVEHDIRDYHTKGVD 136 +LQ +E G + LS I+ + KK QVE + +D +K Sbjct: 209 ------NLQREKASREAQAGDLELEKTTLLSKIAEIEASLKKALEQVEQERKDNASKENT 262 Query: 137 IQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLS 196 I + + E LN + + L + + Q Q++ + ++K Sbjct: 263 I-VALQGQLTE--LNKQLSTLNRNHLKNEDEQALQQANQALQN---------LQREKASR 310 Query: 197 EAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSLEVVEDV 256 EA+ + + ++ + + +A + KD+ S + + + E+ + + Sbjct: 311 EAQAEELELEKTTLLSKIAEIEASLKKALEQAEQERKDNASKENTIVDLQGQLTELNKQL 370 Query: 257 SKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQA 297 S + LD K E L ++ +L +L +E QA Sbjct: 371 STL-NRNHQEHLDQEK-----EDLKNQLESLKRQLNEERQA 405 >gi|115943067|ref|XP_001195605.1| PREDICTED: similar to dystonin [Strongylocentrotus purpuratus] Length = 3054 Score = 38.3 bits (87), Expect = 4.2, Method: Composition-based stats. Identities = 57/405 (14%), Positives = 130/405 (32%), Gaps = 55/405 (13%) Query: 11 VEAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQ 70 V V D +++ + + + +Q + + N F + + + KA Q Sbjct: 1188 VSDVQQKLDHIQNRYD-----NIFQKSEDQLSQLQQAVPLANEFNRLQSEFADWLKKAEQ 1242 Query: 71 DFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSS------ISHRDVRKKFQAQVE 124 D +F S +K+++ +D R R +S + + Q+ Sbjct: 1243 DLRNFDPSASVETQKAVQEKLQDEID-RHRPILQELNEKGTRLIELSPGQGAAEIRQQLN 1301 Query: 125 HDIRDY-----HTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITD 179 D + + TK ++ ++ A + + L +E Q+ + Sbjct: 1302 DDNQRFNELASRTKDIEYRLSAALDGAQK------VQQSMDGLLKWLDEADRQQQTQARE 1355 Query: 180 HINSLPI---DLRLKQKLLSE--------AKEKLNTNQIIGAHARDPRVFENFMRAFYKK 228 I+ P D K K + + ++ L + + DPR Sbjct: 1356 TISVEPDGVRDQLKKAKAIDKEITGKQSRVQDTLGAAEALIEQTSDPRQQARLESQANDL 1415 Query: 229 GHPPKDSTSLS----DVSDSARERSLEVVEDVSKAIGL--AGWDRLDDTKRRRLLEHLSS 282 D ++ S +V + A ++L+ E + L + K L Sbjct: 1416 KQRFDDLSARSADRVEVLEEALPQTLQFNESHEELTKWLDEVEGDLKNLKPPGLDAEEIR 1475 Query: 283 RDNALNTKLRKETQAQARRID---------AQLNHGITVKPSELIPLEDYTQAYG----- 328 R+ N +++ + + ++ A+L+ G + + +D + Sbjct: 1476 REVDNNRFMKQTVAEKQQHVNKLNRIAPELAKLSPGAGAQGVQAKADDDNRRYENVKEDV 1535 Query: 329 VEQGTELYNLQQFKSVAAPDVARIKLMSTFDAKKFLQKIDDEYIS 373 ++G ++++L Q S D+ A+K L + + Sbjct: 1536 NKRGEKMFDLLQRTSSFVEDLDSTLEQLNITAEK-LSRPEPISAD 1579 >gi|297463001|ref|XP_605047.5| PREDICTED: myosin, heavy polypeptide 15 [Bos taurus] Length = 2004 Score = 38.3 bits (87), Expect = 4.4, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 114/320 (35%), Gaps = 29/320 (9%) Query: 46 EGIRRDNAFATANTRY---TELSFKAVQDFHDFTNSLDT---RDSLQAGDKIKEYVDGRI 99 + I+R A + + + +A+ + SL+ R L+ GD + + G+ Sbjct: 1372 DTIQRAEDLEDAKKKLALRLQEAAEAMGVANARNASLERARLRLQLELGDTLSDL--GQA 1429 Query: 100 RSAYDRFLSSISHRDV------RKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMT 153 RSA H D RK ++Q D + + IQ+ + + E+ Sbjct: 1430 RSAAAALNQKQQHFDKSLDDWRRKHEESQAMLDAAKKEARALSIQLLELRHSYEEGTMSQ 1489 Query: 154 VGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHAR 213 L + N +V+ N ++ KQ + + ++ + GA R Sbjct: 1490 EALRRENKHLKEEISNLTNQVR--EGKKNLSKMEKVKKQIEQEKNEVQMALEEAEGALER 1547 Query: 214 DPRVFENFMRAFYKKGHPPKDSTS-LSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTK 272 + + S + ++ R++ ++ + ++ R++ T+ Sbjct: 1548 NESKVLRLQLELSDTKAELERKLSEKDEEIENFRKKQQCAIDSLQSSLDSEARCRIEATR 1607 Query: 273 RRRLLE----------HLSSRDNALNTKLRKETQAQARRIDAQLNHG--ITVKPSELIPL 320 +R +E +SR + TK + Q Q + + Q++ + + E + + Sbjct: 1608 LKRSMEGDLTEMELQLSCASRQASEATKSLGQLQTQVKDLQVQVDDSTCMNSELKEQVAV 1667 Query: 321 EDYTQAYGVEQGTELYNLQQ 340 + A + EL +LQ+ Sbjct: 1668 AERRNALLQAEVEELRSLQE 1687 >gi|297470826|ref|XP_002684783.1| PREDICTED: KIAA1000 protein-like [Bos taurus] gi|296491471|gb|DAA33524.1| KIAA1000 protein-like [Bos taurus] Length = 2004 Score = 38.3 bits (87), Expect = 4.4, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 114/320 (35%), Gaps = 29/320 (9%) Query: 46 EGIRRDNAFATANTRY---TELSFKAVQDFHDFTNSLDT---RDSLQAGDKIKEYVDGRI 99 + I+R A + + + +A+ + SL+ R L+ GD + + G+ Sbjct: 1372 DTIQRAEDLEDAKKKLALRLQEAAEAMGVANARNASLERARLRLQLELGDTLSDL--GQA 1429 Query: 100 RSAYDRFLSSISHRDV------RKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMT 153 RSA H D RK ++Q D + + IQ+ + + E+ Sbjct: 1430 RSAAAALNQKQQHFDKSLDDWRRKHEESQAMLDAAKKEARALSIQLLELRHSYEEGTMSQ 1489 Query: 154 VGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHAR 213 L + N +V+ N ++ KQ + + ++ + GA R Sbjct: 1490 EALRRENKHLKEEISNLTNQVR--EGKKNLSKMEKVKKQIEQEKNEVQMALEEAEGALER 1547 Query: 214 DPRVFENFMRAFYKKGHPPKDSTS-LSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTK 272 + + S + ++ R++ ++ + ++ R++ T+ Sbjct: 1548 NESKVLRLQLELSDTKAELERKLSEKDEEIENFRKKQQCAIDSLQSSLDSEARCRIEATR 1607 Query: 273 RRRLLE----------HLSSRDNALNTKLRKETQAQARRIDAQLNHG--ITVKPSELIPL 320 +R +E +SR + TK + Q Q + + Q++ + + E + + Sbjct: 1608 LKRSMEGDLTEMELQLSCASRQASEATKSLGQLQTQVKDLQVQVDDSTCMNSELKEQVAV 1667 Query: 321 EDYTQAYGVEQGTELYNLQQ 340 + A + EL +LQ+ Sbjct: 1668 AERRNALLQAEVEELRSLQE 1687 >gi|313159585|gb|EFR58948.1| YmdA/YtgF family protein [Alistipes sp. HGB5] Length = 550 Score = 38.3 bits (87), Expect = 4.5, Method: Composition-based stats. Identities = 38/300 (12%), Positives = 89/300 (29%), Gaps = 14/300 (4%) Query: 57 ANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVR 116 A ++ +L + + ++ + S Q +I+ + + R ++ + R Sbjct: 92 AKEKFIQLKSEYDRQVNERNQKI--AQSEQRAKQIENNLQNQQRDLENKLRE-----NDR 144 Query: 117 KKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQS 176 K Q Q + I ++ + VD + E ++ A ++ + E + Sbjct: 145 LKEQMQNQLQILEHKKEEVDQMMREQNVRLEQISGLSSEEAKNILIENMKAEAKTEAAGY 204 Query: 177 ITDHINSLPIDLRLKQKLL------SEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGH 230 I + I + + K + A E N + + V + + Sbjct: 205 INETIEEAKMTATKEAKRIIVASIQRVATETAIENAVTVFNIESDEVKGRIIGREGRNIR 264 Query: 231 PPKDSTSLSDVSDSARERSL-EVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNT 289 + +T + + D E + + V + I +L R Sbjct: 265 ALEAATGIEIIVDDTPEAIILSGFDPVRREIARLALHQLVTDGRIHPARIEEVVAKVQKQ 324 Query: 290 KLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDV 349 + + R HG+ + +I Y +YG + A ++ Sbjct: 325 IEEEIVEVGKRTTIDLGIHGLHPELIRMIGKMKYRSSYGQNLLQHARETANLAGIMAAEL 384 >gi|255100181|ref|ZP_05329158.1| putative integrase/recombinase [Clostridium difficile QCD-63q42] Length = 188 Score = 38.3 bits (87), Expect = 5.2, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 40/130 (30%), Gaps = 7/130 (5%) Query: 84 SLQAGDKIKEYVDGRIRSAYDRFLS-------SISHRDVRKKFQAQVEHDIRDYHTKGVD 136 G +I + + +I + L + +R V K + R Y+ K D Sbjct: 46 EANLGLRISDILRLKIGNFKGNMLEINEKKTGKLQYRPVNKNIIESINEHARKYNLKSND 105 Query: 137 IQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLS 196 + +A + L + + + S + Q + N + L ++ Sbjct: 106 YLVNIKTKAIQKQLRIICKYLNLYNISTHSFRKLYATTQFEKSNNNLELVKELLNHSSVA 165 Query: 197 EAKEKLNTNQ 206 + + Q Sbjct: 166 TTQRYIRVTQ 175 >gi|126698762|ref|YP_001087659.1| putative integrase/recombinase [Clostridium difficile 630] gi|115250199|emb|CAJ68020.1| putative integrase/recombinase [Clostridium difficile] Length = 190 Score = 37.9 bits (86), Expect = 5.5, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 40/130 (30%), Gaps = 7/130 (5%) Query: 84 SLQAGDKIKEYVDGRIRSAYDRFLS-------SISHRDVRKKFQAQVEHDIRDYHTKGVD 136 G +I + + +I + L + +R V K + R Y+ K D Sbjct: 48 EANLGLRISDILRLKIGNFKGNMLEINEKKTGKLQYRPVNKNIIESINEHARKYNLKSND 107 Query: 137 IQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLS 196 + +A + L + + + S + Q + N + L ++ Sbjct: 108 YLVNIKTKAIQKQLRIICKHLNLYNISTHSFRKLYATTQFEKSNNNLELVKELLNHSSVA 167 Query: 197 EAKEKLNTNQ 206 + + Q Sbjct: 168 TTQRYIRVTQ 177 >gi|288959380|ref|YP_003449721.1| hypothetical protein AZL_025390 [Azospirillum sp. B510] gi|288911688|dbj|BAI73177.1| hypothetical protein AZL_025390 [Azospirillum sp. B510] Length = 715 Score = 37.9 bits (86), Expect = 5.5, Method: Composition-based stats. Identities = 71/464 (15%), Positives = 139/464 (29%), Gaps = 57/464 (12%) Query: 70 QDFHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRD 129 Q F + S L+AG ++ + VD A ++ +D D Sbjct: 26 QRFTEPQQSAGGAALLRAGGQVGDAVD-----AVGNAFEKLNRQDEALTVARMRAEDRTT 80 Query: 130 YHTKGVDIQIGATQRA---QEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPI 186 + TK + + A A L AA + PS + D L Sbjct: 81 WSTKFNEAKEKADIGAPDFTSGFLKDFDAYAAERLAAAPSPLARRDVEMGLIDLRGRLAE 140 Query: 187 DLRLKQKLLSEAKEKLNTNQII----GAHARDPRVFENFMRAFYKK----GHPP-----K 233 + + AK + + QI G ARD R F + G PP + Sbjct: 141 EGLQFEAGARLAKRRGDVEQIRSLNGGLLARDHRDFATVLADTENSIRTLGLPPAVQAQE 200 Query: 234 DSTSLSDVSDSARERSLEVVEDVSKAIGLAG-WDRLDDTKRRRLLEHLSSRDNALNTKLR 292 + + +++A + +G WD R L ++ ++ Sbjct: 201 IEATRATFAETALMGLASRDPAAAMTAIKSGAWDGYTQPDRLVRLYDHANTEDRRRRNEA 260 Query: 293 KETQAQARRIDAQLNHGITVKPSEL------------IPLEDYTQAYGVEQGTELYNLQQ 340 + A+AR + + + + IP AYG E+G +++ + Sbjct: 261 RAAAAEARAEARSI---LEPQYRDAFAAAQDGAAFTPIPEARLVNAYGPERGRQMFAELE 317 Query: 341 FKSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSL-----SLASTMMATKYKEILEKS 395 DV + ++ + L + LA + + + Sbjct: 318 ATRKLGQDVKTVAMVPPDVQDRLLASYAPDGAGYAGERERYGELARAVARDRKNREDPAA 377 Query: 396 HRQSMQELNQDAISWGIKYKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNK 455 + S +D ++ + DP + + + +L+ AG + + ++ Sbjct: 378 YVLSNSSKLRDLVAGA----RTDPTKAQAAVALSLTLQADAGIPEVDRRILPVS---MAD 430 Query: 456 TEENQLRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHI 499 T QL T + A A ++ + E F + Sbjct: 431 TAARQLATLPPEARA--------DAVEGMAATYGEHWPRVFREL 466 >gi|254974701|ref|ZP_05271173.1| putative integrase/recombinase [Clostridium difficile QCD-66c26] gi|255092090|ref|ZP_05321568.1| putative integrase/recombinase [Clostridium difficile CIP 107932] gi|255306070|ref|ZP_05350242.1| putative integrase/recombinase [Clostridium difficile ATCC 43255] gi|255313827|ref|ZP_05355410.1| putative integrase/recombinase [Clostridium difficile QCD-76w55] gi|255516508|ref|ZP_05384184.1| putative integrase/recombinase [Clostridium difficile QCD-97b34] gi|255649608|ref|ZP_05396510.1| putative integrase/recombinase [Clostridium difficile QCD-37x79] gi|255655168|ref|ZP_05400577.1| putative integrase/recombinase [Clostridium difficile QCD-23m63] gi|260682772|ref|YP_003214057.1| putative integrase/recombinase [Clostridium difficile CD196] gi|260686370|ref|YP_003217503.1| putative integrase/recombinase [Clostridium difficile R20291] gi|296451156|ref|ZP_06892897.1| phage integrase family site-specific recombinase [Clostridium difficile NAP08] gi|296880492|ref|ZP_06904454.1| phage integrase family site-specific recombinase [Clostridium difficile NAP07] gi|306519731|ref|ZP_07406078.1| putative integrase/recombinase [Clostridium difficile QCD-32g58] gi|260208935|emb|CBA61947.1| putative integrase/recombinase [Clostridium difficile CD196] gi|260212386|emb|CBE03222.1| putative integrase/recombinase [Clostridium difficile R20291] gi|296259977|gb|EFH06831.1| phage integrase family site-specific recombinase [Clostridium difficile NAP08] gi|296428446|gb|EFH14331.1| phage integrase family site-specific recombinase [Clostridium difficile NAP07] Length = 188 Score = 37.9 bits (86), Expect = 5.5, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 40/130 (30%), Gaps = 7/130 (5%) Query: 84 SLQAGDKIKEYVDGRIRSAYDRFLS-------SISHRDVRKKFQAQVEHDIRDYHTKGVD 136 G +I + + +I + L + +R V K + R Y+ K D Sbjct: 46 EANLGLRISDILRLKIGNFKGNMLEINEKKTGKLQYRPVNKNIIESINEHARKYNLKSND 105 Query: 137 IQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKLLS 196 + +A + L + + + S + Q + N + L ++ Sbjct: 106 YLVNIKTKAIQKQLRIICKHLNLYNISTHSFRKLYATTQFEKSNNNLELVKELLNHSSVA 165 Query: 197 EAKEKLNTNQ 206 + + Q Sbjct: 166 TTQRYIRVTQ 175 >gi|332292808|ref|YP_004431417.1| metal dependent phosphohydrolase [Krokinobacter diaphorus 4H-3-7-5] gi|332170894|gb|AEE20149.1| metal dependent phosphohydrolase [Krokinobacter diaphorus 4H-3-7-5] Length = 521 Score = 37.9 bits (86), Expect = 5.9, Method: Composition-based stats. Identities = 43/330 (13%), Positives = 99/330 (30%), Gaps = 33/330 (10%) Query: 38 LGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVDG 97 L +AT E +++D A ++ EL + + ++ D+ + + V G Sbjct: 45 LKEATSDGESLKKDKILQ-AKEKFIELKAEHEKVILSRDKKMN--DAEKRARDKESQVSG 101 Query: 98 RIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDY--HTKGVDIQIGATQRAQEDNLNMTVG 155 + KK +E I+DY + VD + ++ E + Sbjct: 102 ELSR--------------NKKLNQSLEDKIKDYDKRLEYVDKKKAEAEKMHESTVQQLEV 147 Query: 156 LAAAH----------VLHDPSNENYFQRVQSITDHINSLPIDLRLK---QKLLSEAKEKL 202 ++ L D + + + VQ+ + K + E+ Sbjct: 148 ISGLSAEEAKSQLVTTLKDQAKADAMKHVQTAIEEAKMTAEQEAKKIIINTIQRIGTEEA 207 Query: 203 NTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSL-EVVEDVSKAIG 261 N + + V + + + +T + + D E + + V + + Sbjct: 208 VENCVSVFNLESDDVKGRIIGREGRNIRAIEAATGVEIIVDDTPEAIILSCFDSVRREVA 267 Query: 262 LAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLE 321 +L R ++ + R + HG+ + + + Sbjct: 268 RLSLHKLVTDGRIHPARIEEVVQKTTKQIEQEIVEVGKRTVIDLGIHGLHPELIKAVGRM 327 Query: 322 DYTQAYGVEQGTELYNLQQFKSVAAPDVAR 351 Y +YG + + SV A ++ Sbjct: 328 KYRSSYGQNLLQHSREVAKLCSVMAAELGV 357 >gi|229497008|ref|ZP_04390713.1| conserved hypothetical protein [Porphyromonas endodontalis ATCC 35406] gi|229316110|gb|EEN82038.1| conserved hypothetical protein [Porphyromonas endodontalis ATCC 35406] Length = 521 Score = 37.9 bits (86), Expect = 5.9, Method: Composition-based stats. Identities = 48/340 (14%), Positives = 102/340 (30%), Gaps = 44/340 (12%) Query: 37 ALGKATQFLEGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDSLQAGDKIKEYVD 96 L +A + +E +R+D A L + +++ L + + Sbjct: 33 KLAEAKREIERLRKD-----ARREADALEKEKMKELKQKFQGLKEELERETKQR------ 81 Query: 97 GRIRSAYDRFLSSISHRD-------------------VRKKFQAQVEHDIRDYHTKGVDI 137 SA+D + + R+ VR+ AQ D+ ++ + ++ Sbjct: 82 ---NSAFDSRHTKLKRREEDLNERQCELEKKNGELDAVREMLTAQ--QDVVEHRQEELN- 135 Query: 138 QIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHIN---SLPIDLRLK--- 191 ++ A QR Q +++ A L + + + I + + K Sbjct: 136 ELIAQQRNQLEHIGGLSATEAKEQLIETMRAEAQDEATAYVNEIVEEAKMTANKEAKRIV 195 Query: 192 -QKLLSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARER-S 249 Q + A E N + H + + + + + +T + + D E Sbjct: 196 VQSIQRVATETAIENSVSVFHIDNDEIKGRIIGREGRNIRALEAATGIEIIVDDTPEAIV 255 Query: 250 LEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHG 309 L + V + I +L R + + R + HG Sbjct: 256 LSGFDPVRREIARLALHQLVQDGRIHPARIEEVVAKVRKQVEEEIIETGKRTVIDLGVHG 315 Query: 310 ITVKPSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDV 349 I + LI Y +YG ++ A ++ Sbjct: 316 IHPELVRLIGKMKYRSSYGQNLLQHSRETANLCAIMASEL 355 >gi|67921510|ref|ZP_00515028.1| Protein of unknown function DUF814:Fibronectin-binding A, N-terminal [Crocosphaera watsonii WH 8501] gi|67856622|gb|EAM51863.1| Protein of unknown function DUF814:Fibronectin-binding A, N-terminal [Crocosphaera watsonii WH 8501] Length = 578 Score = 37.9 bits (86), Expect = 6.0, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 56/162 (34%), Gaps = 9/162 (5%) Query: 85 LQAGDKIKEYVDGRIRSAYDRFLSSISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQR 144 ++ I+ +D Y + ++ + ++ + Q +++ ++ + K + Q Q Sbjct: 266 IEPKINIQTLIDN----YYTKEINQEKFKQLKHQLQQKLKSCLKKLYNKANNFQQKMEQS 321 Query: 145 AQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRL----KQKLLSEAKE 200 ++ + L AH N TD +P++ + L + + Sbjct: 322 SEAEEYRQKGDLLMAHSYQWEPGMNSMTLKDFETDKPIKIPLNPEKNAIQNAQNLYKKHQ 381 Query: 201 KLNTNQ-IIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDV 241 KL + + ++ + N++ + S D+ Sbjct: 382 KLKRAKGAVEPLLKEVQEEINYLEQVEDSLQQIDNYNSPEDL 423 >gi|145553261|ref|XP_001462305.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124430144|emb|CAK94932.1| unnamed protein product [Paramecium tetraurelia] Length = 1564 Score = 37.5 bits (85), Expect = 7.8, Method: Composition-based stats. Identities = 72/566 (12%), Positives = 182/566 (32%), Gaps = 53/566 (9%) Query: 54 FATANTRYTELSFKAVQDFHDFTNSL---------DTRDSLQAGDKIKEYVDGRIRSAYD 104 F +T + ++D ++ L + + LQ + + V+ + S Sbjct: 468 FTEVSTNLKNIKGDQIKDKNELQEGLKKITNDVENNDKKQLQLMKNLDQKVNQGLNSQEF 527 Query: 105 RFLS-SISHRDVRKKFQAQVEHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLH 163 + I ++ K Q Q + + QI + + V ++ + Sbjct: 528 KLDQLRIELSEIIKNIQLQYNIQMHE--------QIQFAIETAKKDATQQVLESSQQI-- 577 Query: 164 DPSNENYFQRVQSITDHINSLPIDLRLKQKLLSEAKEKLNTNQIIGAHARDPRVFENFMR 223 N+ + ++ N+ L ++ K L +E+ ++ + + + Sbjct: 578 ---NDRITREIEMADQQTNAKLDSLSIQMKDLIRNREEYILQRLQSMIDEEKELRQRLQE 634 Query: 224 AFYKKGHPPKD-STSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSS 282 K G + L R E + + + I +++ D +R+ E L+S Sbjct: 635 LIEKYGDMVRKLKEDLDYEMQQLRRDQDEKLNEAIQKIERDYSEKVKDIERQLKQEILNS 694 Query: 283 RDNALNTKLRKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTEL-YNLQQF 341 Q + + + + + + +E+ ++Y ++ + ++L+ Sbjct: 695 ---------ENRMQTKLDVLSLEFSKTMEQMKLYIKKIENTLKSYVQQELEQTKHDLKMM 745 Query: 342 KSVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQ 401 DV ++++ + +K + + +I + M E ++ M Sbjct: 746 IKGIEDDVFKLRIWTIQQMEKMDSAMKEIHIQIQTEMYLERMYTMTTLENFQEHLINVMN 805 Query: 402 ELNQDAISWGIK-YKQIDPLRFDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTEENQ 460 + QD + W Q++ L + +++ DD N + Q Sbjct: 806 KEKQDRLQWQFSMENQLNQLDMSLNNK----IDSVQNNLQQQLDDVNAEFKQAVQKLNEQ 861 Query: 461 LRTQLVKRPASESVDLIRGAYNTLSDSDKEGVRSSFAHIEDNGLSAVVRLSSEFSDDAKN 520 + QL K + + + + + K+ + D + + ++F +D +N Sbjct: 862 INEQLGKLE-EDFNQKLNDVHEQVKQAAKDLEDKINQQM-DKMNEEINQQLTQFKNDVEN 919 Query: 521 AAMVILSGMKH-------QKDTETRYNTDHKSNKFDSLYDSYINTPLTKLEQSTAGGNFN 573 + +K + + + N + + T L +S +F Sbjct: 920 QFQELTEKIKQDLQDLFDEINQKVNNLRVDTQNGLTQVVK--LQTTLGNQLKSQIDESFA 977 Query: 574 KDKEAIKLYLLGSMKDSGNYTLNRVR 599 + K YL+ N L++ R Sbjct: 978 YFQS--KFYLM-EQIRQANEELDKER 1000 >gi|319954135|ref|YP_004165402.1| 2,3 cyclic-nucleotide 2-phophodiesterase [Cellulophaga algicola DSM 14237] gi|319422795|gb|ADV49904.1| 2,3 cyclic-nucleotide 2-phophodiesterase [Cellulophaga algicola DSM 14237] Length = 523 Score = 37.5 bits (85), Expect = 8.7, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 106/336 (31%), Gaps = 30/336 (8%) Query: 32 RDLGDALGKATQFL-------EGIRRDNAFATANTRYTELSFKAVQDFHDFTNSLDTRDS 84 + + +A +A +FL E I+++ F A ++ EL + + + + ++ Sbjct: 32 KTIQNAKKEANRFLKDANVEGENIKKEKIFQ-AKEKFLELKAEHEKVINSKDKKI--GEA 88 Query: 85 LQAGDKIKEYVDGRI---RSAYDRFLSSISHRDVRKKFQAQVEHDIRDYH---TKGVDIQ 138 + + + + + D+ S I + + +F + + ++ H + +++ Sbjct: 89 EKRTRDKESQISSELAKNKKLSDQLESQIKDVEYKHEFFDKKQSELDKLHKNQVQQLEVI 148 Query: 139 IGATQRAQE----DNLNMTVGLAAAHVLHDPSNENYFQRVQSITDHINSLPIDLRLKQKL 194 G + + ++L T A + E Q I + + Sbjct: 149 SGLSAEEAKGQLLESLRETAKSDAMSYMQATMEEAKLTAQQEAKKIIINT---------I 199 Query: 195 LSEAKEKLNTNQIIGAHARDPRVFENFMRAFYKKGHPPKDSTSLSDVSDSARERSL-EVV 253 E+ N + + V + + + +T + V D E + Sbjct: 200 QRIGTEEAVENCVSVFNLESDDVKGRIIGREGRNIRALEAATGVEIVVDDTPEAIILSCF 259 Query: 254 EDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNALNTKLRKETQAQARRIDAQLNHGITVK 313 + V + + +L R ++ + R + HG+ + Sbjct: 260 DSVRREVARLSLHKLVTDGRIHPARIEEIVKKTEKQIEQEIVEIGKRTVIDLGIHGLHPE 319 Query: 314 PSELIPLEDYTQAYGVEQGTELYNLQQFKSVAAPDV 349 + + Y +YG + + V A ++ Sbjct: 320 LIKAVGRMKYRSSYGQNLLQHSREVAKLCGVMAAEL 355 >gi|123448346|ref|XP_001312904.1| viral A-type inclusion protein [Trichomonas vaginalis G3] gi|121894768|gb|EAX99974.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3] Length = 3748 Score = 37.1 bits (84), Expect = 9.9, Method: Composition-based stats. Identities = 69/522 (13%), Positives = 172/522 (32%), Gaps = 49/522 (9%) Query: 12 EAVTPNQDAVRDSVNPQAGLRDLGDALGKATQFLEGIRRDNAFATANTRYTELSFKAVQD 71 + + QD ++D L + + L K + L+ ++ +++ Sbjct: 857 QRIQTLQDNLQDRKKLNNELTEQNNKLQKELKDLQNELDQTELVNDDSESLNKKLDEIKE 916 Query: 72 FHDFTNSLDTRDSLQAGDKIKEYVDGRIRSAYDRFLSSISHRDV----RKKFQAQV---- 123 + S + ++ Q I+E ++F + ++ K QAQ+ Sbjct: 917 QINERKSQNENNTEQNEKLIEEI---------EKFAKELDEIEIIEDKSDKLQAQISELQ 967 Query: 124 ----EHDIRDYHTKGVDIQIGATQRAQEDNLNMTVGLAAAHVLHDPSNENYFQRVQSITD 179 E + T + + + + L+ + EN + ++ I D Sbjct: 968 KQIDEKQKNNEQTDKSNNDLEHELQITKQKLDSMSSVKNNSDYLKSEIENVNKEIEKIRD 1027 Query: 180 HINSLPIDLRLKQKLLSEAKEKLNTN------------QIIGAHARDPRVFENFMRAFYK 227 N L +L+ K K L E + + + +I A + + E Sbjct: 1028 TNNKLKQELQDKNKELEEMTDIADNSEELKEKIDSVNEEITKRVANNTTIDELIRHLHED 1087 Query: 228 KGHPPKDSTSLSDVSDSARERSLEVVEDVSKAIGLAGWDRLDDTKRRRLLEHLSSRDNAL 287 + S+ V D+ + E +++ + + + RL++ + + L Sbjct: 1088 LKNAEAKLQSIPHVDDNTDSLQKSLDEVLAQISQKQRENDELNDEISRLIQEKEEKTDEL 1147 Query: 288 NTKL-----RKETQAQARRIDAQLNHGITVKPSELIPLEDYTQAYGVEQGTELYNLQQFK 342 N R+E ++ + +Q+ K +E I E+ A +E + + + Sbjct: 1148 NNMETIPDKREEISSEIETVKSQIEE--KKKNNEKIAEENKKLAEELENLRQTLSKMETS 1205 Query: 343 SVAAPDVARIKLMSTFDAKKFLQKIDDEYISNPSLSLASTMMATKYKEILEKSHRQSMQE 402 ++ + + + + +++D+ + A + E +E Q ++ Sbjct: 1206 DQPLENIQKEIETTKQEISEKQKELDELKQELEQIKDEDQSKADEISEEIENIKTQIDEK 1265 Query: 403 LNQDAISWGIKYKQIDPLR-----FDTEESFADSLRQRAGFVKKIKDDYNLTTSHFNKTE 457 ++ ++ L E D + +++ + + + T K Sbjct: 1266 NKKNEEIAKNNEEKQSELDEKLKELQDLEEIKDETEEINQQIEETQKE--IETKKQQKEN 1323 Query: 458 ENQLRTQLVKRPAS-ESVDLIRGAYNTLSDSDKEGVRSSFAH 498 N+L +L K E ++ + L++ + E V+S Sbjct: 1324 NNKLNEELDKLKQDLEQIENVEDNVEKLTE-EIEKVKSDIDS 1364 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.303 0.119 0.268 Lambda K H 0.267 0.0363 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 9,132,350,496 Number of Sequences: 14124377 Number of extensions: 325527043 Number of successful extensions: 870156 Number of sequences better than 10.0: 441 Number of HSP's better than 10.0 without gapping: 37 Number of HSP's successfully gapped in prelim test: 590 Number of HSP's that attempted gapping in prelim test: 869781 Number of HSP's gapped (non-prelim): 851 length of query: 707 length of database: 4,842,793,630 effective HSP length: 147 effective length of query: 560 effective length of database: 2,766,510,211 effective search space: 1549245718160 effective search space used: 1549245718160 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.4 bits) S2: 85 (37.5 bits)