BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781203|ref|YP_003065616.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] (478 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781203|ref|YP_003065616.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|254040880|gb|ACT57676.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|317120669|gb|ADV02492.1| hypothetical protein SC1_gp035 [Liberibacter phage SC1] gi|317120813|gb|ADV02634.1| hypothetical protein SC1_gp035 [Candidatus Liberibacter asiaticus] Length = 478 Score = 983 bits (2541), Expect = 0.0, Method: Compositional matrix adjust. Identities = 478/478 (100%), Positives = 478/478 (100%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ Sbjct: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 Query: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP Sbjct: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV Sbjct: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 Query: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV Sbjct: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 Query: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH Sbjct: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 Query: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL Sbjct: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 Query: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP Sbjct: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 Query: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL Sbjct: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 >gi|332160978|ref|YP_004297555.1| hypothetical protein YE105_C1356 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665208|gb|ADZ41852.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862134|emb|CBX72298.1| hypothetical protein YEW_AK02350 [Yersinia enterocolitica W22703] Length = 430 Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 69/277 (24%), Positives = 127/277 (45%), Gaps = 12/277 (4%) Query: 88 APYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDK 147 AP AG++L+ + ++R G + + PL GA+ A + ++ +G+D Sbjct: 101 APEATVTTTAGQILNGLGDVMSRAVGGTVAAGPLG-GAVLAGGTEAIFANDEGLRKGLDP 159 Query: 148 ETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGY 207 TA + + + L P A ++++ VA+GA N+ G V+RG +++ LE GY Sbjct: 160 LTAAGKGVLDGVSLGAGTLVPAAPFAKTLLSRVAAGAASNIAIGAVQRGTTAEWLEQRGY 219 Query: 208 PDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKS 267 DMAQ Y+++D +++ DG++GA FGG+ ++ D + +H + Sbjct: 220 KDMAQQYKVWDATAMLADGVLGAAFGGLA-----HIGAAATPDSVDAALTARNAQHFRED 274 Query: 268 SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADN---TLEDPHFKPHLP 324 ++PG+ T + AH L D + RGE D + + D + +F Sbjct: 275 TAPGIPTDIPSNIAHQRALETATDQINRGE--PVDVANIDGVFDAHFIARDGSNFAEQPA 332 Query: 325 EPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELS 361 E P P + + Q P + AE P+ + R+++ Sbjct: 333 EIAPRPVAESEATFQ-PEKTTAETATPEADPILRDIN 368 >gi|30387395|ref|NP_848224.1| hypothetical protein epsilon15p16 [Enterobacteria phage epsilon15] gi|30266050|gb|AAO06079.1| 16 [Salmonella phage epsilon15] Length = 634 Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 68/233 (29%), Positives = 107/233 (45%), Gaps = 44/233 (18%) Query: 32 TGLGKEVINMPARSLDKLVAP----FR----------EETHD----QPNYYRG-SRTDPH 72 G K +I+ PA + D VAP FR ET+D Q RG + D Sbjct: 57 VGFSKRLISDPAFTAD--VAPTVNIFRVMFPDADKALNETYDTIGKQLQDARGYVKPDAG 114 Query: 73 SVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSH 132 S GT A ++ GL P I G PT GA A+ S Sbjct: 115 SQGTAAEVLYGLGQFVPAIGATIFGG------PT----------------VGAATAFSST 152 Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 +S + +GVD+ TA LA ++++ + + + P A+ + ++ +ASG +N FG Sbjct: 153 YEQSYQDFKGKGVDETTARNLATQQSLFNAAGMALPAAVGT-TLTTRIASGVAINTGFGG 211 Query: 193 VERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSL 245 + R + LE+ GY +MA+ YR+FD ++++ D ++GA FGG H +N + Sbjct: 212 LNRYSVGETLEEKGYTEMAKQYRVFDGQAMLVDAVLGAAFGGAHHLAARNADV 264 >gi|301028421|ref|ZP_07191667.1| conserved domain protein [Escherichia coli MS 196-1] gi|299878532|gb|EFI86743.1| conserved domain protein [Escherichia coli MS 196-1] Length = 686 Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 64/219 (29%), Positives = 107/219 (48%), Gaps = 22/219 (10%) Query: 32 TGLGKEVINMPARSLDKLVAP----FREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSL 87 G K +I+ PA + D VAP FRE D + D + G L + + + Sbjct: 57 VGFSKRLISDPAFTAD--VAPTVNIFREMFPDADK----TLNDTYDT-IGKQLQDARSYV 109 Query: 88 APYIAGAALAGKLLS----FIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIE 143 P +A ++L+ F+P T + G PL GA A+ S +S + + Sbjct: 110 KPDAGSQGMAAEVLNELGKFVPAIGTTMFG-----GPLI-GAATAFSSTYEQSYQDFKGK 163 Query: 144 GVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLE 203 GVD+ TA LA ++++ + + P A+ + ++A +ASG +N FG + R LE Sbjct: 164 GVDEATARNLATQQSLFNAVGMALPAAVGT-TLATRIASGVAINTGFGGLNRYSVGATLE 222 Query: 204 DHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN 242 + GY +MA+ YR+FD ++++ D ++G FGG+H N Sbjct: 223 EKGYTEMAKQYRVFDGQAMLVDAVLGGVFGGVHHLTTHN 261 >gi|304398391|ref|ZP_07380265.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] gi|304354257|gb|EFM18630.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] Length = 625 Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 57/218 (26%), Positives = 98/218 (44%), Gaps = 11/218 (5%) Query: 27 DIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNY--YRGSR------TDPHSVGTGA 78 D +W+ G G + A L E P Y RG D + Sbjct: 30 DPRWYAGSGSALFRGAAEGTIGLGQTLVETAKLSPTYSALRGDLPELDEIVDQNFSAVQK 89 Query: 79 HLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSI 138 L + S+ P +A ++L + T + P+A GA+ A+ S + Sbjct: 90 SLNDARNSVKPAPNSQGMAAEILEGLGT-FAPAIAATAVAGPVAGGAV-AFGSSYESTRQ 147 Query: 139 HHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWS 198 +GV+++TA LA +A + + P + + +A + SG +N FG V R Sbjct: 148 DFLAKGVNEDTAGTLALEQAGANALGMALPAGVGGR-LATRLLSGVGINTGFGAVNRFAL 206 Query: 199 SKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH 236 + LE++GY ++A+ YR++D ++L+ DG++GA FGG+H Sbjct: 207 GETLEENGYDELAKQYRVWDKQALLVDGVLGAAFGGVH 244 >gi|330007167|ref|ZP_08305909.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] gi|328535514|gb|EGF61974.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] Length = 632 Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 34/94 (36%), Positives = 58/94 (61%), Gaps = 1/94 (1%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVL 202 +GVD++TA +A ++ + + P A+ + +A + SG +N FG + R + L Sbjct: 163 KGVDEQTARTVAAEQSGFNAVGMGLPAAVGGR-LATRLLSGVGINAAFGGLNRFAVGETL 221 Query: 203 EDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH 236 ED+GY DMA+ YR+FD ++++ D ++GA FGG H Sbjct: 222 EDNGYADMAKQYRVFDGQAILIDSVLGAAFGGAH 255 >gi|317120710|gb|ADV02532.1| hypothetical protein SC2_gp040 [Liberibacter phage SC2] gi|317120771|gb|ADV02592.1| hypothetical protein SC2_gp040 [Candidatus Liberibacter asiaticus] Length = 408 Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 63/236 (26%), Positives = 109/236 (46%), Gaps = 27/236 (11%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIAS---QSIAKTVASGAVLNVPFGMVERGWSS 199 EGV ETA A++ T A G+++ +S+ +G NV FG+ ER Sbjct: 149 EGVAHETAKI----GALITTGTTFAGGSVSGVIGKSLVSKAVTGGATNVAFGLGERQSIG 204 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSK-----QVQNMSLRLVNDLKEG 254 L+ G+ D+AQHYR D T+ +IGA G +H K ++ + + +K Sbjct: 205 AYLDYKGHKDLAQHYREVDGIHTTTEFIIGAGLGALHGKGGKHPDIKPSDVDIAQVVKRD 264 Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTL 314 I + + S+P + T+ + E H TL ++ + RGE + D + + + + + Sbjct: 265 IDD-------IYHSAPAIATTSRSAELHAQTLEQAIEKMRRGEEINVDPKSIDLMTKDMI 317 Query: 315 EDP--HFKPHLPEPEPLPQYKEHSDRQKPSEPLA-EHPHPKRKEV---ERELSEIE 364 P F P L + L Q ++ +Q+ S+P A + P +V ER L+++E Sbjct: 318 TKPEVEFSPEL--KKQLKQGEDFLAQQEVSKPKALKEQDPLSSQVPEYERRLTDLE 371 >gi|268589386|ref|ZP_06123607.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315413|gb|EFE55866.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 594 Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 68/266 (25%), Positives = 111/266 (41%), Gaps = 45/266 (16%) Query: 11 IRDNIKEWAQRPRVSPDIKW--------HTGLGKEVINMPARSL----DKLVAPFREETH 58 I + + Q P S D + +TGL +I P + L D +V+P E + Sbjct: 11 INQQLDDAMQSPENSGDADFFDGAFTSTYTGLYSGLIAKPEQVLWGIADTVVSPIAREVN 70 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLL-SFIPTPLTRLAGLALQ 117 +Q + S A + + SL P A AG+++ S L G A+ Sbjct: 71 EQFDINDTSEQFIQEQRKNAE--KQVRSLTPDRATTGTAGQVMFSLFDIGGEALTG-AMI 127 Query: 118 SAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP-------GA 170 PL L + ++ + +GVDK TA A E + +L P G Sbjct: 128 GGPLGGAMLVGGVQGFSDYE-KLRADGVDKNTAINKATGEGLFAGLGVLTPMTLGFKGGG 186 Query: 171 IASQSI-AKTVASGAVL--------------------NVPFGMVERGWSSKVLEDHGYPD 209 I ++SI A+ A G L N+ GM +RG++S++L++ GY Sbjct: 187 ILAESIGAQFTARGGTLSSLAGTAARATPDIVYASGSNIAMGMAQRGFASQILKERGYNQ 246 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGM 235 +A Y ++D +++ DG++G FGGM Sbjct: 247 LASQYDVYDKQAIAIDGVLGVAFGGM 272 >gi|315122889|ref|YP_004063378.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496291|gb|ADR52890.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 363 Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 45/186 (24%), Positives = 84/186 (45%), Gaps = 13/186 (6%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVL 202 EG D TA + ++ + L P + +AS V N+ ++R +L Sbjct: 133 EGQDSSTATKGGMKTGVISGAGALIPAGFGVSVVKSAIASAGV-NLGLSKLDRMGDYAIL 191 Query: 203 EDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLV--------NDLKEG 254 + +GY ++A+H D S+ TD ++G FGG+H+K + + +LV D+ G Sbjct: 192 KANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARR-NKKLVGMKPTPSEGDIATG 250 Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTL 314 L + + P T+ +++E H +A +LV GE D +KL+ + ++ Sbjct: 251 AKNELMTSRTLNDAIP---TTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALERGSI 307 Query: 315 EDPHFK 320 + P + Sbjct: 308 KKPDIE 313 >gi|315121927|ref|YP_004062416.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495329|gb|ADR51928.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 326 Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 45/182 (24%), Positives = 82/182 (45%), Gaps = 11/182 (6%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVL 202 EG D TA + ++ + L P + +AS V N+ ++R +L Sbjct: 96 EGQDSSTATKGGMKTGVISGAGALIPAGFGVSVVKSAIASAGV-NLGLSKLDRMGDYAIL 154 Query: 203 EDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV-QNMSLRLV------NDLKEGI 255 + +GY ++A+H D S+ TD ++G FGG+H+K +N L + D+ G Sbjct: 155 KANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARRNKKLAGMKPTPSEGDIATGA 214 Query: 256 TERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 L + + P T+ +++E H +A +LV GE D +KL+ + +++ Sbjct: 215 KNELMTSRTLNDAVP---TTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALERGSIK 271 Query: 316 DP 317 P Sbjct: 272 KP 273 >gi|309702800|emb|CBJ02131.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 600 Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 46/158 (29%) Query: 78 AHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESS 137 A LV+G+T AGA + IP L AG AL A ++ A L+ ES+ Sbjct: 163 AGLVQGVT------AGAG------TLIPMSLGLRAGGAL------AESVGAQLARTGESA 204 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + + + T+ AP +A A N+ FGM +RG Sbjct: 205 VRN------------------VAATAVRAAP----------DIAYAAGTNIAFGMAQRGL 236 Query: 198 SSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 ++K L D GY +MA Y +FD +S+ D ++G FGG+ Sbjct: 237 TAKTLRDGGYNEMANQYDVFDRQSIAIDAVLGVAFGGV 274 >gi|215487809|ref|YP_002330240.1| hypothetical protein E2348C_2742 [Escherichia coli O127:H6 str. E2348/69] gi|215265881|emb|CAS10290.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 600 Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 46/158 (29%) Query: 78 AHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESS 137 A LV+G+T AGA + IP L AG AL A ++ A L+ ES+ Sbjct: 163 AGLVQGVT------AGAG------TLIPMSLGLRAGGAL------AESVGAQLARTGESA 204 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + + + T+ AP +A A N+ FGM +RG Sbjct: 205 VRN------------------VAATAVRAAP----------DIAYAAGTNIAFGMAQRGL 236 Query: 198 SSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 ++K L D GY +MA Y +FD +S+ D ++G FGG+ Sbjct: 237 TAKTLRDGGYNEMAAQYDVFDRQSIAIDAVLGVAFGGV 274 >gi|327252172|gb|EGE63844.1| hypothetical protein ECSTEC7V_3019 [Escherichia coli STEC_7v] Length = 600 Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|323948673|gb|EGB44578.1| hypothetical protein ERKG_04896 [Escherichia coli H252] Length = 600 Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|324008548|gb|EGB77767.1| hypothetical protein HMPREF9532_01735 [Escherichia coli MS 57-2] Length = 600 Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|89152440|ref|YP_512273.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] gi|74055463|gb|AAZ95912.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] Length = 600 Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|218700978|ref|YP_002408607.1| hypothetical protein ECIAI39_2668 [Escherichia coli IAI39] gi|218370964|emb|CAR18791.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|300898439|ref|ZP_07116780.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357906|gb|EFJ73776.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|332344342|gb|AEE57676.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|298381706|ref|ZP_06991305.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279148|gb|EFI20662.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|323156121|gb|EFZ42280.1| hypothetical protein ECEPECA14_1896 [Escherichia coli EPECa14] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|117624700|ref|YP_853613.1| hypothetical protein APECO1_4053 [Escherichia coli APEC O1] gi|115513824|gb|ABJ01899.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 600 Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 35/56 (62%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|298485994|ref|ZP_07004068.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159471|gb|EFI00518.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 448 Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 45/155 (29%), Positives = 73/155 (47%), Gaps = 4/155 (2%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVL 202 EG+D+ TA L E +V + + P A + + A NV GM RG ++ +L Sbjct: 161 EGIDENTATLLGLSEGVVTGAGAILPAAQFVKPVLGDAAIAIGANVGLGMAHRGTAAALL 220 Query: 203 EDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYK 262 + +GY A YR D ++ TD ++GA F G+ +M + + +TER + Sbjct: 221 DSNGYAAQAAQYRAMDGTAIATDAILGAAFFGIGRS---SMRRPTTDQVDAALTER-NAQ 276 Query: 263 HGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 H ++PGL + AH D L ++ + RGE Sbjct: 277 HADIDTAPGLPVDPRSAIAHQDALRAAIEQINRGE 311 >gi|331648164|ref|ZP_08349254.1| hypothetical protein ECIG_04090 [Escherichia coli M605] gi|331043024|gb|EGI15164.1| hypothetical protein ECIG_04090 [Escherichia coli M605] Length = 600 Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 21/56 (37%), Positives = 34/56 (60%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 +A A N+ FGM +R ++K L D GY +MA Y + D +++ D ++G FGG+ Sbjct: 219 IAYAAGTNIAFGMAQRVLTAKTLRDGGYSEMANQYDVLDRQAIAIDAVLGVAFGGV 274 >gi|85059172|ref|YP_454874.1| hypothetical protein SG1194 [Sodalis glossinidius str. 'morsitans'] gi|84779692|dbj|BAE74469.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 32/116 (27%), Positives = 61/116 (52%), Gaps = 6/116 (5%) Query: 187 NVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNM 243 N+ GM +RG S++ L GY DMA+ Y + D ++L TD ++G FGG+ + + +++ Sbjct: 226 NIAMGMAQRGLSAETLRRGGYEDMARQYDVMDAQALATDAVLGVAFGGLGRFINSRGEDV 285 Query: 244 SLRLVN--DLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 +R V+ ++ +T V + +PG+ S + AH + + ++ GE Sbjct: 286 PVRRVSPEEIDAALTSSSHVNFEV-TVAPGVPVSVLSRNAHAQAMNKAMTDVLAGE 340 >gi|320175033|gb|EFW50146.1| 16 [Shigella dysenteriae CDC 74-1112] Length = 600 Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust. Identities = 18/46 (39%), Positives = 28/46 (60%) Query: 180 VASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITD 225 +A A N+ FGM +RG ++K L D GY +MA Y + D +++ D Sbjct: 219 IAYAAGTNIAFGMAQRGLTAKTLRDGGYSEMANQYDVLDRQAIAID 264 >gi|85059663|ref|YP_455365.1| hypothetical protein SG1685 [Sodalis glossinidius str. 'morsitans'] gi|84780183|dbj|BAE74960.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 46.2 bits (108), Expect = 0.011, Method: Compositional matrix adjust. Identities = 30/116 (25%), Positives = 59/116 (50%), Gaps = 6/116 (5%) Query: 187 NVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNM 243 N+ GM +RG S++ L GY DMA+ Y + ++L TD ++G GG+ + + +++ Sbjct: 226 NIAMGMAQRGLSAETLRRGGYEDMARQYDVMASQALATDAVLGLAPGGLGRFINSRGEDV 285 Query: 244 SLRLVN--DLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGE 297 +R V+ ++ +T V + +PG+ S + AH + + ++ GE Sbjct: 286 PVRRVSPEEIDAALTSSSHVNFEV-TVAPGVPVSVLSCNAHAQAMNKAMAGVLAGE 340 >gi|319793416|ref|YP_004155056.1| phage-like protein [Variovorax paradoxus EPS] gi|315595879|gb|ADU36945.1| phage-like protein [Variovorax paradoxus EPS] Length = 937 Score = 45.4 bits (106), Expect = 0.018, Method: Compositional matrix adjust. Identities = 51/215 (23%), Positives = 89/215 (41%), Gaps = 25/215 (11%) Query: 143 EGVDKETADALAWREAIVHTSALLAPGAIASQSI-------AKTVASGAVLNVPFGMVER 195 +GV TA A+ + AP + Q+I A+ +A GA +V G+ ER Sbjct: 144 QGVAPGTATAVGAVSGAATYVGVKAPITLGQQAIGQGGRAMAQNLAYGATASVAGGVAER 203 Query: 196 GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGG----MHSKQVQNMSLRLVNDL 251 G+S +L+ GY + A +D +L + +GA F G +H++ + + D Sbjct: 204 GFSRDLLKAAGYGEQAAPLEPYDKTALAAEATLGALFSGGAAALHAR--STVRGQAATDA 261 Query: 252 KEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIAD 311 +T H + ++PG T A AH L+ ++ ++R E + ++ Sbjct: 262 ALTVTT---VDHAQRGTAPGTPTDARAASAHASALSTAIEQVLRNEPANVGEQ------- 311 Query: 312 NTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLA 346 + D F +P PE + + H P P A Sbjct: 312 --MADTAFVRPVPSPEIRAELQAHVADLLPVGPAA 344 >gi|254251752|ref|ZP_04945070.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] gi|124894361|gb|EAY68241.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] Length = 764 Score = 40.0 bits (92), Expect = 0.94, Method: Compositional matrix adjust. Identities = 42/171 (24%), Positives = 67/171 (39%), Gaps = 25/171 (14%) Query: 68 RTDPHSVGTGAHLVEGLTS-LAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGAL 126 R DP + T +V+G S L + A L G + AG A+ A + G Sbjct: 117 RPDPQNTTTTDQIVQGAVSGLVQIVPAAVLGGPV-----------AGAAVGGASIGLG-- 163 Query: 127 YAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVL 186 S + EGVD T A+ E + + + P +IA+T+ AV Sbjct: 164 ---------RSEELKREGVDVGTRTAVGAVEGALGAAGAVLPAG--GSTIARTLGLVAVG 212 Query: 187 NVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS 237 + + +L++ GY +A D +L L+ FFGG+H+ Sbjct: 213 GPGMAIGQSTAEKAILKNAGYDHLADQIDPLDPTNLAASTLMAGFFGGLHA 263 >gi|222147647|ref|YP_002548604.1| Two-component sensor histidine kinase protein [Agrobacterium vitis S4] gi|221734635|gb|ACM35598.1| Two-component sensor histidine kinase protein [Agrobacterium vitis S4] Length = 445 Score = 39.7 bits (91), Expect = 1.0, Method: Compositional matrix adjust. Identities = 29/129 (22%), Positives = 62/129 (48%), Gaps = 14/129 (10%) Query: 140 HQIEGVDKETADALAW----REAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVER 195 H++ G +E LA R+ + + P ++++A+ + G +L V Sbjct: 77 HRVSGTVQEFPSGLALDMPPRQVSIVRTDQTPPQQQRARAVARRLPDGNILFV------- 129 Query: 196 GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHS-KQVQNMSLRLVNDLKEG 254 GWS+ ++ M + +F + ++T GL A F G+++ ++V M +R+ + Sbjct: 130 GWSTA--DNEQAASMVERGLLFGLVPVLTFGLAAAVFFGLNAHRRVNEMQIRIAQIVAGD 187 Query: 255 ITERLPYKH 263 + +RLPY++ Sbjct: 188 LKQRLPYRN 196 >gi|296123820|ref|YP_003631598.1| hypothetical protein Plim_3586 [Planctomyces limnophilus DSM 3776] gi|296016160|gb|ADG69399.1| Tetratricopeptide TPR_2 repeat protein [Planctomyces limnophilus DSM 3776] Length = 1077 Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust. Identities = 45/125 (36%), Positives = 55/125 (44%), Gaps = 10/125 (8%) Query: 84 LTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIE 143 L SL + A AA+AGK LS P R ALQ A +A KAE+ I Q E Sbjct: 529 LFSLGDFEASAAVAGKYLSMFPQATQRRRAYALQGLAYAKAQQWA----KAEAVI-KQFE 583 Query: 144 GV---DKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSK 200 D A AL +A V +A P A+A K +A+G N PF GWS Sbjct: 584 AEFPGDPAVAAALM-DQAEVAEAAKQWPVALADFEKLKRLAAGTT-NEPFAWRGTGWSRF 641 Query: 201 VLEDH 205 L D+ Sbjct: 642 RLGDY 646 >gi|297203976|ref|ZP_06921373.1| O-antigen polymerase [Streptomyces sviceus ATCC 29083] gi|297148540|gb|EDY57206.2| O-antigen polymerase [Streptomyces sviceus ATCC 29083] Length = 479 Score = 37.7 bits (86), Expect = 4.4, Method: Compositional matrix adjust. Identities = 30/91 (32%), Positives = 43/91 (47%), Gaps = 7/91 (7%) Query: 74 VGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHK 133 VGT A+ V L L Y+A L G++L P P TR +++ L L AY++ Sbjct: 63 VGTPAN-VFALLGLLWYLA-TWLGGRIL---PAPGTRFVRVSM--CVLGTAVLMAYIADA 115 Query: 134 AESSIHHQIEGVDKETADALAWREAIVHTSA 164 S H ++ G D+ L W +V TSA Sbjct: 116 MRESSHQEVLGADRGLIGYLVWVSLVVLTSA 146 Searching..................................................done Results from round 2 >gi|254781203|ref|YP_003065616.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|254040880|gb|ACT57676.1| hypothetical protein CLIBASIA_05550 [Candidatus Liberibacter asiaticus str. psy62] gi|317120669|gb|ADV02492.1| hypothetical protein SC1_gp035 [Liberibacter phage SC1] gi|317120813|gb|ADV02634.1| hypothetical protein SC1_gp035 [Candidatus Liberibacter asiaticus] Length = 478 Score = 742 bits (1915), Expect = 0.0, Method: Composition-based stats. Identities = 478/478 (100%), Positives = 478/478 (100%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ Sbjct: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQ 60 Query: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP Sbjct: 61 PNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAP 120 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV Sbjct: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 Query: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV Sbjct: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQV 240 Query: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH Sbjct: 241 QNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPH 300 Query: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL Sbjct: 301 FDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVEREL 360 Query: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP Sbjct: 361 SEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 Query: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL Sbjct: 421 HVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIKTKSLIKEAINCFLRTGGSL 478 >gi|332160978|ref|YP_004297555.1| hypothetical protein YE105_C1356 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665208|gb|ADZ41852.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862134|emb|CBX72298.1| hypothetical protein YEW_AK02350 [Yersinia enterocolitica W22703] Length = 430 Score = 282 bits (720), Expect = 1e-73, Method: Composition-based stats. Identities = 74/330 (22%), Positives = 141/330 (42%), Gaps = 13/330 (3%) Query: 39 INMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSL-APYIAGAALA 97 +N A + + V+ + G+ + G+ + AP A Sbjct: 51 LNKVAFAASQGVSTLLSPVAQAIDRATGTNANAFFDGSWTEGFRKTAEIQAPEATVTTTA 110 Query: 98 GKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWRE 157 G++L+ + ++R G + + PL GA+ A + ++ +G+D TA + Sbjct: 111 GQILNGLGDVMSRAVGGTVAAGPLG-GAVLAGGTEAIFANDEGLRKGLDPLTAAGKGVLD 169 Query: 158 AIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIF 217 + + L P A ++++ VA+GA N+ G V+RG +++ LE GY DMAQ Y+++ Sbjct: 170 GVSLGAGTLVPAAPFAKTLLSRVAAGAASNIAIGAVQRGTTAEWLEQRGYKDMAQQYKVW 229 Query: 218 DMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFD 277 D +++ DG++GA FGG+ + D + +H + ++PG+ T Sbjct: 230 DATAMLADGVLGAAFGGLAH-----IGAAATPDSVDAALTARNAQHFREDTAPGIPTDIP 284 Query: 278 AYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADN---TLEDPHFKPHLPEPEPLPQYKE 334 + AH L D + RGE D + + D + +F E P P + Sbjct: 285 SNIAHQRALETATDQINRGE--PVDVANIDGVFDAHFIARDGSNFAEQPAEIAPRPVAES 342 Query: 335 HSDRQKPSEPLAEHPHPKRKEVERELSEIE 364 + Q P + AE P+ + R+++ + Sbjct: 343 EATFQ-PEKTTAETATPEADPILRDINNAD 371 >gi|268589386|ref|ZP_06123607.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315413|gb|EFE55866.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 594 Score = 239 bits (610), Expect = 6e-61, Method: Composition-based stats. Identities = 76/369 (20%), Positives = 140/369 (37%), Gaps = 58/369 (15%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSPDIKW--------HTGLGKEVINMPARSL----DK 48 M + ++ I + + Q P S D + +TGL +I P + L D Sbjct: 1 MSYFGLNPTRINQQLDDAMQSPENSGDADFFDGAFTSTYTGLYSGLIAKPEQVLWGIADT 60 Query: 49 LVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPL 108 +V+P E ++Q + S A + + SL P A AG+++ + Sbjct: 61 VVSPIAREVNEQFDINDTSEQFIQEQRKNAE--KQVRSLTPDRATTGTAGQVMFSLFDIG 118 Query: 109 TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP 168 A+ PL L + ++ + +GVDK TA A E + +L P Sbjct: 119 GEALTGAMIGGPLGGAMLVGGVQGFSDYE-KLRADGVDKNTAINKATGEGLFAGLGVLTP 177 Query: 169 -------GAIASQSI---------------------AKTVASGAVLNVPFGMVERGWSSK 200 G I ++SI + + N+ GM +RG++S+ Sbjct: 178 MTLGFKGGGILAESIGAQFTARGGTLSSLAGTAARATPDIVYASGSNIAMGMAQRGFASQ 237 Query: 201 VLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRL--VNDLKEGI 255 +L++ GY +A Y ++D +++ DG++G FGGM + + +N+ L + + Sbjct: 238 ILKERGYNQLASQYDVYDKQAIAIDGVLGVAFGGMGRYINSRGENVPLPEFDTPHVDAAL 297 Query: 256 TERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 T H PG+ + + + H + ++ L +G D + L+ Sbjct: 298 TANQQL-HLEADLPPGIPINAMSLDGHLAAMNKAMNDLSQGN--PVDIGSI-------LD 347 Query: 316 DPHFKPHLP 324 F H P Sbjct: 348 GAEFLVHRP 356 >gi|317120710|gb|ADV02532.1| hypothetical protein SC2_gp040 [Liberibacter phage SC2] gi|317120771|gb|ADV02592.1| hypothetical protein SC2_gp040 [Candidatus Liberibacter asiaticus] Length = 408 Score = 235 bits (599), Expect = 1e-59, Method: Composition-based stats. Identities = 85/378 (22%), Positives = 148/378 (39%), Gaps = 38/378 (10%) Query: 9 EDIRDNIKEWAQR------PRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPN 62 E + IK P PD + T + +V ++P+ + E D Sbjct: 10 EKLLQQIKHAMDAGFYRYDPPKKPDYGFWTNITNDVASIPSEFIKGT----AEGQVDVIT 65 Query: 63 YYRGSRTD--PHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLA--GLALQS 118 S PH+ T + A G LS T + A + L + Sbjct: 66 SISTSLGYYTPHNKITSKPWYNVAEDVGVMGGVAHGIGHFLSAFGTGFSLFAINPVTLPA 125 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIAS---QS 175 +P G A + + EGV ETA A ++ T A G+++ +S Sbjct: 126 SPFI-GLATASSASGTRRYKELRDEGVAHETAKIGA----LITTGTTFAGGSVSGVIGKS 180 Query: 176 IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM 235 + +G NV FG+ ER L+ G+ D+AQHYR D T+ +IGA G + Sbjct: 181 LVSKAVTGGATNVAFGLGERQSIGAYLDYKGHKDLAQHYREVDGIHTTTEFIIGAGLGAL 240 Query: 236 HSK-----QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGV 290 H K ++ + + +K I + + S+P + T+ + E H TL + Sbjct: 241 HGKGGKHPDIKPSDVDIAQVVKRDIDD-------IYHSAPAIATTSRSAELHAQTLEQAI 293 Query: 291 DSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLA-EHP 349 + + RGE + D + + + + + P + + L Q ++ +Q+ S+P A + Sbjct: 294 EKMRRGEEINVDPKSIDLMTKDMITKPEVEFSPELKKQLKQGEDFLAQQEVSKPKALKEQ 353 Query: 350 HPKRKEV---ERELSEIE 364 P +V ER L+++E Sbjct: 354 DPLSSQVPEYERRLTDLE 371 >gi|301028421|ref|ZP_07191667.1| conserved domain protein [Escherichia coli MS 196-1] gi|299878532|gb|EFI86743.1| conserved domain protein [Escherichia coli MS 196-1] Length = 686 Score = 219 bits (558), Expect = 8e-55, Method: Composition-based stats. Identities = 57/211 (27%), Positives = 101/211 (47%), Gaps = 6/211 (2%) Query: 32 TGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYI 91 G K +I+ PA + D VAP + + D + G L + + + P Sbjct: 57 VGFSKRLISDPAFTAD--VAPTVNIFREMFPDADKTLNDTYDT-IGKQLQDARSYVKPDA 113 Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 +A ++L+ + G + PL GA A+ S +S + +GVD+ TA Sbjct: 114 GSQGMAAEVLNELG-KFVPAIGTTMFGGPLI-GAATAFSSTYEQSYQDFKGKGVDEATAR 171 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 LA ++++ + + P A+ + ++A +ASG +N FG + R LE+ GY +MA Sbjct: 172 NLATQQSLFNAVGMALPAAVGT-TLATRIASGVAINTGFGGLNRYSVGATLEEKGYTEMA 230 Query: 212 QHYRIFDMESLITDGLIGAFFGGMHSKQVQN 242 + YR+FD ++++ D ++G FGG+H N Sbjct: 231 KQYRVFDGQAMLVDAVLGGVFGGVHHLTTHN 261 Score = 53.8 bits (127), Expect = 7e-05, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 59/147 (40%), Gaps = 13/147 (8%) Query: 222 LITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEA 281 + + + FG ++++ + + L EG+ H SSP LHTS ++ + Sbjct: 473 MKLEAAVEKVFGIRARERIKPSDIDAAHILNEGL-------HYDIESSPVLHTSNESINS 525 Query: 282 HTDTLAHGVDSLVRGEYPHFD--QEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ 339 H D + L G+ + L + + D ++ E + ++E+ R Sbjct: 526 HVDAMDEAYRQLNDGQPVNVGGMARGLDGPLRSDISDT-YQEQYHEIQ--KVFEENGVRY 582 Query: 340 KP-SEPLAEHPHPKRKEVERELSEIEG 365 + SEP++E P P+ + E G Sbjct: 583 ETSSEPISESPVPRAESAFSSAGEHRG 609 >gi|304398391|ref|ZP_07380265.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] gi|304354257|gb|EFM18630.1| hypothetical protein PanABDRAFT_3526 [Pantoea sp. aB] Length = 625 Score = 212 bits (539), Expect = 1e-52, Method: Composition-based stats. Identities = 86/413 (20%), Positives = 156/413 (37%), Gaps = 38/413 (9%) Query: 16 KEWAQRPRVSPD---IKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNY--YRGSRT- 69 + A + PD +W+ G G + A L E P Y RG Sbjct: 16 DDQAASKQAQPDDYDPRWYAGSGSALFRGAAEGTIGLGQTLVETAKLSPTYSALRGDLPE 75 Query: 70 -----DPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAG 124 D + L + S+ P +A ++L + T + P+A G Sbjct: 76 LDEIVDQNFSAVQKSLNDARNSVKPAPNSQGMAAEILEGLGT-FAPAIAATAVAGPVAGG 134 Query: 125 ALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGA 184 A+ A+ S + +GV+++TA LA +A + + P + + +A + SG Sbjct: 135 AV-AFGSSYESTRQDFLAKGVNEDTAGTLALEQAGANALGMALPAGVGGR-LATRLLSGV 192 Query: 185 VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 +N FG V R + LE++GY ++A+ YR++D ++L+ DG++GA FGG+H Sbjct: 193 GINTGFGAVNRFALGETLEENGYDELAKQYRVWDKQALLVDGVLGAAFGGVHHLTSPRAD 252 Query: 245 LRLVNDL-----KEGITERLPYKHGVKSSSPGL------------HTSFDAYEAHTDTLA 287 L + + +T+ + + ++D+ A LA Sbjct: 253 TPLADPAPVSAGESAVTDAPAALRADADPAQTVVAEDSPLPAGEPAVTYDSRIAEMQDLA 312 Query: 288 HGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAE 347 V + RG+ QE + + + + PL + + + Sbjct: 313 GQV--ISRGDRKALAQEVHDLQYQH--DQATTQLQQVKNTPLSGSGKALAQARAQRTAQV 368 Query: 348 HPHPKRKEVERELSEIEGAK-KESSARKFFDEGSPDHSPFKGERNQKLDPMRG 399 + R + +E + GA+ +SS F E D S + E+ + MRG Sbjct: 369 NELDMRIGLLKEQIDQRGARLADSSPGGRFYEARSDLS--RIEQGLIPESMRG 419 Score = 39.1 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 60/376 (15%), Positives = 112/376 (29%), Gaps = 42/376 (11%) Query: 37 EVINMPARSLDK-LVAPFREETHDQPNYYRGSRTDPHSVGTGAH-LVEGLTSLAPYIAGA 94 V + A +D L A F H DP V G + + +L A Sbjct: 223 RVWDKQALLVDGVLGAAFGGVHHLTSPRADTPLADPAPVSAGESAVTDAPAALRADADPA 282 Query: 95 ALAGKLLSFIPTPLTRLAGLALQS--APLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 S +P + + + LA + +H D+ T Sbjct: 283 QTVVAEDSPLPAGEPAVTYDSRIAEMQDLAGQVISRGDRKALAQEVHDLQYQHDQATTQL 342 Query: 153 LAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQ 212 + + S +Q+ A+ A L++ G+++ + + Sbjct: 343 QQVKNTPLSGSGKAL-----AQARAQRTAQVNELDMRIGLLKEQIDQRGARLADSSPGGR 397 Query: 213 HYRIFDMESLITDGLIGAFFGGM-HSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPG 271 Y S I GLI G+ Q++ + + + EG+ + SSP Sbjct: 398 FYEARSDLSRIEQGLIPESMRGLVPEAQIKPSDVDAAHVMNEGL-------YYDLESSPV 450 Query: 272 LHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQE--------KLQTIADNTLEDPHFK--- 320 +H+ ++ +H + L+ GE + + + IA + Sbjct: 451 VHSGNESLNSHVAAMDQASRQLLSGEPVNVSAQIRGLDGIARPDAIATGEAQRAELSAAY 510 Query: 321 ---------PHLPEPE--PLPQYKEHSDRQ--KPSEPLAEHPHPKRKE-VERELSEIEGA 366 P EP P+ + + + +PS P P E + ++ A Sbjct: 511 RENGIAETVPQNAEPSIPPVREGSAFAGGRSAEPSSPEQISTDPVTGESISSNSYDLMAA 570 Query: 367 KKESSARKFFDEGSPD 382 + S A PD Sbjct: 571 RDMSQANADIMIAHPD 586 >gi|30387395|ref|NP_848224.1| hypothetical protein epsilon15p16 [Enterobacteria phage epsilon15] gi|30266050|gb|AAO06079.1| 16 [Salmonella phage epsilon15] Length = 634 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 59/248 (23%), Positives = 106/248 (42%), Gaps = 6/248 (2%) Query: 32 TGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYI 91 G K +I+ PA + D VAP + + + G L + + P Sbjct: 57 VGFSKRLISDPAFTAD--VAPTVNIFRVMFPDADKALNETYDT-IGKQLQDARGYVKPDA 113 Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 A ++L + G + P GA A+ S +S + +GVD+ TA Sbjct: 114 GSQGTAAEVLYGLG-QFVPAIGATIFGGP-TVGAATAFSSTYEQSYQDFKGKGVDETTAR 171 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 LA ++++ + + + P A+ + ++ +ASG +N FG + R + LE+ GY +MA Sbjct: 172 NLATQQSLFNAAGMALPAAVGT-TLTTRIASGVAINTGFGGLNRYSVGETLEEKGYTEMA 230 Query: 212 QHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPG 271 + YR+FD ++++ D ++GA FGG H +N + D + I S P Sbjct: 231 KQYRVFDGQAMLVDAVLGAAFGGAHHLAARNADVPPPPDSEAPIPAAEVQSVPDNSPQPQ 290 Query: 272 LHTSFDAY 279 ++ Sbjct: 291 AESAPQPA 298 >gi|315122889|ref|YP_004063378.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496291|gb|ADR52890.1| hypothetical protein CKC_05725 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 363 Score = 195 bits (496), Expect = 1e-47, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 97/250 (38%), Gaps = 18/250 (7%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGA-------LYAYLSHKAESS 137 +L G++ + ++ A+ + L L+ + Sbjct: 68 NALTVDPEETGAIGQIGHSLLHSVSAFGIGAMAGGSIGGPLGALAGGFLSVALAEGRRAF 127 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + + EG D TA + ++ + L P + +AS V N+ ++R Sbjct: 128 ENARDEGQDSSTATKGGMKTGVISGAGALIPAGFGVSVVKSAIASAGV-NLGLSKLDRMG 186 Query: 198 SSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN-------MSLRLVND 250 +L+ +GY ++A+H D S+ TD ++G FGG+H+K + D Sbjct: 187 DYAILKANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARRNKKLVGMKPTPSEGD 246 Query: 251 LKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIA 310 + G L + + P T+ +++E H +A +LV GE D +KL+ + Sbjct: 247 IATGAKNELMTSRTLNDAIP---TTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALE 303 Query: 311 DNTLEDPHFK 320 +++ P + Sbjct: 304 RGSIKKPDIE 313 >gi|315121927|ref|YP_004062416.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495329|gb|ADR51928.1| hypothetical protein CKC_00885 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 326 Score = 194 bits (493), Expect = 2e-47, Method: Composition-based stats. Identities = 48/250 (19%), Positives = 97/250 (38%), Gaps = 18/250 (7%) Query: 85 TSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGA-------LYAYLSHKAESS 137 +L G++ + ++ A+ + L L+ + Sbjct: 31 NALTVDPEETGAIGQIGHSLLHSVSAFGIGAMTGGSIGGPLGALAGGFLSVALAEGRRAF 90 Query: 138 IHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGW 197 + + EG D TA + ++ + L P + +AS V N+ ++R Sbjct: 91 ENARDEGQDSSTATKGGMKTGVISGAGALIPAGFGVSVVKSAIASAGV-NLGLSKLDRMG 149 Query: 198 SSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQN-------MSLRLVND 250 +L+ +GY ++A+H D S+ TD ++G FGG+H+K + D Sbjct: 150 DYAILKANGYDELAEHASEMDSISIATDIVLGMAFGGLHAKNARRNKKLAGMKPTPSEGD 209 Query: 251 LKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIA 310 + G L + + P T+ +++E H +A +LV GE D +KL+ + Sbjct: 210 IATGAKNELMTSRTLNDAVP---TTNESFETHMSAIAEAEHALVNGEKFGLDSQKLEALE 266 Query: 311 DNTLEDPHFK 320 +++ P + Sbjct: 267 RGSIKKPDIE 276 >gi|298381706|ref|ZP_06991305.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279148|gb|EFI20662.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 600 Score = 189 bits (480), Expect = 7e-46, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGSALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDVDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|218700978|ref|YP_002408607.1| hypothetical protein ECIAI39_2668 [Escherichia coli IAI39] gi|218370964|emb|CAR18791.1| conserved hypothetical protein from phage origin [Escherichia coli IAI39] Length = 600 Score = 189 bits (479), Expect = 9e-46, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGH 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|309702800|emb|CBJ02131.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 600 Score = 189 bits (479), Expect = 9e-46, Method: Composition-based stats. Identities = 74/403 (18%), Positives = 134/403 (33%), Gaps = 55/403 (13%) Query: 3 FNAVSDEDIRDNIKEWAQRP----------RVSPDIKWHTGLGKEVINMPAR----SLDK 48 NAV+ + E A P S +GL ++ P + +DK Sbjct: 6 LNAVNQ---NQQLDEAASNPAGFNTDVGFFDNSGTAA-VSGLYSGLVAKPDQLLWAGMDK 61 Query: 49 LVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPL 108 +V+P + ++ + S A + + L P A AG++L + Sbjct: 62 IVSPIAKFVNENTSINDTSAEYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLHGLFDMG 119 Query: 109 TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP 168 + L S P A L +E +GVD TA + + + L P Sbjct: 120 GQAVVGTLLSGPAGGAAAVTALQGFSEFE-RLTAQGVDFRTAQEAGLVQGVTAGAGTLIP 178 Query: 169 GAIASQS-----------------------------IAKTVASGAVLNVPFGMVERGWSS 199 ++ ++ A +A A N+ FGM +RG ++ Sbjct: 179 MSLGLRAGGALAESVGAQLARTGESAVRNVAATAVRAAPDIAYAAGTNIAFGMAQRGLTA 238 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDL----KEGI 255 K L D GY +MA Y +FD +S+ D ++G FGG+ + + Sbjct: 239 KTLRDGGYNEMANQYDVFDRQSIAIDAVLGVAFGGVGRFLNARGESAAAPEFSPAEVDAA 298 Query: 256 TERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 H +PG+ + + +AH L ++ + +G + Sbjct: 299 LAANASHHAEIDVAPGVPVNVLSRDAHIQALQKAMNDVSQGRAVDVTSIAEPASFSDIPG 358 Query: 316 DPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVER 358 + + E L + +E S + E + +VE+ Sbjct: 359 RRNLISQAID-ETLYRTEEGSTQVAVDTRALEQQAAQALDVEQ 400 >gi|300898439|ref|ZP_07116780.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357906|gb|EFJ73776.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPISLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDVDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|323948673|gb|EGB44578.1| hypothetical protein ERKG_04896 [Escherichia coli H252] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|324008548|gb|EGB77767.1| hypothetical protein HMPREF9532_01735 [Escherichia coli MS 57-2] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|332344342|gb|AEE57676.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 66/371 (17%), Positives = 127/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDVDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + G D + +E F L Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSEGR--PVDVASI-------VESASFSEILGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|117624700|ref|YP_853613.1| hypothetical protein APECO1_4053 [Escherichia coli APEC O1] gi|115513824|gb|ABJ01899.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 67/371 (18%), Positives = 128/371 (34%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSINDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGVFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP-------GAI 171 P+ A L +E +GVD TA + I + L P G Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGALIPMSLWLRAGGA 188 Query: 172 ASQSIAK----------------------TVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 ++ +A +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|89152440|ref|YP_512273.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] gi|74055463|gb|AAZ95912.1| hypothetical protein PhiV10p19 [Escherichia phage phiV10] Length = 600 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 66/371 (17%), Positives = 130/371 (35%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRP-RVSPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAALNPVGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQLVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A +AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGIAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDVDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ ++ + +H L + + +G D + +E F L Sbjct: 309 IDIAPGVPSNVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEILGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|323156121|gb|EFZ42280.1| hypothetical protein ECEPECA14_1896 [Escherichia coli EPECa14] Length = 600 Score = 187 bits (473), Expect = 5e-45, Method: Composition-based stats. Identities = 66/371 (17%), Positives = 126/371 (33%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPVGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVIGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + + S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGEATSTPNFSPVDVDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 SPG+ + + +H L + + +G D + +E F Sbjct: 309 IDISPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|331648164|ref|ZP_08349254.1| hypothetical protein ECIG_04090 [Escherichia coli M605] gi|331043024|gb|EGI15164.1| hypothetical protein ECIG_04090 [Escherichia coli M605] Length = 600 Score = 186 bits (471), Expect = 7e-45, Method: Composition-based stats. Identities = 64/371 (17%), Positives = 125/371 (33%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGSAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +R ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRVLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + + S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGEPTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|85059172|ref|YP_454874.1| hypothetical protein SG1194 [Sodalis glossinidius str. 'morsitans'] gi|84779692|dbj|BAE74469.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 186 bits (471), Expect = 8e-45, Method: Composition-based stats. Identities = 73/396 (18%), Positives = 132/396 (33%), Gaps = 60/396 (15%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSP---DIKWHTGLGKEVINMPARS-----------L 46 M + S + ++ P + D + G G + L Sbjct: 1 MSYFGFSPTQQNKALAYASEHPIGTGTLQDAAFFDGAGTALFEGLWSGVRQADQVGWAAL 60 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 D +++P E + S A + + L P AG++L + Sbjct: 61 DTVMSPVAEAVSETFGVRDSSADFFKEQRKLAE--KSVRELTPDPGTTGTAGQVLYSLGQ 118 Query: 107 PLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALL 166 + +L P A A L ++ + +GVD TA A ++ Sbjct: 119 LGGQAIAGSLMGGPWGAAATVGTLQGFSDYE-KSRADGVDYGTAVDKALVTGGTAALGVV 177 Query: 167 AP------------GAIASQSIAKTVASG----------------AVLNVPFGMVERGWS 198 P +++ ASG A N+ GM +RG S Sbjct: 178 LPMSLGLRAGGAVAEGVSAALSVGRGASGALAGAVARAAPDLFYSAGTNIAMGMAQRGLS 237 Query: 199 SKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVN--DLKE 253 ++ L GY DMA+ Y + D ++L TD ++G FGG+ + + +++ +R V+ ++ Sbjct: 238 AETLRRGGYEDMARQYDVMDAQALATDAVLGVAFGGLGRFINSRGEDVPVRRVSPEEIDA 297 Query: 254 GITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNT 313 +T V + +PG+ S + AH + + ++ GE D L Sbjct: 298 ALTSSSHVNFEV-TVAPGVPVSVLSRNAHAQAMNKAMTDVLAGE--PVDVGAL------- 347 Query: 314 LEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 LE F +P Q + + A Sbjct: 348 LEGAEFLQKMPRVNLASQSVREALGLRGGATTAAEQ 383 >gi|215487809|ref|YP_002330240.1| hypothetical protein E2348C_2742 [Escherichia coli O127:H6 str. E2348/69] gi|215265881|emb|CAS10290.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 600 Score = 179 bits (453), Expect = 9e-43, Method: Composition-based stats. Identities = 73/403 (18%), Positives = 135/403 (33%), Gaps = 55/403 (13%) Query: 3 FNAVSDEDIRDNIKEWAQRP----------RVSPDIKWHTGLGKEVINMPAR----SLDK 48 NAV+ + E A P S +GL ++ P + +DK Sbjct: 6 LNAVNQ---NQQLDEAASNPAGFNTDVGFFDNSGTAA-VSGLYSGLVAKPDQLLWAGMDK 61 Query: 49 LVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPL 108 +V+P + ++ + S A + + L P A AG++L+ + Sbjct: 62 IVSPIAKFVNENTSINDTSAEYIGEQRKLAE--QQVKRLTPDAATTGTAGQVLNGLFDMG 119 Query: 109 TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP 168 + L + P A L +E +GVD TA + + + L P Sbjct: 120 GQAVVGTLLAGPAGGAAAVTALQGFSEFE-KLTAQGVDFRTAQEAGLVQGVTAGAGTLIP 178 Query: 169 GAIASQS-----------------------------IAKTVASGAVLNVPFGMVERGWSS 199 ++ ++ A +A A N+ FGM +RG ++ Sbjct: 179 MSLGLRAGGALAESVGAQLARTGESAVRNVAATAVRAAPDIAYAAGTNIAFGMAQRGLTA 238 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDL----KEGI 255 K L D GY +MA Y +FD +S+ D ++G FGG+ + + Sbjct: 239 KTLRDGGYNEMAAQYDVFDRQSIAIDAVLGVAFGGVGRFLNARGESAATPEFSPAEVDAA 298 Query: 256 TERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 H +PG+ + + +AH L ++ + +G + Sbjct: 299 LAANASHHAEIDVAPGVPVNVLSRDAHIQALQKAMNDVSQGRAVDVASIAEPASFSDIPG 358 Query: 316 DPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVER 358 + + E L + +E S + E + +VE+ Sbjct: 359 RRNLISQAID-ETLYRSEEGSTQIAVDTRALEQQAAQALDVEQ 400 >gi|327252172|gb|EGE63844.1| hypothetical protein ECSTEC7V_3019 [Escherichia coli STEC_7v] Length = 600 Score = 177 bits (447), Expect = 4e-42, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 126/371 (33%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVIGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQ---- 174 P A L +E +GVD TA + I + + P ++ + Sbjct: 130 GPAGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTMIPMSLGLRAGGA 188 Query: 175 -------------------------SIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 S +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVSATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + ++ S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVAFGGVGRFINSRGESTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|330007167|ref|ZP_08305909.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] gi|328535514|gb|EGF61974.1| hypothetical protein HMPREF9538_03598 [Klebsiella sp. MS 92-3] Length = 632 Score = 174 bits (440), Expect = 3e-41, Method: Composition-based stats. Identities = 72/383 (18%), Positives = 140/383 (36%), Gaps = 33/383 (8%) Query: 32 TGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYI 91 G K +I+ PA + + VAP + + + G L + P Sbjct: 57 VGFSKRLISDPAFTDN--VAPTINMFRVMFPDADKALNESYDD-LGKQLSSAREYIKPEA 113 Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 +A +++ + G ++ GA A S ++ +GVD++TA Sbjct: 114 GSQGVAAQVIHGLGQ-FAPAIGASVIGG-PVVGAAAAAGSTYEQAYQDALAKGVDEQTAR 171 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 +A ++ + + P A+ + +A + SG +N FG + R + LED+GY DMA Sbjct: 172 TVAAEQSGFNAVGMGLPAAVGGR-LATRLLSGVGINAAFGGLNRFAVGETLEDNGYADMA 230 Query: 212 QHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPG 271 + YR+FD ++++ D ++GA FGG H + S+ D + + + Sbjct: 231 KQYRVFDGQAILIDSVLGAAFGGAHHFAARGNSVDARADSTPAVDDGTTAQEP------- 283 Query: 272 LHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTI---ADNTLEDPHFKPHLPEPEP 328 + E P + + D + + L E + Sbjct: 284 ----------------AATAEIQPQEQPPVSPAQESGVVPDTDASAPGATYDSRLAELQQ 327 Query: 329 LP-QYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFK 387 L Q DR+ ++ + + + E+ + + SS+R+ + Sbjct: 328 LAGQVLSRGDRKVLTDEIHRAEYEIARIGEQRQALRDQRVGNSSSRRIRNRELAALEQRV 387 Query: 388 GERNQKLDPMRGADFTDAPHAKF 410 E +++P R A P +F Sbjct: 388 QEIQSRIEPSRQALADSTPGGRF 410 >gi|85059663|ref|YP_455365.1| hypothetical protein SG1685 [Sodalis glossinidius str. 'morsitans'] gi|84780183|dbj|BAE74960.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 490 Score = 169 bits (427), Expect = 1e-39, Method: Composition-based stats. Identities = 71/396 (17%), Positives = 128/396 (32%), Gaps = 60/396 (15%) Query: 1 MYFNAVSDEDIRDNIKEWAQRPRVSP---DIKWHTGLGKEVINMPARS-----------L 46 M + + S + A+ P + D + G G + L Sbjct: 1 MSYFSFSPTQQNKALAYAAEHPIGTGTLQDAAFFDGAGTALFKGLWSGVRQADQVGWAAL 60 Query: 47 DKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPT 106 D ++P + + S + A + L P + AG++L + Sbjct: 61 DTAISPVADAVSETFGVRDFSADFFKAQRKLAETR--VRELTPDLGTTGTAGQVLFSLGQ 118 Query: 107 PLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALL 166 + +L P +A A L + + +GVD TA A + Sbjct: 119 LGGQAIAGSLMGGPWSAAATVGTLQGFS-YYEKSRADGVDYGTAVDKALVTGGTAALGAV 177 Query: 167 AP------------GAIASQSIAKTVASG----------------AVLNVPFGMVERGWS 198 P +++ ASG A N+ GM +RG S Sbjct: 178 LPMSLGLRAGGAVAEGVSAALSVGRGASGALAGAVARAAPDLFYSAGTNIAMGMAQRGLS 237 Query: 199 SKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVN--DLKE 253 ++ L GY DMA+ Y + ++L TD ++G GG+ + + +++ +R V+ ++ Sbjct: 238 AETLRRGGYEDMARQYDVMASQALATDAVLGLAPGGLGRFINSRGEDVPVRRVSPEEIDA 297 Query: 254 GITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNT 313 +T V + +PG+ S + AH + + ++ GE D L Sbjct: 298 ALTSSSHVNFEV-TVAPGVPVSVLSCNAHAQAMNKAMAGVLAGE--PVDVGAL------- 347 Query: 314 LEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 LE F P Q + A Sbjct: 348 LEGAEFLQKTPRVNLASQSVREELGLRGEATTAAEQ 383 >gi|320175033|gb|EFW50146.1| 16 [Shigella dysenteriae CDC 74-1112] Length = 600 Score = 167 bits (423), Expect = 3e-39, Method: Composition-based stats. Identities = 65/371 (17%), Positives = 126/371 (33%), Gaps = 58/371 (15%) Query: 12 RDNIKEWAQRPRV-SPDIKWH--------TGLGKEVINMPAR----SLDKLVAPFREETH 58 + E A P + D+ + +GL ++ P + +DK+V+P + + Sbjct: 12 NQQLDEAASNPAGFNSDVGFFDNAVGAALSGLYSGLVAKPDQLLWAGMDKIVSPIAQFVN 71 Query: 59 DQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQS 118 + + S + A + + L P A AG++L + + Sbjct: 72 ENTSLNDTSVSYIAEQRKLAE--QQVKRLTPDAATTGTAGQVLYGLFDMGGQAVVGTTLG 129 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS--- 175 P+ A L +E +GVD TA + I + L P ++ ++ Sbjct: 130 GPVGGAAAVTSLQGFSEFE-RLTAQGVDFRTAQEAGLVQGITAGAGTLIPMSLGLRAGGA 188 Query: 176 --------------------------IAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 +A A N+ FGM +RG ++K L D GY + Sbjct: 189 LAEGVAAQLARTGESSVRRAAATAVRATPDIAYAAGTNIAFGMAQRGLTAKTLRDGGYSE 248 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMH---SKQVQNMSLRLVNDLKEGITERLPYKHGVK 266 MA Y + D +++ D ++G FGG+ + + + S + + H + Sbjct: 249 MANQYDVLDRQAIAIDAVLGVVFGGVGRFINSRGEPTSAPNFSPVDIDAALAANAAHHAE 308 Query: 267 -SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPE 325 +PG+ + + +H L + + +G D + +E F Sbjct: 309 IDIAPGVPINVLSRNSHIQALRKAMSDVSQGR--PVDVASI-------VESASFSEIPGR 359 Query: 326 PEPLPQYKEHS 336 L Q + Sbjct: 360 KSLLSQAVNEA 370 >gi|298485994|ref|ZP_07004068.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159471|gb|EFI00518.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 448 Score = 165 bits (418), Expect = 1e-38, Method: Composition-based stats. Identities = 83/411 (20%), Positives = 148/411 (36%), Gaps = 47/411 (11%) Query: 31 HTGLGKEVINMPARSLDKLVAPFREET----HDQ--PNYYRGSRTDPHSVGT-------- 76 + LGK ++ + + +Q +Y + + S Sbjct: 36 YDSLGKGLVRGAIEGGAAAESTYWNAILSGGPEQNIFDYTQSTTLSRESQQKIGDDLNTL 95 Query: 77 GAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAES 136 + L P A +AG+++ L R A+ + P A + + Sbjct: 96 REETASAVMDLRPDPAEVGIAGQIIGEAAAILPRAVIGAVAAGPAGAAIAAGAPAGYSRR 155 Query: 137 SIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERG 196 ++ EG+D+ TA L E +V + + P A + + A NV GM RG Sbjct: 156 AVS-MAEGIDENTATLLGLSEGVVTGAGAILPAAQFVKPVLGDAAIAIGANVGLGMAHRG 214 Query: 197 WSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGIT 256 ++ +L+ +GY A YR D ++ TD ++GA F G+ +M + + +T Sbjct: 215 TAAALLDSNGYAAQAAQYRAMDGTAIATDAILGAAFFGIGRS---SMRRPTTDQVDAALT 271 Query: 257 ERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED 316 ER +H ++PGL + AH D L ++ + RGE + +Q+ Sbjct: 272 ER-NAQHADIDTAPGLPVDPRSAIAHQDALRAAIEQINRGEAVVL-PDNIQSAT-FLRTP 328 Query: 317 PHFKP-HLPEPEPLPQYKEHSD--RQKPSEPLAEHPHPKRKEVERELSEIEGA------- 366 P E L +E + + A P K+V EL+ + + Sbjct: 329 DDVAPIAPSRAEALIAAREELAPVLRNELQQDATAAIPNVKDVRTELANLSKSLDGLDES 388 Query: 367 ----------------KKESSARKFFDEGSPDHSPFKGERNQKLDPMRGAD 401 + ES+AR+ + + + E N+ LD R AD Sbjct: 389 FRARAKEFQQQGQSRKQAESAARQSIADERTQLTDRQTELNESLDGNRSAD 439 >gi|319793416|ref|YP_004155056.1| phage-like protein [Variovorax paradoxus EPS] gi|315595879|gb|ADU36945.1| phage-like protein [Variovorax paradoxus EPS] Length = 937 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 60/298 (20%), Positives = 106/298 (35%), Gaps = 17/298 (5%) Query: 33 GLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIA 92 GL + + PA L P + + G+ D L L A Sbjct: 43 GLARGTVAKPALLLGDAATPLLRTSAQAVDKTLGTSLDAWLTDQQKRNTTALEQLRSDPA 102 Query: 93 GAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADA 152 AG+++ + + A+ P A L Y +GV TA A Sbjct: 103 TTGFAGQVVGGLFDLGS----SAILYTPEGAAVLEGYGRR-----QELIGQGVAPGTATA 153 Query: 153 LAWREAIVHTSALLAPG-------AIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDH 205 + + AP +++A+ +A GA +V G+ ERG+S +L+ Sbjct: 154 VGAVSGAATYVGVKAPITLGQQAIGQGGRAMAQNLAYGATASVAGGVAERGFSRDLLKAA 213 Query: 206 GYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGV 265 GY + A +D +L + +GA F G + ++R +T H Sbjct: 214 GYGEQAAPLEPYDKTALAAEATLGALFSGGAAALHARSTVRGQAATDAALT-VTTVDHAQ 272 Query: 266 KSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHL 323 + ++PG T A AH L+ ++ ++R E + ++ T + P + L Sbjct: 273 RGTAPGTPTDARAASAHASALSTAIEQVLRNEPANVGEQMADTAFVRPVPSPEIRAEL 330 >gi|169795395|ref|YP_001713188.1| phage-like protein [Acinetobacter baumannii AYE] gi|169148322|emb|CAM86187.1| hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 954 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 57/355 (16%), Positives = 109/355 (30%), Gaps = 47/355 (13%) Query: 2 YFNAVSDEDIRDNIKEWAQRPRVSPDI--KWHTGLGKEVINMPARSL--------DKLVA 51 +++ +D++ E QR ++ + G+ I+ P R + D + A Sbjct: 3 WYDTFADDE--QKSVEELQRKGITGKPTVQKEVGIFDGAISSPFRGMAIGLNKVGDAISA 60 Query: 52 PFR----EETHDQPNYYRGSRTDPHSVGTGA------HLVEGLTSLAPYIAGAALAGKLL 101 P ++ + +P+ +LV G + + G + Sbjct: 61 PIDAVVDRVSYSLKDVSTNEFIEPYEEFKAKREKARDNLVYGTIADLEDKDNTGIVGNIG 120 Query: 102 SFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVH 161 + L R A S L A L + +GVD+ TA +A A+ Sbjct: 121 VGVGDYLWRGALGVATSGTLGAATLTGGSTG-NYVYTDLTRKGVDENTALKVAGVNAVGD 179 Query: 162 TSALLAPGAIASQ---SIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFD 218 P + + + A + S ++L+ +GY A+ Y + Sbjct: 180 AIGTALPISYGFKGSGGLVADAALSVGGATGLNTGMQYTSEQLLKSNGYDKQAKQYEVT- 238 Query: 219 MESLITDGLIGA-FFGGMHSKQVQNMSLRLVNDLKEGITE-----------------RLP 260 ES+ TD LI + FGG + L D+ I + Sbjct: 239 GESVATDLLINSLMFGGARYLGTRQNQLD--QDVDAEINQLNSDDFETRNDALNDALVRN 296 Query: 261 YKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 ++ P T H L + +++G+ NT++ Sbjct: 297 SFEFEDTTFPVRTTDPVQQNKHYQNLDAATEQILKGQPVSVPNAVQGEPRRNTID 351 >gi|332875213|ref|ZP_08443046.1| cation diffusion facilitator family transporter [Acinetobacter baumannii 6014059] gi|332736657|gb|EGJ67651.1| cation diffusion facilitator family transporter [Acinetobacter baumannii 6014059] Length = 957 Score = 95.4 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 54/331 (16%), Positives = 95/331 (28%), Gaps = 40/331 (12%) Query: 7 SDEDIRDNIKEWAQRPRVSP-DIKWHTGLGKEVINMPARSLDKLV----APFR----EET 57 + +D + Q P P D G A L+K+ AP + Sbjct: 12 NQQDFEKLNSQGLQHPDTRPNDPGVFDGAISSPFRGMAIGLNKVGDAISAPIDAVVDRVS 71 Query: 58 HDQPNYYRGSRTDPHSVGTGA------HLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRL 111 + + +P+ +LV G + + G + + L R Sbjct: 72 YSLKDVSTNEFIEPYEEFKAKREKARDNLVYGTIADLEDKDNTGIVGNIGVGVGDYLWRG 131 Query: 112 AGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 A L A L + +GVD+ TA +A A+ P Sbjct: 132 ALGVATGGTLGAATLTGGSTG-NYVYTDLTRKGVDENTALKVAGVNAVGDAIGTALPIGY 190 Query: 172 ASQ---SIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI 228 + + A + S ++L+ +GY A+ Y + ES+ TD LI Sbjct: 191 GFKGTGGLVADAALSVGGATGLNTGMQYASEQLLKSNGYDKQAKQYEVT-GESVATDLLI 249 Query: 229 GA-FFGGMHSKQVQNMSLRLVNDLKEGITE-----------------RLPYKHGVKSSSP 270 + FGG + L D+ I + ++ P Sbjct: 250 NSLMFGGARYLGSKQNQLD--QDVDAEINQLNSDDFETRNDALNDALVKNSFEFEDTTLP 307 Query: 271 GLHTSFDAYEAHTDTLAHGVDSLVRGEYPHF 301 T H L + +++G+ Sbjct: 308 VQTTDPVQQNKHYQNLDVATEQILKGQPVSV 338 >gi|294648410|ref|ZP_06725909.1| hypothetical protein HMP0015_0118 [Acinetobacter haemolyticus ATCC 19194] gi|292825715|gb|EFF84419.1| hypothetical protein HMP0015_0118 [Acinetobacter haemolyticus ATCC 19194] Length = 837 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 58/318 (18%), Positives = 103/318 (32%), Gaps = 20/318 (6%) Query: 64 YRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAA 123 SR + + + P AG + S I ++ A A P Sbjct: 51 NAASRFVEGDEVADKRMQQVNEAFTPL--NQGTAGHIASGITEVVSAGAVGAPL-GPYGM 107 Query: 124 GALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGA-IASQSIAKTVAS 182 A + E + Q GVD++TAD + + + P + + +S+ A+ Sbjct: 108 AATVGLGTRAIEHTKLTQQLGVDQDTADTASNIYGATNAALAFLPVSNVFKKSLIADYAA 167 Query: 183 GAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIF--DMESLITDGLIGAFFGGMHS--K 238 V G L+ GY Y+ D ++ + IG+ F Sbjct: 168 LVVAPTAVGQGMTYAEGAYLDSKGYKKQGAMYKDMATDPNAIFMNMAIGSTFFAAGRYMN 227 Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKS----SSPGLHTSFDAYEAHTDTLAHGVDSLV 294 N L K + S P + + D H L +D ++ Sbjct: 228 AKGNADLPEAEVHKAEADFNATVEQAQTDADVSSMPNIADTVDDLAQHEANLNQAIDQVM 287 Query: 295 RGEYPHFDQE---KLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHP 351 +GE + + KL+T+ D H + + + +P + + R S LA + + Sbjct: 288 KGEKVNISEATGGKLKTLDD---VKKHIQANQKKVQPTLEDLSNKVRSSISSRLAANKNN 344 Query: 352 KR-KEVERELSEIEGAKK 368 E + + I G KK Sbjct: 345 SSNDEATKPFTAI-GTKK 361 >gi|293609610|ref|ZP_06691912.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828062|gb|EFF86425.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 954 Score = 88.4 bits (217), Expect = 3e-15, Method: Composition-based stats. Identities = 53/341 (15%), Positives = 100/341 (29%), Gaps = 37/341 (10%) Query: 7 SDEDIRDNIKEWAQRPRVSP-DIKWHTGLGKEVINMPARSLDKLV----APFR----EET 57 + +D + + Q P + P + +G A L+K+ AP + Sbjct: 12 NQQDFEELNSKGLQHPDIRPNEPSAFSGAISSPFRGAAIGLNKVGDAISAPIDAVVDRVS 71 Query: 58 HDQPNYYRGSRTDPHSVGTGA------HLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRL 111 + + +P+ +LV G + G+ L R Sbjct: 72 YTLKDVSTNEFIEPYEEYKAKREKARDNLVYGAIDKLEDKENTGIVGRFGVGAGDYLWRG 131 Query: 112 AGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 A A L A L + +GVD+ TA +A A+ P + Sbjct: 132 ALGAATGGTLGAATLTGGSTG-NYIYTDLTRKGVDENTALQVAGINAVGDAIGTALPMSY 190 Query: 172 ASQ---SIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI 228 + + A + S+++L+ G A+ + + ES+ TD + Sbjct: 191 GFRGTGGLVGDAALSVGGATALNTGVQYTSNQILKAAGNEKEAKQFEV-TGESVATDLAL 249 Query: 229 GA-FFGGMHSKQVQ------NMSLRLVNDLKEGITER---------LPYKHGVKSSSPGL 272 A FGG + ++ + + I R ++ P Sbjct: 250 NALLFGGARYLGSRQKQLDQDVDAEINQLNADDIETRNDQINDTLVRNSFEFEDTTLPVR 309 Query: 273 HTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNT 313 T H L D +++G+ +Q A Sbjct: 310 TTDPVQQNKHYQNLDAATDQILKGQTVSV-PNTVQGEARKA 349 >gi|254251752|ref|ZP_04945070.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] gi|124894361|gb|EAY68241.1| Soluble lytic murein transglycosylase [Burkholderia dolosa AUO158] Length = 764 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 44/245 (17%), Positives = 84/245 (34%), Gaps = 11/245 (4%) Query: 32 TGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYI 91 T +++ P SL + + + + G L L P Sbjct: 61 TAGASQMLTDPTESLLNPQVQEETDRRLGETFRKQREGTLFTSAAGQRLYSLSDMLRPDP 120 Query: 92 AGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETAD 151 +++ + L ++ A+ P+A A+ S + EGVD T Sbjct: 121 QNTTTTDQIVQGAVSGLVQIVPAAVLGGPVAGAAVGGASIGLGRSE-ELKREGVDVGTRT 179 Query: 152 ALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMA 211 A+ E + + + P +IA+T+ AV + + +L++ GY +A Sbjct: 180 AVGAVEGALGAAGAVLPA--GGSTIARTLGLVAVGGPGMAIGQSTAEKAILKNAGYDHLA 237 Query: 212 QHYRIFDMESLITDGLIGAFFGGMH-------SKQVQNMSLRL-VNDLKEGITERLPYKH 263 D +L L+ FFGG+H ++ +N + L + LPY Sbjct: 238 DQIDPLDPTNLAASTLMAGFFGGLHAGGLASAARTARNADPSTPLPSLDVAARKALPYNS 297 Query: 264 GVKSS 268 + + Sbjct: 298 PILDA 302 >gi|48697206|ref|YP_024936.1| SLT domain-containing tail structural protein [Burkholderia phage BcepC6B] gi|47779012|gb|AAT38375.1| gp16 [Burkholderia phage BcepC6B] Length = 763 Score = 69.9 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 44/302 (14%), Positives = 100/302 (33%), Gaps = 8/302 (2%) Query: 73 SVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSH 132 GA + + P A + + + + L ++ A+ PLA A+ Sbjct: 101 ESPLGARAYDLSDTFKPDPTRATAIDQTVQGVVSGLAQIVPAAVLGGPLAGAAVGGASIG 160 Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 + + + +GVD T A+ E + + + P +A ++ +T+ A + Sbjct: 161 MSRAE-DLKRQGVDVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTIGLVAAGGPGAAI 217 Query: 193 VERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLK 252 + +L + GY +A D +L L+ F G+H+ + + Sbjct: 218 AQATIEKAILRNAGYDHLADQINPLDPINLAAATLMAGTFAGVHTAATARTARQNAPAAT 277 Query: 253 EGITE-RLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL-VRGEYPHFDQEKLQTIA 310 + + + + +P L + +L GE + Q + A Sbjct: 278 VPLQSLAIDARRALPYDAPQLDAYAAQAAQAAGVPPELMLALKNAGEKSNSGQVSPKGAA 337 Query: 311 DNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKES 370 + P +P ++ LA+ ++ +++ G K++ Sbjct: 338 GVSQMMPENLRKYGVTDP---TDPMQALDGMAKYLADTQKQYGGNLQAMIADYNGGPKQA 394 Query: 371 SA 372 +A Sbjct: 395 AA 396 >gi|221213943|ref|ZP_03586916.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD1] gi|221166120|gb|EED98593.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD1] Length = 749 Score = 67.2 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 41/302 (13%), Positives = 98/302 (32%), Gaps = 8/302 (2%) Query: 73 SVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSH 132 G + + P + + + + LT++ A+ PL A+ Sbjct: 101 ESPLGTRAYDLSDTFKPDPTRTTAIDQTVQGVVSGLTQIVPAAVLGGPLTGAAVGGTSIG 160 Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 + + + +GVD T A+ E + + + P +A ++ +TV A + Sbjct: 161 MSRAE-DLKRQGVDVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTVGLVAAGGPGAAI 217 Query: 193 VERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLK 252 + +L + Y +A D ++ L+ F G H+ + + Sbjct: 218 AQASIEKAILRNADYDHLADQIDPLDPVNIAASTLMAGVFAGAHTVATARTARQTATAPT 277 Query: 253 EGITE-RLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL-VRGEYPHFDQEKLQTIA 310 + L + + ++P L + +L GE + Q + A Sbjct: 278 ASLQSLSLDARRALPYNAPELDAYAVQAAQAAGVPPELMLALKNAGEKSNSGQVSRKGAA 337 Query: 311 DNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKES 370 + P +P + ++ LA+ ++ +++ G +++ Sbjct: 338 GVSQMMPENLRKYGVTDPTDPVQ---ALDGMAKYLADTQKQYGGNLQAMIADYNGGPRQA 394 Query: 371 SA 372 +A Sbjct: 395 AA 396 >gi|262371857|ref|ZP_06065136.1| predicted protein [Acinetobacter junii SH205] gi|262311882|gb|EEY92967.1| predicted protein [Acinetobacter junii SH205] Length = 876 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 40/293 (13%), Positives = 91/293 (31%), Gaps = 23/293 (7%) Query: 24 VSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYY-RGSRTDPHSVGTGAHLVE 82 D ++ + + A + VA E P+ RG + + + Sbjct: 12 NQDDPRFKPKSERGGFSDGALGIVSGVAMGTVEAATAPDALIRGDKKAAALRAQNLEIFK 71 Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYL---SHKAESSIH 139 + G L+ T + A L + + AL + L Sbjct: 72 -----PDDLGGVGEFTYGLTKDFTRIGWNAVTTLGTGGVPGLALNSGLFGYQTFEAEKSD 126 Query: 140 HQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSS 199 +G D +TA + + ++ P ++S+ + L G+ Sbjct: 127 LLNKGADVKTARTGGAIKGLADAASFAIPTHGVAKSVVADAVATTALATGAGVAGDYLEG 186 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH--------SKQVQNMSLRLVNDL 251 L+ + +AQ+ +L L A GGM +++ ++ +++ Sbjct: 187 SFLKTNENKKVAQYGEALKENALSPSTL--AANGGMALLLNLWANKGRLRPEQIKDHSNV 244 Query: 252 KEGITERLPYKHGVKS---SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHF 301 + + + ++ ++P T+ +H D L ++S + E Sbjct: 245 DT-MNDAAHIQANIEHAEGTNPFSPTNAKEANSHFDALDSAMESALNDELVSL 296 >gi|226953661|ref|ZP_03824125.1| possible phage-like protein [Acinetobacter sp. ATCC 27244] gi|226835533|gb|EEH67916.1| possible phage-like protein [Acinetobacter sp. ATCC 27244] Length = 876 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 87/293 (29%), Gaps = 23/293 (7%) Query: 24 VSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYY-RGSRTDPHSVGTGAHLVE 82 D ++ + + VA E P+ RG + + + Sbjct: 12 NQDDPRFKPKSERGGFSDGVLGTVSGVAMGTIEAATAPDALIRGDKKAAALRAQNLEIFK 71 Query: 83 GLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYL---SHKAESSIH 139 + G L+ T + A L + + AL + L Sbjct: 72 -----PDDLGGVGEFTYGLTKDFTRIGWNAVTTLGTGGVPGLALNSGLFGYQTFEAEKSD 126 Query: 140 HQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSS 199 +G D +TA + + P ++S+ + L G+ Sbjct: 127 LLNKGADIKTARTGGAIKGVTDALGFAIPTHGVAKSVVADAVATTALATGAGVAGDYLEG 186 Query: 200 KVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH--------SKQVQNMSLRLVNDL 251 LE++ +AQ+ + L A GGM +++ ++ +++ Sbjct: 187 SFLENNENKKVAQYGEALKENATSPSTL--AANGGMALLLNLWANKGRLRPEQIKDHSNV 244 Query: 252 KEGITERLPYKHGVKS---SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHF 301 + + + ++ ++P T+ +H D L ++S + E Sbjct: 245 DT-MNDAAHIQANIEHAEGTNPFSPTNAKEANSHFDALDSAMESALNDELVSL 296 >gi|221201509|ref|ZP_03574548.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2M] gi|221207935|ref|ZP_03580941.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2] gi|221172120|gb|EEE04561.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2] gi|221178777|gb|EEE11185.1| SLT domain-containing tail structural protein [Burkholderia multivorans CGD2M] Length = 749 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 57/160 (35%), Gaps = 3/160 (1%) Query: 73 SVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSH 132 G + + P + + + + + LT++ A+ PLA A+ Sbjct: 101 ESTLGTRAYDLADTFKPDPTRTTVIDQTVQGVMSGLTQIVPAAVLGGPLAGAAVGGTSIG 160 Query: 133 KAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGM 192 + + + +GVD T A+ E + + + P +A ++ +TV A + Sbjct: 161 MSRAE-DLKRQGVDVGTRTAVGAVEGALTAAGAVLP--VAGSTLPRTVGLVAAGGPGAAI 217 Query: 193 VERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFF 232 + +L + Y +A D ++ L+ F Sbjct: 218 AQASIEKAILRNADYDHLADQIDPLDPVNIAASTLMAGVF 257 >gi|317156431|ref|XP_001825741.2| 3-oxoacyl-[acyl-carrier-protein] synthase [Aspergillus oryzae RIB40] Length = 1625 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 104/272 (38%), Gaps = 37/272 (13%) Query: 228 IGAFFGGMHSKQVQNMSLRLVNDLKEGI-------TERLPYKHGVKSSSPGLHTSFDAYE 280 IG+ GG+HS + L D+++ I T + SS+ + TS A Sbjct: 1132 IGSGLGGVHSLKKMFRDRYLDKDVQKDILQETFINTTAAWVNMLLISSAGPIRTSVGACA 1191 Query: 281 AHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQK 340 ++L G +++V G + E F P + K+ Q+ Sbjct: 1192 TSIESLETGFETIVTGRAKICLVGGYDDMTQALAE--EFANMKATTNPEEEAKKGRLPQE 1249 Query: 341 PSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHS--PFKGERNQKLD--- 395 S P AE + VE S+ G + +SAR D G P H + G + K Sbjct: 1250 MSRPAAES---RSGFVE---SQGSGVQVITSARLALDLGLPIHGIVAWVGTASDKTSRSV 1303 Query: 396 --PMRG--ADFTDAPHAKFDAT---------TFTESLPHVDEQTMHRFSELKERHPVEAR 442 P +G + + P+++F + L ++E L+E+ + Sbjct: 1304 PAPGQGILTNAREKPNSRFPSPLLDIRYRKRRLEARLKQINESVDLEVQMLEEQMTQDG- 1362 Query: 443 EVLEGLQEKLQGTK---EIKTKSLIKEAINCF 471 EV E LQE+LQ K E + + KEA+N F Sbjct: 1363 EVPEELQEELQNHKRFVEGEAERQRKEALNTF 1394 >gi|83774485|dbj|BAE64608.1| unnamed protein product [Aspergillus oryzae] Length = 1783 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 104/272 (38%), Gaps = 37/272 (13%) Query: 228 IGAFFGGMHSKQVQNMSLRLVNDLKEGI-------TERLPYKHGVKSSSPGLHTSFDAYE 280 IG+ GG+HS + L D+++ I T + SS+ + TS A Sbjct: 1132 IGSGLGGVHSLKKMFRDRYLDKDVQKDILQETFINTTAAWVNMLLISSAGPIRTSVGACA 1191 Query: 281 AHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQK 340 ++L G +++V G + E F P + K+ Q+ Sbjct: 1192 TSIESLETGFETIVTGRAKICLVGGYDDMTQALAE--EFANMKATTNPEEEAKKGRLPQE 1249 Query: 341 PSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHS--PFKGERNQKLD--- 395 S P AE + VE S+ G + +SAR D G P H + G + K Sbjct: 1250 MSRPAAES---RSGFVE---SQGSGVQVITSARLALDLGLPIHGIVAWVGTASDKTSRSV 1303 Query: 396 --PMRG--ADFTDAPHAKFDAT---------TFTESLPHVDEQTMHRFSELKERHPVEAR 442 P +G + + P+++F + L ++E L+E+ + Sbjct: 1304 PAPGQGILTNAREKPNSRFPSPLLDIRYRKRRLEARLKQINESVDLEVQMLEEQMTQDG- 1362 Query: 443 EVLEGLQEKLQGTK---EIKTKSLIKEAINCF 471 EV E LQE+LQ K E + + KEA+N F Sbjct: 1363 EVPEELQEELQNHKRFVEGEAERQRKEALNTF 1394 >gi|238492181|ref|XP_002377327.1| fatty acid synthase alpha subunit, putative [Aspergillus flavus NRRL3357] gi|220695821|gb|EED52163.1| fatty acid synthase alpha subunit, putative [Aspergillus flavus NRRL3357] Length = 1650 Score = 50.3 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 64/272 (23%), Positives = 104/272 (38%), Gaps = 37/272 (13%) Query: 228 IGAFFGGMHSKQVQNMSLRLVNDLKEGI-------TERLPYKHGVKSSSPGLHTSFDAYE 280 IG+ GG+HS + L D+++ I T + SS+ + TS A Sbjct: 1136 IGSGLGGVHSLKKMFRDRYLDKDVQKDILQETFINTTAAWVNMLLISSAGPIRTSVGACA 1195 Query: 281 AHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQK 340 ++L G +++V G + E F P + K+ Q+ Sbjct: 1196 TSIESLETGFETIVTGRAKICLVGGYDDMTQAVAE--EFANMKATTNPEEEAKKGRLPQE 1253 Query: 341 PSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHS--PFKGERNQKLD--- 395 S P AE + VE S+ G + +SAR D G P H + G + K Sbjct: 1254 MSRPAAES---RSGFVE---SQGSGVQVITSARLALDLGLPIHGIVAWVGTASDKTSRSV 1307 Query: 396 --PMRG--ADFTDAPHAKFDAT---------TFTESLPHVDEQTMHRFSELKERHPVEAR 442 P +G + + P+++F + L ++E L+E+ + Sbjct: 1308 PAPGQGILTNAREKPNSRFPSPLLDIRYRKRRLEARLKQINESVDLEVQMLEEQMTQDG- 1366 Query: 443 EVLEGLQEKLQGTK---EIKTKSLIKEAINCF 471 EV E LQE+LQ K E + + KEA+N F Sbjct: 1367 EVPEELQEELQNHKRFVEGEAERQRKEALNTF 1398 >gi|315122596|ref|YP_004063085.1| hypothetical protein CKC_04240 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495998|gb|ADR52597.1| hypothetical protein CKC_04240 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 283 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 61/185 (32%), Gaps = 12/185 (6%) Query: 50 VAPFREETHDQP-NYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPL 108 + F HD P + DP G A + + G ++ I + Sbjct: 70 IEKFYRLFHDNPLKISDPLQYDPDQKKLG---------FWGSTAHSIVEGAVIYGIGNII 120 Query: 109 TRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAP 168 A L G L ++ ++S + + G+D+ T+ L + + P Sbjct: 121 GSSFSANPFVASL-VGLLTISATYGHQTSENMKHLGIDESTSQTLGLLSGGFYMLSFAIP 179 Query: 169 GAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLI 228 K + +GA + E+ ++ L GY + + + ++I D ++ Sbjct: 180 YIHRGDVSLKKIINGAGQQIATRTTEQLTTNGTLYFQGY-EKEEPTEGWSNYTVIVDVIL 238 Query: 229 GAFFG 233 G Sbjct: 239 TVGLG 243 >gi|291243144|ref|XP_002741464.1| PREDICTED: PHD finger protein 7-like [Saccoglossus kowalevskii] Length = 1231 Score = 46.0 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 43/186 (23%), Positives = 62/186 (33%), Gaps = 18/186 (9%) Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL 293 G+ S ++ + + K+G+ E P SSP ++ + S Sbjct: 743 GVESSPLRKLDVESSPLRKQGV-ESSPLSRLNDESSPLRKLDVESSPLRKQGVESSPLSR 801 Query: 294 VRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKR 353 + E KL +E + E PL + + S P L P R Sbjct: 802 LNDESSPL--RKLD------VESSPLRKQGVESSPLSRLNDESS---PLRKLDVESSPLR 850 Query: 354 KEVERELSEIEGAKKESSA-RKFFDEGSPDHSPFKGERNQKLD----PMRGADFTDAPHA 408 K+ R S + ESS RK DE SP KLD P+R D P Sbjct: 851 KQGVRS-SSLSRLNDESSPLRKLNDESSPLRKLDDESSLSKLDVESLPLRKLDVESLPFR 909 Query: 409 KFDATT 414 K D + Sbjct: 910 KLDVES 915 >gi|325962152|ref|YP_004240058.1| membrane protein [Arthrobacter phenanthrenivorans Sphe3] gi|323468239|gb|ADX71924.1| putative membrane protein [Arthrobacter phenanthrenivorans Sphe3] Length = 678 Score = 45.3 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 96/327 (29%), Gaps = 37/327 (11%) Query: 56 ETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLA 115 T+D NY + D L + S G A +LL+ T +R+ A Sbjct: 138 TTNDANNYLLSTIVD--------KLTTAVHSRVATEVGEETANQLLTGFGTIHSRMVQAA 189 Query: 116 LQSAPLAAGAL------------YAYLS-HKAESSIHH--QIEGVDKETADALAWREAIV 160 + +A G A LS E +G ++ T A + Sbjct: 190 DGAGQVADGVARLRDGTATLREGTAGLSNGAGELYQGQVKLRDGANQLTDGAGQLSSGLS 249 Query: 161 HTSALLAPGAIASQSIAKTVASGAVLNVPFG-----------MVERGWSSKVLEDHGYPD 209 A +Q++A A A N ++G ++V + + Sbjct: 250 VLKDKTATLPTDTQTLANGAARVAAGNAQLNTKVQEAAAQLEAADQGLRARVADTNARLV 309 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMHSKQVQ-NMSLRLVNDLKEGITERLPYKHGVKSS 268 A ++++ D A + + + + + L +G + + Sbjct: 310 AAGVLTQEQADAILADFDATAGSSPVAAARTKIQADAAQIQQLADGAASVSTGAAQLAAG 369 Query: 269 SPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEP 328 +P L + + D L G +L GE D + +AD L Sbjct: 370 TPALRDAVSQASSGADQLHTGAAALATGEQSALDGAR--RLADGARTLDDGAAQLSAGAG 427 Query: 329 LPQYKEHSDRQKPSEPLAEHPHPKRKE 355 + + + + P+P + Sbjct: 428 TAADGSRTLADELGKGAGQVPNPDDSQ 454 >gi|74693947|sp|Q758T8|SWC3_ASHGO RecName: Full=SWR1-complex protein 3 Length = 688 Score = 44.9 bits (104), Expect = 0.033, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 48/117 (41%), Gaps = 15/117 (12%) Query: 285 TLAHGVDSLVRGEYPHFDQEKLQTI---ADNTLEDPHFKPHLPEPEPLPQYKE------H 335 L + ++ G+ P D E+ + A N P++KP L + + +E Sbjct: 286 ALNKLMIAVANGQAPPADVERFKVFIERARNMEPPPNWKPRLSSRPVIKRTEEPTVEQQE 345 Query: 336 SDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQ 392 S Q PS PL P+ +V+ S G+ SS F E S S +GE ++ Sbjct: 346 SASQTPSTPLPRKASPESSQVDNLSSPPHGSDPNSS----FTEASMSDS--RGELSE 396 >gi|159127542|gb|EDP52657.1| conserved hypothetical protein [Aspergillus fumigatus A1163] Length = 587 Score = 44.5 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 76/238 (31%), Gaps = 25/238 (10%) Query: 216 IFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSS-SPGLHT 274 + D +++D A ++ L +V+DL P K + S SP Sbjct: 207 VKDGRRILSDKTPNACL-----SPARSKHLDVVSDL-------SPVKRSLFESRSPKKLL 254 Query: 275 SFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKE 334 ++ T+ D R + ++++ + N + + + P P+Y + Sbjct: 255 PSPSFVGQKRTIDQVEDD-SRINKENVQIQRVEQVERN--HERNLQDQTITPATAPKYDQ 311 Query: 335 HSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKL 394 PS + + R L S D SP +P R Sbjct: 312 QQSDAMPSNDTQHTEPQQSNQQTRRL-------PLSDIVDLIDTPSPKETPKTNSRTIPE 364 Query: 395 DPMRGADFTDAPHAKFDATTFTESLPHV-DEQTMHRFSELKERHPVEAREVLEGLQEK 451 DP F A + ++ HV D Q R SEL+ R L L +K Sbjct: 365 DPQTRKLFIQE-KASLLRSRIRSAMRHVRDHQFDRRLSELEAHSRKFPRLSLPALSQK 421 >gi|254579100|ref|XP_002495536.1| ZYRO0B13662p [Zygosaccharomyces rouxii] gi|238938426|emb|CAR26603.1| ZYRO0B13662p [Zygosaccharomyces rouxii] Length = 314 Score = 44.5 bits (103), Expect = 0.041, Method: Composition-based stats. Identities = 35/157 (22%), Positives = 67/157 (42%), Gaps = 12/157 (7%) Query: 321 PHLPEPEPLPQYKEHSDRQ---KPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFD 377 EPE ++ + + +P +P AE H E ++ SE + A +E S D Sbjct: 127 EQPAEPEQSATEEQPAAEEKPAEPEQPAAEEKHEDASEKHQDASEPQPAPEEDSNESEQD 186 Query: 378 EGSPDHSPFKGERNQK---LDPMRGADFTDAPHAKFDATTFTESLPH-VDEQTMHRFSEL 433 E + ++P GE N L M + A F ++E+ P +D + +F + Sbjct: 187 EKATAYNPDTGEINWDCPCLGGMAHGPCGEEFKAAFSCFVYSEAEPKGID--CIEKFQNM 244 Query: 434 KE---RHPVEAREVLEGLQEKLQGTKEIKTKSLIKEA 467 +E +HP E L+ +E + + ++ + + +A Sbjct: 245 QECFRKHPEHYAEQLKDEEEAIAAQESVEAEVAVVDA 281 >gi|70999624|ref|XP_754529.1| conserved hypothetical protein [Aspergillus fumigatus Af293] gi|66852166|gb|EAL92491.1| conserved hypothetical protein [Aspergillus fumigatus Af293] Length = 587 Score = 44.5 bits (103), Expect = 0.042, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 76/238 (31%), Gaps = 25/238 (10%) Query: 216 IFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSS-SPGLHT 274 + D +++D A ++ L +V+DL P K + S SP Sbjct: 207 VKDGRRILSDKTPNACL-----SPARSKHLDVVSDL-------SPVKRSLSESRSPKKLL 254 Query: 275 SFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKE 334 ++ T+ D R + ++++ + N + + + P P+Y + Sbjct: 255 PSPSFVGQKRTIDQVEDD-SRINKENVQIQRVEQVERN--HERNLQDQAITPATAPKYDQ 311 Query: 335 HSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKL 394 PS + + R L S D SP +P R Sbjct: 312 QQSDAMPSNDTQHTEPQQSNQQTRRL-------PLSDIVDLIDTPSPKETPKTNSRTIPE 364 Query: 395 DPMRGADFTDAPHAKFDATTFTESLPHV-DEQTMHRFSELKERHPVEAREVLEGLQEK 451 DP F A + ++ HV D Q R SEL+ R L L +K Sbjct: 365 DPQTRKLFIQE-KASLLRSRIRSAMRHVRDHQFDRRLSELEAHSRKFPRLSLPALSQK 421 >gi|302307784|ref|NP_984524.2| AEL336Wp [Ashbya gossypii ATCC 10895] gi|299789167|gb|AAS52348.2| AEL336Wp [Ashbya gossypii ATCC 10895] Length = 688 Score = 44.1 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 48/117 (41%), Gaps = 15/117 (12%) Query: 285 TLAHGVDSLVRGEYPHFDQEKLQTI---ADNTLEDPHFKPHLPEPEPLPQYKE------H 335 L + ++ G+ P D E+ + A N P++KP L + + +E Sbjct: 286 ALNKLMIAVANGQAPPADVERFKVFIERARNMEPPPNWKPRLSSRPVIKRTEEPTVEQQE 345 Query: 336 SDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQ 392 S Q PS PL P+ +V+ S G+ SS F E S S +GE ++ Sbjct: 346 SASQTPSTPLPRKASPESSQVDNLSSPPHGSDPNSS----FTEASMSDS--RGELSE 396 >gi|194853302|ref|XP_001968138.1| GG24671 [Drosophila erecta] gi|190660005|gb|EDV57197.1| GG24671 [Drosophila erecta] Length = 5335 Score = 44.1 bits (102), Expect = 0.054, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 74/215 (34%), Gaps = 17/215 (7%) Query: 246 RLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDS------LVRGEYP 299 ++ E + + +S + + TD + S + G+ Sbjct: 1593 DAGQEVGEEKSNPPLDESSQLEASSSTSAAEKERQISTDAANAAMSSKPNYVYINTGDED 1652 Query: 300 HFDQEKLQTIADNTLEDPHFKPHLPEPEPLP-QYKEHSDRQKPSEPLAEHPHPKRKEVER 358 + + + E KP PEP + K SD P + + E ++ Sbjct: 1653 SMVVQLVLAMRMGKRELIPDKPKEKAPEPKKDEEKSESDEATPDKLEGDEISKTEGEPKK 1712 Query: 359 ELSEIEGAKKESSARKF--FDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFT 416 +L++ EG + +SSA + DE PD S E N+ D M D K D Sbjct: 1713 DLTDTEGKQLDSSAMEVDSKDESEPDDSKKSDEDNKDKDKME----VDDEAEKSD----K 1764 Query: 417 ESLPHVDEQTMHRFSELKERHPVEAREVLEGLQEK 451 ES P +T+ K ++ VL G Q K Sbjct: 1765 ESKPEEQSETVKTEENSKAAEEDKSSTVLTGDQAK 1799 >gi|169627314|ref|YP_001700963.1| hypothetical protein MAB_0209 [Mycobacterium abscessus ATCC 19977] gi|169239281|emb|CAM60309.1| Hypothetical protein MAB_0209 [Mycobacterium abscessus] Length = 1144 Score = 43.7 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 69/410 (16%), Positives = 127/410 (30%), Gaps = 60/410 (14%) Query: 8 DEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGS 67 ++D+ D Q+P+ PD + + A+ L D N + + Sbjct: 555 EQDLSDQ-----QQPQ-GPD-------TQALAQDGAQLGQSLPGDIANTVSDSVNLGQSA 601 Query: 68 RTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGA-L 126 + + G+ A AGA+LA S P+ +A + S ++ A Sbjct: 602 GSAAQNFGSAAQ------------AGASLASSAQSGAVNPMDAVALVQGVSGGISDTADA 649 Query: 127 YAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVL 186 + A + ++ +G + ADA +A L + VA G V Sbjct: 650 VGSGASIASTWLNEAGQG-AQLAADANPQLKAEAEQVRQLTQAGSQVADLTGKVA-GGVS 707 Query: 187 NVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLR 246 V GMV ++ L G PD + + + G S Sbjct: 708 QVS-GMVN---TASSLGTSGMPDTSGATDALSGTATAVN---GPGDVPKPPTPPSVPSQS 760 Query: 247 LVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKL 306 + T + + S+P + A+ + + + L + L Sbjct: 761 PSSVQALDSTTTRAPQQPPQPSNPSTPKT-----ANQSSTSKPLSPL-EASTFPAPLQAL 814 Query: 307 QTIADNTLEDP------------HFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRK 354 T + DP P P Q + + + E+ + Sbjct: 815 NTAQAASTPDPNAGRLSSMPGVRDVSQPPLRPAPTLQPDQVDAFRAITRQNLENQNVPAD 874 Query: 355 EVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTD 404 ++E+ +++ A K++ +F PD P + Q LD G F D Sbjct: 875 QIEQRVND---AVKQAQTPRFM----PDPQPMRTPGAQPLDRPLGDKFND 917 >gi|301097660|ref|XP_002897924.1| abnormal spindle-like microcephaly-associated protein [Phytophthora infestans T30-4] gi|262106369|gb|EEY64421.1| abnormal spindle-like microcephaly-associated protein [Phytophthora infestans T30-4] Length = 2036 Score = 43.7 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 52/143 (36%), Gaps = 11/143 (7%) Query: 257 ERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED 316 E LP G + ++ T D+++ + + R + + TI Sbjct: 247 EPLPSDVGKEVAATLKFTVNDSFKLQCRATGFVMPRVARLAKFGKAKAPVDTIVVAPRAK 306 Query: 317 PHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFF 376 PH P +PE + + + A P+++ V R + G + S A +F Sbjct: 307 PHRLPRIPEESTRQGSRSTA----LAAQTAREGEPEQEPVVRPGPVVGGKRPSSVAIEF- 361 Query: 377 DEGSPDHSPFKGERNQKLDPMRG 399 SP P G + +K +P R Sbjct: 362 ---SP---PRNGPKRRKCEPPRA 378 >gi|193700114|ref|XP_001942665.1| PREDICTED: leucine-rich repeat-containing protein 4B-like [Acyrthosiphon pisum] Length = 669 Score = 43.7 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 74/256 (28%), Gaps = 18/256 (7%) Query: 134 AESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVP--FG 191 + + + + + A A + A V A+ TV S + V +G Sbjct: 414 SSTRSDGKPQHLVTTAASATSNGTAAVVVQVKQPVAGQATSPATTTVVSSMAVAVGGPYG 473 Query: 192 MVERGWSSKVLED---HGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLV 248 ++ G+ + G A+ S TD FG L Sbjct: 474 GLDAGFDQATATEPVVGGVRRPAK----LTELSFATDHYDSGGFGHGGVLDAGRTVLYRS 529 Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQT 308 + P +H +S PG S + H R T Sbjct: 530 QPSNPDLIVDAPEQHSPQSQPPGAAHS----QHHQQARRSASGEYRRTADDSLYSPGFWT 585 Query: 309 IADNTLED--PHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGA 366 +D D P + P P S R+ +A P PK + ++ Sbjct: 586 PSDAAATDRTPIIEKSPPLPAQ-SVAAVCSARETVM--VAAAPDPKAASLRVWKHGVQVM 642 Query: 367 KKESSARKFFDEGSPD 382 S+ ++ ++GSPD Sbjct: 643 PPLSALKRALNKGSPD 658 >gi|224088128|ref|XP_002308334.1| calcium dependent protein kinase 26 [Populus trichocarpa] gi|222854310|gb|EEE91857.1| calcium dependent protein kinase 26 [Populus trichocarpa] Length = 613 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 28/67 (41%) Query: 304 EKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEI 363 E + ++ N +++P F PE + KE Q PS P + E+ E+ E Sbjct: 40 ENVDGLSLNRVQEPPFHAQNKPPEQMKIAKEEIINQVPSPPKPKENATVASEIIMEVEES 99 Query: 364 EGAKKES 370 AK S Sbjct: 100 RPAKPAS 106 >gi|145595902|ref|YP_001160199.1| hypothetical protein Strop_3388 [Salinispora tropica CNB-440] gi|145305239|gb|ABP55821.1| hypothetical protein Strop_3388 [Salinispora tropica CNB-440] Length = 706 Score = 43.4 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 79/266 (29%), Gaps = 44/266 (16%) Query: 51 APFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTR 110 APF T QP G TG E +T + + A A ++ + Sbjct: 113 APFYRATPAQP---LGVVRRRMIQSTG----ERVTGIEDDLLDPAAASPEMTTVGD--GA 163 Query: 111 LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGA 170 L Q+ + A + + + +I GV T + T+ L A Sbjct: 164 LLAALSQATGRGMRDIVATIQREQDEAIRSPGSGV---TVVSGGPGTG--KTAVALHRAA 218 Query: 171 IASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGA 230 + A G +L V V + VL G E T +G Sbjct: 219 YLLYTDRSRYAGGGILVVGPSAVFVEYIGSVLPSLG-------------EETATLAALGG 265 Query: 231 FFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGV 290 F G+ + + + +K + R + + ++PG Sbjct: 266 LFPGVTATRTDPAEVAA---VKGSLRMRRVLERAARDTAPGAPDELR------------- 309 Query: 291 DSLVRGEYPHFDQEKLQTIADNTLED 316 L RGE D+ +L I D L Sbjct: 310 -LLYRGELLRVDRRELNAIRDRALRR 334 >gi|302528120|ref|ZP_07280462.1| predicted protein [Streptomyces sp. AA4] gi|302437015|gb|EFL08831.1| predicted protein [Streptomyces sp. AA4] Length = 641 Score = 43.4 bits (100), Expect = 0.089, Method: Composition-based stats. Identities = 55/324 (16%), Positives = 95/324 (29%), Gaps = 35/324 (10%) Query: 3 FNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPN 62 ++ VS++D RD +++ + + L P ++ L T+D N Sbjct: 99 WHQVSEKDARDGVRDDKYSFAIGIPHDFSKALLSSGNFEPQQATITL------TTNDANN 152 Query: 63 YYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLA 122 Y G+ + + + G+ A K L T +++ + LA Sbjct: 153 YLAGT--------IAKQVADQVRKTIAEKVGSEAADKFLVGFSTIYGKISEATDGAKQLA 204 Query: 123 AGAL-----------YAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 GA A SS+ + + TA + + + + +A G Sbjct: 205 DGAAKLQTGQHQLADGAGQLATGSSSLATGLGTLKSSTAQLPSQTQKLADGAGQVADGNQ 264 Query: 172 ASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAF 231 + AS + R + L D G D + D L Sbjct: 265 KVADASSLAASASSDLQGRLDSYRSQLNTQLHDAGLSD-----SQVNDILSRLDQLRSPV 319 Query: 232 FGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVD 291 Q N L L G + H + S+SP L + L G Sbjct: 320 NDANGKIQSANGDL---QKLASGARQVSDGAHQLASASPQLANGIAQASDGANQLRDGAA 376 Query: 292 SLVRGEYPHFDQEKLQTIADNTLE 315 L GE +AD + + Sbjct: 377 KLNDGEKTAV--TGTDQLADGSAK 398 >gi|194288752|ref|YP_002004659.1| replication/virulence associated protein; ATP-dependent protease clpa/b chaperone motif [Cupriavidus taiwanensis LMG 19424] gi|193222587|emb|CAQ68590.1| replication/virulence associated protein; putative ATP-dependent protease, clpA/B chaperone motif [Cupriavidus taiwanensis LMG 19424] Length = 912 Score = 43.4 bits (100), Expect = 0.094, Method: Composition-based stats. Identities = 75/431 (17%), Positives = 132/431 (30%), Gaps = 62/431 (14%) Query: 51 APFREETHDQPNYYRGSRTDP-----HSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIP 105 + D R R DP H + T ++ P + G A GK Sbjct: 189 SALGRYCRDLTEAARAGRLDPVIGREHEIRTMTDILLRRRQNNPLLTGEAGVGKTAVIEG 248 Query: 106 TPLTRLAGLA------LQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAI 159 L AG ++ L GAL A S K E +GV +E A + A Sbjct: 249 LALAVAAGEVPPSLKDVRLLSLDVGALLAGASMKGEFEARL--KGVLEEAAKSPAPVILF 306 Query: 160 VHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGY----PDMAQHYR 215 V L Q+ A+ + G++ ++ E + P + + ++ Sbjct: 307 VDEVHTLVGA--GGQAGTGDAANLLKPALARGVLRTIGATTWSEYKRHIEKDPALTRRFQ 364 Query: 216 IFD----MESLITDGLIGAF--FGGMHSKQVQNMSLRLV-------------NDLKEGIT 256 + E+ + G F H +++ ++R D + Sbjct: 365 VLQVMEPDEARAVAMVRGLVRTFEAHHGVLIRDEAVRAAVRLSHRFIPSRQLPDKAISLL 424 Query: 257 ERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYP--HFDQEKLQTIADNTL 314 + + G+ +P H A +L+ E D ++ +A Sbjct: 425 DTACARVGLSLHAPPAEVEHLR---HELAAADAESTLLARESGLGRPDAARI-GLARARR 480 Query: 315 EDPHFKPHLPE---PEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESS 371 E L + R+ A P +R L+E+EG + Sbjct: 481 EQLEADLALATARWERVRGLANDLVTRRHALVQSAPDASPLAAPAQRILAELEGQLHAAQ 540 Query: 372 ARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKF--DATTFTESLPHV-DEQTM- 427 A D E ++++ AD+T P + D T LP + E+ + Sbjct: 541 A---------DAPLVYTEVDERVVAAIVADWTGIPVGRMVADEVTTVMQLPQILGERVIG 591 Query: 428 --HRFSELKER 436 H S++ ER Sbjct: 592 QGHALSQIGER 602 >gi|116669229|ref|YP_830162.1| ABC-2 type transporter [Arthrobacter sp. FB24] gi|116609338|gb|ABK02062.1| ABC-2 type transporter [Arthrobacter sp. FB24] Length = 678 Score = 43.4 bits (100), Expect = 0.095, Method: Composition-based stats. Identities = 56/336 (16%), Positives = 110/336 (32%), Gaps = 48/336 (14%) Query: 6 VSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINM--PARSLDKLVAP----------F 53 V+D I ++ W Q + GK + P LV+P Sbjct: 77 VADSLIDGHVFNW-QSVDSAEQADQGVSSGKYAFALKIPKDFSANLVSPGSFDAANQAML 135 Query: 54 REETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAG 113 T+D NY + D + + + + + G A +LL+ T T++ Sbjct: 136 NVTTNDANNYLLSTIVDKLTTAVHSSVAKEV--------GEETANQLLTGFGTIHTQMVK 187 Query: 114 LALQSAPLAAG---------------ALYAYLSHKAESSIHHQIEGVDKETADALAWREA 158 A + L+ G + + + + + +G ++ A + Sbjct: 188 AADGAGQLSDGVSKLHDGTVTLHEGTSQLSSGAGELYNGQLKLRDGANQLNDGAAQLSDG 247 Query: 159 IVH----TSALLAPGAIASQSIAKTVASGAVLNVP-------FGMVERGWSSKVLEDHGY 207 + T++L A + A+ A A LN ++G ++V+E +G Sbjct: 248 LSQLQDKTASLPADSQKLADGAAQVAAGNATLNTKVQDVVGQLDAADQGLRNRVVESNGR 307 Query: 208 PDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQ-NMSLRLVNDLKEGITERLPYKHGVK 266 A +S++ D A G + + + + L +G + + Sbjct: 308 LMAAGIITQAQADSILKDFDAAAASGPVADAKAKIQSDAAQIQQLSDGSSAVSAGAARLA 367 Query: 267 SSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFD 302 +++P L + A D L G +L GE D Sbjct: 368 AATPALTGAIAQASAGADQLHTGTSALAAGEQSAVD 403 >gi|320038570|gb|EFW20505.1| serine/threonine-protein kinase prp4 [Coccidioides posadasii str. Silveira] Length = 580 Score = 43.0 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 60/183 (32%), Gaps = 17/183 (9%) Query: 265 VKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLP 324 ++ +P A + + RG+ + LQT + + Sbjct: 23 MEDETPSEPVDEAALIEQRRRRREAIKAKYRGQATPLLVQALQTGNETGSTACDASEAVS 82 Query: 325 EP-------EPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFD 377 +P P + S Q P++ + S +E K+E+SA Sbjct: 83 KPDLSGRQGSPTNTLDDTSTAQSPTDLHVSRDEDLANTDLQSRSGLE--KEEASA----- 135 Query: 378 EGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLPHVDEQTMHRFSELKERH 437 D+ P R +K+ + D P + +D T T V E T +++K + Sbjct: 136 ---ADYDPTADMRQEKMKHDKRHFGEDMPASAYDETKVTRQEVLVPEPTAADPNQMKAKD 192 Query: 438 PVE 440 P + Sbjct: 193 PFD 195 >gi|332358977|gb|EGJ36798.1| ABC superfamily ATP binding cassette transporter, membrane protein [Streptococcus sanguinis SK49] Length = 907 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 70/199 (35%), Gaps = 21/199 (10%) Query: 192 MVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDL 251 + E+ S VL++ Y + + L +D +G G + + D Sbjct: 144 LTEKAGSRSVLKNKTYKIVG----FVNSAELWSDRNLGNATSGSGALSAYAVVSPKAFDT 199 Query: 252 KEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIAD 311 RL Y H ++ +P + + E H L ++ + + + TI Sbjct: 200 DVYSIARLRY-HDLEKLAPFSESYQERLEQHQTALDKSLEDNGAARFKRLEADAKSTIQK 258 Query: 312 NTLEDPHFKPHLPE-PEPLPQYKEHSDRQKPSEPLAEHPH--PKRK-------------E 355 + + L + + L Q + D+QK A+ P + + Sbjct: 259 GQDKIAQAESELTQGKKQLEQAESQLDQQKSQLAAAQSASILPPAQLSQSQQQIQEAEFQ 318 Query: 356 VERELSEIEGAKKESSARK 374 + ++ +E+ A+K+ SA K Sbjct: 319 LNQKKAELAQAEKDLSASK 337 >gi|7509604|pir||T26656 hypothetical protein Y38E10A.f - Caenorhabditis elegans Length = 1384 Score = 42.6 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 44/108 (40%), Gaps = 12/108 (11%) Query: 286 LAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPL 345 + + S + E ++++ I ++ + E + +P+ E + P +P Sbjct: 950 IDEVLKSPQKSEKIPEKAQEIEEIEESPKKSEKAPEKPQEIQEIPKKSEKA----PEKPQ 1005 Query: 346 AEHPHPKRKEVERELSEIEGAKKESSARK--------FFDEGSPDHSP 385 PK+ E +E+ EI +++S ++ FF +P +P Sbjct: 1006 EIEKSPKKSEKRQEIQEIPQKSEKTSEKRPEIEELPTFFKSSAPAQTP 1053 >gi|126698700|ref|YP_001087597.1| putative DNA-repair protein [Clostridium difficile 630] gi|115250137|emb|CAJ67958.1| putative conjugative transposon protein Tn1549-like, CTn4-Orf11 [Clostridium difficile] Length = 646 Score = 42.6 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 61/167 (36%), Gaps = 8/167 (4%) Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSS-PGLHTSFDAYEAHTDTLAHGVDS 292 H+++ ++ + T + + + P L T E D L + Sbjct: 45 AAHTRKANKKEVKKEQEATALRTSTSRLQFTDEERATPELETYIKKSEKAADQLDAAKAA 104 Query: 293 LVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPK 352 + + + + A F+ +P+P K+ + +P++ H K Sbjct: 105 IPKQKKL-VKERTFDEAAGKAKTRLRFEEQ---EKPIPGGKKGNPLSRPAQEAGIFVHNK 160 Query: 353 RKEVERELSEIEGA-KKESSARKFFDEGSPDHSPFKGERNQKLDPMR 398 VE++ S +EGA K E A + G+ +G R+ KL P R Sbjct: 161 IHSVEKDNSGVEGAHKSEELAERGAKYGARKLK--QGYRSHKLKPYR 205 >gi|325107016|ref|YP_004268084.1| hypothetical protein Plabr_0435 [Planctomyces brasiliensis DSM 5305] gi|324967284|gb|ADY58062.1| hypothetical protein Plabr_0435 [Planctomyces brasiliensis DSM 5305] Length = 407 Score = 42.6 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 49/165 (29%), Gaps = 27/165 (16%) Query: 270 PGLHTSFDAYEAHTDTLAHGVDSLVRGEYP-------------HFDQEKLQTIADNTL-E 315 PG+ A + S E + + E A + Sbjct: 141 PGIPLDPHASKFQQQCDRAARKSNRWAEQIWHSVPGVPCNDKCNCEPESFAAAAPAIRGQ 200 Query: 316 DPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKF 375 P P LP P Q + E P P +++ +++ + E A+ F Sbjct: 201 SPEVGPELPPLTPAQQAEWEKALLGVLNLEEESPTPAAPVARKDVRDLKATEAE-LAQSF 259 Query: 376 FDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLP 420 +P SP ++Q + P + P + DA E P Sbjct: 260 ---PAPAPSP----QDQNVKP-----YQPPPQPRLDAPANLEEAP 292 >gi|108761607|ref|YP_635052.1| putative methyl-accepting chemotaxis protein [Myxococcus xanthus DK 1622] gi|108465487|gb|ABF90672.1| putative methyl-accepting chemotaxis protein [Myxococcus xanthus DK 1622] Length = 591 Score = 42.6 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 47/247 (19%), Positives = 88/247 (35%), Gaps = 34/247 (13%) Query: 209 DMAQHYRIFDMESLITDGLIGAFFGGMH--SKQVQNMSLRLVNDLKEGITERLP--YKHG 264 A H S++ A G + S L ++ D+ G+ E++ + Sbjct: 360 QQASHVTHSRATSILQVAERAAAVGKLGEESLAGTEKGLTVIRDIAAGLHEQMLDLEQRA 419 Query: 265 VKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY---PHFDQEKLQTIADNTLEDPHFKP 321 + A ++H + +++ GE+ +++ +AD ++ + Sbjct: 420 REVGRVSEVVKSLADQSHMLAINAAIEATRAGEHGKGFGVVARQMRDLADQSVRATNQVR 479 Query: 322 HLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSA--RKFFDEG 379 L E + + + +AE P R ER L E+ G KES+A R+ + Sbjct: 480 GLLESMATATQHATAMSDQGAAGVAEALEPLRHSGER-LRELAGLSKESAAAVRQITEAV 538 Query: 380 SPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLPHVD---------EQTMHRF 430 S H+ G D A + D T T++L H+D + Sbjct: 539 SQQHA--------------GVDQLFAAVRELDELT-TDTLRHLDTTQQAASAVSHATGQV 583 Query: 431 SELKERH 437 S+L ER+ Sbjct: 584 SQLAERY 590 >gi|157126450|ref|XP_001654627.1| hypothetical protein AaeL_AAEL010525 [Aedes aegypti] gi|108873265|gb|EAT37490.1| hypothetical protein AaeL_AAEL010525 [Aedes aegypti] Length = 986 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 47/141 (33%), Gaps = 13/141 (9%) Query: 253 EGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADN 312 + + P P + E H D VDS ++E Sbjct: 361 DSADDMEPLDQTEDDDEPE-----ETAEGHEDVPEQEVDSEN---DTSMNEETTDITEPA 412 Query: 313 TLEDPHFKPHLPEPEPLPQYKEHS----DRQKPSEPLAEHPHPKRKEVERELSEIEGAKK 368 + + +P E P ++ + + +P+E P + E+E + +E Sbjct: 413 AMVETLLEPEPENDESEPTERDKAPDKDVQSEPAEEAEHTAEPTAEGQEQEQTMVEIPDD 472 Query: 369 ESS-ARKFFDEGSPDHSPFKG 388 SS + + F+E PD P G Sbjct: 473 NSSFSCEMFEEIGPDEEPANG 493 >gi|71998068|ref|NP_001022429.1| hypothetical protein Y38E10A.6 [Caenorhabditis elegans] gi|34556124|emb|CAE46683.1| C. elegans protein Y38E10A.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1345 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 44/108 (40%), Gaps = 12/108 (11%) Query: 286 LAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPL 345 + + S + E ++++ I ++ + E + +P+ E + P +P Sbjct: 911 IDEVLKSPQKSEKIPEKAQEIEEIEESPKKSEKAPEKPQEIQEIPKKSEKA----PEKPQ 966 Query: 346 AEHPHPKRKEVERELSEIEGAKKESSARK--------FFDEGSPDHSP 385 PK+ E +E+ EI +++S ++ FF +P +P Sbjct: 967 EIEKSPKKSEKRQEIQEIPQKSEKTSEKRPEIEELPTFFKSSAPAQTP 1014 >gi|56414508|ref|YP_151583.1| DNA transfer protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197363430|ref|YP_002143067.1| DNA transfer protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|56128765|gb|AAV78271.1| DNA transfer protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197094907|emb|CAR60444.1| DNA transfer protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] Length = 643 Score = 42.2 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 64/224 (28%), Gaps = 14/224 (6%) Query: 66 GSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLA--- 122 GS D + G A + TS+ P G + GL LA Sbjct: 22 GSAIDEYFSGQSAQQEQQGTSMTPGSQPQQQGGFISDLGNAAAETGRGLLQAGVNLANIP 81 Query: 123 ---AGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKT 179 A A+ + + + + V L P + ++ Sbjct: 82 ASMADAVASAGAWAGQKLGIGDGTYQPAPRVTTQGLEQGFVLQQGALTPQTTEGKIFSEA 141 Query: 180 VASGAVLNV------PFGMVER--GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAF 231 + + V + R +S++L ++ +A + + E+L TD G Sbjct: 142 LPYLTPVGVERIAAQAPSIAGRVAQGASRLLAENAVGSLAANSERDNPEALATDLGTGVA 201 Query: 232 FGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTS 275 GG +K + E ++ LHT+ Sbjct: 202 LGGAINKLGRAAGAAYRGIRGTIAPEAQQAIQFANAADVPLHTT 245 >gi|119947101|ref|YP_944781.1| DNA-directed RNA polymerase subunit alpha [Psychromonas ingrahamii 37] gi|158513126|sp|A1T0B7|RPOA2_PSYIN RecName: Full=DNA-directed RNA polymerase subunit alpha 2; Short=RNAP subunit alpha 2; AltName: Full=RNA polymerase subunit alpha 2; AltName: Full=Transcriptase subunit alpha 2 gi|119865705|gb|ABM05182.1| DNA-directed RNA polymerase, alpha subunit [Psychromonas ingrahamii 37] Length = 328 Score = 42.2 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 48/130 (36%), Gaps = 2/130 (1%) Query: 236 HSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVR 295 H +S+R+ + G H + P DA + + +A+ V+S Sbjct: 132 HLTGNAEISMRIKIESGRGYVPASSRIHTEEDERPIGRLLVDATFSPVERIAYSVESARV 191 Query: 296 GEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKE 355 + D+ + D TL+ P Q D +K SEP+A+ P+ Sbjct: 192 EQRTDLDKLVIDMETDGTLD-PEEAIRRAATILAEQLDAFVDLRKVSEPVAKEEKPEFDP 250 Query: 356 V-ERELSEIE 364 + R + ++E Sbjct: 251 ILLRPVDDLE 260 >gi|71998064|ref|NP_001022428.1| hypothetical protein Y38E10A.6 [Caenorhabditis elegans] gi|34556123|emb|CAB60334.3| C. elegans protein Y38E10A.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1343 Score = 42.2 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 44/108 (40%), Gaps = 12/108 (11%) Query: 286 LAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPL 345 + + S + E ++++ I ++ + E + +P+ E + P +P Sbjct: 909 IDEVLKSPQKSEKIPEKAQEIEEIEESPKKSEKAPEKPQEIQEIPKKSEKA----PEKPQ 964 Query: 346 AEHPHPKRKEVERELSEIEGAKKESSARK--------FFDEGSPDHSP 385 PK+ E +E+ EI +++S ++ FF +P +P Sbjct: 965 EIEKSPKKSEKRQEIQEIPQKSEKTSEKRPEIEELPTFFKSSAPAQTP 1012 >gi|189197597|ref|XP_001935136.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187981084|gb|EDU47710.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 577 Score = 42.2 bits (97), Expect = 0.20, Method: Composition-based stats. Identities = 35/123 (28%), Positives = 45/123 (36%), Gaps = 24/123 (19%) Query: 281 AHTDTLAH------GVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKE 334 AH L H G V E FD ++ LED K + E LP + Sbjct: 3 AHARRLEHSLGSSWGEADYVSDEGGSFDSGS-DGASELDLEDSD-KEVVQERRALPTPRR 60 Query: 335 HSDRQKPSEPLAEH-PHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQK 393 H ++Q P P A+ P RK ER KK S ++ HS KG R Sbjct: 61 HRNQQSPPVPTAQTKSTPVRKPTER--------KKTSQSQHL-------HSDKKGPRQHP 105 Query: 394 LDP 396 +P Sbjct: 106 FEP 108 >gi|26553757|ref|NP_757691.1| ABC transporter ATP-binding protein [Mycoplasma penetrans HF-2] gi|26453764|dbj|BAC44095.1| ABC transporter ATP-binding protein [Mycoplasma penetrans HF-2] Length = 678 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 60/165 (36%), Gaps = 30/165 (18%) Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEV 356 E F+ K ++ ++ P KP P P ++KP + A P+ KEV Sbjct: 128 EKYLFEPFKGPSLFKEEVKKPE-KPTKPAKVSKPV---EPVKEKPVKAKAADKKPEVKEV 183 Query: 357 ERELSEIEGAK------------------KESSARKFFDEGSPDHSPFKG-----ERNQK 393 + + K ++F D+ + PFKG E K Sbjct: 184 KPAKPKKLKETKEKKLKSKFLYIPYIKLVKGEEVKQFSDDNKFLYEPFKGPSLYDESANK 243 Query: 394 LDPMRGADFTDAPHAKFDATTFTESLPHVDEQTMHRFSELKERHP 438 ++P+ F D K D +E P D++ ++ E KE +P Sbjct: 244 VEPLPKEIFLDDSF-KDDEVPLSEEKPTKDKK--YKLEETKEDYP 285 >gi|158302482|ref|XP_322022.4| AGAP001140-PA [Anopheles gambiae str. PEST] gi|157012974|gb|EAA01427.5| AGAP001140-PA [Anopheles gambiae str. PEST] Length = 1702 Score = 41.8 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 74/217 (34%), Gaps = 26/217 (11%) Query: 238 KQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDT---LAHGVDSLV 294 + + + + E ++ER+ K P EA + D Sbjct: 14 SNAEPSAAQSEPVMSEAVSERMDVDDAGKEDVPAEAKEPLHEEASAVAERPVNSKPDDKD 73 Query: 295 RGEYPHFDQEKLQTIAD---NTLEDPHFKPHLPEP---------EPLPQYKEHSDRQKPS 342 GE +D + D + ++P + +P +P+ +EH +Q+ Sbjct: 74 DGESEKYDSAVEEMETDQPGDQDQNPTEESESKKPVAANTTSVDDPMKVDEEHEHQQQVV 133 Query: 343 EPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADF 402 +PL + + + E++ ++ ES + D D S D +R A Sbjct: 134 KPLDDVSNTSSHLQDIEVTNVD----ESDHTRGDDMPEGDRSIDPSTAEDPFDQLRHAST 189 Query: 403 TDAPHAKFDATTFTES-LPHVDEQTMHRFSELKERHP 438 D H D T + + VD+Q EL + HP Sbjct: 190 DDISHGDKDNQNETATEMEGVDKQ------ELPDEHP 220 >gi|194754609|ref|XP_001959587.1| GF11968 [Drosophila ananassae] gi|190620885|gb|EDV36409.1| GF11968 [Drosophila ananassae] Length = 972 Score = 41.8 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 54/230 (23%), Positives = 84/230 (36%), Gaps = 20/230 (8%) Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTDT--LAHGVDSLVRGEYPHFDQEKLQTIADN 312 T+ H HT+ D EAH L + E H L+ Sbjct: 744 ATDLEEAHHPAPDLEEAHHTAPDLEEAHHPAPDLEEAHHTAPDLEEAHHTAPDLEEAHHP 803 Query: 313 TLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSA 372 + +P P+ E + + E + + P A HP P +EV ++E A Sbjct: 804 ATDLEEAQPPAPDLEEVTRLLEEAHPRAPDLEEAHHPAPDLEEVLHPAPDLE------EA 857 Query: 373 RKFFDEGSPDHSPFKGERNQKLDPMRGADFT--DAPHAKFDATTFTESLPHV-DEQTMHR 429 + +EG P +P E + + A T D A AT E+ P D + + R Sbjct: 858 TRLLEEGYP-RAPDLEEAHHTAPDLEEAHHTAPDLEEAHHPATDLEEAQPPAPDLEEVTR 916 Query: 430 FSELKERHP-----VEAREVLEGLQEKLQGTKEIKTKS-LIKEAINCFLR 473 L+E HP EA L+E L +++ + L++EA F R Sbjct: 917 L--LEEAHPRAPDLEEAHHPAPDLEEVLHPAPDLEEATRLLEEATKLFGR 964 >gi|17986031|ref|NP_523441.1| kismet, isoform A [Drosophila melanogaster] gi|7230509|gb|AAF43004.1|AF215703_1 KISMET-L long isoform [Drosophila melanogaster] gi|22945599|gb|AAF51527.3| kismet, isoform A [Drosophila melanogaster] Length = 5322 Score = 41.8 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 73/206 (35%), Gaps = 17/206 (8%) Query: 259 LPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDS------LVRGEYPHFDQEKLQTIADN 312 L +++SS + + TD + S + G+ + + + Sbjct: 1597 LDESSQLEASSSTSAVAEKERQISTDAANAAMSSKPNYVYINTGDEDSMVVQLVLAMRMG 1656 Query: 313 TLEDPHFKPHLPEPEPLP-QYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESS 371 E KP PEP + K D +P + E +++L++ E K ESS Sbjct: 1657 KRELILDKPKEKAPEPKQDEEKSELDEATTDKPEGDEKFKTEGESKKDLTDSEETKLESS 1716 Query: 372 ARKF--FDEGSPDHSPFKGERNQKLDPMR------GADFTDAPHAKFDATTFTESLPHVD 423 A + +E PD S E N+ D M +D P + + E+ ++ Sbjct: 1717 AMEVDSKEESEPDDSKKSDEDNKDKDKMEVDDEVGKSDKESKPEEQSETVKTEENSKAIE 1776 Query: 424 EQTMHRFSELKERHPVEAREVLEGLQ 449 E + L H E VLE ++ Sbjct: 1777 EDKSS--TVLTADHAKEPETVLEKME 1800 >gi|326480010|gb|EGE04020.1| GRAM domain-containing protein YSP2 [Trichophyton equinum CBS 127.97] Length = 1254 Score = 41.4 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 84/274 (30%), Gaps = 20/274 (7%) Query: 15 IKEWAQRPRVSPDI---KWHTGLGKEVINMPARSLDKLVAPFREETH----DQPNYYRGS 67 + ++ P+ ++ G +++ + L + + Q G Sbjct: 381 VSPASEDPKSQGKPSTSQFGAGFFSSMVSAAQNAATTLSSSLNPQAKGSKTSQEQNPEGD 440 Query: 68 RTDPHSVGTGAHLVEGLTSLAP-------YIAGAALAGKLLSFIPTP-LTRLAGLALQSA 119 D A G ++AP + S + L + AG + Sbjct: 441 TRDSGEQEKPAATPGGEENVAPQDGKKELAVNTLGTGDLDFSHLGLEHLEKAAGDDEGNK 500 Query: 120 PLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS---I 176 AG A + + ++E V A ++A+ V + ++ + Sbjct: 501 LDVAGRPRAKTAVSQRDELAARMEDVRAARAVSMAYGNTPVTPIVTVDGINADNRPANPL 560 Query: 177 AKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH 236 V A N P G +++ L+ +G + R + T+ IGA G Sbjct: 561 NTVVRDNAGENTPPGGSVHSETAESLKQNGSLRSRRARRDRGSSAATTNTTIGAPIGT-- 618 Query: 237 SKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSP 270 + +N S+ + +R H + S P Sbjct: 619 NLTARNTSVPRLTGFAVASKKRNRDFHSLFRSVP 652 >gi|326468510|gb|EGD92519.1| GRAM domain-containing protein [Trichophyton tonsurans CBS 112818] Length = 1254 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 84/274 (30%), Gaps = 20/274 (7%) Query: 15 IKEWAQRPRVSPDI---KWHTGLGKEVINMPARSLDKLVAPFREETH----DQPNYYRGS 67 + ++ P+ ++ G +++ + L + + Q G Sbjct: 381 VSPASEDPKSQGKPSTSQFGAGFFSSMVSAAQNAATTLSSSLNPQAKGSKTSQEQNPEGD 440 Query: 68 RTDPHSVGTGAHLVEGLTSLAP-------YIAGAALAGKLLSFIPTP-LTRLAGLALQSA 119 D A G ++AP + S + L + AG + Sbjct: 441 TRDSGEQEKPAATPGGEENVAPQDGKKELAVNTLGTGDLDFSHLGLEHLEKAAGDDEGNK 500 Query: 120 PLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS---I 176 AG A + + ++E V A ++A+ V + ++ + Sbjct: 501 LDVAGRPRAKTAVSQRDELAARMEDVRAARAVSMAYGNTPVTPIVTVDGINADNRPANPL 560 Query: 177 AKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMH 236 V A N P G +++ L+ +G + R + T+ IGA G Sbjct: 561 NTVVRDNAGENTPPGGSVHSETAESLKQNGSLRSRRARRDRGSSAATTNTTIGAPIGT-- 618 Query: 237 SKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSP 270 + +N S+ + +R H + S P Sbjct: 619 NLTARNTSVPRLTGFAVASKKRNRDFHSLFRSVP 652 >gi|269216177|ref|ZP_06160031.1| ATP synthase F1, alpha subunit [Slackia exigua ATCC 700122] gi|269130436|gb|EEZ61514.1| ATP synthase F1, alpha subunit [Slackia exigua ATCC 700122] Length = 522 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 90/293 (30%), Gaps = 55/293 (18%) Query: 45 SLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFI 104 + E Q S D VGT + +G+ + A+AG+LL FI Sbjct: 3 VTEITAKSIDEALRKQLEDLETS-VDAREVGTVVQVGDGIARIDGLKG--AMAGELLEFI 59 Query: 105 PTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSA 164 + GLA GA+ + + G E + +V+ Sbjct: 60 GADGRTVYGLAQNLEEEEVGAVLMGDVTAIRENDQVRTTGRIMEVPSGKSLLGRVVNPLG 119 Query: 165 L------------------LAPGAIASQSIAKTVASGAV---LNVPFGMVERGWSSKVLE 203 + APG I + + + + +G V +P G +R Sbjct: 120 MPIDGKGPIKAEGMRPVEFKAPGVIHRKPVHEPMQTGIVAVDTMIPIGRGQR-------- 171 Query: 204 DHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNM---------SLRLVNDLKEG 254 ++ R ++ D +I +++ ++M V ++ E Sbjct: 172 -----ELIIGDRQTGKTAIAIDAII--------NQKGKDMICIYVAVGQKASTVANVMET 218 Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY-PHFDQEKL 306 + + ++ + S+ ++ Y A A G + GE E L Sbjct: 219 LEKHGAMEYTIIVSATASDSAPLQYIAPMAGAAMGEHFVYTGEDGKPAGPENL 271 >gi|240953833|ref|XP_002399696.1| condensin-2 complex subunit H2, putative [Ixodes scapularis] gi|215490611|gb|EEC00254.1| condensin-2 complex subunit H2, putative [Ixodes scapularis] Length = 704 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 54/288 (18%), Positives = 82/288 (28%), Gaps = 43/288 (14%) Query: 160 VHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLED-HGYPDM---AQHYR 215 +A L G+ + V + M RG + + ED G A H R Sbjct: 73 FTEAAFLIQGSASVYGKKVEYLYSLVQKLASEMTHRGNTGETGEDAQGVAKTGAGASHRR 132 Query: 216 IFD-MESLITDGLIGA----FFGGMHSKQVQNMSLRL----------------VNDLKEG 254 D SL D +G GG + + L+ L Sbjct: 133 KADYGFSLQEDMGLGENLDDAVGGRTGRAGRKTRLKRQRRLPLQLVLFEEDKQTRLLDVK 192 Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTL 314 + PG D H +L E D + Sbjct: 193 GDVVGHQSDYMLHELPGCSCGPDRCRCHRTSLETQRRLDDSDEVSMADPADFDRGCPSIS 252 Query: 315 EDPHFKPHLPEPEPLPQY---KEHSDRQ--------KPSEPLAEHPHPKRKEVERELSEI 363 +D H LPE + ++ + +P P P+ V R + E Sbjct: 253 DDSHLSGQLPELDISGLGALHEDVGAFEECRAKPDVEPESEPTPVPLPRNSSVGRRIKEE 312 Query: 364 EGAKKESS--ARKFFD---EGSPDHSPFKGERNQKLDPMRGADFTDAP 406 KKE KF+D + S + P K R + P++ +D D P Sbjct: 313 NRVKKEDIILVHKFYDPMEDTSALNKPIKIMRRARKRPLKESD--DVP 358 >gi|254884948|ref|ZP_05257658.1| predicted protein [Bacteroides sp. 4_3_47FAA] gi|254837741|gb|EET18050.1| predicted protein [Bacteroides sp. 4_3_47FAA] Length = 1419 Score = 41.4 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 47/315 (14%), Positives = 89/315 (28%), Gaps = 44/315 (13%) Query: 71 PHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYL 130 + + ++ T + IA A LS + L + +S G + A L Sbjct: 936 TNVDKSSKKAIDNTTGVTNAIAQLGEADVSLSSFGDSVGSLVDVLSESGSKIGGIIAAIL 995 Query: 131 SHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPF 190 + + + V I T + F Sbjct: 996 AILDQIGDQGLDKFVGNILETVSNAVGGIFDTVGSI-----------------------F 1032 Query: 191 GMVERGWSSKVLEDHGYPDMAQHY----RIFDMESLITDGLIGAFFGGMHSKQVQ---NM 243 G+ G + GY +M Y I+D I +G SK + N+ Sbjct: 1033 GIKGAGGIFHGADYSGYNEMVAQYDNLLDIWDELLDKKKAYINESYGAEASKAGEEALNI 1092 Query: 244 SLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVR-------- 295 + ++ K+ RL + S S G +Y+ + R Sbjct: 1093 AKNELDVQKKLAEARLSAGSSIGSHSQGYRMWKGSYKWEGQNWRDVAGEISRKYGVTFNE 1152 Query: 296 -GEYPHFDQEKLQTIADN-----TLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 + + E LQ+I +N ++ D F+ HL + ++ + Sbjct: 1153 MKDMINMSPEVLQSIRENYAGLWSVMDGEFRNHLENIIKYGETEKEILEAVKEQVTGISF 1212 Query: 350 HPKRKEVERELSEIE 364 +S++E Sbjct: 1213 DSFEDSYWEMISDLE 1227 >gi|118114669|ref|XP_423552.2| PREDICTED: similar to ALR, partial [Gallus gallus] Length = 1172 Score = 41.4 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 7/76 (9%) Query: 315 EDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARK 374 ++P PE PL ++H + Q P +P + P E+EL + + K+ + Sbjct: 273 DEPPVDEMPPEKPPLD--EQHLEEQPPEKPPLDEQPPIELPSEKELLDEQPPKELHPEKP 330 Query: 375 FFD-----EGSPDHSP 385 D E PD P Sbjct: 331 LLDELPLAEQPPDEPP 346 >gi|317509188|ref|ZP_07966812.1| hypothetical protein HMPREF9336_03184 [Segniliparus rugosus ATCC BAA-974] gi|316252545|gb|EFV11991.1| hypothetical protein HMPREF9336_03184 [Segniliparus rugosus ATCC BAA-974] Length = 1053 Score = 41.4 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 38/115 (33%), Gaps = 5/115 (4%) Query: 248 VNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTL-AHGVDSLVRGEYPHFDQEKL 306 G ++PG +TS + A +SL G P +E Sbjct: 908 APQSAWGSATPGDPSPAAWDAAPGDYTSQREAKEKAQAYRAAAEESLRTGGAPAPQRETS 967 Query: 307 QT--IADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ-KPS-EPLAEHPHPKRKEVE 357 QT E+ P E +P + RQ +P EP P P+R + E Sbjct: 968 QTSQAYRAAAEESLRAPAPSETQPASPAQSEPARQAEPQREPQEREPQPQRPDEE 1022 >gi|303317268|ref|XP_003068636.1| serine/threonine-protein kinase, putative [Coccidioides posadasii C735 delta SOWgp] gi|240108317|gb|EER26491.1| serine/threonine-protein kinase, putative [Coccidioides posadasii C735 delta SOWgp] Length = 795 Score = 41.4 bits (95), Expect = 0.35, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 60/183 (32%), Gaps = 17/183 (9%) Query: 265 VKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLP 324 ++ +P A + + RG+ + LQT + + Sbjct: 238 MEDETPSEPVDEAALIEQRRRRREAIKAKYRGQATPLLVQALQTGNETGSTACDASEAVS 297 Query: 325 EP-------EPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFD 377 +P P + S Q P++ + S +E K+E+SA Sbjct: 298 KPDLSGRQGSPTNTLDDTSTAQSPTDLHVSRDEDLANTDLQSRSGLE--KEEASA----- 350 Query: 378 EGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLPHVDEQTMHRFSELKERH 437 D+ P R +K+ + D P + +D T T V E T +++K + Sbjct: 351 ---ADYDPTADMRQEKMKHDKRHFGEDMPASAYDETKVTRQEVLVPEPTAADPNQMKAKD 407 Query: 438 PVE 440 P + Sbjct: 408 PFD 410 >gi|221330583|ref|NP_001137761.1| kismet, isoform C [Drosophila melanogaster] gi|220901895|gb|ACL82968.1| kismet, isoform C [Drosophila melanogaster] Length = 5517 Score = 41.4 bits (95), Expect = 0.36, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 73/206 (35%), Gaps = 17/206 (8%) Query: 259 LPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDS------LVRGEYPHFDQEKLQTIADN 312 L +++SS + + TD + S + G+ + + + Sbjct: 1597 LDESSQLEASSSTSAVAEKERQISTDAANAAMSSKPNYVYINTGDEDSMVVQLVLAMRMG 1656 Query: 313 TLEDPHFKPHLPEPEPLP-QYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESS 371 E KP PEP + K D +P + E +++L++ E K ESS Sbjct: 1657 KRELILDKPKEKAPEPKQDEEKSELDEATTDKPEGDEKFKTEGESKKDLTDSEETKLESS 1716 Query: 372 ARKF--FDEGSPDHSPFKGERNQKLDPMR------GADFTDAPHAKFDATTFTESLPHVD 423 A + +E PD S E N+ D M +D P + + E+ ++ Sbjct: 1717 AMEVDSKEESEPDDSKKSDEDNKDKDKMEVDDEVGKSDKESKPEEQSETVKTEENSKAIE 1776 Query: 424 EQTMHRFSELKERHPVEAREVLEGLQ 449 E + L H E VLE ++ Sbjct: 1777 EDKSS--TVLTADHAKEPETVLEKME 1800 >gi|319785587|ref|YP_004145063.1| peptidoglycan-binding domain 1 protein [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171475|gb|ADV15013.1| Peptidoglycan-binding domain 1 protein [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 1345 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 40/140 (28%), Gaps = 5/140 (3%) Query: 36 KEVINMPARSLDKLVAPFREETHDQPNYYRGS---RTDPHSVGTGAHLVEGLTSLAPYIA 92 K PA+ V P Q + +TD + + P Sbjct: 897 KAFFADPAQVASNDVVPIVASQPMQTASLDAASQPKTDAQPAAVESAPDHAVRQAEPSAP 956 Query: 93 GAA--LAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETA 150 +A + + P+ G A + P+ A A + + ++ + Sbjct: 957 ATKDDMAAQSMIATPSGAEPSMGAAPIAEPVPAPVALATSADASPAASETTTASAAPAST 1016 Query: 151 DALAWREAIVHTSALLAPGA 170 +A A T+ + P A Sbjct: 1017 VPVAAEPADTDTATGIQPTA 1036 >gi|330468592|ref|YP_004406335.1| cytochrome c oxidase subunit i [Verrucosispora maris AB-18-032] gi|328811563|gb|AEB45735.1| cytochrome c oxidase, subunit i [Verrucosispora maris AB-18-032] Length = 668 Score = 41.0 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 29/122 (23%), Positives = 48/122 (39%), Gaps = 18/122 (14%) Query: 294 VRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPS--------EPL 345 +R E P FD + + ++D + P P+ ++E R+ PS E Sbjct: 547 IRSERPAFDAKYGELVSDLGRDLPQRTTKPPQGLRDELHREKHHRESPSAEGAHGAPEAT 606 Query: 346 AEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLD------PMRG 399 A HP P+ E+ + + ++ S F D P+ +P ER + D P G Sbjct: 607 AYHPAPQSGARPVEVPDPQNVRRPS----FDDTDEPEDNPLGAERRSETDDDRWRHPRSG 662 Query: 400 AD 401 D Sbjct: 663 GD 664 >gi|167838495|ref|ZP_02465354.1| hypothetical protein Bpse38_18442 [Burkholderia thailandensis MSMB43] Length = 283 Score = 41.0 bits (94), Expect = 0.44, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 56/188 (29%), Gaps = 24/188 (12%) Query: 228 IGAFFG-GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTL 286 IG +G GM +N L ++ P ++ + P + H +L Sbjct: 13 IGLAYGPGMAEFAARNAHLVDYIEVPFEQLRFSPAVAELQQTIP--------FVLHCASL 64 Query: 287 AHGVDSLVRGEYPHFDQEKLQTIADNTLED--PHFKPHLPEPEPLPQYKEHSDRQKPSEP 344 + + D + I L+ P HL P +E +P+ Sbjct: 65 SIA-------GFVPPDASTVDAIERTALQTGTPWIGEHLAYISADPIGEELGGAGEPTSL 117 Query: 345 LAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTD 404 E R + + A + E SP + P G M ADF Sbjct: 118 SYTLCPQLSDETVRRVVDNLAALRPHFPVPLIVENSPQYFPIPGS------TMGMADFIR 171 Query: 405 APHAKFDA 412 A + DA Sbjct: 172 AIAQRCDA 179 >gi|260785738|ref|XP_002587917.1| hypothetical protein BRAFLDRAFT_87305 [Branchiostoma floridae] gi|229273072|gb|EEN43928.1| hypothetical protein BRAFLDRAFT_87305 [Branchiostoma floridae] Length = 503 Score = 41.0 bits (94), Expect = 0.45, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 40/86 (46%), Gaps = 8/86 (9%) Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPL-PQYKEHSDRQKPSEPLAEHPHPKRKE 355 E P F E + A+N P F+P PE +P P+++ + +P P + +P+ + Sbjct: 77 ENPEFQPENPELQAEN----PEFQPENPELQPENPEFQPENPELQPENPELQPENPENQP 132 Query: 356 VEREL---SEIEGAKKESSARKFFDE 378 EL + + G E + +FD+ Sbjct: 133 ETPELQPGASLLGTTAEGDVQGYFDD 158 >gi|296228399|ref|XP_002759787.1| PREDICTED: xin actin-binding repeat-containing protein 1-like [Callithrix jacchus] Length = 1822 Score = 41.0 bits (94), Expect = 0.47, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 42/130 (32%), Gaps = 12/130 (9%) Query: 269 SPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEP 328 +P S + L V +L + D + L+ + + + P P Sbjct: 1437 APESPASLQRNQNELQGLLTQVQALEKEAESSVDVQALRRLFEAVPQLEGAAPQAPTTRQ 1496 Query: 329 LPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGA-----------KKESSARKFFD 377 P+ + + E +++ L +IE A + E+SAR F Sbjct: 1497 KPEASVEQAFGELTRVSTEVAR-LKEQTLARLLDIEEAVHKALSSMSSLQPEASARGHFQ 1555 Query: 378 EGSPDHSPFK 387 DHS K Sbjct: 1556 GPPKDHSAHK 1565 >gi|255283111|ref|ZP_05347666.1| sortase B signal domain, QVPTGV class family [Bryantella formatexigens DSM 14469] gi|255266413|gb|EET59618.1| sortase B signal domain, QVPTGV class family [Bryantella formatexigens DSM 14469] Length = 1150 Score = 40.7 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 98/310 (31%), Gaps = 41/310 (13%) Query: 73 SVGTGAHLVEGLT----SLAPYIAGAALAGKLLSF--------IPTPLTRLAGLALQSAP 120 VG +++G++ + P G GK+ + + +L +P Sbjct: 756 FVGIDDAVLDGVSLPISTDMPEYTGEVPEGKVFAGWTLNQGTEVYKAGEKLVLTTENGSP 815 Query: 121 LAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTV 180 + + + E+ + EG+ A + + + P + V Sbjct: 816 AFGKYAFVFEPYFEEA-QTLKAEGIVSFVGIDGAVLDGVSLPISTDMP-EYTGEVPEGKV 873 Query: 181 ASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFD-----MESLITDGLIGAFFGGM 235 +G LN + + G + ++G P ++ +F+ + L +G+ F G+ Sbjct: 874 FAGWTLNQGTEVYKAGEKLVLTTENGSPAFGKYAFVFEPYFEEAQILKAEGI--VSFVGI 931 Query: 236 HSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVR 295 + +SL + D+ E E + TL G + Sbjct: 932 DGAVLDGVSLPISTDMPEYTGE-----------------VPEGKVFAGWTLNQGTEVYKA 974 Query: 296 GEYPHFDQEKLQTIAD--NTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAE-HPHPK 352 GE E + +P+F+ P EP + + SEP +E P Sbjct: 975 GERLVLTTENGSPAFGKYAFVFEPYFEDQEPTSEPTSEPTSEPTSEPTSEPTSEPTSEPT 1034 Query: 353 RKEVERELSE 362 + SE Sbjct: 1035 SEPTSEPTSE 1044 >gi|237841197|ref|XP_002369896.1| hypothetical protein TGME49_120290 [Toxoplasma gondii ME49] gi|211967560|gb|EEB02756.1| hypothetical protein TGME49_120290 [Toxoplasma gondii ME49] gi|221483590|gb|EEE21902.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 534 Score = 40.7 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 5/92 (5%) Query: 273 HTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQ---EKLQTIADNTLEDPHFKP-HLPEPEP 328 + AHT ++ + +RG P A+N ED +P E Sbjct: 42 PHDTFSRPAHTHSVLIATAAELRGNAPPVSPGTTRATDAAAENKAEDSSSEPGESTEVAQ 101 Query: 329 LPQYKEHSDRQKPSEPLAE-HPHPKRKEVERE 359 LP + S AE H + VE++ Sbjct: 102 LPAQEVSSGEPSAETTPAESHDTSESDPVEKD 133 >gi|38344657|emb|CAE02319.2| OSJNBb0112E13.1 [Oryza sativa Japonica Group] gi|38346564|emb|CAE03785.2| OSJNBa0063G07.9 [Oryza sativa Japonica Group] gi|116309495|emb|CAH66563.1| OSIGBa0113K06.9 [Oryza sativa Indica Group] gi|218194590|gb|EEC77017.1| hypothetical protein OsI_15361 [Oryza sativa Indica Group] Length = 303 Score = 40.7 bits (93), Expect = 0.58, Method: Composition-based stats. Identities = 21/100 (21%), Positives = 41/100 (41%), Gaps = 3/100 (3%) Query: 283 TDTLAHGVDSLVRGE---YPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ 339 L G +SL DQ K D +E + L E + L + ++ ++Q Sbjct: 201 QKELEEGRESLRNIRLKLDMPPDQRKRLIRRDLRVEHQEIERQLQEMQQLERERQQLEQQ 260 Query: 340 KPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEG 379 + L +++E++R+L ++E ++ R F G Sbjct: 261 EIERQLPSRQQLEQQEIKRQLQDMERERQLHRWRNFVTGG 300 >gi|262198713|ref|YP_003269922.1| hypothetical protein Hoch_5546 [Haliangium ochraceum DSM 14365] gi|262082060|gb|ACY18029.1| hypothetical protein Hoch_5546 [Haliangium ochraceum DSM 14365] Length = 1503 Score = 40.3 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 50/322 (15%), Positives = 79/322 (24%), Gaps = 34/322 (10%) Query: 51 APFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTR 110 P E D G V L S AP G+L P Sbjct: 402 QPVAAEASDSEGGGAGGAGGAGGEAGAETAVPDLASAAPEAG----LGQLQGVRPDKQQT 457 Query: 111 LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGA 170 G + + S A++ +G ETA + + + + A + Sbjct: 458 ALGGVRAA---IGTDVGESRSELAQNPPQQMSDGDAAETAASGEQAASEASSDSAAATES 514 Query: 171 IASQSIAKTVASGAVLNVPFGMVERGWS----SKVLEDHGYPDMAQHY--RIFDMESLIT 224 A+ A + G + + G + A +I D + Sbjct: 515 AAASPEGNAAAGAETADTIAGTEAEPEAPADEAASQTREGEAEQANDAATQILDDIASTI 574 Query: 225 DGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTD 284 L G+FFGG M+ + L + P + EA Sbjct: 575 SSLFGSFFGGAAENAANQMAKAEADGLASSLDNLSTKSDVAADPGPA-PELAVSTEAQAT 633 Query: 285 TLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEP 344 D+ L+ D P ++ PSE Sbjct: 634 AKQ--------------DRAALEQQVDGA------AQQTAAEVQRPMGEDSIATTVPSEQ 673 Query: 345 LAEHPHPKRKEVERELSEIEGA 366 L P E L ++ A Sbjct: 674 LRAAPIESAAASEIALPDVATA 695 >gi|170725919|ref|YP_001759945.1| phosphoribosylformylglycinamidine synthase [Shewanella woodyi ATCC 51908] gi|169811266|gb|ACA85850.1| phosphoribosylformylglycinamidine synthase [Shewanella woodyi ATCC 51908] Length = 1293 Score = 40.3 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 94/328 (28%), Gaps = 28/328 (8%) Query: 56 ETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLA 115 E D+ RGS+ G ++ + P+ A ++++ + G A Sbjct: 313 EIRDEGATGRGSKPKAGLTGFSVSNLKIPGFVQPWEADYGKPERIVTALDIMTEGPLGGA 372 Query: 116 LQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQS 175 + AL Y + H V + + + + + Sbjct: 373 AFNNEFGRPALLGYFRTYEQEVSSHNGVEV-RGYHKPIMLAGGLGNIRGEHVQKGEITVG 431 Query: 176 IAKTVASGAVLNVPFGM--VERGWSSKVLEDHGYPDMAQHYRIFD-------------ME 220 V G +N+ G S + ED + + + + + Sbjct: 432 AKLIVLGGPAMNIGLGGGAASSMASGESSEDLDFASVQRENPEMERRCQEVIDRCWQMGD 491 Query: 221 SLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGIT-ERLPYKHGVKSSSPGLHTSFDAY 279 + GG+ N LVND G E SP ++ Sbjct: 492 RNPIQFIHDVGAGGL-----SNAFPELVNDGDRGGKFELRNVPSDEPGMSPLEIWCNESQ 546 Query: 280 EAHTDTLA----HGVDSLVRGEYPHFDQEKLQTIADN-TLEDPHFKPHLPEPEPLPQYKE 334 E + ++A ++ E F + T + +L D HF + PL Sbjct: 547 ERYVMSVAPENLEVFTAICERERAPFSVVGVATEERHLSLSDEHFNDKPIDL-PLEVLLG 605 Query: 335 HSDRQKPSEPLAEHPHPKRKEVERELSE 362 + + A+ P+ + + E+ E Sbjct: 606 KAPKMSRDVVTAKALSPELNQEKIEIKE 633 >gi|212545080|ref|XP_002152694.1| dihydrolipoamide succinyltransferase, putative [Penicillium marneffei ATCC 18224] gi|210065663|gb|EEA19757.1| dihydrolipoamide succinyltransferase, putative [Penicillium marneffei ATCC 18224] Length = 476 Score = 40.3 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 11/118 (9%) Query: 320 KPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHP-KRKEVERELSEIEGAKKESSARKFFDE 378 P+ EP P+ KE PS+P + P K + V+ + E K + RK + Sbjct: 177 AAEKPKHEPAPEKKEEKTEASPSKPETKEAAPSKPEPVKEKQPE---RPKPTEPRKEAEP 233 Query: 379 GSPDHSPFKGERNQKLDPMR---GADFTDAPHAKFDATTFTESLPHVDEQTMHRFSEL 433 +P + + ER K++ MR + + TTF E VD ++ F +L Sbjct: 234 STPAQAGGREERRVKMNRMRLRIAERLKQSQNTAASLTTFNE----VDMSSLMEFRKL 287 >gi|308198333|ref|XP_001386996.2| DNA-directed RNA polymerase II largest subunit (RNA polymerase II subunit 1) (B220) [Scheffersomyces stipitis CBS 6054] gi|149388976|gb|EAZ62973.2| DNA-directed RNA polymerase II largest subunit (RNA polymerase II subunit 1) (B220) [Pichia stipitis CBS 6054] Length = 1739 Score = 40.3 bits (92), Expect = 0.75, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 78/255 (30%), Gaps = 11/255 (4%) Query: 19 AQRPRVSPDI--KWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTDPHSVGT 76 Q +PD ++ G + D + P + + N + + T Sbjct: 1287 MQHKVNTPDATGEFKQGKEWVLETDGVNLADVMAVPGVDSSRTYSNNFIEILSVLGIEAT 1346 Query: 77 GAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLS-HKAE 135 A L + + ++ + + + +R +A+ + A + E Sbjct: 1347 RAALFKEILNVLSFDGSYVNYRHMALLVDVMTSRGHLMAITRHGINRSDTGALMRCSFEE 1406 Query: 136 SSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVER 195 + G E D E ++ + + A NV Sbjct: 1407 TVEILLEAGASAELDDCRGISENVMLGQMAPLGTGAFDVMLDDKMLQTAPSNVAVAAGN- 1465 Query: 196 GWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGI 255 + +D G A YR +DME GA F +H+ QVQ++S L + + Sbjct: 1466 ---DEFADDGG----ATPYREYDMEDDKIQFEEGAGFSPIHTAQVQDVSGGLTSYGGQPT 1518 Query: 256 TERLPYKHGVKSSSP 270 + S+SP Sbjct: 1519 SPSATSPFSYGSTSP 1533 >gi|223993045|ref|XP_002286206.1| transketolase [Thalassiosira pseudonana CCMP1335] gi|220977521|gb|EED95847.1| transketolase [Thalassiosira pseudonana CCMP1335] Length = 719 Score = 40.3 bits (92), Expect = 0.78, Method: Composition-based stats. Identities = 41/279 (14%), Positives = 84/279 (30%), Gaps = 43/279 (15%) Query: 130 LSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLN-- 187 SH + +GV+ T + I + + + N Sbjct: 123 GSHTPGHPENFCTKGVEVCT---GPLGQGISNAVGMAIAAKHLGAIYNTADFPNIISNKT 179 Query: 188 ---VPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 G ++ G S + G+ + ++D + DG G F +K+ + Sbjct: 180 YVICGDGCLQEGISGEACSLAGHLGLGDLIVLYDDNHITIDGDTGLAFTEDVNKRYEAYG 239 Query: 245 --LRLVNDLKEGI--TERLPYKHGVKSSSP-----------GLHTSFDAYEAHTDTLAHG 289 ++ V D+ G+ + + + P G + +H L Sbjct: 240 WHVQTVGDVANGLEDLNKAIAEAKKVTDKPSLIKIRTEIGFGSPHKQGSASSHGAPLGDR 299 Query: 290 VDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 LV+ D Q+ + + ++K E + + + ++ HP Sbjct: 300 EIELVKSRLYGCDPS--QSFFVDEDVNAYYKQQAAEGDAARAAWD----AEFAKYKVAHP 353 Query: 350 HPKRKEVEREL-------------SEIEGAKKESSARKF 375 K E+ER + + G K ++ RKF Sbjct: 354 D-KASELERRFKHELPNDVFDDLPTFVYGKDKANATRKF 391 >gi|260804555|ref|XP_002597153.1| hypothetical protein BRAFLDRAFT_118100 [Branchiostoma floridae] gi|229282416|gb|EEN53165.1| hypothetical protein BRAFLDRAFT_118100 [Branchiostoma floridae] Length = 472 Score = 40.3 bits (92), Expect = 0.81, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 49/136 (36%), Gaps = 10/136 (7%) Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY 298 V+ ++ G TE V +S P + + + ++ D LA L + Sbjct: 29 DVKEERAPAEEEMVRGTTECS-----VVTSQPPM-SQPRSSKSSADVLAE----LRQDGL 78 Query: 299 PHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVER 358 + L+ P +P P P+ K Q+ E + + P R ++ + Sbjct: 79 LPLNTRGESVAFQVLLDKPASEPDAPPRRPVKLAKLEETLQERRERVKKEPAGSRTQLRQ 138 Query: 359 ELSEIEGAKKESSARK 374 +LS ++E A + Sbjct: 139 QLSNAANRREEMLAER 154 >gi|225562962|gb|EEH11241.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR] Length = 439 Score = 40.3 bits (92), Expect = 0.82, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 19/192 (9%) Query: 185 VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 +N + L+ D+ F E D +I G + +Q + Sbjct: 39 TVNQILSKGLHNTDGECLKYT--TDLMDKLEKFKSEHADDDTVIDDAAGQAYVEQFGLET 96 Query: 245 LRLVNDLKEGITERLPYKHGVKSSS-------------PGLHTSFDAYEAHTDTLAHGVD 291 + ++ L ++++ P T + H +A Sbjct: 97 FQRADNAVRANKASLQTADTFQAAATFLELCQIWGPIDPETATKIKFAKYHALRIAKA-- 154 Query: 292 SLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ-KPSEPLAEHPH 350 L GE P+ ++ +N + P P+ PE + L S + K +P E Sbjct: 155 -LKAGEDPNLSNPSMEEEEENLRDGPTLDPNDPEVQALNGSPSQSVPEVKLRQPSVEDVP 213 Query: 351 PKRKEVERELSE 362 + ER L++ Sbjct: 214 DEFDSEERRLAQ 225 >gi|240279783|gb|EER43288.1| conserved hypothetical protein [Ajellomyces capsulatus H143] gi|325092915|gb|EGC46225.1| conserved hypothetical protein [Ajellomyces capsulatus H88] Length = 439 Score = 39.9 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 19/192 (9%) Query: 185 VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 +N + L+ D+ F E D +I G + +Q + Sbjct: 39 TVNQILSKGLHNTDGECLKYT--TDLMDKLEKFKSEHADDDTVIDDAAGQAYVEQFGLET 96 Query: 245 LRLVNDLKEGITERLPYKHGVKSSS-------------PGLHTSFDAYEAHTDTLAHGVD 291 + ++ L ++++ P T + H +A Sbjct: 97 FQRADNAVRANKASLQTADTFQAAATFLELCQIWGPIDPETATKIKFAKYHALRIAKA-- 154 Query: 292 SLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ-KPSEPLAEHPH 350 L GE P+ ++ +N + P P+ PE + L S + K +P E Sbjct: 155 -LKAGEDPNLSNPSMEEEEENLRDGPTLDPNDPEVQALNGSPSQSVPEVKLRQPSVEDVP 213 Query: 351 PKRKEVERELSE 362 + ER L++ Sbjct: 214 DEFDSEERRLAQ 225 >gi|123469906|ref|XP_001318162.1| viral A-type inclusion protein [Trichomonas vaginalis G3] gi|121900914|gb|EAY05939.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3] Length = 5296 Score = 39.9 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 55/137 (40%), Gaps = 4/137 (2%) Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY 298 +NM L +K+ + + ++ + L + + L +D L R + Sbjct: 4190 DSKNMLLDSFGTIKDHLNDANNNNKKLQDENNKLRDDAQKATSKNNELQSIIDDLNR-KL 4248 Query: 299 PHFDQEKLQTIADNTLEDPHFKPHLPEPEPL--PQYKEHSDRQKPSEPLAEHPHPKRKEV 356 + D EK T + K E + + + +++ E LA+ ++K+V Sbjct: 4249 ANLDAEKKATEEKLKNTEDKLKQAEAEKKATEDKLRETENAKKETEEKLAKTEE-EKKQV 4307 Query: 357 ERELSEIEGAKKESSAR 373 E +L+ E AKKE+ + Sbjct: 4308 EDKLAATEAAKKETEDK 4324 >gi|126307916|ref|XP_001364954.1| PREDICTED: similar to amine oxidase, copper containing 3 [Monodelphis domestica] Length = 499 Score = 39.9 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 38/102 (37%), Gaps = 3/102 (2%) Query: 264 GVKSSSPGLHTSF--DAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKP 321 G+ +S + + + E H A + L RG P +E L + P+ Sbjct: 80 GLVDASRAIPSDNCVYSVELHLPPKAVALAHLDRGGPPP-PREALALVFFGQQARPNVSE 138 Query: 322 HLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEI 363 L P P P Y ++ PL H P + E++++ Sbjct: 139 LLVGPLPNPSYLRDVTVERHGGPLPYHRRPLSTKEMEEMNKM 180 >gi|154280286|ref|XP_001540956.1| predicted protein [Ajellomyces capsulatus NAm1] gi|150412899|gb|EDN08286.1| predicted protein [Ajellomyces capsulatus NAm1] Length = 279 Score = 39.9 bits (91), Expect = 0.93, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 19/192 (9%) Query: 185 VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMS 244 +N + L+ D+ F E D +I G + +Q + Sbjct: 39 TVNQILSKGLHNTDGECLKYT--TDLMDKLEKFKSEHADDDTVIDDAAGQAYVEQFGLET 96 Query: 245 LRLVNDLKEGITERLPYKHGVKSSS-------------PGLHTSFDAYEAHTDTLAHGVD 291 + ++ L ++++ P + H +A Sbjct: 97 FQRADNAVRANKASLQTADTFQAAATFLELCQIWGSIDPETAAKIKFAKYHALRIAKA-- 154 Query: 292 SLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQ-KPSEPLAEHPH 350 L GE P+ ++ +N + P P+ PE + L S + K +P E Sbjct: 155 -LKAGEDPNLSNPSMEEEEENLRDGPTLDPNDPEVQALNGSPSQSVPEVKLRQPSVEDVP 213 Query: 351 PKRKEVERELSE 362 + ER L++ Sbjct: 214 DEFDNEERRLAQ 225 >gi|262195344|ref|YP_003266553.1| hypothetical protein Hoch_2115 [Haliangium ochraceum DSM 14365] gi|262078691|gb|ACY14660.1| hypothetical protein Hoch_2115 [Haliangium ochraceum DSM 14365] Length = 1637 Score = 39.9 bits (91), Expect = 0.97, Method: Composition-based stats. Identities = 50/322 (15%), Positives = 79/322 (24%), Gaps = 34/322 (10%) Query: 51 APFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTR 110 P E D G V L S AP G+L P Sbjct: 402 QPVAAEASDSEGGGAGGAGGAGGEAGAETAVPDLASAAPEAG----LGQLQGVRPDKQQT 457 Query: 111 LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGA 170 G + + S A++ +G ETA + + + + A + Sbjct: 458 ALGGVRAA---IGTDVGESRSELAQNPPQQMSDGDAAETAASGEQAASEASSDSAAATES 514 Query: 171 IASQSIAKTVASGAVLNVPFGMVERGWS----SKVLEDHGYPDMAQHY--RIFDMESLIT 224 A+ A + G + + G + A +I D + Sbjct: 515 AAASPEGNAAAGAETADTIAGTEAEPEAPADEAASQTREGEAEQANDAATQILDDIASTI 574 Query: 225 DGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTD 284 L G+FFGG M+ + L + P + EA Sbjct: 575 SSLFGSFFGGAAENAANQMAKAEADGLASSLDNLSTKSDVAADPGPA-PELAVSTEAQAT 633 Query: 285 TLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEP 344 D+ L+ D P ++ PSE Sbjct: 634 AKQ--------------DRAALEQQVDGA------AQQTAAEVQRPMGEDSIATTVPSEQ 673 Query: 345 LAEHPHPKRKEVERELSEIEGA 366 L P E L ++ A Sbjct: 674 LRAAPIESAAASEIALPDVATA 695 >gi|311110217|ref|ZP_07711614.1| putative membrane protein YdgH [Lactobacillus gasseri MV-22] gi|311065371|gb|EFQ45711.1| putative membrane protein YdgH [Lactobacillus gasseri MV-22] Length = 1155 Score = 39.9 bits (91), Expect = 0.99, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 98/327 (29%), Gaps = 24/327 (7%) Query: 2 YFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQP 61 Y A+ + D K+ K + ++ + S K VA + + Sbjct: 621 YAQAMDSAKLNDQQKQAMSVALNQILGKVESASSQK--SQALTSSLKSVAGNIQAAGEAD 678 Query: 62 NYYRGSRTDPHSVGTG-AHLVEGLTSLAPYIAGAALAG-KLLSFIPTPLTRLA-GLALQS 118 S + + ++ + +L + A A K L T L +L+ GL Sbjct: 679 KKLGQSASSVGATLQNLQGMMSQVATLKQEVNTLANASNKALPGATTALNQLSSGLTQVQ 738 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAK 178 + +A GA A + + +++ + ++ + + + LA GA + + Sbjct: 739 SAVAQGAAGASRLNDGAARLNNGAGQLATGLQAGVSGSSQLANGAGQLANGAGQLNTGLQ 798 Query: 179 TVASGA-----VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFG 233 SG ++ G + S + G +A + G G Sbjct: 799 AGLSGTNQLANGIDQLNGGAGQLASGAGQLNGGSGQLAN------GIGQLNGGASQLANG 852 Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL 293 ++ + G+ E Y G+ S+ G A H+ T +D+ Sbjct: 853 AGQIASNNPKITAGIDKVNSGLGEGQEYLTGLADSAAGKTFYIPAKMIHSSTFKPALDNY 912 Query: 294 VRGEY--------PHFDQEKLQTIADN 312 + + D + Sbjct: 913 MSSDLKSTKIIIILKLDPASTEGAKKA 939 >gi|116630191|ref|YP_815363.1| hypothetical protein LGAS_1630 [Lactobacillus gasseri ATCC 33323] gi|282852809|ref|ZP_06262150.1| MMPL family protein [Lactobacillus gasseri 224-1] gi|116095773|gb|ABJ60925.1| Predicted membrane protein [Lactobacillus gasseri ATCC 33323] gi|282555917|gb|EFB61538.1| MMPL family protein [Lactobacillus gasseri 224-1] Length = 1246 Score = 39.9 bits (91), Expect = 0.99, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 98/327 (29%), Gaps = 24/327 (7%) Query: 2 YFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQP 61 Y A+ + D K+ K + ++ + S K VA + + Sbjct: 712 YAQAMDSAKLNDQQKQAMSVALNQILGKVESASSQK--SQALTSSLKSVAGNIQAAGEAD 769 Query: 62 NYYRGSRTDPHSVGTG-AHLVEGLTSLAPYIAGAALAG-KLLSFIPTPLTRLA-GLALQS 118 S + + ++ + +L + A A K L T L +L+ GL Sbjct: 770 KKLGQSASSVGATLQNLQGMMSQVATLKQEVNTLANASNKALPGATTALNQLSSGLTQVQ 829 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAK 178 + +A GA A + + +++ + ++ + + + LA GA + + Sbjct: 830 SAVAQGAAGASRLNDGAARLNNGAGQLATGLQAGVSGSSQLANGAGQLANGAGQLNTGLQ 889 Query: 179 TVASGA-----VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFG 233 SG ++ G + S + G +A + G G Sbjct: 890 AGLSGTNQLANGIDQLNGGAGQLASGAGQLNGGSGQLAN------GIGQLNGGASQLANG 943 Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL 293 ++ + G+ E Y G+ S+ G A H+ T +D+ Sbjct: 944 AGQIASNNPKITAGIDKVNSGLGEGQEYLTGLADSAAGKTFYIPAKMIHSSTFKPALDNY 1003 Query: 294 VRGEY--------PHFDQEKLQTIADN 312 + + D + Sbjct: 1004 MSSDLKSTKIIIILKLDPASTEGAKKA 1030 >gi|9453839|dbj|BAB03273.1| myosin [Chara corallina] Length = 2182 Score = 39.9 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 30/149 (20%), Positives = 51/149 (34%), Gaps = 9/149 (6%) Query: 235 MHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLV 294 + V + + + P GV + PG + H + + SL Sbjct: 1638 IGRAAVTRIKPTPEPVITTSYPDEQPATPGV--TGPGTPSRPLGRSQHIRSESSDFTSLY 1695 Query: 295 RGEYPHF------DQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEH 348 E D EK + + D P P +PE +P+ Q K K + Sbjct: 1696 FREDSPVPEAKPVDHEKSKMMPDKLQYLPEDSP-VPEAKPVDQKKSKMMPDKLQYLPEDS 1754 Query: 349 PHPKRKEVERELSEIEGAKKESSARKFFD 377 P P+ K V+++ S++ K +S D Sbjct: 1755 PVPEAKPVDQKKSKMMPDKLQSDQEALLD 1783 >gi|170731993|ref|YP_001763940.1| outer membrane autotransporter [Burkholderia cenocepacia MC0-3] gi|169815235|gb|ACA89818.1| outer membrane autotransporter barrel domain protein [Burkholderia cenocepacia MC0-3] Length = 1763 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 34/283 (12%), Positives = 73/283 (25%), Gaps = 17/283 (6%) Query: 32 TGLGKEVINMPARSLDKLVA--PFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAP 89 G +++ A L A P + + + +E +++ Sbjct: 842 AGATAGIVDGQAHDLAGAAAGAPVATTLTNHAAVTSSTAGVTGFIAQNLGTLENRSTVL- 900 Query: 90 YIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKET 149 + GA G + + T GAL S ++ + + Sbjct: 901 -LTGAGSTGVVAGTLGTVNNAST----IRVSDGTGALVQGASATLANAGSIEADDGIAGV 955 Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A + + + A + + SG + + G S + + G Sbjct: 956 RLTGAGASVALSGAGTVVANGSADGVLIDSTVSGGGIAAGPTSIAVGGSGSGIRNLG--- 1012 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKS-- 267 A + T G G + ++ ++ T+ L Sbjct: 1013 -ANATIALSGTQVATTG--NGAAGLASTGAGARIATDAATVVRTAGTDALGLSVSGADST 1069 Query: 268 -SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTI 309 ++ G + AH + G +L+ G I Sbjct: 1070 LTANGTTVATTGANAHAIVMDGGATALLSGAKISASGAAADGI 1112 >gi|302657623|ref|XP_003020530.1| GRAM domain protein [Trichophyton verrucosum HKI 0517] gi|291184371|gb|EFE39912.1| GRAM domain protein [Trichophyton verrucosum HKI 0517] Length = 1254 Score = 39.5 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 80/262 (30%), Gaps = 18/262 (6%) Query: 25 SPDIKWHTGLGKEVINMPARSLDKLVAPFREETH-----DQPNYYRGSRTDPHSVGTGAH 79 S ++ G +++ + L + + + N G D Sbjct: 394 SSTSQFGAGFFSSMVSAAQNAATTLSSSLNPQAKGSKTSQEQNNTEGDTRDSGEQEKSGA 453 Query: 80 LVEGLTSLAP-------YIAGAALAGKLLSFIPTP-LTRLAGLALQSAPLAAGALYAYLS 131 G ++AP + S + L + AG S AG A + Sbjct: 454 TPGGEENVAPQNGKKELAVNTLGTGDLDFSHLGLEHLEKAAGDGEGSKLDVAGRPRAKTA 513 Query: 132 HKAESSIHHQIEGVDKETADALAWREAIVH---TSALLAPGAIASQSIAKTVASGAVLNV 188 + ++E V A ++A+ V T + + + V A N Sbjct: 514 VSQRDELAARMEDVRAARAVSMAYGNTPVTPIVTVDSINTDNQPANPLNTVVRDNAGENT 573 Query: 189 PFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLV 248 P G +++ L+ +G + R + T+ IGA G + +N S+ + Sbjct: 574 PPGGSVHSETAESLKQNGSLKSRRARRDRGSSAATTNTTIGAPIGT--NLTARNTSVPRL 631 Query: 249 NDLKEGITERLPYKHGVKSSSP 270 +R H + S P Sbjct: 632 TGFAVASKKRNRDFHSLFRSVP 653 >gi|330930785|ref|XP_003303149.1| hypothetical protein PTT_15249 [Pyrenophora teres f. teres 0-1] gi|311321027|gb|EFQ88755.1| hypothetical protein PTT_15249 [Pyrenophora teres f. teres 0-1] Length = 1543 Score = 39.5 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 43/303 (14%), Positives = 91/303 (30%), Gaps = 36/303 (11%) Query: 103 FIPTPLTRLAGLALQSAPLAAGALYAYLSHKAES----------SIHHQIEGVDKETADA 152 + L ++ + P AL+A L ++S S ++ +G+ + Sbjct: 195 LFGASGSSLLNTSIATNPYGNDALFAGLQTPSQSPGPIATPLSNSQKNKSKGILPQHKLN 254 Query: 153 LAWREAIVHTSALLAPGAI---ASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 + ++ + L + S+A + L G + R V + + Sbjct: 255 PSASTRLITPQSKLGGYGFSYSGASSLAGSTGFNGSL-FANGHLSRSLGKSVSTSNLRNN 313 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMHSKQV---QNMSLRLVNDLKEGITERLPYKHGVK 266 I + T G FG K++ +N++ E Sbjct: 314 FTPDTSILSPGAFTT---TGRNFGNGSLKRLNINRNINGGRTPLFDE-------PSQKRV 363 Query: 267 SSSPGLHTSFDAYEAHT--------DTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPH 318 S +PG + + D D+ + GE P +++ ++DP Sbjct: 364 SFAPGEDVNGETNGETALVVRRDEDDASPRAADNQINGESPRPAMQQVNGTEIVRVDDPA 423 Query: 319 FKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDE 378 P + ++ + S P E K+ + + + + K F+ Sbjct: 424 LAPKSASSVNIQPGQDPKPGEYWSSPSFEQLKRMSKQELKSVPDFVVGRHNIGQIK-FNH 482 Query: 379 GSP 381 G P Sbjct: 483 GKP 485 >gi|145246284|ref|XP_001395391.1| hypothetical protein ANI_1_1632104 [Aspergillus niger CBS 513.88] gi|134080106|emb|CAK46087.1| unnamed protein product [Aspergillus niger] Length = 476 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 8/84 (9%) Query: 319 FKPHLPEPEPLPQYKEHSDRQKPSEP-LAEHPHPKRKEVERELSEIEGAKKESSARKFFD 377 F P P+ + + ++PS+P + P P+ +L++ E A + AR+ Sbjct: 326 FTPEPASPKTQLKSELEPKPKEPSKPATSPKPVPETTAHPEKLTQPEKATQPEKARQ--- 382 Query: 378 EGSPDHSPF-KGERNQKLDPMRGA 400 P+ PF K E + K +P GA Sbjct: 383 ---PEKVPFDKPEPSPKFNPRSGA 403 >gi|326430011|gb|EGD75581.1| SMC2 protein [Salpingoeca sp. ATCC 50818] Length = 1212 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 71/196 (36%), Gaps = 25/196 (12%) Query: 252 KEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIAD 311 K + + + ++ + + + A E+ L D + E + +D Sbjct: 332 KAHLAQIEETQKSLEDKAGEIDAARAAVESGQTALQAAQDGVAESEKRCM-AASVGASSD 390 Query: 312 NTLEDPHFKPHLPEPE--------PLPQYK---EHSDRQ-KPSEPLAEHPHPKRKEVERE 359 T F + E + + Q + H+ + K +P A+ + K ++R+ Sbjct: 391 GT--SLTFAEQIKELQSVISTASTQMKQAEMTISHATSELKTKKPNAKKSESEYKRLQRD 448 Query: 360 LSEIE---GAKKESSARKFFDEGSPDHSPFKGERNQKLDP--MRGADFTDAPHAKFDATT 414 ++ +E A +E A+ FDEG E+ Q LD + D D A+ T Sbjct: 449 VNALETDLKAIEEHVAKLAFDEGEEAK---LHEQKQALDREYLAAKDQVDTLSARLSRLT 505 Query: 415 F--TESLPHVDEQTMH 428 F + P D +H Sbjct: 506 FEYKDPEPGFDRSQVH 521 >gi|283781747|ref|YP_003372502.1| protein-export membrane protein SecD [Pirellula staleyi DSM 6068] gi|283440200|gb|ADB18642.1| protein-export membrane protein SecD [Pirellula staleyi DSM 6068] Length = 1192 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 41/132 (31%), Gaps = 18/132 (13%) Query: 264 GVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHL 323 VK ++P E GE P + E P P Sbjct: 709 EVKEATPP-PVDTRTPETTPPATEKP------GEVPAEVPAEKPAETPAA-EKPAEAPKA 760 Query: 324 PEP-EPLPQYKEHSDRQKP--------SEPLAEHPHPKRKEVERELSEIEGAKKESSA-R 373 EP P+ ++ + P +P E P + K E EG+ +E +A Sbjct: 761 EEPPAEAPKAEDKPAEEAPKTEEKPADEKPAEEKPAEEAKPAESTEPAAEGSCQEPAADD 820 Query: 374 KFFDEGSPDHSP 385 K DE P+ P Sbjct: 821 KPADEAKPEEKP 832 >gi|238852927|ref|ZP_04643328.1| membrane protein [Lactobacillus gasseri 202-4] gi|238834463|gb|EEQ26699.1| membrane protein [Lactobacillus gasseri 202-4] Length = 1045 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 98/327 (29%), Gaps = 24/327 (7%) Query: 2 YFNAVSDEDIRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQP 61 Y A+ + D K+ K + ++ + S K VA + + Sbjct: 511 YAQAMDSAKLNDQQKQAMSVALNQILGKVESASSQK--SQALTSSLKSVAGNIQAAGEAD 568 Query: 62 NYYRGSRTDPHSVGTG-AHLVEGLTSLAPYIAGAALAG-KLLSFIPTPLTRLA-GLALQS 118 S + + ++ + +L + A A K L T L +L+ GL Sbjct: 569 KKLGQSASSVGATLQNLQGMMSQVATLKQEVNTLANASNKALPGATTALNQLSSGLTQVQ 628 Query: 119 APLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAK 178 + +A GA A + + +++ + ++ + + + LA GA + + Sbjct: 629 SAVAQGAAGASRLNDGAARLNNGAGQLATGLQAGVSGSSQLANGAGQLANGAGQLNTGLQ 688 Query: 179 TVASGA-----VLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFG 233 SG ++ G + S + G +A + G G Sbjct: 689 AGLSGTNQLANGIDQLNGGAGQLASGAGQLNGGSGQLAN------GIGQLNGGASQLANG 742 Query: 234 GMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSL 293 ++ + G+ E Y G+ S+ G A H+ T +D+ Sbjct: 743 AGQIASNNPKITAGIDKVNSGLGEGQEYLTGLADSAAGKTFYIPAKMIHSSTFKPALDNY 802 Query: 294 VRGEY--------PHFDQEKLQTIADN 312 + + D + Sbjct: 803 MSSDLKSTKIIIILKLDPASTEGAKKA 829 >gi|297727273|ref|NP_001176000.1| Os09g0572550 [Oryza sativa Japonica Group] gi|255679157|dbj|BAH94728.1| Os09g0572550 [Oryza sativa Japonica Group] Length = 354 Score = 39.5 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 85/297 (28%), Gaps = 22/297 (7%) Query: 75 GTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKA 134 G G LVE L + G G+ L + G L L A + + Sbjct: 56 GDGEGLVEALGGVGAAEQGVGHVGEELLVL-----EARGGPLDEVLLIVRAGHVDGAAAG 110 Query: 135 ESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVE 194 + E VD LA + A+ A A + A V E Sbjct: 111 DDLEEDDAEAVDVGAGGELAGESVLGGAVAVGAHDAGGDVGLVADGADLGEAEVG----E 166 Query: 195 RGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGM--HSKQVQNMSLRLVNDLK 252 G V ED G ++A + D + ++ A G + S +K Sbjct: 167 AGLEGGVEEDVGGLEVA----VDDGGTSCVVQVLKAAGGALRDAHPSGPVQSRGARGQVK 222 Query: 253 EGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTL--AHGVDSLVRGEYP-HFDQEKLQTI 309 + + + H + + + H + + E P +Q + Sbjct: 223 QVVLQGA-AGHVLVHQDAVVAVGAVPQQRHQVWVLRQQAQHQHLHKELPVPLHPVPVQLL 281 Query: 310 ADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPS---EPLAEHPHPKRKEVERELSEI 363 P P P P + R +P+ LA P+R +L ++ Sbjct: 282 HRRLRHRPLDAQPPPVHRPEPSLPQQRLRPEPARRLRQLAVRERPRRHLPLADLQDL 338 >gi|224126103|ref|XP_002319756.1| predicted protein [Populus trichocarpa] gi|222858132|gb|EEE95679.1| predicted protein [Populus trichocarpa] Length = 425 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 48/121 (39%), Gaps = 14/121 (11%) Query: 286 LAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPL 345 + + + EYP+ DQ L ++ P L P PQ ++ +Q PS Sbjct: 1 MELAIMEVQNQEYPNTDQVVL------FIDQPD--SKLKMSSPPPQQEDSKLKQSPSPQQ 52 Query: 346 AEHPHPKRKEVE-RELSEIEGAKKESSARKFFDEGSPDHSPF--KGERNQKLDPMRGADF 402 + PK + + L + +K +S +F + P HS + E Q L+P A Sbjct: 53 PDIKDPKLTQARTKTLRRLNFSKPKS---RFTETNYPPHSKTFPESEEYQPLNPPESATS 109 Query: 403 T 403 T Sbjct: 110 T 110 >gi|255732828|ref|XP_002551337.1| histone deacetylase RPD3 [Candida tropicalis MYA-3404] gi|240131078|gb|EER30639.1| histone deacetylase RPD3 [Candida tropicalis MYA-3404] Length = 615 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 46/108 (42%), Gaps = 4/108 (3%) Query: 314 LEDPH-FKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSA 372 ++ P KP +PE + + KP E +E P+ + E E +E +ES Sbjct: 455 IDKPEEAKPEESKPEEAKPEEAKPEEAKPEEAKSEEAKPEESKHE-ETKPVEAKHEESKP 513 Query: 373 RKFF-DEGSPDHSPFKGERNQKLDPMRGADFTD-APHAKFDATTFTES 418 + +E P+ P E + ++ ++ AD + + K + T ES Sbjct: 514 EESKPEESKPEEQPAPVEEPKSIEEVKTADESKPSEETKPETITLEES 561 >gi|242814581|ref|XP_002486396.1| dihydrolipoamide succinyltransferase, putative [Talaromyces stipitatus ATCC 10500] gi|218714735|gb|EED14158.1| dihydrolipoamide succinyltransferase, putative [Talaromyces stipitatus ATCC 10500] Length = 459 Score = 39.1 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 64/199 (32%), Gaps = 18/199 (9%) Query: 248 VNDLKEGITERLPYKH--------GVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYP 299 V ++ E ITE + + T + + LV E Sbjct: 77 VPEMAESITEGTLKQFSKQVGDFVERDEEIATIETDKIDVAVNAPESGTIKELLVNEEDT 136 Query: 300 HFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRK--EVE 357 + + + + + + EP PQ E PS+P + P K V+ Sbjct: 137 VTVGQPIVKLEPGSGDGAAAAEKPKD-EPAPQKTEEKTETAPSKPETKEPAAPSKPEPVQ 195 Query: 358 RELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMR---GADFTDAPHAKFDATT 414 + SE K S + + P + ER K++ MR + + TT Sbjct: 196 EKKSEQPKPKPAESKKTEPEPSKPAQPGSREERRVKMNRMRLRIAERLKQSQNTAASLTT 255 Query: 415 FTESLPHVDEQTMHRFSEL 433 F E VD ++ F +L Sbjct: 256 FNE----VDMSSLMEFRKL 270 >gi|126665680|ref|ZP_01736661.1| Methyl-accepting chemotaxis protein (contains HAMP domain) [Marinobacter sp. ELB17] gi|126629614|gb|EBA00231.1| Methyl-accepting chemotaxis protein (contains HAMP domain) [Marinobacter sp. ELB17] Length = 564 Score = 39.1 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 59/211 (27%), Gaps = 15/211 (7%) Query: 168 PGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGL 227 P VAS V +G + + KV GY +M + I DG Sbjct: 133 PNGTYLIRELVKVASDGGGYVSYGW-QNEATGKVAPKLGYAEMLPQWNIMIGTGFWVDG- 190 Query: 228 IGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLA 287 +G M SK + ++ + + + + + + +A Sbjct: 191 LGEQVAAMDSKVGDALDNAVIGSVTTSLIALAIIGLFALVVVRSIIRPLKSAASAMNDIA 250 Query: 288 HGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPH--------LPEPEPLPQYKEHSDRQ 339 G L R D + ++ + F L L + + Sbjct: 251 SGDGDLTR----RLDIDGKDELSQLAIAFNSFADQVHGLVEQVLSSTGTLNEASAELSQV 306 Query: 340 KPSEPLA-EHPHPKRKEVERELSEIEGAKKE 369 E + +V ++++ A +E Sbjct: 307 MEESTQGVERQKSESDQVATAMNQMTAAAQE 337 >gi|242814586|ref|XP_002486397.1| dihydrolipoamide succinyltransferase, putative [Talaromyces stipitatus ATCC 10500] gi|218714736|gb|EED14159.1| dihydrolipoamide succinyltransferase, putative [Talaromyces stipitatus ATCC 10500] Length = 427 Score = 38.7 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 64/199 (32%), Gaps = 18/199 (9%) Query: 248 VNDLKEGITERLPYKH--------GVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYP 299 V ++ E ITE + + T + + LV E Sbjct: 77 VPEMAESITEGTLKQFSKQVGDFVERDEEIATIETDKIDVAVNAPESGTIKELLVNEEDT 136 Query: 300 HFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRK--EVE 357 + + + + + + EP PQ E PS+P + P K V+ Sbjct: 137 VTVGQPIVKLEPGSGDGAAAAEKPKD-EPAPQKTEEKTETAPSKPETKEPAAPSKPEPVQ 195 Query: 358 RELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMR---GADFTDAPHAKFDATT 414 + SE K S + + P + ER K++ MR + + TT Sbjct: 196 EKKSEQPKPKPAESKKTEPEPSKPAQPGSREERRVKMNRMRLRIAERLKQSQNTAASLTT 255 Query: 415 FTESLPHVDEQTMHRFSEL 433 F E VD ++ F +L Sbjct: 256 FNE----VDMSSLMEFRKL 270 >gi|333026288|ref|ZP_08454352.1| putative oxidoreductase [Streptomyces sp. Tu6071] gi|332746140|gb|EGJ76581.1| putative oxidoreductase [Streptomyces sp. Tu6071] Length = 474 Score = 38.7 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 59/195 (30%), Gaps = 16/195 (8%) Query: 136 SSIHHQIEGVDKET---ADALAWREAIVHTSALLAP--GAIASQSIAKTVASGAVLNVPF 190 + GV ET A R I +L P + + +A+G + Sbjct: 289 AEDRVAGYGVSVETCAEALTAIARPGIASVQIILNPFRMKPLDEVLPAALAAGVGIIARV 348 Query: 191 GMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVND 250 + S K ED + A +R F+ D G F G+ + Sbjct: 349 PLASGLLSGKYTEDTTFA--ANDHRTFNRHGEAFDQ--GETFAGVDFATGVAAAREFAAL 404 Query: 251 LKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIA 310 EG T + PG+ T A+ GE P D++KL I Sbjct: 405 APEGATPAQTALRWIVQQ-PGVTTVIPGARTPAQARANSA----AGELPPLDEQKLTAIR 459 Query: 311 DNTLEDPHFKPHLPE 325 + L P + + Sbjct: 460 E--LYTREIAPQVAD 472 >gi|257812101|gb|ACV69918.1| PHIST domain containing protein [Plasmodium berghei] Length = 1084 Score = 38.7 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 62/182 (34%), Gaps = 21/182 (11%) Query: 235 MHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLV 294 MH Q + V ++ + + + P E ++ L Sbjct: 265 MHFTQWVDYMKYAVQEVDQ--DHEQGTSKQLPDTEPLKPEQKYYIEPSKPEQEENIEPLK 322 Query: 295 RGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEP---------LPQYKEHSDRQKPS--- 342 + + D K + + P K ++ +P P+ +E+ D KP Sbjct: 323 PEQEENVDPLKPEQEENIKPLKPEQKENIKPLKPEQEENIKPLKPEQEENVDPLKPEQEE 382 Query: 343 -----EPLAEHP-HPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDP 396 +P E P + E E + ++ ++E + + E + P K E+ + +DP Sbjct: 383 NVDPLKPEQEENVDPLKPEQEENVDPLK-PEQEENIKPLKPEQKENIKPLKPEQEENVDP 441 Query: 397 MR 398 ++ Sbjct: 442 LK 443 >gi|56964861|ref|YP_176592.1| beta-N-acetylglucosaminidase [Bacillus clausii KSM-K16] gi|56911104|dbj|BAD65631.1| beta-N-acetylglucosaminidase [Bacillus clausii KSM-K16] Length = 1398 Score = 38.7 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 55/171 (32%), Gaps = 13/171 (7%) Query: 256 TERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLE 315 ER +H + S T + EA+ T+ E E T D E Sbjct: 1061 KERDTDEHAEEPSIDENDTVPHSDEANDQTVEEDDHVADENEQA---SETTDTENDAENE 1117 Query: 316 DPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKF 375 + + P + S ++P EP P E E E ++ ++E Sbjct: 1118 ESNLPASEEAPSEENDSTDESSLEEPQEP---ETDPSTDEQEPE-TDASADEQEPETDAN 1173 Query: 376 FDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFTESLPHVDEQT 426 DE P+ E+ + D D K DA+T E P D T Sbjct: 1174 TDEQEPETDASTDEQEPETDAS-----ADEQEPKTDAST-DEQEPETDAST 1218 >gi|311245483|ref|XP_001925661.2| PREDICTED: ninein isoform 1 [Sus scrofa] Length = 2136 Score = 38.7 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 71/189 (37%), Gaps = 23/189 (12%) Query: 243 MSLRLVNDLKEGITERLPYKH------GVKSSSPGLHTSFDAYEAHTDTLA-HGVDSLVR 295 LR+ KE + + + H G K+ +P + T + L+ +D L+ Sbjct: 1790 SDLRMTQQEKEALKQEVMSLHKQLQNAGDKNWAPEVATHPSGFPNQQQRLSWDKLDQLMN 1849 Query: 296 GEYPHF--DQEKLQTIADNT----LEDPHFKPHLPEPEPLPQYKEHSDRQKPSEP----- 344 E + E+LQT+ NT + L LP++++H +P Sbjct: 1850 EEQQLLWQENERLQTVVQNTKAELIHSREKVRQLESNLLLPKHQKHLSSSGTMKPPEQEK 1909 Query: 345 -----LAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRG 399 E +R R++S++ ++E +EG E+ ++ +R Sbjct: 1910 LSLKRECEQVQKERSPTNRKVSQMNSLERELETIHLENEGLKKKQVKLDEQLMEMQHLRS 1969 Query: 400 ADFTDAPHA 408 F+ +P+A Sbjct: 1970 TMFSPSPNA 1978 >gi|311245485|ref|XP_003121856.1| PREDICTED: ninein isoform 2 [Sus scrofa] Length = 2049 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 71/189 (37%), Gaps = 23/189 (12%) Query: 243 MSLRLVNDLKEGITERLPYKH------GVKSSSPGLHTSFDAYEAHTDTLA-HGVDSLVR 295 LR+ KE + + + H G K+ +P + T + L+ +D L+ Sbjct: 1790 SDLRMTQQEKEALKQEVMSLHKQLQNAGDKNWAPEVATHPSGFPNQQQRLSWDKLDQLMN 1849 Query: 296 GEYPHF--DQEKLQTIADNT----LEDPHFKPHLPEPEPLPQYKEHSDRQKPSEP----- 344 E + E+LQT+ NT + L LP++++H +P Sbjct: 1850 EEQQLLWQENERLQTVVQNTKAELIHSREKVRQLESNLLLPKHQKHLSSSGTMKPPEQEK 1909 Query: 345 -----LAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRG 399 E +R R++S++ ++E +EG E+ ++ +R Sbjct: 1910 LSLKRECEQVQKERSPTNRKVSQMNSLERELETIHLENEGLKKKQVKLDEQLMEMQHLRS 1969 Query: 400 ADFTDAPHA 408 F+ +P+A Sbjct: 1970 TMFSPSPNA 1978 >gi|256827615|ref|YP_003151574.1| ATP synthase F1 subcomplex alpha subunit [Cryptobacterium curtum DSM 15641] gi|256583758|gb|ACU94892.1| ATP synthase F1 subcomplex alpha subunit [Cryptobacterium curtum DSM 15641] Length = 524 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 61/196 (31%), Gaps = 19/196 (9%) Query: 45 SLDKLVAPFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFI 104 + + Q + S + VGT + +G+ + A+AG+LL F Sbjct: 3 VTEITAQSIDDALRKQLDALDTS-VEAREVGTVIQVGDGIARI--DGLKDAMAGELLEFT 59 Query: 105 PTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSA 164 + + G+A GA+ + + + G E A +V+ Sbjct: 60 GSAGQIVYGMAQNLEEEEVGAVLLGDVTAIKENDQVKTTGRIVEIPVGPAMCGRVVNALG 119 Query: 165 LLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLED------------HGYPDMAQ 212 + G ++ A G++ R S+ ++ G ++ Sbjct: 120 MPIDGKGPIKTTATRPVEFKAP----GVISRQPVSEPMQTGILAVDSMIPIGRGQRELII 175 Query: 213 HYRIFDMESLITDGLI 228 R ++ D +I Sbjct: 176 GDRQTGKTAIAIDAII 191 >gi|38174850|emb|CAD89773.1| MelB protein [Melittangium lichenicola] Length = 1050 Score = 38.3 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 67/404 (16%), Positives = 130/404 (32%), Gaps = 59/404 (14%) Query: 15 IKEWAQRPRVSPDIKW---HTGLGKEV------------INMPARSLDKLVAPFREETHD 59 + ++AQ +PD + ++G G + + PA +D + TH Sbjct: 139 LSDYAQLELNAPDPRGINPYSGSGGVLSMAAGRIAATLGLEGPALVVDTSCSSSLVATHL 198 Query: 60 QPNYYRGSRTDPHSVGTGAHLV---------EGLTSLAPYIAGAALAGKLLSFIPTPLTR 110 R D VG GA+L+ L +L+P A A G ++ + Sbjct: 199 ACQSLRAGECDLALVG-GANLLLSPRMTVYFSKLKALSPDGACKAFDGAANGYVRSEGAG 257 Query: 111 LAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGA 170 + L S +AAG + + + + ++ TA A +E Sbjct: 258 VVVLKRLSDAIAAGDSIFAVVRGSAVNQDGR---TNRLTAPHQAAQE------------- 301 Query: 171 IASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGL-IG 229 + I + + G + G VE + +L D ++ + + + + IG Sbjct: 302 ---RVIERALGQGGIAPHEVGYVEAHGAGSLLADS--VEVKALAAVLGRQRAASAPVGIG 356 Query: 230 AFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHG 289 + + + L+ LP K SP + + E + H Sbjct: 357 SVKTNLGHLEGAAGIASLIKVALALRHRALPRSLHFKDPSPHIPWAELPVEVIS---EHR 413 Query: 290 VDSLVRGEYPHFDQEKLQTIADNTLEDPH---FKPHLPEPEPLPQYKEHSDRQKPSEPLA 346 S+ G Q ++ ++ L + PEP P +R + A Sbjct: 414 PWSVAAG------QRRIAGVSALGLSGTNAHVVLEEAPEPARRPVAPGAEERAELLVLSA 467 Query: 347 EHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGER 390 P + +R +++ + ES+ + + H G R Sbjct: 468 RTPRALSEAAQRLSAQLSSPEAESAGLRALSYSTTCHREHHGHR 511 >gi|271501409|ref|YP_003334434.1| outer membrane autotransporter barrel domain-containing protein [Dickeya dadantii Ech586] gi|270344964|gb|ACZ77729.1| outer membrane autotransporter barrel domain protein [Dickeya dadantii Ech586] Length = 1075 Score = 38.3 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 46/277 (16%), Positives = 82/277 (29%), Gaps = 23/277 (8%) Query: 16 KEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPN----YYRGSRTDP 71 ++W Q + G G +++ A++ +V E D N + G TD Sbjct: 188 EQWVQSGGSTTGTVISAG-GYQLVKNGAQASGTVVNTGAEGGPDAENSDGMFVSGIATDT 246 Query: 72 HSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLS 131 G +V S AG S + Q + AGAL + Sbjct: 247 LIHAGGRQIVAAGGSST---GTTIQAGGDQSVHGQAQSTTLDGGNQY--VHAGALATGTT 301 Query: 132 HKAESSIHHQIEGVDKETADALAWREAIVHTSALL-APGAIASQSIAKTVASGAVLNVPF 190 A Q G T + ++ + T A+ + +N Sbjct: 302 VNAGGWQVVQQSGTADATTVNRDGKLSVSAGGTASNVTLNAGGALVTSTAATVSGIN-SL 360 Query: 191 GMVERGWSSK-----VLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSL 245 G ++ +LE+ G D+ D ++ G++ GG+ V N Sbjct: 361 GGFNVDAATASATNVLLENGGRLDVLSGGSA-DTTTVSNGGVLAVATGGVAQHIVMNEGG 419 Query: 246 RLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAH 282 L+ D ++ ++ G A H Sbjct: 420 VLIADSGSTVSGTNTAGTFGIDAATG-----RASNLH 451 >gi|291543810|emb|CBL16919.1| prepilin-type N-terminal cleavage/methylation domain [Ruminococcus sp. 18P13] Length = 380 Score = 38.3 bits (87), Expect = 2.9, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 39/128 (30%), Gaps = 22/128 (17%) Query: 268 SSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED---PHFKPHLP 324 S P +F L + P + E++ I ++ H Sbjct: 88 SKPAQPAAFTPVPLTQQQLDVLAQQRAQSGQPPYTPEQIAAIQKAYMDRQVAAHAASQPA 147 Query: 325 EPEPLPQYK----EHSDRQKPSEPLAE---------------HPHPKRKEVERELSEIEG 365 EP P PQ K E S P + E P P+RK +++E Sbjct: 148 EPAPQPQVKAPVLEESTYTPPVKEKHEPQVSAAAAASLLEEPAPEPERKVSRFNEADLEA 207 Query: 366 AKKESSAR 373 AK + R Sbjct: 208 AKANAQKR 215 >gi|258568268|ref|XP_002584878.1| predicted protein [Uncinocarpus reesii 1704] gi|237906324|gb|EEP80725.1| predicted protein [Uncinocarpus reesii 1704] Length = 280 Score = 38.3 bits (87), Expect = 3.0, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 50/132 (37%), Gaps = 16/132 (12%) Query: 258 RLPYKHGVKSSSP-GLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLED 316 R+ + + + P G+ + ++ H + E P IA+ ++ Sbjct: 116 RVNFLPALDDAQPNGIASEDESTTLHASPVNAKF------ETPSMPTRAAPVIAEPPIQP 169 Query: 317 PHFKPHLPE----PEPLPQYKEHSDRQ--KPSEPLAEHPHPKRKEVERELSEIEG--AKK 368 P F+P E P P ++ + +PSEP + E ++ +++ A+ Sbjct: 170 PKFQPEPSESTILETPKPVAEDVKSERSTEPSEPKDDLKSQL-DEARVQIQQLKQQVAEN 228 Query: 369 ESSARKFFDEGS 380 E RK EGS Sbjct: 229 ELRRRKVATEGS 240 >gi|156543722|ref|XP_001605809.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 1174 Score = 38.3 bits (87), Expect = 3.0, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 40/120 (33%), Gaps = 7/120 (5%) Query: 243 MSLRLVNDLKEGITERLPYKHGVKSSSPGL--HTSFDAYEAHT-DTLAHGVDSLVRGEYP 299 S RL+ D + + E ++ + S GL A+ D + + GEY Sbjct: 75 SSKRLLEDDELELFEPTETRYAARDQSAGLDEPIDALTASANQLDQIDGNQIEVKPGEYF 134 Query: 300 ----HFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKE 355 E L A + PH P P EP Q + R +PS + P Sbjct: 135 RNIVPTKSESLSGEARGAPQVPHRHPAGPSSEPREQAPRQARRDQPSARIFSEHAPTAAP 194 >gi|221214518|ref|ZP_03587489.1| outer membrane autotransporter barrel domain protein [Burkholderia multivorans CGD1] gi|221165775|gb|EED98250.1| outer membrane autotransporter barrel domain protein [Burkholderia multivorans CGD1] Length = 1748 Score = 38.3 bits (87), Expect = 3.1, Method: Composition-based stats. Identities = 32/246 (13%), Positives = 63/246 (25%), Gaps = 23/246 (9%) Query: 71 PHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYL 130 V ++ ++ + GA G + + T A L Sbjct: 879 TGFVAQQLGTLDNRNTVL--LTGAGSTGVVAGTLGTVNNTSTIRVANGTGARVEGASATL 936 Query: 131 SHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPF 190 ++ + GV T + + + + A + SG + Sbjct: 937 ANAGTIEADDGVAGV-HLTGTGASVA---LSGAGSVVANGSADGVLIDATVSGGGIAAGA 992 Query: 191 GMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVND 250 + G + K +++ G D ++T IG G R+ D Sbjct: 993 TSIAVGGAGKGIDNVG----------TDSTIVLTGTRIGTTGSGADGIHSTGAGARITTD 1042 Query: 251 LKEGITERLPYKHGVKSS-------SPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQ 303 + G+ S + G + AH + G +L+ G Sbjct: 1043 AATVVRTGGDGARGLFVSGAGSTLDATGTTVATAGAGAHAIVVDGGTTALLSGTKLSTTG 1102 Query: 304 EKLQTI 309 I Sbjct: 1103 IAADGI 1108 >gi|238025169|ref|YP_002909401.1| hypothetical protein bglu_2g18400 [Burkholderia glumae BGR1] gi|237879834|gb|ACR32166.1| Hypothetical protein bglu_2g18400 [Burkholderia glumae BGR1] Length = 575 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 74/221 (33%), Gaps = 33/221 (14%) Query: 235 MHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLV 294 M K + + E V +P TS H + +++ Sbjct: 34 MDGKVNAHRPAATAPNPGAVADESTGTSQTV---APTQRTSS-----HALSDLPSTSTVL 85 Query: 295 RGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEH-SDRQKPSEPL--AEHPHP 351 E + I ++ E LP P PQ+ + R+ P + P Sbjct: 86 AREEAP--PREAPVIRPSSRERSQTAAPLPTATPSPQHSSAPAPRRLPRATTHASSAQGP 143 Query: 352 KRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFD 411 ++ R ++ + A ESS+ +FD S G+ ++ L M FT+ Sbjct: 144 EQAPDPRTINAADEADTESSSSIYFDASSS-----FGDMDETLRDMDDVSFTN------- 191 Query: 412 ATTFTESLPHVDEQTMHRFSELKERHPVEAREVLEGLQEKL 452 F+ +L +D+ ++ PV+ L LQ+ L Sbjct: 192 ---FSNALDRLDDP-----ESAQQLGPVQQDANLSALQQAL 224 >gi|256786302|ref|ZP_05524733.1| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces lividans TK24] Length = 234 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 50/162 (30%), Gaps = 7/162 (4%) Query: 215 RIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHT 274 R ++ + S V ++K + + PG T Sbjct: 20 RALGGTPMLIHAVRAMAASRAVSLVVVVAPPDGAGEVKSLLDAHALPERTDFVVVPGGET 79 Query: 275 SFDAYEAHTDTL--AHGVDSLVRGEYPHFDQEKLQTIADNTLED-PHFKPHLPEPEPLPQ 331 ++ D L +G+ + P + + + D E P P +P + + Q Sbjct: 80 RQESVRLGLDALPPEYGIVLVHDAARPLVPVDTVDAVIDAVREGAPAVVPAVPLADTVKQ 139 Query: 332 YKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSAR 373 + + P EP P+R R + +G + + R Sbjct: 140 VEPAAA---PGEPEPVVATPERAR-LRAVQTPQGFDRATLVR 177 >gi|221110510|ref|XP_002167498.1| PREDICTED: similar to conserved hypothetical protein, partial [Hydra magnipapillata] Length = 2047 Score = 38.0 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 45/139 (32%), Gaps = 16/139 (11%) Query: 282 HTDTLAHGVDSL-VRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYK------- 333 H L + G H D K + + P H P+P + + Sbjct: 1258 HLKPEKEAEKPLDLNGHPDHPDHLKPDKETEKPDKYPEPHEHPDHPKPDKETEKPNEYPG 1317 Query: 334 --EHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDE----GSPDHSPFK 387 E+ D S+P E P V +L + K E +K D PDHS Sbjct: 1318 HHEYLDHLDHSKPEKETEKPLDHMVHPDLPD--HLKPEKETKKPIDHLGFPDQPDHSKPD 1375 Query: 388 GERNQKLDPMRGADFTDAP 406 E + D M D +D P Sbjct: 1376 RETEKPYDHMGHPDLSDHP 1394 >gi|29830512|ref|NP_825146.1| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces avermitilis MA-4680] gi|33516907|sp|Q82GC8|ISPD_STRAW RecName: Full=2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; AltName: Full=4-diphosphocytidyl-2C-methyl-D-erythritol synthase; AltName: Full=MEP cytidylyltransferase; Short=MCT gi|29607624|dbj|BAC71681.1| putative 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces avermitilis MA-4680] Length = 250 Score = 38.0 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 7/162 (4%) Query: 215 RIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHT 274 R + ++ + S V ++K + + PG + Sbjct: 36 RALNGTPMLIHAVRAMAASRAVSLVVVVAPPDGTAEVKSLLDAHALPERTDFVVVPGGES 95 Query: 275 SFDAYEAHTDTLAHGVDSLV--RGEYPHFDQEKLQTIADNTLED-PHFKPHLPEPEPLPQ 331 ++ + D L G+D ++ P + + + + + P P LP + + Q Sbjct: 96 RQESVKLGLDALPPGIDIVLVHDAARPLVPVDTVDAVIEAVRDGAPAVVPALPLADTVKQ 155 Query: 332 YKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSAR 373 + + P EP P+R R + +G +++ R Sbjct: 156 VEPAAV---PGEPEPVVATPERAR-LRAVQTPQGFDRDTLVR 193 >gi|123455460|ref|XP_001315474.1| hypothetical protein [Trichomonas vaginalis G3] gi|121898152|gb|EAY03251.1| hypothetical protein TVAG_299130 [Trichomonas vaginalis G3] Length = 450 Score = 38.0 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 26/84 (30%), Gaps = 5/84 (5%) Query: 274 TSFDAYEAHTDTLAHGVDSLVRGE-YPHFDQEKLQTIADNTLE----DPHFKPHLPEPEP 328 T YE H V+ + D + P F+ + E Sbjct: 222 TPIATYELHMREFEGKTTQEVKIPIKFNNDSTSFYVTFTAQMRQPFAKPEFQSKIVEIHS 281 Query: 329 LPQYKEHSDRQKPSEPLAEHPHPK 352 +P +QKP+ P AE P + Sbjct: 282 IPNGTAVEQQQKPAAPQAESPKQE 305 >gi|78223058|ref|YP_384805.1| pentapeptide repeat-containing protein [Geobacter metallireducens GS-15] gi|78194313|gb|ABB32080.1| pentapeptide repeat protein [Geobacter metallireducens GS-15] Length = 551 Score = 38.0 bits (86), Expect = 3.7, Method: Composition-based stats. Identities = 38/286 (13%), Positives = 75/286 (26%), Gaps = 37/286 (12%) Query: 105 PTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSA 164 T L+ G + + G+ ++ ++ G + + Sbjct: 141 DTGLSSGTGGSTNTGSTDTGSTDTGSTNTGSTNTGSTNTGSTNTGSTNTGSTNTGSTNTG 200 Query: 165 LLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLIT 224 G+ + S + E K D G D R+ Sbjct: 201 STNTGSTNTGSTNTGSTDTGSTDTGSTSTENYPD-KSSSDTGPADKGSSGRVSS------ 253 Query: 225 DGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTD 284 G+ ++ + +G ++ G S +S D + Sbjct: 254 ---------GLAPSVTSSVDK---GSVDKGSVDKGSADKGSADKSSADKSSADTQGSRIP 301 Query: 285 TLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDR--QKPS 342 +A G L + ++P + +P P+ + D+ Q P+ Sbjct: 302 FVAQGTIILPDAPQSPLEPTSPDA------QEPALGDQPAQWQPSPEAADREDKEHQPPT 355 Query: 343 EPLAEHPHPKR----------KEVERELSEIEGAKKESSARKFFDE 378 P++ H P K V G + + R FDE Sbjct: 356 APISPHTVPTTSVQEQKYAVFKGVLERFKGYSGDRTPKALRALFDE 401 >gi|221054265|ref|XP_002261880.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] gi|193808340|emb|CAQ39044.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] Length = 703 Score = 37.6 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 4/65 (6%) Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSE----PLAEHPHPK 352 E P +QE ++ I +EDP + E +P ++ Q+P E P E P + Sbjct: 332 EDPEKEQEPVEVIQMPAIEDPEKEQEPVEVIQMPAIEDPEKEQEPVEVIQMPAIEDPEKE 391 Query: 353 RKEVE 357 ++ VE Sbjct: 392 QEPVE 396 >gi|46136393|ref|XP_389888.1| hypothetical protein FG09712.1 [Gibberella zeae PH-1] Length = 479 Score = 37.6 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 62/205 (30%), Gaps = 18/205 (8%) Query: 11 IRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTD 70 I D I + + P PD + + K I P + + P P G+ D Sbjct: 138 ITDAIDDGSATPAGEPDDDFFSSWDKPAIKRPTPPVSRTGTPPVVGRTPSPFLNSGNGKD 197 Query: 71 PHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAG---LALQSAPLAAGALY 127 A + ++ A L P A ++ L A + Sbjct: 198 I---ARTASPLSRTSTGENKPASRITTSAALRKTPASTGPRKANVLGAKKTTKLGAKKVT 254 Query: 128 AYLSHKAESSIHHQIE-------GVDKETADALAWREAIVHTS----ALLAPGAIASQSI 176 A + E+ + E G D + + A + + + +AP ++ S Sbjct: 255 ADIIDFDEAERKAKEEADRIAKLGYDPDAEEDPATKNSGSAAAIISPTPVAPSRGSASSH 314 Query: 177 AKTVASGAVLNVPFGMVERGWSSKV 201 + + V + GM R +V Sbjct: 315 TRQKSDAEVERLGMGM-NRLGFGQV 338 >gi|207092432|ref|ZP_03240219.1| hypothetical protein HpylHP_05773 [Helicobacter pylori HPKX_438_AG0C1] Length = 870 Score = 37.6 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 71/234 (30%), Gaps = 14/234 (5%) Query: 182 SGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQ 241 V+ + +++G + Y+ F SL D + Sbjct: 128 YAVVVEQAINKKNELALKTMYKNNGSYKNNEVYKEFSSTSLDADAKVCHRLSSYSGATEN 187 Query: 242 NMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHF 301 N L + + L + ++P +A+ + LA + E Sbjct: 188 NTPKPLTDQ-----EDLLKTSENLNETTPKPTNLSPLEQANAEKLAKLQREQEQSEQEFL 242 Query: 302 DQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELS 361 ++ + L+ Q K + P++ A+ P + + ERE+ Sbjct: 243 KAKEQENKRKEALKKKLEHERGNAGNIESQTKIEVGKDIPTKTQAQLPKSRVRLNEREIY 302 Query: 362 EIEGA--KKESSARKFFDEGSPDHSPFKGER----NQKLDPMR---GADFTDAP 406 +++ A K + F G+ + E+ Q DP + F D P Sbjct: 303 DLDYAIVKAKDLKPSFTTGGTQKRTDMNEEQIKSIAQNFDPKKIFGSGGFEDLP 356 >gi|270004992|gb|EFA01440.1| hypothetical protein TcasGA2_TC030701 [Tribolium castaneum] Length = 18024 Score = 37.6 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 2/100 (2%) Query: 292 SLVRGEYPHFDQEKLQTIADN-TLEDPHFK-PHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 L E +Q +++ L P + P + +P+P+ +E S ++P +P + Sbjct: 6670 DLKPAEAVPEEQPEVRQWRRGKQLPKPEEEQPEIVSLKPIPRKQEVSQPEQPEQPEVKEQ 6729 Query: 350 HPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGE 389 K +++ R+ K+E + F E P+ P K E Sbjct: 6730 KIKTQKMIRKSKSHVLPKEEEEGTELFPEEKPELVPTKLE 6769 >gi|88501749|ref|NP_001034245.1| TRIO and F-actin-binding protein isoform 3 [Mus musculus] gi|90110076|sp|Q99KW3|TARA_MOUSE RecName: Full=TRIO and F-actin-binding protein; AltName: Full=Protein Tara; AltName: Full=Trio-associated repeat on actin gi|81176573|gb|ABB59556.1| TRIOBP isoform 3 [Mus musculus] gi|151358007|emb|CAO78087.1| TRIO and F-actin binding protein [Mus musculus] Length = 2014 Score = 37.6 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 54/173 (31%), Gaps = 16/173 (9%) Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY 298 Q Q + ERL + KS +PG + D E + + L R Sbjct: 954 QAQGSNEGRTRSPGRAEVERLFGQERRKSEAPGAFQTRD--EGRSQRPSQAQSQLRRQSS 1011 Query: 299 PHFD------QEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPK 352 P K + P PH P+ P+ R P P + Sbjct: 1012 PAPSRQVTKPSAKQAEPTRQSRTGP---PHPKSPDKRPEGDRQLQRTSPPARTPARPPER 1068 Query: 353 RKEVERELSEIEGAKKES-----SARKFFDEGSPDHSPFKGERNQKLDPMRGA 400 + ++ER L ++S S + SP+ P K +QK P G Sbjct: 1069 KAQIERHLESGHTGPRQSLGGWQSQERLSGPQSPNRHPEKSWGSQKEGPSLGG 1121 >gi|189235987|ref|XP_971849.2| PREDICTED: similar to BMKETTIN [Tribolium castaneum] Length = 20466 Score = 37.6 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 22/100 (22%), Positives = 43/100 (43%), Gaps = 2/100 (2%) Query: 292 SLVRGEYPHFDQEKLQTIADN-TLEDPHFK-PHLPEPEPLPQYKEHSDRQKPSEPLAEHP 349 L E +Q +++ L P + P + +P+P+ +E S ++P +P + Sbjct: 6963 DLKPAEAVPEEQPEVRQWRRGKQLPKPEEEQPEIVSLKPIPRKQEVSQPEQPEQPEVKEQ 7022 Query: 350 HPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKGE 389 K +++ R+ K+E + F E P+ P K E Sbjct: 7023 KIKTQKMIRKSKSHVLPKEEEEGTELFPEEKPELVPTKLE 7062 >gi|167533281|ref|XP_001748320.1| hypothetical protein [Monosiga brevicollis MX1] gi|163773132|gb|EDQ86775.1| predicted protein [Monosiga brevicollis MX1] Length = 305 Score = 37.6 bits (85), Expect = 4.9, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 69/180 (38%), Gaps = 33/180 (18%) Query: 297 EYPHFDQ--------EKLQTIADNTLEDPHF-KPHLPEPEPLPQYKEHSDRQKPSEPLAE 347 E +FD +K+ D+ + + P + + +E D KP++ Sbjct: 4 EDLNFDPTLKKKKKKKKILASFDDEVTESSTDAPEPVSAAAIDENEEEDDLPKPTKVAVV 63 Query: 348 HPHPKRKEVEREL------SEIEGAKKESSARKFFDE-GSPDHSPFKGERNQKLDPMRGA 400 H K V EL ++I+ + + A++ D PD F ++ +K + Sbjct: 64 HSEDLDKPVSEELVTMLISNDIDYSALKKRAKRSTDALEEPDSLDFSKKKKKKKKTAKAV 123 Query: 401 DFTDAPHAKFDATTFTESLPHVDEQTMHRFSEL--------KERHPVEAREVLEGLQEKL 452 D A + T + DE T+H +S L KE++P E++ G ++K Sbjct: 124 D-----QAPLEGDAVTSTADDGDEGTVHPYSVLLDRVFAIIKEKNP----ELISGEKKKF 174 >gi|182437805|ref|YP_001825524.1| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces griseus subsp. griseus NBRC 13350] gi|326778440|ref|ZP_08237705.1| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces cf. griseus XylebKG-1] gi|178466321|dbj|BAG20841.1| putative 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces griseus subsp. griseus NBRC 13350] gi|326658773|gb|EGE43619.1| 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase [Streptomyces cf. griseus XylebKG-1] Length = 255 Score = 37.6 bits (85), Expect = 5.0, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 51/162 (31%), Gaps = 7/162 (4%) Query: 215 RIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHT 274 R ++ + S V ++K + E + PG T Sbjct: 38 RALGGTPMLIHAIRAMAASRAVSLVVVVAPPDGAPEVKHLLDEHALPERTDYLVVPGGET 97 Query: 275 SFDAYEAHTDTLAHGVDSLV--RGEYPHFDQEKLQTIADNTLED-PHFKPHLPEPEPLPQ 331 ++ D L + +++ P + + +A + P P LP + + + Sbjct: 98 RQESVRLGLDALPEDISAVLVHDAARPLVPVDTVDAVASAVRDGAPAVVPALPLADTVKE 157 Query: 332 YKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGAKKESSAR 373 + +P LA P R R + +G +++ R Sbjct: 158 VEPAGTPGEPEPVLA---TPVRAR-LRAVQTPQGFDRDTLVR 195 >gi|25153045|ref|NP_500704.2| Sperm-Specific family, class Q family member (ssq-4) [Caenorhabditis elegans] gi|20451260|gb|AAB04604.3| Sperm-specific family, class q protein 4 [Caenorhabditis elegans] Length = 373 Score = 37.6 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 33/116 (28%), Gaps = 2/116 (1%) Query: 88 APYIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAE-SSIHHQIEGVD 146 AP + + + G A A+ S + ++I G Sbjct: 99 APAGGSSTMTAVGGAPRGASTMTAVGGAPVGGSSTMTAVGGAPSGASTMTAIGGAPRGAS 158 Query: 147 KETADALAWREAIVH-TSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKV 201 TA A T+ AP ++ + SGA G RG S+ Sbjct: 159 TMTAVGGAPMGGGSTMTAVGGAPSGASTMTAVGGAPSGASTMTAIGGAPRGASTMT 214 >gi|261252352|ref|ZP_05944925.1| AAA ATPase [Vibrio orientalis CIP 102891] gi|260935743|gb|EEX91732.1| AAA ATPase [Vibrio orientalis CIP 102891] Length = 1685 Score = 37.2 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 36/142 (25%), Positives = 58/142 (40%), Gaps = 14/142 (9%) Query: 286 LAHGVDSLVRGEYP-----HFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQK 340 L +D L+ E P D L + D+ LE+ + L ++E D +K Sbjct: 636 LEAELDDLIGAEQPEPIELGDDAGLLDEVVDSQLENAETAELGDDSTDL--FEELLDIEK 693 Query: 341 PSEPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDE---GSPDHSPFKGERNQKLDPM 397 S AE P+ + + E ++ + K+ S+ F D+ +P+ P E N LD Sbjct: 694 QSTEQAELETPQPEPISEEALDLADSDKDFSSEDFIDDMLSAAPEADPLLEEIN--LD-- 749 Query: 398 RGADFTDAPHAKFDATTFTESL 419 G D P A D + ES+ Sbjct: 750 EGDDVELEPTANLDIDSLEESI 771 >gi|254509681|ref|ZP_05121748.1| cell division protein FtsZ [Rhodobacteraceae bacterium KLH11] gi|221533392|gb|EEE36380.1| cell division protein FtsZ [Rhodobacteraceae bacterium KLH11] Length = 528 Score = 37.2 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 23/103 (22%), Positives = 39/103 (37%), Gaps = 7/103 (6%) Query: 283 TDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPS 342 TL ++ + E H D + D+ L P ++P + + EP P+ S PS Sbjct: 361 APTLFESIEDVELNEGWHEDSQPAAEQEDDGLPPPAYQPQVAQFEPQPEEPAES-YAAPS 419 Query: 343 EPLAEHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSP 385 P P P + ++ A ++ A + G P P Sbjct: 420 APTPGTPSPAA------MQRLQAAVQKVPASQRRMGGEPPREP 456 >gi|88501743|ref|NP_613045.3| TRIO and F-actin-binding protein isoform 5 [Mus musculus] gi|84798608|gb|ABB59557.2| TRIOBP isoform 5 [Mus musculus] gi|151358006|emb|CAO78086.1| TRIO and F-actin binding protein [Mus musculus] Length = 1968 Score = 37.2 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 54/173 (31%), Gaps = 16/173 (9%) Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY 298 Q Q + ERL + KS +PG + D E + + L R Sbjct: 954 QAQGSNEGRTRSPGRAEVERLFGQERRKSEAPGAFQTRD--EGRSQRPSQAQSQLRRQSS 1011 Query: 299 PHFD------QEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPK 352 P K + P PH P+ P+ R P P + Sbjct: 1012 PAPSRQVTKPSAKQAEPTRQSRTGP---PHPKSPDKRPEGDRQLQRTSPPARTPARPPER 1068 Query: 353 RKEVERELSEIEGAKKES-----SARKFFDEGSPDHSPFKGERNQKLDPMRGA 400 + ++ER L ++S S + SP+ P K +QK P G Sbjct: 1069 KAQIERHLESGHTGPRQSLGGWQSQERLSGPQSPNRHPEKSWGSQKEGPSLGG 1121 >gi|158294679|ref|XP_315753.4| AGAP005739-PA [Anopheles gambiae str. PEST] gi|157015677|gb|EAA10955.4| AGAP005739-PA [Anopheles gambiae str. PEST] Length = 1799 Score = 37.2 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 47/120 (39%), Gaps = 12/120 (10%) Query: 298 YPHFDQEKLQTIADNTLEDPHFK-PHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEV 356 P KL+ + L+ P FK P + P + + S ++P P P ++ Sbjct: 1085 KPGSKLGKLKNMHMPKLQKPDFKRPEFTKKMPKLKAPDMSKFKRPEMPKFLTEKPDFSKM 1144 Query: 357 ERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFT 416 + + ++I+ A+ +S E SP + P + DAP K + T F+ Sbjct: 1145 KSDFAKIKLARSKS-----MKEASPSGA------TSAASPSDASMMGDAPTTKVNYTDFS 1193 >gi|294654840|ref|XP_002770038.1| DEHA2A13618p [Debaryomyces hansenii CBS767] gi|199429189|emb|CAR65414.1| DEHA2A13618p [Debaryomyces hansenii] Length = 1749 Score = 37.2 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 40/137 (29%), Gaps = 6/137 (4%) Query: 132 HKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFG 191 E+ G E D E ++ + + A N+ Sbjct: 1408 SFEETVEILLEAGSAAELDDCRGISENVMLGQMAPLGTGAFDVMVDDKMLQTAPSNIAVT 1467 Query: 192 MVERGWSSKVLEDHGYPDMAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDL 251 + + +D G A YR ++M+ GA F +H+ V + S L + Sbjct: 1468 TAAGNETGEYADDGG----ATPYRDYEMQDDKIQFEEGAGFSPIHTAPVSDGSGALTS-- 1521 Query: 252 KEGITERLPYKHGVKSS 268 G T P + Sbjct: 1522 YGGATSPSPTSPFSYGA 1538 >gi|107021750|ref|YP_620077.1| Outer membrane autotransporter barrel [Burkholderia cenocepacia AU 1054] gi|116688696|ref|YP_834319.1| outer membrane autotransporter [Burkholderia cenocepacia HI2424] gi|105891939|gb|ABF75104.1| Outer membrane autotransporter barrel [Burkholderia cenocepacia AU 1054] gi|116646785|gb|ABK07426.1| outer membrane autotransporter barrel domain protein [Burkholderia cenocepacia HI2424] Length = 1762 Score = 37.2 bits (84), Expect = 6.5, Method: Composition-based stats. Identities = 33/283 (11%), Positives = 72/283 (25%), Gaps = 17/283 (6%) Query: 32 TGLGKEVINMPARSLDKLVA--PFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAP 89 G +++ A L A P + + + +E +++ Sbjct: 842 AGATAGIVDGQAHDLAGAAAGAPVATTLTNHAAVTSSTAGVTGFIAQNLGTLENRSTVL- 900 Query: 90 YIAGAALAGKLLSFIPTPLTRLAGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKET 149 + GA G + + T GAL S ++ + + Sbjct: 901 -LTGAGSTGVVAGTLGTVNNAST----IRVSNGTGALVQGASATLANAGSIEADDGVAGV 955 Query: 150 ADALAWREAIVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPD 209 A + + + A + + +G + + G S + + G Sbjct: 956 RLTGAGASVALSGAGTVIANGSADGVLIDSTVTGGGIAAGATSIAVGGSGSGIHNLG--- 1012 Query: 210 MAQHYRIFDMESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSS- 268 A + T G G + ++ ++ + L S Sbjct: 1013 -ANATIALSGTQVATTG--NGAAGLASTGAGARIATDAATVVRTAGADALGLSVSGADST 1069 Query: 269 --SPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTI 309 + G + AH + G +L+ G I Sbjct: 1070 LAANGTTVATTGANAHAIVMDGGATALLSGAKISASGAAADGI 1112 >gi|255547165|ref|XP_002514640.1| conserved hypothetical protein [Ricinus communis] gi|223546244|gb|EEF47746.1| conserved hypothetical protein [Ricinus communis] Length = 1094 Score = 37.2 bits (84), Expect = 6.7, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 4/56 (7%) Query: 312 NTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKR---KEVERELSEIE 364 + + P EP P Q + QKP++ A+ +P E+E+ L ++E Sbjct: 411 DLIMKP-ISRLPIEPAPWKQLEGSRASQKPAKLSAKTSNPFPTVYSEIEKRLKDLE 465 >gi|71834474|ref|NP_001025335.1| human immunodeficiency virus type I enhancer binding protein 2 [Danio rerio] gi|55251090|emb|CAH68883.1| novel protein similar to vertebrate human immunodeficiency virus type I enhancer binding protein 2 (HIVEP2) [Danio rerio] Length = 2298 Score = 37.2 bits (84), Expect = 6.9, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 54/130 (41%), Gaps = 18/130 (13%) Query: 255 ITERLPYKHGVKSSSPGLHTSFDAYEAHTD-----------TLAHGVDSLVRGEYPHFDQ 303 ++E+ ++ SP H ++ E TL H LVR P+ Sbjct: 751 LSEQSDTENIDDVQSPDSHHRSESMEHQQQGDNEHGSFSSNTLYHMPHKLVR--QPNIQV 808 Query: 304 EKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVER--ELS 361 +++ + + P +P +P EP +E Q+ SE L++ P K ++ L+ Sbjct: 809 PEIRVTEEP--DKPEKEPEVPAKEPEKHVEEFQWPQR-SETLSQLPAEKLPPKKKRLRLA 865 Query: 362 EIEGAKKESS 371 ++E + ESS Sbjct: 866 DMEHSSGESS 875 >gi|195387950|ref|XP_002052655.1| GJ20515 [Drosophila virilis] gi|194149112|gb|EDW64810.1| GJ20515 [Drosophila virilis] Length = 424 Score = 36.8 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 64/177 (36%), Gaps = 12/177 (6%) Query: 237 SKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 SK + +++ ++ E + + K E ++ V Sbjct: 80 SKATETKAMKPEPEMGEAADTKSLEQQDTKKKLEAEPELSRPKE--RKSMEKQEK--VAE 135 Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHLP---EPEPLPQYKEHSDRQKPSEPLAEHPHPKR 353 E P + + + I ++E P + EPE Q K D++ P+EP AE Sbjct: 136 EKPELEVDTSRKIE--SMEQPETETEPSPKTEPESARQAKAVEDQENPTEPPAEPEAIAS 193 Query: 354 KEVERELSEIEGAKKES--SARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHA 408 E + + +E E + S S K +E P E KL+P A +A H+ Sbjct: 194 MEQQADTNEAEPETETSKGSETKAMEETEIATEP-PAEPETKLEPSNAAQSAEATHS 249 >gi|302895381|ref|XP_003046571.1| hypothetical protein NECHADRAFT_66377 [Nectria haematococca mpVI 77-13-4] gi|256727498|gb|EEU40858.1| hypothetical protein NECHADRAFT_66377 [Nectria haematococca mpVI 77-13-4] Length = 479 Score = 36.8 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 63/201 (31%), Gaps = 15/201 (7%) Query: 11 IRDNIKEWAQRPRVSPDIKWHTGLGKEVINMPARSLDKLVAPFREETHDQPNYYRGSRTD 70 I D + + + P PD + + K I P + + P P G+ D Sbjct: 138 ITDAVDDGSATPAGEPDDDFFSSWDKPAIKKPTPPVSRTATPPVMGRTPSPFLNAGNGKD 197 Query: 71 PHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAG--LALQSAPLAAGALYA 128 + + + P A + R A A ++ L A + + Sbjct: 198 I-ARASSPLARTASSESKPASRITTSAALRKTGGGIGGPRKANVLGAKKTTKLGAKKVTS 256 Query: 129 YLSHKAESSIHHQIE-------GVDKETADALAWREAIVHTSALLAPGAIAS-----QSI 176 E+ + E G D + + A + A +A+++P ++ S Sbjct: 257 DAIDFDEAERKAKEEADRIAKLGYDPDAEEDPATKAATGSAAAIISPTPVSPNKSSYSSH 316 Query: 177 AKTVASGAVLNVPFGMVERGW 197 + + V + GM G+ Sbjct: 317 TRQKSDAEVERLGMGMGRLGF 337 >gi|328865177|gb|EGG13563.1| hypothetical protein DFA_11324 [Dictyostelium fasciculatum] Length = 1253 Score = 36.8 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 33/62 (53%), Gaps = 2/62 (3%) Query: 317 PHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIE-GAKKESSARKF 375 P +P P Y + +Q+P+ P A+ P P +++V ++++++ GAK + + Sbjct: 586 PPKQPAPAPLSQRPVYPQQQQQQRPTAPSAQ-PKPSQQQVVKQVTDMMGGAKFDVNQSTI 644 Query: 376 FD 377 FD Sbjct: 645 FD 646 >gi|288918412|ref|ZP_06412764.1| hypothetical protein FrEUN1fDRAFT_2460 [Frankia sp. EUN1f] gi|288350175|gb|EFC84400.1| hypothetical protein FrEUN1fDRAFT_2460 [Frankia sp. EUN1f] Length = 535 Score = 36.8 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 44/142 (30%), Gaps = 4/142 (2%) Query: 52 PFREETHDQPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRL 111 P R E + RG+R VGTGA A A + G L Sbjct: 240 PIRTENDLVLTFLRGARAGREPVGTGAGGTTA--DPARDAGSALVFGPGRQLGADQLAAA 297 Query: 112 AGLALQSAPLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAI 171 G Q+ A G L + EG D W E +V P A+ Sbjct: 298 FGPVAQTVRAAVGGLLHRRGDLRRRGDLRRREGGDALPGGEAPWPEVVVVGGLGFLPAAV 357 Query: 172 ASQSIAKTVASGAVLNVPFGMV 193 +++ VA ++P G Sbjct: 358 --EAVRTAVAEAWPADLPGGSG 377 >gi|188591999|ref|YP_001796597.1| branched-chain alpha-keto acid dehydrogenase subunit e2 [Cupriavidus taiwanensis LMG 19424] gi|170938373|emb|CAP63360.1| Dihydrolipoyllysine-residue acetyltransferase component of acetoin cleaving system [Cupriavidus taiwanensis LMG 19424] Length = 371 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 42/252 (16%), Positives = 75/252 (29%), Gaps = 25/252 (9%) Query: 60 QPNYYRGSRTDPHSVGTGAHLVEGLTSLAPYIAGAALAGKLLSFIPTPLTRLAGLALQSA 119 Q G R G GA V + + L+ T + QS+ Sbjct: 114 QFAEVDGIRVRYARKGNGAQTVLFIHGFGGDLDNWLFNLDPLADAYTVVALDLPGHGQSS 173 Query: 120 PLAAGALYAYLSHKAESSIHHQIEGVDKETADALAWREAIVHTSALLAPGAIASQSIAKT 179 P AG A ++ + G++ + + A+ AP + S ++ Sbjct: 174 PRLAGTTLAQMAGFVARFMD--EAGIEAAHVVGHSMGGGVAAQLAVDAPQRVLSVALVSP 231 Query: 180 VASGAVLNV----PFGMVER-----------GWSSKVLEDHGYPDMAQHYRIFDMESLIT 224 V G +N F + ++ D+ + Y+ D Sbjct: 232 VGFGEAVNSDYTDGFVKAQSRRELKPVVELLFADPGLVSRQMLDDLLR-YKRLDGVDEAL 290 Query: 225 DGLIGAFFGGMHSKQVQNMSLRLVNDLKE-----GITERLPYKHGVKSSSPGLHTSFDAY 279 L FGG +Q + RL + K G +R+ +++ PG + A Sbjct: 291 AALGQGLFGG--GRQSEQPGQRLADSGKRVLVVWGAQDRIIPAGHAEAAPPGANVKVFAD 348 Query: 280 EAHTDTLAHGVD 291 H + D Sbjct: 349 AGHMSQMEKAND 360 >gi|47210024|emb|CAF90899.1| unnamed protein product [Tetraodon nigroviridis] Length = 1552 Score = 36.8 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 72/207 (34%), Gaps = 16/207 (7%) Query: 228 IGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLA 287 I A +G + + + N++ P + G + + G S D+ HT Sbjct: 1137 ISAAYGRGGEARREASGGKHPNEVSLSSLGA-PEEAGDEQTDEGQEFSSDSMSDHT---E 1192 Query: 288 HGVDSLVRGEYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLP--QYKEHSDRQKPSEPL 345 V+ R D + +A + P EP+ P Q ++ + ++ ++ Sbjct: 1193 SAVEPARRPAAETLDPTERLDLAMEAISLP---EQPAEPKEEPGAQTEDERNEEEMAQRK 1249 Query: 346 A---EHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFKG----ERNQKLDPMR 398 A E + +E+ R E ++ A SP +P G P R Sbjct: 1250 ALLLEKQQKRAEELRRRKQWHEQERENRLASSERRADSPSATPPAGTTSPSPTPPATPAR 1309 Query: 399 GADFTDAPHAKFDATTFTESLPHVDEQ 425 DFT + +A+ E L V +Q Sbjct: 1310 RGDFTRSEYARRQQLRIMEDLDKVLQQ 1336 >gi|307182327|gb|EFN69609.1| STE20-like serine/threonine-protein kinase [Camponotus floridanus] Length = 1661 Score = 36.8 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 76/223 (34%), Gaps = 38/223 (17%) Query: 239 QVQNMSLRLVNDLKEGITERLPY--KHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRG 296 + ++ L L + RL K K + L TS E+H + Sbjct: 329 RTSHLPLELDQITDDSAPTRLDAEIKITDKENIATLPTSLKKEESHKREINR-------- 380 Query: 297 EYPHFDQEKLQTIADNTLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEV 356 D ED + + L + + + Q PS + P P + Sbjct: 381 --------------DGEKEDKN--------KRLRKAESKENIQPPSAEKKQAPKPPNETS 418 Query: 357 ERELSEIEGAKKESSARKFFDEGSPDHSPFKGERNQKLDPMRGADFTDAPHAKFDATTFT 416 ER LS +G + D +G N D + + TD + D + Sbjct: 419 ERRLSRDKGPAPPPPPMR-QDSEEKKKKDVEGRENVSKDVEKIVNLTDKQKSAEDKSQIN 477 Query: 417 ESLPHVDEQTMHRFSELKERHPVEAREVLEGLQEKLQGTKEIK 459 + LP +E+ + + + + E+ +E +Q +L G+ ++K Sbjct: 478 K-LPQ-NEKMVDQVTNVAEQRNLETE---NQMQNELDGSGKVK 515 >gi|221061793|ref|XP_002262466.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] gi|193811616|emb|CAQ42344.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] Length = 920 Score = 36.8 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 55/161 (34%), Gaps = 23/161 (14%) Query: 249 NDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVD---SLVRGEYPHFD--- 302 D ++ E H + + L+ + + ++ GVD L E H + Sbjct: 603 PDDRDQTEEPSQTNHIDQDVTTFLNVQAEGEDEWASAMSQGVDPSVQLEEKEESHMEKTE 662 Query: 303 QEKLQTIADNTLEDPHFKPHLP----------------EPEPLPQYKEHSDRQKPSEPLA 346 Q+ L E LP E EPLPQ ++ + EP A Sbjct: 663 QDNLALQESANEEGGAINDELPQEENEPMNGEADGGKAEDEPLPQQEDEGVAIEMVEP-A 721 Query: 347 EHPHPKRKEVERELSEIEGAKKESSARKFFDEGSPDHSPFK 387 + P + ++ EL E AK++ + E P P K Sbjct: 722 KEDIPTEEPIKEELPIDEPAKEDIPTEEPIKEELPIDEPAK 762 >gi|255279787|ref|ZP_05344342.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] gi|255269560|gb|EET62765.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] Length = 842 Score = 36.8 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 59/180 (32%), Gaps = 6/180 (3%) Query: 100 LLSFIPTPLTRLAGLALQSAPLAAG-ALYAYLSHKAESSIHHQIEGVDKETADALAWREA 158 L + A LA + LA G + A S + S G + ++ + Sbjct: 625 LYTAFGQVTEGAASLAEGAEALAEGNSALAEGSEELYSGTKSLASGAKQLSSGSKE---- 680 Query: 159 IVHTSALLAPGAIASQSIAKTVASGAVLNVPFGMVERGWSSKVLEDHGYPDMAQHYRIFD 218 + + A GA QS A ++A G + G + L+D + D Sbjct: 681 LASGAGSAAKGASQLQSGAGSLA-GGADALRQGAGSLYSGTITLQDGATQLYDGTVELSD 739 Query: 219 MESLITDGLIGAFFGGMHSKQVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDA 278 S + DG++ G K N + D+ E I E + + SF + Sbjct: 740 GVSELYDGVVELKDGTAELKDGTNEFVEKTQDIDETIDEEIDKAVDKIAGGDFEPVSFTS 799 >gi|157849706|gb|ABV89636.1| catalytic/coenzyme binding protein [Brassica rapa] Length = 624 Score = 36.8 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 40/108 (37%), Gaps = 8/108 (7%) Query: 259 LPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEYPHFDQEKLQTIADNTLEDPH 318 PY P T + +DTLA GE T+A E+ Sbjct: 472 SPYASYENLKPPSSPTPKASGIQKSDTLAPVPTDSDTGES--------STVATTVTEEAE 523 Query: 319 FKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKEVERELSEIEGA 366 P +P+ PL Y ++D + P+ P PK+ E+SE+ G Sbjct: 524 APPAIPKMRPLSPYAAYADLKPPTSPTPASTGPKKTAPAEEISELPGG 571 >gi|74199130|dbj|BAE33111.1| unnamed protein product [Mus musculus] Length = 1330 Score = 36.4 bits (82), Expect = 9.3, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 57/170 (33%), Gaps = 10/170 (5%) Query: 239 QVQNMSLRLVNDLKEGITERLPYKHGVKSSSPGLHTSFDAYEAHTDTLAHGVDSLVRGEY 298 Q Q + ERL + KS +PG + D E + + L R Sbjct: 392 QAQGSNEGRTRSPGRAEVERLFGQERRKSEAPGAFQTRD--EGRSQRPSQAQSQLRRQSS 449 Query: 299 PHFDQEKLQTIADN---TLEDPHFKPHLPEPEPLPQYKEHSDRQKPSEPLAEHPHPKRKE 355 P ++ + A T + PH P+ P+ R P P ++ + Sbjct: 450 PAPSRQVTKPSAKQAEPTRQSRTGSPHPKSPDKRPEGDRQLQRTSPPARTPARPPERKAQ 509 Query: 356 VERELSEIEGAKKES-----SARKFFDEGSPDHSPFKGERNQKLDPMRGA 400 +ER L ++S S + SP+ P K +QK P G Sbjct: 510 IERHLESGHTGPRQSLGGWQSQERLSGPQSPNRHPEKSWGSQKEGPSLGG 559 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.302 0.115 0.277 Lambda K H 0.267 0.0355 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 7,190,407,485 Number of Sequences: 14124377 Number of extensions: 279338330 Number of successful extensions: 1005180 Number of sequences better than 10.0: 1438 Number of HSP's better than 10.0 without gapping: 139 Number of HSP's successfully gapped in prelim test: 1557 Number of HSP's that attempted gapping in prelim test: 998021 Number of HSP's gapped (non-prelim): 6785 length of query: 478 length of database: 4,842,793,630 effective HSP length: 143 effective length of query: 335 effective length of database: 2,823,007,719 effective search space: 945707585865 effective search space used: 945707585865 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.4 bits) S2: 83 (36.8 bits)