BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006475
(643 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224053020|ref|XP_002297667.1| predicted protein [Populus trichocarpa]
gi|222844925|gb|EEE82472.1| predicted protein [Populus trichocarpa]
Length = 646
Score = 1064 bits (2752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 517/640 (80%), Positives = 557/640 (87%), Gaps = 31/640 (4%)
Query: 26 RPRLP-KFPFYPAYFTKSPSCP----------SIACHVSTTG-----------GGGAAQM 63
RP LP KFPFYP F KS CP S++ HVST+ ++
Sbjct: 16 RPFLPIKFPFYPPPFVKSQFCPLSPPAHLFKPSLSRHVSTSSFPSSRGRGSSVSMESSSP 75
Query: 64 ESSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDS 123
E + S+DSVT DLKNQ L G + KLK LEDLNWDHSFVR LPGDPR D+
Sbjct: 76 EPTVSLDSVTQDLKNQTL--------GPDDVSKAKLK-LEDLNWDHSFVRALPGDPRADT 126
Query: 124 IPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGA 183
IPR+V+HACYTKV PSAEVENP+LVAWS+SVAD +LDPKEFERPDFPL FSGA+PL GA
Sbjct: 127 IPRQVMHACYTKVLPSAEVENPELVAWSDSVADLFDLDPKEFERPDFPLLFSGASPLVGA 186
Query: 184 VPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVL 243
+PYAQCYGGHQFGMWAGQLGDGRAITLGE++N KSERWELQLKG+G+TPYSRFADGLAVL
Sbjct: 187 LPYAQCYGGHQFGMWAGQLGDGRAITLGEVVNSKSERWELQLKGSGRTPYSRFADGLAVL 246
Query: 244 RSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLR 303
RSSIREFLCSEAMH LGIPTTRAL LVTTGK+VTRDMFYDGN KEEPGAIVCRVA SFLR
Sbjct: 247 RSSIREFLCSEAMHCLGIPTTRALSLVTTGKYVTRDMFYDGNAKEEPGAIVCRVAPSFLR 306
Query: 304 FGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK 363
FGSYQIHASRG+EDL+IVR LADYAIRHHF HIENMNKSESLSFSTGDEDHSVVDLTSNK
Sbjct: 307 FGSYQIHASRGKEDLEIVRALADYAIRHHFPHIENMNKSESLSFSTGDEDHSVVDLTSNK 366
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
YAAW VE+AERTAS++A WQGVGFTHGV+NTDNMSILGLTIDYGPFGFLDAFDPSFTPNT
Sbjct: 367 YAAWTVEIAERTASMIASWQGVGFTHGVMNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 426
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTK 483
TDLPGRRYCFANQPDIGLWNIAQF+ TL+ AKLI DKEA+Y MERYG KFMDEYQA+MT+
Sbjct: 427 TDLPGRRYCFANQPDIGLWNIAQFTATLSTAKLISDKEADYAMERYGNKFMDEYQAMMTR 486
Query: 484 KLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
KLGLPKYNKQ+ISKLLNNMAVDKVDYTNFFR LSNVKADP IPEDELLVPLKAVLLDIG+
Sbjct: 487 KLGLPKYNKQLISKLLNNMAVDKVDYTNFFRLLSNVKADPKIPEDELLVPLKAVLLDIGQ 546
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
ERKEAW+SWV SY+ EL +SGISDE+RKA MNSVNPKYVLRNYLCQ+AIDAAE GD+ EV
Sbjct: 547 ERKEAWMSWVQSYVHELAASGISDEQRKAQMNSVNPKYVLRNYLCQTAIDAAEQGDYTEV 606
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 607 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 646
>gi|297746392|emb|CBI16448.3| unnamed protein product [Vitis vinifera]
Length = 672
Score = 1061 bits (2743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/639 (79%), Positives = 557/639 (87%), Gaps = 19/639 (2%)
Query: 6 HFSTKPHLLFSSLSSSSSSLRPRLPK-FPFYPAYFTKSPSCPSIACHVSTTGGGGAAQME 64
HFS +FS S SL +L + F F P ++S PS + S + +
Sbjct: 52 HFSYSSCPIFSPFFRSHPSLSSKLSRSFHFRPGVSSESAFSPSRSMEASPSA-------D 104
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
++A+V+S+ L+NQRL +E + L LEDLNWDHSFV ELPGDPRTD I
Sbjct: 105 AAATVESLADGLRNQRLGSEN-----------RVLLRLEDLNWDHSFVHELPGDPRTDPI 153
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
PR+VLHACYTK+SPSAEVENPQLVAW ESVA+ L+LDPKEFERPDFPL FSGA+ L G +
Sbjct: 154 PRQVLHACYTKISPSAEVENPQLVAWLESVAELLDLDPKEFERPDFPLIFSGASLLVGGL 213
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYAQCYGGHQFGMWAGQLGDGRAITLGE+LN KSERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 214 PYAQCYGGHQFGMWAGQLGDGRAITLGELLNSKSERWELQLKGAGRTPYSRFADGLAVLR 273
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAMH LGIPTTRALCLVTTGK+VTRDMFYDGNPKEEPGAIVCRVAQSFLRF
Sbjct: 274 SSIREFLCSEAMHSLGIPTTRALCLVTTGKYVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 333
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHA+RG+EDL IVR LADY IRHHF HIENM +SE LSFSTG++D S+VDLTSNKY
Sbjct: 334 GSYQIHAARGKEDLGIVRALADYTIRHHFPHIENMTRSEGLSFSTGEQDESIVDLTSNKY 393
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW+VEVAERTASLVA WQGVGFTHGVLNTDNMS+LGLTIDYGPFGFLDAFDPS+TPNTT
Sbjct: 394 AAWSVEVAERTASLVASWQGVGFTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPSYTPNTT 453
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQF++TL +A+LI+DKEANY MERYGTKFMDEYQAIMT+K
Sbjct: 454 DLPGRRYCFANQPDIGLWNIAQFTSTLMSAELINDKEANYAMERYGTKFMDEYQAIMTRK 513
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGLPKYNKQ+ISKLLNNMAVDKVDYTNFFR LSN+KADP+IP+DELL PLKAVLLDIGKE
Sbjct: 514 LGLPKYNKQLISKLLNNMAVDKVDYTNFFRLLSNIKADPTIPQDELLTPLKAVLLDIGKE 573
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKE+WISWV SYIQEL +SGISDEERKA MNSVNPKYVLRNYLCQSAIDAAE GDFG VR
Sbjct: 574 RKESWISWVQSYIQELAASGISDEERKASMNSVNPKYVLRNYLCQSAIDAAEQGDFGVVR 633
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
R+LK+MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 634 RILKIMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 672
>gi|225435594|ref|XP_002285614.1| PREDICTED: UPF0061 protein AZOSEA38000-like [Vitis vinifera]
Length = 651
Score = 1059 bits (2739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/639 (79%), Positives = 557/639 (87%), Gaps = 19/639 (2%)
Query: 6 HFSTKPHLLFSSLSSSSSSLRPRLPK-FPFYPAYFTKSPSCPSIACHVSTTGGGGAAQME 64
HFS +FS S SL +L + F F P ++S PS + S + +
Sbjct: 31 HFSYSSCPIFSPFFRSHPSLSSKLSRSFHFRPGVSSESAFSPSRSMEASPSA-------D 83
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
++A+V+S+ L+NQRL +E + L LEDLNWDHSFV ELPGDPRTD I
Sbjct: 84 AAATVESLADGLRNQRLGSEN-----------RVLLRLEDLNWDHSFVHELPGDPRTDPI 132
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
PR+VLHACYTK+SPSAEVENPQLVAW ESVA+ L+LDPKEFERPDFPL FSGA+ L G +
Sbjct: 133 PRQVLHACYTKISPSAEVENPQLVAWLESVAELLDLDPKEFERPDFPLIFSGASLLVGGL 192
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYAQCYGGHQFGMWAGQLGDGRAITLGE+LN KSERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 193 PYAQCYGGHQFGMWAGQLGDGRAITLGELLNSKSERWELQLKGAGRTPYSRFADGLAVLR 252
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAMH LGIPTTRALCLVTTGK+VTRDMFYDGNPKEEPGAIVCRVAQSFLRF
Sbjct: 253 SSIREFLCSEAMHSLGIPTTRALCLVTTGKYVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 312
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHA+RG+EDL IVR LADY IRHHF HIENM +SE LSFSTG++D S+VDLTSNKY
Sbjct: 313 GSYQIHAARGKEDLGIVRALADYTIRHHFPHIENMTRSEGLSFSTGEQDESIVDLTSNKY 372
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW+VEVAERTASLVA WQGVGFTHGVLNTDNMS+LGLTIDYGPFGFLDAFDPS+TPNTT
Sbjct: 373 AAWSVEVAERTASLVASWQGVGFTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPSYTPNTT 432
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQF++TL +A+LI+DKEANY MERYGTKFMDEYQAIMT+K
Sbjct: 433 DLPGRRYCFANQPDIGLWNIAQFTSTLMSAELINDKEANYAMERYGTKFMDEYQAIMTRK 492
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGLPKYNKQ+ISKLLNNMAVDKVDYTNFFR LSN+KADP+IP+DELL PLKAVLLDIGKE
Sbjct: 493 LGLPKYNKQLISKLLNNMAVDKVDYTNFFRLLSNIKADPTIPQDELLTPLKAVLLDIGKE 552
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKE+WISWV SYIQEL +SGISDEERKA MNSVNPKYVLRNYLCQSAIDAAE GDFG VR
Sbjct: 553 RKESWISWVQSYIQELAASGISDEERKASMNSVNPKYVLRNYLCQSAIDAAEQGDFGVVR 612
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
R+LK+MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 613 RILKIMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 651
>gi|449462599|ref|XP_004149028.1| PREDICTED: UPF0061 protein AZOSEA38000-like [Cucumis sativus]
Length = 649
Score = 1043 bits (2696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 503/609 (82%), Positives = 547/609 (89%), Gaps = 2/609 (0%)
Query: 36 PAYFTKSPS-CPSIACHVSTTGGGGAAQMESSASVDSVTHDLKNQRLDTETETDGGDESK 94
PA FT PS P+ + H +A E SASVDSV LKNQ L+ + DGG
Sbjct: 42 PASFTSLPSPLPAHSRHGRRKLSMDSASPEVSASVDSVAEGLKNQSLNNDDRVDGGSSIN 101
Query: 95 MTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV 154
K K LEDLNWD+SFVRELPGDPRTD IPREVLHACY+KV PS EV++PQLVAWSESV
Sbjct: 102 HATK-KKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSKVLPSVEVQSPQLVAWSESV 160
Query: 155 ADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
AD L+LDP+EFERPDFPL FSGA+PL GA PYAQCYGGHQFGMWAGQLGDGRAITLGEIL
Sbjct: 161 ADLLDLDPQEFERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 220
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N +SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCL+TTG
Sbjct: 221 NSRSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGT 280
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG++D IVR LADY IRHHF
Sbjct: 281 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDFKIVRALADYVIRHHFP 340
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
H+ENM+ S+S+SFSTG+ D SVVDLTSNKYAAW VEVAERTASL+A WQGVGFTHGVLNT
Sbjct: 341 HLENMSSSQSVSFSTGNTDSSVVDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT 400
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF++TL+AA
Sbjct: 401 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAA 460
Query: 455 KLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFR 514
+LI+DKEANY MERYG KFMD+YQAIMTKK+GLPKYNKQ+ISKLLNNMAVDKVDYTNFFR
Sbjct: 461 ELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNKQLISKLLNNMAVDKVDYTNFFR 520
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
+LSN+KADPSIPE+ELLVPLKAVLLDIGKERKEAW+SWV +Y++EL SGISDEERKA M
Sbjct: 521 SLSNLKADPSIPEEELLVPLKAVLLDIGKERKEAWVSWVKTYMEELAGSGISDEERKASM 580
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYLCQ+AIDAAE GDFGEVR+LLK+MERP+DEQPGMEKYARLPPAWAYRP
Sbjct: 581 DAVNPKYILRNYLCQTAIDAAEQGDFGEVRQLLKIMERPFDEQPGMEKYARLPPAWAYRP 640
Query: 635 GVCMLSCSS 643
GVCMLSCSS
Sbjct: 641 GVCMLSCSS 649
>gi|255544744|ref|XP_002513433.1| Selenoprotein O, putative [Ricinus communis]
gi|223547341|gb|EEF48836.1| Selenoprotein O, putative [Ricinus communis]
Length = 654
Score = 1034 bits (2673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 503/634 (79%), Positives = 554/634 (87%), Gaps = 21/634 (3%)
Query: 27 PRLPKFPFYPA-------YFTKSPSCPSIACHVSTTGGGGAAQM---------ESSASVD 70
PR K FYP+ ++++SP P + C V+T+ G+ M + + VD
Sbjct: 25 PRHFKSRFYPSSSFLSSHFYSRSPH-PYLVCGVNTSSSSGSVSMDSSGSPEAASTMSVVD 83
Query: 71 SVTHDLKNQRLDTETETDGGDESKMTKKLKA-LEDLNWDHSFVRELPGDPRTDSIPREVL 129
SVT+D KNQ L + + ++ T K+K+ L+DLNWDHSFVRELPGD RTD+IPR+VL
Sbjct: 84 SVTNDFKNQSLRDDDNNN---KNNTTSKVKSSLDDLNWDHSFVRELPGDSRTDTIPRQVL 140
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
HAC++KV PSAEVENPQLVAWSESVA L+LD KEFERPDF L FSGA+ L G++PYAQC
Sbjct: 141 HACFSKVFPSAEVENPQLVAWSESVAVLLDLDLKEFERPDFALKFSGASTLVGSLPYAQC 200
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
YGGHQFGMWAGQLGDGRAITLGEILN KSERWELQLKGAGKTPYSRFADGLAVLRSSIRE
Sbjct: 201 YGGHQFGMWAGQLGDGRAITLGEILNSKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 260
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAMH LGIPTTRALCLVTTGK+VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS+QI
Sbjct: 261 FLCSEAMHHLGIPTTRALCLVTTGKYVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSFQI 320
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
HASRG+ED IVR LADYAIRHHF HI+NM KSESLSFS G ED S+VDLTSNKYAAW V
Sbjct: 321 HASRGKEDFGIVRALADYAIRHHFPHIDNMTKSESLSFSMGAEDDSIVDLTSNKYAAWTV 380
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EVAERTASL+A WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS+TPNTTDLPGR
Sbjct: 381 EVAERTASLIASWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSYTPNTTDLPGR 440
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RYCFANQPDIGLWNIAQF+ TL+ A+LI+DKEANY MERYG KFMDEYQAIMT+KLGLPK
Sbjct: 441 RYCFANQPDIGLWNIAQFTATLSEAQLINDKEANYAMERYGNKFMDEYQAIMTRKLGLPK 500
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
YNKQ+ISKLLNNMAVDKVDYTNFFR LSN+KADP+IPE+ELLVPLKA LLDIGKERKEAW
Sbjct: 501 YNKQLISKLLNNMAVDKVDYTNFFRLLSNIKADPNIPEEELLVPLKAALLDIGKERKEAW 560
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
ISWV SY+QEL +S ISD+ERKA M++VNPKY+LRNYLCQ+AIDAAE GD GEVRRLLKL
Sbjct: 561 ISWVQSYVQELAASDISDDERKAQMDAVNPKYILRNYLCQTAIDAAEQGDMGEVRRLLKL 620
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
MERP+DEQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 621 MERPFDEQPGMEKYARLPPAWAYRPGVCMLSCSS 654
>gi|13430492|gb|AAK25868.1|AF360158_1 unknown protein [Arabidopsis thaliana]
Length = 585
Score = 997 bits (2577), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/579 (81%), Positives = 517/579 (89%), Gaps = 8/579 (1%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S DS+ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 15 TDSSADSLAKDLQNQSL--------GAVDEGVKIKKKLEDFNWDHSFVKELPGDPRTDVI 66
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWS SVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 67 SREVLHACYSKVSPSVEVDDPQLVAWSVSVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 126
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 127 SYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 186
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TTG+ VTRDMFYDGNPKEEPGAIVCRV+QSFLRF
Sbjct: 187 SSIREFLCSETMHCLGIPTTRALCLLTTGQNVTRDMFYDGNPKEEPGAIVCRVSQSFLRF 246
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHASRG+EDLDIVR LADYAI+HHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 247 GSYQIHASRGKEDLDIVRKLADYAIKHHFPHIESMDRSDSLSFKTGDEDDSVVDLTSNKY 306
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 307 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 366
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 367 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 426
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 427 LGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 486
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 487 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 546
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 547 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 585
>gi|356576911|ref|XP_003556573.1| PREDICTED: UPF0061 protein AZOSEA38000-like [Glycine max]
Length = 590
Score = 996 bits (2575), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/542 (85%), Positives = 503/542 (92%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
LEDL WDHSFVRELPGDPR DS PREVLHACYT+VSPS +V NPQLVA+S+ VAD L+LD
Sbjct: 49 LEDLKWDHSFVRELPGDPRRDSFPREVLHACYTQVSPSVQVHNPQLVAFSQPVADLLDLD 108
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
KEF+RPDFPLFFSGATPL GA+PYAQCYGGHQFGMWAGQLGDGRA+TLGEILN SERW
Sbjct: 109 HKEFQRPDFPLFFSGATPLVGALPYAQCYGGHQFGMWAGQLGDGRAMTLGEILNSNSERW 168
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAGKTPYSRFADGLAVLRSS+REFLCSEAMH LGIPTTRAL LVTTG VTRDMF
Sbjct: 169 ELQLKGAGKTPYSRFADGLAVLRSSVREFLCSEAMHHLGIPTTRALSLVTTGNLVTRDMF 228
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR EDL +VR LADYAIRHHF HI+NM+K
Sbjct: 229 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRSDEDLGLVRVLADYAIRHHFPHIQNMSK 288
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
S+SLSF TGDEDHSVVDLTSNKYAAW VE+AERTASL+A+WQGVGFTHGVLNTDNMSILG
Sbjct: 289 SDSLSFCTGDEDHSVVDLTSNKYAAWVVEIAERTASLIARWQGVGFTHGVLNTDNMSILG 348
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGPFGFLDAFDP FTPNTTDLPGRRYCFANQPDIGLWNIAQF+TTL AA LI++KE
Sbjct: 349 LTIDYGPFGFLDAFDPKFTPNTTDLPGRRYCFANQPDIGLWNIAQFTTTLQAAHLINEKE 408
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
ANY MERYGT+FMD+YQ MTKKLGLPKYNKQ+I+KLL+NMAVDKVDYTNFFR LSNVKA
Sbjct: 409 ANYAMERYGTRFMDDYQVTMTKKLGLPKYNKQMINKLLSNMAVDKVDYTNFFRTLSNVKA 468
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
D +IP+DELLVPLK+VLLDIGKERKEAW SW+ +YI E+ +SGI D+ERK M+SVNPKY
Sbjct: 469 DINIPDDELLVPLKSVLLDIGKERKEAWTSWLKAYIHEVSTSGIPDDERKISMDSVNPKY 528
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYLCQ+AIDAAE+GDFGEVR LLKL+E PYDEQPGMEKYARLPPAWAYRPGVCMLSC
Sbjct: 529 ILRNYLCQTAIDAAEIGDFGEVRSLLKLVEHPYDEQPGMEKYARLPPAWAYRPGVCMLSC 588
Query: 642 SS 643
SS
Sbjct: 589 SS 590
>gi|51971098|dbj|BAD44241.1| unnamed protein product [Arabidopsis thaliana]
Length = 630
Score = 995 bits (2573), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/579 (81%), Positives = 517/579 (89%), Gaps = 8/579 (1%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S DS+ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 60 TDSSADSLAKDLQNQSL--------GAVDEGVKIKKKLEDFNWDHSFVKELPGDPRTDVI 111
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWS SVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 112 SREVLHACYSKVSPSVEVDDPQLVAWSVSVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 171
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 172 SYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 231
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TTG+ VTRDMFYDGNPKEEPGAIVCRV+QSFLRF
Sbjct: 232 SSIREFLCSETMHCLGIPTTRALCLLTTGQNVTRDMFYDGNPKEEPGAIVCRVSQSFLRF 291
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHASRG+EDLDIVR LADYAI+HHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 292 GSYQIHASRGKEDLDIVRKLADYAIKHHFPHIESMDRSDSLSFKTGDEDDSVVDLTSNKY 351
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 352 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 411
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 412 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 471
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 472 LGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 531
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 532 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 591
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 592 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 630
>gi|30684227|ref|NP_196807.2| uncharacterized protein [Arabidopsis thaliana]
gi|24030204|gb|AAN41282.1| unknown protein [Arabidopsis thaliana]
gi|332004460|gb|AED91843.1| uncharacterized protein [Arabidopsis thaliana]
Length = 633
Score = 995 bits (2572), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/579 (81%), Positives = 517/579 (89%), Gaps = 8/579 (1%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S DS+ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 63 TDSSADSLAKDLQNQSL--------GAVDEGVKIKKKLEDFNWDHSFVKELPGDPRTDVI 114
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWS SVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 115 SREVLHACYSKVSPSVEVDDPQLVAWSVSVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 174
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 175 SYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 234
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TTG+ VTRDMFYDGNPKEEPGAIVCRV+QSFLRF
Sbjct: 235 SSIREFLCSETMHCLGIPTTRALCLLTTGQNVTRDMFYDGNPKEEPGAIVCRVSQSFLRF 294
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHASRG+EDLDIVR LADYAI+HHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 295 GSYQIHASRGKEDLDIVRKLADYAIKHHFPHIESMDRSDSLSFKTGDEDDSVVDLTSNKY 354
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 355 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 414
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 415 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 474
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 475 LGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 534
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 535 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 594
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 595 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 633
>gi|51971224|dbj|BAD44304.1| unnamed protein product [Arabidopsis thaliana]
gi|51971665|dbj|BAD44497.1| unnamed protein product [Arabidopsis thaliana]
Length = 632
Score = 995 bits (2572), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/579 (81%), Positives = 517/579 (89%), Gaps = 8/579 (1%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S DS+ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 62 TDSSADSLAKDLQNQSL--------GAVDEGVKIKKKLEDFNWDHSFVKELPGDPRTDVI 113
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWS SVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 114 SREVLHACYSKVSPSVEVDDPQLVAWSVSVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 173
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 174 SYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 233
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TTG+ VTRDMFYDGNPKEEPGAIVCRV+QSFLRF
Sbjct: 234 SSIREFLCSETMHCLGIPTTRALCLLTTGQNVTRDMFYDGNPKEEPGAIVCRVSQSFLRF 293
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHASRG+EDLDIVR LADYAI+HHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 294 GSYQIHASRGKEDLDIVRKLADYAIKHHFPHIESMDRSDSLSFKTGDEDDSVVDLTSNKY 353
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 354 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 413
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 414 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 473
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 474 LGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 533
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 534 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 593
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 594 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 632
>gi|297807317|ref|XP_002871542.1| hypothetical protein ARALYDRAFT_350459 [Arabidopsis lyrata subsp.
lyrata]
gi|297317379|gb|EFH47801.1| hypothetical protein ARALYDRAFT_350459 [Arabidopsis lyrata subsp.
lyrata]
Length = 582
Score = 987 bits (2551), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/579 (80%), Positives = 516/579 (89%), Gaps = 11/579 (1%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S D++ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 15 TDSSADTLGKDLQNQSL--------GAVDEGCKIKKKLEDFNWDHSFVKELPGDPRTDVI 66
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWSESVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 67 SREVLHACYSKVSPSVEVDDPQLVAWSESVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 126
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 127 PYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 186
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TTG+ VTRD+ GNPKEEPGAIVCRV+QSF+RF
Sbjct: 187 SSIREFLCSETMHCLGIPTTRALCLLTTGQDVTRDI---GNPKEEPGAIVCRVSQSFIRF 243
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GSYQIHASRG+EDLDIVR LADYAIRHHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 244 GSYQIHASRGKEDLDIVRKLADYAIRHHFPHIESMDQSDSLSFKTGDEDDSVVDLTSNKY 303
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 304 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 363
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 364 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 423
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 424 LGLSKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 483
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 484 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 543
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 544 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 582
>gi|357124422|ref|XP_003563899.1| PREDICTED: UPF0061 protein AZOSEA38000-like [Brachypodium
distachyon]
Length = 631
Score = 976 bits (2522), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/557 (82%), Positives = 502/557 (90%)
Query: 87 TDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQ 146
T G E + + LE+L WD +FVRELPGDPR+D+IPR+VLHACYTKVSPSA V+NP+
Sbjct: 75 TSGSGEGAVRPPRRTLEELAWDETFVRELPGDPRSDNIPRQVLHACYTKVSPSAPVDNPK 134
Query: 147 LVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGR 206
LVAWSESVAD L+LD KEFERPDFP FFSGATPL G+VPYAQCYGGHQFG WAGQLGDGR
Sbjct: 135 LVAWSESVADLLDLDHKEFERPDFPQFFSGATPLVGSVPYAQCYGGHQFGSWAGQLGDGR 194
Query: 207 AITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRA 266
A+TLGE+LN + ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRA
Sbjct: 195 AVTLGEVLNSRGERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRA 254
Query: 267 LCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLAD 326
LCLV TGK V RDMFYDGN KEEPGAIVCRVA SFLRFGSYQIHA+RG+EDL+IVR L D
Sbjct: 255 LCLVETGKSVVRDMFYDGNSKEEPGAIVCRVAPSFLRFGSYQIHATRGKEDLEIVRHLVD 314
Query: 327 YAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVG 386
Y IRHH+ H+E++ KSE LSF D +DLTSNKYAAWAVEVAERTA L+A+WQGVG
Sbjct: 315 YTIRHHYPHLESIKKSEGLSFEAAIGDSPAIDLTSNKYAAWAVEVAERTAYLIARWQGVG 374
Query: 387 FTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ 446
FTHGVLNTDNMS+LGLTIDYGPFGFLDAFDPSFTPNTTDLPG+RYCFANQPD+GLWNIAQ
Sbjct: 375 FTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPSFTPNTTDLPGKRYCFANQPDVGLWNIAQ 434
Query: 447 FSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDK 506
F+ L++A LI+ EANYVMERYGTKFMDEYQ+IMT+KLGL KYNKQ+ISKLLNN+AVDK
Sbjct: 435 FTGPLSSAGLINKDEANYVMERYGTKFMDEYQSIMTRKLGLSKYNKQLISKLLNNLAVDK 494
Query: 507 VDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGIS 566
VDYTNFFR LSNVKADP IPE+ELLVP+KA LLDIGKERKEAWISWV +YI+EL++SGIS
Sbjct: 495 VDYTNFFRLLSNVKADPDIPENELLVPIKAALLDIGKERKEAWISWVQTYIEELVASGIS 554
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
DEERK MN VNPKYVLRNYLCQ+AIDAA+LGD+ EVRRLLK+MERPYDEQPGMEKYARL
Sbjct: 555 DEERKTSMNQVNPKYVLRNYLCQTAIDAADLGDYEEVRRLLKVMERPYDEQPGMEKYARL 614
Query: 627 PPAWAYRPGVCMLSCSS 643
PPAWAYRPGVCMLSCSS
Sbjct: 615 PPAWAYRPGVCMLSCSS 631
>gi|326516894|dbj|BAJ96439.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 622
Score = 974 bits (2518), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/557 (82%), Positives = 502/557 (90%), Gaps = 1/557 (0%)
Query: 87 TDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQ 146
T G E+ + +ALE+L+WD +FVRELPGDPR+D+IPR+VLHACYTKVSPSA VENP+
Sbjct: 67 TSGAGEAAARPR-RALEELSWDETFVRELPGDPRSDNIPRQVLHACYTKVSPSAPVENPK 125
Query: 147 LVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGR 206
LVAWS+S AD L+LD KEFERPDFP FFSG TPL G+VPYAQCYGGHQFG WAGQLGDGR
Sbjct: 126 LVAWSQSAADLLDLDHKEFERPDFPRFFSGETPLVGSVPYAQCYGGHQFGSWAGQLGDGR 185
Query: 207 AITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRA 266
AITLGE+LN + ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRA
Sbjct: 186 AITLGEVLNSRGERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRA 245
Query: 267 LCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLAD 326
LCLV TGK V RDMFYDGN KEEPGAIVCR+A SFLRFGSYQIHA+RG+EDL+IVR LAD
Sbjct: 246 LCLVETGKSVVRDMFYDGNAKEEPGAIVCRLAPSFLRFGSYQIHATRGKEDLEIVRRLAD 305
Query: 327 YAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVG 386
YAIRHH+ H+EN+ KSE LSF D +DLTSNKYAAWAVEVAERTA L+A+WQGVG
Sbjct: 306 YAIRHHYPHLENIKKSEGLSFEAAIGDSPAIDLTSNKYAAWAVEVAERTAYLIARWQGVG 365
Query: 387 FTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ 446
FTHGVLNTDNMS+LGLTIDYGPFGFLDAFDPSFTPNTTDLPG+RYCFANQPD+GLWNIAQ
Sbjct: 366 FTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPSFTPNTTDLPGKRYCFANQPDVGLWNIAQ 425
Query: 447 FSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDK 506
F+ L+AA LI EANYVMERYGTKFMDEYQ+IMTKKLGL KYNKQ+ISKLLNN+AVDK
Sbjct: 426 FTGPLSAADLISKDEANYVMERYGTKFMDEYQSIMTKKLGLSKYNKQLISKLLNNLAVDK 485
Query: 507 VDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGIS 566
VDYTNFFR LSNVKAD IPE ELLVP+KA LLDIGKERKEAWISWV +YI+EL++SG+S
Sbjct: 486 VDYTNFFRLLSNVKADRDIPETELLVPIKAALLDIGKERKEAWISWVQTYIEELVASGVS 545
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
DEERKA MN VNPKYVLRNYLCQ+AIDAA+LGD+ EVRRLLK+ME PYDEQPGMEKYARL
Sbjct: 546 DEERKAAMNRVNPKYVLRNYLCQTAIDAADLGDYEEVRRLLKVMEHPYDEQPGMEKYARL 605
Query: 627 PPAWAYRPGVCMLSCSS 643
PPAWAYRPGVCMLSCSS
Sbjct: 606 PPAWAYRPGVCMLSCSS 622
>gi|413953849|gb|AFW86498.1| hypothetical protein ZEAMMB73_905295 [Zea mays]
Length = 630
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/543 (82%), Positives = 494/543 (90%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
LE+L WDHSFVRELPGDPR+D+IPREVLHACY++VSPSA+V+NP+LVAWS+SVAD L+L
Sbjct: 88 VLEELPWDHSFVRELPGDPRSDTIPREVLHACYSRVSPSAKVDNPKLVAWSDSVADLLDL 147
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
D KEFERPDFP FFSGATPL G++PYAQCYGGHQFG+WAGQLGDGRAI LGE++N + ER
Sbjct: 148 DHKEFERPDFPQFFSGATPLVGSLPYAQCYGGHQFGVWAGQLGDGRAIALGEVVNSRGER 207
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK V RDM
Sbjct: 208 WELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKSVVRDM 267
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FYDGN KEEPGAIVCRVA SFLRFGSYQIHASRG+ED++IVR LADY I HHF H+ENM
Sbjct: 268 FYDGNAKEEPGAIVCRVAPSFLRFGSYQIHASRGKEDIEIVRRLADYTIHHHFPHLENMK 327
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
KSE LSF T D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTDNMS+L
Sbjct: 328 KSEGLSFETAIGDSPTIDLTSNKYAAWAVEVAERTAYLIARWQGVGFTHGVLNTDNMSVL 387
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF+ L++A+LI
Sbjct: 388 GLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTGPLSSAELISQD 447
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
EANYVMERYGTKFMDEYQ+IMTKKLGL KYNKQ+ISKLL+NMAVDKVDYTNFFR LSNV
Sbjct: 448 EANYVMERYGTKFMDEYQSIMTKKLGLTKYNKQLISKLLSNMAVDKVDYTNFFRLLSNVN 507
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
ADP IPE+ELLVPLKA LLDIGKERKEAWISWV +YI+EL+ SG+ DEERKA MNSVNPK
Sbjct: 508 ADPGIPENELLVPLKAALLDIGKERKEAWISWVQTYIEELVESGVPDEERKAAMNSVNPK 567
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLS 640
Y+LRNYLCQSAID AE GD+ EVRR+L++M PYDEQPGMEKYARLPPAWAYRPGVCMLS
Sbjct: 568 YILRNYLCQSAIDVAEQGDYEEVRRVLRVMHNPYDEQPGMEKYARLPPAWAYRPGVCMLS 627
Query: 641 CSS 643
CSS
Sbjct: 628 CSS 630
>gi|293335415|ref|NP_001169284.1| uncharacterized protein LOC100383148 precursor [Zea mays]
gi|224028397|gb|ACN33274.1| unknown [Zea mays]
Length = 630
Score = 964 bits (2491), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/542 (82%), Positives = 493/542 (90%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
LE+L WDHSFVRELPGDPR+D+IPREVLHACY++VSPSA+V+NP+LVAWS+SVAD L+L
Sbjct: 88 VLEELPWDHSFVRELPGDPRSDTIPREVLHACYSRVSPSAKVDNPKLVAWSDSVADLLDL 147
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
D KEFERPDFP FFSGATPL G++PYAQCYGGHQFG+WAGQLGDGRAI LGE++N + ER
Sbjct: 148 DHKEFERPDFPQFFSGATPLVGSLPYAQCYGGHQFGVWAGQLGDGRAIALGEVVNSRGER 207
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK V RDM
Sbjct: 208 WELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKSVVRDM 267
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FYDGN KEEPGAIVCRVA SFLRFGSYQIHASRG+ED++IVR LADY I HHF H+ENM
Sbjct: 268 FYDGNAKEEPGAIVCRVAPSFLRFGSYQIHASRGKEDIEIVRRLADYTIHHHFPHLENMK 327
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
KSE LSF T D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTDNMS+L
Sbjct: 328 KSEGLSFETAIGDSPTIDLTSNKYAAWAVEVAERTAYLIARWQGVGFTHGVLNTDNMSVL 387
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF+ L++A+LI
Sbjct: 388 GLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTGPLSSAELISQD 447
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
EANYVMERYGTKFMDEYQ+IMTKKLGL KYNKQ+ISKLL+NMAVDKVDYTNFFR LSNV
Sbjct: 448 EANYVMERYGTKFMDEYQSIMTKKLGLTKYNKQLISKLLSNMAVDKVDYTNFFRLLSNVN 507
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
ADP IPE+ELLVPLKA LLDIGKERKEAWISWV +YI+EL+ SG+ DEERKA MNSVNPK
Sbjct: 508 ADPGIPENELLVPLKAALLDIGKERKEAWISWVQTYIEELVESGVPDEERKAAMNSVNPK 567
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLS 640
Y+LRNYLCQSAID AE GD+ EVRR+L++M PYDEQPGMEKYARLPPAWAYRPGVCMLS
Sbjct: 568 YILRNYLCQSAIDVAEQGDYEEVRRVLRVMHNPYDEQPGMEKYARLPPAWAYRPGVCMLS 627
Query: 641 CS 642
CS
Sbjct: 628 CS 629
>gi|115467830|ref|NP_001057514.1| Os06g0320700 [Oryza sativa Japonica Group]
gi|54290901|dbj|BAD61584.1| putative selenoprotein O [Oryza sativa Japonica Group]
gi|113595554|dbj|BAF19428.1| Os06g0320700 [Oryza sativa Japonica Group]
Length = 626
Score = 963 bits (2490), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/548 (82%), Positives = 494/548 (90%)
Query: 96 TKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
++ + LE+L+WD SFVRELPGDPR+D+IPREVLHACYTKVSPSA V+NP+LVAWS+SVA
Sbjct: 79 SRPRRVLEELSWDDSFVRELPGDPRSDAIPREVLHACYTKVSPSAPVDNPKLVAWSQSVA 138
Query: 156 DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILN 215
D L+LD KEFERPDFP FSGA PL G+ PYAQCYGGHQFG WAGQLGDGRAITLGE++N
Sbjct: 139 DILDLDHKEFERPDFPQLFSGANPLVGSSPYAQCYGGHQFGSWAGQLGDGRAITLGEVIN 198
Query: 216 LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF 275
+ ERWELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK
Sbjct: 199 SRGERWELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKS 258
Query: 276 VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH 335
V RDMFYDGN KEEPGAIVCRVA SFLRFGSYQIHA+R +EDL+IVR LADY IRHH+ H
Sbjct: 259 VVRDMFYDGNSKEEPGAIVCRVAPSFLRFGSYQIHATRDKEDLEIVRHLADYTIRHHYPH 318
Query: 336 IENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+EN+ KSE LSF D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTD
Sbjct: 319 LENIKKSEGLSFEAAIGDSPAIDLTSNKYAAWAVEVAERTAFLIARWQGVGFTHGVLNTD 378
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAK 455
NMS+LGLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF++ L AA+
Sbjct: 379 NMSVLGLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTSPLTAAE 438
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRA 515
LI EANYVMERYGTKFMDEYQ+IMT+KLGLPKYNKQ+I KLLNN+AVDKVDYTNFFR
Sbjct: 439 LISKDEANYVMERYGTKFMDEYQSIMTRKLGLPKYNKQLIGKLLNNLAVDKVDYTNFFRL 498
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LSNVKAD +IPE ELLVPLKA LLDIG ERKEAWISWV +YI+EL+SSG+ DEERKA MN
Sbjct: 499 LSNVKADHNIPEKELLVPLKAALLDIGPERKEAWISWVQTYIEELVSSGVPDEERKAAMN 558
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
SVNPKYVLRNYLCQ+AIDAAE GD+ EVRRLLK+ME PYDEQPGMEKYARLPPAWAYRPG
Sbjct: 559 SVNPKYVLRNYLCQTAIDAAEQGDYDEVRRLLKVMEHPYDEQPGMEKYARLPPAWAYRPG 618
Query: 636 VCMLSCSS 643
VCMLSCSS
Sbjct: 619 VCMLSCSS 626
>gi|222635478|gb|EEE65610.1| hypothetical protein OsJ_21157 [Oryza sativa Japonica Group]
Length = 568
Score = 962 bits (2488), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/548 (82%), Positives = 494/548 (90%)
Query: 96 TKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
++ + LE+L+WD SFVRELPGDPR+D+IPREVLHACYTKVSPSA V+NP+LVAWS+SVA
Sbjct: 21 SRPRRVLEELSWDDSFVRELPGDPRSDAIPREVLHACYTKVSPSAPVDNPKLVAWSQSVA 80
Query: 156 DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILN 215
D L+LD KEFERPDFP FSGA PL G+ PYAQCYGGHQFG WAGQLGDGRAITLGE++N
Sbjct: 81 DILDLDHKEFERPDFPQLFSGANPLVGSSPYAQCYGGHQFGSWAGQLGDGRAITLGEVIN 140
Query: 216 LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF 275
+ ERWELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK
Sbjct: 141 SRGERWELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKS 200
Query: 276 VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH 335
V RDMFYDGN KEEPGAIVCRVA SFLRFGSYQIHA+R +EDL+IVR LADY IRHH+ H
Sbjct: 201 VVRDMFYDGNSKEEPGAIVCRVAPSFLRFGSYQIHATRDKEDLEIVRHLADYTIRHHYPH 260
Query: 336 IENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+EN+ KSE LSF D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTD
Sbjct: 261 LENIKKSEGLSFEAAIGDSPAIDLTSNKYAAWAVEVAERTAFLIARWQGVGFTHGVLNTD 320
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAK 455
NMS+LGLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF++ L AA+
Sbjct: 321 NMSVLGLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTSPLTAAE 380
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRA 515
LI EANYVMERYGTKFMDEYQ+IMT+KLGLPKYNKQ+I KLLNN+AVDKVDYTNFFR
Sbjct: 381 LISKDEANYVMERYGTKFMDEYQSIMTRKLGLPKYNKQLIGKLLNNLAVDKVDYTNFFRL 440
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LSNVKAD +IPE ELLVPLKA LLDIG ERKEAWISWV +YI+EL+SSG+ DEERKA MN
Sbjct: 441 LSNVKADHNIPEKELLVPLKAALLDIGPERKEAWISWVQTYIEELVSSGVPDEERKAAMN 500
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
SVNPKYVLRNYLCQ+AIDAAE GD+ EVRRLLK+ME PYDEQPGMEKYARLPPAWAYRPG
Sbjct: 501 SVNPKYVLRNYLCQTAIDAAEQGDYDEVRRLLKVMEHPYDEQPGMEKYARLPPAWAYRPG 560
Query: 636 VCMLSCSS 643
VCMLSCSS
Sbjct: 561 VCMLSCSS 568
>gi|125555125|gb|EAZ00731.1| hypothetical protein OsI_22756 [Oryza sativa Indica Group]
Length = 568
Score = 962 bits (2487), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/548 (82%), Positives = 494/548 (90%)
Query: 96 TKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
++ + LE+L+WD SFVRELPGDPR+D+IPREVLHACYTKVSPSA V+NP+LVAWS+SVA
Sbjct: 21 SRPRRVLEELSWDDSFVRELPGDPRSDAIPREVLHACYTKVSPSAPVDNPKLVAWSQSVA 80
Query: 156 DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILN 215
D L+LD KEFERPDFP FSGA PL G+ PYAQCYGGHQFG WAGQLGDGRAITLGE++N
Sbjct: 81 DILDLDHKEFERPDFPQLFSGANPLVGSSPYAQCYGGHQFGSWAGQLGDGRAITLGEVIN 140
Query: 216 LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF 275
+ ERWELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK
Sbjct: 141 SRGERWELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKS 200
Query: 276 VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH 335
V RD+FYDGN KEEPGAIVCRVA SFLRFGSYQIHA+R +EDL+IVR LADY IRHH+ H
Sbjct: 201 VVRDLFYDGNSKEEPGAIVCRVAPSFLRFGSYQIHATRDKEDLEIVRHLADYTIRHHYAH 260
Query: 336 IENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+EN+ KSE LSF D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTD
Sbjct: 261 LENIKKSEGLSFEAAIGDSPAIDLTSNKYAAWAVEVAERTAFLIARWQGVGFTHGVLNTD 320
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAK 455
NMS+LGLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF++ L AA+
Sbjct: 321 NMSVLGLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTSPLTAAE 380
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRA 515
LI EANYVMERYGTKFMDEYQ+IMT+KLGLPKYNKQ+I KLLNN+AVDKVDYTNFFR
Sbjct: 381 LISKDEANYVMERYGTKFMDEYQSIMTRKLGLPKYNKQLIGKLLNNLAVDKVDYTNFFRL 440
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LSNVKAD +IPE ELLVPLKA LLDIG ERKEAWISWV +YI+EL+SSG+ DEERKA MN
Sbjct: 441 LSNVKADHNIPEKELLVPLKAALLDIGPERKEAWISWVQTYIEELVSSGVPDEERKAAMN 500
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
SVNPKYVLRNYLCQ+AIDAAE GD+ EVRRLLK+ME PYDEQPGMEKYARLPPAWAYRPG
Sbjct: 501 SVNPKYVLRNYLCQTAIDAAEQGDYDEVRRLLKVMEHPYDEQPGMEKYARLPPAWAYRPG 560
Query: 636 VCMLSCSS 643
VCMLSCSS
Sbjct: 561 VCMLSCSS 568
>gi|449502212|ref|XP_004161576.1| PREDICTED: UPF0061 protein AZOSEA38000-like [Cucumis sativus]
Length = 566
Score = 886 bits (2290), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/526 (81%), Positives = 468/526 (88%), Gaps = 2/526 (0%)
Query: 36 PAYFTKSPS-CPSIACHVSTTGGGGAAQMESSASVDSVTHDLKNQRLDTETETDGGDESK 94
PA FT PS P+ + H +A E SASVDSV LKNQ L+ + DGG
Sbjct: 42 PASFTSLPSPLPAHSRHGRRKLSMDSASPEVSASVDSVAEGLKNQSLNNDDRVDGGSSIN 101
Query: 95 MTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV 154
K K LEDLNWD+SFVRELPGDPRTD IPREVLHACY+KV PS EV++PQLVAWSESV
Sbjct: 102 HATK-KKLEDLNWDNSFVRELPGDPRTDIIPREVLHACYSKVLPSVEVQSPQLVAWSESV 160
Query: 155 ADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
AD L+LDP+EFERPDFPL FSGA+PL GA PYAQCYGGHQFGMWAGQLGDGRAITLGEIL
Sbjct: 161 ADLLDLDPQEFERPDFPLLFSGASPLVGASPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 220
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N +SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCL+TTG
Sbjct: 221 NSRSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHSLGIPTTRALCLLTTGT 280
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG++D IVR LADY IRHHF
Sbjct: 281 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGKDDFKIVRALADYVIRHHFP 340
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
H+ENM+ S+S+SFSTG+ D SVVDLTSNKYAAW VEVAERTASL+A WQGVGFTHGVLNT
Sbjct: 341 HLENMSSSQSVSFSTGNTDSSVVDLTSNKYAAWTVEVAERTASLIASWQGVGFTHGVLNT 400
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF++TL+AA
Sbjct: 401 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFASTLSAA 460
Query: 455 KLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFR 514
+LI+DKEANY MERYG KFMD+YQAIMTKK+GLPKYNKQ+ISKLLNNMAVDKVDYTNFFR
Sbjct: 461 ELINDKEANYAMERYGDKFMDDYQAIMTKKIGLPKYNKQLISKLLNNMAVDKVDYTNFFR 520
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL 560
+LSN+KADPSIPE+ELLVPLKAVLLDIGKERKEAW+SWV +Y++E+
Sbjct: 521 SLSNLKADPSIPEEELLVPLKAVLLDIGKERKEAWVSWVKTYMEEV 566
>gi|7630059|emb|CAB88267.1| putative protein [Arabidopsis thaliana]
Length = 554
Score = 882 bits (2278), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/579 (74%), Positives = 478/579 (82%), Gaps = 39/579 (6%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
+ +S DS+ DL+NQ L G + K K LED NWDHSFV+ELPGDPRTD I
Sbjct: 15 TDSSADSLAKDLQNQSL--------GAVDEGVKIKKKLEDFNWDHSFVKELPGDPRTDVI 66
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
REVLHACY+KVSPS EV++PQLVAWS SVA+ L+LDPKEFERPDFPL SGA PL GA+
Sbjct: 67 SREVLHACYSKVSPSVEVDDPQLVAWSVSVAELLDLDPKEFERPDFPLMLSGAKPLPGAM 126
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQCYGGHQFGMWAGQLGDGRAITLGE+LN K ERWELQLKGAG+TPYSRFADGLAVLR
Sbjct: 127 SYAQCYGGHQFGMWAGQLGDGRAITLGEVLNSKGERWELQLKGAGRTPYSRFADGLAVLR 186
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSE MH LGIPTTRALCL+TT + NP AQSF F
Sbjct: 187 SSIREFLCSETMHCLGIPTTRALCLLTTVAIRRK------NP-----------AQSFAGF 229
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
S+ +A DYAI+HHF HIE+M++S+SLSF TGDED SVVDLTSNKY
Sbjct: 230 LSH-FYA-------------LDYAIKHHFPHIESMDRSDSLSFKTGDEDDSVVDLTSNKY 275
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW VE+AERTA+LVA+WQGVGFTHGVLNTDNMSILG TIDYGPFGFLDAFDPS+TPNTT
Sbjct: 276 AAWIVEIAERTATLVARWQGVGFTHGVLNTDNMSILGQTIDYGPFGFLDAFDPSYTPNTT 335
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
DLPGRRYCFANQPDIGLWNIAQFS TLA A+LI+ KEANY MERYG KFMDEYQAIM+KK
Sbjct: 336 DLPGRRYCFANQPDIGLWNIAQFSKTLAVAQLINQKEANYAMERYGDKFMDEYQAIMSKK 395
Query: 485 LGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
LGL KYNK++ISKLLNNM+VDKVDYTNFFR L+NVKA+P+ PE+ELL PLKAVLLDIGKE
Sbjct: 396 LGLTKYNKEVISKLLNNMSVDKVDYTNFFRLLANVKANPNTPENELLKPLKAVLLDIGKE 455
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
RKEAWI W+ SYIQE+ S +SDEERKA M+SVNPKY+LRNYLCQSAIDAAE GDF EV
Sbjct: 456 RKEAWIKWMRSYIQEVGGSEVSDEERKARMDSVNPKYILRNYLCQSAIDAAEQGDFSEVN 515
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++LM+RPY+EQPGMEKYARLPPAWAYRPGVCMLSCSS
Sbjct: 516 NLIRLMKRPYEEQPGMEKYARLPPAWAYRPGVCMLSCSS 554
>gi|413953848|gb|AFW86497.1| hypothetical protein ZEAMMB73_905295 [Zea mays]
Length = 562
Score = 814 bits (2103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/460 (82%), Positives = 419/460 (91%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
LE+L WDHSFVRELPGDPR+D+IPREVLHACY++VSPSA+V+NP+LVAWS+SVAD L+L
Sbjct: 88 VLEELPWDHSFVRELPGDPRSDTIPREVLHACYSRVSPSAKVDNPKLVAWSDSVADLLDL 147
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
D KEFERPDFP FFSGATPL G++PYAQCYGGHQFG+WAGQLGDGRAI LGE++N + ER
Sbjct: 148 DHKEFERPDFPQFFSGATPLVGSLPYAQCYGGHQFGVWAGQLGDGRAIALGEVVNSRGER 207
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WELQLKG GKTPYSRFADGLAVLRSSIREFLCSEAMH LGIPTTRALCLV TGK V RDM
Sbjct: 208 WELQLKGCGKTPYSRFADGLAVLRSSIREFLCSEAMHGLGIPTTRALCLVETGKSVVRDM 267
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FYDGN KEEPGAIVCRVA SFLRFGSYQIHASRG+ED++IVR LADY I HHF H+ENM
Sbjct: 268 FYDGNAKEEPGAIVCRVAPSFLRFGSYQIHASRGKEDIEIVRRLADYTIHHHFPHLENMK 327
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
KSE LSF T D +DLTSNKYAAWAVEVAERTA L+A+WQGVGFTHGVLNTDNMS+L
Sbjct: 328 KSEGLSFETAIGDSPTIDLTSNKYAAWAVEVAERTAYLIARWQGVGFTHGVLNTDNMSVL 387
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLTIDYGPFGFLDAFDPS+TPNTTDLPG+RYCFANQPD+GLWNIAQF+ L++A+LI
Sbjct: 388 GLTIDYGPFGFLDAFDPSYTPNTTDLPGKRYCFANQPDVGLWNIAQFTGPLSSAELISQD 447
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
EANYVMERYGTKFMDEYQ+IMTKKLGL KYNKQ+ISKLL+NMAVDKVDYTNFFR LSNV
Sbjct: 448 EANYVMERYGTKFMDEYQSIMTKKLGLTKYNKQLISKLLSNMAVDKVDYTNFFRLLSNVN 507
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL 560
ADP IPE+ELLVPLKA LLDIGKERKEAWISWV +YI+E+
Sbjct: 508 ADPGIPENELLVPLKAALLDIGKERKEAWISWVQTYIEEV 547
>gi|357445153|ref|XP_003592854.1| hypothetical protein MTR_1g116880 [Medicago truncatula]
gi|355481902|gb|AES63105.1| hypothetical protein MTR_1g116880 [Medicago truncatula]
Length = 792
Score = 810 bits (2093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/469 (82%), Positives = 418/469 (89%), Gaps = 14/469 (2%)
Query: 65 SSASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSI 124
S+ +DSVT + KNQ L + KK + LEDLNWD+SFVR+LP DPRTD
Sbjct: 53 SAPLLDSVTQEFKNQSL-------------IQKKKRELEDLNWDNSFVRDLPSDPRTDPF 99
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
PREVLHACYTKVSPS V++PQLV WSESVA+ L+LD EF+RPDFPLFFSGA+P GA
Sbjct: 100 PREVLHACYTKVSPSVSVDDPQLVVWSESVAELLDLDNNEFQRPDFPLFFSGASPFVGAF 159
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYAQCYGGHQFGMWAGQLGDGRAITLGEILN S+RWELQLKGAGKTPYSRFADGLAVLR
Sbjct: 160 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNSNSQRWELQLKGAGKTPYSRFADGLAVLR 219
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+REFLCSEAMH LGIPTTRAL LVTTGK VTRDMFYDGNPKEE GAIVCRVAQSFLRF
Sbjct: 220 SSVREFLCSEAMHHLGIPTTRALSLVTTGKLVTRDMFYDGNPKEEQGAIVCRVAQSFLRF 279
Query: 305 GSYQIHASRG-QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK 363
GSYQ+HASRG EDL+IVR LADYAI+HHF HIENM+KSESLSFSTGDEDHSVVDLTSNK
Sbjct: 280 GSYQLHASRGSNEDLEIVRVLADYAIKHHFPHIENMSKSESLSFSTGDEDHSVVDLTSNK 339
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
YAAWAVE+AERTAS++A+WQGVGFTHGV+NTDNMSILGLTIDYGPFGFLDAFDP FTPNT
Sbjct: 340 YAAWAVEIAERTASMIARWQGVGFTHGVMNTDNMSILGLTIDYGPFGFLDAFDPKFTPNT 399
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTK 483
TDLPGRRYCFANQPDIGLWN+AQF+TTL+AA LI+DKEANY +ERYGTKFMD+YQ IMTK
Sbjct: 400 TDLPGRRYCFANQPDIGLWNLAQFTTTLSAAHLINDKEANYALERYGTKFMDDYQDIMTK 459
Query: 484 KLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLV 532
KLGLPKYNKQ+I KLL NMAVDKVDYTNFFR LSN+KAD SIP+DELLV
Sbjct: 460 KLGLPKYNKQLIGKLLTNMAVDKVDYTNFFRTLSNIKADTSIPDDELLV 508
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 241/313 (76%), Positives = 276/313 (88%), Gaps = 7/313 (2%)
Query: 331 HHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHG 390
+ FR + N+ S+ +D +V + ++ WAVE+AERTAS++A+WQGVGFTHG
Sbjct: 487 NFFRTLSNIKADTSIP-----DDELLVSVVNS--GPWAVEIAERTASMIARWQGVGFTHG 539
Query: 391 VLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTT 450
V+NTDNMSILGLTIDYGPFGFLDAFDP FTPNTTDLPGRRYCFANQPDIGLWN+AQF+TT
Sbjct: 540 VMNTDNMSILGLTIDYGPFGFLDAFDPKFTPNTTDLPGRRYCFANQPDIGLWNLAQFTTT 599
Query: 451 LAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYT 510
L+AA LI+DKEANY +ERYGTKFMD+YQ IMTKKLGLPKYNKQ+I KLL NMAVDKVDYT
Sbjct: 600 LSAAHLINDKEANYALERYGTKFMDDYQDIMTKKLGLPKYNKQLIGKLLTNMAVDKVDYT 659
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEER 570
NFFR LSN+KAD SIP+DELLVPLK+VLLDIG+ERKEAW SW+ +YI EL +SGISD++R
Sbjct: 660 NFFRTLSNIKADTSIPDDELLVPLKSVLLDIGQERKEAWTSWLKTYIHELSTSGISDDQR 719
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
K MN VNPKY+LRNYLCQ+AIDAAE+GDFGEVRRLLKL+E P+DEQPGMEKYARLPPAW
Sbjct: 720 KTSMNMVNPKYILRNYLCQTAIDAAEIGDFGEVRRLLKLVEHPFDEQPGMEKYARLPPAW 779
Query: 631 AYRPGVCMLSCSS 643
AYRPGVCMLSCSS
Sbjct: 780 AYRPGVCMLSCSS 792
>gi|168047679|ref|XP_001776297.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672392|gb|EDQ58930.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 702
Score = 809 bits (2089), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/615 (63%), Positives = 468/615 (76%), Gaps = 27/615 (4%)
Query: 53 STTGGGGAAQMESSAS------VDSVTHDLKNQRLDTETETDGGDESKMTKK-------- 98
S G GAA + S ++T ++KN LD + +G K+ K
Sbjct: 91 SRRGKAGAALLRDFGSSRGRVLTAAMTDNMKNLNLDDDKSVNGDVAEKVDKSEEIGASGS 150
Query: 99 --LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVAD 156
K LEDL WDHSFVRELPGD R+D R+VLHACY+KV+PS V+NP+LV+WS VAD
Sbjct: 151 LGRKKLEDLIWDHSFVRELPGDKRSDGPTRQVLHACYSKVTPSVRVKNPELVSWSRHVAD 210
Query: 157 SLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNL 216
L+LD KEFERPDFPL F+GA+ L G + YAQCYGGHQFG+WAGQLGDGRAITLGEILN
Sbjct: 211 LLDLDYKEFERPDFPLLFTGASQLKGGLAYAQCYGGHQFGVWAGQLGDGRAITLGEILNS 270
Query: 217 KSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFV 276
K +RWELQLKGAGKTPYSR ADGLAVLRSS+RE+LCSEAM+ LG+PTTRAL LVTTG+ V
Sbjct: 271 KGQRWELQLKGAGKTPYSRTADGLAVLRSSVREYLCSEAMYHLGVPTTRALSLVTTGEGV 330
Query: 277 TRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHI 336
RDMFYDGN K EPGA+VCRV+ SF+RFGS+QIHA+R + DL IV+ LADY I HH+
Sbjct: 331 LRDMFYDGNVKMEPGAVVCRVSPSFIRFGSFQIHAARDKADLPIVKQLADYTIHHHYPDF 390
Query: 337 ENM-------NKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
E++ + SES G+ + +D + NKY+AW E+AERTA ++A+WQ VGFTH
Sbjct: 391 EDLPFERQGQDGSES---QKGENNAPQIDTSKNKYSAWFTEIAERTALMIAKWQAVGFTH 447
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFST 449
GV+NTDNMSILGLTIDYGPFGFLDAFDP +TPNTTDLPGRRY FANQPDIGLWN+ Q +
Sbjct: 448 GVMNTDNMSILGLTIDYGPFGFLDAFDPKYTPNTTDLPGRRYGFANQPDIGLWNVMQLAN 507
Query: 450 TLAAAKLIDDKEANYV-MERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVD 508
TL A+LI EA YV ++ Y KFM YQ M+ K+GL YNK ++SKLLNNMA DKVD
Sbjct: 508 TLYTAELITADEAQYVTIQIYADKFMFLYQQHMSNKIGLKTYNKDLLSKLLNNMAFDKVD 567
Query: 509 YTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDE 568
YTNFFR+ SN+KA P +D+L+ PLK LLD+ KER++ W+ W+ Y++ ++ G+S+
Sbjct: 568 YTNFFRSFSNLKATPETSDDDLIAPLKNALLDLSKERRKVWLDWLHQYVKNVVDEGVSEA 627
Query: 569 ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
+RKALMNSVNP+YVLRNY+ QSAID AE GDF EV LLKL+ERPYD+QPGMEKYARLPP
Sbjct: 628 DRKALMNSVNPRYVLRNYMLQSAIDMAEQGDFSEVENLLKLIERPYDDQPGMEKYARLPP 687
Query: 629 AWAYRPGVCMLSCSS 643
AWAYRPGVCMLSCSS
Sbjct: 688 AWAYRPGVCMLSCSS 702
>gi|302804871|ref|XP_002984187.1| hypothetical protein SELMODRAFT_180861 [Selaginella moellendorffii]
gi|300148036|gb|EFJ14697.1| hypothetical protein SELMODRAFT_180861 [Selaginella moellendorffii]
Length = 576
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/560 (68%), Positives = 443/560 (79%), Gaps = 14/560 (2%)
Query: 87 TDGGDESKMTKKLK--ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVEN 144
+DG D TK K LE+L WDHSFVRELP D + + R+V+ ACY++VSPSA+V++
Sbjct: 28 SDGEDRGVTTKNKKKNTLEELRWDHSFVRELPSDGTSPNFVRQVMKACYSRVSPSAKVKD 87
Query: 145 PQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGD 204
P+LVAWS+SVA+ LELDP EF+R DFPL FSG L G+ YAQCYGGHQFG+WAGQLGD
Sbjct: 88 PKLVAWSDSVAELLELDPAEFKREDFPLIFSGGKELQGSECYAQCYGGHQFGVWAGQLGD 147
Query: 205 GRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTT 264
GRAITLGE LN K+ERWELQLKGAGKTPYSR ADGLAVLRSS+REFLCSEAMH LGIPTT
Sbjct: 148 GRAITLGEALNSKNERWELQLKGAGKTPYSRMADGLAVLRSSVREFLCSEAMHHLGIPTT 207
Query: 265 RALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTL 324
RALCLVTTG V RDMFYDGN K EPGA+VCRVA SFLRFGSYQIHA+R ED +VR L
Sbjct: 208 RALCLVTTGDDVLRDMFYDGNAKMEPGAVVCRVAPSFLRFGSYQIHAAR--EDSKLVRLL 265
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
ADY +++HF ++ E L ++D + + NKYAAW V+VAE T+ LVA WQ
Sbjct: 266 ADYTLKYHF---PDLPDEEELEIKINEQDGQI---SKNKYAAWFVKVAESTSCLVAMWQA 319
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGFTHGVLNTDNMS+LGLTIDYGPFGFLDAFDP +TPNTTDLPGRRYCFANQPDIGLWNI
Sbjct: 320 VGFTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPKYTPNTTDLPGRRYCFANQPDIGLWNI 379
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV 504
QF TL AA L+ +E Y + RY FM YQ MTKKLGL +YNK + SKLL+N+A
Sbjct: 380 LQFGNTLMAAGLLTQEELQYGLNRYADTFMVHYQQNMTKKLGLKEYNKDLTSKLLSNLAF 439
Query: 505 DKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG 564
DKVDYTNFFRAL++V I ED LVPLK+VL DI KERK+ W+ W+ Y ++ + G
Sbjct: 440 DKVDYTNFFRALASVNLTEPITED-TLVPLKSVLPDISKERKKTWMDWLSLYREK--AEG 496
Query: 565 ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGME-KY 623
ISDE RKA MN VNPKYVLRNYLCQSAIDAAE GDF EVR+LL++M+RP+DEQP +E KY
Sbjct: 497 ISDESRKAAMNKVNPKYVLRNYLCQSAIDAAEAGDFSEVRQLLEVMKRPFDEQPEVEKKY 556
Query: 624 ARLPPAWAYRPGVCMLSCSS 643
ARLPP WAYRPGVCMLSCSS
Sbjct: 557 ARLPPTWAYRPGVCMLSCSS 576
>gi|302780998|ref|XP_002972273.1| hypothetical protein SELMODRAFT_148418 [Selaginella moellendorffii]
gi|300159740|gb|EFJ26359.1| hypothetical protein SELMODRAFT_148418 [Selaginella moellendorffii]
Length = 505
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/516 (70%), Positives = 416/516 (80%), Gaps = 12/516 (2%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
+ ACY++VSPSA+V++P+LVAWS+SVA+ LELDP EF+R DFPL FSG L G+ YAQ
Sbjct: 1 MKACYSRVSPSAKVKDPKLVAWSDSVAELLELDPAEFKREDFPLIFSGGKELQGSECYAQ 60
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
CYGGHQFG+WAGQLGDGRAITLGE LN K+ERWELQLKGAGKTPYSR ADGLAVLRSS+R
Sbjct: 61 CYGGHQFGVWAGQLGDGRAITLGEALNSKNERWELQLKGAGKTPYSRMADGLAVLRSSVR 120
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFLCSEAMH LGIPTTRALCLVTTG V RDMFYDGN K EPGA+VCRVA SFLRFGSYQ
Sbjct: 121 EFLCSEAMHHLGIPTTRALCLVTTGDDVLRDMFYDGNAKMEPGAVVCRVAPSFLRFGSYQ 180
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
IHA+R +D +VR LADY +++HF ++ E L ++D + + NKYAAW
Sbjct: 181 IHAAR--DDSKLVRLLADYTLKYHF---PDLPDEEELEIKINEQDGQI---SKNKYAAWF 232
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
V+VAE T+ LVA WQ VGFTHGVLNTDNMS+LGLTIDYGPFGFLDAFDP +TPNTTDLPG
Sbjct: 233 VKVAESTSCLVAMWQAVGFTHGVLNTDNMSVLGLTIDYGPFGFLDAFDPKYTPNTTDLPG 292
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RRYCFANQPDIGLWNI QF TL AA L+ +E Y + RY FM YQ MTKKLGL
Sbjct: 293 RRYCFANQPDIGLWNILQFGNTLMAAGLLTQEELQYGLNRYADTFMVHYQQNMTKKLGLK 352
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+YNK + SKLL+N+A DKVDYTNFFRAL++V I ED LVPLK+VL DI KERK+
Sbjct: 353 EYNKDLTSKLLSNLAFDKVDYTNFFRALASVNLTEPITEDT-LVPLKSVLPDISKERKKT 411
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W+ W+ Y ++ + GISDE RKA MN VNPKYVLRNYLCQSAIDAAE GDF EVR+LL+
Sbjct: 412 WMDWLSLYREK--AEGISDESRKAAMNKVNPKYVLRNYLCQSAIDAAEAGDFSEVRQLLE 469
Query: 609 LMERPYDEQPGME-KYARLPPAWAYRPGVCMLSCSS 643
+M+RP+DEQP +E KYARLPP WAYRPGVCMLSCSS
Sbjct: 470 VMKRPFDEQPEVEKKYARLPPTWAYRPGVCMLSCSS 505
>gi|149175611|ref|ZP_01854231.1| hypothetical protein PM8797T_16308 [Planctomyces maris DSM 8797]
gi|148845596|gb|EDL59939.1| hypothetical protein PM8797T_16308 [Planctomyces maris DSM 8797]
Length = 537
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 283/558 (50%), Positives = 354/558 (63%), Gaps = 36/558 (6%)
Query: 97 KKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVAD 156
+ +K L DL +D+ F RE+P DP T++ R+V ACY++V+P+ V PQLV++S+ VAD
Sbjct: 5 QTIKNLHDLEFDNQFTREMPADPETENFRRQVSQACYSRVTPT-RVSQPQLVSYSKEVAD 63
Query: 157 SLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNL 216
L+L E +F F+G L G P+A CYGGHQFG WAGQLGDGRAI LGE+ N
Sbjct: 64 LLDLSTAAVESDEFAEVFAGNQVLEGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVRNQ 123
Query: 217 KSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFV 276
K E W LQLKGAG TPYSR ADGLAVLRSS+REFLCSEAM+ LG+PTTRAL LV TG+ V
Sbjct: 124 KGEHWTLQLKGAGPTPYSRTADGLAVLRSSVREFLCSEAMYHLGVPTTRALSLVLTGEQV 183
Query: 277 TRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHI 336
RDMFYDGNP+ EPGA+VCRVA SFLRFG+YQI ASRG+ ++ ++ L DY IR F +
Sbjct: 184 LRDMFYDGNPEHEPGAVVCRVAPSFLRFGNYQIFASRGE--IEPLQKLVDYTIRTDFPEL 241
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
G+ V Y W EV RTA ++ W VGF HGV+NTDN
Sbjct: 242 -------------GEPSREV-------YLRWFEEVCRRTADMIIHWMRVGFVHGVMNTDN 281
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILGLTIDYGP+G+L+ +DP++TPNTTD GRRY F NQP I LWN+ Q + A L
Sbjct: 282 MSILGLTIDYGPYGWLEDYDPNWTPNTTDAAGRRYRFGNQPQIALWNLVQLAN--AIFPL 339
Query: 457 IDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTN 511
I+D E ++ Y F +Q +M +KLG + +I +L + + + D T
Sbjct: 340 IEDAEPLQQSLDEYVDGFEQGFQQMMAEKLGFSSLQRDTDLPLIEELQQVLQLVETDMTI 399
Query: 512 FFRALSNVKAD--PSIPEDELLVPLKAVLLDIGK---ERKEAWISWVLSYIQELLSSGIS 566
FFR L+ +KA+ PS ELL PL + K + + + W+ Y++ L S
Sbjct: 400 FFRRLALLKAESQPSSDAAELLAPLMDAYYEPDKVTGDVRAKIVEWLERYLKRLREEQSS 459
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
D R+ MN VNPKYVLRNYL Q AID A GDF V LL+L+ RPYDEQP E+YA
Sbjct: 460 DTVRRERMNRVNPKYVLRNYLAQLAIDKAAEGDFSLVNELLELLRRPYDEQPEQEEYAGR 519
Query: 627 PPAWAY-RPGVCMLSCSS 643
P WA RPG MLSCSS
Sbjct: 520 RPEWARNRPGCSMLSCSS 537
>gi|384252239|gb|EIE25715.1| UPF0061-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 541
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 279/555 (50%), Positives = 360/555 (64%), Gaps = 28/555 (5%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
++++ + +F RELPGDP T + R+V A Y+ V+P+ P V +S VA + LD
Sbjct: 2 VQNIKLESTFTRELPGDPETKNQRRQVHDAFYSFVAPTPTNSEPMTVLYSGDVARLIGLD 61
Query: 162 PKEFERPDFPLFFSGATPLA-GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
P E ER +F FSG PL G P+AQCYGGHQFGMWAGQLGDGRAI+LGE + +
Sbjct: 62 PAECERQEFAAIFSGNAPLPNGPRPWAQCYGGHQFGMWAGQLGDGRAISLGEAVGPDGKT 121
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
+ELQLKGAG TPYSR ADG AVLRSS+REF+ SEAM+ LGIPTTRAL LV TG V RDM
Sbjct: 122 YELQLKGAGATPYSRMADGRAVLRSSLREFVASEAMYALGIPTTRALSLVGTGAKVLRDM 181
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FY+G+ K EPGA+VCRV+ SF+RFG++Q+ A RG + L ++ LADY IRHH+ H+E
Sbjct: 182 FYNGDAKFEPGAVVCRVSPSFVRFGTFQLPAMRGGDQLPLIAPLADYIIRHHYPHLEGAG 241
Query: 341 KSES--------LSFS-TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGV 391
S + LS S G ED +Y A+ EV RTA+L+A WQ VGF HGV
Sbjct: 242 FSRNGYSDRMKLLSLSGAGRED---------RYVAFLGEVVSRTANLLASWQSVGFVHGV 292
Query: 392 LNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL 451
NTDN SILG TIDYGP+GFL+ FDP+FTPNTTDL GRRY + QP IG WN AQ +
Sbjct: 293 GNTDNFSILGETIDYGPYGFLERFDPNFTPNTTDLDGRRYTYRAQPGIGHWNCAQLANAF 352
Query: 452 AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTN 511
A L+D ++A +++ Y M+ Y M +K+GL KY++++ L+ M DK D+TN
Sbjct: 353 MTAGLLDLEKAQPIVDSYADIMMEAYTGRMARKMGLTKYDRELAVGLVTLMYEDKADFTN 412
Query: 512 FFRALSNVK---ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDE 568
FRAL++V A SIP PL+ L D+ +ER+ AW W+ + G +
Sbjct: 413 TFRALASVSDGDAPGSIP-----APLEEALEDLSEERRSAWGKWLDGLRAAHRAEGRPEA 467
Query: 569 ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
R+A + VNP YV RN L Q AI AE GD+ E++ L+K++ERPY+EQPG E++ PP
Sbjct: 468 ARRADQDDVNPCYVPRNQLMQIAIARAEAGDYDELKALMKVLERPYEEQPGAERFKVTPP 527
Query: 629 AWAYRPGVCMLSCSS 643
R GV +LSCSS
Sbjct: 528 K-EIRMGVELLSCSS 541
>gi|254492380|ref|ZP_05105552.1| Uncharacterized ACR, YdiU/UPF0061 family [Methylophaga thiooxidans
DMS010]
gi|224462272|gb|EEF78549.1| Uncharacterized ACR, YdiU/UPF0061 family [Methylophaga thiooxydans
DMS010]
Length = 540
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 273/550 (49%), Positives = 352/550 (64%), Gaps = 36/550 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
D ++D+ FVRELP DP TD+ R+VL AC++ V P +V PQLVA+S +A L+LD
Sbjct: 17 DFHFDNKFVRELPADPETDNHRRQVLGACFSYVKPR-QVSAPQLVAFSAEMATELDLDES 75
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ F F+G L G P+AQCYGGHQFG WAGQLGDGRAI LGE++N + +R+ L
Sbjct: 76 ICQSEQFAQVFAGNLLLDGMAPHAQCYGGHQFGNWAGQLGDGRAINLGEVINQQGKRFCL 135
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG+TPYSR ADGLAVLRSS+REFLCSEAM+ LGIPTTRAL +VTTG+ V RDMFYD
Sbjct: 136 QLKGAGETPYSRTADGLAVLRSSVREFLCSEAMYHLGIPTTRALSIVTTGENVMRDMFYD 195
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G P+ EPGA+VCRVA SFLR GS++I SRG D+D + L +Y I F H+ +K
Sbjct: 196 GRPEAEPGAVVCRVAPSFLRLGSFEIFTSRG--DIDTLTQLVNYTIETDFPHLGAPSKE- 252
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
Y AW E+ ERTA++V W VGF HGV NTDN S+LGLT
Sbjct: 253 -------------------TYLAWFREICERTATMVTDWMRVGFVHGVFNTDNTSVLGLT 293
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G++D +DP++TPNTTD G+RY F QP I WN+ Q + A LIDD EA
Sbjct: 294 IDYGPYGWIDDYDPNWTPNTTDAVGKRYRFGAQPQIAQWNLLQMAN--AIYPLIDDAEAL 351
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNV 519
++ Y T + D++Q + KLGL ++ ++++ +L M + + D T F+R L+N+
Sbjct: 352 RNILNDYVTVYTDKWQQMRADKLGLAEFKADDEELHQQLNKVMQLSETDMTIFYRLLANI 411
Query: 520 KADPSIPEDE--LLVPLKAVLL---DIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
K I +D+ LL PL + + K+ +W+ SY+ + G+ D RK M
Sbjct: 412 KV-TDIDQDDGTLLQPLLPAFYAPESLSQSDKQDIAAWIRSYLTRVKEDGVDDRSRKTKM 470
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-R 633
N VNPKY+LRNYL Q AID +E GD V LL +M PYDEQP E+YA P WA +
Sbjct: 471 NRVNPKYILRNYLSQLAIDKSEQGDHSLVNELLDVMRHPYDEQPEYEQYAAKRPDWARNK 530
Query: 634 PGVCMLSCSS 643
PG MLSCSS
Sbjct: 531 PGCSMLSCSS 540
>gi|344943913|ref|ZP_08783199.1| UPF0061 protein ydiU [Methylobacter tundripaludum SV96]
gi|344259571|gb|EGW19844.1| UPF0061 protein ydiU [Methylobacter tundripaludum SV96]
Length = 538
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 278/555 (50%), Positives = 354/555 (63%), Gaps = 34/555 (6%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
K L+DL +D+ F+RELP DP T + R+V ACY++V P+ +V NP+LVA+S VA+
Sbjct: 9 KTSGLDDLIFDNRFIRELPADPETVNNRRQVFSACYSRVLPT-KVANPRLVAYSREVAEL 67
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L+L + + DF F G + L G YA CYGGHQFG WAGQLGDGRAI LGEI+N K
Sbjct: 68 LDLTEEVCKSADFTQVFVGNSLLTGMDSYAICYGGHQFGNWAGQLGDGRAINLGEIINRK 127
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
ER+ LQLKGAG TPYSR ADGLAVLRSS+REFLCSEAM+ LG+PTTRAL L+ TG+ V
Sbjct: 128 GERFTLQLKGAGSTPYSRNADGLAVLRSSVREFLCSEAMYHLGVPTTRALSLILTGEEVI 187
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFY G+PK EPGA+VCRVA SF RFGS+QI +RG+ +D++R L DY I F H+
Sbjct: 188 RDMFYSGDPKPEPGAVVCRVAPSFTRFGSFQIFTARGE--IDLLRKLVDYTIVTDFPHL- 244
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
G+ V Y W EV RTA ++ WQ VGF HGV+NTDNM
Sbjct: 245 ------------GEPSLDV-------YLQWFEEVCRRTAEMIVHWQRVGFVHGVMNTDNM 285
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G+L+ +DP++TPNTTD RRY F NQP I WN+ Q + A LI
Sbjct: 286 SILGLTIDYGPYGWLENYDPNWTPNTTDAADRRYRFGNQPQIAFWNLGQLAN--AIYPLI 343
Query: 458 DDKE-ANYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNF 512
+ E + Y F +Q ++ KLGL P + ++ ++LL + + D T F
Sbjct: 344 EQVEPLQQAINAYKDTFERGWQTMVAGKLGLNAYDPSIDNELNTELLILLQSVETDMTIF 403
Query: 513 FRALSNVKADPSIPEDELLVPLKA---VLLDIGKERKEAWISWVLSYIQELLSSGISDEE 569
+R L+ + D + ++ L+ PL V + E K +W+ YI+ + +SGI+D E
Sbjct: 404 YRKLAILVMDVELGDEALMAPLMEAYYVPEQLTDEYKARLGNWLRLYIKRIQNSGIADAE 463
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
R MN+ NPKYVLRNYL Q AID AE GDF V LL+L+ PYDEQPG E++A P
Sbjct: 464 RIKTMNATNPKYVLRNYLAQLAIDKAEQGDFSMVNELLELLRHPYDEQPGKEEFALKRPD 523
Query: 630 WA-YRPGVCMLSCSS 643
WA R G MLSCSS
Sbjct: 524 WARQRAGCSMLSCSS 538
>gi|381153495|ref|ZP_09865364.1| hypothetical protein Metal_3699 [Methylomicrobium album BG8]
gi|380885467|gb|EIC31344.1| hypothetical protein Metal_3699 [Methylomicrobium album BG8]
Length = 537
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 285/567 (50%), Positives = 353/567 (62%), Gaps = 48/567 (8%)
Query: 94 KMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSES 153
++ +L +L+DL +D+ F+RELPGDP T + R+V ACY++V+P A+V PQ VA+S
Sbjct: 2 NLSPQLASLDDLVFDNRFIRELPGDPETANFRRQVADACYSRVNP-AKVAAPQWVAYSRE 60
Query: 154 VADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
VAD L+L + DF F+G G P+A CYGGHQFG WAGQLGDGRAI LGE+
Sbjct: 61 VADLLDLSRELCASEDFTQVFAGNRLARGMEPFAMCYGGHQFGFWAGQLGDGRAINLGEV 120
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
+N ERW LQLKGAG TPYSR ADGLAVLRSSIREFLCSEAMH LG+PTTRAL +V TG
Sbjct: 121 VNRHGERWVLQLKGAGPTPYSRNADGLAVLRSSIREFLCSEAMHHLGVPTTRALSVVLTG 180
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHF 333
+ V RDMFYDGNP+ EPGAIVCRV+ SF+RFG++QI A+RG+ +L +R DY IR F
Sbjct: 181 ERVIRDMFYDGNPRSEPGAIVCRVSPSFIRFGNFQILAARGETEL--LRRFVDYTIRVDF 238
Query: 334 RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLN 393
H+ G+ +V YA W E+ +TA ++ WQ VGF HGV+N
Sbjct: 239 PHL-------------GEPSPAV-------YADWFQEICRKTAEMIVHWQRVGFVHGVMN 278
Query: 394 TDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA 453
TDNMSILGLTIDYGP+G+LD +DP +TPNTTD RRY F QP I WN+ Q + L
Sbjct: 279 TDNMSILGLTIDYGPYGWLDNYDPHWTPNTTDAEQRRYRFGQQPQIAYWNLGQLANALFP 338
Query: 454 AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDY 509
+ + M Y F E+Q +M KLG+ P ++ +I +LL + + D
Sbjct: 339 V-FGEAEPLQAGMSAYAETFDREWQRMMAGKLGIADDRPATDEDLIIELLVLLQKAETDM 397
Query: 510 TNFFRALSNVKADPSIP----------EDELLVP--LKAVLLDIGKERKEAWISWVLSYI 557
T FFR L+++ P ED P L A L ER++AW+ Y
Sbjct: 398 TLFFRRLASLDTGGDRPDWKTRIAARLEDCYYRPEQLSADYL----ERRDAWLG---RYH 450
Query: 558 QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ 617
+ L G+ D ER+ M +VNPKYVLRNYL Q AID AE GDF V LL L RPYDEQ
Sbjct: 451 ERLRQGGLPDVERRRRMYAVNPKYVLRNYLSQLAIDRAEQGDFSTVDELLDLCRRPYDEQ 510
Query: 618 PGMEKYARLPPAWAY-RPGVCMLSCSS 643
PG E YA P WA RPG MLSCSS
Sbjct: 511 PGKEHYAAKRPDWARSRPGCSMLSCSS 537
>gi|387128075|ref|YP_006296680.1| hypothetical protein Q7A_2225 [Methylophaga sp. JAM1]
gi|386275137|gb|AFI85035.1| hypothetical protein Q7A_2225 [Methylophaga sp. JAM1]
Length = 542
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 275/549 (50%), Positives = 350/549 (63%), Gaps = 35/549 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ FVRELP DP T+++ R+VL ACYT V+P+ V +P+LVA+S +A L + P +
Sbjct: 19 LQFDNRFVRELPADPDTENVRRQVLGACYTFVNPTP-VADPKLVAYSMDLATDLGIRPVD 77
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
E F F+G L G P+A CYGGHQFG WAGQLGDGRAI LGE+ ++ + LQ
Sbjct: 78 CESRQFANVFAGNEMLEGMQPHAMCYGGHQFGNWAGQLGDGRAINLGEVQDIHGQLQMLQ 137
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKG+G+TPYSR ADGLAVLRSS+REFLCSEAM LG+PTTRAL L+TTG+ V RDMFYDG
Sbjct: 138 LKGSGETPYSRSADGLAVLRSSVREFLCSEAMFHLGVPTTRALSLITTGEGVVRDMFYDG 197
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
P+ EPGAIVCRVA SFLR G+Y++ SRG D+D +R L DY IRHHF H+ +K
Sbjct: 198 RPQTEPGAIVCRVAPSFLRIGNYELFNSRG--DIDNLRLLIDYTIRHHFPHLGEPSKE-- 253
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
Y AW EV ERTA LV W VGF HGVLNTDN SILGLTI
Sbjct: 254 ------------------TYLAWFKEVCERTADLVVHWMRVGFVHGVLNTDNTSILGLTI 295
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G++D +DP +TPNTTD G+RY F +QP I WN+ Q A LI++ E
Sbjct: 296 DYGPYGWIDNYDPDWTPNTTDATGKRYRFGHQPQIAQWNLLQLGN--AIYPLINEVEPLQ 353
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSNV 519
++ Y + +++Q + KLGL +Y + ++ +L + + + D T F+R L++V
Sbjct: 354 QILTDYVELYTNKWQQMRADKLGLNEYQGDDDHELNQQLQKILLLAETDMTIFYRRLADV 413
Query: 520 KADPSIPEDE-LLVPLKAVLL---DIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
+ DE LL PL + KE K+ W+ Y Q + G SD++RKA MN
Sbjct: 414 SCEQKDLSDEALLEPLMEAYYAPDALSKEDKKDICDWLRQYQQRVQQDGTSDQDRKARMN 473
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
VNPKYVLRNYL Q AID A GD+ + LL++M RPYDEQP + YA P WA +P
Sbjct: 474 LVNPKYVLRNYLSQQAIDKAHEGDYSMIDELLEVMHRPYDEQPQYDHYAAKRPDWARDKP 533
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 534 GCSMLSCSS 542
>gi|335042435|ref|ZP_08535462.1| hypothetical protein MAMP_01925 [Methylophaga aminisulfidivorans
MP]
gi|333789049|gb|EGL54931.1| hypothetical protein MAMP_01925 [Methylophaga aminisulfidivorans
MP]
Length = 538
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 271/564 (48%), Positives = 355/564 (62%), Gaps = 41/564 (7%)
Query: 91 DESKMTKKLKALEDLNW--DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLV 148
+ES T L LNW D+ F++ LP D T + R+VL AC++ V+P + +P L+
Sbjct: 5 NESNTTNGL-----LNWQFDNQFIQRLPADAETGNFRRQVLGACFSYVTPR-KATSPTLM 58
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
A+S +++ L L+ ++ F F G L G P+AQCYGGHQFG WAGQLGDGRAI
Sbjct: 59 AYSAEMSEELGLNDEDCHSDLFKQVFVGNQQLEGMQPHAQCYGGHQFGNWAGQLGDGRAI 118
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE++ +RW LQLKG+G+TPYSR ADGLAVLRSS+REFLCSEAM+ LG+PTTRAL
Sbjct: 119 NLGEVIGESGQRWSLQLKGSGETPYSRTADGLAVLRSSVREFLCSEAMYHLGVPTTRALS 178
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYA 328
L+TTG V RDMFYDG P+ EPGA+VCRVA SFLR GSY+I ++RG D + ++TL DY
Sbjct: 179 LITTGDDVIRDMFYDGRPQSEPGAVVCRVAPSFLRLGSYEIFSARG--DSETLKTLVDYT 236
Query: 329 IRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
I + H+ +K Y W E+ ERTA +V W VGF
Sbjct: 237 IDTFYPHLGAPSKQ--------------------SYLDWFREICERTADMVVDWMRVGFV 276
Query: 389 HGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFS 448
HGV NTDN S+LGLTIDYGP+G++D +DP++TPNTTD G+RY F QP I WN+ Q +
Sbjct: 277 HGVFNTDNTSVLGLTIDYGPYGWIDDYDPNWTPNTTDATGKRYRFGAQPQIAQWNLLQMA 336
Query: 449 TTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN--KQIISKLLNN-MAV 504
A LIDD EA ++ Y T + D++Q + KLGL ++ + + + LN M +
Sbjct: 337 N--AIYPLIDDAEALRNILNDYVTVYTDKWQQMRADKLGLAEFKPADEALHQDLNRVMQL 394
Query: 505 DKVDYTNFFRALSNVKA-DPSIPEDELLVPLKAVLLD---IGKERKEAWISWVLSYIQEL 560
+ D T F+R L++V D + +DELL PL + + K+ +WV Y++ +
Sbjct: 395 TETDMTLFYRHLADVNVTDKNKTDDELLSPLMVAFYSPDALSQADKKDIANWVRDYLKRV 454
Query: 561 LSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGM 620
GISDE+RK MN+VNPKYVLRNYL Q AID AE GD + L+++M RPYDEQP
Sbjct: 455 EEEGISDEKRKTKMNAVNPKYVLRNYLSQLAIDKAEQGDPSLINELMEVMRRPYDEQPQY 514
Query: 621 EKYARLPPAWAY-RPGVCMLSCSS 643
E YA P WA +PG MLSCSS
Sbjct: 515 ESYAAKRPDWARNKPGCSMLSCSS 538
>gi|408419254|ref|YP_006760668.1| hypothetical protein TOL2_C18030 [Desulfobacula toluolica Tol2]
gi|405106467|emb|CCK79964.1| conserved uncharacterized protein, UPF0061 [Desulfobacula toluolica
Tol2]
Length = 535
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 271/558 (48%), Positives = 349/558 (62%), Gaps = 34/558 (6%)
Query: 95 MTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV 154
+ +K LE+L +D+ FVR LP DP TD+ R+V ACY++V+P V P LVA+S
Sbjct: 3 LERKANTLENLIFDNRFVRNLPCDPNTDNTRRQVTGACYSRVNPKPVVA-PGLVAFSSES 61
Query: 155 ADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
A ++L + + F F+G L G P+A CYGGHQFG WAGQLGDGRAI LGEI+
Sbjct: 62 AQLMDLTDEACQSELFTRVFTGNHLLPGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEII 121
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N ++ERW LQLKGAG TPYSR ADGLAVLRSSIREFLCSEAM LGIPTTRAL L TG+
Sbjct: 122 NQRNERWVLQLKGAGPTPYSRTADGLAVLRSSIREFLCSEAMFHLGIPTTRALSLTLTGE 181
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
V RDMFYDG+PK E GA+VCR+A SF+RFG++QI +RG+ L ++ L DY I F
Sbjct: 182 EVERDMFYDGHPKLEQGAVVCRMAPSFIRFGNFQILVARGENCL--LKRLVDYTIETDFP 239
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
H+ + + + Y W EV RT ++ W VGF HGV+NT
Sbjct: 240 HL--------------------ISTSQSVYERWFREVCMRTMDMIIHWMRVGFVHGVMNT 279
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGP+G+L+ ++P +TPNTTDL GRRYCF NQP I LWN+AQ A
Sbjct: 280 DNMSILGLTIDYGPYGWLEDYNPGWTPNTTDLAGRRYCFGNQPQIALWNLAQLGN--AVF 337
Query: 455 KLIDDKEANYVMERYGTKFMDEYQ-AIMTKKLGL----PKYNKQIISKLLNNMAVDKVDY 509
++ E+ ++ + Q A+MT+KLG P + II +LL + + + DY
Sbjct: 338 PMVKRHESLQEALDEAQAYVQQGQLAMMTQKLGFQNFEPDMDIAIIKELLKILQLAETDY 397
Query: 510 TNFFRALSNVKADPSIPEDELLVPLKAVLLD---IGKERKEAWISWVLSYIQELLSSGIS 566
T F R LS++ + + + + L+ D I + +W+ Y + L + IS
Sbjct: 398 TIFLRGLSHLDTEDGLAKTLIPSFLENAFYDPDQITPDYIARLNAWLAVYQKRLGLNRIS 457
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
+ ++K M+ VNPKYVLRNYL Q+AID AE GDF VR LL +M +PYDEQPG E +A
Sbjct: 458 NADKKQQMDQVNPKYVLRNYLAQTAIDKAEDGDFSMVRELLDVMRKPYDEQPGREMFAAK 517
Query: 627 PPAWAY-RPGVCMLSCSS 643
P WA RPG MLSCSS
Sbjct: 518 RPEWARNRPGCSMLSCSS 535
>gi|159480380|ref|XP_001698262.1| hypothetical protein CHLREDRAFT_120727 [Chlamydomonas reinhardtii]
gi|158273760|gb|EDO99547.1| predicted protein [Chlamydomonas reinhardtii]
Length = 552
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 265/553 (47%), Positives = 340/553 (61%), Gaps = 14/553 (2%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
A + L W H+FV ELP DP T ++ R+V A +T V P+ P + +S VA L L
Sbjct: 4 APQSLPWAHTFVNELPADPNTTNVVRQVKGALFTPVQPTPPDGVPYTITYSAKVARLLGL 63
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS-- 218
DP E ERP+F L SGA PL GA P+A CYGGHQFG WAGQLGDGRAITLGE+ +
Sbjct: 64 DPTECERPEFALVMSGAAPLPGARPFAACYGGHQFGQWAGQLGDGRAITLGEVRRAGACG 123
Query: 219 ERWEL-QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
W+L + KG G T R ADG AVLRSS+REF+ SEAM LG+PTTRAL LV TG V
Sbjct: 124 GVWKLGKRKGKGPTHGVRRADGRAVLRSSLREFVASEAMAALGVPTTRALSLVGTGDKVL 183
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFY+GN K E GA+VCRVA SF+RFG++Q+ SRG ++ +V+ AD+ I+HH H+
Sbjct: 184 RDMFYNGNAKMEQGAVVCRVAPSFVRFGTFQLPVSRGAGEVGLVKMAADWVIKHHMPHLA 243
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + + G V+ + Y E RT LVAQWQ +GF HGVLNTDNM
Sbjct: 244 GEGEGTCVFRAAGPP----VNKSPEPYLGLLREACARTGRLVAQWQALGFVHGVLNTDNM 299
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+GFLD FDP +TPN TD GRRY + NQP+ G +N+ L AA L+
Sbjct: 300 SILGLTIDYGPYGFLDVFDPDWTPNLTDASGRRYSYRNQPEAGQFNVVMLGNALLAADLL 359
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
+ A + Y Y +M KLGL +Y++ + +L+ M D D+TN FRALS
Sbjct: 360 GREAATEALVGYSEVLSTTYNQLMAAKLGLKEYDRTLAQELMKMMYTDDADFTNTFRALS 419
Query: 518 NVKADPSIPEDELLVPLK-AVLLDIGK----ERKEAWISWVLSYIQELLSSGISDEERKA 572
+ + + E +P + A L+ G+ ER AW W+ +Y + G D ER+A
Sbjct: 420 SEEGGGAAAEGPQQLPGRLAAALNRGQPLSEERAAAWRQWLQAYQARCVPDGTPDAERQA 479
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP-GMEKY-ARLPPAW 630
PK++ R +L Q AI+AAE GD+ E+ L++++ERPYDEQP KY A PP
Sbjct: 480 AQRLACPKFIPRQHLLQWAIEAAEQGDYSELEALMEVLERPYDEQPEAPAKYSAPPPPDM 539
Query: 631 AYRPGVCMLSCSS 643
RPGVCMLSCSS
Sbjct: 540 EGRPGVCMLSCSS 552
>gi|237653304|ref|YP_002889618.1| hypothetical protein Tmz1t_2639 [Thauera sp. MZ1T]
gi|237624551|gb|ACR01241.1| protein of unknown function UPF0061 [Thauera sp. MZ1T]
Length = 524
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 270/551 (49%), Positives = 342/551 (62%), Gaps = 36/551 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ FVRELP DP ++ R V ACY++V P+ V P+L+AWS VA L L+
Sbjct: 1 MRALRFDNRFVRELPADPEAENHVRPVHGACYSRVMPTP-VRAPRLLAWSREVAHILGLE 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ +F F G L G PYA CYGGHQFG WAGQLGDGRAITLGE +N + ERW
Sbjct: 60 EADVRSAEFARVFGGNGLLPGMEPYAACYGGHQFGNWAGQLGDGRAITLGESINARGERW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSRFADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDM
Sbjct: 120 ELQLKGAGPTPYSRFADGRAVLRSSLREFLCSEAMHHLGVPTTRALSLVGTGETVVRDML 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP+ EPGA+VCRVA SF+RFG+++I ASRG+E L + L D+ I F +
Sbjct: 180 YDGNPRPEPGAVVCRVAPSFIRFGNFEIFASRGEEAL--LERLIDFTIARDFPEL----- 232
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ D + + W EV RTA LVA W VGF HGV+NTDNMSILG
Sbjct: 233 -------AAEPD------AAARRIRWFDEVCRRTAVLVAHWMRVGFVHGVMNTDNMSILG 279
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D FDP +TPNTTD GRRY F NQP I WN+ Q + + + + +
Sbjct: 280 LTIDYGPYGWVDDFDPDWTPNTTDAGGRRYRFGNQPFIAHWNLWQLANAIYPV-VREVEP 338
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKY------NKQIISKLLNNMAVDKVDYTNFFRA 515
+ Y Y+ +M KLGL ++ + ++ +L +A +VD + FFR
Sbjct: 339 LERALAAYADVHDSSYRDMMRAKLGLAEWRGGEEGDDGLLERLHRLLAAGEVDMSLFFRR 398
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKAL 573
L++V DP+ P L PL D + + ++W+ ++ +L G R+A
Sbjct: 399 LADV--DPAAP---TLEPLAEAFYDPTRRAAVEAELLAWLRAHGARVLGDGRLAAARRAE 453
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY- 632
MN VNP YV RNYL Q AIDAAE GD E+ LL+++ RPYDEQPG E++A P WA
Sbjct: 454 MNRVNPLYVPRNYLAQQAIDAAEGGDMSELEALLEVLRRPYDEQPGRERFAARRPDWARD 513
Query: 633 RPGVCMLSCSS 643
RPG MLSCSS
Sbjct: 514 RPGCSMLSCSS 524
>gi|192361916|ref|YP_001983073.1| hypothetical protein CJA_2613 [Cellvibrio japonicus Ueda107]
gi|190688081|gb|ACE85759.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
Length = 538
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/558 (47%), Positives = 356/558 (63%), Gaps = 35/558 (6%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
L++L L +D+ VRELP DP ++ R+V A Y++V+P+ V PQL+ ++ VAD L
Sbjct: 3 LRSLAHLRFDNRLVRELPADPVVENYRRQVTGAVYSRVTPTP-VSAPQLIMAAQDVADLL 61
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L +P+F F+G + L G P+A CYGGHQFG WAGQLGDGRAI LGE++N +
Sbjct: 62 DLGADILAQPEFTQVFAGNSLLPGMEPHACCYGGHQFGNWAGQLGDGRAINLGEVINQRG 121
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W LQLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LVTTG+ V R
Sbjct: 122 EHWTLQLKGAGPTPYSRTADGLAVLRSSLREFLCSEAMHHLGVPTTRALSLVTTGELVRR 181
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
DMFYDGNP+ EPGAIVCRVA F RFG+++I ++RG D+D++R L D+ IR F +
Sbjct: 182 DMFYDGNPQWEPGAIVCRVAPGFTRFGNFEIFSARG--DIDLLRQLVDFTIRADFPALLE 239
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N + + Y W +V +RTA L+A W VGF HGV+NTDNMS
Sbjct: 240 GNTPD-----------------KHTYLRWYQDVCKRTAQLMAHWMRVGFVHGVMNTDNMS 282
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLTIDYGP+G+L+ +DP +TPNTTD GRRY + NQP + LWN+AQ + A LI+
Sbjct: 283 ILGLTIDYGPYGWLEGYDPDWTPNTTDAQGRRYRYGNQPRVALWNLAQLAN--AIYPLIN 340
Query: 459 DKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFF 513
+ E +E + ++ Q M KLGL ++ ++ ++ LL + ++D T F+
Sbjct: 341 EVEPLQAGLEYFRAQYEACSQQDMAAKLGLSQFRQETDQPLVESLLAVLQSTEMDMTIFY 400
Query: 514 RALSNVKA-DPSIPEDELLVP--LKAVLLDIGKERKEAWISWVLSYI----QELLSSGIS 566
R L+++ + D +E L+ L A + W++ Y Q+++ +G +
Sbjct: 401 RRLASIASVDLLDASNEYLLTHFLPACYQTPDATQVAMLRQWLMDYARRIQQDVVMNGWT 460
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
+ +R ALMN NPKYVLRNY+ Q AID A GD+ EV++LL L+ PYDEQP ++Y
Sbjct: 461 EVQRCALMNRTNPKYVLRNYMAQQAIDKATQGDYNEVQQLLTLLRNPYDEQPEFDRYFAK 520
Query: 627 PPAWA-YRPGVCMLSCSS 643
P WA ++ G MLSCSS
Sbjct: 521 RPEWARHKAGCSMLSCSS 538
>gi|449018261|dbj|BAM81663.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 671
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 298/653 (45%), Positives = 379/653 (58%), Gaps = 44/653 (6%)
Query: 11 PHLLFSSLSSSSSSLRP-----RLPKFPFYPAYFTKSPSCPSIACHVSTTGGGGAAQMES 65
PHL S + S ++ RP RLP+ + + S P A S TG G
Sbjct: 43 PHLGRSVFTPSRTTARPSEARERLPRSAL--PHLRSNYSLPETAMLGSGTGHG------- 93
Query: 66 SASVDSVTHDLKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIP 125
S D L T ++D ++L L++L F LP DP T +
Sbjct: 94 --SSDGKGAPLPATTTTTTHQSD--------ERLLTLDELVLSAGFASRLPADPETANYV 143
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADS-LELDPKEFERPDFPLFFSGATPLAGAV 184
R V A + V PS P L WS+ A + L+L+ + ER FSG L G+
Sbjct: 144 RVVRGAALSFVHPSPTWTEPVLAVWSDRCARACLDLEVRPSERDYAARVFSGLAMLPGSR 203
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYAQ YGGHQFG+WAGQLGDGR I LGE N E W LQLKGAGKTP++RFADG AVLR
Sbjct: 204 PYAQRYGGHQFGVWAGQLGDGRVIVLGEYQNRCGETWTLQLKGAGKTPFARFADGRAVLR 263
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+REFL SEA+H LGIPT+RAL LV TG V RDMFYDGNP+EEPGA+VCR+A S++RF
Sbjct: 264 SSVREFLASEALHALGIPTSRALSLVVTGDKVVRDMFYDGNPREEPGAVVCRLAPSWVRF 323
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK- 363
G++++ + +L+++R LAD I HH+ + +S ++ D S + S
Sbjct: 324 GTFEL--ATDWNELELLRQLADDTIVHHYPALLAHERSHG-KRTSADSSRSARNEESQNP 380
Query: 364 --YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTP 421
Y A ++VAERTA+LVA WQ VGF HGVLNTDNMSILG+TIDYGPFGFLDA+ P +TP
Sbjct: 381 MPYRALLLQVAERTAALVAGWQSVGFVHGVLNTDNMSILGITIDYGPFGFLDAYMPEYTP 440
Query: 422 NTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
NTTDLPGRRYC+A QP I LWN+ Q A L + V + Y TKF +E A +
Sbjct: 441 NTTDLPGRRYCYALQPTICLWNLLQL--VRAFEPLTGTNLSEEVSQTYETKFREEMSARL 498
Query: 482 TKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE-DELLVPLKA 536
KLG +N ++ L M D+ D+T +RALS ++ + + D L PL
Sbjct: 499 RAKLGFQTWNSDADNGLVRDLYELMRQDRADFTRTWRALSWLEPVACLQKSDASLEPLLR 558
Query: 537 VL---LDIGKERKEAWISWVLSYIQELLSSGISD-EERKALMNSVNPKYVLRNYLCQSAI 592
VL + +R EAW WV Y + L+ D R+ M + +PKY+LRNY+ Q AI
Sbjct: 559 VLPEPVRKNPDRLEAWRLWVQRYAERTLAEDNFDGTARRKQMQAASPKYILRNYMAQVAI 618
Query: 593 DAAE-LGDFGEVRRLLKLMERPYDEQPGMEK-YARLPPAWAYRPGVCMLSCSS 643
+ AE DF E+ RLLKL+E PY EQP ME Y R PP W+ R GVCM SCSS
Sbjct: 619 EKAENEQDFSEIERLLKLLEHPYAEQPEMEALYDREPPTWSQRLGVCMNSCSS 671
>gi|388258677|ref|ZP_10135852.1| hypothetical protein O59_003073 [Cellvibrio sp. BR]
gi|387937436|gb|EIK43992.1| hypothetical protein O59_003073 [Cellvibrio sp. BR]
Length = 525
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 266/545 (48%), Positives = 344/545 (63%), Gaps = 33/545 (6%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+ +LP DP T++ R+V+ A Y++V+P++ V NPQL+A + VA ++L F++ +F
Sbjct: 1 MHQLPADPETENFRRQVVGAIYSRVNPTS-VTNPQLLAGAAEVAALVDLPAAIFQQAEFA 59
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
F+G LAG P+A CYGGHQFG WAGQLGDGRAI LGE++N K E W LQLKGAG T
Sbjct: 60 QVFAGNQLLAGMEPHACCYGGHQFGNWAGQLGDGRAINLGEVINSKGEHWTLQLKGAGPT 119
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR ADGLAVLRSS+REFLCSEAM LG+PTTRAL LVTTG+ V RDMFYDGNP+ E G
Sbjct: 120 PYSRSADGLAVLRSSVREFLCSEAMFHLGVPTTRALSLVTTGEKVRRDMFYDGNPEFEQG 179
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
AIVCRVA SF RFG+++I ++RG D +++ LAD+ IR F H+ + +
Sbjct: 180 AIVCRVAPSFTRFGNFEILSARG--DNQLLKRLADFTIRTDFPHLLSAKNN--------- 228
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
D+ + Y W EV TA L+A W VGF HGV+NTDNMSILGLTIDYGP+G+
Sbjct: 229 ------DIGVDIYVQWFTEVCIATAQLIAHWMRVGFVHGVMNTDNMSILGLTIDYGPYGW 282
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYG 470
L+ +DP +TPNTTD GRRY F NQP I LWN+ Q + A LI+ E +E+Y
Sbjct: 283 LEGYDPDWTPNTTDAQGRRYRFGNQPRIALWNLTQLAN--AIYPLINAVEPLQIALEQYR 340
Query: 471 TKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKAD--PS 524
++ Q M KLGL P ++ + LL + ++D T F+R L+ D
Sbjct: 341 IEYERCAQRDMASKLGLYQFDPAQDETLTDNLLLALQSAEIDMTIFYRQLAQYSVDDIDQ 400
Query: 525 IPEDELLVPLK-AVLLDIGKERKEAWISWVLSYIQEL----LSSGISDEERKALMNSVNP 579
+ + + A + ++ K ISW+ +Y Q L + +SDE R+ALMN NP
Sbjct: 401 YSDQQWFDKVAFAYYQEPTRDAKSTMISWLRAYGQRLQQDAVLHNVSDEARRALMNRTNP 460
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCM 638
KYVLRNYL Q AID A LGD E+ RLL+L+ PY EQP E Y P WA ++ G M
Sbjct: 461 KYVLRNYLAQQAIDKATLGDASEIERLLQLLRNPYAEQPEFESYYAKRPEWARHKAGCSM 520
Query: 639 LSCSS 643
LSCSS
Sbjct: 521 LSCSS 525
>gi|320353978|ref|YP_004195317.1| hypothetical protein Despr_1878 [Desulfobulbus propionicus DSM
2032]
gi|320122480|gb|ADW18026.1| protein of unknown function UPF0061 [Desulfobulbus propionicus DSM
2032]
Length = 533
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 264/552 (47%), Positives = 344/552 (62%), Gaps = 37/552 (6%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
AL+ L +D+ F R LP DPR+D+ R+V ACY++V P +V P+LVA S A L+L
Sbjct: 10 ALDALTFDNRFTRALPADPRSDNSRRQVHQACYSRVRP-VQVREPRLVAVSREAAALLDL 68
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
+ F F+G + LAG P+A CYGGHQFG WA QLGDGRAI LGE++N + E
Sbjct: 69 TENDCRCERFLQVFAGNSLLAGMDPHALCYGGHQFGNWARQLGDGRAINLGEVVNRRGEH 128
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
W LQLKGAG TPYSR ADGLAVLRSS+REFLCSEAM LG+PTTRAL L+ TG+ V RDM
Sbjct: 129 WTLQLKGAGPTPYSRNADGLAVLRSSLREFLCSEAMFHLGVPTTRALSLILTGESVLRDM 188
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FYDGNP EPGA++CR+A SFLRFG+Y++ A+RG+ L +R L D+ +R F H+
Sbjct: 189 FYDGNPALEPGAVICRLAPSFLRFGNYELLAARGETAL--LRQLVDFTLRTFFPHL---- 242
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
GD + Y W E+ TA L+ W VGF HGV+NTDNMSIL
Sbjct: 243 ---------GDPGPAA-------YGRWFAEICRTTAELMVHWLRVGFVHGVMNTDNMSIL 286
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLTIDYGP+G+L+ +DP++TPNTTD GRRYC+ QP I WN+AQ +T L + LI +
Sbjct: 287 GLTIDYGPYGWLEDYDPTWTPNTTDAMGRRYCYGRQPQIAHWNLAQLATAL--SPLIGET 344
Query: 461 EA-NYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
E + Y F +Q +M +KLGL P ++ ++ +LL + + D T FFR
Sbjct: 345 EPLEEALRDYAHHFEQGWQTMMARKLGLRAFEPHSDRPLVEELLRLLPEVETDMTLFFRR 404
Query: 516 LSNVKADPSIPEDELLVPLKAVLL---DIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
L+ V PS ++ + PL+ + + ++ W+ Y Q L + D ER
Sbjct: 405 LAMV---PSGCAEDRVQPLRDAFYRPEQLTEPYRQRLHGWIERYRQRLQRDNLPDAERCR 461
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA- 631
MN+VNPKYVLRNYL Q AID G++ V +L+++ PYDEQPG E +A P WA
Sbjct: 462 RMNAVNPKYVLRNYLAQLAIDKIMEGEYSLVEEMLEVLRHPYDEQPGREWFAEKRPEWAR 521
Query: 632 YRPGVCMLSCSS 643
+RPG MLSCSS
Sbjct: 522 HRPGCSMLSCSS 533
>gi|119897865|ref|YP_933078.1| hypothetical protein azo1574 [Azoarcus sp. BH72]
gi|166231415|sp|A1K5T6.1|Y1574_AZOSB RecName: Full=UPF0061 protein azo1574
gi|119670278|emb|CAL94191.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length = 519
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 276/549 (50%), Positives = 336/549 (61%), Gaps = 37/549 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ FVRELP DP T R+V A Y++V+P+ V P LVA S VA L D
Sbjct: 1 MRPLVFDNRFVRELPADPETGPHTRQVAGASYSRVNPT-PVAAPHLVAHSAEVAALLGWD 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ P+F F G L G PYA CYGGHQFG WAGQLGDGRAITLGE+LN + RW
Sbjct: 60 ESDIASPEFAEVFGGNRLLDGMEPYAACYGGHQFGNWAGQLGDGRAITLGEVLNGQGGRW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMF
Sbjct: 120 ELQLKGAGPTPYSRRADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEKVVRDMF 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP+ EPGAIVCRVA SF+RFG++++ A+RG DLD++ L D+ I F IE +
Sbjct: 180 YDGNPQAEPGAIVCRVAPSFIRFGNFELLAARG--DLDLLNRLIDFTIARDFPGIEGSAR 237
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+K A W V RTA++VA W VGF HGV+NTDNMSILG
Sbjct: 238 --------------------DKRARWFETVCARTATMVAHWMRVGFVHGVMNTDNMSILG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D FDP +TPNTTD GRRY F +QP I WN+ Q + L A +
Sbjct: 278 LTIDYGPYGWVDNFDPGWTPNTTDAGGRRYRFGHQPRIANWNLLQLANALFPA-FGSTEA 336
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSN 518
+ Y + E +A+ KLGL ++ L M +VD T FFRAL+
Sbjct: 337 LQAGLNTYAEVYDRESRAMTAAKLGLAALADADLPMVDALHGWMKRAEVDMTLFFRALAE 396
Query: 519 V---KADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
V K DP++ D K + E E + W+ Y G+ ++R+A MN
Sbjct: 397 VDLLKPDPALFLDAFYDDAKRL------ETAEEFSGWLRLYADRCRQEGLDADQRRARMN 450
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
+ NP+YV+RNYL Q AIDAAE GD+G VR LL +M RPYDEQP YA+ P WA R
Sbjct: 451 AANPRYVMRNYLAQQAIDAAEQGDYGPVRSLLDVMRRPYDEQPERAAYAQRRPDWARERA 510
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 511 GCSMLSCSS 519
>gi|149920510|ref|ZP_01908978.1| hypothetical protein PPSIR1_34502 [Plesiocystis pacifica SIR-1]
gi|149818691|gb|EDM78136.1| hypothetical protein PPSIR1_34502 [Plesiocystis pacifica SIR-1]
Length = 557
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 271/575 (47%), Positives = 345/575 (60%), Gaps = 65/575 (11%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA------DSLE 159
+D+SFVRELPGDP D+ R+VL ACY++V P+ V P+L+ WS VA + L+
Sbjct: 11 GFDNSFVRELPGDPEADNFRRQVLGACYSRVEPTP-VSGPELLGWSREVAALLGLPEDLQ 69
Query: 160 LDPKE-----FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
DP+E R + SG+ AG PYA CYGGHQFG WA QLGDGRAITLGEIL
Sbjct: 70 EDPQEDPQAEATREELAAVLSGSRLWAGMEPYAACYGGHQFGNWADQLGDGRAITLGEIL 129
Query: 215 ---NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVT 271
+ + RWELQLKGAG TPYSR DG AVLRSSIREFLCSEAMH LG+PTTRAL LV
Sbjct: 130 RSNDGEDTRWELQLKGAGPTPYSRRGDGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVR 189
Query: 272 TGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRH 331
TG V RDMFYDGN + EPGA+VCRVA SF+RFG++++ A+R +D + +R LADY I
Sbjct: 190 TGDEVRRDMFYDGNAELEPGAVVCRVAPSFVRFGNFELFAAR--KDHETLRRLADYVIAE 247
Query: 332 HFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGV 391
HF +L + YAAW VAERTA ++ W VGF HGV
Sbjct: 248 HF-----------------------PELDAGDYAAWFGIVAERTAEMICHWMRVGFVHGV 284
Query: 392 LNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL 451
+NTDNMS+LGLTIDYGP+G+L+ +DP++TPNTTD GRRY F NQP I WN+ +F L
Sbjct: 285 MNTDNMSVLGLTIDYGPYGWLEDYDPNWTPNTTDAHGRRYRFGNQPRIAAWNLTRFGAAL 344
Query: 452 AAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISK------LLNNMAV 504
L+D+ E+ +E Y + + KLGL + S L N +
Sbjct: 345 --LPLVDEAESIQAGLEAYAERLSAGVLSTYADKLGLRSIDADEGSDQPGSGPWLANTCM 402
Query: 505 D---------KVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL------DIGKERKEAW 549
D + D T F R L+ V DP ++ +L PL+ ++ + +E
Sbjct: 403 DVLRGANTKVETDMTIFHRQLAEVPMDPEASDEAVLAPLRPAYYGEYDRRELPPKLRELT 462
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
+ W+ + S G+ +R+A+M+ NPKYVLRNYL Q AID AE GD + LL+L
Sbjct: 463 LRWLRGLQARVRSEGLDPNQRRAIMDGANPKYVLRNYLAQEAIDLAEAGDPSRIHELLEL 522
Query: 610 MERPYDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 643
+ RPY +QPG E +A P WA +RPG MLSCSS
Sbjct: 523 LRRPYTDQPGKEHFAGKRPEWARHRPGCSMLSCSS 557
>gi|387131420|ref|YP_006294310.1| hypothetical protein Q7C_2498 [Methylophaga sp. JAM7]
gi|386272709|gb|AFJ03623.1| hypothetical protein Q7C_2498 [Methylophaga sp. JAM7]
Length = 546
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 259/550 (47%), Positives = 347/550 (63%), Gaps = 35/550 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
+L +++ FVRELP DP +++ R+VL ACY+ V+P+ +V P L+A+S +A + L
Sbjct: 22 NLQFNNRFVRELPADPDMENVRRQVLGACYSFVNPT-QVRAPYLIAYSPEMATDIGLSAD 80
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ E F F+G LAG P+AQCYGGHQFG WAGQLGDGRAI LGE+ + L
Sbjct: 81 DCEDEWFTQVFAGNEQLAGMQPHAQCYGGHQFGNWAGQLGDGRAINLGEVPDQHGILQTL 140
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG+TPYSR ADGLAVLRSS+REFLCSEAM LGIPTTRAL L+ TG+ V RDMFYD
Sbjct: 141 QLKGAGETPYSRSADGLAVLRSSVREFLCSEAMFHLGIPTTRALSLIGTGEQVMRDMFYD 200
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G PK EPGA+VCRVA SFLR GSY+I ++R +D++ ++ L D+ I HHF H+
Sbjct: 201 GRPKSEPGAVVCRVAPSFLRIGSYEIFSAR--QDVENLKKLVDFTICHHFPHL------- 251
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
G+ +H Y W EV ER+A LV W VGF HGVLNTDN SILGLT
Sbjct: 252 ------GEPNHET-------YLRWFREVCERSAKLVVDWMRVGFVHGVLNTDNTSILGLT 298
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G++D +DP +TPNTTD +RY F +Q I WN+ Q L LI++ E
Sbjct: 299 IDYGPYGWIDDYDPDWTPNTTDADLKRYRFGHQAQIMQWNLLQLGNALYP--LINESEPL 356
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
++ + + ++Q + KLGL +Y +K + +L + + + + D T F+R L+
Sbjct: 357 RQILNDFVDDYTQKWQQMRADKLGLKQYHEASDKALNQRLQHILLLTETDMTLFYRQLAE 416
Query: 519 VKADP-SIPEDELLVPLKAVLL---DIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
+ + +I + ELL ++ + + K ++W+ Y + G SD ERK M
Sbjct: 417 LPCESDTITDAELLSIIEVAWYAPKSVSQNDKTEIVAWLRQYQLRVREEGTSDAERKKAM 476
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YR 633
N +NPKYVLRNYL Q AI+ AE GDF E++ LL ++ PYDEQP ++YA P WA ++
Sbjct: 477 NLINPKYVLRNYLAQQAIERAEKGDFSEIKTLLNVLRHPYDEQPAYQEYANKRPEWARHK 536
Query: 634 PGVCMLSCSS 643
PG MLSCSS
Sbjct: 537 PGCSMLSCSS 546
>gi|56479237|ref|YP_160826.1| hypothetical protein ebA6654 [Aromatoleum aromaticum EbN1]
gi|81356286|sp|Q5NYD9.1|Y3800_AZOSE RecName: Full=UPF0061 protein AZOSEA38000
gi|56315280|emb|CAI09925.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
Length = 523
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 266/557 (47%), Positives = 340/557 (61%), Gaps = 49/557 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+++L D+ FV ELPGDP R+V ACY++V P+ V P L+AWS VA L D
Sbjct: 1 MKNLVLDNRFVHELPGDPNPSPDVRQVHGACYSRVMPTP-VSAPHLIAWSPEVAALLGFD 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-- 219
+ P+F F+G + G PYA CYGGHQFG WAGQLGDGRAITLGE + + +
Sbjct: 60 ESDVRSPEFAAVFAGNALMPGMEPYAACYGGHQFGNWAGQLGDGRAITLGEAVTTRGDGH 119
Query: 220 --RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
RWELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRALCLV TG+ V
Sbjct: 120 TGRWELQLKGAGPTPYSRHADGRAVLRSSIREFLCSEAMHHLGVPTTRALCLVGTGEKVV 179
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG PK EPGA+VCRVA SF+RFG+++I SRG E L + L D+ I F +
Sbjct: 180 RDMFYDGRPKAEPGAVVCRVAPSFIRFGNFEIFTSRGDEAL--LTRLVDFTIARDFPEL- 236
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
G E + + A W +V ERTA ++AQW VGF HGV+NTDNM
Sbjct: 237 ------------GGE-------PATRRAEWFCKVCERTARMIAQWMRVGFVHGVMNTDNM 277
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AA 453
SILGLTIDYGP+G++D FDP +TPNTTD G+RY F NQP I WN+ Q + L A
Sbjct: 278 SILGLTIDYGPYGWIDNFDPGWTPNTTDAGGKRYRFGNQPHIAHWNLLQLANALYPVFGA 337
Query: 454 AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
A+ + + ++ Y F +E + ++ KLG + + ++ L + +VD T
Sbjct: 338 AEPLHEG-----LDLYARVFDEENRRMLAAKLGFEAFGDEDATLVETLHALLTRAEVDMT 392
Query: 511 NFFRALSNVKAD-PSIPEDELLVPLKAVLLDIGKE--RKEAWISWVLSYIQELLSSGISD 567
FFR L+++ + PSI PL+ K + SW+ +Y +
Sbjct: 393 IFFRGLASLDLEAPSID------PLRDAFYSAEKAAVAEPEMNSWLAAYTKRTKQERTPG 446
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
++R+ MN+VNP++VLRNYL Q AIDAAE G++ V LL +M PYDEQPG E++A
Sbjct: 447 DQRRVRMNAVNPRFVLRNYLAQEAIDAAEQGEYALVSELLDVMRHPYDEQPGRERFAARR 506
Query: 628 PAWAY-RPGVCMLSCSS 643
P WA R G MLSCSS
Sbjct: 507 PDWARNRAGCSMLSCSS 523
>gi|224371590|ref|YP_002605754.1| hypothetical protein HRM2_45340 [Desulfobacterium autotrophicum
HRM2]
gi|223694307|gb|ACN17590.1| conserved hypothetical protein [Desulfobacterium autotrophicum
HRM2]
Length = 534
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 268/554 (48%), Positives = 339/554 (61%), Gaps = 32/554 (5%)
Query: 96 TKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
T LE L +D+SF+ LPGDP ++ R+V +A Y+ V P A V NP+L A S A
Sbjct: 7 TNGQNGLESLIFDNSFINHLPGDPEIENHRRQVRNASYSIVQP-ARVHNPRLGAASREAA 65
Query: 156 DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILN 215
++L P+F FSG L VP+A CYGGHQFG WAGQLGDGRAI LGEI+N
Sbjct: 66 GLIDLSMDTVNSPEFLEIFSGNRLLPDMVPFATCYGGHQFGTWAGQLGDGRAINLGEIIN 125
Query: 216 LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF 275
+ +RW +QLKGAG TPYSR ADGLAVLRSS+REFLCSEAM LG+PTTRAL L+TTG+
Sbjct: 126 REGQRWAIQLKGAGPTPYSRSADGLAVLRSSVREFLCSEAMFHLGVPTTRALSLITTGEE 185
Query: 276 VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH 335
V RDMFYDG+PK EPGAIV R+A SF RFGS+QIH+SR E+ D+++ L DY I+ F
Sbjct: 186 VLRDMFYDGHPKMEPGAIVTRLAPSFTRFGSFQIHSSR--EETDLLKKLVDYTIKTDFPE 243
Query: 336 IENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+ G V Y W V T ++ W VGF HGV+NTD
Sbjct: 244 L-------------GTPSPRV-------YLEWFNTVCTTTVDMIVHWMRVGFVHGVMNTD 283
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAK 455
NMSILGLTIDYGP+G+L+ +DP++TPNTTD GRRY F QPDI LWN+ Q + A +
Sbjct: 284 NMSILGLTIDYGPYGWLENYDPNWTPNTTDAQGRRYSFGKQPDIALWNLTQLAK--AISP 341
Query: 456 LIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYT 510
+I+D +A +E Y +F D Q +M KLGL P+ + +++ LL+ + + + D T
Sbjct: 342 IINDVDALAQSLEVYRNRFQDGSQNMMALKLGLTHFKPETDPALMAALLDLLQLVETDMT 401
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEER 570
FFR L+ V ++ E + K + + W Y Q L R
Sbjct: 402 LFFRQLAMVDPSKTVSPMEFSAAYYQP-EQLTKPYVDRFDDWFKRYGQRLTLDSSDPGTR 460
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
+ MN VNPKYVLRNYL Q AID AE GDF V LL++M PYD+QPG +++A P W
Sbjct: 461 QQRMNQVNPKYVLRNYLAQLAIDQAEQGDFSGVTELLQVMRHPYDDQPGNQRFAEKRPEW 520
Query: 631 AY-RPGVCMLSCSS 643
A RPG MLSCSS
Sbjct: 521 ARNRPGCSMLSCSS 534
>gi|444915353|ref|ZP_21235487.1| Selenoprotein O and cysteine-containing protein [Cystobacter fuscus
DSM 2262]
gi|444713582|gb|ELW54479.1| Selenoprotein O and cysteine-containing protein [Cystobacter fuscus
DSM 2262]
Length = 522
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 262/546 (47%), Positives = 336/546 (61%), Gaps = 32/546 (5%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L + F+ PGDP+TD PR+V A ++KV P+ V P+LVAWS VA L LD
Sbjct: 2 LQFTSRFIDSTPGDPQTDRQPRQVHGALWSKVQPTP-VSAPRLVAWSPEVAALLGLDEAT 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ SG G VPYA YGGHQFG WAGQLGDGRAI+LGE+ + R+ELQ
Sbjct: 61 LRSEEAVRVLSGNGLWPGMVPYAANYGGHQFGQWAGQLGDGRAISLGELQGPEGTRYELQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMFYDG
Sbjct: 121 LKGAGPTPYSRRGDGRAVLRSSIREFLCSEAMHQLGVPTTRALSLVATGDAVIRDMFYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP+ EPGAIVCRV+ +FLRFG++++ ASRG D+ +++ LADY +++ + + +K
Sbjct: 181 NPEAEPGAIVCRVSPTFLRFGNFELCASRG--DVGLLKALADYTLKNFYPELGAPSK--- 235
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ YAA+ +EVA RTA L+A WQ VGF HGV+NTDNMSILGLTI
Sbjct: 236 -----------------DTYAAFFLEVARRTARLIAHWQAVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-- 462
DYGP+G++D F+P +TPNTTD RRY F NQP IGLWN+ + +A L+D++EA
Sbjct: 279 DYGPYGWVDDFNPGWTPNTTDAQQRRYRFGNQPGIGLWNVERLG--IALLPLLDEEEALV 336
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKYNK----QIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y F E + KLGL + +++ + +A + D T FFR LS
Sbjct: 337 EAGLLEYERVFQSELERRFAAKLGLSSLVQEGDLELVQGCFSWLAAQETDMTIFFRGLSR 396
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
V P P + V +A + E + W+ ++ + ++ E M++VN
Sbjct: 397 VVTAPEAPSEWPAVLREAFYGKVPDEHVARGLEWLAAWWRRTRREDVAPAELARRMDAVN 456
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVC 637
PKYVLRN+L Q AIDAA GD +V LL++M RP+DEQPG E YA P WA +PG
Sbjct: 457 PKYVLRNWLAQEAIDAAHAGDDSKVHTLLEVMRRPFDEQPGREAYAGRRPEWARSKPGCS 516
Query: 638 MLSCSS 643
LSCSS
Sbjct: 517 ALSCSS 522
>gi|91776140|ref|YP_545896.1| hypothetical protein Mfla_1788 [Methylobacillus flagellatus KT]
gi|121957836|sp|Q1H0D2.1|Y1788_METFK RecName: Full=UPF0061 protein Mfla_1788
gi|91710127|gb|ABE50055.1| protein of unknown function UPF0061 [Methylobacillus flagellatus
KT]
Length = 518
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 262/549 (47%), Positives = 344/549 (62%), Gaps = 42/549 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ F+RELPGDP T + R+V AC+++V P++ V +P+L+A+S + ++LEL +E
Sbjct: 2 LTFDNRFLRELPGDPETSNQLRQVYGACWSRVMPTS-VSSPKLLAYSHEMLEALELSEEE 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P + +G + G PYA CYGGHQFG WAGQLGDGRAI+LGE++N + +RWELQ
Sbjct: 61 IRSPAWVDALAGNGLMPGMEPYAACYGGHQFGHWAGQLGDGRAISLGEVVNRQGQRWELQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSS+REFLCSEAMH LGIPTTRAL LV TG V RDMFYDG
Sbjct: 121 LKGAGVTPYSRMADGRAVLRSSVREFLCSEAMHHLGIPTTRALSLVQTGDVVIRDMFYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ E GAIVCRV+ SF+RFG+++I A R +D ++ L D+ I F + N + E
Sbjct: 181 HPQAEKGAIVCRVSPSFIRFGNFEIFAMR--DDKQTLQKLVDFTIDRDFPELRNYPEEER 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L A W + RTA L+AQW VGF HGV+NTDNMSILGLTI
Sbjct: 239 L-------------------AEWFAIICVRTARLIAQWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN---IAQFSTTLAAAKLIDDKE 461
DYGP+G++D FDP +TPNTTD GRRYCF QPDI WN +AQ TL + I D+
Sbjct: 280 DYGPYGWVDNFDPGWTPNTTDAAGRRYCFGRQPDIARWNLERLAQALYTLKPEREIYDEG 339
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSN 518
+ Y + +E+ A++ K G + + +++++ M ++D T FFR L+
Sbjct: 340 ----LMLYDQAYNNEWGAVLAAKFGFSAWRDEYEPLLNEVFGLMTQAEIDMTEFFRKLAL 395
Query: 519 VKA---DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
V A D I + P L + K R W+ Y Q L+ G ER+ MN
Sbjct: 396 VDAAQPDLGILQSAAYSP---ALWETFKPRFSDWLG---QYAQATLADGRDPAERREAMN 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRP 634
VNP+YVLRNYL Q AID A+ GD + L+ ++ +PYDEQPG E++A L P WA ++
Sbjct: 450 RVNPRYVLRNYLAQQAIDLADTGDTSMIEALMDVLRKPYDEQPGKERFAALRPDWARHKA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|380512322|ref|ZP_09855729.1| hypothetical protein XsacN4_13943 [Xanthomonas sacchari NCPPB 4393]
Length = 523
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 264/549 (48%), Positives = 332/549 (60%), Gaps = 33/549 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ FV ELPGDP T REVL A ++ V P+ V P+L+A+S VA L L
Sbjct: 1 MSSLRFDNRFVAELPGDPETGPRRREVLGALWSPVQPT-PVAAPRLLAYSPEVAALLGLS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+E P F F+G G PYA YGGHQFG WAGQLGDGRAI+LGE L + RW
Sbjct: 60 EQEVRAPQFAAVFAGNARYPGMQPYAANYGGHQFGHWAGQLGDGRAISLGEALGVDGRRW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMF
Sbjct: 120 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGETVVRDMF 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGA+VCRVA SF+RFGS+++ A+RG D+ ++R LAD I F +
Sbjct: 180 YDGHPRAEPGAVVCRVAPSFVRFGSFELPAARG--DIALLRRLADLVIARDFPELPGTGG 237
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ AAW E+ RTA +VA W VGF HGV+NTDNMSILG
Sbjct: 238 ARD--------------------AAWFAEICARTARMVAHWMRVGFVHGVMNTDNMSILG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L A L DD
Sbjct: 278 LTIDYGPYGWVDDYDPEWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAL--APLFDDVA 335
Query: 462 ANY-VMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
+ +ER+ +++ + + KLGL + ++ +L+ + +VD T +FR LS
Sbjct: 336 PLHDGLERFRSEYAQAERDNIAAKLGLQQCGDDDVALMRDVLDLLQQGEVDMTLWFRGLS 395
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGK--ERKEAWISWVLSYIQELLSSGISDEERKALMN 575
+ P P L L D K + A+ +W+ Y Q L + R M
Sbjct: 396 ALPLQPWTPAQALAA-LADAFYDPAKLAAQAPAFEAWLARYAQRLQPDPLPAAARAEQMR 454
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
+ NP+YVLRNYL Q AID AE GD G + LL++M RPYDEQPG E +A P WA R
Sbjct: 455 AANPRYVLRNYLAQQAIDRAEQGDTGGIDELLEVMRRPYDEQPGREAFAAKRPDWARTRA 514
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 515 GCSMLSCSS 523
>gi|32476167|ref|NP_869161.1| hypothetical protein RB9953 [Rhodopirellula baltica SH 1]
gi|39932504|sp|Q7UKT5.1|Y9953_RHOBA RecName: Full=UPF0061 protein RB9953
gi|32446711|emb|CAD76547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 540
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 257/559 (45%), Positives = 347/559 (62%), Gaps = 41/559 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ F R+LP D + R+V A +++V P+ V P+ VA S+ VA+ + LDPK
Sbjct: 4 DLTFDNRFTRDLPADTEPRNFTRQVHQAGFSRVKPTP-VSAPKWVAGSKEVAELIGLDPK 62
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ +G G P+A CYGGHQFG WAGQLGDGRAI LGE++ + W L
Sbjct: 63 WLGSAELTEVLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVVTADEKHWTL 122
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 123 QLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLRDMFYD 182
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P+ E GAIVCRVA SF+RFG+++I ASR ED + ++TL ++ IR F H+ + +E
Sbjct: 183 GHPEHELGAIVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFSHLLSEPDAE 240
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+ + AA EV TA +V W VGF HGV+NTDNMSILGLT
Sbjct: 241 ---------------IGPDVIAAMFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLT 285
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G+L+ +DP +TPNTTD GRRY +A+QP I WN+ + L L+ + E
Sbjct: 286 IDYGPYGWLEDYDPDWTPNTTDAQGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPL 343
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y +F + ++M KLGL KY + +++ LL + + + D T F+R L++
Sbjct: 344 QRGIAVYVEEFQKSWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLAD 403
Query: 519 VKADPSIPEDELLVPLKAVLL----------DIGKERKEAWISWVLSYIQELLSSG---I 565
++ E + + L AVL ++ +E ++A + W+ SY +L+
Sbjct: 404 IEL--GTREQPVTLELAAVLRHLSEAHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPA 461
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
D +R+ MN+VNPKYVLRNYL Q AIDA + GD V LL+++ RPYD+QPG E++A
Sbjct: 462 EDSQRRQRMNAVNPKYVLRNYLAQLAIDACDKGDDSLVSELLEVLRRPYDDQPGKERFAE 521
Query: 626 LPPAWA-YRPGVCMLSCSS 643
P WA +RPG MLSCSS
Sbjct: 522 KRPEWARHRPGCSMLSCSS 540
>gi|237807458|ref|YP_002891898.1| hypothetical protein Tola_0683 [Tolumonas auensis DSM 9187]
gi|259647108|sp|C4LAV8.1|Y683_TOLAT RecName: Full=UPF0061 protein Tola_0683
gi|237499719|gb|ACQ92312.1| protein of unknown function UPF0061 [Tolumonas auensis DSM 9187]
Length = 519
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 261/544 (47%), Positives = 341/544 (62%), Gaps = 33/544 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+ F+RELPGDP T + PR+V A ++ V+P A V PQL+A S VA L + E
Sbjct: 4 LHFDNRFIRELPGDPLTLNQPRQVHAAFWSAVTP-APVPQPQLIASSAEVAALLGISLAE 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
++P + SG L G P+A CYGGHQFG WAGQLGDGRAI+LGE+++ RWELQ
Sbjct: 63 LQQPAWVAALSGNGLLDGMSPFATCYGGHQFGNWAGQLGDGRAISLGELIH-NDLRWELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AVLRSSIREFLCSEAM LG+PTTRAL LV TG+ + RDMFYDG
Sbjct: 122 LKGAGVTPYSRRGDGKAVLRSSIREFLCSEAMFHLGVPTTRALSLVLTGEQIWRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP++EPGAIVCRVA SF+RFG +Q+ A RG+ DL + L D+ I F H+
Sbjct: 182 NPQQEPGAIVCRVAPSFIRFGHFQLPAMRGESDL--LNQLIDFTIDRDFPHLS------- 232
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ + W EV TA L+ +W VGF HGV+NTDNMSILGLTI
Sbjct: 233 ------------AQPATVRRGVWFSEVCITTAKLMVEWTRVGFVHGVMNTDNMSILGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D FD ++TPNTTD G RYCF QP I WN+ + + L + D
Sbjct: 281 DYGPYGWVDNFDLNWTPNTTDAEGLRYCFGRQPAIARWNLERLAEALGTV-MTDHAILAQ 339
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+E + F E A++ KLG ++ + +++++L + + +VD T FFR L+ V
Sbjct: 340 GIEMFDETFAQEMAAMLAAKLGWQQWLPEDSELVNRLFDLLQQAEVDMTLFFRRLALV-- 397
Query: 522 DPSIPEDELLVPLKAVLL-DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
D S P +L V A D+ + + A+ W+ +Y Q +LS G+ ER A MN VNP
Sbjct: 398 DVSAP--DLTVLADAFYRDDLFCQHQPAFTQWLTNYSQRVLSEGVLPAERAARMNQVNPV 455
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCML 639
YVLRNYL Q IDAAE G++ + LL+++ +PY EQ G E YA+ P WA ++PG ML
Sbjct: 456 YVLRNYLAQQVIDAAEQGNYQPIAELLEVLRQPYTEQSGKEAYAQKRPDWARHKPGCSML 515
Query: 640 SCSS 643
SCSS
Sbjct: 516 SCSS 519
>gi|449133591|ref|ZP_21769141.1| protein belonging to Uncharacterized protein family UPF0061
[Rhodopirellula europaea 6C]
gi|448887756|gb|EMB18114.1| protein belonging to Uncharacterized protein family UPF0061
[Rhodopirellula europaea 6C]
Length = 542
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 257/559 (45%), Positives = 350/559 (62%), Gaps = 39/559 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ F R+LP DP + + R+V A +++V P+ V P+ VA S+ VA+ + LD K
Sbjct: 4 DLTFDNRFTRDLPADPESRNFTRQVHQAGFSRVKPTP-VSAPKWVAGSKEVAELIGLDSK 62
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ +G G P+A CYGGHQFG WAGQLGDGRAI LGE++ + W L
Sbjct: 63 WLGSAELTEVLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVVTADEKHWTL 122
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 123 QLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLRDMFYD 182
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P+ E GA+VCRVA SF+RFG+++I ASR ED + ++TL ++ IR F H+
Sbjct: 183 GHPEHELGAVVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFPHL------- 233
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
LS + D ++ + AA EV TA +V W VGF HGV+NTDNMSILGLT
Sbjct: 234 -LSGAGPD-----AEVGPDVIAAMFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLT 287
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G+L+ +DP +TPNTTD GRRY +A+QP I WN+ + L L+ + E
Sbjct: 288 IDYGPYGWLEDYDPDWTPNTTDAQGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPL 345
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y +F + ++M KLGL KY + +++ LL + + + D T F+R L++
Sbjct: 346 QRGIAVYVEEFQKSWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLAD 405
Query: 519 VKADPSIPEDELLVPLKAVLL----------DIGKERKEAWISWVLSYIQELLSSG---I 565
++ E + + L AVL ++ +E ++A + W+ SY +L+
Sbjct: 406 IEL--GTQEQPVALELAAVLNHLSEAHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPA 463
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
+D +R+ MN+VNPKYVLRNYL Q AIDA + GD V LL ++ RPY++QPG E++A
Sbjct: 464 NDSQRRQRMNAVNPKYVLRNYLAQLAIDACDKGDDSMVSELLDVLRRPYEDQPGKERFAE 523
Query: 626 LPPAWA-YRPGVCMLSCSS 643
P WA +RPG MLSCSS
Sbjct: 524 KRPEWARHRPGCSMLSCSS 542
>gi|333986081|ref|YP_004515291.1| hypothetical protein [Methylomonas methanica MC09]
gi|333810122|gb|AEG02792.1| UPF0061 protein ydiU [Methylomonas methanica MC09]
Length = 531
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 253/553 (45%), Positives = 335/553 (60%), Gaps = 42/553 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
L+ LN+D+ FV +LP DP D+ R+V +CY++V P V+ P+LVA+S+ +A L+L
Sbjct: 10 LDTLNFDNRFVHDLPCDPEPDNYRRQVYQSCYSQVRPKP-VKAPRLVAYSKEMAKLLDLP 68
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ F F+G L G PYA YGG QFG WAGQLGDGRAI LGE++N + +RW
Sbjct: 69 EAACQSQTFCQVFAGNQLLDGMEPYAMNYGGQQFGHWAGQLGDGRAINLGEVVNREGQRW 128
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAG TPYSR ADGLAVLRSSIREFLCSEAM+ LG+PTTRAL ++ TG+ V RDMF
Sbjct: 129 TLQLKGAGPTPYSRSADGLAVLRSSIREFLCSEAMYHLGVPTTRALSVILTGEQVVRDMF 188
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP+ EPGA+VCRVA SF+RFG++Q+ SR +DL+ ++ L D+ I+ F H+ NK
Sbjct: 189 YDGNPQLEPGAVVCRVAPSFIRFGNFQLFTSR--DDLETLKQLVDFTIKTDFPHLGAPNK 246
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
Y W E+ TA ++ WQ VGF HGV+NTDNMSILG
Sbjct: 247 E--------------------VYLQWFAEICRTTADMIVHWQRVGFVHGVMNTDNMSILG 286
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G+L+ +DP +TPNTTD GRRY F NQP I WN+ Q + L ++ +
Sbjct: 287 LTIDYGPYGWLENYDPDWTPNTTDAQGRRYRFGNQPKIAYWNLVQLANALYPL-ILKAEP 345
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
+ + + F +Q M KLGL P ++ + S+L + + D T F+R L+
Sbjct: 346 LQDALTVFTSTFEQNWQQTMATKLGLKAFDPGSDETLTSELATLLQAAEADMTLFYRGLA 405
Query: 518 NVKADPSIP------EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
++A+ ++ E PL L + + +W +Y L ER+
Sbjct: 406 AIEANDAVAVFQAHLEACSYEPLSPETLALAE-------AWFQTYQARLQGENRPQAERQ 458
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
MN+VNP YVLRNYL Q AID AE DF EV LL+++ PY EQ G +++A P WA
Sbjct: 459 RAMNAVNPLYVLRNYLAQQAIDLAEQDDFSEVWELLEVLRHPYTEQAGKQRFAEKRPDWA 518
Query: 632 -YRPGVCMLSCSS 643
R G MLSCSS
Sbjct: 519 KQRAGCSMLSCSS 531
>gi|386818326|ref|ZP_10105544.1| UPF0061 protein ydiU [Thiothrix nivea DSM 5205]
gi|386422902|gb|EIJ36737.1| UPF0061 protein ydiU [Thiothrix nivea DSM 5205]
Length = 519
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 257/546 (47%), Positives = 336/546 (61%), Gaps = 31/546 (5%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ LN+D+ FV ELPGD +IPR+V A +++V P+ V P+L+A S VA L
Sbjct: 1 MHPLNFDNRFVHELPGDTDGVNIPRQVYDAFWSEVKPTP-VSAPRLLAHSPEVAQLLGWQ 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ PDF F G L G PYA YGGHQFG WAGQLGDGRAI+LGE +N + +RW
Sbjct: 60 DADITDPDFEQVFGGNKLLPGMQPYAANYGGHQFGGWAGQLGDGRAISLGETVNAQGQRW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LGIPTTRAL LV TG V RDMF
Sbjct: 120 ELQLKGAGPTPYSRRADGRAVLRSSVREFLCSEAMHHLGIPTTRALSLVMTGDGVVRDMF 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP+ EPGAIVCRVA SF+RFG++++ SRG DL ++ L D+ I + ++
Sbjct: 180 YDGNPQVEPGAIVCRVAPSFIRFGNFELPNSRG--DLGLLEQLVDFTIARDYPELQ---- 233
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
GD T K + W +E+ RTA ++A W VGF HGV+NTDNMSILG
Sbjct: 234 --------GD--------TQEKRSQWFLEICRRTAVMMAHWMRVGFVHGVMNTDNMSILG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G+L+ +DP +TPNTTD GRRY + QP IG WN+A+ L + D
Sbjct: 278 LTIDYGPYGWLEDYDPMWTPNTTDAQGRRYAYGQQPYIGHWNLARLRDALKPV-IGDASV 336
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSN 518
+ Y + + + ++ K G+ + + I+ M +VD T FFR L++
Sbjct: 337 LQAGSQLYADTYSETFGEMLAAKFGIRALSDEDAPWINSAFELMHKSEVDMTLFFRNLAS 396
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
+ D P E L+P D+ +E ++ W +W+ Y Q L + + +ER+ MN+ N
Sbjct: 397 L--DMREPRLEPLLP-AFYREDLLREHRQDWENWLQQYRQRLQADNLPTDERQRRMNTAN 453
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
P++VLRNYL Q AID A GD G + LL+++ RPYDEQ K+A P WA ++ G
Sbjct: 454 PRFVLRNYLAQQAIDKAAAGDNGMILELLEVLRRPYDEQAQYAKFAEKRPEWARHKAGCS 513
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 514 MLSCSS 519
>gi|358636858|dbj|BAL24155.1| hypothetical protein AZKH_1842 [Azoarcus sp. KH32C]
Length = 484
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 253/509 (49%), Positives = 315/509 (61%), Gaps = 36/509 (7%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V P+L+AWS +A +L D + P+F F G L G PYA CYGGHQFG WAGQ
Sbjct: 5 VREPRLIAWSPEMASALGFDEADVRSPEFAQVFGGNALLPGMEPYAACYGGHQFGNWAGQ 64
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRAITLGE +N K ER+ELQLKGAGKTPYSR ADG AVLRSSIREFLCSEAMH LGI
Sbjct: 65 LGDGRAITLGEAVNAKGERYELQLKGAGKTPYSRTADGRAVLRSSIREFLCSEAMHHLGI 124
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PTTRALC+V TG+ V RDMFYDG+P+ EPGA+VCRVA SF+RFG+++I ++RG E L +
Sbjct: 125 PTTRALCIVGTGEDVIRDMFYDGHPRAEPGAVVCRVAPSFIRFGNFEIFSARGDEQL--L 182
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
L D+ I F + T + W V ERTA L+A+
Sbjct: 183 AQLVDFTIARDFPELGGT--------------------TETRRTEWFHTVCERTARLMAE 222
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
W VGF HGV+NTDNMSILGLTIDYGP+G++D FDP +TPNTTD GRRY F NQP IG
Sbjct: 223 WMRVGFVHGVMNTDNMSILGLTIDYGPYGWIDNFDPDWTPNTTDASGRRYRFGNQPGIGQ 282
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKL 498
WN+ Q L A + ++RY + E + + KLGL +++ +++ L
Sbjct: 283 WNLWQLGNALYPA-FGSVEPLQEGLDRYAVVYARERERTLAGKLGLTMFHEGDSELVDTL 341
Query: 499 LNNMAVDKVDYTNFFRALSNVK-ADPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLS 555
+A +VD T FFR L++V PSI P++ + +E A+ W+
Sbjct: 342 HTLLARAEVDMTIFFRGLADVDLQQPSIE------PVREAFYNEALLERESAAFADWLAR 395
Query: 556 YIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD 615
Y L G+ E R+ MN+ NP YVLRNYL Q AIDAAE GD + LL +M RPY+
Sbjct: 396 YAARALQDGVPPELRRERMNAANPCYVLRNYLAQEAIDAAEQGDNALILELLDVMRRPYE 455
Query: 616 EQPGMEKYARLPPAWA-YRPGVCMLSCSS 643
+QPG E++A P WA R G MLSCSS
Sbjct: 456 DQPGRERFAAKRPDWARQRAGCSMLSCSS 484
>gi|417301033|ref|ZP_12088206.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica WH47]
gi|327542687|gb|EGF29158.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica WH47]
Length = 540
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 255/559 (45%), Positives = 346/559 (61%), Gaps = 41/559 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ F R+LP D + R+V A +++V P+ V P+ VA S+ VA+ + LDPK
Sbjct: 4 DLTFDNRFTRDLPADTEPRNFTRQVHQAGFSRVKPTP-VSAPKWVAGSKEVAELIGLDPK 62
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ +G G P+A CYGGHQFG WAGQLGDGRAI LGE++ + W L
Sbjct: 63 WLGSAELTEVLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVVTSDEKHWTL 122
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 123 QLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLRDMFYD 182
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P+ E GA+VCRVA SF+RFG+++I ASR ED + ++TL ++ IR F H+ + +E
Sbjct: 183 GHPEHELGAVVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFSHLLSEPDAE 240
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+ + AA EV TA +V W VGF HGV+NTDNMSILGLT
Sbjct: 241 ---------------IGPDVVAAMFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLT 285
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G+L+ +DP +TPNTTD GRRY +A+QP I WN+ + L L+ + E
Sbjct: 286 IDYGPYGWLEDYDPDWTPNTTDAQGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPL 343
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y +F + ++M KLGL KY + +++ LL + + + D T F+R L++
Sbjct: 344 QRGIAVYVEEFQKSWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLAD 403
Query: 519 VKADPSIPEDELLVPLKAVLL----------DIGKERKEAWISWVLSYIQELLSSG---I 565
++ E + + L VL ++ +E ++A + W+ SY +L+
Sbjct: 404 IEL--GTREQPVTLELAVVLRHLSEAHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPA 461
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
D +R+ MN+VNPKYVLRNYL Q AIDA + GD V LL+++ RPYD+QPG E++A
Sbjct: 462 EDSQRRQRMNAVNPKYVLRNYLAQLAIDACDKGDDSLVSELLEVLRRPYDDQPGKERFAE 521
Query: 626 LPPAWA-YRPGVCMLSCSS 643
P WA +RPG MLSCSS
Sbjct: 522 KRPEWARHRPGCSMLSCSS 540
>gi|302841364|ref|XP_002952227.1| hypothetical protein VOLCADRAFT_62183 [Volvox carteri f.
nagariensis]
gi|300262492|gb|EFJ46698.1| hypothetical protein VOLCADRAFT_62183 [Volvox carteri f.
nagariensis]
Length = 604
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 266/575 (46%), Positives = 347/575 (60%), Gaps = 50/575 (8%)
Query: 103 EDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDP 162
++L WDH+FV+ELP DP + ++ R+V A ++ VSP+ P V +S VA + LDP
Sbjct: 46 KNLPWDHTFVKELPADPDSRNVVRQVEGALFSFVSPTPPSGVPYTVTYSRQVARLVGLDP 105
Query: 163 KEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILN-LKSERW 221
+ ER +FPL SGA PL G++PYA YGGHQFG WAGQLGDGRAITLGE++N + +RW
Sbjct: 106 TDCERAEFPLVMSGAAPLPGSLPYAAVYGGHQFGQWAGQLGDGRAITLGEVVNPVDGQRW 165
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAGKTPYSR ADG AVLRSS+REF+CSEAM LG+PTTRAL LV TG
Sbjct: 166 ELQLKGAGKTPYSRRADGRAVLRSSLREFVCSEAMAALGVPTTRALSLVGTGG------- 218
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
PGA+VCRVA SF+RFG++Q+ SRG ++ +V+ AD+ I++H H+ + +
Sbjct: 219 --------PGAVVCRVAPSFMRFGTFQLPVSRGLGEVGLVKMAADWVIKYHNPHLAS-DL 269
Query: 342 SESLSFST-------GDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
S L + T + Y EV RTA+LVA WQ +GF HGVLNT
Sbjct: 270 SVCLPYLTICPPLPPPPPPPPPPSDSPQPYLDLLREVTCRTATLVAAWQSLGFVHGVLNT 329
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGPFGFLD FDP +TPN TD GRRY + NQP+ +N+ L AA
Sbjct: 330 DNMSILGLTIDYGPFGFLDKFDPDWTPNLTDAGGRRYSYRNQPEAVQFNLVMLGNALLAA 389
Query: 455 KLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFR 514
L+ + A V+ Y + Y A M KLGL +Y+ + +L+ M D D+TN FR
Sbjct: 390 DLVPREGAEEVLREYSKVLSESYNARMAAKLGLREYDMTLTHELMRLMYDDDADFTNTFR 449
Query: 515 ALSNVKADPSIPEDELL-------------------VPLKAVLLDIG-----KERKEAWI 550
AL ++ P + +P G +ER AW
Sbjct: 450 ALCSISCTEDEPPECASSDSGSESGSGLRPTGHHHDLPAALAAALNGGQPLSEERVAAWR 509
Query: 551 SWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLM 610
W+ +Y L + G+ + ER++ SVNPK++ R +L Q AI+AAE GD+ E+ LL+++
Sbjct: 510 QWLQAYRARLRAEGVPEAERQSAQRSVNPKFIPRQHLLQWAIEAAEGGDYSELETLLEVL 569
Query: 611 ERPYDEQPGM-EKYARLPP-AWAYRPGVCMLSCSS 643
ERPYD+QP KY+ LPP RPGVCMLSCSS
Sbjct: 570 ERPYDDQPDTAAKYSGLPPEEMVRRPGVCMLSCSS 604
>gi|440717735|ref|ZP_20898216.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica SWK14]
gi|436437158|gb|ELP30822.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica SWK14]
Length = 540
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 258/557 (46%), Positives = 346/557 (62%), Gaps = 37/557 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ F R+LP D + R+V A +++V P+ V P+ VA S+ VA+ + LDPK
Sbjct: 4 DLTFDNRFTRDLPADTEPRNFTRQVHQAGFSRVKPTP-VSAPKWVAGSKEVAELIGLDPK 62
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ +G G P+A CYGGHQFG WAGQLGDGRAI LGE++ + W L
Sbjct: 63 WLGSAELTEVLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLGEVVTADEKHWTL 122
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 123 QLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLRDMFYD 182
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P+ E GA+VCRVA SF+RFG+++I ASR ED + ++TL ++ IR F H+ + SE
Sbjct: 183 GHPEHELGAVVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFSHLLSPPDSE 240
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+ + AA EV TA +V W VGF HGV+NTDNMSILGLT
Sbjct: 241 ---------------IGPDVVAAMFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLT 285
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G+L+ +DP +TPNTTD GRRY +A+QP I WN+ + L L+ + E
Sbjct: 286 IDYGPYGWLEDYDPDWTPNTTDAQGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPL 343
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y +F + ++M KLGL KY + +++ LL + + + D T F+R L++
Sbjct: 344 QRGIAIYVEEFQKSWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLAD 403
Query: 519 V----KADPSIPE--DEL--LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG---ISD 567
+ + P E D L L V ++ +E ++A + W+ SY +L+ D
Sbjct: 404 IGLGTREQPVTLELADVLRHLSEAHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPADD 463
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
+R+ MN+VNPKYVLRNYL Q AIDA + GD V LL+++ RPYD+QPG E++A
Sbjct: 464 SQRRQRMNAVNPKYVLRNYLAQLAIDACDKGDDSLVSELLEVLRRPYDDQPGKERFAEKR 523
Query: 628 PAWA-YRPGVCMLSCSS 643
P WA +RPG MLSCSS
Sbjct: 524 PEWARHRPGCSMLSCSS 540
>gi|262199258|ref|YP_003270467.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262082605|gb|ACY18574.1| protein of unknown function UPF0061 [Haliangium ochraceum DSM
14365]
Length = 548
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 265/556 (47%), Positives = 337/556 (60%), Gaps = 43/556 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+SFVRELPGD + R V ACY+++ P+ V P+ VA++ VA L L
Sbjct: 19 LAFDNSFVRELPGDRVAGNHVRTVSGACYSRIDPT-PVRAPETVAYAPEVAALLGLPEAF 77
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG+ L G P+A CYGGHQFG WAGQLGDGRAI+LGE++ +RWELQ
Sbjct: 78 CVSPAFAQVFSGSARLPGMAPWAACYGGHQFGHWAGQLGDGRAISLGELIA-DGQRWELQ 136
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFY G
Sbjct: 137 LKGAGLTPYSRTADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVRTGEDVVRDMFYSG 196
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCRVA SFLRFG+++I A+R D ++ L DYAIR HF + K+
Sbjct: 197 DPRPEPGAVVCRVAPSFLRFGNFEILAAR--RDAALLGRLLDYAIRTHFPALGTPCKA-- 252
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
Y AW EV RTA +VA W VGF HGV+NTDNMSILG TI
Sbjct: 253 ------------------VYVAWMTEVCRRTAVMVAHWMRVGFVHGVMNTDNMSILGQTI 294
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G++D DP++TPNTTD RRY F QP + LWN+ + + + ++DD A
Sbjct: 295 DYGPYGWIDNHDPNWTPNTTDAHRRRYRFGQQPQVALWNLVKLAQAIEL--VVDDTAALE 352
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVD-KVDYTNFFRALSNV 519
++ Y F D + KLGL +++ ++ L M D + D T F+R L+ +
Sbjct: 353 GALDSYQHSFEDAMHDTLAGKLGLREFDPSSDVLLVDALTGMLTDLEFDMTIFYRRLAAL 412
Query: 520 K-ADPSIPEDE---------LLVPLK-AVLLDIGKERKEAWISWVLSYIQELLSSGISDE 568
AD + P + LL + A + + ++ ++W+ Y + + G D
Sbjct: 413 PCADAAGPNGDSAGDSDSAALLAHFEDAQYRPLSEREQQRALAWLRDYRARVRADGTPDG 472
Query: 569 ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
ER A MN VNPKYVLRNY+ Q AI+ AE GD VR LL L+ RPYDEQP + +A P
Sbjct: 473 ERAAAMNRVNPKYVLRNYMAQQAIERAEAGDAALVRELLALLRRPYDEQPQHQTWAGKRP 532
Query: 629 AWAY-RPGVCMLSCSS 643
WA RPG MLSCSS
Sbjct: 533 EWARDRPGCSMLSCSS 548
>gi|389722450|ref|ZP_10189089.1| hypothetical protein UU5_04194 [Rhodanobacter sp. 115]
gi|388441886|gb|EIL98122.1| hypothetical protein UU5_04194 [Rhodanobacter sp. 115]
Length = 520
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 255/548 (46%), Positives = 340/548 (62%), Gaps = 34/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D++++RELPGDP T R+V A Y++V P+ V P+++A S +A +L
Sbjct: 1 MHTLHFDNAYLRELPGDPETGPRLRQVAGALYSRVEPT-PVAAPRVLAHSAEMASALGFS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ F F G L G P+A YGGHQFG+WAGQLGDGRAI+LGE ++ ERW
Sbjct: 60 EADVASETFAQVFGGNALLDGMQPWAANYGGHQFGVWAGQLGDGRAISLGETISAAGERW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRALCLV TG+ V RDMF
Sbjct: 120 ELQLKGAGATPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALCLVGTGEPVLRDMF 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+ ++EPGAIVCR A SF+RFG +++ ASR D+ ++R+L ++ +R F H+ +
Sbjct: 180 YDGHVQDEPGAIVCRAAPSFIRFGHFELPASR--NDVPLLRSLVEFTLRRDFPHL--TGQ 235
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
ESL +A W EV RTA LVAQW VGF HGV+NTDNMSI G
Sbjct: 236 GESL------------------HADWFGEVCARTAQLVAQWMRVGFVHGVMNTDNMSITG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LT+DYGP+G++D FDP +TPNTTD RRY + QPD+ WN+++ + LA D
Sbjct: 278 LTLDYGPYGWVDNFDPDWTPNTTDAQRRRYRYGQQPDVAWWNLSRLAGALAPL-FGDIAP 336
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSN 518
++RY + + +A M KLGL + + ++ L + +VD T +FRAL +
Sbjct: 337 LQAGLDRYAAVYAEADRANMADKLGLAECREDDVALMQSLHGLLRQAEVDMTLWFRALGD 396
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNS 576
+ A+ + L L+ D K R + A+ W+ Y L ++ +R+ M +
Sbjct: 397 LDANAPM----LTSALRDAFYDEAKLRANEAAFGDWLQRYAARLADDPLTSGQRRNRMRA 452
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPG 635
NP+YVLRNYL Q AID A GD + LL++M PYD+QPG E YA+ P WA ++PG
Sbjct: 453 ANPRYVLRNYLAQQAIDRASQGDHAGISELLEVMRHPYDDQPGHEAYAQKRPDWARHKPG 512
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 513 CSMLSCSS 520
>gi|421614214|ref|ZP_16055279.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica SH28]
gi|408495080|gb|EKJ99673.1| protein belonging to uncharacterized protein family UPF0061
[Rhodopirellula baltica SH28]
Length = 540
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 255/559 (45%), Positives = 345/559 (61%), Gaps = 41/559 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ F R+LP D + R+V A +++V P+ V P+ VA S+ VA+ + LDPK
Sbjct: 4 DLTFDNRFTRDLPADTEPRNFTRQVHQAGFSRVKPTP-VSAPKWVAGSKEVAELIGLDPK 62
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ +G G P+A CYGGHQFG WAGQLGDGRAI L E++ + W L
Sbjct: 63 WLGSAELTEVLAGNALADGMDPFAMCYGGHQFGNWAGQLGDGRAINLAEVVTSGEKHWTL 122
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADGLAVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 123 QLKGAGLTPYSRTADGLAVLRSSVREFLCSEAMHHLGVPTTRALSLVLTGEKVLRDMFYD 182
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P+ E GAIVCRVA SF+RFG+++I ASR ED + ++TL ++ IR F H+ + +E
Sbjct: 183 GHPEHELGAIVCRVAPSFIRFGNFEIFASR--EDTETLQTLVEHTIRSEFSHLLSEPDAE 240
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+ + AA EV TA +V W VGF HGV+NTDNMSILGLT
Sbjct: 241 ---------------IGPDVIAAMFEEVCRTTAEMVVHWMRVGFVHGVMNTDNMSILGLT 285
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA- 462
IDYGP+G+L+ +DP +TPNTTD GRRY +A+QP I WN+ + L L+ + E
Sbjct: 286 IDYGPYGWLEDYDPDWTPNTTDAQGRRYRYAHQPQIAQWNLVALANAL--VPLVKEAEPL 343
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ Y +F + ++M KLGL KY + +++ LL + + + D T F+R L++
Sbjct: 344 QRGIAVYVEEFQKSWHSMMAGKLGLSKYESETDDELVDSLLTLLQLAETDMTIFYRRLAD 403
Query: 519 VKADPSIPEDELLVPLKAVLL----------DIGKERKEAWISWVLSYIQELLSSG---I 565
++ E + + L VL ++ +E ++A + W+ SY +L+
Sbjct: 404 IEL--GTREQPVTLELAVVLRYLSETHYVADEVTEEYQQALMDWMRSYQSRVLADDGFPA 461
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
D +R+ MN+VNPKYVLRNYL Q AIDA + GD V LL+++ RPYD+QPG E++A
Sbjct: 462 DDSQRRQRMNAVNPKYVLRNYLAQLAIDACDKGDDSLVSELLEVLRRPYDDQPGKERFAE 521
Query: 626 LPPAWA-YRPGVCMLSCSS 643
P WA +RPG MLSCSS
Sbjct: 522 KRPEWARHRPGCSMLSCSS 540
>gi|332667321|ref|YP_004450109.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332336135|gb|AEE53236.1| UPF0061 protein ydiU [Haliscomenobacter hydrossis DSM 1100]
Length = 526
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 259/554 (46%), Positives = 350/554 (63%), Gaps = 40/554 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ LN +F +ELP DP + R+V AC++ V+P + NP LV S+ +A+++ L
Sbjct: 1 MNKLNIQDTFNQELPADPNLSNTRRQVRGACFSYVTPR-QPSNPVLVHASQEMAEAIGLA 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ + +F FSGAT L G PYA CYGGHQFG WAGQLGDGRAI L E+++ + +RW
Sbjct: 60 AGDTQSEEFLSIFSGATTLEGTSPYAMCYGGHQFGSWAGQLGDGRAINLTEVVH-EGQRW 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAG+TPYSR ADGLAVLRSSIRE LCSEAM+ LG+PTTR+L LV TG V RDM
Sbjct: 119 ALQLKGAGETPYSRTADGLAVLRSSIREHLCSEAMYHLGVPTTRSLSLVLTGDQVMRDML 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GN E GA+VCRVA SF+RFG++QI +R +++ +R+L DY IRH F HIE
Sbjct: 179 YNGNTAYEKGAVVCRVAPSFIRFGNFQIFTAR--DEVSTLRSLTDYTIRHFFPHIEPG-- 234
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
T YA + EV++RT LV +WQ VGF HGV+NTDN+SILG
Sbjct: 235 ------------------TPEAYAEFFKEVSQRTLDLVIEWQRVGFVHGVMNTDNLSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK- 460
LTIDYGP+G+L+ ++P +TPNTTD RRY + QP + LWN+ Q + L L+ D
Sbjct: 277 LTIDYGPYGWLEGYEPDWTPNTTDRSQRRYRYGQQPGVALWNLVQLANALMP--LVKDTV 334
Query: 461 --EANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
EA+ + + KF +Y A++ +KLGL + + ++ +L +A + D T FFR
Sbjct: 335 LLEAS--LADFQLKFPKKYLAMLRRKLGLATPDEGDAELAEELEKLLAYTETDMTIFFRN 392
Query: 516 LSNVKADPSIPEDELLVP-LKAVLL---DIGKERKEAWISWVLSYIQELLSSGISDEERK 571
LS V+ D +P ++ L+ + D+ ++ W W+ Y+Q L +DEER+
Sbjct: 393 LSKVEKDGGLPANKTFFEHLQTAMYQPEDLNAALQQKWEDWLDHYLQRLQLETANDEERR 452
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGM-EKYARLPPAW 630
+MN+ NPKYVLRNY+ Q AID A+LGDF V L +L+++PYDEQP M EK+ P W
Sbjct: 453 TVMNNANPKYVLRNYMAQLAIDQADLGDFKLVDELYQLLKKPYDEQPEMEEKWFVKRPEW 512
Query: 631 AY-RPGVCMLSCSS 643
A + G MLSCSS
Sbjct: 513 ARNKVGCSMLSCSS 526
>gi|253996672|ref|YP_003048736.1| hypothetical protein Mmol_1303 [Methylotenera mobilis JLW8]
gi|253983351|gb|ACT48209.1| protein of unknown function UPF0061 [Methylotenera mobilis JLW8]
Length = 528
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 256/548 (46%), Positives = 348/548 (63%), Gaps = 26/548 (4%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ LN+D+ F RELPGD TD+ R+V A ++ V P+ V+ P L+A+S VA+ L L
Sbjct: 1 MRTLNFDNRFYRELPGDAITDNYTRQVKDALWSSVMPTP-VKAPSLMAYSSDVAEMLGLS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ PD G L G PYA CYGGHQFG WAGQLGDGRAI LGE+++ ++R+
Sbjct: 60 DADMHDPDMVNALGGNQLLPGMQPYATCYGGHQFGNWAGQLGDGRAIYLGELVH-NNQRF 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG+TPYSR ADG AVLRSS+REFLCSEAM++LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGETPYSRRADGRAVLRSSLREFLCSEAMYYLGVPTTRALSLVCTGDQVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP+ E GAIVCRVA SF RFG +++ ASRG +L +++ + + I F + +
Sbjct: 179 YDGNPQMEQGAIVCRVAPSFTRFGHFELLASRG--NLALLKQMIGFTIDRDF---SDWLQ 233
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
++ + S + ++++ AW E+ ERTA ++A W VGF HGV+NTDNMSI+G
Sbjct: 234 QQNHTLSKDEPSTALIE-------AWFTEICERTARMIAHWMRVGFVHGVMNTDNMSIIG 286
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D FDP +TPNTTD GRRYCF Q DIG WN+ + + L+ L D
Sbjct: 287 LTIDYGPYGWVDNFDPGWTPNTTDAQGRRYCFGRQHDIGRWNLERLADALSTI-LPDAVG 345
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSN 518
N+ +++Y T + + K GL + + ++I++ M +VD T FF LS+
Sbjct: 346 LNHALDQYETVYTQSLIDALVGKFGLDTWQDDDGELINRCFELMTRAEVDMTLFFTHLSH 405
Query: 519 VK-ADPSIPEDELLVPLKAVLLDIGKERKEA-WISWVLSYIQELLSSGISDEERKALMNS 576
+ A P+I + ++ A + G E+ + +W+ Y + +L S S R+A M S
Sbjct: 406 INLASPNIADLKI-----AFYTEQGYTNFESDFNAWLAQYAKRILQSTESIAARQARMAS 460
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPG 635
NP+YVLRNYL Q AID AE GD + LLKL++ PY +Q GMEK+ P WA ++ G
Sbjct: 461 HNPRYVLRNYLAQEAIDLAEQGDSSMIETLLKLLKNPYTQQAGMEKFEDKRPDWARHKAG 520
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 521 CSMLSCSS 528
>gi|389775135|ref|ZP_10193185.1| hypothetical protein UU7_04657 [Rhodanobacter spathiphylli B39]
gi|388437468|gb|EIL94261.1| hypothetical protein UU7_04657 [Rhodanobacter spathiphylli B39]
Length = 519
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 257/546 (47%), Positives = 337/546 (61%), Gaps = 37/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D++FVR+LPGDP+ + R+V A Y++++P+ V P+L+A S +A +L E
Sbjct: 4 LHFDNAFVRDLPGDPQQGAGLRQVEGALYSRIAPT-PVAAPRLLAHSAEMAATLGFSEAE 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P+F F G L G PYA YGGHQFG WAGQLGDGRAI+LGE++N ERWELQ
Sbjct: 63 VAAPEFARLFGGNVLLDGMQPYAANYGGHQFGHWAGQLGDGRAISLGEVINAAGERWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG
Sbjct: 123 LKGAGLTPYSRGADGRAVLRSSVREFLCSEAMHHLGVPTTRALSLVGTGEPVLRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N EPGAIVCR A SFLRFG++++ ASRG D+ ++R L D+AIR F ++ + E+
Sbjct: 183 NAATEPGAIVCRAAPSFLRFGNFELPASRG--DIGLLRQLVDFAIRRDFPELQ--GQGEA 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L YA W +V ERTA+++A W VGF HGV+NTDNMSILGLTI
Sbjct: 239 L------------------YAEWFAQVCERTAAMIAHWMRVGFVHGVMNTDNMSILGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD RRY F QPD+ WN+++ + LA D
Sbjct: 281 DYGPYGWIDNYDPDWTPNTTDAQRRRYRFGQQPDVAWWNLSRLAGALAPL-FADVAPLQA 339
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV----DKVDYTNFFRALSNVK 520
++RY +A + KLG + ++ L+ ++ V ++D T +FRAL+++
Sbjct: 340 GLDRYVAAHAAADRANIAAKLGFAECRDDDMA-LMQSLQVLLQQAEIDMTLWFRALADI- 397
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLSYIQELLSSGISDEERKALMNSVN 578
D P L P D K R+ A W+ Y L + R+ M N
Sbjct: 398 -DMRAPT---LAPFAEAFYDEAKRREAEPALDDWLRRYAARLADDPLPAGSRREQMRLAN 453
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
P+YVLRNYL Q AID AE GD + LL ++ PYD+QPG E +A+ P WA ++ G
Sbjct: 454 PRYVLRNYLAQQAIDRAEQGDLDGITELLDVLRHPYDDQPGREAFAQRRPDWARHKAGCS 513
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 514 MLSCSS 519
>gi|285017898|ref|YP_003375609.1| hypothetical protein XALc_1107 [Xanthomonas albilineans GPE PC73]
gi|283473116|emb|CBA15622.1| hypothetical protein XALC_1107 [Xanthomonas albilineans GPE PC73]
Length = 523
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 261/546 (47%), Positives = 331/546 (60%), Gaps = 33/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ F ELPGDP T REVL A +++V+P++ V PQL+A+S VA L L +E
Sbjct: 4 LRFDNRFTAELPGDPETSPRRREVLGALWSQVAPTS-VPAPQLLAYSREVAAMLGLSEQE 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F G AG PYA YGGHQFG WAGQLGDGRAI LGE L RWELQ
Sbjct: 63 VLAPHFAAVFGGNACDAGMRPYAANYGGHQFGHWAGQLGDGRAIALGEALGEDGRRWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG
Sbjct: 123 LKGAGPTPYSRGGDGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGETVVRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCRVA SF+RFGS+++ A+RG D ++R LAD+ I F H++
Sbjct: 183 HPRPEPGAVVCRVAPSFVRFGSFELPAARG--DTLLLRRLADFVIARDFPHLQ------- 233
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++G+ ++YA W ++ RTA +VA W VGF HGV+NTDNMSILGLT+
Sbjct: 234 ---ASGN----------DRYADWFADICVRTAHMVAHWMRVGFVHGVMNTDNMSILGLTL 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEAN 463
DYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L L D+
Sbjct: 281 DYGPYGWIDNYDPDWTPNTTDAQGRRYRFGTQPQLAYWNLGRLAQAL--VPLFDEVAPLQ 338
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVK 520
+ R+ ++ + KLGL K + ++ LL + +VD T +FR LS
Sbjct: 339 DGLMRFSAEYAQAERDTTAAKLGLAKCEDEDLTLMRDLLALLQQAEVDMTLWFRGLSAQP 398
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWI--SWVLSYIQELLSSGISDEERKALMNSVN 578
P L L D + +A + SW+ Y Q L +S R A M + N
Sbjct: 399 VQAGTPAQALAA-LADAFYDPAQLAAQAAMFESWLQRYAQRLGRDPLSASVRAAKMRAAN 457
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVC 637
P+YVLRNYL Q AID AE GD + LL++M RPY++QPG E +A P WA R G
Sbjct: 458 PRYVLRNYLAQQAIDRAEQGDTAGIAELLEVMRRPYEDQPGREAFAARRPDWARTRAGCS 517
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 518 MLSCSS 523
>gi|163755646|ref|ZP_02162765.1| hypothetical protein KAOT1_05777 [Kordia algicida OT-1]
gi|161324559|gb|EDP95889.1| hypothetical protein KAOT1_05777 [Kordia algicida OT-1]
Length = 520
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 252/546 (46%), Positives = 345/546 (63%), Gaps = 35/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN +F +ELP DP + PR+V ACY+ V+P + NP L+ ++ VA+ L+L+ ++
Sbjct: 3 LNIKDTFNKELPADPNITNTPRKVFEACYSFVTPR-KPSNPTLIHVADEVAEMLDLE-RD 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ +F FSG T PYA CYGGHQFG WAGQLGDGRAI L EI + + + LQ
Sbjct: 61 TQSEEFLHTFSGKTVYPKTKPYAMCYGGHQFGHWAGQLGDGRAINLAEIRS-SGKPFALQ 119
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR DGLAVLRSSIRE LCSEAMH+LG+PTTR+L ++ TG V RDM YDG
Sbjct: 120 LKGAGETPYSRRGDGLAVLRSSIREHLCSEAMHYLGVPTTRSLSIMLTGDEVLRDMLYDG 179
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N + E GA+VCRVA +F+RFG++QI A+R +D ++ L DY IRH +++I+
Sbjct: 180 NQEYEKGAVVCRVAPTFIRFGNFQIFAAR--KDHKNLKNLTDYTIRHFYKNIQ------- 230
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
S G E KY A+ +V+E + +V WQ VGF HGV+NTDNMSILGLTI
Sbjct: 231 ---SEGKE----------KYIAFFQKVSEASLEMVLHWQRVGFVHGVMNTDNMSILGLTI 277
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEAN 463
DYGP+G+L+ ++P++TPNTTD RY + NQP I LWN+ Q + L LI+D K
Sbjct: 278 DYGPYGWLEGYEPNWTPNTTDSREHRYAYGNQPGIVLWNLVQLANALYP--LIEDAKPLE 335
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRALSNVK 520
++E Y F +Y +M++KLGL + N +Q+I L N+ + + D T FFR L V+
Sbjct: 336 DILENYQKSFDLKYVQMMSQKLGLTEINTETEQLIEDLQQNLQLTETDMTIFFRELPRVQ 395
Query: 521 ADPSIPEDELLVPLKAVL--LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
+ P++ K+ L++ +AWI+W YI+ L DE RK M VN
Sbjct: 396 KK-NTPQEAFQKIHKSFYKPLELAGATTDAWITWFTKYIERLQVEVDRDETRKFKMYEVN 454
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
PK+VLRNY+ Q AI+AA+ GD+ + L ++++RPY+EQ EK+ P WA ++ G
Sbjct: 455 PKFVLRNYMAQLAINAADNGDYSVLNELYEVLKRPYNEQTEYEKWYAKRPEWARHKVGCS 514
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 515 MLSCSS 520
>gi|226229228|ref|YP_002763334.1| hypothetical protein GAU_3822 [Gemmatimonas aurantiaca T-27]
gi|259647019|sp|C1AED7.1|Y3822_GEMAT RecName: Full=UPF0061 protein GAU_3822
gi|226092419|dbj|BAH40864.1| hypothetical protein GAU_3822 [Gemmatimonas aurantiaca T-27]
Length = 522
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 253/548 (46%), Positives = 330/548 (60%), Gaps = 32/548 (5%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
++ L +D+ FV ELPGDP + R+VL A ++ V P+ V PQL+A + VA L
Sbjct: 1 MQTLRFDNRFVDELPGDPDPRNQRRQVLGAAWSAVQPT-PVTAPQLLAVAPDVAAMLGFS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
P++ P+F F G L G P+A CYGGHQFG WAGQLGDGRAI+LGE++ +RW
Sbjct: 60 PEQTASPEFAAVFGGNALLEGMRPWAACYGGHQFGQWAGQLGDGRAISLGELVTTAGDRW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LVTTG V RD+
Sbjct: 120 ELQLKGAGPTPYSRTADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVTTGDPVVRDVL 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GNP EPGA+VCRVA SF+RFG+++I +R DL + L D+ I F HI+
Sbjct: 180 YNGNPAPEPGAVVCRVAPSFVRFGNFEIFTAR--HDLTTLAQLVDFTIARDFPHID---- 233
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
GD D + AAW EV ERTA L+ W VGF HGV+NTDNMSILG
Sbjct: 234 --------GDVD--------ARRAAWFREVCERTAHLMVHWMRVGFVHGVMNTDNMSILG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G+LD FDP +TPNTTD GRRY +A QP + WN+ + + +A D
Sbjct: 278 LTIDYGPYGWLDNFDPQWTPNTTDAQGRRYRYAQQPAVAQWNLMRLADAIAPL-FRDVTP 336
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRALSN 518
++ YG F+ ++A+ K G + +I++ M +D+T FFRAL +
Sbjct: 337 LQAGLDHYGDVFLVAHEAMQAAKFGFVRQGPDEDALITEAFALMERVDIDFTRFFRALGD 396
Query: 519 VKADPSIPEDELLVPLKAVLLD--IGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
A ++ + + L V D + +A +W+ + + R+ M++
Sbjct: 397 APA--ALGDASAVTVLGDVFYDATLRDTHADALTAWLRRWHVAVGRQRPDAATRRTAMHA 454
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPG 635
VNP +VLRNY+ Q AIDAA GD +VR LL+++ RPYDEQP P WA ++ G
Sbjct: 455 VNPWFVLRNYVAQQAIDAATAGDPSQVRLLLEVLRRPYDEQPEHAALVARRPEWARHKVG 514
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 515 CSMLSCSS 522
>gi|82702639|ref|YP_412205.1| hypothetical protein Nmul_A1510 [Nitrosospira multiformis ATCC
25196]
gi|121957807|sp|Q2Y8V8.1|Y1510_NITMU RecName: Full=UPF0061 protein Nmul_A1510
gi|82410704|gb|ABB74813.1| Protein of unknown function UPF0061 [Nitrosospira multiformis ATCC
25196]
Length = 565
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 263/578 (45%), Positives = 345/578 (59%), Gaps = 60/578 (10%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
L L D +D+ FVR+LPGDP T ++PR+V +A YT+VSP+ V +P+L+AW++ V + L
Sbjct: 15 LPDLFDARFDNRFVRQLPGDPETRNVPRQVRNAGYTQVSPTP-VRSPRLLAWADEVGEML 73
Query: 159 ELDPKEFERPDFPL-----FFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
+ RP P+ +G L PYA YGGHQFG WAGQLGDGRAITLGE+
Sbjct: 74 GI-----ARPASPVSPAVEVLAGNRILPSMQPYAARYGGHQFGHWAGQLGDGRAITLGEL 128
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
++ +R+ELQLKGAGKTPYSR ADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG
Sbjct: 129 ISPNDKRYELQLKGAGKTPYSRTADGRAVLRSSVREFLCSEAMHSLGVPTTRALSLVATG 188
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHF 333
+ V RDMFYDG+P EPGAIVCRV+ SFLRFG+++I A+ Q++ +++R LAD+ I HF
Sbjct: 189 EAVIRDMFYDGHPGAEPGAIVCRVSPSFLRFGNFEILAA--QKEPELLRQLADFVIGEHF 246
Query: 334 RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLN 393
+ + ++ + YA W EV RT LVA W VGF HGV+N
Sbjct: 247 PELASSHRPPEV------------------YAKWFEEVCRRTGILVAHWMRVGFVHGVMN 288
Query: 394 TDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA 453
TDNMSILGLTIDYGP+G+L+ FD +TPNTTD GRRYC+ NQP I WN+ + + L
Sbjct: 289 TDNMSILGLTIDYGPYGWLEGFDLHWTPNTTDAQGRRYCYGNQPKIAQWNLTRLAGALTP 348
Query: 454 AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK----QIISKLLNNMAVDKVDY 509
+ DD + + +G F + + ++ KLGL ++S L + + D
Sbjct: 349 L-IEDDAALEHGLAVFGETFNNTWSGMLAAKLGLASLEHSDDDSLLSDLFETLQQVETDM 407
Query: 510 TNFFRALSNVKADP---------SIPE---------DELLVPL-KAVLLDIGKERKEAWI 550
T FFR L N+ +P PE D LV L + D + A +
Sbjct: 408 TLFFRCLMNIPLNPISGNRATTFPAPENLESVDQMNDHGLVELFRPAFYDAHQAFSHAHL 467
Query: 551 S----WVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+ W+ YI + G + R M+ NPKYVLRNYL Q AI+A E GD + RL
Sbjct: 468 TRLAGWLRRYIARVRQEGEPEGLRYHRMSRANPKYVLRNYLAQQAIEALERGDDSVIIRL 527
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAY-RPGVCMLSCSS 643
+++++ PYDEQP E A P WA +PG LSCSS
Sbjct: 528 MEMLKHPYDEQPEHEDLAARRPEWARNKPGCSALSCSS 565
>gi|389810095|ref|ZP_10205677.1| hypothetical protein UUA_14891 [Rhodanobacter thiooxydans LCS2]
gi|388441083|gb|EIL97388.1| hypothetical protein UUA_14891 [Rhodanobacter thiooxydans LCS2]
Length = 519
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 257/547 (46%), Positives = 335/547 (61%), Gaps = 37/547 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D+ FVRELPGDP + R+V A Y++V P+ V P+L+A+S +A +L
Sbjct: 3 DLRFDNVFVRELPGDPEQGARLRQVDGALYSRVDPT-PVAAPRLLAYSAEMATALGFSAA 61
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ P+F F G L G PYA YGGHQFG WAGQLGDGRAI+LGE++N ERWEL
Sbjct: 62 DLAAPEFAQVFGGNVLLDGMQPYAANYGGHQFGHWAGQLGDGRAISLGEVVNAAGERWEL 121
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 122 QLKGAGLTPYSRGADGRAVLRSSVREFLCSEAMHHLGVPTTRALSLVGTGETVVRDMFYD 181
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+ E GAIVCR A SF+RFG++++ SRG D+ ++R L ++ IR F +E E
Sbjct: 182 GHAAPESGAIVCRAAPSFIRFGNFELPTSRG--DIALLRQLVEFTIRRDFPELE--GSGE 237
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+L YAAW +V ERTA+L+A W VGF HGV+NTDNMSILGLT
Sbjct: 238 TL------------------YAAWFRQVCERTATLLAHWMRVGFVHGVINTDNMSILGLT 279
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-A 462
IDYGP+G++D +DP +TPNTTD RRY + QP++ WN++ + L A L D E
Sbjct: 280 IDYGPYGWVDNYDPDWTPNTTDAQRRRYRYGQQPNVAWWNLSCLTGAL--APLFDGVELL 337
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNV 519
++ Y + +A + KLGL + ++ ++ L + + +VD T +FRAL++V
Sbjct: 338 EAGLQHYAATYAAADRANVAAKLGLAECREEDAALMQSLQSLLQQAEVDMTLWFRALADV 397
Query: 520 KADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNSV 577
D P L P D K R + A+ W+ Y L + +R+ M
Sbjct: 398 --DVQAPT---LAPFGEAFYDEAKRRAAEPAFADWLARYAARLADDPLPPPQRRERMRLA 452
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGV 636
NP+YVLRNYL Q AID AE GD + LL ++ PYD+QPG E YA+ P WA ++ G
Sbjct: 453 NPRYVLRNYLAQQAIDRAEQGDMAGIHELLDVLRHPYDDQPGREAYAQKRPDWARHKAGC 512
Query: 637 CMLSCSS 643
LSCSS
Sbjct: 513 STLSCSS 519
>gi|345866609|ref|ZP_08818634.1| hypothetical protein BZARG_2149 [Bizionia argentinensis JUB59]
gi|344048953|gb|EGV44552.1| hypothetical protein BZARG_2149 [Bizionia argentinensis JUB59]
Length = 524
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 253/559 (45%), Positives = 343/559 (61%), Gaps = 45/559 (8%)
Query: 95 MTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV 154
MTK++K N F++ELP DP ++ R+VL AC++ V P + P+L+ S+ +
Sbjct: 1 MTKQIK----FNIKDRFIKELPADPILENSRRQVLKACFSYVEPK-KTAKPELLHVSDEM 55
Query: 155 ADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
+L L + F F+G T L PYA CYGGHQFG WAGQLGDGRAI L EI
Sbjct: 56 LTNLGLSEADSHSEHFLNVFTGNTVLENTKPYAMCYGGHQFGNWAGQLGDGRAINLFEIE 115
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
+ ++ W LQLKGAG+TPYSR DGLAVLRSS+RE+LCSEAM+ LG+PTTRAL + TG
Sbjct: 116 H-DNKSWVLQLKGAGETPYSRSGDGLAVLRSSVREYLCSEAMYHLGVPTTRALSIAITGD 174
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
V RDM YDGN E GA+V R++ SFLRFGSY+I +SR +D++ ++TL DY I+HHF
Sbjct: 175 NVLRDMLYDGNSAYEKGAVVSRISPSFLRFGSYEIFSSR--QDVESLKTLVDYTIKHHFS 232
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
+ +K + F EV++RT ++ WQ VGF HGV+NT
Sbjct: 233 RLGAPSKETYIQF--------------------FAEVSQRTLEMIIHWQRVGFVHGVMNT 272
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGP+G+L+ F +TPNTTD+ +RY + NQP++GLWN+ Q + L
Sbjct: 273 DNMSILGLTIDYGPYGWLEDFSYGWTPNTTDIQHKRYRYGNQPNMGLWNLYQLANALYP- 331
Query: 455 KLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYT 510
LI+D E V+ +Y T F E +M KLGL + +K +I L +N+ + + D T
Sbjct: 332 -LIEDAEPLETVLNQYKTDFDVESLKMMRSKLGLENEDELDKLLIQDLEDNLQLSETDMT 390
Query: 511 NFFRALSNV-KADPS----IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI 565
FFR LS K +PS I D VP +I + ++ W W Y + L + +
Sbjct: 391 IFFRNLSRFNKENPSEGLKIVADAFYVP-----TEISDKIRQEWNEWFQRYAKRLQNETL 445
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
SD +R+ MN++NPKYVLRNY+ Q AID A+ GD+ + L +L+++PY EQP EK+
Sbjct: 446 SDADRRIQMNTINPKYVLRNYMSQLAIDDADKGDYRLIDELYQLLKQPYTEQPKYEKWFA 505
Query: 626 LPPAWA-YRPGVCMLSCSS 643
P WA ++ G MLSCSS
Sbjct: 506 KRPDWAKHKAGCSMLSCSS 524
>gi|319952468|ref|YP_004163735.1| hypothetical protein [Cellulophaga algicola DSM 14237]
gi|319421128|gb|ADV48237.1| UPF0061 protein ydiU [Cellulophaga algicola DSM 14237]
Length = 521
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 245/542 (45%), Positives = 342/542 (63%), Gaps = 36/542 (6%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPD 169
+F + LP DP ++ R++ AC++ V+P + P+L+ S+ +A L L + + +
Sbjct: 8 TFTKTLPQDPILENSRRQISGACFSFVTPKKTAQ-PELIHTSKEMASELGLSNEALKSEE 66
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
F L F+G + PYA CYGGHQFG WAGQLGDGRAI LGE+++ K++RW LQLKGAG
Sbjct: 67 FLLLFTGNKIGENSHPYAMCYGGHQFGNWAGQLGDGRAINLGELVH-KNKRWTLQLKGAG 125
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
+TPYSR ADGLAVLRSSIRE+LCSEAM+ LG+PTTRAL + TG V RD+ Y+GNP E
Sbjct: 126 ETPYSRTADGLAVLRSSIREYLCSEAMYHLGVPTTRALSIALTGDQVLRDVLYNGNPDYE 185
Query: 290 PGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLS-FS 348
GAIV RVA SFLRFG+Y+I +SR +D + TL DY I+ F I++ NK + F
Sbjct: 186 KGAIVTRVAPSFLRFGNYEIFSSR--QDYKTLTTLVDYTIKELFPEIKSTNKEGYIQLFK 243
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
T VA+RT +++ WQ VGF HGV+NTDNMSILGLTIDYGP
Sbjct: 244 T---------------------VAQRTLTMIIHWQRVGFVHGVMNTDNMSILGLTIDYGP 282
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVME 467
+G+L+ +D ++TPNTTD +RY + NQP+IGLWN+ Q + L LI+D E ++E
Sbjct: 283 YGWLEGYDDAWTPNTTDRQHKRYRYGNQPNIGLWNLYQLANALYP--LIEDAEPFEEILE 340
Query: 468 RYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
+Y + +Y +M K+GL + + +++S L N+ + + D T FFR LS + + S
Sbjct: 341 QYKNDYAVKYLEMMKAKIGLFTTEEDDAELLSTLEENLQIIETDMTLFFRNLSVITKNDS 400
Query: 525 IPE--DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+ + ++ V ++ ++ ++ E W +W Y++ L I+D+ER MN NPKYV
Sbjct: 401 VVDAVSKIEVAFYSI-AELKEDTLEQWKAWFNLYVKRLQKESITDQERMLKMNGTNPKYV 459
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCMLSC 641
LRNY+ Q AID A+ D+ V L L+++PYDEQP EK+ P WA + G MLSC
Sbjct: 460 LRNYMAQMAIDKADEKDYSLVDELYTLLKKPYDEQPKFEKWFSKRPEWARNKVGCSMLSC 519
Query: 642 SS 643
SS
Sbjct: 520 SS 521
>gi|334130034|ref|ZP_08503837.1| hypothetical protein METUNv1_00851 [Methyloversatilis universalis
FAM5]
gi|333445070|gb|EGK73013.1| hypothetical protein METUNv1_00851 [Methyloversatilis universalis
FAM5]
Length = 530
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 256/561 (45%), Positives = 332/561 (59%), Gaps = 43/561 (7%)
Query: 95 MTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV 154
M+ + L+++ +D+ FVR LP DP T+ R+V A Y+ +P V +PQL+ WS+ +
Sbjct: 1 MSAASRRLDEIEFDNLFVRSLPADPSTEIRSRQVPGAAYS-FTPPTPVADPQLLGWSDDL 59
Query: 155 ADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
L L + R +G L G PYA YGGHQFG WAGQLGDGRAITLGE+
Sbjct: 60 GAQLGL-ARPARRDAAVEALAGNRILPGMQPYAARYGGHQFGNWAGQLGDGRAITLGEMF 118
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
+ +R ELQLKGAG TPYSR ADG AVLRSS+REFLCSEAM LGIPTTRAL LV TG
Sbjct: 119 DTHGQRQELQLKGAGPTPYSRRADGRAVLRSSVREFLCSEAMFHLGIPTTRALSLVATGD 178
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
V RDMFYDG P+ EPGAIVCRVA SF+RFG ++I S ++ ++ LAD+ + HH+
Sbjct: 179 TVVRDMFYDGRPENEPGAIVCRVAPSFVRFGHFEILTS--HDETALLGQLADWVMTHHYP 236
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
I YA W E+ RTA+L+ +W VGF HGV+NT
Sbjct: 237 GI-------------------------GSYADWFAEICRRTATLMVEWMRVGFVHGVMNT 271
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTIDYGP+G+L+ D +TPNTTD GRRYC+ QP IG WN+ + + L A
Sbjct: 272 DNMSILGLTIDYGPYGWLEGVDMMWTPNTTDAQGRRYCYGRQPQIGYWNLTRLAAAL--A 329
Query: 455 KLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLP------KYNKQIISKLLNNMAVDKV 507
LIDD++A + +E Y F D + A++ KLGLP + + S+L + ++
Sbjct: 330 PLIDDRDAIDAALEGYEQTFSDGWTAMLANKLGLPMPAAGDDADADMRSRLFLLLQEEEC 389
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG----KERKEAWISWVLSYIQELLSS 563
D+T FFR L+ V + D + G + A + W+ + + +
Sbjct: 390 DFTIFFRQLAGVPLAAAAAGDAAALAPLHAAFYSGDGPSADHGRALLGWLQQWAARISAG 449
Query: 564 GISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKY 623
G D R A MN+ NPKYV+RN+L Q AID A GD G + RLLK+M RPYDEQP +
Sbjct: 450 GEPDAARIARMNATNPKYVVRNWLAQRAIDDATAGDTGMIERLLKMMRRPYDEQPEFDDL 509
Query: 624 ARLPPAWA-YRPGVCMLSCSS 643
A P WA ++PG LSCSS
Sbjct: 510 AGRRPEWARHKPGCSALSCSS 530
>gi|340616633|ref|YP_004735086.1| hypothetical protein zobellia_624 [Zobellia galactanivorans]
gi|339731430|emb|CAZ94695.1| UPF0061 family protein [Zobellia galactanivorans]
Length = 522
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 255/546 (46%), Positives = 332/546 (60%), Gaps = 33/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
N +F +ELP DP T++ R+V AC++ V+P P LV S +A+ L L ++
Sbjct: 3 FNIQDTFNKELPADPITENSRRQVERACFSYVTPK-HTARPSLVHVSPEMAEELGLSEED 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F F+G T L G PYA CYGGHQFG WAGQLGDGRAI L E+ + + W LQ
Sbjct: 62 IRSEEFLKVFTGNTVLDGTAPYAMCYGGHQFGNWAGQLGDGRAINLMEVEH-NGKHWALQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSSIRE+LCSEAM+ LG+PTTRAL L +G V RD+ Y+G
Sbjct: 121 LKGAGETPYSRTADGLAVLRSSIREYLCSEAMYHLGVPTTRALSLALSGDQVLRDVLYNG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP E GAIVCRVA SFLRFG+YQI A+R ED + TL +Y I+H F + +K+
Sbjct: 181 NPAYEKGAIVCRVAPSFLRFGNYQIFAAR--EDTATMGTLVNYTIKHFFPELGAPSKASY 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ F VA+ T ++ WQ VGF HGV+NTDN+SILGLTI
Sbjct: 239 VQFFQA--------------------VADATLEMLVHWQRVGFVHGVMNTDNLSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGP+G+L+ +D +TPNTTD +RY + NQP+IGLWN+ Q + A LI + E
Sbjct: 279 DYGPYGWLEGYDHGWTPNTTDRQHKRYRYGNQPNIGLWNLYQLAN--AIFPLIGEAEPLE 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLL---NNMAVDKVDYTNFFRALSNVK 520
V+E + TKF +Y+ +M K+GL K + L N+ + + D T FFR L+N K
Sbjct: 337 AVLEGFKTKFEQKYRDMMKSKIGLYKADDLDPHLLDDLEENLQLTETDMTLFFRNLANFK 396
Query: 521 ADPSIPEDELLVPLKAVLL--DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
+ + V +A + ++ E E W W +Y L +SD ERK MNSVN
Sbjct: 397 KQVTDSGAFMEVVGEAFYVPDEVSGEVLEKWKVWFATYQSRLGQEELSDTERKQKMNSVN 456
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVC 637
PKYVLRNY+ Q AIDAA+ GD+ + L L+++PYDEQP EK+ P WA + G
Sbjct: 457 PKYVLRNYMAQLAIDAADKGDYALIDELFVLLKKPYDEQPEQEKWFAKRPDWARNKVGCS 516
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 517 MLSCSS 522
>gi|319787048|ref|YP_004146523.1| hypothetical protein Psesu_1445 [Pseudoxanthomonas suwonensis 11-1]
gi|317465560|gb|ADV27292.1| protein of unknown function UPF0061 [Pseudoxanthomonas suwonensis
11-1]
Length = 517
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 256/550 (46%), Positives = 327/550 (59%), Gaps = 46/550 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+ +D+SF+R+LPGDP REV A +++V P+ V +P+L+AWS A + L ++
Sbjct: 3 IEFDNSFLRDLPGDPEAGPRVREVF-AAWSRVDPT-PVADPRLLAWSPEAAALVGLGAED 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
PDF G L G P+A YGGHQFG WAGQLGDGRAI+LGE + RWELQ
Sbjct: 61 VADPDFARVCGGNALLEGMQPWAANYGGHQFGSWAGQLGDGRAISLGEAIAADGRRWELQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSRFADG AVLRSSIREFLCSEAMH LGIPTTRAL LV TG+ V RDMFYDG
Sbjct: 121 LKGAGRTPYSRFADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLVGTGEEVVRDMFYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCR+A SFLRFGS+Q+ ASRG D ++R L D+ RHHF + + +
Sbjct: 181 HPRPEPGAVVCRMAPSFLRFGSWQLPASRG--DTALLRQLTDHVQRHHFPDLHGLGPA-- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
GD A W +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 237 -----GD-------------AEWFAQVCERTAEMVAGWMRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLA-----AAKLIDD 459
DYGP+G+L+ +DP +TPNTTD GRRY + QP + WN+ + + LA AA L
Sbjct: 279 DYGPYGWLEDYDPGWTPNTTDAQGRRYRYGTQPQVAYWNLTRLAQALAPLFGEAAPL--- 335
Query: 460 KEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRAL 516
EA ++R+ + + ++ KLGL + + L + + D T FFR L
Sbjct: 336 -EAG--LQRFLDAWARAEREMVAGKLGLARAGADDVALFEDLRTVLQAGQFDLTAFFRRL 392
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWI--SWVLSYIQELLSSGISDEERKALM 574
P + AV D W+ Y L ++ E+R+ M
Sbjct: 393 GE-----GDPAADDAGGFAAVSYDADAFASATAALSDWLARYAARLADDPLTAEQRRERM 447
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-R 633
NP+YV RN+L Q AID AE G+ + LL++M RPY++QPG + YA L P WA R
Sbjct: 448 RLANPRYVPRNWLAQEAIDQAEAGNLAPLSNLLEVMRRPYEDQPGRDHYAGLRPGWARDR 507
Query: 634 PGVCMLSCSS 643
G MLSCSS
Sbjct: 508 AGCSMLSCSS 517
>gi|440733290|ref|ZP_20913047.1| hypothetical protein A989_16868 [Xanthomonas translucens DAR61454]
gi|440363305|gb|ELQ00474.1| hypothetical protein A989_16868 [Xanthomonas translucens DAR61454]
Length = 517
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 259/545 (47%), Positives = 321/545 (58%), Gaps = 37/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L D+ F ELPGDP REVL A +++V+P+ V PQL+A S VA L +E
Sbjct: 4 LRLDNRFTAELPGDPERGPRLREVLGALWSEVAPT-PVAAPQLLAHSREVAAMLGFSEQE 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F+G G PYA YGGHQFG WAGQLGDGRAI LGE L RWELQ
Sbjct: 63 VLAAQFAEVFAGNALYPGMRPYAANYGGHQFGHWAGQLGDGRAIALGEALGADGRRWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV +G+ V RDMFYDG
Sbjct: 123 LKGAGRTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVASGERVVRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCRVA SF+RFGS+++ A+RG D ++R LAD+ I F + S
Sbjct: 183 HPRAEPGAVVCRVAPSFVRFGSFELPAARG--DTALLRQLADFVIDRDFPALRTCGAS-- 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+YA W EV RTA++VAQW VGF HGV+NTDNMSILGLTI
Sbjct: 239 ------------------RYADWFGEVCARTAAMVAQWMRVGFVHGVMNTDNMSILGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP I WN+ + + L A D
Sbjct: 281 DYGPYGWIDDYDPDWTPNTTDAQGRRYRFGTQPQIAYWNLTRLAQAL-APLFADVAPLQA 339
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ R+ + + KLGL + ++ LL + +VD T +FR LS +
Sbjct: 340 GLARFRDTYAQAERDSAAAKLGLAECGAADLALLQDLLQLLQQGEVDMTLWFRGLSAAQ- 398
Query: 522 DPSIPEDELLVPLKAVLLDIGK--ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
+P +L L D K + A+ +W+ Y Q L + + R M + NP
Sbjct: 399 ---LP---MLADLADAFYDPAKLAAQAPAFEAWLARYAQRLQADPLPAAARVTKMRAANP 452
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AID AE GD + LL ++ RPYDEQPG E +A P WA R G M
Sbjct: 453 RYVLRNYLAQQAIDRAEQGDADGIAELLDVLRRPYDEQPGREGFAARRPDWARERAGCSM 512
Query: 639 LSCSS 643
LSCSS
Sbjct: 513 LSCSS 517
>gi|357417150|ref|YP_004930170.1| hypothetical protein DSC_07390 [Pseudoxanthomonas spadix BD-a59]
gi|355334728|gb|AER56129.1| hypothetical protein DSC_07390 [Pseudoxanthomonas spadix BD-a59]
Length = 518
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 254/545 (46%), Positives = 325/545 (59%), Gaps = 35/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN+D+ +RELPGDP + R+V A +++V+P+A V P+++AWS VA L L +
Sbjct: 3 LNFDNRLLRELPGDPVSGPQVRQVRGALWSQVAPTA-VAAPRVLAWSAEVASLLGLSAGD 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F G L G PYA YGGHQFG WAGQLGDGRAI LGE++ R ELQ
Sbjct: 62 IADPQFAQVFGGNALLPGMAPYATNYGGHQFGNWAGQLGDGRAICLGEVIAADGSRQELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSRFADG AVLRSSIREFLCSEAM LG+PTTRALCL+ TG+ V RDMFYDG
Sbjct: 122 LKGAGPTPYSRFADGRAVLRSSIREFLCSEAMAHLGVPTTRALCLIGTGEAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+ EPGA+VCRVA S LRFG +++ ASRG+ L +R L D+ I F H++
Sbjct: 182 HAAPEPGAVVCRVAPSLLRFGHFELPASRGESAL--LRQLVDFTIARDFPHLDGPAGQA- 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ AAW EV RTA+L+A W VGF HGV+NTDN+SI GLTI
Sbjct: 239 ------------------RDAAWFAEVCTRTATLMAHWMRVGFVHGVMNTDNLSITGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D FD +TPNTTD GRRY F QP + WN+++ + LA D
Sbjct: 281 DYGPYGWIDDFDLDWTPNTTDASGRRYRFGWQPQVAFWNLSRLAGALAPL-FTDATPLED 339
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + +A + KLGL + ++ +++ L + +VD T FFR L
Sbjct: 340 ALRGYAEAYAAAERATIAAKLGLAECGPADQALMADLHALLQQAEVDMTLFFRGLGE--- 396
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLSYIQELLSSGISDEERKALMNSVNP 579
P + L L+ D K A+ +W+ Y Q G S++ R+ M + NP
Sbjct: 397 --HPPGAQALQGLREAFYDDAKYHAHAGAFGAWLQRYAQRCAQEG-SEQARRTRMRAANP 453
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AID A GD G V LL+++ PYD+QPG E +AR P WA +PG M
Sbjct: 454 RYVLRNYLAQQAIDRAHAGDLGGVHALLEVLRHPYDDQPGREAFARKRPDWARSKPGCSM 513
Query: 639 LSCSS 643
LSCSS
Sbjct: 514 LSCSS 518
>gi|433679773|ref|ZP_20511465.1| UPF0061 protein [Xanthomonas translucens pv. translucens DSM 18974]
gi|430815118|emb|CCP42077.1| UPF0061 protein [Xanthomonas translucens pv. translucens DSM 18974]
Length = 517
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 258/545 (47%), Positives = 321/545 (58%), Gaps = 37/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L D+ F ELPGDP REVL A +++V+P+ V PQL+A S VA L +E
Sbjct: 4 LRLDNRFTAELPGDPERGPRLREVLGALWSEVAPT-PVAAPQLLAHSREVAAMLGFSEQE 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F+G G PYA YGGHQFG WAGQLGDGRAI LGE L RWELQ
Sbjct: 63 VLAAQFAEVFAGNALYPGMRPYAANYGGHQFGHWAGQLGDGRAIALGEALGADGRRWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV +G+ V RDMFYDG
Sbjct: 123 LKGAGRTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVASGERVVRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCRVA SF+RFGS+++ A+RG D ++R LAD+ I F + S
Sbjct: 183 HPRAEPGAVVCRVAPSFVRFGSFELPAARG--DTALLRQLADFVIDRDFPRLRTCGAS-- 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+YA W EV RTA++VAQW VGF HGV+NTDNMSILGLTI
Sbjct: 239 ------------------RYADWFGEVCARTATMVAQWMRVGFVHGVMNTDNMSILGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP I WN+ + + L A D
Sbjct: 281 DYGPYGWIDDYDPDWTPNTTDAQGRRYRFGTQPQIAYWNLTRLAQAL-APLFADVAPLQA 339
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ R+ + + KLGL + ++ LL+ + +VD T +FR LS +
Sbjct: 340 GLARFRDTYAQAERDSAAAKLGLAECGAADLALLQDLLHLLQQGEVDMTLWFRGLSAAQL 399
Query: 522 DPSIPE--DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
P + + D P K + A+ +W+ Y Q L + + R M + NP
Sbjct: 400 -PMLADLADAFYGPAKLA------AQAPAFEAWLARYAQRLQADPLPAAARVTKMRAANP 452
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AID AE GD + LL ++ RPYDEQPG E +A P WA R G M
Sbjct: 453 RYVLRNYLAQQAIDRAEQGDADGIAELLDVLRRPYDEQPGREAFAARRPDWARERAGCSM 512
Query: 639 LSCSS 643
LSCSS
Sbjct: 513 LSCSS 517
>gi|88810326|ref|ZP_01125583.1| hypothetical protein NB231_14638 [Nitrococcus mobilis Nb-231]
gi|88791956|gb|EAR23066.1| hypothetical protein NB231_14638 [Nitrococcus mobilis Nb-231]
Length = 540
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 259/560 (46%), Positives = 334/560 (59%), Gaps = 45/560 (8%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
+LE L +D+ F RELP DP + + R V AC+++VSP P+L+A+S VA L+L
Sbjct: 9 SLERLVFDNRFTRELPADPHSHNQRRLVTGACFSRVSPQPATA-PRLIAFSREVAALLDL 67
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
+ F F+G L G P+A CYGGHQFG+WAGQLGDGRAI LGE++N ER
Sbjct: 68 SEADCRSEVFTQVFAGNRLLPGMDPHATCYGGHQFGVWAGQLGDGRAINLGEVVNAHGER 127
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
W LQLKGAG TPYSR ADG AVLRSS+REFLCSEAMH L +PTTRAL LV +GK V RDM
Sbjct: 128 WILQLKGAGPTPYSREADGFAVLRSSLREFLCSEAMHHLRVPTTRALSLVLSGKQVMRDM 187
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
FYDG P EPGAIVCRVA SF RFG ++I A+ ++ ++R L DY IR F H+
Sbjct: 188 FYDGRPALEPGAIVCRVAPSFTRFGHFEILAA--HQNTRLLRQLLDYTIRTDFPHLG--- 242
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
+ + Y AW EV RT ++V W VGF HGV+NTDNMS+L
Sbjct: 243 -----------------EASQQTYIAWFEEVCRRTLTMVVHWMRVGFVHGVMNTDNMSVL 285
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AAAKL 456
G TIDYGP+G+L+ +DP +TPNTTD GRRY F QP + LWN+ Q + + +
Sbjct: 286 GQTIDYGPYGWLEGYDPDWTPNTTDAVGRRYRFEQQPQVALWNLTQLANAILPVVGQVEP 345
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNF 512
+ ANY E YG ++ A+M KLGL P +K +I +LL + + + D T F
Sbjct: 346 LQQAIANYAKE-YGPAWL----AMMASKLGLSQVDPARDKPLIDELLEVLQLLETDLTLF 400
Query: 513 FRALSNVK----ADPSIPEDELLVPLKAVLL---DIGKERKEAWISWVLSYIQELLSSGI 565
+R L+ + + + LL PL + E + +W+ Y++ L +
Sbjct: 401 YRNLARLSPAGAGAHEVSDAALLEPLLPAYYAPEALTGEHRARTTAWLRRYLERLGAESA 460
Query: 566 SD-EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYA 624
D + R+ MN VNPKYVLRNYL Q AID E GD+ + LL+L+ PYDEQP E++A
Sbjct: 461 DDAKARRRRMNRVNPKYVLRNYLAQLAIDQCEQGDYALLHELLELLRHPYDEQPDKEQFA 520
Query: 625 RLPPAWA-YRPGVCMLSCSS 643
P WA R G MLSCSS
Sbjct: 521 AKRPEWARQRAGCSMLSCSS 540
>gi|365959182|ref|YP_004940749.1| hypothetical protein FCOL_00505 [Flavobacterium columnare ATCC
49512]
gi|365735863|gb|AEW84956.1| hypothetical protein FCOL_00505 [Flavobacterium columnare ATCC
49512]
Length = 523
Score = 453 bits (1166), Expect = e-124, Method: Compositional matrix adjust.
Identities = 241/543 (44%), Positives = 342/543 (62%), Gaps = 36/543 (6%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERP 168
+ F +ELP D ++ R+V + ++ V+P+ + P L+ + A+ L L + +
Sbjct: 9 NKFTKELPADSINENTVRKVFESAFSFVTPTPP-KKPHLIHANIGFANELGLSVSDVKSD 67
Query: 169 DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGA 228
DF FFSG P++ CYGGHQFG+WAGQLGDGRAI L EI N ++++ LQLKGA
Sbjct: 68 DFLSFFSGKKIYPETNPFSMCYGGHQFGVWAGQLGDGRAINLFEIEN-NNKKYTLQLKGA 126
Query: 229 GKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
GKTPYSR ADGLAVLRSSIRE+LC+EAM+ LGIPTTR+L ++TTG V RD+ Y+GNP
Sbjct: 127 GKTPYSRNADGLAVLRSSIREYLCAEAMNSLGIPTTRSLSIITTGNDVLRDVLYNGNPAY 186
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
E GAIVCRVA SF+RFG++++ A+R DL ++ L D+ I+H+F I+ +
Sbjct: 187 EKGAIVCRVAPSFIRFGNFELFAARN--DLKNLQLLTDFTIKHYFPEIK----------T 234
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
TG E Y A+ VA+ T L+ WQ VGF HGV+NTDNMSI G+TIDYGP
Sbjct: 235 TGKE----------AYIAFFQTVAQLTRKLITNWQQVGFVHGVMNTDNMSIHGITIDYGP 284
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVME 467
+G+LD F+P++TPNTTD RY F NQP I LWN+ Q + L LI+ +E ++
Sbjct: 285 YGWLDDFNPNWTPNTTDAHQHRYAFGNQPQISLWNLYQLANALYP--LINQTEELEKILH 342
Query: 468 RYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y ++ ++Y IM KKLGL + ++++I +L+N++ + + DYT FFR L NV + +
Sbjct: 343 EYEDEYENDYMNIMRKKLGLTQAHSTDRELIYQLINSLQLQETDYTIFFRLLGNVSKEKT 402
Query: 525 IPEDELLVPLKAVLLDI---GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
++ +++ +I E + W W +Y+ + +SDEERK MN VNPKY
Sbjct: 403 --KENAFETIQSSFYEIPNKNPEFEHLWSVWFQNYLNRINLEPLSDEERKEKMNLVNPKY 460
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCMLS 640
+LRNY+ Q AI+ AEL D+ + L +++++PY+EQP EK+ P WA + G LS
Sbjct: 461 ILRNYMAQLAIEKAELEDYTLLEELYQVIQKPYEEQPEYEKWFTKRPDWAKEKIGCSQLS 520
Query: 641 CSS 643
CSS
Sbjct: 521 CSS 523
>gi|407716880|ref|YP_006838160.1| hypothetical protein Q91_1623 [Cycloclasticus sp. P1]
gi|407257216|gb|AFT67657.1| Hypothetical protein Q91_1623 [Cycloclasticus sp. P1]
Length = 529
Score = 453 bits (1166), Expect = e-124, Method: Compositional matrix adjust.
Identities = 255/557 (45%), Positives = 344/557 (61%), Gaps = 43/557 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ +L + + FV +LP D +++ PR+V AC++ VSP +++ P LV++S A L+LD
Sbjct: 1 MNNLTFSNKFVSQLPADNVSENYPRQVQGACFSWVSPK-QMKAPSLVSYSLEAAALLDLD 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ F FSG L G PYA CYGGHQFG WAGQLGDGRAI LGEI+N K ERW
Sbjct: 60 EDDCLSEQFLNTFSGNEQLDGMQPYATCYGGHQFGNWAGQLGDGRAINLGEIVNKKGERW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAG TPYSR ADGLAVLRSSIREFLCSEAM LG+PTTRAL L +TG+ V RD+
Sbjct: 120 ALQLKGAGPTPYSRTADGLAVLRSSIREFLCSEAMFHLGVPTTRALSLASTGEHVMRDVM 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GNP EPGA+VCR+A SF RFG +Q +A Q++ ++++ DY + F H+ +
Sbjct: 180 YNGNPAPEPGAVVCRLAPSFTRFGHFQYYA---QQNTELLKQFVDYTLETDFPHLLEKDS 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
S Y W EV T +V +W VGF HGV+NTDNMSILG
Sbjct: 237 VPSKQI----------------YLKWFEEVCRLTCDMVIEWMRVGFVHGVMNTDNMSILG 280
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G+L+++DP++TPNTTD RY FA Q I WN+ Q + A LI++ E
Sbjct: 281 LTIDYGPYGWLESYDPNWTPNTTDATHHRYAFAQQAKIAHWNLYQLAN--AIYPLIEEAE 338
Query: 462 A-----NYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDYTNF 512
N ERYG +++ +M+KKLG + + +++ +LL+ ++ + D T F
Sbjct: 339 PLEKALNEYAERYGQQWL----LMMSKKLGFSQLEEETDSELVKQLLSFFSLHETDMTIF 394
Query: 513 FRALSNVKA---DPSIPED-ELLVPLKAVLLD-IGKERKEAWISWVLSYIQELLSSGISD 567
FR L++++ D ++ E L P A +D + + KEA W++ Y++ +
Sbjct: 395 FRRLADIQTTSDDFNVATAIEHLKP--AFYIDELELQAKEAITEWLVRYVKRCEQEPQNA 452
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
+R+ALMNSVNPKYVLRNYL Q AID +E GD V LL+++ PYDEQP E +
Sbjct: 453 VQRRALMNSVNPKYVLRNYLAQLAIDKSEKGDHSMVNELLEVLRHPYDEQPDKEHLNQKR 512
Query: 628 PAWA-YRPGVCMLSCSS 643
P WA ++ G MLSCSS
Sbjct: 513 PDWAKHKVGCSMLSCSS 529
>gi|386819270|ref|ZP_10106486.1| hypothetical protein JoomaDRAFT_1187 [Joostella marina DSM 19592]
gi|386424376|gb|EIJ38206.1| hypothetical protein JoomaDRAFT_1187 [Joostella marina DSM 19592]
Length = 523
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 247/544 (45%), Positives = 336/544 (61%), Gaps = 31/544 (5%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN +F +ELP DP ++ R+V A ++ V+P + P L+ S+++ +L + +E
Sbjct: 6 LNIQDTFNKELPADPILENSRRQVKEAFFSYVTPK-KTTAPALLHVSDAMLQALGISEEE 64
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ F F+G L PYA CYGGHQFG WAGQLGDGRAI LGE+++ ++RW +Q
Sbjct: 65 KKSDAFLKIFTGNEVLDNTKPYAMCYGGHQFGNWAGQLGDGRAINLGEVVH-NNKRWAIQ 123
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSSIRE+LCSEAM LG+PTTRAL L TG V RD+ Y+G
Sbjct: 124 LKGAGETPYSRSADGLAVLRSSIREYLCSEAMFHLGVPTTRALSLALTGDEVLRDVLYNG 183
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP E GA+VCRVA SF+RFG+++I A+RG D + ++ LADY I+H + ++
Sbjct: 184 NPAYEKGAVVCRVAPSFIRFGNFEIFAARG--DHESLKKLADYTIKHFYPYL-------- 233
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
V + Y + EVA RT V WQ VGF HGVLNTDNMSILGLTI
Sbjct: 234 ------------VTPSKEVYIQFFKEVATRTLETVLHWQRVGFVHGVLNTDNMSILGLTI 281
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G+L+ FD +TPNTTD +RY F NQP+IGLWN+ Q + A LID+ E
Sbjct: 282 DYGPYGWLEGFDFGWTPNTTDATNKRYRFGNQPNIGLWNLYQLAN--AIYPLIDEVEGLE 339
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVK 520
++ Y F ++ +M KLGL + ++ +I +L N+ + + D T FFR LS
Sbjct: 340 KILNDYKVDFEEKSLEMMRSKLGLEQKEEEDSRLILQLEENLELSETDMTIFFRNLSKFT 399
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
+ + +++ +I E E W +W Y L +SDE RK MN+VNPK
Sbjct: 400 KEKNGSGVDIVKEAFYSSEEIQGEILEKWNTWFTFYRNRLKKERLSDEARKEKMNNVNPK 459
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCML 639
YVLRNY+ Q AI++A+ G++ + L +L+++PYDEQP EK+ P WA ++ G ML
Sbjct: 460 YVLRNYMAQLAIESADKGNYSLIEELYQLLKKPYDEQPDNEKWFVKRPEWARHKVGCSML 519
Query: 640 SCSS 643
SCSS
Sbjct: 520 SCSS 523
>gi|307108874|gb|EFN57113.1| hypothetical protein CHLNCDRAFT_57451 [Chlorella variabilis]
Length = 1336
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 254/562 (45%), Positives = 333/562 (59%), Gaps = 58/562 (10%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
L++LEDL +D++F +LP D DS V A Y+ V+P+ P +A S +V +
Sbjct: 816 LRSLEDLQFDNTFTAQLPAD---DSE-INVSSALYSWVAPTPTGTEPTTIAASAAVGRLV 871
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
LDP E RP+F L FSG PL YAQCYGGHQFG WAGQLGDGRAI LG+ +N +
Sbjct: 872 GLDPAEALRPEFALIFSGNAPLPQTRSYAQCYGGHQFGHWAGQLGDGRAICLGQSVNGEG 931
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWELQLKGAG+TPYSR ADG AVLRSSIRE+L SEAMH LG+PTTRAL LV TG V R
Sbjct: 932 ERWELQLKGAGRTPYSRMADGRAVLRSSIREYLASEAMHALGVPTTRALSLVATGDQVMR 991
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
DMFY+GN + EPGA+VCRV++SF+RFGS+Q+ +RG++++ +V LADY IRHH+ H++
Sbjct: 992 DMFYNGNARLEPGAVVCRVSKSFVRFGSFQLPVTRGKDEMGMVGLLADYVIRHHYPHLQG 1051
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
NKYAA+ EVA+RTA LVA+W VGF HGVLNTDNMS
Sbjct: 1052 G--------------------PGNKYAAFLAEVAQRTARLVAEWHRVGFVHGVLNTDNMS 1091
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG TIDYGP+GFL+ FDP FT P+IG WN+ Q + L A L+
Sbjct: 1092 ILGETIDYGPYGFLERFDPDFT----------------PEIGQWNLVQLARALVVAGLLS 1135
Query: 459 DK--------------EANYVMER-YGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMA 503
++ +A + E Y + KLGL Y++++ LL M
Sbjct: 1136 EEEAAPALAAYAETLTQAGLLREAGLAGPACRRYDEVQAAKLGLRAYDREVAGGLLRLMY 1195
Query: 504 VDKVDYTNFFRALSNVKADPSIPEDELLVP--LKAVLLDIGKERKEAWISWVLSYIQELL 561
D DYTN FR+LS V D + E +P L L + +ER AW WV Y L
Sbjct: 1196 EDAADYTNTFRSLSGVGLDAAGDEPASGLPPALACALGPLEEERYAAWRQWVQLYRARLA 1255
Query: 562 SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGME 621
G++++ER A+ ++ NP V RN++ + I AE G++ + R + + +PY+ G++
Sbjct: 1256 QEGMAEQERAAIQDAANPAIVPRNHVMVTIIGEAEEGNYQPLHRYMAALLQPYNAS-GLD 1314
Query: 622 KYARLPPAWAYRPGVCMLSCSS 643
P R GV +LSCSS
Sbjct: 1315 PAWLEPAPQKCRLGVELLSCSS 1336
>gi|389797073|ref|ZP_10200117.1| hypothetical protein UUC_05136 [Rhodanobacter sp. 116-2]
gi|388447906|gb|EIM03900.1| hypothetical protein UUC_05136 [Rhodanobacter sp. 116-2]
Length = 519
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 248/547 (45%), Positives = 333/547 (60%), Gaps = 37/547 (6%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D++FVREL D + R+V A Y++V P+ V P+L+A S +A +L
Sbjct: 3 DLRFDNTFVRELASDAEQGARRRQVEGALYSRVEPTP-VAVPRLLAHSAEMAAALGFSAV 61
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ P F F G + G PYA YGGHQFG WAGQLGDGRAI+LGE++N ERWEL
Sbjct: 62 DVATPQFAQVFGGNALIEGMQPYAANYGGHQFGHWAGQLGDGRAISLGEVVNEAGERWEL 121
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 122 QLKGAGLTPYSRGADGRAVLRSSVREFLCSEAMHHLGVPTTRALSLVGTGETVLRDMFYD 181
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+ EPGAIVCRVA SF+RFG++++ SRG D+ ++R L ++ +R F +E +
Sbjct: 182 GHAAPEPGAIVCRVAPSFIRFGNFELPTSRG--DVALLRQLVEFTLRRDFPELEGEGEV- 238
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+YAAW +V ERTA++VA W VGF HGV+NTDNMSILGLT
Sbjct: 239 -------------------RYAAWFRQVCERTATMVAHWMRVGFVHGVMNTDNMSILGLT 279
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEA 462
+DYGP+G++D +DP +TPNTTD RRY + QP++ WN++ + L A L D
Sbjct: 280 LDYGPYGWVDDYDPDWTPNTTDAQRRRYRYGQQPNVAWWNLSCLAGAL--APLFDGVGPL 337
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV---DKVDYTNFFRALSNV 519
++ Y + +A + KLGL + ++ + + A+ ++D T +FRAL+++
Sbjct: 338 QAGLQHYAATYAAADRANVAAKLGLAECRDDDVALMQSLQALLQQAEIDMTLWFRALADL 397
Query: 520 KADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNSV 577
D P L P + D K R + + W+ Y L ++ E+R+ M
Sbjct: 398 --DVQAPT---LAPFEGAFYDEAKRRAAEPELVDWLARYAARLADDPLAPEQRRERMRLA 452
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGV 636
NP+YVLRNYL Q AID AE GD + LL ++ PYD+QPG E +A+ P WA ++ G
Sbjct: 453 NPRYVLRNYLAQQAIDRAEQGDVAGIHELLDVLRHPYDDQPGREAFAQKRPDWARHKAGC 512
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 513 SMLSCSS 519
>gi|408369535|ref|ZP_11167316.1| hypothetical protein I215_01495 [Galbibacter sp. ck-I2-15]
gi|407745281|gb|EKF56847.1| hypothetical protein I215_01495 [Galbibacter sp. ck-I2-15]
Length = 526
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 245/544 (45%), Positives = 338/544 (62%), Gaps = 29/544 (5%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
+LN D+SF RELPGDP ++ R+V A Y+ V P + + P+L+ S+ ++D L L K
Sbjct: 8 NLNIDNSFTRELPGDPILENYIRQVQQASYSFVEPQ-KSKAPKLLHVSKDLSDQLGLSEK 66
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ + F +G PL+ + PYA YGGHQFG WAGQLGDGRAI +GE + +R+ L
Sbjct: 67 DIQGGQFLNIVTGNEPLSQSKPYAMNYGGHQFGNWAGQLGDGRAINIGEGIK-GDKRYVL 125
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAGKTPYSR DG AVLRSSIRE+LCSEAM LGIPTTRAL L TG V RD+ YD
Sbjct: 126 QLKGAGKTPYSRRGDGRAVLRSSIREYLCSEAMFHLGIPTTRALSLSLTGDKVLRDILYD 185
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
GNP+ E GAIV RVA SF+RFG++++++ RG D++ ++ L DY I++ + H+ +K+
Sbjct: 186 GNPEYELGAIVSRVAPSFIRFGNFELYSQRG--DIENLKRLTDYTIKYFYPHLGAPSKT- 242
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
Y A+ EV RT + WQ VGF HGVLNTDNMSILGLT
Sbjct: 243 -------------------TYIAFFKEVMRRTLDTIIHWQRVGFVHGVLNTDNMSILGLT 283
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEAN 463
IDYGP+G+L+ +D ++TPNTTDLP +RY FANQ ++GLWN+ Q + L +
Sbjct: 284 IDYGPYGWLEVYDHNWTPNTTDLPQKRYRFANQHNVGLWNLYQLANALYPLIEELEPIEE 343
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
++E Y + F +Y ++ KLGL K + +++S+L + + + D T F+R LS
Sbjct: 344 -ILESYESAFTTKYLKMLRSKLGLEKEHPDDVELLSELDQVLTLTETDMTLFYRKLSTFS 402
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
+ + ++ V ++ E K+ W +W + Y + L +DE+RK MN+ NPK
Sbjct: 403 KNKPKQGLDTIMDAFYVKEELNHEIKQKWNAWFVKYSERLKLEDAADEQRKIKMNNTNPK 462
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNY+ Q AIDAAE GD+G + + +++ PY EQP EK+ P WA + G ML
Sbjct: 463 YVLRNYMAQLAIDAAEQGDYGLIDQFYIMLQNPYKEQPQFEKWFAKRPQWAADKVGCSML 522
Query: 640 SCSS 643
SCSS
Sbjct: 523 SCSS 526
>gi|424793540|ref|ZP_18219641.1| hypothetical protein XTG29_01982 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422796589|gb|EKU25073.1| hypothetical protein XTG29_01982 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 519
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 259/545 (47%), Positives = 324/545 (59%), Gaps = 37/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ F ELPGDP REVL A +++V+P+ V PQL+A S VA L +E
Sbjct: 6 LRFDNRFTAELPGDPERGPRLREVLGALWSEVAPT-PVAAPQLLAHSREVAAMLGFSEQE 64
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F+G G PYA YGGHQFG WAGQLGDGRAI LGE L RWELQ
Sbjct: 65 VLAPQFAEVFAGNALYPGMRPYAANYGGHQFGHWAGQLGDGRAIALGEALGADGRRWELQ 124
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV +G+ V RDMFYDG
Sbjct: 125 LKGAGRTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVASGERVVRDMFYDG 184
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGA+VCRVA SF+RFGS+++ A+RG D ++R LAD I F ++
Sbjct: 185 HPRAEPGAVVCRVAPSFVRFGSFELPAARG--DTALLRQLADVVIDRDFPELQARG---- 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ +YA W EV RTA++VAQW VGF HGV+NTDNMSILGLTI
Sbjct: 239 ----------------ATRYADWFGEVCARTAAMVAQWMRVGFVHGVMNTDNMSILGLTI 282
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP I WN+ + + LA D
Sbjct: 283 DYGPYGWIDDYDPDWTPNTTDAQGRRYRFGTQPQIAYWNLTRLAQALAPL-FADVAPLQD 341
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ R+ + + KLGL + + ++ LL + +VD T +FR LS +
Sbjct: 342 GLARFRQTYAQAERDSAAAKLGLAECGAADLALMQDLLQLLQQGEVDMTLWFRGLSAAQ- 400
Query: 522 DPSIPEDELLVPLKAVLLDIGK--ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
+P L L D K + A+ +W+ Y Q L + + R A M + NP
Sbjct: 401 ---LPT---LADLADAFYDPAKLAAQAPAFDAWLARYAQRLRGDPLPEAARAAKMRAANP 454
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AI+ AE GD + LL ++ RPYDEQPG E +A P WA R G M
Sbjct: 455 RYVLRNYLAQQAIERAEQGDADGIAELLDVLRRPYDEQPGREAFAARRPDWARERAGCSM 514
Query: 639 LSCSS 643
LSCSS
Sbjct: 515 LSCSS 519
>gi|352090001|ref|ZP_08954238.1| protein of unknown function UPF0061 [Rhodanobacter sp. 2APBS1]
gi|351678537|gb|EHA61683.1| protein of unknown function UPF0061 [Rhodanobacter sp. 2APBS1]
Length = 519
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 249/549 (45%), Positives = 332/549 (60%), Gaps = 41/549 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
DL +D++FVREL D + R+V A Y++V P+ V P+L+A S +A +L
Sbjct: 3 DLRFDNTFVRELASDAEQGARRRQVEGALYSRVEPTP-VAVPRLLAHSAEMAAALGFSAV 61
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ P F F G + G PYA YGGHQFG WAGQLGDGRAI+LGE++N ERWEL
Sbjct: 62 DVATPQFAQVFGGNALIEGMQPYAANYGGHQFGHWAGQLGDGRAISLGEVVNEAGERWEL 121
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG TPYSR ADG AVLRSS+REFLCSEAMH LG+PTTRAL LV TG+ V RDMFYD
Sbjct: 122 QLKGAGLTPYSRGADGRAVLRSSVREFLCSEAMHHLGVPTTRALSLVGTGETVLRDMFYD 181
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+ EPGAIVCR A SF+RFG++++ SRG D+ ++R L ++ +R F +E +
Sbjct: 182 GHAAPEPGAIVCRAAPSFIRFGNFELPTSRG--DVALLRQLVEFTLRRDFPELEGEGEV- 238
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+YAAW +V ERTA++VA W VGF HGV+NTDNMSILGLT
Sbjct: 239 -------------------RYAAWFRQVCERTATMVAHWMRVGFVHGVMNTDNMSILGLT 279
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK--- 460
+DYGP+G++D +DP +TPNTTD RRY + QP++ WN++ + L A L D
Sbjct: 280 LDYGPYGWVDDYDPDWTPNTTDAQRRRYRYGQQPNVAWWNLSCLAGAL--APLFDGVGPL 337
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALS 517
EA ++ Y + +A + KLGL + + ++ L + ++D T +FRAL+
Sbjct: 338 EAG--LQHYAATYAAADRANVAAKLGLAECRDDDAGLMQSLQALLQQAEIDMTLWFRALA 395
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMN 575
++ D P L P + D K R + + W+ Y L ++ E R+ M
Sbjct: 396 DL--DVQAPT---LAPFEGAFYDEAKRRAAEPELVDWLARYAARLADDPLAPERRRERMR 450
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRP 634
NP+YVLRNYL Q AID AE GD + LL ++ PYD+QPG E +A+ P WA ++
Sbjct: 451 LANPRYVLRNYLAQQAIDRAEQGDVAGIHELLDVLRHPYDDQPGREAFAQKRPDWARHKA 510
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 511 GCSMLSCSS 519
>gi|384428188|ref|YP_005637547.1| hypothetical protein XCR_2555 [Xanthomonas campestris pv. raphani
756C]
gi|341937290|gb|AEL07429.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 518
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 255/549 (46%), Positives = 326/549 (59%), Gaps = 44/549 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ +LPGDP REVL A ++ V P+ V P L+A+S VA L L ++
Sbjct: 4 LQFDNRLRAQLPGDPEQGPRRREVL-AAWSAVRPT-PVAAPTLLAYSADVAQRLGLRAED 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+ELQ
Sbjct: 62 LASPQFAEVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGVDGGRYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH+LG+PTTRAL LV TG V RDMFYDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHYLGVPTTRALSLVGTGDAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG++++ A+RG D+D++R D+ + F + +
Sbjct: 182 HPRREPGAIVCRVAPSFIRFGNFELPAARG--DVDLLRQWVDFTLARDFPDLPGSGE--- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++ AAW +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 237 -----------------DRIAAWFGQVCERTAVMVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L + L D +
Sbjct: 280 DYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAL--SPLFGDAAS-- 335
Query: 465 VMERYGTKFMDEYQAI----MTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALS 517
++ +F D Y A KLGL + Q+I L M ++D T FR L
Sbjct: 336 -LQAGLDQFRDTYLACDRRDTAAKLGLAECQDEDLQLIDDLRALMREAEMDMTLTFRGLV 394
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLSYIQELLSSGISDEERKALMN 575
++ P P+ + L+ D K + A +W+ Y L G SD R + M
Sbjct: 395 DLS--PQQPDASV---LREAFYDETKRAAQAPALDAWLQRYAARCLQDGASDAVRASRMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
+ NP+YVLRNYL Q AID AE GD V LL++M+ PYD+QPG E +A P WA R
Sbjct: 450 AANPRYVLRNYLAQQAIDQAEQGDLSGVHALLEVMQLPYDDQPGREAFAAKRPDWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|305666303|ref|YP_003862590.1| hypothetical protein FB2170_08504 [Maribacter sp. HTCC2170]
gi|88708295|gb|EAR00532.1| hypothetical protein FB2170_08504 [Maribacter sp. HTCC2170]
Length = 521
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 246/547 (44%), Positives = 331/547 (60%), Gaps = 36/547 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN +F ELP DP ++ R+V AC++ V+P NP+L+ S + + L K+
Sbjct: 3 LNIKDTFNTELPADPILENSRRQVRGACFSLVTPR-RTSNPKLLHVSNDMLQKIGLTEKD 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ F F+G L PYA CYGGHQFG WAGQLGDGRAI L E+ + SE W LQ
Sbjct: 62 VKNNSFLKVFTGNEVLPNTKPYAMCYGGHQFGNWAGQLGDGRAINLCEVEH-NSEHWALQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSSIRE+LCSEAM LG+PTTRAL L TG V RD+ YDG
Sbjct: 121 LKGAGETPYSRTADGLAVLRSSIREYLCSEAMFHLGVPTTRALSLALTGDQVLRDVMYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP E GA+VCR + SF+RFG+++I A+R + + ++ L DY I H F H+ +K
Sbjct: 181 NPAYEKGAVVCRTSPSFIRFGNFEILAARNE--ISTLKKLTDYTIEHFFTHLGKPSKEVY 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L F EVA+ + +V +WQ VGF HGV+NTDNMSILGLTI
Sbjct: 239 LQFFK--------------------EVADSSLKMVIEWQRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G+L+ +DP +TPNTTD +RY F NQPDI LWN+ Q + L LI++ E +
Sbjct: 279 DYGPYGWLEGYDPDWTPNTTDRQFKRYRFDNQPDIVLWNLYQLANALYP--LIEETETLD 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVK 520
++ Y + F +YQ +M KLGL K +I +L + + + + D T FFR L N +
Sbjct: 337 LILTDYRSSFTKDYQNMMRSKLGLFKSKNDDSILIKELEDILQLSETDMTIFFRNLGNYE 396
Query: 521 ADPSIPEDELLVPLKAV--LLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKALMNSV 577
P++ + V A L D+ + ++ W W L Y L L ++ ERK M+S+
Sbjct: 397 VGK--PDEGIKVISDAFYKLSDVNESIRKKWDDWFLRYDNRLKLGVEVTQIERKEKMDSI 454
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGV 636
NPKYVLRNY+ Q AID A+ G++ + + L+++PY EQP +K+ P WA ++ G
Sbjct: 455 NPKYVLRNYMAQMAIDNADKGNYSLIEEIYTLLKKPYSEQPKYKKWFAKRPEWARHKVGC 514
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 515 SMLSCSS 521
>gi|376316686|emb|CCG00071.1| protein belonging to UPF0061 [uncultured Flavobacteriia bacterium]
Length = 523
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 245/551 (44%), Positives = 339/551 (61%), Gaps = 37/551 (6%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
K ++ L ++F +ELPGD T + R+V A Y+ P NP +V S+ + SL+
Sbjct: 3 KFVKSLTLHNTFTKELPGDENTSNSRRQVYKASYSYAEP-LNPSNPSMVIASKDLGKSLD 61
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
LD E +F +G A + PYA CYGGHQFG WAGQLGDGRAI LGE+ N +
Sbjct: 62 LDDMASE--EFLHLMTGKKLAAKSTPYAMCYGGHQFGHWAGQLGDGRAINLGEV-NHDGK 118
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
W LQLKGAG TPYSR ADG AVLRSS+REFLCSE+M +LG+ TTRAL L TG V RD
Sbjct: 119 SWVLQLKGAGPTPYSRGADGRAVLRSSVREFLCSESMFYLGVSTTRALSLALTGDKVLRD 178
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM 339
+ YDGNP E GAIVCRV++SF+R G++++ ++R +DLD ++ LAD+ IRH + +++
Sbjct: 179 VLYDGNPIYEKGAIVCRVSESFIRIGNFELLSAR--KDLDSLKILADFTIRHFYPNLKGQ 236
Query: 340 NKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
K LSF VA RTAS++ WQ VGF HGV+NTDNMSI
Sbjct: 237 GKDLYLSFFRA--------------------VAARTASMIIDWQRVGFVHGVMNTDNMSI 276
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD 459
LG TIDYGP+G+L+ +D +TPNTTD RRY F NQ + LWN+ Q + L LI+D
Sbjct: 277 LGQTIDYGPYGWLENYDEEWTPNTTDQEHRRYRFGNQGSVALWNLTQLANALYP--LIED 334
Query: 460 KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY--NKQIISKLLNNMAVD-KVDYTNFFRA 515
A ++ Y T ++ +Y ++ K+GL K N + ++K L+++ + + D T F+R
Sbjct: 335 VPALEKSLDEYRTNYLKDYHKMLNTKIGLTKMKGNDEKLNKDLHDLMIHTQTDMTIFYRQ 394
Query: 516 LSNVKADPSIPEDELLVPLKAVLLD--IGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
LS + D P + L + A + + E KEAW++W++ Y L ++ER+A
Sbjct: 395 LSLFEVDK--PSEHLRLVKDACYIGDVVFNENKEAWLNWLVRYACRLTEESKQEDERRAN 452
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY- 632
MN VNPKYVLRNY+ Q AI+ A+ ++ + L +L++ PYDEQP M+K+ + P+WA
Sbjct: 453 MNGVNPKYVLRNYMAQLAIEDADKENYDLIHELHELLKNPYDEQPEMQKWFAMRPSWALN 512
Query: 633 RPGVCMLSCSS 643
+ G LSCSS
Sbjct: 513 KVGCSQLSCSS 523
>gi|188991289|ref|YP_001903299.1| hypothetical protein xccb100_1894 [Xanthomonas campestris pv.
campestris str. B100]
gi|226696168|sp|B0RS12.1|Y1894_XANCB RecName: Full=UPF0061 protein xcc-b100_1894
gi|167733049|emb|CAP51247.1| Conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 518
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 254/549 (46%), Positives = 326/549 (59%), Gaps = 44/549 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ +LPGDP REVL A ++ V P+ V P L+A+S VA L L ++
Sbjct: 4 LQFDNRLRAQLPGDPEQGPRRREVL-AAWSAVRPT-PVAAPTLLAYSADVAQRLGLRAED 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+ELQ
Sbjct: 62 LASPQFAEVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGVDGGRYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH+LG+PTTRAL LV TG V RDMFYDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHYLGVPTTRALSLVGTGDAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG++++ A+RG D+D++R D+ + F + +
Sbjct: 182 HPRREPGAIVCRVAPSFIRFGNFELPAARG--DVDLLRQWVDFTLARDFPDLPGSGE--- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++ AAW +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 237 -----------------DRIAAWFGQVCERTAVMVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L + L D +
Sbjct: 280 DYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAL--SPLFGDAAS-- 335
Query: 465 VMERYGTKFMDEYQAI----MTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
++ +F D Y A KLGL + + +I L M ++D T FR L
Sbjct: 336 -LQAGLDQFRDTYLACDRRDTAAKLGLAECQDEDLHLIDDLRALMREAEMDMTLTFRGLV 394
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLSYIQELLSSGISDEERKALMN 575
++ P P+ + L+ D K + A +W+ Y L G SD R + M
Sbjct: 395 DLS--PQQPDASV---LREAFYDETKRAAQAPALGAWLQRYAARCLQDGASDAVRASRMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
+ NP+YVLRNYL Q AID AE GD V LL++M+RPYD+QP E +A P WA R
Sbjct: 450 AANPRYVLRNYLAQQAIDQAEQGDLSGVHALLEVMQRPYDDQPRRESFAAKRPDWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|399032669|ref|ZP_10731992.1| hypothetical protein PMI10_03876 [Flavobacterium sp. CF136]
gi|398068958|gb|EJL60343.1| hypothetical protein PMI10_03876 [Flavobacterium sp. CF136]
Length = 523
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 243/555 (43%), Positives = 339/555 (61%), Gaps = 45/555 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
++ L + F ELP D + R+V A ++ V+P+ + +P+L+ +ESVA+ + +
Sbjct: 1 MKHLKIHNRFTTELPADTNETNEVRQVSKALFSYVNPT-KPSDPKLIHAAESVAELVGIS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E + +F FSG L G PYA CY GHQFG WAGQLGDGRAI L E+ + ++ +
Sbjct: 60 KDEIQSEEFLNVFSGKEILPGTRPYAMCYAGHQFGNWAGQLGDGRAINLTEVEHDDNQFF 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAGKTPYSR ADGLAVLRSSIRE LC+EAM++LGIPTTR+L L+ +G V RD+
Sbjct: 120 TLQLKGAGKTPYSRTADGLAVLRSSIREHLCAEAMYYLGIPTTRSLSLMLSGDQVLRDVL 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDGNP E GAIVCRVA SF+RFGS+++ +R + L ++ +Y I+H+F I+ K
Sbjct: 180 YDGNPAYEKGAIVCRVAPSFIRFGSFEMLTARNE--LKNLKQFVEYNIKHYFPEIKGEPK 237
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ L F VA++T ++ WQ VGF HGV+NTDNMSI G
Sbjct: 238 KQYLQFFKT--------------------VADKTREMILHWQRVGFVHGVMNTDNMSIHG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
+TIDYGP+G+L+ +DP++TPNTTD RRY F NQP I WN+ Q + +L LI++ E
Sbjct: 278 ITIDYGPYGWLENYDPNWTPNTTDSQNRRYRFGNQPQIAQWNLYQLANSLYP--LINEAE 335
Query: 462 -ANYVMERYGTKFMDEYQAIMTKKLG---LPKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
++E + F +Y+ ++ KLG + + ++I+ L +N+ + + D T F+R L+
Sbjct: 336 PLEKILESFIIDFNSDYKKMILSKLGSTTSTESDDELIAYLESNLQLSETDMTIFYRNLN 395
Query: 518 NVKADPSIP------EDELLVP--LKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE 569
+K S ED P +K +LD W+ W Y++ L+ SDEE
Sbjct: 396 KIKKTDSAEKALKCIEDAFYKPEEIKDTILD-------NWLLWFADYLERLIQENTSDEE 448
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
R LMNSVNPKYVLRNY+ Q AIDAA+ D+ + L +L+++PYDEQP EK+ P
Sbjct: 449 RIKLMNSVNPKYVLRNYMAQLAIDAADKEDYSLINELYELLKKPYDEQPEHEKWFAKRPD 508
Query: 630 WAY-RPGVCMLSCSS 643
WA + G MLSCSS
Sbjct: 509 WARSKVGCSMLSCSS 523
>gi|389793943|ref|ZP_10197104.1| hypothetical protein UU9_07049 [Rhodanobacter fulvus Jip2]
gi|388433576|gb|EIL90542.1| hypothetical protein UU9_07049 [Rhodanobacter fulvus Jip2]
Length = 519
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 255/546 (46%), Positives = 333/546 (60%), Gaps = 37/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D++FVRELP DP + R+V A Y+ V P+ V P+L+A+S A L + +
Sbjct: 4 LRFDNAFVRELPADPERGARLRQVEGALYSLVEPT-PVAAPRLLAYSAETAALLGIRATD 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F G L G P+A YGGHQFG W GQLGDGRA++LGE++N ERWELQ
Sbjct: 63 ITTLAFARVFGGNALLPGMQPFAANYGGHQFGNWVGQLGDGRALSLGEVINAAGERWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL L+ TG+ V RDMFYDG
Sbjct: 123 LKGAGRTPYSRSADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLIDTGEPVLRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+ EPGAIVCRVA SF+RFG++++ ASRG D ++R L D+ IR F + + E+
Sbjct: 183 HAAPEPGAIVCRVAPSFIRFGNFELPASRG--DTALLRQLVDFTIRRDFPELG--GQGEA 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L Y W +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 239 L------------------YGEWFGQVCERTARMVAHWMRVGFVHGVMNTDNMSILGLTI 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA-KLIDDKEAN 463
DYGP+G++D FDP +TPNTTD RRY F QPD+ WN+++ + LA ++ +A
Sbjct: 281 DYGPYGWIDNFDPDWTPNTTDAQRRRYRFGQQPDVAWWNLSRLAGALAPLFSGVEPLQAG 340
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVK 520
++RY + +A + KLGL + ++ L +A +VD T +FR L +V
Sbjct: 341 --LDRYAATYAAADRANIAAKLGLLECRDDDVALMQSLHALLAQAEVDMTLWFRGLGDV- 397
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWI--SWVLSYIQELLSSGISDEERKALMNSVN 578
DP P L + D K R+ + W+ Y L + +R+ M +VN
Sbjct: 398 -DPEAPT---LAAMDDAFYDALKRREAERLLDDWLKRYAARLADDPQTVAQRRKRMRAVN 453
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
P+YVLRNYL Q+AID A+ GD G + LL +M PYD+QPG E +A+ P WA ++ G
Sbjct: 454 PRYVLRNYLVQNAIDQAQAGDAGGIHELLDVMRWPYDDQPGREAFAQKRPDWARHKAGCS 513
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 514 MLSCSS 519
>gi|325923001|ref|ZP_08184705.1| hypothetical protein XGA_3737 [Xanthomonas gardneri ATCC 19865]
gi|325546509|gb|EGD17659.1| hypothetical protein XGA_3737 [Xanthomonas gardneri ATCC 19865]
Length = 518
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 257/552 (46%), Positives = 326/552 (59%), Gaps = 44/552 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ D+ +D+ ++LPGDP R+V+ A ++ VSP+ V P+L+A+S +A L LD
Sbjct: 1 MTDIQFDNRLRQQLPGDPEEGPRRRDVV-AAWSSVSPTP-VAAPRLLAYSAEMAQQLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 EAELAGARFAEVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGVDGVRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LVTTG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVTTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R AD+ I F +E +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DSALLRQWADFTIARDFPELEGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
N YAAW +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 --------------------NLYAAWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L A L D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAL--APLFADAA 334
Query: 462 ANYVMERYGTKFMDEYQAI----MTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFR 514
+++ F D Y A KLGL + +I L M ++D T FR
Sbjct: 335 P---LQQGLDHFRDTYLACDRRDTAAKLGLADCRDEDLHLIDVLRELMHAAEMDMTLTFR 391
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKA 572
L ++ P P+ EL L+ D K A W+ Y L +S ++R+
Sbjct: 392 GL--IELSPEHPDPEL---LREAFYDQDKRLAHAGQLQEWLQRYATRLGQDTLSPDQRRE 446
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NP+YVLRNYL Q AID AE GD V+ LL++M RP D+QPG + +A P WA
Sbjct: 447 RMRLANPRYVLRNYLAQQAIDLAEQGDPSGVQELLEVMRRPCDDQPGRDAFAARRPEWAR 506
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 507 DRAGCSMLSCSS 518
>gi|126661720|ref|ZP_01732719.1| hypothetical protein FBBAL38_00175 [Flavobacteria bacterium BAL38]
gi|126625099|gb|EAZ95788.1| hypothetical protein FBBAL38_00175 [Flavobacteria bacterium BAL38]
Length = 520
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 247/541 (45%), Positives = 324/541 (59%), Gaps = 36/541 (6%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPD 169
+F +LP D T + R+V A Y+ V+P NP V +E VA L L + + D
Sbjct: 9 TFTTQLPADQETANTRRQVYEAAYSFVTPRVP-SNPAFVHVAEEVAAFLGLSKEATKTDD 67
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
F SG+ PYA Y GHQFG WAGQLGDGRAI L E+++ ++R+ LQLKGAG
Sbjct: 68 FLKLVSGSMVYPNTTPYAMAYAGHQFGNWAGQLGDGRAINLFEVIH-NNQRFTLQLKGAG 126
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
TPYSR ADG AVLRSSIRE LCSEAM +LG+PTTR+L LVTTG V RD+ Y+GN E
Sbjct: 127 ATPYSRSADGFAVLRSSIREHLCSEAMCYLGVPTTRSLSLVTTGDKVLRDVLYNGNAAYE 186
Query: 290 PGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFST 349
GA+VCRVA +F+RFG++Q+ A+R +D+ ++ LADY I++ + I K + L F
Sbjct: 187 DGAVVCRVAPTFIRFGNFQLFAAR--KDIKNLKALADYTIQYFYPQITISGKEKYLQFYK 244
Query: 350 GDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPF 409
EV RT +V WQ VGF HGV+NTDNMSILGLTIDYGP+
Sbjct: 245 --------------------EVVNRTVEMVLHWQRVGFVHGVMNTDNMSILGLTIDYGPY 284
Query: 410 GFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMER 468
G+L+ +DP +TPNTTD GRRY F NQPDI LWN+ Q L LI+D V+
Sbjct: 285 GWLEDYDPDWTPNTTDAEGRRYRFRNQPDIALWNLVQLGNALYP--LIEDIASMEQVLNS 342
Query: 469 YGTKFMDEYQAIMTKKLGL-PKYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIP 526
Y +F ++ I +KLGL +Y+ +L + + D T F+R L+NV K D S
Sbjct: 343 YSQQFDSQFPIIQQQKLGLQAEYDAHFQDELTTLLTASETDMTIFYRNLANVLKTDTS-- 400
Query: 527 EDELLVPLKAVLLD---IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVL 583
+E L + I K +W++W+ Y++++ + SDEERK MN VNPKYVL
Sbjct: 401 -EEALAKIILAFYQPDKIVTTLKTSWLNWMELYLEKIKAEVGSDEERKEAMNKVNPKYVL 459
Query: 584 RNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLSCS 642
RNY+ Q AI+AAE D+ + L++ PYDEQP EK+ P WA ++ G MLSCS
Sbjct: 460 RNYMAQLAIEAAEKQDYSVIDEFYTLLKNPYDEQPQYEKWFAKRPDWARHKVGCSMLSCS 519
Query: 643 S 643
S
Sbjct: 520 S 520
>gi|343087457|ref|YP_004776752.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342355991|gb|AEL28521.1| UPF0061 protein ydiU [Cyclobacterium marinum DSM 745]
Length = 529
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 247/550 (44%), Positives = 333/550 (60%), Gaps = 41/550 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
+LN +F ELP DP R+V AC++ V PS P+L+ S+ + D+L L +
Sbjct: 11 NLNIQDTFTSELPEDPIMGKQRRQVTDACFSYVDPSPTAA-PKLIHVSKEMLDNLGLTIE 69
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
+ + +F F+G + L PYA YGGHQFG WAGQLGDGRAI L E+++ + ++W +
Sbjct: 70 DSKSTEFLKVFTGNSVLDKTKPYAMSYGGHQFGNWAGQLGDGRAINLFEVVH-QEKKWVV 128
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKGAG+TPYSR ADGLAVLRSSIRE+LCSEAMH LG+PTTRAL L TG V RD+ Y+
Sbjct: 129 QLKGAGETPYSRTADGLAVLRSSIREYLCSEAMHHLGVPTTRALSLALTGDKVMRDVLYN 188
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
GNP E GAIV RV+ SFLRFG+Y++ ASR +D ++TL D+ I+HHF H+ +K
Sbjct: 189 GNPAYEKGAIVSRVSPSFLRFGNYELFASR--QDTITLKTLVDFTIKHHFSHLGTPSKE- 245
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
Y A+ EV + T +L+ WQ VGF HGV+NTDNMSILGLT
Sbjct: 246 -------------------TYIAFFNEVVQSTLALIVHWQSVGFVHGVMNTDNMSILGLT 286
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-----AAAKLID 458
IDYGP+G+L+ F+ +TPNTTDL +RY + NQP+IGLWN+ Q + L A L D
Sbjct: 287 IDYGPYGWLEGFEEGWTPNTTDLHQKRYRYGNQPNIGLWNLYQLANALYPLIEEVAPLED 346
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRA 515
+++Y + F +M +K+GL + +I +L + + D T FFR
Sbjct: 347 ------ALDQYRSGFPKAMVQMMREKIGLTTEKGKDIALIQELERLLQEAETDMTIFFRL 400
Query: 516 LSNV-KADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS + KAD S ++++ ++ +E W +W Y L +SD ERK +M
Sbjct: 401 LSKIEKADTSNGLEQVMEAFYTP-SELSSSLREDWQAWFQFYGNRLQEESLSDIERKKIM 459
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YR 633
N VNPKYVLRNY+ Q AID AE G++G + L L++ PY EQ EK+ P WA ++
Sbjct: 460 NLVNPKYVLRNYMAQLAIDDAENGNYGLLEELFDLLKNPYSEQADQEKWFAKRPEWARHK 519
Query: 634 PGVCMLSCSS 643
G MLSCSS
Sbjct: 520 VGCSMLSCSS 529
>gi|21231722|ref|NP_637639.1| hypothetical protein XCC2284 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768152|ref|YP_242914.1| hypothetical protein XC_1831 [Xanthomonas campestris pv. campestris
str. 8004]
gi|33517048|sp|Q8P8F8.1|Y2284_XANCP RecName: Full=UPF0061 protein XCC2284
gi|81305873|sp|Q4UVM9.1|Y1831_XANC8 RecName: Full=UPF0061 protein XC_1831
gi|21113425|gb|AAM41563.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66573484|gb|AAY48894.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 518
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 254/549 (46%), Positives = 325/549 (59%), Gaps = 44/549 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ ELPGDP REVL A ++ V P+ V P L+A+S VA L L ++
Sbjct: 4 LQFDNRLRAELPGDPEEGPRRREVL-AAWSAVQPT-PVAAPTLLAYSADVAQRLGLRAED 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+ELQ
Sbjct: 62 LASPRFAEVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGVDGGRYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH+LG+PTTRAL LV TG V RDMFYDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHYLGVPTTRALSLVGTGDAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG++++ A+RG D+D++R D+ + F + +
Sbjct: 182 HPRREPGAIVCRVAPSFIRFGNFELPAARG--DVDLLRQWVDFTLARDFPDLPGSGE--- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++ A+W +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 237 -----------------DRIASWLGQVCERTAVMVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + L + L D
Sbjct: 280 DYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAL--SPLFGDAAP-- 335
Query: 465 VMERYGTKFMDEYQAI----MTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
++ +F D Y A KLGL + + +I L M ++D T FR L
Sbjct: 336 -LQAGLDQFRDTYLACDRRDTAAKLGLAECQDEDLHLIDDLRALMREAEMDMTLTFRGLV 394
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKE--AWISWVLSYIQELLSSGISDEERKALMN 575
++ P P+ + L+ D K + A +W+ Y L G SD R + M
Sbjct: 395 DLS--PQQPDASV---LREAFYDETKRAAQAPALGAWLQRYAARCLQDGASDAVRASRMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
+ NP+YVLRNYL Q AID AE GD V LL++M+RPYD+QP E +A P WA R
Sbjct: 450 AANPRYVLRNYLAQQAIDQAEQGDLSGVHALLEVMQRPYDDQPRRESFAAKRPDWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|302879624|ref|YP_003848188.1| hypothetical protein Galf_2424 [Gallionella capsiferriformans ES-2]
gi|302582413|gb|ADL56424.1| protein of unknown function UPF0061 [Gallionella capsiferriformans
ES-2]
Length = 518
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 247/549 (44%), Positives = 328/549 (59%), Gaps = 44/549 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+D+ FV ELPGD R+ C+ V+P+ + P L+A+S + A L L ++
Sbjct: 4 FTFDNRFVSELPGDQSGSPHSRQTPDVCWAAVNPTPTAQ-PVLLAYSNAAACLLNLSHED 62
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F FSG L G P+A CYGGHQFG WAGQLGDGRAI+LGE++NL+ ERWELQ
Sbjct: 63 VHSAEFLQAFSGNQLLPGMRPFAACYGGHQFGHWAGQLGDGRAISLGEVINLQGERWELQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSS+REFLCSEAMH LGIPTTRAL L+ TG V RDMFYDG
Sbjct: 123 LKGAGMTPYSRRADGRAVLRSSLREFLCSEAMHHLGIPTTRALSLIGTGDDVMRDMFYDG 182
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P +EPGAIVCR+A SF+RFG++++ A+RG+ +L +R L D+ I F+ I
Sbjct: 183 HPNDEPGAIVCRIAPSFIRFGNFELLAARGEHEL--LRRLVDFTIDRDFQEI-------- 232
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ + D + D W V ERTA LV +W VGF HGV+NTDNMSILGLT+
Sbjct: 233 ----SKEPDDYLSD--------WFSLVCERTAKLVVEWLRVGFVHGVMNTDNMSILGLTL 280
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D FDP +TPNTTD RRYC + QP + WN+ + + L+ K A
Sbjct: 281 DYGPYGWIDNFDPGWTPNTTDSEWRRYCLSQQPPVARWNLERLADALSTI-----KGARS 335
Query: 465 VMERYGTKFMDEYQAIMTKKLG-------LPKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
+ ER F Q MT L + +++ + + M +VD T FFRAL+
Sbjct: 336 LRERGLKHFDATLQTSMTSMLAGKFGWLVWCDTDAELVETIFDLMQTAQVDMTQFFRALA 395
Query: 518 NVKADPSIPEDELLVPLKAVLLD--IGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
N++ + P+ L L++ + + W+ Y L SD+ R+ MN
Sbjct: 396 NIEQEA--PD---LAVLRSAFYQEALYHNHSTLFNDWLQRYAARLCLQQESDDTRRKRMN 450
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
VNP+++LRNYL Q AI+AA D + RL++ +RPYDE+ + A L P WA +P
Sbjct: 451 LVNPRFILRNYLAQQAIEAAMQNDMSFLERLMQAGQRPYDEEIDADLVA-LRPDWALNKP 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|86134526|ref|ZP_01053108.1| uncharacterized ACR, YdiU/UPF0061 family [Polaribacter sp. MED152]
gi|85821389|gb|EAQ42536.1| uncharacterized ACR, YdiU/UPF0061 family [Polaribacter sp. MED152]
Length = 518
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 245/547 (44%), Positives = 332/547 (60%), Gaps = 39/547 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN H+F+ ELP D ++ R+V A Y+ V+P + + P+++ S+ +A+ L + +E
Sbjct: 3 LNLKHTFLNELPADSILENTRRQVSDAVYSFVNPK-KTQQPEILHVSQEMANELGITQEE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F+G PYA CYGGHQFG WAGQLGDGRAI L E+ + ++ W++Q
Sbjct: 62 TTSTLFKKIFTGNEVYPNTKPYAMCYGGHQFGNWAGQLGDGRAINLFEVEH-DNKNWKVQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSSIRE+LC+EAM+ LG+PTTR+L L +G V RD+ YDG
Sbjct: 121 LKGAGETPYSRTADGLAVLRSSIREYLCAEAMYHLGVPTTRSLSLALSGDDVLRDVMYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP E GAIV R++ SFLRFG+++I ASR D ++ L DY I+HHF H+ N +K
Sbjct: 181 NPAYEKGAIVSRISPSFLRFGNFEIFASRN--DFKNLKILTDYTIKHHFSHLGNPSKETY 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ F EVA+RT +++ WQ VGF HGV+NTDNMSILGLTI
Sbjct: 239 IQFFG--------------------EVADRTLNMIIDWQRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGP+G+L+ FD +TPNTTD +RY + NQP+IGLWN+ Q + L LI+D
Sbjct: 279 DYGPYGWLEGFDFGWTPNTTDRQNKRYRYGNQPNIGLWNLYQLANALYP--LIEDASPLE 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN-- 518
++ +Y T F + +M KLGL ++ ++I +L +N+ + + D T FFR LS+
Sbjct: 337 AILNKYKTDFERKSLQMMKSKLGLFVVDEDDLKLIQELEDNLQLVETDMTIFFRNLSDFS 396
Query: 519 -VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
K I ED L I + K W SW Y L + +ERK M++V
Sbjct: 397 STKEGFKIIEDAFY-----DLESISDDVKIRWNSWFNKYEDRLAIERVPFDERKEKMDAV 451
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGV 636
NPKYVLRNY+ Q AIDAA D+ + L +L+++PY EQP EK+ P WA + G
Sbjct: 452 NPKYVLRNYMAQLAIDAANNKDYSLINELFELLKKPYSEQPNYEKWFAKRPEWARDKVGC 511
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 512 SMLSCSS 518
>gi|395804497|ref|ZP_10483735.1| hypothetical protein FF52_21553 [Flavobacterium sp. F52]
gi|395433384|gb|EJF99339.1| hypothetical protein FF52_21553 [Flavobacterium sp. F52]
Length = 522
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 242/555 (43%), Positives = 335/555 (60%), Gaps = 46/555 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+++L ++ F ELP DP + R+V + ++ V+P+ + NP+L+ SE VA+ + +
Sbjct: 1 MKNLKINNRFTAELPADPDLTNEIRQVKNTLFSYVNPT-QPSNPKLIHASEEVAELVGIS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E + +F FSG L PYA CY GHQFG WAGQLGDGRAI L E+ N + +
Sbjct: 60 KDEIQSEEFLNVFSGKEILPETKPYAMCYAGHQFGNWAGQLGDGRAINLTEVEN-NNRFY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAGKTPYSR ADGLAVLRSSIRE+LC+EAMH+LG+PTTR+L LV +G V RD+
Sbjct: 119 TLQLKGAGKTPYSRTADGLAVLRSSIREYLCAEAMHYLGVPTTRSLSLVLSGDQVLRDIL 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GNP E GA+VCRVA SF+RFGSY++ +R + L ++ ++ I+H+F I K
Sbjct: 179 YNGNPAYEKGAVVCRVAPSFIRFGSYEMLTARNE--LKNLKQFVEFTIKHYFPEITGEPK 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ L F +VA+ T ++ WQ VGF HGV+NTDNMSI G
Sbjct: 237 EQYLKFFQ--------------------KVADTTREMILHWQRVGFVHGVMNTDNMSIHG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
+TIDYGP+G+L+ +DP +TPNTTD RRY F NQP + WN+ Q + A LI++ E
Sbjct: 277 ITIDYGPYGWLENYDPDWTPNTTDSQNRRYRFGNQPHVAQWNLFQLAN--AIYPLINEAE 334
Query: 462 -ANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
+++ + T F +Y+ + KLG+ + + +II L + + + D T FFR LS
Sbjct: 335 PLEKILDTFITDFEKDYKTMFLSKLGIFTSSEADDKIIKGLEEILQLSETDMTIFFRNLS 394
Query: 518 NVKADPSIP------EDELLVP--LKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE 569
+K D S+ E +P +K +LD AW W Y++ L + +SD+E
Sbjct: 395 KIKKDDSVEQAFEKIEYAFYIPEEIKENILD-------AWQKWFTVYLKRLNAEELSDDE 447
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
R MN +NPKYVLRNY+ Q AIDAA+ D+ V L +L++ PYDEQP EK+ P
Sbjct: 448 RSEKMNQINPKYVLRNYMAQLAIDAADKEDYSLVDELFQLLKNPYDEQPESEKWFAKRPD 507
Query: 630 WAY-RPGVCMLSCSS 643
WA + G MLSCSS
Sbjct: 508 WARTKVGCSMLSCSS 522
>gi|325916973|ref|ZP_08179215.1| hypothetical protein XVE_3195 [Xanthomonas vesicatoria ATCC 35937]
gi|325536824|gb|EGD08578.1| hypothetical protein XVE_3195 [Xanthomonas vesicatoria ATCC 35937]
Length = 518
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 253/548 (46%), Positives = 320/548 (58%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ DL++D+ ++LP DP REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTDLHFDNRLRQQLPADPEQGPRRREVA-AAWSSVLPTP-VAAPHLIAHSPEMAQLLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 AAELASARFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGVDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LVTTG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVTTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ + RG D ++R D+ I F +E +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSVRG--DTALLRQSVDFTIARDFPELEGTGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YAAW +V ERTA +VAQW VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------IYAAWFAQVCERTAVMVAQWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FADAAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
++R+ ++ + KLGL + Q+I L M ++D T FRAL
Sbjct: 336 LQQGLDRFRDTYLACDRNDTAAKLGLAECRDEDLQLIDALRALMREAEMDMTLTFRAL-- 393
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ P P+ +L L+ D K A + W+ Y L + E+R+ M
Sbjct: 394 IDFTPEHPDPQL---LRDAFYDHDKRTATAPQLLDWLRRYATRLQQDSVLPEQRRERMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QP +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDKAEQGDPSGVQELLEVMRRPYDDQPDNAAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|294666448|ref|ZP_06731691.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292603754|gb|EFF47162.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 557
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 250/552 (45%), Positives = 320/552 (57%), Gaps = 36/552 (6%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
+L + L +D+ ++LPGDP S REV A ++ V P+ V P L+A S +A +
Sbjct: 36 RLAGMTHLRFDNRLRQQLPGDPEEGSRRREV-SAAWSAVLPTP-VAAPSLIAHSAEMAQA 93
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L LD E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE +
Sbjct: 94 LGLDAAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTD 153
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V
Sbjct: 154 GGRYELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVV 213
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F +
Sbjct: 214 RDMFYDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALA 271
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ YA W +V ERTA +VA W VGF HGV+NTDNM
Sbjct: 272 GAGEA--------------------LYADWFTQVCERTAVMVAHWLRVGFVHGVMNTDNM 311
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA
Sbjct: 312 SILGLTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FP 370
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
D + ++R+ ++ + KLGL + Q+I L M ++D T FR
Sbjct: 371 DQAPLQHGLDRFRDTYLACDRHDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFR 430
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKA 572
L ++ D P L+ D K +A W+ Y L + +ER
Sbjct: 431 GLIDLSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDERHT 485
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA
Sbjct: 486 RMRLANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRRPYDDQPGRDAFAARRPEWAR 545
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 546 DRAGCSMLSCSS 557
>gi|119945733|ref|YP_943413.1| hypothetical protein Ping_2062 [Psychromonas ingrahamii 37]
gi|119864337|gb|ABM03814.1| hypothetical protein UPF0061 [Psychromonas ingrahamii 37]
Length = 533
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 251/549 (45%), Positives = 319/549 (58%), Gaps = 31/549 (5%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ LP D TD+ R V +A Y+ VSP + P+LVA S +A+ L +
Sbjct: 6 LKFDNRLRNNLPADSETDNYCRSVENAAYSLVSP-VKATAPKLVAVSNLLAEQLGFTTEA 64
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P+FP +G L G PYA CYGGHQFG WAGQLGDGRAI LGE++ LQ
Sbjct: 65 LNSPEFPQAMTGNLLLDGMQPYALCYGGHQFGQWAGQLGDGRAINLGELVTTNLGHQTLQ 124
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG+AVLRSSIREFLCSEAM LGI TTRAL L TG V RDM YDG
Sbjct: 125 LKGAGPTPYSRRADGMAVLRSSIREFLCSEAMFHLGISTTRALSLCLTGDQVVRDMMYDG 184
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N EP AIVCRV+ SFLRFGS+Q+ ASRG E L I L + I+ + H
Sbjct: 185 NAALEPTAIVCRVSSSFLRFGSFQLPASRGDEQLLI--QLVQHCIKSDYPH--------- 233
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L+ ++G D V Y AW E+ ERT V W VGF HGV+NTDNMSI+G TI
Sbjct: 234 LAPASGVFDQQV-------YLAWFKEICERTCDTVVNWMRVGFVHGVMNTDNMSIMGETI 286
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGP+G++D FD ++TPNTTD +RY F Q +I WN+ Q + A LI + E
Sbjct: 287 DYGPYGWIDDFDLNWTPNTTDEGQKRYRFGGQGEISQWNLFQLAN--AIFPLIGEAEPLQ 344
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNV 519
++ YGT + ++ +M +KLG Y + + L + + D T F+R L+N+
Sbjct: 345 KILNEYGTDYQRKWCDMMAEKLGFKHYRGETDLALFKSLEKLLGAVETDMTLFYRLLANI 404
Query: 520 KADPSIPEDEL----LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
D L P LLD+ + + W+ SY++ + G+S E R MN
Sbjct: 405 PNDLDTQTATQWMAKLGPCYYSLLDLNDQYIKDLTKWLASYLERVNLDGLSQELRATAMN 464
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
VNPKYV+RNYL Q AI+ AE GDF E+ L K+++ PYD+QP YA+ P WA +
Sbjct: 465 KVNPKYVIRNYLAQHAIELAEKGDFSEIATLQKILQNPYDDQPEHNSYAQKRPDWARDKA 524
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 525 GCSMLSCSS 533
>gi|325288029|ref|YP_004263819.1| hypothetical protein Celly_3131 [Cellulophaga lytica DSM 7489]
gi|324323483|gb|ADY30948.1| UPF0061 protein ydiU [Cellulophaga lytica DSM 7489]
Length = 520
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 239/547 (43%), Positives = 334/547 (61%), Gaps = 37/547 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
N F +LP DP ++ R+V +AC++ V+P + NP+++ S+ + +L L K+
Sbjct: 3 FNLKDRFTSQLPADPILENSRRQVSNACFSYVTPK-KTANPEIIHVSDDMLRTLGLTKKD 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F F+G + + PYA CYGGHQFG WAGQLGDGRAI L E+ + ++ W LQ
Sbjct: 62 SATKEFLNVFTGNSVMPNTKPYAMCYGGHQFGNWAGQLGDGRAINLAEVEH-NNKIWALQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSS+RE+LCSEAM+ LG+PTTRAL L TG V RDM Y+G
Sbjct: 121 LKGAGETPYSRSADGLAVLRSSVREYLCSEAMYHLGVPTTRALSLALTGDNVLRDMLYNG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N E GA+V RVA SFLRFGS+Q+ A++ ED+ + TL +Y I++H+ H+ N +K
Sbjct: 181 NAAYEKGAVVTRVAPSFLRFGSFQLLAAK--EDISTLTTLVNYTIKNHYSHLGNPSKE-- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
Y A+ EVAERT ++ WQ VGF HGV+NTDNMSILGLTI
Sbjct: 237 ------------------TYIAFFKEVAERTLEMIVHWQRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGP+G+LD ++P +TPNTTD RRY + NQP++GLWN+ Q + L L+++
Sbjct: 279 DYGPYGWLDDYNPDWTPNTTDAENRRYRYNNQPNVGLWNLFQLANALFP--LVNEAAPLE 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+++ Y + +M K+GL + ++I +L N+ + D T F+R LS
Sbjct: 337 TILDDYKLGYDKASLKMMRSKIGLFTEFDTDYKLIEQLEENLQRIETDMTIFYRNLSTF- 395
Query: 521 ADPSIPEDELLVPLKAVLLD---IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+ + P+ E L +K + + + K W +W SY L +D+ERK MN
Sbjct: 396 -NKNAPK-EALNSIKEAFYNTNTLTDDVKTHWNNWFTSYASRLKLEKTTDDERKVKMNLT 453
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGV 636
NPKYVLRNY+ Q AIDAA+ G++ + L +L++ PY EQP +K+ P WA ++ G
Sbjct: 454 NPKYVLRNYMAQLAIDAADNGNYAVLDELYQLLKNPYKEQPEHQKWFAKRPDWAKHKVGC 513
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 514 SMLSCSS 520
>gi|257092929|ref|YP_003166570.1| hypothetical protein CAP2UW1_1317 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257045453|gb|ACV34641.1| protein of unknown function UPF0061 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 517
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 259/547 (47%), Positives = 329/547 (60%), Gaps = 39/547 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN+D+ F+R+LPGD + PR+V AC++ V P+ V P L+A S VA +L LD +
Sbjct: 2 LNFDNRFLRDLPGDTDRHNAPRQVFGACWSPVDPT-PVAAPTLLAHSREVAAALGLDEQA 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P+ +G L G YA CYGGHQFG WAGQLGDGRAI LGE +N + +R ELQ
Sbjct: 61 MAAPEMLAALAGNALLPGMAAYASCYGGHQFGQWAGQLGDGRAILLGEAVNRQGQRLELQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG
Sbjct: 121 LKGAGPTPYSRRADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVATGETVVRDMFYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EPGA+VCRVA SF RFG +++ A+RG+ +L ++ L D+ I F +
Sbjct: 181 HPVAEPGAVVCRVAPSFTRFGHFELLAARGEREL--LQRLVDFTIARDFAEL-------- 230
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
TG E AAW EV ERTA L+ W VGF HGV+NTDNMSILGLTI
Sbjct: 231 ---VTGAE---------PSLAAWFGEVCERTARLMVHWMRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK--EA 462
DYGP+G++D FDP +TPNTTD RRYCFA QP I WN+ + + LA ++ + E
Sbjct: 279 DYGPYGWVDNFDPGWTPNTTDASSRRYCFARQPAIARWNLERLADALA---MLTPRPVEL 335
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNV 519
+ERY + E+ A KLGL ++ + ++ +L M ++D T FFR L+++
Sbjct: 336 AAGIERYDEVYSSEFCAAFAGKLGLCEWHHDDADLLEELFELMRQAEIDMTEFFRCLASL 395
Query: 520 KAD-PSIPEDELLVPLKAVLLDIGKERKEAWIS-WVLSYIQELLSSGISDEERKALMNSV 577
D P+I V A D + R A +S W+ Y + R A MN+
Sbjct: 396 DIDNPAID-----VVQSAFYRDDLRLRFSAPVSRWLTRYAARVRQDAQPAARRAARMNAA 450
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGV 636
NP+YVLRNYL Q AID AE GD + LL ++ PY EQ G ++ P WA +R G
Sbjct: 451 NPRYVLRNYLAQQAIDRAEQGDTQRIHDLLDVLRHPYVEQAGCAAFSAKRPDWARHRAGC 510
Query: 637 CMLSCSS 643
LSCSS
Sbjct: 511 STLSCSS 517
>gi|381189365|ref|ZP_09896913.1| hypothetical protein HJ01_03433 [Flavobacterium frigoris PS1]
gi|379648574|gb|EIA07161.1| hypothetical protein HJ01_03433 [Flavobacterium frigoris PS1]
Length = 521
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/550 (43%), Positives = 334/550 (60%), Gaps = 40/550 (7%)
Query: 104 DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPK 163
+L ++ F ELP D ++ R+V +AC++ V+P +P+L+ ++ V + L + K
Sbjct: 2 NLKINNRFSTELPADTNETNVTRQVKNACFSYVNPRIP-SSPKLIHVTDEVLELLGITKK 60
Query: 164 EFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWEL 223
E + +F FSG L PY+ Y GHQFG WAGQLGDGRAI L EI N + + L
Sbjct: 61 EAQSAEFTNIFSGKELLPNTRPYSMSYAGHQFGNWAGQLGDGRAIILTEIEN-NQQTYTL 119
Query: 224 QLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYD 283
QLKG+G TPYSR ADGLAVLRSSIRE LCSEAM LG+PTTR+L L+ TG V RD+ YD
Sbjct: 120 QLKGSGLTPYSRGADGLAVLRSSIREHLCSEAMFHLGVPTTRSLSLLLTGDQVLRDVMYD 179
Query: 284 GNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
G+P E GA+VCRVA SF+RFG++++ +S Q DL +++LAD+ I+++F I+++ K
Sbjct: 180 GHPAYEKGAVVCRVAPSFIRFGNFELFSS--QNDLKTLKSLADFTIKYYFPEIKSIGKES 237
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+ F EVA + ++ WQ VGF HGV+NTDNMSILGLT
Sbjct: 238 YIQFFQ--------------------EVANKNLEMIVHWQRVGFVHGVMNTDNMSILGLT 277
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK--- 460
IDYGP+G+L+ ++P +TPNTTD RRY F NQP+I LWN+ Q + L LI++
Sbjct: 278 IDYGPYGWLEDYNPEWTPNTTDRENRRYRFGNQPEIVLWNLYQLANALYP--LIEEAAPL 335
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
EA ++ + +K+ +Y +M KLGL + + Q+I L N+ + D T FFR LS
Sbjct: 336 EA--ILNSFQSKYEADYATMMRNKLGLFTKEENDNQLIHLLTENLQQTETDMTIFFRKLS 393
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGK---ERKEAWISWVLSYIQELLSSGISDEERKALM 574
+K S E+E + + I + + KE W+ W Y+ L +D +RK M
Sbjct: 394 QIKKVES--EEEAFLRIADSFYKINEVTGQLKETWLYWFTQYLNRLRQEEATDADRKKAM 451
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-R 633
N+VNPKYVLRNY+ Q AI+A+E DF + L L++ PY+EQP EK+ P WA +
Sbjct: 452 NAVNPKYVLRNYMSQLAIEASEKEDFSLIEELHLLLKNPYEEQPESEKWFAKRPDWAREK 511
Query: 634 PGVCMLSCSS 643
G MLSCSS
Sbjct: 512 IGSSMLSCSS 521
>gi|325928090|ref|ZP_08189303.1| hypothetical protein XPE_3352 [Xanthomonas perforans 91-118]
gi|325541588|gb|EGD13117.1| hypothetical protein XPE_3352 [Xanthomonas perforans 91-118]
Length = 518
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 252/545 (46%), Positives = 322/545 (59%), Gaps = 36/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+ ++LPGDP + REV A ++ V P+ V P L+A S +A L L+ E
Sbjct: 4 LHFDNRLRQQLPGDPEEGARRREV-GAAWSSVLPT-PVAAPYLIAHSAEMAQVLGLEAAE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+ELQ
Sbjct: 62 IASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ +++ D+ I F + SE+
Sbjct: 182 HPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLKQWVDFTIARDFPAL--AGASEA 237
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L YA W +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 238 L------------------YADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F Q + WN+ + + LA D Y
Sbjct: 280 DYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQAQVAYWNLGRLAQALAPL-FADQALLQY 338
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKA 521
++R+ ++ + KLGL + Q+I L M ++D T FR L ++
Sbjct: 339 GLDRFRDTYLACDRRDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFRGLIDLS- 397
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNSVNP 579
PE L+ D K +A W+ Y L +S EER+A M NP
Sbjct: 398 ----PEHPDPAQLRDAFYDEDKRLADAPQLQQWLQRYAARLQQDPLSPEERRARMRLANP 453
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA R G M
Sbjct: 454 RYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRRPYDDQPGRDAFAARRPDWARDRAGCSM 513
Query: 639 LSCSS 643
LSCSS
Sbjct: 514 LSCSS 518
>gi|294626033|ref|ZP_06704643.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292599703|gb|EFF43830.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 557
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 249/552 (45%), Positives = 319/552 (57%), Gaps = 36/552 (6%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
+L + L +D+ ++LPGDP S REV A ++ V P+ V P L+A S +A +
Sbjct: 36 RLAGMTHLRFDNRLRQQLPGDPEEGSRRREV-SAAWSAVLPTP-VAAPSLIAHSAEMAQA 93
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L LD E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE +
Sbjct: 94 LGLDAAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTD 153
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG
Sbjct: 154 GGRYELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAAV 213
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F +
Sbjct: 214 RDMFYDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALA 271
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ YA W +V ERTA +VA W VGF HGV+NTDNM
Sbjct: 272 GAGEA--------------------LYADWFTQVCERTAVMVAHWLRVGFVHGVMNTDNM 311
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA
Sbjct: 312 SILGLTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FP 370
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
D + ++R+ ++ + KLGL + Q+I L M ++D T FR
Sbjct: 371 DQAPLQHGLDRFRDTYLACDRHDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFR 430
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKA 572
L ++ D P L+ D K +A W+ Y L + +ER
Sbjct: 431 GLIDLSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDERHT 485
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA
Sbjct: 486 RMRLANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRRPYDDQPGRDAFAARRPEWAR 545
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 546 DRAGCSMLSCSS 557
>gi|78048145|ref|YP_364320.1| hypothetical protein XCV2589 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036575|emb|CAJ24266.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 557
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 250/552 (45%), Positives = 323/552 (58%), Gaps = 36/552 (6%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
+L + L++D+ ++LPGDP + REV A ++ V P+ V P L+A S +A
Sbjct: 36 RLAGMTHLHFDNRLRQQLPGDPEEGARRREV-GAAWSSVLPTP-VAAPYLIAHSAEMAQV 93
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L L+ E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE +
Sbjct: 94 LGLEAAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTD 153
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V
Sbjct: 154 GGRYELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEAVV 213
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ +++ D+ I F +
Sbjct: 214 RDMFYDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLKQWVDFTIARDFPALA 271
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ YA W +V ERTA +VA W VGF HGV+NTDNM
Sbjct: 272 GAGEA--------------------LYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNM 311
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA
Sbjct: 312 SILGLTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FA 370
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
D Y ++R+ ++ + KLGL + Q+I L M ++D T FR
Sbjct: 371 DQALLQYGLDRFRDTYLACDRRDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFR 430
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKA 572
L ++ PE L+ D K +A W+ Y L +S EER+A
Sbjct: 431 GLIDLS-----PEHPDPAQLRDAFYDEDKRLVDAPQLQQWLQRYAARLQQDPLSPEERRA 485
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G + +A P WA
Sbjct: 486 RMRLANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRRPYDDQHGRDAFAARRPDWAR 545
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 546 DRAGCSMLSCSS 557
>gi|372210199|ref|ZP_09498001.1| hypothetical protein FbacS_08775 [Flavobacteriaceae bacterium S85]
Length = 513
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 241/547 (44%), Positives = 329/547 (60%), Gaps = 44/547 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN ++F +LP D ++ R+V +AC++ VSPS ++P+L+ + +A ++ +
Sbjct: 3 LNIQNTFTNQLPADENHENFTRQVNNACFSYVSPSP-TKSPKLLHVNPELAKTIGFTEEN 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F +G + PYA CYGGHQFG WAGQLGDGRAI L ++ +S + LQ
Sbjct: 62 LGSKEFLNLVTGNSLHPNTKPYAMCYGGHQFGNWAGQLGDGRAINLFQVKTDQS--YTLQ 119
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAGKTPYSR ADGLAVLRSSIRE+LC+EAMH LGIPTTR+L L TG V RD+FY+G
Sbjct: 120 LKGAGKTPYSRTADGLAVLRSSIREYLCAEAMHHLGIPTTRSLSLSLTGDQVLRDVFYNG 179
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N EPGA+VCRV+QSF+RFG++QI A+R D + L +Y IRH+F +++ +K
Sbjct: 180 NTAYEPGAVVCRVSQSFIRFGNFQIFAARN--DKANLAGLMNYTIRHYFPNLQENDK--- 234
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ YA E+ T +++ WQ VGF HGV+NTDNMSILG TI
Sbjct: 235 -----------------DSYAKLFQEIVNATVTMIVHWQRVGFVHGVMNTDNMSILGQTI 277
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD----K 460
DYGP+G+LD +DP +TPNTTD RRY + QP+IGLWN+ Q + T L +D +
Sbjct: 278 DYGPYGWLDNYDPDWTPNTTDSQNRRYRYGQQPNIGLWNLYQLANTFYT--LTEDAAPLE 335
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
EA + Y +F ++ +M K+G+ K NKQ +I L N+ D T F+R L+
Sbjct: 336 EA---LNSYRNQFETQHLKMMCAKIGIQKPNKQDAILIQALETNLKRVDTDMTIFYRLLA 392
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+ D +P + E W +W Y + LL G+S ER A MN+V
Sbjct: 393 KARNIIDCI-DAFYIPES-----LEGEVLTEWQAWFEQYQERLLQEGLSSNERIAHMNAV 446
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGV 636
NPKY+LRNY+ Q AIDAAE G++ + L L+++PYDEQP +K+ P WA + G
Sbjct: 447 NPKYILRNYMAQLAIDAAEEGNYQLIDELYSLLKKPYDEQPEYQKWFAKRPDWAKNKAGC 506
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 507 SMLSCSS 513
>gi|376316029|emb|CCF99432.1| protein belonging to UPF0061 [uncultured Flavobacteriia bacterium]
Length = 516
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 244/538 (45%), Positives = 318/538 (59%), Gaps = 35/538 (6%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F +LP DP ++ REVL A Y+ V P + NP L+ S+ + +L+ ++ + +F
Sbjct: 9 FTDQLPADPNLENTRREVLEAVYSFVRP-IKTSNPTLLHVSDEMQHTLKFSNEDIQSKEF 67
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
F +G + L + P+A CY GHQFG WAGQLGDGRAI LGEI N W +QLKG+G
Sbjct: 68 LEFVTGNSVLENSKPFAMCYAGHQFGNWAGQLGDGRAINLGEIKN-----WAVQLKGSGP 122
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR ADGLAVLRSS+RE+LCSEAMH LG+P+TRAL L TG V RD+ Y+GNP E
Sbjct: 123 TPYSRTADGLAVLRSSVREYLCSEAMHHLGVPSTRALSLSLTGDRVLRDVMYNGNPAHEK 182
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GAIV RVA+SFLRFG+++I A+R DL ++TL DY I+ HF H+ +K L F
Sbjct: 183 GAIVSRVAKSFLRFGNFEIFAARN--DLKNLKTLTDYTIKSHFSHLGKPSKEVYLQFFQ- 239
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
EV +T ++ WQ VGF HGV+NTDNMSILGLTIDYGP+G
Sbjct: 240 -------------------EVTNKTLEMIIHWQRVGFVHGVMNTDNMSILGLTIDYGPYG 280
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERY 469
+L+ FD +TPNTTD +RY + NQP IGLWN+ Q + +L LI++ ++E Y
Sbjct: 281 WLEGFDFGWTPNTTDKQHKRYRYGNQPTIGLWNLYQLANSLYP--LIEEVAPLEEILEGY 338
Query: 470 GTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
+ F + Q +M KLGL + II L NN+ + D T FFR LS+ K +
Sbjct: 339 KSNFEKKSQDMMRAKLGLTSAKETDIDIIQSLENNLQATETDMTIFFRTLSSFKKEQPEK 398
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
EL+ I + W W Y + L S +ER+ MN VNPKYVLRNY
Sbjct: 399 GVELIQDAFYTPDTIKGDVLNNWKQWFADYAKRLEDETTSVDERQQQMNKVNPKYVLRNY 458
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLSCSS 643
+ Q AID A+ GD + L L++ PY EQP E + P WA ++ G MLSCSS
Sbjct: 459 MAQLAIDKADKGDTSVLEELYLLLKEPYSEQPKFEHWFAKRPEWARHKVGCSMLSCSS 516
>gi|418523090|ref|ZP_13089115.1| hypothetical protein WS7_18991 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410700360|gb|EKQ58919.1| hypothetical protein WS7_18991 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 518
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 248/548 (45%), Positives = 316/548 (57%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ ++LPGDP S REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTHLRFDNRLRQQLPGDPEEGSRRREV-SAAWSAVLPT-PVAAPSLIAHSAEMAQVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+
Sbjct: 59 AAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F + +
Sbjct: 179 YDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------LYAGWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDGYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FPDQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
+ ++R+ ++ + KLGL + Q+I L M +D T FR L +
Sbjct: 336 LQHGLDRFRDTYLACGRHDTAAKLGLAECRDEDLQLIDALRALMRESGMDMTLTFRGLID 395
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ D P L+ D K +A W+ Y L + + R+A M
Sbjct: 396 LSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDARRARMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M PYD+QPG + +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRHPYDDQPGRDAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|289665685|ref|ZP_06487266.1| hypothetical protein XcampvN_22064 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 518
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 250/549 (45%), Positives = 319/549 (58%), Gaps = 38/549 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPGD S REVL A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTQLHFDNYLRQQLPGDSEEGSRRREVL-AAWSSVLPT-PVAAPYLIAHSAEMAHVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 TSEIASAQFVQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGIDGRRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F + +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DSALLRQWVDFTIARDFPELAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ +YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------RYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + +A D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAIAPL-FADQTP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRA--- 515
++R+ ++ + KLGL + ++I L M ++D T FR
Sbjct: 336 LQQGLDRFRATYLACDRRDTAAKLGLAECRDEDLELIDALRALMRDAEMDMTLTFRGLID 395
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LS DP+ D K V G + + W+ Y L +S ER+A M
Sbjct: 396 LSPAHPDPAQLRDAFYDEDKRV---AGAPQLQEWLQ---RYAARLQQDALSPHERRARMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA R
Sbjct: 450 LANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQPGRDAFAARRPEWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|86143330|ref|ZP_01061732.1| hypothetical protein MED217_09110 [Leeuwenhoekiella blandensis
MED217]
gi|85830235|gb|EAQ48695.1| hypothetical protein MED217_09110 [Leeuwenhoekiella blandensis
MED217]
Length = 520
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 241/544 (44%), Positives = 324/544 (59%), Gaps = 31/544 (5%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
N ++ F +LP DP ++ R+V+ Y+ V+P E P+L+ S+ + ++L + +E
Sbjct: 3 FNLNNLFTDQLPADPNFENSRRQVMQGYYSFVTPK-ETAKPELIHISDEMLEALGISKEE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F F+G PYA YGGHQFG WAGQLGDGRAI L EI + + W +Q
Sbjct: 62 AHTEEFLNVFTGNAVWPETHPYAMLYGGHQFGHWAGQLGDGRAINLFEI-DHNDKHWAVQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSSIRE+L SEAMH LGIPTTRAL L TG V RD+ YDG
Sbjct: 121 LKGAGETPYSRSADGLAVLRSSIREYLMSEAMHHLGIPTTRALSLALTGDSVLRDVMYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
NP E GA+VCRVA SFLRFG+YQI +R D+ ++ L D+ I+++F + +K
Sbjct: 181 NPAYEKGAVVCRVAPSFLRFGNYQIFTARN--DVAGLQKLVDFTIKNYFPELGAPSKETY 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L F EV+ RT ++ WQ VGF HGV+NTDNMSILGLTI
Sbjct: 239 LKF--------------------FAEVSARTLEMIIHWQRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G+L+ FD +TPNTTD +RY + NQP+IGLWN+ Q + L L++D E
Sbjct: 279 DYGPYGWLEGFDWGWTPNTTDRQHKRYRYGNQPNIGLWNLYQLANALFP--LVEDAEGFE 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+++RY + + +M KLGL + + ++I+ L + + + D T FFR L+ K
Sbjct: 337 EILDRYKEDYAQKSFQMMADKLGLEAPQETDLKLIADLEDCLLATETDMTIFFRKLAAFK 396
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
D S+ L+ L + + K W W +Y L +DE R MN+ NPK
Sbjct: 397 KDASVDGWNLIEDALYDLENTSEAVKTQWKQWFEAYAARLQQDQQNDEARNKRMNATNPK 456
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNY+ Q AIDAAE GDF + L +++++PYD QP EK+ P WA + G ML
Sbjct: 457 YVLRNYMAQLAIDAAEKGDFSLIDELYQVLKKPYDNQPEYEKWFAKRPEWARDKVGCSML 516
Query: 640 SCSS 643
SCSS
Sbjct: 517 SCSS 520
>gi|121957875|sp|Q3BSE3.2|Y2589_XANC5 RecName: Full=UPF0061 protein XCV2589
Length = 518
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 249/545 (45%), Positives = 320/545 (58%), Gaps = 36/545 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+ ++LPGDP + REV A ++ V P+ V P L+A S +A L L+ E
Sbjct: 4 LHFDNRLRQQLPGDPEEGARRREV-GAAWSSVLPTP-VAAPYLIAHSAEMAQVLGLEAAE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+ELQ
Sbjct: 62 IASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEAVVRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ +++ D+ I F + ++
Sbjct: 182 HPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLKQWVDFTIARDFPALAGAGEA-- 237
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
YA W +V ERTA +VA W VGF HGV+NTDNMSILGLTI
Sbjct: 238 ------------------LYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D Y
Sbjct: 280 DYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FADQALLQY 338
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKA 521
++R+ ++ + KLGL + Q+I L M ++D T FR L ++
Sbjct: 339 GLDRFRDTYLACDRRDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFRGLIDLS- 397
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNSVNP 579
PE L+ D K +A W+ Y L +S EER+A M NP
Sbjct: 398 ----PEHPDPAQLRDAFYDEDKRLVDAPQLQQWLQRYAARLQQDPLSPEERRARMRLANP 453
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCM 638
+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G + +A P WA R G M
Sbjct: 454 RYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRRPYDDQHGRDAFAARRPDWARDRAGCSM 513
Query: 639 LSCSS 643
LSCSS
Sbjct: 514 LSCSS 518
>gi|418516473|ref|ZP_13082646.1| hypothetical protein MOU_06646 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410706752|gb|EKQ65209.1| hypothetical protein MOU_06646 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 518
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 247/548 (45%), Positives = 316/548 (57%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ ++LPGDP S REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTHLRFDNRLRQQLPGDPEEGSRRREV-SAAWSAVLPT-PVAAPSLIAHSAEMAQVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+
Sbjct: 59 AAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F + +
Sbjct: 179 YDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------LYAGWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FPDQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
+ ++R+ ++ + KLGL + Q+I L M +D T FR L +
Sbjct: 336 LQHGLDRFRDTYLACDRHDTAAKLGLAECRDEDLQLIDALRALMRESGMDMTLTFRGLID 395
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ D P L+ D K +A W+ Y + + + R+A M
Sbjct: 396 LSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARMQQDPLPPDARRARMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M PYD+QPG + +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRHPYDDQPGRDAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|289671302|ref|ZP_06492377.1| hypothetical protein XcampmN_23190 [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 518
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 249/549 (45%), Positives = 318/549 (57%), Gaps = 38/549 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPGD S REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTQLHFDNCLRQQLPGDSEEGSRRREV-RAAWSSVLPT-PVAAPYLIAHSAEMAHVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 TSEIASAQFVQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGIDGRRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F + +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DSALLRQWVDFTIARDFPELAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ +YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------RYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + +A D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAIAPL-FADQTP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRA--- 515
++R+ ++ + KLGL + ++I L M ++D T FR
Sbjct: 336 LQQGLDRFRATYLACDRRDTAAKLGLAECRDEDLELIDALRALMRDAEMDMTLTFRGLID 395
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LS DP+ D K V G + + W+ Y L +S ER+A M
Sbjct: 396 LSPAHPDPAQLRDAFYDEDKRV---AGAPQLQEWLQ---RYAARLQQDALSPHERRARMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA R
Sbjct: 450 LANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQPGRDAFAARRPEWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|383451076|ref|YP_005357797.1| hypothetical protein KQS_09030 [Flavobacterium indicum GPTSA100-9]
gi|380502698|emb|CCG53740.1| Protein of unknown function [Flavobacterium indicum GPTSA100-9]
Length = 518
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 242/542 (44%), Positives = 327/542 (60%), Gaps = 38/542 (7%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERP 168
++F L D TD+ R V A ++ V+P + P L+ S+ VAD L L+ +
Sbjct: 8 NNFTSNLVADSITDNYVRLVPAAHFSYVNPITPTQ-PFLIHSSKEVADILNLNVDYIQSN 66
Query: 169 DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGA 228
+F FSG + + P+A Y GHQFG WAGQLGDGRAI LGEI N W +QLKGA
Sbjct: 67 EFTSVFSGTSLGDNSKPFAMNYAGHQFGNWAGQLGDGRAINLGEINN-----WSIQLKGA 121
Query: 229 GKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
G TPYSR DG AVLRSSIRE+LCSEAMH+LGIPTTRAL L TG V RDM Y+GNP
Sbjct: 122 GPTPYSRRGDGFAVLRSSIREYLCSEAMHYLGIPTTRALALFLTGDDVMRDMLYNGNPAL 181
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
E GAIVCRVA SF+RFG++++ AS+G DLD ++ LADY I +F I + +K
Sbjct: 182 EKGAIVCRVAPSFIRFGNFELFASQG--DLDNLKKLADYTIDTYFPEITSQDKQ------ 233
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
+Y V ++T LV WQ VGF HGV+NTDNMSI G+TIDYGP
Sbjct: 234 --------------RYIDLLKLVTDKTLDLVIHWQRVGFVHGVMNTDNMSIHGITIDYGP 279
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVME 467
+G+L+ F+ +TPNTTD RRY F NQPDI LWN+ QF+ +L LI++ ++
Sbjct: 280 YGWLEDFNLEWTPNTTDRENRRYRFGNQPDIMLWNLYQFANSLYP--LIEETAPLESILT 337
Query: 468 RYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
+ + + + + +M K+G +N +++ +LL + + + D T FFR LS V
Sbjct: 338 SFASNYENRFLGMMCSKIGCENHNDSTHKLVYQLLECLQLSETDMTIFFRLLSTVNLQ-D 396
Query: 525 IPEDEL--LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
P+ L + P + +I KE W++W+ Y++++ S G+ DE RK MN++NPKYV
Sbjct: 397 YPDSALSKISPAFYLPNEIDGSIKERWLNWMEDYLKQINSQGVLDEVRKVKMNAINPKYV 456
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLSC 641
LRNY+ Q AID A G + + +L+++PY EQP MEK+ P WA + G MLSC
Sbjct: 457 LRNYMAQLAIDEANTGKYEMIDEFFELLKKPYAEQPEMEKWFAKRPDWARTKVGCSMLSC 516
Query: 642 SS 643
SS
Sbjct: 517 SS 518
>gi|21243126|ref|NP_642708.1| hypothetical protein XAC2392 [Xanthomonas axonopodis pv. citri str.
306]
gi|33517049|sp|Q8PJY5.1|Y2392_XANAC RecName: Full=UPF0061 protein XAC2392
gi|21108645|gb|AAM37244.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 518
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 247/548 (45%), Positives = 316/548 (57%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ ++LPGDP S REV ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTHLRFDNRLRQQLPGDPEEGSRRREV-SVAWSAVLPT-PVAAPSLIAHSAEMAQVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+
Sbjct: 59 AAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F + +
Sbjct: 179 YDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------LYAGWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FPDQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
+ ++R+ ++ + KLGL + Q+I L M ++D T FR L +
Sbjct: 336 LQHGLDRFRDTYLACDRHDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFRGLID 395
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ D P L+ D K +A W+ Y L + + R+A M
Sbjct: 396 LSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDARRARMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M PYD+QPG + +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRHPYDDQPGRDAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|313202400|ref|YP_004041058.1| hypothetical protein MPQ_2682 [Methylovorus sp. MP688]
gi|312441716|gb|ADQ85822.1| conserved hypothetical protein [Methylovorus sp. MP688]
Length = 522
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 248/550 (45%), Positives = 326/550 (59%), Gaps = 41/550 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+ + ELPGDP + R+V A +++V + V P+++AWS +A +L L +
Sbjct: 3 LSFDNRLLNELPGDPIQGAQLRQVHGALWSRVD-ATPVSAPRMLAWSPEMATTLGLTAGD 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ SG L G YA CYGGHQFG WAGQLGDGRAI LGE +N ERWELQ
Sbjct: 62 MQSDAMLQALSGNGLLPGMQHYATCYGGHQFGNWAGQLGDGRAIFLGETVNAAGERWELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSS+REFLCSEAM LGIPTTRAL LV TG V RDMFYDG
Sbjct: 122 LKGAGATPYSRRADGRAVLRSSLREFLCSEAMFHLGIPTTRALSLVATGDSVIRDMFYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG +++ ASRG D+D++R L ++ ++ F
Sbjct: 182 HPEREPGAIVCRVAPSFIRFGHFELPASRG--DIDLLRRLTEFTMQRDF---------AD 230
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++F H V + W E+ RTA L+A+W VGF HGV+NTDNMSILGLTI
Sbjct: 231 MAFPADMPLHERVPI-------WFGEICRRTALLMAEWMRVGFVHGVMNTDNMSILGLTI 283
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D FDP +TPNTTD GRRYCF QPDI WN+ + + LA
Sbjct: 284 DYGPYGWIDNFDPGWTPNTTDASGRRYCFGRQPDIARWNLERLAEALALLLPEPAPLVE- 342
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ + + + + + K GL + + ++S++ M +VD T FFR L ++
Sbjct: 343 SLGIFDSTYGQAWSQGLAAKFGLRDWQDDDAALMSEIFELMTRAEVDMTMFFRLLGDM-- 400
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAW-------ISWVLSYIQELLSSGISDEERKALM 574
D P+ E L+A R+E W SW+ Y + L +S E R+ M
Sbjct: 401 DMQAPKAE---ALRAAFY-----REELWQDFHPPLYSWLQRYSERLKHDNLSQEARRTAM 452
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YR 633
+ VNP++VLRNYL Q AID A GD +++L M +PYD+ P L P WA ++
Sbjct: 453 HKVNPRFVLRNYLAQQAIDQATEGDTTMLQQLFSAMRQPYDDLPQYAALYALRPDWARHK 512
Query: 634 PGVCMLSCSS 643
G MLSCSS
Sbjct: 513 AGCSMLSCSS 522
>gi|88802174|ref|ZP_01117702.1| hypothetical protein PI23P_05907 [Polaribacter irgensii 23-P]
gi|88782832|gb|EAR14009.1| hypothetical protein PI23P_05907 [Polaribacter irgensii 23-P]
Length = 518
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 242/547 (44%), Positives = 327/547 (59%), Gaps = 39/547 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L+ ++F+ E P DP ++ R+V A ++ V P + NP+++ SE +A L + +E
Sbjct: 3 LHIKNTFIEENPADPVEENTRRQVEKAAFSYVLPK-KTSNPKVLHVSEEMAKELHISSEE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F +G PYA CY GHQFG WAGQLGDGRAI L E+ + ++ W++Q
Sbjct: 62 TASEFFQDIVTGNQIYPDTKPYAMCYAGHQFGNWAGQLGDGRAINLFEVEH-QNRNWKVQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR ADGLAVLRSS+RE+LCSEAM LG+PTTRAL L +G V RDM YDG
Sbjct: 121 LKGAGETPYSRTADGLAVLRSSVREYLCSEAMFHLGVPTTRALSLSLSGDSVLRDMLYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P E GAIV R A SFLRFGS++I +R ED ++ L DY I+HHF H+ +K
Sbjct: 181 HPAYEKGAIVSRAAPSFLRFGSFEIFTAR--EDTKNLKNLVDYTIKHHFPHLNATSKENY 238
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+ F EV ERT ++ WQ +GF HGV+NTDNMSILGLTI
Sbjct: 239 IQFFK--------------------EVTERTLGMIIHWQRIGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-AAAKLIDDKEAN 463
D+GP+G+L+ FD +TPNTTD +RY + NQP+IGLWN+ Q + L + + EA
Sbjct: 279 DFGPYGWLEGFDFGWTPNTTDNQHKRYRYGNQPNIGLWNLYQLANALYPIIEEVAPLEA- 337
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVK 520
V+ +Y T F + +M KLG +K+ +I L + + + + D T FFR LS
Sbjct: 338 -VLNQYKTDFESKSLQMMQSKLGFFSSDKKDIDLIQNLEDLLQLTETDMTIFFRNLSKFT 396
Query: 521 ADPS---IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+ S + ED LK + ++I K +W W Y + L +S +ER A MN+V
Sbjct: 397 EESSGLKLIEDA-FYDLKNISIEI----KSSWNLWFEKYAERLQKEPLSPKERTAKMNAV 451
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGV 636
NPKYVLRNY+ Q AIDAA+ GD+ + L +L+++PY EQP EK+ P WA + G
Sbjct: 452 NPKYVLRNYMSQMAIDAADEGDYALIDELFQLLKQPYSEQPDKEKWFAKRPEWARDKAGC 511
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 512 SMLSCSS 518
>gi|381171469|ref|ZP_09880614.1| YdiU protein [Xanthomonas citri pv. mangiferaeindicae LMG 941]
gi|380688104|emb|CCG37101.1| YdiU protein [Xanthomonas citri pv. mangiferaeindicae LMG 941]
Length = 518
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 247/548 (45%), Positives = 315/548 (57%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ ++LPGDP S REV ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTHLRFDNRLRQQLPGDPEEGSRRREV-SVAWSAVLPT-PVAAPSLIAHSAEMAQVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+
Sbjct: 59 AAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F + +
Sbjct: 179 YDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------LYAGWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FPDQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
+ ++R+ ++ + KLGL + Q+I L M +D T FR L +
Sbjct: 336 LQHGLDRFRDTYLACDRHDTAAKLGLAECRDEDLQLIDALRALMRESGMDMTLTFRGLID 395
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ D P L+ D K +A W+ Y L + + R+A M
Sbjct: 396 LSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDARRARMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M PYD+QPG + +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRHPYDDQPGRDAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|402496152|ref|ZP_10842861.1| hypothetical protein AagaZ_17280 [Aquimarina agarilytica ZC1]
Length = 522
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 242/543 (44%), Positives = 331/543 (60%), Gaps = 41/543 (7%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F +ELP D D+ R+V AC++ V+P +NP L+ S ++ +L L ++ +R +F
Sbjct: 11 FTKELPADKVLDNSRRQVEGACFSYVNPKLP-KNPSLLHVSTAMLRNLGLKEEDGQRTEF 69
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
SG L PYA CYGGHQFG WAGQLGDGRAI L EI + ++ W LQLKGAG+
Sbjct: 70 LYVVSGKVVLPNTKPYAMCYGGHQFGNWAGQLGDGRAINLTEIAH-NNKIWALQLKGAGE 128
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR ADGLAVLRSSIRE+LCSEAM++LG+PTTRAL + +G V RD+ Y+GN E
Sbjct: 129 TPYSRTADGLAVLRSSIREYLCSEAMYYLGVPTTRALSIALSGSKVLRDVMYNGNSAYEK 188
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GAIV RVA SFLRFG+Y+I ASRG D ++TL DY I +HF ++ +K+ L F
Sbjct: 189 GAIVSRVAPSFLRFGNYEIFASRG--DNATLKTLVDYTINNHFSYLGTPSKAVYLDFLR- 245
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
EVA+++ +V WQ VGF HGV+NTDNMSILGLTIDYGP+G
Sbjct: 246 -------------------EVAKKSMEMVIHWQRVGFVHGVMNTDNMSILGLTIDYGPYG 286
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERY 469
+L+ +D ++TPNTTD +RY + QP I LWN+ Q + L LI++ + ++E Y
Sbjct: 287 WLEGYDHNWTPNTTDSSHKRYRYGTQPQIVLWNLLQLARALYG--LIEEAASLEEILEEY 344
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADP--- 523
+ +M KLGL +++++ L + + + D T FFR L+++K +
Sbjct: 345 RINVKVAHLEMMRNKLGLNTKIDNDEKLVEDLEKVLQLTETDMTIFFRNLADLKKEQFHD 404
Query: 524 --SIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+I ED ++ KE WI W Y + L +D+ERK MN+VNPKY
Sbjct: 405 WFNIVEDAFYNH-----KEVSGTIKENWIKWFNDYGKRLSMEVWTDKERKITMNTVNPKY 459
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLS 640
VLRNY+ Q AI+AA+ GD+ + L +L+++PYDEQP K+ P WA ++ G MLS
Sbjct: 460 VLRNYMAQLAINAADDGDYTVLDELFELLKKPYDEQPNALKWFAKRPEWARHKVGCSMLS 519
Query: 641 CSS 643
CSS
Sbjct: 520 CSS 522
>gi|384419063|ref|YP_005628423.1| hypothetical protein XOC_2109 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353461976|gb|AEQ96255.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 518
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 249/549 (45%), Positives = 319/549 (58%), Gaps = 38/549 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPGD + REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTQLHFDNRLRQQLPGDQEEGARRREV-RAAWSAVMPT-PVAAPYLIAHSAEMAHVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 ASEVASAAFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGIDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGLTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F + +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DNALLRQWVDFTIARDFPELAGTGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ +YA W +V ERTA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------RYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + +A D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAVAPL-FADQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRA--- 515
++R+ ++ + KLGL + ++I L M ++D T FR
Sbjct: 336 LQQGLDRFRDTYLASDRRHTAAKLGLAECRDEDLELIDALRALMRDAEMDMTLTFRGLID 395
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
LS V DP+ D K V + +E W+ Y L +S +ER+ALM
Sbjct: 396 LSPVHPDPAQLHDAFYDDDKRVA--SASQLQE----WLQRYAARLQQDALSPDERRALMR 449
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RP 634
NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G +A P WA R
Sbjct: 450 LANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQSGRAAFAARRPEWARDRA 509
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 510 GCSMLSCSS 518
>gi|390992318|ref|ZP_10262555.1| YdiU protein [Xanthomonas axonopodis pv. punicae str. LMG 859]
gi|372552934|emb|CCF69530.1| YdiU protein [Xanthomonas axonopodis pv. punicae str. LMG 859]
Length = 518
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 247/548 (45%), Positives = 316/548 (57%), Gaps = 36/548 (6%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPGDP S REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTHLHFDNRLRQQLPGDPEEGSRRREV-SAAWSAVLPT-PVAAPSLIAHSAEMAQVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + R+
Sbjct: 59 AAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ ++R D+ I F + +
Sbjct: 179 YDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLRQWVDFTIARDFPALAGAGE 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ YA W +V E TA +VA W VGF HGV+NTDNMSILG
Sbjct: 237 A--------------------LYAGWFAQVCECTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA D
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FPDQAP 335
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSN 518
+ ++R+ ++ + KLGL + Q+I L M +D T FR L +
Sbjct: 336 LQHGLDRFRDTYLACGRHDTAAKLGLAECRDEDLQLIDALRALMRESGMDMTLTFRGLID 395
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKALMNS 576
+ D P L+ D K +A W+ Y L + + R+A M
Sbjct: 396 LSPDHPDPAQ-----LREAFYDEDKRVADAPQLQQWLQRYAARLQQDPLPPDARRARMRL 450
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
NP+YVLRNYL Q AID AE GD V+ LL++M PYD+QPG + +A P WA R G
Sbjct: 451 ANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMRHPYDDQPGRDAFAARRPEWARDRAG 510
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 511 CSMLSCSS 518
>gi|254000441|ref|YP_003052504.1| hypothetical protein Msip34_2740 [Methylovorus glucosetrophus
SIP3-4]
gi|253987120|gb|ACT51977.1| protein of unknown function UPF0061 [Methylovorus glucosetrophus
SIP3-4]
Length = 521
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 247/550 (44%), Positives = 324/550 (58%), Gaps = 41/550 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+ + ELPGDP R+V A +++V + V P+++AWS +A +L L +
Sbjct: 2 LSFDNRLLNELPGDPIQGPQLRQVHGALWSRVD-ATPVSAPRMLAWSPEMATTLGLTAAD 60
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ SG L G YA CYGGHQFG WAGQLGDGRAI LGE +N ERWELQ
Sbjct: 61 MQSDAMLQALSGNGLLPGMQHYATCYGGHQFGNWAGQLGDGRAIFLGETVNAAGERWELQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSS+REFLCSEAM LGIPTTRAL LV TG V RDMFYDG
Sbjct: 121 LKGAGATPYSRRADGRAVLRSSLREFLCSEAMFHLGIPTTRALSLVATGDSVIRDMFYDG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P+ EPGAIVCRVA SF+RFG +++ ASR D+D++R L ++ ++ F +
Sbjct: 181 HPEREPGAIVCRVAPSFIRFGHFELPASRA--DIDLLRRLTEFTMQRDF---------AN 229
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
++F H V + W E+ RTA L+A+W VGF HGV+NTDNMSILGLTI
Sbjct: 230 MAFPADMPLHERVPI-------WFGEICRRTALLMAEWMRVGFVHGVMNTDNMSILGLTI 282
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D FDP +TPNTTD GRRYCF QPDI WN+ + + LA
Sbjct: 283 DYGPYGWIDNFDPGWTPNTTDASGRRYCFGRQPDIARWNLERLAEALALLLPEPAPLVE- 341
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ + + + + + K GL + + ++S++ M +VD T FFR L ++
Sbjct: 342 SLGMFDSTYGQAWSQGLAAKFGLRDWQDDDAALMSEIFELMTRAEVDMTMFFRLLGDM-- 399
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAW-------ISWVLSYIQELLSSGISDEERKALM 574
D P+ E L+A R+E W SW+ Y + L +S E R+ M
Sbjct: 400 DMQAPKAE---ALRAAFY-----REELWEDFHPPLYSWLQRYGERLKRDNLSQEARQTAM 451
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+ VNP++VLRNYL Q AID A GD +++L M +PYD+ P L P WA +
Sbjct: 452 HKVNPRFVLRNYLAQQAIDQATEGDTSMLQQLFSAMRQPYDDLPQHAALYALRPDWARQK 511
Query: 635 GVC-MLSCSS 643
C MLSCSS
Sbjct: 512 AGCSMLSCSS 521
>gi|254522103|ref|ZP_05134158.1| conserved hypothetical protein [Stenotrophomonas sp. SKA14]
gi|219719694|gb|EED38219.1| conserved hypothetical protein [Stenotrophomonas sp. SKA14]
Length = 521
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 249/544 (45%), Positives = 321/544 (59%), Gaps = 39/544 (7%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + REVL A ++ V P+ V P L+AWS VA L D E E
Sbjct: 9 DNRLLNALPGDPESGPRRREVLGAAWSPVMPT-PVAAPALLAWSPEVARMLGFDAAEVEG 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 EGFARVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+P+TRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPSTRALSLVGTGEDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L +R L D I F +E + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LRQLVDACITRDFPELE--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + LA D +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALAPL-FADVAPLQAGLA 344
Query: 468 RYGTKFM---DEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y + F+ A + Q+ + M +D T +RAL ++ DP+
Sbjct: 345 AYQSTFVACTRRDAAAKLGLAAAADDDLQLYLRWQQLMQDGAMDMTLAWRAL--MRLDPA 402
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWIS----WVLSYIQELLSSGISDEERKALMNSVNPK 580
P+ + L AV G+ R++A + W+ Y L + +S ER A M + NP
Sbjct: 403 APDAAV---LDAVY--YGETRQQAVQAPLQQWLQDYATRLRADPLSAGERMAKMAAANPL 457
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCML 639
YVLRN+L Q AID AE GD G V+ L +++ PY E+PG+ +A PAWA R G ML
Sbjct: 458 YVLRNWLAQEAIDRAEQGDLGGVQALQEVLRDPYTERPGLGHFAGKRPAWADNRAGCSML 517
Query: 640 SCSS 643
SCSS
Sbjct: 518 SCSS 521
>gi|374724542|gb|EHR76622.1| hypothetical protein MG2_1034 [uncultured marine group II
euryarchaeote]
Length = 507
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 250/552 (45%), Positives = 333/552 (60%), Gaps = 52/552 (9%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ L D W F+ E PGD ++D R+V AC++KV+P + P+L W++ V L
Sbjct: 1 MTPLNDCEWSTRFLDETPGDAQSDGPSRQVPGACWSKVTPF-QAPKPELRLWAKDVGAML 59
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L R D +F G L G YAQ YGGHQFG WAGQLGDGRAITLGE L
Sbjct: 60 GLS-----RGDEDVFAGGRLTL-GMAAYAQRYGGHQFGNWAGQLGDGRAITLGE-LKASQ 112
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ELQLKGAG TPYSRFADG AVLRSS+RE+LCSEAMH LG+PTTRAL L TTG+ V R
Sbjct: 113 GTFELQLKGAGHTPYSRFADGKAVLRSSVREYLCSEAMHHLGVPTTRALSLCTTGESVMR 172
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
D+ Y+GN E GA+VCRVA SF+RFGS+QIHA+ G D +R L ++ +RHHF
Sbjct: 173 DVLYNGNKALELGAVVCRVAPSFIRFGSFQIHAATG--DQVTLRALVEHTVRHHF----- 225
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
HSV + AWA EVAE TA ++A W VGF HGV+NTDNMS
Sbjct: 226 -------------PTHSVAN--DAGIVAWANEVAESTALMIAHWMRVGFVHGVMNTDNMS 270
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
I GLTIDYGP+G+L+ ++P +TPNTTD RRY +A QP IG WN+A++ +L L++
Sbjct: 271 IHGLTIDYGPYGWLEDYNPGWTPNTTDASNRRYRYAQQPQIGAWNLARWLESL--IPLME 328
Query: 459 DKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
E V++ YG F + + + KLGL + ++++++ L + + ++D T FFR
Sbjct: 329 QPEQLEGVLDHYGEVFNEHHNRMWVAKLGLGSWVESDQKLVANLNSALQTIEIDMTIFFR 388
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKAL 573
LS + A P + L P + + ++ W+ +W++ + G D++
Sbjct: 389 LLSTLDA----PTLDQLSPSFYEPIGVAEQPLNEWLEAWMIR------TDGAPDQD---A 435
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK-YARLPPAWAY 632
M + NPKYVLRN++ Q AID AE GD+ L +L++ PYDEQP ME + + P WA
Sbjct: 436 MKAANPKYVLRNWMAQLAIDDAEKGDYATCEALEQLLKAPYDEQPEMEADWFQRRPEWAR 495
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 496 NRVGCSMLSCSS 507
>gi|58582341|ref|YP_201357.1| hypothetical protein XOO2718 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58426935|gb|AAW75972.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 557
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 254/557 (45%), Positives = 325/557 (58%), Gaps = 46/557 (8%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
+L + L++D+ ++LPG + REV A ++ V P+ V P L+A S +A
Sbjct: 36 RLARMTQLHFDNRLRQQLPGYQEEGARRREV-RAAWSAVMPT-PVAAPYLIAHSAEMAHV 93
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L LD E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + +
Sbjct: 94 LGLDASEVASAAFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGID 153
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG TPYSR ADG AVLRSSIREFLCSE+MH LG+PTTRAL LV TG V
Sbjct: 154 GGRYELQLKGAGPTPYSRGADGRAVLRSSIREFLCSESMHHLGVPTTRALSLVGTGDAVV 213
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F E
Sbjct: 214 RDMFYDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DNALLRQWVDFTIARDF--PE 269
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ +E+L YA W +V +RTA +VA W VGF HGV+NTDNM
Sbjct: 270 LVGTAEAL------------------YADWFAQVCQRTAVMVAHWMRVGFVHGVMNTDNM 311
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + + A L
Sbjct: 312 SILGLTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAM--APLF 369
Query: 458 DDKEANYVMERYGTKFMDEYQAI----MTKKLGLPKYNK---QIISKLLNNMAVDKVDYT 510
D+ +++ +F D Y A KLGL + ++I L M ++D T
Sbjct: 370 ADQAP---LQQGLNRFRDTYLACDRRDTAAKLGLAECRDEDLELIDALRALMRDAEMDMT 426
Query: 511 NFFRA---LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD 567
FR LS V DP+ D K V + +E W+ Y L +S
Sbjct: 427 LTFRGLIDLSPVHPDPAQLHDAFYDDHKRVA--SASQLQE----WLQRYAARLQQDALSP 480
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
+ER+ALM NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G +A
Sbjct: 481 DERRALMRLANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQSGRAAFAARR 540
Query: 628 PAWAY-RPGVCMLSCSS 643
P WA R G MLSCSS
Sbjct: 541 PEWARDRAGCSMLSCSS 557
>gi|146300543|ref|YP_001195134.1| hypothetical protein Fjoh_2793 [Flavobacterium johnsoniae UW101]
gi|189039770|sp|A5FG48.1|Y2793_FLAJ1 RecName: Full=UPF0061 protein Fjoh_2793
gi|146154961|gb|ABQ05815.1| protein of unknown function UPF0061 [Flavobacterium johnsoniae
UW101]
Length = 522
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 233/548 (42%), Positives = 330/548 (60%), Gaps = 32/548 (5%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+++L ++ F ELP DP + R+V + ++ V+P+ + NP+L+ SE A + +
Sbjct: 1 MKNLKINNRFTAELPADPDLTNETRQVKNTAFSYVNPT-KPSNPKLIHASEETAALVGIS 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+E +F FSG L PYA CY GHQFG WAGQLGDGRAI L E+ N + +
Sbjct: 60 KEEIHSEEFLNVFSGKEILPETQPYAMCYAGHQFGNWAGQLGDGRAINLTEVEN-NNTFY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAGKTPYSR ADGLAVLRSSIRE+LC+EAM+ LG+PTTR+L L+ +G V RD+
Sbjct: 119 TLQLKGAGKTPYSRTADGLAVLRSSIREYLCAEAMYHLGVPTTRSLSLILSGDQVLRDIL 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GNP E GA+VCRVA SF+RFGS+++ A+R + L ++ +Y I+H+F I K
Sbjct: 179 YNGNPAYEKGAVVCRVAPSFIRFGSFEMLAARNE--LKNLKQFVEYTIKHYFPEITGEPK 236
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ L F +VA+ T ++ WQ VGF HGV+NTDNMS+ G
Sbjct: 237 EQYLQFFK--------------------KVADTTREMILHWQRVGFVHGVMNTDNMSVHG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
+TIDYGP+G+L+ +DP++TPNTTD +RY F NQP + WN+ Q + A LI++ E
Sbjct: 277 ITIDYGPYGWLENYDPNWTPNTTDSQNKRYRFGNQPQVAHWNLYQLAN--AIYPLINETE 334
Query: 462 A-NYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
++E + F+ +Y+ + KLGL + + +I L + + + D T FFR LS
Sbjct: 335 GLEKILESFMDDFILDYKEMFLNKLGLFTSTETDNDLIDNLEAVLQLTETDMTIFFRNLS 394
Query: 518 NVKADPSIPEDELLVPLKAVLL-DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
+VK S+ + + + ++ E +AW W Y+ L + +SDE R MN
Sbjct: 395 SVKKTDSVEKAIEKIQFAFYKIEEVSGEILDAWKKWFSVYLDRLNAEVLSDEVRLQKMNL 454
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPG 635
+NPKYVLRNY+ Q AIDAA+ D+ V L L+++PYDEQP +K+ P WA + G
Sbjct: 455 INPKYVLRNYMAQLAIDAADKEDYSLVNELYTLLQKPYDEQPEYQKWFAKRPDWATSKVG 514
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 515 CSMLSCSS 522
>gi|163787345|ref|ZP_02181792.1| hypothetical protein FBALC1_02362 [Flavobacteriales bacterium
ALC-1]
gi|159877233|gb|EDP71290.1| hypothetical protein FBALC1_02362 [Flavobacteriales bacterium
ALC-1]
Length = 520
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 242/546 (44%), Positives = 328/546 (60%), Gaps = 35/546 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
LN +F RELP D T++ R+V A ++ V+P NP+L+ S +A+++ L+ K+
Sbjct: 3 LNIKDTFNRELPSDSNTENTRRKVFEATHSYVNPKVP-SNPKLLHASIEMANAIGLEEKD 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F FSGA PYA Y GHQFG WAGQLGDGRAI L E+ + K+ RW LQ
Sbjct: 62 INSKAFLELFSGAIVQPKTKPYAMAYAGHQFGNWAGQLGDGRAINLFEVEHHKN-RWALQ 120
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR DGLAVLRSSIRE+LCSEAMH LG+PTTRAL L+ +G V RDM Y+G
Sbjct: 121 LKGAGETPYSRQGDGLAVLRSSIREYLCSEAMHHLGVPTTRALSLMLSGDDVLRDMLYNG 180
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N E GAIV R+A +F+RFG++++ A+R D ++ L DY I++ + + +K
Sbjct: 181 NADYEKGAIVSRLAPTFIRFGNFELFAARN--DHSNLKKLTDYTIKYFYPELGKPSKE-- 236
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
Y EVA +T ++ WQ VGF HGV+NTDNMSILGLTI
Sbjct: 237 ------------------IYIKLFQEVANKTLDMIVHWQRVGFVHGVMNTDNMSILGLTI 278
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-N 463
DYGP+G+L+ FD +TPNTTD +RY + NQP+IGLWN+ Q + L L+++ E
Sbjct: 279 DYGPYGWLEGFDFGWTPNTTDKQNKRYRYGNQPNIGLWNLLQLANALYP--LVEENEPFE 336
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVK 520
++++Y T F + A+M K+GL K K ++++ L + + V + D T FFR LSN +
Sbjct: 337 TILKQYQTDFETKSLAMMRSKIGLEKQEKDDAKLMADLEDCLLVWETDMTIFFRLLSNYR 396
Query: 521 ADPSIPEDELLVPLKAVL--LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
P + V KA I E W W +Y Q L ++D+ER MN VN
Sbjct: 397 TGN--PNSGIEVIKKAFYGSESIKDTILEQWKGWFTAYDQRLQLEELTDQERHVKMNLVN 454
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
PKYVLRNY+ Q AID A GD+ + +L +L+++PY EQP E + P WA ++ G
Sbjct: 455 PKYVLRNYMAQLAIDDANKGDYKLIDKLFQLLKQPYAEQPENESWFAKRPDWARHKVGCS 514
Query: 638 MLSCSS 643
MLSCSS
Sbjct: 515 MLSCSS 520
>gi|188576175|ref|YP_001913104.1| hypothetical protein PXO_00396 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|226706087|sp|B2SHR2.1|Y396_XANOP RecName: Full=UPF0061 protein PXO_00396
gi|188520627|gb|ACD58572.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 518
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 254/553 (45%), Positives = 323/553 (58%), Gaps = 46/553 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPG + REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTQLHFDNRLRQQLPGYQEEGARRREV-RAAWSAVMPT-PVAAPYLIAHSAEMAHVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 ASEVASAAFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGIDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSE+MH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSESMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F E +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DNALLRQWVDFTIARDF--PELVGT 234
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+E+L YA W +V +RTA +VA W VGF HGV+NTDNMSILG
Sbjct: 235 AEAL------------------YADWFAQVCQRTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + A A L D+
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQ--AVAPLFADQA 334
Query: 462 ANYVMERYGTKFMDEYQAI----MTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
+++ +F D Y A KLGL + ++I L M ++D T FR
Sbjct: 335 P---LQQGLNRFRDTYLACDRRDTAAKLGLAECRDEDLELIDALRALMRDAEMDMTLTFR 391
Query: 515 A---LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
LS V DP+ D K V + +E W+ Y L +S +ER+
Sbjct: 392 GLIDLSPVHPDPAQLHDAFYDDHKRVA--SASQLQE----WLQRYAARLQQDALSPDERR 445
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
ALM NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G +A P WA
Sbjct: 446 ALMRLANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQSGRAAFAARRPEWA 505
Query: 632 Y-RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 506 RDRAGCSMLSCSS 518
>gi|84624220|ref|YP_451592.1| hypothetical protein XOO_2563 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|121957871|sp|Q2P2A9.1|Y2563_XANOM RecName: Full=UPF0061 protein XOO2563
gi|121957879|sp|Q5GZ99.2|Y2718_XANOR RecName: Full=UPF0061 protein XOO2718
gi|84368160|dbj|BAE69318.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 518
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 253/553 (45%), Positives = 323/553 (58%), Gaps = 46/553 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L++D+ ++LPG + REV A ++ V P+ V P L+A S +A L LD
Sbjct: 1 MTQLHFDNRLRQQLPGYQEEGARRREV-RAAWSAVMPT-PVAAPYLIAHSAEMAHVLGLD 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE + + R+
Sbjct: 59 ASEVASAAFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGIDGGRY 118
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREFLCSE+MH LG+PTTRAL LV TG V RDMF
Sbjct: 119 ELQLKGAGPTPYSRGADGRAVLRSSIREFLCSESMHHLGVPTTRALSLVGTGDAVVRDMF 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG P+ EPGAIVCRVA SF+RFG++++ ++RG D ++R D+ I F E +
Sbjct: 179 YDGRPQREPGAIVCRVAPSFIRFGNFELPSARG--DNALLRQWVDFTIARDF--PELVGT 234
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+E+L YA W +V +RTA +VA W VGF HGV+NTDNMSILG
Sbjct: 235 AEAL------------------YADWFAQVCQRTAVMVAHWMRVGFVHGVMNTDNMSILG 276
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + + A L D+
Sbjct: 277 LTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQAM--APLFADQA 334
Query: 462 ANYVMERYGTKFMDEYQAI----MTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
+++ +F D Y A KLGL + ++I L M ++D T FR
Sbjct: 335 P---LQQGLNRFRDTYLACDRRDTAAKLGLAECRDEDLELIDALRALMRDAEMDMTLTFR 391
Query: 515 A---LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
LS V DP+ D K V + +E W+ Y L +S +ER+
Sbjct: 392 GLIDLSPVHPDPAQLHDAFYDDHKRVA--SASQLQE----WLQRYAARLQQDALSPDERR 445
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
ALM NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+Q G +A P WA
Sbjct: 446 ALMRLANPRYVLRNYLAQQAIDQAEQGDPSGVQELLEVMRRPYDDQSGRAAFAARRPEWA 505
Query: 632 Y-RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 506 RDRAGCSMLSCSS 518
>gi|190573990|ref|YP_001971835.1| hypothetical protein Smlt2024 [Stenotrophomonas maltophilia K279a]
gi|424668386|ref|ZP_18105411.1| UPF0061 protein [Stenotrophomonas maltophilia Ab55555]
gi|190011912|emb|CAQ45533.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
gi|401068648|gb|EJP77172.1| UPF0061 protein [Stenotrophomonas maltophilia Ab55555]
Length = 521
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 248/549 (45%), Positives = 319/549 (58%), Gaps = 49/549 (8%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + REVL A ++ V P+ V P L+AW+ VA L D E E
Sbjct: 9 DNRLLHTLPGDPESGPRRREVLGAAWSPVMPT-PVTAPTLLAWAPDVAAMLGFDTAEVES 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 EGFARVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH L +PTTRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLSVPTTRALSLVGTGEDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L +R L D I F +E + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LRQLVDACIARDFPELE--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ D + +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPL-FADVAQLQAGLA 344
Query: 468 RYGTKFM----------DEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALS 517
Y + F+ A LGL + +Q+ M +D T +RAL
Sbjct: 345 AYQSTFVACTRRDAAAKLGLAAADDDDLGLYQRWQQL-------MQDGGMDMTLAWRAL- 396
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMN 575
++ DP+ P+ + L AV D +++ + W+ Y L + +S ER A M
Sbjct: 397 -MRVDPAAPDVGV---LDAVYYDESRQQAVQAPLQQWLQDYAARLQADPLSASERAAKMA 452
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRP 634
NP YVLRN+L Q AID AE GD G V L ++ PY E+ G+E +A PAWA R
Sbjct: 453 KANPLYVLRNWLAQEAIDRAEQGDLGGVHALQDVLRDPYTERAGLEHFAGKRPAWADNRA 512
Query: 635 GVCMLSCSS 643
G MLSCSS
Sbjct: 513 GCSMLSCSS 521
>gi|408824007|ref|ZP_11208897.1| hypothetical protein PgenN_12833 [Pseudomonas geniculata N1]
Length = 521
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 249/542 (45%), Positives = 320/542 (59%), Gaps = 35/542 (6%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ ++ LPGDP + REVL A ++ V P+ V P L+AWS VA L D E E
Sbjct: 9 DNRLLQTLPGDPESGPRRREVLGAAWSPVMPT-PVTAPTLLAWSPDVAAMLGFDTAEVES 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 ESFAQVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGDDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L ++ L D I F + + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LQHLVDACIARDFPELH--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ D + +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPL-FADVEPLQAGLA 344
Query: 468 RYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y + F+ + KLGL + Q+ + M +D T +RAL ++ DP
Sbjct: 345 AYQSTFVACTRRDAAAKLGLAAADDDDLQLYLRWQQLMQDGGMDMTLAWRAL--MRIDPV 402
Query: 525 IPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
P+ L L AV D +++ + W+ Y L + +S ER A M + NP YV
Sbjct: 403 APDVAL---LDAVYYDEARQQAVQAPLQQWLQDYAVRLQADPLSASERLAKMTAANPLYV 459
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLSC 641
LRN+L Q AID AE GD G V L ++ PY E+ G+E +A PAWA R G MLSC
Sbjct: 460 LRNWLAQEAIDRAEQGDLGGVHALQDVLRNPYTERAGLEHFASKRPAWADNRAGCSMLSC 519
Query: 642 SS 643
SS
Sbjct: 520 SS 521
>gi|456734268|gb|EMF59090.1| Selenoprotein O [Stenotrophomonas maltophilia EPM1]
Length = 521
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 248/548 (45%), Positives = 316/548 (57%), Gaps = 47/548 (8%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + REVL A ++ V P+ V P L+AW+ VA L D E E
Sbjct: 9 DNRLLHTLPGDPESGPRRREVLGAAWSPVMPT-PVAAPTLLAWAPDVAAMLGFDTAEVES 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 EGFAQVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L +R L D I F +E + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LRQLVDACIARDFPELE--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLA---------AAKLID 458
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ A L
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPLFADVAPLQAGLAA 345
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRALSN 518
+ R A LGL + +Q+ M +D T + AL
Sbjct: 346 YQSTFVACTRRDAAAKLGLAAADDDDLGLYQRWQQL-------MQDGGMDMTLAWHAL-- 396
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNS 576
++ DP+ P+ + L AV D +++ + W+ Y L + +S ER A M
Sbjct: 397 MRVDPAAPDVGV---LDAVYYDESRQQAVQAPLQQWLQDYAARLQADPLSASERAAKMAK 453
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPG 635
NP YVLRN+L Q AID AE GD G V L ++ PY E+ G+E +A PAWA R G
Sbjct: 454 ANPLYVLRNWLAQEAIDRAEQGDLGGVHALQDVLRDPYTERAGLEHFAGKRPAWADNRAG 513
Query: 636 VCMLSCSS 643
MLSCSS
Sbjct: 514 CSMLSCSS 521
>gi|386718215|ref|YP_006184541.1| hypothetical protein SMD_1821 [Stenotrophomonas maltophilia D457]
gi|384077777|emb|CCH12366.1| Selenoprotein O and cysteine-containing homologs [Stenotrophomonas
maltophilia D457]
Length = 521
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 246/544 (45%), Positives = 324/544 (59%), Gaps = 39/544 (7%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + REVL A ++ V P+ V P L+AW+ VA+ L D E E
Sbjct: 9 DNRLLHTLPGDPESGPRRREVLGAAWSPVMPT-PVTAPTLLAWAPDVAEMLGFDTAEVES 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ + WELQLKG
Sbjct: 68 EGFAQVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGQHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L ++ L D I F ++ + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LQQLVDACIARDFPALQ--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LG+T+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGVTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ D +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPL-FADVAPLQAGLA 344
Query: 468 RYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y + F+ + KLGL + Q+ + M +D T +RAL ++ DP+
Sbjct: 345 AYQSTFVACTRRDAAAKLGLAAADDDDLQLYQRWQQLMQEGAMDMTLAWRAL--MRIDPA 402
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWIS----WVLSYIQELLSSGISDEERKALMNSVNPK 580
+ + L AV D + R++A + W+ Y L ++ ER+A M + NP
Sbjct: 403 AADATV---LDAVYYD--EARRQAVQAPLRHWLQDYAARLRRDPLAASERQAKMAAANPL 457
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCML 639
YVLRN+L Q AID AE GD G V L ++ PY E+ G+E +A PAWA R G ML
Sbjct: 458 YVLRNWLAQEAIDRAEQGDLGGVHALQDVLRDPYTERAGLEHFAGKRPAWADNRAGCSML 517
Query: 640 SCSS 643
SCSS
Sbjct: 518 SCSS 521
>gi|344207085|ref|YP_004792226.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
gi|343778447|gb|AEM51000.1| UPF0061 protein ydiU [Stenotrophomonas maltophilia JV3]
Length = 521
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 247/544 (45%), Positives = 323/544 (59%), Gaps = 39/544 (7%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + R+VL A ++ V P+ V P L+AWS +A L D + +
Sbjct: 9 DNRLLHTLPGDPESGPRRRDVLGAAWSPVMPT-PVAAPTLLAWSPELATLLGFDAADVDS 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 EGFAQVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEDVVRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L ++ L D I F ++ + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LQQLVDTCIVRDFPELQ--GQGEAL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W +VA RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQVAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ D +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPL-FADVAPLQAGLA 344
Query: 468 RYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y + F+ + KLGL + Q+ + M +D T +RAL ++ DP+
Sbjct: 345 VYQSTFVACTRRDAAAKLGLAAADDDDLQLYQRWQQLMQEGAMDMTLAWRAL--MRIDPA 402
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWIS----WVLSYIQELLSSGISDEERKALMNSVNPK 580
+ + L AV D + R++A + W+ Y L +S ER+A M + NP
Sbjct: 403 AADATV---LDAVYYD--EARRQAVQAPLQHWLQDYAARLRRDPLSASERQAKMAAANPL 457
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCML 639
YVLRN+L Q AID AE GD G V L ++ PY E+PG+E +A PAWA R G ML
Sbjct: 458 YVLRNWLAQEAIDRAEQGDLGGVHALQDVLRDPYTERPGLEHFANKRPAWADNRAGCSML 517
Query: 640 SCSS 643
SCSS
Sbjct: 518 SCSS 521
>gi|89890220|ref|ZP_01201730.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89517135|gb|EAS19792.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 529
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 236/562 (41%), Positives = 334/562 (59%), Gaps = 53/562 (9%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ +++ D+SF LP DP T++ R+V Y+ P E + Q++ S+ +A L
Sbjct: 1 MHNIHIDNSFTDALPQDPITENYTRQVTGTAYSLAQP-VEFKKSQVIHVSK-LARELGFT 58
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+E + F +G G PYA Y GHQFG WAGQLGDGRAI L E+++ +RW
Sbjct: 59 DEEVQSLAFKNVVTGREFPDGVAPYAMVYAGHQFGNWAGQLGDGRAINLFEMVH-NDQRW 117
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
LQLKGAG TPYSR DG AVLRSSIRE LCSEAMH LG+PTTR+L L +G+ V RDM
Sbjct: 118 ALQLKGAGPTPYSRNGDGFAVLRSSIREHLCSEAMHHLGVPTTRSLSLSLSGQQVLRDML 177
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG+ E GAIVCRVA SF+RFG++++ A++G + D+++ L DY I+ + I K
Sbjct: 178 YDGHAAHEKGAIVCRVAPSFIRFGNFELAAAQG--NTDVLKQLTDYTIKTFYSQITTTGK 235
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
L F EV +RT ++ WQ +GF HGV+NTDNMSILG
Sbjct: 236 EAYLQFFK--------------------EVTDRTLEMIIHWQRIGFVHGVMNTDNMSILG 275
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
LTIDYGP+G+L+ +D +TPNTTD +RY + QP+IGLWN+ Q + L +LIDD
Sbjct: 276 LTIDYGPYGWLEPYDHGWTPNTTDRQNKRYRYGAQPEIGLWNLLQLANAL--YELIDDGP 333
Query: 462 A-----NYVMERYGTKFMDEYQAIMTKKLGL--PKYN-KQIISKLLNNMAVDKVDYTNFF 513
A N E Y TK +D +M K+GL P+ N +++I+ L +++ + + D T FF
Sbjct: 334 ALEKILNSYKENYQTKHLD----MMRSKMGLSRPQENDRELIATLEHHLQLHETDMTIFF 389
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLD---IGKERKEAWISWVLSYIQELLS----SGIS 566
R L+ V DP + D+ + + D + + + +W+ W+ SY++ L SG+
Sbjct: 390 RELAQV--DPQMDTDKAFLHISMAFYDLENLSEPHQWSWLEWLESYLKRLQKEQDESGLD 447
Query: 567 D----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK 622
+ ++ MN+VNPKYV RNY+ Q ID A+ GD+ + + ++++RPYDEQP +K
Sbjct: 448 GIAFAKAKQQQMNAVNPKYVFRNYIAQLIIDDADKGDYTLLNEVYRMLQRPYDEQPEFDK 507
Query: 623 YARLPPAWAY-RPGVCMLSCSS 643
+ L P WA + G MLSCSS
Sbjct: 508 WYDLRPDWARTKVGCSMLSCSS 529
>gi|116781106|gb|ABK21967.1| unknown [Picea sitchensis]
Length = 247
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 191/247 (77%), Positives = 221/247 (89%)
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MS+LGLTIDYGPFGFLDAFDP FTPNTTDLPGRRYCFANQPD+G+WN+AQ ++TL++A L
Sbjct: 1 MSVLGLTIDYGPFGFLDAFDPKFTPNTTDLPGRRYCFANQPDVGMWNVAQLASTLSSANL 60
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKVDYTNFFRAL 516
I+D EA Y MERYG KFM+EYQ+IMTKK+GL KYNK++ISKLL+NMA DKVDYT FFRAL
Sbjct: 61 INDDEAKYGMERYGAKFMEEYQSIMTKKIGLKKYNKELISKLLSNMAFDKVDYTIFFRAL 120
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
SN+K + + ED+LL PLK VLLDI KERK+AWI W+ YI EL +SGISDEERKA M+S
Sbjct: 121 SNIKTNTDLSEDKLLSPLKPVLLDISKERKKAWIDWIHQYIHELTTSGISDEERKASMDS 180
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
+NPK+VLRNYLCQ+AIDAAE GD+ EVRRLLK+M++PYDE PGMEKYARLPPAWAYRPGV
Sbjct: 181 INPKFVLRNYLCQTAIDAAEQGDYSEVRRLLKVMQKPYDEHPGMEKYARLPPAWAYRPGV 240
Query: 637 CMLSCSS 643
CMLSCSS
Sbjct: 241 CMLSCSS 247
>gi|346725286|ref|YP_004851955.1| hypothetical protein XACM_2396 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346650033|gb|AEO42657.1| hypothetical protein XACM_2396 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 557
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 250/552 (45%), Positives = 322/552 (58%), Gaps = 36/552 (6%)
Query: 98 KLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS 157
+L + L++D+ ++LPGDP + REV A ++ V P+ V P L+A S +A
Sbjct: 36 RLAGMTHLHFDNRLRQQLPGDPEEGARRREV-GAAWSSVLPT-PVAAPYLIAHSAEMAQV 93
Query: 158 LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L L+ E F F G G P+A YGGHQFG WAGQLGDGRAI+LGE +
Sbjct: 94 LGLEAAEIASAQFAQVFGGNALYPGMQPWAVNYGGHQFGHWAGQLGDGRAISLGEAIGTD 153
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V
Sbjct: 154 GGRYELQLKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEAVV 213
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDMFYDG+P+ EPGAIVCRVA SF+RFG++++ ++RG D+ +++ D+ I F +
Sbjct: 214 RDMFYDGHPQREPGAIVCRVAPSFIRFGNFELPSARG--DIALLKQWVDFTIARDFPALA 271
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ YA W +V ERTA +VA W VGF HGV+NTDNM
Sbjct: 272 GAGDA--------------------LYADWFAQVCERTAVMVAHWMRVGFVHGVMNTDNM 311
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLTIDYGP+G++D +DP +TPNTTD GRRY F QP + WN+ + + LA
Sbjct: 312 SILGLTIDYGPYGWVDDYDPDWTPNTTDAQGRRYRFGTQPQVAYWNLGRLAQALAPL-FA 370
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFR 514
D Y ++R+ ++ + KLGL + Q+I L M ++D T FR
Sbjct: 371 DQALLQYGLDRFRDTYLACDRRDTAAKLGLAECRDEDLQLIDALRALMRESEMDMTLTFR 430
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA--WISWVLSYIQELLSSGISDEERKA 572
L ++ PE L+ D K +A W+ Y L + EER+A
Sbjct: 431 GLIDLS-----PEHPDPAQLRDAFYDEDKRLADASQLQQWLQRYAARLQQDPLLPEERRA 485
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NP+YVLRNYL Q AID AE GD V+ LL++M RPYD+QPG + +A P WA
Sbjct: 486 RMRRANPRYVLRNYLAQQAIDRAEQGDPSGVQELLEVMCRPYDDQPGRDAFAARRPDWAR 545
Query: 633 -RPGVCMLSCSS 643
R G MLSCSS
Sbjct: 546 ARAGCSMLSCSS 557
>gi|194365405|ref|YP_002028015.1| hypothetical protein Smal_1627 [Stenotrophomonas maltophilia
R551-3]
gi|194348209|gb|ACF51332.1| protein of unknown function UPF0061 [Stenotrophomonas maltophilia
R551-3]
Length = 521
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 245/542 (45%), Positives = 316/542 (58%), Gaps = 35/542 (6%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+ + LPGDP + REVL A ++ V P+ V P L+AW+ VA+ L D E E
Sbjct: 9 DNRLLHMLPGDPESGPRRREVLGAAWSPVMPT-PVTAPTLLAWAPDVAEMLGFDTAEVES 67
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
F F G AG P+A YGGHQFG WAGQLGDGRAI+LGE++ WELQLKG
Sbjct: 68 EGFAQVFGGNALYAGMQPWAANYGGHQFGHWAGQLGDGRAISLGELVAPDGRHWELQLKG 127
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSSIREFLCSEAMH LG+PTTRAL LV TG+ V RDMFYDG+P+
Sbjct: 128 AGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGVPTTRALSLVGTGEDVMRDMFYDGHPR 187
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGAIVCRV+ SFLRFGS+++ ASRG+ L ++ L D I F +E + E+L
Sbjct: 188 AEPGAIVCRVSPSFLRFGSFELPASRGETAL--LQQLVDACIARDFPELE--GEGETL-- 241
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y W ++A RTA ++A W VGF HGV+NTDN+S+LGLT+DYG
Sbjct: 242 ----------------YGDWFAQIAVRTAEMIAHWMRVGFVHGVMNTDNLSVLGLTLDYG 285
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+G+++ FDP +TPNTTD GRRY F QP + WN+++ + L+ D +
Sbjct: 286 PYGWVEDFDPDWTPNTTDAQGRRYRFGTQPQVAYWNLSRLAQALSPL-FADVAPLQAGLA 344
Query: 468 RYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y + F+ + KLGL + Q+ + M +D T +RAL +
Sbjct: 345 AYQSTFVACTRRDAAAKLGLAAADDDDLQLYLRWQQLMQDGAMDMTLAWRALMRLDP--- 401
Query: 525 IPEDELLVPLKAVLLDIGKER--KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
L AV D +++ + W+ Y L + +S ER A M + NP YV
Sbjct: 402 --AAPDAALLDAVYYDEARQQAVQAPLQHWLQDYAARLQADPLSASERTAKMAAANPLYV 459
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVCMLSC 641
LRN+L Q AID AE GD G V L ++ PY E+PG+E +A P+WA R G MLSC
Sbjct: 460 LRNWLAQEAIDRAEQGDLGGVHALQDVLRDPYTERPGLEHFAGKRPSWADNRAGCSMLSC 519
Query: 642 SS 643
SS
Sbjct: 520 SS 521
>gi|28199858|ref|NP_780172.1| hypothetical protein PD1992 [Xylella fastidiosa Temecula1]
gi|386083945|ref|YP_006000227.1| hypothetical protein XFLM_04465 [Xylella fastidiosa subsp.
fastidiosa GB514]
gi|33516998|sp|Q87A39.1|Y1992_XYLFT RecName: Full=UPF0061 protein PD_1992
gi|28057979|gb|AAO29821.1| conserved hypothetical protein [Xylella fastidiosa Temecula1]
gi|307578892|gb|ADN62861.1| hypothetical protein XFLM_04465 [Xylella fastidiosa subsp.
fastidiosa GB514]
Length = 519
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 243/544 (44%), Positives = 313/544 (57%), Gaps = 33/544 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +++ F+ LP DP R+VL A +++V+P+ V P L+A+S VA L D +E
Sbjct: 4 LRFNNRFIDVLPCDPEVSLRSRQVLEA-WSRVAPTP-VPMPCLLAYSSEVAAILNFDAEE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG G PYA YGGHQFG W GQLGDGR ITLGE+L +ELQ
Sbjct: 62 LVTPRFVEVFSGNALYTGMQPYAVNYGGHQFGQWVGQLGDGRVITLGELLGADGVYYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRAL L+ TG V RDM YDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLIATGDTVIRDMLYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EP AIVCRVA SF+RFG++++ ASRG D+D++R L ++ I + H+ ++
Sbjct: 182 HPAPEPSAIVCRVAPSFIRFGTFELPASRG--DIDLLRRLVEFTIIRDYPHLHGAGET-- 237
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
YA W E+ RTA LVA W VGF HGV+NTDNMSILGLTI
Sbjct: 238 ------------------LYADWFAEICTRTAELVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D D +TPN TD+ RRY F QP + WN+ + LA D
Sbjct: 280 DYGPYGWIDNNDLDWTPNVTDVQSRRYRFGAQPQVAYWNLGCLARALAPL-FSDAASLQA 338
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ER+ ++ + KLG + ++ L M ++D T F L++
Sbjct: 339 GLERFRATYLAAERRDAAAKLGFAACFDEDLELFDALRTCMHQAEMDMTLTFLGLADW-- 396
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPK 580
+P++P D L + +A + ++ + + W+ Y L + ER M NP+
Sbjct: 397 EPNMP-DSLSLWAEAFYDPVKRDAQAPMLRDWLQRYAARLSVDPLPVAERHERMRLANPR 455
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNYL Q AI+ AE GD E+ LL++M RPYD Q G E YA P WA R G ML
Sbjct: 456 YVLRNYLTQQAIECAEQGDLTELHALLEVMRRPYDFQLGREAYAMRRPEWARSRIGCSML 515
Query: 640 SCSS 643
SCSS
Sbjct: 516 SCSS 519
>gi|182682609|ref|YP_001830769.1| hypothetical protein XfasM23_2097 [Xylella fastidiosa M23]
gi|417557463|ref|ZP_12208500.1| hypothetical protein XFEB_00277 [Xylella fastidiosa EB92.1]
gi|182632719|gb|ACB93495.1| protein of unknown function UPF0061 [Xylella fastidiosa M23]
gi|338179958|gb|EGO82867.1| hypothetical protein XFEB_00277 [Xylella fastidiosa EB92.1]
Length = 525
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 243/544 (44%), Positives = 313/544 (57%), Gaps = 33/544 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +++ F+ LP DP R+VL A +++V+P+ V P L+A+S VA L D +E
Sbjct: 10 LRFNNRFIDVLPCDPEVSLRSRQVLEA-WSRVAPTP-VPMPCLLAYSSEVAAILNFDAEE 67
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG G PYA YGGHQFG W GQLGDGR ITLGE+L +ELQ
Sbjct: 68 LVTPRFVEVFSGNALYTGMQPYAVNYGGHQFGQWVGQLGDGRVITLGELLGADGVYYELQ 127
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRAL L+ TG V RDM YDG
Sbjct: 128 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLIATGDTVIRDMLYDG 187
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EP AIVCRVA SF+RFG++++ ASRG D+D++R L ++ I + H+ ++
Sbjct: 188 HPAPEPSAIVCRVAPSFIRFGTFELPASRG--DIDLLRRLVEFTIIRDYPHLHGAGET-- 243
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
YA W E+ RTA LVA W VGF HGV+NTDNMSILGLTI
Sbjct: 244 ------------------LYADWFAEICTRTAELVAHWMRVGFVHGVMNTDNMSILGLTI 285
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D D +TPN TD+ RRY F QP + WN+ + LA D
Sbjct: 286 DYGPYGWIDNNDLDWTPNVTDVQSRRYRFGAQPQVAYWNLGCLARALAPL-FSDAASLQA 344
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ER+ ++ + KLG + ++ L M ++D T F L++
Sbjct: 345 GLERFRATYLAAERRDAAAKLGFAACFDEDLELFDALRTCMHQAEMDMTLTFLGLADW-- 402
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPK 580
+P++P D L + +A + ++ + + W+ Y L + ER M NP+
Sbjct: 403 EPNMP-DSLSLWAEAFYDPVKRDAQAPMLRDWLQRYAARLSVDPLPVAERHERMRLANPR 461
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNYL Q AI+ AE GD E+ LL++M RPYD Q G E YA P WA R G ML
Sbjct: 462 YVLRNYLTQQAIECAEQGDLTELHALLEVMRRPYDFQLGREAYAMRRPEWARSRIGCSML 521
Query: 640 SCSS 643
SCSS
Sbjct: 522 SCSS 525
>gi|71730289|gb|EAO32373.1| Protein of unknown function UPF0061 [Xylella fastidiosa Ann-1]
Length = 525
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 243/544 (44%), Positives = 312/544 (57%), Gaps = 33/544 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +++ F+ LP DP R+VL A +++V P+ V P L+A+S VA L D +E
Sbjct: 10 LRFNNRFIDVLPCDPEVSLRSRQVLEA-WSRVEPTP-VPMPCLLAYSSEVAAILNFDAEE 67
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG G PYA YGGHQFG W GQLGDGR ITLGE+L +ELQ
Sbjct: 68 LVTPRFVEVFSGNALYPGMQPYAVNYGGHQFGQWVGQLGDGRVITLGELLGADGVYYELQ 127
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRAL L+ TG V RDM YDG
Sbjct: 128 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLIATGDTVIRDMLYDG 187
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EP AIVCRVA SF+RFG++++ ASRG D+D++R L ++ I + H+ ++
Sbjct: 188 HPAPEPSAIVCRVAPSFIRFGTFELPASRG--DIDLLRRLVEFTIMRDYPHLHGAGET-- 243
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
YA W E+ RTA LVA W VGF HGV+NTDNMSILGLTI
Sbjct: 244 ------------------LYADWFAEICTRTAELVAHWMRVGFVHGVMNTDNMSILGLTI 285
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D D +TPN TD+ RRY F QP + WN+ + LA D
Sbjct: 286 DYGPYGWIDNNDLDWTPNVTDVQSRRYRFGAQPQVAYWNLGCLARALAPL-FSDAASLQA 344
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ER+ ++ + KLG + ++ L M ++D T F L++
Sbjct: 345 GLERFRATYLAAERRDAAAKLGFAACFDEDLELFDALRTCMHQAEMDMTLTFLGLAD--W 402
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPK 580
+P++P D L + +A + ++ + + W+ Y L + ER M NP+
Sbjct: 403 EPNMP-DSLSLWAEAFYDPVKRDAQAPMLRDWLQRYAARLSVDPLPVAERHERMRLANPR 461
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNYL Q AI+ AE GD E+ LL++M RPYD Q G E YA P WA R G ML
Sbjct: 462 YVLRNYLTQQAIECAEQGDLTELHALLEVMRRPYDFQLGREVYAMRRPEWARSRIGCSML 521
Query: 640 SCSS 643
SCSS
Sbjct: 522 SCSS 525
>gi|291336343|gb|ADD95902.1| hypothetical protein PM8797T_16308 [uncultured organism
MedDCM-OCT-S01-C5]
Length = 456
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 236/497 (47%), Positives = 303/497 (60%), Gaps = 48/497 (9%)
Query: 154 VADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
+ + L L P E + G P+AG PYAQ YGGHQFG WAGQLGDGRAITLGE+
Sbjct: 1 MGEELNLTPTE----ETGEVLGGGAPVAGMKPYAQRYGGHQFGNWAGQLGDGRAITLGEV 56
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
++ ELQLKGAG+TPYSR ADG AVLRSSIRE+LCSEAMH LG+PTTRAL LVTTG
Sbjct: 57 -ETENGFLELQLKGAGRTPYSRTADGKAVLRSSIREYLCSEAMHHLGVPTTRALSLVTTG 115
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHF 333
+ + RD+ Y+GNP EPGA+VCRVA SF+RFGS+QIH S G +RTL D+ +RHHF
Sbjct: 116 EAIMRDVLYNGNPAPEPGAVVCRVAPSFIRFGSFQIHMSDGHH--QTLRTLLDHTVRHHF 173
Query: 334 RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLN 393
DH V T + AW EVAE TA+++A W VGF HGV+N
Sbjct: 174 ------------------PDHDVS--TDDGIIAWLSEVAETTATMIAHWMRVGFVHGVMN 213
Query: 394 TDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA 453
TDNMSI GLTIDYGP+G+L+ FD +TPNTTD RRY + NQP IG WN+A+ ++
Sbjct: 214 TDNMSIHGLTIDYGPYGWLEPFDVDWTPNTTDAGRRRYRYGNQPHIGAWNVARLLESM-- 271
Query: 454 AKLIDD-KEANYVMERYGTKFMDEYQAIMTKKLG---LPKYNKQIISKLLNNMAVDKVDY 509
A L+DD V++ Y M+ KLG L + ++ +++ LL + +VD
Sbjct: 272 APLLDDVARLQPVLDHYMEYAMNAQSETWADKLGLGVLQESDEPLVNDLLTLLGATEVDM 331
Query: 510 TNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE 569
T FFR L ++ P I L + + + AW +W+ + + + +E
Sbjct: 332 TIFFRLLCSI-TQPDITH------LSDAFYEGDEPSETAWNAWLGRWWER-----VEEEP 379
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLMERPYDEQPGM-EKYARLP 627
+ M NPKYVLRN++ Q AID+A E GDF L +L++RPYDEQP EK+ +
Sbjct: 380 DRDTMRKTNPKYVLRNWMAQLAIDSAEEHGDFSIAEELHELLKRPYDEQPEHEEKWFQKR 439
Query: 628 PAWA-YRPGVCMLSCSS 643
P WA +R G MLSCSS
Sbjct: 440 PEWARHRVGCSMLSCSS 456
>gi|443244460|ref|YP_007377685.1| UPF0061 protein [Nonlabens dokdonensis DSW-6]
gi|442801859|gb|AGC77664.1| UPF0061 protein [Nonlabens dokdonensis DSW-6]
Length = 565
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 230/566 (40%), Positives = 327/566 (57%), Gaps = 41/566 (7%)
Query: 92 ESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS 151
+S+++ ++ L+ ++SF LP DP ++ R+V Y++ +P L+ S
Sbjct: 27 DSRLSITFASMHKLHINNSFTNALPEDPIKENFTRQVTGVAYSQATPLT-FRKASLIHVS 85
Query: 152 ESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLG 211
E +A L D +E +F F+G YA Y GHQFG WAGQLGDGRAI L
Sbjct: 86 E-LAKELGFDQEEIASAEFLQLFTGQVLYPKTQSYAMAYAGHQFGNWAGQLGDGRAINLF 144
Query: 212 EILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVT 271
EI+ + RW QLKGAG TPYSR DGLAVLRSSIRE LCSEAMH LGIPTTR+L L
Sbjct: 145 EIVE-NNNRWAFQLKGAGPTPYSRRGDGLAVLRSSIREHLCSEAMHHLGIPTTRSLSLSL 203
Query: 272 TGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRH 331
+G+ V RDM Y+GN E GAIVCRVA SF+RFG++++ A++G+++L ++ L DY I
Sbjct: 204 SGEEVLRDMMYNGNAAHEKGAIVCRVAPSFIRFGNFELAAAQGEKEL--LKKLTDYTIST 261
Query: 332 HFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGV 391
+++I K + F EV +RT ++ WQ VGF HGV
Sbjct: 262 FYKNITTSGKEAYIQFFQ--------------------EVTDRTLEMIMHWQRVGFVHGV 301
Query: 392 LNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL 451
+NTDNMSILGLTIDYGP+G+L+ +D +TPNTTD +RY + QP+IGLWN+ Q + L
Sbjct: 302 MNTDNMSILGLTIDYGPYGWLEPYDHGWTPNTTDRQNKRYRYGAQPEIGLWNLLQLANAL 361
Query: 452 AAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKV 507
LI+D +++ Y T + +Y M KLG+ K ++ +I +L + + +
Sbjct: 362 FP--LIEDAAPLQEILDSYRTNYQVQYLETMMNKLGIYHTHKDDRDLIQQLEEILHLHET 419
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLD-IGKERKEAWISWVLSYIQEL-----L 561
D T F+R LS + + + ++ + LD + ++ W+ W+ SYI L +
Sbjct: 420 DMTIFYRELSKINSKTDKIDAFEVISIAFYHLDQLSDAHRKEWLDWLESYILRLELDVKM 479
Query: 562 SSG---ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
+G + R MN+ NPKYVLRNY+ Q ID A+ GD+ + + ++++PYDEQP
Sbjct: 480 EAGDIITFAKARIQKMNATNPKYVLRNYIAQLVIDDADKGDYSLLNEIYTMLQKPYDEQP 539
Query: 619 GMEKYARLPPAWAY-RPGVCMLSCSS 643
EK+ L P WA + G MLSCSS
Sbjct: 540 EFEKWYALRPEWARSKVGCSMLSCSS 565
>gi|374594854|ref|ZP_09667858.1| UPF0061 protein ydiU [Gillisia limnaea DSM 15749]
gi|373869493|gb|EHQ01491.1| UPF0061 protein ydiU [Gillisia limnaea DSM 15749]
Length = 516
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 243/550 (44%), Positives = 324/550 (58%), Gaps = 44/550 (8%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ D + + F PGD D PR+ Y+K P+ +V +P+L+A++E +A + +D
Sbjct: 3 ITDKKFTNLFTSAFPGDNSGDLSPRQTPGVLYSKAIPT-KVSDPKLLAFTEELAAEMGMD 61
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
E D + +G PYA CY GHQFG WAGQLGDGRAITLGE + W
Sbjct: 62 SPGAE--DLKIL-AGNKVTETMQPYAACYAGHQFGNWAGQLGDGRAITLGEWEH-NGGSW 117
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
E+QLKGAG T YSR ADG AVLRSS+RE+L SEAM LG+PTTRAL LVTTG + RDMF
Sbjct: 118 EMQLKGAGPTAYSRMADGRAVLRSSVREYLMSEAMFHLGVPTTRALSLVTTGDKILRDMF 177
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
Y+GN EPGAIV RV++SFLRFG+++I A+R +++ ++ L D+ I HF H +K
Sbjct: 178 YNGNAAYEPGAIVMRVSESFLRFGNFEILAARKEKE--NLQHLVDWTIEKHFPH----HK 231
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
E N+ W EV ++TA+L+ +W VGF HGV+NTDNMSILG
Sbjct: 232 GE------------------NRIINWFREVIDKTAALMVEWHRVGFVHGVMNTDNMSILG 273
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
TIDYGPF FLD +DPSFTPNTTDLPGRRY F NQP I LWN+++ +T L L D E
Sbjct: 274 QTIDYGPFSFLDDYDPSFTPNTTDLPGRRYAFGNQPSIALWNLSRLATALTP--LFKDTE 331
Query: 462 -ANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALS 517
+ Y F + Y +M KLGL K +K++IS+L +A K D T +R L
Sbjct: 332 LLEEALNSYEDNFWNRYYEMMGNKLGLDKITAEDKKMISQLEELLAKVKPDMTILYRLLI 391
Query: 518 NVKADPSIPE--DELLVPLK-AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
++ PSI D L + LK A + E K ++ ++SY + + IS E +M
Sbjct: 392 DL---PSISAEGDMLFIYLKPAFYTEPSGELKVEFLKLIISYAERRKKNSISTEASAEIM 448
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YR 633
NP+++LRNYL AI+ E+G+ +L +++PY E E + P WA +
Sbjct: 449 KKTNPRFILRNYLLHQAIEELEMGERSLFDKLRAALKQPYTEDD--EDLLKKRPDWATQK 506
Query: 634 PGVCMLSCSS 643
PG MLSCSS
Sbjct: 507 PGCSMLSCSS 516
>gi|15839208|ref|NP_299896.1| hypothetical protein XF2619 [Xylella fastidiosa 9a5c]
gi|33517142|sp|Q9PA99.1|Y2619_XYLFA RecName: Full=UPF0061 protein XF_2619
gi|9107844|gb|AAF85416.1|AE004068_12 conserved hypothetical protein [Xylella fastidiosa 9a5c]
Length = 519
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 242/544 (44%), Positives = 310/544 (56%), Gaps = 33/544 (6%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +++ F+ LP DP R+VL A ++ V+P+ V P L+A+S VA L D +E
Sbjct: 4 LRFNNRFIAVLPCDPEVSLRSRQVLEA-WSGVAPT-PVPVPCLLAYSSEVAAILNFDAEE 61
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG G PYA YGGHQFG W GQLGDGR ITLGE+L +ELQ
Sbjct: 62 LVTPRFVEVFSGNALYPGMQPYAVNYGGHQFGQWVGQLGDGRVITLGELLGADGVYYELQ 121
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRAL L+ TG V RDM YDG
Sbjct: 122 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLIATGDTVIRDMLYDG 181
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EP AIVCRVA SF+RFG++++ ASRG D+D++R L ++ I + H+ ++
Sbjct: 182 HPAPEPSAIVCRVAPSFVRFGTFELPASRG--DIDLLRRLVEFTIMRDYPHLHGAGET-- 237
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
Y W E+ RTA LVA W VGF HGV+NTDNMSILGLTI
Sbjct: 238 ------------------LYVDWFAEICTRTAELVAHWMRVGFVHGVMNTDNMSILGLTI 279
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D D +TPN TD RRY F QP + WN+ + LA D
Sbjct: 280 DYGPYGWIDNNDLDWTPNVTDAQSRRYRFGAQPQVAYWNLGCLARALAPL-FSDAASLQA 338
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ER+ ++ + KLG + ++ L M ++D T F L++
Sbjct: 339 GLERFRATYLAAERRDAAAKLGFAACFDEDLELFDALRTCMHQAEMDMTLTFLGLADW-- 396
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPK 580
+P++P D L + +A + ++ + + W+ Y L + ER M NP+
Sbjct: 397 EPNMP-DSLSLWAEAFYDPVKRDAQAPMLRDWLQRYAARLSVDPLPVAERHERMRLANPR 455
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCML 639
YVLRNYL Q AI+ AE GD E+ LL++M RPYD Q G E YA P WA R G ML
Sbjct: 456 YVLRNYLTQQAIECAEQGDLIELHALLEVMRRPYDFQLGREAYAMRRPEWARSRIGCSML 515
Query: 640 SCSS 643
SCSS
Sbjct: 516 SCSS 519
>gi|71275238|ref|ZP_00651525.1| Protein of unknown function UPF0061 [Xylella fastidiosa Dixon]
gi|170731235|ref|YP_001776668.1| hypothetical protein Xfasm12_2185 [Xylella fastidiosa M12]
gi|71164047|gb|EAO13762.1| Protein of unknown function UPF0061 [Xylella fastidiosa Dixon]
gi|71730670|gb|EAO32745.1| Protein of unknown function UPF0061 [Xylella fastidiosa Ann-1]
gi|167966028|gb|ACA13038.1| conserved hypothetical protein [Xylella fastidiosa M12]
Length = 525
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 242/547 (44%), Positives = 303/547 (55%), Gaps = 39/547 (7%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +++ F+ LP DP R+VL A ++ V+P+ V P L+A+S VA L D +E
Sbjct: 10 LRFNNRFIDVLPCDPEVSLRSRQVLEA-WSGVAPT-PVPVPCLLAYSSEVAAILNFDAEE 67
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
P F FSG G PYA YGGHQFG W GQLGDGR ITLGE+L +ELQ
Sbjct: 68 LVTPRFVEVFSGNALYPGMQPYAVNYGGHQFGQWVGQLGDGRVITLGELLGADGVYYELQ 127
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR ADG AVLRSSIREFLCSEAMH LGIPTTRAL L+ TG V RDM YDG
Sbjct: 128 LKGAGPTPYSRGADGRAVLRSSIREFLCSEAMHHLGIPTTRALSLIATGDTVIRDMLYDG 187
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+P EP AIVCRVA SF+RFG++++ ASRG D+D++R L ++ I + H+ ++
Sbjct: 188 HPAPEPSAIVCRVAPSFIRFGTFELPASRG--DIDLLRRLVEFTIMRDYPHLHGAGET-- 243
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
YA W E+ RTA LVA W VGF HGV+NTDNMSILGLTI
Sbjct: 244 ------------------LYADWFAEICTRTAELVAHWMRVGFVHGVMNTDNMSILGLTI 285
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+G++D D +TPN TD+ RRY F QP + WN+ + LA D
Sbjct: 286 DYGPYGWIDNNDLDWTPNVTDVQSRRYRFGAQPQVAYWNLGCLARALAPL-FSDAASLQA 344
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALS---- 517
+ER+ ++ + KLG + + L M ++D T F L+
Sbjct: 345 GLERFRATYLAAERRDAAAKLGFAACFDEDLALFDALRTCMHQAEMDMTLTFLGLADWEP 404
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
N+ S+ D P+K + W+ Y L + ER M
Sbjct: 405 NMLDSLSLWADAFYDPVKR------DAQAPMLRDWLQRYAARLSVDPLPVAERHERMRLA 458
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGV 636
NP+YVLRNYL Q AI+ AE GD E+ LL++M RPYD Q G E Y P WA R G
Sbjct: 459 NPRYVLRNYLTQQAIECAEQGDLTELHALLEVMRRPYDFQLGREAYGMRRPEWARSRIGC 518
Query: 637 CMLSCSS 643
MLSCSS
Sbjct: 519 SMLSCSS 525
>gi|383315869|ref|YP_005376711.1| hypothetical protein [Frateuria aurantia DSM 6220]
gi|379042973|gb|AFC85029.1| hypothetical protein Fraau_0547 [Frateuria aurantia DSM 6220]
Length = 518
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 243/550 (44%), Positives = 320/550 (58%), Gaps = 40/550 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+ L +D+ ++RELP DP + PREV A Y++V P+ V+ P+ +A S A L LD
Sbjct: 1 MSRLEFDNRWLRELPADPLAELAPREVAGAMYSRVQPT-RVQAPRWLAASADAAALLGLD 59
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ P++ SG L+G P+A YGGHQFG WAGQLGDGRAI+LGE + RW
Sbjct: 60 LAALQTPEWLQALSGNALLSGMEPWASNYGGHQFGHWAGQLGDGRAISLGEAVVADGRRW 119
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG TPYSR ADG AVLRSSIREF+CSEAM LG+PTTRAL LV + V RDMF
Sbjct: 120 ELQLKGAGPTPYSRSADGRAVLRSSIREFICSEAMQHLGVPTTRALSLVGSTDSVWRDMF 179
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNK 341
YDG + EP AIVCR+A SF+RFG +++ ASRG D +VR LAD+ I F + +
Sbjct: 180 YDGRAQREPLAIVCRMAPSFVRFGHFELPASRG--DTALVRQLADFVIDRDFPELSGHGE 237
Query: 342 SESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILG 401
+ +YAAW + RTA +V WQ VGF HGV+NTDNMSILG
Sbjct: 238 A--------------------RYAAWFETICRRTAVMVMHWQRVGFVHGVMNTDNMSILG 277
Query: 402 LTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE 461
L++DYGP+G+++ FDP +TPNTTD RRY + QP + WN+ + + LA+ L D
Sbjct: 278 LSLDYGPYGWMEPFDPRWTPNTTDAGQRRYRYEQQPAVAYWNLGRLAGALAS--LFGDMA 335
Query: 462 ANYVMERYGTKFMDEY----QAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFR 514
++ F+DE+ +A + KLGL + +++++LL M ++D T FR
Sbjct: 336 P---LQAALDAFVDEWRLQERANIRAKLGLEHDRDDDAELMAELLQVMEAARLDMTLLFR 392
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS + DP++ D LL A D + +W+ Y Q L R M
Sbjct: 393 LLS--RHDPAM--DSLLHFSPAFYADAPADAMARLSTWLARYRQRLADETRPQAARWQAM 448
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-R 633
NP Y+ RNYL Q I+ AE GD + LL ++ +PY EQPG E +A P WA R
Sbjct: 449 QQANPCYIPRNYLVQQVIEQAEAGDSSGIGDLLDVLRQPYVEQPGREAWAARRPDWAASR 508
Query: 634 PGVCMLSCSS 643
G MLSCSS
Sbjct: 509 EGCGMLSCSS 518
>gi|347756644|ref|YP_004864207.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347589161|gb|AEP13690.1| Uncharacterized conserved protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 493
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 242/551 (43%), Positives = 329/551 (59%), Gaps = 67/551 (12%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+ LE L +D+++ LP D Y++V+P+ + +LVA++ A L+
Sbjct: 3 RTLETLVFDNTYT-TLPED-------------YYSRVAPTP-LRGARLVAFNPEAAALLD 47
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
LDP E RPDF +F+G L GA P A Y GHQFG++ QLGDGRA+ LGE+ N + E
Sbjct: 48 LDPSEAARPDFVAYFNGEKALPGAEPLAALYAGHQFGVYVPQLGDGRALLLGEVRNARGE 107
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
RW+LQ+KG+G+TPYSR DG AVLRS+IRE+L SEAMH LGIPTTRALC++ + + V R+
Sbjct: 108 RWDLQVKGSGRTPYSRMGDGRAVLRSTIREYLGSEAMHALGIPTTRALCIIGSDEPVYRE 167
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM 339
E GA++ R+A + +RFGS+++ R + L V LADY I F ++ +
Sbjct: 168 TV-------ERGALLVRLAPTHVRFGSFEVFFHRRR--LADVARLADYVIGQFFPELQAL 218
Query: 340 NKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
G+ED ++AA+ EV RTA LVAQWQ VGF HGVLNTDNMSI
Sbjct: 219 ----------GEED---------RFAAFLQEVVNRTARLVAQWQAVGFAHGVLNTDNMSI 259
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTT----LAAAK 455
LGLT+DYGPFGFLD +DP F N +D+ G RY F QP I LWN+ + T + +
Sbjct: 260 LGLTLDYGPFGFLDDYDPHFICNHSDVTG-RYAFNQQPGIALWNLRCLAQTFLPWVPRER 318
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--PK-YNKQIISKLLNNMAVDKVDYTNF 512
L+D A + F DEY+ +M KLGL P+ + ++++ L +A ++ DYT
Sbjct: 319 LVDSLNA------FRDVFFDEYERLMFAKLGLHHPQPGDAELLADWLELLAQNRADYTLA 372
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
FR L+ ++PED + P A L D+ +R EA +W+ Y L G+ ER+A
Sbjct: 373 FRRLAE-----TVPEDPVH-PANARLQDLFVDR-EAVAAWLTKYGCRLAQEGVPSSERQA 425
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M SVNPKY+LRNYL Q AI+ AE GDF E+ RLL ++ +PY EQP +YA PP W
Sbjct: 426 RMRSVNPKYILRNYLAQIAIERAEEGDFSEIERLLTVLRQPYAEQPEAARYAEPPPDWGR 485
Query: 633 RPGVCMLSCSS 643
R +SCSS
Sbjct: 486 R---LEISCSS 493
>gi|374287709|ref|YP_005034794.1| hypothetical protein BMS_0937 [Bacteriovorax marinus SJ]
gi|301166250|emb|CBW25825.1| conserved hypothetical protein [Bacteriovorax marinus SJ]
Length = 523
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 225/553 (40%), Positives = 325/553 (58%), Gaps = 41/553 (7%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+ L++L ++++FV G+ + P E L + YT+ P+ V P+L+A+S +A ++
Sbjct: 3 RKLDELEFENNFVNNFKGNDQVSRTPSETLDSLYTRAMPTP-VSGPRLIAYSSELASAMG 61
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
+D R + SG +PYA CYGG QFG WA QLGDGRAITLGEI + ++
Sbjct: 62 IDQGAETRESVEIL-SGNRVNRTMIPYAACYGGFQFGHWANQLGDGRAITLGEI-SKGNQ 119
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
+ELQLKGAG+T YSR DG AVLRSS+REFL SEAM +LG+PTTRAL LV TG V RD
Sbjct: 120 IFELQLKGAGQTAYSRRGDGRAVLRSSVREFLMSEAMFYLGVPTTRALSLVDTGDKVLRD 179
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM 339
MFYDGN + E GAIV RVA SFLRFG++QI +RG+ + + L +++++ + I+
Sbjct: 180 MFYDGNSEYENGAIVSRVAPSFLRFGNFQILYARGE--VSNLEDLLNWSVQKFYPEIKEQ 237
Query: 340 NKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
+ +SF EV++RT+ ++++W VGF HGV+NTDNMSI
Sbjct: 238 GDQKIISFFR--------------------EVSKRTSRMISEWMRVGFVHGVMNTDNMSI 277
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AAAK 455
LGLTIDYGPF FLD FDP+FTPNTTDLPGRRY FA QP I LWN+ +F+ +L
Sbjct: 278 LGLTIDYGPFSFLDNFDPNFTPNTTDLPGRRYAFAKQPSIALWNLQRFAESLMPLMQETN 337
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV----DKVDYTN 511
L++D+ +N+ E Y T +Y +M++K GL + + L+ M KVD T
Sbjct: 338 LLEDEVSNF-KEYYTT----DYYQMMSRKYGLSNLKTEEGEEFLDQMRSLLYDCKVDMTL 392
Query: 512 FFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
FF+ L ++ + E+ + + ++ + + + + + Y L ++ E +
Sbjct: 393 FFQYLIDLARGEASREEVMNHFNECFYRELSESEQREFYNLIKVYKSFLEKDSLTTSESR 452
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
+M+ NP+++LRNYL Q A + E GD L ++ PY + G +++ P WA
Sbjct: 453 QIMSEANPRFILRNYLLQKASEELEAGDDTLFNELFTALKNPYSK--GSDRFFCKRPKWA 510
Query: 632 -YRPGVCMLSCSS 643
+ G MLSCSS
Sbjct: 511 ENKAGSSMLSCSS 523
>gi|394988292|ref|ZP_10381130.1| hypothetical protein SCD_00694 [Sulfuricella denitrificans skB26]
gi|393792750|dbj|GAB70769.1| hypothetical protein SCD_00694 [Sulfuricella denitrificans skB26]
Length = 489
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 240/553 (43%), Positives = 319/553 (57%), Gaps = 72/553 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ L+ LN+ ++F R LP E H +++ P+ E P LV+++ + A+ +
Sbjct: 1 MMKLDQLNFQNTFAR-LP----------ETFH---SRLHPTPLPE-PYLVSFNANAAELI 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E DF +F G L G+ P A Y GHQFG + QLGDGRAI LGE+ N
Sbjct: 46 DLDPDEVMCADFAEYFIGNRLLPGSDPLAMLYAGHQFGHFVPQLGDGRAILLGEVKNRAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+LQLKGAG TP+SR DG AVLRSSIRE+LCSEAMH LGIPTTRALC+V + + + R
Sbjct: 106 EHWDLQLKGAGATPFSRSGDGRAVLRSSIREYLCSEAMHGLGIPTTRALCIVGSDEEIWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A+V R+A S +RFGS+++ R Q + IVR LADY I HF + +
Sbjct: 166 ETV-------ESAAVVTRIAPSHVRFGSFEVFFYRDQPE-PIVR-LADYVIDKHFPELAD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+KY + EV RTA L+A+WQ VGF+HGV+NTDNMS
Sbjct: 217 ---------------------APDKYPRFLNEVVIRTARLMAKWQAVGFSHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLT DYGPFGF+DA++P + N +D G RY F QP IGLWN+ + L +I
Sbjct: 256 ILGLTFDYGPFGFMDAYNPGYVCNHSDH-GGRYAFDRQPQIGLWNLTCLAQAL--TPIIP 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRA 515
+EA V+ YG + + Y +M +KLGL + +I LL M ++VDYTN FR+
Sbjct: 313 VEEARAVLGHYGPTYAEHYVDLMGQKLGLTHAGQDDVPLIEALLGLMHANQVDYTNLFRS 372
Query: 516 LSNVKADP----SIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
L + K++ S+ D+ + + A+ +W +Y L + +DEERK
Sbjct: 373 LGHFKSEAGEQNSVVRDQFI-------------DRPAFDAWAETYRARLQNEPGTDEERK 419
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELG-DFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
M+ VNPKY+LRNYL Q AI+ AE DF EV RLLKL+ P+DEQP M YA PP W
Sbjct: 420 VRMDKVNPKYILRNYLAQVAIEKAEKERDFSEVDRLLKLLGCPFDEQPEMANYAAPPPDW 479
Query: 631 AYRPGVCMLSCSS 643
A V SCSS
Sbjct: 480 AQHISV---SCSS 489
>gi|156359336|ref|XP_001624726.1| predicted protein [Nematostella vectensis]
gi|156211523|gb|EDO32626.1| predicted protein [Nematostella vectensis]
Length = 522
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 221/544 (40%), Positives = 320/544 (58%), Gaps = 49/544 (9%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS-ESVADSLELDPKEF---ERPDF 170
P DP T + R+V ++ V P+ P LVA S E +AD L+++P+ R F
Sbjct: 13 FPIDPETRNYVRQVRRYVFSYVKPTPLRARPSLVAVSSEVLADILDINPESVTMESRDRF 72
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
SG + +VP A YGGHQFG W+GQLGDGRA+ LGE +N K ERWELQLKG+GK
Sbjct: 73 VRLVSGTEVASQSVPLAHRYGGHQFGDWSGQLGDGRAVMLGEYVNSKGERWELQLKGSGK 132
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR DG AV RSS+REFL SEAMH+LG+PT+R LV + + V RD FYDG+P E
Sbjct: 133 TPYSRHGDGRAVFRSSVREFLASEAMHYLGVPTSRVASLVVSDEQVWRDQFYDGHPIREK 192
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
A+V R+A+S+ R GS +I + G+ DL +R + D+ I HF I++
Sbjct: 193 AAVVLRLAKSWFRIGSLEILTNNGETDL--LRKVVDFVIEQHFNKIKD------------ 238
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
+ KY + +V +TA ++A WQ +GF HGV NTDN S+L +TIDYGPFG
Sbjct: 239 ---------SKEKYLEFFSQVVTKTAHMIAIWQALGFAHGVCNTDNFSLLSMTIDYGPFG 289
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ----FSTTLAAAKLIDDKEANYVM 466
F+D ++ F PNT+D G RY F+NQP G +N+A+ S + A+ + K+ ++
Sbjct: 290 FMDTYNSDFVPNTSDDEG-RYSFSNQPSAGQYNLAKLLDALSPIIDLARYLAGKK---IL 345
Query: 467 ERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADP 523
+RY +F + + + +KLGL + +I L M + D+T FR L N+
Sbjct: 346 QRYAAEFNNCFMDLHRQKLGLVGRRDEDDMLIKSFLQIMESSQADFTMTFRQLGNLTLGH 405
Query: 524 SIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG--ISDEERKALMNSVNPKY 581
++ ++P A L+ K++K W W+ Y + L +G +DE+R+ M++VNP+Y
Sbjct: 406 I---EQGVIPPGAWALEKLKQQKN-WRDWLGRYQERLGRNGGHDTDEKRRIRMHAVNPRY 461
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCML 639
VLRN++ Q+AID A GD+ E+R LL +++RP++ Q E+ YA PP W+ + +
Sbjct: 462 VLRNWMAQTAIDKANRGDYTEIRHLLDVLQRPFNYQESAERAGYAAPPPPWSTK---LRV 518
Query: 640 SCSS 643
SCSS
Sbjct: 519 SCSS 522
>gi|340370931|ref|XP_003383999.1| PREDICTED: selenoprotein O-like [Amphimedon queenslandica]
Length = 615
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 232/603 (38%), Positives = 328/603 (54%), Gaps = 108/603 (17%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
+LE L +D+ ++ LP D ++ R V ACY+ V+P+ V+NPQLV+ S + L L
Sbjct: 2 SLESLQFDNRVLKSLPVDEEKENYVRSVSGACYSLVNPTP-VKNPQLVSASADALNLLGL 60
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
D KE +RP+F +FSG + G+ P A CY GHQFG ++GQLGDG A+ LGE++N ER
Sbjct: 61 DIKEIQRPEFIEYFSGNKVIPGSEPAAHCYCGHQFGHFSGQLGDGCALYLGEVINSNGER 120
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WELQLKG+GKTPYSR ADG VLRSSIREFLCSEAMH+LGIPTTRA +T+ V RD+
Sbjct: 121 WELQLKGSGKTPYSRHADGRKVLRSSIREFLCSEAMHYLGIPTTRAGSCITSESLVARDI 180
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR-----------GQEDLDIVRTLADYAI 329
FY+GN +E ++ R+A +F+RFGS++I +R G++ DI L DY
Sbjct: 181 FYNGNVIQEQATVISRIAPTFIRFGSFEIFKTRDATTGRIGPSVGRD--DIFHLLLDYVT 238
Query: 330 RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
H + I S +D + A + E+ T LVA WQ VGF H
Sbjct: 239 EHFYPEIYK----------------SHLDDIEARTAGFFNEICRLTGRLVAMWQCVGFCH 282
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFST 449
GVLNTDNMSI+G+TIDYGPFGFLD +DP+ N +D G RY F+ QP + WN+ + S
Sbjct: 283 GVLNTDNMSIVGVTIDYGPFGFLDRYDPAHICNKSD-DGGRYAFSKQPSVCKWNLRKLSE 341
Query: 450 TLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVD 505
L+ + ++A+ +E Y +F Y + + +KLGL + ++ + L+ +
Sbjct: 342 ALSPC--LSTEKADEGLELYEMEFQQTYLSKIREKLGLVNKAFPEDSDLVEQFLDTLHET 399
Query: 506 KVDYTNFFRALSNVK----ADPSIPE-------------DELLVPLK------------- 535
D+TN FR L+ V DP E DEL+ K
Sbjct: 400 GCDFTNGFRKLNKVVLSHLNDPGHLEMVCDSLLDECATPDELVKSFKPIMPIHQLMMFAS 459
Query: 536 ------AVLLDIG-------------------------KERK---EAWISWVLSYIQELL 561
+L+ +G ++RK E W++W+ Y L
Sbjct: 460 LGEQSPMILMSLGLSPEMIKNELTKINNMEKVKKTTVEEKRKTDRETWLTWLALYRSRLG 519
Query: 562 SSGISD-------EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
D E+R +MN+ NP++VLRN++ QSAI AE GDF EV ++L+L+++PY
Sbjct: 520 REYTDDMEIDKLQEKRVEVMNNANPRFVLRNHIAQSAISLAEDGDFSEVNKVLQLLQKPY 579
Query: 615 DEQ 617
D++
Sbjct: 580 DDE 582
>gi|115373116|ref|ZP_01460418.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|310824332|ref|YP_003956690.1| hypothetical protein STAUR_7107 [Stigmatella aurantiaca DW4/3-1]
gi|115369872|gb|EAU68805.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|309397404|gb|ADO74863.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
Length = 488
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 223/513 (43%), Positives = 297/513 (57%), Gaps = 49/513 (9%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
+V P A + +LV+ S L+L+ E RP+F +GA L G P A Y GH
Sbjct: 22 VRVRP-APLAEARLVSVSPEALRLLDLEDAEAHRPEFVEVMNGARLLPGMEPTATVYSGH 80
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG++ +LGDGRA+ LGE+ N ERWE+QLKG+G TP+SR DG AVLRS++RE+LCS
Sbjct: 81 QFGVYVPRLGDGRALLLGEVRNAAGERWEVQLKGSGPTPFSRMGDGRAVLRSTVREYLCS 140
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAMH LGIPTTRALC++ + + V R+ + E GAI+ R+A S +RFG+++ A
Sbjct: 141 EAMHALGIPTTRALCVIGSPEAVYRE-------EVETGAILVRMAPSHVRFGTFEYFAH- 192
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E + V LA++ I HF H+ +++A EVA
Sbjct: 193 -TEQTEHVALLAEHVIARHFPHLAG---------------------APDRHARLFAEVAG 230
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTASLVAQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD F+P F N +D G RY F
Sbjct: 231 RTASLVAQWQAVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDFEPGFICNHSDHSG-RYAF 289
Query: 434 ANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ 493
QP I LWN++ + L + L+ + +E + F + A M +KLGL + ++
Sbjct: 290 DQQPRIALWNLSCLAQALLS--LVPEDALRATLESFAPTFSAHWLARMREKLGLREAREE 347
Query: 494 ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWI 550
++ LL MA + DYT FFRAL + A P + PL+A+ R E +
Sbjct: 348 DRGLLEMLLTRMAESRTDYTRFFRALGHFDASPQARNE----PLRALF-----SRPEGFD 398
Query: 551 SWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLM 610
+W Y L + G D ER M VNPKYVLRNYL Q+AI A+ GDF EV RL ++
Sbjct: 399 AWATLYRTRLAAEGSVDAERPERMARVNPKYVLRNYLAQTAILRAQQGDFSEVDRLRTVL 458
Query: 611 ERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RP++EQPG E YA PP+W +SCSS
Sbjct: 459 SRPFEEQPGSEAYAAPPPSWGRH---LEVSCSS 488
>gi|195999240|ref|XP_002109488.1| hypothetical protein TRIADDRAFT_21587 [Trichoplax adhaerens]
gi|190587612|gb|EDV27654.1| hypothetical protein TRIADDRAFT_21587 [Trichoplax adhaerens]
Length = 626
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 238/605 (39%), Positives = 327/605 (54%), Gaps = 110/605 (18%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
LE LN+D+S +R LP + T+ PR V AC++ V P+ V+NPQLVA S S L+L
Sbjct: 4 TLETLNFDNSCLRCLPVENNTEVYPRNVAGACFSYVQPTP-VDNPQLVAVSPSAMALLDL 62
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
E ER +F +FSG P+ G+ A CY GHQFG ++GQLGDG A+ +GE++N K ER
Sbjct: 63 SQYELERSEFVHYFSGNLPIKGSRTAAHCYCGHQFGYFSGQLGDGAAMYIGEVVNHKDER 122
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WE+Q KG+G TPYSR ADG VLRSSIREFLCSEAMH LGIPTTRA +T+ V RD+
Sbjct: 123 WEIQFKGSGLTPYSRHADGRKVLRSSIREFLCSEAMHHLGIPTTRAGSCITSDSEVLRDI 182
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVRTLADYAI 329
+Y GNP +E ++ R+A +FLRFGS++I S G++ DI+ L +Y I
Sbjct: 183 YYSGNPIKEKATVILRIAPTFLRFGSFEIFKPLDKITGSMGPSVGRK--DILIQLLEYTI 240
Query: 330 RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
HF H+ + + D++ +Y A+ EV + TA LVA WQ VGF H
Sbjct: 241 NTHFPHV-------AAKYPDSDKE---------RYLAFFEEVVKATAKLVALWQCVGFCH 284
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFST 449
GVLNTDNMSI G+TIDYGPFGFLD +DP + N +D G RY F NQP+ WN+++ +
Sbjct: 285 GVLNTDNMSIAGITIDYGPFGFLDVYDPDYVCNASD-DGGRYAFINQPEACKWNLSKLAE 343
Query: 450 TLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIIS-----KLLN 500
LA+ + D +N V+E+Y F Y M KLGL + ++ I+S KLL
Sbjct: 344 ALASVLPLAD--SNPVLEKYNELFHKFYLEKMRLKLGLIRKQLPGDEYILSVVHQNKLLF 401
Query: 501 NMAVDKVDYTNFFRALSNV----------------------------KADPSIPEDELLV 532
D+TN FR L+ + ++ PS+P+ +L +
Sbjct: 402 VFFWVGADFTNSFRCLNKLRISEPDRSFSELKACLLSQCTSLKDLKKRSKPSMPQSQLNM 461
Query: 533 PLKAV-----------------------------LLDIGKERKEA-----WISWVLSYIQ 558
+ + L D+ ++ K W W+ Y
Sbjct: 462 LISMIQANPNLITQMGQTALRIKNDLEKLEKLRDLNDLTEDEKRQSDNLIWDGWLKKYQC 521
Query: 559 EL------LSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMER 612
L L ER +MNS NP+++LRNY+ +AI AE GD+ E+RR+LKL++
Sbjct: 522 RLHIEVEHLDVDAIKTERIEVMNSNNPRFILRNYIAHNAIIQAEKGDYSEIRRVLKLLQN 581
Query: 613 PYDEQ 617
PY Q
Sbjct: 582 PYSSQ 586
>gi|90417428|ref|ZP_01225352.1| hypothetical protein GB2207_07562 [gamma proteobacterium HTCC2207]
gi|90330762|gb|EAS46037.1| hypothetical protein GB2207_07562 [gamma proteobacterium HTCC2207]
Length = 502
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 219/518 (42%), Positives = 301/518 (58%), Gaps = 67/518 (12%)
Query: 144 NPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLG 203
+P +V+ ++ +A+ L +DP + P+ SG A P A Y GHQFG+WAGQLG
Sbjct: 34 DPVVVSSNKLLAEELGIDPDNLDSPEMLELMSGNFMTANIKPIALVYSGHQFGVWAGQLG 93
Query: 204 DGRAITLGEILNLKS---------------ERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
DGRA+TLGE+ KS E W++QLKGAG TPYSRFADG AVLRSSIR
Sbjct: 94 DGRAMTLGELPVAKSALGEDELGETEVPHSELWDIQLKGAGPTPYSRFADGRAVLRSSIR 153
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+LCSEAMH LGI TTRAL LV + V R+ + E GA VCRVA+S +RFGS++
Sbjct: 154 EYLCSEAMHGLGIATTRALSLVDSKTQVYRE-------EVESGATVCRVARSHIRFGSFE 206
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R Q + VR LADY ++ HF T D D + + +
Sbjct: 207 HFHYRNQP--ESVRALADYVVQRHFPQW------------TEDSDRFIKLFKNTVF---- 248
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+TA ++AQWQ VGF HGV+NTDNMSILG T+D+GPFGFLD ++P F N +D G
Sbjct: 249 -----KTAKMIAQWQSVGFNHGVMNTDNMSILGDTLDFGPFGFLDNYNPDFICNHSDTNG 303
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F NQP +GLWN+ +T+L + L+ E V+++Y +F+++++ IM KLGL
Sbjct: 304 -RYAFKNQPSVGLWNLNALATSLTS--LLSSDELIDVLKQYEPEFLNQFRGIMASKLGLE 360
Query: 489 KYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+Y + + ++LL+ M + VDYT FR+L + A D+ +
Sbjct: 361 QYQAEDELLSNELLDLMQTNNVDYTILFRSLCDFTATNHTVRDQFI-------------D 407
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+E + W + Y+ L +SD +R+ M ++NPKYVLRNY+ Q AI+ A+ GD+ EV
Sbjct: 408 REGFDQWAVKYLARLEQQRLSDAQRRDNMRAINPKYVLRNYMAQGAIEKAQTGDYSEVNL 467
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LLK+++ P +E P + YA LPP WA V SCSS
Sbjct: 468 LLKVLQSPREEHPEAQHYAGLPPDWAETISV---SCSS 502
>gi|313206613|ref|YP_004045790.1| hypothetical protein Riean_1123 [Riemerella anatipestifer ATCC
11845 = DSM 15868]
gi|383485919|ref|YP_005394831.1| hypothetical protein RA0C_1391 [Riemerella anatipestifer ATCC 11845
= DSM 15868]
gi|312445929|gb|ADQ82284.1| protein of unknown function UPF0061 [Riemerella anatipestifer ATCC
11845 = DSM 15868]
gi|380460604|gb|AFD56288.1| hypothetical protein RA0C_1391 [Riemerella anatipestifer ATCC 11845
= DSM 15868]
Length = 510
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 226/540 (41%), Positives = 305/540 (56%), Gaps = 46/540 (8%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F+ + PGD D++ R+ + V P A N + + +++ +++ + L E P+
Sbjct: 10 FLDQFPGDFSGDTMQRQTPKMLFATVEP-ALFTNYKTITFNQELSNDIGLGSFE---PED 65
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
F + YA Y GHQFG WAGQLGDGRAI GEI N E E+Q KGAG
Sbjct: 66 EAFLAAQDLPKNIRTYATAYAGHQFGQWAGQLGDGRAILAGEIQNTSGETTEIQWKGAGA 125
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSRFADG AVLRSS+RE+L SEAMH LG+PTTRAL L TG+ VTRD+ Y+GNPK+E
Sbjct: 126 TPYSRFADGRAVLRSSVREYLMSEAMHHLGVPTTRALSLAETGEMVTRDILYNGNPKQEK 185
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA+V R A SF+RFG +Q+ A+ Q ++D ++ LAD+ I+ +FR I+
Sbjct: 186 GAVVIRTAPSFIRFGHFQLLAA--QNEIDTLKNLADFCIQRYFREIKT------------ 231
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
DE S Y + ++AE TA+L+ +WQ VGFTHGV+NTDNMSILGL+IDYGPF
Sbjct: 232 DE--------SQPYHQFFKKIAETTANLMVEWQRVGFTHGVMNTDNMSILGLSIDYGPFS 283
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERY 469
LD +D +FTPNTTDLPGRRY F Q ++ WN+ Q L LI+D + +E +
Sbjct: 284 MLDEYDLNFTPNTTDLPGRRYAFGRQAEMAQWNLWQLGNALFP--LINDVDFIEQTLEDF 341
Query: 470 GTKFMDEYQAIMTKKLGLPKYNK----QIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
GT F ++Y +M K+GL + K + N M K+DYT FF AL
Sbjct: 342 GTDFWNQYDQMMCSKMGLDTFMKDTDVDFFTDWQNLMTSLKLDYTLFFNALE-------- 393
Query: 526 PEDELLVPLKAV-LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLR 584
+D L+ + + + E + W+ SY L + IS ER ALM+ NPK+ LR
Sbjct: 394 -KDVHLINWQDISYQSLHTEDLQRLNQWINSYQNRLALNKISPNERLALMSQNNPKFTLR 452
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ-PGMEKYARLPPAWAYRPGVCMLSCSS 643
NYL I G+ +LL +++PY E P E + P + G LSCSS
Sbjct: 453 NYLLHECIKELNKGNISYFNQLLSALKKPYQETFP--EWSVKRPKKYDEVVGCSTLSCSS 510
>gi|452824255|gb|EME31259.1| hypothetical protein Gasu_14990 [Galdieria sulphuraria]
Length = 596
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/424 (49%), Positives = 276/424 (65%), Gaps = 30/424 (7%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPS--AEVEN-PQLVAWSESVADSL 158
LE L H+FV ELP DP+ ++ R V +CY+ V+P+ E EN P++VAW VA+ L
Sbjct: 13 LEQLPLQHTFVCELPQDPQQENFTRTVRRSCYSLVAPAFLRERENRPRVVAWCPWVAEEL 72
Query: 159 ELDPKEFER-PDFPL-FFSGATPLAGA--VPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
LD ++ ER +F F G L + YAQCYGGHQFG WAGQLGDGRAI +GE +
Sbjct: 73 -LDLEQDERYKEFSAEVFGGFRVLDSSKNFTYAQCYGGHQFGNWAGQLGDGRAICIGEHI 131
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N + ERW++QLKGAGKTPY RFADG AVLRS IREFL SEA+ +GIPTTRALC+V TG+
Sbjct: 132 NQRGERWDIQLKGAGKTPYGRFADGFAVLRSCIREFLASEALASIGIPTTRALCVVETGR 191
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFR 334
V RD+FYDGN K E GA++ R+A SF+RFG++++ A D + +R LADY I+H+F
Sbjct: 192 EVLRDLFYDGNVKPERGAVLTRLAPSFIRFGNFELFAYYN--DFETLRKLADYCIKHYFP 249
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
E + + + S DE+ N+YA +A V E A LVA+WQ VGF HGV+NT
Sbjct: 250 --EFLEATSTFS----DEN--------NRYALFATRVVELNAELVAKWQAVGFVHGVMNT 295
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DN SILGLT+DYGPFGFLD +DP +TPN+TDLPGRRYC+ NQ + WN +F +L +
Sbjct: 296 DNFSILGLTLDYGPFGFLDRYDPLYTPNSTDLPGRRYCYLNQAQVARWNCQKFVQSLIS- 354
Query: 455 KLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYT 510
L +ME++ + KLGL +N K+++ L+ + D++DYT
Sbjct: 355 -LYGGATVFNIMEKFDETYSSSLSTCYQNKLGLLTWNEETDKELVDTFLDILQTDQLDYT 413
Query: 511 NFFR 514
N +R
Sbjct: 414 NTWR 417
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%)
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE G+F EV LL+++ PY+E+P + Y+ PP WA GVC+ SCSS
Sbjct: 548 AETGNFDEVENLLQVISNPYEERPELSIYSEEPPEWANVVGVCVNSCSS 596
>gi|74317037|ref|YP_314777.1| hypothetical protein Tbd_1019 [Thiobacillus denitrificans ATCC
25259]
gi|121957653|sp|Q3SEY2.1|Y1019_THIDA RecName: Full=UPF0061 protein Tbd_1019
gi|74056532|gb|AAZ96972.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
25259]
Length = 488
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/548 (42%), Positives = 307/548 (56%), Gaps = 63/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+ F R LP Y +V P+ V +P LV +S L
Sbjct: 1 MATLESLTFDNGFAR-LP-------------ETYYARVCPT-PVPDPYLVCYSPEALSLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LD E +RP+ +G L G A Y GHQFG + QLGDGRAI LGE+ N
Sbjct: 46 DLDATELKRPETIETLAGNRLLPGMDAIAALYAGHQFGHYVPQLGDGRAILLGEVRNRAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E WE+QLKGAG+TPYSR DG AVLRSSIREFLCSEAMH L IPTTRAL +V + V R
Sbjct: 106 EGWEIQLKGAGRTPYSRGGDGRAVLRSSIREFLCSEAMHALDIPTTRALAVVGSDHPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ EE A+V R+A SF+RFGS+++ R Q ++ +R LADY I ++ ++
Sbjct: 166 E-------DEETAALVTRLAPSFVRFGSFEVFYYRNQ--VEPIRHLADYVIARYYPELKT 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ ++ Y + +V+ RTA L+AQWQ VGF+HGV+NTDNMS
Sbjct: 217 L---------------------ADPYPEFLRQVSLRTAELMAQWQAVGFSHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLT+DYGPFGFLDAFDP F N +D G RY F QPD+ WN+ + + L L+
Sbjct: 256 ILGLTLDYGPFGFLDAFDPGFVCNHSDT-GGRYAFDQQPDVAAWNLTKLAQAL--VPLMS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQI--ISKLLNNMAVDKVDYTNFFRAL 516
+ A+ + Y F Y A M K GL + + I+ L +A ++VDYT F R L
Sbjct: 313 VETASQAISEYPQAFGRAYLARMAAKFGLAPGDDTVPLITDALQLLAGNRVDYTIFLRKL 372
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
+ D PL+ + LD + A+ +W + Y L G D ER A M +
Sbjct: 373 CAFDSQ----ADAGNAPLRDLFLD-----RAAFDAWAVRYGAALRQHGQPDAERAATMRT 423
Query: 577 VNPKYVLRNYLCQSAI-DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
NPKY+LRNYL ++AI AA+L D+ EV RL +L+ RP+DEQP E YA PP WA R
Sbjct: 424 RNPKYILRNYLAENAIRRAADLRDYSEVERLHRLLARPFDEQPAFEAYAAEPPDWAKRIE 483
Query: 636 VCMLSCSS 643
V SCSS
Sbjct: 484 V---SCSS 488
>gi|302039647|ref|YP_003799969.1| hypothetical protein NIDE4384 [Candidatus Nitrospira defluvii]
gi|300607711|emb|CBK44044.1| conserved protein of unknown function UPF0061 [Candidatus
Nitrospira defluvii]
Length = 491
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 228/546 (41%), Positives = 306/546 (56%), Gaps = 62/546 (11%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
+LE L +D+S+ R LP A Y KV+P+ P L++ + + + L+L
Sbjct: 5 SLETLTFDNSYAR-LP-------------EAFYAKVNPTPFSAAPFLISANRAAMELLDL 50
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
DP E RP+F F G+ + G P A Y GHQFG++ QLGDGRAI L E+ N + ER
Sbjct: 51 DPTEAARPEFAGVFGGSLLIPGMEPLAMLYSGHQFGVYVPQLGDGRAILLAEVKNGRGER 110
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
W+L LKGAG TP+SR DG +VLRS+IRE+LC EAMH LGIPTTRALCLV + V R+
Sbjct: 111 WDLHLKGAGMTPFSRDGDGRSVLRSAIREYLCCEAMHGLGIPTTRALCLVGSDDKVYRE- 169
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
+ E GA + R+A S +RFG+++I R Q + ++ LADY I HF +
Sbjct: 170 ------QVETGATIVRMAPSHVRFGTFEIFYYRKQHEH--LQRLADYVIEMHFPDLAP-- 219
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
++KYA + V ERTA L+A WQ VG++HGVLNTDNMSIL
Sbjct: 220 -------------------AADKYARFFAGVVERTAKLIAHWQAVGWSHGVLNTDNMSIL 260
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLT+DYGP+GF+D +DP F N +D G RY F QP IGLWN++ + TL +
Sbjct: 261 GLTLDYGPYGFMDDYDPGFICNHSDYNG-RYAFNQQPYIGLWNLSCLAQTL--LPFAPKE 317
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALS 517
E ++ Y T Y M KLGL + ++ ++ +L + M +VDYT F+R L
Sbjct: 318 ELKAALDGYQTSVDRHYHNNMRAKLGLVEDRAEDEALLQELKSLMVGSRVDYTIFWRELG 377
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+D + L + ER +AW Y L DEER+ M+ V
Sbjct: 378 TFSSDAGAKNERLREHF------LNPERFDAWAG---QYRDRLQGEQSRDEERRIRMDRV 428
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVC 637
NPKY+LRNYL Q AI+ A+ D+ E+ RLL L+++PY EQPGM+ YA PP W V
Sbjct: 429 NPKYILRNYLAQGAIEKAQQKDYSEIERLLTLLQQPYTEQPGMDSYAAAPPNWGKHLSV- 487
Query: 638 MLSCSS 643
SCSS
Sbjct: 488 --SCSS 491
>gi|427789073|gb|JAA59988.1| Putative selenoprotein o [Rhipicephalus pulchellus]
Length = 620
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 240/643 (37%), Positives = 351/643 (54%), Gaps = 121/643 (18%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+ +R LP D T + R V A +++V P A +E+P++V +SE L
Sbjct: 1 MSTLETLRFDNLALRTLPVDKETRNYVRTVSGAVFSRVLP-APLESPEMVVFSEDAMMLL 59
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P E +R D +FSG L G+ A CY GHQFG +AGQLGDG A+ LGE++N K
Sbjct: 60 DLPPSELQRKDAAEYFSGNKLLPGSETAAHCYCGHQFGYFAGQLGDGAAMYLGEVINRKG 119
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWE+QLKGAG TPYSR ADG VLRSS+REFLCSEAMH+LG+PTTRA VT+ V+R
Sbjct: 120 ERWEIQLKGAGLTPYSRSADGRKVLRSSLREFLCSEAMHYLGVPTTRAGTCVTSSTTVSR 179
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVRTLADY 327
DMFYDG+PK E +++ R+A +FLRFGS++I S G++D I+ L +Y
Sbjct: 180 DMFYDGHPKNEKCSVILRIAPTFLRFGSFEIFKTLDSFTGRVGPSVGRKD--ILLQLLNY 237
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
AI F + S GD+ + Y + +V ++TA LVA+WQ VGF
Sbjct: 238 AIETFFPEVYR---------SCGDDKEQM-------YIEFFKDVVKKTAHLVAKWQCVGF 281
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGVLNTDNMSILGLTIDYGPFGF++ FDP NT+D G RY + QP+I LWN+ +F
Sbjct: 282 CHGVLNTDNMSILGLTIDYGPFGFMERFDPDHICNTSD-DGGRYTYIKQPEICLWNLRKF 340
Query: 448 STTLAAA----------------------------------KLIDDKEANY----VMERY 469
+ + +A +L++DK+ ME+
Sbjct: 341 AEAIQSAVPLSKTSPCLDLYASEYETCFLGGMRRKLGLLKKELVEDKDLVTSFYDTMEKT 400
Query: 470 GTKFMDEYQAIMTKKLGLPKY------NKQIISKLLNNMAVDKVDYTNFFRALSNV---- 519
G F ++ + T L +P + + ++SKL++ + + T+ +A ++
Sbjct: 401 GADFTRSFRCLST--LAVPGHPDHEPSKESLLSKLMSCCS-SHAELTDHLKAQTSSRDFQ 457
Query: 520 ------KADPSIPE---------DELLVPLKAV--LLDIGKERKEA-----WISWVLSYI 557
K +P + E + ++ ++ L ++ E EA W W+ +Y
Sbjct: 458 MFLILSKNNPELLEQLGKGALAKERIMAQIEKTKELKEMSAENFEARNKGMWTDWIEAYC 517
Query: 558 QELLS--SGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLM 610
+ L + G+ D ++R +MNS NP++VLRNY+ Q AIDAAE GD+ E +++LK++
Sbjct: 518 KRLTADVEGVKDLQALQDDRVHVMNSSNPRFVLRNYIAQQAIDAAEKGDYSEAQKVLKIL 577
Query: 611 ERPYDEQPGMEKYARLPPA----------WAYRPGVCMLSCSS 643
+RP+ + P K ++ PA +A +SCSS
Sbjct: 578 QRPFSDDPLELKGKQVCPAVFDEGFYEGRYALSAKALRVSCSS 620
>gi|110638543|ref|YP_678752.1| hypothetical protein CHU_2147 [Cytophaga hutchinsonii ATCC 33406]
gi|121957851|sp|Q11T54.1|Y2147_CYTH3 RecName: Full=UPF0061 protein CHU_2147
gi|110281224|gb|ABG59410.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 515
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 227/540 (42%), Positives = 307/540 (56%), Gaps = 40/540 (7%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERP 168
++F PGD ++ R+ Y V P+ V +PQL+AWS VA+ L L E P
Sbjct: 11 NTFTETFPGDLSMNNTTRQTPGVLYCSVLPTP-VHHPQLLAWSADVAEMLGL---ESPVP 66
Query: 169 DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGA 228
+ L G T PYA CY GHQFG WAGQLGDGRAI+LG S +ELQLKGA
Sbjct: 67 EDVLILGGNTVNPTMKPYASCYAGHQFGNWAGQLGDGRAISLGFCSGKDSMEYELQLKGA 126
Query: 229 GKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
G TPYSR +DG AVLRSS+RE+L SEAMH+LG+PTTRAL LV+TG V RDMFY+G+
Sbjct: 127 GPTPYSRNSDGRAVLRSSLREYLMSEAMHYLGVPTTRALSLVSTGDAVLRDMFYNGHAAY 186
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
EPGA+V RVA SF+RFG+++I A R DL + L D+ I ++ I ++
Sbjct: 187 EPGAVVLRVAPSFIRFGNFEILAERNNRDLS--QQLCDWVITRYYPEIRGEDR------- 237
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
VV L VAERTA +V QW VGF HGV+NTDNMSILG+TIDYGP
Sbjct: 238 -------VVQLFQ--------AVAERTADMVVQWLRVGFVHGVMNTDNMSILGVTIDYGP 282
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER 468
+ F+D +D FTPNTTDLPGRRY F NQ + WN+ + + LA DK V++
Sbjct: 283 YSFVDEYDARFTPNTTDLPGRRYAFGNQAAVAYWNLGRLANALAFLVPETDKLVA-VLKN 341
Query: 469 YGTKFMDEYQAIMTKKLG---LPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y + +Y +M KLG L + ++ +I + K D T F++ L ++ ADP
Sbjct: 342 YQDVYETKYYTMMANKLGFDALREDDRLLIDSFEEMLRTVKPDMTMFYQLLIDLPADPGT 401
Query: 526 PEDELLVPLKAVLLD-IGKERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPKYVL 583
D +K E EA + + + +Y + + ++ S E M + NP++VL
Sbjct: 402 AAD-----VKQFFQSCFYTEADEALLHTCIAAYSKRIKTNTCSKEVSAEKMRAANPRFVL 456
Query: 584 RNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RNY+ AI+ E GD +++L + +++PY + E + + P A + G MLSCSS
Sbjct: 457 RNYILHEAIEKLEKGDDALLKKLEEYIKQPYSKNAD-EYFIKRPDWAAQKAGCSMLSCSS 515
>gi|407451543|ref|YP_006723267.1| hypothetical protein B739_0767 [Riemerella anatipestifer RA-CH-1]
gi|403312528|gb|AFR35369.1| hypothetical protein B739_0767 [Riemerella anatipestifer RA-CH-1]
Length = 510
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 224/540 (41%), Positives = 304/540 (56%), Gaps = 46/540 (8%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F+ + PGD D++ R+ + V P A N + + +++ +++ + L E P+
Sbjct: 10 FLDQFPGDFSDDTMQRQTPKMLFATVEP-ALFTNYKTITFNQELSNDIGLGSFE---PED 65
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
F + YA Y GHQFG WAGQLGDGRAI GEI N E E+Q KGAG
Sbjct: 66 EAFLAAQDLPKNIRTYATAYAGHQFGQWAGQLGDGRAILAGEIQNTSGETTEIQWKGAGA 125
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSRFADG AVLRSS+RE+L SEAMH LG+PTTRAL L TG+ VTRD+ Y+GNPK+E
Sbjct: 126 TPYSRFADGRAVLRSSVREYLMSEAMHHLGVPTTRALSLAETGEMVTRDILYNGNPKQEK 185
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA+V R A SF+RFG +Q+ + Q ++D ++ LAD+ I+ +FR I+
Sbjct: 186 GAVVIRTAPSFIRFGHFQLLTA--QNEIDTLKNLADFCIQRYFREIKT------------ 231
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
DE Y + ++AE TA+L+ +WQ VGFTHGV+NTDNMSILGL+IDYGPF
Sbjct: 232 DEPQP--------YHQFFKKIAETTANLMVEWQRVGFTHGVMNTDNMSILGLSIDYGPFS 283
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERY 469
LD +D +FTPNTTDLPGRRY F Q ++ WN+ Q L LI+D + +E +
Sbjct: 284 MLDEYDLNFTPNTTDLPGRRYAFGRQAEMAQWNLWQLGNALFP--LINDVDFIEQTLEDF 341
Query: 470 GTKFMDEYQAIMTKKLGLPKYNK----QIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
GT F ++Y +M K+GL + K + N MA K+DYT FF AL
Sbjct: 342 GTDFWNQYDQMMCSKMGLDTFMKDTDVDFFTDWQNLMASLKLDYTLFFNALE-------- 393
Query: 526 PEDELLVPLKAV-LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLR 584
+D L+ + + + E + W+ SY L + I+ ER ALM+ NPK+ LR
Sbjct: 394 -KDVHLINWQDISYQSLHTEDLQRLNQWINSYQNRLALNKIAPNERLALMSQNNPKFTLR 452
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ-PGMEKYARLPPAWAYRPGVCMLSCSS 643
NYL I+ G+ +LL ++ PY E P E + P + G LSCSS
Sbjct: 453 NYLLHECIEELNNGNTNYFHQLLTALKNPYQETFP--EWSVKRPKKYDEVVGCSTLSCSS 510
>gi|167537910|ref|XP_001750622.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770918|gb|EDQ84595.1| predicted protein [Monosiga brevicollis MX1]
Length = 2462
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 236/605 (39%), Positives = 329/605 (54%), Gaps = 104/605 (17%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+AL L +D+S +RELP DP T + R V A Y++V P A VENPQ+VA S + L
Sbjct: 55 EALAQLRFDNSALRELPVDPETKNFTRRVSGAFYSRVEP-APVENPQVVALSWPALELLG 113
Query: 160 LDPKEFE-RPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L + DF F+G P+ GA A CY GHQFG ++GQLGDG A+ LGE++N ++
Sbjct: 114 LTEATVQVDDDFVAAFAGNVPIPGAEYAAHCYCGHQFGYFSGQLGDGAAMYLGEVVNERN 173
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWELQ KGAG TP+SR ADG VLRSSIREFLCSEAMH L IPTTRA L+T+ V R
Sbjct: 174 ERWELQFKGAGLTPFSRQADGRKVLRSSIREFLCSEAMHALNIPTTRAGSLITSDTRVVR 233
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI-----------HASRGQEDLDIVRTLADY 327
D+FY G+ +E ++ R+A SFLRFGS+++ +S GQ +++ + L DY
Sbjct: 234 DIFYTGSLIQERATVITRLAPSFLRFGSFEVVKEKDPKTMQEGSSPGQ--VELTKKLLDY 291
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
+ HHF I + + S +K+A + EV RTA+LVAQWQ VG+
Sbjct: 292 LLAHHFADIWSQDSS-----------------PEDKFAEFLAEVTRRTAALVAQWQCVGW 334
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGVLNTDNMS+LGLTIDYGPFGF++ +DP+F N +D G RY + +QP+I WN+ +
Sbjct: 335 CHGVLNTDNMSVLGLTIDYGPFGFMEQYDPNFICNRSD-DGGRYDYQSQPEICRWNLHRL 393
Query: 448 STTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--PK-YNKQIISKLLNNMAV 504
+ L L ++ + + Y F Y M KLGL P+ ++++I L MA
Sbjct: 394 ADVL-VPHLPLERARDIIDRHYTRTFEQAYMDGMRAKLGLLYPQGEDQELIKALFTVMAK 452
Query: 505 DKVDYTNFFRALSNVKAD-------------------------PSIPEDE----LLVPLK 535
D+TN FR LS D P IPED+ L +P
Sbjct: 453 TSADFTNTFRLLSRFSIDDQGRALWPALREQLYPLDVQRLLSKPRIPEDQMRQLLAMPQL 512
Query: 536 AVLLDIG------KERKEA--------------------WISWVLSYI------------ 557
A ++ +G ++RK A W W Y
Sbjct: 513 AEMIGLGAGVLNVEQRKSARFKELQQQTQEAMDEDNLTHWQLWFNKYAARLQVDNDTALK 572
Query: 558 QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ 617
Q+ ++ + R+ +M+ NP +VLRN++ Q+AI AE GDF EV+R+L+ + RP+ E+
Sbjct: 573 QDQMARDAVESRRRQVMDEHNPSFVLRNHVAQTAIAKAEQGDFSEVQRVLEELRRPFAER 632
Query: 618 PGMEK 622
+++
Sbjct: 633 EDLQR 637
>gi|225010070|ref|ZP_03700542.1| protein of unknown function UPF0061 [Flavobacteria bacterium
MS024-3C]
gi|225005549|gb|EEG43499.1| protein of unknown function UPF0061 [Flavobacteria bacterium
MS024-3C]
Length = 559
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 231/582 (39%), Positives = 322/582 (55%), Gaps = 75/582 (12%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
DH F++ LP DP D PR V A Y+ P + PQ + + ++ +L + KE +
Sbjct: 7 DH-FIQSLPQDPSLDEYPRAVQGALYSFTQPK-KTAFPQKIHLNTNLLKTLGI--KE-DD 61
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI--------LNLKS- 218
P+ +G G +P+A YGGHQFG WAGQLGDGRAI LG + LN S
Sbjct: 62 PELVQQLTGNKISEGHIPFAMNYGGHQFGHWAGQLGDGRAIHLGGLKISGDTKDLNWNSP 121
Query: 219 ERW-ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
W ++QLKGAG TPYSR ADGLAVLRSSIRE+LCSEAM+ LG+PTTRAL L +G V
Sbjct: 122 SNWAQIQLKGAGPTPYSRSADGLAVLRSSIREYLCSEAMYHLGVPTTRALSLCLSGDLVN 181
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIE 337
RDM Y+GNP E GAIV RVA +F+RFGS+++ ASRG+ + +++TL I++++ I+
Sbjct: 182 RDMLYNGNPGLEQGAIVARVAPNFIRFGSFELPASRGE--IGLLKTLIKQTIKYYYPEIK 239
Query: 338 N-MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ ++ +L F +V E TA ++A WQ VGF HGVLNTDN
Sbjct: 240 APLKEATTLFFK---------------------KVCEDTAKVIAAWQRVGFVHGVLNTDN 278
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MS+LGLTIDYGP+G+++ +D +TPNTTD RY F NQ +GLWN+ Q + L +
Sbjct: 279 MSVLGLTIDYGPYGWMEPYDLDWTPNTTDAKESRYRFGNQHQVGLWNLYQLANALYPI-V 337
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNN----MAVDKVDYTNF 512
D ++ + + Y I +KLGL + N ++ L+ + +++ + D T F
Sbjct: 338 EDAAPLEAALDHFKETYETTYAQIRKEKLGLCQSNGVVLDALIEDLDPLLSLIETDMTLF 397
Query: 513 FRALSNVKADP---------------------------SIPEDELLVPLKAVLLD---IG 542
+R L+ K+D SI L L D +
Sbjct: 398 YRELALFKSDQFLEKIKTTPVHTNSDSSTHANTTHSTLSIDNHALFGSLIKAFYDPRALN 457
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
K WI W+ SY + L+ ++D+ MN+VNPKYVLRNY+ Q AI+AAE D+
Sbjct: 458 GTVKNKWILWLSSYAKIRLTQKLADQVVIEKMNAVNPKYVLRNYMAQMAIEAAENSDYSI 517
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCMLSCSS 643
+ L +L++ PY+EQ K+ P WA + G LSCSS
Sbjct: 518 IEELFQLLQNPYEEQHEFNKWYAKRPEWARNKIGCSQLSCSS 559
>gi|169234793|ref|NP_001108489.1| selenoprotein O [Gallus gallus]
Length = 652
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 254/626 (40%), Positives = 329/626 (52%), Gaps = 114/626 (18%)
Query: 76 LKNQRLDTET-ETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYT 134
L+ R DTE ET GG L L +D+ +R LP DP D PR V AC+
Sbjct: 8 LRRGRADTERGETGGG----------WLSALRFDNLAMRSLPVDPFEDCAPRAVPGACFA 57
Query: 135 KVSPSAEVENPQLVAWSESVADSLELD---PKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
+V P+ + NP+LVA S L L+ P+ + L+FSG L G+ P A CY
Sbjct: 58 RVRPTP-LRNPRLVAMSAPALALLGLEAGGPEAEREAEAALYFSGNRLLPGSEPAAHCYC 116
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG +AGQLGDG AI LGE+ + RWELQLKGAG TP+SR ADG VLRSSIREFL
Sbjct: 117 GHQFGSFAGQLGDGAAIYLGEVRGPRGARWELQLKGAGITPFSRQADGRKVLRSSIREFL 176
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI-- 309
CSEAM LGIPTTRA VT+ V RD+FYDGNPK+E +V R+A +F+RFGS++I
Sbjct: 177 CSEAMFHLGIPTTRAGTCVTSDSEVVRDIFYDGNPKKERCTVVLRIASTFIRFGSFEIFK 236
Query: 310 ----HASRGQEDL---DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSN 362
+ R + DI + DY I + I+ + S+
Sbjct: 237 PPDEYTGRKGPSVNRNDIRIQMLDYVIGTFYPEIQEAHADNSI----------------Q 280
Query: 363 KYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPN 422
+ AA+ E+ +RTA LVA+WQ VGF HGVLNTDNMSI+GLTIDYGPFGF+D +DP N
Sbjct: 281 RNAAFFKEITKRTARLVAEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFMDRYDPEHICN 340
Query: 423 TTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
+D G RY + QP+I WN+ + + L +L + + E Y +F Y M
Sbjct: 341 GSDNTG-RYAYNRQPEICKWNLGKLAEAL-VPELPLEISELILEEEYDAEFEKHYLQKMR 398
Query: 483 KKLGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALS--NVKADPSIPED-------- 528
KKLGL + + +++S+LL M + D+TN F LS +V DPS ED
Sbjct: 399 KKLGLIQLELEEDSKLVSELLETMHLTGGDFTNIFYLLSSFSVDTDPSRLEDFLEKLISQ 458
Query: 529 -----ELLVPLK--------AVLL-----------------DIGKE-------------- 544
EL V K +++L +I KE
Sbjct: 459 CASVEELRVAFKPQMDPRQLSMMLMLAQSNPQLFALIGTKANINKELERIEQFSKLQQLT 518
Query: 545 -------RKEAWISWVLSYIQELLS--SGISD-----EERKALMNSVNPKYVLRNYLCQS 590
K W W+ Y L ISD ER +MNS NP+Y+LRNY+ Q+
Sbjct: 519 AADLLSRNKRHWTEWLEKYRVRLHKEVESISDVDAWNTERVKVMNSNNPRYILRNYIAQN 578
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDE 616
AI+AAE GDF EVR +LKL+E P+ E
Sbjct: 579 AIEAAENGDFSEVRNVLKLLENPFQE 604
>gi|383452769|ref|YP_005366758.1| hypothetical protein COCOR_00752 [Corallococcus coralloides DSM
2259]
gi|380727688|gb|AFE03690.1| hypothetical protein COCOR_00752 [Corallococcus coralloides DSM
2259]
Length = 488
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 227/552 (41%), Positives = 303/552 (54%), Gaps = 71/552 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ +LE L +D+S+ R PG +V+P + Q+V+ + + L
Sbjct: 1 MASLEQLVFDNSYARLPPG--------------FAARVAP-VPFPDAQVVSVNPAALRLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
LD +E RP+F F GATPL G P A Y GHQFG++ +LGDGRA+ LGE+
Sbjct: 46 GLDAEEAARPEFARVFGGATPLPGMEPLAMVYAGHQFGVYVPRLGDGRALLLGEVRAPDG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+W+L LKG G TP+SR DG AVLRS++RE+L EA+H LGIPTTRALC++ + V R
Sbjct: 106 GKWDLHLKGGGPTPFSRGGDGRAVLRSTVREYLAGEALHALGIPTTRALCILGSRTPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG+++ H + E V TLAD+ I HF H+
Sbjct: 166 E-------EVETGAMLVRLAPSHVRFGTFEYFHHT---EQPGHVATLADHVIAAHFPHL- 214
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
G E ++A + EV ERTA LVA+WQ VGF HGV+NTDNM
Sbjct: 215 -----------AGQE---------GRHARFFAEVVERTAELVARWQAVGFAHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLT+DYGP+GFLD FDP F N +D G RY F QP + LWN+A L LI
Sbjct: 255 SILGLTLDYGPYGFLDDFDPGFVCNHSDHQG-RYAFDQQPRVALWNLACLGEAL--LTLI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ EA + + F + A M +KLGL + ++ ++ L MA VDYT FFR
Sbjct: 312 TEDEARATLTLFQPTFARHFLARMREKLGLKEARDEDRSLLEDLFALMASSHVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDEL---LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
AL+ + P D L +P E + W Y L + G D ER
Sbjct: 372 ALNRFDSSPGARNDALRDHFLP------------PEGFDGWAERYRARLEAEGSVDAERH 419
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
A ++ VNPKYVLRN++ Q AI A+ GDF EV R+L L+ P+DE PG E YA PPAW
Sbjct: 420 ASLDRVNPKYVLRNWVAQQAIARAQEGDFAEVDRVLALVSAPFDEHPGQEAYAASPPAWG 479
Query: 632 YRPGVCMLSCSS 643
++SCSS
Sbjct: 480 RH---LVVSCSS 488
>gi|291227954|ref|XP_002733947.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 584
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 224/531 (42%), Positives = 308/531 (58%), Gaps = 49/531 (9%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADS-LELDPKEFERPDFPLFFSGATPLAGAV 184
R+V + ++KV P+ +LVA S + ++ L+LD E F F SG T L G++
Sbjct: 90 RQVKNVLFSKVLPTPLQTTVKLVAVSSDLLENVLDLDKSISETEHFLTFVSGNTILPGSI 149
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P + YGGHQFG W+ QLGDGRA LGE +N +RWELQLKG+G TPYSR DG AVLR
Sbjct: 150 PISHRYGGHQFGEWSDQLGDGRAHLLGEYVNRNGDRWELQLKGSGLTPYSRRGDGRAVLR 209
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAM+ LGIPT+RAL ++ +G V RD FYDG+ K E A+V R+A+S+ R
Sbjct: 210 SSIREFLCSEAMYHLGIPTSRALSVIVSGDPVWRDQFYDGHAKTEKAAVVLRLAKSWFRI 269
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
GS +I A + ++ ++R L D+ I ++F I+ DE NKY
Sbjct: 270 GSLEILAMK--REIKLLRRLTDFVIENYFPSID-----------ISDE---------NKY 307
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
+ E+ +TA L+A+W VGF HGV+NTDN S+L +TIDYGPFGFLD ++PSF PNT+
Sbjct: 308 LSLFSEIVSQTADLMARWMSVGFAHGVMNTDNFSLLSITIDYGPFGFLDDYNPSFIPNTS 367
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTL-----AAAKLIDDKEANYVMERYGTKFMDEYQA 479
D G Y + NQPDIG +N+ + L K + + ++ Y T+FM+
Sbjct: 368 DDEG-MYSYENQPDIGHFNMNRLRAALWPLWNNKQKQLSEMILQGYIDIYKTRFME---- 422
Query: 480 IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED--ELLVPL 534
I KLG + + II LL M + D+T FR L N+ + + L L
Sbjct: 423 IFRGKLGFLSTDDKDEYIIGLLLKMMEDTRTDFTMTFRQLGNLTFQHIQNNNVSDALWAL 482
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
K + L E W +W+ Y + S +D +R MN+VNPKY+LR ++ +SAI
Sbjct: 483 KTLQL------HEKWNNWLQLYYARITSEDDTDVKRMNRMNNVNPKYILRYWMAESAIRK 536
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGME--KYARLPPAWAYRPGVCMLSCSS 643
AE DF EV++LL++++ PY EQ ME YA PP W+ + V SCSS
Sbjct: 537 AEDNDFSEVQKLLEILQAPYTEQLDMEPTGYADRPPEWSKKLKV---SCSS 584
>gi|152980384|ref|YP_001353238.1| hypothetical protein mma_1548 [Janthinobacterium sp. Marseille]
gi|151280461|gb|ABR88871.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length = 559
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 231/537 (43%), Positives = 305/537 (56%), Gaps = 60/537 (11%)
Query: 120 RTDSIPREVLHAC-----YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFF 174
RT+++P E A YT + P+ + +P LV S S A + LD E +F F
Sbjct: 70 RTNTLPLENSFATLPPAHYTALMPTP-LPDPYLVCASASTAAMIGLDFAETGGTEFIETF 128
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE---RWELQLKGAGKT 231
+G L + P + Y GHQFG+WA QLGDGRAI LG++ + E R ELQLKGAG T
Sbjct: 129 TGNRLLLNSKPLSAVYSGHQFGVWASQLGDGRAILLGDVPAPEIEPSGRLELQLKGAGLT 188
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSSIREFLCSEAM LG+PTTRALC+ + + V R+ + E
Sbjct: 189 PYSRMGDGRAVLRSSIREFLCSEAMAALGVPTTRALCVTGSDQLVMRE-------QAETA 241
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+ RVAQSF+RFGS++ E D ++TLADY I + + N
Sbjct: 242 AVATRVAQSFVRFGSFEHWFY--NEKHDELKTLADYVIDRFYPYFRN------------- 286
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+ N Y EV RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGPFGF
Sbjct: 287 --------SENPYKDLLTEVTLRTAHMIAHWQAVGFMHGVMNTDNMSILGLTLDYGPFGF 338
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYG 470
++AF+ + N TD GR Y +A QP IG WN ++ A LI D E + Y
Sbjct: 339 MEAFNATHICNHTDQQGR-YSYARQPQIGEWNC--YALGQALLPLIGDVDETQAALRIYK 395
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK-ADPSIP 526
F ++++ +M KLGL ++Q+ L + VD+T FFR L N++ A+
Sbjct: 396 PAFAEKFEELMHAKLGLKTRQSDDRQLFDSLFGILQDSHVDFTTFFRQLGNLQPANSDSH 455
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
ED L+ + +D + A+ +W L Y L D ERK M++VNPKY+LRNY
Sbjct: 456 ED-----LRDLFID-----RAAFDAWALQYGARLQQENSIDSERKLAMDAVNPKYILRNY 505
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L Q AI+ A+ DF EV +LL+++E+P+DEQPG EKYA LPP WA +SCSS
Sbjct: 506 LAQIAIEKAQNKDFSEVAKLLQVLEKPFDEQPGNEKYAALPPDWA---NDLEVSCSS 559
>gi|220934366|ref|YP_002513265.1| hypothetical protein Tgr7_1192 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|254799974|sp|B8GQ83.1|Y1192_THISH RecName: Full=UPF0061 protein Tgr7_1192
gi|219995676|gb|ACL72278.1| protein of unknown function UPF0061 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
Length = 492
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 237/554 (42%), Positives = 306/554 (55%), Gaps = 71/554 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LEDL + +S+ R LP A + + P A P VA++E A +
Sbjct: 1 MHKLEDLKFINSYAR-LP-------------EAFHDRPMP-APFPQPYRVAFNEKAAALI 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L P+E R +F F+G PL G P + Y GHQFG++ QLGDGRA+ LGE+ +
Sbjct: 46 GLHPEEASRAEFVNAFTGQIPLTGMEPVSMIYAGHQFGVYVPQLGDGRALVLGEVQTPEG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
RWELQLKG+G T +SR ADG AVLRS+IRE+L SEAMH LG+PTTRAL ++ + V R
Sbjct: 106 ARWELQLKGSGPTRFSRGADGRAVLRSTIREYLASEAMHALGVPTTRALTILGSDMPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ + E AI+ R+A S +RFGS++ A G ++ LADY I HH+ +
Sbjct: 166 E-------RVETAAILVRMAPSHVRFGSFEYFAHGGYPAR--LKELADYVIAHHYPELAE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A V RTA L+A+WQ VGF HGV+NTDNMS
Sbjct: 217 RYQP---------------------YLALLETVIRRTADLIARWQAVGFAHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLTIDYGP+GFLDA+ P F N +D G RY F QP I WN+A + L L+
Sbjct: 256 ILGLTIDYGPYGFLDAYQPGFICNHSDHRG-RYAFDQQPRIAWWNLACLAQAL--LPLLH 312
Query: 459 DKEANYV------MERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDY 509
+ EA V ++R+ +F + A+M KLGL + ++ +I +LL MA VDY
Sbjct: 313 EDEAAGVELARAALDRFNGQFASCWTALMGAKLGLLETRREDLDLIERLLGLMAGSAVDY 372
Query: 510 TNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE 569
T FFRAL +P+ L+A D EA+ +W+ Y L G D
Sbjct: 373 TRFFRALGRFHDPAWLPD------LRAAFRD-----PEAFDAWLADYRARLGHEGREDAA 421
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
R A M +VNPKYVLRNYL Q AI AE DF EV RL +L+ERP+DEQP ME YA LPP
Sbjct: 422 RLADMLAVNPKYVLRNYLAQMAIAKAEQKDFSEVERLQRLLERPFDEQPEMEAYAALPPD 481
Query: 630 WAYRPGVCMLSCSS 643
WA V SCSS
Sbjct: 482 WAEEIAV---SCSS 492
>gi|260794380|ref|XP_002592187.1| hypothetical protein BRAFLDRAFT_88076 [Branchiostoma floridae]
gi|229277402|gb|EEN48198.1| hypothetical protein BRAFLDRAFT_88076 [Branchiostoma floridae]
Length = 567
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 234/552 (42%), Positives = 312/552 (56%), Gaps = 60/552 (10%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE LN+D+ +R LP D +++PR+V AC++K VA+S L
Sbjct: 1 MATLETLNFDNLVLRSLPIDNSGENVPRQVPGACFSKT-----------VAFSAQALQLL 49
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P E RP+F FSG+ L G+ A CY GHQFG ++GQLGDG A+ LGE++N
Sbjct: 50 DLPPAELTRPEFAQHFSGSKLLPGSETAAHCYCGHQFGHFSGQLGDGAAMYLGEVVNKSG 109
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWE+QLKGAG TPYSR ADG VLRSSIREFLCSEAMH LGIPTTRA VT+ V R
Sbjct: 110 ERWEIQLKGAGLTPYSRTADGRKVLRSSIREFLCSEAMHHLGIPTTRAGSCVTSDSKVLR 169
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVRTLADY 327
D++Y+GN E IV R+AQ+FLRFGS++I S G+ DI+ T+ DY
Sbjct: 170 DVYYNGNASYERCTIVLRIAQTFLRFGSFEIFKPTDEITGRKGPSVGRN--DILITMLDY 227
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
AI+ F I+ + + +Y A+ E+ RTA LVA+WQ VGF
Sbjct: 228 AIKTFFPEIQEAHAD-----------------SEERYLAFFREIVHRTARLVAEWQCVGF 270
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGVLNTDNMSILGLTIDYGPFGFLD +D N +D G RY + NQP++ WN +F
Sbjct: 271 CHGVLNTDNMSILGLTIDYGPFGFLDRYDADNICNGSD-DGARYSYRNQPEMCKWNCEKF 329
Query: 448 STTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAVDKV 507
S ++ A + + V+E + KF + Y + M KKLGL K +L M +
Sbjct: 330 SEAISEA--LPTVLSKPVLEEFDPKFSEHYLSKMRKKLGLLKKELPEDKQLQMLMLLLST 387
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK-----EAWISWVLSY------ 556
+ + + K E L +K D+ +E+K + W W+ Y
Sbjct: 388 NPSLLMQLGGQGKIMREFERMEKLEEIK----DLTQEQKATADAQKWTEWLEKYTARLKL 443
Query: 557 -IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD 615
QE + ++ER MNS NPK++LRNY+ Q+AI AAE GDF EV+R+L+L+E PY
Sbjct: 444 ETQEAGNVEQLNKERVVTMNSNNPKFILRNYIAQNAITAAEEGDFTEVQRVLRLLEHPYS 503
Query: 616 EQPGMEKYARLP 627
E + + A P
Sbjct: 504 EDVDLGELAVAP 515
>gi|413962688|ref|ZP_11401915.1| hypothetical protein BURK_022290 [Burkholderia sp. SJ98]
gi|413928520|gb|EKS67808.1| hypothetical protein BURK_022290 [Burkholderia sp. SJ98]
Length = 530
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 231/526 (43%), Positives = 299/526 (56%), Gaps = 65/526 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPD---FPLFFSGATPL---AGAVPYAQCYG 191
P+A V +P LV S +A++L DP+ P+ F FF+G A A+PYA Y
Sbjct: 50 PAAPVPDPYLVGMSREMAETLGFDPQVATGPEKDAFAAFFAGNPTRDWPADALPYAAVYS 109
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGRA+TLGE + R E+QLKGAG+TPYSR DG AVLRSSIREFL
Sbjct: 110 GHQFGVWAGQLGDGRALTLGEAEH-DGARLEVQLKGAGRTPYSRMGDGRAVLRSSIREFL 168
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAMH LGIPTTRAL ++ + V R++ E AIV RV+ SF+RFG ++
Sbjct: 169 CSEAMHHLGIPTTRALTVIGSDLPVRREIV-------ETAAIVTRVSPSFVRFGHFEHFY 221
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
S + +D ++TLAD+ I + H + + + Y A E
Sbjct: 222 S--NDRIDELKTLADHVIDRFYPHCRDAD---------------------DPYLALLDEA 258
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
TA L+A+WQGVGF HGV+NTDNMSILGLTIDYGPFGF+DAF+ N +D G RY
Sbjct: 259 VRSTADLMAEWQGVGFCHGVMNTDNMSILGLTIDYGPFGFMDAFNAHHVCNHSDTQG-RY 317
Query: 432 CFANQPDIGLWN---IAQFSTTLAAAKLIDD-------KEANYVMERYGTKFMDEYQAIM 481
+ QP + WN +AQ L A L ++ +EA VMERY +F A M
Sbjct: 318 SYGRQPQVAYWNLFCLAQALVPLFGANLPEEGRAERVVEEAQKVMERYKDRFGPALVAKM 377
Query: 482 TKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAV 537
KLGL + + ++ + L M ++ D+T FR LS + K+D S P++ +
Sbjct: 378 RAKLGLDIEREGDDKLANGLFEIMHANRADFTLTFRNLSKLSKSDASRD-----APVRDL 432
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
LD + A+ +W Y + L D ER A MN VNPKYVLRN+L ++AI A
Sbjct: 433 FLD-----RAAFDAWAAQYRERLAHEPRDDAERAAAMNRVNPKYVLRNHLAENAIRRAAE 487
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RLL ++ PYDEQP E YA LPP WA +SCSS
Sbjct: 488 KDFSEVARLLDVLRHPYDEQPEYEAYAGLPPDWA---SDLEVSCSS 530
>gi|321463811|gb|EFX74824.1| hypothetical protein DAPPUDRAFT_306992 [Daphnia pulex]
Length = 517
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 224/539 (41%), Positives = 307/539 (56%), Gaps = 47/539 (8%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS-LELDPKEFERPDFPL 172
+ P DP ++ R V ++ +P+ QLV+ S V ++ L+L+P E P F
Sbjct: 17 QFPIDPIKENYIRRVPGCVFSHATPTPLKTQLQLVSASHDVLENILDLNPIEEANPVFAK 76
Query: 173 FFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTP 232
F +G L G+V A YGG+QFG WA QLGDGRAITLGE +N K RWELQLKGAGKTP
Sbjct: 77 FIAGNQLLPGSVTIAHRYGGYQFGYWADQLGDGRAITLGEYVNSKGNRWELQLKGAGKTP 136
Query: 233 YSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGA 292
YSR DG AVLRSSIRE+LCSEAMH LGIPT+RA +V + V RD FY+G K EP A
Sbjct: 137 YSRNGDGRAVLRSSIREYLCSEAMHALGIPTSRAAAIVVSKDMVVRDQFYNGRMKYEPTA 196
Query: 293 IVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDE 352
+V R+A ++ R GS +I ++++ ++ + D+ I HH I N
Sbjct: 197 VVLRLAPTWFRIGSLEILTR--EKEIKNLKQVVDFTIEHHMPTIPQGN------------ 242
Query: 353 DHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFL 412
Y + V E++A+LV+ W GFTHGVLNTDNMS+L +TIDYGPFGFL
Sbjct: 243 -----------YLKFLETVLEQSAALVSLWMAHGFTHGVLNTDNMSLLSITIDYGPFGFL 291
Query: 413 DAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYGT 471
D+++PSF PN +D G RY + NQP I WN+A+ + L ++ KEA + R+
Sbjct: 292 DSYNPSFVPNHSDDEG-RYSYLNQPKIFKWNMARLADALQPLLSAEEQKEAAATIGRFDE 350
Query: 472 KFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED 528
+ ++ +I +KLGL K K +++ LL+ M + D+T FR L + D +I
Sbjct: 351 IYQQQFISIFRRKLGLSKAAKDEDKLVQLLLDMMQQRRADFTQTFRQLGAIHLD-NIELG 409
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
E L ++ +IS +QE +GISDEER +MN VNP+YVL N++
Sbjct: 410 EEHWALHSI---TTHPSFSEFISLYQKIVQE---TGISDEERCRVMNGVNPRYVLHNWMA 463
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCML--SCSS 643
++AI AE DF L K++ +PYD+ E ++ PP WA C L SCSS
Sbjct: 464 EAAIRQAEKDDFHLTHLLSKVLSKPYDKDDEAESLGFSNPPPDWA-----CSLRVSCSS 517
>gi|239815911|ref|YP_002944821.1| hypothetical protein Vapar_2935 [Variovorax paradoxus S110]
gi|259646924|sp|C5CNS8.1|Y2935_VARPS RecName: Full=UPF0061 protein Vapar_2935
gi|239802488|gb|ACS19555.1| protein of unknown function UPF0061 [Variovorax paradoxus S110]
Length = 494
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 231/518 (44%), Positives = 302/518 (58%), Gaps = 55/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF-FSGATPLAGAVPYAQC 189
A T++ P+ + P V SE+ A L L P ++ + + L +G P+AG +P+A
Sbjct: 27 AFLTELRPTPLPDPPYWVGHSEAAARLLGL-PADWRQSEGTLAALTGNLPVAGTLPFATV 85
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG+WAGQLGDGRAI LGE E+QLKGAG+TPYSR ADG AVLRSSIRE
Sbjct: 86 YSGHQFGVWAGQLGDGRAIMLGET----EGGLEVQLKGAGRTPYSRGADGRAVLRSSIRE 141
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAMH LGIPTTRALC+ + V R+M E A+V RVA SF+RFG ++
Sbjct: 142 FLCSEAMHGLGIPTTRALCVTGSDARVYREM-------PETAAVVTRVAPSFIRFGHFE- 193
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H S Q D ++ R LADY I ++ + ++ N YAA+
Sbjct: 194 HFSASQRDAEL-RALADYVIDRYYPDCRSTSR-----------------FNGNAYAAFLE 235
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
V+ERTA+L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLD FDP N +D G
Sbjct: 236 AVSERTAALLAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDGFDPRHICNHSDTSG- 294
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F QP++ WN+ F A LI D+E A +E Y T F E+++ M KLGL
Sbjct: 295 RYAFNQQPNVAYWNL--FCLAQALLPLIGDQEIAVAALESYKTVFPREFESRMRAKLGLA 352
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ +I +L MA +KVDYT F+R LS A + P++ + LD
Sbjct: 353 EPAEGDRALIEGVLKLMAAEKVDYTIFWRRLSQHMAGGNAE------PVRDLFLD----- 401
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ + +W+LS+ + + + + LM NPKYVLRN+L Q AI+AA DF V
Sbjct: 402 RAGFDAWLLSFSER--HAQLPRAQAADLMLRSNPKYVLRNHLGQQAIEAASQKDFSAVAT 459
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL L+E P++E PG + YA PP WA +SCSS
Sbjct: 460 LLALLETPFEEHPGADAYAGFPPDWA---STIEISCSS 494
>gi|392950468|ref|ZP_10316023.1| hypothetical protein WQQ_00950 [Hydrocarboniphaga effusa AP103]
gi|392950655|ref|ZP_10316210.1| hypothetical protein WQQ_02820 [Hydrocarboniphaga effusa AP103]
gi|391859430|gb|EIT69958.1| hypothetical protein WQQ_00950 [Hydrocarboniphaga effusa AP103]
gi|391859617|gb|EIT70145.1| hypothetical protein WQQ_02820 [Hydrocarboniphaga effusa AP103]
Length = 498
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/514 (42%), Positives = 296/514 (57%), Gaps = 56/514 (10%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFER-PDFPLFFSGATPLAGAVPYAQCYGGHQFG 196
P +EV +L+ + +A L LD R PDF +G + G A Y GHQFG
Sbjct: 33 PLSEV---RLLHLNAQLAGQLGLDAGAAARDPDFVAAMAGNRKIVGGAYVASVYAGHQFG 89
Query: 197 MWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAM 256
QLGDGRA +GE+L E++ELQLKG+G+TP+SRFADG AVLRSSIRE+LCSEAM
Sbjct: 90 TLVPQLGDGRANLIGEVLTPSGEQFELQLKGSGQTPFSRFADGRAVLRSSIREYLCSEAM 149
Query: 257 HFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQE 316
H LGIPTTRAL LV V R+ F E A+VCRVA SF+RFG ++ R +
Sbjct: 150 HALGIPTTRALSLVGASDPVQRERF-------ERAAVVCRVAPSFVRFGHFEYFYFRNRH 202
Query: 317 DLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTA 376
+ +R LAD+ I H+ H+ + +YAAW E+ +RTA
Sbjct: 203 EE--IRQLADHVIEAHYPHLAGFPE---------------------RYAAWLSEIVQRTA 239
Query: 377 SLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQ 436
L+AQWQ VGF HGV+NTDNMS+LGLTIDYGP+GFLD FD N +D G RY + Q
Sbjct: 240 RLMAQWQSVGFCHGVMNTDNMSVLGLTIDYGPYGFLDGFDAHHICNHSD-EGGRYAYDRQ 298
Query: 437 PDIGLWNIAQ-FSTTLAAAKLIDDKE---ANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
P IG WN ++ TL D+ AN ++ RY +M++ ++ +KLGL +
Sbjct: 299 PVIGQWNCSKLLQATLPLLHEDPDQSVEIANAILTRYPADYMNQMMSLWRRKLGLVSEQE 358
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
++++I++ LN + K D+T FRALSN++ P ++ LLD + A+
Sbjct: 359 EDRELINRFLNLLDKGKSDFTRTFRALSNLRDGDDKP------AMRDELLD-----QAAF 407
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
+W+ Y L G + ER+ M +VNPKYVLRN+L Q+AI+ AE D E+ RL ++
Sbjct: 408 DAWLPDYRARLAQDGQPEAERQQAMRAVNPKYVLRNHLAQAAIEKAEASDASEIDRLFRV 467
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RPYDEQP + YA PP A V SCSS
Sbjct: 468 LQRPYDEQPEFDAYAAEPPPEARHISV---SCSS 498
>gi|315139008|ref|NP_001186712.1| selenoprotein O [Taeniopygia guttata]
Length = 641
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 242/603 (40%), Positives = 320/603 (53%), Gaps = 104/603 (17%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ +R LP D +S PR V AC+ +V PS ++NP+LVA S L L+ E
Sbjct: 14 LRFDNLALRSLPVDASEESGPRAVPGACFARVRPSP-LQNPRLVAMSLPALALLGLEAPE 72
Query: 165 FERPDFP----LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
+ LFFSG LAGA P A CY GHQFG +AGQLGDG A+ LGE+L + ER
Sbjct: 73 ADPAAAEAEAALFFSGNRVLAGAEPAAHCYCGHQFGSFAGQLGDGAAMYLGEVLGPRGER 132
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WE+QLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+ V RD+
Sbjct: 133 WEIQLKGAGITPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSDSKVVRDI 192
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTLADYAIRH 331
FYDGNPK E +V R+A +F+RFGS++I + R + DI + DY I
Sbjct: 193 FYDGNPKNERCTVVLRIASTFIRFGSFEIFKPPDEYTGRKGPSVNRNDIRIQMLDYVIST 252
Query: 332 HFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGV 391
+ I+ + D T + AA+ E+ +RTA LVA+WQ VGF HGV
Sbjct: 253 FYPEIQ----------------EAYSDNTVQRNAAFFKEITKRTARLVAEWQCVGFCHGV 296
Query: 392 LNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL 451
LNTDNMSI+GLTIDYGPFGF+D +DP N +D GR Y + QP+I WN+ + + L
Sbjct: 297 LNTDNMSIVGLTIDYGPFGFMDRYDPEHVCNGSDNTGR-YAYNKQPEICKWNLGKLAEAL 355
Query: 452 AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMA---- 503
++ + + E Y +F Y M KKLGL + + +++S+LL M
Sbjct: 356 VPELPLEISQP-ILEEEYDAEFEKHYLQKMRKKLGLIQLELEEDSKLVSELLETMHLTAG 414
Query: 504 ---------------VDKVDYTNFFRALSNVKA-------------DP-----------S 524
+D + +F L++ A DP S
Sbjct: 415 DFTNIFYLLSSFSVDIDHSKFEDFLEELTSQCASVEELKVVFKPQMDPRQLSMMLMLAQS 474
Query: 525 IPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSY----IQELLS 562
P+ L+ KA + D+ K W W+ Y +E+ S
Sbjct: 475 NPQLFALIGTKANINKELERIEQFSKLQQLTADDVLSRNKRQWKEWLEKYRVRLQKEIES 534
Query: 563 SGISDE---ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPG 619
G +D ER +MNS NPKY+LRNY+ Q+AI+AAE GDF EVR +LKL+E PY E G
Sbjct: 535 VGNADTWNTERVKVMNSNNPKYILRNYIAQNAIEAAENGDFSEVRNVLKLLEHPYQEAEG 594
Query: 620 MEK 622
++
Sbjct: 595 FQE 597
>gi|365875841|ref|ZP_09415366.1| hypothetical protein EAAG1_06167 [Elizabethkingia anophelis Ag1]
gi|442587563|ref|ZP_21006379.1| hypothetical protein D505_07018 [Elizabethkingia anophelis R26]
gi|365756353|gb|EHM98267.1| hypothetical protein EAAG1_06167 [Elizabethkingia anophelis Ag1]
gi|442562734|gb|ELR79953.1| hypothetical protein D505_07018 [Elizabethkingia anophelis R26]
Length = 512
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 222/541 (41%), Positives = 304/541 (56%), Gaps = 47/541 (8%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F PGD ++ PR+ Y V E P+L+ ++E + L + D
Sbjct: 11 FKETFPGDNTYNNYPRQTPGVLYALVE-LMEFPKPELILFNEELGKELMISK------DN 63
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
FFSG G YA Y GHQFG WAGQLGDGRAI +GE+ +L + ELQ KGAG
Sbjct: 64 IGFFSGQILPEGIETYATAYAGHQFGNWAGQLGDGRAINIGEVESLSGKNIELQYKGAGS 123
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TP+SR ADG AV RSS+RE+L SEAM+ LG+ TTRAL LV TG+ V RDMFY+G+P+ E
Sbjct: 124 TPFSRNADGRAVFRSSLREYLMSEAMYHLGVSTTRALSLVKTGENVIRDMFYNGHPEAEN 183
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA++ R A+SF+RFG +++ A+R ++ + ++ L D+ I +F I+ G
Sbjct: 184 GAVIIRTAESFIRFGHFELLAAR--QETETLKQLMDWVIERYFPEIK------------G 229
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
D D + KY W EVA+RTA + W VGF HGV+NTDNMSILGLTIDYGPF
Sbjct: 230 DAD-------TEKYLNWFREVAQRTADTIVDWFRVGFVHGVMNTDNMSILGLTIDYGPFS 282
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERY 469
LD + +FTPNTTDLPGRRY F Q +I WN+ Q + A +I+D+E ++ +
Sbjct: 283 MLDEYSLNFTPNTTDLPGRRYAFGKQANIAHWNLFQLAN--AIFPVINDQEGLEEILNDF 340
Query: 470 GTKFMDEYQAIMTKKLGLP--KYNKQII----SKLLNNMAVDKVDYTNFFRALSNVKADP 523
F EY +M +KLGL K + Q + KL++ + K+DYT FF L A
Sbjct: 341 SKYFWTEYDKMMAEKLGLDAVKESDQALLLEWQKLMDEL---KLDYTLFFSLLEKTDAQT 397
Query: 524 SIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVL 583
++ +L + + + + +V YI + IS EE M NPK++L
Sbjct: 398 NV----ILHFEPCFYYGLTQFQAQQLEGFVQHYIDRKAQNTISAEESLQKMQRTNPKFIL 453
Query: 584 RNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY-RPGVCMLSCS 642
RNYL I+ + GDF + +LLK +E PY+E +++ P WA +PG LSCS
Sbjct: 454 RNYLLFQCIEETDNGDFTLLNKLLKALENPYEEL--YPEFSVKRPDWAGDQPGCSTLSCS 511
Query: 643 S 643
S
Sbjct: 512 S 512
>gi|354597105|ref|ZP_09015122.1| UPF0061 protein ydiU [Brenneria sp. EniD312]
gi|353675040|gb|EHD21073.1| UPF0061 protein ydiU [Brenneria sp. EniD312]
Length = 483
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/532 (41%), Positives = 296/532 (55%), Gaps = 52/532 (9%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFF 174
+P P + + L YT++ P+ ++ +L+ +S +AD L L + F R + +
Sbjct: 1 MPQKPSFINHYHQQLPGFYTELQPTP-LQGARLLYYSRGLADELGLSAQWFTR-QYDAVW 58
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
G L G P AQ Y GHQFGMWAGQLGDGR I LGE + LKGAG TPYS
Sbjct: 59 RGEALLPGMKPLAQAYSGHQFGMWAGQLGDGRGILLGEQQLADGRSMDWHLKGAGLTPYS 118
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRS IREFL SEAMH LGIPTTRAL +VT+ + + R+ +EEPGA++
Sbjct: 119 RMGDGRAVLRSVIREFLASEAMHHLGIPTTRALTIVTSEQAIARE-------REEPGAML 171
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVA+S +RFG ++ R + + VR LAD+ I H+ +
Sbjct: 172 LRVAESHVRFGHFEHFYYR--REGERVRQLADFVIARHWPQWRD---------------- 213
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+YA W +V ERTA L+A WQ VGF HGVLNTDNMSILGLTIDYGPFGFLD
Sbjct: 214 -----DPRRYALWLGDVVERTARLIAHWQSVGFAHGVLNTDNMSILGLTIDYGPFGFLDD 268
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFM 474
+ P + N +D G RY F NQP +GLWN+ + + +L+ L+D +E + RY M
Sbjct: 269 YQPDYICNHSDHQG-RYAFDNQPAVGLWNLHRLAQSLSG--LMDTEELETALARYEPALM 325
Query: 475 DEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELL 531
+Y +M KLGL + + I+ +LL M ++ DYT FR L++ + + + L
Sbjct: 326 QKYGELMRAKLGLFTADAEDNAILVELLRLMRQERRDYTRTFRLLADGE------KSDAL 379
Query: 532 VPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSA 591
PL+ +D + A+ W +Y + L D ER+ M NP Y+LRNYL Q A
Sbjct: 380 SPLRDEFID-----RPAFDRWFAAYRKRLAQEPQHDAERRQRMKGANPNYILRNYLAQQA 434
Query: 592 IDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
I+ AE D + RL + + RPY+EQP M+ A LPP W +SCSS
Sbjct: 435 IERAEKEDISVLARLHQALCRPYEEQPEMDDLAALPPEWGKH---LEISCSS 483
>gi|407939383|ref|YP_006855024.1| hypothetical protein C380_13425 [Acidovorax sp. KKS102]
gi|407897177|gb|AFU46386.1| hypothetical protein C380_13425 [Acidovorax sp. KKS102]
Length = 493
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 232/543 (42%), Positives = 305/543 (56%), Gaps = 68/543 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L WDH F P +T++ P+ + +P V S +VA L LD
Sbjct: 15 LAWDHRFAALGPD--------------FFTELRPT-PLPSPHWVGTSPAVAQLLGLDEAA 59
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ F+G LAG+ P A Y GHQFG+WAGQLGDGRAI LGE + WE+Q
Sbjct: 60 LHSDEALQAFTGNRLLAGSRPLASVYSGHQFGVWAGQLGDGRAILLGE----TASGWEVQ 115
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR DG AVLRSSIREFLCSEAMH LG+PT+RALC+ + V R+
Sbjct: 116 LKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMHGLGVPTSRALCITGSPGPVRRE----- 170
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+ E A+V RVA+SF+RFG ++ A+ GQED ++TLADY I ++ +
Sbjct: 171 --EIETAAVVTRVARSFVRFGHFEHFAANGQED--ALQTLADYVIDRYYPECRD------ 220
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
TG + N YAA V+ERTA L+AQWQ VGF HGV+NTDNMSILGLTI
Sbjct: 221 ---GTG--------MAGNPYAALLQAVSERTARLMAQWQAVGFCHGVMNTDNMSILGLTI 269
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGPF FLDAF P N +D G RY + QP++ WN+ F A LI D++ A
Sbjct: 270 DYGPFQFLDAFVPGHVCNHSDSQG-RYAYNRQPNVAYWNL--FCLAQALLPLIGDQDLAK 326
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+E Y T F + + A M KLGL + + +I +L +A + VDY F+R LS+
Sbjct: 327 QALESYKTVFPESFMAQMRAKLGLVEASDGDGALIDGILLLLAQNGVDYPIFWRRLSHAV 386
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
+ P++ + D + W+L Y + S + + LM NPK
Sbjct: 387 GTQDME------PVRDLFAD-----RAGCDQWLLLYSEH--SRHMDVAHQADLMLKTNPK 433
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLS 640
+VLRN+L + AI AA+LGDFGE++ L +L+ERP+DE PG + YA PP WA +S
Sbjct: 434 FVLRNHLGEQAIRAAKLGDFGELQTLQRLLERPFDEHPGHDAYAAFPPDWA---SSIEIS 490
Query: 641 CSS 643
CSS
Sbjct: 491 CSS 493
>gi|260794897|ref|XP_002592443.1| hypothetical protein BRAFLDRAFT_113831 [Branchiostoma floridae]
gi|229277663|gb|EEN48454.1| hypothetical protein BRAFLDRAFT_113831 [Branchiostoma floridae]
Length = 454
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 208/481 (43%), Positives = 289/481 (60%), Gaps = 35/481 (7%)
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
F F SG T L G+ P + YGGHQF W+GQLGDGRAI LGE +N + ERWELQLKG+G
Sbjct: 2 FQAFVSGNTILYGSTPLSHRYGGHQFASWSGQLGDGRAIMLGEYVNRRGERWELQLKGSG 61
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
TPYSR DG AVLRSS+REFLCSEAM+ LGIPT+RA L+ + V RD FY+G+PK+E
Sbjct: 62 LTPYSRRGDGRAVLRSSVREFLCSEAMYHLGIPTSRAATLIVSDDPVIRDQFYNGHPKKE 121
Query: 290 PGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFST 349
GA+V R+A+S+ R GS +I A+ ++ +++ L D+ I+ +F I + S
Sbjct: 122 RGAVVLRLAKSWFRIGSLEILAA--NQETQLLKQLVDFTIQQYFTDIYE-------TLSE 172
Query: 350 GDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPF 409
GD +Y + +V +TA ++A WQ VGF HGV NTDN S+L +TIDYGPF
Sbjct: 173 GD-----------RYLTFFSDVVSQTAEMIALWQSVGFAHGVCNTDNFSLLSITIDYGPF 221
Query: 410 GFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK-EANYVMER 468
GF+D++DP F PNT+D G Y + NQPD+GL+N+ + LA+ + + ++E
Sbjct: 222 GFMDSYDPEFVPNTSDDTG-MYSYENQPDVGLFNLDKLREALASLLTEQQRFQMTKILEL 280
Query: 469 YGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y + +Y I+ +K+G+ + + I + L MA K D+T FR LS + +
Sbjct: 281 YPDIYKTKYMEILRRKMGMLGEEEDDAMIAAVLFKMMADTKADFTMTFRQLSELSLEQM- 339
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALMNSVNPKYVLR 584
E+ + P + + + E + W+ Y Q L SD ERKA M++ NP+YVLR
Sbjct: 340 -ENAAIPPHLWAIRTL--QPHEYFTRWLQVYTQRLKHHNKDSDVERKARMDTTNPQYVLR 396
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCMLSCS 642
N++ +SAI AE DF EV+ LLK+++ PY +Q EK Y PP WA V SCS
Sbjct: 397 NWMAESAIKKAEKDDFSEVKLLLKVLQNPYVKQEEAEKQGYGSPPPEWAKELRV---SCS 453
Query: 643 S 643
S
Sbjct: 454 S 454
>gi|196009079|ref|XP_002114405.1| hypothetical protein TRIADDRAFT_58177 [Trichoplax adhaerens]
gi|190583424|gb|EDV23495.1| hypothetical protein TRIADDRAFT_58177 [Trichoplax adhaerens]
Length = 609
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 219/552 (39%), Positives = 305/552 (55%), Gaps = 44/552 (7%)
Query: 95 MTKKLKALEDLNWDHS----FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAW 150
+ K L+ L NW S LP + + R+V +A ++ P+ + P+LVA
Sbjct: 50 INKPLQTLR--NWQFSKHNLLYHHLPIEAEKRNFVRQVKNAIFSTCYPTPLSQPPKLVAA 107
Query: 151 SESVADS---LELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRA 207
S+ V ++ L+ + F FF+G G+ P + YGGHQFG WAGQLGDGRA
Sbjct: 108 SKEVLENALDLKYSDSLIQSKYFLDFFAGQVLPNGSTPISHRYGGHQFGHWAGQLGDGRA 167
Query: 208 ITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRAL 267
+ LGE ++ + RW LQLKG+GKTPYSR DG AVLRSSIRE+L SEAM+ LGIPTTRA
Sbjct: 168 VMLGEYISNEGIRWALQLKGSGKTPYSRDGDGRAVLRSSIREYLVSEAMYHLGIPTTRAA 227
Query: 268 CLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADY 327
+VT+ + + RD FYDG+P+ E IV R+A S+ RFGS +I ++ ++ L D
Sbjct: 228 SIVTSDEPIWRDQFYDGHPRAEKAGIVLRLAPSWFRFGSIEI--LHYNQEFHLLNRLVDV 285
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
I H+ H+ + N+ KY + E+ TASL+AQWQ VGF
Sbjct: 286 IINLHYPHLSDDNR---------------------KYIKFYAEIINTTASLIAQWQSVGF 324
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
THGV NTDN SIL LTIDYGPFGFLD ++ F NT+D G RY F QP++ +N+ +
Sbjct: 325 THGVCNTDNFSILSLTIDYGPFGFLDEYNDDFISNTSDDDG-RYRFRFQPNVAYFNLDKL 383
Query: 448 STTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAV 504
L++ LI + + + Y + Y IM KKLGL NK ++I+++L M
Sbjct: 384 RIALSS--LISEVDGQKELSNYKRIYRRHYLHIMRKKLGLKGSNKKDTKLITQMLKMMKN 441
Query: 505 DKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG 564
K D+T FR LS + SI ++ R W W+ +Y++ L +
Sbjct: 442 QKADFTMTFRELSEIDIQ-SINNGFQSENIQKSWSLSKVMRDNEWPKWIQNYLERLNVTN 500
Query: 565 ---ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGME 621
D++R+ M VNP+Y+LRNY+ Q AI+ A +GDF EVR L + P+ +Q E
Sbjct: 501 WKLYDDQDRQLRMQEVNPRYILRNYMAQIAINKANIGDFSEVRNLQNTLLNPFSKQRNAE 560
Query: 622 K--YARLPPAWA 631
+ YA PP WA
Sbjct: 561 RLGYAAPPPVWA 572
>gi|427404636|ref|ZP_18895376.1| UPF0061 protein [Massilia timonae CCUG 45783]
gi|425716807|gb|EKU79776.1| UPF0061 protein [Massilia timonae CCUG 45783]
Length = 464
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 224/505 (44%), Positives = 288/505 (57%), Gaps = 52/505 (10%)
Query: 144 NPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLG 203
+P +A S A + LD + RPDF F+G A + P + Y GHQFG+WAGQLG
Sbjct: 7 SPHFIAASSPAAALIGLDAADLARPDFVDVFTGNKVAARSQPLSAVYSGHQFGVWAGQLG 66
Query: 204 DGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPT 263
DGRAITLG+I ELQLKGAG+TPYSR DG AVLRSSIREFLCSEAM LGIPT
Sbjct: 67 DGRAITLGDIATPNGP-MELQLKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMAALGIPT 125
Query: 264 TRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRT 323
TRAL + + + V R+ E A+V R+A +F+RFGS++ ASRG+E ++T
Sbjct: 126 TRALMVTGSPQQVARETM-------ESTAVVTRMAPTFVRFGSFEHWASRGREAE--LKT 176
Query: 324 LADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQ 383
LADY IR + E L +N Y EV RTA ++A WQ
Sbjct: 177 LADYVIRQFY--------PEFLG-------------AANPYKELLAEVTRRTARMIAHWQ 215
Query: 384 GVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN 443
VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G RY +ANQ IG WN
Sbjct: 216 AVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDAKHICNHTD-QGGRYSYANQVPIGHWN 274
Query: 444 IAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNM 502
L LI + E A ++ Y +F + ++ KLGL K + + L +NM
Sbjct: 275 CYALGNALL--PLIGEPEVAEEALDVYRPEFGRQLDTLLHAKLGL-KETRDGDAALFDNM 331
Query: 503 AV----DKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
+ D+T FFR L +K + ++ PL+ + +D + A+ +W Y
Sbjct: 332 FTLLQDNHADFTLFFRRLGELKLEEPAADE----PLRDLFID-----RAAFDAWAGEYRA 382
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
L G SD R+ M+ VNPKY+LRNYL Q AI+ A+ GDFG V +LL ++ERP+DEQP
Sbjct: 383 RLRQEGSSDAARREAMHGVNPKYILRNYLAQIAIEQAQNGDFGGVHKLLAVLERPFDEQP 442
Query: 619 GMEKYARLPPAWAYRPGVCMLSCSS 643
YA LPP WA +SCSS
Sbjct: 443 ENASYAALPPDWAAH---LEVSCSS 464
>gi|443723409|gb|ELU11840.1| hypothetical protein CAPTEDRAFT_95444 [Capitella teleta]
Length = 582
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 234/592 (39%), Positives = 318/592 (53%), Gaps = 89/592 (15%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ AL +L +D+S +R LP DP PR+V AC++KV+P+ VENPQLV+ + L
Sbjct: 1 MTALNNLTFDNSVLRSLPIDPEEKVFPRQVKGACFSKVTPTP-VENPQLVSAALPALQLL 59
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + E DF +FSG L G+ A CY GHQFG +AGQLGDG AI LGEI+N +
Sbjct: 60 DLGEDDIEHKDFTEYFSGNKLLKGSETAAHCYCGHQFGHFAGQLGDGAAIYLGEIINKRG 119
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWELQ+KGAG TPYSR ADG VLRSSIREFLCSEAMH LGIPTTRA VT+ +V R
Sbjct: 120 ERWELQVKGAGLTPYSRQADGRKVLRSSIREFLCSEAMHHLGIPTTRAATCVTSDSYVVR 179
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED---------LDIVRTLADYAI 329
D+FY GNP E IV R+A SFLRFGS+QI +E D++ L ++ I
Sbjct: 180 DVFYSGNPVNERCTIVSRIAPSFLRFGSFQICKPPDRETGREGPSVCLPDVLSKLTNFTI 239
Query: 330 RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
+F I M+ + D++ ++ + + EV RTA LVA+WQ +GF H
Sbjct: 240 EKYFPEIWEMH--------SNDKETAI--------SEFFKEVVLRTARLVAEWQCIGFCH 283
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI---------- 439
GVLNTDNMSILGL+IDYGPFGF+D FD F N +D GR Y + QP+I
Sbjct: 284 GVLNTDNMSILGLSIDYGPFGFMDRFDEDFICNGSDDRGR-YTYKKQPEICKWNCQKLCD 342
Query: 440 GLWNIAQFSTTLAAAKLIDDKEANYVMERYGTK---------FMDEYQ---AIMTKKL-- 485
L + L + +L D + ME+ K F+D Q A T
Sbjct: 343 ALMELIPLEKLLPSVELFDVEYQRCYMEKMRKKVGDRDLVASFLDTMQKTGADFTNCFRL 402
Query: 486 --GLPKYNKQIISKLLNNMAVD----------KVDYTNFFRALSNVKADPSIPEDELLVP 533
G+ N + I + L + ++D L+ + +P + ++ +
Sbjct: 403 LSGVRDDNTETILEELMKQSCSIEELRAANQPRMDVRQLQMLLTLAETNPGLL-GQMGMA 461
Query: 534 LKAVLLDIG-----------------KERKEAWISWVLSYIQELLSSG---ISDEE---- 569
+ ++ ++ K+ + W W+L Y L +S E+
Sbjct: 462 ARGLMQELSRLEKLKELKEKTEDWKRKQDQTMWSQWILKYQDRLKRESDPSLSQEDIRLK 521
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY-DEQPGM 620
R +MNS NPK+VLRNY+ Q+AI+AAE GDF EV R+L L++ P+ D GM
Sbjct: 522 RTQVMNSNNPKFVLRNYMAQNAIEAAEKGDFSEVNRVLSLLQNPFIDLDNGM 573
>gi|335423984|ref|ZP_08553002.1| hypothetical protein SSPSH_14879 [Salinisphaera shabanensis E1L3A]
gi|334890735|gb|EGM28997.1| hypothetical protein SSPSH_14879 [Salinisphaera shabanensis E1L3A]
Length = 505
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 217/515 (42%), Positives = 295/515 (57%), Gaps = 50/515 (9%)
Query: 137 SPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFG 196
+PSA + P + +++ VA L+LD + + SG P A YGGHQFG
Sbjct: 33 TPSA-LPAPYPIVFNDDVAALLDLDTEAVRHAGYAHVLSGNDLPDACHPVAHRYGGHQFG 91
Query: 197 MWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAM 256
+WAGQLGDGRAIT+G+I N + + +E+QLKGAGKTP+SRFADG AVLRS +RE+L SEA+
Sbjct: 92 VWAGQLGDGRAITIGDIRNARGQAYEIQLKGAGKTPFSRFADGRAVLRSVVREYLGSEAL 151
Query: 257 HFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQE 316
LGIPTTRAL +V + V R+ E A++ R+A S +RFGS++I Q
Sbjct: 152 AALGIPTTRALAIVGSDAPVYRETV-------EHAAVMTRIAPSLVRFGSFEILFENRQ- 203
Query: 317 DLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTA 376
D + LAD+ I HF I + ++ + +Y AW V + TA
Sbjct: 204 -FDALAPLADHVIGEHFPRI------------------AAIEGANTRYRAWGERVIDLTA 244
Query: 377 SLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQ 436
SL+A WQ VGF HGV+NTDNMS+LGLT+DYGP+GF+D+FDP + N TD G RY + Q
Sbjct: 245 SLIADWQAVGFCHGVMNTDNMSVLGLTLDYGPYGFMDSFDPHWICNHTDAGG-RYAYDQQ 303
Query: 437 PDIGLWNIAQFSTTLAAAKLIDDKE-----ANYVMERYGTKFMDEYQAIMTKKLGLPKYN 491
P +GLWN+ +F + L DD + ++ERY F Y M KLGL +
Sbjct: 304 PHVGLWNLGRFVQAILPL-LSDDPDTAVEIGQGLLERYRRSFDAAYMQRMRAKLGLVDTH 362
Query: 492 ---KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ ++ LL MA D D+T FRAL +V ADP+ P +D ++A
Sbjct: 363 DDDRDLVDDLLKTMAADGADFTRTFRALGHVSADPAASN----APFVDEFVD-----RDA 413
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+W+ + + L+ + D R M NPKYVLRNYL Q+AID A+ GD+ E+ RL
Sbjct: 414 AGAWLARWRERLVDTAADDTARAERMRLTNPKYVLRNYLAQAAIDRADEGDYSEIERLHA 473
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ P+DEQP E YA+LPP WA +LSCSS
Sbjct: 474 ILRHPFDEQPEHEAYAKLPPDWARG---LVLSCSS 505
>gi|227111716|ref|ZP_03825372.1| hypothetical protein PcarbP_02067 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length = 483
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 222/514 (43%), Positives = 286/514 (55%), Gaps = 52/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P+ +SG L G P AQ Y G
Sbjct: 19 YTALQPTP-LHGARLLYHSEGLAAELGLSSDWFT-PEQDAVWSGERLLPGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IREFL
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLPDGRTMDWHLKGAGLTPYSRMGDGRAVLRSAIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +V + V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHHLGIPTTRALTIVASAHPVQRE-------QEEKGAMLLRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR LA+Y I H+ EN DE N+Y W +V
Sbjct: 190 R--REPEKVRQLAEYVIARHWPQWEN------------DE---------NRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFAHGVMNTDNMSILGLTIDYGPYGFLDAYQPGFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F NQP +GLWN+ + + L+ L+D + + RY M Y +M KLGL
Sbjct: 286 FDNQPAVGLWNLHRLAQALSG--LMDTETLERALARYEPALMQHYGTLMRAKLGLFTASA 343
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ LL M + DYT+ FR L++ + S PL+ +D + A+
Sbjct: 344 EDNDVLVGLLRLMQQEGSDYTHTFRLLADSEKQASHS------PLRDEFID-----RTAF 392
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
SW +Y Q L+ DEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL +
Sbjct: 393 DSWFATYRQRLMQEEQGDEERRRLMNATNPKYILRNYLAQMAIERAENDDISVLARLHQT 452
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +P+DEQP A LPP W +SCSS
Sbjct: 453 LCQPFDEQPEKNDLAALPPEWGKH---LEISCSS 483
>gi|108762089|ref|YP_629124.1| hypothetical protein MXAN_0863 [Myxococcus xanthus DK 1622]
gi|121957918|sp|Q1DDZ9.1|Y863_MYXXD RecName: Full=UPF0061 protein MXAN_0863
gi|108465969|gb|ABF91154.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 488
Score = 369 bits (946), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 226/552 (40%), Positives = 294/552 (53%), Gaps = 71/552 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+++ R LP +V PS + +LV+ + + L
Sbjct: 1 MATLEQLRFDNTYAR-LPA-------------GFGARVHPS-PFPDAKLVSVNPAALKLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P+E +RP+F GA PL G P+A Y GHQFG++ +LGDGRA+ LGE+ +
Sbjct: 46 DLTPEEAQRPEFVAAMGGAKPLPGMEPFAMVYAGHQFGVYVPRLGDGRALLLGEVRDAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+W+L LKG G TP+SR DG AVLRS+IRE+LC EAMH LGIPTTR L ++ + V R
Sbjct: 106 AKWDLHLKGGGPTPFSRGGDGRAVLRSTIREYLCGEAMHGLGIPTTRGLGILGSQAPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E GA++ R+A S +RFG+++ E + V TLAD+ I HF +
Sbjct: 166 EAV-------ETGAMLVRMAPSHVRFGTFEFFHY--TEQTEHVATLADHVITEHFPQL-- 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
G E +YA + EV ERTA L+AQWQ VGF HGV+NTDNMS
Sbjct: 215 ----------AGQE---------GRYARFYTEVVERTARLIAQWQAVGFAHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLT+DYGPFGFLD F+P F N +D G RY F QP IGLWN+A L LI
Sbjct: 256 ILGLTLDYGPFGFLDDFEPGFICNHSDDRG-RYAFDQQPRIGLWNLACLGEAL--LTLIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ EA + Y + + M KLGL + +++++S L MA VDYT FFRA
Sbjct: 313 EDEARAALATYQPAYNAHFMDRMRAKLGLRETRDEDRELVSDLFARMAEAHVDYTRFFRA 372
Query: 516 LSNVK----ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
L + AD D P E + +W Y L + G D ER
Sbjct: 373 LGHFASADGADTRPVRDMFPAP-------------EGFDAWAGRYRARLAAEGSVDAERH 419
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
A M VNPKYVLRN++ Q AI AE GDF V RLL ++ P+ E P E YA PP W
Sbjct: 420 ARMTRVNPKYVLRNWVAQEAISRAEAGDFSLVDRLLGVLSDPFAEHPDAEPYAAAPPTWG 479
Query: 632 YRPGVCMLSCSS 643
V SCSS
Sbjct: 480 RHLAV---SCSS 488
>gi|227327012|ref|ZP_03831036.1| hypothetical protein PcarcW_06704 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length = 483
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 223/516 (43%), Positives = 289/516 (56%), Gaps = 56/516 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P+ +SG L G P AQ Y G
Sbjct: 19 YTALQPTP-LHGARLLYHSEGLAAELGLSSDWFT-PEQDAVWSGERLLPGMAPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGE--ILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
HQFG+WAGQLGDGR I LGE + + +S W LKGAG TPYSR DG AVLRS+IREF
Sbjct: 77 HQFGVWAGQLGDGRGILLGEQQLADGRSVDW--HLKGAGLTPYSRMGDGRAVLRSAIREF 134
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 135 LASEAMHHLGIPTTRALTIVTSTHPVQRE-------QEEKGAMLLRVAESHVRFGHFEHF 187
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
R + + VR L +Y I H+ EN DE +Y W +
Sbjct: 188 YYRRES--EKVRQLVEYVIARHWPQWEN------------DE---------RRYELWFGD 224
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA L+ WQ VGF HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G R
Sbjct: 225 VVERTARLITHWQAVGFAHGVMNTDNMSILGLTIDYGPYGFLDAYQPDFICNHSDHRG-R 283
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--- 487
Y F NQP +GLWN+ + + L+ L+D + + RY M Y +M KLGL
Sbjct: 284 YAFDNQPAVGLWNLHRLAQALSG--LMDTETLERALARYEPALMQHYGTLMRAKLGLFTA 341
Query: 488 PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ LL M + DYT FR L++ + S PL+ +D +
Sbjct: 342 SSEDNDVLVGLLRLMQQEGSDYTRTFRLLADSEKQASRS------PLRDEFID-----RA 390
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A+ SW +Y Q L+ SDEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL
Sbjct: 391 AFDSWFATYRQRLMQEEQSDEERRRLMNATNPKYILRNYLAQMAIERAESDDISVLARLH 450
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + +P+DEQP A LPP W +SCSS
Sbjct: 451 QALCQPFDEQPEKNDLAALPPEWGKH---LEISCSS 483
>gi|377820677|ref|YP_004977048.1| hypothetical protein BYI23_A012330 [Burkholderia sp. YI23]
gi|357935512|gb|AET89071.1| hypothetical protein BYI23_A012330 [Burkholderia sp. YI23]
Length = 508
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 233/526 (44%), Positives = 293/526 (55%), Gaps = 65/526 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELD---PKEFERPDFPLFFSGATP---LAGAVPYAQCYG 191
P+A VE+P LV S A+SL D E+ F +F+G A ++PYA Y
Sbjct: 28 PAAPVEDPYLVGLSRETAESLGFDSDVATGAEKHAFAAYFAGNPTRDWAADSLPYAAVYS 87
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGRA+TLGE+ ER E+QLKGAG+TPYSR DG AVLRSSIREFL
Sbjct: 88 GHQFGVWAGQLGDGRALTLGEVAR-DGERLEVQLKGAGRTPYSRMGDGRAVLRSSIREFL 146
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAMH LGIPTTRAL ++ V R+ E AIV RVA SF+RFG ++
Sbjct: 147 CSEAMHHLGIPTTRALAVIGADLPVRRETI-------ETAAIVTRVAPSFVRFGHFEHFY 199
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
S + +D +R LAD+ I + H N + Y A E
Sbjct: 200 S--NDRIDDLRKLADHVIDRFYPHCRN---------------------AEDPYLALLDEA 236
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
TA L+AQWQGVGF HGV+NTDNMSILGLTIDYGPFGF+DAF+ N +D G RY
Sbjct: 237 VRTTADLMAQWQGVGFCHGVMNTDNMSILGLTIDYGPFGFMDAFNAHHVCNHSDTQG-RY 295
Query: 432 CFANQPDIGLWN---IAQFSTTLAAAKLIDD-------KEANYVMERYGTKFMDEYQAIM 481
+ QP + WN +AQ L A L ++ +EA V+ERY +F A M
Sbjct: 296 SYGRQPQVAYWNLFCLAQALVPLFGANLPEEGRAERVVEEAQKVLERYKERFGPALVATM 355
Query: 482 TKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAV 537
KLGL + + ++ + L M ++ D+T FR LS + K+D S P + +
Sbjct: 356 RAKLGLATELEGDDKLANGLFEIMHANRADFTLTFRNLSKLSKSDASGD-----APARDL 410
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
LD + A+ +W Y + L D R A MN VNPKYVLRN+L + AI A
Sbjct: 411 FLD-----RAAFDAWAALYRERLAHEPRDDAARAAAMNRVNPKYVLRNHLAEQAIRRANE 465
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RLL ++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 466 KDFSEVARLLDVLRRPFDEQPENEAYAGLPPDWA---GALEVSCSS 508
>gi|403059011|ref|YP_006647228.1| hypothetical protein PCC21_025720 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
gi|402806337|gb|AFR03975.1| hypothetical protein PCC21_025720 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
Length = 483
Score = 368 bits (944), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 222/514 (43%), Positives = 285/514 (55%), Gaps = 52/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P+ +SG L G P AQ Y G
Sbjct: 19 YTALQPTP-LHGARLLYHSEGLAAELGLSSDWFT-PEQDAVWSGERLLPGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IREFL
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLPDGRSMDWHLKGAGLTPYSRMGDGRAVLRSAIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHHLGIPTTRALTIVTSTHPVQRE-------QEEKGAMLLRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR LA+Y I H+ EN DE +Y W +V
Sbjct: 190 R--REPEKVRQLAEYVIARHWPQWEN------------DE---------RRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFAHGVMNTDNMSILGLTIDYGPYGFLDAYQPGFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F NQP +GLWN+ + + L+ L+D + + RY M Y +M KLGL
Sbjct: 286 FDNQPAVGLWNLHRLAQALSG--LMDTETLERALARYEPALMQHYGTLMRAKLGLFTASA 343
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ LL M + DYT FR L++ + S PL+ +D + A+
Sbjct: 344 EDNDVLVGLLRLMQQEGSDYTRAFRLLADSEKQASHS------PLRDEFID-----RTAF 392
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
SW +Y Q L+ DEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL +
Sbjct: 393 DSWFATYRQRLMQEEQGDEERRRLMNATNPKYILRNYLAQMAIERAENDDISVLARLHQT 452
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +P+DEQP A LPP W +SCSS
Sbjct: 453 LCQPFDEQPEKNDLAALPPEWGKH---LEISCSS 483
>gi|395007708|ref|ZP_10391421.1| hypothetical protein PMI14_04115 [Acidovorax sp. CF316]
gi|394314344|gb|EJE51274.1| hypothetical protein PMI14_04115 [Acidovorax sp. CF316]
Length = 495
Score = 367 bits (943), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 231/520 (44%), Positives = 300/520 (57%), Gaps = 59/520 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPL-FFSGATPLAGAVPYAQC 189
A +T++ P+ + +P V S SVA L LD + + R D L F+G L G+ P A
Sbjct: 28 AFFTELQPT-PLPSPHWVGTSASVARLLGLD-EAWLRSDAALQAFAGNALLPGSRPLASV 85
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG+WAGQLGDGRAI LGE + E+QLKGAG+TPYSR DG AVLRSSIRE
Sbjct: 86 YSGHQFGIWAGQLGDGRAILLGETVGGH----EIQLKGAGRTPYSRMGDGRAVLRSSIRE 141
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAM LG+PTTRALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 142 FLCSEAMQGLGVPTTRALCITGSPAPVRRE-------EVETAAVVARVAPSFVRFGHFE- 193
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H S D D ++ LADY I ++ + +L N YAA
Sbjct: 194 HFSANDMD-DELQALADYVIDRYYPDCRGRS-----------------ELAGNPYAALLQ 235
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
V+ERTA L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLD+F P N +D G
Sbjct: 236 AVSERTAVLMAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDSFVPGHVCNHSDTQG- 294
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY + QP++ WN+ F A LI D+E A +E Y T F E+ A M KLGL
Sbjct: 295 RYAYNRQPNVAYWNV--FCLAQALLPLIGDQELAMAALESYKTVFPAEFMARMRDKLGLG 352
Query: 489 KY----NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ + ++I LL +A VDY F+R LS+ +VP +A
Sbjct: 353 ERAEEGDAELIDGLLVVLAKGGVDYPIFWRRLSHAVGSGEFEPVRGMVPDQA-------- 404
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKA-LMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW +W+ Y+ E ++D E+ + M + NPK+VLRN+LC+ AI AA+LGDF +
Sbjct: 405 ---AWDAWLAKYLAE---PRLADREKASRAMLATNPKFVLRNHLCEEAIRAAKLGDFSAL 458
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L +L+ERP++E PG E YA PPAWA +SCSS
Sbjct: 459 QTLQRLLERPFEEHPGHESYAAFPPAWA---STIEISCSS 495
>gi|442317883|ref|YP_007357904.1| hypothetical protein MYSTI_00871 [Myxococcus stipitatus DSM 14675]
gi|441485525|gb|AGC42220.1| hypothetical protein MYSTI_00871 [Myxococcus stipitatus DSM 14675]
Length = 480
Score = 367 bits (943), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 222/549 (40%), Positives = 298/549 (54%), Gaps = 73/549 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+++ R PG +V P A + N +LV+ + S L
Sbjct: 1 MSTLEQLRFDNTYARLPPG--------------FGARVEPRA-LSNTRLVSANPSALRLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L P+E RP+F G PL G P+A Y GHQFG++ +LGDGRA+ LGE+
Sbjct: 46 GLTPEEARRPEFLEAMGGGRPLPGMEPFAMVYAGHQFGVYVPRLGDGRAMLLGEVRAPSG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E+W+L LKG G TP+SR DG AVLRSSIRE+LC EAMH LGIPTTRALCL+ + V R
Sbjct: 106 EKWDLHLKGGGPTPFSRGGDGRAVLRSSIREYLCGEAMHGLGIPTTRALCLLGSDAPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG+++ H + ++ + + R LAD+ I HF H+
Sbjct: 166 E-------EVETGAMIVRMAPSHVRFGTFEFFHYT--EQHVHVAR-LADHVIDAHFPHLS 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ + EV ERTA LVAQWQ VGF HGV+NTDNM
Sbjct: 216 G---------------------APERHVRFYAEVVERTARLVAQWQAVGFAHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLT+DYGPFGFLD F+P F N +D G RY F QP I LWN+A L LI
Sbjct: 255 SILGLTLDYGPFGFLDEFEPGFICNHSDHRG-RYAFDQQPRIALWNLACLGEALLT--LI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ +A + + F + M KLGL + ++ ++ L MA +VDYT FFR
Sbjct: 312 SEDDARAALATFEPSFSAHFLTRMRAKLGLAESKEEDRALVCDLFALMAEARVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
ALS V A + D + R +AW Y L + G D ER+A M
Sbjct: 372 ALSRVDAVAEMFPD--------------RARFQAWAE---RYRARLTAEGSVDLERQARM 414
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
VNP+YVLRN++ Q AI A+ GDF +V RLL +E P+ E+ + R PP+W
Sbjct: 415 ERVNPRYVLRNWMAQDAITQAQRGDFSQVERLLAALEDPFTERSEHAELMREPPSWGRH- 473
Query: 635 GVCMLSCSS 643
++SCSS
Sbjct: 474 --LVVSCSS 480
>gi|206560344|ref|YP_002231108.1| hypothetical protein BCAL1981 [Burkholderia cenocepacia J2315]
gi|444358522|ref|ZP_21159918.1| hypothetical protein BURCENBC7_2246 [Burkholderia cenocepacia BC7]
gi|226701087|sp|B4EBK8.1|Y1944_BURCJ RecName: Full=UPF0061 protein BceJ2315_19440
gi|198036385|emb|CAR52281.1| conserved hypothetical protein [Burkholderia cenocepacia J2315]
gi|443603877|gb|ELT71855.1| hypothetical protein BURCENBC7_2246 [Burkholderia cenocepacia BC7]
Length = 522
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 225/536 (41%), Positives = 296/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVAQLLDLPPTLAAQPGFAELFTG-NPTRDWPANAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I D H + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVI---------------------DRFHPACRDADDPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 303 GG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLALEREGDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D +EA+ +W Y L D R MN NPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|422832814|ref|ZP_16880882.1| hypothetical protein ESOG_00483 [Escherichia coli E101]
gi|371610830|gb|EHN99357.1| hypothetical protein ESOG_00483 [Escherichia coli E101]
Length = 478
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 298/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + P + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGPGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|386704566|ref|YP_006168413.1| hypothetical protein P12B_c1378 [Escherichia coli P12b]
gi|383102734|gb|AFG40243.1| hypothetical protein P12B_c1378 [Escherichia coli P12b]
Length = 478
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 298/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEVLRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|365091116|ref|ZP_09328623.1| hypothetical protein KYG_07680 [Acidovorax sp. NO-1]
gi|363416234|gb|EHL23354.1| hypothetical protein KYG_07680 [Acidovorax sp. NO-1]
Length = 494
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 234/520 (45%), Positives = 300/520 (57%), Gaps = 64/520 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + P V S +VA + LD +R F+G T LAG+ P A Y G
Sbjct: 30 FTELRPT-PLPAPHWVGTSTAVAQLIGLDADWLQRDAALQAFTGNTLLAGSRPLASVYSG 88
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE + E+QLKGAG+TPYSR DG AVLRSSIREFLC
Sbjct: 89 HQFGVWAGQLGDGRAILLGE----TAAGLEIQLKGAGRTPYSRMGDGRAVLRSSIREFLC 144
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPT+RALC+ + V R+ + E ++V RVA SF+RFG ++ A+
Sbjct: 145 SEAMHGLGIPTSRALCITGSPAPVRRE-------EVETASVVTRVAPSFVRFGHFEHFAA 197
Query: 313 RGQEDLDI-VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
DL ++TLADY I ++ E D N YAA V
Sbjct: 198 ---NDLQAQLKTLADYVINRYY-----------------PECRDTRDFGGNAYAALLQAV 237
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
+ERTA L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF P N +D G RY
Sbjct: 238 SERTAHLMAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFMPGHVCNHSDHQG-RY 296
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
+ QP++ WN+ F A LI D E A +E Y T F + + A M KLGL +
Sbjct: 297 AYNRQPNVAYWNL--FCLAQALLPLIGDPELAKAALESYKTVFPEAFMARMRSKLGLAQA 354
Query: 491 NKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+Q +I +L +A + VDYT F+R LS+ + EL+ L A +
Sbjct: 355 REQDAELIDGILVLLAQNGVDYTIFWRRLSHAV---QTSDFELVRDLFA--------DRS 403
Query: 548 AWISWVLSYIQELLSSGISDEERKAL----MNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
A+ W+LSY + L G KAL M + NPK+VLRN+L + AI AA+LGDFGE+
Sbjct: 404 AFDDWMLSYSELLALDG------KALAANFMLNTNPKFVLRNHLGEQAIRAAKLGDFGEL 457
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
R L +L+ERP++E PG + YA PP WA +SCSS
Sbjct: 458 RTLQRLLERPFEEHPGHDAYAAFPPDWA---SSIEISCSS 494
>gi|421866880|ref|ZP_16298542.1| Selenoprotein O and cysteine-containing homologs [Burkholderia
cenocepacia H111]
gi|358073044|emb|CCE49420.1| Selenoprotein O and cysteine-containing homologs [Burkholderia
cenocepacia H111]
Length = 522
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 225/536 (41%), Positives = 296/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVAQLLDLPPTLAAQPGFAELFAG-NPTRDWPANAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I D H + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVI---------------------DRFHPACRDADDPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 303 GG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLALEREGDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D +EA+ +W Y L D R MN NPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|340787584|ref|YP_004753049.1| selenoprotein O-like protein [Collimonas fungivorans Ter331]
gi|340552851|gb|AEK62226.1| Selenoprotein O-like protein [Collimonas fungivorans Ter331]
Length = 501
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 230/553 (41%), Positives = 308/553 (55%), Gaps = 75/553 (13%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
+E L + +SF P A YT+++P+ + P LVA SE A + L
Sbjct: 13 IEHLRFANSFANAFADSP-----------AAYTRLAPT-PLPAPYLVAASEQAAQLIGLT 60
Query: 162 PKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
P DF FSG A + A Y GHQFG+WAGQLGDGRAI LG++ R
Sbjct: 61 PAACGSDDFIQTFSGNRAAADSQSLAAVYSGHQFGVWAGQLGDGRAILLGDVAASDGGRL 120
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKG+G TPYSR DG AVLRSSIRE+LCSEAM LGIPT+RAL ++ + + R+
Sbjct: 121 ELQLKGSGSTPYSRMGDGRAVLRSSIREYLCSEAMAALGIPTSRALSVIGSDQLAMRE-- 178
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHIENM 339
+ E A+V R+A SF+RFGS++ + +R ++ ++TLADY I + ++
Sbjct: 179 -----RPETTAVVTRMAPSFVRFGSFEHWYYNNRPEQ----LKTLADYVIAGFYPELQ-- 227
Query: 340 NKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
+N Y A EV RTA L+AQWQ VGF HGV+NTDNMSI
Sbjct: 228 -------------------AAANPYQALLAEVTRRTAHLMAQWQAVGFMHGVMNTDNMSI 268
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD 459
LGLT+DYGPFGF++A+DP N TD G RY + QP IG WN F+ A LI
Sbjct: 269 LGLTLDYGPFGFMEAYDPRHICNHTDQQG-RYAYNQQPQIGHWNC--FALGQALLPLIGS 325
Query: 460 KE------ANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYT 510
E +NY YG K +DE ++ KLGL + + +++ + M VD+T
Sbjct: 326 VEQTEAALSNY-QALYGAK-LDE---LLHAKLGLLTHQADDDKLLDAMFALMQGSHVDFT 380
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEER 570
FFR L N++ D S ++ L+ + +D + A+ +W L Y L D ER
Sbjct: 381 LFFRRLGNLRLDGSGGDET----LRDLFID-----RAAFDAWALQYRARLKLENSQDHER 431
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
K +M++ NPKYVLRNYL Q+AI+ A+ DF EVR+L +++E P+DEQP +YA LPP W
Sbjct: 432 KLVMDASNPKYVLRNYLAQTAIERAQEKDFSEVRKLQQILENPFDEQPQHAQYAELPPDW 491
Query: 631 AYRPGVCMLSCSS 643
A V SCSS
Sbjct: 492 ARGLEV---SCSS 501
>gi|113675269|ref|NP_001038333.1| uncharacterized protein LOC558542 [Danio rerio]
Length = 612
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 231/583 (39%), Positives = 322/583 (55%), Gaps = 71/583 (12%)
Query: 94 KMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSES 153
+M + L LE L +++ ++ LP D + R V AC++ V P A ++ P +VA S
Sbjct: 15 RMDQSLTPLERLKFNNVALKALPVDSSLEPGSRTVKAACFSLVKPQALIK-PTIVALSGP 73
Query: 154 VADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGE 212
L L ++ + P + SG+ + G+ P A CY GHQFG +AGQLGDG LGE
Sbjct: 74 ALALLGLKVEDVLQDPHAAEYLSGSRLIQGSEPAAHCYCGHQFGQFAGQLGDGAVCYLGE 133
Query: 213 I-LNLKSE------------RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFL 259
+ + + +E RWE+Q+KGAG TPYSR +DG VLRSSIREFLCSEAM L
Sbjct: 134 VEVEVGAEQTTDPNRTSPCGRWEIQVKGAGLTPYSRLSDGRKVLRSSIREFLCSEAMFAL 193
Query: 260 GIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH--------- 310
GIPTTRA LVT+ +V RD FY GNPK E ++V R+A +F+RFGS++I
Sbjct: 194 GIPTTRAGSLVTSDLYVQRDEFYSGNPKPERCSVVLRIAPTFIRFGSFEIFHPLDDFTGR 253
Query: 311 --ASRGQEDLDIVRTLADYAIRHHFRHIE--NMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
S G+ DI L DY I + I+ ++++ E + AA
Sbjct: 254 QGPSVGRP--DIRAGLLDYVIETFYPEIQRGHLDRKE-------------------RNAA 292
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
+ EV RTA LVA WQ VGF HGVLNTDNMSILGLTIDYGPFGF+D FDP F N +D
Sbjct: 293 FFREVTVRTAKLVALWQSVGFCHGVLNTDNMSILGLTIDYGPFGFMDRFDPEFVCNASDK 352
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY + QP + WN+A+ + L A I +A +++ + + + D Y M KKLG
Sbjct: 353 KG-RYTYEAQPYVCRWNLARLAEALGAE--IQSIKAGVILDEFMSLYEDFYLGNMRKKLG 409
Query: 487 LPKYNK----QIISKLLNNMAVDKVDYTNFFRALSNVKA---DPSIPED-----ELLVPL 534
L + + ++++ +L M + D+TN FR LS++ + DP+ ++ EL+V
Sbjct: 410 LLRKQEPEDGELVADMLKTMHITGADFTNTFRLLSDISSPVGDPAEKDNTDSVVELIVDQ 469
Query: 535 KAVL--LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI 592
A+L L + + + ++ER MNS NP VLRNY+ Q+AI
Sbjct: 470 CALLEELKVANHPTMQPGKRLAFECNQASDPASVEKERVRFMNSTNPAVVLRNYIAQNAI 529
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
DAAE GDF EV+R+L+++E PY P +E P W+ G
Sbjct: 530 DAAEKGDFSEVQRVLRVLENPYSVSPDLEC-----PVWSAGKG 567
>gi|329901819|ref|ZP_08272911.1| Selenoprotein O and cysteine-like protein [Oxalobacteraceae
bacterium IMCC9480]
gi|327549002|gb|EGF33614.1| Selenoprotein O and cysteine-like protein [Oxalobacteraceae
bacterium IMCC9480]
Length = 493
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 223/537 (41%), Positives = 299/537 (55%), Gaps = 54/537 (10%)
Query: 115 LPGDPRTDSI----PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
LP RTD++ L A ++ + P LV S + A + LDP EF +F
Sbjct: 3 LPTLKRTDTLDIGNTFAALPAAFSTRLLPTPLATPYLVCASPTAAALIHLDPAEFTTDNF 62
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
F+G A + P A Y GHQFG+WAGQLGDGRAI LG++ ++ R ELQLKGAG
Sbjct: 63 IETFTGNRIPADSTPLAAVYSGHQFGVWAGQLGDGRAILLGDVPSVAG-RMELQLKGAGP 121
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR DG AVLRSSIREFLCSEAM LGIPTTRALC+ + + R+ E
Sbjct: 122 TPYSRGGDGRAVLRSSIREFLCSEAMAGLGIPTTRALCVTGSDQRAMRE-------APET 174
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
A+ R+A SF+RFGS++ + Q +L +R LAD+ I H+
Sbjct: 175 TAVTTRMAPSFIRFGSFEHWYQKDQPEL--LRALADHVIDQHYPQARA------------ 220
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
+N YAA V RTA +VA WQ VGF HGV+NTDNMSILGLT+DYGPFG
Sbjct: 221 ---------DANPYAALLTSVTRRTAQMVAHWQAVGFMHGVMNTDNMSILGLTLDYGPFG 271
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI-AQFSTTLAAAKLIDDKEANYVMERY 469
F+D FDPS N TD G RY ++ QP I WN A L ++D EA + +
Sbjct: 272 FMDGFDPSHICNHTDQQG-RYAYSMQPQIAHWNCYALGQALLPLIGTVEDTEA--ALANF 328
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
+ + A++ KLGL + ++ L + +VD+T FFR L +++ P
Sbjct: 329 KPDYDSKMAALLQAKLGLLSVLPDDAALVDSLFAILQAGRVDFTLFFRRLGDLQT--GRP 386
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
E + PL+ + +D + A+ +W +Y Q L D ER+ M++VNPKY+LRN+
Sbjct: 387 ESD--APLRDLFID-----RPAFDAWAAAYRQRLQQEPRGDAERRLAMHAVNPKYILRNH 439
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L Q AI+ A+ DF EV RLL ++++P+D+QP + YA LPP WA + V SCSS
Sbjct: 440 LAQVAIEKAQDRDFSEVARLLAILDKPFDDQPEFDNYAALPPDWASQLEV---SCSS 493
>gi|351732228|ref|ZP_08949919.1| hypothetical protein AradN_20737 [Acidovorax radicis N35]
Length = 494
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 230/517 (44%), Positives = 308/517 (59%), Gaps = 58/517 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + +P V S +VA + LD +R + F+G T LAG+ P A Y G
Sbjct: 30 FTELRPT-PLPDPHWVGTSTAVAQLIGLDTDWLQRDEALQAFTGNTLLAGSRPLASVYSG 88
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE +E E+QLKGAG+TPYSR DG AVLRSSIREFLC
Sbjct: 89 HQFGVWAGQLGDGRAILLGE----TAEGLEIQLKGAGRTPYSRMGDGRAVLRSSIREFLC 144
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPT+RALC+ + V R+ + E ++V RVA SF+RFG ++ A+
Sbjct: 145 SEAMHGLGIPTSRALCITGSPAPVRRE-------EVETASVVTRVAPSFVRFGHFEHFAA 197
Query: 313 RGQEDLD-IVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
DL ++TLADY I ++ + + D N YAA V
Sbjct: 198 ---NDLQPQLKTLADYVIDRYYPECRDNH-----------------DFGGNPYAALLQAV 237
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
+ERTA L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF P N +D G RY
Sbjct: 238 SERTARLMAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFVPGHVCNHSDNQG-RY 296
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
+ QP++ WN+ F A LI D+E A +E Y T F + + A M KLGL
Sbjct: 297 AYNRQPNVAYWNL--FCLAQALLPLIGDQELAKGALESYKTVFPEAFMARMRAKLGLASA 354
Query: 491 NK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++I +L +A + VDYT F+R LS+ ++ +D+ P + + D +
Sbjct: 355 REGDGELIDGILMLLAQNGVDYTIFWRRLSH-----AVQQDD-FEPARDLFAD-----RT 403
Query: 548 AWISWVLSYIQELLSSGISDEERKA-LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A+ +W+LSY ELL+ + ++ A LM NPK+VLRN+L + AI AA+LGDF E++ L
Sbjct: 404 AFDNWLLSY-SELLA--LDNKALAANLMLKTNPKFVLRNHLGEQAIRAAKLGDFSELQTL 460
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+L+E P+DE PG + YA PP WA +SCSS
Sbjct: 461 QRLLEHPFDEHPGHDAYAAFPPDWA---SSIEISCSS 494
>gi|332525963|ref|ZP_08402104.1| hypothetical protein RBXJA2T_08925 [Rubrivivax benzoatilyticus JA2]
gi|332109514|gb|EGJ10437.1| hypothetical protein RBXJA2T_08925 [Rubrivivax benzoatilyticus JA2]
Length = 494
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 219/475 (46%), Positives = 275/475 (57%), Gaps = 49/475 (10%)
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
L A P G + A Y GHQFG+WAGQLGDGRA+ LGE + ELQLKG+G T
Sbjct: 66 LLAGNAQPAGGTL--ATVYSGHQFGVWAGQLGDGRALLLGEA-DTPLGPLELQLKGSGLT 122
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSSIRE+L SEAMH LGIPTTRAL LV + V R+ + E
Sbjct: 123 PYSRMGDGRAVLRSSIREYLGSEAMHALGIPTTRALALVGSPLPVRRE-------RVETA 175
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V RVA SFLRFG ++ H + D +R LAD AI +F ++E+
Sbjct: 176 AVVTRVAPSFLRFGHFE-HFAHTAADNAALRRLADDAIERYF-----PAQAEA------- 222
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+N+YAA EVA RTA LVAQWQ VGF HGV+NTDNMS+LGLTIDYGPFGF
Sbjct: 223 ---------ANRYAALLEEVARRTARLVAQWQAVGFCHGVMNTDNMSLLGLTIDYGPFGF 273
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGT 471
LDAFDP N +D G RY +A QP++ WN+ + L ++D A +E Y +
Sbjct: 274 LDAFDPGHVCNHSDHQG-RYAYARQPNVAFWNLHALAQALLPL-IVDPDAAVAALEPYKS 331
Query: 472 KFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED 528
+F+ Q M KLGL + ++ LL MA D DYT FR L+ + P D
Sbjct: 332 EFLAALQTAMRAKLGLRDERPEDGALVDDLLRRMAADGADYTISFRRLARFDSTPGATHD 391
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
L+ + LD +EA+ +W L Y + L + D ER+ M NPKYVLRN+L
Sbjct: 392 A----LRDLFLD-----REAFDAWALRYAERLRAEASVDAERRLRMERTNPKYVLRNHLA 442
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++AI AE GDFGEV RLL +++ P+DEQP E A PP WA + +SCSS
Sbjct: 443 ETAIRQAEAGDFGEVSRLLAVLQHPFDEQPEHEALAGFPPDWARQ---LEISCSS 494
>gi|223461567|gb|AAI41294.1| RIKEN cDNA 1300018J18 gene [Mus musculus]
Length = 667
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 250/630 (39%), Positives = 325/630 (51%), Gaps = 117/630 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +RELP G + + PR V AC+++ P A + P+LVA SE
Sbjct: 46 LAGLRFDNRALRELPVETPPPGPEDSLATPRPVPGACFSRARP-APLRRPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEASEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDEHTGRAGPSVGRDDIRVQL 284
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + T D D+ + AA+ EV +RTA +VA+WQ
Sbjct: 285 LDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTQRTARMVAEWQC 330
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + WN+
Sbjct: 331 VGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHICNASDNAGR-YTYSKQPQVCKWNL 389
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKL-- 498
+ + L + EA + E + T+F Y M KKLGL + K+ +++KL
Sbjct: 390 QKLAEALEPELPLAAAEA-ILKEEFDTEFQRHYLQKMRKKLGLIRVEKEEDGTLVAKLLE 448
Query: 499 ---------------LNNMAVDKVDYTNFFRALSNVKA-------------DP------- 523
L++ D D F L++ A DP
Sbjct: 449 TMHLTGADFTNTFCVLSSFPADLSDSAEFLSRLTSQCASLEELRLAFRPQMDPRQLSMML 508
Query: 524 ----SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSYIQEL- 560
S P+ L+ +A + D+ ++ ++ W +W+ Y L
Sbjct: 509 MLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSDLQRKNRDHWEAWLQEYRDRLD 568
Query: 561 -LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
G+ D ER +M + NPKYVLRNY+ Q AI+AAE GDF EVRR+LKL+E PY
Sbjct: 569 KEKEGVGDTAAWQAERVRVMRANNPKYVLRNYIAQKAIEAAENGDFSEVRRVLKLLESPY 628
Query: 615 ---DEQPGMEKYAR----------LPPAWA 631
+E G E AR PP WA
Sbjct: 629 HSEEEATGPEAVARSTEEQSSYSNRPPLWA 658
>gi|421080538|ref|ZP_15541456.1| UPF0061 fanily protein YdiU [Pectobacterium wasabiae CFBP 3304]
gi|401704550|gb|EJS94755.1| UPF0061 fanily protein YdiU [Pectobacterium wasabiae CFBP 3304]
Length = 483
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 226/517 (43%), Positives = 285/517 (55%), Gaps = 58/517 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P +SG L+G P AQ Y G
Sbjct: 19 YTALPPTP-LHGARLLYHSEGLAAELGLSSDWFT-PAQDNVWSGERLLSGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS IREFL
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLADGRSMDWHLKGAGFTPYSRMGDGRAVLRSVIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH+LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHYLGIPTTRALTIVTSTHPVQRE-------QEEKGAMLLRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR LA+Y I H+ EN DE +Y W +V
Sbjct: 190 R--REPEKVRQLAEYVIARHWPQWEN------------DE---------RRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF+HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFSHGVMNTDNMSILGLTIDYGPYGFLDAYQPDFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----P 488
F NQP +GLWN+ + + L+ L+D + RY M Y +M KLG P
Sbjct: 286 FDNQPAVGLWNLHRLAQALSG--LMDTDALERALARYEPALMQHYGTLMRAKLGFFTASP 343
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRAL--SNVKADPSIPEDELLVPLKAVLLDIGKERK 546
N ++ +LL M + DYT FR L S +A S+ DE + +
Sbjct: 344 DDND-VLVELLRLMQKEGSDYTRTFRLLADSEKQASRSLLRDEFI-------------DR 389
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A+ SW Y Q L+ SDEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL
Sbjct: 390 AAFDSWFAVYRQRLMQEDQSDEERRRLMNATNPKYILRNYLAQMAIERAENDDISVLARL 449
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RP+DEQP A LPP W +SCSS
Sbjct: 450 HQALCRPFDEQPDNNDLAALPPDWGKH---LEISCSS 483
>gi|320170405|gb|EFW47304.1| UPF0061 protein [Capsaspora owczarzaki ATCC 30864]
Length = 635
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 238/625 (38%), Positives = 320/625 (51%), Gaps = 113/625 (18%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+ LN+D++F R+LPGD + R+V CY+ P+ NP+LV + A L+
Sbjct: 43 RLFHQLNFDNTFARQLPGDGIEANYTRQVRGVCYSNAVPTPST-NPRLVHANAGAAALLD 101
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG-----------------------HQFG 196
L+P E P+F SG + A P A Y G HQFG
Sbjct: 102 LNPSELATPEFVDVVSGCALHSTAKPIALTYAGNNANCVNVPVMPQQLTAIPLRPGHQFG 161
Query: 197 MWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAM 256
+AGQLGDGRAI+LGE++N ERWE+QLKGAG TPYSRFADG AVLRSSIRE++CSEAM
Sbjct: 162 SFAGQLGDGRAISLGEVVNHHGERWEMQLKGAGMTPYSRFADGRAVLRSSIREYMCSEAM 221
Query: 257 HFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQE 316
+ LG+PT+RAL LV T + V R+ EPGAIVCR+AQS++RFGS++ Q
Sbjct: 222 NALGVPTSRALSLVVTDEKVVRETV-------EPGAIVCRLAQSWIRFGSFEHQFYFKQP 274
Query: 317 DLDIVRTLADYAIRHHF-RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERT 375
+++ L DY I HHF ++E S DED +Y A+ EVA RT
Sbjct: 275 --KVLKRLVDYTITHHFPSYLETAMPGAS------DED---------RYLAFYREVARRT 317
Query: 376 ASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFAN 435
A +A WQ VGF GVLNTDN SILGL+IDYGPF F++AFD N TD G Y +
Sbjct: 318 AHTIALWQAVGFVGGVLNTDNFSILGLSIDYGPFAFMEAFDDDAVFNHTDSEG-MYAYGR 376
Query: 436 QPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL-------P 488
QPD+G WN+++ + +A + +++ + A V+ Y + F Y A M KLGL
Sbjct: 377 QPDVGHWNLSRLA--IALSPVLEVERAREVLLEYPSMFHKAYVAKMRSKLGLLAALPDKD 434
Query: 489 KYNKQIISKLLNNM-----AVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA---VLLD 540
+ + ++ +LL+ M D+T FFR LS S ++ ++A + L
Sbjct: 435 ESDAALVKELLDAMQSQPGTTSGADWTIFFRTLSEAAPSLSATDEASQQQIEADSNLKLA 494
Query: 541 IGKERK------------EAWISWVLSYIQELLSSGISDEE------------------- 569
+ RK W +W Y L + E
Sbjct: 495 TTRARKALECMFQDEKVSSKWSAWRQKYTARLAEDSTAVREHSKLGGGLLLPGLSSSLDA 554
Query: 570 ----------RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPG 619
R+ +M NPKY+LR ++ Q AIDAA DF V +L KL++RPYD+QP
Sbjct: 555 SSTALAIGLARRDVMKQHNPKYILRTWMAQKAIDAATANDFTVVDQLFKLLQRPYDDQPE 614
Query: 620 MEK-YARLPPAWAYRPGVCMLSCSS 643
+ YAR A G LSCSS
Sbjct: 615 FDDVYARQDTA----TGPVCLSCSS 635
>gi|419921041|ref|ZP_14439137.1| hypothetical protein ECKD2_23279 [Escherichia coli KD2]
gi|388383351|gb|EIL45130.1| hypothetical protein ECKD2_23279 [Escherichia coli KD2]
Length = 478
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|301026974|ref|ZP_07190364.1| SelO family protein [Escherichia coli MS 69-1]
gi|300395242|gb|EFJ78780.1| SelO family protein [Escherichia coli MS 69-1]
Length = 478
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQLVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|218695268|ref|YP_002402935.1| hypothetical protein EC55989_1874 [Escherichia coli 55989]
gi|407469456|ref|YP_006784102.1| hypothetical protein O3O_13935 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|407481882|ref|YP_006779031.1| hypothetical protein O3K_11700 [Escherichia coli O104:H4 str.
2011C-3493]
gi|410482432|ref|YP_006769978.1| hypothetical protein O3M_11665 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|417667085|ref|ZP_12316633.1| hypothetical protein ECSTECO31_1889 [Escherichia coli STEC_O31]
gi|417805218|ref|ZP_12452174.1| hypothetical protein HUSEC_09624 [Escherichia coli O104:H4 str.
LB226692]
gi|417832942|ref|ZP_12479390.1| hypothetical protein HUSEC41_09222 [Escherichia coli O104:H4 str.
01-09591]
gi|417865475|ref|ZP_12510519.1| hypothetical protein C22711_2407 [Escherichia coli O104:H4 str.
C227-11]
gi|422987706|ref|ZP_16978482.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. C227-11]
gi|422994589|ref|ZP_16985353.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. C236-11]
gi|422999775|ref|ZP_16990529.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 09-7901]
gi|423003388|ref|ZP_16994134.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 04-8351]
gi|423009902|ref|ZP_17000640.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-3677]
gi|423019131|ref|ZP_17009840.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4404]
gi|423024297|ref|ZP_17014994.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4522]
gi|423030114|ref|ZP_17020802.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4623]
gi|423037946|ref|ZP_17028620.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C1]
gi|423043067|ref|ZP_17033734.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C2]
gi|423044806|ref|ZP_17035467.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C3]
gi|423053339|ref|ZP_17042147.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C4]
gi|423060305|ref|ZP_17049101.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C5]
gi|429719161|ref|ZP_19254101.1| hypothetical protein MO3_01886 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429724506|ref|ZP_19259374.1| hypothetical protein MO5_00493 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429776204|ref|ZP_19308189.1| hypothetical protein C212_00808 [Escherichia coli O104:H4 str.
11-02030]
gi|429780657|ref|ZP_19312604.1| hypothetical protein C213_00805 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429783244|ref|ZP_19315160.1| hypothetical protein C214_00808 [Escherichia coli O104:H4 str.
11-02092]
gi|429790422|ref|ZP_19322291.1| hypothetical protein C215_00806 [Escherichia coli O104:H4 str.
11-02093]
gi|429794384|ref|ZP_19326225.1| hypothetical protein C216_00806 [Escherichia coli O104:H4 str.
11-02281]
gi|429798037|ref|ZP_19329841.1| hypothetical protein C217_00806 [Escherichia coli O104:H4 str.
11-02318]
gi|429806457|ref|ZP_19338196.1| hypothetical protein C218_00805 [Escherichia coli O104:H4 str.
11-02913]
gi|429810902|ref|ZP_19342603.1| hypothetical protein C219_00807 [Escherichia coli O104:H4 str.
11-03439]
gi|429816342|ref|ZP_19348000.1| hypothetical protein C220_00806 [Escherichia coli O104:H4 str.
11-04080]
gi|429821029|ref|ZP_19352643.1| hypothetical protein C221_00805 [Escherichia coli O104:H4 str.
11-03943]
gi|429912704|ref|ZP_19378660.1| hypothetical protein MO7_00476 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429913574|ref|ZP_19379522.1| hypothetical protein O7C_00463 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429918616|ref|ZP_19384549.1| hypothetical protein O7E_00480 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429924422|ref|ZP_19390336.1| hypothetical protein O7G_01282 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429928361|ref|ZP_19394263.1| hypothetical protein O7I_00157 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429934914|ref|ZP_19400801.1| hypothetical protein O7K_01726 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429940584|ref|ZP_19406458.1| hypothetical protein O7M_02287 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429948217|ref|ZP_19414072.1| hypothetical protein O7O_04820 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429950862|ref|ZP_19416710.1| hypothetical protein S7Y_02285 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429954160|ref|ZP_19419996.1| hypothetical protein S91_00534 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|432750162|ref|ZP_19984769.1| hypothetical protein WEQ_01579 [Escherichia coli KTE29]
gi|432765059|ref|ZP_19999498.1| hypothetical protein A1S5_02617 [Escherichia coli KTE48]
gi|254814080|sp|B7L6H9.1|YDIU_ECO55 RecName: Full=UPF0061 protein YdiU
gi|218352000|emb|CAU97732.1| conserved hypothetical protein [Escherichia coli 55989]
gi|340733824|gb|EGR62954.1| hypothetical protein HUSEC41_09222 [Escherichia coli O104:H4 str.
01-09591]
gi|340740121|gb|EGR74346.1| hypothetical protein HUSEC_09624 [Escherichia coli O104:H4 str.
LB226692]
gi|341918764|gb|EGT68377.1| hypothetical protein C22711_2407 [Escherichia coli O104:H4 str.
C227-11]
gi|354865664|gb|EHF26093.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. C236-11]
gi|354869833|gb|EHF30241.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. C227-11]
gi|354870921|gb|EHF31321.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 04-8351]
gi|354874338|gb|EHF34709.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 09-7901]
gi|354881270|gb|EHF41600.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-3677]
gi|354891573|gb|EHF51801.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4404]
gi|354894458|gb|EHF54652.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4522]
gi|354896740|gb|EHF56909.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C1]
gi|354899705|gb|EHF59849.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4623]
gi|354901864|gb|EHF61988.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C2]
gi|354914529|gb|EHF74513.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C5]
gi|354919021|gb|EHF78976.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C3]
gi|354919882|gb|EHF79821.1| UPF0061 protein ydiU [Escherichia coli O104:H4 str. 11-4632 C4]
gi|397785332|gb|EJK96182.1| hypothetical protein ECSTECO31_1889 [Escherichia coli STEC_O31]
gi|406777594|gb|AFS57018.1| hypothetical protein O3M_11665 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|407054179|gb|AFS74230.1| hypothetical protein O3K_11700 [Escherichia coli O104:H4 str.
2011C-3493]
gi|407065491|gb|AFS86538.1| hypothetical protein O3O_13935 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|429347950|gb|EKY84722.1| hypothetical protein C212_00808 [Escherichia coli O104:H4 str.
11-02030]
gi|429350458|gb|EKY87189.1| hypothetical protein C213_00805 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429354631|gb|EKY91327.1| hypothetical protein C214_00808 [Escherichia coli O104:H4 str.
11-02092]
gi|429364750|gb|EKZ01369.1| hypothetical protein C215_00806 [Escherichia coli O104:H4 str.
11-02093]
gi|429372400|gb|EKZ08950.1| hypothetical protein C216_00806 [Escherichia coli O104:H4 str.
11-02281]
gi|429374350|gb|EKZ10890.1| hypothetical protein C217_00806 [Escherichia coli O104:H4 str.
11-02318]
gi|429380075|gb|EKZ16574.1| hypothetical protein C218_00805 [Escherichia coli O104:H4 str.
11-02913]
gi|429384455|gb|EKZ20912.1| hypothetical protein C219_00807 [Escherichia coli O104:H4 str.
11-03439]
gi|429386539|gb|EKZ22987.1| hypothetical protein C221_00805 [Escherichia coli O104:H4 str.
11-03943]
gi|429394158|gb|EKZ30539.1| hypothetical protein MO3_01886 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429394454|gb|EKZ30830.1| hypothetical protein MO5_00493 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429396463|gb|EKZ32815.1| hypothetical protein C220_00806 [Escherichia coli O104:H4 str.
11-04080]
gi|429407338|gb|EKZ43591.1| hypothetical protein O7C_00463 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429410169|gb|EKZ46392.1| hypothetical protein O7G_01282 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429418731|gb|EKZ54873.1| hypothetical protein O7K_01726 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429426329|gb|EKZ62418.1| hypothetical protein O7M_02287 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429426735|gb|EKZ62822.1| hypothetical protein O7I_00157 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429431299|gb|EKZ67348.1| hypothetical protein O7E_00480 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429440661|gb|EKZ76638.1| hypothetical protein O7O_04820 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429444241|gb|EKZ80187.1| hypothetical protein S91_00534 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|429449868|gb|EKZ85766.1| hypothetical protein S7Y_02285 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429453731|gb|EKZ89599.1| hypothetical protein MO7_00476 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|431297079|gb|ELF86737.1| hypothetical protein WEQ_01579 [Escherichia coli KTE29]
gi|431310820|gb|ELF99000.1| hypothetical protein A1S5_02617 [Escherichia coli KTE48]
Length = 478
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 298/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|253688840|ref|YP_003018030.1| hypothetical protein PC1_2463 [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|259646851|sp|C6DKP3.1|Y2463_PECCP RecName: Full=UPF0061 protein PC1_2463
gi|251755418|gb|ACT13494.1| protein of unknown function UPF0061 [Pectobacterium carotovorum
subsp. carotovorum PC1]
Length = 483
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 220/514 (42%), Positives = 284/514 (55%), Gaps = 52/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P + +L+ SE +A L L F P+ +SG L G P AQ Y G
Sbjct: 19 YTALQPKP-LHGARLLYHSEGLAAELGLSSDWFT-PEQDAVWSGERLLPGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS IREFL
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLADGRSMDWHLKGAGLTPYSRMGDGRAVLRSVIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHHLGIPTTRALTIVTSTHPVQRE-------QEEKGAMLMRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR L +Y I H+ EN DE +Y W +V
Sbjct: 190 R--REPEKVRQLVEYVIARHWPQWEN------------DE---------RRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF+HGV+NTDNMSILGLTIDYGP+GFLDA+ P+F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFSHGVMNTDNMSILGLTIDYGPYGFLDAYQPNFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F NQP +GLWN+ + + L+ L+D + RY M Y +M KLGL
Sbjct: 286 FDNQPAVGLWNLHRLAQALSG--LMDTDTLERALARYEPALMQHYGTLMRAKLGLFTASA 343
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ LL M + DYT+ FR L++ + S PL+ +D + A+
Sbjct: 344 EDNDVLVGLLRLMQQEGSDYTHTFRLLADSEKQAS------RAPLRDEFID-----RAAF 392
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
SW +Y Q L+ DEER+ LMN+ NPK++LRNYL Q AI+ AE D + RL +
Sbjct: 393 DSWFATYRQRLMQEEQGDEERRRLMNTTNPKFILRNYLAQMAIERAENDDISVLARLHQA 452
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +P+DEQP A LPP W +SCSS
Sbjct: 453 LCQPFDEQPDKNDLAALPPEWGKH---LEISCSS 483
>gi|81295807|ref|NP_082181.2| selenoprotein O [Mus musculus]
gi|341942275|sp|Q9DBC0.4|SELO_MOUSE RecName: Full=Selenoprotein O; Short=SelO
Length = 667
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 250/630 (39%), Positives = 325/630 (51%), Gaps = 117/630 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +RELP G + + PR V AC+++ P A + P+LVA SE
Sbjct: 46 LAGLRFDNRALRELPVETPPPGPEDSLATPRPVPGACFSRARP-APLRRPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEASEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDEHTGRAGPSVGRDDIRVQL 284
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + T D D+ + AA+ EV +RTA +VA+WQ
Sbjct: 285 LDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTQRTARMVAEWQC 330
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + WN+
Sbjct: 331 VGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHICNASDNAGR-YTYSKQPQVCKWNL 389
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKL-- 498
+ + L + EA + E + T+F Y M KKLGL + K+ +++KL
Sbjct: 390 QKLAEALEPELPLALAEA-ILKEEFDTEFQRHYLQKMRKKLGLIRVEKEEDGTLVAKLLE 448
Query: 499 ---------------LNNMAVDKVDYTNFFRALSNVKA-------------DP------- 523
L++ D D F L++ A DP
Sbjct: 449 TMHLTGADFTNTFCVLSSFPADLSDSAEFLSRLTSQCASLEELRLAFRPQMDPRQLSMML 508
Query: 524 ----SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSYIQEL- 560
S P+ L+ +A + D+ ++ ++ W +W+ Y L
Sbjct: 509 MLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSDLQRKNRDHWEAWLQEYRDRLD 568
Query: 561 -LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
G+ D ER +M + NPKYVLRNY+ Q AI+AAE GDF EVRR+LKL+E PY
Sbjct: 569 KEKEGVGDTAAWQAERVRVMRANNPKYVLRNYIAQKAIEAAENGDFSEVRRVLKLLESPY 628
Query: 615 ---DEQPGMEKYAR----------LPPAWA 631
+E G E AR PP WA
Sbjct: 629 HSEEEATGPEAVARSTEEQSSYSNRPPLWA 658
>gi|432449719|ref|ZP_19691991.1| hypothetical protein A13W_00666 [Escherichia coli KTE193]
gi|433033444|ref|ZP_20221176.1| hypothetical protein WIC_02017 [Escherichia coli KTE112]
gi|430981295|gb|ELC98023.1| hypothetical protein A13W_00666 [Escherichia coli KTE193]
gi|431553434|gb|ELI27360.1| hypothetical protein WIC_02017 [Escherichia coli KTE112]
Length = 478
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 223/522 (42%), Positives = 297/522 (56%), Gaps = 57/522 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + P + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGPGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
++ + R E VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYCREPEK---VRQLADFAIRHYWSHLED------------DED---------KY 215
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +
Sbjct: 216 RLWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHS 275
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +K
Sbjct: 276 DHQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQK 332
Query: 485 LGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LG K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 333 LGFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID- 385
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD
Sbjct: 386 ----RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMT 441
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 442 ELHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|307310723|ref|ZP_07590369.1| protein of unknown function UPF0061 [Escherichia coli W]
gi|378712856|ref|YP_005277749.1| hypothetical protein [Escherichia coli KO11FL]
gi|386609094|ref|YP_006124580.1| hypothetical protein ECW_m1875 [Escherichia coli W]
gi|386701329|ref|YP_006165166.1| hypothetical protein KO11_14215 [Escherichia coli KO11FL]
gi|386709562|ref|YP_006173283.1| hypothetical protein WFL_09185 [Escherichia coli W]
gi|306908901|gb|EFN39397.1| protein of unknown function UPF0061 [Escherichia coli W]
gi|315061011|gb|ADT75338.1| conserved protein [Escherichia coli W]
gi|323378417|gb|ADX50685.1| protein of unknown function UPF0061 [Escherichia coli KO11FL]
gi|383392856|gb|AFH17814.1| hypothetical protein KO11_14215 [Escherichia coli KO11FL]
gi|383405254|gb|AFH11497.1| hypothetical protein WFL_09185 [Escherichia coli W]
Length = 478
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|398812132|ref|ZP_10570907.1| hypothetical protein PMI12_05012 [Variovorax sp. CF313]
gi|398078760|gb|EJL69646.1| hypothetical protein PMI12_05012 [Variovorax sp. CF313]
Length = 493
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 224/518 (43%), Positives = 303/518 (58%), Gaps = 56/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF-FSGATPLAGAVPYAQC 189
A +T++ P+ + +P V SE+VA L L P + D L +G P+AG+ P+A
Sbjct: 27 AFFTELRPT-PLPDPYWVGRSEAVARELGL-PAGWHSSDGTLAALTGNLPVAGSRPFATV 84
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG+WAGQLGDGRAIT+GE E+QLKGAG+TPYSR DG AVLRSSIRE
Sbjct: 85 YSGHQFGVWAGQLGDGRAITVGET----EGGLEVQLKGAGRTPYSRGGDGRAVLRSSIRE 140
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAMH LGIPTTRALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 141 FLCSEAMHGLGIPTTRALCVTGSDARVYRE-------EPESAAVVTRVAPSFIRFGHFEH 193
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
A+ +ED +R LADY I H+ + N YAA+
Sbjct: 194 FAANQREDE--LRALADYVIDRHYPACRTTGR-----------------FGGNAYAAFLE 234
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
V+ERTA+L+A+WQ VGF HGV+NTDNMSILGLTIDYGPF FLD FDP N +D G
Sbjct: 235 AVSERTAALLARWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDGFDPRHICNHSDTSG- 293
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F QP++ WN+ F A LI D+E A +E Y T F +E++ M KLGL
Sbjct: 294 RYAFNQQPNVAYWNL--FCLAQALLPLIGDQEVAVAALESYKTVFPNEFEGRMRAKLGLA 351
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ +I +L +A KVDYT F+R LS AD ++ P++ + LD
Sbjct: 352 SPAEGDRALIEGVLKLLAAGKVDYTIFWRRLSTHMADGNVE------PVRDLFLD----- 400
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+E + +W+L++ + ++G + + LM NP++VLRN+L Q AI+A++ D V
Sbjct: 401 REGFDAWLLAFSERHTTTGRT--QAADLMLKSNPRFVLRNHLGQQAIEASQQKDHSGVAT 458
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL ++E P++E P + A PP WA +SCSS
Sbjct: 459 LLAVLETPFEEHPDADALAGFPPDWA---STIEISCSS 493
>gi|16764696|ref|NP_460311.1| hypothetical protein STM1345 [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|167994361|ref|ZP_02575453.1| protein YdiU [Salmonella enterica subsp. enterica serovar
4,[5],12:i:- str. CVM23701]
gi|374980353|ref|ZP_09721683.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|378444775|ref|YP_005232407.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378449849|ref|YP_005237208.1| hypothetical protein STM14_1633 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 14028S]
gi|378983902|ref|YP_005247057.1| hypothetical protein STMDT12_C13610 [Salmonella enterica subsp.
enterica serovar Typhimurium str. T000240]
gi|378988686|ref|YP_005251850.1| hypothetical protein STMUK_1312 [Salmonella enterica subsp.
enterica serovar Typhimurium str. UK-1]
gi|422025496|ref|ZP_16371926.1| hypothetical protein B571_06665 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422030500|ref|ZP_16376699.1| hypothetical protein B572_06617 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427549155|ref|ZP_18927236.1| hypothetical protein B576_06765 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427564782|ref|ZP_18931939.1| hypothetical protein B577_06119 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427584718|ref|ZP_18936736.1| hypothetical protein B573_06160 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427607148|ref|ZP_18941550.1| hypothetical protein B574_06188 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427632246|ref|ZP_18946497.1| hypothetical protein B575_06751 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427655539|ref|ZP_18951255.1| hypothetical protein B578_06371 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427660674|ref|ZP_18956162.1| hypothetical protein B579_06996 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427666696|ref|ZP_18960932.1| hypothetical protein B580_06548 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|427754348|ref|ZP_18966052.1| hypothetical protein B581_07979 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
gi|33517081|sp|Q8ZPS5.1|YDIU_SALTY RecName: Full=UPF0061 protein YdiU
gi|16419864|gb|AAL20270.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|205327742|gb|EDZ14506.1| protein YdiU [Salmonella enterica subsp. enterica serovar
4,[5],12:i:- str. CVM23701]
gi|261246554|emb|CBG24364.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. D23580]
gi|267993227|gb|ACY88112.1| hypothetical protein STM14_1633 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 14028S]
gi|312912330|dbj|BAJ36304.1| hypothetical protein STMDT12_C13610 [Salmonella enterica subsp.
enterica serovar Typhimurium str. T000240]
gi|321223973|gb|EFX49036.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Typhimurium str. TN061786]
gi|332988233|gb|AEF07216.1| hypothetical protein STMUK_1312 [Salmonella enterica subsp.
enterica serovar Typhimurium str. UK-1]
gi|414020301|gb|EKT03888.1| hypothetical protein B571_06665 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414020538|gb|EKT04117.1| hypothetical protein B576_06765 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414022071|gb|EKT05572.1| hypothetical protein B572_06617 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414034415|gb|EKT17342.1| hypothetical protein B577_06119 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414035771|gb|EKT18627.1| hypothetical protein B573_06160 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414039285|gb|EKT21962.1| hypothetical protein B574_06188 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414048786|gb|EKT31020.1| hypothetical protein B578_06371 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414050352|gb|EKT32528.1| hypothetical protein B575_06751 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414054895|gb|EKT36821.1| hypothetical protein B579_06996 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414060373|gb|EKT41888.1| hypothetical protein B580_06548 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|414066054|gb|EKT46686.1| hypothetical protein B581_07979 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm5]
Length = 480
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 297/521 (57%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YAR PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYARRPPEWGKRLEV---SCSS 480
>gi|330817253|ref|YP_004360958.1| hypothetical protein bgla_1g23750 [Burkholderia gladioli BSR3]
gi|327369646|gb|AEA61002.1| hypothetical protein bgla_1g23750 [Burkholderia gladioli BSR3]
Length = 521
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 227/544 (41%), Positives = 297/544 (54%), Gaps = 65/544 (11%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ VA L LDP P F F G
Sbjct: 24 PRDDAFLK--LGAAFLTRLPAAPLPAPYVVGFSDDVAAELGLDPAIRALPGFAELFCGNP 81
Query: 179 PL---AGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSR 235
A A+PY+ Y GHQFG+WAGQLGDGRA+ +GEI + + R+ELQLKGAG+TPYSR
Sbjct: 82 SRDWPAEALPYSSVYSGHQFGVWAGQLGDGRALNVGEIEH-EGRRFELQLKGAGRTPYSR 140
Query: 236 FADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVC 295
DG AVLRSSIREFLCSEAMH LGIPTTRAL + + + V R+ E A+V
Sbjct: 141 MGDGRAVLRSSIREFLCSEAMHHLGIPTTRALTVTGSDQTVMRETV-------ETAAVVT 193
Query: 296 RVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHS 355
RVA+SF+RFG ++ S + DL ++ LAD+ I D +
Sbjct: 194 RVAESFVRFGHFEHFFSNDRPDL--LKQLADHVI---------------------DRFYP 230
Query: 356 VVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAF 415
+ Y A V +RTA +VAQWQ VGF HGV+NTDNMSILGLT+DYGPFGF+DAF
Sbjct: 231 ACGEAEDPYLALLEAVMQRTAKMVAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFVDAF 290
Query: 416 DPSFTPNTTDLPGRRYCFANQPDIGLWN---IAQFSTTLAAAKLID----------DKEA 462
D N TD G RY + QP I WN +AQ L + +D ++A
Sbjct: 291 DAGHICNHTDQQG-RYAYRMQPRISHWNCFCLAQALLPLIGQQRVDLEDDPRTERAVEDA 349
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV 519
V+ R+ F + M KLGL + + + ++LL M D+T FR L+ +
Sbjct: 350 QAVLSRFPETFGPALEGAMRAKLGLALEQEGDAALANRLLEIMHGSHADFTLTFRRLAQL 409
Query: 520 KADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
+ + P++ + +D +EA+ W Y L D ER A MN VNP
Sbjct: 410 SKHDANSD----APVRDLFID-----REAFDGWAAQYRARLADETRDDAERAAAMNRVNP 460
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCML 639
KYVLRN+L ++AI A D+ EV RL ++ RP+DEQP E YA LPP WA G +
Sbjct: 461 KYVLRNHLAETAIRRAAEKDYSEVERLAAILRRPFDEQPEHEAYAALPPDWA---GTLEV 517
Query: 640 SCSS 643
SCSS
Sbjct: 518 SCSS 521
>gi|409406043|ref|ZP_11254505.1| hypothetical protein GWL_16580 [Herbaspirillum sp. GW103]
gi|386434592|gb|EIJ47417.1| hypothetical protein GWL_16580 [Herbaspirillum sp. GW103]
Length = 491
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 225/519 (43%), Positives = 297/519 (57%), Gaps = 53/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T++ P+ + P LV +SE+ A ++ L E F F+G G++P + Y
Sbjct: 20 AFHTRLQPTP-LPAPYLVGFSEAAAATVGLSRPAHEDDSFLDVFAGNRIAPGSLPLSAVY 78
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
GHQFG+WAGQLGDGRAITLG++ + R ELQLKGAG+TPYSR DG AVLRSSIRE
Sbjct: 79 SGHQFGVWAGQLGDGRAITLGDLPAADGQGRIELQLKGAGQTPYSRMGDGRAVLRSSIRE 138
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAM LGIPTTRAL ++ + + V R+ E A+V R+A SF+RFGS++
Sbjct: 139 FLCSEAMAALGIPTTRALTVIGSDQRVLRE-------TPETAAVVTRMAPSFIRFGSFE- 190
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H Q D ++ LAD + + + +N Y A
Sbjct: 191 HWYYNQR-FDDLKILADTVLEQFYPQLLT---------------------EANPYQALLR 228
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G
Sbjct: 229 EVTRRTATLMAQWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDARHICNHTDSQG- 287
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY + QP IG WN F+ A LI E + Y F + A++ KLGL
Sbjct: 288 RYSYQMQPRIGQWNC--FALGQAMLPLIGSVEQTEAALADYEAIFQARHDALLHAKLGLN 345
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK-ADPSIPEDELLVPLKAVLLDIGKE 544
+ Q+I L + + VD+T FFR L +++ +P +DE PL+ ++LD
Sbjct: 346 TRQPDDDQLIQALFAILQANHVDFTLFFRRLGDLRIGNPE--QDE---PLRDLILD---- 396
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
+ A+ +W Y Q L + DE R+ M +VNPKYVLRNYL Q AID A+ DF EV
Sbjct: 397 -RPAFDAWAAQYRQRLRAEDSDDEARRLAMQAVNPKYVLRNYLAQVAIDKAQQKDFSEVA 455
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ P+DEQP ++YA LPP WA V SCSS
Sbjct: 456 RLQQILRHPFDEQPEFDRYADLPPDWASHLEV---SCSS 491
>gi|383758286|ref|YP_005437271.1| hypothetical protein RGE_24310 [Rubrivivax gelatinosus IL144]
gi|381378955|dbj|BAL95772.1| hypothetical protein RGE_24310 [Rubrivivax gelatinosus IL144]
Length = 497
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 219/475 (46%), Positives = 274/475 (57%), Gaps = 49/475 (10%)
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
L A P G + A Y GHQFG+WAGQLGDGRA+ LGE + ELQLKG+G T
Sbjct: 69 LLAGNAQPAGGTL--ATVYSGHQFGVWAGQLGDGRALLLGEA-DTPLGPLELQLKGSGLT 125
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSSIRE+L SEAMH LGIPTTRAL LV + V R+ + E
Sbjct: 126 PYSRMGDGRAVLRSSIREYLGSEAMHALGIPTTRALALVGSPLPVRRE-------RVETA 178
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V RVA SFLRFG ++ H + D +R LAD I +F ++E+
Sbjct: 179 AVVTRVAPSFLRFGHFE-HFAHTAADEAALRRLADDTIERYF-----PAQAEA------- 225
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+N+YAA EVA RTA LVAQWQ VGF HGV+NTDNMS+LGLTIDYGPFGF
Sbjct: 226 ---------ANRYAALLEEVARRTARLVAQWQAVGFCHGVMNTDNMSLLGLTIDYGPFGF 276
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGT 471
LDAFDP N +D G RY +A QP++ WN+ + L ++D A +E Y T
Sbjct: 277 LDAFDPGHVCNHSDHQG-RYAYARQPNVAFWNLHALAQALLPL-IVDSDAAVAALEPYKT 334
Query: 472 KFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED 528
+F+ Q M KLGL + ++ LL MA D DYT FR L+ + P D
Sbjct: 335 EFLAALQTAMRAKLGLRDERPEDGTLVDDLLRRMAADGADYTISFRRLARFDSTPGARND 394
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
L+ + LD +EA+ +W L Y + L + D ER+ M NPKYVLRN+L
Sbjct: 395 A----LRDMFLD-----REAFDAWALRYAERLRAESSLDAERRLRMERSNPKYVLRNHLA 445
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++AI AE GDFGEV RLL +++ P+DEQP E A PP WA + +SCSS
Sbjct: 446 ETAIRQAETGDFGEVSRLLAVLQHPFDEQPEHEALAGFPPDWARQ---LEISCSS 497
>gi|405355559|ref|ZP_11024734.1| Selenoprotein O and cysteine-containing protein [Chondromyces
apiculatus DSM 436]
gi|397091266|gb|EJJ22084.1| Selenoprotein O and cysteine-containing protein [Myxococcus sp.
(contaminant ex DSM 436)]
Length = 493
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 221/517 (42%), Positives = 286/517 (55%), Gaps = 57/517 (11%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
+V PS + +LV+ + S L+L P+E RP+F GA PL G P+A Y GH
Sbjct: 27 ARVQPS-PFPDAKLVSVNPSALKLLDLTPEEALRPEFVAALGGAQPLPGMEPFAMVYAGH 85
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG++ +LGDGRAI LGE+ N +W+L LKG G TP+SR DG AVLRS+IRE+LC
Sbjct: 86 QFGVYVPRLGDGRAILLGEVRNAAGAKWDLHLKGGGPTPFSRGGDGRAVLRSTIREYLCG 145
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAMH LGIPTTR L ++ + V R+ E GA++ R+A S +RFG+++
Sbjct: 146 EAMHGLGIPTTRGLGILGSHAPVYREAV-------ETGAMLVRMAPSHVRFGTFEFFHY- 197
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E + V TLAD+ I HF H+ G E ++A + EV E
Sbjct: 198 -TEQTEHVATLADHVITEHFPHL------------AGQE---------GRFARFYAEVVE 235
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF+D F+P F N +D G RY F
Sbjct: 236 RTARLIAQWQAVGFAHGVMNTDNMSILGLTLDYGPFGFMDDFEPGFICNHSDDRG-RYAF 294
Query: 434 ANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY--- 490
QP IGLWN+A L L+ + EA + Y F + +M KLGL +
Sbjct: 295 DQQPRIGLWNLACLGEAL--LTLLSEDEARATLGTYQPTFNAHFMDVMRAKLGLREAQDE 352
Query: 491 NKQIISKLLNNMAVDKVDYTNFFRALSNVKA----DPSIPEDELLVPLKAVLLDIGKERK 546
++ ++S L MA +VDYT FFRAL + + PS D P
Sbjct: 353 DRALVSDLFACMAEARVDYTRFFRALGGLASADGDGPSPVRDMFTAP------------- 399
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
E + +W Y L + G D ER+A M+ VNPKYVLRN++ Q AI AE GDF V RL
Sbjct: 400 EGFDAWAARYRARLAAEGSVDAERRARMDRVNPKYVLRNWVAQEAISRAEAGDFSVVDRL 459
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L ++ P+ E P E YA PP W V SCSS
Sbjct: 460 LGVLADPFAEHPDAEAYAAAPPVWGRHLAV---SCSS 493
>gi|444367143|ref|ZP_21167132.1| hypothetical protein BURCENK562V_3571 [Burkholderia cenocepacia
K56-2Valvano]
gi|443603421|gb|ELT71429.1| hypothetical protein BURCENK562V_3571 [Burkholderia cenocepacia
K56-2Valvano]
Length = 522
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 224/536 (41%), Positives = 295/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVAQLLDLPPTLAAQPGFAELFTG-NPTRDWPANAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V R ++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRASESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I D H + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVI---------------------DRFHPACRDADDPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 303 GG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLALEREGDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D +EA+ +W Y L D R MN NPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|148672432|gb|EDL04379.1| RIKEN cDNA 1300018J18, isoform CRA_c [Mus musculus]
Length = 664
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 250/630 (39%), Positives = 325/630 (51%), Gaps = 117/630 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +RELP G + + PR V AC+++ P A + P+LVA SE
Sbjct: 46 LAGLRFDNRALRELPVETPPPGPEDSLATPRPVPGACFSRARP-APLRRPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEASEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDEHTGRAGPSVGRDDIRVQL 284
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + T D D+ + AA+ EV +RTA +VA+WQ
Sbjct: 285 LDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTQRTARMVAEWQC 330
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + WN+
Sbjct: 331 VGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHICNASDNAGR-YTYSKQPQVCKWNL 389
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKL-- 498
+ + L + EA + E + T+F Y M KKLGL + K+ +++KL
Sbjct: 390 QKLAEALEPELPLALAEA-ILKEEFDTEFQRHYLQKMRKKLGLIRVEKEEDGTLVAKLLE 448
Query: 499 ---------------LNNMAVDKVDYTNFFRALSNVKA-------------DP------- 523
L++ D D F L++ A DP
Sbjct: 449 TMHLTGADFTNTFCVLSSFPADLSDSAEFLSRLTSQCASLEELRLAFRPQMDPRQLSMML 508
Query: 524 ----SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSYIQEL- 560
S P+ L+ +A + D+ ++ ++ W +W+ Y L
Sbjct: 509 MLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSDLQRKNRDHWEAWLQEYRDRLD 568
Query: 561 -LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
G+ D ER +M + NPKYVLRNY+ Q AI+AAE GDF EVRR+LKL+E PY
Sbjct: 569 KEKEGVGDTAAWQAERVRVMRANNPKYVLRNYIAQKAIEAAENGDFSEVRRVLKLLESPY 628
Query: 615 ---DEQPGMEKYAR----------LPPAWA 631
+E G E AR PP WA
Sbjct: 629 HSEEEATGPEAVARSTEEQSSYSNRPPLWA 658
>gi|50120772|ref|YP_049939.1| hypothetical protein ECA1842 [Pectobacterium atrosepticum SCRI1043]
gi|81645339|sp|Q6D646.1|Y1842_ERWCT RecName: Full=UPF0061 protein ECA1842
gi|49611298|emb|CAG74745.1| conserved hypothetical protein [Pectobacterium atrosepticum
SCRI1043]
Length = 483
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 223/515 (43%), Positives = 283/515 (54%), Gaps = 54/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P+ +SG L G P AQ Y G
Sbjct: 19 YTALQPTP-LHGARLLYHSEGLASELGLSSDWFT-PEQDDVWSGTRLLPGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IREFL
Sbjct: 77 HQFGSWAGQLGDGRGILLGEQQLADGRSMDWHLKGAGLTPYSRMGDGRAVLRSAIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHHLGIPTTRALTIVTSQHPVQRE-------QEEKGAMLLRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR L +Y I H+ EN DE +Y W +V
Sbjct: 190 R--REPEKVRQLVEYVIARHWPQWEN------------DE---------RRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF+HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFSHGVMNTDNMSILGLTIDYGPYGFLDAYQPDFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----P 488
F NQP +GLWN+ + L+ L+D + RY M Y +M KLGL P
Sbjct: 286 FDNQPAVGLWNLHRLGQALSG--LMDTDTLERALARYEPALMQHYGTLMRAKLGLFTASP 343
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
N ++ LL M + DYT FR L++ + S PL+ +D + A
Sbjct: 344 DDNDVLVG-LLRLMQKEGSDYTRTFRLLADSEKQASRS------PLRDEFID-----RAA 391
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ SW +Y Q L+ DEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL +
Sbjct: 392 FDSWFATYRQRLMQEDQDDEERRRLMNATNPKYILRNYLAQMAIERAESDDTSALARLHQ 451
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RP+DEQP A LPP W +SCSS
Sbjct: 452 ALCRPFDEQPDSHDLAALPPDWGKH---LEISCSS 483
>gi|319793853|ref|YP_004155493.1| hypothetical protein Varpa_3196 [Variovorax paradoxus EPS]
gi|315596316|gb|ADU37382.1| protein of unknown function UPF0061 [Variovorax paradoxus EPS]
Length = 493
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 229/518 (44%), Positives = 298/518 (57%), Gaps = 56/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF-FSGATPLAGAVPYAQC 189
A T + P+ + +P V SE+VA L L P ++ + D L +G+ P +G P+A
Sbjct: 27 AFLTHLRPT-PLPDPYWVGHSEAVARELGL-PADWRQSDTTLAALTGSLPASGTNPFATV 84
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG+WAGQLGDGRAI LGE E+QLKGAG+TPYSR DG AVLRSSIRE
Sbjct: 85 YSGHQFGVWAGQLGDGRAIMLGE----TEGGLEVQLKGAGRTPYSRGGDGRAVLRSSIRE 140
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAMH LGIPTTRAL + + V R+ + E A+V RVA SF+RFG ++
Sbjct: 141 FLCSEAMHGLGIPTTRALSVTGSDARVYRE-------EPESAAVVARVAPSFIRFGHFEH 193
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
A+ +ED +R L DY I ++ ++ N YAA+
Sbjct: 194 FAANQREDE--LRALTDYVIDRYYPACRTTDR-----------------FNGNAYAAFLE 234
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
V+ERTA+L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLD FDP N +D G
Sbjct: 235 AVSERTAALLAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDGFDPRHICNHSDTSG- 293
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F QP++ WN+ F A LI D+E A +E Y T F + ++A M KLGL
Sbjct: 294 RYAFNQQPNVAYWNL--FCLAQALLPLIGDQEVAVAALESYKTVFPNAFEARMRAKLGLA 351
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ +I +L +A KVDYT F+R LS AD + P++ + LD
Sbjct: 352 DAAEADRALIEGVLKLLAAGKVDYTIFWRRLSQYMADGNAE------PVRDLFLD----- 400
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ + +W+LS+ + S S E LM +NPKYVLRN+L Q AI+AA DF V
Sbjct: 401 RAGFDAWLLSFSERHAQSVRS--EAADLMLQLNPKYVLRNHLGQQAIEAAAQKDFSGVAT 458
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL L+E P++E G + YA PP WA +SCSS
Sbjct: 459 LLTLLETPFEEHSGADAYAGFPPDWA---STIEISCSS 493
>gi|338530554|ref|YP_004663888.1| hypothetical protein LILAB_04445 [Myxococcus fulvus HW-1]
gi|337256650|gb|AEI62810.1| hypothetical protein LILAB_04445 [Myxococcus fulvus HW-1]
Length = 486
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 225/549 (40%), Positives = 295/549 (53%), Gaps = 72/549 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+++ R LP +V PS + +LV+ + + L
Sbjct: 6 MATLEQLRFDNTYAR-LPA-------------GFGARVHPS-PFPDARLVSVNPAALKLL 50
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P+E RP+F G PL G P+A Y GHQFG++ +LGDGRA+ LGE+ N
Sbjct: 51 DLAPEEAARPEFVAAMGGERPLPGMEPFAMVYAGHQFGVYVPRLGDGRALLLGEVRNAAG 110
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+W+L LKG G TP+SR DG AVLRS++RE+LC EAMH LGIPTTR L ++ + V R
Sbjct: 111 AKWDLHLKGGGPTPFSRGGDGRAVLRSTVREYLCGEAMHGLGIPTTRGLGILGSQAPVYR 170
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ E GA++ R+A S +RFG+++ H + E + V TLAD+ I HF H+
Sbjct: 171 EAV-------ETGAMLVRMAPSHVRFGTFEYFHYT---EQTEHVATLADHVIAEHFPHL- 219
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
G E ++A + EV ERTA L+AQWQ VGF HGV+NTDNM
Sbjct: 220 -----------AGQE---------GRHARFYAEVVERTARLIAQWQAVGFAHGVMNTDNM 259
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILGLT+DYGPFGFLD F+P F N +D G RY F QP IGLWN+A L LI
Sbjct: 260 SILGLTLDYGPFGFLDDFEPGFICNHSDDRG-RYAFDQQPRIGLWNLACLGEAL--LTLI 316
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ EA + Y F + M KLGL + +++++S L +A +VDYT FFR
Sbjct: 317 SEDEARAALATYQPTFNAHFMDRMRAKLGLREARDEDRELVSDLFTRLAEARVDYTRFFR 376
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
AL + D D P E + +W Y L + G D ER A M
Sbjct: 377 ALGS---DVRPVRDMFPAP-------------EGFDAWAGRYRARLDAEGSVDAERHARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
VNPKYVLRN++ Q AI AE GDF V RLL ++ P+ E P E YA PP W
Sbjct: 421 ARVNPKYVLRNWVAQEAISRAEAGDFSLVDRLLGVLADPFAEHPDAEPYAAAPPVWGRHL 480
Query: 635 GVCMLSCSS 643
V SCSS
Sbjct: 481 AV---SCSS 486
>gi|378699234|ref|YP_005181191.1| hypothetical protein SL1344_1279 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|379700517|ref|YP_005242245.1| hypothetical protein STM474_1349 [Salmonella enterica subsp.
enterica serovar Typhimurium str. ST4/74]
gi|383496058|ref|YP_005396747.1| hypothetical protein UMN798_1401 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|301157882|emb|CBW17376.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. SL1344]
gi|323129616|gb|ADX17046.1| UPF0061 protein ydiU [Salmonella enterica subsp. enterica serovar
Typhimurium str. ST4/74]
gi|380462879|gb|AFD58282.1| hypothetical protein UMN798_1401 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
Length = 480
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------PEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLIPFIEID--ALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YAR PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYARRPPEWGKRLEV---SCSS 480
>gi|401676099|ref|ZP_10808085.1| YdiU Protein [Enterobacter sp. SST3]
gi|400216585|gb|EJO47485.1| YdiU Protein [Enterobacter sp. SST3]
Length = 480
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 214/514 (41%), Positives = 291/514 (56%), Gaps = 53/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ ++N +L+ +++ +A+ L + P+ +R + G T LAG P AQ Y G
Sbjct: 17 YTALKPTP-LQNSRLIWYNDRLAEELAIPPELLQRSGSAGVWGGETLLAGMQPLAQVYSG 75
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 76 HQFGVWAGQLGDGRGILLGEQQLPNGETVDWHLKGAGLTPYSRMGDGRAVLRSTIRECLG 135
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+AQS LRFG ++
Sbjct: 136 SEAMHALGIPTTRALSIVTSDTPVARETV-------EKGAMLMRIAQSHLRFGHFEHFYY 188
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + D VR LAD+AIRHH+ H+++ ++KY W +V
Sbjct: 189 R--REPDKVRQLADFAIRHHWAHLQD---------------------DADKYVLWFRDVV 225
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+L+A+WQ VGF HGV+NTDNMS+LGLT DYGPFGFLD + P + N +D G RY
Sbjct: 226 ARTAALIARWQTVGFAHGVMNTDNMSLLGLTFDYGPFGFLDDYQPGYICNHSDYQG-RYS 284
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F NQP +GLWN+ + + TL + ID N ++ Y + EY ++M KLGL K
Sbjct: 285 FDNQPAVGLWNLQRLAQTL--SPFIDVDALNDALDSYQAILLREYGSLMRNKLGLVTQEK 342
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ I++ L MA + DYT FR L + + PL+ +D ++A+
Sbjct: 343 GDNDILNGLFALMAREGSDYTRTFRMLGQTEQHSAAS------PLRDEFID-----RQAF 391
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
W SY L + D R+A MN+ NP VLRN+L Q AI+ AE G++ E+ RL
Sbjct: 392 DDWFASYRTRLQQEQVDDVTRQAQMNATNPAMVLRNWLAQRAIEQAEQGEYDELHRLHVA 451
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 452 LRTPFADRD--DDYVSRPPKWGKRLEV---SCSS 480
>gi|417240864|ref|ZP_12037031.1| hypothetical protein EC90111_0207 [Escherichia coli 9.0111]
gi|386212508|gb|EII22953.1| hypothetical protein EC90111_0207 [Escherichia coli 9.0111]
Length = 478
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417167881|ref|ZP_12000503.1| hypothetical protein EC970259_2007 [Escherichia coli 99.0741]
gi|419864460|ref|ZP_14386910.1| hypothetical protein ECO9340_14373 [Escherichia coli O103:H25 str.
CVM9340]
gi|386170907|gb|EIH42955.1| hypothetical protein EC970259_2007 [Escherichia coli 99.0741]
gi|388340113|gb|EIL06394.1| hypothetical protein ECO9340_14373 [Escherichia coli O103:H25 str.
CVM9340]
Length = 478
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|419278023|ref|ZP_13820281.1| hypothetical protein ECDEC10E_1975 [Escherichia coli DEC10E]
gi|419375571|ref|ZP_13916601.1| hypothetical protein ECDEC14B_2145 [Escherichia coli DEC14B]
gi|419380813|ref|ZP_13921774.1| hypothetical protein ECDEC14C_1970 [Escherichia coli DEC14C]
gi|419386166|ref|ZP_13927048.1| hypothetical protein ECDEC14D_1971 [Escherichia coli DEC14D]
gi|378130803|gb|EHW92166.1| hypothetical protein ECDEC10E_1975 [Escherichia coli DEC10E]
gi|378221445|gb|EHX81694.1| hypothetical protein ECDEC14B_2145 [Escherichia coli DEC14B]
gi|378229689|gb|EHX89825.1| hypothetical protein ECDEC14C_1970 [Escherichia coli DEC14C]
gi|378232641|gb|EHX92739.1| hypothetical protein ECDEC14D_1971 [Escherichia coli DEC14D]
Length = 478
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|393776995|ref|ZP_10365289.1| hypothetical protein MW7_1976 [Ralstonia sp. PBA]
gi|392716352|gb|EIZ03932.1| hypothetical protein MW7_1976 [Ralstonia sp. PBA]
Length = 523
Score = 365 bits (937), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 223/534 (41%), Positives = 296/534 (55%), Gaps = 72/534 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P A + +P L+ +SE L LD + + DF F+G + A P A Y G
Sbjct: 39 FTRLPP-APLPDPVLIDFSEEAGTMLGLDRQAAQAQDFVEVFTGNRIPSWADPLATVYSG 97
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRA+ L E+ E+QLKGAG+TPYSR ADG AVLRSSIREFLC
Sbjct: 98 HQFGVWAGQLGDGRALRLAEVATADGP-LEVQLKGAGRTPYSRMADGRAVLRSSIREFLC 156
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIPT+RALC+ + V R+ E A+V R+A SF+RFG ++ +
Sbjct: 157 SEAMAGLGIPTSRALCITGSNAPVRREEI-------ETAAVVTRLAPSFIRFGHFEHFGA 209
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R +D+ +R LAD+ I D + + YAA EV
Sbjct: 210 R--DDIAALRQLADFVI---------------------DRFYPQCRAAAQPYAALLREVT 246
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD F+ + N +D GR Y
Sbjct: 247 VRTADLMADWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDGFNANHICNHSDTQGR-YA 305
Query: 433 FANQPDIGLWNIAQFSTTL---------AAAKLIDD--KEANYVM---------ERYGTK 472
+ QP IG WN+ + + AA+ D+ +EA + ERY
Sbjct: 306 YQQQPQIGFWNLHCLAQAMLPLLLDPHGTAAESDDESRQEAAIALAHESLGAFRERYAAA 365
Query: 473 FMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
F+ Y+A KLGL ++Q+++++ + ++DYT FFR L+ + S +D
Sbjct: 366 FLARYRA----KLGLATTQDNDEQLLAEMFGMLHAQRIDYTLFFRNLAAI----SSTDDS 417
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQ 589
P++ + LD + AW +W SY Q L DE R M +VNPKY+LRN+L +
Sbjct: 418 QDAPVRDLFLD-----RSAWQAWAASYRQRLQLEHSVDEARSTAMRAVNPKYILRNHLAE 472
Query: 590 SAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI A DF EV RL +L+ RP+DEQP M YA LPP WA G +SCSS
Sbjct: 473 IAIRRARENDFSEVARLRQLLSRPFDEQPDMAHYAALPPDWA---GGLEVSCSS 523
>gi|193065279|ref|ZP_03046351.1| conserved hypothetical protein [Escherichia coli E22]
gi|194429486|ref|ZP_03062008.1| conserved hypothetical protein [Escherichia coli B171]
gi|209919022|ref|YP_002293106.1| hypothetical protein ECSE_1831 [Escherichia coli SE11]
gi|260844011|ref|YP_003221789.1| hypothetical protein ECO103_1850 [Escherichia coli O103:H2 str.
12009]
gi|415794890|ref|ZP_11496637.1| hypothetical protein ECE128010_0294 [Escherichia coli E128010]
gi|417172178|ref|ZP_12002211.1| hypothetical protein EC32608_1368 [Escherichia coli 3.2608]
gi|417252002|ref|ZP_12043765.1| hypothetical protein EC40967_4966 [Escherichia coli 4.0967]
gi|417623394|ref|ZP_12273701.1| hypothetical protein ECSTECH18_2144 [Escherichia coli STEC_H.1.8]
gi|419289601|ref|ZP_13831696.1| hypothetical protein ECDEC11A_1952 [Escherichia coli DEC11A]
gi|419294891|ref|ZP_13836937.1| hypothetical protein ECDEC11B_1960 [Escherichia coli DEC11B]
gi|419300252|ref|ZP_13842254.1| hypothetical protein ECDEC11C_2126 [Escherichia coli DEC11C]
gi|419306349|ref|ZP_13848253.1| hypothetical protein ECDEC11D_1913 [Escherichia coli DEC11D]
gi|419311372|ref|ZP_13853240.1| hypothetical protein ECDEC11E_1904 [Escherichia coli DEC11E]
gi|419322800|ref|ZP_13864513.1| hypothetical protein ECDEC12B_2297 [Escherichia coli DEC12B]
gi|419334400|ref|ZP_13875944.1| hypothetical protein ECDEC12D_2163 [Escherichia coli DEC12D]
gi|419869345|ref|ZP_14391549.1| hypothetical protein ECO9450_17681 [Escherichia coli O103:H2 str.
CVM9450]
gi|419930400|ref|ZP_14448004.1| hypothetical protein EC5411_18985 [Escherichia coli 541-1]
gi|420391385|ref|ZP_14890642.1| hypothetical protein ECEPECC34262_2214 [Escherichia coli EPEC
C342-62]
gi|422355554|ref|ZP_16436268.1| SelO family protein [Escherichia coli MS 117-3]
gi|432481050|ref|ZP_19723008.1| hypothetical protein A15U_02165 [Escherichia coli KTE210]
gi|226725730|sp|B6I8R1.1|YDIU_ECOSE RecName: Full=UPF0061 protein YdiU
gi|192927073|gb|EDV81695.1| conserved hypothetical protein [Escherichia coli E22]
gi|194412450|gb|EDX28750.1| conserved hypothetical protein [Escherichia coli B171]
gi|209912281|dbj|BAG77355.1| conserved hypothetical protein [Escherichia coli SE11]
gi|257759158|dbj|BAI30655.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
gi|323163443|gb|EFZ49269.1| hypothetical protein ECE128010_0294 [Escherichia coli E128010]
gi|324016459|gb|EGB85678.1| SelO family protein [Escherichia coli MS 117-3]
gi|345380035|gb|EGX11941.1| hypothetical protein ECSTECH18_2144 [Escherichia coli STEC_H.1.8]
gi|378131532|gb|EHW92889.1| hypothetical protein ECDEC11A_1952 [Escherichia coli DEC11A]
gi|378141978|gb|EHX03180.1| hypothetical protein ECDEC11B_1960 [Escherichia coli DEC11B]
gi|378149784|gb|EHX10904.1| hypothetical protein ECDEC11D_1913 [Escherichia coli DEC11D]
gi|378152222|gb|EHX13323.1| hypothetical protein ECDEC11C_2126 [Escherichia coli DEC11C]
gi|378159029|gb|EHX20043.1| hypothetical protein ECDEC11E_1904 [Escherichia coli DEC11E]
gi|378169456|gb|EHX30354.1| hypothetical protein ECDEC12B_2297 [Escherichia coli DEC12B]
gi|378186613|gb|EHX47236.1| hypothetical protein ECDEC12D_2163 [Escherichia coli DEC12D]
gi|386179876|gb|EIH57350.1| hypothetical protein EC32608_1368 [Escherichia coli 3.2608]
gi|386217577|gb|EII34062.1| hypothetical protein EC40967_4966 [Escherichia coli 4.0967]
gi|388342550|gb|EIL08584.1| hypothetical protein ECO9450_17681 [Escherichia coli O103:H2 str.
CVM9450]
gi|388400254|gb|EIL61006.1| hypothetical protein EC5411_18985 [Escherichia coli 541-1]
gi|391313150|gb|EIQ70743.1| hypothetical protein ECEPECC34262_2214 [Escherichia coli EPEC
C342-62]
gi|431007707|gb|ELD22518.1| hypothetical protein A15U_02165 [Escherichia coli KTE210]
Length = 478
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|418043902|ref|ZP_12682054.1| hypothetical protein ECW26_42850 [Escherichia coli W26]
gi|419391621|ref|ZP_13932436.1| hypothetical protein ECDEC15A_2220 [Escherichia coli DEC15A]
gi|419396618|ref|ZP_13937394.1| hypothetical protein ECDEC15B_1917 [Escherichia coli DEC15B]
gi|419402025|ref|ZP_13942750.1| hypothetical protein ECDEC15C_1937 [Escherichia coli DEC15C]
gi|419407168|ref|ZP_13947859.1| hypothetical protein ECDEC15D_1870 [Escherichia coli DEC15D]
gi|419412703|ref|ZP_13953359.1| hypothetical protein ECDEC15E_2207 [Escherichia coli DEC15E]
gi|378238345|gb|EHX98346.1| hypothetical protein ECDEC15A_2220 [Escherichia coli DEC15A]
gi|378246774|gb|EHY06694.1| hypothetical protein ECDEC15B_1917 [Escherichia coli DEC15B]
gi|378247884|gb|EHY07799.1| hypothetical protein ECDEC15C_1937 [Escherichia coli DEC15C]
gi|378255418|gb|EHY15276.1| hypothetical protein ECDEC15D_1870 [Escherichia coli DEC15D]
gi|378259568|gb|EHY19380.1| hypothetical protein ECDEC15E_2207 [Escherichia coli DEC15E]
gi|383473319|gb|EID65346.1| hypothetical protein ECW26_42850 [Escherichia coli W26]
Length = 478
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417628826|ref|ZP_12279066.1| hypothetical protein ECSTECMHI813_1742 [Escherichia coli
STEC_MHI813]
gi|345374040|gb|EGX05993.1| hypothetical protein ECSTECMHI813_1742 [Escherichia coli
STEC_MHI813]
Length = 478
Score = 364 bits (935), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSTAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|191167848|ref|ZP_03029653.1| conserved hypothetical protein [Escherichia coli B7A]
gi|309793476|ref|ZP_07687903.1| SelO family protein [Escherichia coli MS 145-7]
gi|190902107|gb|EDV61851.1| conserved hypothetical protein [Escherichia coli B7A]
gi|308123063|gb|EFO60325.1| SelO family protein [Escherichia coli MS 145-7]
Length = 478
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 222/521 (42%), Positives = 298/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + P + D ++ G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSI-PSSLFKNDAGVW-GGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432602227|ref|ZP_19838471.1| hypothetical protein A1U5_02062 [Escherichia coli KTE66]
gi|431140801|gb|ELE42566.1| hypothetical protein A1U5_02062 [Escherichia coli KTE66]
Length = 478
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGISSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|291282836|ref|YP_003499654.1| hypothetical protein G2583_2103 [Escherichia coli O55:H7 str.
CB9615]
gi|387506951|ref|YP_006159207.1| hypothetical protein ECO55CA74_10330 [Escherichia coli O55:H7 str.
RM12579]
gi|416773539|ref|ZP_11873746.1| hypothetical protein ECO5101_07502 [Escherichia coli O157:H7 str.
G5101]
gi|416785348|ref|ZP_11878644.1| hypothetical protein ECO9389_09243 [Escherichia coli O157:H- str.
493-89]
gi|416796340|ref|ZP_11883559.1| hypothetical protein ECO2687_03735 [Escherichia coli O157:H- str. H
2687]
gi|416818198|ref|ZP_11892898.1| hypothetical protein ECO7815_12670 [Escherichia coli O55:H7 str.
3256-97]
gi|416827313|ref|ZP_11897478.1| hypothetical protein ECO5905_08594 [Escherichia coli O55:H7 str.
USDA 5905]
gi|416828610|ref|ZP_11898098.1| hypothetical protein ECOSU61_21343 [Escherichia coli O157:H7 str.
LSU-61]
gi|419075557|ref|ZP_13621089.1| hypothetical protein ECDEC3F_2588 [Escherichia coli DEC3F]
gi|419114841|ref|ZP_13659863.1| hypothetical protein ECDEC5A_2008 [Escherichia coli DEC5A]
gi|419120466|ref|ZP_13665432.1| hypothetical protein ECDEC5B_2280 [Escherichia coli DEC5B]
gi|419126312|ref|ZP_13671201.1| hypothetical protein ECDEC5C_2142 [Escherichia coli DEC5C]
gi|419131634|ref|ZP_13676475.1| hypothetical protein ECDEC5D_2384 [Escherichia coli DEC5D]
gi|419136453|ref|ZP_13681254.1| hypothetical protein ECDEC5E_1947 [Escherichia coli DEC5E]
gi|420280910|ref|ZP_14783157.1| hypothetical protein ECTW06591_2160 [Escherichia coli TW06591]
gi|425144095|ref|ZP_18544156.1| hypothetical protein EC100869_2390 [Escherichia coli 10.0869]
gi|425249155|ref|ZP_18642151.1| hypothetical protein EC5905_2800 [Escherichia coli 5905]
gi|425261218|ref|ZP_18653306.1| hypothetical protein ECEC96038_2481 [Escherichia coli EC96038]
gi|425267254|ref|ZP_18658939.1| hypothetical protein EC5412_2534 [Escherichia coli 5412]
gi|445012291|ref|ZP_21328432.1| hypothetical protein ECPA48_2000 [Escherichia coli PA48]
gi|209768958|gb|ACI82791.1| hypothetical protein ECs2413 [Escherichia coli]
gi|209768964|gb|ACI82794.1| hypothetical protein ECs2413 [Escherichia coli]
gi|290762709|gb|ADD56670.1| UPF0061 protein ydiU [Escherichia coli O55:H7 str. CB9615]
gi|320641921|gb|EFX11289.1| hypothetical protein ECO5101_07502 [Escherichia coli O157:H7 str.
G5101]
gi|320647378|gb|EFX16186.1| hypothetical protein ECO9389_09243 [Escherichia coli O157:H- str.
493-89]
gi|320652672|gb|EFX20941.1| hypothetical protein ECO2687_03735 [Escherichia coli O157:H- str. H
2687]
gi|320653054|gb|EFX21250.1| hypothetical protein ECO7815_12670 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
gi|320658740|gb|EFX26417.1| hypothetical protein ECO5905_08594 [Escherichia coli O55:H7 str.
USDA 5905]
gi|320668730|gb|EFX35535.1| hypothetical protein ECOSU61_21343 [Escherichia coli O157:H7 str.
LSU-61]
gi|374358945|gb|AEZ40652.1| hypothetical protein ECO55CA74_10330 [Escherichia coli O55:H7 str.
RM12579]
gi|377923828|gb|EHU87789.1| hypothetical protein ECDEC3F_2588 [Escherichia coli DEC3F]
gi|377962046|gb|EHV25509.1| hypothetical protein ECDEC5A_2008 [Escherichia coli DEC5A]
gi|377968673|gb|EHV32064.1| hypothetical protein ECDEC5B_2280 [Escherichia coli DEC5B]
gi|377976367|gb|EHV39678.1| hypothetical protein ECDEC5C_2142 [Escherichia coli DEC5C]
gi|377977037|gb|EHV40338.1| hypothetical protein ECDEC5D_2384 [Escherichia coli DEC5D]
gi|377985641|gb|EHV48853.1| hypothetical protein ECDEC5E_1947 [Escherichia coli DEC5E]
gi|390782851|gb|EIO50485.1| hypothetical protein ECTW06591_2160 [Escherichia coli TW06591]
gi|408165576|gb|EKH93253.1| hypothetical protein EC5905_2800 [Escherichia coli 5905]
gi|408183799|gb|EKI10221.1| hypothetical protein ECEC96038_2481 [Escherichia coli EC96038]
gi|408184700|gb|EKI11017.1| hypothetical protein EC5412_2534 [Escherichia coli 5412]
gi|408594556|gb|EKK68837.1| hypothetical protein EC100869_2390 [Escherichia coli 10.0869]
gi|444626562|gb|ELW00354.1| hypothetical protein ECPA48_2000 [Escherichia coli PA48]
Length = 478
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + D VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPDKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|110805485|ref|YP_689005.1| hypothetical protein SFV_1518 [Shigella flexneri 5 str. 8401]
gi|110615033|gb|ABF03700.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
Length = 496
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 28 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 84
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 85 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 144
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 145 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 197
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 198 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 234
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 235 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 294
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 295 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 351
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 352 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLREEFID-- 403
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 404 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 460
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 461 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 496
>gi|385872312|gb|AFI90832.1| UPF0061 protein ydiU [Pectobacterium sp. SCC3193]
Length = 483
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 223/515 (43%), Positives = 285/515 (55%), Gaps = 54/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P + G L+G P AQ Y G
Sbjct: 19 YTALPPTP-LHGARLLYHSEGLAAELGLSSDWFT-PAQDNVWGGERLLSGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS IREFL
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLADGRSVDWHLKGAGLTPYSRMGDGRAVLRSVIREFLA 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH+LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 137 SEAMHYLGIPTTRALTIVTSTHLVQRE-------QEEKGAMLLRVAESHVRFGHFEHFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR L +Y I H+ EN DE +Y W +V
Sbjct: 190 R--REPEKVRQLVEYVIARHWPQWEN------------DE---------RRYELWFGDVV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+ WQ VGF+HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G RY
Sbjct: 227 ERTARLITHWQAVGFSHGVMNTDNMSILGLTIDYGPYGFLDAYQPDFICNHSDHRG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----P 488
F NQP +GLWN+ + + L+ L+D + RY M Y +M KLGL P
Sbjct: 286 FDNQPAVGLWNLHRLAQALSG--LMDTDTLERALARYEPALMQHYGTLMRAKLGLFTASP 343
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
N +++ LL M + DYT FR L++ + S +A L D +R A
Sbjct: 344 DDND-VLAGLLRLMQKEGSDYTRTFRLLADSEKQAS----------RASLRDEFIDRA-A 391
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ +W +Y Q L+ DEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL +
Sbjct: 392 FDNWFAAYRQRLMQEDQGDEERRRLMNATNPKYILRNYLAQMAIERAENDDISVLARLHQ 451
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RP+DEQP A LPP W +SCSS
Sbjct: 452 ALCRPFDEQPDNNDLAALPPDWGKH---LEISCSS 483
>gi|170733267|ref|YP_001765214.1| hypothetical protein Bcenmc03_1931 [Burkholderia cenocepacia MC0-3]
gi|226701083|sp|B1JTT5.1|Y1931_BURCC RecName: Full=UPF0061 protein Bcenmc03_1931
gi|169816509|gb|ACA91092.1| protein of unknown function UPF0061 [Burkholderia cenocepacia
MC0-3]
Length = 522
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 223/536 (41%), Positives = 297/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDDVAQLLDLPPAIAAQPGFAELFAG-NPTRDWPAHAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I + + + + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVIDRFYPACRDAD---------------------DPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 303 SG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLALEREGDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D +EA+ +W Y L D R MN NPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|331653107|ref|ZP_08354112.1| putative cytoplasmic protein [Escherichia coli M718]
gi|331049205|gb|EGI21277.1| putative cytoplasmic protein [Escherichia coli M718]
Length = 478
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D+
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFIDLA 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 388 -----AFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHGALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|332279143|ref|ZP_08391556.1| conserved hypothetical protein [Shigella sp. D9]
gi|332101495|gb|EGJ04841.1| conserved hypothetical protein [Shigella sp. D9]
Length = 478
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|107028913|ref|YP_626008.1| hypothetical protein Bcen_6171 [Burkholderia cenocepacia AU 1054]
gi|116689929|ref|YP_835552.1| hypothetical protein Bcen2424_1908 [Burkholderia cenocepacia
HI2424]
gi|121957915|sp|Q1BH70.1|Y6171_BURCA RecName: Full=UPF0061 protein Bcen_6171
gi|166227489|sp|A0K832.1|Y1908_BURCH RecName: Full=UPF0061 protein Bcen2424_1908
gi|105898077|gb|ABF81035.1| protein of unknown function UPF0061 [Burkholderia cenocepacia AU
1054]
gi|116648018|gb|ABK08659.1| protein of unknown function UPF0061 [Burkholderia cenocepacia
HI2424]
Length = 522
Score = 364 bits (935), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 223/536 (41%), Positives = 297/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDDVAQLLDLPPSIAAQPGFAELFAG-NPTRDWPAHAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I + + + + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVIDRFYPACRDAD---------------------DPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 303 SG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLELEREGDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D +EA+ +W Y L D R MN NPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|213428584|ref|ZP_03361334.1| hypothetical protein SentesTyphi_25491 [Salmonella enterica subsp.
enterica serovar Typhi str. E02-1180]
Length = 480
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGIWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYRRES--EKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|415815820|ref|ZP_11507251.1| hypothetical protein ECLT68_5669 [Escherichia coli LT-68]
gi|417712683|ref|ZP_12361666.1| hypothetical protein SFK272_2413 [Shigella flexneri K-272]
gi|417717149|ref|ZP_12366067.1| hypothetical protein SFK227_1874 [Shigella flexneri K-227]
gi|420320215|ref|ZP_14822053.1| hypothetical protein SF285071_1831 [Shigella flexneri 2850-71]
gi|323170025|gb|EFZ55681.1| hypothetical protein ECLT68_5669 [Escherichia coli LT-68]
gi|333005950|gb|EGK25466.1| hypothetical protein SFK272_2413 [Shigella flexneri K-272]
gi|333018803|gb|EGK38096.1| hypothetical protein SFK227_1874 [Shigella flexneri K-227]
gi|391251255|gb|EIQ10471.1| hypothetical protein SF285071_1831 [Shigella flexneri 2850-71]
Length = 478
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|260855529|ref|YP_003229420.1| hypothetical protein ECO26_2435 [Escherichia coli O26:H11 str.
11368]
gi|260868196|ref|YP_003234598.1| hypothetical protein ECO111_2176 [Escherichia coli O111:H- str.
11128]
gi|415791727|ref|ZP_11495499.1| hypothetical protein ECEPECA14_5139 [Escherichia coli EPECa14]
gi|415817495|ref|ZP_11507626.1| hypothetical protein ECOK1180_0320 [Escherichia coli OK1180]
gi|417195370|ref|ZP_12015784.1| hypothetical protein EC40522_1747 [Escherichia coli 4.0522]
gi|417212919|ref|ZP_12022315.1| hypothetical protein ECJB195_0888 [Escherichia coli JB1-95]
gi|417298659|ref|ZP_12085897.1| hypothetical protein EC900105_2265 [Escherichia coli 900105 (10e)]
gi|417591792|ref|ZP_12242491.1| hypothetical protein EC253486_2390 [Escherichia coli 2534-86]
gi|419197039|ref|ZP_13740432.1| hypothetical protein ECDEC8A_2140 [Escherichia coli DEC8A]
gi|419203164|ref|ZP_13746365.1| hypothetical protein ECDEC8B_2189 [Escherichia coli DEC8B]
gi|419209566|ref|ZP_13752656.1| hypothetical protein ECDEC8C_2771 [Escherichia coli DEC8C]
gi|419215596|ref|ZP_13758605.1| hypothetical protein ECDEC8D_2360 [Escherichia coli DEC8D]
gi|419221400|ref|ZP_13764335.1| hypothetical protein ECDEC8E_2202 [Escherichia coli DEC8E]
gi|419226734|ref|ZP_13769602.1| hypothetical protein ECDEC9A_2144 [Escherichia coli DEC9A]
gi|419249106|ref|ZP_13791695.1| hypothetical protein ECDEC9E_2330 [Escherichia coli DEC9E]
gi|419254913|ref|ZP_13797436.1| hypothetical protein ECDEC10A_2425 [Escherichia coli DEC10A]
gi|419261119|ref|ZP_13803547.1| hypothetical protein ECDEC10B_2701 [Escherichia coli DEC10B]
gi|419266957|ref|ZP_13809318.1| hypothetical protein ECDEC10C_2733 [Escherichia coli DEC10C]
gi|419272625|ref|ZP_13814927.1| hypothetical protein ECDEC10D_2377 [Escherichia coli DEC10D]
gi|419283982|ref|ZP_13826173.1| hypothetical protein ECDEC10F_2649 [Escherichia coli DEC10F]
gi|419876518|ref|ZP_14398243.1| hypothetical protein ECO9534_12407 [Escherichia coli O111:H11 str.
CVM9534]
gi|419892384|ref|ZP_14412406.1| hypothetical protein ECO9570_09333 [Escherichia coli O111:H8 str.
CVM9570]
gi|419896037|ref|ZP_14415799.1| hypothetical protein ECO9574_03311 [Escherichia coli O111:H8 str.
CVM9574]
gi|420091843|ref|ZP_14603579.1| hypothetical protein ECO9602_22159 [Escherichia coli O111:H8 str.
CVM9602]
gi|420094804|ref|ZP_14606372.1| hypothetical protein ECO9634_14721 [Escherichia coli O111:H8 str.
CVM9634]
gi|420102948|ref|ZP_14613873.1| hypothetical protein ECO9455_23615 [Escherichia coli O111:H11 str.
CVM9455]
gi|420109151|ref|ZP_14619328.1| hypothetical protein ECO9553_01969 [Escherichia coli O111:H11 str.
CVM9553]
gi|420114685|ref|ZP_14624317.1| hypothetical protein ECO10021_22657 [Escherichia coli O26:H11 str.
CVM10021]
gi|420118929|ref|ZP_14628238.1| hypothetical protein ECO10030_07988 [Escherichia coli O26:H11 str.
CVM10030]
gi|420129917|ref|ZP_14638432.1| hypothetical protein ECO10224_21965 [Escherichia coli O26:H11 str.
CVM10224]
gi|420136215|ref|ZP_14644276.1| hypothetical protein ECO9952_11535 [Escherichia coli O26:H11 str.
CVM9952]
gi|424752157|ref|ZP_18180163.1| hypothetical protein CFSAN001629_18435 [Escherichia coli O26:H11
str. CFSAN001629]
gi|424771337|ref|ZP_18198487.1| hypothetical protein CFSAN001632_13759 [Escherichia coli O111:H8
str. CFSAN001632]
gi|425379446|ref|ZP_18763560.1| hypothetical protein ECEC1865_2520 [Escherichia coli EC1865]
gi|257754178|dbj|BAI25680.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
gi|257764552|dbj|BAI36047.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
gi|323153056|gb|EFZ39325.1| hypothetical protein ECEPECA14_5139 [Escherichia coli EPECa14]
gi|323181024|gb|EFZ66562.1| hypothetical protein ECOK1180_0320 [Escherichia coli OK1180]
gi|345340452|gb|EGW72870.1| hypothetical protein EC253486_2390 [Escherichia coli 2534-86]
gi|378048351|gb|EHW10705.1| hypothetical protein ECDEC8A_2140 [Escherichia coli DEC8A]
gi|378052125|gb|EHW14435.1| hypothetical protein ECDEC8B_2189 [Escherichia coli DEC8B]
gi|378055431|gb|EHW17693.1| hypothetical protein ECDEC8C_2771 [Escherichia coli DEC8C]
gi|378064054|gb|EHW26216.1| hypothetical protein ECDEC8D_2360 [Escherichia coli DEC8D]
gi|378067960|gb|EHW30071.1| hypothetical protein ECDEC8E_2202 [Escherichia coli DEC8E]
gi|378076729|gb|EHW38731.1| hypothetical protein ECDEC9A_2144 [Escherichia coli DEC9A]
gi|378096479|gb|EHW58249.1| hypothetical protein ECDEC9E_2330 [Escherichia coli DEC9E]
gi|378101955|gb|EHW63639.1| hypothetical protein ECDEC10A_2425 [Escherichia coli DEC10A]
gi|378108450|gb|EHW70063.1| hypothetical protein ECDEC10B_2701 [Escherichia coli DEC10B]
gi|378112829|gb|EHW74402.1| hypothetical protein ECDEC10C_2733 [Escherichia coli DEC10C]
gi|378118001|gb|EHW79510.1| hypothetical protein ECDEC10D_2377 [Escherichia coli DEC10D]
gi|378135524|gb|EHW96835.1| hypothetical protein ECDEC10F_2649 [Escherichia coli DEC10F]
gi|386189412|gb|EIH78178.1| hypothetical protein EC40522_1747 [Escherichia coli 4.0522]
gi|386194595|gb|EIH88842.1| hypothetical protein ECJB195_0888 [Escherichia coli JB1-95]
gi|386257698|gb|EIJ13181.1| hypothetical protein EC900105_2265 [Escherichia coli 900105 (10e)]
gi|388343850|gb|EIL09750.1| hypothetical protein ECO9534_12407 [Escherichia coli O111:H11 str.
CVM9534]
gi|388347784|gb|EIL13434.1| hypothetical protein ECO9570_09333 [Escherichia coli O111:H8 str.
CVM9570]
gi|388359400|gb|EIL23720.1| hypothetical protein ECO9574_03311 [Escherichia coli O111:H8 str.
CVM9574]
gi|394381132|gb|EJE58829.1| hypothetical protein ECO10224_21965 [Escherichia coli O26:H11 str.
CVM10224]
gi|394382158|gb|EJE59810.1| hypothetical protein ECO9602_22159 [Escherichia coli O111:H8 str.
CVM9602]
gi|394395229|gb|EJE71702.1| hypothetical protein ECO9634_14721 [Escherichia coli O111:H8 str.
CVM9634]
gi|394407734|gb|EJE82513.1| hypothetical protein ECO9553_01969 [Escherichia coli O111:H11 str.
CVM9553]
gi|394408549|gb|EJE83191.1| hypothetical protein ECO10021_22657 [Escherichia coli O26:H11 str.
CVM10021]
gi|394409366|gb|EJE83905.1| hypothetical protein ECO9455_23615 [Escherichia coli O111:H11 str.
CVM9455]
gi|394418734|gb|EJE92392.1| hypothetical protein ECO9952_11535 [Escherichia coli O26:H11 str.
CVM9952]
gi|394432302|gb|EJF04404.1| hypothetical protein ECO10030_07988 [Escherichia coli O26:H11 str.
CVM10030]
gi|408298566|gb|EKJ16500.1| hypothetical protein ECEC1865_2520 [Escherichia coli EC1865]
gi|421938446|gb|EKT96020.1| hypothetical protein CFSAN001629_18435 [Escherichia coli O26:H11
str. CFSAN001629]
gi|421940688|gb|EKT98138.1| hypothetical protein CFSAN001632_13759 [Escherichia coli O111:H8
str. CFSAN001632]
Length = 478
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|424837916|ref|ZP_18262553.1| hypothetical protein SF5M90T_1482 [Shigella flexneri 5a str. M90T]
gi|383466968|gb|EID61989.1| hypothetical protein SF5M90T_1482 [Shigella flexneri 5a str. M90T]
Length = 496
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 28 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 84
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 85 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 144
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 145 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 197
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 198 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 234
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 235 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 294
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 295 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 351
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 352 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 403
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 404 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 460
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 461 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 496
>gi|423139769|ref|ZP_17127407.1| SelO family protein [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
gi|379052323|gb|EHY70214.1| SelO family protein [Salmonella enterica subsp. houtenae str. ATCC
BAA-1581]
Length = 480
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ ++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWHNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LAD+AIRH++ ++ T KY
Sbjct: 182 HFEHFYYR--REPKKVQQLADFAIRHYWPQWQD---------------------TPEKYE 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I++ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPAVALWNLQRLAQTL--TPFIENDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ +L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFTEQKDDNVLLHELFSLMAREGSDYTRTFRKLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M SVNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTETVDDALRQQQMQSVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEILRQPFIDRD--DDYASRPPEWGKRLAV---SCSS 480
>gi|375001552|ref|ZP_09725892.1| SelO family protein [Salmonella enterica subsp. enterica serovar
Infantis str. SARB27]
gi|353076240|gb|EHB42000.1| SelO family protein [Salmonella enterica subsp. enterica serovar
Infantis str. SARB27]
Length = 480
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 297/521 (57%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+M +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQREM-------QETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|432616680|ref|ZP_19852801.1| hypothetical protein A1UM_02113 [Escherichia coli KTE75]
gi|431154920|gb|ELE55681.1| hypothetical protein A1UM_02113 [Escherichia coli KTE75]
Length = 478
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVALSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + YA PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYASRPPDWGKRLEV---SCSS 478
>gi|56413668|ref|YP_150743.1| hypothetical protein SPA1498 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197362592|ref|YP_002142229.1| hypothetical protein SSPA1390 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|81360457|sp|Q5PH84.1|YDIU_SALPA RecName: Full=UPF0061 protein YdiU
gi|226725738|sp|B5BA30.1|YDIU_SALPK RecName: Full=UPF0061 protein YdiU
gi|56127925|gb|AAV77431.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|197094069|emb|CAR59569.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
Length = 480
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|168263833|ref|ZP_02685806.1| protein YdiU [Salmonella enterica subsp. enterica serovar Hadar
str. RI_05P066]
gi|205347617|gb|EDZ34248.1| protein YdiU [Salmonella enterica subsp. enterica serovar Hadar
str. RI_05P066]
Length = 480
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YAR PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYARRPPEWGKRLEV---SCSS 480
>gi|419232323|ref|ZP_13775104.1| hypothetical protein ECDEC9B_1840 [Escherichia coli DEC9B]
gi|419237854|ref|ZP_13780581.1| hypothetical protein ECDEC9C_2071 [Escherichia coli DEC9C]
gi|419243292|ref|ZP_13785933.1| hypothetical protein ECDEC9D_1865 [Escherichia coli DEC9D]
gi|378078816|gb|EHW40795.1| hypothetical protein ECDEC9B_1840 [Escherichia coli DEC9B]
gi|378085267|gb|EHW47160.1| hypothetical protein ECDEC9C_2071 [Escherichia coli DEC9C]
gi|378091900|gb|EHW53727.1| hypothetical protein ECDEC9D_1865 [Escherichia coli DEC9D]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQLVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|301327434|ref|ZP_07220671.1| SelO family protein [Escherichia coli MS 78-1]
gi|417148606|ref|ZP_11988853.1| hypothetical protein EC12264_3360 [Escherichia coli 1.2264]
gi|417596830|ref|ZP_12247479.1| hypothetical protein EC30301_1967 [Escherichia coli 3030-1]
gi|419804411|ref|ZP_14329569.1| SelO family protein [Escherichia coli AI27]
gi|419949985|ref|ZP_14466211.1| hypothetical protein ECMT8_11512 [Escherichia coli CUMT8]
gi|422956937|ref|ZP_16969411.1| UPF0061 protein ydiU [Escherichia coli H494]
gi|432831684|ref|ZP_20065258.1| hypothetical protein A1YM_03470 [Escherichia coli KTE135]
gi|432967828|ref|ZP_20156743.1| hypothetical protein A15G_02927 [Escherichia coli KTE203]
gi|433092113|ref|ZP_20278388.1| hypothetical protein WK1_01747 [Escherichia coli KTE138]
gi|300845986|gb|EFK73746.1| SelO family protein [Escherichia coli MS 78-1]
gi|345355743|gb|EGW87952.1| hypothetical protein EC30301_1967 [Escherichia coli 3030-1]
gi|371599238|gb|EHN88028.1| UPF0061 protein ydiU [Escherichia coli H494]
gi|384472596|gb|EIE56649.1| SelO family protein [Escherichia coli AI27]
gi|386162264|gb|EIH24066.1| hypothetical protein EC12264_3360 [Escherichia coli 1.2264]
gi|388417954|gb|EIL77777.1| hypothetical protein ECMT8_11512 [Escherichia coli CUMT8]
gi|431375654|gb|ELG60977.1| hypothetical protein A1YM_03470 [Escherichia coli KTE135]
gi|431470945|gb|ELH50838.1| hypothetical protein A15G_02927 [Escherichia coli KTE203]
gi|431611095|gb|ELI80375.1| hypothetical protein WK1_01747 [Escherichia coli KTE138]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|168463253|ref|ZP_02697184.1| protein YdiU [Salmonella enterica subsp. enterica serovar Newport
str. SL317]
gi|418761178|ref|ZP_13317323.1| hypothetical protein SEEN185_01236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|418768735|ref|ZP_13324779.1| hypothetical protein SEEN199_18269 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|418769674|ref|ZP_13325701.1| hypothetical protein SEEN539_09408 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|418776086|ref|ZP_13332035.1| hypothetical protein SEEN953_12667 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|418780427|ref|ZP_13336316.1| hypothetical protein SEEN188_02797 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|418786142|ref|ZP_13341962.1| hypothetical protein SEEN559_05891 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|418802333|ref|ZP_13357960.1| hypothetical protein SEEN202_07014 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|419787710|ref|ZP_14313417.1| hypothetical protein SEENLE01_15685 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419792084|ref|ZP_14317727.1| hypothetical protein SEENLE15_22702 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|195633982|gb|EDX52334.1| protein YdiU [Salmonella enterica subsp. enterica serovar Newport
str. SL317]
gi|392619205|gb|EIX01590.1| hypothetical protein SEENLE01_15685 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392619468|gb|EIX01852.1| hypothetical protein SEENLE15_22702 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392730735|gb|EIZ87975.1| hypothetical protein SEEN199_18269 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|392739120|gb|EIZ96259.1| hypothetical protein SEEN539_09408 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|392740796|gb|EIZ97911.1| hypothetical protein SEEN185_01236 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|392746719|gb|EJA03725.1| hypothetical protein SEEN953_12667 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|392749156|gb|EJA06134.1| hypothetical protein SEEN559_05891 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|392749477|gb|EJA06454.1| hypothetical protein SEEN188_02797 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|392777346|gb|EJA34029.1| hypothetical protein SEEN202_07014 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
Length = 480
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 297/521 (57%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|416346732|ref|ZP_11679823.1| hypothetical protein ECoL_04894 [Escherichia coli EC4100B]
gi|320197890|gb|EFW72498.1| hypothetical protein ECoL_04894 [Escherichia coli EC4100B]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLANGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|157156707|ref|YP_001463002.1| hypothetical protein EcE24377A_1924 [Escherichia coli E24377A]
gi|166979597|sp|A7ZMH3.1|YDIU_ECO24 RecName: Full=UPF0061 protein YdiU
gi|157078737|gb|ABV18445.1| conserved hypothetical protein [Escherichia coli E24377A]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|187732402|ref|YP_001880467.1| hypothetical protein SbBS512_E1910 [Shigella boydii CDC 3083-94]
gi|226725740|sp|B2U355.1|YDIU_SHIB3 RecName: Full=UPF0061 protein YdiU
gi|187429394|gb|ACD08668.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +LV + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLVWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|293396346|ref|ZP_06640624.1| SelO family protein [Serratia odorifera DSM 4582]
gi|291421135|gb|EFE94386.1| SelO family protein [Serratia odorifera DSM 4582]
Length = 480
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 300/528 (56%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ + L YT+++P+ ++ +L+ SE +A L LD F + P++ +G
Sbjct: 2 PQFENAYHQQLPGFYTELTPTP-LQGARLLYHSEPLAHELGLDDSWFTPDNVPVW-AGER 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLPDGRSMDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS +REFL SEAMH LGIPT+RAL +VT+ + V R+ + E GA++ R+A
Sbjct: 120 GRAVLRSVVREFLASEAMHHLGIPTSRALTIVTSDQPVYRE-------QPERGAMLMRIA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + VR LAD+ I H+ + +
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VRQLADFVIARHWPALAD-------------------- 210
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+++KY W EV ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+GFLD + P
Sbjct: 211 -SADKYLLWFTEVVERTARLMADWQTVGFAHGVMNTDNMSILGITIDYGPYGFLDDYQPG 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY F NQP + LWN+ + + TL+ ++ EA + + M Y
Sbjct: 270 YICNHSDHQG-RYAFDNQPAVALWNLHRLAQTLSGLMRVEQLEA--ALAAFEPALMQAYG 326
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG KQ +++ LL+ M + DYT FR LS V+ + + PL+
Sbjct: 327 DKMRAKLGFFSQEKQDNDLLTGLLSLMTAEGRDYTRTFRLLSEVE------QLQTRSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D ++A+ W L Y Q LL +SDE+R+ M +VNPK +LRNYL Q AI+AA
Sbjct: 381 DEFID-----RDAFDRWYLQYRQRLLQEQVSDEQRQRAMKAVNPKLILRNYLAQEAIEAA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ D G++ RL + + P+D+ P E +A LPP W +SCSS
Sbjct: 436 QKDDIGKLARLHQALLTPFDDDPRYEDFAALPPDWGKH---LEISCSS 480
>gi|16760549|ref|NP_456166.1| hypothetical protein STY1765 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29141690|ref|NP_805032.1| hypothetical protein t1226 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213161735|ref|ZP_03347445.1| hypothetical protein Salmoneentericaenterica_17734 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
gi|213648789|ref|ZP_03378842.1| hypothetical protein SentesTy_16778 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|213855702|ref|ZP_03383942.1| hypothetical protein SentesT_17343 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|378959391|ref|YP_005216877.1| hypothetical protein STBHUCCB_13150 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
gi|33517077|sp|Q8Z6I8.1|YDIU_SALTI RecName: Full=UPF0061 protein YdiU
gi|25323659|pir||AF0704 conserved hypothetical protein STY1765 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16502845|emb|CAD02007.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29137318|gb|AAO68881.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|374353263|gb|AEZ45024.1| hypothetical protein STBHUCCB_13150 [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
Length = 480
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGIWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|193068900|ref|ZP_03049859.1| conserved hypothetical protein [Escherichia coli E110019]
gi|415826422|ref|ZP_11513560.1| hypothetical protein ECOK1357_0481 [Escherichia coli OK1357]
gi|417232050|ref|ZP_12033448.1| hypothetical protein EC50959_4685 [Escherichia coli 5.0959]
gi|432533955|ref|ZP_19770934.1| hypothetical protein A193_02392 [Escherichia coli KTE234]
gi|432674739|ref|ZP_19910214.1| hypothetical protein A1YU_01285 [Escherichia coli KTE142]
gi|192957695|gb|EDV88139.1| conserved hypothetical protein [Escherichia coli E110019]
gi|323186147|gb|EFZ71502.1| hypothetical protein ECOK1357_0481 [Escherichia coli OK1357]
gi|386205049|gb|EII09560.1| hypothetical protein EC50959_4685 [Escherichia coli 5.0959]
gi|431061441|gb|ELD70754.1| hypothetical protein A193_02392 [Escherichia coli KTE234]
gi|431215612|gb|ELF13298.1| hypothetical protein A1YU_01285 [Escherichia coli KTE142]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|365970121|ref|YP_004951682.1| protein YdiU [Enterobacter cloacae EcWSU1]
gi|365749034|gb|AEW73261.1| YdiU [Enterobacter cloacae EcWSU1]
Length = 524
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 216/518 (41%), Positives = 289/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ ++N +L+ ++ +AD L + P+ F+ D + G T LAG P AQ
Sbjct: 57 LPGFYTALKPTP-LQNSRLIWHNDRLADELAVPPEMFQPSDGAGVWGGETLLAGMQPLAQ 115
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 116 VYSGHQFGVWAGQLGDGRGILLGEQRLPNGETVDWHLKGAGLTPYSRMGDGRAVLRSTIR 175
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ RVAQS LRFG ++
Sbjct: 176 ECLASEAMHALGIPTTRALSIVTSDTPVARETM-------EKGAMLMRVAQSHLRFGHFE 228
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + VR LADYAIRHH+ H ++ ++KY W
Sbjct: 229 HFYYR--REPEKVRQLADYAIRHHWSHFQD---------------------EADKYILWF 265
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V RTA+++A+WQ VGF HGV+NTDNMS+LGLT DYGPFGFLD + P + N +D G
Sbjct: 266 RDVVARTATMIARWQTVGFAHGVMNTDNMSLLGLTFDYGPFGFLDDYQPGYICNHSDYQG 325
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + TL + ID N ++ Y + EY A+M KLGL
Sbjct: 326 -RYSFDNQPAVGLWNLQRLAQTL--SPFIDVDALNDALDSYQDILLREYGALMRNKLGLV 382
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ + I++ L M + DYT FR LS + S PL+ +D
Sbjct: 383 TQERGDNDILNALFALMEREGSDYTRTFRMLSQTEQHSSAS------PLRDEFID----- 431
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++ + W Y L + D R+A MN+ NP VLRN+L Q AI+ AE G++ E+ R
Sbjct: 432 RQGFDDWFALYRARLQQEQVDDATRQAQMNAANPAMVLRNWLAQRAIEQAEQGEYDELHR 491
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 492 LHVALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 524
>gi|300904562|ref|ZP_07122399.1| SelO family protein [Escherichia coli MS 84-1]
gi|300918080|ref|ZP_07134699.1| SelO family protein [Escherichia coli MS 115-1]
gi|301306651|ref|ZP_07212710.1| SelO family protein [Escherichia coli MS 124-1]
gi|415861386|ref|ZP_11535052.1| SelO family protein [Escherichia coli MS 85-1]
gi|417639210|ref|ZP_12289364.1| hypothetical protein ECTX1999_1917 [Escherichia coli TX1999]
gi|419170253|ref|ZP_13714144.1| hypothetical protein ECDEC7A_1906 [Escherichia coli DEC7A]
gi|419180906|ref|ZP_13724523.1| hypothetical protein ECDEC7C_2034 [Escherichia coli DEC7C]
gi|419186342|ref|ZP_13729859.1| hypothetical protein ECDEC7D_2074 [Escherichia coli DEC7D]
gi|419191627|ref|ZP_13735087.1| hypothetical protein ECDEC7E_1904 [Escherichia coli DEC7E]
gi|420385684|ref|ZP_14885045.1| hypothetical protein ECEPECA12_2048 [Escherichia coli EPECa12]
gi|427804841|ref|ZP_18971908.1| hypothetical protein BN16_22511 [Escherichia coli chi7122]
gi|427809399|ref|ZP_18976464.1| hypothetical protein BN17_19641 [Escherichia coli]
gi|432531077|ref|ZP_19768107.1| hypothetical protein A191_04326 [Escherichia coli KTE233]
gi|433130234|ref|ZP_20315679.1| hypothetical protein WKG_01966 [Escherichia coli KTE163]
gi|433134936|ref|ZP_20320290.1| hypothetical protein WKI_01870 [Escherichia coli KTE166]
gi|443617788|ref|YP_007381644.1| hypothetical protein APECO78_12355 [Escherichia coli APEC O78]
gi|300403475|gb|EFJ87013.1| SelO family protein [Escherichia coli MS 84-1]
gi|300414731|gb|EFJ98041.1| SelO family protein [Escherichia coli MS 115-1]
gi|300838113|gb|EFK65873.1| SelO family protein [Escherichia coli MS 124-1]
gi|315257489|gb|EFU37457.1| SelO family protein [Escherichia coli MS 85-1]
gi|345394062|gb|EGX23827.1| hypothetical protein ECTX1999_1917 [Escherichia coli TX1999]
gi|378016890|gb|EHV79767.1| hypothetical protein ECDEC7A_1906 [Escherichia coli DEC7A]
gi|378024274|gb|EHV86928.1| hypothetical protein ECDEC7C_2034 [Escherichia coli DEC7C]
gi|378030046|gb|EHV92650.1| hypothetical protein ECDEC7D_2074 [Escherichia coli DEC7D]
gi|378039570|gb|EHW02058.1| hypothetical protein ECDEC7E_1904 [Escherichia coli DEC7E]
gi|391306561|gb|EIQ64317.1| hypothetical protein ECEPECA12_2048 [Escherichia coli EPECa12]
gi|412963023|emb|CCK46941.1| hypothetical protein BN16_22511 [Escherichia coli chi7122]
gi|412969578|emb|CCJ44215.1| hypothetical protein BN17_19641 [Escherichia coli]
gi|431055018|gb|ELD64582.1| hypothetical protein A191_04326 [Escherichia coli KTE233]
gi|431647282|gb|ELJ14766.1| hypothetical protein WKG_01966 [Escherichia coli KTE163]
gi|431657799|gb|ELJ24761.1| hypothetical protein WKI_01870 [Escherichia coli KTE166]
gi|443422296|gb|AGC87200.1| hypothetical protein APECO78_12355 [Escherichia coli APEC O78]
Length = 478
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417689607|ref|ZP_12338838.1| hypothetical protein SB521682_1859 [Shigella boydii 5216-82]
gi|332090853|gb|EGI95945.1| hypothetical protein SB521682_1859 [Shigella boydii 5216-82]
Length = 481
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 297/521 (57%), Gaps = 52/521 (9%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DEDN------EDKYR 219
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF H V+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 220 LWFNDVVARTASLIAQWQTVGFAHRVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 279
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 280 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 336
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 337 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 388
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 389 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 445
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 446 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 481
>gi|312969735|ref|ZP_07783918.1| conserved hypothetical protein [Escherichia coli 1827-70]
gi|310338020|gb|EFQ03109.1| conserved hypothetical protein [Escherichia coli 1827-70]
Length = 478
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNAELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|425288575|ref|ZP_18679444.1| hypothetical protein EC3006_2053 [Escherichia coli 3006]
gi|408215153|gb|EKI39557.1| hypothetical protein EC3006_2053 [Escherichia coli 3006]
Length = 478
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTL-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|300821420|ref|ZP_07101567.1| SelO family protein [Escherichia coli MS 119-7]
gi|331668392|ref|ZP_08369240.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|331677579|ref|ZP_08378254.1| putative cytoplasmic protein [Escherichia coli H591]
gi|417131992|ref|ZP_11976777.1| hypothetical protein EC50588_1906 [Escherichia coli 5.0588]
gi|417222717|ref|ZP_12026157.1| hypothetical protein EC96154_1889 [Escherichia coli 96.154]
gi|417266140|ref|ZP_12053509.1| hypothetical protein EC33884_4052 [Escherichia coli 3.3884]
gi|417602292|ref|ZP_12252862.1| hypothetical protein ECSTEC94C_2081 [Escherichia coli STEC_94C]
gi|418941437|ref|ZP_13494765.1| hypothetical protein T22_01951 [Escherichia coli O157:H43 str. T22]
gi|419370101|ref|ZP_13911223.1| hypothetical protein ECDEC14A_1844 [Escherichia coli DEC14A]
gi|422760958|ref|ZP_16814717.1| hypothetical protein ERBG_00881 [Escherichia coli E1167]
gi|423705695|ref|ZP_17680078.1| UPF0061 protein ydiU [Escherichia coli B799]
gi|425422406|ref|ZP_18803587.1| hypothetical protein EC01288_1763 [Escherichia coli 0.1288]
gi|432376858|ref|ZP_19619855.1| hypothetical protein WCQ_01731 [Escherichia coli KTE12]
gi|432809353|ref|ZP_20043246.1| hypothetical protein A1WM_00506 [Escherichia coli KTE101]
gi|432834703|ref|ZP_20068242.1| hypothetical protein A1YO_02056 [Escherichia coli KTE136]
gi|300525923|gb|EFK46992.1| SelO family protein [Escherichia coli MS 119-7]
gi|324119192|gb|EGC13080.1| hypothetical protein ERBG_00881 [Escherichia coli E1167]
gi|331063586|gb|EGI35497.1| putative cytoplasmic protein [Escherichia coli TA271]
gi|331074039|gb|EGI45359.1| putative cytoplasmic protein [Escherichia coli H591]
gi|345349958|gb|EGW82233.1| hypothetical protein ECSTEC94C_2081 [Escherichia coli STEC_94C]
gi|375323242|gb|EHS68959.1| hypothetical protein T22_01951 [Escherichia coli O157:H43 str. T22]
gi|378219561|gb|EHX79829.1| hypothetical protein ECDEC14A_1844 [Escherichia coli DEC14A]
gi|385713087|gb|EIG50023.1| UPF0061 protein ydiU [Escherichia coli B799]
gi|386149846|gb|EIH01135.1| hypothetical protein EC50588_1906 [Escherichia coli 5.0588]
gi|386202519|gb|EII01510.1| hypothetical protein EC96154_1889 [Escherichia coli 96.154]
gi|386232133|gb|EII59480.1| hypothetical protein EC33884_4052 [Escherichia coli 3.3884]
gi|408344995|gb|EKJ59341.1| hypothetical protein EC01288_1763 [Escherichia coli 0.1288]
gi|430899150|gb|ELC21255.1| hypothetical protein WCQ_01731 [Escherichia coli KTE12]
gi|431362121|gb|ELG48699.1| hypothetical protein A1WM_00506 [Escherichia coli KTE101]
gi|431385063|gb|ELG69050.1| hypothetical protein A1YO_02056 [Escherichia coli KTE136]
Length = 478
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417608252|ref|ZP_12258759.1| hypothetical protein ECSTECDG1313_2645 [Escherichia coli
STEC_DG131-3]
gi|345359793|gb|EGW91968.1| hypothetical protein ECSTECDG1313_2645 [Escherichia coli
STEC_DG131-3]
Length = 478
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 221/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQPLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|157161167|ref|YP_001458485.1| hypothetical protein EcHS_A1786 [Escherichia coli HS]
gi|188493468|ref|ZP_03000738.1| conserved hypothetical protein [Escherichia coli 53638]
gi|432485457|ref|ZP_19727373.1| hypothetical protein A15Y_01936 [Escherichia coli KTE212]
gi|432670784|ref|ZP_19906315.1| hypothetical protein A1Y7_02320 [Escherichia coli KTE119]
gi|433173566|ref|ZP_20358101.1| hypothetical protein WGQ_01828 [Escherichia coli KTE232]
gi|166979598|sp|A8A0P8.1|YDIU_ECOHS RecName: Full=UPF0061 protein YdiU
gi|157066847|gb|ABV06102.1| conserved hypothetical protein [Escherichia coli HS]
gi|188488667|gb|EDU63770.1| conserved hypothetical protein [Escherichia coli 53638]
gi|431015854|gb|ELD29401.1| hypothetical protein A15Y_01936 [Escherichia coli KTE212]
gi|431210858|gb|ELF08841.1| hypothetical protein A1Y7_02320 [Escherichia coli KTE119]
gi|431693832|gb|ELJ59226.1| hypothetical protein WGQ_01828 [Escherichia coli KTE232]
Length = 478
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|425305248|ref|ZP_18694993.1| hypothetical protein ECN1_1676 [Escherichia coli N1]
gi|408229919|gb|EKI53344.1| hypothetical protein ECN1_1676 [Escherichia coli N1]
Length = 478
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F++ + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFKKG--AGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRHKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNTLLNELFRLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+A E GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAVEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432372083|ref|ZP_19615133.1| hypothetical protein WCO_01108 [Escherichia coli KTE11]
gi|430898412|gb|ELC20547.1| hypothetical protein WCO_01108 [Escherichia coli KTE11]
Length = 478
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ ++ +A++L + FE + G T L G P
Sbjct: 10 RDELPATYTSLSPTP-LNNARLIWYNAELANTLGIPSSLFESG--AGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA+S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------ESGAMLMRVARSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+++ DE NKY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWPHLQD------------DE---------NKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIANWQTVGFAHGVMNTDNMSILGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + +Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFISVD--ALNEALDSYQQVLLSQYGQRMRRKL 333
Query: 486 GLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K+ ++S+L + MA ++ DYT FR LS + PL+ +D
Sbjct: 334 GFMTEQKEDNVLLSELFSLMARERSDYTRTFRMLSLTGQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDNWFARYRARLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEQGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|74311975|ref|YP_310394.1| hypothetical protein SSON_1453 [Shigella sonnei Ss046]
gi|383178228|ref|YP_005456233.1| hypothetical protein SSON53_08415 [Shigella sonnei 53G]
gi|414575798|ref|ZP_11432998.1| hypothetical protein SS323385_1639 [Shigella sonnei 3233-85]
gi|415843943|ref|ZP_11523766.1| hypothetical protein SS53G_0459 [Shigella sonnei 53G]
gi|418264871|ref|ZP_12885122.1| hypothetical protein SSMOSELEY_1933 [Shigella sonnei str. Moseley]
gi|420358329|ref|ZP_14859321.1| hypothetical protein SS322685_2127 [Shigella sonnei 3226-85]
gi|420363169|ref|ZP_14864071.1| hypothetical protein SS482266_1575 [Shigella sonnei 4822-66]
gi|121957930|sp|Q3Z253.1|YDIU_SHISS RecName: Full=UPF0061 protein YdiU
gi|73855452|gb|AAZ88159.1| conserved hypothetical protein [Shigella sonnei Ss046]
gi|323169289|gb|EFZ54965.1| hypothetical protein SS53G_0459 [Shigella sonnei 53G]
gi|391285145|gb|EIQ43731.1| hypothetical protein SS322685_2127 [Shigella sonnei 3226-85]
gi|391287029|gb|EIQ45563.1| hypothetical protein SS323385_1639 [Shigella sonnei 3233-85]
gi|391295286|gb|EIQ53455.1| hypothetical protein SS482266_1575 [Shigella sonnei 4822-66]
gi|397901724|gb|EJL18065.1| hypothetical protein SSMOSELEY_1933 [Shigella sonnei str. Moseley]
Length = 478
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSTAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDGWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHGALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417121325|ref|ZP_11970753.1| hypothetical protein EC970246_4775 [Escherichia coli 97.0246]
gi|386148177|gb|EIG94614.1| hypothetical protein EC970246_4775 [Escherichia coli 97.0246]
Length = 478
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--VLNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|424756850|ref|ZP_18184640.1| hypothetical protein CFSAN001630_04528 [Escherichia coli O111:H11
str. CFSAN001630]
gi|421949483|gb|EKU06430.1| hypothetical protein CFSAN001630_04528 [Escherichia coli O111:H11
str. CFSAN001630]
Length = 478
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 297/521 (57%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + +KGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHVKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|124266958|ref|YP_001020962.1| hypothetical protein Mpe_A1768 [Methylibium petroleiphilum PM1]
gi|124259733|gb|ABM94727.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 507
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 222/523 (42%), Positives = 295/523 (56%), Gaps = 60/523 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
+T+++ A + P VA S+S A L ER D+ SG G+ P A Y
Sbjct: 33 HTRLAAQA-LPQPHWVATSDSAARLLGWPGDWAERADWQALEVLSGGRTWPGSEPLATVY 91
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA+ LGEI + + ELQLKGAG+TPYSR DG AVLRSSIREF
Sbjct: 92 SGHQFGVWAGQLGDGRALLLGEI-DTPNGPMELQLKGAGRTPYSRMGDGRAVLRSSIREF 150
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMHFLGIPTTRAL +V + V R+ E A+V RVA SF+RFG ++
Sbjct: 151 LCSEAMHFLGIPTTRALAVVGSPLPVRRETV-------ETAAVVTRVAPSFVRFGHFEHF 203
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A G + +RTLAD+ I D+ H +N YAA
Sbjct: 204 AHHGLPE--ALRTLADFVI---------------------DQHHPACREAANPYAALLET 240
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
VA RTA+L+A WQ VGF HGV+NTDN+SILGLTIDYGPFGFLD FDP N +D G R
Sbjct: 241 VARRTATLLADWQAVGFCHGVMNTDNLSILGLTIDYGPFGFLDGFDPGHVCNHSDHQG-R 299
Query: 431 YCFANQPDIGLWNIAQFSTTL----AAAKLIDDKEANYVMER---YGTKFMDEYQAIMTK 483
Y ++ QP + WN+ + + A + + + +E Y F + A +
Sbjct: 300 YAYSRQPSVAFWNLHALAQAMLPLIAMGGEVTEATGDLALEAIEPYKHTFSEAMAARLRA 359
Query: 484 KLGLPKYNKQIIS---KLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
KLGL + ++ L MA ++ D+T +R L+ + P+ P+ ++ + LD
Sbjct: 360 KLGLAGERDEDVALADDWLQLMATERADHTITWRRLA--QWSPAEPQ-----AVRDLFLD 412
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ +W Y + L G ++ ER+ M+ NPKYVLRN+LC++AI AA+ GDF
Sbjct: 413 -----RPAFDAWADRYARRLALDGRAEAERRLQMDRANPKYVLRNHLCENAIRAAQGGDF 467
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GE +RLLK++ERP+DEQP YA PP WA +SCSS
Sbjct: 468 GETQRLLKVLERPFDEQPEHSAYAEFPPDWAQ---TLEVSCSS 507
>gi|194444535|ref|YP_002040602.1| hypothetical protein SNSL254_A1456 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gi|198243364|ref|YP_002215781.1| hypothetical protein SeD_A2000 [Salmonella enterica subsp. enterica
serovar Dublin str. CT_02021853]
gi|375119261|ref|ZP_09764428.1| protein YdiU [Salmonella enterica subsp. enterica serovar Dublin
str. SD3246]
gi|418795806|ref|ZP_13351507.1| hypothetical protein SEEN449_13615 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|418808882|ref|ZP_13364435.1| hypothetical protein SEEN550_04195 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|418813038|ref|ZP_13368559.1| hypothetical protein SEEN513_05772 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|418816882|ref|ZP_13372370.1| hypothetical protein SEEN538_05988 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|418820323|ref|ZP_13375756.1| hypothetical protein SEEN425_08994 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|418824204|ref|ZP_13379576.1| hypothetical protein SEEN462_12269 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|418832750|ref|ZP_13387684.1| hypothetical protein SEEN486_06698 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|418835358|ref|ZP_13390253.1| hypothetical protein SEEN543_14163 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|418839780|ref|ZP_13394612.1| hypothetical protein SEEN554_00974 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|418846426|ref|ZP_13401195.1| hypothetical protein SEEN443_15597 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|418855412|ref|ZP_13410068.1| hypothetical protein SEEN593_04439 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|418868589|ref|ZP_13423030.1| hypothetical protein SEEN176_02324 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|445142276|ref|ZP_21385962.1| hypothetical protein SEEDSL_014597 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
gi|445158833|ref|ZP_21393117.1| hypothetical protein SEEDHWS_018442 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|226725734|sp|B5FJ96.1|YDIU_SALDC RecName: Full=UPF0061 protein YdiU
gi|226725737|sp|B4T4P0.1|YDIU_SALNS RecName: Full=UPF0061 protein YdiU
gi|194403198|gb|ACF63420.1| protein YdiU [Salmonella enterica subsp. enterica serovar Newport
str. SL254]
gi|197937880|gb|ACH75213.1| protein YdiU [Salmonella enterica subsp. enterica serovar Dublin
str. CT_02021853]
gi|326623528|gb|EGE29873.1| protein YdiU [Salmonella enterica subsp. enterica serovar Dublin
str. SD3246]
gi|392758334|gb|EJA15209.1| hypothetical protein SEEN449_13615 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19449]
gi|392774264|gb|EJA30959.1| hypothetical protein SEEN513_05772 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22513]
gi|392775565|gb|EJA32257.1| hypothetical protein SEEN550_04195 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21550]
gi|392789050|gb|EJA45570.1| hypothetical protein SEEN538_05988 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21538]
gi|392792592|gb|EJA49046.1| hypothetical protein SEEN425_08994 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22425]
gi|392796820|gb|EJA53148.1| hypothetical protein SEEN486_06698 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N18486]
gi|392803768|gb|EJA59952.1| hypothetical protein SEEN543_14163 [Salmonella enterica subsp.
enterica serovar Newport str. CVM N1543]
gi|392810299|gb|EJA66319.1| hypothetical protein SEEN443_15597 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|392812224|gb|EJA68219.1| hypothetical protein SEEN554_00974 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21554]
gi|392821470|gb|EJA77294.1| hypothetical protein SEEN593_04439 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19593]
gi|392824537|gb|EJA80322.1| hypothetical protein SEEN462_12269 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 22462]
gi|392837279|gb|EJA92849.1| hypothetical protein SEEN176_02324 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|444845099|gb|ELX70311.1| hypothetical protein SEEDHWS_018442 [Salmonella enterica subsp.
enterica serovar Dublin str. HWS51]
gi|444849701|gb|ELX74810.1| hypothetical protein SEEDSL_014597 [Salmonella enterica subsp.
enterica serovar Dublin str. SL1438]
Length = 480
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDHYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|417827856|ref|ZP_12474419.1| conserved protein [Shigella flexneri J1713]
gi|335575689|gb|EGM61966.1| conserved protein [Shigella flexneri J1713]
Length = 478
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IR+ L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRKSLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|418858426|ref|ZP_13413040.1| hypothetical protein SEEN470_01780 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|418862916|ref|ZP_13417454.1| hypothetical protein SEEN536_18505 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|392832397|gb|EJA88017.1| hypothetical protein SEEN470_01780 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|392832784|gb|EJA88399.1| hypothetical protein SEEN536_18505 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
Length = 480
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEVDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDHYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|417287323|ref|ZP_12074610.1| hypothetical protein ECTW07793_1794 [Escherichia coli TW07793]
gi|425300480|ref|ZP_18690424.1| hypothetical protein EC07798_2337 [Escherichia coli 07798]
gi|386249656|gb|EII95827.1| hypothetical protein ECTW07793_1794 [Escherichia coli TW07793]
gi|408216627|gb|EKI40941.1| hypothetical protein EC07798_2337 [Escherichia coli 07798]
Length = 478
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 219/515 (42%), Positives = 294/515 (57%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + N +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNNARLIWHNAELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H+E+ DED KY W +V
Sbjct: 186 YRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYRLWFSDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E+ RL +
Sbjct: 389 FDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|418788483|ref|ZP_13344277.1| hypothetical protein SEEN447_20836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|418798544|ref|ZP_13354221.1| hypothetical protein SEEN567_15616 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
gi|392762785|gb|EJA19597.1| hypothetical protein SEEN447_20836 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19447]
gi|392767201|gb|EJA23973.1| hypothetical protein SEEN567_15616 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19567]
Length = 480
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDHYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLHQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|170019944|ref|YP_001724898.1| hypothetical protein EcolC_1925 [Escherichia coli ATCC 8739]
gi|189041160|sp|B1IQ50.1|YDIU_ECOLC RecName: Full=UPF0061 protein YdiU
gi|169754872|gb|ACA77571.1| protein of unknown function UPF0061 [Escherichia coli ATCC 8739]
Length = 478
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|161614246|ref|YP_001588211.1| hypothetical protein SPAB_01991 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|189041162|sp|A9N229.1|YDIU_SALPB RecName: Full=UPF0061 protein YdiU
gi|161363610|gb|ABX67378.1| hypothetical protein SPAB_01991 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
Length = 480
Score = 362 bits (930), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|300818345|ref|ZP_07098555.1| SelO family protein [Escherichia coli MS 107-1]
gi|415873497|ref|ZP_11540717.1| SelO family protein [Escherichia coli MS 79-10]
gi|432805760|ref|ZP_20039699.1| hypothetical protein A1WA_01664 [Escherichia coli KTE91]
gi|432934326|ref|ZP_20133864.1| hypothetical protein A13E_03016 [Escherichia coli KTE184]
gi|433193681|ref|ZP_20377681.1| hypothetical protein WGU_01996 [Escherichia coli KTE90]
gi|300528985|gb|EFK50047.1| SelO family protein [Escherichia coli MS 107-1]
gi|342930704|gb|EGU99426.1| SelO family protein [Escherichia coli MS 79-10]
gi|431355454|gb|ELG42162.1| hypothetical protein A1WA_01664 [Escherichia coli KTE91]
gi|431453858|gb|ELH34240.1| hypothetical protein A13E_03016 [Escherichia coli KTE184]
gi|431717508|gb|ELJ81605.1| hypothetical protein WGU_01996 [Escherichia coli KTE90]
Length = 478
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|168233530|ref|ZP_02658588.1| protein YdiU [Salmonella enterica subsp. enterica serovar Kentucky
str. CDC 191]
gi|194468948|ref|ZP_03074932.1| protein YdiU [Salmonella enterica subsp. enterica serovar Kentucky
str. CVM29188]
gi|194455312|gb|EDX44151.1| protein YdiU [Salmonella enterica subsp. enterica serovar Kentucky
str. CVM29188]
gi|205332347|gb|EDZ19111.1| protein YdiU [Salmonella enterica subsp. enterica serovar Kentucky
str. CDC 191]
Length = 480
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|432868907|ref|ZP_20089702.1| hypothetical protein A313_00511 [Escherichia coli KTE147]
gi|431410823|gb|ELG93966.1| hypothetical protein A313_00511 [Escherichia coli KTE147]
Length = 478
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|293446080|ref|ZP_06662502.1| hypothetical protein ECCG_00226 [Escherichia coli B088]
gi|417155363|ref|ZP_11993492.1| hypothetical protein EC960497_1882 [Escherichia coli 96.0497]
gi|417581176|ref|ZP_12231981.1| hypothetical protein ECSTECB2F1_1832 [Escherichia coli STEC_B2F1]
gi|291322910|gb|EFE62338.1| hypothetical protein ECCG_00226 [Escherichia coli B088]
gi|345339799|gb|EGW72224.1| hypothetical protein ECSTECB2F1_1832 [Escherichia coli STEC_B2F1]
gi|386168452|gb|EIH34968.1| hypothetical protein EC960497_1882 [Escherichia coli 96.0497]
Length = 478
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432369826|ref|ZP_19612915.1| hypothetical protein WCM_03773 [Escherichia coli KTE10]
gi|430885453|gb|ELC08324.1| hypothetical protein WCM_03773 [Escherichia coli KTE10]
Length = 478
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE + SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESVASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|440896682|gb|ELR48546.1| hypothetical protein M91_07113 [Bos grunniens mutus]
Length = 527
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 217/546 (39%), Positives = 314/546 (57%), Gaps = 51/546 (9%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV-ADSLELDPKEFERPDFPLF 173
LP DP ++ R+V + ++ P+ +LVA S+ V D L+LD E DF
Sbjct: 16 LPTDPVKENYVRKVKNCVFSIAFPTPFQSRVRLVAVSKEVLEDILDLDLSVSETDDFIQL 75
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
SG + G++P A YGGHQFG+WA QLGDGRA +G +N + E+WELQLKG+GKTPY
Sbjct: 76 VSGGKIVFGSIPLAHRYGGHQFGIWADQLGDGRAHLIGIYMNRQGEKWELQLKGSGKTPY 135
Query: 234 SR-----FADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
SR DG A+LRSS+REFLCSEAMH+LGIPT+RA LV + V RD FY+GN +
Sbjct: 136 SRDILVLNGDGRAILRSSLREFLCSEAMHYLGIPTSRAASLVVSDDVVWRDQFYNGNLAK 195
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
E GA+V RVA+S+ R GS +I G+ LD++R L D+ I+ +F
Sbjct: 196 ERGAVVLRVAKSWFRIGSLEILTHSGE--LDLLRMLLDFIIQEYF--------------- 238
Query: 349 TGDEDHSVVDLTS-NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
+VD+ N+Y + V TA L+A W VGF HGV NTDN S+L +TIDYG
Sbjct: 239 ------PLVDVKEPNRYVDFFSIVVFETAQLIALWMSVGFAHGVCNTDNFSLLSITIDYG 292
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE---ANY 464
PFGF++A++P F PNT+D RRY NQ +IG++N+ + L L++ ++ A
Sbjct: 293 PFGFMEAYNPDFVPNTSD-DERRYKIGNQANIGMFNLNKLLQALNP--LLNPRQKQLATQ 349
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+++ Y + ++ + KLGL + + +I+ LL+ M + D+T FR LS +
Sbjct: 350 ILKEYPVLYYTRFRELFKAKLGLLGKSEGDDDLIAFLLHLMEKTEADFTMTFRQLSEITQ 409
Query: 522 DPSIPEDELLVPLKAVLLDIGKERK--EAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
EL++P + L + + K AW+S LS ++ +S SD ER+ M +VNP
Sbjct: 410 SQL---QELVIPQEFWALKMISKHKLFPAWVSQYLSRLKSNISD--SDSERRKRMTAVNP 464
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVC 637
+YVL+N++ +SA+ AE DF EV L +++ P+ + E+ Y+ P+WA V
Sbjct: 465 RYVLKNWMAESAVQKAERNDFSEVHLLQQVLRHPFQKHSAAERAGYSSPTPSWARDLRV- 523
Query: 638 MLSCSS 643
SCSS
Sbjct: 524 --SCSS 527
>gi|432947582|ref|ZP_20142738.1| hypothetical protein A153_02495 [Escherichia coli KTE196]
gi|433043305|ref|ZP_20230806.1| hypothetical protein WIG_01831 [Escherichia coli KTE117]
gi|431457560|gb|ELH37897.1| hypothetical protein A153_02495 [Escherichia coli KTE196]
gi|431556636|gb|ELI30411.1| hypothetical protein WIG_01831 [Escherichia coli KTE117]
Length = 478
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYC 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|168239539|ref|ZP_02664597.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. SL480]
gi|194734876|ref|YP_002114362.1| hypothetical protein SeSA_A1440 [Salmonella enterica subsp.
enterica serovar Schwarzengrund str. CVM19633]
gi|226725739|sp|B4TUG2.1|YDIU_SALSV RecName: Full=UPF0061 protein YdiU
gi|194710378|gb|ACF89599.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. CVM19633]
gi|197287763|gb|EDY27153.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Schwarzengrund str. SL480]
Length = 480
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RTAFDAWFERYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|419175201|ref|ZP_13719046.1| hypothetical protein ECDEC7B_1893 [Escherichia coli DEC7B]
gi|378034732|gb|EHV97296.1| hypothetical protein ECDEC7B_1893 [Escherichia coli DEC7B]
Length = 478
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLTQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DNYVSRPPDWGKRLEV---SCSS 478
>gi|416507505|ref|ZP_11735453.1| hypothetical protein SEEM031_00835 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|416523649|ref|ZP_11741284.1| hypothetical protein SEEM710_08798 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|416562996|ref|ZP_11762582.1| hypothetical protein SEEM42N_13162 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
gi|363549802|gb|EHL34135.1| hypothetical protein SEEM710_08798 [Salmonella enterica subsp.
enterica serovar Montevideo str. ATCC BAA710]
gi|363553515|gb|EHL37763.1| hypothetical protein SEEM031_00835 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB31]
gi|363572200|gb|EHL56093.1| hypothetical protein SEEM42N_13162 [Salmonella enterica subsp.
enterica serovar Montevideo str. 42N]
Length = 480
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RTAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|238910839|ref|ZP_04654676.1| hypothetical protein SentesTe_06847 [Salmonella enterica subsp.
enterica serovar Tennessee str. CDC07-0191]
Length = 480
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|167551695|ref|ZP_02345449.1| protein YdiU [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA29]
gi|205323604|gb|EDZ11443.1| protein YdiU [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA29]
Length = 480
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|419345262|ref|ZP_13886642.1| hypothetical protein ECDEC13A_1821 [Escherichia coli DEC13A]
gi|419349678|ref|ZP_13891029.1| hypothetical protein ECDEC13B_1624 [Escherichia coli DEC13B]
gi|419355019|ref|ZP_13896287.1| hypothetical protein ECDEC13C_2053 [Escherichia coli DEC13C]
gi|419360158|ref|ZP_13901379.1| hypothetical protein ECDEC13D_1930 [Escherichia coli DEC13D]
gi|419365129|ref|ZP_13906297.1| hypothetical protein ECDEC13E_1959 [Escherichia coli DEC13E]
gi|378188297|gb|EHX48903.1| hypothetical protein ECDEC13A_1821 [Escherichia coli DEC13A]
gi|378203056|gb|EHX63481.1| hypothetical protein ECDEC13B_1624 [Escherichia coli DEC13B]
gi|378203458|gb|EHX63881.1| hypothetical protein ECDEC13C_2053 [Escherichia coli DEC13C]
gi|378205088|gb|EHX65503.1| hypothetical protein ECDEC13D_1930 [Escherichia coli DEC13D]
gi|378215052|gb|EHX75352.1| hypothetical protein ECDEC13E_1959 [Escherichia coli DEC13E]
Length = 478
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR L D+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLVDFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|395762314|ref|ZP_10442983.1| hypothetical protein JPAM2_11285 [Janthinobacterium lividum PAMC
25724]
Length = 492
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 218/518 (42%), Positives = 288/518 (55%), Gaps = 53/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT + P+ + VA S A + LD PDF SG + P + Y
Sbjct: 23 AFYTHLMPT-PLPAAYFVAASAQAASLVGLDCARLAEPDFVALLSGNVVAERSRPLSAVY 81
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRAI LG++ ELQLKGAG TPYSR DG AVLRSSIREF
Sbjct: 82 SGHQFGVWAGQLGDGRAILLGDLATADGP-LELQLKGAGATPYSRMGDGRAVLRSSIREF 140
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPT+RAL ++ + + + R+ E A+V R+A SF+RFGS++
Sbjct: 141 LCSEAMAALGIPTSRALSIMGSQQGIMRETV-------ETAAVVTRMAPSFVRFGSFEHW 193
Query: 311 ASRGQ-EDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
R + E+L I LADY I + H+ +N Y A
Sbjct: 194 FYRKKPEELKI---LADYVIDGFYPHLRA---------------------AANPYQALLH 229
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA ++AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G
Sbjct: 230 EVCVRTAHMIAQWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDAQHICNHTDQQG- 288
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY +ANQP +G WN L LI + EA ++ Y F D+ ++ KLGL
Sbjct: 289 RYSYANQPQVGHWNCHALGQAL--LPLIGEVAEAQAALDAYQPAFADKMNGLLRAKLGLQ 346
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ + + M + VD+T+FFR L+ ++ + PE + PL+ + +D
Sbjct: 347 TQQDDDTTLFDSMFALMQANSVDFTHFFRTLATLQV--AAPEHD--TPLRDMFID----- 397
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ + +W +Y LL G D +R+ M+ VNPKYVLRNYL Q AI+ A+ D+ EV
Sbjct: 398 RPGFDAWAATYRARLLQEGSVDAQRQVAMHQVNPKYVLRNYLAQVAIEKAQQQDYTEVTT 457
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL+++++P+DEQP YA LPP WA V SCSS
Sbjct: 458 LLEILQKPFDEQPEHHHYAALPPDWASHLEV---SCSS 492
>gi|421884910|ref|ZP_16316115.1| hypothetical protein SS209_02075 [Salmonella enterica subsp.
enterica serovar Senftenberg str. SS209]
gi|379985624|emb|CCF88388.1| hypothetical protein SS209_02075 [Salmonella enterica subsp.
enterica serovar Senftenberg str. SS209]
Length = 480
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|254247984|ref|ZP_04941305.1| hypothetical protein BCPG_02802 [Burkholderia cenocepacia PC184]
gi|124872760|gb|EAY64476.1| hypothetical protein BCPG_02802 [Burkholderia cenocepacia PC184]
Length = 611
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 222/536 (41%), Positives = 297/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L+L P +P F F+G P A A+PY
Sbjct: 124 AFHTRL-PAAPLAAPYVVGFSDDVAQLLDLPPAVAAQPGFAELFAG-NPTRDWPAHAMPY 181
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 182 ASVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGGGRTPYSRMGDGRAVLRSS 241
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 242 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGH 294
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I + + + + Y A
Sbjct: 295 FEHFFSNDRPDL--LRQLADHVIDRFYPACRDAD---------------------DPYLA 331
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 332 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDT 391
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + +DD +A V+ ++
Sbjct: 392 SG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPE 448
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 449 RFGPALERAMRAKLGLELEREGDAELANKLLETMHASHADFTLTFRRLAQLSKHDASRD- 507
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D ++A+ +W Y L D R MN NPKYVLRN+L
Sbjct: 508 ----APVRDLFID-----RDAFDAWANLYRARLSEETRDDAARAVAMNRANPKYVLRNHL 558
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 559 AEVAIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 611
>gi|12836702|dbj|BAB23774.1| unnamed protein product [Mus musculus]
Length = 664
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 249/630 (39%), Positives = 324/630 (51%), Gaps = 117/630 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +RELP G + + PR V AC+++ P A + P+LVA SE
Sbjct: 46 LAGLRFDNRALRELPVETPPPGPEDSLATPRPVPGACFSRARP-APLRRPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEASEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDEHTGRAGPSVGRDDIRVQL 284
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + T D D+ + AA+ EV +RTA +VA+WQ
Sbjct: 285 LDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTQRTARMVAEWQC 330
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + WN+
Sbjct: 331 VGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHICNASDNAGR-YTYSKQPQVCKWNL 389
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKL-- 498
+ + L + EA + E + T+F Y M KKLGL + K+ +++KL
Sbjct: 390 QKLAEALEPELPLALAEA-ILKEEFDTEFQRHYLQKMRKKLGLIRVEKEEDGTLVAKLLE 448
Query: 499 ---------------LNNMAVDKVDYTNFFRALSNVKA-------------DP------- 523
L++ D D F L++ A DP
Sbjct: 449 TMHLTGADFTNTFCVLSSFPADLSDSAEFLSRLTSQCASLEELRLAFRPQMDPRQLSMML 508
Query: 524 ----SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSYIQEL- 560
S P+ L+ +A + D+ ++ ++ W +W+ Y L
Sbjct: 509 MLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSDLQRKNRDHWEAWLQEYRDRLD 568
Query: 561 -LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
G+ D ER +M + NPKYVLRNY+ Q AI+AAE GDF EVR +LKL+E PY
Sbjct: 569 KEKEGVGDTAAWQAERVRVMRANNPKYVLRNYIAQKAIEAAENGDFSEVRLVLKLLESPY 628
Query: 615 ---DEQPGMEKYAR----------LPPAWA 631
+E G E AR PP WA
Sbjct: 629 HSEEEATGPEAVARSTEEQSSYSNRPPLWA 658
>gi|168822205|ref|ZP_02834205.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Weltevreden str. HI_N05-537]
gi|409250347|ref|YP_006886158.1| UPF0061 protein ydiU [Salmonella enterica subsp. enterica serovar
Weltevreden str. 2007-60-3289-1]
gi|205341292|gb|EDZ28056.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Weltevreden str. HI_N05-537]
gi|320086175|emb|CBY95949.1| UPF0061 protein ydiU [Salmonella enterica subsp. enterica serovar
Weltevreden str. 2007-60-3289-1]
Length = 480
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVLRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|452120485|ref|YP_007470733.1| hypothetical protein CFSAN001992_04875 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
gi|451909489|gb|AGF81295.1| hypothetical protein CFSAN001992_04875 [Salmonella enterica subsp.
enterica serovar Javiana str. CFSAN001992]
Length = 480
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVATRTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|418513897|ref|ZP_13080118.1| hypothetical protein SEEPO729_00320 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
gi|366080811|gb|EHN44768.1| hypothetical protein SEEPO729_00320 [Salmonella enterica subsp.
enterica serovar Pomona str. ATCC 10729]
Length = 480
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RTAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|386614256|ref|YP_006133922.1| hypothetical protein UMNK88_2169 [Escherichia coli UMNK88]
gi|332343425|gb|AEE56759.1| conserved hypothetical protein [Escherichia coli UMNK88]
Length = 478
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|194438491|ref|ZP_03070580.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|251785157|ref|YP_002999461.1| hypothetical protein B21_01664 [Escherichia coli BL21(DE3)]
gi|253773338|ref|YP_003036169.1| hypothetical protein ECBD_1939 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|254161766|ref|YP_003044874.1| hypothetical protein ECB_01675 [Escherichia coli B str. REL606]
gi|254288554|ref|YP_003054302.1| hypothetical protein ECD_01675 [Escherichia coli BL21(DE3)]
gi|297517829|ref|ZP_06936215.1| hypothetical protein EcolOP_09357 [Escherichia coli OP50]
gi|300930820|ref|ZP_07146191.1| SelO family protein [Escherichia coli MS 187-1]
gi|422786291|ref|ZP_16839030.1| hypothetical protein ERGG_01441 [Escherichia coli H489]
gi|422789606|ref|ZP_16842311.1| hypothetical protein ERHG_00089 [Escherichia coli TA007]
gi|432580450|ref|ZP_19816876.1| hypothetical protein A1SK_04222 [Escherichia coli KTE56]
gi|442598271|ref|ZP_21016043.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
O5:K4(L):H4 str. ATCC 23502]
gi|194422501|gb|EDX38499.1| conserved hypothetical protein [Escherichia coli 101-1]
gi|242377430|emb|CAQ32181.1| conserved protein [Escherichia coli BL21(DE3)]
gi|253324382|gb|ACT28984.1| protein of unknown function UPF0061 [Escherichia coli
'BL21-Gold(DE3)pLysS AG']
gi|253973667|gb|ACT39338.1| hypothetical protein ECB_01675 [Escherichia coli B str. REL606]
gi|253977861|gb|ACT43531.1| hypothetical protein ECD_01675 [Escherichia coli BL21(DE3)]
gi|300461334|gb|EFK24827.1| SelO family protein [Escherichia coli MS 187-1]
gi|323962090|gb|EGB57686.1| hypothetical protein ERGG_01441 [Escherichia coli H489]
gi|323973913|gb|EGB69085.1| hypothetical protein ERHG_00089 [Escherichia coli TA007]
gi|431105281|gb|ELE09616.1| hypothetical protein A1SK_04222 [Escherichia coli KTE56]
gi|441653011|emb|CCQ03971.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
O5:K4(L):H4 str. ATCC 23502]
Length = 478
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417184843|ref|ZP_12010377.1| hypothetical protein EC930624_1180 [Escherichia coli 93.0624]
gi|386183312|gb|EIH66061.1| hypothetical protein EC930624_1180 [Escherichia coli 93.0624]
Length = 478
Score = 362 bits (928), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG T YSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTSYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|450215073|ref|ZP_21895409.1| hypothetical protein C202_08121 [Escherichia coli O08]
gi|449319291|gb|EMD09344.1| hypothetical protein C202_08121 [Escherichia coli O08]
Length = 478
Score = 362 bits (928), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLQPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|261822020|ref|YP_003260126.1| hypothetical protein Pecwa_2765 [Pectobacterium wasabiae WPP163]
gi|261606033|gb|ACX88519.1| protein of unknown function UPF0061 [Pectobacterium wasabiae
WPP163]
Length = 483
Score = 361 bits (927), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 224/517 (43%), Positives = 288/517 (55%), Gaps = 58/517 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT + P+ + +L+ SE +A L L F P + G L+G P AQ Y G
Sbjct: 19 YTALPPTP-LHGARLLYHSEGLAAELGLSSDWFT-PAQDNVWGGERLLSGMEPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGE--ILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
HQFGMWAGQLGDGR I LGE + + +S W LKGAG TPYSR DG AVLRS IREF
Sbjct: 77 HQFGMWAGQLGDGRGILLGEQQLADGRSVDW--HLKGAGLTPYSRMGDGRAVLRSVIREF 134
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH+LGIPTTRAL +VT+ V R+ +EE GA++ RVA+S +RFG ++
Sbjct: 135 LASEAMHYLGIPTTRALTIVTSTHLVQRE-------QEEKGAMLLRVAESHVRFGHFEHF 187
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
R + + VR L +Y I H+ EN DE +Y W +
Sbjct: 188 YYR--REPEKVRQLVEYVIARHWPQWEN------------DE---------RRYELWFGD 224
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA L+ WQ VGF+HGV+NTDNMSILGLTIDYGP+GFLDA+ P F N +D G R
Sbjct: 225 VVERTARLITHWQAVGFSHGVMNTDNMSILGLTIDYGPYGFLDAYQPDFICNHSDHRG-R 283
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--- 487
Y F NQP +GLWN+ + + L+ L+D + RY M Y +M KLGL
Sbjct: 284 YAFDNQPAVGLWNLHRLAQALSG--LMDTDTLERALARYEPALMQHYGTLMRAKLGLFTA 341
Query: 488 -PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
P N +++ LL M + DYT FR L++ + S +A L D +R
Sbjct: 342 SPDDND-VLAGLLRLMQKEGSDYTRTFRLLADSEKQAS----------RASLRDEFIDRA 390
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A+ +W +Y Q L+ DEER+ LMN+ NPKY+LRNYL Q AI+ AE D + RL
Sbjct: 391 -AFDNWFAAYRQRLMQEDQGDEERRRLMNATNPKYILRNYLAQMAIERAENDDISVLARL 449
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RP+DEQ A LPP W +SCSS
Sbjct: 450 HQALCRPFDEQSDNNDLAALPPDWGKH---LEISCSS 483
>gi|442593389|ref|ZP_21011340.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
O10:K5(L):H4 str. ATCC 23506]
gi|441606875|emb|CCP96667.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
O10:K5(L):H4 str. ATCC 23506]
Length = 478
Score = 361 bits (927), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEYFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|419316722|ref|ZP_13858536.1| hypothetical protein ECDEC12A_2026 [Escherichia coli DEC12A]
gi|419328843|ref|ZP_13870460.1| hypothetical protein ECDEC12C_2049 [Escherichia coli DEC12C]
gi|419339966|ref|ZP_13881443.1| hypothetical protein ECDEC12E_2097 [Escherichia coli DEC12E]
gi|378171419|gb|EHX32286.1| hypothetical protein ECDEC12A_2026 [Escherichia coli DEC12A]
gi|378172600|gb|EHX33451.1| hypothetical protein ECDEC12C_2049 [Escherichia coli DEC12C]
gi|378191432|gb|EHX52008.1| hypothetical protein ECDEC12E_2097 [Escherichia coli DEC12E]
Length = 478
Score = 361 bits (927), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGD R I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDERGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLIRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|422774398|ref|ZP_16828054.1| ydiU [Escherichia coli H120]
gi|323948103|gb|EGB44094.1| ydiU [Escherichia coli H120]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AI H++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIHHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|300924745|ref|ZP_07140689.1| SelO family protein [Escherichia coli MS 182-1]
gi|300419079|gb|EFK02390.1| SelO family protein [Escherichia coli MS 182-1]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ ++E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSYLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLHRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+ Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWWAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVSRPPDWGKRLEV---SCSS 478
>gi|194434790|ref|ZP_03067040.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|416281734|ref|ZP_11646042.1| hypothetical protein SGB_01581 [Shigella boydii ATCC 9905]
gi|417672217|ref|ZP_12321690.1| hypothetical protein SD15574_1851 [Shigella dysenteriae 155-74]
gi|194416959|gb|EDX33078.1| conserved hypothetical protein [Shigella dysenteriae 1012]
gi|320181264|gb|EFW56183.1| hypothetical protein SGB_01581 [Shigella boydii ATCC 9905]
gi|332093952|gb|EGI99005.1| hypothetical protein SD15574_1851 [Shigella dysenteriae 155-74]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF H V+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHRVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432861834|ref|ZP_20086594.1| hypothetical protein A311_02326 [Escherichia coli KTE146]
gi|431405581|gb|ELG88814.1| hypothetical protein A311_02326 [Escherichia coli KTE146]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAASHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432416926|ref|ZP_19659537.1| hypothetical protein WGI_02431 [Escherichia coli KTE44]
gi|430940288|gb|ELC60471.1| hypothetical protein WGI_02431 [Escherichia coli KTE44]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGISP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFAQYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|218705206|ref|YP_002412725.1| hypothetical protein ECUMN_1997 [Escherichia coli UMN026]
gi|293405205|ref|ZP_06649197.1| hypothetical protein ECGG_00544 [Escherichia coli FVEC1412]
gi|298380848|ref|ZP_06990447.1| ydiU protein [Escherichia coli FVEC1302]
gi|300898509|ref|ZP_07116844.1| SelO family protein [Escherichia coli MS 198-1]
gi|432353618|ref|ZP_19596892.1| hypothetical protein WCA_02591 [Escherichia coli KTE2]
gi|432401969|ref|ZP_19644722.1| hypothetical protein WEK_02152 [Escherichia coli KTE26]
gi|432426142|ref|ZP_19668647.1| hypothetical protein A139_01528 [Escherichia coli KTE181]
gi|432460761|ref|ZP_19702912.1| hypothetical protein A15I_01628 [Escherichia coli KTE204]
gi|432537870|ref|ZP_19774773.1| hypothetical protein A195_01483 [Escherichia coli KTE235]
gi|432631442|ref|ZP_19867371.1| hypothetical protein A1UW_01815 [Escherichia coli KTE80]
gi|432641088|ref|ZP_19876925.1| hypothetical protein A1W1_01949 [Escherichia coli KTE83]
gi|432666074|ref|ZP_19901656.1| hypothetical protein A1Y3_02673 [Escherichia coli KTE116]
gi|433053212|ref|ZP_20240407.1| hypothetical protein WIK_02020 [Escherichia coli KTE122]
gi|433067990|ref|ZP_20254791.1| hypothetical protein WIQ_01872 [Escherichia coli KTE128]
gi|433178350|ref|ZP_20362762.1| hypothetical protein WGM_01991 [Escherichia coli KTE82]
gi|226725729|sp|B7N544.1|YDIU_ECOLU RecName: Full=UPF0061 protein YdiU
gi|218432303|emb|CAR13193.1| conserved hypothetical protein [Escherichia coli UMN026]
gi|291427413|gb|EFF00440.1| hypothetical protein ECGG_00544 [Escherichia coli FVEC1412]
gi|298278290|gb|EFI19804.1| ydiU protein [Escherichia coli FVEC1302]
gi|300357817|gb|EFJ73687.1| SelO family protein [Escherichia coli MS 198-1]
gi|430875859|gb|ELB99380.1| hypothetical protein WCA_02591 [Escherichia coli KTE2]
gi|430926799|gb|ELC47386.1| hypothetical protein WEK_02152 [Escherichia coli KTE26]
gi|430956482|gb|ELC75156.1| hypothetical protein A139_01528 [Escherichia coli KTE181]
gi|430989474|gb|ELD05928.1| hypothetical protein A15I_01628 [Escherichia coli KTE204]
gi|431069784|gb|ELD78104.1| hypothetical protein A195_01483 [Escherichia coli KTE235]
gi|431170910|gb|ELE71091.1| hypothetical protein A1UW_01815 [Escherichia coli KTE80]
gi|431183353|gb|ELE83169.1| hypothetical protein A1W1_01949 [Escherichia coli KTE83]
gi|431201449|gb|ELF00146.1| hypothetical protein A1Y3_02673 [Escherichia coli KTE116]
gi|431571608|gb|ELI44478.1| hypothetical protein WIK_02020 [Escherichia coli KTE122]
gi|431585682|gb|ELI57629.1| hypothetical protein WIQ_01872 [Escherichia coli KTE128]
gi|431704714|gb|ELJ69339.1| hypothetical protein WGM_01991 [Escherichia coli KTE82]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + P + D ++ G T L G P
Sbjct: 10 RDELPGTYTALSPTP-LNNARLIWHNTELANTLSI-PSSLFKNDAGVW-GGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|24112898|ref|NP_707408.1| hypothetical protein SF1525 [Shigella flexneri 2a str. 301]
gi|30063027|ref|NP_837198.1| hypothetical protein S1642 [Shigella flexneri 2a str. 2457T]
gi|415856440|ref|ZP_11531426.1| hypothetical protein SF2457T_2418 [Shigella flexneri 2a str. 2457T]
gi|417702094|ref|ZP_12351215.1| hypothetical protein SFK218_2369 [Shigella flexneri K-218]
gi|417723077|ref|ZP_12371894.1| hypothetical protein SFK304_2129 [Shigella flexneri K-304]
gi|417733314|ref|ZP_12381974.1| hypothetical protein SF274771_1862 [Shigella flexneri 2747-71]
gi|417736824|ref|ZP_12385438.1| hypothetical protein SF434370_0140 [Shigella flexneri 4343-70]
gi|417743173|ref|ZP_12391714.1| conserved protein [Shigella flexneri 2930-71]
gi|418255751|ref|ZP_12880032.1| hypothetical protein SF660363_1844 [Shigella flexneri 6603-63]
gi|420341628|ref|ZP_14843128.1| hypothetical protein SFK404_2215 [Shigella flexneri K-404]
gi|33516996|sp|Q83L33.1|YDIU_SHIFL RecName: Full=UPF0061 protein YdiU
gi|24051844|gb|AAN43115.1| conserved hypothetical protein [Shigella flexneri 2a str. 301]
gi|30041276|gb|AAP17005.1| hypothetical protein S1642 [Shigella flexneri 2a str. 2457T]
gi|313649272|gb|EFS13706.1| hypothetical protein SF2457T_2418 [Shigella flexneri 2a str. 2457T]
gi|332758672|gb|EGJ88991.1| hypothetical protein SF274771_1862 [Shigella flexneri 2747-71]
gi|332762554|gb|EGJ92819.1| hypothetical protein SF434370_0140 [Shigella flexneri 4343-70]
gi|332767231|gb|EGJ97426.1| conserved protein [Shigella flexneri 2930-71]
gi|333004328|gb|EGK23859.1| hypothetical protein SFK218_2369 [Shigella flexneri K-218]
gi|333018249|gb|EGK37551.1| hypothetical protein SFK304_2129 [Shigella flexneri K-304]
gi|391269664|gb|EIQ28564.1| hypothetical protein SFK404_2215 [Shigella flexneri K-404]
gi|397898593|gb|EJL14976.1| hypothetical protein SF660363_1844 [Shigella flexneri 6603-63]
Length = 478
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQF +WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFVVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLREEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|300958592|ref|ZP_07170719.1| SelO family protein [Escherichia coli MS 175-1]
gi|300314755|gb|EFJ64539.1| SelO family protein [Escherichia coli MS 175-1]
Length = 478
Score = 361 bits (927), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFNNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|222156457|ref|YP_002556596.1| hypothetical protein LF82_2886 [Escherichia coli LF82]
gi|387617046|ref|YP_006120068.1| hypothetical protein NRG857_08550 [Escherichia coli O83:H1 str. NRG
857C]
gi|222033462|emb|CAP76203.1| UPF0061 protein ydiU [Escherichia coli LF82]
gi|312946307|gb|ADR27134.1| hypothetical protein NRG857_08550 [Escherichia coli O83:H1 str. NRG
857C]
Length = 478
Score = 361 bits (927), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 218/515 (42%), Positives = 293/515 (56%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + N +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNNARLIWHNAELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H+E+ DED KY W +V
Sbjct: 186 YRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYRLWFSDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E+ RL +
Sbjct: 389 FDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|384543144|ref|YP_005727206.1| hypothetical protein SFxv_1708 [Shigella flexneri 2002017]
gi|281600929|gb|ADA73913.1| hypothetical protein SFxv_1708 [Shigella flexneri 2002017]
Length = 496
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 28 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 84
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQF +WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 85 LAQVYSGHQFVVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 144
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 145 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 197
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 198 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 234
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 235 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 294
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 295 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 351
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 352 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLREEFID-- 403
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 404 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 460
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 461 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 496
>gi|197264163|ref|ZP_03164237.1| protein YdiU [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA23]
gi|378954891|ref|YP_005212378.1| hypothetical protein SPUL_1161 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|421358156|ref|ZP_15808454.1| hypothetical protein SEEE3139_08904 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421364579|ref|ZP_15814811.1| hypothetical protein SEEE0166_18252 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421366632|ref|ZP_15816834.1| hypothetical protein SEEE0631_05568 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421373546|ref|ZP_15823686.1| hypothetical protein SEEE0424_17649 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421377069|ref|ZP_15827168.1| hypothetical protein SEEE3076_12583 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421381568|ref|ZP_15831623.1| hypothetical protein SEEE4917_12333 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421385248|ref|ZP_15835270.1| hypothetical protein SEEE6622_08149 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421390424|ref|ZP_15840399.1| hypothetical protein SEEE6670_11432 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421393684|ref|ZP_15843628.1| hypothetical protein SEEE6426_05124 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421398270|ref|ZP_15848178.1| hypothetical protein SEEE6437_06046 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421404082|ref|ZP_15853926.1| hypothetical protein SEEE7246_12520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421409593|ref|ZP_15859383.1| hypothetical protein SEEE7250_17622 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421413316|ref|ZP_15863070.1| hypothetical protein SEEE1427_13541 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418628|ref|ZP_15868329.1| hypothetical protein SEEE2659_17626 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421422304|ref|ZP_15871972.1| hypothetical protein SEEE1757_13409 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421426459|ref|ZP_15876087.1| hypothetical protein SEEE5101_11612 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421432790|ref|ZP_15882358.1| hypothetical protein SEEE8B1_20782 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421434794|ref|ZP_15884340.1| hypothetical protein SEEE5518_07585 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421442314|ref|ZP_15891774.1| hypothetical protein SEEE1618_22719 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421444604|ref|ZP_15894034.1| hypothetical protein SEEE3079_11177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|421448107|ref|ZP_15897502.1| hypothetical protein SEEE6482_06111 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|436596487|ref|ZP_20512552.1| hypothetical protein SEE22704_04155 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436809054|ref|ZP_20528434.1| hypothetical protein SEEE1882_11499 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436815190|ref|ZP_20532741.1| hypothetical protein SEEE1884_10388 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436844613|ref|ZP_20538371.1| hypothetical protein SEEE1594_16098 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436854056|ref|ZP_20543690.1| hypothetical protein SEEE1566_20189 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436857546|ref|ZP_20546066.1| hypothetical protein SEEE1580_09505 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436864719|ref|ZP_20550686.1| hypothetical protein SEEE1543_10290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436873717|ref|ZP_20556441.1| hypothetical protein SEEE1441_16927 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436878085|ref|ZP_20558940.1| hypothetical protein SEEE1810_06832 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436888374|ref|ZP_20564703.1| hypothetical protein SEEE1558_13209 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436895842|ref|ZP_20568598.1| hypothetical protein SEEE1018_09957 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436901724|ref|ZP_20572634.1| hypothetical protein SEEE1010_07769 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436912236|ref|ZP_20578065.1| hypothetical protein SEEE1729_12680 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436922168|ref|ZP_20584393.1| hypothetical protein SEEE0895_21875 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436927095|ref|ZP_20586921.1| hypothetical protein SEEE0899_11659 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436936187|ref|ZP_20591627.1| hypothetical protein SEEE1457_12741 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436943377|ref|ZP_20596323.1| hypothetical protein SEEE1747_13882 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436951135|ref|ZP_20600190.1| hypothetical protein SEEE0968_10534 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436961540|ref|ZP_20604914.1| hypothetical protein SEEE1444_11555 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436970866|ref|ZP_20609259.1| hypothetical protein SEEE1445_10726 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436983531|ref|ZP_20614120.1| hypothetical protein SEEE1559_12742 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436994385|ref|ZP_20618856.1| hypothetical protein SEEE1565_13877 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437007113|ref|ZP_20623164.1| hypothetical protein SEEE1808_13068 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437023983|ref|ZP_20629192.1| hypothetical protein SEEE1811_20724 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437030305|ref|ZP_20631275.1| hypothetical protein SEEE0956_08331 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437040684|ref|ZP_20634819.1| hypothetical protein SEEE1455_03345 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437053939|ref|ZP_20642738.1| hypothetical protein SEEE1575_20881 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437058707|ref|ZP_20645554.1| hypothetical protein SEEE1725_12514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437070470|ref|ZP_20651648.1| hypothetical protein SEEE1745_20543 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437076397|ref|ZP_20654760.1| hypothetical protein SEEE1791_13397 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437081241|ref|ZP_20657693.1| hypothetical protein SEEE1795_05531 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437091596|ref|ZP_20663196.1| hypothetical protein SEEE6709_10832 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437101809|ref|ZP_20666258.1| hypothetical protein SEEE9058_03379 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437121039|ref|ZP_20671679.1| hypothetical protein SEEE0816_08086 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437131001|ref|ZP_20677131.1| hypothetical protein SEEE0819_12840 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437138753|ref|ZP_20681235.1| hypothetical protein SEEE3072_10757 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437145608|ref|ZP_20685515.1| hypothetical protein SEEE3089_09532 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437156887|ref|ZP_20692423.1| hypothetical protein SEEE9163_21702 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437158751|ref|ZP_20693509.1| hypothetical protein SEEE151_04298 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437165982|ref|ZP_20697767.1| hypothetical protein SEEEN202_03231 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437177758|ref|ZP_20704228.1| hypothetical protein SEEE3991_13361 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437186098|ref|ZP_20709367.1| hypothetical protein SEEE3618_16824 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437244007|ref|ZP_20714577.1| hypothetical protein SEEE1831_20768 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437258828|ref|ZP_20716748.1| hypothetical protein SEEE2490_05054 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437268397|ref|ZP_20721867.1| hypothetical protein SEEEL909_08413 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437277236|ref|ZP_20726755.1| hypothetical protein SEEEL913_10280 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437293343|ref|ZP_20732058.1| hypothetical protein SEEE4941_14592 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437312314|ref|ZP_20736422.1| hypothetical protein SEEE7015_14045 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437409733|ref|ZP_20752517.1| hypothetical protein SEEE2217_04287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437452188|ref|ZP_20759669.1| hypothetical protein SEEE4018_17935 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437460691|ref|ZP_20761645.1| hypothetical protein SEEE6211_04737 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437473526|ref|ZP_20765827.1| hypothetical protein SEEE4441_03109 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437514470|ref|ZP_20777833.1| hypothetical protein SEEE9845_18965 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437525481|ref|ZP_20779790.1| hypothetical protein SEEE9317_05778 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437560882|ref|ZP_20786166.1| hypothetical protein SEEE0116_15275 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437577778|ref|ZP_20791127.1| hypothetical protein SEEE1117_17344 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437601211|ref|ZP_20797534.1| hypothetical protein SEEE0268_04143 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437613790|ref|ZP_20801670.1| hypothetical protein SEEE0316_02194 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437633654|ref|ZP_20806732.1| hypothetical protein SEEE0436_05026 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437657994|ref|ZP_20811325.1| hypothetical protein SEEE1319_04738 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437683396|ref|ZP_20818787.1| hypothetical protein SEEE4481_20299 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437696946|ref|ZP_20822609.1| hypothetical protein SEEE6297_15965 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437704709|ref|ZP_20824765.1| hypothetical protein SEEE4220_04010 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437728026|ref|ZP_20830370.1| hypothetical protein SEEE1616_09290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|437789182|ref|ZP_20837091.1| hypothetical protein SEEE2651_21023 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|437808116|ref|ZP_20839952.1| hypothetical protein SEEE3944_10563 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437945559|ref|ZP_20851804.1| hypothetical protein SEEE5621_24765 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|438091983|ref|ZP_20861200.1| hypothetical protein SEEE2625_18611 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438099916|ref|ZP_20863660.1| hypothetical protein SEEE1976_07969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438110546|ref|ZP_20867944.1| hypothetical protein SEEE3407_06926 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|438125829|ref|ZP_20872756.1| hypothetical protein SEEP9120_04350 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|445170612|ref|ZP_21395785.1| hypothetical protein SEE8A_016289 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445194704|ref|ZP_21400271.1| hypothetical protein SE20037_11790 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445224013|ref|ZP_21403512.1| hypothetical protein SEE10_017640 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|445353061|ref|ZP_21420953.1| hypothetical protein SEE13_019630 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445357183|ref|ZP_21422103.1| hypothetical protein SEE23_009276 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|197242418|gb|EDY25038.1| protein YdiU [Salmonella enterica subsp. enterica serovar Saintpaul
str. SARA23]
gi|357205502|gb|AET53548.1| hypothetical protein SPUL_1161 [Salmonella enterica subsp. enterica
serovar Gallinarum/pullorum str. RKS5078]
gi|395984068|gb|EJH93258.1| hypothetical protein SEEE0166_18252 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395988460|gb|EJH97616.1| hypothetical protein SEEE3139_08904 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395989287|gb|EJH98421.1| hypothetical protein SEEE0631_05568 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395996665|gb|EJI05710.1| hypothetical protein SEEE0424_17649 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|396000691|gb|EJI09705.1| hypothetical protein SEEE3076_12583 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396001531|gb|EJI10543.1| hypothetical protein SEEE4917_12333 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396014234|gb|EJI23120.1| hypothetical protein SEEE6670_11432 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396016685|gb|EJI25552.1| hypothetical protein SEEE6622_08149 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396017567|gb|EJI26432.1| hypothetical protein SEEE6426_05124 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396024890|gb|EJI33674.1| hypothetical protein SEEE7250_17622 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396027162|gb|EJI35926.1| hypothetical protein SEEE7246_12520 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396031343|gb|EJI40070.1| hypothetical protein SEEE6437_06046 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396037906|gb|EJI46550.1| hypothetical protein SEEE2659_17626 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396040404|gb|EJI49028.1| hypothetical protein SEEE1427_13541 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396041619|gb|EJI50242.1| hypothetical protein SEEE1757_13409 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396049006|gb|EJI57549.1| hypothetical protein SEEE8B1_20782 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396053966|gb|EJI62459.1| hypothetical protein SEEE5101_11612 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396059175|gb|EJI67630.1| hypothetical protein SEEE5518_07585 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396062991|gb|EJI71402.1| hypothetical protein SEEE1618_22719 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396067035|gb|EJI75395.1| hypothetical protein SEEE3079_11177 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396073707|gb|EJI82007.1| hypothetical protein SEEE6482_06111 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|434942516|gb|ELL48793.1| hypothetical protein SEEP9120_04350 [Salmonella enterica subsp.
enterica serovar Pullorum str. ATCC 9120]
gi|434966871|gb|ELL59706.1| hypothetical protein SEEE1882_11499 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434973306|gb|ELL65694.1| hypothetical protein SEEE1884_10388 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434976961|gb|ELL69134.1| hypothetical protein SEE22704_04155 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434979199|gb|ELL71191.1| hypothetical protein SEEE1594_16098 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434982859|gb|ELL74667.1| hypothetical protein SEEE1566_20189 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434989698|gb|ELL81248.1| hypothetical protein SEEE1580_09505 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434995754|gb|ELL87070.1| hypothetical protein SEEE1543_10290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|434998474|gb|ELL89695.1| hypothetical protein SEEE1441_16927 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435008022|gb|ELL98849.1| hypothetical protein SEEE1810_06832 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435010084|gb|ELM00870.1| hypothetical protein SEEE1558_13209 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435015731|gb|ELM06257.1| hypothetical protein SEEE1018_09957 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435021158|gb|ELM11547.1| hypothetical protein SEEE1010_07769 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435024486|gb|ELM14692.1| hypothetical protein SEEE0895_21875 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435026481|gb|ELM16612.1| hypothetical protein SEEE1729_12680 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435036936|gb|ELM26755.1| hypothetical protein SEEE0899_11659 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435039025|gb|ELM28806.1| hypothetical protein SEEE1457_12741 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435043576|gb|ELM33293.1| hypothetical protein SEEE1747_13882 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435050679|gb|ELM40183.1| hypothetical protein SEEE1444_11555 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435051602|gb|ELM41104.1| hypothetical protein SEEE0968_10534 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435057155|gb|ELM46524.1| hypothetical protein SEEE1445_10726 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435064544|gb|ELM53672.1| hypothetical protein SEEE1565_13877 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435065969|gb|ELM55074.1| hypothetical protein SEEE1559_12742 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435070029|gb|ELM59028.1| hypothetical protein SEEE1808_13068 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435073790|gb|ELM62645.1| hypothetical protein SEEE1811_20724 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435082070|gb|ELM70695.1| hypothetical protein SEEE0956_08331 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435087140|gb|ELM75657.1| hypothetical protein SEEE1455_03345 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435088953|gb|ELM77408.1| hypothetical protein SEEE1575_20881 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435090441|gb|ELM78843.1| hypothetical protein SEEE1745_20543 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435094520|gb|ELM82859.1| hypothetical protein SEEE1725_12514 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435105694|gb|ELM93731.1| hypothetical protein SEEE1791_13397 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435111860|gb|ELM99748.1| hypothetical protein SEEE1795_05531 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435112502|gb|ELN00367.1| hypothetical protein SEEE6709_10832 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435123788|gb|ELN11279.1| hypothetical protein SEEE9058_03379 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435124975|gb|ELN12431.1| hypothetical protein SEEE0819_12840 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435126117|gb|ELN13523.1| hypothetical protein SEEE0816_08086 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435132275|gb|ELN19473.1| hypothetical protein SEEE3072_10757 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435135494|gb|ELN22603.1| hypothetical protein SEEE9163_21702 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435137069|gb|ELN24140.1| hypothetical protein SEEE3089_09532 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435150555|gb|ELN37222.1| hypothetical protein SEEE151_04298 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435153339|gb|ELN39947.1| hypothetical protein SEEEN202_03231 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435154606|gb|ELN41185.1| hypothetical protein SEEE3991_13361 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435158972|gb|ELN45342.1| hypothetical protein SEEE3618_16824 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435166075|gb|ELN52077.1| hypothetical protein SEEE2490_05054 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435173422|gb|ELN58932.1| hypothetical protein SEEEL913_10280 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435174576|gb|ELN60018.1| hypothetical protein SEEEL909_08413 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435176880|gb|ELN62230.1| hypothetical protein SEEE1831_20768 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435180782|gb|ELN65887.1| hypothetical protein SEEE4941_14592 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435183446|gb|ELN68421.1| hypothetical protein SEEE7015_14045 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435204732|gb|ELN88396.1| hypothetical protein SEEE2217_04287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435208508|gb|ELN91917.1| hypothetical protein SEEE4018_17935 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435220983|gb|ELO03257.1| hypothetical protein SEEE6211_04737 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435225046|gb|ELO06979.1| hypothetical protein SEEE4441_03109 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435229469|gb|ELO10830.1| hypothetical protein SEEE9845_18965 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435238208|gb|ELO18857.1| hypothetical protein SEEE0116_15275 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435242720|gb|ELO23024.1| hypothetical protein SEEE1117_17344 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435248337|gb|ELO28223.1| hypothetical protein SEEE9317_05778 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435261493|gb|ELO40648.1| hypothetical protein SEEE0268_04143 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435264265|gb|ELO43197.1| hypothetical protein SEEE0316_02194 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435269329|gb|ELO47874.1| hypothetical protein SEEE4481_20299 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435270689|gb|ELO49174.1| hypothetical protein SEEE1319_04738 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435276534|gb|ELO54536.1| hypothetical protein SEEE6297_15965 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435282083|gb|ELO59721.1| hypothetical protein SEEE0436_05026 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435290910|gb|ELO67801.1| hypothetical protein SEEE1616_09290 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 16-16]
gi|435292881|gb|ELO69621.1| hypothetical protein SEEE4220_04010 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435295310|gb|ELO71821.1| hypothetical protein SEEE2651_21023 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 76-2651]
gi|435300458|gb|ELO76549.1| hypothetical protein SEEE3944_10563 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435307827|gb|ELO82868.1| hypothetical protein SEEE5621_24765 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|435315567|gb|ELO88799.1| hypothetical protein SEEE2625_18611 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435325514|gb|ELO97379.1| hypothetical protein SEEE1976_07969 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435331753|gb|ELP02851.1| hypothetical protein SEEE3407_06926 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|444862237|gb|ELX87096.1| hypothetical protein SEE8A_016289 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444866059|gb|ELX90811.1| hypothetical protein SE20037_11790 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444868759|gb|ELX93374.1| hypothetical protein SEE10_017640 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444873238|gb|ELX97539.1| hypothetical protein SEE13_019630 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444886783|gb|ELY10528.1| hypothetical protein SEE23_009276 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
Length = 480
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 296/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPLVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|16129662|ref|NP_416221.1| conserved protein, UPF0061 family [Escherichia coli str. K-12
substr. MG1655]
gi|170081365|ref|YP_001730685.1| hypothetical protein ECDH10B_1842 [Escherichia coli str. K-12
substr. DH10B]
gi|238900921|ref|YP_002926717.1| hypothetical protein BWG_1520 [Escherichia coli BW2952]
gi|300951303|ref|ZP_07165149.1| SelO family protein [Escherichia coli MS 116-1]
gi|301027845|ref|ZP_07191148.1| SelO family protein [Escherichia coli MS 196-1]
gi|301647894|ref|ZP_07247673.1| SelO family protein [Escherichia coli MS 146-1]
gi|331642304|ref|ZP_08343439.1| putative cytoplasmic protein [Escherichia coli H736]
gi|386280771|ref|ZP_10058435.1| UPF0061 protein ydiU [Escherichia sp. 4_1_40B]
gi|386595482|ref|YP_006091882.1| hypothetical protein [Escherichia coli DH1]
gi|387612195|ref|YP_006115311.1| hypothetical protein ETEC_1739 [Escherichia coli ETEC H10407]
gi|387621424|ref|YP_006129051.1| hypothetical protein ECDH1ME8569_1650 [Escherichia coli DH1]
gi|388477780|ref|YP_489968.1| hypothetical protein Y75_p1681 [Escherichia coli str. K-12 substr.
W3110]
gi|415773583|ref|ZP_11486178.1| conserved hypothetical protein [Escherichia coli 3431]
gi|417261217|ref|ZP_12048705.1| hypothetical protein EC23916_2512 [Escherichia coli 2.3916]
gi|417271675|ref|ZP_12059024.1| hypothetical protein EC24168_1910 [Escherichia coli 2.4168]
gi|417277020|ref|ZP_12064346.1| hypothetical protein EC32303_1856 [Escherichia coli 3.2303]
gi|417292688|ref|ZP_12079969.1| hypothetical protein ECB41_1895 [Escherichia coli B41]
gi|417613071|ref|ZP_12263533.1| hypothetical protein ECSTECEH250_2125 [Escherichia coli STEC_EH250]
gi|417618253|ref|ZP_12268674.1| hypothetical protein ECG581_2058 [Escherichia coli G58-1]
gi|417634615|ref|ZP_12284829.1| hypothetical protein ECSTECS1191_2528 [Escherichia coli STEC_S1191]
gi|417943376|ref|ZP_12586624.1| hypothetical protein IAE_00195 [Escherichia coli XH140A]
gi|417974802|ref|ZP_12615603.1| hypothetical protein IAM_00640 [Escherichia coli XH001]
gi|418302966|ref|ZP_12914760.1| hypothetical protein UMNF18_2153 [Escherichia coli UMNF18]
gi|418957936|ref|ZP_13509859.1| SelO family protein [Escherichia coli J53]
gi|419142341|ref|ZP_13687088.1| hypothetical protein ECDEC6A_1984 [Escherichia coli DEC6A]
gi|419148294|ref|ZP_13692971.1| hypothetical protein ECDEC6B_2319 [Escherichia coli DEC6B]
gi|419153805|ref|ZP_13698376.1| hypothetical protein ECDEC6C_1964 [Escherichia coli DEC6C]
gi|419159197|ref|ZP_13703706.1| hypothetical protein ECDEC6D_2002 [Escherichia coli DEC6D]
gi|419164415|ref|ZP_13708872.1| hypothetical protein ECDEC6E_2131 [Escherichia coli DEC6E]
gi|419809848|ref|ZP_14334732.1| hypothetical protein UWO_04941 [Escherichia coli O32:H37 str. P4]
gi|419941789|ref|ZP_14458447.1| hypothetical protein EC75_20699 [Escherichia coli 75]
gi|421774060|ref|ZP_16210673.1| SelO family protein [Escherichia coli AD30]
gi|422766271|ref|ZP_16819998.1| ydiU [Escherichia coli E1520]
gi|422772418|ref|ZP_16826106.1| ydiU [Escherichia coli E482]
gi|422817012|ref|ZP_16865226.1| UPF0061 protein ydiU [Escherichia coli M919]
gi|425115082|ref|ZP_18516890.1| hypothetical protein EC80566_1738 [Escherichia coli 8.0566]
gi|425119806|ref|ZP_18521512.1| hypothetical protein EC80569_1702 [Escherichia coli 8.0569]
gi|425272807|ref|ZP_18664241.1| hypothetical protein ECTW15901_2034 [Escherichia coli TW15901]
gi|425283291|ref|ZP_18674352.1| hypothetical protein ECTW00353_1902 [Escherichia coli TW00353]
gi|432563899|ref|ZP_19800490.1| hypothetical protein A1SA_02539 [Escherichia coli KTE51]
gi|432627292|ref|ZP_19863272.1| hypothetical protein A1UQ_02130 [Escherichia coli KTE77]
gi|432660939|ref|ZP_19896585.1| hypothetical protein A1WY_02352 [Escherichia coli KTE111]
gi|432685493|ref|ZP_19920795.1| hypothetical protein A31A_02343 [Escherichia coli KTE156]
gi|432691642|ref|ZP_19926873.1| hypothetical protein A31G_03860 [Escherichia coli KTE161]
gi|432704459|ref|ZP_19939563.1| hypothetical protein A31Q_02328 [Escherichia coli KTE171]
gi|432737196|ref|ZP_19971962.1| hypothetical protein WGE_02441 [Escherichia coli KTE42]
gi|432955140|ref|ZP_20147080.1| hypothetical protein A155_02357 [Escherichia coli KTE197]
gi|450244246|ref|ZP_21900209.1| hypothetical protein C201_07630 [Escherichia coli S17]
gi|3183285|sp|P77649.1|YDIU_ECOLI RecName: Full=UPF0061 protein YdiU
gi|226725728|sp|B1XG13.1|YDIU_ECODH RecName: Full=UPF0061 protein YdiU
gi|259710234|sp|C4ZYG8.1|YDIU_ECOBW RecName: Full=UPF0061 protein YdiU
gi|1742787|dbj|BAA15475.1| conserved hypothetical protein [Escherichia coli str. K12 substr.
W3110]
gi|1787999|gb|AAC74776.1| conserved protein, UPF0061 family [Escherichia coli str. K-12
substr. MG1655]
gi|169889200|gb|ACB02907.1| conserved protein [Escherichia coli str. K-12 substr. DH10B]
gi|238860321|gb|ACR62319.1| conserved protein [Escherichia coli BW2952]
gi|260449171|gb|ACX39593.1| protein of unknown function UPF0061 [Escherichia coli DH1]
gi|299879045|gb|EFI87256.1| SelO family protein [Escherichia coli MS 196-1]
gi|300449438|gb|EFK13058.1| SelO family protein [Escherichia coli MS 116-1]
gi|301073989|gb|EFK88795.1| SelO family protein [Escherichia coli MS 146-1]
gi|309701931|emb|CBJ01243.1| conserved hypothetical protein [Escherichia coli ETEC H10407]
gi|315136347|dbj|BAJ43506.1| hypothetical protein ECDH1ME8569_1650 [Escherichia coli DH1]
gi|315618903|gb|EFU99486.1| conserved hypothetical protein [Escherichia coli 3431]
gi|323937309|gb|EGB33588.1| ydiU [Escherichia coli E1520]
gi|323940627|gb|EGB36818.1| ydiU [Escherichia coli E482]
gi|331039102|gb|EGI11322.1| putative cytoplasmic protein [Escherichia coli H736]
gi|339415064|gb|AEJ56736.1| hypothetical protein UMNF18_2153 [Escherichia coli UMNF18]
gi|342364702|gb|EGU28801.1| hypothetical protein IAE_00195 [Escherichia coli XH140A]
gi|344195411|gb|EGV49480.1| hypothetical protein IAM_00640 [Escherichia coli XH001]
gi|345363537|gb|EGW95679.1| hypothetical protein ECSTECEH250_2125 [Escherichia coli STEC_EH250]
gi|345378560|gb|EGX10490.1| hypothetical protein ECG581_2058 [Escherichia coli G58-1]
gi|345388106|gb|EGX17917.1| hypothetical protein ECSTECS1191_2528 [Escherichia coli STEC_S1191]
gi|359332185|dbj|BAL38632.1| conserved protein [Escherichia coli str. K-12 substr. MDS42]
gi|377995810|gb|EHV58922.1| hypothetical protein ECDEC6B_2319 [Escherichia coli DEC6B]
gi|377996650|gb|EHV59758.1| hypothetical protein ECDEC6A_1984 [Escherichia coli DEC6A]
gi|377999227|gb|EHV62311.1| hypothetical protein ECDEC6C_1964 [Escherichia coli DEC6C]
gi|378009241|gb|EHV72197.1| hypothetical protein ECDEC6D_2002 [Escherichia coli DEC6D]
gi|378010497|gb|EHV73442.1| hypothetical protein ECDEC6E_2131 [Escherichia coli DEC6E]
gi|384379545|gb|EIE37413.1| SelO family protein [Escherichia coli J53]
gi|385157410|gb|EIF19402.1| hypothetical protein UWO_04941 [Escherichia coli O32:H37 str. P4]
gi|385539683|gb|EIF86515.1| UPF0061 protein ydiU [Escherichia coli M919]
gi|386121954|gb|EIG70567.1| UPF0061 protein ydiU [Escherichia sp. 4_1_40B]
gi|386224344|gb|EII46679.1| hypothetical protein EC23916_2512 [Escherichia coli 2.3916]
gi|386235375|gb|EII67351.1| hypothetical protein EC24168_1910 [Escherichia coli 2.4168]
gi|386240509|gb|EII77433.1| hypothetical protein EC32303_1856 [Escherichia coli 3.2303]
gi|386255010|gb|EIJ04700.1| hypothetical protein ECB41_1895 [Escherichia coli B41]
gi|388399676|gb|EIL60460.1| hypothetical protein EC75_20699 [Escherichia coli 75]
gi|408194475|gb|EKI19953.1| hypothetical protein ECTW15901_2034 [Escherichia coli TW15901]
gi|408203219|gb|EKI28276.1| hypothetical protein ECTW00353_1902 [Escherichia coli TW00353]
gi|408460690|gb|EKJ84468.1| SelO family protein [Escherichia coli AD30]
gi|408569500|gb|EKK45487.1| hypothetical protein EC80566_1738 [Escherichia coli 8.0566]
gi|408570747|gb|EKK46703.1| hypothetical protein EC80569_1702 [Escherichia coli 8.0569]
gi|431094886|gb|ELE00514.1| hypothetical protein A1SA_02539 [Escherichia coli KTE51]
gi|431163985|gb|ELE64386.1| hypothetical protein A1UQ_02130 [Escherichia coli KTE77]
gi|431200055|gb|ELE98781.1| hypothetical protein A1WY_02352 [Escherichia coli KTE111]
gi|431222528|gb|ELF19804.1| hypothetical protein A31A_02343 [Escherichia coli KTE156]
gi|431227117|gb|ELF24254.1| hypothetical protein A31G_03860 [Escherichia coli KTE161]
gi|431243765|gb|ELF38093.1| hypothetical protein A31Q_02328 [Escherichia coli KTE171]
gi|431284296|gb|ELF75154.1| hypothetical protein WGE_02441 [Escherichia coli KTE42]
gi|431467811|gb|ELH47817.1| hypothetical protein A155_02357 [Escherichia coli KTE197]
gi|449321599|gb|EMD11610.1| hypothetical protein C201_07630 [Escherichia coli S17]
Length = 478
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|134094941|ref|YP_001100016.1| hypothetical protein HEAR1735 [Herminiimonas arsenicoxydans]
gi|166234794|sp|A4G5V4.1|Y1735_HERAR RecName: Full=UPF0061 protein HEAR1735
gi|133738844|emb|CAL61891.1| conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 500
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 224/520 (43%), Positives = 290/520 (55%), Gaps = 53/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT + P+ + P LV S S A + LD + + F F+G G+ P + Y
Sbjct: 27 AHYTALMPT-PLPAPYLVCASASAAALIGLDFSDIDSAAFIETFTGNRIPDGSRPLSAVY 85
Query: 191 GGHQFGMWAGQLGDGRAITLGEI---LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
GHQFG+WAGQLGDGRAI LG++ + S R ELQLKGAG TPYSR DG AVLRSSI
Sbjct: 86 SGHQFGVWAGQLGDGRAILLGDVPAPTMIPSGRLELQLKGAGLTPYSRMGDGRAVLRSSI 145
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAM LGIPTTRALC+ + + V R+ + E A+ R+AQSF+RFGS+
Sbjct: 146 REFLCSEAMAALGIPTTRALCVTGSDQIVLRE-------QRETAAVATRMAQSFVRFGSF 198
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ E D ++TLADY I + F T + N Y A
Sbjct: 199 EHWFY--NEKHDELKTLADYVIAQFYPQ-----------FKTAE----------NPYKAL 235
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
EV RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGPFGF++AF+ + N TD
Sbjct: 236 LTEVTLRTAQMIAHWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFNATHICNHTDQQ 295
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYGTKFMDEYQAIMTKKLG 486
GR Y +A QP IG WN TL LI D E + Y + +++ +M KLG
Sbjct: 296 GR-YSYARQPQIGEWNCYALGQTLL--PLIGDVDETQNALRIYKPAYAEKFAELMRAKLG 352
Query: 487 LPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + ++ L + D+T FFR L ++ + + L + + LD
Sbjct: 353 LQTQQPDDGKLFDALFAVLQGSHADFTLFFRRLGELRIGQAASREAL----RDLFLD--- 405
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
+ A+ W L Y L D+ RK M++VNPKYVLRNYL Q AI+ A+ DF EV
Sbjct: 406 --RAAFDDWALQYELRLQLENSDDDARKLAMHAVNPKYVLRNYLAQIAIEKAQNKDFSEV 463
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+LL+++E+P+DEQP EKYA LPP WA V SCSS
Sbjct: 464 AKLLQVLEKPFDEQPENEKYAALPPDWANDLEV---SCSS 500
>gi|168240849|ref|ZP_02665781.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL486]
gi|194449047|ref|YP_002045351.1| hypothetical protein SeHA_C1474 [Salmonella enterica subsp.
enterica serovar Heidelberg str. SL476]
gi|386591197|ref|YP_006087597.1| Selenoprotein O [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|419729076|ref|ZP_14256037.1| hypothetical protein SEEH1579_06796 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419734511|ref|ZP_14261401.1| hypothetical protein SEEH1563_06124 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419740933|ref|ZP_14267648.1| hypothetical protein SEEH1573_19569 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419744987|ref|ZP_14271633.1| hypothetical protein SEEH1566_17571 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419749222|ref|ZP_14275707.1| hypothetical protein SEEH1565_14650 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|421570788|ref|ZP_16016473.1| hypothetical protein CFSAN00322_11383 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|421576011|ref|ZP_16021617.1| hypothetical protein CFSAN00325_14373 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|421580704|ref|ZP_16026258.1| hypothetical protein CFSAN00326_14877 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|421586511|ref|ZP_16031992.1| hypothetical protein CFSAN00328_21014 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
gi|226725736|sp|B4TGI2.1|YDIU_SALHS RecName: Full=UPF0061 protein YdiU
gi|194407351|gb|ACF67570.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL476]
gi|205339415|gb|EDZ26179.1| protein YdiU [Salmonella enterica subsp. enterica serovar
Heidelberg str. SL486]
gi|381293400|gb|EIC34563.1| hypothetical protein SEEH1573_19569 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381297364|gb|EIC38456.1| hypothetical protein SEEH1563_06124 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381297779|gb|EIC38865.1| hypothetical protein SEEH1579_06796 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381307194|gb|EIC48058.1| hypothetical protein SEEH1566_17571 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381311712|gb|EIC52523.1| hypothetical protein SEEH1565_14650 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|383798241|gb|AFH45323.1| Selenoprotein O [Salmonella enterica subsp. enterica serovar
Heidelberg str. B182]
gi|402519199|gb|EJW26562.1| hypothetical protein CFSAN00326_14877 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00326]
gi|402519964|gb|EJW27319.1| hypothetical protein CFSAN00325_14373 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00325]
gi|402523368|gb|EJW30686.1| hypothetical protein CFSAN00322_11383 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00322]
gi|402527910|gb|EJW35168.1| hypothetical protein CFSAN00328_21014 [Salmonella enterica subsp.
enterica serovar Heidelberg str. CFSAN00328]
Length = 480
Score = 361 bits (926), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGF D +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFFDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|300774718|ref|ZP_07084581.1| protein of hypothetical function UPF0061 [Chryseobacterium gleum
ATCC 35910]
gi|300506533|gb|EFK37668.1| protein of hypothetical function UPF0061 [Chryseobacterium gleum
ATCC 35910]
Length = 515
Score = 361 bits (926), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 204/537 (37%), Positives = 301/537 (56%), Gaps = 35/537 (6%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F+ PGD + + R + + P A + P+L+A++E++++ + L ++E D
Sbjct: 10 FIENFPGDFSNNPMQRNTPKVLFATIRP-AGFDKPELIAFNEALSEEIGLG--KYEDKDL 66
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
P YA Y GHQFG WAGQLGDGRAI GEI N K ++ E+Q KGAG
Sbjct: 67 DFLVGNNLP-ENVQSYATAYAGHQFGNWAGQLGDGRAILAGEITNEKGKKTEIQWKGAGA 125
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR ADG AVLRSS+RE+L SEAM+ LG+PTTRAL L TG+ V RD+ Y+GNP+ E
Sbjct: 126 TPYSRHADGRAVLRSSVREYLMSEAMYHLGVPTTRALSLAFTGEDVMRDIMYNGNPELEK 185
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA+V R A+SFLRFG +++ ++ Q + + ++ LAD+ I +++ I + +
Sbjct: 186 GAVVIRTAESFLRFGHFELMSA--QREYNSLQELADFTIENYYPEITSTD---------- 233
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
S KY + + RTA L+ +W VGF HGV+NTDNMS+LGLTIDYGP+
Sbjct: 234 ----------SKKYKDFFERICTRTADLMVEWFRVGFVHGVMNTDNMSVLGLTIDYGPYS 283
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYG 470
+D +D +FTPNTTDLPGRRY F Q I WN+ Q + L + ++K + +G
Sbjct: 284 MMDEYDLNFTPNTTDLPGRRYAFGKQGQIAQWNLWQLANALHPL-IKNEKFLEDTLNNFG 342
Query: 471 TKFMDEYQAIMTKKLG---LPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
T F + + ++ KK G L K +++ + M ++DYT FF L + + +I E
Sbjct: 343 TYFWEAHDRMLCKKFGFDQLKKEDEEFFTNWQGLMQELQLDYTLFFNQLEKINQNTNIIE 402
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
+ + +++ +E+ ++ +Y + + IS E A+M NPK++LRNYL
Sbjct: 403 H--FKDISYININLNEEKIAKLEHFIRNYETRIALNSISKEASLAMMEKSNPKFILRNYL 460
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDE-QPGMEKYARLPPAWAYRPGVCMLSCSS 643
I+ G + +L+K +E PY E P E A+ P + G LSCSS
Sbjct: 461 LYQCIEEISNGKRDMLEKLIKALENPYRELYP--EFSAKRPSDYDDIAGCSTLSCSS 515
>gi|15802118|ref|NP_288140.1| hypothetical protein Z2735 [Escherichia coli O157:H7 str. EDL933]
gi|15831667|ref|NP_310440.1| hypothetical protein ECs2413 [Escherichia coli O157:H7 str. Sakai]
gi|168756706|ref|ZP_02781713.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|168762231|ref|ZP_02787238.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|168770466|ref|ZP_02795473.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|168774995|ref|ZP_02800002.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|168782120|ref|ZP_02807127.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|168789842|ref|ZP_02814849.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|168800114|ref|ZP_02825121.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|195937390|ref|ZP_03082772.1| hypothetical protein EscherichcoliO157_13232 [Escherichia coli
O157:H7 str. EC4024]
gi|208810379|ref|ZP_03252255.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208816870|ref|ZP_03257990.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208818405|ref|ZP_03258725.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209398355|ref|YP_002270776.1| hypothetical protein ECH74115_2424 [Escherichia coli O157:H7 str.
EC4115]
gi|217328902|ref|ZP_03444983.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254793323|ref|YP_003078160.1| hypothetical protein ECSP_2273 [Escherichia coli O157:H7 str.
TW14359]
gi|261227849|ref|ZP_05942130.1| hypothetical protein EscherichiacoliO157_25072 [Escherichia coli
O157:H7 str. FRIK2000]
gi|261258418|ref|ZP_05950951.1| hypothetical protein EscherichiacoliO157EcO_21707 [Escherichia coli
O157:H7 str. FRIK966]
gi|387882810|ref|YP_006313112.1| hypothetical protein CDCO157_2247 [Escherichia coli Xuzhou21]
gi|416312206|ref|ZP_11657407.1| hypothetical protein ECoA_03141 [Escherichia coli O157:H7 str.
1044]
gi|416322921|ref|ZP_11664530.1| hypothetical protein ECoD_04892 [Escherichia coli O157:H7 str.
EC1212]
gi|416327179|ref|ZP_11667186.1| hypothetical protein ECF_02059 [Escherichia coli O157:H7 str. 1125]
gi|419045463|ref|ZP_13592409.1| hypothetical protein ECDEC3A_2295 [Escherichia coli DEC3A]
gi|419051232|ref|ZP_13598113.1| hypothetical protein ECDEC3B_2522 [Escherichia coli DEC3B]
gi|419057230|ref|ZP_13604045.1| hypothetical protein ECDEC3C_2807 [Escherichia coli DEC3C]
gi|419062608|ref|ZP_13609347.1| hypothetical protein ECDEC3D_2394 [Escherichia coli DEC3D]
gi|419069515|ref|ZP_13615151.1| hypothetical protein ECDEC3E_2588 [Escherichia coli DEC3E]
gi|419080745|ref|ZP_13626202.1| hypothetical protein ECDEC4A_2340 [Escherichia coli DEC4A]
gi|419086379|ref|ZP_13631749.1| hypothetical protein ECDEC4B_2298 [Escherichia coli DEC4B]
gi|419092698|ref|ZP_13637991.1| hypothetical protein ECDEC4C_2384 [Escherichia coli DEC4C]
gi|419098446|ref|ZP_13643659.1| hypothetical protein ECDEC4D_2300 [Escherichia coli DEC4D]
gi|419104005|ref|ZP_13649146.1| hypothetical protein ECDEC4E_2314 [Escherichia coli DEC4E]
gi|419109558|ref|ZP_13654625.1| hypothetical protein ECDEC4F_2371 [Escherichia coli DEC4F]
gi|420269543|ref|ZP_14771916.1| hypothetical protein ECPA22_2500 [Escherichia coli PA22]
gi|420275457|ref|ZP_14777758.1| hypothetical protein ECPA40_2698 [Escherichia coli PA40]
gi|420287077|ref|ZP_14789274.1| hypothetical protein ECTW10246_2735 [Escherichia coli TW10246]
gi|420292439|ref|ZP_14794571.1| hypothetical protein ECTW11039_2563 [Escherichia coli TW11039]
gi|420298226|ref|ZP_14800289.1| hypothetical protein ECTW09109_2690 [Escherichia coli TW09109]
gi|420304423|ref|ZP_14806430.1| hypothetical protein ECTW10119_2796 [Escherichia coli TW10119]
gi|420309909|ref|ZP_14811853.1| hypothetical protein ECEC1738_2546 [Escherichia coli EC1738]
gi|420315323|ref|ZP_14817206.1| hypothetical protein ECEC1734_2423 [Escherichia coli EC1734]
gi|421812373|ref|ZP_16248121.1| hypothetical protein EC80416_2155 [Escherichia coli 8.0416]
gi|421818405|ref|ZP_16253918.1| hypothetical protein EC100821_2289 [Escherichia coli 10.0821]
gi|421823976|ref|ZP_16259371.1| hypothetical protein ECFRIK920_2392 [Escherichia coli FRIK920]
gi|421830917|ref|ZP_16266215.1| hypothetical protein ECPA7_3060 [Escherichia coli PA7]
gi|423710859|ref|ZP_17685192.1| hypothetical protein ECPA31_2378 [Escherichia coli PA31]
gi|424077536|ref|ZP_17814591.1| hypothetical protein ECFDA505_2511 [Escherichia coli FDA505]
gi|424083910|ref|ZP_17820472.1| hypothetical protein ECFDA517_2767 [Escherichia coli FDA517]
gi|424090315|ref|ZP_17826345.1| hypothetical protein ECFRIK1996_2536 [Escherichia coli FRIK1996]
gi|424096853|ref|ZP_17832276.1| hypothetical protein ECFRIK1985_2660 [Escherichia coli FRIK1985]
gi|424103193|ref|ZP_17838070.1| hypothetical protein ECFRIK1990_2663 [Escherichia coli FRIK1990]
gi|424109916|ref|ZP_17844236.1| hypothetical protein EC93001_2662 [Escherichia coli 93-001]
gi|424115626|ref|ZP_17849557.1| hypothetical protein ECPA3_2443 [Escherichia coli PA3]
gi|424121992|ref|ZP_17855406.1| hypothetical protein ECPA5_2501 [Escherichia coli PA5]
gi|424128105|ref|ZP_17861083.1| hypothetical protein ECPA9_2608 [Escherichia coli PA9]
gi|424134256|ref|ZP_17866803.1| hypothetical protein ECPA10_2599 [Escherichia coli PA10]
gi|424140945|ref|ZP_17872924.1| hypothetical protein ECPA14_2606 [Escherichia coli PA14]
gi|424147370|ref|ZP_17878833.1| hypothetical protein ECPA15_2731 [Escherichia coli PA15]
gi|424153308|ref|ZP_17884324.1| hypothetical protein ECPA24_2416 [Escherichia coli PA24]
gi|424235485|ref|ZP_17889776.1| hypothetical protein ECPA25_2280 [Escherichia coli PA25]
gi|424313388|ref|ZP_17895681.1| hypothetical protein ECPA28_2622 [Escherichia coli PA28]
gi|424449729|ref|ZP_17901505.1| hypothetical protein ECPA32_2558 [Escherichia coli PA32]
gi|424455899|ref|ZP_17907128.1| hypothetical protein ECPA33_2550 [Escherichia coli PA33]
gi|424462200|ref|ZP_17912779.1| hypothetical protein ECPA39_2540 [Escherichia coli PA39]
gi|424468602|ref|ZP_17918517.1| hypothetical protein ECPA41_2556 [Escherichia coli PA41]
gi|424475185|ref|ZP_17924596.1| hypothetical protein ECPA42_2702 [Escherichia coli PA42]
gi|424480933|ref|ZP_17929975.1| hypothetical protein ECTW07945_2498 [Escherichia coli TW07945]
gi|424487114|ref|ZP_17935742.1| hypothetical protein ECTW09098_2585 [Escherichia coli TW09098]
gi|424493493|ref|ZP_17941417.1| hypothetical protein ECTW09195_2598 [Escherichia coli TW09195]
gi|424500375|ref|ZP_17947376.1| hypothetical protein ECEC4203_2519 [Escherichia coli EC4203]
gi|424506529|ref|ZP_17953043.1| hypothetical protein ECEC4196_2486 [Escherichia coli EC4196]
gi|424514015|ref|ZP_17958799.1| hypothetical protein ECTW14313_2463 [Escherichia coli TW14313]
gi|424520305|ref|ZP_17964500.1| hypothetical protein ECTW14301_2404 [Escherichia coli TW14301]
gi|424526215|ref|ZP_17970000.1| hypothetical protein ECEC4421_2492 [Escherichia coli EC4421]
gi|424532377|ref|ZP_17975783.1| hypothetical protein ECEC4422_2622 [Escherichia coli EC4422]
gi|424538382|ref|ZP_17981400.1| hypothetical protein ECEC4013_2721 [Escherichia coli EC4013]
gi|424544347|ref|ZP_17986873.1| hypothetical protein ECEC4402_2504 [Escherichia coli EC4402]
gi|424550614|ref|ZP_17992562.1| hypothetical protein ECEC4439_2457 [Escherichia coli EC4439]
gi|424556862|ref|ZP_17998340.1| hypothetical protein ECEC4436_2441 [Escherichia coli EC4436]
gi|424563207|ref|ZP_18004266.1| hypothetical protein ECEC4437_2593 [Escherichia coli EC4437]
gi|424569279|ref|ZP_18009931.1| hypothetical protein ECEC4448_2483 [Escherichia coli EC4448]
gi|424575409|ref|ZP_18015583.1| hypothetical protein ECEC1845_2435 [Escherichia coli EC1845]
gi|424581266|ref|ZP_18020988.1| hypothetical protein ECEC1863_2166 [Escherichia coli EC1863]
gi|425098113|ref|ZP_18500908.1| hypothetical protein EC34870_2686 [Escherichia coli 3.4870]
gi|425104291|ref|ZP_18506657.1| hypothetical protein EC52239_2706 [Escherichia coli 5.2239]
gi|425110121|ref|ZP_18512119.1| hypothetical protein EC60172_2709 [Escherichia coli 6.0172]
gi|425125909|ref|ZP_18527174.1| hypothetical protein EC80586_2724 [Escherichia coli 8.0586]
gi|425131755|ref|ZP_18532660.1| hypothetical protein EC82524_2426 [Escherichia coli 8.2524]
gi|425138136|ref|ZP_18538606.1| hypothetical protein EC100833_2630 [Escherichia coli 10.0833]
gi|425150164|ref|ZP_18549846.1| hypothetical protein EC880221_2475 [Escherichia coli 88.0221]
gi|425156008|ref|ZP_18555336.1| hypothetical protein ECPA34_2603 [Escherichia coli PA34]
gi|425162516|ref|ZP_18561456.1| hypothetical protein ECFDA506_2958 [Escherichia coli FDA506]
gi|425168191|ref|ZP_18566738.1| hypothetical protein ECFDA507_2637 [Escherichia coli FDA507]
gi|425174283|ref|ZP_18572455.1| hypothetical protein ECFDA504_2593 [Escherichia coli FDA504]
gi|425180223|ref|ZP_18578005.1| hypothetical protein ECFRIK1999_2698 [Escherichia coli FRIK1999]
gi|425186457|ref|ZP_18583817.1| hypothetical protein ECFRIK1997_2725 [Escherichia coli FRIK1997]
gi|425193328|ref|ZP_18590178.1| hypothetical protein ECNE1487_2961 [Escherichia coli NE1487]
gi|425199718|ref|ZP_18596036.1| hypothetical protein ECNE037_2895 [Escherichia coli NE037]
gi|425206167|ref|ZP_18602048.1| hypothetical protein ECFRIK2001_2963 [Escherichia coli FRIK2001]
gi|425211903|ref|ZP_18607389.1| hypothetical protein ECPA4_2684 [Escherichia coli PA4]
gi|425218031|ref|ZP_18613077.1| hypothetical protein ECPA23_2561 [Escherichia coli PA23]
gi|425224546|ref|ZP_18619110.1| hypothetical protein ECPA49_2667 [Escherichia coli PA49]
gi|425230780|ref|ZP_18624909.1| hypothetical protein ECPA45_2687 [Escherichia coli PA45]
gi|425236931|ref|ZP_18630691.1| hypothetical protein ECTT12B_2572 [Escherichia coli TT12B]
gi|425242994|ref|ZP_18636375.1| hypothetical protein ECMA6_2733 [Escherichia coli MA6]
gi|425254923|ref|ZP_18647517.1| hypothetical protein ECCB7326_2550 [Escherichia coli CB7326]
gi|425294709|ref|ZP_18684996.1| hypothetical protein ECPA38_2459 [Escherichia coli PA38]
gi|425311402|ref|ZP_18700648.1| hypothetical protein ECEC1735_2557 [Escherichia coli EC1735]
gi|425317327|ref|ZP_18706181.1| hypothetical protein ECEC1736_2445 [Escherichia coli EC1736]
gi|425323431|ref|ZP_18711865.1| hypothetical protein ECEC1737_2454 [Escherichia coli EC1737]
gi|425329591|ref|ZP_18717561.1| hypothetical protein ECEC1846_2417 [Escherichia coli EC1846]
gi|425335758|ref|ZP_18723249.1| hypothetical protein ECEC1847_2428 [Escherichia coli EC1847]
gi|425342185|ref|ZP_18729166.1| hypothetical protein ECEC1848_2616 [Escherichia coli EC1848]
gi|425347997|ref|ZP_18734570.1| hypothetical protein ECEC1849_2371 [Escherichia coli EC1849]
gi|425354298|ref|ZP_18740444.1| hypothetical protein ECEC1850_2605 [Escherichia coli EC1850]
gi|425360268|ref|ZP_18746002.1| hypothetical protein ECEC1856_2436 [Escherichia coli EC1856]
gi|425366393|ref|ZP_18751682.1| hypothetical protein ECEC1862_2429 [Escherichia coli EC1862]
gi|425372818|ref|ZP_18757553.1| hypothetical protein ECEC1864_2607 [Escherichia coli EC1864]
gi|425385641|ref|ZP_18769289.1| hypothetical protein ECEC1866_2283 [Escherichia coli EC1866]
gi|425392332|ref|ZP_18775531.1| hypothetical protein ECEC1868_2619 [Escherichia coli EC1868]
gi|425398487|ref|ZP_18781276.1| hypothetical protein ECEC1869_2615 [Escherichia coli EC1869]
gi|425404519|ref|ZP_18786850.1| hypothetical protein ECEC1870_2360 [Escherichia coli EC1870]
gi|425411092|ref|ZP_18792936.1| hypothetical protein ECNE098_2715 [Escherichia coli NE098]
gi|425417399|ref|ZP_18798745.1| hypothetical protein ECFRIK523_2559 [Escherichia coli FRIK523]
gi|425428655|ref|ZP_18809350.1| hypothetical protein EC01304_2667 [Escherichia coli 0.1304]
gi|428947000|ref|ZP_19019389.1| hypothetical protein EC881467_2572 [Escherichia coli 88.1467]
gi|428953250|ref|ZP_19025100.1| hypothetical protein EC881042_2632 [Escherichia coli 88.1042]
gi|428959172|ref|ZP_19030553.1| hypothetical protein EC890511_2553 [Escherichia coli 89.0511]
gi|428965626|ref|ZP_19036483.1| hypothetical protein EC900091_2819 [Escherichia coli 90.0091]
gi|428971343|ref|ZP_19041764.1| hypothetical protein EC900039_2353 [Escherichia coli 90.0039]
gi|428978052|ref|ZP_19047942.1| hypothetical protein EC902281_2607 [Escherichia coli 90.2281]
gi|428983868|ref|ZP_19053325.1| hypothetical protein EC930055_2541 [Escherichia coli 93.0055]
gi|428989996|ref|ZP_19059044.1| hypothetical protein EC930056_2598 [Escherichia coli 93.0056]
gi|428995770|ref|ZP_19064452.1| hypothetical protein EC940618_2419 [Escherichia coli 94.0618]
gi|429001874|ref|ZP_19070118.1| hypothetical protein EC950183_2514 [Escherichia coli 95.0183]
gi|429008138|ref|ZP_19075744.1| hypothetical protein EC951288_2373 [Escherichia coli 95.1288]
gi|429014627|ref|ZP_19081597.1| hypothetical protein EC950943_2670 [Escherichia coli 95.0943]
gi|429020504|ref|ZP_19087080.1| hypothetical protein EC960428_2447 [Escherichia coli 96.0428]
gi|429026540|ref|ZP_19092636.1| hypothetical protein EC960427_2572 [Escherichia coli 96.0427]
gi|429032617|ref|ZP_19098225.1| hypothetical protein EC960939_2486 [Escherichia coli 96.0939]
gi|429038762|ref|ZP_19103953.1| hypothetical protein EC960932_2608 [Escherichia coli 96.0932]
gi|429044660|ref|ZP_19109428.1| hypothetical protein EC960107_2516 [Escherichia coli 96.0107]
gi|429050210|ref|ZP_19114813.1| hypothetical protein EC970003_2330 [Escherichia coli 97.0003]
gi|429055473|ref|ZP_19119876.1| hypothetical protein EC971742_2046 [Escherichia coli 97.1742]
gi|429061123|ref|ZP_19125192.1| hypothetical protein EC970007_1997 [Escherichia coli 97.0007]
gi|429067220|ref|ZP_19130767.1| hypothetical protein EC990672_2511 [Escherichia coli 99.0672]
gi|429073221|ref|ZP_19136513.1| hypothetical protein EC990678_2327 [Escherichia coli 99.0678]
gi|429078548|ref|ZP_19141713.1| hypothetical protein EC990713_2375 [Escherichia coli 99.0713]
gi|429826466|ref|ZP_19357604.1| hypothetical protein EC960109_2680 [Escherichia coli 96.0109]
gi|429832739|ref|ZP_19363222.1| hypothetical protein EC970010_2547 [Escherichia coli 97.0010]
gi|444924911|ref|ZP_21244318.1| hypothetical protein EC09BKT78844_2611 [Escherichia coli
09BKT078844]
gi|444930761|ref|ZP_21249847.1| hypothetical protein EC990814_2171 [Escherichia coli 99.0814]
gi|444936048|ref|ZP_21254890.1| hypothetical protein EC990815_2043 [Escherichia coli 99.0815]
gi|444941688|ref|ZP_21260262.1| hypothetical protein EC990816_2127 [Escherichia coli 99.0816]
gi|444947243|ref|ZP_21265599.1| hypothetical protein EC990839_2131 [Escherichia coli 99.0839]
gi|444952877|ref|ZP_21271019.1| hypothetical protein EC990848_2183 [Escherichia coli 99.0848]
gi|444958378|ref|ZP_21276281.1| hypothetical protein EC991753_2238 [Escherichia coli 99.1753]
gi|444963606|ref|ZP_21281270.1| hypothetical protein EC991775_2129 [Escherichia coli 99.1775]
gi|444969432|ref|ZP_21286839.1| hypothetical protein EC991793_2365 [Escherichia coli 99.1793]
gi|444974775|ref|ZP_21291959.1| hypothetical protein EC991805_2039 [Escherichia coli 99.1805]
gi|444980266|ref|ZP_21297210.1| hypothetical protein ECATCC700728_2108 [Escherichia coli ATCC
700728]
gi|444985586|ref|ZP_21302402.1| hypothetical protein ECPA11_2205 [Escherichia coli PA11]
gi|444990874|ref|ZP_21307557.1| hypothetical protein ECPA19_2154 [Escherichia coli PA19]
gi|444996077|ref|ZP_21312616.1| hypothetical protein ECPA13_1878 [Escherichia coli PA13]
gi|445001703|ref|ZP_21318123.1| hypothetical protein ECPA2_2265 [Escherichia coli PA2]
gi|445007159|ref|ZP_21323444.1| hypothetical protein ECPA47_2092 [Escherichia coli PA47]
gi|445018028|ref|ZP_21334024.1| hypothetical protein ECPA8_2169 [Escherichia coli PA8]
gi|445023673|ref|ZP_21339533.1| hypothetical protein EC71982_2347 [Escherichia coli 7.1982]
gi|445028914|ref|ZP_21344629.1| hypothetical protein EC991781_2331 [Escherichia coli 99.1781]
gi|445034362|ref|ZP_21349925.1| hypothetical protein EC991762_2315 [Escherichia coli 99.1762]
gi|445040067|ref|ZP_21355474.1| hypothetical protein ECPA35_2374 [Escherichia coli PA35]
gi|445045199|ref|ZP_21360491.1| hypothetical protein EC34880_2156 [Escherichia coli 3.4880]
gi|445050821|ref|ZP_21365917.1| hypothetical protein EC950083_2143 [Escherichia coli 95.0083]
gi|445056604|ref|ZP_21371494.1| hypothetical protein EC990670_2418 [Escherichia coli 99.0670]
gi|452971142|ref|ZP_21969369.1| hypothetical protein EC4009_RS21420 [Escherichia coli O157:H7 str.
EC4009]
gi|33517063|sp|Q8X5W3.1|YDIU_ECO57 RecName: Full=UPF0061 protein YdiU
gi|226725726|sp|B5YPZ4.1|YDIU_ECO5E RecName: Full=UPF0061 protein YdiU
gi|12515717|gb|AAG56693.1|AE005394_2 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
gi|13361880|dbj|BAB35836.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|187769470|gb|EDU33314.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4196]
gi|189000263|gb|EDU69249.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4076]
gi|189356199|gb|EDU74618.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4401]
gi|189360609|gb|EDU79028.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4486]
gi|189367420|gb|EDU85836.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4501]
gi|189370587|gb|EDU89003.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC869]
gi|189377541|gb|EDU95957.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC508]
gi|208724895|gb|EDZ74602.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208731213|gb|EDZ79902.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208738528|gb|EDZ86210.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209159755|gb|ACI37188.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|209768960|gb|ACI82792.1| hypothetical protein ECs2413 [Escherichia coli]
gi|209768962|gb|ACI82793.1| hypothetical protein ECs2413 [Escherichia coli]
gi|209768966|gb|ACI82795.1| hypothetical protein ECs2413 [Escherichia coli]
gi|217318249|gb|EEC26676.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254592723|gb|ACT72084.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
gi|320188394|gb|EFW63056.1| hypothetical protein ECoD_04892 [Escherichia coli O157:H7 str.
EC1212]
gi|326342073|gb|EGD65854.1| hypothetical protein ECoA_03141 [Escherichia coli O157:H7 str.
1044]
gi|326343626|gb|EGD67388.1| hypothetical protein ECF_02059 [Escherichia coli O157:H7 str. 1125]
gi|377895060|gb|EHU59473.1| hypothetical protein ECDEC3A_2295 [Escherichia coli DEC3A]
gi|377895556|gb|EHU59967.1| hypothetical protein ECDEC3B_2522 [Escherichia coli DEC3B]
gi|377906511|gb|EHU70753.1| hypothetical protein ECDEC3C_2807 [Escherichia coli DEC3C]
gi|377911845|gb|EHU76010.1| hypothetical protein ECDEC3D_2394 [Escherichia coli DEC3D]
gi|377914573|gb|EHU78695.1| hypothetical protein ECDEC3E_2588 [Escherichia coli DEC3E]
gi|377928227|gb|EHU92138.1| hypothetical protein ECDEC4A_2340 [Escherichia coli DEC4A]
gi|377932799|gb|EHU96645.1| hypothetical protein ECDEC4B_2298 [Escherichia coli DEC4B]
gi|377943987|gb|EHV07696.1| hypothetical protein ECDEC4C_2384 [Escherichia coli DEC4C]
gi|377944762|gb|EHV08464.1| hypothetical protein ECDEC4D_2300 [Escherichia coli DEC4D]
gi|377949818|gb|EHV13449.1| hypothetical protein ECDEC4E_2314 [Escherichia coli DEC4E]
gi|377958765|gb|EHV22277.1| hypothetical protein ECDEC4F_2371 [Escherichia coli DEC4F]
gi|386796268|gb|AFJ29302.1| hypothetical protein CDCO157_2247 [Escherichia coli Xuzhou21]
gi|390645490|gb|EIN24667.1| hypothetical protein ECFDA517_2767 [Escherichia coli FDA517]
gi|390645571|gb|EIN24743.1| hypothetical protein ECFRIK1996_2536 [Escherichia coli FRIK1996]
gi|390646202|gb|EIN25328.1| hypothetical protein ECFDA505_2511 [Escherichia coli FDA505]
gi|390663799|gb|EIN41285.1| hypothetical protein EC93001_2662 [Escherichia coli 93-001]
gi|390665276|gb|EIN42587.1| hypothetical protein ECFRIK1985_2660 [Escherichia coli FRIK1985]
gi|390666225|gb|EIN43421.1| hypothetical protein ECFRIK1990_2663 [Escherichia coli FRIK1990]
gi|390681395|gb|EIN57188.1| hypothetical protein ECPA3_2443 [Escherichia coli PA3]
gi|390684861|gb|EIN60465.1| hypothetical protein ECPA5_2501 [Escherichia coli PA5]
gi|390685874|gb|EIN61329.1| hypothetical protein ECPA9_2608 [Escherichia coli PA9]
gi|390702022|gb|EIN76239.1| hypothetical protein ECPA10_2599 [Escherichia coli PA10]
gi|390703233|gb|EIN77272.1| hypothetical protein ECPA15_2731 [Escherichia coli PA15]
gi|390703967|gb|EIN77957.1| hypothetical protein ECPA14_2606 [Escherichia coli PA14]
gi|390715745|gb|EIN88581.1| hypothetical protein ECPA22_2500 [Escherichia coli PA22]
gi|390727056|gb|EIN99476.1| hypothetical protein ECPA25_2280 [Escherichia coli PA25]
gi|390727554|gb|EIN99962.1| hypothetical protein ECPA24_2416 [Escherichia coli PA24]
gi|390729645|gb|EIO01805.1| hypothetical protein ECPA28_2622 [Escherichia coli PA28]
gi|390745412|gb|EIO16219.1| hypothetical protein ECPA32_2558 [Escherichia coli PA32]
gi|390746250|gb|EIO17009.1| hypothetical protein ECPA31_2378 [Escherichia coli PA31]
gi|390747806|gb|EIO18351.1| hypothetical protein ECPA33_2550 [Escherichia coli PA33]
gi|390759238|gb|EIO28636.1| hypothetical protein ECPA40_2698 [Escherichia coli PA40]
gi|390770106|gb|EIO38995.1| hypothetical protein ECPA41_2556 [Escherichia coli PA41]
gi|390771649|gb|EIO40305.1| hypothetical protein ECPA39_2540 [Escherichia coli PA39]
gi|390771980|gb|EIO40627.1| hypothetical protein ECPA42_2702 [Escherichia coli PA42]
gi|390791257|gb|EIO58652.1| hypothetical protein ECTW10246_2735 [Escherichia coli TW10246]
gi|390796767|gb|EIO64033.1| hypothetical protein ECTW07945_2498 [Escherichia coli TW07945]
gi|390798238|gb|EIO65434.1| hypothetical protein ECTW11039_2563 [Escherichia coli TW11039]
gi|390808416|gb|EIO75255.1| hypothetical protein ECTW09109_2690 [Escherichia coli TW09109]
gi|390810034|gb|EIO76810.1| hypothetical protein ECTW09098_2585 [Escherichia coli TW09098]
gi|390817109|gb|EIO83569.1| hypothetical protein ECTW10119_2796 [Escherichia coli TW10119]
gi|390829577|gb|EIO95177.1| hypothetical protein ECEC4203_2519 [Escherichia coli EC4203]
gi|390832782|gb|EIO97992.1| hypothetical protein ECTW09195_2598 [Escherichia coli TW09195]
gi|390834194|gb|EIO99160.1| hypothetical protein ECEC4196_2486 [Escherichia coli EC4196]
gi|390849288|gb|EIP12729.1| hypothetical protein ECTW14301_2404 [Escherichia coli TW14301]
gi|390850974|gb|EIP14310.1| hypothetical protein ECTW14313_2463 [Escherichia coli TW14313]
gi|390852378|gb|EIP15538.1| hypothetical protein ECEC4421_2492 [Escherichia coli EC4421]
gi|390863925|gb|EIP26054.1| hypothetical protein ECEC4422_2622 [Escherichia coli EC4422]
gi|390868258|gb|EIP30016.1| hypothetical protein ECEC4013_2721 [Escherichia coli EC4013]
gi|390873809|gb|EIP34979.1| hypothetical protein ECEC4402_2504 [Escherichia coli EC4402]
gi|390880791|gb|EIP41459.1| hypothetical protein ECEC4439_2457 [Escherichia coli EC4439]
gi|390885351|gb|EIP45591.1| hypothetical protein ECEC4436_2441 [Escherichia coli EC4436]
gi|390896758|gb|EIP56138.1| hypothetical protein ECEC4437_2593 [Escherichia coli EC4437]
gi|390900811|gb|EIP60023.1| hypothetical protein ECEC4448_2483 [Escherichia coli EC4448]
gi|390901356|gb|EIP60540.1| hypothetical protein ECEC1738_2546 [Escherichia coli EC1738]
gi|390909024|gb|EIP67825.1| hypothetical protein ECEC1734_2423 [Escherichia coli EC1734]
gi|390921077|gb|EIP79300.1| hypothetical protein ECEC1863_2166 [Escherichia coli EC1863]
gi|390922349|gb|EIP80448.1| hypothetical protein ECEC1845_2435 [Escherichia coli EC1845]
gi|408066959|gb|EKH01402.1| hypothetical protein ECPA7_3060 [Escherichia coli PA7]
gi|408071364|gb|EKH05716.1| hypothetical protein ECFRIK920_2392 [Escherichia coli FRIK920]
gi|408076625|gb|EKH10847.1| hypothetical protein ECPA34_2603 [Escherichia coli PA34]
gi|408082296|gb|EKH16283.1| hypothetical protein ECFDA506_2958 [Escherichia coli FDA506]
gi|408084701|gb|EKH18464.1| hypothetical protein ECFDA507_2637 [Escherichia coli FDA507]
gi|408093498|gb|EKH26587.1| hypothetical protein ECFDA504_2593 [Escherichia coli FDA504]
gi|408099358|gb|EKH32007.1| hypothetical protein ECFRIK1999_2698 [Escherichia coli FRIK1999]
gi|408107075|gb|EKH39163.1| hypothetical protein ECFRIK1997_2725 [Escherichia coli FRIK1997]
gi|408110968|gb|EKH42747.1| hypothetical protein ECNE1487_2961 [Escherichia coli NE1487]
gi|408117917|gb|EKH49091.1| hypothetical protein ECNE037_2895 [Escherichia coli NE037]
gi|408123827|gb|EKH54556.1| hypothetical protein ECFRIK2001_2963 [Escherichia coli FRIK2001]
gi|408129512|gb|EKH59731.1| hypothetical protein ECPA4_2684 [Escherichia coli PA4]
gi|408140876|gb|EKH70356.1| hypothetical protein ECPA23_2561 [Escherichia coli PA23]
gi|408142892|gb|EKH72236.1| hypothetical protein ECPA49_2667 [Escherichia coli PA49]
gi|408148182|gb|EKH77086.1| hypothetical protein ECPA45_2687 [Escherichia coli PA45]
gi|408156351|gb|EKH84554.1| hypothetical protein ECTT12B_2572 [Escherichia coli TT12B]
gi|408163569|gb|EKH91432.1| hypothetical protein ECMA6_2733 [Escherichia coli MA6]
gi|408177011|gb|EKI03838.1| hypothetical protein ECCB7326_2550 [Escherichia coli CB7326]
gi|408220656|gb|EKI44696.1| hypothetical protein ECPA38_2459 [Escherichia coli PA38]
gi|408230097|gb|EKI53520.1| hypothetical protein ECEC1735_2557 [Escherichia coli EC1735]
gi|408241464|gb|EKI64110.1| hypothetical protein ECEC1736_2445 [Escherichia coli EC1736]
gi|408245433|gb|EKI67821.1| hypothetical protein ECEC1737_2454 [Escherichia coli EC1737]
gi|408249898|gb|EKI71807.1| hypothetical protein ECEC1846_2417 [Escherichia coli EC1846]
gi|408260273|gb|EKI81402.1| hypothetical protein ECEC1847_2428 [Escherichia coli EC1847]
gi|408262396|gb|EKI83345.1| hypothetical protein ECEC1848_2616 [Escherichia coli EC1848]
gi|408267913|gb|EKI88349.1| hypothetical protein ECEC1849_2371 [Escherichia coli EC1849]
gi|408277820|gb|EKI97600.1| hypothetical protein ECEC1850_2605 [Escherichia coli EC1850]
gi|408280119|gb|EKI99699.1| hypothetical protein ECEC1856_2436 [Escherichia coli EC1856]
gi|408291733|gb|EKJ10317.1| hypothetical protein ECEC1862_2429 [Escherichia coli EC1862]
gi|408293734|gb|EKJ12155.1| hypothetical protein ECEC1864_2607 [Escherichia coli EC1864]
gi|408310841|gb|EKJ27882.1| hypothetical protein ECEC1868_2619 [Escherichia coli EC1868]
gi|408311206|gb|EKJ28216.1| hypothetical protein ECEC1866_2283 [Escherichia coli EC1866]
gi|408323447|gb|EKJ39409.1| hypothetical protein ECEC1869_2615 [Escherichia coli EC1869]
gi|408328293|gb|EKJ43903.1| hypothetical protein ECNE098_2715 [Escherichia coli NE098]
gi|408328826|gb|EKJ44365.1| hypothetical protein ECEC1870_2360 [Escherichia coli EC1870]
gi|408339288|gb|EKJ53900.1| hypothetical protein ECFRIK523_2559 [Escherichia coli FRIK523]
gi|408348921|gb|EKJ62999.1| hypothetical protein EC01304_2667 [Escherichia coli 0.1304]
gi|408551952|gb|EKK29184.1| hypothetical protein EC52239_2706 [Escherichia coli 5.2239]
gi|408552830|gb|EKK29993.1| hypothetical protein EC34870_2686 [Escherichia coli 3.4870]
gi|408553374|gb|EKK30495.1| hypothetical protein EC60172_2709 [Escherichia coli 6.0172]
gi|408574558|gb|EKK50327.1| hypothetical protein EC80586_2724 [Escherichia coli 8.0586]
gi|408582786|gb|EKK57995.1| hypothetical protein EC100833_2630 [Escherichia coli 10.0833]
gi|408583426|gb|EKK58594.1| hypothetical protein EC82524_2426 [Escherichia coli 8.2524]
gi|408598525|gb|EKK72480.1| hypothetical protein EC880221_2475 [Escherichia coli 88.0221]
gi|408602459|gb|EKK76174.1| hypothetical protein EC80416_2155 [Escherichia coli 8.0416]
gi|408614052|gb|EKK87336.1| hypothetical protein EC100821_2289 [Escherichia coli 10.0821]
gi|427207838|gb|EKV78000.1| hypothetical protein EC881042_2632 [Escherichia coli 88.1042]
gi|427209578|gb|EKV79608.1| hypothetical protein EC890511_2553 [Escherichia coli 89.0511]
gi|427210925|gb|EKV80771.1| hypothetical protein EC881467_2572 [Escherichia coli 88.1467]
gi|427226515|gb|EKV95104.1| hypothetical protein EC900091_2819 [Escherichia coli 90.0091]
gi|427226837|gb|EKV95421.1| hypothetical protein EC902281_2607 [Escherichia coli 90.2281]
gi|427229788|gb|EKV98090.1| hypothetical protein EC900039_2353 [Escherichia coli 90.0039]
gi|427245111|gb|EKW12413.1| hypothetical protein EC930056_2598 [Escherichia coli 93.0056]
gi|427245838|gb|EKW13113.1| hypothetical protein EC930055_2541 [Escherichia coli 93.0055]
gi|427248085|gb|EKW15130.1| hypothetical protein EC940618_2419 [Escherichia coli 94.0618]
gi|427263818|gb|EKW29569.1| hypothetical protein EC950943_2670 [Escherichia coli 95.0943]
gi|427264669|gb|EKW30340.1| hypothetical protein EC950183_2514 [Escherichia coli 95.0183]
gi|427266547|gb|EKW31980.1| hypothetical protein EC951288_2373 [Escherichia coli 95.1288]
gi|427279127|gb|EKW43578.1| hypothetical protein EC960428_2447 [Escherichia coli 96.0428]
gi|427282894|gb|EKW47135.1| hypothetical protein EC960427_2572 [Escherichia coli 96.0427]
gi|427285452|gb|EKW49436.1| hypothetical protein EC960939_2486 [Escherichia coli 96.0939]
gi|427294501|gb|EKW57680.1| hypothetical protein EC960932_2608 [Escherichia coli 96.0932]
gi|427301634|gb|EKW64489.1| hypothetical protein EC960107_2516 [Escherichia coli 96.0107]
gi|427302115|gb|EKW64951.1| hypothetical protein EC970003_2330 [Escherichia coli 97.0003]
gi|427316274|gb|EKW78234.1| hypothetical protein EC971742_2046 [Escherichia coli 97.1742]
gi|427317977|gb|EKW79861.1| hypothetical protein EC970007_1997 [Escherichia coli 97.0007]
gi|427322633|gb|EKW84262.1| hypothetical protein EC990672_2511 [Escherichia coli 99.0672]
gi|427330405|gb|EKW91676.1| hypothetical protein EC990678_2327 [Escherichia coli 99.0678]
gi|427330825|gb|EKW92086.1| hypothetical protein EC990713_2375 [Escherichia coli 99.0713]
gi|429255409|gb|EKY39738.1| hypothetical protein EC960109_2680 [Escherichia coli 96.0109]
gi|429257274|gb|EKY41365.1| hypothetical protein EC970010_2547 [Escherichia coli 97.0010]
gi|444539855|gb|ELV19562.1| hypothetical protein EC990814_2171 [Escherichia coli 99.0814]
gi|444542994|gb|ELV22319.1| hypothetical protein EC09BKT78844_2611 [Escherichia coli
09BKT078844]
gi|444548952|gb|ELV27286.1| hypothetical protein EC990815_2043 [Escherichia coli 99.0815]
gi|444559914|gb|ELV37107.1| hypothetical protein EC990839_2131 [Escherichia coli 99.0839]
gi|444561649|gb|ELV38752.1| hypothetical protein EC990816_2127 [Escherichia coli 99.0816]
gi|444566361|gb|ELV43196.1| hypothetical protein EC990848_2183 [Escherichia coli 99.0848]
gi|444575772|gb|ELV51999.1| hypothetical protein EC991753_2238 [Escherichia coli 99.1753]
gi|444580004|gb|ELV55967.1| hypothetical protein EC991775_2129 [Escherichia coli 99.1775]
gi|444581572|gb|ELV57410.1| hypothetical protein EC991793_2365 [Escherichia coli 99.1793]
gi|444595780|gb|ELV70876.1| hypothetical protein ECPA11_2205 [Escherichia coli PA11]
gi|444595983|gb|ELV71078.1| hypothetical protein ECATCC700728_2108 [Escherichia coli ATCC
700728]
gi|444598419|gb|ELV73344.1| hypothetical protein EC991805_2039 [Escherichia coli 99.1805]
gi|444609368|gb|ELV83826.1| hypothetical protein ECPA13_1878 [Escherichia coli PA13]
gi|444609758|gb|ELV84213.1| hypothetical protein ECPA19_2154 [Escherichia coli PA19]
gi|444617820|gb|ELV91927.1| hypothetical protein ECPA2_2265 [Escherichia coli PA2]
gi|444626927|gb|ELW00716.1| hypothetical protein ECPA47_2092 [Escherichia coli PA47]
gi|444632246|gb|ELW05822.1| hypothetical protein ECPA8_2169 [Escherichia coli PA8]
gi|444641540|gb|ELW14770.1| hypothetical protein EC71982_2347 [Escherichia coli 7.1982]
gi|444644591|gb|ELW17701.1| hypothetical protein EC991781_2331 [Escherichia coli 99.1781]
gi|444647775|gb|ELW20738.1| hypothetical protein EC991762_2315 [Escherichia coli 99.1762]
gi|444656336|gb|ELW28866.1| hypothetical protein ECPA35_2374 [Escherichia coli PA35]
gi|444662665|gb|ELW34917.1| hypothetical protein EC34880_2156 [Escherichia coli 3.4880]
gi|444668149|gb|ELW40173.1| hypothetical protein EC950083_2143 [Escherichia coli 95.0083]
gi|444671321|gb|ELW43149.1| hypothetical protein EC990670_2418 [Escherichia coli 99.0670]
Length = 478
Score = 361 bits (926), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + D VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPDKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LW + + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWILQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417728247|ref|ZP_12376966.1| hypothetical protein SFK671_1911 [Shigella flexneri K-671]
gi|332759240|gb|EGJ89549.1| hypothetical protein SFK671_1911 [Shigella flexneri K-671]
Length = 478
Score = 361 bits (926), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 220/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQF +WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFVVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLREEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|207857148|ref|YP_002243799.1| hypothetical protein SEN1699 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|436793694|ref|ZP_20521838.1| hypothetical protein SEECHS44_01013 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|437332518|ref|ZP_20742209.1| hypothetical protein SEEE7927_20508 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437343769|ref|ZP_20745937.1| hypothetical protein SEEECHS4_16505 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|445242934|ref|ZP_21407866.1| hypothetical protein SEE436_012381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|445326393|ref|ZP_21412557.1| hypothetical protein SEE18569_007121 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|226725735|sp|B5QVV6.1|YDIU_SALEP RecName: Full=UPF0061 protein YdiU
gi|206708951|emb|CAR33281.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|434963151|gb|ELL56276.1| hypothetical protein SEECHS44_01013 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|435188496|gb|ELN73209.1| hypothetical protein SEEE7927_20508 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435191546|gb|ELN76103.1| hypothetical protein SEEECHS4_16505 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|444881574|gb|ELY05612.1| hypothetical protein SEE18569_007121 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444890784|gb|ELY14086.1| hypothetical protein SEE436_012381 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
Length = 480
Score = 360 bits (925), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLAYGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------PEKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPLVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|200390121|ref|ZP_03216732.1| protein YdiU [Salmonella enterica subsp. enterica serovar Virchow
str. SL491]
gi|199602566|gb|EDZ01112.1| protein YdiU [Salmonella enterica subsp. enterica serovar Virchow
str. SL491]
Length = 480
Score = 360 bits (925), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGF D +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFFDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|419925117|ref|ZP_14442965.1| hypothetical protein EC54115_18757 [Escherichia coli 541-15]
gi|388387356|gb|EIL48974.1| hypothetical protein EC54115_18757 [Escherichia coli 541-15]
Length = 478
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPG ++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGTMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSQFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRA--DDYVIRPPDWGKRLEV---SCSS 478
>gi|222111219|ref|YP_002553483.1| hypothetical protein Dtpsy_2027 [Acidovorax ebreus TPSY]
gi|221730663|gb|ACM33483.1| protein of unknown function UPF0061 [Acidovorax ebreus TPSY]
Length = 495
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 222/505 (43%), Positives = 292/505 (57%), Gaps = 51/505 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T + P+ + P V V L L +R D F+G T L G+ P A Y
Sbjct: 29 AFFTPLRPT-PLPQPHWVGTCAEVGALLGLPEAWQQRDDALQAFTGNTLLPGSQPLASVY 87
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRAI LGE + E+QLKG+G+TPYSR DG AVLRSSIREF
Sbjct: 88 SGHQFGVWAGQLGDGRAILLGETATGQ----EVQLKGSGRTPYSRMGDGRAVLRSSIREF 143
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPTTRALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 144 LCSEAMHALGIPTTRALCVTGSPAPVQRE-------EVETAAVVTRVAPSFIRFGHFEHF 196
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A+RGQE +R LADY I R+ N +S+ + N YAA
Sbjct: 197 AARGQEA--ELRALADYVID---RYYPNCRRSQ--------------EWEGNAYAALLHA 237
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V+ERTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPF FLDAFDP N +D+ G R
Sbjct: 238 VSERTAALLAQWQAVGFCHGVMNTDNMSILGLTMDYGPFQFLDAFDPGHICNHSDVRG-R 296
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y F QP + WN+ + L LI + + A ++ Y F ++ A + KLGL +
Sbjct: 297 YAFDRQPSVAYWNLLCLAQAL--LPLIGEVDTARAALQSYEGSFGRQFLARIRAKLGLQQ 354
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ LL +A D+VDY F+R LS A E P++ + LD +
Sbjct: 355 AREGDAALVDGLLRLLAADRVDYPIFWRRLSGAVA------TEDFEPVRDLFLD-----R 403
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A +W+L Y + L G + LM+ NP++VLRN+L + AI AA+LGDF E++ L
Sbjct: 404 AALDAWLLQYKELLALDGWAIA--ADLMHKTNPRFVLRNHLGEQAIRAAKLGDFSELQIL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWA 631
+L+ RP+D+ PG E YA PP WA
Sbjct: 462 QRLLARPFDDHPGHEAYAGFPPDWA 486
>gi|121594048|ref|YP_985944.1| hypothetical protein Ajs_1677 [Acidovorax sp. JS42]
gi|120606128|gb|ABM41868.1| protein of unknown function UPF0061 [Acidovorax sp. JS42]
Length = 495
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 221/505 (43%), Positives = 290/505 (57%), Gaps = 51/505 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T + P+ + P V S V L L +R D F+G T L G+ P A Y
Sbjct: 29 AFFTPLRPT-PLPQPHWVGTSAEVGALLGLPEAWQQRDDALQAFTGNTLLPGSQPLASVY 87
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRAI LGE + E+QLKG+G+TPYSR DG AVLRSSIREF
Sbjct: 88 SGHQFGVWAGQLGDGRAILLGETATGQ----EVQLKGSGRTPYSRMGDGRAVLRSSIREF 143
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPTTRALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 144 LCSEAMHALGIPTTRALCVTGSPAPVQRE-------EVETAAVVTRVAPSFIRFGHFEHF 196
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A+RGQE +R LADY I ++ + E N YAA
Sbjct: 197 AARGQEA--ELRALADYVIDRYYPDCRRSQEWEG-----------------NAYAALLHA 237
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V+ERTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPF FLDAFDP N +D+ G R
Sbjct: 238 VSERTAALLAQWQAVGFCHGVMNTDNMSILGLTMDYGPFQFLDAFDPGHICNHSDVRG-R 296
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y F QP + WN+ + L LI + + A ++ Y F ++ A + KLGL +
Sbjct: 297 YAFDRQPSVAYWNLLCLAQAL--LPLIGEVDTARAALQSYEGSFGRQFLARIRAKLGLQQ 354
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ LL +A D+VDY F+R LS A E P++ + LD +
Sbjct: 355 AREGDAALVDGLLRLLAADRVDYPIFWRRLSGAVA------TEDFEPVRDLFLD-----R 403
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A +W+L Y + L G + LM+ NP++VLRN+L + AI AA+LGDF E++ L
Sbjct: 404 AALDAWLLQYKELLALDGWALA--ADLMHKTNPRFVLRNHLGEQAIRAAKLGDFSELQTL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWA 631
+L+ RP+D+ PG E YA PP WA
Sbjct: 462 QRLLARPFDDHPGHEAYAGFPPDWA 486
>gi|432475883|ref|ZP_19717883.1| hypothetical protein A15Q_02067 [Escherichia coli KTE208]
gi|432517772|ref|ZP_19754964.1| hypothetical protein A17U_00734 [Escherichia coli KTE228]
gi|432774796|ref|ZP_20009078.1| hypothetical protein A1SG_02881 [Escherichia coli KTE54]
gi|432886649|ref|ZP_20100738.1| hypothetical protein A31C_02453 [Escherichia coli KTE158]
gi|432912746|ref|ZP_20118556.1| hypothetical protein A13Q_02166 [Escherichia coli KTE190]
gi|433018665|ref|ZP_20206911.1| hypothetical protein WI7_01711 [Escherichia coli KTE105]
gi|433158737|ref|ZP_20343585.1| hypothetical protein WKU_01812 [Escherichia coli KTE177]
gi|431005824|gb|ELD20831.1| hypothetical protein A15Q_02067 [Escherichia coli KTE208]
gi|431051820|gb|ELD61482.1| hypothetical protein A17U_00734 [Escherichia coli KTE228]
gi|431318511|gb|ELG06206.1| hypothetical protein A1SG_02881 [Escherichia coli KTE54]
gi|431416694|gb|ELG99165.1| hypothetical protein A31C_02453 [Escherichia coli KTE158]
gi|431440175|gb|ELH21504.1| hypothetical protein A13Q_02166 [Escherichia coli KTE190]
gi|431533603|gb|ELI10102.1| hypothetical protein WI7_01711 [Escherichia coli KTE105]
gi|431679425|gb|ELJ45337.1| hypothetical protein WKU_01812 [Escherichia coli KTE177]
Length = 478
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPGTYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|404375066|ref|ZP_10980255.1| UPF0061 protein ydiU [Escherichia sp. 1_1_43]
gi|404291322|gb|EJZ48210.1| UPF0061 protein ydiU [Escherichia sp. 1_1_43]
Length = 478
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTELKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|82543926|ref|YP_407873.1| hypothetical protein SBO_1422 [Shigella boydii Sb227]
gi|417681883|ref|ZP_12331254.1| hypothetical protein SB359474_1591 [Shigella boydii 3594-74]
gi|420325413|ref|ZP_14827178.1| hypothetical protein SFCCH060_1738 [Shigella flexneri CCH060]
gi|421682362|ref|ZP_16122175.1| hypothetical protein SF148580_1714 [Shigella flexneri 1485-80]
gi|121957929|sp|Q321G3.1|YDIU_SHIBS RecName: Full=UPF0061 protein YdiU
gi|81245337|gb|ABB66045.1| conserved hypothetical protein [Shigella boydii Sb227]
gi|332096072|gb|EGJ01077.1| hypothetical protein SB359474_1591 [Shigella boydii 3594-74]
gi|391253258|gb|EIQ12439.1| hypothetical protein SFCCH060_1738 [Shigella flexneri CCH060]
gi|404340668|gb|EJZ67087.1| hypothetical protein SF148580_1714 [Shigella flexneri 1485-80]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNIELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ L+ SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLIQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|350544465|ref|ZP_08914069.1| Selenoprotein O and cysteine-containing homologs [Candidatus
Burkholderia kirkii UZHbot1]
gi|350527753|emb|CCD37427.1| Selenoprotein O and cysteine-containing homologs [Candidatus
Burkholderia kirkii UZHbot1]
Length = 530
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 226/526 (42%), Positives = 298/526 (56%), Gaps = 65/526 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEF---ERPDFPLFFSGATPL---AGAVPYAQCYG 191
P+A V +P L+ S +A+SL DP E+ +F +F G + A+PYA Y
Sbjct: 50 PAAPVPDPYLIGLSREMAESLGFDPDVAVGQEKNEFAGYFVGNPTRDWPSDALPYAAVYS 109
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGRA+TLGE+ + R E+QLKGAG+TPYSR DG AVLRSSIREFL
Sbjct: 110 GHQFGVWAGQLGDGRALTLGEVEH-DGARLEVQLKGAGRTPYSRMGDGRAVLRSSIREFL 168
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAMH LGIPTTRAL ++ + V R+ E AIV RVA SF+RFG ++
Sbjct: 169 CSEAMHHLGIPTTRALTVIGSDLPVRRETI-------ETAAIVTRVAPSFVRFGHFEHFY 221
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
S + +D ++ LAD+ I + H + + Y A E
Sbjct: 222 S--NDRVDDLKKLADHVIDRFYPHCRD---------------------AEDPYLALLDEA 258
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
TA L+AQWQGVGF HGV+NTDNMSI+GLTIDYGPFGF+DAF+ N +D G RY
Sbjct: 259 VRSTADLMAQWQGVGFCHGVMNTDNMSIIGLTIDYGPFGFIDAFNAHHICNHSDTQG-RY 317
Query: 432 CFANQPDIGLWN---IAQFSTTLAAAKLIDD-------KEANYVMERYGTKFMDEYQAIM 481
++ QP + WN +AQ L +L ++ +EA ++E Y +F A M
Sbjct: 318 SYSRQPQVAYWNLFCLAQALVPLFGQELPEEGRGERVVQEAQKLLEHYRERFAPALVAKM 377
Query: 482 TKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAV 537
KLGL + + ++ + L M ++ D+T FR LS + K+D S +D P++ +
Sbjct: 378 RAKLGLEVEREGDDKLANGLFEIMHANRTDFTLTFRNLSKLSKSDAS--QD---APVRDL 432
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
LD + A+ +W Y + L D R A MN VNPKYVLRN+L ++AI A
Sbjct: 433 FLD-----RAAFDAWTAQYRERLTHEPRDDAARAAAMNRVNPKYVLRNHLAENAIRRASE 487
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RLL ++ RPYDEQP E YA LPP WA +SCSS
Sbjct: 488 KDFAEVARLLDVLRRPYDEQPAYEAYAGLPPDWA---SALEVSCSS 530
>gi|432718821|ref|ZP_19953790.1| hypothetical protein WCK_02434 [Escherichia coli KTE9]
gi|431262633|gb|ELF54622.1| hypothetical protein WCK_02434 [Escherichia coli KTE9]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNAELANTLGISSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|242239069|ref|YP_002987250.1| hypothetical protein Dd703_1631 [Dickeya dadantii Ech703]
gi|242131126|gb|ACS85428.1| protein of unknown function UPF0061 [Dickeya dadantii Ech703]
Length = 483
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 213/542 (39%), Positives = 295/542 (54%), Gaps = 66/542 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ + R+LPG YT++ P+ ++ +L+ S +A L LD
Sbjct: 5 LQFDNHYHRQLPG--------------FYTELQPTP-LQGARLLYHSAPLARDLSLDQHW 49
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
FE D +SG L G P AQ Y GHQFG+WAGQLGDGR I LG+ ++
Sbjct: 50 FE-GDNQRIWSGEISLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGQQRREDGYTYDWH 108
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AVLRS +REFL SEA+H LGIPTTRAL +VT+ V R+
Sbjct: 109 LKGAGLTPYSRMGDGRAVLRSVVREFLASEALHHLGIPTTRALTIVTSDHPVQRE----- 163
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+EE GA++ RVA+S +RFG ++ R + + VR LADY I HH+ H++
Sbjct: 164 --QEERGAMLLRVAESHVRFGHFEHFYYR--REPERVRQLADYVIAHHWPHLQT------ 213
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+KYA W EV RTA L+AQWQ VGF HGV+NTDNMSILG+T+
Sbjct: 214 ---------------DVDKYAVWFGEVVVRTAQLIAQWQAVGFAHGVMNTDNMSILGMTL 258
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGPFGF+D + P + N +D G RY F NQP + LWN+ + + +L ++LI +
Sbjct: 259 DYGPFGFMDDYQPGYVCNHSDHQG-RYAFDNQPAVALWNLQRLAQSL--SELIPVAQLQQ 315
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y M + +M KLG + Q ++ +LL M + DY++ FR LS +
Sbjct: 316 GLAGYEPALMQRFGELMRAKLGFMTADSQDNALLVELLQLMHKESADYSSVFRLLSETE- 374
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ L PL+ V +D + A+ W +Y + L + G D+ R+ +M NP++
Sbjct: 375 -----QQSALTPLQDVFID-----RPAFDVWFSAYRRRLAADGCDDDRRQRVMRQANPRF 424
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
LRNYL Q I+ AE D ++RL + + PYDEQP A P W ++SC
Sbjct: 425 TLRNYLAQQVIEHAERDDVAPLQRLHQALMHPYDEQPDASDLAVPSPDWGKH---LVISC 481
Query: 642 SS 643
SS
Sbjct: 482 SS 483
>gi|432543160|ref|ZP_19780011.1| hypothetical protein A197_01743 [Escherichia coli KTE236]
gi|432548642|ref|ZP_19785423.1| hypothetical protein A199_02110 [Escherichia coli KTE237]
gi|432621907|ref|ZP_19857941.1| hypothetical protein A1UO_01778 [Escherichia coli KTE76]
gi|432815401|ref|ZP_20049186.1| hypothetical protein A1Y1_01802 [Escherichia coli KTE115]
gi|431075915|gb|ELD83435.1| hypothetical protein A197_01743 [Escherichia coli KTE236]
gi|431081871|gb|ELD88198.1| hypothetical protein A199_02110 [Escherichia coli KTE237]
gi|431159606|gb|ELE60150.1| hypothetical protein A1UO_01778 [Escherichia coli KTE76]
gi|431364457|gb|ELG50988.1| hypothetical protein A1Y1_01802 [Escherichia coli KTE115]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVATSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFAHYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|422332972|ref|ZP_16413984.1| UPF0061 protein ydiU [Escherichia coli 4_1_47FAA]
gi|432770670|ref|ZP_20005014.1| hypothetical protein A1S9_03468 [Escherichia coli KTE50]
gi|432961724|ref|ZP_20151514.1| hypothetical protein A15E_02432 [Escherichia coli KTE202]
gi|433063098|ref|ZP_20250031.1| hypothetical protein WIO_01918 [Escherichia coli KTE125]
gi|373246101|gb|EHP65562.1| UPF0061 protein ydiU [Escherichia coli 4_1_47FAA]
gi|431315870|gb|ELG03769.1| hypothetical protein A1S9_03468 [Escherichia coli KTE50]
gi|431474680|gb|ELH54486.1| hypothetical protein A15E_02432 [Escherichia coli KTE202]
gi|431582932|gb|ELI54942.1| hypothetical protein WIO_01918 [Escherichia coli KTE125]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSESPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFIDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + +A ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLLARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|416897621|ref|ZP_11927269.1| hypothetical protein ECSTEC7V_2068 [Escherichia coli STEC_7v]
gi|417114985|ref|ZP_11966121.1| hypothetical protein EC12741_2140 [Escherichia coli 1.2741]
gi|422798994|ref|ZP_16847493.1| hypothetical protein ERJG_00157 [Escherichia coli M863]
gi|323968476|gb|EGB63882.1| hypothetical protein ERJG_00157 [Escherichia coli M863]
gi|327252823|gb|EGE64477.1| hypothetical protein ECSTEC7V_2068 [Escherichia coli STEC_7v]
gi|386140404|gb|EIG81556.1| hypothetical protein EC12741_2140 [Escherichia coli 1.2741]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LA++AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLAEFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N +E Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFITVD--ALNEALESYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPSLVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R +SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKR---LQVSCSS 478
>gi|331683213|ref|ZP_08383814.1| putative cytoplasmic protein [Escherichia coli H299]
gi|450189100|ref|ZP_21890421.1| hypothetical protein A364_08916 [Escherichia coli SEPT362]
gi|331079428|gb|EGI50625.1| putative cytoplasmic protein [Escherichia coli H299]
gi|449322134|gb|EMD12135.1| hypothetical protein A364_08916 [Escherichia coli SEPT362]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|293410022|ref|ZP_06653598.1| hypothetical protein ECEG_00973 [Escherichia coli B354]
gi|291470490|gb|EFF12974.1| hypothetical protein ECEG_00973 [Escherichia coli B354]
Length = 478
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVATSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGCICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM S+NP VLRN+L Q AI AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSINPALVLRNWLAQRAIGAAEKGDMKE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|261339527|ref|ZP_05967385.1| SelO family protein [Enterobacter cancerogenus ATCC 35316]
gi|288318340|gb|EFC57278.1| SelO family protein [Enterobacter cancerogenus ATCC 35316]
Length = 480
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 212/518 (40%), Positives = 295/518 (56%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT ++P+ ++N +L+ +E++ADSL + P F+ + + G T L G P AQ
Sbjct: 13 LPGFYTALNPTP-LDNARLIWHNETLADSLAIPPALFQPSEGAGVWGGETLLPGMRPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPNGETVDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPT+RAL +VT+ V+R+ E GA++ RVAQS LRFG ++
Sbjct: 132 ESLASEAMHALGIPTSRALSIVTSDTPVSRETI-------EQGAMLIRVAQSHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + VR LAD+A+RHH+ H+++ ++KY W
Sbjct: 185 HFYYR--REPEKVRQLADFALRHHWPHLQD---------------------EADKYLLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTAS++A+WQ VGF HGV+NTDNMS+LGLT DYGPFGFLD + P + N +D G
Sbjct: 222 RDIVARTASMIARWQTVGFAHGVMNTDNMSLLGLTFDYGPFGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID + N ++ Y + EY ++M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVEGLNDALDSYQEVLLREYGSLMRSKLGLL 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + +++ L + MA + DYT FR L + + PL+ +D
Sbjct: 339 TQDKGDNALLNTLFSLMAREGSDYTRTFRMLGQTEQQSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W +Y L I D R+ MN+VNP VLRN+L Q AI+ AE G + E+ R
Sbjct: 388 RQAFDDWFTAYRTRLQREQIDDVTRQEKMNAVNPAMVLRNWLAQRAIEQAEQGQYDELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHAALRTPFADRE--DDYVSRPPDWGKRLEV---SCSS 480
>gi|331647198|ref|ZP_08348292.1| putative cytoplasmic protein [Escherichia coli M605]
gi|417662295|ref|ZP_12311876.1| hypothetical protein ECAA86_01870 [Escherichia coli AA86]
gi|330911513|gb|EGH40023.1| hypothetical protein ECAA86_01870 [Escherichia coli AA86]
gi|331043981|gb|EGI16117.1| putative cytoplasmic protein [Escherichia coli M605]
Length = 478
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432881943|ref|ZP_20098023.1| hypothetical protein A317_04309 [Escherichia coli KTE154]
gi|431411449|gb|ELG94560.1| hypothetical protein A317_04309 [Escherichia coli KTE154]
Length = 478
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTT AL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTHALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFITEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DNYVSRPPDWGKRLEV---SCSS 478
>gi|197250990|ref|YP_002146692.1| hypothetical protein SeAg_B1828 [Salmonella enterica subsp.
enterica serovar Agona str. SL483]
gi|440765231|ref|ZP_20944251.1| hypothetical protein F434_19746 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|440767689|ref|ZP_20946665.1| hypothetical protein F514_08567 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
gi|440774138|ref|ZP_20953026.1| hypothetical protein F515_17103 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|226725733|sp|B5F7F0.1|YDIU_SALA4 RecName: Full=UPF0061 protein YdiU
gi|197214693|gb|ACH52090.1| protein YdiU [Salmonella enterica subsp. enterica serovar Agona
str. SL483]
gi|436413656|gb|ELP11589.1| hypothetical protein F515_17103 [Salmonella enterica subsp.
enterica serovar Agona str. SH10GFN094]
gi|436414355|gb|ELP12285.1| hypothetical protein F434_19746 [Salmonella enterica subsp.
enterica serovar Agona str. SH11G1113]
gi|436419598|gb|ELP17473.1| hypothetical protein F514_08567 [Salmonella enterica subsp.
enterica serovar Agona str. SH08SF124]
Length = 480
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AI H++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIHHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|300311562|ref|YP_003775654.1| hypothetical protein Hsero_2247 [Herbaspirillum seropedicae SmR1]
gi|300074347|gb|ADJ63746.1| conserved hypothetical protein [Herbaspirillum seropedicae SmR1]
Length = 495
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 221/522 (42%), Positives = 298/522 (57%), Gaps = 51/522 (9%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ A +T++ P+ + P LV +SE A S+ L + + DF F+G G+ P
Sbjct: 20 ELPPAFHTRLQPTP-LPAPYLVGFSEDAAASIALPRPQADDGDFLDIFAGNRIAPGSTPL 78
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLK-SERWELQLKGAGKTPYSRFADGLAVLRS 245
+ Y GHQFG+WAGQLGDGRAITLG++ + R ELQLKGAG TPYSR DG AVLRS
Sbjct: 79 SAVYSGHQFGVWAGQLGDGRAITLGDLPAADGAGRIELQLKGAGPTPYSRMGDGRAVLRS 138
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIREFLCSEAM LGIPTTRAL ++ + + V R+ E A+V R+A SF+RFG
Sbjct: 139 SIREFLCSEAMAALGIPTTRALTVIGSDQRVLRE-------TAETAAVVTRMAPSFIRFG 191
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
S++ H Q D ++ LAD + + + N YA
Sbjct: 192 SFE-HWYYNQR-FDDLKLLADTVLEQFYPELLQ---------------------AGNPYA 228
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
A EV RTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD
Sbjct: 229 ALLKEVTRRTATLMAQWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDARHICNHTD 288
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYGTKFMDEYQAIMTKK 484
G RY + QP IG WN F+ A LI +E + Y F +++A++ K
Sbjct: 289 SQG-RYSYQMQPRIGQWNC--FALGQAMLPLIGTVEETEAALADYEAIFQAQHEALLRAK 345
Query: 485 LGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL ++Q+I + + + VD+T FFR L +++ + ++ L+ ++LD
Sbjct: 346 LGLRTRQPEDEQLIEAMFAILQANHVDFTLFFRRLGDLQIGNAAHDEG----LRDLILD- 400
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ +W Y L + D+ R+ M++VNPKYVLRNYL Q AI+ A+ DF
Sbjct: 401 ----RPAFDAWATQYRARLRAEDSDDQARRLAMHAVNPKYVLRNYLAQVAIERAQQKDFS 456
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV RL ++ P+DEQP +KYA LPP WA V SCSS
Sbjct: 457 EVARLQSILRHPFDEQPEHDKYADLPPDWASHLEV---SCSS 495
>gi|149278787|ref|ZP_01884922.1| hypothetical protein PBAL39_06411 [Pedobacter sp. BAL39]
gi|149230406|gb|EDM35790.1| hypothetical protein PBAL39_06411 [Pedobacter sp. BAL39]
Length = 516
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 221/546 (40%), Positives = 299/546 (54%), Gaps = 51/546 (9%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL-DPKEFER 167
+ F GD ++ R+ Y V P+ V P L+ W+ +A+ L + DP +
Sbjct: 11 NEFTAHFDGDHSDNAARRQTPGMFYCTVQPTP-VSQPSLITWNTPLAEELGISDPDD--- 66
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
D + G +PYA CY GHQFG WAGQLGDGRAITLGE WELQLKG
Sbjct: 67 QDLQVL-GGNVTTPSMLPYAACYAGHQFGNWAGQLGDGRAITLGEWPMSSGSSWELQLKG 125
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR ADG AVLRSS+RE+L SEAM +LG+PTTRAL LV TG V RD FYDG
Sbjct: 126 AGPTPYSRRADGRAVLRSSVREYLMSEAMFYLGVPTTRALSLVATGDAVMRDPFYDGRTA 185
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
EPGA+V R A SFLRFG++++ A+R ++ + +R LAD+ I ++ +
Sbjct: 186 YEPGAVVMRAAPSFLRFGNFEMLAAR--KEYEQLRQLADWTISRYYPEV----------- 232
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
+TG Y W V ++T +++ +W VGF HGV+NTDNMSILGLTIDYG
Sbjct: 233 TTG-------------YLDWFRAVVDKTTTMIVEWLRVGFVHGVMNTDNMSILGLTIDYG 279
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VM 466
PF FLDA+D F+PNTTD PGRRY F Q I WN+ + A A L +D +
Sbjct: 280 PFSFLDAYDRDFSPNTTDHPGRRYAFGKQHHIAYWNLGCLAN--AVAPLFNDTAPLVEAL 337
Query: 467 ERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV---DKVDYTNFFRAL-----SN 518
E +G F + + A+ K+GL + I + AV + D T F++ L SN
Sbjct: 338 EGFGDLFYERFYAMKAGKMGLDLVGAEEIELVEQFEAVLFALQPDMTIFYQLLITLPESN 397
Query: 519 VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
+ A+ + + +A D+G+ K+ + SY + IS EE A M + N
Sbjct: 398 LNAESTTAHFK-----EAFYHDLGESEKQQLQECIRSYQDRKNKNTISPEESIANMKANN 452
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA-YRPGVC 637
P+++LRNY+ AI E GD R+L ++ PY + +++ R P WA +PG
Sbjct: 453 PRFILRNYMLYEAIQDLEKGDNTRFRKLEHALQTPYADT--HDEFFRRRPQWADEQPGSA 510
Query: 638 MLSCSS 643
LSCSS
Sbjct: 511 TLSCSS 516
>gi|432489315|ref|ZP_19731196.1| hypothetical protein A171_01234 [Escherichia coli KTE213]
gi|432839330|ref|ZP_20072817.1| hypothetical protein A1YQ_02288 [Escherichia coli KTE140]
gi|433203283|ref|ZP_20387064.1| hypothetical protein WGY_01864 [Escherichia coli KTE95]
gi|431021351|gb|ELD34674.1| hypothetical protein A171_01234 [Escherichia coli KTE213]
gi|431389482|gb|ELG73193.1| hypothetical protein A1YQ_02288 [Escherichia coli KTE140]
gi|431722351|gb|ELJ86317.1| hypothetical protein WGY_01864 [Escherichia coli KTE95]
Length = 478
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVATSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432792912|ref|ZP_20026997.1| hypothetical protein A1US_02125 [Escherichia coli KTE78]
gi|432798870|ref|ZP_20032893.1| hypothetical protein A1UU_03609 [Escherichia coli KTE79]
gi|431339656|gb|ELG26710.1| hypothetical protein A1US_02125 [Escherichia coli KTE78]
gi|431343737|gb|ELG30693.1| hypothetical protein A1UU_03609 [Escherichia coli KTE79]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNAELANTLGISSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVALSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|161503546|ref|YP_001570658.1| hypothetical protein SARI_01624 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|189041161|sp|A9MEQ9.1|YDIU_SALAR RecName: Full=UPF0061 protein YdiU
gi|160864893|gb|ABX21516.1| hypothetical protein SARI_01624 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 480
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 292/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQEAGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ ++ KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQD---------------------APEKYD 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A WQ +GF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIADWQTIGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L + + D R+ M SVNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDGWFDRYRARLRTEAVDDALRQQQMQSVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEILRQPFIDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|293415025|ref|ZP_06657668.1| ydiU protein [Escherichia coli B185]
gi|291432673|gb|EFF05652.1| ydiU protein [Escherichia coli B185]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRLEP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFL+ ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLNDYEPGFICNYSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|299471650|emb|CBN76872.1| selenoprotein O homolog [Ectocarpus siliculosus]
Length = 672
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 250/638 (39%), Positives = 335/638 (52%), Gaps = 98/638 (15%)
Query: 71 SVTHDLKNQRLDT-----ETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIP 125
SV+H +N R+ T T ++ T L+ L +D+ +RELP DP TD+
Sbjct: 68 SVSHSNRNDRVVTARPASRTAMSTAVDAAATCSSSTLDTLPFDNRVIRELPVDPITDNYV 127
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R V +AC++ V+P V+ P +VA S S L L +E +R D +FSG + GA P
Sbjct: 128 RRVENACFSIVAPDPVVK-PVMVAASNSALGLLGLAAEEGQREDAAEYFSGNKLMPGAQP 186
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
+A Y GHQFG +AGQLGDG A+ LGE+ S RWE+Q KGAG TPYSR ADG VLRS
Sbjct: 187 HAHAYCGHQFGSFAGQLGDGAAMYLGEVEG-PSGRWEIQFKGAGLTPYSRSADGRKVLRS 245
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIREFLCSEAMHFLGIPTTRA LVT+ V RD+FY GN +E +IV R+A +FLRFG
Sbjct: 246 SIREFLCSEAMHFLGIPTTRAAALVTSDTKVRRDVFYTGNVIQERASIVTRLAPTFLRFG 305
Query: 306 SYQIHASR-----------GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
S++I R G + L + + +YAI F + + G E
Sbjct: 306 SFEIFKPRDPRTGRDGPSAGNDALRL--QMLEYAIGRFFPG----------AAAAGPEG- 352
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+ +Y A E TA LVA+WQ VGFTHGVLNTDNMSILGLTIDYGP+GF+D
Sbjct: 353 -----SKARYLAMYEEAVRSTAELVAKWQCVGFTHGVLNTDNMSILGLTIDYGPYGFMDF 407
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFM 474
FDP F PN +D G RY + QP++ WN+ +F+ +A A + D A +E+Y F
Sbjct: 408 FDPKFVPNGSD-GGGRYSYERQPEMCKWNLHKFAEAVAPALPLSDSTA--ALEKYDGLFK 464
Query: 475 DEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSN------------- 518
Y+ M +KLGL + + + L MA D+T FR L+
Sbjct: 465 GYYEEGMRRKLGLFSVEEDDDGLFESLFATMADTSADFTGTFRELAQLVPGGDVDAVSKA 524
Query: 519 ---------VKAD----------PSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQE 559
+KA PSIP +L L + +E EA ++ S ++
Sbjct: 525 LAAQCAGPKIKAKALRRAVDIGRPSIPPQQL-----QGLWAMAQENPEA-LAQRFSAPKD 578
Query: 560 LLSSGISDEERKALMNSVNPKYVLRNYLC-----QSAIDAAELGDFGEVRRLLKLMERPY 614
+ + + +E +K L N + L++ AI+ AE GDF V+R+L+L+E PY
Sbjct: 579 AVIAELREEMQK-LSNYDAAQQRLKDMEALEEDGXEAIEDAEKGDFSGVQRVLRLLESPY 637
Query: 615 D---------EQPGMEKYARLPPAWAYRPGVCMLSCSS 643
D PG + Y R P WA VC +CSS
Sbjct: 638 DPPADDGEGSSSPGGKDYLRATPDWAADL-VC--TCSS 672
>gi|422973805|ref|ZP_16975973.1| UPF0061 protein ydiU [Escherichia coli TA124]
gi|371596226|gb|EHN85065.1| UPF0061 protein ydiU [Escherichia coli TA124]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVATSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGCICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM S+NP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSINPALVLRNWLAQRAIEAAEKGDMKE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417308166|ref|ZP_12095020.1| hypothetical protein PPECC33_15920 [Escherichia coli PCN033]
gi|338770242|gb|EGP25008.1| hypothetical protein PPECC33_15920 [Escherichia coli PCN033]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVALSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFIDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTCTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM S+NP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSINPALVLRNWLAQRAIEAAEKGDMKE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432392114|ref|ZP_19634954.1| hypothetical protein WE9_02427 [Escherichia coli KTE21]
gi|430919931|gb|ELC40851.1| hypothetical protein WE9_02427 [Escherichia coli KTE21]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVALSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+ AE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEVAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|386619276|ref|YP_006138856.1| hypothetical protein ECNA114_1754 [Escherichia coli NA114]
gi|387829620|ref|YP_003349557.1| hypothetical protein ECSF_1567 [Escherichia coli SE15]
gi|432421971|ref|ZP_19664519.1| hypothetical protein A137_02388 [Escherichia coli KTE178]
gi|432500066|ref|ZP_19741826.1| hypothetical protein A177_02156 [Escherichia coli KTE216]
gi|432558793|ref|ZP_19795471.1| hypothetical protein A1S7_02439 [Escherichia coli KTE49]
gi|432694457|ref|ZP_19929664.1| hypothetical protein A31I_01929 [Escherichia coli KTE162]
gi|432710619|ref|ZP_19945681.1| hypothetical protein WCG_03948 [Escherichia coli KTE6]
gi|432919131|ref|ZP_20123262.1| hypothetical protein A133_02174 [Escherichia coli KTE173]
gi|432926938|ref|ZP_20128478.1| hypothetical protein A135_02523 [Escherichia coli KTE175]
gi|432981117|ref|ZP_20169893.1| hypothetical protein A15W_02241 [Escherichia coli KTE211]
gi|433096532|ref|ZP_20282729.1| hypothetical protein WK3_01734 [Escherichia coli KTE139]
gi|433105896|ref|ZP_20291887.1| hypothetical protein WK7_01763 [Escherichia coli KTE148]
gi|281178777|dbj|BAI55107.1| conserved hypothetical protein [Escherichia coli SE15]
gi|333969777|gb|AEG36582.1| Hypothetical protein ECNA114_1754 [Escherichia coli NA114]
gi|430944730|gb|ELC64819.1| hypothetical protein A137_02388 [Escherichia coli KTE178]
gi|431028936|gb|ELD41968.1| hypothetical protein A177_02156 [Escherichia coli KTE216]
gi|431091844|gb|ELD97552.1| hypothetical protein A1S7_02439 [Escherichia coli KTE49]
gi|431234656|gb|ELF30050.1| hypothetical protein A31I_01929 [Escherichia coli KTE162]
gi|431249411|gb|ELF43566.1| hypothetical protein WCG_03948 [Escherichia coli KTE6]
gi|431444445|gb|ELH25467.1| hypothetical protein A133_02174 [Escherichia coli KTE173]
gi|431445165|gb|ELH26092.1| hypothetical protein A135_02523 [Escherichia coli KTE175]
gi|431491872|gb|ELH71475.1| hypothetical protein A15W_02241 [Escherichia coli KTE211]
gi|431616793|gb|ELI85816.1| hypothetical protein WK3_01734 [Escherichia coli KTE139]
gi|431629120|gb|ELI97486.1| hypothetical protein WK7_01763 [Escherichia coli KTE148]
Length = 478
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWPHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTELKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDNERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|213626329|gb|AAI71618.1| Si:dkey-14d8.2 protein [Danio rerio]
Length = 674
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 242/643 (37%), Positives = 330/643 (51%), Gaps = 129/643 (20%)
Query: 94 KMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSES 153
+M + L LE L +++ ++ LP D + R V AC++ V P A ++ P +VA S
Sbjct: 15 RMDQSLTPLERLKFNNVALKALPVDSSLEPGSRTVKAACFSLVKPQALIK-PTIVALSGP 73
Query: 154 VADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGE 212
L L ++ + P + SG+ + G+ P A CY GHQFG +AGQLGDG LGE
Sbjct: 74 ALALLGLKVEDVLQDPHAAEYLSGSRLIQGSEPAAHCYCGHQFGQFAGQLGDGAVCYLGE 133
Query: 213 I-LNLKSE------------RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFL 259
+ + + +E RWE+Q+KGAG TPYSR +DG VLRSSIREFLCSEAM L
Sbjct: 134 VEVEVGAEQTTDPNRTSPCGRWEIQVKGAGLTPYSRLSDGRKVLRSSIREFLCSEAMFAL 193
Query: 260 GIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH--------- 310
GIPTTRA LVT+ +V RD FY GNPK E ++V R+A +F+RFGS++I
Sbjct: 194 GIPTTRAGSLVTSDLYVQRDEFYSGNPKPERCSVVLRIAPTFIRFGSFEIFHPLDDFTGR 253
Query: 311 --ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
S G+ DI L DY I + I+ G D + AA+
Sbjct: 254 QGPSVGRP--DIRAGLLDYVIETFYPEIQR-----------GHLDR------KERNAAFF 294
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
EV RTA LVA WQ VGF HGVLNTDNMSILGLTIDYGPFGF+D FDP F N +D G
Sbjct: 295 REVTVRTAKLVALWQSVGFCHGVLNTDNMSILGLTIDYGPFGFMDRFDPEFVCNASDKKG 354
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY + QP + WN+A+ + L A I +A +++ + + + D Y M KKLGL
Sbjct: 355 -RYTYEAQPYVCRWNLARLAEALGAE--IQSIKAGVILDEFMSLYEDFYLGNMRKKLGLL 411
Query: 489 KYNK----QIISKLLNNMAVDKVDYTNFFRALSNVKA---DPSIPED-----ELLVPLKA 536
+ + ++++ +L M + D+TN FR LS++ + DP+ ++ EL+V A
Sbjct: 412 RKQEPEDGELVADMLKTMHITGADFTNTFRLLSDISSPVGDPAEKDNTDSVVELIVDQCA 471
Query: 537 VLLD-------------------------------------------IGKERK------- 546
+L + IG+ R+
Sbjct: 472 LLEELKVANHPTMQPGELEMILSMAETNPEMFNMVANQPEVTKQLEKIGRLRELLMISEA 531
Query: 547 -------EAWISWVLSYIQELL--SSGISD-----EERKALMNSVNPKYVLRNYLCQSAI 592
E W WV Y + L + SD +ER MNS NP VLRNY+ Q+AI
Sbjct: 532 ELKVKQREHWQRWVKQYRKRLAFECNQASDPASVEKERVRFMNSTNPAVVLRNYIAQNAI 591
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
DAAE GDF EV+R+L+++E PY P +E P W+ G
Sbjct: 592 DAAEKGDFSEVQRVLRVLENPYSVSPDLE-----CPVWSAGKG 629
>gi|204927655|ref|ZP_03218856.1| protein YdiU [Salmonella enterica subsp. enterica serovar Javiana
str. GA_MM04042433]
gi|204322997|gb|EDZ08193.1| protein YdiU [Salmonella enterica subsp. enterica serovar Javiana
str. GA_MM04042433]
Length = 480
Score = 359 bits (921), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 212/521 (40%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVATRTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W +SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWG---KWLEVSCSS 480
>gi|420380158|ref|ZP_14879626.1| hypothetical protein SD22575_2009 [Shigella dysenteriae 225-75]
gi|391302674|gb|EIQ60528.1| hypothetical protein SD22575_2009 [Shigella dysenteriae 225-75]
Length = 478
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAELGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|448241960|ref|YP_007406013.1| hypothetical protein, UPF0061 family [Serratia marcescens WW4]
gi|445212324|gb|AGE17994.1| hypothetical protein, UPF0061 family [Serratia marcescens WW4]
Length = 480
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 210/528 (39%), Positives = 298/528 (56%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ D+ + L YT ++P+ +++ +L+ SE +A L LD F + P++ +G T
Sbjct: 2 PQFDNAYYQQLPGFYTALNPTP-LKDTRLLYHSEPLARELGLDESWFTQDKTPIW-AGET 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQVMADGSHRDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS +REFL SEA+H LGIPTTRAL +VT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSVVREFLASEALHHLGIPTTRALTIVTSQQPVYRE-------QPERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + VR LAD+ I H+ +++
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VRQLADFVIARHWPQLQDQ------------------- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+++Y W +V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+GFLD + P
Sbjct: 212 --ADRYLLWFTDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPYGFLDDYQPG 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY F NQP + LWN+ + + TL+ L+ ++ + Y M Y
Sbjct: 270 YICNHSDHQG-RYAFDNQPAVALWNLHRLAQTLSG--LMTTEQLQQALAAYEPALMRAYG 326
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + +++ LL+ MA + DYT FR LS+ + + + PL+
Sbjct: 327 EQMRAKLGFFTPTAQDNDVLTGLLSLMAQEGRDYTRTFRLLSDTE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ +W Y + L +SD ER+ M +VNP+ +LRNYL Q AI+ A
Sbjct: 381 DEFID-----RAAFDAWYQQYRRRLQQEQVSDAERQRAMKAVNPRLILRNYLAQQAIEDA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D G ++RL + + RP+D+ P + A LPP W +SCSS
Sbjct: 436 EKDDVGRLQRLHQALLRPFDDAPEYDDLAALPPDWGKH---LEISCSS 480
>gi|306815040|ref|ZP_07449196.1| hypothetical protein ECNC101_23398 [Escherichia coli NC101]
gi|432381380|ref|ZP_19624325.1| hypothetical protein WCU_01522 [Escherichia coli KTE15]
gi|432387134|ref|ZP_19630025.1| hypothetical protein WCY_02383 [Escherichia coli KTE16]
gi|432513947|ref|ZP_19751173.1| hypothetical protein A17M_01799 [Escherichia coli KTE224]
gi|432611449|ref|ZP_19847612.1| hypothetical protein A1UG_01802 [Escherichia coli KTE72]
gi|432646213|ref|ZP_19882003.1| hypothetical protein A1W5_01958 [Escherichia coli KTE86]
gi|432655791|ref|ZP_19891497.1| hypothetical protein A1WE_01902 [Escherichia coli KTE93]
gi|432699067|ref|ZP_19934225.1| hypothetical protein A31M_01809 [Escherichia coli KTE169]
gi|432745691|ref|ZP_19980360.1| hypothetical protein WGG_01792 [Escherichia coli KTE43]
gi|432904879|ref|ZP_20113785.1| hypothetical protein A13Y_02151 [Escherichia coli KTE194]
gi|432937895|ref|ZP_20136272.1| hypothetical protein A13C_00691 [Escherichia coli KTE183]
gi|432971870|ref|ZP_20160738.1| hypothetical protein A15O_02441 [Escherichia coli KTE207]
gi|432985399|ref|ZP_20174123.1| hypothetical protein A175_01848 [Escherichia coli KTE215]
gi|433038635|ref|ZP_20226239.1| hypothetical protein WIE_01979 [Escherichia coli KTE113]
gi|433082579|ref|ZP_20269044.1| hypothetical protein WIW_01721 [Escherichia coli KTE133]
gi|433101170|ref|ZP_20287267.1| hypothetical protein WK5_01725 [Escherichia coli KTE145]
gi|433144244|ref|ZP_20329396.1| hypothetical protein WKO_01777 [Escherichia coli KTE168]
gi|433188445|ref|ZP_20372548.1| hypothetical protein WGS_01516 [Escherichia coli KTE88]
gi|305851688|gb|EFM52141.1| hypothetical protein ECNC101_23398 [Escherichia coli NC101]
gi|430907116|gb|ELC28615.1| hypothetical protein WCY_02383 [Escherichia coli KTE16]
gi|430908383|gb|ELC29776.1| hypothetical protein WCU_01522 [Escherichia coli KTE15]
gi|431042545|gb|ELD53033.1| hypothetical protein A17M_01799 [Escherichia coli KTE224]
gi|431148873|gb|ELE50146.1| hypothetical protein A1UG_01802 [Escherichia coli KTE72]
gi|431180250|gb|ELE80137.1| hypothetical protein A1W5_01958 [Escherichia coli KTE86]
gi|431191849|gb|ELE91223.1| hypothetical protein A1WE_01902 [Escherichia coli KTE93]
gi|431244316|gb|ELF38624.1| hypothetical protein A31M_01809 [Escherichia coli KTE169]
gi|431291828|gb|ELF82324.1| hypothetical protein WGG_01792 [Escherichia coli KTE43]
gi|431433179|gb|ELH14851.1| hypothetical protein A13Y_02151 [Escherichia coli KTE194]
gi|431463979|gb|ELH44101.1| hypothetical protein A13C_00691 [Escherichia coli KTE183]
gi|431482571|gb|ELH62273.1| hypothetical protein A15O_02441 [Escherichia coli KTE207]
gi|431500836|gb|ELH79822.1| hypothetical protein A175_01848 [Escherichia coli KTE215]
gi|431552095|gb|ELI26057.1| hypothetical protein WIE_01979 [Escherichia coli KTE113]
gi|431602906|gb|ELI72333.1| hypothetical protein WIW_01721 [Escherichia coli KTE133]
gi|431620300|gb|ELI89177.1| hypothetical protein WK5_01725 [Escherichia coli KTE145]
gi|431662790|gb|ELJ29558.1| hypothetical protein WKO_01777 [Escherichia coli KTE168]
gi|431706488|gb|ELJ71058.1| hypothetical protein WGS_01516 [Escherichia coli KTE88]
Length = 478
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432850692|ref|ZP_20081387.1| hypothetical protein A1YY_01516 [Escherichia coli KTE144]
gi|431400014|gb|ELG83396.1| hypothetical protein A1YY_01516 [Escherichia coli KTE144]
Length = 478
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 296/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDR--YDDYVSRPPDWGKRLEV---SCSS 478
>gi|402566293|ref|YP_006615638.1| hypothetical protein GEM_1519 [Burkholderia cepacia GG4]
gi|402247490|gb|AFQ47944.1| hypothetical protein GEM_1519 [Burkholderia cepacia GG4]
Length = 522
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 222/536 (41%), Positives = 297/536 (55%), Gaps = 71/536 (13%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T++ P+A + P +V +S+ VA L L +P F F+G P A A+PY
Sbjct: 35 AFHTRL-PAAPLPAPYVVGFSDEVAQLLGLPASLAAQPGFAELFAG-NPTRDWPAHAMPY 92
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRA+T+GE+ +R+ELQLKG G+TPYSR DG AVLRSS
Sbjct: 93 ASVYSGHQFGVWAGQLGDGRALTIGELSGADGQRYELQLKGGGRTPYSRMGDGRAVLRSS 152
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG
Sbjct: 153 IREFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGH 205
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ S + DL +R LAD+ I D + + Y A
Sbjct: 206 FEHFFSNDRPDL--LRQLADHVI---------------------DRFYPACRDADDPYLA 242
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 243 LLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGMTIDYGPFGFVDAFDANHICNHSDT 302
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGT 471
G RY + QP I WN + L A + ++D +A V+ ++
Sbjct: 303 SG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVEDAQA--VLAKFPE 359
Query: 472 KFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPE 527
+F + M KLGL + + ++ +KLL M + D+T FR L+ + K D S
Sbjct: 360 RFGPALERAMRAKLGLELERENDAELANKLLETMHASRADFTLTFRRLAQLSKHDASRD- 418
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
P++ + +D ++A+ +W Y L D R MN VNPKYVLRN+L
Sbjct: 419 ----APVRDLFID-----RDAFDAWANLYRARLSEETRDDVARATAMNRVNPKYVLRNHL 469
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 470 AEVAIRRAKEKDFSEVERLAQVLRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|420347358|ref|ZP_14848758.1| hypothetical protein SB96558_2303 [Shigella boydii 965-58]
gi|391271307|gb|EIQ30182.1| hypothetical protein SB96558_2303 [Shigella boydii 965-58]
Length = 478
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RT SL+AQWQ VGF H V+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTTSLIAQWQTVGFAHRVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQWAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417138042|ref|ZP_11981775.1| hypothetical protein EC990741_1840 [Escherichia coli 97.0259]
gi|386158027|gb|EIH14364.1| hypothetical protein EC990741_1840 [Escherichia coli 97.0259]
Length = 478
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVALSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFIDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTCTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM S+NP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSINPALVLRNWLAQRAIEAAEKGDMKE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|300938961|ref|ZP_07153661.1| SelO family protein [Escherichia coli MS 21-1]
gi|432680286|ref|ZP_19915663.1| hypothetical protein A1YW_02030 [Escherichia coli KTE143]
gi|300456119|gb|EFK19612.1| SelO family protein [Escherichia coli MS 21-1]
gi|431221216|gb|ELF18537.1| hypothetical protein A1YW_02030 [Escherichia coli KTE143]
Length = 478
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 217/515 (42%), Positives = 292/515 (56%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + N +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVTSDSPVYRETM-------EPGAMLMRVALSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H+E+ DED KY W +V
Sbjct: 186 YRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYRLWFSDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E+ RL +
Sbjct: 389 FDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|420335986|ref|ZP_14837586.1| hypothetical protein SFK315_1743 [Shigella flexneri K-315]
gi|391264592|gb|EIQ23584.1| hypothetical protein SFK315_1743 [Shigella flexneri K-315]
Length = 478
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ R+A S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRMAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W +SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWG---KWLEVSCSS 478
>gi|422368519|ref|ZP_16448931.1| SelO family protein [Escherichia coli MS 16-3]
gi|432898624|ref|ZP_20109316.1| hypothetical protein A13U_02072 [Escherichia coli KTE192]
gi|433028578|ref|ZP_20216440.1| hypothetical protein WIA_01671 [Escherichia coli KTE109]
gi|315299738|gb|EFU58978.1| SelO family protein [Escherichia coli MS 16-3]
gi|431426276|gb|ELH08320.1| hypothetical protein A13U_02072 [Escherichia coli KTE192]
gi|431543687|gb|ELI18653.1| hypothetical protein WIA_01671 [Escherichia coli KTE109]
Length = 478
Score = 358 bits (919), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 YQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|218689651|ref|YP_002397863.1| hypothetical protein ECED1_1908 [Escherichia coli ED1a]
gi|416337690|ref|ZP_11674053.1| hypothetical protein EcoM_03504 [Escherichia coli WV_060327]
gi|432801865|ref|ZP_20035846.1| hypothetical protein A1W3_02120 [Escherichia coli KTE84]
gi|254814081|sp|B7MVI5.1|YDIU_ECO81 RecName: Full=UPF0061 protein YdiU
gi|218427215|emb|CAR08101.2| conserved hypothetical protein [Escherichia coli ED1a]
gi|320194582|gb|EFW69213.1| hypothetical protein EcoM_03504 [Escherichia coli WV_060327]
gi|431348842|gb|ELG35684.1| hypothetical protein A1W3_02120 [Escherichia coli KTE84]
Length = 478
Score = 358 bits (919), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|422781439|ref|ZP_16834224.1| hypothetical protein ERFG_01679 [Escherichia coli TW10509]
gi|323978157|gb|EGB73243.1| hypothetical protein ERFG_01679 [Escherichia coli TW10509]
Length = 478
Score = 358 bits (919), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LA++AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLAEFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPSLVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417586576|ref|ZP_12237348.1| hypothetical protein ECSTECC16502_2203 [Escherichia coli
STEC_C165-02]
gi|345338079|gb|EGW70510.1| hypothetical protein ECSTECC16502_2203 [Escherichia coli
STEC_C165-02]
Length = 478
Score = 358 bits (918), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 219/521 (42%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPATYTTLSPTP-LNNARLIWHNAELANTLGIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRHEP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LYRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|437835065|ref|ZP_20845200.1| hypothetical protein SEEERB17_016684 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
gi|435300677|gb|ELO76741.1| hypothetical protein SEEERB17_016684 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SARB17]
Length = 480
Score = 358 bits (918), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 215/522 (41%), Positives = 294/522 (56%), Gaps = 55/522 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
++ + R E V+ LAD+AIRH++ +++ + KY
Sbjct: 182 HFEHFYYCREPEK---VQQLADFAIRHYWPQWQDVPE---------------------KY 217
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
W EVA RT L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGF D +DP F N +
Sbjct: 218 DLWFEEVAARTGRLIAEWQTVGFAHGVMNTDNMSILGLTIDYGPFGFFDDYDPGFIGNHS 277
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F NQP + LWN+ + + TL ID N ++RY + Y M +K
Sbjct: 278 DHQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTHYGQRMRQK 334
Query: 485 LGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LG K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 335 LGFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID- 387
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD
Sbjct: 388 ----RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMA 443
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 444 ELHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|416528395|ref|ZP_11743845.1| hypothetical protein SEEM010_01872 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|416535713|ref|ZP_11747967.1| hypothetical protein SEEM030_08803 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|416554020|ref|ZP_11758048.1| hypothetical protein SEEM29N_20083 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|416571495|ref|ZP_11766729.1| hypothetical protein SEEM41H_12771 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
gi|363553712|gb|EHL37958.1| hypothetical protein SEEM010_01872 [Salmonella enterica subsp.
enterica serovar Montevideo str. LQC 10]
gi|363562206|gb|EHL46312.1| hypothetical protein SEEM29N_20083 [Salmonella enterica subsp.
enterica serovar Montevideo str. 29N]
gi|363565921|gb|EHL49945.1| hypothetical protein SEEM030_08803 [Salmonella enterica subsp.
enterica serovar Montevideo str. SARB30]
gi|363574025|gb|EHL57898.1| hypothetical protein SEEM41H_12771 [Salmonella enterica subsp.
enterica serovar Montevideo str. 4441 H]
Length = 480
Score = 358 bits (918), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 295/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYD 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AI+AAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAINAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|387607327|ref|YP_006096183.1| hypothetical protein EC042_1873 [Escherichia coli 042]
gi|284921627|emb|CBG34699.1| conserved hypothetical protein [Escherichia coli 042]
Length = 478
Score = 358 bits (918), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSESPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++V RTASL+AQWQ V F HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFIDVVARTASLIAQWQTVSFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + +A ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLLARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|420255528|ref|ZP_14758415.1| hypothetical protein PMI06_08879 [Burkholderia sp. BT03]
gi|398045033|gb|EJL37810.1| hypothetical protein PMI06_08879 [Burkholderia sp. BT03]
Length = 518
Score = 358 bits (918), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 217/522 (41%), Positives = 288/522 (55%), Gaps = 60/522 (11%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P +V ++ VA L D P F FFSG T A ++PYA Y GHQ
Sbjct: 41 PAAPLPAPYVVGFAPDVAAMLGFDASLASAPGFAEFFSGNTTRDWPAASLPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+TLGE+ + +R+ELQLKGAG+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 101 FGVWAGQLGDGRALTLGEVEH-DGKRFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC+ + + V R+ + E A+V RV+ SF+RFG ++ +
Sbjct: 160 AMHHLGIPTTRALCVTGSDQPVRRE-------EMETAAVVTRVSPSFVRFGHFEHFYA-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ +D +R LAD I + + + + Y A E
Sbjct: 211 NDRVDALRALADQVIDRFYPSCRDAD---------------------DPYLALLNEAVLS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+AQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLIAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDANHICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNIAQFSTTLAA--AKLIDD--------KEANYVMERYGTKFMDEYQAIMTKK 484
QP I WN+ + L + DD ++A V+E + +F +A M K
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLFGERYDDAQRSERAVQDAQRVLEGFKARFAPALEARMRAK 368
Query: 485 LGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + I +KL M ++ D+T FR LS + + + ++ + LD
Sbjct: 369 LGLDTQREGDDAIANKLFEIMNANRADFTLTFRNLSKLSKHDASGD----TSVRDLFLD- 423
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ +W Y L+ D R MN VNPKYVLRN+L ++AI A+ DF
Sbjct: 424 ----RAAFDAWATDYRARLVHETRDDAARAEAMNRVNPKYVLRNHLAEAAIRQAKEKDFS 479
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 480 EVERLATVLRRPFDEQPDYEAYAGLPPDWA---SSLEVSCSS 518
>gi|416422303|ref|ZP_11690207.1| hypothetical protein SEEM315_14043 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|416431080|ref|ZP_11695362.1| hypothetical protein SEEM971_00760 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|416441197|ref|ZP_11701409.1| hypothetical protein SEEM973_11935 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|416446483|ref|ZP_11705073.1| hypothetical protein SEEM974_02490 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|416452084|ref|ZP_11708751.1| hypothetical protein SEEM201_17041 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|416458903|ref|ZP_11713412.1| hypothetical protein SEEM202_00540 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|416467995|ref|ZP_11717742.1| hypothetical protein SEEM954_01233 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|416479638|ref|ZP_11722447.1| hypothetical protein SEEM054_20381 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|416489514|ref|ZP_11726278.1| hypothetical protein SEEM675_18375 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|416497533|ref|ZP_11729801.1| hypothetical protein SEEM965_06881 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|416542891|ref|ZP_11751891.1| hypothetical protein SEEM19N_11448 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|416576161|ref|ZP_11768848.1| hypothetical protein SEEM801_02696 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|416583458|ref|ZP_11773310.1| hypothetical protein SEEM507_13566 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|416590874|ref|ZP_11778049.1| hypothetical protein SEEM877_21334 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|416598911|ref|ZP_11783262.1| hypothetical protein SEEM867_19539 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|416608010|ref|ZP_11789004.1| hypothetical protein SEEM180_03790 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|416611276|ref|ZP_11790706.1| hypothetical protein SEEM600_04842 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|416624360|ref|ZP_11798016.1| hypothetical protein SEEM581_17987 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|416630444|ref|ZP_11800744.1| hypothetical protein SEEM501_01421 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|416638707|ref|ZP_11804102.1| hypothetical protein SEEM460_07669 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|416650877|ref|ZP_11810642.1| hypothetical protein SEEM020_008110 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|416662643|ref|ZP_11815978.1| hypothetical protein SEEM6152_01972 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|416665871|ref|ZP_11817022.1| hypothetical protein SEEM0077_04569 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|416682047|ref|ZP_11823908.1| hypothetical protein SEEM0047_21193 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|416702488|ref|ZP_11829547.1| hypothetical protein SEEM0055_09078 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|416707117|ref|ZP_11832215.1| hypothetical protein SEEM0052_11622 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|416714413|ref|ZP_11837731.1| hypothetical protein SEEM3312_01564 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|416717151|ref|ZP_11839432.1| hypothetical protein SEEM5258_21629 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|416725096|ref|ZP_11845466.1| hypothetical protein SEEM1156_19024 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|416729593|ref|ZP_11848139.1| hypothetical protein SEEM9199_00060 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|416738568|ref|ZP_11853358.1| hypothetical protein SEEM8282_01406 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|416750514|ref|ZP_11859751.1| hypothetical protein SEEM8283_22199 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|416759126|ref|ZP_11864054.1| hypothetical protein SEEM8284_10058 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|416762010|ref|ZP_11866060.1| hypothetical protein SEEM8285_03315 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|416768096|ref|ZP_11870373.1| hypothetical protein SEEM8287_15860 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|418485817|ref|ZP_13054799.1| hypothetical protein SEEM906_19179 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|418491316|ref|ZP_13057840.1| hypothetical protein SEEM5278_02023 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|418495547|ref|ZP_13061989.1| hypothetical protein SEEM5318_12088 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|418499159|ref|ZP_13065568.1| hypothetical protein SEEM5320_21403 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|418503037|ref|ZP_13069406.1| hypothetical protein SEEM5321_07435 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|418510242|ref|ZP_13076528.1| hypothetical protein SEEM5327_06213 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|418527139|ref|ZP_13093096.1| hypothetical protein SEEM8286_12742 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
gi|322616730|gb|EFY13639.1| hypothetical protein SEEM315_14043 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315996572]
gi|322620010|gb|EFY16883.1| hypothetical protein SEEM971_00760 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-1]
gi|322622321|gb|EFY19166.1| hypothetical protein SEEM973_11935 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-3]
gi|322627845|gb|EFY24635.1| hypothetical protein SEEM974_02490 [Salmonella enterica subsp.
enterica serovar Montevideo str. 495297-4]
gi|322633057|gb|EFY29800.1| hypothetical protein SEEM201_17041 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-1]
gi|322636697|gb|EFY33400.1| hypothetical protein SEEM202_00540 [Salmonella enterica subsp.
enterica serovar Montevideo str. 515920-2]
gi|322641277|gb|EFY37918.1| hypothetical protein SEEM954_01233 [Salmonella enterica subsp.
enterica serovar Montevideo str. 531954]
gi|322645266|gb|EFY41795.1| hypothetical protein SEEM054_20381 [Salmonella enterica subsp.
enterica serovar Montevideo str. NC_MB110209-0054]
gi|322650207|gb|EFY46621.1| hypothetical protein SEEM675_18375 [Salmonella enterica subsp.
enterica serovar Montevideo str. OH_2009072675]
gi|322655781|gb|EFY52083.1| hypothetical protein SEEM965_06881 [Salmonella enterica subsp.
enterica serovar Montevideo str. CASC_09SCPH15965]
gi|322660107|gb|EFY56346.1| hypothetical protein SEEM19N_11448 [Salmonella enterica subsp.
enterica serovar Montevideo str. 19N]
gi|322665326|gb|EFY61514.1| hypothetical protein SEEM801_02696 [Salmonella enterica subsp.
enterica serovar Montevideo str. 81038-01]
gi|322669584|gb|EFY65732.1| hypothetical protein SEEM507_13566 [Salmonella enterica subsp.
enterica serovar Montevideo str. MD_MDA09249507]
gi|322673510|gb|EFY69612.1| hypothetical protein SEEM877_21334 [Salmonella enterica subsp.
enterica serovar Montevideo str. 414877]
gi|322677436|gb|EFY73500.1| hypothetical protein SEEM867_19539 [Salmonella enterica subsp.
enterica serovar Montevideo str. 366867]
gi|322679899|gb|EFY75938.1| hypothetical protein SEEM180_03790 [Salmonella enterica subsp.
enterica serovar Montevideo str. 413180]
gi|322687371|gb|EFY83343.1| hypothetical protein SEEM600_04842 [Salmonella enterica subsp.
enterica serovar Montevideo str. 446600]
gi|323192489|gb|EFZ77719.1| hypothetical protein SEEM581_17987 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609458-1]
gi|323198656|gb|EFZ83757.1| hypothetical protein SEEM501_01421 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556150-1]
gi|323204084|gb|EFZ89098.1| hypothetical protein SEEM460_07669 [Salmonella enterica subsp.
enterica serovar Montevideo str. 609460]
gi|323209950|gb|EFZ94860.1| hypothetical protein SEEM6152_01972 [Salmonella enterica subsp.
enterica serovar Montevideo str. 556152]
gi|323217679|gb|EGA02394.1| hypothetical protein SEEM0077_04569 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB101509-0077]
gi|323220084|gb|EGA04551.1| hypothetical protein SEEM0047_21193 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB102109-0047]
gi|323223501|gb|EGA07827.1| hypothetical protein SEEM0055_09078 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB110209-0055]
gi|323229481|gb|EGA13604.1| hypothetical protein SEEM0052_11622 [Salmonella enterica subsp.
enterica serovar Montevideo str. MB111609-0052]
gi|323232704|gb|EGA16800.1| hypothetical protein SEEM3312_01564 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009083312]
gi|323240257|gb|EGA24301.1| hypothetical protein SEEM5258_21629 [Salmonella enterica subsp.
enterica serovar Montevideo str. 2009085258]
gi|323242755|gb|EGA26776.1| hypothetical protein SEEM1156_19024 [Salmonella enterica subsp.
enterica serovar Montevideo str. 315731156]
gi|323249071|gb|EGA32990.1| hypothetical protein SEEM9199_00060 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2009159199]
gi|323252790|gb|EGA36627.1| hypothetical protein SEEM8282_01406 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008282]
gi|323255317|gb|EGA39091.1| hypothetical protein SEEM8283_22199 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008283]
gi|323260111|gb|EGA43736.1| hypothetical protein SEEM8284_10058 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008284]
gi|323267125|gb|EGA50610.1| hypothetical protein SEEM8285_03315 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008285]
gi|323271551|gb|EGA54972.1| hypothetical protein SEEM8287_15860 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008287]
gi|366055707|gb|EHN20042.1| hypothetical protein SEEM906_19179 [Salmonella enterica subsp.
enterica serovar Montevideo str. 80959-06]
gi|366059403|gb|EHN23677.1| hypothetical protein SEEM5318_12088 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035318]
gi|366062766|gb|EHN26994.1| hypothetical protein SEEM5278_02023 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035278]
gi|366071694|gb|EHN35788.1| hypothetical protein SEEM5320_21403 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035320]
gi|366074761|gb|EHN38823.1| hypothetical protein SEEM5321_07435 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035321]
gi|366077102|gb|EHN41127.1| hypothetical protein SEEM5327_06213 [Salmonella enterica subsp.
enterica serovar Montevideo str. CT_02035327]
gi|366827759|gb|EHN54657.1| hypothetical protein SEEM020_008110 [Salmonella enterica subsp.
enterica serovar Montevideo str. 507440-20]
gi|372204608|gb|EHP18135.1| hypothetical protein SEEM8286_12742 [Salmonella enterica subsp.
enterica serovar Montevideo str. IA_2010008286]
Length = 480
Score = 357 bits (917), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 212/521 (40%), Positives = 294/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AI H++ +++ + KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIHHYWPQWQDVPE---------------------KYD 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|312796405|ref|YP_004029327.1| hypothetical protein RBRH_01599 [Burkholderia rhizoxinica HKI 454]
gi|312168180|emb|CBW75183.1| Hypothetical cytosolic protein [Burkholderia rhizoxinica HKI 454]
Length = 516
Score = 357 bits (917), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 211/515 (40%), Positives = 283/515 (54%), Gaps = 59/515 (11%)
Query: 144 NPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQFGMWAG 200
+P +VA S +A L L P F +F G A+P+A Y GHQFG+WAG
Sbjct: 46 DPYVVAVSTDLAHELGLGATALTDPAFADYFCGNLTQYLEHAALPFASVYSGHQFGVWAG 105
Query: 201 QLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLG 260
QLGDGRA+TLGE + + +R E+Q+KG G+TPYSR DG AVLRSSIREFLCSEAMH LG
Sbjct: 106 QLGDGRALTLGETEH-RGQRQEIQIKGGGRTPYSRTGDGRAVLRSSIREFLCSEAMHCLG 164
Query: 261 IPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDI 320
IPTTRALC++ + V R+ E A+ RVA +F+RFG ++ S GQ ++
Sbjct: 165 IPTTRALCVIGSDTPVYRETV-------ETAAVTTRVAPTFIRFGHFEHFYSTGQ--VEA 215
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
+R LAD+ I F + + Y A V ERTA+L+A
Sbjct: 216 LRRLADHVIEREFPSCRDAQ---------------------DPYLALLTAVCERTAALIA 254
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
WQ VGF HGV+NTDNMSI+GLTIDYGPFGF+D FD + N +D G RY + QP +G
Sbjct: 255 HWQAVGFCHGVMNTDNMSIIGLTIDYGPFGFIDGFDANHICNHSDTSG-RYAYQQQPHVG 313
Query: 441 LWNIAQFSTTLA---------AAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY- 490
WN+ + L A +A ++ Y F +A KLGL
Sbjct: 314 RWNLICLAQALVPLIGAHRGTAGDERAIADARDALQGYQAHFGPALEARFRAKLGLATAE 373
Query: 491 --NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +I++LL M + D+T FR ++ V + + P++ + +D + A
Sbjct: 374 PDDVALINRLLALMHANHADFTLTFRRMAGVCQHDASGD----APVRDLFVD-----RAA 424
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ +W +Y Q L + D R+A MN VNPKYVLRN+L + A+ AA DF E+ RLL+
Sbjct: 425 FDAWAATYRQRLKTEPADDATRRAAMNRVNPKYVLRNHLAEQAVRAANGKDFTEIARLLQ 484
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ RP+DEQP E YA LPP WA V SCSS
Sbjct: 485 VLSRPFDEQPEYEAYAALPPDWAASLSV---SCSS 516
>gi|78066678|ref|YP_369447.1| hypothetical protein Bcep18194_A5209 [Burkholderia sp. 383]
gi|77967423|gb|ABB08803.1| protein of unknown function UPF0061 [Burkholderia sp. 383]
Length = 540
Score = 357 bits (917), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 225/533 (42%), Positives = 298/533 (55%), Gaps = 65/533 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S VA L+L P +P F F+G A A+PYA
Sbjct: 53 AFHTRL-PAAPLAAPYVVGFSGEVAQLLDLPPSIAAQPGFAELFAGNPTRDWPANAMPYA 111
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 112 SVYSGHQFGVWAGQLGDGRALTIGERTGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 171
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 172 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 224
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I + + + Y A
Sbjct: 225 EHFFSNDRPDL--LRQLADHVIDRFYPECRRAD---------------------DPYLAL 261
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 262 LEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTS 321
Query: 428 GRRYCFANQPDIGLWN---IAQFSTTLAAAKL-IDD---------KEANYVMERYGTKFM 474
G RY + QP I WN +AQ L + IDD ++A V+ ++ +F
Sbjct: 322 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIDDDDARAERAVEDAQAVLAKFPERFG 380
Query: 475 DEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDEL 530
+ M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 381 PALERAMRAKLGLELERESDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD---- 436
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L +
Sbjct: 437 -APVRDLFID-----RDAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLAEV 490
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 491 AIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 540
>gi|317420116|emb|CBN82152.1| Uncharacterized protein [Dicentrarchus labrax]
Length = 531
Score = 357 bits (917), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 217/537 (40%), Positives = 299/537 (55%), Gaps = 39/537 (7%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADS-LELDPKEFERPDFPLF 173
P D + R V + ++K P+ +L A S+ V + L++D + +F +
Sbjct: 26 FPVDEVDGNFVRTVKNCIFSKSIPTPLKGPLRLAAVSKDVVEGILDVDVAVTQSEEFLHY 85
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
SG L G+VP A YGGHQFG WAGQLGDGRA +LG+ N E WELQLKG+GKTPY
Sbjct: 86 ASGGRLLQGSVPLAHRYGGHQFGYWAGQLGDGRAHSLGQYTNRNGEVWELQLKGSGKTPY 145
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AV+RSS+REFLCSEAMHFLG+PT+RA L+ + + V RD FY GN K E GA+
Sbjct: 146 SRSGDGRAVIRSSVREFLCSEAMHFLGVPTSRAASLIVSDEPVLRDQFYSGNVKTERGAV 205
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
V R+A+S+ R GS +I A G+ +D++R L ++ I HF ++ + D D
Sbjct: 206 VLRLAKSWFRIGSLEILAQSGE--IDLLRKLLNFVIGEHFASVD-----------SDDPD 252
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
KY + V TA L+AQW VGF HGV NTDN S+L +TIDYGPFGF++
Sbjct: 253 ---------KYLVFYSTVVNETAHLIAQWMSVGFAHGVCNTDNFSLLSITIDYGPFGFME 303
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLA-AAKLIDDKEANYVMERYGTK 472
+++P+F PNT+D G RY Q +IGL+N+ + L+ KEA +++ Y
Sbjct: 304 SYNPNFVPNTSDDEG-RYSVGAQANIGLFNLEKLLMALSPVLSEKQQKEAKMILKGYVDI 362
Query: 473 FMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
+ + KLGL ++ +I+ LL M + D+T FR LS V A D
Sbjct: 363 YQMRIHQLFKAKLGLLGEEEEDGYLIAFLLKMMEDTQSDFTMTFRQLSEVSARQLHNSD- 421
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKALMNSVNPKYVLRNYLC 588
L D+ + + W+ Y+ L SD +R+ M +VNP+YVLRN++
Sbjct: 422 --FTQMWALEDLSSHK--LFSDWLSMYLLRLSRQRDNSDLDRQHRMKNVNPRYVLRNWMA 477
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCMLSCSS 643
+SAI AE+ DF EV L ++ P+ Q E+ YA PP WA R V SCSS
Sbjct: 478 ESAIGKAEMNDFSEVELLHHILSFPFVTQETAEEAGYAARPPVWAKRLKV---SCSS 531
>gi|420352639|ref|ZP_14853776.1| hypothetical protein SB444474_1719 [Shigella boydii 4444-74]
gi|391281574|gb|EIQ40215.1| hypothetical protein SB444474_1719 [Shigella boydii 4444-74]
Length = 472
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 214/511 (41%), Positives = 290/511 (56%), Gaps = 52/511 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNIELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ L+ SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLIQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
+ RL + + P+ ++ + Y PP W R
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKR 471
>gi|110641828|ref|YP_669558.1| hypothetical protein ECP_1654 [Escherichia coli 536]
gi|121957927|sp|Q0THC2.1|YDIU_ECOL5 RecName: Full=UPF0061 protein YdiU
gi|110343420|gb|ABG69657.1| putative cytoplasmic protein [Escherichia coli 536]
Length = 478
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NSAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|390571714|ref|ZP_10251951.1| hypothetical protein WQE_25182 [Burkholderia terrae BS001]
gi|389936328|gb|EIM98219.1| hypothetical protein WQE_25182 [Burkholderia terrae BS001]
Length = 505
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/522 (41%), Positives = 288/522 (55%), Gaps = 60/522 (11%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P +V ++ VA L D P F FFSG T A ++PYA Y GHQ
Sbjct: 28 PAAPLPAPYVVGFAPDVAAMLGFDASLASAPGFAEFFSGNTTRDWPAASLPYASVYSGHQ 87
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+TLGE+ + +R+ELQLKGAG+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 88 FGVWAGQLGDGRALTLGEVEH-DGKRFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCSE 146
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC+ + + V R+ + E A+V RV+ SF+RFG ++ +
Sbjct: 147 AMHHLGIPTTRALCVTGSDQPVRRE-------EMETAAVVTRVSPSFVRFGHFEHFYA-- 197
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ +D +R LAD I + + + + Y A E
Sbjct: 198 NDRVDALRALADQVIDRFYPSCRDAD---------------------DPYLALLNEAVLS 236
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 237 TADLVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDANHICNHSDSQG-RYAYR 295
Query: 435 NQPDIGLWNIAQFSTTLAA--AKLIDD--------KEANYVMERYGTKFMDEYQAIMTKK 484
QP I WN+ + L + DD ++A V+E + +F +A M K
Sbjct: 296 MQPQIAYWNLFCLAQGLLPLFGERYDDAQRSERAVQDAQRVLEGFKARFAPALEARMRAK 355
Query: 485 LGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + + +KL M ++ D+T FR LS + + + ++ + LD
Sbjct: 356 LGLDTQRDGDDALANKLFEIMNANRADFTLTFRNLSKLSKHDASGD----TSVRDLFLD- 410
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ +W Y L+ D R MN VNPKYVLRN+L ++AI A+ DF
Sbjct: 411 ----RAAFDAWATDYRARLVHETRDDAARAEAMNRVNPKYVLRNHLAEAAIRQAKEKDFS 466
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 467 EVERLATVLRRPFDEQPDYEAYAGLPPDWA---SSLEVSCSS 505
>gi|332529850|ref|ZP_08405803.1| hypothetical protein HGR_08019 [Hylemonella gracilis ATCC 19624]
gi|332040692|gb|EGI77065.1| hypothetical protein HGR_08019 [Hylemonella gracilis ATCC 19624]
Length = 512
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 224/551 (40%), Positives = 294/551 (53%), Gaps = 57/551 (10%)
Query: 110 SFVRELPGDPRTDSIPREV----------LHACY-TKVSPSAEVEN--PQLVAWSESVAD 156
S V + P R D+ P + L A Y T ++P + P V S +V D
Sbjct: 2 SAVLDTPAHARNDAAPVQTGLRWINRYAQLGASYATALAPQTLPADHPPYWVGQSRAVGD 61
Query: 157 SLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNL 216
L L P D +G PLAG+ P A Y GHQFG+WAGQLGDGRA+ LGE+L+
Sbjct: 62 WLGLAPDWTTSSDLLAALTGNAPLAGSAPVATVYSGHQFGVWAGQLGDGRALLLGEVLSE 121
Query: 217 KSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFV 276
E+QLKGAG+TPYSR DG AVLRSSIREFL SEAMH +G+PTTRALC+ + V
Sbjct: 122 TGSGLEIQLKGAGRTPYSRMGDGRAVLRSSIREFLASEAMHAMGVPTTRALCVTGSDAPV 181
Query: 277 TRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHI 336
R+ E A+V RVA SF+RFG ++ ASR E D +R LADY I ++
Sbjct: 182 RRETI-------ETAAVVTRVASSFIRFGHFEHFASR--EQFDELRVLADYVIDRYYPEC 232
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + N YAA V+ERTA L+A WQ VGF HGV+NTDN
Sbjct: 233 RATDVYQ-----------------GNAYAALLAAVSERTAVLLAHWQAVGFCHGVMNTDN 275
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILGLT+DYGP+ FLD +DP N +D G RY +A QP++ WN+ + L L
Sbjct: 276 MSILGLTLDYGPYQFLDGYDPGHICNHSDTQG-RYAYARQPNVAYWNLHALAQAL--LPL 332
Query: 457 IDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNF 512
I+D+ A ++ Y +F E A KLGL + ++ ++ L +A ++ DYT F
Sbjct: 333 IEDERLAQAAVDVYRERFPLELDARYRAKLGLATHQPDDRALLEATLRLLAQERTDYTIF 392
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
+R LS A + E L+ + +D A+ W+ Y L + D
Sbjct: 393 WRRLSEHVAASARGETRAQA-LRDLFID-----STAFDDWLSRYEARLTQEPLQDSANT- 445
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M VNP++VLRN+L + AI A D+ V RLL L+ERP+DE PG E A PP WA
Sbjct: 446 -MLGVNPRFVLRNWLGEQAIRQARDKDYSGVARLLALLERPFDEHPGFEAEAGFPPDWA- 503
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 504 --ASIEISCSS 512
>gi|419700504|ref|ZP_14228110.1| hypothetical protein OQA_08101 [Escherichia coli SCI-07]
gi|422381721|ref|ZP_16461885.1| SelO family protein [Escherichia coli MS 57-2]
gi|432732402|ref|ZP_19967235.1| hypothetical protein WGK_02244 [Escherichia coli KTE45]
gi|432759486|ref|ZP_19993981.1| hypothetical protein A1S1_01603 [Escherichia coli KTE46]
gi|324007069|gb|EGB76288.1| SelO family protein [Escherichia coli MS 57-2]
gi|380348280|gb|EIA36562.1| hypothetical protein OQA_08101 [Escherichia coli SCI-07]
gi|431275589|gb|ELF66616.1| hypothetical protein WGK_02244 [Escherichia coli KTE45]
gi|431308659|gb|ELF96938.1| hypothetical protein A1S1_01603 [Escherichia coli KTE46]
Length = 478
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 295/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD++IRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFSIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|406672877|ref|ZP_11080102.1| hypothetical protein HMPREF9700_00644 [Bergeyella zoohelcum CCUG
30536]
gi|405587421|gb|EKB61149.1| hypothetical protein HMPREF9700_00644 [Bergeyella zoohelcum CCUG
30536]
Length = 510
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 211/542 (38%), Positives = 293/542 (54%), Gaps = 57/542 (10%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFF 174
PGD + R+ + Y+ V+P + P L+ ++ ++ + L E+ D P
Sbjct: 13 FPGDTSLNPYQRQTPNVLYSLVTPEI-FKKPTLLIFNTKLSQEIGLG--EYSEQDLPFLV 69
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P PY+ Y GHQFG WAGQLGDGRAI GEI N K + ELQ KGAG TPYS
Sbjct: 70 GNHLP-QNIRPYSTAYAGHQFGNWAGQLGDGRAIFAGEIQNKKGKTHELQWKGAGATPYS 128
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R ADG AV RSS+RE+L SEAM+ LGIPTTRAL L TG+ V RD+ Y+GNP+EE GA+V
Sbjct: 129 RHADGKAVFRSSLREYLMSEAMYHLGIPTTRALSLCFTGEKVIRDILYNGNPQEENGAVV 188
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RV++SFLRFG ++ + Q D ++++ LAD+ I H +
Sbjct: 189 MRVSESFLRFGHFEF--ASLQSDKNLLKDLADFTITHFYPE------------------- 227
Query: 355 SVVDLTS-NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
VD+ S +KYA W ++ E+T L+ +W VGF HGV+NTDNMSI+G TIDYGPFG L+
Sbjct: 228 --VDIHSPDKYALWFEKITEKTLHLIIEWLRVGFVHGVMNTDNMSIIGETIDYGPFGMLE 285
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTK 472
++ +FTPNTTDLPGRRY F Q I WN+ Q + L A LI+D + ++ +G
Sbjct: 286 EYNLNFTPNTTDLPGRRYAFGKQGQIAQWNLWQLANALYA--LINDADFLQNTLDNFGKN 343
Query: 473 FMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS----- 524
F ++ ++ KK GL K ++ M +K+DYT FF L + +
Sbjct: 344 FWKKHDEMLAKKFGLDKVLPSDEDFFVHWQKLMTSEKLDYTLFFTELERARTHHTPQWAN 403
Query: 525 ---IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+P +E + LK + + Y+ L + EE M + NPK+
Sbjct: 404 VSYLPNEE--INLKKI------------NDFYTQYLIRLEQNNCPKEESIQWMKTHNPKF 449
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL I+ E GD + L+ +E PY+ + + R P + G MLSC
Sbjct: 450 ILRNYLLYDCIEKVEAGDTEMLHLLIHALENPYETKYEHFQKKR-PTQYDDVSGCSMLSC 508
Query: 642 SS 643
SS
Sbjct: 509 SS 510
>gi|254252170|ref|ZP_04945488.1| hypothetical protein BDAG_01385 [Burkholderia dolosa AUO158]
gi|124894779|gb|EAY68659.1| hypothetical protein BDAG_01385 [Burkholderia dolosa AUO158]
Length = 600
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 221/529 (41%), Positives = 295/529 (55%), Gaps = 70/529 (13%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPYAQCYGGH 193
P+A + P +V +S+ VA L L +P F F+G P A A+PYA Y GH
Sbjct: 119 PAAPLPAPYVVGFSDDVARLLGLPESIAAQPAFAELFAG-NPTRDWPADAMPYASVYSGH 177
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSIREFLCS
Sbjct: 178 QFGVWAGQLGDGRALTIGELAGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSIREFLCS 237
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAMH LGIPTTRAL +V + V R+ E A+V RV++SF+RFG ++ S
Sbjct: 238 EAMHHLGIPTTRALTVVGSDHPVVREEI-------ETAAVVTRVSESFVRFGHFEHFFSN 290
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
+ DL +R LAD+ I + + + + Y A V
Sbjct: 291 DRPDL--LRALADHVIDRFYPACRDAD---------------------DPYLALLEAVTL 327
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA LVAQWQ VGF HGV+NTDNMSILG+T+DYGPFGF+DAFD + N +D G RY +
Sbjct: 328 RTADLVAQWQAVGFCHGVMNTDNMSILGVTLDYGPFGFVDAFDANHICNHSDTSG-RYAY 386
Query: 434 ANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTKFMDEYQ 478
QP I WN + L A + +DD +A V+ ++ +F +
Sbjct: 387 RMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPERFGPALE 444
Query: 479 AIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPL 534
M KLGL +++ ++ ++LL M + D+T FR L+ + K D S P+
Sbjct: 445 RAMRAKLGLELEREHDAELANQLLETMHASRADFTLTFRRLAQLSKHDASRD-----APV 499
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L + AI
Sbjct: 500 RDLFID-----RDAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLAEVAIRR 554
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 555 AKEKDFSEVERLAQVLRRPFDEQPEHEAYAALPPDWA---GSLAVSCSS 600
>gi|419913917|ref|ZP_14432326.1| hypothetical protein ECKD1_12189 [Escherichia coli KD1]
gi|433198276|ref|ZP_20382188.1| hypothetical protein WGW_01820 [Escherichia coli KTE94]
gi|388387945|gb|EIL49543.1| hypothetical protein ECKD1_12189 [Escherichia coli KD1]
gi|431722942|gb|ELJ86904.1| hypothetical protein WGW_01820 [Escherichia coli KTE94]
Length = 478
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|121957908|sp|Q39FG3.2|Y5209_BURS3 RecName: Full=UPF0061 protein Bcep18194_A5209
Length = 522
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 225/533 (42%), Positives = 298/533 (55%), Gaps = 65/533 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S VA L+L P +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSGEVAQLLDLPPSIAAQPGFAELFAGNPTRDWPANAMPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGERTGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I + + + Y A
Sbjct: 207 EHFFSNDRPDL--LRQLADHVIDRFYPECRRAD---------------------DPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWN---IAQFSTTLAAAKL-IDD---------KEANYVMERYGTKFM 474
G RY + QP I WN +AQ L + IDD ++A V+ ++ +F
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIDDDDARAERAVEDAQAVLAKFPERFG 362
Query: 475 DEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDEL 530
+ M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 363 PALERAMRAKLGLELERESDAELANKLLETMHASHADFTLTFRRLAQISKHDASRD---- 418
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L +
Sbjct: 419 -APVRDLFID-----RDAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLAEV 472
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 473 AIRRAKEKDFSEVERLAQILRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|423704828|ref|ZP_17679251.1| UPF0061 protein ydiU [Escherichia coli H730]
gi|433047983|ref|ZP_20235353.1| hypothetical protein WII_01924 [Escherichia coli KTE120]
gi|385705471|gb|EIG42536.1| UPF0061 protein ydiU [Escherichia coli H730]
gi|431566366|gb|ELI39402.1| hypothetical protein WII_01924 [Escherichia coli KTE120]
Length = 478
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y P W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPLDWGKRLEV---SCSS 478
>gi|289825931|ref|ZP_06545090.1| hypothetical protein Salmonellentericaenterica_11140 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
Length = 479
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 295/521 (56%), Gaps = 54/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGIWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+I E L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TI-ESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 180
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 181 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDV---------------------AEKYA 217
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 218 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 277
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 278 HQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 334
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 335 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 386
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 387 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAIVLRNWLAQRAIDAAEQGDMAE 443
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 444 LHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 479
>gi|191171729|ref|ZP_03033276.1| conserved hypothetical protein [Escherichia coli F11]
gi|300987708|ref|ZP_07178320.1| SelO family protein [Escherichia coli MS 200-1]
gi|422377237|ref|ZP_16457480.1| SelO family protein [Escherichia coli MS 60-1]
gi|432471009|ref|ZP_19713056.1| hypothetical protein A15M_01890 [Escherichia coli KTE206]
gi|432713420|ref|ZP_19948461.1| hypothetical protein WCI_01785 [Escherichia coli KTE8]
gi|433077790|ref|ZP_20264341.1| hypothetical protein WIU_01661 [Escherichia coli KTE131]
gi|190908059|gb|EDV67651.1| conserved hypothetical protein [Escherichia coli F11]
gi|300306062|gb|EFJ60582.1| SelO family protein [Escherichia coli MS 200-1]
gi|324011469|gb|EGB80688.1| SelO family protein [Escherichia coli MS 60-1]
gi|430998227|gb|ELD14468.1| hypothetical protein A15M_01890 [Escherichia coli KTE206]
gi|431257223|gb|ELF50147.1| hypothetical protein WCI_01785 [Escherichia coli KTE8]
gi|431597461|gb|ELI67367.1| hypothetical protein WIU_01661 [Escherichia coli KTE131]
Length = 478
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432636928|ref|ZP_19872804.1| hypothetical protein A1UY_02283 [Escherichia coli KTE81]
gi|431171917|gb|ELE72068.1| hypothetical protein A1UY_02283 [Escherichia coli KTE81]
Length = 478
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 218/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE-------TAEPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRRES--EKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGERMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD E + LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSECQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|417707618|ref|ZP_12356663.1| hypothetical protein SFVA6_2427 [Shigella flexneri VA-6]
gi|420331066|ref|ZP_14832741.1| hypothetical protein SFK1770_2282 [Shigella flexneri K-1770]
gi|333003782|gb|EGK23318.1| hypothetical protein SFVA6_2427 [Shigella flexneri VA-6]
gi|391254557|gb|EIQ13718.1| hypothetical protein SFK1770_2282 [Shigella flexneri K-1770]
Length = 467
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 214/506 (42%), Positives = 289/506 (57%), Gaps = 52/506 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPP 628
+ RL + + P+ ++ + Y PP
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPP 466
>gi|403353926|gb|EJY76508.1| Selenoprotein O [Oxytricha trifallax]
Length = 624
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 233/630 (36%), Positives = 328/630 (52%), Gaps = 123/630 (19%)
Query: 107 WDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFE 166
++H + E PG+ R+V Y+KV+P+ ++NP +V+ S + L+L +
Sbjct: 25 FNHFEIDENPGNK-----IRQVPGYVYSKVTPTP-LKNPCIVSLSPKCLELLDLKYDDIM 78
Query: 167 RPD-----FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERW 221
+ D + FSG L G++P + Y GHQFG++AGQLGDGRAITLG+I N K E W
Sbjct: 79 QNDKFKKLYAELFSGNKLLQGSIPISHNYCGHQFGVFAGQLGDGRAITLGDIRNNKQETW 138
Query: 222 ELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMF 281
ELQLKGAG+TPYSR ADG AVLRSSIRE+LCSEAM FLG+PT+RA L+ + V RD
Sbjct: 139 ELQLKGAGQTPYSRHADGRAVLRSSIREYLCSEAMFFLGVPTSRAASLIVSDTKVQRDPL 198
Query: 282 YDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVRTLADYAIR 330
Y GN E A+V R+A +F RFGS++I S G ++ +++ + ++ +
Sbjct: 199 YSGNVINEKCAVVMRLAPTFFRFGSFEIFKEKDKYSGSKGPSHGMQE-EMMPQMLEFLFK 257
Query: 331 HHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHG 390
+++ I G+++ ++ A+ E+ RT LVA WQ VG+ HG
Sbjct: 258 NYYPEI-----------YYGEQN------LQDQTRAYFHEITRRTVDLVALWQTVGYVHG 300
Query: 391 VLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTT 450
VLNTDNMS LGLTIDYGP+GF++ F+P F PN +D G RY + NQP I WN+ + +
Sbjct: 301 VLNTDNMSALGLTIDYGPYGFMEHFNPKFIPNYSDKEG-RYSYENQPSICKWNLGKLAEA 359
Query: 451 LAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLG-----------LPKYNKQIISKL 498
L+ +D++E+ Y+ E Y + + IM+KKLG + Q I +
Sbjct: 360 LSP--FLDEEESKQYLEENYDKLYSARFLEIMSKKLGFLIEGQNEKVEIVDQEYQCIQSI 417
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPED-----ELLV--------------------- 532
M D+TN FR L+ V + + E ELLV
Sbjct: 418 FTAMEQTMGDFTNTFRILALVSREIELKETDQKAIELLVKHSAPVEHVIALNKPKYSAAA 477
Query: 533 --PLKAVL-----------LDIGKERKE------------------------AWISWVLS 555
+K++L LD + +KE W WV S
Sbjct: 478 LEKIKSILETNPNVLHMFGLDPEEAKKEIEKIENSKSQGTLTQDQKSVKDREVWTKWVQS 537
Query: 556 YIQEL--LSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERP 613
Y Q L + I+DE RK MN VNPK++LRNYL + AI AE DF +V LLK+ P
Sbjct: 538 YKQSLGQMDKSITDEIRKQSMNKVNPKFILRNYLMEEAIRKAEDEDFSKVDELLKMCYDP 597
Query: 614 YDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
Y+E+ E + PP WA +C +SCSS
Sbjct: 598 YNEENISEASTQPPPQWA--QDLC-VSCSS 624
>gi|218699726|ref|YP_002407355.1| hypothetical protein ECIAI39_1347 [Escherichia coli IAI39]
gi|386624330|ref|YP_006144058.1| hypothetical protein CE10_1986 [Escherichia coli O7:K1 str. CE10]
gi|226725727|sp|B7NTS5.1|YDIU_ECO7I RecName: Full=UPF0061 protein YdiU
gi|218369712|emb|CAR17481.1| conserved hypothetical protein [Escherichia coli IAI39]
gi|349738068|gb|AEQ12774.1| conserved protein, UPF0061 family [Escherichia coli O7:K1 str.
CE10]
Length = 478
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 216/515 (41%), Positives = 291/515 (56%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNKARLIWHNAELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H+ + DED KY W +V
Sbjct: 186 YR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYRLWFSDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E+ RL +
Sbjct: 389 FDDWFARYRRRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|121608765|ref|YP_996572.1| hypothetical protein Veis_1800 [Verminephrobacter eiseniae EF01-2]
gi|121553405|gb|ABM57554.1| protein of unknown function UPF0061 [Verminephrobacter eiseniae
EF01-2]
Length = 476
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 220/515 (42%), Positives = 288/515 (55%), Gaps = 57/515 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ PS + V S +VA L LD F+G PLAGA P A YGG
Sbjct: 15 FTELRPS-PLPAAHWVGRSSAVARLLGLDAAWLHSDAALQAFTGNGPLAGARPLASVYGG 73
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE + WE+QLKGAG+TPYSR DG AVLRSSIREFLC
Sbjct: 74 HQFGVWAGQLGDGRAIMLGE----TAAGWEIQLKGAGRTPYSRMGDGRAVLRSSIREFLC 129
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRALC+ + V R+ + E A+V RVA SF+RFG ++ +
Sbjct: 130 SEAMHGLGIPTTRALCITGSPAPVRRE-------ETETAAVVTRVAPSFVRFGHFEHFCA 182
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
Q ++ LADY I ++ +N YAA V+
Sbjct: 183 --QRQTPQLQALADYVIARYYPQCRAG--------------------AANPYAALLQAVS 220
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPF FLDAF P N +D G RY
Sbjct: 221 ERTARLMAQWQAVGFCHGVMNTDNMSILGLTMDYGPFQFLDAFIPEHRCNHSDTQG-RYA 279
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKY- 490
+ QPD+ WN+ + L LI +++ A + Y F E+ A M KLGL +
Sbjct: 280 YQRQPDVAYWNLLCLAQAL--LPLIGERDGALAALASYPGVFSAEFMAGMRAKLGLLQAR 337
Query: 491 --NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +I +L +A +VDYT F+R LS P++A+ +R +A
Sbjct: 338 DGDAALIDGVLMLLARHRVDYTIFWRRLSQAVGCGDFE------PVRALF----AQRADA 387
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W+L + + ++ + + +M NPK+VLRN+L + AI AA+ GDFG + LL+
Sbjct: 388 -ERWLLLFSEH--TTHMDHAQMAGMMLKTNPKFVLRNHLGEQAIRAAQQGDFGAIETLLR 444
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+ERP+DE PG + YA PP WA +SCSS
Sbjct: 445 LLERPFDEHPGHDAYAAFPPDWA---ATIAISCSS 476
>gi|215486881|ref|YP_002329312.1| hypothetical protein E2348C_1791 [Escherichia coli O127:H6 str.
E2348/69]
gi|312966860|ref|ZP_07781078.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|417755706|ref|ZP_12403790.1| hypothetical protein ECDEC2B_2023 [Escherichia coli DEC2B]
gi|418997092|ref|ZP_13544692.1| hypothetical protein ECDEC1A_1808 [Escherichia coli DEC1A]
gi|419007617|ref|ZP_13555060.1| hypothetical protein ECDEC1C_1923 [Escherichia coli DEC1C]
gi|419018302|ref|ZP_13565616.1| hypothetical protein ECDEC1E_2004 [Escherichia coli DEC1E]
gi|419028906|ref|ZP_13576080.1| hypothetical protein ECDEC2C_1943 [Escherichia coli DEC2C]
gi|419034501|ref|ZP_13581592.1| hypothetical protein ECDEC2D_1903 [Escherichia coli DEC2D]
gi|419039603|ref|ZP_13586645.1| hypothetical protein ECDEC2E_1916 [Escherichia coli DEC2E]
gi|254814079|sp|B7US45.1|YDIU_ECO27 RecName: Full=UPF0061 protein YdiU
gi|215264953|emb|CAS09339.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|312288324|gb|EFR16226.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|377845709|gb|EHU10731.1| hypothetical protein ECDEC1A_1808 [Escherichia coli DEC1A]
gi|377847434|gb|EHU12435.1| hypothetical protein ECDEC1C_1923 [Escherichia coli DEC1C]
gi|377863244|gb|EHU28050.1| hypothetical protein ECDEC1E_2004 [Escherichia coli DEC1E]
gi|377875957|gb|EHU40565.1| hypothetical protein ECDEC2B_2023 [Escherichia coli DEC2B]
gi|377881113|gb|EHU45677.1| hypothetical protein ECDEC2C_1943 [Escherichia coli DEC2C]
gi|377881571|gb|EHU46128.1| hypothetical protein ECDEC2D_1903 [Escherichia coli DEC2D]
gi|377894433|gb|EHU58854.1| hypothetical protein ECDEC2E_1916 [Escherichia coli DEC2E]
Length = 478
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|302845399|ref|XP_002954238.1| hypothetical protein VOLCADRAFT_106324 [Volvox carteri f.
nagariensis]
gi|300260443|gb|EFJ44662.1| hypothetical protein VOLCADRAFT_106324 [Volvox carteri f.
nagariensis]
Length = 672
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 227/610 (37%), Positives = 318/610 (52%), Gaps = 118/610 (19%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+ LE LN+D+ +R LP DP P +VA E++A L+
Sbjct: 17 RKLEHLNFDNLTLRALPLDPIKG---------------------GPLVVASPEALA-LLD 54
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
+DP E +RPDF +F G L GA A CY GHQFG ++GQLGDG A+ LGE++N + E
Sbjct: 55 VDPAEIDRPDFAEYFCGNKLLPGAEAAAHCYCGHQFGYFSGQLGDGAAMYLGEVVNSRGE 114
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
RWELQ KGAGKTPYSR ADG VLRSS+REFLCSEAM+ LG+PTTRA VT+ V RD
Sbjct: 115 RWELQFKGAGKTPYSRQADGRKVLRSSLREFLCSEAMYHLGVPTTRAGTCVTSDTRVVRD 174
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH-----------ASRGQEDLDIVRTLADYA 328
+FYDGN E I+ R+A +FLRFGS++I +S GQE + ++ TL +
Sbjct: 175 VFYDGNAILEKATIITRIAPTFLRFGSFEIFKPVDAFTGRRGSSAGQE-VAMLPTLLHHT 233
Query: 329 IRHHFRHIENMNKSESLSFSTG-------------DEDHSVVDLTSNKYAAWAVEVAERT 375
IR +F I ++ +++S G + V Y W +EV RT
Sbjct: 234 IRTYFPDIWASHQGDAISAGVGVASDGSGGAPWPPEGGLEVEARLQAMYLDWLIEVTRRT 293
Query: 376 ASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFAN 435
ASLVA WQ VG+ HGVLNTDNMS++G+T+DYGPFGFLD +DP N +D G RY + +
Sbjct: 294 ASLVAAWQCVGWCHGVLNTDNMSVVGVTLDYGPFGFLDRYDPDHICNGSDDSG-RYDYKS 352
Query: 436 QPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--PK---- 489
QPDI WN + + + L + + V E + + Y +M +KLGL P+
Sbjct: 353 QPDICRWNCEKLAEAIRTV-LPEARGKRAVAETFDPVYRRTYLGLMRRKLGLATPREGLE 411
Query: 490 --------------YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE-------- 527
++ ++S+LL M D+TN FR + +A+PS
Sbjct: 412 GEMADVGDIDADAGEDEMLVSELLTVMEETGADFTNTFRQ-AQGRAEPSSTAVVEAGGVL 470
Query: 528 DELLVPLKAVLLDIGKE------------------------------------RKEAWIS 551
D +L + A+L E +E W +
Sbjct: 471 DYILTQMLAMLAGKNPELLHQMGLTPQMLNAEMARLKRSEQLAKQNDEDKRLRDRERWAA 530
Query: 552 WVLSY---IQELLSSGISDEE-RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
W+ SY +Q +++G D R A+MN+ NP+++LRN++ Q AI+ AE GDF EV R+
Sbjct: 531 WLASYGARLQRAMAAGRLDAGMRPAVMNATNPRFILRNWIAQQAIEKAEKGDFSEVTRVY 590
Query: 608 KLMERPYDEQ 617
L+ P+ ++
Sbjct: 591 ALLRNPFSDE 600
>gi|331663186|ref|ZP_08364096.1| putative cytoplasmic protein [Escherichia coli TA143]
gi|331058985|gb|EGI30962.1| putative cytoplasmic protein [Escherichia coli TA143]
Length = 478
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 294/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSESPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFSDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + +A ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLLARERSDYTRTFRMLSLTEQYSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|419002103|ref|ZP_13549640.1| hypothetical protein ECDEC1B_2001 [Escherichia coli DEC1B]
gi|377850034|gb|EHU15002.1| hypothetical protein ECDEC1B_2001 [Escherichia coli DEC1B]
Length = 478
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFRLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|423315675|ref|ZP_17293580.1| hypothetical protein HMPREF9699_00151 [Bergeyella zoohelcum ATCC
43767]
gi|405585779|gb|EKB59582.1| hypothetical protein HMPREF9699_00151 [Bergeyella zoohelcum ATCC
43767]
Length = 510
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 210/542 (38%), Positives = 291/542 (53%), Gaps = 57/542 (10%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFF 174
PGD + R+ + Y V+P +NP L+ ++ ++ + L E+ D P
Sbjct: 13 FPGDTSLNPYQRQTPNVLYNLVTPEV-FKNPTLLIFNTKLSQEIGLG--EYSEQDLPFLV 69
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P PY+ Y GHQFG WAGQLGDGRAI GEI N K + ELQ KGAG TPYS
Sbjct: 70 GNNLP-QNIRPYSTAYAGHQFGNWAGQLGDGRAIFAGEIQNKKGKTHELQWKGAGATPYS 128
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R ADG AV RSS+RE+L SEAM+ LGIPT RAL L TG+ V RD+ Y+GNP+EE GA+V
Sbjct: 129 RHADGRAVFRSSLREYLMSEAMYHLGIPTIRALSLCFTGEKVIRDILYNGNPQEENGAVV 188
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RV++SFLRFG ++ + Q D ++++ LAD+ I H +
Sbjct: 189 MRVSESFLRFGHFEF--ASLQSDKNLLKDLADFTITHFYPE------------------- 227
Query: 355 SVVDLTS-NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
VD+ S +KYA W ++ E+T L+ +W VGF HGV+NTDNMSI+G TIDYGPFG L+
Sbjct: 228 --VDIHSPDKYALWFEKITEKTLHLIIEWLRVGFVHGVMNTDNMSIIGETIDYGPFGMLE 285
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTK 472
++ +FTPNTTDLPGRRY F Q I WN+ Q + L LI+D + ++ +G
Sbjct: 286 EYNLNFTPNTTDLPGRRYAFGKQGQIAQWNLWQLANALYT--LINDADFLQNTLDNFGKN 343
Query: 473 FMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS----- 524
F ++ ++ KK GL K ++ M +K+DYT FF L + +
Sbjct: 344 FWKKHDEMLAKKFGLDKVLPSDEDFFVHWQKLMTSEKLDYTLFFTELERARTHHTPQWAN 403
Query: 525 ---IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+P +E + LK + + Y+ L + EE M + NPK+
Sbjct: 404 VSYLPNEE--INLKKI------------NDFYTQYLIRLEQNNCPKEESIQWMKTYNPKF 449
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL I+ E GD + L+ +E PY+ + + R P + G MLSC
Sbjct: 450 ILRNYLLYDCIEKVEAGDTEMLYLLIHALENPYETKYEHFQKKR-PTQYDDVSGCSMLSC 508
Query: 642 SS 643
SS
Sbjct: 509 SS 510
>gi|330825807|ref|YP_004389110.1| hypothetical protein Alide2_3253 [Alicycliphilus denitrificans
K601]
gi|329311179|gb|AEB85594.1| UPF0061 protein ydiU [Alicycliphilus denitrificans K601]
Length = 495
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 227/518 (43%), Positives = 295/518 (56%), Gaps = 56/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T++ P+ + P V S+ VA L L +R D F+G G+ P A Y
Sbjct: 29 AFFTELRPT-PLPAPHWVGASDDVAALLGLPEGWQQRDDALQSFTGNALPPGSRPLASVY 87
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRAI LGE+ ELQLKG G+TPYSR DG AVLRSSIREF
Sbjct: 88 SGHQFGVWAGQLGDGRAILLGEVETPAHGGQELQLKGCGRTPYSRMGDGRAVLRSSIREF 147
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPTTRALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 148 LCSEAMHALGIPTTRALCVTGSPAPVARE-------EIETAAVVTRVAPSFIRFGHFEHF 200
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A+RGQ+ +R LADY I ++ + +N AA
Sbjct: 201 AARGQQ--AELRRLADYVIDRYYPECRD---------------------GANPCAALLRA 237
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V+ERTA+L+A+WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAFDP N +D G R
Sbjct: 238 VSERTAALMARWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPGHICNHSDAQG-R 296
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y F QP + WN+ A LI + E A + Y F E+ M KLGL +
Sbjct: 297 YAFDRQPGVAWWNL--LCLAQAMLPLIGEVETARAALSTYEGVFAAEFLRRMRAKLGLQQ 354
Query: 490 ---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ LL +A +VDYT F+R LS+ A P++ + D +
Sbjct: 355 PREGDGALVDALLRLLAAGRVDYTIFWRRLSHAVAAGDFE------PVRDLFAD-----R 403
Query: 547 EAWISWVLSYIQELLSSGISDEERKA-LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
A+ +W+LSY +ELL+ + D+ A LM + NP +VLRN+L + AI AA+LGDF E++
Sbjct: 404 AAFDAWLLSY-EELLA--LEDQALVADLMLNTNPGFVLRNHLGEQAIRAAKLGDFSELQT 460
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L KL+ RP+DE PG E +A PP WA +SCSS
Sbjct: 461 LQKLLARPFDEHPGHEAHAGFPPEWA---STISISCSS 495
>gi|307729673|ref|YP_003906897.1| hypothetical protein [Burkholderia sp. CCGE1003]
gi|307584208|gb|ADN57606.1| protein of unknown function UPF0061 [Burkholderia sp. CCGE1003]
Length = 518
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 214/524 (40%), Positives = 289/524 (55%), Gaps = 64/524 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+ + P +V +S A L L+P + P+F FSG A+PYA Y GHQ
Sbjct: 41 PATPLSAPYVVGFSAQTAALLGLEPGLEKDPEFAELFSGNATREWPTEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + +R+ELQLKGAG+TPYSR DG AVLRSSIREFLCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-AGQRYELQLKGAGRTPYSRMGDGRAVLRSSIREFLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVIGSDQPVRREEI-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVVS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLLVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ ++ ++ A K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGERYEESVRADKSIED--AQRVLAGFKDRFGPGLERRMM 366
Query: 483 KKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + + + ++L + M ++ D+T FR L+ V + + P++ + L
Sbjct: 367 AKLGLAAEREGDAALANRLFDVMHANRADFTLTFRNLARVSRHDASGD----APVRDLFL 422
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ +W Y L SD ER MN VNPK+VLRN+L ++AI A+ D
Sbjct: 423 D-----RAAFDAWANDYRARLSHETRSDAERAIAMNRVNPKFVLRNHLAETAIRRAKEKD 477
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F E+ RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 478 FSELERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|357631787|gb|EHJ79256.1| hypothetical protein KGM_15405 [Danaus plexippus]
Length = 538
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 209/544 (38%), Positives = 304/544 (55%), Gaps = 46/544 (8%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSE-SVADSLELDPKEFERPDFPLF 173
LP D D + V + Y++V+P +N +LV +SE ++ + L++ P+ +F F
Sbjct: 26 LPIDENHDQVKNNVKNVIYSEVTPHPLEKNLRLVCFSEDALTNILDMSPEIVNTGEFLEF 85
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
G G++P A YGGHQ+G+W GQLGDGRA +GE +N ERW++QLKG+G TPY
Sbjct: 86 VGGRRLPCGSLPVAHRYGGHQYGLWVGQLGDGRAHLIGEYVNRLCERWQVQLKGSGLTPY 145
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG VLR++IRE + SEAM LG+PTTR +V + V RD++Y GNP E AI
Sbjct: 146 SRLYDGRCVLRAAIREMVASEAMFHLGVPTTRTAAVVASDDTVVRDLYYSGNPHREKTAI 205
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ R++QS+ RFGS +I A G+ L I++ L D+ I+ HF I DE
Sbjct: 206 LLRLSQSWFRFGSLEILAKGGE--LAILKQLTDFIIKEHFPDIH-----------LSDE- 251
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
N++ E+A R+ LVA+WQG+GFTHG+LNTDNMSILG+T+DYGPFGF+D
Sbjct: 252 --------NRFIRLFSEMAHRSLDLVAKWQGLGFTHGLLNTDNMSILGVTMDYGPFGFVD 303
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEAN--YVMERYGT 471
++D F N++D G RY + QPDI +WNI Q + L L ++ + ++++ T
Sbjct: 304 SYDGGFVSNSSDGEG-RYSLSKQPDIVVWNIGQLANALKPL-LSSSQQVHMTHILKTLDT 361
Query: 472 KFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED 528
++ K+GL K +++++ KLL+ M D+T FR LS ++ +
Sbjct: 362 YCKNKILETFLMKIGLKKERWGDEELVEKLLDMMQHTGADFTTTFRQLSELEPHEMVTGS 421
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-------LSSGISDEERKALMNSVNPKY 581
+L L +W W+ Y + L SS + ER M VNP Y
Sbjct: 422 KLEEKWSLKRLS----SHSSWGCWLDQYRERLDKESVDSSSSCVFSVERVRRMRLVNPAY 477
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCML 639
V R +L Q AI AE DF ++R LL++++ PY+ QP E ++ PP WAY +
Sbjct: 478 VPRTWLIQEAIQDAERDDFTKLRFLLQVIQNPYEVQPEAEARGFSNQPPQWAY---ALKI 534
Query: 640 SCSS 643
SCSS
Sbjct: 535 SCSS 538
>gi|392978693|ref|YP_006477281.1| hypothetical protein A3UG_09220 [Enterobacter cloacae subsp.
dissolvens SDM]
gi|392324626|gb|AFM59579.1| hypothetical protein A3UG_09220 [Enterobacter cloacae subsp.
dissolvens SDM]
Length = 480
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 211/518 (40%), Positives = 289/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ +++ +LV ++S+A+ L + P+ F+ D + G T LAG P AQ
Sbjct: 13 LPGFYTALKPTP-LQHSRLVWHNDSLAEDLAIPPEMFQPSDGAGVWGGETLLAGMQPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPGGETMDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+AQS LRFG ++
Sbjct: 132 ESLASEAMHALGIPTTRALSIVTSDTPVVRETV-------EKGAMLMRIAQSHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + VR LADYAIR H+ +++ ++KY W
Sbjct: 185 HFYYR--REPEKVRQLADYAIRRHWPQLQD---------------------EADKYHLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTA+++A+WQ VGF HGV+NTDNMSILGLT DYGPFGFLD + P + N +D G
Sbjct: 222 RDIVARTATMIARWQTVGFAHGVMNTDNMSILGLTFDYGPFGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY A+M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDGYQETLLREYGALMRNKLGLM 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ L MA + DYT FR L + + PL+ +D
Sbjct: 339 TQEKGDNAILNGLFALMAREGSDYTRTFRMLGQTEQHSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++ + W +Y L + D R+ MN+ NP VLRN+L Q AI+ AE G++ E+ R
Sbjct: 388 RQGFDDWFATYRARLQQEQVDDAARQTQMNAANPAMVLRNWLAQRAIEQAERGEYDELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHVALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|186475791|ref|YP_001857261.1| hypothetical protein Bphy_1026 [Burkholderia phymatum STM815]
gi|184192250|gb|ACC70215.1| protein of unknown function UPF0061 [Burkholderia phymatum STM815]
Length = 505
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 218/522 (41%), Positives = 287/522 (54%), Gaps = 60/522 (11%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P +V ++ VA L D P F FFSG T + A+PYA Y GHQ
Sbjct: 28 PAAPLPAPYVVGFAPDVASMLGFDASLASAPGFSEFFSGNTTRDWPSTALPYASVYSGHQ 87
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+TLGE + R+ELQLKG G+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 88 FGVWAGQLGDGRALTLGEAEH-NGRRFELQLKGGGRTPYSRMGDGRAVLRSSIREYLCSE 146
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RV+ SF+RFG ++ +
Sbjct: 147 AMHHLGIPTTRALCVIGSDQPVRREEI-------ETAAVVTRVSPSFVRFGHFEHFYA-- 197
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ +D +R+LAD+ I D + + Y A E
Sbjct: 198 NDRVDALRSLADHVI---------------------DRFYPACRDADDPYLALLNEAVLS 236
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ QWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 237 TADLIVQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDANHICNHSDSQG-RYAYR 295
Query: 435 NQPDIGLWN---IAQ-----FSTTLAAAKLIDD--KEANYVMERYGTKFMDEYQAIMTKK 484
QP I WN +AQ F A+ + ++A V+E + +F +A M K
Sbjct: 296 MQPQIAYWNLFCLAQGLLPLFGERYGEAERSERAVQDAQRVLEGFKARFAPALEARMRAK 355
Query: 485 LGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + + Q+ +KL M ++ D+T FR LS + + + P + + LD
Sbjct: 356 LGLDTEREGDDQLANKLFEIMHANRADFTLTFRNLSKLSRHDANGD----APARDLFLD- 410
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ A+ +W Y L D ER MN VNPKYVLRN+L ++AI A+ DF
Sbjct: 411 ----RAAFDAWATEYRARLSHETRDDAERAEAMNRVNPKYVLRNHLAENAIRRAKEKDFS 466
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV RL ++ P+DEQP E YA LPP WA +SCSS
Sbjct: 467 EVERLAAVLRHPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 505
>gi|170701225|ref|ZP_02892194.1| protein of unknown function UPF0061 [Burkholderia ambifaria
IOP40-10]
gi|170133854|gb|EDT02213.1| protein of unknown function UPF0061 [Burkholderia ambifaria
IOP40-10]
Length = 522
Score = 355 bits (911), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 296/535 (55%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGFSDEVAQLLGLPASFATQPGFAELFAGNPTRDWPANALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ +R+ELQ+KG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGQRYELQIKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNDRPDL--LRQLADHVI---------------------DRFYPACREADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 361 FGPALEHAMRAKLGLELERENDAELANKLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RDAFDAWANLYRTRLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKEKDFSEVERLAQVLRRPFDEQPEHETYAALPPDWA---GSLEVSCSS 522
>gi|432465697|ref|ZP_19707788.1| hypothetical protein A15K_01635 [Escherichia coli KTE205]
gi|432583799|ref|ZP_19820200.1| hypothetical protein A1SM_03021 [Escherichia coli KTE57]
gi|433072818|ref|ZP_20259484.1| hypothetical protein WIS_01774 [Escherichia coli KTE129]
gi|433120248|ref|ZP_20305927.1| hypothetical protein WKC_01672 [Escherichia coli KTE157]
gi|433183267|ref|ZP_20367533.1| hypothetical protein WGO_01706 [Escherichia coli KTE85]
gi|430994178|gb|ELD10509.1| hypothetical protein A15K_01635 [Escherichia coli KTE205]
gi|431116969|gb|ELE20241.1| hypothetical protein A1SM_03021 [Escherichia coli KTE57]
gi|431589381|gb|ELI60596.1| hypothetical protein WIS_01774 [Escherichia coli KTE129]
gi|431644006|gb|ELJ11693.1| hypothetical protein WKC_01672 [Escherichia coli KTE157]
gi|431708157|gb|ELJ72681.1| hypothetical protein WGO_01706 [Escherichia coli KTE85]
Length = 478
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 215/515 (41%), Positives = 291/515 (56%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + N +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGIWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H++ DE+ +KY W +V
Sbjct: 186 YRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYRLWFTDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVDG--VNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE D E+ RL +
Sbjct: 389 FDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKDDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|149017530|gb|EDL76534.1| hypothetical LOC315216 (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 663
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 244/631 (38%), Positives = 324/631 (51%), Gaps = 120/631 (19%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G + S PR V AC+++ P A + P+LVA SE
Sbjct: 46 LARLRFDNRALRALPVETPPPGPEDSLSTPRPVPGACFSRARP-APLRQPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEVSEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG T +SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTAFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVR 322
V RD+FYDGNPK E +V R+A +F+RFGS++I S G+ D+ +
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDELTGRAGPSVGRNDIRV-- 282
Query: 323 TLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQW 382
+ DY I + I+ + T D D+ + AA+ EV RTA +VA+W
Sbjct: 283 QMLDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTRRTARMVAEW 328
Query: 383 QGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLW 442
Q VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + W
Sbjct: 329 QCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHVCNASDNAGR-YTYSKQPQVCRW 387
Query: 443 NIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISK-- 497
N+ + + L + EA + E + T+F Y M KKLGL K ++ +++K
Sbjct: 388 NLQKLAEALEPELPLVLAEA-ILKEEFDTEFQRHYLQKMRKKLGLVRVEKEDETLVAKLL 446
Query: 498 ---------------LLNNMAVDKVDYTNFFRALSNVKA-------------DP------ 523
+L++ + D F L++ A DP
Sbjct: 447 ETMHQTGADFTNTFCVLSSFPAEPSDTAEFLTQLTSQCASLEELKLAFRPQMDPRQLSMM 506
Query: 524 -----SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVLSYIQEL 560
S P+ L+ +A + ++ + ++ W +W+ Y + L
Sbjct: 507 LMLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSELQSKNRDHWETWLQEYRERL 566
Query: 561 --LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERP 613
G+ D ER +M++ NPKYVLRNY+ Q AI+AAE GDF EVRR+LKL+E P
Sbjct: 567 DKEKEGVGDIAAWQAERVRIMHANNPKYVLRNYIAQKAIEAAENGDFSEVRRVLKLLESP 626
Query: 614 Y---DEQPGMEKYARL----------PPAWA 631
Y +E G E AR PP WA
Sbjct: 627 YHSEEEATGPEAVARTTDEQSSYSSRPPLWA 657
>gi|432894530|ref|ZP_20106351.1| hypothetical protein A31K_03493 [Escherichia coli KTE165]
gi|431422443|gb|ELH04635.1| hypothetical protein A31K_03493 [Escherichia coli KTE165]
Length = 478
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 292/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT + P+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALFPTP-LNNARLIWHNSELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWPHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE D E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDNERQQLMQSVNPALVLRNWLAQRAIEAAEKDDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|432406723|ref|ZP_19649432.1| hypothetical protein WEO_01907 [Escherichia coli KTE28]
gi|430929482|gb|ELC49991.1| hypothetical protein WEO_01907 [Escherichia coli KTE28]
Length = 478
Score = 355 bits (910), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 217/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWPHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL + + N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTL--SPFVAVYGLNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|148283739|ref|NP_001078954.1| selenoprotein O [Rattus norvegicus]
gi|183986296|gb|AAI66588.1| Selenoprotein O [Rattus norvegicus]
Length = 666
Score = 354 bits (909), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 244/631 (38%), Positives = 321/631 (50%), Gaps = 120/631 (19%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G + S PR V AC+++ P A + P+LVA SE
Sbjct: 46 LARLRFDNRALRALPVETPPPGPEDSLSTPRPVPGACFSRARP-APLRQPRLVALSEPAL 104
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L+ E + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 105 ALLGLEVSEEAEVEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 164
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG T +SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 165 CTAAGERWELQLKGAGPTAFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 224
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVR 322
V RD+FYDGNPK E +V R+A +F+RFGS++I S G+ D+ +
Sbjct: 225 STVMRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPPDELTGRAGPSVGRNDIRV-- 282
Query: 323 TLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQW 382
+ DY I + I+ + T D D+ + AA+ EV RTA +VA+W
Sbjct: 283 QMLDYVISSFYPEIQAAH--------TCDTDNI------QRNAAFFREVTRRTARMVAEW 328
Query: 383 QGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLW 442
Q VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP + W
Sbjct: 329 QCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHVCNASDNAGR-YTYSKQPQVCRW 387
Query: 443 NIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISK-- 497
N+ + + L + EA + E + T+F Y M KKLGL K ++ +++K
Sbjct: 388 NLQKLAEALEPELPLVLAEA-ILKEEFDTEFQRHYLQKMRKKLGLVRVEKEDETLVAKLL 446
Query: 498 ---------------LLNNMAVDKVDYTNFFRALSNVKAD---------PSIPEDEL--- 530
+L++ + D F L++ A P + +L
Sbjct: 447 ETMHQTGADFTNTFCVLSSFPAEPSDTAEFLTQLTSQCASLEELKLAFRPQMDPRQLSMM 506
Query: 531 ---------LVPLKAVLLDIGKE---------------------RKEAWISWVLSYIQEL 560
L L ++ KE ++ W +W+ Y + L
Sbjct: 507 LMLAQSNPQLFALIGTQANVTKELERVEHQSRLEQLSPSELQSKNRDHWETWLQEYRERL 566
Query: 561 --LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERP 613
G+ D ER +M++ NPKYVLRNY+ Q AI+AAE GDF EVRR+LKL+E P
Sbjct: 567 DKEKEGVGDIAAWQAERVRIMHANNPKYVLRNYIAQKAIEAAENGDFSEVRRVLKLLESP 626
Query: 614 Y---DEQPGMEKYARL----------PPAWA 631
Y +E G E AR PP WA
Sbjct: 627 YHSEEEATGPEAVARTTDEQSSYSSRPPLWA 657
>gi|345874709|ref|ZP_08826509.1| SelO family protein [Neisseria weaveri LMG 5135]
gi|343970068|gb|EGV38266.1| SelO family protein [Neisseria weaveri LMG 5135]
Length = 492
Score = 354 bits (909), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 216/506 (42%), Positives = 280/506 (55%), Gaps = 53/506 (10%)
Query: 145 PQLVAWSESVADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLG 203
P VA + +A+ + L P E F+ D L+ +G+ P A Y GHQFG++ QLG
Sbjct: 33 PYWVAQNHVLAEEMGLRPSEIFDNADNLLYLAGSAKQYDPAPIASVYSGHQFGVYVRQLG 92
Query: 204 DGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPT 263
DGRA+ +G+ + RWE QLKGAGKTPYSRFADG AVLRSSIRE+LCSEAMH LGIPT
Sbjct: 93 DGRAVLIGDSVGSDGLRWEWQLKGAGKTPYSRFADGRAVLRSSIREYLCSEAMHGLGIPT 152
Query: 264 TRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRT 323
TRAL + + V R+ + E A+V R+A SF+RFG ++ GQ +
Sbjct: 153 TRALAITGSNDAVYRE-------EAETAAVVTRIAPSFIRFGHFEYMYHTGQH--HNLPV 203
Query: 324 LADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQ 383
LAD+ I HF N Y A+ V+ RTA LVA WQ
Sbjct: 204 LADFLIDRHFPECRE---------------------AENPYLAFFQTVSRRTAELVAAWQ 242
Query: 384 GVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN 443
VGF HGVLNTDNMS LGLTIDYGPFGFLDA+D N +D G RY + QP + WN
Sbjct: 243 SVGFCHGVLNTDNMSALGLTIDYGPFGFLDAYDRRHVCNHSDTGG-RYAYNEQPYVVHWN 301
Query: 444 IAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLN 500
+++F++ L DD A +ER+ F Y M KLGL K ++I+ +
Sbjct: 302 LSRFASCLLPLVPQDDLVAE--LERFPDMFQTAYLQKMRAKLGLQTQEKGDDELIADMFT 359
Query: 501 NMAVDKVDYTNFFRALS---NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYI 557
+ KVD+T FFR LS NV +P +PE LL + EA+ +W+ Y
Sbjct: 360 ALQSRKVDFTLFFRYLSEVGNVHGEP-LPEK---------LLALFHGPTEAFTAWIGRYR 409
Query: 558 QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ 617
L + + ER MN+VNP YVLRNYL + AI A+ GDF E+ RL + M+ P+ E+
Sbjct: 410 GRLRAENSNPAERAERMNAVNPLYVLRNYLLEQAIQLAKSGDFREIERLHRCMQNPFVER 469
Query: 618 PGMEKYARLPPAWAYRPGVCMLSCSS 643
+A LPP WA G+C +SCSS
Sbjct: 470 KEFADFAELPPQWA--EGIC-VSCSS 492
>gi|334122274|ref|ZP_08496314.1| SelO family protein [Enterobacter hormaechei ATCC 49162]
gi|333392205|gb|EGK63310.1| SelO family protein [Enterobacter hormaechei ATCC 49162]
Length = 480
Score = 354 bits (909), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 210/518 (40%), Positives = 289/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ ++N +L+ +E++ADSL + F+ + G T L G P AQ
Sbjct: 13 LPGFYTALKPTP-LQNARLIWHNEALADSLGIPATLFQPEKGAGVWGGETLLPGMKPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE + E + LKGAG TPYSR DG AVLRS++R
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQVLPNGETLDWHLKGAGLTPYSRMGDGRAVLRSTLR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPT+RAL +VT+ V R+ E GA++ RVA+S LRFG ++
Sbjct: 132 ESLASEAMHALGIPTSRALSIVTSDTPVARETM-------ERGAMLIRVAESHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + D VR LADYA+R H+ H++N ++Y W
Sbjct: 185 HFYYR--REPDKVRQLADYALRRHWPHLQN---------------------EPDRYVLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTAS++A+WQ VGF HGV+NTDNMS+LGLT DYGP+GFLD + P + N +D G
Sbjct: 222 RDIVARTASMIARWQAVGFAHGVMNTDNMSLLGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY +M KLGL
Sbjct: 282 -RYRFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDSYQEVLLREYGVLMRNKLGLM 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + ++++ L MA + DYT FR LS + PL+ +D
Sbjct: 339 TQEKGDNELLNGLFAIMAREGSDYTRTFRMLSQTAQQSASS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W +Y L I D+ R+ M +VNP VLRN+L Q AI+ AE GD+ E+ R
Sbjct: 388 RQAFDDWFAAYRARLQQEQIDDDTRQTQMKAVNPAMVLRNWLAQRAIEQAEQGDYTELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHIALRTPFADRE--DDYVSRPPDWGKRLEV---SCSS 480
>gi|26247957|ref|NP_753997.1| hypothetical protein c2102 [Escherichia coli CFT073]
gi|91210920|ref|YP_540906.1| hypothetical protein UTI89_C1899 [Escherichia coli UTI89]
gi|117623883|ref|YP_852796.1| hypothetical protein APECO1_781 [Escherichia coli APEC O1]
gi|218558576|ref|YP_002391489.1| hypothetical protein ECS88_1757 [Escherichia coli S88]
gi|227885872|ref|ZP_04003677.1| protein YdiU [Escherichia coli 83972]
gi|237705654|ref|ZP_04536135.1| ydiU [Escherichia sp. 3_2_53FAA]
gi|300994622|ref|ZP_07180946.1| SelO family protein [Escherichia coli MS 45-1]
gi|301050960|ref|ZP_07197807.1| SelO family protein [Escherichia coli MS 185-1]
gi|386599505|ref|YP_006101011.1| hypothetical protein ECOK1_1826 [Escherichia coli IHE3034]
gi|386604323|ref|YP_006110623.1| hypothetical protein UM146_08620 [Escherichia coli UM146]
gi|386629398|ref|YP_006149118.1| hypothetical protein i02_1924 [Escherichia coli str. 'clone D i2']
gi|386634318|ref|YP_006154037.1| hypothetical protein i14_1924 [Escherichia coli str. 'clone D i14']
gi|386639236|ref|YP_006106034.1| putative cytoplasmic protein YdiU [Escherichia coli ABU 83972]
gi|417084642|ref|ZP_11952281.1| hypothetical protein i01_02248 [Escherichia coli cloneA_i1]
gi|419946528|ref|ZP_14462925.1| hypothetical protein ECHM605_20698 [Escherichia coli HM605]
gi|422359784|ref|ZP_16440421.1| SelO family protein [Escherichia coli MS 110-3]
gi|422366809|ref|ZP_16447266.1| SelO family protein [Escherichia coli MS 153-1]
gi|422748938|ref|ZP_16802850.1| hypothetical protein ERKG_01165 [Escherichia coli H252]
gi|422755043|ref|ZP_16808868.1| hypothetical protein ERLG_02166 [Escherichia coli H263]
gi|422838368|ref|ZP_16886341.1| hypothetical protein ESPG_01027 [Escherichia coli H397]
gi|432358046|ref|ZP_19601275.1| hypothetical protein WCC_01996 [Escherichia coli KTE4]
gi|432362671|ref|ZP_19605842.1| hypothetical protein WCE_01691 [Escherichia coli KTE5]
gi|432411926|ref|ZP_19654592.1| hypothetical protein WG9_02405 [Escherichia coli KTE39]
gi|432436121|ref|ZP_19678514.1| hypothetical protein A13M_01829 [Escherichia coli KTE188]
gi|432441122|ref|ZP_19683463.1| hypothetical protein A13O_01943 [Escherichia coli KTE189]
gi|432446244|ref|ZP_19688543.1| hypothetical protein A13S_02280 [Escherichia coli KTE191]
gi|432456737|ref|ZP_19698924.1| hypothetical protein A15C_02523 [Escherichia coli KTE201]
gi|432495728|ref|ZP_19737527.1| hypothetical protein A173_02887 [Escherichia coli KTE214]
gi|432504437|ref|ZP_19746167.1| hypothetical protein A17E_01490 [Escherichia coli KTE220]
gi|432523813|ref|ZP_19760945.1| hypothetical protein A17Y_01925 [Escherichia coli KTE230]
gi|432568704|ref|ZP_19805222.1| hypothetical protein A1SE_02284 [Escherichia coli KTE53]
gi|432573743|ref|ZP_19810225.1| hypothetical protein A1SI_02437 [Escherichia coli KTE55]
gi|432587970|ref|ZP_19824326.1| hypothetical protein A1SO_02320 [Escherichia coli KTE58]
gi|432592879|ref|ZP_19829198.1| hypothetical protein A1SS_02298 [Escherichia coli KTE60]
gi|432597693|ref|ZP_19833969.1| hypothetical protein A1SW_02406 [Escherichia coli KTE62]
gi|432607534|ref|ZP_19843723.1| hypothetical protein A1U7_02532 [Escherichia coli KTE67]
gi|432651145|ref|ZP_19886902.1| hypothetical protein A1W7_02146 [Escherichia coli KTE87]
gi|432754454|ref|ZP_19989005.1| hypothetical protein WEA_01429 [Escherichia coli KTE22]
gi|432778584|ref|ZP_20012827.1| hypothetical protein A1SQ_02247 [Escherichia coli KTE59]
gi|432783589|ref|ZP_20017770.1| hypothetical protein A1SY_02428 [Escherichia coli KTE63]
gi|432787530|ref|ZP_20021662.1| hypothetical protein A1U3_01640 [Escherichia coli KTE65]
gi|432820966|ref|ZP_20054658.1| hypothetical protein A1Y5_02560 [Escherichia coli KTE118]
gi|432827110|ref|ZP_20060762.1| hypothetical protein A1YA_03825 [Escherichia coli KTE123]
gi|432978312|ref|ZP_20167134.1| hypothetical protein A15S_04227 [Escherichia coli KTE209]
gi|432995371|ref|ZP_20183982.1| hypothetical protein A17A_02454 [Escherichia coli KTE218]
gi|432999947|ref|ZP_20188477.1| hypothetical protein A17K_02281 [Escherichia coli KTE223]
gi|433005163|ref|ZP_20193593.1| hypothetical protein A17S_02730 [Escherichia coli KTE227]
gi|433007661|ref|ZP_20196079.1| hypothetical protein A17W_00361 [Escherichia coli KTE229]
gi|433013847|ref|ZP_20202209.1| hypothetical protein WI5_01672 [Escherichia coli KTE104]
gi|433023479|ref|ZP_20211480.1| hypothetical protein WI9_01645 [Escherichia coli KTE106]
gi|433058095|ref|ZP_20245154.1| hypothetical protein WIM_01864 [Escherichia coli KTE124]
gi|433087242|ref|ZP_20273626.1| hypothetical protein WIY_01690 [Escherichia coli KTE137]
gi|433115560|ref|ZP_20301364.1| hypothetical protein WKA_01749 [Escherichia coli KTE153]
gi|433125197|ref|ZP_20310772.1| hypothetical protein WKE_01693 [Escherichia coli KTE160]
gi|433139260|ref|ZP_20324531.1| hypothetical protein WKM_01541 [Escherichia coli KTE167]
gi|433149208|ref|ZP_20334244.1| hypothetical protein WKQ_01859 [Escherichia coli KTE174]
gi|433153781|ref|ZP_20338736.1| hypothetical protein WKS_01709 [Escherichia coli KTE176]
gi|433163491|ref|ZP_20348236.1| hypothetical protein WKW_01696 [Escherichia coli KTE179]
gi|433168612|ref|ZP_20353245.1| hypothetical protein WKY_01850 [Escherichia coli KTE180]
gi|433212513|ref|ZP_20396116.1| hypothetical protein WI3_01692 [Escherichia coli KTE99]
gi|433324134|ref|ZP_20401452.1| hypothetical protein B185_011564 [Escherichia coli J96]
gi|442604369|ref|ZP_21019214.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
Nissle 1917]
gi|33517034|sp|Q8FH30.1|YDIU_ECOL6 RecName: Full=UPF0061 protein YdiU
gi|121957928|sp|Q1RB89.1|YDIU_ECOUT RecName: Full=UPF0061 protein YdiU
gi|166227578|sp|A1ABP2.1|YDIU_ECOK1 RecName: Full=UPF0061 protein YdiU
gi|226723585|sp|B7MAR7.1|YDIU_ECO45 RecName: Full=UPF0061 protein YdiU
gi|26108360|gb|AAN80562.1|AE016761_137 Hypothetical protein ydiU [Escherichia coli CFT073]
gi|91072494|gb|ABE07375.1| hypothetical protein YdiU [Escherichia coli UTI89]
gi|115513007|gb|ABJ01082.1| conserved hypothetical protein [Escherichia coli APEC O1]
gi|218365345|emb|CAR03066.1| conserved hypothetical protein [Escherichia coli S88]
gi|226900411|gb|EEH86670.1| ydiU [Escherichia sp. 3_2_53FAA]
gi|227837445|gb|EEJ47911.1| protein YdiU [Escherichia coli 83972]
gi|294494107|gb|ADE92863.1| conserved hypothetical protein [Escherichia coli IHE3034]
gi|300297370|gb|EFJ53755.1| SelO family protein [Escherichia coli MS 185-1]
gi|300406205|gb|EFJ89743.1| SelO family protein [Escherichia coli MS 45-1]
gi|307553728|gb|ADN46503.1| putative cytoplasmic protein YdiU [Escherichia coli ABU 83972]
gi|307626807|gb|ADN71111.1| hypothetical protein UM146_08620 [Escherichia coli UM146]
gi|315286398|gb|EFU45834.1| SelO family protein [Escherichia coli MS 110-3]
gi|315290513|gb|EFU49887.1| SelO family protein [Escherichia coli MS 153-1]
gi|323952214|gb|EGB48087.1| hypothetical protein ERKG_01165 [Escherichia coli H252]
gi|323956608|gb|EGB52346.1| hypothetical protein ERLG_02166 [Escherichia coli H263]
gi|355351817|gb|EHG01004.1| hypothetical protein i01_02248 [Escherichia coli cloneA_i1]
gi|355420297|gb|AER84494.1| hypothetical protein i02_1924 [Escherichia coli str. 'clone D i2']
gi|355425217|gb|AER89413.1| hypothetical protein i14_1924 [Escherichia coli str. 'clone D i14']
gi|371614292|gb|EHO02777.1| hypothetical protein ESPG_01027 [Escherichia coli H397]
gi|388412583|gb|EIL72640.1| hypothetical protein ECHM605_20698 [Escherichia coli HM605]
gi|430878030|gb|ELC01462.1| hypothetical protein WCC_01996 [Escherichia coli KTE4]
gi|430887210|gb|ELC10037.1| hypothetical protein WCE_01691 [Escherichia coli KTE5]
gi|430935152|gb|ELC55474.1| hypothetical protein WG9_02405 [Escherichia coli KTE39]
gi|430964543|gb|ELC81990.1| hypothetical protein A13M_01829 [Escherichia coli KTE188]
gi|430966963|gb|ELC84325.1| hypothetical protein A13O_01943 [Escherichia coli KTE189]
gi|430972517|gb|ELC89485.1| hypothetical protein A13S_02280 [Escherichia coli KTE191]
gi|430982619|gb|ELC99308.1| hypothetical protein A15C_02523 [Escherichia coli KTE201]
gi|431024271|gb|ELD37436.1| hypothetical protein A173_02887 [Escherichia coli KTE214]
gi|431039420|gb|ELD50240.1| hypothetical protein A17E_01490 [Escherichia coli KTE220]
gi|431052915|gb|ELD62551.1| hypothetical protein A17Y_01925 [Escherichia coli KTE230]
gi|431100555|gb|ELE05525.1| hypothetical protein A1SE_02284 [Escherichia coli KTE53]
gi|431108454|gb|ELE12426.1| hypothetical protein A1SI_02437 [Escherichia coli KTE55]
gi|431120303|gb|ELE23301.1| hypothetical protein A1SO_02320 [Escherichia coli KTE58]
gi|431128664|gb|ELE30846.1| hypothetical protein A1SS_02298 [Escherichia coli KTE60]
gi|431130560|gb|ELE32643.1| hypothetical protein A1SW_02406 [Escherichia coli KTE62]
gi|431138632|gb|ELE40444.1| hypothetical protein A1U7_02532 [Escherichia coli KTE67]
gi|431191014|gb|ELE90399.1| hypothetical protein A1W7_02146 [Escherichia coli KTE87]
gi|431302655|gb|ELF91834.1| hypothetical protein WEA_01429 [Escherichia coli KTE22]
gi|431326737|gb|ELG14082.1| hypothetical protein A1SQ_02247 [Escherichia coli KTE59]
gi|431329457|gb|ELG16743.1| hypothetical protein A1SY_02428 [Escherichia coli KTE63]
gi|431337247|gb|ELG24335.1| hypothetical protein A1U3_01640 [Escherichia coli KTE65]
gi|431367813|gb|ELG54281.1| hypothetical protein A1Y5_02560 [Escherichia coli KTE118]
gi|431372359|gb|ELG58021.1| hypothetical protein A1YA_03825 [Escherichia coli KTE123]
gi|431480484|gb|ELH60203.1| hypothetical protein A15S_04227 [Escherichia coli KTE209]
gi|431507084|gb|ELH85370.1| hypothetical protein A17A_02454 [Escherichia coli KTE218]
gi|431509964|gb|ELH88211.1| hypothetical protein A17K_02281 [Escherichia coli KTE223]
gi|431515068|gb|ELH92895.1| hypothetical protein A17S_02730 [Escherichia coli KTE227]
gi|431524194|gb|ELI01141.1| hypothetical protein A17W_00361 [Escherichia coli KTE229]
gi|431531833|gb|ELI08488.1| hypothetical protein WI5_01672 [Escherichia coli KTE104]
gi|431537130|gb|ELI13278.1| hypothetical protein WI9_01645 [Escherichia coli KTE106]
gi|431570738|gb|ELI43646.1| hypothetical protein WIM_01864 [Escherichia coli KTE124]
gi|431606962|gb|ELI76333.1| hypothetical protein WIY_01690 [Escherichia coli KTE137]
gi|431635086|gb|ELJ03301.1| hypothetical protein WKA_01749 [Escherichia coli KTE153]
gi|431646582|gb|ELJ14074.1| hypothetical protein WKE_01693 [Escherichia coli KTE160]
gi|431661638|gb|ELJ28450.1| hypothetical protein WKM_01541 [Escherichia coli KTE167]
gi|431671872|gb|ELJ38145.1| hypothetical protein WKQ_01859 [Escherichia coli KTE174]
gi|431675238|gb|ELJ41383.1| hypothetical protein WKS_01709 [Escherichia coli KTE176]
gi|431688578|gb|ELJ54096.1| hypothetical protein WKW_01696 [Escherichia coli KTE179]
gi|431688936|gb|ELJ54453.1| hypothetical protein WKY_01850 [Escherichia coli KTE180]
gi|431734795|gb|ELJ98171.1| hypothetical protein WI3_01692 [Escherichia coli KTE99]
gi|432347393|gb|ELL41853.1| hypothetical protein B185_011564 [Escherichia coli J96]
gi|441714626|emb|CCQ05191.1| Selenoprotein O and cysteine-containing homologs [Escherichia coli
Nissle 1917]
Length = 478
Score = 354 bits (908), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE D E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKDDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|397168311|ref|ZP_10491749.1| hypothetical protein Y71_2328 [Enterobacter radicincitans DSM
16656]
gi|396089846|gb|EJI87418.1| hypothetical protein Y71_2328 [Enterobacter radicincitans DSM
16656]
Length = 480
Score = 354 bits (908), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 212/521 (40%), Positives = 288/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A L ++ F + G L G P
Sbjct: 10 RDELPEFYTALSPTP-LHNARLIWHNAPLAQELGVEDALFHPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLPDGTTRDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A+S LRFG
Sbjct: 129 TIRESLASEAMHHLGIPTTRALSIVTSDTPVMRE-------SREQGAMLMRIAESHLRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + VR LAD+AIRHH+ H++N S+KY
Sbjct: 182 HFEHFYYR--REPQKVRQLADFAIRHHWPHLQN---------------------ESDKYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ R A+L+A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD + PSF N +D
Sbjct: 219 LWFRDIVRRIATLIARWQAVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYQPSFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + +L + ID + N ++ Y + EY +M KL
Sbjct: 279 YQG-RYSFDNQPAVALWNLQRLAQSL--SPFIDIEALNSALDDYQHLLLTEYGVLMRGKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G + + Q++++L MA + DYT FR LS + + + PL+ +D
Sbjct: 336 GFLTQQQGDNQLLTELFALMAREGSDYTRTFRLLSQTE------QQSVSSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L + D +R+ LM+ VNP VLRN+L Q IDAAE GD E
Sbjct: 388 ---RAAFDRWFAQYRMRLQQEQVDDAQRQQLMSGVNPALVLRNWLAQRVIDAAEKGDASE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +L + + +P+ ++ + Y PP W R V SCSS
Sbjct: 445 LAQLHEALRQPFRDRN--DDYVSRPPDWGKRLEV---SCSS 480
>gi|170680793|ref|YP_001743542.1| hypothetical protein EcSMS35_1484 [Escherichia coli SMS-3-5]
gi|422828984|ref|ZP_16877153.1| hypothetical protein ESNG_01658 [Escherichia coli B093]
gi|226725731|sp|B1LE24.1|YDIU_ECOSM RecName: Full=UPF0061 protein YdiU
gi|170518511|gb|ACB16689.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
gi|371612085|gb|EHO00603.1| hypothetical protein ESNG_01658 [Escherichia coli B093]
Length = 478
Score = 354 bits (908), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 215/515 (41%), Positives = 291/515 (56%), Gaps = 55/515 (10%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
YT +SP+ + +L+ + +A++L + F+ + + G T L G P AQ Y
Sbjct: 16 TYTALSPTP-LNKARLIWHNAELANTLSIPSSLFK--NGAGVWGGETLLPGMSPLAQVYS 72
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 73 GHQFGVWAGQLGDGRGILLGEQQLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH+LGIPTTRAL +V++ V R+ EPGA++ RVA S LRFG ++
Sbjct: 133 ASEAMHYLGIPTTRALSIVSSDSPVYRETV-------EPGAMLMRVAPSHLRFGHFEHFY 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R + + VR LAD+AIRH++ H+ + DED KY W +V
Sbjct: 186 YR--REPEKVRQLADFAIRHYWSHLAD------------DED---------KYRLWFSDV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D G RY
Sbjct: 223 VARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KLG
Sbjct: 282 SFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKLGFMTEQ 339
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
K + ++++L + MA ++ DYT FR LS + + PL+ +D + A
Sbjct: 340 KEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-----RAA 388
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W Y + L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E+ RL +
Sbjct: 389 FDDWFARYRRRLQQDEVSDIERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHE 448
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ ++ + Y PP W R V SCSS
Sbjct: 449 ALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|415842189|ref|ZP_11522923.1| hypothetical protein ECRN5871_4719 [Escherichia coli RN587/1]
gi|417283522|ref|ZP_12070819.1| hypothetical protein EC3003_1821 [Escherichia coli 3003]
gi|425277948|ref|ZP_18669214.1| hypothetical protein ECARS42123_2062 [Escherichia coli ARS4.2123]
gi|323187000|gb|EFZ72317.1| hypothetical protein ECRN5871_4719 [Escherichia coli RN587/1]
gi|386243465|gb|EII85198.1| hypothetical protein EC3003_1821 [Escherichia coli 3003]
gi|408203319|gb|EKI28374.1| hypothetical protein ECARS42123_2062 [Escherichia coli ARS4.2123]
Length = 478
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVDG--LNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE D E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKDDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRG--DDYVSRPPDWGKRLEV---SCSS 478
>gi|377575902|ref|ZP_09804886.1| hypothetical protein YdiU [Escherichia hermannii NBRC 105704]
gi|377541934|dbj|GAB50051.1| hypothetical protein YdiU [Escherichia hermannii NBRC 105704]
Length = 481
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 212/529 (40%), Positives = 300/529 (56%), Gaps = 53/529 (10%)
Query: 118 DPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA 177
+P+ + R+ L Y+++SP+ + N +L +E +A SL+L + F+ + G
Sbjct: 3 NPKFITTWRDELPGFYSELSPTP-LTNARLFWHNEPLAQSLQLPEELFDYQGSAGVWGGE 61
Query: 178 TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFA 237
L G P AQ Y GHQFG+WAGQLGDGR I LGE R++ LKGAG TPYSR
Sbjct: 62 ALLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLDDGRRYDWHLKGAGLTPYSRMG 121
Query: 238 DGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRV 297
DG AVLRS++RE L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+
Sbjct: 122 DGRAVLRSTLRECLASEAMHSLGIPTTRALSIVTSDTPVYRE-------TAERGAMMIRI 174
Query: 298 AQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
A+S +RFG ++ R + + V+ LA+Y IRHHF V
Sbjct: 175 AESHVRFGHFEHFYYR--REPERVQQLAEYVIRHHFPQW--------------------V 212
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
D +++ A EV RTA+L+A+WQ VGF+HGV+NTDNMS+LGLT+DYGP+GF+D + P
Sbjct: 213 D-EADRLALLLEEVIVRTATLIARWQAVGFSHGVMNTDNMSVLGLTMDYGPYGFMDDWQP 271
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
F N +D G RY F NQP +GLWN+ + + T A + + N +++ Y T + EY
Sbjct: 272 RFICNHSDYQG-RYAFDNQPAVGLWNLQRLAQTF--APFVSAERLNALLDTYQTVLLREY 328
Query: 478 QAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
+M KLGL + + +++ LL M + DYT FR LS + + PL
Sbjct: 329 GGLMRAKLGLMTEQQGDNDLLNTLLEQMQREGSDYTRTFRMLSETEQHSAAS------PL 382
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ +D + ++ +W Y + L + DE R+ M +VNP VLRNYL Q AIDA
Sbjct: 383 RDEFID-----RASFDAWFARYRERLQRETVDDERRQQAMKAVNPAIVLRNYLAQRAIDA 437
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE GD E++RL + + P+ ++ ++Y+R PP W R V SCSS
Sbjct: 438 AEQGDVSEMQRLHQALREPFADRN--DEYSRRPPDWGKRLEV---SCSS 481
>gi|313216687|emb|CBY37949.1| unnamed protein product [Oikopleura dioica]
Length = 600
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 228/596 (38%), Positives = 318/596 (53%), Gaps = 101/596 (16%)
Query: 94 KMTKKLKALEDLNWDHSFVRELPGDPRTDS-IPREVLHACYTKVSPSAEVENPQLVAWSE 152
+ +++ E LN+D+ +++LP D D I R V +AC+ +V P+ V+ P++VA SE
Sbjct: 7 RNVRRMTTFEKLNFDNQALKQLPVDSSPDYLIQRPVPNACFHRVKPT-RVDEPKIVAISE 65
Query: 153 SVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGE 212
+ LDP EF R D + SG + GA A CY GHQFG +AGQLGDG + +GE
Sbjct: 66 DALKLIGLDPSEFLRSDAAEYLSGNSNFPGADYAAHCYCGHQFGNFAGQLGDGATMYIGE 125
Query: 213 ILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTT 272
+L RWE+Q KGAGKTP+SR ADG VLRSSIREFLCSEAMH LG+PTTRA +V +
Sbjct: 126 VLKENGSRWEIQFKGAGKTPFSRTADGRKVLRSSIREFLCSEAMHNLGVPTTRAGSIVVS 185
Query: 273 -GKFVTRDMFYDGNPKE-EPGAIVCRVAQSFLRFGSYQIHASRGQE--DLDIVRTLADYA 328
V RD FYDGN E EP +I+ R+A + RFGS++I G L++ LADY
Sbjct: 186 FDTTVIRDKFYDGNAHEAEPTSIITRLAPT--RFGSFEIIRRGGPSAGRLELATQLADYT 243
Query: 329 IRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
I+ + IE+ T KY V+E+TA L+A+WQ +G+
Sbjct: 244 IKTCYPQIED---------------------TEEKYKQLIKAVSEKTAELIAKWQLIGWC 282
Query: 389 HGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG---RRYCFANQPDIGLWNIA 445
HGV+NTDNMSI G+T+DYGPFGF+D FDP F N +D RY ++NQP IG WN+
Sbjct: 283 HGVMNTDNMSIAGVTLDYGPFGFMDRFDPEFICNASDNRDGYQGRYTYSNQPLIGKWNLI 342
Query: 446 QFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNN 501
+++ T+ L+ EA + E Y +M + K+GL + +K++ LL
Sbjct: 343 KWAETM--EHLVPRLEARECIQESYDETYMAALISGARSKMGLFEELDGDKELYESLLTA 400
Query: 502 MAVDKVDYTNFFRALSNVK--ADPSIPEDELLVPLKAVLL--------------DIGKER 545
M D+TN FRAL+ V+ AD + D + K +L D G E+
Sbjct: 401 MLESGADFTNTFRALAGVELSADGEV-NDSTVEKTKEFILNNCCYSAEDCQASPDAGSEQ 459
Query: 546 KEAWISWVLSY----------IQELLS---------------SGISDEE----------- 569
+ + + +L + +QE L+ + D+E
Sbjct: 460 ELSMLRMMLRHGMLDPEQQAQLQENLAEYEVKMNKFKMTNEEKKVKDKEYWDAWFTVYKV 519
Query: 570 ----------RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD 615
RK LMNS NPK++LRN++ + +I AE GDF EV RLL+L + PYD
Sbjct: 520 RLSREKSNEGRKRLMNSANPKFILRNHILEKSIQMAEDGDFSEVNRLLELFKDPYD 575
>gi|419013542|ref|ZP_13560897.1| hypothetical protein ECDEC1D_2390 [Escherichia coli DEC1D]
gi|377858526|gb|EHU23365.1| hypothetical protein ECDEC1D_2390 [Escherichia coli DEC1D]
Length = 478
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y HQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSSHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|224584144|ref|YP_002637942.1| hypothetical protein SPC_2386 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|254814082|sp|C0Q635.1|YDIU_SALPC RecName: Full=UPF0061 protein YdiU
gi|224468671|gb|ACN46501.1| hypothetical protein SPC_2386 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
Length = 480
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 293/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTL-LKNARLIWYNDKLAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+ +WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIVEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DY+ FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYSRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHWLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|398836684|ref|ZP_10594016.1| hypothetical protein PMI40_04270 [Herbaspirillum sp. YR522]
gi|398211165|gb|EJM97788.1| hypothetical protein PMI40_04270 [Herbaspirillum sp. YR522]
Length = 497
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 222/518 (42%), Positives = 289/518 (55%), Gaps = 51/518 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T + P+ + P LV +S+ A + L + P FSG AG+ P A Y
Sbjct: 26 AFHTHLQPT-PIPAPYLVGFSDDAAAGIGLPRAALDDPAVLDVFSGNRVAAGSRPLAAVY 84
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLK-SERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
GHQFG+WAGQLGDGRAITLG++ + R ELQLKG+GKTPYSR DG AVLRSSIRE
Sbjct: 85 SGHQFGVWAGQLGDGRAITLGDVAAADGTGRIELQLKGSGKTPYSRGGDGRAVLRSSIRE 144
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAM LGIPTTRAL + + V R+ E A+V R A SF+RFGS++
Sbjct: 145 FLCSEAMAALGIPTTRALMVTGSDLRVMRE-------SVETAAVVTRAAPSFIRFGSFE- 196
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H Q D ++ LAD + + + N Y A
Sbjct: 197 HWYYNQRH-DELKVLADTVLAQFYPALLQQG---------------------NPYQALLA 234
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G
Sbjct: 235 EVTRRTAHLMAQWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDSRHICNHTDQQG- 293
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEAN-YVMERYGTKFMDEYQAIMTKKLGLP 488
RY +A QP IG WN F+ A LI EA + + + + +M KLGL
Sbjct: 294 RYSYAMQPRIGQWNC--FALGQALLPLIGTVEATEAALAGFEASYDQRHGELMRAKLGLA 351
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
++ +I L + + VD+T FFR L +++ D +I DE L+ +++D
Sbjct: 352 TMRAEDEALIDALFAILQANHVDFTLFFRRLGHLQID-NIGGDE---ALRDLVID----- 402
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ A+ +W Y + L + D R+ MN+VNPKYVLRNYL Q+AI+ A DF EV R
Sbjct: 403 RPAFDAWATRYRERLRAEQSEDGARQLAMNAVNPKYVLRNYLAQTAIERAAHRDFSEVAR 462
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L ++ RP+DEQP ++YA LPP WA +SCSS
Sbjct: 463 LQAILRRPFDEQPEHQRYAELPPDWA---AGLEVSCSS 497
>gi|432397507|ref|ZP_19640288.1| hypothetical protein WEI_02426 [Escherichia coli KTE25]
gi|432723131|ref|ZP_19958051.1| hypothetical protein WE1_02160 [Escherichia coli KTE17]
gi|432727718|ref|ZP_19962597.1| hypothetical protein WE3_02162 [Escherichia coli KTE18]
gi|432741409|ref|ZP_19976128.1| hypothetical protein WEE_02090 [Escherichia coli KTE23]
gi|432990718|ref|ZP_20179382.1| hypothetical protein A179_02492 [Escherichia coli KTE217]
gi|433110929|ref|ZP_20296794.1| hypothetical protein WK9_01792 [Escherichia coli KTE150]
gi|430915611|gb|ELC36689.1| hypothetical protein WEI_02426 [Escherichia coli KTE25]
gi|431265685|gb|ELF57247.1| hypothetical protein WE1_02160 [Escherichia coli KTE17]
gi|431273407|gb|ELF64481.1| hypothetical protein WE3_02162 [Escherichia coli KTE18]
gi|431283100|gb|ELF73959.1| hypothetical protein WEE_02090 [Escherichia coli KTE23]
gi|431494800|gb|ELH74386.1| hypothetical protein A179_02492 [Escherichia coli KTE217]
gi|431628233|gb|ELI96609.1| hypothetical protein WK9_01792 [Escherichia coli KTE150]
Length = 478
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 293/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LR+G
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRYG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+ + DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWPHLAD------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL + + N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTL--SPFVAVYGLNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRVRLQQDEVTDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|115351947|ref|YP_773786.1| hypothetical protein Bamb_1896 [Burkholderia ambifaria AMMD]
gi|122322962|sp|Q0BEH1.1|Y1896_BURCM RecName: Full=UPF0061 protein Bamb_1896
gi|115281935|gb|ABI87452.1| protein of unknown function UPF0061 [Burkholderia ambifaria AMMD]
Length = 522
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 295/535 (55%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGCSDEVAQLLGLPASFATQPGFAELFAGNPTRDWPAHALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQ+KG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQIKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNDRPDL--LRQLADHVI---------------------DRFYPACREADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERENDAELANKLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D ++A+ +W Y + L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RDAFDAWANLYRERLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKEKDFSEVERLAQVLRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|172060873|ref|YP_001808525.1| hypothetical protein BamMC406_1826 [Burkholderia ambifaria MC40-6]
gi|226696090|sp|B1YRN5.1|Y1826_BURA4 RecName: Full=UPF0061 protein BamMC406_1826
gi|171993390|gb|ACB64309.1| protein of unknown function UPF0061 [Burkholderia ambifaria MC40-6]
Length = 522
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 295/535 (55%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGFSDEVAQLLGLPASFAAQPGFAELFAGNPTRDWPAHALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQ+KG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQIKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNDRPDL--LRQLADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAAMLRTADLVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERENDAELANKLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RDAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKEKDFSEVERLAQVLRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|453065567|gb|EMF06528.1| hypothetical protein F518_06754 [Serratia marcescens VGH107]
Length = 480
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 211/528 (39%), Positives = 297/528 (56%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ D+ + L YT ++P+ +++ +L+ SE +A L LD F + P++ +G T
Sbjct: 2 PQFDNAYYQQLPGFYTALNPTP-LKDTRLLYHSEPLARELGLDESWFTQDKTPIW-AGET 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQVMADGSHRDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS +REFL SEA+H LGIPTTRAL +VT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSVVREFLASEALHHLGIPTTRALTIVTSQQPVYRE-------QPERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + VR LAD+ I H+ +++
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VRQLADFVIARHWPQLQDQ------------------- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+++Y W +V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+GFLD + P
Sbjct: 212 --ADRYQLWFTDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPYGFLDDYQPG 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY + NQP + LWN+ + + TL+ L+ ++ + Y M Y
Sbjct: 270 YICNHSDHQG-RYAYDNQPAVALWNLHRLAQTLSG--LMTTEQLQQALAAYEPALMRAYG 326
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + +++ LL+ MA + DYT FR LS + + + PL+
Sbjct: 327 EQMRAKLGFFTPTAQDNDVLTGLLSLMAQEGRDYTRTFRLLSETE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ +W Y Q L +SD +R+ M +VNP+ +LRNYL Q AI+ A
Sbjct: 381 DEFID-----RAAFDAWYQQYRQRLQQEQVSDADRQRSMKAVNPRLILRNYLAQQAIEDA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D G +RRL + + RP+DE P + A LPP W +SCSS
Sbjct: 436 EKDDVGRLRRLHQALLRPFDEAPEYDDLAALPPDWGKH---LEISCSS 480
>gi|187923914|ref|YP_001895556.1| hypothetical protein Bphyt_1924 [Burkholderia phytofirmans PsJN]
gi|226701080|sp|B2T421.1|Y1924_BURPP RecName: Full=UPF0061 protein Bphyt_1924
gi|187715108|gb|ACD16332.1| protein of unknown function UPF0061 [Burkholderia phytofirmans
PsJN]
Length = 518
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 214/524 (40%), Positives = 285/524 (54%), Gaps = 64/524 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P LV +S A L L+P P F FSG A A+PYA Y GHQ
Sbjct: 41 PAAPLSAPYLVGFSAETAALLGLEPGLENDPGFAELFSGNLTREWPAEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + +R+ELQLKGAG+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-NGQRFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVIGSDQPVRRETV-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVIS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ WQ VGF HGV+NTDNMSI+GLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLMVDWQAVGFCHGVMNTDNMSIVGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYK 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ + ++ K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGEKHEESVRGDKAIED--AQRVLGGFKNRFAPALERRMR 366
Query: 483 KKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + + ++++L M ++ D+T FR L+ V + + P++ + L
Sbjct: 367 AKLGLEIEREGDDGLVNRLFEVMHANRADFTLTFRNLARVSKHDASGD----APVRDLFL 422
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ +WV Y L D R MN VNPK+VLRN+L ++AI A+ D
Sbjct: 423 D-----RAAFDAWVNDYRARLSEETRDDAARAIAMNRVNPKFVLRNHLAETAIRRAKEKD 477
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 478 FSEVERLAAILRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|120611610|ref|YP_971288.1| hypothetical protein Aave_2947 [Acidovorax citrulli AAC00-1]
gi|120590074|gb|ABM33514.1| protein of unknown function UPF0061 [Acidovorax citrulli AAC00-1]
Length = 498
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 219/515 (42%), Positives = 281/515 (54%), Gaps = 54/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + P+ VA SE+ A + L+P SG L G P A Y G
Sbjct: 34 FTELVPT-PLPGPRWVAGSEATARLIGLEPDWLGSDAAVQVLSGNALLRGMRPLASVYSG 92
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE +E+QLKG+G+TPYSR DG AVLRSSIREFLC
Sbjct: 93 HQFGVWAGQLGDGRAILLGE----TDTGYEVQLKGSGRTPYSRMGDGRAVLRSSIREFLC 148
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ + E A+V RVA SF+RFG ++ A+
Sbjct: 149 SEAMHALGIPTTRALALTASPAPVVRE-------EIETAAVVTRVAPSFVRFGHFEHFAA 201
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R Q + +R LADY I ++ + + +N YAA V
Sbjct: 202 RDQ--VRELRALADYVIDRYYPGCRDAGGAPG----------------ANPYAALLQAVG 243
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF P N +D G RY
Sbjct: 244 ARTAALLAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFVPGHICNHSDSQG-RYA 302
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F QP + WN+ F A LI D E A +E Y T F EY A M KLGL
Sbjct: 303 FNRQPQVAYWNL--FCLGQALMPLIGDTELAQAALEPYRTAFPAEYMARMRAKLGLVSAA 360
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ + ++ LL +A D VDYT F+ LS A P++ + +D +
Sbjct: 361 EGDAALVDDLLGLLAADAVDYTIFWHRLSQAVASGD------FTPVRDLFID-----RAG 409
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W +W Y Q L G ++ LM NP++VLRN+L + I AA+ GDF + L
Sbjct: 410 WEAWSARYRQRL---GSGSDQAAGLMERTNPRFVLRNHLGEQTIRAAKSGDFAPLHALQA 466
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ RP+DE P ++A PP WA +SCSS
Sbjct: 467 VLARPFDEHPAHAEWAGFPPDWA---SSIEISCSS 498
>gi|347540772|ref|YP_004848197.1| hypothetical protein NH8B_2992 [Pseudogulbenkiania sp. NH8B]
gi|345643950|dbj|BAK77783.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
Length = 488
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 211/517 (40%), Positives = 286/517 (55%), Gaps = 51/517 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A Y +V P+ + +P VA S +A L + + D SG+ P A Y
Sbjct: 19 AFYRRVDPTP-LPDPYPVAVSRPLAAELGVAGESLLGADAVGVLSGSALRPDMRPVAAIY 77
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ QLGDGRA+ LG+ E Q+KGAG TP+SR DG AVLRSSIREF
Sbjct: 78 SGHQFGVYVPQLGDGRALLLGDTKAPDGRLMEWQIKGAGLTPFSRMGDGRAVLRSSIREF 137
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPTTRAL ++ + + V R+ E A+V RVA+SFLRFGS+++
Sbjct: 138 LCSEAMHHLGIPTTRALAIMGSDEPVYRE-------TTETAAVVTRVAESFLRFGSFELF 190
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
RG D +R LADY IRHH+ + +N Y A E
Sbjct: 191 YHRGMHDE--IRVLADYVIRHHYPACQE---------------------AANPYLALFAE 227
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+AQWQ VGF HGV+N+DNMSILGLTIDYGPFGF+D F+ + N +D G R
Sbjct: 228 VTRRTAELIAQWQAVGFCHGVMNSDNMSILGLTIDYGPFGFIDGFNAAHICNHSDHAG-R 286
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y + QP IGLWN+ ++ L L+ ++E V+ Y F + + KLGL
Sbjct: 287 YAYNQQPQIGLWNLHCLASAL--LPLVSEEELVAVLGSYRDTFEAAHLMRLRAKLGLTAE 344
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ +I+ L + + D+T FFR+L+ + D D + P++ + ++ +E
Sbjct: 345 HDDDADLINSLFLTLHAHRTDFTIFFRSLAGFRQD-----DAVNAPVRDLFVE-----RE 394
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI-DAAELGDFGEVRRL 606
+ +W Y + L G D ER MN VNPKY+LRNYL ++AI A + D+ E+ L
Sbjct: 395 QFDAWARRYRERLAWEGSVDAERAVRMNRVNPKYILRNYLAEAAIAKARDERDYSEIEHL 454
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +E+P+DEQP E YA PP WA + V SCSS
Sbjct: 455 GRCLEKPFDEQPEFEAYAGFPPEWAEQISV---SCSS 488
>gi|295096100|emb|CBK85190.1| Uncharacterized conserved protein [Enterobacter cloacae subsp.
cloacae NCTC 9394]
Length = 480
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 210/518 (40%), Positives = 287/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ ++N +L+ ++++ADSL + F+ + G T L G P AQ
Sbjct: 13 LPGFYTALKPTP-LQNARLIWHNDALADSLGIPSTLFQPEKGAGVWGGETLLPGMKPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE L E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQLLPNGETLDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPT+RAL +VT+ V R+ E GA++ RVA+S LRFG ++
Sbjct: 132 EGLASEAMHALGIPTSRALSIVTSDTPVARETM-------EQGAMLIRVAESHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + D VR LADYA+R H+ H++N ++Y W
Sbjct: 185 HFYYR--REPDKVRQLADYALRRHWPHLQN---------------------EPDRYVLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTA+++A+WQ VGF HGV+NTDNMS+LGLT DYGP+GFLD + P + N +D G
Sbjct: 222 RDIVARTAAMIARWQAVGFAHGVMNTDNMSLLGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY +M KLGL
Sbjct: 282 -RYRFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDSYQEILLREYGVLMRNKLGLM 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + +++ L MA + DYT FR LS + PL+ +D
Sbjct: 339 TQEKSDNALLNGLFAIMAREGSDYTRTFRMLSQTAQQSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W Y L I D+ R+ M +VNP VLRN+L Q AI+ AE GD+ E+ R
Sbjct: 388 RQAFDDWFAVYRTRLQQEQIDDDTRQTRMKAVNPAMVLRNWLAQRAIEQAEQGDYTELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHIALRTPFADRE--DDYVSRPPDWGKRLEV---SCSS 480
>gi|429100196|ref|ZP_19162170.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
turicensis 564]
gi|426286845|emb|CCJ88283.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
turicensis 564]
Length = 482
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 216/528 (40%), Positives = 290/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLFFHNAPLAQALELPQTLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYRRES--ESVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAHLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPVLLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+D++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRHPFDDRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|161524539|ref|YP_001579551.1| hypothetical protein Bmul_1366 [Burkholderia multivorans ATCC
17616]
gi|189350705|ref|YP_001946333.1| hypothetical protein BMULJ_01877 [Burkholderia multivorans ATCC
17616]
gi|226696161|sp|A9AJS7.1|Y1877_BURM1 RecName: Full=UPF0061 protein Bmul_1366/BMULJ_01877
gi|160341968|gb|ABX15054.1| protein of unknown function UPF0061 [Burkholderia multivorans ATCC
17616]
gi|189334727|dbj|BAG43797.1| conserved hypothetical protein [Burkholderia multivorans ATCC
17616]
Length = 522
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 294/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVARLLGLPASLAAQPGFAELFAGNPTRDWPAEALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNNRPDL--LRALADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTG 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ + +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLATFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++LL M + D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLALERDGDAALANQLLETMHASRADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D +EA+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 ELAIRRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 522
>gi|296102753|ref|YP_003612899.1| hypothetical protein ECL_02407 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gi|295057212|gb|ADF61950.1| hypothetical protein ECL_02407 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length = 480
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 212/518 (40%), Positives = 287/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ + + +LV ++S+A+ L + P+ F+ D + G T L G P AQ
Sbjct: 13 LPGFYTALKPTP-LHHSRLVWHNDSLANDLAIPPEMFQPSDGAGVWGGETLLDGMQPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPGGETVDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+AQS LRFG ++
Sbjct: 132 ESLASEAMHALGIPTTRALTIVTSDTPVVRETV-------EKGAMLMRIAQSHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + VR LADYAIR H+ +++ ++KY W
Sbjct: 185 HFYYR--REPENVRQLADYAIRRHWPQLQD---------------------EADKYHLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V RTA ++A+WQ VGF HGV+NTDNMSILGLT DYGPFGFLD + P + N +D G
Sbjct: 222 RDVVARTAIMIARWQSVGFAHGVMNTDNMSILGLTFDYGPFGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY +M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDGYQETLLREYGTLMRNKLGLM 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ L MA + DYT FR L + + PL+ +D
Sbjct: 339 TQEKGDNTILNGLFALMAREGSDYTRTFRMLGQTEQHSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W +Y L + D R+A MN+ NP VLRN+L Q AI+ AE G++ E+ R
Sbjct: 388 RQAFDDWFATYRARLQQEQVDDATRQAQMNAANPAMVLRNWLAQRAIEQAEQGEYAELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHVALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|238026991|ref|YP_002911222.1| hypothetical protein [Burkholderia glumae BGR1]
gi|237876185|gb|ACR28518.1| Hypothetical protein bglu_1g13690 [Burkholderia glumae BGR1]
Length = 521
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 220/530 (41%), Positives = 290/530 (54%), Gaps = 73/530 (13%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P ++ +S+ +A L LDP P F F G A A+PYA Y GHQ
Sbjct: 41 PAAPLPAPYVIGFSDELARELGLDPSIRALPGFAELFCGNPTRDWPAAALPYATVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+T+GE L R E QLKGAG+TPYSR DG AVLRSSIREFLCSE
Sbjct: 101 FGVWAGQLGDGRALTIGE-LEHAGRRVEFQLKGAGRTPYSRMGDGRAVLRSSIREFLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRAL L+ + + VTR+ E A+V RVA SF+RFG ++ +
Sbjct: 160 AMHHLGIPTTRALALIGSDQPVTREEI-------ETAAVVTRVADSFVRFGHFEHFFAND 212
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ DL ++ LAD+ I + D D + Y A V +R
Sbjct: 213 RPDL--LKQLADHVIARFY------------------PDCRAAD---DPYLALLEAVMQR 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA ++AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF+D FD S N TD G RY +
Sbjct: 250 TARMLAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFIDGFDASHICNHTDTQG-RYAYR 308
Query: 435 NQPDIGLWNIAQFSTTL------AAAKLIDD-------KEANYVMERYGTKFMDEYQAIM 481
QP I WN + L +L DD ++A V+ R+ F +A M
Sbjct: 309 MQPRIAHWNCFCLAQALLPLIGQQRTELDDDPRTERAVEDAQAVLARFPETFGPALEAAM 368
Query: 482 TKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-----KADPSIPEDELLVP 533
KLGL + + + ++LL M + D+T FR L+++ +AD ++
Sbjct: 369 RAKLGLALELEGDAALANRLLEIMNGSRADFTLTFRRLAHLSKHDARADGAV-------- 420
Query: 534 LKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAID 593
+ + +D + A+ W Y + L + D R MN VNPKYVLRN+L ++AI
Sbjct: 421 -RDLFID-----RAAFDGWAAQYRERLAAEPRDDAARAEAMNRVNPKYVLRNHLAETAIR 474
Query: 594 AAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A DF E+ RL +++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 475 RAAEKDFSELERLARILRRPFDEQPEYEAYAALPPDWA---STLEVSCSS 521
>gi|241763909|ref|ZP_04761952.1| protein of unknown function UPF0061 [Acidovorax delafieldii 2AN]
gi|241366804|gb|EER61236.1| protein of unknown function UPF0061 [Acidovorax delafieldii 2AN]
Length = 494
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 221/517 (42%), Positives = 292/517 (56%), Gaps = 54/517 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T++ P+ + P V S SVA+ L+LD + + F+G G+ P A Y
Sbjct: 28 AFFTRLDPT-PLPQPYWVGISSSVAELLDLDAQWMASDEALQVFTGNACPVGSRPLASVY 86
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRAI LGE +E E+QLKG+G+TPYSR DG AVLRSSIREF
Sbjct: 87 SGHQFGVWAGQLGDGRAILLGE----TTEGLEVQLKGSGRTPYSRMGDGRAVLRSSIREF 142
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPT+RALC+ + V R+ + E A+V RVA SF+RFG ++
Sbjct: 143 LCSEAMHALGIPTSRALCVTGSPAPVRRE-------ETETAAVVTRVAPSFVRFGHFEHF 195
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A+R + + LADY I ++ + SN YAA
Sbjct: 196 AARDMQTE--LHALADYVIERYYPACRTAPQP-----------------ASNAYAALLQA 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V+ERTA+L+A WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF P N +D G R
Sbjct: 237 VSERTATLMAHWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFVPGHVCNHSDTQG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLI-DDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP++ WN+ F A LI D+ A +E Y T F + A M +KLGL
Sbjct: 296 YAYNRQPNVAYWNL--FCLAQALLPLIGDEGVARTALESYKTVFSTNFMAQMRRKLGLAD 353
Query: 490 ---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++I +L +A + VD+T F+R LS+ A P++ + LD
Sbjct: 354 AAPADGELIDAILLLLAREGVDHTIFWRRLSHAVARHD------FAPVRDLFLD-----G 402
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
W W+LSY + + + + LM NPK+VLRN+L + AI AA+LGDF V+ L
Sbjct: 403 AGWDRWLLSYSERIAQT--DKAQSGDLMLKTNPKFVLRNHLGEQAIRAAKLGDFSPVQTL 460
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L L+E P+DE PG + +A PP WA +SCSS
Sbjct: 461 LHLLEHPFDEHPGHDAWADFPPDWA---SSIEISCSS 494
>gi|407713393|ref|YP_006833958.1| hypothetical protein BUPH_02205 [Burkholderia phenoliruptrix
BR3459a]
gi|407235577|gb|AFT85776.1| hypothetical protein BUPH_02205 [Burkholderia phenoliruptrix
BR3459a]
Length = 518
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 214/524 (40%), Positives = 286/524 (54%), Gaps = 64/524 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P LV +S A L L+P P F FSG + A+PYA Y GHQ
Sbjct: 41 PAAPLNAPYLVGFSADTAAMLGLEPGLETDPGFAELFSGNATREWPSEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + + R+ELQLKGAG+TPYSR DG AVLRSSIREFLCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-EGRRYELQLKGAGRTPYSRMGDGRAVLRSSIREFLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVIGSDQPVRREEI-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVMS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLMVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ ++ T+ K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGERYEDTVRGDKSIED--AQQVLAGFKDRFGPALERRML 366
Query: 483 KKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + + + ++L + M ++ D+T FR L+ + + + P++ + L
Sbjct: 367 AKLGLEDAREGDAALANRLFDVMHANRADFTLTFRNLARLSKHDASGD----APVRDLFL 422
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ +W Y L D R MN VNPK+VLRN+L ++AI A+ D
Sbjct: 423 D-----RAAFDAWANDYRARLSHETRDDAARAIAMNRVNPKFVLRNHLAETAICRAKEKD 477
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 478 FSEVERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|121604738|ref|YP_982067.1| hypothetical protein Pnap_1836 [Polaromonas naphthalenivorans CJ2]
gi|120593707|gb|ABM37146.1| protein of unknown function UPF0061 [Polaromonas naphthalenivorans
CJ2]
Length = 497
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 215/502 (42%), Positives = 284/502 (56%), Gaps = 52/502 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT+++PS + +P V + ++A L L + E + +G PLAG+ P A Y G
Sbjct: 34 YTELAPS-PLPSPYWVGRNRALARELGLHDQWLESAETLAALTGNQPLAGSRPLASVYAG 92
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE+ + + E+QLKGAGKTPYSR DG AVLRSSIREFLC
Sbjct: 93 HQFGVWAGQLGDGRAILLGELETPRGPQ-EIQLKGAGKTPYSRMGDGRAVLRSSIREFLC 151
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGI TTRALC+ + V R+ E A+V R A SF+RFG ++ +
Sbjct: 152 SEAMHGLGIATTRALCVTGSDAAVRREEI-------ETAAVVTRTAPSFIRFGHFEHFSY 204
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + ++ LADY I + + YAA V+
Sbjct: 205 RNKPAQ--LKALADYVIARFYPDCREARQ---------------------PYAALLQAVS 241
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA ++A WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAFDP N +D G RY
Sbjct: 242 ERTAHMMAAWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPGHICNHSDDHG-RYA 300
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLP--- 488
+ QP++ WN+ F A LI+++E A +E Y T F QA M KLGLP
Sbjct: 301 YNKQPNMAYWNL--FCLGQALLPLIENQEDALAALESYKTVFPQALQARMRAKLGLPDEH 358
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ + Q+I +A +KVDYT F+R L A P++ + D+ E+
Sbjct: 359 ESDGQLIESTFRLLASNKVDYTIFWRRLCGFTAQSGHE------PVRDLFFDL-----ES 407
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ +W L Y + L I+ ++ LM NPKYVLRN+L + AI AA+L DF +V LL
Sbjct: 408 FNAWALQYSERLAPVDIA--QKADLMLKSNPKYVLRNHLGEEAIQAAKLKDFSQVDTLLT 465
Query: 609 LMERPYDEQPGMEKYARLPPAW 630
L++ P+DE PG + +A PP W
Sbjct: 466 LLQAPFDEHPGQDSFAGFPPDW 487
>gi|399016945|ref|ZP_10719148.1| hypothetical protein PMI16_00045 [Herbaspirillum sp. CF444]
gi|398104464|gb|EJL94599.1| hypothetical protein PMI16_00045 [Herbaspirillum sp. CF444]
Length = 505
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 222/523 (42%), Positives = 286/523 (54%), Gaps = 54/523 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ A +T + P+ + P LV S AD + LDP F F+G + P
Sbjct: 31 ELPPAFHTHLQPT-PLRAPYLVGVSADAADLIGLDPAMANSSSFVDVFTGNAVARDSKPL 89
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRAI LG++ R ELQLKGAG+TPYSR DG AVLRSS
Sbjct: 90 AAVYSGHQFGVWAGQLGDGRAILLGDLPARDGGRMELQLKGAGQTPYSRMGDGRAVLRSS 149
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAM LGIPTTRALC+ + + V R+ E A+V R++ SF+RFGS
Sbjct: 150 IREFLCSEAMAALGIPTTRALCVTGSDQQVRRETM-------ETTAVVTRMSPSFIRFGS 202
Query: 307 YQ--IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
++ ++ R E ++ LAD I + + G E N Y
Sbjct: 203 FEHWYYSKRHDE----LKLLADNVIANFYPEF------------LGAE---------NPY 237
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N T
Sbjct: 238 RELLAEVTRRTAHLMAHWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDARHICNHT 297
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD-KEANYVMERYGTKFMDEYQAIMTK 483
D G RY + QP IG WN F+ A LI +E + +Y +F + A++
Sbjct: 298 DQQG-RYSYQMQPRIGQWNC--FALGQALLPLIGSVEETEAALAQYEAEFAAKNDALLHA 354
Query: 484 KLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
KLGL + ++ + + VD+T FFR LS+++A ++ L D
Sbjct: 355 KLGLATRQPDDDKLFEAMFAILQAGHVDFTLFFRRLSDIQAGSDAGDE--------ALRD 406
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ ER A+ +W Y L D RK M++ NPKYVLRNYL Q AID A+ DF
Sbjct: 407 LFIERP-AFDAWAAQYRARLQQENSLDAPRKLAMDASNPKYVLRNYLAQVAIDKAQNKDF 465
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV +LL ++ RP+DEQP +KYA LPP WA V SCSS
Sbjct: 466 SEVAKLLDILRRPFDEQPEHDKYADLPPDWASHLEV---SCSS 505
>gi|221215074|ref|ZP_03588041.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
gi|221165010|gb|EED97489.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
Length = 522
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 294/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVARLLGLPASLAAQPGFAELFAGNPTRDWPAEALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNNRPDL--LRALADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTG 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ + +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLATFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++LL M + D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERDGDAALANQLLETMHASRADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D +EA+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 ELAIRRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 522
>gi|432431859|ref|ZP_19674291.1| hypothetical protein A13K_02144 [Escherichia coli KTE187]
gi|432844524|ref|ZP_20077423.1| hypothetical protein A1YS_02163 [Escherichia coli KTE141]
gi|433207805|ref|ZP_20391488.1| hypothetical protein WI1_01571 [Escherichia coli KTE97]
gi|430953408|gb|ELC72306.1| hypothetical protein A13K_02144 [Escherichia coli KTE187]
gi|431394851|gb|ELG78364.1| hypothetical protein A1YS_02163 [Escherichia coli KTE141]
gi|431730817|gb|ELJ94376.1| hypothetical protein WI1_01571 [Escherichia coli KTE97]
Length = 478
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 292/521 (56%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NGAGVWGGENLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------ESGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H++ DE+ +KY
Sbjct: 180 HFEHFYYRREP--EKVRQLADFAIRHYWSHLD-------------DEE--------DKYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFTDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N + Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEAPDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D ER+ LM SVNP VLRN+L Q AI+AAE D E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEITDSERQQLMQSVNPALVLRNWLAQRAIEAAEKDDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVSRPPDWGKRLEV---SCSS 478
>gi|170692428|ref|ZP_02883591.1| protein of unknown function UPF0061 [Burkholderia graminis C4D1M]
gi|170142858|gb|EDT11023.1| protein of unknown function UPF0061 [Burkholderia graminis C4D1M]
Length = 518
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 215/534 (40%), Positives = 291/534 (54%), Gaps = 66/534 (12%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVP 185
L + + P+ + P +V +S A L L+P + P F FSG A A+P
Sbjct: 32 LGSTFVTRLPATPLNAPYVVGFSSETAAMLGLEPGLEKDPGFAELFSGNATREWPADALP 91
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
YA Y GHQFG+WAGQLGDGRA+ LGE+ +R+ELQLKGAG+TPYSR DG AVLRS
Sbjct: 92 YASVYSGHQFGVWAGQLGDGRALGLGEV-EQDGQRFELQLKGAGRTPYSRMGDGRAVLRS 150
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIREFLCSEAMH LGIPTTRALC++ + + V R+ + E A+V RVA SF+RFG
Sbjct: 151 SIREFLCSEAMHHLGIPTTRALCVIGSDQPVRRE-------EVETAAVVTRVAPSFVRFG 203
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ S + D +R LAD+ I + H + + Y
Sbjct: 204 HFEHFYS--NDRTDALRALADHVIERFYPHCREAD---------------------DPYL 240
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
A E TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D
Sbjct: 241 ALLNEAVLSTADLMVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSD 300
Query: 426 LPGRRYCFANQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKF 473
G RY + QP I WN+ ++ ++ K I+D A V+ + +F
Sbjct: 301 SQG-RYAYRMQPQIAYWNLFCLAQGLLPLLGERYEESVRGDKSIED--AQRVLAGFKDRF 357
Query: 474 MDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDE 529
+ M+ KLGL + ++++L + M ++ D+T FR L+ + K D S
Sbjct: 358 GPALERRMSAKLGLEIERDGDAALVNRLFDVMHANRADFTLTFRNLARLSKRDASGD--- 414
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQ 589
P++ + LD + A+ +W Y L D R MN VNPK+VLRN+L +
Sbjct: 415 --APVRDLFLD-----RAAFDAWANDYRARLSHETRDDAARAIAMNRVNPKFVLRNHLAE 467
Query: 590 SAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+AI A+ DF E+ RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 468 TAIRRAKEKDFSELERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|419957388|ref|ZP_14473454.1| hypothetical protein PGS1_04945 [Enterobacter cloacae subsp.
cloacae GS1]
gi|388607546|gb|EIM36750.1| hypothetical protein PGS1_04945 [Enterobacter cloacae subsp.
cloacae GS1]
Length = 480
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 209/518 (40%), Positives = 289/518 (55%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ ++N +L+ ++++ADSL + F+ + G T L G P AQ
Sbjct: 13 LPGFYTALKPTP-LQNARLIWHNDALADSLGIPSTLFQPEKGAGVWGGETLLPGMKPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE + E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQVLPNGETLDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPT+RAL +VT+ V R+ E GA++ RVA+S LRFG ++
Sbjct: 132 EGLASEAMHALGIPTSRALSIVTSDTPVARETM-------EQGAMLVRVAESHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + D VR LADYA+R H+ H++N ++Y W
Sbjct: 185 HFYYR--REPDKVRQLADYALRRHWPHLQN---------------------EPDRYVLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTA+++A+WQ VGF HGV+NTDNMS+LGLT DYGP+GFLD + P + N +D G
Sbjct: 222 RDIVARTAAMIARWQAVGFAHGVMNTDNMSLLGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY +M +LGL
Sbjct: 282 -RYRFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDSYQEVLLREYGVLMRTRLGLM 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + +++ L MA + DYT FR LS + PL+ +D
Sbjct: 339 TQEKGDNALLNGLFAIMAREGSDYTRTFRMLSQTAQQSAAS------PLRDEFVD----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W +Y L I D+ R+A M +VNP VLRN+L Q AI+ AE GD+ E+ R
Sbjct: 388 RQAFDDWFAAYRARLQQEQIDDDTRQARMKAVNPAMVLRNWLAQRAIEQAEQGDYTELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHIALRTPFADRE--DDYVSRPPDWGKRLEV---SCSS 480
>gi|417958050|ref|ZP_12600967.1| SelO family protein [Neisseria weaveri ATCC 51223]
gi|343967442|gb|EGV35687.1| SelO family protein [Neisseria weaveri ATCC 51223]
Length = 492
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 216/506 (42%), Positives = 279/506 (55%), Gaps = 53/506 (10%)
Query: 145 PQLVAWSESVADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLG 203
P VA + +A+ + L P E F+ D L+ +G+ P A Y GHQFG++ QLG
Sbjct: 33 PYWVAQNHVLAEEMGLRPSEIFDNADNLLYLAGSAKQYDPAPIASVYSGHQFGVYVRQLG 92
Query: 204 DGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPT 263
DGRA+ +G+ + RWE QLKGAGKTPYSRFADG AVLRSSIRE+LCSEAMH LGIPT
Sbjct: 93 DGRAVLIGDSVGSDGLRWEWQLKGAGKTPYSRFADGRAVLRSSIREYLCSEAMHGLGIPT 152
Query: 264 TRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRT 323
TRAL + + V R+ + E A+V R+A SF+RFG ++ GQ +
Sbjct: 153 TRALAITGSNDAVYRE-------EAETAAVVTRIAPSFIRFGHFEYMYHTGQH--HNLPV 203
Query: 324 LADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQ 383
LAD+ I HF K F T V+ RTA LVA WQ
Sbjct: 204 LADFLIDRHFPECREAEKPYLALFET---------------------VSRRTAELVAAWQ 242
Query: 384 GVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN 443
VGF HGVLNTDNMS LGLTIDYGPFGFLDA+D N +D G RY + QP + WN
Sbjct: 243 SVGFCHGVLNTDNMSALGLTIDYGPFGFLDAYDRRHVCNHSDTGG-RYAYNEQPYVVHWN 301
Query: 444 IAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLN 500
+++F++ L DD A +ER+ F Y M KLGL K ++I+ +
Sbjct: 302 LSRFASCLLPLVSQDDLVAE--LERFPDIFQTAYLQKMRAKLGLQTQEKGDDELIADMFT 359
Query: 501 NMAVDKVDYTNFFRALS---NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYI 557
+ KVD+T FFR LS NV +P +PE LL + EA+ +W+ Y
Sbjct: 360 ALQSRKVDFTLFFRYLSEVGNVHGEP-LPEK---------LLALFHGPTEAFTAWIGRYR 409
Query: 558 QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ 617
L + + ER MN+VNP YVLRNYL + AI A+ GDF E+ RL + M+ P+ E+
Sbjct: 410 GRLRAENSNPAERAERMNAVNPLYVLRNYLLEQAIQLAKSGDFREIERLHRCMQNPFVER 469
Query: 618 PGMEKYARLPPAWAYRPGVCMLSCSS 643
+A LPP WA G+C +SCSS
Sbjct: 470 KEFADFAELPPQWA--EGIC-VSCSS 492
>gi|156406460|ref|XP_001641063.1| predicted protein [Nematostella vectensis]
gi|156228200|gb|EDO49000.1| predicted protein [Nematostella vectensis]
Length = 574
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 222/594 (37%), Positives = 312/594 (52%), Gaps = 111/594 (18%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE L +D+ +R LP D T + R+V AC++ V P A V NP+ V +SES + L
Sbjct: 1 MATLETLTFDNLALRSLPIDKETKNYVRQVEGACFSLVEP-APVSNPKTVVFSESALELL 59
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L E ER +F +FSG L G P + CY GHQFG ++GQLGDG A+ LGE++N K
Sbjct: 60 DLHKAEIERQEFAQYFSGNKLLPGTRPASHCYCGHQFGYFSGQLGDGAAMYLGEVINSKG 119
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWE+QLKG+G TPYSR ADG VLRSSIREFLCSEAM+ LGIPTTRA VT+ V R
Sbjct: 120 ERWEMQLKGSGLTPYSRQADGRKVLRSSIREFLCSEAMYHLGIPTTRAGSCVTSDTKVIR 179
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDLDIVRTLADY 327
D+FY+GN K E I+ R+A +F+RFGS++I S G++D I+ L +Y
Sbjct: 180 DIFYNGNAKSEKATIILRIAPTFIRFGSFEIFKPIDPVTGRKGPSTGRKD--ILLQLLEY 237
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
I+ + I +++ S +Y A+ ++ +TA LVAQWQ VGF
Sbjct: 238 TIKTFYPKIYDLHSS-----------------PEERYLAFYKDLVVKTARLVAQWQCVGF 280
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGVLNTDNMSI+GLTIDYGPFGF+DAFDP N +D RY + QP+I WN+ +
Sbjct: 281 CHGVLNTDNMSIVGLTIDYGPFGFMDAFDPQHICNDSDADRGRYRYGAQPEICKWNLMKL 340
Query: 448 S----------TTLAAAKLIDDKEAN-----------------------------YVMER 468
+LAA + + DKE M +
Sbjct: 341 GEAIHDALPVDQSLAALEELYDKEYQGAFLSKMRLKLGLLNKQQPEDVDLIEALFETMHK 400
Query: 469 YGTKFMDEYQAIMTKKLGLPKYNKQ--------------IISKLLNNMAVDKVDYTNFFR 514
G F + ++A+ +LG P+ +KQ + L + +DY
Sbjct: 401 TGADFTNTFRAL--SRLGAPQVSKQERVEDVTEYIRQQCLSLDELKQASQPSMDYRQIQM 458
Query: 515 ALSNVKADPSI------------PEDELLVPLKAVLLDIGKERKEA-----WISWVLSY- 556
++++P + E E + LK L + +E K+A W +W+ Y
Sbjct: 459 FQMLMQSNPGLLDQLGGGISVIKKELEKVEKLKQ-LRETSQEEKDAKDTHLWNTWIERYR 517
Query: 557 ------IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
+ E+ + ++ R +M NP+++LRNY+ Q+AI AAE GDF EV+
Sbjct: 518 SRLSEDMDEVDNVDEANSTRVNIMKVNNPRFILRNYIAQNAITAAENGDFTEVK 571
>gi|126438842|ref|YP_001059332.1| hypothetical protein BURPS668_2297 [Burkholderia pseudomallei 668]
gi|126218335|gb|ABN81841.1| conserved hypothetical protein [Burkholderia pseudomallei 668]
Length = 525
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 226/547 (41%), Positives = 298/547 (54%), Gaps = 71/547 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVAQSF+RFG ++ + Q + +R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDQPEQ--LRALADHVI-------------ERFYPACRDAD- 240
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DA
Sbjct: 241 -------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFIDA 293
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDD 459
FD N +D G RY + QP I WN + L A + ++D
Sbjct: 294 FDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVED 352
Query: 460 KEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRAL 516
A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR L
Sbjct: 353 --AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRHL 410
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 411 ARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMNR 461
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 462 VNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---ST 518
Query: 637 CMLSCSS 643
+SCSS
Sbjct: 519 LEVSCSS 525
>gi|308186658|ref|YP_003930789.1| hypothetical protein Pvag_1147 [Pantoea vagans C9-1]
gi|308057168|gb|ADO09340.1| UPF0061 protein [Pantoea vagans C9-1]
Length = 483
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 216/542 (39%), Positives = 297/542 (54%), Gaps = 69/542 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
++D+++ REL G CYT ++P+ + +L+ + +A S+ LD F
Sbjct: 7 SFDNTWFRELTG--------------CYTALNPTP-LAGGRLLYHNAPLATSMGLDSALF 51
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
E ++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE + + L
Sbjct: 52 EGHGHDVW-HGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLDDGSKLDWHL 110
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 111 KGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRE------ 164
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
E GA++ R++ S LRFG ++ S+ QE V+ LADYAIRHH+ H+E
Sbjct: 165 -TTERGAMLMRISPSHLRFGHFEHFFYSQQQEK---VQQLADYAIRHHWPHLEE------ 214
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+++Y W ++ RTA L+A WQ VGF HGV+NTDNMSILGLTI
Sbjct: 215 ---------------EADRYQQWFTDIVLRTARLIALWQSVGFAHGVMNTDNMSILGLTI 259
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGPFGFLD + P F N +D G RY F NQP IG+WN+ + + L+ L+ ++
Sbjct: 260 DYGPFGFLDDYQPDFICNHSDYQG-RYSFENQPMIGMWNLNRLAHALSG--LLTTEQLRT 316
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + M + M KLGL + QI++ LL M + DYT FR LS +
Sbjct: 317 ALSAYEPELMRVWGERMRAKLGLLTQQSSDNQILTDLLALMTQEHSDYTLTFRLLSETQ- 375
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ E PL+ +D +EA+ W Y L+ +SDEER+ +M + NP
Sbjct: 376 -----QAESRSPLRDEFID-----REAFDGWYQRYRSRLMDEQVSDEERQTVMKAANPAV 425
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL Q I+ AE G+ G + RL + ++RP+ ++ E Y + PP W +SC
Sbjct: 426 ILRNYLAQQVIEEAERGEQGALARLHQALQRPFSDETAAE-YRQRPPDWG---KTLEVSC 481
Query: 642 SS 643
SS
Sbjct: 482 SS 483
>gi|171321058|ref|ZP_02910041.1| protein of unknown function UPF0061 [Burkholderia ambifaria MEX-5]
gi|171093672|gb|EDT38822.1| protein of unknown function UPF0061 [Burkholderia ambifaria MEX-5]
Length = 522
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 222/535 (41%), Positives = 295/535 (55%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGCSDEVAQLLGLPASFAAQPGFAELFAGNPTRDWPANALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ +R+ELQ+KG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGQRYELQIKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNDRPDL--LRQLADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATLRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + ++D +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVEDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++ +KLL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERENDAELANKLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RDAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKEKDFSEVERLAQVLRRPFDEQPEHEAYAALPPDWA---GSLEVSCSS 522
>gi|429093367|ref|ZP_19155963.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
dublinensis 1210]
gi|426741779|emb|CCJ82076.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
dublinensis 1210]
Length = 482
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 213/529 (40%), Positives = 293/529 (55%), Gaps = 53/529 (10%)
Query: 118 DPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA 177
+P + R+ L YT+++P+ + N +L+ + +A +LEL P F+ + G
Sbjct: 4 NPHFTATWRDELPGFYTELTPTP-LSNSRLLCHNAPLAQTLELPPALFDYQGPAGVWGGE 62
Query: 178 TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFA 237
T L G P AQ Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR
Sbjct: 63 TLLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKFDWHLKGAGLTPYSRMG 122
Query: 238 DGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRV 297
DG AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+
Sbjct: 123 DGRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRI 175
Query: 298 AQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
A+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 176 AESHVRFGHFEHFYYR--REPERVRELAQYVIAHHFAHLAQ------------EED---- 217
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
++A W EV RTA L+A WQ VGF+HGV+NTDNMS+LGLT+DYGP+GFLD ++P
Sbjct: 218 -----RFALWFGEVVTRTAHLMASWQCVGFSHGVMNTDNMSVLGLTMDYGPYGFLDDYNP 272
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 273 GFICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDEYQPALLREW 329
Query: 478 QAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
M KLG + + + +LL MA + DYT FR LS + + PL
Sbjct: 330 GRQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSVTEQSSAAS------PL 383
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ +D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+A
Sbjct: 384 RDEFID-----RATFDAWFARYRARLQEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEA 438
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE D E+ RLL + P+ ++ + Y PP W V SCSS
Sbjct: 439 AERDDASELSRLLDALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|415939651|ref|ZP_11555544.1| hypothetical protein HFRIS_03809 [Herbaspirillum frisingense GSF30]
gi|407759285|gb|EKF69000.1| hypothetical protein HFRIS_03809 [Herbaspirillum frisingense GSF30]
Length = 491
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 218/518 (42%), Positives = 288/518 (55%), Gaps = 51/518 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT++ P+ + +P LV +S+ A ++ L E F F+G G+ + Y
Sbjct: 20 AFYTRLQPT-PLPDPYLVGFSDEAAATIGLARPAPEDRGFLDIFAGNQLAPGSQALSAVY 78
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
GHQFG+WAGQLGDGRAITLG++ + R ELQLKGAGKTPYSR DG AVLRSSIRE
Sbjct: 79 SGHQFGVWAGQLGDGRAITLGDLPAATGQGRIELQLKGAGKTPYSRMGDGRAVLRSSIRE 138
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
FLCSEAM LGIPTTRAL ++ + + V R+ E A+V R+A SF+RFGS++
Sbjct: 139 FLCSEAMAALGIPTTRALTVIGSDQRVQRE-------TAETAAVVTRMAPSFIRFGSFE- 190
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H Q D ++ L D + + + N Y A
Sbjct: 191 HWYYNQR-FDDLKVLGDAVLEQFYPELLR---------------------EENPYQALLK 228
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G
Sbjct: 229 EVTRRTATLMAQWQAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDARHICNHTDSQG- 287
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEAN-YVMERYGTKFMDEYQAIMTKKLGLP 488
RY + QP IG WN F+ A LI EA + Y F ++ A++ KLGL
Sbjct: 288 RYSYQMQPRIGQWNC--FALGQAMLPLIGSVEATEAALADYEAVFQAQHDALLHAKLGLR 345
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ Q+I + + VD+T FFR L +++ + ++ PL+ + +D
Sbjct: 346 TQRADDSQLIEAMFALLQAGHVDFTLFFRRLGDLQIGNAANDE----PLRDLFID----- 396
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ A+ +W Y L D R+ M++VNPKYVLRNYL Q AID A+ DF EV R
Sbjct: 397 RPAFDAWATQYRARLRDEDSDDAGRRLAMHAVNPKYVLRNYLAQVAIDKAQQKDFTEVAR 456
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L ++ P+DEQP ++YA LPP WA V SCSS
Sbjct: 457 LQTILRHPFDEQPEFDRYADLPPDWASHLEV---SCSS 491
>gi|421468836|ref|ZP_15917347.1| hypothetical protein BURMUCF1_1780 [Burkholderia multivorans ATCC
BAA-247]
gi|400231085|gb|EJO60806.1| hypothetical protein BURMUCF1_1780 [Burkholderia multivorans ATCC
BAA-247]
Length = 522
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 222/535 (41%), Positives = 293/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVARLLGLPASLAAQPGFAELFAGNPTRDWPAEALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + + R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPIVREEI-------ETAAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNNRPDL--LRALADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTG 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ + +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLATFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++LL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERDSDAALANQLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D +EA+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 ELAIRRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 522
>gi|283785070|ref|YP_003364935.1| hypothetical protein ROD_13491 [Citrobacter rodentium ICC168]
gi|282948524|emb|CBG88113.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 480
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 292/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ + ++A L + F+ + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNSALAQQLNIPQTLFDADGPAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQALPDGSILDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALTIVTSDTPVYRETV-------ESGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ H+ ++KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPHLHE---------------------ETDKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD ++P F N +D
Sbjct: 219 LWFRDVVARTATLIADWQTVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYEPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + + Y M +KL
Sbjct: 279 HQG-RYRFDNQPAVGLWNLQRLAQSL--SPFIGVEALNNALDEYQQELLRRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++ L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFISEQKEDNELLNALFSLMARERSDYTRTFRMLSRTEQQSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y + LL G+ D R+ LM SVNP VLRN+L Q AI AA+ GD E
Sbjct: 388 ---RAAFDAWFARYRERLLRDGVDDAARQMLMLSVNPALVLRNWLAQRAISAADQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y PP W V SCSS
Sbjct: 445 LHRLHAALRDPFTDRS--DDYVNRPPDWGRHLEV---SCSS 480
>gi|429120255|ref|ZP_19180939.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 680]
gi|426325321|emb|CCK11676.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 680]
Length = 482
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 290/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHL------------VQEED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAQLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMACEGSDYTRTFRMLSETEQHSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|345298923|ref|YP_004828281.1| hypothetical protein Entas_1755 [Enterobacter asburiae LF7a]
gi|345092860|gb|AEN64496.1| UPF0061 protein ydiU [Enterobacter asburiae LF7a]
Length = 480
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 212/520 (40%), Positives = 287/520 (55%), Gaps = 57/520 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ ++N +L+ ++ +AD+L + P F + + G T L G P AQ
Sbjct: 13 LPGFYTALKPTP-LQNARLIWHNDQLADALGVPPALFRPSEGAGVWGGETLLPGMNPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE + ++ LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPDGQSFDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ RVAQS LRFG ++
Sbjct: 132 ECLASEAMHALGIPTTRALSIVTSDTPVARETM-------EQGAMLMRVAQSHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + D VR LADYAIR H+ +++ ++KY W
Sbjct: 185 HFYYR--REPDKVRQLADYAIRRHWPALKD---------------------EADKYRLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V RTAS++A+WQ VGF HGV+NTDNMSILGLT DYGP+GFLD + P + N +D G
Sbjct: 222 CDVVARTASMIARWQSVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY +M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDTYQDVLLREYGKLMRGKLGLI 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK--ADPSIPEDELLVPLKAVLLDIGK 543
K + I++ L MA + DYT FR L + + S+ DE +
Sbjct: 339 TQEKGDNDILNGLFALMAREGSDYTRTFRMLGQTEQHSSASVLRDEFI------------ 386
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
++A+ W Y L + D R+A MN+ NP VLRN+L Q AI+ AE G++ E+
Sbjct: 387 -DRQAFDDWYRQYRARLQRDNVDDATRQAQMNAANPAMVLRNWLAQRAIEQAEQGEYAEL 445
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL + P+ ++ + Y PP W R V SCSS
Sbjct: 446 HRLHLALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|326317156|ref|YP_004234828.1| hypothetical protein Acav_2349 [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323373992|gb|ADX46261.1| protein of unknown function UPF0061 [Acidovorax avenae subsp.
avenae ATCC 19860]
Length = 496
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 222/515 (43%), Positives = 281/515 (54%), Gaps = 53/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + +P+ VA SE A + LD SG L G P A Y G
Sbjct: 31 FTELVPT-PLPDPRWVAGSEVTARLIGLDTDWLGSDAAVQVLSGNALLRGMRPLASVYSG 89
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE +E+QLKG+G+TPYSR DG AVLRSSIREFLC
Sbjct: 90 HQFGVWAGQLGDGRAILLGE----TETGYEVQLKGSGRTPYSRMGDGRAVLRSSIREFLC 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ + E A+V RVA SF+RFG ++ A+
Sbjct: 146 SEAMHALGIPTTRALALTASPAPVARE-------EIETAAVVTRVAPSFVRFGHFEHFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R Q + +R LADY I ++ +GD N YAA V
Sbjct: 199 RDQ--VRELRALADYVIDRYYPGCRG----------SGDAP------GGNPYAALLQAVG 240
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+L+AQWQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF P N +D G RY
Sbjct: 241 ARTAALIAQWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFVPGHICNHSDSQG-RYA 299
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL---P 488
F QP + WN+ F A LI+D A +E Y T F EY A M KLGL
Sbjct: 300 FNRQPQVAYWNL--FCLGQALMPLIEDTGLAQAALEPYRTAFPAEYMARMRAKLGLASAA 357
Query: 489 KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ + ++ LL +A D VDYT F+ LS A VP++ + +D +
Sbjct: 358 EGDAALVDDLLGLLATDAVDYTVFWHRLSQAVASGD------FVPVRDLFVD-----RAG 406
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W +W Y Q L + D +LM NP++VLRN+L + AI AA+ GDF + L
Sbjct: 407 WDAWAARYRQRLGNEAAQDP--ASLMQRTNPRFVLRNHLGEQAIRAAKTGDFAPLHALQA 464
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ RP+DE P +A PP WA +SCSS
Sbjct: 465 VLARPFDEHPAHADWAGFPPDWA---SSIEISCSS 496
>gi|395233636|ref|ZP_10411875.1| hypothetical protein A936_08263 [Enterobacter sp. Ag1]
gi|394731850|gb|EJF31571.1| hypothetical protein A936_08263 [Enterobacter sp. Ag1]
Length = 481
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 210/518 (40%), Positives = 298/518 (57%), Gaps = 54/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L Y++++P+ ++N +L+ S+ +AD L ++ F P ++ SG T L G P AQ
Sbjct: 15 LPGFYSELTPTP-LKNARLLYHSQPLADDLGINASFFAAPQQGIW-SGETLLPGMQPLAQ 72
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR DG AVLRS++R
Sbjct: 73 VYSGHQFGVWAGQLGDGRGILLGEQQLADGRKVDWHLKGAGLTPYSRMGDGRAVLRSTVR 132
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ RV++S LRFG ++
Sbjct: 133 EFLASEAMHALGIPTTRALTIVTSDTPVQRETV-------EQGAMLLRVSESHLRFGHFE 185
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + V+ LADYAIRHH+ H++ + + +Y W
Sbjct: 186 HFYYR--REPEKVQQLADYAIRHHWPHLQGLEE---------------------RYELWF 222
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P F N +D G
Sbjct: 223 TDVVARTAALIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPEFICNHSDYQG 282
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + TL + I ++ N +++ Y M + M KLGL
Sbjct: 283 -RYAFDNQPAVGLWNLQRLAQTL--SPFITAEKLNAILDGYQPAIMRAFGQRMRAKLGLF 339
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ + I+S+L M+ + DYT FR LS + + PL+ +D
Sbjct: 340 TEQQADNLILSELFALMSKEGSDYTRTFRMLSVTEQLSAAS------PLRDEFID----- 388
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ ++ +W Y + L + +SD ER+ M +VNP VLRN+L Q AI+AAE GD E+ +
Sbjct: 389 RASFDAWFGRYRERLQAEQVSDAERQQKMQAVNPALVLRNWLAQRAIEAAEKGDTRELAK 448
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + +P+ ++ + + PP W R V SCSS
Sbjct: 449 LHEALLQPFSDRE--DDMTQRPPDWGKRLEV---SCSS 481
>gi|399023273|ref|ZP_10725337.1| hypothetical protein PMI13_01274 [Chryseobacterium sp. CF314]
gi|398083243|gb|EJL73962.1| hypothetical protein PMI13_01274 [Chryseobacterium sp. CF314]
Length = 532
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 205/537 (38%), Positives = 300/537 (55%), Gaps = 37/537 (6%)
Query: 111 FVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
F++ GD + + R L ++ ++P A ++P+L+A++E +++ + L +F D
Sbjct: 29 FIKNFSGDFSGNPMQRATLKVLFSTINP-AGFDHPKLIAFNEKLSEEIGLG--KFNEQDL 85
Query: 171 PLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
P PYA Y GHQFG WAGQLGDGRAI GEI+N E+ E+Q KGAG
Sbjct: 86 DFLVGNNLP-ENVQPYATAYAGHQFGNWAGQLGDGRAILAGEIMNNAGEKTEIQWKGAGA 144
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR ADG AVLRSS+RE+L SEAM L +PTTRAL L TG+ + RDM YDGNP E
Sbjct: 145 TPYSRHADGRAVLRSSVREYLMSEAMFHLKVPTTRALSLCFTGEDIIRDMMYDGNPGYEQ 204
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA++ R A+SFLRFG +++ ++ Q + +++ L D+ I+++F I S+G
Sbjct: 205 GAVIIRTAESFLRFGHFELISA--QREYKMLQDLVDFTIQNYFPEIT----------SSG 252
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
+++Y + V RTA L+ +W VGF HGV+NTDNMS+LGLTIDYGP+
Sbjct: 253 ----------TDRYKDFFKNVCTRTADLMTEWFRVGFVHGVMNTDNMSVLGLTIDYGPYS 302
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYG 470
+D +D +FTPNTTDLPGRRY F Q I WN+ Q + L + D+K + +G
Sbjct: 303 MMDEYDLNFTPNTTDLPGRRYAFGKQGQISQWNLWQLANALHPL-IKDEKFLEDTLNSFG 361
Query: 471 TKFMDEYQAIMTKKLG---LPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + + ++ +K G L + +++ S M ++D+T FF L ++ ++
Sbjct: 362 NYFWENHDKMLCRKFGFDQLQETDEEFFSNWQALMQELQLDHTLFFHQLEKLQDSTNLSS 421
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
V ++L + E +I Y + L ++ IS E+ LM NPK++LRNYL
Sbjct: 422 LFENVSY-SILTSDAIVKLENFIK---KYRERLSANQISQEDALELMKKNNPKFILRNYL 477
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDE-QPGMEKYARLPPAWAYRPGVCMLSCSS 643
I+ + G + +L +E PY+E P K R P + G MLSCSS
Sbjct: 478 LFECIEEIKEGKTKMLDKLTHALENPYEELYPEFSK--RRPSGYDDISGCSMLSCSS 532
>gi|62179934|ref|YP_216351.1| hypothetical protein SC1364 [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|375114254|ref|ZP_09759424.1| UPF0061 protein ydiU [Salmonella enterica subsp. enterica serovar
Choleraesuis str. SCSA50]
gi|75483699|sp|Q57PU1.1|YDIU_SALCH RecName: Full=UPF0061 protein YdiU
gi|62127567|gb|AAX65270.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|322714400|gb|EFZ05971.1| UPF0061 protein ydiU [Salmonella enterica subsp. enterica serovar
Choleraesuis str. SCSA50]
Length = 480
Score = 351 bits (901), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 293/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVCSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL ID N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPSVALWNLQRLAQTLTPFIEID--ALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DY+ FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNVLLNELFSLMAREGSDYSRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + + D R+ M VNP VLRN+L Q AIDAAE GD E
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHWLHEVLRQPFTDRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|421477665|ref|ZP_15925475.1| hypothetical protein BURMUCF2_1776 [Burkholderia multivorans CF2]
gi|400226126|gb|EJO56223.1| hypothetical protein BURMUCF2_1776 [Burkholderia multivorans CF2]
Length = 522
Score = 351 bits (901), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 294/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S+ VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSDEVARLLGLPASLAAQPGFAELFAGNPTREWPAEALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNNRPDL--LRALADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATLRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTG 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ + +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLATFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++LL M + D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLALERDGDAALANQLLETMHASRADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D +EA+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 ELAIRRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 522
>gi|53719058|ref|YP_108044.1| hypothetical protein BPSL1422 [Burkholderia pseudomallei K96243]
gi|167738147|ref|ZP_02410921.1| hypothetical protein Bpse14_08775 [Burkholderia pseudomallei 14]
gi|167815334|ref|ZP_02447014.1| hypothetical protein Bpse9_09334 [Burkholderia pseudomallei 91]
gi|167823741|ref|ZP_02455212.1| hypothetical protein Bpseu9_08685 [Burkholderia pseudomallei 9]
gi|167910524|ref|ZP_02497615.1| hypothetical protein Bpse112_08520 [Burkholderia pseudomallei 112]
gi|217421896|ref|ZP_03453400.1| conserved hypothetical protein [Burkholderia pseudomallei 576]
gi|226197134|ref|ZP_03792711.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan
9]
gi|237812656|ref|YP_002897107.1| hypothetical protein GBP346_A2406 [Burkholderia pseudomallei
MSHR346]
gi|254189163|ref|ZP_04895674.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur
52237]
gi|254260168|ref|ZP_04951222.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a]
gi|386861443|ref|YP_006274392.1| hypothetical protein BP1026B_I1357 [Burkholderia pseudomallei
1026b]
gi|418382843|ref|ZP_12966768.1| hypothetical protein BP354A_1220 [Burkholderia pseudomallei 354a]
gi|418533714|ref|ZP_13099573.1| hypothetical protein BP1026A_0636 [Burkholderia pseudomallei 1026a]
gi|418540586|ref|ZP_13106114.1| hypothetical protein BP1258A_1031 [Burkholderia pseudomallei 1258a]
gi|418546830|ref|ZP_13112019.1| hypothetical protein BP1258B_1125 [Burkholderia pseudomallei 1258b]
gi|418553049|ref|ZP_13117890.1| hypothetical protein BP354E_0933 [Burkholderia pseudomallei 354e]
gi|52209472|emb|CAH35424.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
gi|157936842|gb|EDO92512.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur
52237]
gi|217395638|gb|EEC35656.1| conserved hypothetical protein [Burkholderia pseudomallei 576]
gi|225930513|gb|EEH26523.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan
9]
gi|237503465|gb|ACQ95783.1| conserved hypothetical protein [Burkholderia pseudomallei MSHR346]
gi|254218857|gb|EET08241.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a]
gi|385360674|gb|EIF66588.1| hypothetical protein BP1026A_0636 [Burkholderia pseudomallei 1026a]
gi|385361076|gb|EIF66974.1| hypothetical protein BP1258A_1031 [Burkholderia pseudomallei 1258a]
gi|385362859|gb|EIF68653.1| hypothetical protein BP1258B_1125 [Burkholderia pseudomallei 1258b]
gi|385372165|gb|EIF77290.1| hypothetical protein BP354E_0933 [Burkholderia pseudomallei 354e]
gi|385376962|gb|EIF81591.1| hypothetical protein BP354A_1220 [Burkholderia pseudomallei 354a]
gi|385658571|gb|AFI65994.1| hypothetical protein BP1026B_I1357 [Burkholderia pseudomallei
1026b]
Length = 525
Score = 351 bits (901), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 228/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|375261361|ref|YP_005020531.1| hypothetical protein KOX_22870 [Klebsiella oxytoca KCTC 1686]
gi|397658455|ref|YP_006499157.1| Selenoprotein O and cysteine-containing protein [Klebsiella oxytoca
E718]
gi|365910839|gb|AEX06292.1| hypothetical protein KOX_22870 [Klebsiella oxytoca KCTC 1686]
gi|394346754|gb|AFN32875.1| Selenoprotein O and cysteine-containing protein [Klebsiella oxytoca
E718]
Length = 480
Score = 351 bits (900), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 288/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + +A +L +D F + G T L G P
Sbjct: 10 RDELPDFYTALTPTP-LENARLVWHNAPLARTLGVDASLFSPQKGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREALASEAMHALGIPTTRALAIVASDTPVYRETV-------ERGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H +E L V+ LADY IRHH+ H++N ++KY
Sbjct: 182 HFE-HFYYRREPLK-VQQLADYVIRHHWPHLQN---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFSDVVTRTAEMIACWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + TL + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDNQPAVGLWNLQRLAQTL--SPFISAEALNDALDSYQHALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L M + DYT FR LS + + + PL+ +D
Sbjct: 336 GLFTQQKGDNELLDGLFALMEREGSDYTRTFRMLSASEQESAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW +Y L + D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RETFDSWFTAYRARLRDEQVDDAQRQARMRSVNPAIVLRNWLAQRAIEQAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ ++Y + PP W R V SCSS
Sbjct: 445 LERLHSALSHPFADR--TDEYIQRPPDWGRRLEV---SCSS 480
>gi|76811875|ref|YP_333852.1| hypothetical protein BURPS1710b_2457 [Burkholderia pseudomallei
1710b]
gi|254297331|ref|ZP_04964784.1| conserved hypothetical protein [Burkholderia pseudomallei 406e]
gi|121957746|sp|Q63V22.2|Y1422_BURPS RecName: Full=UPF0061 protein BPSL1422
gi|121957866|sp|Q3JRF1.1|Y2457_BURP1 RecName: Full=UPF0061 protein BURPS1710b_2457
gi|76581328|gb|ABA50803.1| Uncharacterized conserved protein [Burkholderia pseudomallei 1710b]
gi|157807595|gb|EDO84765.1| conserved hypothetical protein [Burkholderia pseudomallei 406e]
Length = 521
Score = 351 bits (900), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 228/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 24 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 81
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 82 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 139
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 140 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 192
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 193 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 236
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 237 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 288
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 289 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 347
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 348 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 405
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 406 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 456
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 457 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 513
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 514 TLEVSCSS 521
>gi|89901172|ref|YP_523643.1| hypothetical protein Rfer_2395 [Rhodoferax ferrireducens T118]
gi|121957861|sp|Q21VU1.1|Y2395_RHOFD RecName: Full=UPF0061 protein Rfer_2395
gi|89345909|gb|ABD70112.1| protein of unknown function UPF0061 [Rhodoferax ferrireducens T118]
Length = 496
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 221/508 (43%), Positives = 282/508 (55%), Gaps = 66/508 (12%)
Query: 148 VAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRA 207
V S S A L L + P+ +G P+AG P A Y GHQFG WAGQLGDGRA
Sbjct: 43 VGRSTSTARELGLSESWLDSPELLQVLTGNQPMAGTQPLASVYSGHQFGQWAGQLGDGRA 102
Query: 208 ITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRAL 267
I LGE L E+QLKG+G TPYSR DG AVLRSSIREFLCSEAM LGI T+RAL
Sbjct: 103 ILLGETGGL-----EVQLKGSGLTPYSRMGDGRAVLRSSIREFLCSEAMQGLGIATSRAL 157
Query: 268 CLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADY 327
C+V + + R+ E A+V RVA SF+RFG ++ H S + + + LADY
Sbjct: 158 CVVGSDAPIRRETV-------ETAAVVTRVAPSFIRFGHFE-HFSHHDQHAQL-KVLADY 208
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
I + +K N YAA V+ERTA+LVAQWQ VGF
Sbjct: 209 VIDRFYPECRASDK-----------------FAGNPYAALLEAVSERTAALVAQWQAVGF 251
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGVLNTDNMSILGLTIDYGPF FLDAF+P N +D G RY F QP+I WN+ F
Sbjct: 252 CHGVLNTDNMSILGLTIDYGPFQFLDAFNPGHVCNHSDQEG-RYAFDKQPNIAYWNL--F 308
Query: 448 STTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGL-------PKYNKQIISKLL 499
A LI ++E A +E Y T F ++ +M KLGL ++ ++ +L
Sbjct: 309 CLGQALLPLIGEQELAIAALESYKTVFPAAFERLMFAKLGLLDASDSTATVDRALLQDIL 368
Query: 500 NNMAVDKVDYTNFFRALSN--VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYI 557
+A ++VDYT F+R LS+ V D D + +D + A +W+L Y
Sbjct: 369 QLLAREQVDYTIFWRRLSHCGVATDAQTVRD--------LFVD-----RSAADAWLLRYS 415
Query: 558 QEL--LSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD 615
+ L + G++ + LM NPK+VLRNYL + AI AA+L DF +V LL L+E P++
Sbjct: 416 ERLEHIPQGLAAD----LMLKTNPKFVLRNYLGEQAIQAAKLKDFSQVETLLMLLESPFE 471
Query: 616 EQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E PG +KYA PP WA +SCSS
Sbjct: 472 EHPGFDKYADFPPDWA---SSIEISCSS 496
>gi|449308520|ref|YP_007440876.1| hypothetical protein CSSP291_10010 [Cronobacter sakazakii SP291]
gi|449098553|gb|AGE86587.1| hypothetical protein CSSP291_10010 [Cronobacter sakazakii SP291]
Length = 482
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 290/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALTIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTARLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|167902283|ref|ZP_02489488.1| hypothetical protein BpseN_08427 [Burkholderia pseudomallei NCTC
13177]
Length = 525
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 228/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFVD-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|146311392|ref|YP_001176466.1| hypothetical protein Ent638_1736 [Enterobacter sp. 638]
gi|166980212|sp|A4W9N5.1|Y1736_ENT38 RecName: Full=UPF0061 protein Ent638_1736
gi|145318268|gb|ABP60415.1| protein of unknown function UPF0061 [Enterobacter sp. 638]
Length = 480
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 211/514 (41%), Positives = 285/514 (55%), Gaps = 53/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT ++P+ ++N +L+ + S+A+ L + F+ + G T L G P AQ Y G
Sbjct: 17 YTALNPTP-LKNARLIWHNASLANDLGVPASLFQPETGAGVWGGETLLPGMHPLAQVYSG 75
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS+IRE L
Sbjct: 76 HQFGVWAGQLGDGRGILLGEQQLENGHTVDWHLKGAGLTPYSRMGDGRAVLRSTIRESLA 135
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPT+RAL +VT+ V R+ E GA++ R+AQS +RFG ++
Sbjct: 136 SEAMHALGIPTSRALSIVTSDTQVARESM-------EQGAMLMRIAQSHVRFGHFEHFYY 188
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + VR LAD+ I HH+ +N ++KY W +V
Sbjct: 189 R--REPEKVRQLADFVIEHHWPQWQN---------------------DADKYVLWFQDVV 225
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTASL+A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD + P F N +D G RY
Sbjct: 226 ARTASLMACWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDYQPDFICNHSDYQG-RYS 284
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F NQP +GLWN+ + + +L + I + N ++RY M EY +M +KLGL K
Sbjct: 285 FENQPAVGLWNLQRLAQSL--SPFIAVEALNDALDRYQDVLMQEYGKLMRRKLGLMTQEK 342
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ I++ L M+ + DYT FR L + + PL+ +D ++ +
Sbjct: 343 GDNDILNALFALMSREGSDYTRTFRMLGQTEKHSAAS------PLRDEFID-----RQGF 391
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
SW +Y L D+ R A MN+VNP VLRN+L Q AID AE GD+ E+ RL
Sbjct: 392 DSWFATYRARLQREETPDDARNAHMNAVNPAMVLRNWLAQRAIDQAEQGDYAELHRLHDA 451
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P++++ + Y PP W R V SCSS
Sbjct: 452 LRTPFNDRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|440759900|ref|ZP_20939022.1| Cysteine-containing selenoprotein O [Pantoea agglomerans 299R]
gi|436426374|gb|ELP24089.1| Cysteine-containing selenoprotein O [Pantoea agglomerans 299R]
Length = 487
Score = 350 bits (899), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 216/542 (39%), Positives = 298/542 (54%), Gaps = 69/542 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
+D+++ REL G CYT ++P+ + +L+ + +A S+ LDP+ F
Sbjct: 11 TFDNTWFRELTG--------------CYTALNPTP-LTGGRLLYHNAPLATSMGLDPELF 55
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE + + L
Sbjct: 56 AGNGHDVW-HGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLDDGSKLDWHL 114
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 115 KGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRE------ 168
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
E GA++ R++ S LRFG ++ S+ QE V+ LADYAIRHH+ H+E
Sbjct: 169 -TTERGAMLMRISPSHLRFGHFEHFFYSQQQEK---VQQLADYAIRHHWPHLEA------ 218
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+++Y W ++ RTA L+A WQ VGF HGV+NTDNMSILGLTI
Sbjct: 219 ---------------EADRYQQWFTDIVLRTARLIALWQSVGFAHGVMNTDNMSILGLTI 263
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGPFGFLD + P F N +D G RY F NQP IGLWN+ + + L+ L+ ++
Sbjct: 264 DYGPFGFLDDYQPDFICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LLTTEQLRT 320
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + M + M KLGL + +I++ LL M + DYT FR LS +
Sbjct: 321 ALSAYEPELMRVWGERMRAKLGLLTQQSNDNEILTDLLALMTQEHSDYTLTFRLLSETQ- 379
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ E PL+ +D +EA+ W Y L+ +SD ER+A+M + NP
Sbjct: 380 -----QAESRSPLRDEFID-----REAFDGWYQRYRSRLMDEQVSDTERQAVMKAANPAV 429
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL Q AI+ AE G+ G + RL + +++P+ ++ E Y + PP W +SC
Sbjct: 430 ILRNYLAQQAIEEAERGEQGALARLHQALQQPFSDETAAE-YRQRPPDWG---KTLEVSC 485
Query: 642 SS 643
SS
Sbjct: 486 SS 487
>gi|224825670|ref|ZP_03698774.1| protein of unknown function UPF0061 [Pseudogulbenkiania
ferrooxidans 2002]
gi|224601894|gb|EEG08073.1| protein of unknown function UPF0061 [Pseudogulbenkiania
ferrooxidans 2002]
Length = 488
Score = 350 bits (899), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 211/517 (40%), Positives = 284/517 (54%), Gaps = 51/517 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A Y +V P+ + P VA S +A L + + D SG+ P A Y
Sbjct: 19 AFYRRVDPTP-LPGPYPVAVSRPLAAELGVVGESLLGADAVGVLSGSALRPDMRPVAAIY 77
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ QLGDGRA+ LG+ E Q+KGAG TP+SR DG AVLRSSIREF
Sbjct: 78 SGHQFGVYVPQLGDGRALLLGDTKAPDGRLMEWQIKGAGLTPFSRMGDGRAVLRSSIREF 137
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAMH LGIPTTRAL ++ + + V R+ E A+V RVA+SFLRFGS+++
Sbjct: 138 LCSEAMHHLGIPTTRALAIMGSDEPVYRE-------TTETAAVVTRVAESFLRFGSFELF 190
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
RG D +R LADY IRHH+ + +N Y A E
Sbjct: 191 YHRGMHDE--IRVLADYVIRHHYPACQE---------------------AANPYLALFAE 227
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+AQWQ VGF HGV+N+DNMSILGLTIDYGPFGF+D F+ + N +D G R
Sbjct: 228 VTRRTAELIAQWQAVGFCHGVMNSDNMSILGLTIDYGPFGFIDGFNAAHICNHSDHAG-R 286
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y + QP IGLWN+ ++ L L+ ++E V+ Y F + + KLGL
Sbjct: 287 YAYNQQPQIGLWNLHCLASAL--LPLVSEEELVAVLGSYRDTFEAAHLMRLRAKLGLTAE 344
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ +I+ L + + D+T FFR L+ + D D + P++ + ++ +E
Sbjct: 345 HDDDADLINSLFLTLHAHRTDFTIFFRRLAGFRQD-----DAVNAPVRDLFVE-----RE 394
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI-DAAELGDFGEVRRL 606
+ +W Y + L D ER MN VNPKY+LRNYL ++AI A + D+ E+ RL
Sbjct: 395 QFDAWARRYRERLAWEASVDAERAVRMNRVNPKYILRNYLAEAAIAKARDERDYSEIERL 454
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +E+P+DEQP E YA PP WA + V SCSS
Sbjct: 455 GRCLEKPFDEQPEFEAYAGFPPEWAEQISV---SCSS 488
>gi|372273889|ref|ZP_09509925.1| hypothetical protein PSL1_02280 [Pantoea sp. SL1_M5]
gi|390433774|ref|ZP_10222312.1| hypothetical protein PaggI_03025 [Pantoea agglomerans IG1]
Length = 483
Score = 350 bits (899), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 216/542 (39%), Positives = 297/542 (54%), Gaps = 69/542 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
++D+++ REL G CYT ++P+ + +L+ + +A S+ LD F
Sbjct: 7 SFDNTWFRELTG--------------CYTALNPTP-LAGGRLLYHNAPLAASMGLDSALF 51
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE + + L
Sbjct: 52 ADKGHAVW-HGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLEDGSKLDWHL 110
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 111 KGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRE------ 164
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
E GA++ R++ S LRFG ++ S+ QE V+ LADYAIRHH+ H+
Sbjct: 165 -TTERGAMLMRISPSHLRFGHFEHFFYSQQQEK---VQQLADYAIRHHWPHL-------- 212
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
D +++Y W ++ RTA L+A WQ VGF HGV+NTDNMSILGLTI
Sbjct: 213 -------------DAEADRYQQWFTDIVLRTARLIALWQSVGFAHGVMNTDNMSILGLTI 259
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGPFGFLD + P F N +D G RY F NQP IG+WN+ + + L+ L+ ++
Sbjct: 260 DYGPFGFLDDYQPDFICNHSDYQG-RYSFENQPMIGMWNLNRLAHALSG--LLTTEQLRT 316
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + M + M KLGL + +I++ LL M + DYT FR LS +
Sbjct: 317 ALSAYEPELMRVWGERMRAKLGLLTQQSSDNEILTDLLALMTQEHSDYTLTFRLLSETQQ 376
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
S PL+ +D +EA+ SW Y L+ +SD ER+A+M + NP
Sbjct: 377 ADSRS------PLRDEFID-----REAFDSWYQRYRSRLMDEQVSDAERQAVMKAANPAV 425
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL Q AI+ AE G+ G + RL + +++P+ +Q E Y + PP W +SC
Sbjct: 426 ILRNYLAQQAIEEAERGEQGALARLHQALQQPFSDQTAAE-YRQRPPDWG---KTLEVSC 481
Query: 642 SS 643
SS
Sbjct: 482 SS 483
>gi|260597652|ref|YP_003210223.1| hypothetical protein CTU_18600 [Cronobacter turicensis z3032]
gi|260216829|emb|CBA30326.1| UPF0061 protein ydiU [Cronobacter turicensis z3032]
Length = 482
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 289/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLFFHNAPLAQALELPQTLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPESVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVRRTAHLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPVLLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ M SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEDEGVEDDARQQRMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+D++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRHPFDDRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|424799351|ref|ZP_18224893.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 696]
gi|423235072|emb|CCK06763.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 696]
Length = 482
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 213/528 (40%), Positives = 289/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L + YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPSFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLVQ-------------------- 214
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 215 -EKDRFALWFGEVVTRTAQLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|221198198|ref|ZP_03571244.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|221208309|ref|ZP_03581312.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221171722|gb|EEE04166.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221182130|gb|EEE14531.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
Length = 522
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 223/535 (41%), Positives = 292/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S VA L L +P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLAAPYVVGFSGEVARLLGLPASLAAQPGFAELFAGNPTRDWPAEALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGTDGRRYELQLKGSGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REFLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETAAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I D + + Y A
Sbjct: 207 EHFFSNNRPDL--LRALADHVI---------------------DRFYPACRDADDPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTG 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ + +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLATFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL + + ++LL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELERDSDAALANQLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D +EA+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----REAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 ELAIRRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 522
>gi|429086269|ref|ZP_19149001.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
universalis NCTC 9529]
gi|426506072|emb|CCK14113.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
universalis NCTC 9529]
Length = 482
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 216/528 (40%), Positives = 291/528 (55%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLLWHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIDHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAHLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPVLLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLHELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVDDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W R V SCSS
Sbjct: 440 ERDDASELSRLLEALRYPFADRD--DDYTHRPPDWGKRLEV---SCSS 482
>gi|209517041|ref|ZP_03265889.1| protein of unknown function UPF0061 [Burkholderia sp. H160]
gi|209502572|gb|EEA02580.1| protein of unknown function UPF0061 [Burkholderia sp. H160]
Length = 518
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 214/525 (40%), Positives = 283/525 (53%), Gaps = 66/525 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG----ATPLAGAVPYAQCYGGH 193
P+A ++ P LV +S A L L P F F G A P A A+PYA Y GH
Sbjct: 41 PAAPLDAPYLVGFSAETAAQLGLPAGIESDPGFVELFCGNATRAWP-ADALPYASVYSGH 99
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ LGE L E +ELQLKGAG+TPYSR DG AVLRSSIRE+LCS
Sbjct: 100 QFGVWAGQLGDGRALMLGE-LEHDGEHFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCS 158
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ +
Sbjct: 159 EAMHHLGIPTTRALCVIGSDQPVRRETI-------ETAAVVTRVAPSFVRFGHFEHFYA- 210
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
+ +D +R LAD+ I + H + + + Y A E
Sbjct: 211 -NDRVDALRALADHVIERFYPHCKEAD---------------------DPYLALLAEAVR 248
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
TA L+ WQGVGF HGV+NTDNMSILGLTIDYGPFGF+D FD N +D G RY +
Sbjct: 249 STADLMVDWQGVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDADHICNHSDTQG-RYAY 307
Query: 434 ANQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
QP I WN+ AQ ++ K ++D A V+ + +F + M
Sbjct: 308 RLQPQIAYWNLFCLAQGLLPLFGAQHDESVRGDKAVED--AQQVLAGFKDRFAPALENRM 365
Query: 482 TKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + ++++L M ++ D+T FR L+ + + + P + +
Sbjct: 366 RAKLGLEQARDGDDALVNRLFEVMHANRADFTLTFRNLARLSKHDASGD----APARDLF 421
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
LD + A+ +W Y L D R MN VNPK+VLRN+L ++AI A+
Sbjct: 422 LD-----RAAFDAWAHDYRARLAVESRDDAARAIAMNRVNPKFVLRNHLAETAIQRAKEK 476
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RL ++ RP+DEQP YA LPP WA +SCSS
Sbjct: 477 DFSEVERLAAVLRRPFDEQPEYAAYAGLPPDWA---SSLEVSCSS 518
>gi|317047881|ref|YP_004115529.1| hypothetical protein Pat9b_1657 [Pantoea sp. At-9b]
gi|316949498|gb|ADU68973.1| protein of unknown function UPF0061 [Pantoea sp. At-9b]
Length = 479
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 211/542 (38%), Positives = 299/542 (55%), Gaps = 66/542 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+ + +S+ RELPG YT ++P+ ++ +L+ + +A ++ LDP
Sbjct: 1 MQFTNSWQRELPG--------------FYTALAPTP-LQGGRLLYHNAPLATTMALDPSL 45
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F ++F G L G P AQ Y GHQFG+WAGQLGDGR I LGE + +
Sbjct: 46 FSGDGHGVWF-GQALLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGRKLDWH 104
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AV+RS++REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 105 LKGAGLTPYSRMGDGRAVIRSTVREFLASEALHHLGIPTTRALSLAVGEEPVLRE----- 159
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+E GA++ R+A+S LRFG ++ H G E D VR LADYAIRHH+ ++
Sbjct: 160 --TQERGAMLMRIAESHLRFGHFE-HFYYGGEP-DKVRQLADYAIRHHWPMLQE------ 209
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+++Y W ++ +RTASL+AQWQ VGF HGV+NTDNMS+LGLTI
Sbjct: 210 ---------------EADRYLLWFTDIVKRTASLIAQWQSVGFAHGVMNTDNMSLLGLTI 254
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP+GFLD + P+F N +D G RY F NQP +GLWN+ + + L+ L+ ++
Sbjct: 255 DYGPYGFLDDYQPNFICNHSDYQG-RYAFDNQPAVGLWNLNRLAHALSG--LMSTEQLKQ 311
Query: 465 VMERYGTKFMDEYQAIMTKKLGL--PKYN-KQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + M + M KLGL P+ N +I++ LL M + DYT FR LS +
Sbjct: 312 ALSHYEPELMRVWGERMRAKLGLLTPEANDNEILTGLLALMTQEHSDYTLTFRLLSETQ- 370
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ + PL+ +D ++A+ W Y Q LL SDE R+ +M + NP
Sbjct: 371 -----QQQTRSPLRDEFID-----RDAFDRWYDGYRQRLLRDEASDETRQQVMKAANPAL 420
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
VLRNYL Q I+ E G+ + RL +++P+ ++ + + PP W +SC
Sbjct: 421 VLRNYLAQQVIEEVERGETAALERLHLALQQPFSDEAVSAELRQRPPEWG---KTLEVSC 477
Query: 642 SS 643
SS
Sbjct: 478 SS 479
>gi|297481447|ref|XP_002692159.1| PREDICTED: UPF0061 protein Fjoh_2793 [Bos taurus]
gi|296481430|tpg|DAA23545.1| TPA: predicted protein-like [Bos taurus]
Length = 573
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 204/503 (40%), Positives = 294/503 (58%), Gaps = 41/503 (8%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV-ADSLELDPKEFERP 168
+ + LP DP ++ R+V + ++ P+ +LVA S+ V D L+LD E
Sbjct: 100 NLIAVLPTDPVKENYVRKVKNCVFSIAFPTPFQSRVRLVAVSKEVLEDILDLDLSVSETD 159
Query: 169 DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGA 228
DF SG + G++P A YGGHQFG+WA QLGDGRA +G +N + E+WELQLKG+
Sbjct: 160 DFIQLVSGGKIVFGSIPLAHRYGGHQFGIWADQLGDGRAHLIGIYMNRQGEKWELQLKGS 219
Query: 229 GKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
GKTPYSR DG A+LRSS+REFLCSEAMH+LGIPT+RA LV + V RD FY+GN +
Sbjct: 220 GKTPYSRNGDGRAILRSSLREFLCSEAMHYLGIPTSRAASLVVSDDVVWRDQFYNGNLTK 279
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
E GA+V RVA+S+ R GS +I G+ LD++R L D+ I+ +F
Sbjct: 280 ERGAVVLRVAKSWFRIGSLEILTHSGE--LDLLRMLLDFIIQEYF--------------- 322
Query: 349 TGDEDHSVVDLTS-NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
+VD+ N+Y + V TA L+A W VGF HGV NTDN S+L +TIDYG
Sbjct: 323 ------PLVDVKEPNRYVDFFSIVVFETAQLIALWMSVGFAHGVCNTDNFSLLSITIDYG 376
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV-- 465
PFGF++A++P F PNT+D RRY NQ +IG++N+ + L L++ ++ V
Sbjct: 377 PFGFMEAYNPDFVPNTSD-DERRYKIGNQANIGMFNLNKLLQALNP--LLNPRQKQLVTQ 433
Query: 466 -MERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
++ Y + ++ + KLGL + + +I+ LL+ M + D+T FR LS +
Sbjct: 434 ILKEYPVLYYTRFRELFKAKLGLLGKSEGDDDLIAFLLHLMEKTEADFTMTFRQLSEITQ 493
Query: 522 DPSIPEDELLVPLKAVLLDIGKERK--EAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
EL++P + L + + K AW+S LS ++ +S SD ER+ M +VNP
Sbjct: 494 SQL---QELVIPQEFWALKMISKHKLFPAWVSQYLSRLKSNISD--SDSERRKRMTAVNP 548
Query: 580 KYVLRNYLCQSAIDAAELGDFGE 602
+YVL+N++ +SA+ AE DF E
Sbjct: 549 RYVLKNWMAESAVQKAERNDFSE 571
>gi|167893832|ref|ZP_02481234.1| hypothetical protein Bpse7_08741 [Burkholderia pseudomallei 7894]
gi|167918552|ref|ZP_02505643.1| hypothetical protein BpseBC_08350 [Burkholderia pseudomallei
BCC215]
Length = 525
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 228/548 (41%), Positives = 297/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRAAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|323526031|ref|YP_004228184.1| hypothetical protein BC1001_1689 [Burkholderia sp. CCGE1001]
gi|323383033|gb|ADX55124.1| protein of unknown function UPF0061 [Burkholderia sp. CCGE1001]
Length = 518
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 213/524 (40%), Positives = 285/524 (54%), Gaps = 64/524 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P LV +S A L L+ P F FSG + A+PYA Y GHQ
Sbjct: 41 PAAPLNAPYLVGFSADTAAMLGLESGLETDPGFAELFSGNATREWPSEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + + R+ELQLKGAG+TPYSR DG AVLRSSIREFLCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-EGRRYELQLKGAGRTPYSRMGDGRAVLRSSIREFLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVIGSDQPVRREEI-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVMS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLMVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ ++ T+ K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGERYEDTVRGDKSIED--AQQVLAGFKDRFGPALERRML 366
Query: 483 KKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + + + ++L + M ++ D+T FR L+ + + + P++ + L
Sbjct: 367 AKLGLEDAREGDAALANRLFDVMHANRADFTLTFRNLARLSKHDASGD----APVRDLFL 422
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ +W Y L D R MN VNPK+VLRN+L ++AI A+ D
Sbjct: 423 D-----RAAFDAWANDYRARLSHERRDDAARAIAMNRVNPKFVLRNHLAETAIRRAKEKD 477
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 478 FSEVERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|283833379|ref|ZP_06353120.1| SelO family protein [Citrobacter youngae ATCC 29220]
gi|291071028|gb|EFE09137.1| SelO family protein [Citrobacter youngae ATCC 29220]
Length = 480
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 288/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N L+ ++++A+ L + F+ D + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNAHLIWHNDALAEQLAIPAALFDISDGSGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLVRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H + ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPHWQE---------------------EADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P F N +D
Sbjct: 219 LWFSDVVTRTANLIADWQAVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYVPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++RY + Y M +KL
Sbjct: 279 HQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPIEALNDALDRYQLALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFFSEQKNDNELLSELFSLMARERSDYTRTFRMLSLTQ------QHSAHSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L + D R+ M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNVDDAVRQTQMQAANPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 445 LHRLHQTLRTPFVDRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|295676533|ref|YP_003605057.1| hypothetical protein BC1002_1471 [Burkholderia sp. CCGE1002]
gi|295436376|gb|ADG15546.1| protein of unknown function UPF0061 [Burkholderia sp. CCGE1002]
Length = 518
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 213/525 (40%), Positives = 286/525 (54%), Gaps = 66/525 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFER-PDFPLFFSGATPL---AGAVPYAQCYGGH 193
P+A ++ P LV +S A L + P+ ER P F F G A A+PYA Y GH
Sbjct: 41 PAAPLDAPYLVGFSAETAARLGM-PEGIERDPGFLELFCGNATRDWPADALPYASVYSGH 99
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+TLGE L ER ELQLKGAG+TPYSR DG AVLRSSIRE+LCS
Sbjct: 100 QFGVWAGQLGDGRALTLGE-LEHDGERNELQLKGAGRTPYSRMGDGRAVLRSSIREYLCS 158
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ +
Sbjct: 159 EAMHHLGIPTTRALCVIGSDQPVRRETI-------ETAAVVTRVAPSFVRFGHFEHFYA- 210
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
+ +D +R LAD+ I + H + + + Y A E
Sbjct: 211 -NDRVDALRALADHVIERFYPHCKEAD---------------------DPYLALLAEAVR 248
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
TA L+ WQ VGF HGV+NTDNMSILGLTIDYGPFGF++ FD N +D G RY +
Sbjct: 249 STADLMVDWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMNGFDAGHICNHSDTQG-RYAY 307
Query: 434 ANQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
QP I WN+ + ++ A K ++D A +V+ + +F + M
Sbjct: 308 RLQPQIAYWNLFCLAQGLLPLLGEKHDESVRADKAVED--AQHVLAGFKERFAPALENRM 365
Query: 482 TKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + ++++L M ++ D+T FR L+ + + + P++ +
Sbjct: 366 RAKLGLEQARDGDDALVNRLFEAMHANRADFTLTFRNLARLSKHDASGD----APVRDLF 421
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
LD + A+ W Y L D R MN VNPK+VLRN+L ++AI A+
Sbjct: 422 LD-----RAAFDVWANDYRARLAVESHDDAARAIAMNRVNPKFVLRNHLAETAIQRAKEK 476
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RL ++ RP+DEQP YA LPP WA +SCSS
Sbjct: 477 DFSEVERLAAVLRRPFDEQPEYASYAGLPPDWA---SSLEVSCSS 518
>gi|299529225|ref|ZP_07042670.1| hypothetical protein CTS44_00619 [Comamonas testosteroni S44]
gi|298722848|gb|EFI63760.1| hypothetical protein CTS44_00619 [Comamonas testosteroni S44]
Length = 511
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 220/528 (41%), Positives = 293/528 (55%), Gaps = 60/528 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG----ATPLAGAVPY 186
A +T + P+ V P +A S S A + L+ + + SG G+ P
Sbjct: 29 AFFTYLQPT-PVPEPHWIAASVSTARWMGLNTEWLHSAEALQILSGNAVSGHGKGGSKPL 87
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRAI LGE + +E+QLKGAG+TPYSR DG AVLRSS
Sbjct: 88 ATVYSGHQFGVWAGQLGDGRAILLGE----TEQGFEVQLKGAGRTPYSRMGDGRAVLRSS 143
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAM LGIPTTRAL L + V R+ E A+V RVA+SF+RFG
Sbjct: 144 IREFLCSEAMAALGIPTTRALALTGSPLPVARETM-------ETAAVVTRVAESFIRFGH 196
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ A+R + ++TLAD I H+ E + V L N YA
Sbjct: 197 FEHFAARDMQTE--LKTLADLVIDQHY-----------------PECRTAVALKGNPYAN 237
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
+ V+ERTA L+AQWQGVGF HGV+NTDNMSILGLTIDYGPF FLDAFDP N +D
Sbjct: 238 FLQAVSERTARLMAQWQGVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPGHICNHSDS 297
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKL 485
G RY F QP + WN+ + A LI D+E +E Y T F Y M KL
Sbjct: 298 QG-RYAFNRQPQVAYWNL--YCLGQALLPLIGDEELTIAALESYKTVFPAAYARQMLAKL 354
Query: 486 GLPKYN----------KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
GLP+ Q+++ LL +A KVDYT FF L++ A + + PL+
Sbjct: 355 GLPENEAGTPATEGRFAQLVNPLLQILADSKVDYTIFFSRLTDAVAQRQETKID-FEPLR 413
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
++LD + ++ +W L+Y ++L + + + LM NP++VLRN+L ++ I AA
Sbjct: 414 DIILD-----RASFDAWSLTYSEQL--AQVDKAQTVDLMQKSNPRFVLRNHLGETVIRAA 466
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ GDF V+++L +++ PYD P +A PP WA +SCSS
Sbjct: 467 QAGDFAPVQQMLAVLQTPYDSHPDHADWAGFPPDWA---SSIEISCSS 511
>gi|387902461|ref|YP_006332800.1| hypothetical protein MYA_1708 [Burkholderia sp. KJ006]
gi|387577353|gb|AFJ86069.1| hypothetical protein MYA_1708 [Burkholderia sp. KJ006]
Length = 522
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 219/535 (40%), Positives = 294/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S VA+ L L P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGFSAEVAELLGLPPSLAAHAQFAELFAGNPTRDWPAHALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGSDGRRYELQLKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REYLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I + + + Y A
Sbjct: 207 EHFFSNDRPDL--LRRLADHVIERFYPACREAD---------------------DPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA +VAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAAMLRTADMVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVDDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL +++ ++ ++LL M D+T FR L+ + K D S
Sbjct: 361 FGPALEHAMRAKLGLALEREHDAELANQLLETMHTSHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D + A+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RAAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKDKDFSEVERLAQILRRPFDEQPEHEPYAALPPDWA---GSLEVSCSS 522
>gi|437486888|ref|ZP_20769780.1| hypothetical protein SEEE4647_00335, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 642046 4-7]
gi|435233110|gb|ELO14158.1| hypothetical protein SEEE4647_00335, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 642046 4-7]
Length = 445
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 207/493 (41%), Positives = 278/493 (56%), Gaps = 52/493 (10%)
Query: 154 VADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
+A L + F+ + + G T L G P AQ Y GHQFG+WAGQLGDGR I LGE
Sbjct: 2 LAQQLAIPASLFDATNGAGVWGGETLLPGMSPVAQVYSGHQFGVWAGQLGDGRGILLGEQ 61
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
L + LKGAG TPYSR DG AVLRS+IRE L SEAMH+LGIPTTRAL +V +
Sbjct: 62 LLADGSTLDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSIVASD 121
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHF 333
V R+ +E GA++ R+AQS +RFG ++ R + + V+ LAD+AIRH++
Sbjct: 122 TPVQRE-------TQETGAMLMRLAQSHMRFGHFEHFYYR--REPEKVQQLADFAIRHYW 172
Query: 334 RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLN 393
+++ + KYA W EVA RT L+A+WQ VGF+HGV+N
Sbjct: 173 PQWQDVPE---------------------KYALWFEEVAARTGRLIAEWQTVGFSHGVMN 211
Query: 394 TDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA 453
TDNMSILGLTIDYGPFGFLD +DP F N +D G RY F NQP + LWN+ + + TL
Sbjct: 212 TDNMSILGLTIDYGPFGFLDDYDPGFIGNHSDHQG-RYRFDNQPLVALWNLQRLAQTLTP 270
Query: 454 AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYT 510
ID N ++RY + Y M +KLG K + ++++L + MA + DYT
Sbjct: 271 FIEID--ALNRALDRYQDALLTHYGQRMRQKLGFFTEQKDDNALLNELFSLMAREGSDYT 328
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEER 570
FR LS+ + + PL+ +D + A+ +W Y L + + D R
Sbjct: 329 RTFRMLSHTEQQSASS------PLRDTFID-----RAAFDAWFDRYRARLRTEAVDDALR 377
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
+ M VNP VLRN+L Q AIDAAE GD E+ RL +++ +P+ ++ + YA PP W
Sbjct: 378 QQQMQRVNPAVVLRNWLAQRAIDAAEQGDMAELHRLHEVLRQPFTDRD--DDYASRPPEW 435
Query: 631 AYRPGVCMLSCSS 643
R V SCSS
Sbjct: 436 GKRLEV---SCSS 445
>gi|389841260|ref|YP_006343344.1| hypothetical protein ES15_2260 [Cronobacter sakazakii ES15]
gi|387851736|gb|AFJ99833.1| hypothetical protein ES15_2260 [Cronobacter sakazakii ES15]
Length = 482
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 290/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGIMLGEQQLSDGCKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAQLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRTKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|424816111|ref|ZP_18241262.1| hypothetical protein ECD227_1228 [Escherichia fergusonii ECD227]
gi|325497131|gb|EGC94990.1| hypothetical protein ECD227_1228 [Escherichia fergusonii ECD227]
Length = 480
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 210/523 (40%), Positives = 293/523 (56%), Gaps = 57/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A +T ++P+ + N +L+ + +A L + F + G T L G P
Sbjct: 10 RDELPATWTALNPTP-LHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIP TR+L +VT+ V R+ E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPGTRSLAIVTSDTPVYRE-------TTETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R D++ V+ LAD+AIRH++ H++ +KYA
Sbjct: 182 HFEHFYYR--RDIEKVQLLADFAIRHYWPHLQE---------------------EQDKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+A WQ VGF HGV+NTDNMSI+GLT+DYGPFGFLD ++P F N +D
Sbjct: 219 IWFRDVVARTASLIAGWQTVGFAHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL + I N ++ Y + Y M +KL
Sbjct: 279 HQG-RYSFDNQPAVALWNLQRLAQTL--SPFIAVNALNDALDSYKQVLLAVYGKRMRQKL 335
Query: 486 GLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
G Y +Q ++++L MA + DYT FR LS + + + PL+ +D
Sbjct: 336 GF--YTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASS------PLRDEFID 387
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ SW Y + + ++D+ER+ M SVNP VLRN+L Q AI+ A+ GD
Sbjct: 388 -----RAAFDSWFSRYRARIQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGDM 442
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL ++ P++++ + Y+R PP W R V SCSS
Sbjct: 443 EELHRLHDVLRNPFNDRD--DDYSRRPPEWGKRLEV---SCSS 480
>gi|238757764|ref|ZP_04618947.1| hypothetical protein yaldo0001_35210 [Yersinia aldovae ATCC 35236]
gi|238704007|gb|EEP96541.1| hypothetical protein yaldo0001_35210 [Yersinia aldovae ATCC 35236]
Length = 497
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 216/558 (38%), Positives = 304/558 (54%), Gaps = 66/558 (11%)
Query: 89 GGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLV 148
G K + K D+N+ +S+ ++L G YT + P+ ++ +L+
Sbjct: 3 GSKNVKSDNRPKFNHDVNFKNSYEQQLRG--------------FYTHLQPTP-LKGARLL 47
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
SE++A+ LELD F P ++ +G + L G +P AQ Y GHQFG+WAGQLGDGR I
Sbjct: 48 YHSEALANELELDASWFSAPKSTVW-AGESLLPGMMPLAQVYSGHQFGVWAGQLGDGRGI 106
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE + LKGAG TPYSR DG AVLRS +REFL SEA+H LGIPT+RAL
Sbjct: 107 LLGEQQLSDGRSMDWHLKGAGLTPYSRMGDGRAVLRSVVREFLASEALHHLGIPTSRALT 166
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYA 328
+VT+ V R+ + E GA++ RVA+S +RFG ++ R Q + V+ LADY
Sbjct: 167 IVTSEHPVYRE-------QPERGAMLLRVAESHVRFGHFEHFYYRQQPEQ--VKQLADYV 217
Query: 329 IRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
I H+ H+ G+++ +Y W +V RTA L+AQWQ VGF
Sbjct: 218 IARHWPHL------------VGEQE---------RYLLWFTDVIMRTARLIAQWQTVGFA 256
Query: 389 HGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFS 448
HGV+NTDNMSILG+T+DYGPFGFLD + P + N +D G RY F NQP + LWN+ +
Sbjct: 257 HGVMNTDNMSILGITMDYGPFGFLDDYVPGYICNHSDHQG-RYAFDNQPAVALWNLHRLG 315
Query: 449 TTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVD 505
L+ L+ + ++ Y + M Y M KLGL + Q +++ LL+ M +
Sbjct: 316 QALSG--LMSVAQLQLALDAYEPELMAVYGQQMRAKLGLFASDSQDNDVLTGLLSLMIKE 373
Query: 506 KVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI 565
DYT FR LS V+ + PL+ +D + + SW Y L +
Sbjct: 374 GRDYTRTFRLLSEVEMHSAHS------PLRDDFID-----RAGFDSWFSRYRTRLQQEPV 422
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
D +R+ M +VNPKY+LRNYL Q AID AE D ++RL + +++P+ +QP + A
Sbjct: 423 DDAQRQLAMKAVNPKYILRNYLAQLAIDHAEKDDILPLQRLHQALQQPFADQPEFDSLAD 482
Query: 626 LPPAWAYRPGVCMLSCSS 643
LPP W +SCSS
Sbjct: 483 LPPDWGKH---LEISCSS 497
>gi|423103472|ref|ZP_17091174.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5242]
gi|376386136|gb|EHS98853.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5242]
Length = 480
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 288/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + +A +L +D F + G T L G P
Sbjct: 10 RDELPDFYTALTPTP-LENARLVWHNAPLARTLGVDASLFSPQKGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREALASEAMHALGIPTTRALAIVASDTPVYRETV-------ERGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H +E L V+ LADY IRHH+ H++N +++Y
Sbjct: 182 HFE-HFYYRREPLK-VQQLADYVIRHHWPHLQN---------------------EADRYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFSDVVTRTAEMIACWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + TL + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYRFDNQPAVGLWNLQRLAQTL--SPFISAEALNGALDSYQQALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L M + DYT FR LS + + + PL+ +D
Sbjct: 336 GLFTQQKGDNELLDGLFALMEREGSDYTRTFRMLSASEQESAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW +Y L + D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RETFDSWFTAYRARLRDEQVEDAQRQARMRSVNPAIVLRNWLAQRAIEQAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ ++Y + PP W R V SCSS
Sbjct: 445 LERLHSALSHPFADR--TDEYIQRPPDWGRRLEV---SCSS 480
>gi|121601004|ref|YP_993250.1| hypothetical protein BMASAVP1_A1931 [Burkholderia mallei SAVP1]
gi|126450377|ref|YP_001080758.1| hypothetical protein BMA10247_1204 [Burkholderia mallei NCTC 10247]
gi|166998728|ref|ZP_02264582.1| conserved hypothetical protein [Burkholderia mallei PRL-20]
gi|294862478|sp|A2SBI7.2|Y5674_BURM9 RecName: Full=UPF0061 protein BMA10229_A3374
gi|121229814|gb|ABM52332.1| conserved hypothetical protein [Burkholderia mallei SAVP1]
gi|126243247|gb|ABO06340.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247]
gi|243065082|gb|EES47268.1| conserved hypothetical protein [Burkholderia mallei PRL-20]
gi|261825980|gb|ABN01587.2| conserved hypothetical protein [Burkholderia mallei NCTC 10229]
Length = 525
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 227/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + + KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAIRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|124384298|ref|YP_001029306.1| hypothetical protein BMA10229_A3374 [Burkholderia mallei NCTC
10229]
gi|254177967|ref|ZP_04884622.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399]
gi|254358212|ref|ZP_04974485.1| conserved hypothetical protein [Burkholderia mallei 2002721280]
gi|148027339|gb|EDK85360.1| conserved hypothetical protein [Burkholderia mallei 2002721280]
gi|160699006|gb|EDP88976.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399]
Length = 521
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 227/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 24 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 81
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 82 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 139
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 140 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 192
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 193 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 236
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 237 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 288
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 289 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 347
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + + KLGL + + + ++LL M D+T FR
Sbjct: 348 D--AHAVLGRFPEQFGPALERAIRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 405
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 406 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 456
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 457 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 513
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 514 TLEVSCSS 521
>gi|429115273|ref|ZP_19176191.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 701]
gi|426318402|emb|CCK02304.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
sakazakii 701]
Length = 482
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 214/528 (40%), Positives = 290/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYS+ D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSQMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALTIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTARLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|254179448|ref|ZP_04886047.1| conserved hypothetical protein [Burkholderia pseudomallei 1655]
gi|184209988|gb|EDU07031.1| conserved hypothetical protein [Burkholderia pseudomallei 1655]
Length = 525
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 227/548 (41%), Positives = 298/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGHRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A +N
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAVN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|429096028|ref|ZP_19158134.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
dublinensis 582]
gi|426282368|emb|CCJ84247.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
dublinensis 582]
Length = 482
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 212/529 (40%), Positives = 292/529 (55%), Gaps = 53/529 (10%)
Query: 118 DPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA 177
+P + R+ L YT+++P+ + N +L+ + +A +LEL P F+ + G
Sbjct: 4 NPHFTATWRDELPGFYTELTPTP-LSNSRLLCHNAPLAQTLELPPALFDYQGPAGVWGGE 62
Query: 178 TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFA 237
T L G P AQ Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR
Sbjct: 63 TLLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKFDWHLKGAGLTPYSRMG 122
Query: 238 DGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRV 297
DG AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+
Sbjct: 123 DGRAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRI 175
Query: 298 AQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
A+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 176 AESHVRFGHFEHFYYR--REPERVRELAQYVIAHHFAHL------------VQEED---- 217
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
++A W EV RTA L+A WQ VGF HGV+NTDNMS+LGLT+DYGP+GFLD ++P
Sbjct: 218 -----RFALWFGEVVTRTAHLMASWQCVGFAHGVMNTDNMSVLGLTMDYGPYGFLDDYNP 272
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 273 GFICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDEYQPALLREW 329
Query: 478 QAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
M KLG + + + +LL MA + DYT FR LS + + + PL
Sbjct: 330 GRQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSVTEQNSAAS------PL 383
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ +D + + +W Y L G+ D+ + LM SVNP VLRN+L Q AI+A
Sbjct: 384 RDEFID-----RATFDAWFARYRARLQEEGVEDDVHQRLMKSVNPALVLRNWLAQRAIEA 438
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE D E+ RLL + P+ ++ + Y PP W V SCSS
Sbjct: 439 AERDDASELSRLLDALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|319803072|ref|NP_001156665.1| selenoprotein O [Bos taurus]
Length = 680
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 234/607 (38%), Positives = 309/607 (50%), Gaps = 113/607 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+++ P + P++VA SE
Sbjct: 45 LAGLRFDNRALRALPVETPPPGPEGAPSAPRPVPGACFSRARPEP-LRRPRVVALSEPAL 103
Query: 156 DSLELDPKEFERPDFPL-------FFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
L L FFSG L GA P A CY GHQFG +AGQLGDG A+
Sbjct: 104 ALLGLGAPPAAAAAREAREAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAM 163
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE+ ERWELQLKGAG T +SR ADG VLRSSIREFLCSEAM LG+PTTRA
Sbjct: 164 YLGEVCTEAGERWELQLKGAGPTAFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGS 223
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---D 319
V++ V RD FYDGNP+ EP A+V R+A +FLRFGS++I H R + D
Sbjct: 224 CVSSQSTVVRDAFYDGNPRPEPCAVVLRLAPTFLRFGSFEIFKPRDEHTGRAGPSVGRDD 283
Query: 320 IVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLV 379
I + DY I + I+ + DH ++AA+ EV RTA LV
Sbjct: 284 IRLQMLDYVISTFYPEIQACHPG----------DHV------QRHAAFFREVTRRTARLV 327
Query: 380 AQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
A+WQ VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP++
Sbjct: 328 AEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHVCNASDTAGR-YSYSKQPEV 386
Query: 440 GLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----II 495
WN+ + + L A ++ EA + E + +F Y M +KLGL + ++ ++
Sbjct: 387 CKWNLQKLAEALDPALPLELAEA-ILAEEFDAEFGRHYLQKMRRKLGLVQTEQEGDGALV 445
Query: 496 SKLLNNM-------------------AVDKVDYTNFFRALSNVKA-------------DP 523
++LL M A + D F AL+ A DP
Sbjct: 446 AQLLETMHLTGADFTNSFYLLNSFPTAPESPDLDGFLAALTAQCASLEELRLAFRPQMDP 505
Query: 524 -----------SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVL 554
S P+ L+ +A L ++ + + W +W+
Sbjct: 506 RQLSMMLMLAQSNPQLLALIGTRASLARELERVEQQSRLEQLSEAELHGKNRSRWAAWLH 565
Query: 555 SYI------QELLSSGIS-DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
+Y +E S ++ ER +M + NPKYVLRNY+ Q AI+AAE GDF EVRR+L
Sbjct: 566 NYRARLEKDREASSDAVTWQAERTRVMRANNPKYVLRNYIAQGAIEAAESGDFSEVRRVL 625
Query: 608 KLMERPY 614
KL+E PY
Sbjct: 626 KLLETPY 632
>gi|296486883|tpg|DAA28996.1| TPA: selenoprotein O [Bos taurus]
Length = 680
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 234/607 (38%), Positives = 309/607 (50%), Gaps = 113/607 (18%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+++ P + P++VA SE
Sbjct: 45 LAGLRFDNRALRALPVETPPPGPEGAPSAPRPVPGACFSRARPEP-LRRPRVVALSEPAL 103
Query: 156 DSLELDPKEFERPDFPL-------FFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
L L FFSG L GA P A CY GHQFG +AGQLGDG A+
Sbjct: 104 ALLGLGAPPAAAAAREAREAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAM 163
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE+ ERWELQLKGAG T +SR ADG VLRSSIREFLCSEAM LG+PTTRA
Sbjct: 164 YLGEVCTEAGERWELQLKGAGPTAFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGS 223
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---D 319
V++ V RD FYDGNP+ EP A+V R+A +FLRFGS++I H R + D
Sbjct: 224 CVSSQSTVVRDAFYDGNPRPEPCAVVLRLAPTFLRFGSFEIFKPRDEHTGRAGPSVGRDD 283
Query: 320 IVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLV 379
I + DY I + I+ + DH ++AA+ EV RTA LV
Sbjct: 284 IRLQMLDYVISTFYPEIQACHPG----------DHV------QRHAAFFREVTRRTARLV 327
Query: 380 AQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
A+WQ VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D GR Y ++ QP++
Sbjct: 328 AEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHVCNASDTAGR-YSYSKQPEV 386
Query: 440 GLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----II 495
WN+ + + L A ++ EA + E + +F Y M +KLGL + ++ ++
Sbjct: 387 CKWNLQKLAEALDPALPLELAEA-ILAEEFDAEFGRHYLQKMRRKLGLVQTEQEGDGALV 445
Query: 496 SKLLNNM-------------------AVDKVDYTNFFRALSNVKA-------------DP 523
++LL M A + D F AL+ A DP
Sbjct: 446 AQLLETMHLTGADFTNSFYLLNSFPTAPESPDLDGFLAALTAQCASLEELRLAFRPQMDP 505
Query: 524 -----------SIPEDELLVPLKAVLL------------------DIGKERKEAWISWVL 554
S P+ L+ +A L ++ + + W +W+
Sbjct: 506 RQLSMMLMLAQSNPQLLALIGTRASLARELERVEQQSRLEQLSEAELHGKNRSRWAAWLH 565
Query: 555 SYI------QELLSSGIS-DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
+Y +E S ++ ER +M + NPKYVLRNY+ Q AI+AAE GDF EVRR+L
Sbjct: 566 NYRARLEKDREASSDAVTWQAERTRVMRANNPKYVLRNYIAQGAIEAAESGDFSEVRRVL 625
Query: 608 KLMERPY 614
KL+E PY
Sbjct: 626 KLLETPY 632
>gi|385209671|ref|ZP_10036539.1| hypothetical protein BCh11DRAFT_06803 [Burkholderia sp. Ch1-1]
gi|385182009|gb|EIF31285.1| hypothetical protein BCh11DRAFT_06803 [Burkholderia sp. Ch1-1]
Length = 518
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 217/525 (41%), Positives = 283/525 (53%), Gaps = 66/525 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P +V +S A L L+P P F FSG A A+PYA Y GHQ
Sbjct: 41 PAAPLSAPYVVGFSAETAALLGLEPGIENDPAFAELFSGNATREWPAEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + R+ELQLKGAG+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-GGRRFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC+V + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVVGSDQPVRRETV-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVLS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLMVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ + ++ K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGEKHEESVRGDKAIED--AQRVLGGFKDRFAPALERRMR 366
Query: 483 KKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAVL 538
KLGL + + ++L M ++ D+T FR L+ V K D S ++ +
Sbjct: 367 AKLGLETERAGDDALANRLFEVMHANRADFTLTFRNLARVSKHDASGD-----AAVRDLF 421
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
LD + A+ +WV Y L D R MN VNPK+VLRN+L ++AI A+
Sbjct: 422 LD-----RAAFDAWVNDYRARLSEETREDAARAIAMNRVNPKFVLRNHLAETAIRRAKEK 476
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 477 DFSEVERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|134295943|ref|YP_001119678.1| hypothetical protein Bcep1808_1840 [Burkholderia vietnamiensis G4]
gi|166225448|sp|A4JEZ0.1|Y1840_BURVG RecName: Full=UPF0061 protein Bcep1808_1840
gi|134139100|gb|ABO54843.1| protein of unknown function UPF0061 [Burkholderia vietnamiensis G4]
Length = 522
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 219/535 (40%), Positives = 293/535 (54%), Gaps = 69/535 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYA 187
A +T++ P+A + P +V +S VA L L P F F+G A A+PYA
Sbjct: 35 AFHTRL-PAAPLPAPYVVGFSAEVAQLLGLPPSLAAHAQFAELFAGNPTRDWPAHALPYA 93
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSSI
Sbjct: 94 SVYSGHQFGVWAGQLGDGRALTIGELPGSDGRRYELQLKGGGRTPYSRMGDGRAVLRSSI 153
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG +
Sbjct: 154 REYLCSEAMHHLGIPTTRALTVIGSDQPVVREEI-------ETSAVVTRVSESFVRFGHF 206
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ S + DL +R LAD+ I + + + Y A
Sbjct: 207 EHFFSNDRPDL--LRRLADHVIERFYPACREAD---------------------DPYLAL 243
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
RTA +VAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +D
Sbjct: 244 LEAAMLRTADMVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHSDTS 303
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTK 472
G RY + QP I WN + L A + +DD +A V+ ++ +
Sbjct: 304 G-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHGIGDDDARAERAVDDAQA--VLAKFPER 360
Query: 473 FMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPED 528
F + M KLGL +++ ++ ++LL M D+T FR L+ + K D S
Sbjct: 361 FGPALERAMRAKLGLELEREHDAELANQLLETMHASHADFTLTFRRLAQLSKHDASRD-- 418
Query: 529 ELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLC 588
P++ + +D + A+ +W Y L D R A MN VNPKYVLRN+L
Sbjct: 419 ---APVRDLFID-----RAAFDAWANLYRARLSEETRDDAARAAAMNRVNPKYVLRNHLA 470
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 471 EVAIRRAKDKDFSEVERLAQILRRPFDEQPEHEPYAALPPDWA---GSLEVSCSS 522
>gi|91783539|ref|YP_558745.1| hypothetical protein Bxe_A2276 [Burkholderia xenovorans LB400]
gi|121957852|sp|Q13YZ6.1|Y2155_BURXL RecName: Full=UPF0061 protein Bxeno_A2155
gi|91687493|gb|ABE30693.1| Conserved hypothetical protein UPF0061 [Burkholderia xenovorans
LB400]
Length = 518
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 213/524 (40%), Positives = 282/524 (53%), Gaps = 64/524 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQ 194
P+A + P +V +S A L L+P P F FSG A A+PYA Y GHQ
Sbjct: 41 PAAPLSAPYVVGFSAETAALLGLEPGIENDPAFAELFSGNATREWPAEALPYASVYSGHQ 100
Query: 195 FGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSE 254
FG+WAGQLGDGRA+ LGE+ + R+ELQLKGAG+TPYSR DG AVLRSSIRE+LCSE
Sbjct: 101 FGVWAGQLGDGRALGLGEVEH-GGRRFELQLKGAGRTPYSRMGDGRAVLRSSIREYLCSE 159
Query: 255 AMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG 314
AMH LGIPTTRALC++ + + V R+ E A+V RVA SF+RFG ++ S
Sbjct: 160 AMHHLGIPTTRALCVIGSDQPVRRETV-------ETAAVVTRVAPSFVRFGHFEHFYS-- 210
Query: 315 QEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
+ D +R LAD+ I + H + + Y A E
Sbjct: 211 NDRTDALRALADHVIERFYPHCREAD---------------------DPYLALLNEAVIS 249
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
TA L+ +WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D FD + N +D G RY +
Sbjct: 250 TADLMVEWQAVGFCHGVMNTDNMSILGLTIDYGPFGFMDGFDAGYICNHSDSQG-RYAYR 308
Query: 435 NQPDIGLWNI------------AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMT 482
QP I WN+ + ++ K I+D A V+ + +F + M
Sbjct: 309 MQPQIAYWNLFCLAQGLLPLLGEKHEESVRGDKAIED--AQRVLGGFKDRFAPALERRMR 366
Query: 483 KKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + + ++L M ++ D+T FR L+ V + + ++ + L
Sbjct: 367 AKLGLETERAGDDALANRLFEVMHANRADFTLTFRNLARVSKHDASGD----AAVRDLFL 422
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ +WV Y L D R MN VNPK+VLRN+L ++AI A+ D
Sbjct: 423 D-----RAAFDAWVNDYRARLSEETREDAARAIAMNRVNPKFVLRNHLAETAIRRAKEKD 477
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F EV RL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 478 FSEVERLAAVLRRPFDEQPEHEAYAGLPPDWA---SSLEVSCSS 518
>gi|238765268|ref|ZP_04626196.1| hypothetical protein ykris0001_43160 [Yersinia kristensenii ATCC
33638]
gi|238696491|gb|EEP89280.1| hypothetical protein ykris0001_43160 [Yersinia kristensenii ATCC
33638]
Length = 486
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 212/528 (40%), Positives = 288/528 (54%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ + L YT + P+ ++ +L+ SE +A LELD F P ++ +G T
Sbjct: 8 PQFNNSYGQQLSGFYTHLQPTP-LKGARLLYHSEPLARELELDASWFTAPKAAVW-AGET 65
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFGMWAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 66 LLPGMEPLAQVYSGHQFGMWAGQLGDGRGILLGEQQLSDGRHMDWHLKGAGLTPYSRMGD 125
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS +REFL SEA+H LGIPT+RAL +VT+ V R+ + E GA++ RVA
Sbjct: 126 GRAVLRSVVREFLASEALHHLGIPTSRALTIVTSDHPVYRE-------QAERGAMLLRVA 178
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q V+ LADY I H+ + G ED
Sbjct: 179 ESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWPQL------------VGQED----- 219
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
Y W +V +RTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD + P
Sbjct: 220 ----SYLLWFTDVVKRTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLDDYAPG 275
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY F NQP + LWN+ + L+ L+ ++ + Y + M Y
Sbjct: 276 YICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LLTAEQLQRGLAAYEPELMAAYG 332
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + Q +++ LL+ M + DYT FR LS V+ + PL+
Sbjct: 333 QQMRTKLGFSERDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVEIHSAQS------PLR 386
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ SW Y L I D +R+ +M +VNP Y+LRNYL Q AID A
Sbjct: 387 DDFID-----RAAFDSWYSRYRARLQQESIDDAQRQQMMKAVNPHYILRNYLAQQAIDHA 441
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++RL + +++P+ +QP A LPP W +SCSS
Sbjct: 442 EKDDIQLLQRLHQALQQPFADQPEFNDLAELPPEWGKH---LEISCSS 486
>gi|365106795|ref|ZP_09335208.1| UPF0061 protein ydiU [Citrobacter freundii 4_7_47CFAA]
gi|363641779|gb|EHL81154.1| UPF0061 protein ydiU [Citrobacter freundii 4_7_47CFAA]
Length = 480
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 290/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ ++++A+ L + F+ P + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNDALAEQLAIPAALFDIPTGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE ++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPQWQE---------------------EADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D
Sbjct: 219 LWFNDVVTRTATLIADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDYICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++ Y + Y M +KL
Sbjct: 279 NQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPVEALNDALDSYQLALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFFSEQKDDNELLSELFSLMARERSDYTRTFRMLSETE------QHSAQSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D R+ M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNIADAVRQTQMKAANPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHQALRTPFADRD--DDYASRPPDWGKRLEV---SCSS 480
>gi|160898743|ref|YP_001564325.1| hypothetical protein Daci_3302 [Delftia acidovorans SPH-1]
gi|160364327|gb|ABX35940.1| protein of unknown function UPF0061 [Delftia acidovorans SPH-1]
Length = 510
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 219/522 (41%), Positives = 291/522 (55%), Gaps = 56/522 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T + P+ + P +A S A+ L LDP+ + +G L G+ P A Y G
Sbjct: 34 FTHLRPT-PLPEPHWIATSTGTAELLGLDPQWLASDEALQALTGNAVLPGSHPLASVYSG 92
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE + E+QLKGAG+TPYSR DG AVLRSSIREFLC
Sbjct: 93 HQFGVWAGQLGDGRAILLGE----TASGHEIQLKGAGRTPYSRMGDGRAVLRSSIREFLC 148
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + + R+ + E A+V RVA SF+RFG ++ A+
Sbjct: 149 SEAMHALGIPTTRALSLTGSPAPIRRE-------EIETAAVVARVAPSFIRFGHFEHFAA 201
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R Q + +R LADY I H++ E + L N YA + V+
Sbjct: 202 RDQ--IAPLRQLADYVIDHYY-----------------PECRTAEALAGNAYANFLQAVS 242
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF+P N +D G RY
Sbjct: 243 ERTARLLAHWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFNPGHICNHSDTQG-RYA 301
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKLGLPK-- 489
F QP + WN+ + A LI ++E +E Y F Y A+M +KLGLP+
Sbjct: 302 FNRQPQVAYWNL--YCLGQALLPLIGEEELTIAALESYKQVFPQAYGALMLRKLGLPEDA 359
Query: 490 --------YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
+++ LL MA + VDYT FF L++ A + + L P++ ++LD
Sbjct: 360 PGTPPAEGRFAALVNPLLQLMADNAVDYTIFFSRLTDAVAAGAGAGTD-LEPVRDLVLD- 417
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+EA+ W Y + L +G ALM NP++VLRN+L + AI AA+ GDF
Sbjct: 418 ----REAFDRWAALYARHL--AGTDAAAAAALMQESNPRFVLRNHLGEMAIRAAKAGDFA 471
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
VR+LL +++ P+ ++A PP WA +SCSS
Sbjct: 472 PVRQLLAVLQTPFAPHAEHAEWAGFPPDWA---SSIEISCSS 510
>gi|440287359|ref|YP_007340124.1| hypothetical protein D782_1951 [Enterobacteriaceae bacterium strain
FGI 57]
gi|440046881|gb|AGB77939.1| hypothetical protein D782_1951 [Enterobacteriaceae bacterium strain
FGI 57]
Length = 480
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 208/521 (39%), Positives = 288/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L Y++++P+ ++N +L+ + +AD L + F + G L G P
Sbjct: 10 RDELPGFYSELNPTP-LQNARLIWHNTPLADELGIASSLFAPERGAGVWGGEALLPGMKP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGTSLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
++RE L SEAMH+LGIPTTRAL +VT+ + R+ E GA++ R+AQS +RFG
Sbjct: 129 TLRESLASEAMHYLGIPTTRALSIVTSDTPIQRE-------NVEQGAMLMRIAQSHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R ++D V+ LAD+ IRH++ H++ +++YA
Sbjct: 182 HFEHFYYR--REMDKVQQLADFVIRHYWPHLQQ---------------------EADRYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RT ++A+WQ VGF HGV+NTDNMSILGLTIDYGPFGFLD + P F N +D
Sbjct: 219 LWFRDVVTRTGQMIARWQTVGFAHGVMNTDNMSILGLTIDYGPFGFLDDYQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L+A ID N ++ Y EY M +KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSLSA--FIDVDTLNDALDGYQLALFSEYGTRMRQKL 335
Query: 486 GLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + +++ L MA + DYT FR LS + + PL+ +D
Sbjct: 336 GLFTQEVGDNDLLNALFALMAREGSDYTRTFRMLSETEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y + GI D R+ M VNP VLRN+L Q AI+ AE GD+ E
Sbjct: 388 ---RAAFDSWFAQYRVRIQPEGIDDAIRQQAMKQVNPAMVLRNWLAQRAIETAEKGDYQE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + YA+ PP W R V SCSS
Sbjct: 445 LHRLHEALRNPWVDRD--DDYAQRPPDWGKRLEV---SCSS 480
>gi|386015649|ref|YP_005933931.1| hypothetical protein PAJ_1055 [Pantoea ananatis AJ13355]
gi|327393713|dbj|BAK11135.1| hypothetical UPF0061 protein YdiU [Pantoea ananatis AJ13355]
Length = 478
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 213/539 (39%), Positives = 298/539 (55%), Gaps = 67/539 (12%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+S+ RELPG YT ++P+ + +L+ + +A ++ LD F
Sbjct: 4 DNSWFRELPG--------------SYTALNPTP-LAGGRLLYHNAPLAKAMALDSALFSG 48
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
+++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE R + LKG
Sbjct: 49 QGHGVWY-GAALLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQRQEDGRRLDWHLKG 107
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR DG AV+RS++REFL SEA+H LGIPTTRAL L + + V R+
Sbjct: 108 AGLTPYSRMGDGRAVVRSTVREFLASEALHHLGIPTTRALTLAVSDEPVYRE-------T 160
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
E GA++ R+A S LRFG ++ H Q+ + V+ LADYAIRHH+ +
Sbjct: 161 AERGAMLMRIAPSHLRFGHFE-HFFYSQQP-EQVKQLADYAIRHHWPQL----------- 207
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
VD +++Y W ++ RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYG
Sbjct: 208 ---------VD-EADRYQLWFADIVLRTARLIAQWQSVGFAHGVMNTDNMSILGLTLDYG 257
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+GFLD + P + N +D G RY F NQP IGLWN+ + + L+ L+ ++ +
Sbjct: 258 PYGFLDDYQPDYICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LMSTEQLKQALS 314
Query: 468 RYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y M + M KLGL + +I+++LL+ M+ ++ DYT FR LS+ +
Sbjct: 315 GYENALMRVWGERMRAKLGLLTADAGDNEILTELLSLMSQERSDYTLTFRLLSDTE---- 370
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLR 584
+ E PL+ +D + A+ W Y Q LL + D ER+ +M + NP VLR
Sbjct: 371 --QAESRSPLRDEFID-----RSAFDRWYQRYRQRLLQEQVGDAERQQVMKAANPAVVLR 423
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
NYL Q ID AE G+ G + RL + +++P+ + E Y + PP W +SCSS
Sbjct: 424 NYLAQQVIDEAEKGESGALARLHQALQQPFSDAAAAE-YRQRPPDWG---KTLEVSCSS 478
>gi|388568335|ref|ZP_10154755.1| hypothetical protein Q5W_3098 [Hydrogenophaga sp. PBC]
gi|388264535|gb|EIK90105.1| hypothetical protein Q5W_3098 [Hydrogenophaga sp. PBC]
Length = 496
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 215/503 (42%), Positives = 285/503 (56%), Gaps = 53/503 (10%)
Query: 146 QLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDG 205
LV+ + +A +L LDP + D FSG+ P+ GA P A Y GHQFG+WAGQLGDG
Sbjct: 42 HLVSLNAPLAQALGLDPARLRQDDAVRAFSGSLPIEGARPLATVYSGHQFGVWAGQLGDG 101
Query: 206 RAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTR 265
RA+ LGE L+ + E+Q KGAG+TPYSR DG AVLRSSIRE+LCSEAMH LGIPTTR
Sbjct: 102 RALLLGE-LDTPAGPMEIQFKGAGRTPYSRMGDGRAVLRSSIREYLCSEAMHGLGIPTTR 160
Query: 266 ALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLA 325
AL + + + V R+ E ++V RVA SF+RFG ++ ++ G D +R LA
Sbjct: 161 ALIVTGSPQPVIRETV-------ESASVVTRVAPSFIRFGHFEHFSANGLAD--ELRRLA 211
Query: 326 DYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGV 385
D+ I +F G + N YA V+ RTA L+AQWQ V
Sbjct: 212 DFVID---------------AFYPG-----CREAGGNPYARLLEAVSARTADLLAQWQAV 251
Query: 386 GFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIA 445
GF HGV+NTDNMS+LGLTIDYGPF FLDAF+P+ N +D G RY + QP++ WN+
Sbjct: 252 GFCHGVMNTDNMSVLGLTIDYGPFQFLDAFNPAHICNHSD-HGGRYAYHRQPNVAYWNL- 309
Query: 446 QFSTTLAAAKLIDD-KEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNN 501
F A L+DD ++A +E Y T+F M KLGL + + +I +L+
Sbjct: 310 -FCLGQALLPLMDDQQQALDALEPYKTRFPAALTQRMGAKLGLADTREGDAALIEELMQL 368
Query: 502 MAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELL 561
MA D VD+T FR L + + P +L + ++E++ +W + + L
Sbjct: 369 MAKDAVDFTILFRRLCDALEGAAEPVRDLFL------------QRESFDAWAARWRERLQ 416
Query: 562 SS-GISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGM 620
+ G A M VNP+ VLRN+L Q AI AE GDFGEV RLLK + PYDE+ G
Sbjct: 417 AQPGFDAAATAAAMRRVNPRIVLRNHLAQIAIQRAEQGDFGEVDRLLKALSAPYDERKGE 476
Query: 621 EKYARLPPAWAYRPGVCMLSCSS 643
+ A PP WA + +SCSS
Sbjct: 477 DDLAAFPPDWAQQ---IEISCSS 496
>gi|445497018|ref|ZP_21463873.1| hypothetical protein UPF0061 [Janthinobacterium sp. HH01]
gi|444787013|gb|ELX08561.1| hypothetical protein UPF0061 [Janthinobacterium sp. HH01]
Length = 465
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 216/507 (42%), Positives = 282/507 (55%), Gaps = 57/507 (11%)
Query: 145 PQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGD 204
P LVA S A+ + L P + + A P A+P A Y GHQFG+WAGQLGD
Sbjct: 8 PYLVAVSAPAAELVGLTPAQVAD-SLDVLIGNAAP-ERALPLAAVYSGHQFGVWAGQLGD 65
Query: 205 GRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTT 264
GRA+ G++ ELQ KGAG TPYSR DG AVLRSSIREFLCSEAMH LGIPT+
Sbjct: 66 GRAMLFGDVATAVGPM-ELQWKGAGLTPYSRMGDGRAVLRSSIREFLCSEAMHGLGIPTS 124
Query: 265 RALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTL 324
RAL + + + V R+ E A+V R+A +F+RFGS++ R + D ++ L
Sbjct: 125 RALSVAGSDQGVMRETV-------ETSAVVVRMAPTFVRFGSFEHWFYRNKNDE--LKIL 175
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
ADY I + + +ED N Y A EV RTA ++A WQ
Sbjct: 176 ADYVIERFYPALR-------------EED--------NPYQALLAEVTRRTAHMIAHWQA 214
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N TD G RY +ANQP +G WN
Sbjct: 215 VGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDSDHICNHTDQQG-RYSYANQPQVGHWNC 273
Query: 445 AQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKLGLPKY------NKQIISK 497
++ A LI + EA ++ Y F + ++ KLGL + ++ +
Sbjct: 274 --YALGQALLPLIGEVEATQAALDVYQPAFAAKMDELLRAKLGLSQLAHLADADRTLFDA 331
Query: 498 LLNNMAVDKVDYTNFFRALSNVK-ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSY 556
+ M + +D+T FFR LS +K AD S E PL+ + +D + A +W Y
Sbjct: 332 MFALMDANHIDFTLFFRRLSGLKAADASGDE-----PLRDLFID-----RPAIDAWATQY 381
Query: 557 IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDE 616
L + D R+ MN VNPK+VLRNYL Q AI+ A+ DF EV RLL +++RPYDE
Sbjct: 382 RARLQAEASDDSARQLAMNKVNPKFVLRNYLAQIAIEKAQNKDFTEVERLLSVLQRPYDE 441
Query: 617 QPGMEKYARLPPAWAYRPGVCMLSCSS 643
QP ++YA LPP WA V SCSS
Sbjct: 442 QPEHDQYAALPPDWASHLEV---SCSS 465
>gi|378767470|ref|YP_005195938.1| hypothetical protein PANA5342_2508 [Pantoea ananatis LMG 5342]
gi|365186951|emb|CCF09901.1| hypothetical protein PANA5342_2508 [Pantoea ananatis LMG 5342]
Length = 478
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 213/539 (39%), Positives = 298/539 (55%), Gaps = 67/539 (12%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+S+ RELPG YT ++P+ + +L+ + +A ++ LD F
Sbjct: 4 DNSWFRELPG--------------SYTALNPTP-LAGGRLLYHNAPLAKAMALDSALFSG 48
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
+++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE R + LKG
Sbjct: 49 QGHGVWY-GAALLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQRQEDGRRLDWHLKG 107
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR DG AV+RS++REFL SEA+H LGIPTTRAL L + + V R+
Sbjct: 108 AGLTPYSRMGDGRAVVRSTVREFLASEALHHLGIPTTRALTLAVSDEPVYRE-------T 160
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
E GA++ R+A S LRFG ++ H Q+ + V+ LADYAIRHH+ +
Sbjct: 161 AERGAMLMRIAPSHLRFGHFE-HFFYSQQP-EQVKQLADYAIRHHWPQL----------- 207
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
VD +++Y W ++ RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYG
Sbjct: 208 ---------VD-EADRYQLWFADIVLRTARLIAQWQSVGFAHGVMNTDNMSILGLTLDYG 257
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+GFLD + P + N +D G RY F NQP IGLWN+ + + L+ L+ ++ +
Sbjct: 258 PYGFLDDYQPDYICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LMTTEQLKQALS 314
Query: 468 RYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y M + M KLGL + +I+++LL+ M+ ++ DYT FR LS+ +
Sbjct: 315 GYENALMRVWGERMRAKLGLLTADAGDNEILTELLSLMSQERSDYTLTFRLLSDTQ---- 370
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLR 584
+ E PL+ +D + A+ W Y Q LL + D ER+ +M + NP VLR
Sbjct: 371 --QAESRSPLRDEFID-----RSAFDRWYQRYRQRLLQEQVGDAERQQVMKAANPAVVLR 423
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
NYL Q ID AE G+ G + RL + +++P+ + E Y + PP W +SCSS
Sbjct: 424 NYLAQQVIDEAEKGESGALARLHQALQQPFSDAAAAE-YRQRPPDWG---KTLEVSCSS 478
>gi|322833515|ref|YP_004213542.1| hypothetical protein Rahaq_2812 [Rahnella sp. Y9602]
gi|384258649|ref|YP_005402583.1| hypothetical protein Q7S_14005 [Rahnella aquatilis HX2]
gi|321168716|gb|ADW74415.1| protein of unknown function UPF0061 [Rahnella sp. Y9602]
gi|380754625|gb|AFE59016.1| hypothetical protein Q7S_14005 [Rahnella aquatilis HX2]
Length = 484
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 206/528 (39%), Positives = 294/528 (55%), Gaps = 48/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + + L YT++ P+ ++ +L+ SE +A L LD F+ ++ G
Sbjct: 2 PRFEHHYADQLPDFYTQLQPTP-LKGARLLYHSEPLARELGLDDSLFD-AQHREYWCGEK 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
G P AQ Y GHQFG WAGQLGDGR I LGE + +R++ LKGAG TPYSR D
Sbjct: 60 LFPGMQPLAQVYSGHQFGQWAGQLGDGRGILLGEQVLPSGKRFDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS +REFL SEA+H L +PTTRAL + T+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSVVREFLASEALHHLSVPTTRALTIATSDEPVFRE-------QPERGAMLIRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + VR LADY I HH+ + +SE +
Sbjct: 173 ESHVRFGHFEHFYYRKQP--EHVRQLADYVIAHHW---PRLLESEPVD------------ 215
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+++Y W V ERTA+L+AQWQ +GF HGV+NTDNMSILGLTIDYGP+GFLD + P
Sbjct: 216 --ASRYQQWFTSVVERTAALIAQWQSIGFAHGVMNTDNMSILGLTIDYGPYGFLDDYKPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY + NQP + WN+ + + TL+ L+ ++ + Y M Y
Sbjct: 274 YICNHSDHQG-RYSYDNQPAVAYWNLHRLAQTLSG--LMSTEQLQTALGEYEPALMRAYG 330
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
+M KLG NKQ +++ LL+ MA + D+T FR LS + + + PL+
Sbjct: 331 TLMRGKLGFFTENKQDNDLLTGLLSLMAKEGRDFTQTFRLLSQTE------QQQAASPLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D ++A+ SW +Y L + I D R+ M NP+ +LRNYL Q AI+ A
Sbjct: 385 DEFID-----RDAFDSWYQAYRHRLQTEDIDDATRQDAMKQSNPRIILRNYLAQKAIERA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ D + +L + + PY + P ++ A+LPP W +SCSS
Sbjct: 440 EVDDISALEQLHQALRDPYSDAPQYDEMAKLPPDWGKH---LEISCSS 484
>gi|424932965|ref|ZP_18351337.1| UPF0061 protein [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
gi|407807152|gb|EKF78403.1| UPF0061 protein [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
Length = 480
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 212/521 (40%), Positives = 282/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + S+A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNASLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|386079605|ref|YP_005993130.1| SelO family protein YdiU [Pantoea ananatis PA13]
gi|354988786|gb|AER32910.1| SelO family protein YdiU [Pantoea ananatis PA13]
Length = 478
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 213/539 (39%), Positives = 298/539 (55%), Gaps = 67/539 (12%)
Query: 108 DHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFER 167
D+S+ RELPG YT ++P+ + +L+ + +A ++ LD F
Sbjct: 4 DNSWFRELPG--------------SYTALNPTP-LAGGRLLYHNAPLAKAMALDSALFSG 48
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
+++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE R + LKG
Sbjct: 49 QGHGVWY-GAALLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQRQEDGRRLDWHLKG 107
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSR DG AV+RS++REFL SEA+H LGIPTTRAL L + + V R+
Sbjct: 108 AGLTPYSRMGDGRAVVRSTVREFLASEALHHLGIPTTRALTLAVSDEPVYRE-------T 160
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
E GA++ R+A S LRFG ++ H Q+ + V+ LADYAIRHH+ +
Sbjct: 161 AERGAMLMRIAPSHLRFGHFE-HFFYSQQP-EQVKQLADYAIRHHWPQL----------- 207
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
VD +++Y W ++ RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYG
Sbjct: 208 ---------VD-EADRYQLWFADIVLRTARLIAQWQSVGFAHGVMNTDNMSILGLTLDYG 257
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVME 467
P+GFLD + P + N +D G RY F NQP IGLWN+ + + L+ L+ ++ +
Sbjct: 258 PYGFLDDYQPDYICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LMTTEQLKQALS 314
Query: 468 RYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y M + M KLGL + +I+++LL+ M+ ++ DYT FR LS+ +
Sbjct: 315 GYENALMRVWGERMRAKLGLLTADAGDNEILTELLSLMSQERSDYTLTFRLLSDTQ---- 370
Query: 525 IPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLR 584
+ E PL+ +D + A+ W Y Q LL + D ER+ +M + NP VLR
Sbjct: 371 --QAESRSPLRDEFID-----RSAFDRWYQRYRQRLLQERVGDAERQQVMKAANPAVVLR 423
Query: 585 NYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
NYL Q ID AE G+ G + RL + +++P+ + E Y + PP W +SCSS
Sbjct: 424 NYLAQQVIDEAEKGESGALARLHQALQQPFSDAAAAE-YRQRPPDWG---KTLEVSCSS 478
>gi|304397628|ref|ZP_07379505.1| protein of unknown function UPF0061 [Pantoea sp. aB]
gi|304354800|gb|EFM19170.1| protein of unknown function UPF0061 [Pantoea sp. aB]
Length = 483
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 215/542 (39%), Positives = 297/542 (54%), Gaps = 69/542 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
+D+++ REL G CYT ++P+ + +L+ + +A S+ LDP+ F
Sbjct: 7 TFDNTWFRELTG--------------CYTALNPTP-LTGGRLLYHNAPLATSMGLDPELF 51
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE + + L
Sbjct: 52 AGNGHDVW-HGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLEDGSKLDWHL 110
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 111 KGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRE------ 164
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
E GA++ R++ S LRFG ++ S+ QE V+ LADYAIRHH+ H+E
Sbjct: 165 -TTERGAMLMRISPSHLRFGHFEHFFYSQQQEK---VQQLADYAIRHHWPHLEA------ 214
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
+++Y W ++ RTA L+A WQ VGF HGV+NTDNMSILGLTI
Sbjct: 215 ---------------EADRYQQWFTDIVLRTARLIALWQSVGFAHGVMNTDNMSILGLTI 259
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGPFGFLD + P F N +D G RY F NQP IGLWN+ + + L+ L+ ++
Sbjct: 260 DYGPFGFLDDYQPDFICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LLTTEQLRT 316
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ Y + M + M KLGL + +I++ LL M + DYT F LS +
Sbjct: 317 ALSAYEPELMRVWGERMRAKLGLLTQQSNDNEILTDLLALMTQEHSDYTLTFLLLSETQ- 375
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ E PL+ +D +EA+ W Y L+ +SD ER+A+M + NP
Sbjct: 376 -----QAESRSPLRDEFID-----REAFDGWYQRYRSRLMDEQVSDTERQAVMKAANPAV 425
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
+LRNYL Q AI+ AE G+ G + RL + +++P+ ++ E Y + PP W +SC
Sbjct: 426 ILRNYLAQQAIEEAERGEQGALARLHQALQQPFSDETAAE-YRQRPPDWG---KTLEVSC 481
Query: 642 SS 643
SS
Sbjct: 482 SS 483
>gi|365849728|ref|ZP_09390196.1| hypothetical protein HMPREF0880_03742 [Yokenella regensburgei ATCC
43003]
gi|364568053|gb|EHM45698.1| hypothetical protein HMPREF0880_03742 [Yokenella regensburgei ATCC
43003]
Length = 480
Score = 348 bits (892), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 212/523 (40%), Positives = 289/523 (55%), Gaps = 57/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +L+ +ES+A L ++P F + G T L G P
Sbjct: 10 RDELPGFYTALAPTP-LENARLIWHNESLAAELGVEPSLFVPSTGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE +R + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLANGKRVDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREALASEAMHGLGIPTTRALSIVTSDTPVYRETV-------EQGAMLMRIAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+ IRHH+ + + +KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADFVIRHHWPELAS---------------------REDKYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A+WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 TWFRDVVTRTAQMIARWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + ID N ++ Y EY M KL
Sbjct: 279 HQG-RYSFENQPAVGLWNLQRLAQSL--SPFIDVDALNDALDDYQRALFTEYGQRMRAKL 335
Query: 486 GLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
G Y +Q +++ L M+ + D+T FR L + + PL+ +D
Sbjct: 336 GF--YTEQSGDNDLLNDLFALMSSEGSDFTRTFRQLGETEQLSAAS------PLRDEFID 387
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ +W Y + L G+SD ER+ M +VNP VLRN+L Q AI+ AE GD
Sbjct: 388 -----RAAFDAWFSRYRERLQLDGVSDAERQQRMQAVNPAMVLRNWLAQRAIEQAEKGDM 442
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL + + P+ ++ + Y R PP W R V SCSS
Sbjct: 443 QELYRLHEALRSPFADRD--DDYVRRPPDWGKRLEV---SCSS 480
>gi|238749459|ref|ZP_04610964.1| hypothetical protein yrohd0001_27760 [Yersinia rohdei ATCC 43380]
gi|238712114|gb|EEQ04327.1| hypothetical protein yrohd0001_27760 [Yersinia rohdei ATCC 43380]
Length = 504
Score = 348 bits (892), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 213/520 (40%), Positives = 282/520 (54%), Gaps = 52/520 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
+ L YT + P+ ++ +L+ SE +A LELD F P ++ +G L G P
Sbjct: 34 QQLSGFYTPLQPTP-LQGARLLYHSEPLAQELELDASWFSAPKSAVW-AGERVLPGMKPL 91
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
AQ Y GHQFGMWAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 92 AQVYSGHQFGMWAGQLGDGRGILLGEQQLSDGRSMDWHLKGAGLTPYSRMGDGRAVLRSV 151
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFL SEA+H LGIPT+RAL +VT+ V R+ + E GA++ RVA+S +RFG
Sbjct: 152 IREFLASEALHHLGIPTSRALTIVTSDHPVYRE-------QAERGAMLLRVAESHVRFGH 204
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ R Q V+ LADY I H+ G ED Y
Sbjct: 205 FEHFYYRQQPAQ--VKQLADYVIARHWPQW------------AGQED---------GYLL 241
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
W +V +RTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD +DP + N +D
Sbjct: 242 WFTDVVKRTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLDDYDPGYICNHSDH 301
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY F NQP + LWN+ + L + L+ ++ + Y + M Y M KLG
Sbjct: 302 QG-RYAFDNQPAVALWNLHRLGQAL--SDLLSAEQLQQGLAAYEPELMAAYGQQMRAKLG 358
Query: 487 LPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
+ + Q +++ L+ M +K DYT FR LS V+ S L+ +D
Sbjct: 359 FSQSDSQDNDVLTGFLSLMIKEKRDYTRSFRLLSEVEMQSSHS------ALRDDFID--- 409
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
+ A+ SW Y L I D ER+ LM +VNP Y+LRNYL Q AID+AE D +
Sbjct: 410 --RAAFDSWYRRYRARLQQESIDDAERQQLMKAVNPHYILRNYLAQLAIDSAEKDDIQPL 467
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+RL + +++P+ + P A LPP W +SCSS
Sbjct: 468 QRLHQALQQPFADNPEFNDLAALPPDWGKH---LEISCSS 504
>gi|71909647|ref|YP_287234.1| hypothetical protein Daro_4038 [Dechloromonas aromatica RCB]
gi|121957897|sp|Q478G7.1|Y4038_DECAR RecName: Full=UPF0061 protein Daro_4038
gi|71849268|gb|AAZ48764.1| Protein of unknown function UPF0061 [Dechloromonas aromatica RCB]
Length = 499
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 216/516 (41%), Positives = 284/516 (55%), Gaps = 43/516 (8%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT++ P E P +V S VAD L L + P F F+G L G+ P A Y
Sbjct: 24 AFYTRLEPHPLPE-PYVVGVSTEVADLLGLPAELMNSPQFAEIFAGNRLLPGSEPLAAVY 82
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LG + N + WE+QLKGAG+TPYSR ADG AVLRSSIREF
Sbjct: 83 SGHQFGVWAGQLGDGRAHLLGGLRNDQGH-WEIQLKGAGRTPYSRGADGRAVLRSSIREF 141
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LG+PTTRALC++ + V R+ E A+V RVA F+RFGS++
Sbjct: 142 LCSEAMAGLGVPTTRALCVIGADQPVRREEI-------ETAALVARVAPGFVRFGSFEHW 194
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
ASR + ++ LADY I +F D N Y A +
Sbjct: 195 ASRDRS--RELQQLADYVID---------------TFRPACRD------AENPYDALLRD 231
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
++ RT L+A W VGF HGV+NTDNMSILGLT+DYGPFGF++AFD N +D G R
Sbjct: 232 ISRRTGELIAHWMAVGFMHGVMNTDNMSILGLTLDYGPFGFMEAFDAGHICNHSDHQG-R 290
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y + NQP + WN+ + D V E YG F ++ +M KLGL
Sbjct: 291 YTYRNQPHVAQWNLYCLADAFLPLLKHPDISRVAVDETYGDAFAQTFERLMCAKLGLRHA 350
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
++ I + + + D+T FFR LS + + E + A L D+ +R
Sbjct: 351 LPDDENFIGETFGFLQQHRPDFTLFFRRLSRLSGG---LDGEAMAKADAPLRDLFVDRA- 406
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A +W+ ++ L + D ER+A M + NPKYVLRN+L ++AI A+L D+ +V+RLL
Sbjct: 407 ACDAWLANWRARLAQTPWDDGERQASMLAANPKYVLRNWLAEAAIRKAKLKDYSDVQRLL 466
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RPYDEQP + A LPP WA +SCSS
Sbjct: 467 TCLRRPYDEQPEFDDLAALPPDWA---SGLEVSCSS 499
>gi|398806822|ref|ZP_10565721.1| hypothetical protein PMI15_04590 [Polaromonas sp. CF318]
gi|398087187|gb|EJL77784.1| hypothetical protein PMI15_04590 [Polaromonas sp. CF318]
Length = 501
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 220/515 (42%), Positives = 289/515 (56%), Gaps = 55/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT++ P+ + +P V S + A L L E +G L GA P A Y G
Sbjct: 38 YTELQPTP-LPSPYWVGKSRAFARELGLADNWLESAGTLEALTGNRLLPGARPLASVYSG 96
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRA+ LGEI + + E+QLKGAGKTPYSR DG AVLRSSIREFLC
Sbjct: 97 HQFGVWAGQLGDGRALLLGEIDTPRGPQ-EIQLKGAGKTPYSRMGDGRAVLRSSIREFLC 155
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRALC+ + V R+ E A+V R+A SF+RFG ++ +
Sbjct: 156 SEAMHGLGIPTTRALCVTGSDAPVRREEI-------ETAAVVTRLAPSFIRFGHFEHFSY 208
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
GQ ++ LADY I D + YAA V+
Sbjct: 209 TGQHAQ--LKALADYVI---------------------DRFYPDCREAPQPYAALLEAVS 245
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAFDP+ N +D G RY
Sbjct: 246 ERTAHLMAAWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPNHICNHSDAQG-RYA 304
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPKY- 490
+ QP++ WN+ F A +I ++E A +E Y T F D A M KLGL +
Sbjct: 305 YNRQPNMAYWNL--FCLGQALLPVIGEQELALAALEPYKTLFPDALYARMRTKLGLAEER 362
Query: 491 --NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+K ++ +A +KVDY+ F+R L+ P + P++ + D +E+
Sbjct: 363 PDDKALVDNCFKLLAANKVDYSIFWRRLNGFT--PQSGHE----PVRDLFFD-----RES 411
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ +W L Y ++L +G+ E+R LM+ NPK+VLRN+L + AI AA+L DF V LL
Sbjct: 412 FNAWALQYSEQL--AGVDPEQRAGLMHRSNPKFVLRNHLGEEAIRAAKLKDFSGVDTLLA 469
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++ P +E PG E +A PP WA +SCSS
Sbjct: 470 LLQSPCEEHPGHESFAGFPPDWA---SSIEISCSS 501
>gi|419763546|ref|ZP_14289789.1| hypothetical protein UUU_22750 [Klebsiella pneumoniae subsp.
pneumoniae DSM 30104]
gi|397743475|gb|EJK90690.1| hypothetical protein UUU_22750 [Klebsiella pneumoniae subsp.
pneumoniae DSM 30104]
Length = 480
Score = 347 bits (891), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ ++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQG---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNVALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + ++ PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAVS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|218548721|ref|YP_002382512.1| hypothetical protein EFER_1358 [Escherichia fergusonii ATCC 35469]
gi|226725732|sp|B7LQ82.1|YDIU_ESCF3 RecName: Full=UPF0061 protein YdiU
gi|218356262|emb|CAQ88879.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
Length = 480
Score = 347 bits (891), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 209/523 (39%), Positives = 292/523 (55%), Gaps = 57/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A +T ++P+ + N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPATWTALNPTP-LHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIP TR+L +VT+ V R+ E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPGTRSLAIVTSDTPVYRE-------TTETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R D++ V+ LAD+AIRH++ H++ +KYA
Sbjct: 182 HFEHFYYR--RDIEKVQLLADFAIRHYWPHLQE---------------------EQDKYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+A WQ VGF HGV+NTDNMSI+GLT+DYGPFGFLD ++P F N +D
Sbjct: 219 IWFRDVVARTASLIAGWQTVGFAHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL + I N ++ Y + Y M +KL
Sbjct: 279 HQG-RYSFDNQPAVALWNLQRLAQTL--SPFIAVNALNDALDSYKQVLLAVYGKRMRQKL 335
Query: 486 GLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
G Y +Q ++++L MA + DYT FR LS + + + PL+ +D
Sbjct: 336 GF--YTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASS------PLRDEFID 387
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ SW Y + + ++D+ER+ M SVNP VLRN+L Q AI+ A+ GD
Sbjct: 388 -----RAAFDSWFSRYRARIQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGDM 442
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL ++ P++++ + Y+R PP W R V SCSS
Sbjct: 443 EELHRLHDVLRNPFNDRD--DDYSRRPPEWGKRLEV---SCSS 480
>gi|167569616|ref|ZP_02362490.1| hypothetical protein BoklC_07238 [Burkholderia oklahomensis C6786]
Length = 521
Score = 347 bits (891), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 224/547 (40%), Positives = 295/547 (53%), Gaps = 71/547 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L + P+A + P +V +S+ A L LDP + P F F G
Sbjct: 24 PRDDAFLQ--LGTAFLTRLPAAPLPAPYVVGFSDEAARMLGLDPALRDAPGFAELFCG-N 80
Query: 179 PLAG----AVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P ++PYA Y GHQFG+WAGQLGDGRA+T+GEI + R+ELQLKGAG+TPYS
Sbjct: 81 PTRDWQPTSLPYASVYSGHQFGVWAGQLGDGRALTIGEIEH-GGRRYELQLKGAGRTPYS 139
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSS+REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 140 RMGDGRAVLRSSVREFLCSEAMHHLGIPTTRALAVIGSDQPVIREAI-------ETSAVV 192
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVA+SF+RFG ++ + + DL +R LAD+ I + S D D
Sbjct: 193 TRVAESFVRFGHFEHFFANDRPDL--LRALADHVIDRFYP-------------SCRDAD- 236
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGFLDA
Sbjct: 237 -------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFLDA 289
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDD 459
FD N +D G RY + QP I WN + L A + ++D
Sbjct: 290 FDAKHICNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLFGLHRDAPNEDARAERAVED 348
Query: 460 KEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRAL 516
A V+ R+ +F + M KLGL + + + ++LL M D+T FR L
Sbjct: 349 AHA--VLGRFPEQFGPALERAMRAKLGLELEREGDAALANQLLEIMDASHADFTLTFRRL 406
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 407 ARVSKHDARGD----APVRDLFID-----RDAFDRWANLYHARLSDEARDDATRAAAMNR 457
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 458 ANPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEYDAYAALPPDWA---SA 514
Query: 637 CMLSCSS 643
+SCSS
Sbjct: 515 LEVSCSS 521
>gi|291617260|ref|YP_003520002.1| hypothetical protein PANA_1707 [Pantoea ananatis LMG 20103]
gi|291152290|gb|ADD76874.1| YdiU [Pantoea ananatis LMG 20103]
Length = 492
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 214/544 (39%), Positives = 301/544 (55%), Gaps = 67/544 (12%)
Query: 103 EDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDP 162
E + +D+S+ RELPG YT ++P+ + +L+ + +A ++ LD
Sbjct: 13 ELMIFDNSWFRELPG--------------SYTALNPTP-LAGGRLLYHNAPLAKAMALDS 57
Query: 163 KEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWE 222
F +++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE R +
Sbjct: 58 ALFSGQGHGVWY-GAALLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLEDGRRLD 116
Query: 223 LQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFY 282
LKGAG TPYSR DG AV+RS++REFL SEA+H LGIPTTRAL L + + V R+
Sbjct: 117 WHLKGAGLTPYSRMGDGRAVVRSTVREFLASEALHHLGIPTTRALTLAVSDEPVYRE--- 173
Query: 283 DGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKS 342
E GA++ R+A S LRFG ++ H Q+ + V+ LADYAIRHH+ +
Sbjct: 174 ----TAERGAMLMRIAPSHLRFGHFE-HFFYSQQP-EQVKQLADYAIRHHWPQL------ 221
Query: 343 ESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGL 402
VD +++Y W ++ RTA L+AQWQ VGF HGV+NTDNMSILGL
Sbjct: 222 --------------VD-EADRYQLWFADIVLRTARLIAQWQSVGFAHGVMNTDNMSILGL 266
Query: 403 TIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA 462
T+DYGP+GFLD + P + N +D G RY F NQP IGLWN+ + + L+ L+ ++
Sbjct: 267 TLDYGPYGFLDDYQPDYICNHSDYQG-RYSFENQPMIGLWNLNRLAHALSG--LMSTEQL 323
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNV 519
+ Y M + M KLGL + +I+++LL+ M+ ++ DYT FR LS+
Sbjct: 324 KQALSGYENALMRVWGERMRAKLGLLTADAGDNEILTELLSLMSQERSDYTLTFRLLSDT 383
Query: 520 KADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
+ + E PL+ +D + A+ W Y Q LL + D ER+ +M + NP
Sbjct: 384 Q------QAESRSPLRDEFID-----RSAFDRWYQRYRQRLLQEQVGDAERQQVMKAANP 432
Query: 580 KYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCML 639
VLRNYL Q ID AE G+ G + RL + +++P+ + E Y + PP W +
Sbjct: 433 AVVLRNYLAQQVIDEAEKGESGALARLHQALQQPFSDAAAAE-YRQRPPDWG---KTLEV 488
Query: 640 SCSS 643
SCSS
Sbjct: 489 SCSS 492
>gi|411011640|ref|ZP_11387969.1| hypothetical protein AaquA_18156 [Aeromonas aquariorum AAK1]
Length = 475
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 207/468 (44%), Positives = 259/468 (55%), Gaps = 53/468 (11%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
PL G P AQ Y GHQFG ++ +LGDGRA+ LGE+L RW+L LKGAGKTP+SRF D
Sbjct: 58 PLPGMQPVAQVYAGHQFGGYSPRLGDGRALLLGELLAPDDSRWDLHLKGAGKTPFSRFGD 117
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R A
Sbjct: 118 GRAVLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYRE-------QVETGATVLRTA 170
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
S LRFG + A GQ + + L DYA+RHHF+ + N
Sbjct: 171 PSHLRFGHVEYFAWSGQG--EKIPALIDYALRHHFQELANG------------------- 209
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P
Sbjct: 210 ------AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPD 263
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D PG RY QP +G WN+ + + LA +D + +Y + M Y
Sbjct: 264 FVCNHSD-PGGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALASALAQYEHQLMLHYS 320
Query: 479 AIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
+M KLGL + + + +L +A KVDY F R L + A +D+ L
Sbjct: 321 ELMRAKLGLEVWEDDDPALFRELFRLLAAHKVDYHLFLRRLGELTA-----QDDWPASLL 375
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
A+L D AW W+ +Y L G D RK M +VNPKYVLRN L Q I+AA
Sbjct: 376 ALLPD-----PAAWQGWLEAYRARLAREGSEDAVRKGQMGAVNPKYVLRNALAQRVIEAA 430
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GD RL ++ PYDEQP E A PAW Y G LSCSS
Sbjct: 431 EQGDMAPFERLFTALQHPYDEQPEYEDLATPSPAW-YCGG--ELSCSS 475
>gi|405975916|gb|EKC40447.1| Selenoprotein O [Crassostrea gigas]
Length = 636
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 189/436 (43%), Positives = 258/436 (59%), Gaps = 41/436 (9%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
+LE LN+D+ +R LP D ++ R+V AC++KV P+ V NPQLVA S S +++
Sbjct: 5 SLESLNFDNLVLRSLPIDSEEENYIRQVSGACFSKVKPTP-VSNPQLVAASLSALSLIDI 63
Query: 161 DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER 220
DPK+ ER DF FFSG L G+ A CY GHQFG ++GQLGDG A+ LGEI+N R
Sbjct: 64 DPKQVERADFAEFFSGNKLLPGSETAAHCYCGHQFGYFSGQLGDGAAMYLGEIVNKSGTR 123
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
WE+QLKG+G TP+SR ADG VLRS+IREFLCSEA+H LGIPTTRA VT+ V RD+
Sbjct: 124 WEIQLKGSGLTPFSRSADGRKVLRSTIREFLCSEAIHHLGIPTTRAGSCVTSDSRVVRDI 183
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED---------LDIVRTLADYAIRH 331
FYDG+P +E +IV R+A +FLRFGS++I + E DI++ + DY ++
Sbjct: 184 FYDGHPIQERCSIVLRIAPTFLRFGSFEIFKATDSETGRTGPSVGRNDILKQMLDYTVQT 243
Query: 332 HFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGV 391
+ I + ++ Y + E+ RTA LVA WQ VG+ HGV
Sbjct: 244 FYPEIWQAHSADK----------------ETAYVEFFKELTRRTARLVADWQSVGWCHGV 287
Query: 392 LNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL 451
LNTDNMSI+G+TIDYGPFGF+D +DP F N +D G RY + QP I WNI +F+ +
Sbjct: 288 LNTDNMSIVGVTIDYGPFGFMDKYDPDFICNASD-DGGRYTYIKQPQICKWNIKKFAEAI 346
Query: 452 AA----AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMA 503
AK + + + + ++ D Y M KK GL + + ++ L+ +
Sbjct: 347 QGVVPLAKTVPETKI------FDEEYSDYYTKKMRKKFGLINTIEEQDGDLVGSFLDTLH 400
Query: 504 VDKVDYTNFFRALSNV 519
D+TN FR LS +
Sbjct: 401 KTGADFTNCFRCLSRL 416
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 55/84 (65%), Gaps = 7/84 (8%)
Query: 540 DIGKERKEAWISWVLSYIQELLSSG-------ISDEERKALMNSVNPKYVLRNYLCQSAI 592
D KE + W +W+ +Y++ L +++ RK +MN NP+++LRNY+ Q+AI
Sbjct: 502 DKKKENQAMWTAWLKTYVERLKKEADKVTDLTAANQRRKEVMNMTNPRFILRNYIAQNAI 561
Query: 593 DAAELGDFGEVRRLLKLMERPYDE 616
DAAE GDF EVRR+L++++ PY E
Sbjct: 562 DAAEKGDFSEVRRVLEILQTPYSE 585
>gi|264679099|ref|YP_003279006.1| hypothetical protein CtCNB1_2964 [Comamonas testosteroni CNB-2]
gi|262209612|gb|ACY33710.1| hypothetical conserved protein [Comamonas testosteroni CNB-2]
Length = 511
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 220/529 (41%), Positives = 294/529 (55%), Gaps = 62/529 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG----ATPLAGAVPY 186
A +T + P+ V P +A S S A + L+ + + SG G+ P
Sbjct: 29 AFFTYLQPT-PVPEPHWIAASVSTARWMGLNTEWLHSAEVLQILSGNAVSGHGKGGSKPL 87
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSER-WELQLKGAGKTPYSRFADGLAVLRS 245
A Y GHQFG+WAGQLGDGRAI LGE +ER +E+QLKGAG+TPYSR DG AVLRS
Sbjct: 88 ATVYSGHQFGVWAGQLGDGRAILLGE-----TERGFEVQLKGAGRTPYSRMGDGRAVLRS 142
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIREFLCSEAM LGIPTTRAL L + V R+ E A+V RVA+SF+RFG
Sbjct: 143 SIREFLCSEAMAALGIPTTRALALTGSPLPVARETM-------ETAAVVTRVAESFIRFG 195
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ A+R + ++ LAD I H+ E + V L N YA
Sbjct: 196 HFEHFAARDMQTE--LKALADLVIDQHY-----------------PECRTAVALNGNPYA 236
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
+ V+ERTA L+AQWQGVGF HGV+NTDNMSILGLTIDYGPF FLDAFDP N +D
Sbjct: 237 NFLQAVSERTARLMAQWQGVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPGHICNHSD 296
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKK 484
G RY F QP + WN+ + A LI D+E +E Y T F Y M K
Sbjct: 297 SQG-RYAFNRQPQVAYWNL--YCLGQALLPLIGDEELTIAALESYKTVFPAAYARQMLSK 353
Query: 485 LGLPKYNKQ----------IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
LGLP+ +++ LL +A +KVDYT FF L++ A + + PL
Sbjct: 354 LGLPENETGTSATEGRFALLVNPLLQILADNKVDYTIFFSRLTDAVAQRQETKID-FEPL 412
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ ++LD + ++ +W L+Y ++L + + + LM NP++VLRN+L ++ I A
Sbjct: 413 RDIILD-----RASFDAWSLTYSEQL--AQVEKAQTVDLMQKSNPRFVLRNHLGETVIRA 465
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ GDF V+++L +++ PYD P +A PP WA +SCSS
Sbjct: 466 AQAGDFAPVQQMLAVLQTPYDSHPDHADWAGFPPDWA---SSIEISCSS 511
>gi|311105402|ref|YP_003978255.1| hypothetical protein AXYL_02217 [Achromobacter xylosoxidans A8]
gi|310760091|gb|ADP15540.1| hypothetical protein AXYL_02217 [Achromobacter xylosoxidans A8]
Length = 495
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 215/517 (41%), Positives = 293/517 (56%), Gaps = 46/517 (8%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A Y+++ P A + NP+L+ + A+ + LDP P+F FSGA PL G A Y
Sbjct: 21 AFYSRLEPQA-LNNPRLLHGNAQAAELIGLDPSALSTPEFLSVFSGAQPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ + WELQLKG+G TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEVEGPQGN-WELQLKGSGMTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L EAMH LG+PTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LAGEAMHGLGVPTTRALALVVSDDPVMRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q D+ ++TLADY I ++ E + G+ + V Y
Sbjct: 192 SSRRQPDM--LKTLADYVIDRYY--------PECRATGAGEVSNDVA-----PYVNLLRA 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHICNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L A L+ D E+ V++ + F + M KLGL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSLHA--LVQDVESLRAVLDEFEAVFTRAFHDRMGAKLGLAA 353
Query: 490 YN---KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ + ++ LL M ++ D+T +R L++ L +A D+ +R
Sbjct: 354 WQPADEALLDDLLKLMDANQADFTLTWRRLADA-----------LSGQRAAFADLFIDRP 402
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A +W+ ++ G ++ A MN VNP YVLRN+L + AI AA+ GD GE+ L
Sbjct: 403 AAG-AWLDRLVERHAQDGRPVQDVTAGMNRVNPLYVLRNHLAEEAIRAAKSGDAGEIDTL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+KL+ PY+ QPG E+YA LPP WA G +SCSS
Sbjct: 462 MKLLRNPYEAQPGHERYAALPPDWA---GSLEVSCSS 495
>gi|307131497|ref|YP_003883513.1| hypothetical protein Dda3937_03652 [Dickeya dadantii 3937]
gi|306529026|gb|ADM98956.1| conserved protein [Dickeya dadantii 3937]
Length = 483
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 207/516 (40%), Positives = 287/516 (55%), Gaps = 56/516 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT+++P+ ++ +L+ + ++A L L F+ D ++G L G VP AQ Y G
Sbjct: 19 YTELTPTP-LQGARLLYHNATLAQELGLSEDWFD-GDNSRIWAGEQLLLGMVPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLG--EILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
HQFG+WAGQLGDGR I LG ++ + +++ W LKGAG TPYSR DG AVLRS +REF
Sbjct: 77 HQFGVWAGQLGDGRGILLGQQQLADGRTQDW--HLKGAGLTPYSRMGDGRAVLRSVVREF 134
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEA+H LGIPTTRAL +V++ V R+ +EE GA++ RVA S +RFG ++
Sbjct: 135 LASEALHHLGIPTTRALTIVSSDHPVRRE-------QEERGAMLLRVADSHVRFGHFEHF 187
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
R + + VR LA+Y I H+ + +++Y W +
Sbjct: 188 YYR--REPEKVRQLAEYVIACHWPQWQQ---------------------ETDRYYLWFSD 224
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA L+A WQ VGF HGV+NTDNMSILGLTIDYGP+GF+D + P + N +D G R
Sbjct: 225 VVERTARLLAHWQAVGFAHGVMNTDNMSILGLTIDYGPYGFMDDYQPGYICNHSDHQG-R 283
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F NQP + LWN+ + + +L+ L+ ++RY M + +M KLG
Sbjct: 284 YAFDNQPAVALWNLHRLAQSLSG--LMSSDILQRALDRYEPALMQRFGELMRAKLGFDTP 341
Query: 491 NKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
Q ++ LL M + DYT+ FR LS + S PL+ V +D +
Sbjct: 342 QTQDNTLLVALLKLMQREPADYTHIFRLLSETERHSSHS------PLQDVFID-----RP 390
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A+ W +Y Q L +SD ER+ M NP+YVLRNYL Q AI+ AE D G + RL
Sbjct: 391 AFDGWFSAYRQRLALENVSDAERQRRMKQSNPRYVLRNYLAQQAIEQAEREDVGLLGRLH 450
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + +PY +QP M A LPP W +SCSS
Sbjct: 451 QALRQPYADQPDMADLAALPPTWGKH---LEISCSS 483
>gi|333915082|ref|YP_004488814.1| hypothetical protein DelCs14_3467 [Delftia sp. Cs1-4]
gi|333745282|gb|AEF90459.1| UPF0061 protein ydiU [Delftia sp. Cs1-4]
Length = 510
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 219/522 (41%), Positives = 288/522 (55%), Gaps = 56/522 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T + P+ + P +A S A+ L LDP+ + +G L G+ P A Y G
Sbjct: 34 FTHLRPT-PLPEPHWIATSTGTAELLGLDPQWLASDEALQALTGNAVLPGSHPLASVYSG 92
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI LGE + E+QLKGAG+TPYSR DG AVLRSSIREFLC
Sbjct: 93 HQFGVWAGQLGDGRAILLGE----TASGHEIQLKGAGRTPYSRMGDGRAVLRSSIREFLC 148
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + + R+ + E A+V RVA SF+RFG ++ A+
Sbjct: 149 SEAMHALGIPTTRALSLTGSPAPIRRE-------EIETAAVVARVAPSFIRFGHFEHFAA 201
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R Q + +R LADY I ++ E + L N YA + V+
Sbjct: 202 RDQ--IAPLRQLADYVIDRYY-----------------PECRTAEALAGNAYANFLQAVS 242
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPF FLDAF+P N +D G RY
Sbjct: 243 ERTARLLAHWQAVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFNPGHICNHSDTQG-RYA 301
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKLGLPK-- 489
F QP + WN+ + A LI ++E +E Y F Y A+M +KLGLP+
Sbjct: 302 FNRQPQVAYWNL--YCLGQALLPLIGEEELTIAALESYKQVFPQAYGALMLRKLGLPEDA 359
Query: 490 --------YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
+++ LL MA + VDYT FF L++ A D L P++ ++LD
Sbjct: 360 PGTPPAEGRFAALVNPLLQLMADNAVDYTIFFSRLTDAVAA-GAGTDLDLEPVRDLVLD- 417
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+EA+ W Y L +G ALM NP++VLRN+L + I AA+ GDF
Sbjct: 418 ----REAFDRWAALYAPHL--AGTDAAAAAALMQESNPRFVLRNHLGEMTIRAAKAGDFA 471
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
VR+LL +++ P+D ++A PP WA +SCSS
Sbjct: 472 PVRQLLAVLQTPFDPHAEHAEWAGFPPDWA---SSIEISCSS 510
>gi|254200039|ref|ZP_04906405.1| conserved hypothetical protein [Burkholderia mallei FMH]
gi|254206374|ref|ZP_04912726.1| conserved hypothetical protein [Burkholderia mallei JHU]
gi|121957753|sp|Q62JM7.2|Y1440_BURMA RecName: Full=UPF0061 protein BMA1440
gi|147749635|gb|EDK56709.1| conserved hypothetical protein [Burkholderia mallei FMH]
gi|147753817|gb|EDK60882.1| conserved hypothetical protein [Burkholderia mallei JHU]
Length = 521
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 224/538 (41%), Positives = 293/538 (54%), Gaps = 71/538 (13%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT----PLAGAV 184
L A + P+A + P +V +S+ A L L+P + P F F G P A ++
Sbjct: 32 LGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNPTRDWPQA-SL 90
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYSR DG AVLR
Sbjct: 91 PYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYSRMGDGRAVLR 149
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RVAQSF+RF
Sbjct: 150 SSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVVTRVAQSFVRF 202
Query: 305 GSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK 363
G ++ A+ E L R LAD+ I E + D D +
Sbjct: 203 GHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD--------DP 238
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD N
Sbjct: 239 YLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFIDAFDAKHVCNH 298
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMER 468
+D G RY + QP I WN + L A + ++D A+ V+ R
Sbjct: 299 SDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVED--AHAVLGR 355
Query: 469 YGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
+ +F + + KLGL + + + ++LL M D+T FR L+ V +
Sbjct: 356 FPEQFGPALERAIRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRHLARVSKHDAR 415
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
+ P++ + +D ++A+ W Y L D R A MN VNPKYVLRN
Sbjct: 416 GD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMNRVNPKYVLRN 466
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA +SCSS
Sbjct: 467 HLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---STLEVSCSS 521
>gi|297460434|ref|XP_002701071.1| PREDICTED: UPF0061 protein Fjoh_2793 [Bos taurus]
Length = 573
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 203/504 (40%), Positives = 293/504 (58%), Gaps = 41/504 (8%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV-ADSLELDPKEFER 167
+ + LP DP ++ R+V + ++ P+ +LVA S+ V D L+LD E
Sbjct: 99 ENLIAVLPTDPVKENYVRKVKNCVFSIAFPTPFQSRVRLVAVSKEVLEDILDLDLSVSET 158
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
DF SG + G++P A YGGHQFG+WA QLGDGRA +G +N + E+WELQLKG
Sbjct: 159 DDFIQLVSGGKIVFGSIPLAHRYGGHQFGIWADQLGDGRAHLIGIYMNRQGEKWELQLKG 218
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
+GKTPYSR DG A+LRSS+REFLCSEAMH+LGIPT+RA LV + V RD FY+GN
Sbjct: 219 SGKTPYSRNGDGRAILRSSLREFLCSEAMHYLGIPTSRAASLVVSDDVVWRDQFYNGNLT 278
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
+E GA+V RVA+S+ R GS +I G+ LD++R L D+ I+ +F
Sbjct: 279 KERGAVVLRVAKSWFRIGSLEILTHSGE--LDLLRMLLDFIIQEYF-------------- 322
Query: 348 STGDEDHSVVDLTS-NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDY 406
+VD+ N+Y + V TA L+A W VGF GV NTDN S+L +TIDY
Sbjct: 323 -------PLVDVKEPNRYVDFFSIVVFETAQLIALWMSVGFARGVCNTDNFSLLSITIDY 375
Query: 407 GPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV- 465
GPFGF++A++P F PNT+D RRY NQ +IG++N+ + L L++ ++ V
Sbjct: 376 GPFGFMEAYNPDFVPNTSD-DERRYKIGNQANIGMFNLNKLLQALNP--LLNPRQKQLVT 432
Query: 466 --MERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
++ Y + ++ + KLGL + + +I+ LL+ M + D+T FR LS +
Sbjct: 433 QILKEYPVLYYTRFRELFKAKLGLLGKSEGDDDLIAFLLHLMEKTEADFTMTFRQLSEIT 492
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERK--EAWISWVLSYIQELLSSGISDEERKALMNSVN 578
EL++P + L + + K AW+S LS ++ +S SD ER+ M +VN
Sbjct: 493 QSQL---QELVIPQEFWALKMISKHKLFPAWVSQYLSRLKSNISD--SDSERRKRMTAVN 547
Query: 579 PKYVLRNYLCQSAIDAAELGDFGE 602
P+YVL+N++ +SA+ AE DF E
Sbjct: 548 PRYVLKNWMAESAVQKAERNDFSE 571
>gi|357631780|gb|EHJ79249.1| hypothetical protein KGM_15660 [Danaus plexippus]
Length = 529
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 210/530 (39%), Positives = 294/530 (55%), Gaps = 40/530 (7%)
Query: 123 SIPREVLHACYTKVSPSAEVENPQLVAWS-ESVADSLELDPKEFERPDFPLFFSGATPLA 181
+IPR V A + KV LV S +++ D L+LDP E +F F +G
Sbjct: 31 NIPRAVKDAVFVKVPTEPLTGKIDLVCVSNDALTDILDLDPVVAESEEFVEFINGKYLPQ 90
Query: 182 GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLA 241
GA+ YGG+QFG WA QLGDGRA LGE +N K E W+LQLKG+G+TP+SRF DG A
Sbjct: 91 GALSVCHGYGGYQFGFWADQLGDGRAHILGEYVNSKGELWQLQLKGSGETPFSRFGDGRA 150
Query: 242 VLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF-VTRDMFYDGNPKEEPGAIVCRVAQS 300
VLRSS+RE + SEA H LGIPTTRA LV + V RD Y G + E A++ R+A S
Sbjct: 151 VLRSSLREMVASEACHHLGIPTTRAAGLVASDSHKVLRDRSYSGLARPERAAVLLRLAPS 210
Query: 301 FLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLT 360
++R GS+++ R Q D+ + LAD+ I+H F HI+ +K
Sbjct: 211 WMRIGSFELMHRRQQTDMLV--ELADHVIKHFFSHIDLNDK------------------- 249
Query: 361 SNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFT 420
+KY + EVA + +VA WQG+GFTHGVLNTDN+SILGLTIDYGPFGF++ + ++
Sbjct: 250 -DKYVKFFTEVAHKNLDMVATWQGLGFTHGVLNTDNISILGLTIDYGPFGFIEHYYENYV 308
Query: 421 PNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDD--KEANYVMERYGTKFMDEYQ 478
PN++D G RY F QP+I LWN+ + + L L D+ K+ V++ D+
Sbjct: 309 PNSSDDMG-RYAFNKQPEILLWNLGKLAEALQLI-LCDESKKKIKDVIDTLELYVKDKIL 366
Query: 479 AIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
KLGL + K +++ L M D+T FR +S + + + ++ L K
Sbjct: 367 HTYILKLGLTEVRKGDDKLVKDFLEMMQQTSSDFTGSFRQISEISLNQLLDKETL--ESK 424
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
L + K + W W+ Y ++++ER M VNP YV RN++ Q AI A
Sbjct: 425 WALARLSKSKN--WDKWIQRYKDRCCQENVNEDERVKHMLKVNPLYVPRNWMLQEAIKDA 482
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEK--YARLPPAWAYRPGVCMLSCSS 643
E DF +VR LL++ +PY+ EK Y+ PP+W++ LSCSS
Sbjct: 483 ENNDFNKVRLLLEIFTKPYEANEEAEKLGYSSQPPSWSFG---LKLSCSS 529
>gi|53723639|ref|YP_103092.1| hypothetical protein BMA1440 [Burkholderia mallei ATCC 23344]
gi|67642000|ref|ZP_00440763.1| conserved hypothetical protein [Burkholderia mallei GB8 horse 4]
gi|52427062|gb|AAU47655.1| conserved hypothetical protein [Burkholderia mallei ATCC 23344]
gi|238523041|gb|EEP86482.1| conserved hypothetical protein [Burkholderia mallei GB8 horse 4]
Length = 525
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 224/538 (41%), Positives = 293/538 (54%), Gaps = 71/538 (13%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT----PLAGAV 184
L A + P+A + P +V +S+ A L L+P + P F F G P A ++
Sbjct: 36 LGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNPTRDWPQA-SL 94
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYSR DG AVLR
Sbjct: 95 PYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYSRMGDGRAVLR 153
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V RVAQSF+RF
Sbjct: 154 SSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVVTRVAQSFVRF 206
Query: 305 GSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK 363
G ++ A+ E L R LAD+ I E + D D +
Sbjct: 207 GHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD--------DP 242
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD N
Sbjct: 243 YLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFIDAFDAKHVCNH 302
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMER 468
+D G RY + QP I WN + L A + ++D A+ V+ R
Sbjct: 303 SDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVED--AHAVLGR 359
Query: 469 YGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
+ +F + + KLGL + + + ++LL M D+T FR L+ V +
Sbjct: 360 FPEQFGPALERAIRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRHLARVSKHDAR 419
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
+ P++ + +D ++A+ W Y L D R A MN VNPKYVLRN
Sbjct: 420 GD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMNRVNPKYVLRN 470
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA +SCSS
Sbjct: 471 HLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALPPDWA---STLEVSCSS 525
>gi|167845290|ref|ZP_02470798.1| hypothetical protein BpseB_08373 [Burkholderia pseudomallei B7210]
gi|403519027|ref|YP_006653160.1| hypothetical protein BPC006_I2379 [Burkholderia pseudomallei
BPC006]
gi|403074669|gb|AFR16249.1| hypothetical protein BPC006_I2379 [Burkholderia pseudomallei
BPC006]
Length = 525
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 227/548 (41%), Positives = 297/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQ + YA LPP WA
Sbjct: 461 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQLEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|126454265|ref|YP_001066600.1| hypothetical protein BURPS1106A_2336 [Burkholderia pseudomallei
1106a]
gi|242316314|ref|ZP_04815330.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b]
gi|166227720|sp|A3NW79.1|Y2336_BURP0 RecName: Full=UPF0061 protein BURPS1106A_2336
gi|126227907|gb|ABN91447.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a]
gi|242139553|gb|EES25955.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b]
Length = 521
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 227/548 (41%), Positives = 297/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 24 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 81
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 82 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 139
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 140 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 192
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 193 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 236
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 237 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 288
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 289 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 347
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 348 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 405
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 406 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 456
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQ + YA LPP WA
Sbjct: 457 RVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQLEHDAYAALPPDWA---S 513
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 514 TLEVSCSS 521
>gi|300716471|ref|YP_003741274.1| hypothetical protein EbC_18930 [Erwinia billingiae Eb661]
gi|299062307|emb|CAX59424.1| conserved uncharacterized protein YdiU [Erwinia billingiae Eb661]
Length = 479
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 215/518 (41%), Positives = 288/518 (55%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT ++P+ ++NP+L+ S +A L LD F D +SG + L G P AQ
Sbjct: 11 LEGFYTALTPTP-LKNPRLLYHSAGLAAELGLDDSWFA-ADKIGIWSGESLLPGMQPLAQ 68
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR DG AVLRSS+R
Sbjct: 69 VYSGHQFGVWAGQLGDGRGILLGEQRLEDGRKMDWHLKGAGLTPYSRMGDGRAVLRSSLR 128
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL SEAM+ LG+PT+RAL +VT+ + V R+ E GA++ RVA+S LRFG ++
Sbjct: 129 EFLASEAMYHLGVPTSRALTVVTSDEPVYRE-------TTERGAMLLRVAESHLRFGHFE 181
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
H Q+ + VR LADYAIRHH+ + DE+ ++Y W
Sbjct: 182 -HFFYNQQP-EKVRELADYAIRHHWPQWQ-------------DEE--------DRYRLWF 218
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P F N +D G
Sbjct: 219 TDVVRRTARLIAHWQSVGFAHGVMNTDNMSILGLTLDYGPYGFLDDYKPDFICNHSDYQG 278
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F NQP +GLWN+ + + L+ L+ ++ + Y + M + M KLG
Sbjct: 279 -RYSFENQPVVGLWNLNRLAHALSG--LMTTEQLKQALAEYEPELMRCWGQQMRAKLGFT 335
Query: 489 ---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K++ I++ LL M + DYT FR LS+ S PL+ +D
Sbjct: 336 TQGKHDNDILTGLLALMTKEGSDYTWTFRQLSDSVQQGSTS------PLRDEFID----- 384
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+EA+ SW + Q +L SDE+R+ M NP VLRNYL Q AI+ AE D + R
Sbjct: 385 REAFDSWYNIWRQRVLEEERSDEDRQQQMKQANPAIVLRNYLAQQAIEQAEKDDISVLSR 444
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + +PY + P + PP W + V SCSS
Sbjct: 445 LHQALSQPYADAPEFADLMQRPPDWGKKLEV---SCSS 479
>gi|221066306|ref|ZP_03542411.1| protein of unknown function UPF0061 [Comamonas testosteroni KF-1]
gi|220711329|gb|EED66697.1| protein of unknown function UPF0061 [Comamonas testosteroni KF-1]
Length = 511
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 221/529 (41%), Positives = 292/529 (55%), Gaps = 62/529 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT----PLAGAVPY 186
A +T + P+ V PQ +A S A ++LDP+ + SG G+ P
Sbjct: 29 AFFTYLQPT-PVPEPQWIATSTCAARWMDLDPEWLHSAEALQILSGNAVSDQGSGGSKPL 87
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRAI LGE + +E+QLKGAG+TPYSR DG AVLRSS
Sbjct: 88 ATVYSGHQFGVWAGQLGDGRAILLGE----TEQGFEIQLKGAGRTPYSRMGDGRAVLRSS 143
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAM LGIPTTRAL L + V R+ E A+V RVA+SF+RFG
Sbjct: 144 IREFLCSEAMAALGIPTTRALALTGSPLPVARETM-------ETAAVVTRVAESFIRFGH 196
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ A+R + +R LAD I H+ E + L N YA
Sbjct: 197 FEHFAARDMQAE--LRALADLVIDQHY-----------------PECRTATALNGNHYAN 237
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
V+ERTA L+A+WQGVGF HGV+NTDNMSILGLTIDYGPF FLDAFDP N +D
Sbjct: 238 LLQAVSERTAQLLARWQGVGFCHGVMNTDNMSILGLTIDYGPFQFLDAFDPGHICNHSDS 297
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKL 485
G RY F QP + WN+ + A LI D+E +E Y T F Y M KL
Sbjct: 298 QG-RYAFNRQPQVAYWNL--YCLGQALLPLIGDEELTIAALESYKTVFPAAYARQMLAKL 354
Query: 486 GLPKYNKQ----------IISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPL 534
GLP+ +++ LL +A +KVDYT FF L++ V + P D PL
Sbjct: 355 GLPENEAGTPATEGRFALLVNPLLQILADNKVDYTIFFSRLTDAVAQGQARPID--FEPL 412
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ ++LD + ++ +W L+Y ++L + + + ALM NP++VLRN+L ++ I A
Sbjct: 413 RDIILD-----RASFDAWSLTYSEQL--AQVDRVQAMALMQESNPRFVLRNHLGETVIRA 465
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A GDF V+++L +++ P D P +A PP WA +SCSS
Sbjct: 466 ARDGDFAPVQQMLAVLQAPCDSHPDHADWAGFPPDWA---SSIEISCSS 511
>gi|365137811|ref|ZP_09344521.1| UPF0061 protein [Klebsiella sp. 4_1_44FAA]
gi|363655703|gb|EHL94510.1| UPF0061 protein [Klebsiella sp. 4_1_44FAA]
Length = 480
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVKQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|156934274|ref|YP_001438190.1| hypothetical protein ESA_02105 [Cronobacter sakazakii ATCC BAA-894]
gi|259646584|sp|A7MNZ6.1|Y2105_ENTS8 RecName: Full=UPF0061 protein ESA_02105
gi|156532528|gb|ABU77354.1| hypothetical protein ESA_02105 [Cronobacter sakazakii ATCC BAA-894]
Length = 482
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 215/528 (40%), Positives = 289/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L+ + +A +LEL F+ + G T
Sbjct: 5 PRFIATWRDELPGFYTELTPTP-LNNSRLLCHNAPLAQALELPETLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGCKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 GRAVLRSTVREFLASEAMHGLGIPTTRALTIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAQLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQHSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGEEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|237731281|ref|ZP_04561762.1| ydiU [Citrobacter sp. 30_2]
gi|226906820|gb|EEH92738.1| ydiU [Citrobacter sp. 30_2]
Length = 480
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 291/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ ++++A+ L + F+ P + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNDALAEQLAIPAALFDIPTGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE ++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAM++LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMYYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ++KY
Sbjct: 182 HFEHFYYR--REPEKVRELADFAIRHYWPQWQE---------------------EADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D
Sbjct: 219 LWFNDVVTRTATLIADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDYICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++ Y + Y M +KL
Sbjct: 279 NQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPVEVLNDALDSYQLALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFFSEQKDDNELLSELFSLMARERSDYTRTFRMLSETE------QHSAQSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D R+ M +VNP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNIADAVRQTQMKAVNPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHQALRTPFIDRD--DDYASRPPDWGKRLEV---SCSS 480
>gi|288934900|ref|YP_003438959.1| hypothetical protein Kvar_2027 [Klebsiella variicola At-22]
gi|288889609|gb|ADC57927.1| protein of unknown function UPF0061 [Klebsiella variicola At-22]
Length = 480
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 282/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPENGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ + R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPIYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDDYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|170768769|ref|ZP_02903222.1| conserved hypothetical protein [Escherichia albertii TW07627]
gi|170122317|gb|EDS91248.1| conserved hypothetical protein [Escherichia albertii TW07627]
Length = 478
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 213/521 (40%), Positives = 290/521 (55%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ + N +L+ + +A++L++ FE + + G L G P
Sbjct: 10 RDELPATYTALSPTP-LNNARLIWHNAELANTLDIPSSLFE--NGAGVWGGEALLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFGIWAGQLGDGRGILLGEQQLADGSTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ E GA++ RVAQS LRFG
Sbjct: 127 TIRESLASEAMHHLGIPTTRALSIVTSDTPVYRETV-------ESGAMLMRVAQSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR D+AIRH++ H+ N DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQWTDFAIRHYWPHLLN------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+A+WQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++ F N +D
Sbjct: 217 LWFTDVVARTASLIARWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYESGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFIAVD--ALNEALDSYQQVLLSHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTAQKEDNTLLNELFSLMARERSDYTRTFRMLSQTEQRSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ +W Y L + D ER+ M +VNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDNWFARYRARLQQDDVGDSERRQRMLNVNPALVLRNWLAQRAIEAAEQGDMTE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W + V SCSS
Sbjct: 443 LHRLHEALRNPFSDRD--DDYVCRPPDWGKQLEV---SCSS 478
>gi|167562434|ref|ZP_02355350.1| hypothetical protein BoklE_07719 [Burkholderia oklahomensis EO147]
Length = 521
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 224/547 (40%), Positives = 295/547 (53%), Gaps = 71/547 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L + P+A + P +V +S A L LDP + P F F G
Sbjct: 24 PRDDAFLQ--LGTAFLTRLPAAPLPAPYVVGFSGEAARMLGLDPALRDAPGFAELFCG-N 80
Query: 179 PLAG----AVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P ++PYA Y GHQFG+WAGQLGDGRA+T+GEI + R+ELQLKGAG+TPYS
Sbjct: 81 PTRDWQPTSLPYASVYSGHQFGVWAGQLGDGRALTIGEIEH-GGRRYELQLKGAGRTPYS 139
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSS+REFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 140 RMGDGRAVLRSSVREFLCSEAMHHLGIPTTRALAVIGSDQPVIREAI-------ETSAVV 192
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVA+SF+RFG ++ + + DL +R LAD+ I + S D D
Sbjct: 193 TRVAESFVRFGHFEHFFANDRPDL--LRALADHVIDRFYP-------------SCRDAD- 236
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGFLDA
Sbjct: 237 -------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFLDA 289
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDD 459
FD N +D G RY + QP I WN + L A + ++D
Sbjct: 290 FDAKHICNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLFGLHRDAPNEDARAERAVED 348
Query: 460 KEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRAL 516
A V+ R+ +F + M KLGL + + + ++LL M D+T FR L
Sbjct: 349 AHA--VLGRFPEQFGPALERAMRAKLGLELEREGDAALANQLLEIMDASHADFTLTFRRL 406
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
++V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 407 AHVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSDEARDDATRAAAMNR 457
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LPP WA
Sbjct: 458 ANPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEYDAYAALPPDWA---SA 514
Query: 637 CMLSCSS 643
+SCSS
Sbjct: 515 LEVSCSS 521
>gi|123442444|ref|YP_001006423.1| hypothetical protein YE2183 [Yersinia enterocolitica subsp.
enterocolitica 8081]
gi|122089405|emb|CAL12253.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
enterocolitica 8081]
Length = 499
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/533 (39%), Positives = 287/533 (53%), Gaps = 52/533 (9%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
EL P+ + + L YT + P+ ++ +L+ SE +A LELD F P ++
Sbjct: 16 ELNNSPQFSNSYGQQLSGFYTHLQPTP-LKGARLLYHSEPLARELELDTSWFSDPKAAVW 74
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L G P AQ Y GHQFG WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 75 -AGEMLLPGMEPLAQVYSGHQFGQWAGQLGDGRGILLGEQKLSDGRHMDWHLKGAGLTPY 133
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LG+PT+RAL +VT+ V R+ + E GA+
Sbjct: 134 SRMGDGRAVLRSVVREFLASEALHHLGVPTSRALTIVTSDHPVYRE-------QAERGAM 186
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q V+ LADY I H+ + +
Sbjct: 187 LLRVAESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWPQWVGLEEC----------- 233
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
Y W +V +RTA L+A WQ +GF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 234 ----------YLLWFTDVVKRTARLMAHWQTIGFAHGVMNTDNMSILGITMDYGPFGFLD 283
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ L+ + +E Y +
Sbjct: 284 DYVPDYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LLSTAQLQQALEAYEPEL 340
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLG + Q +++ LL+ M + DYT FR LS V+ +
Sbjct: 341 MAAYGQQMRAKLGFSDSDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVETHSA------ 394
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
L PL+ +D + A+ SW Y L I D +R+ M++VNPKY+LRNYL Q
Sbjct: 395 LSPLRDDFID-----RAAFDSWYSRYRARLQQEQIDDAQRQQAMSAVNPKYILRNYLAQL 449
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AID AE D ++RL + +++P+ EQP + A LPP W +SCSS
Sbjct: 450 AIDQAEKDDIQPLQRLHQALQQPFAEQPELNDLAALPPDWGKH---LEISCSS 499
>gi|425082005|ref|ZP_18485102.1| hypothetical protein HMPREF1306_02756 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|428936186|ref|ZP_19009611.1| hypothetical protein MTE1_24983 [Klebsiella pneumoniae JHCK1]
gi|405601231|gb|EKB74385.1| hypothetical protein HMPREF1306_02756 [Klebsiella pneumoniae subsp.
pneumoniae WGLW2]
gi|426298830|gb|EKV61207.1| hypothetical protein MTE1_24983 [Klebsiella pneumoniae JHCK1]
Length = 480
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNVQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|429108513|ref|ZP_19170382.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
malonaticus 681]
gi|426295236|emb|CCJ96495.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
malonaticus 681]
Length = 482
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 214/528 (40%), Positives = 288/528 (54%), Gaps = 53/528 (10%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR + R+ L YT+++P+ + N +L + +A +LEL F+ + G T
Sbjct: 5 PRFTATWRDELPGFYTELTPTP-LNNSRLFFHNAPLAQALELPKTLFDYQGPAGVWGGET 63
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR D
Sbjct: 64 LLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRKLDWHLKGAGLTPYSRMGD 123
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
AVLRS++REFL SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A
Sbjct: 124 PAAVLRSTVREFLASEAMHGLGIPTTRALSIVTSDTPVRRE-------TTERGAMLMRIA 176
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R + + VR LA Y I HHF H+ +ED
Sbjct: 177 ESHVRFGHFEHFYYR--REPERVRELAQYVIEHHFAHLAQ------------EED----- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 ----RFALWFGEVVTRTAQLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQPG 273
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 274 FICNHTDYQG-RYAFDNQPGVGLWNLQRLAQAL--SPIIPAERLNALLDDYQPALLREWG 330
Query: 479 AIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + + +LL MA + DYT FR LS + S PL+
Sbjct: 331 RQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSETEQRSSAS------PLR 384
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+AA
Sbjct: 385 DEFID-----RATFDAWFARYRARLEEEGVEDDARQRLMKSVNPALVLRNWLAQRAIEAA 439
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 440 ERDDASELSRLLEALRNPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|429084451|ref|ZP_19147456.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
condimenti 1330]
gi|426546508|emb|CCJ73497.1| Selenoprotein O and cysteine-containing homologs [Cronobacter
condimenti 1330]
Length = 482
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 214/529 (40%), Positives = 291/529 (55%), Gaps = 53/529 (10%)
Query: 118 DPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA 177
+PR + R+ L YT+++P+ + N +L+ + +A +L+L F+ G
Sbjct: 4 NPRFTATWRDELPGFYTELTPTP-LANSRLLCHNAPLAQALKLPDTLFDYQGPAGVLGGE 62
Query: 178 TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFA 237
T L G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR
Sbjct: 63 TLLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLKDGRKVDWHLKGAGLTPYSRMG 122
Query: 238 DGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRV 297
DG AVLRS++REFL SEAMH L IPTTRAL +VT+ V R+ E GA++ R+
Sbjct: 123 DGRAVLRSTVREFLASEAMHGLRIPTTRALSIVTSDTPVRRE-------TTERGAMLIRI 175
Query: 298 AQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
A+S +RFG ++ R + + VR LA+Y I HHF H+ + DED
Sbjct: 176 AESHVRFGHFEHFYYR--REPEKVRELAEYVIAHHFAHLAH------------DED---- 217
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
++A W EV RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P
Sbjct: 218 -----RFALWFGEVVTRTAHLMASWQCVGFAHGVMNTDNMSILGLTMDYGPYGFLDDYQP 272
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
F N TD G RY F NQP +GLWN+ + + L + +I + N +++ Y + E+
Sbjct: 273 GFICNHTDHQG-RYAFDNQPGVGLWNLQRLAQAL--SPVIPAERLNALLDEYQPVLLREW 329
Query: 478 QAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
M KLG + + + +LL MA + DYT FR LS + S PL
Sbjct: 330 GKQMRAKLGFTVEKEGDNDYLRELLTLMAREGSDYTRTFRMLSVTEQRSSAS------PL 383
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ +D + + +W Y L G+ D+ R+ LM SVNP VLRN+L Q AI+A
Sbjct: 384 RDEFID-----RATFDAWFARYRARLAEEGVEDDARQTLMKSVNPALVLRNWLAQRAIEA 438
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE D E+ RLL+ + P+ ++ + Y PP W V SCSS
Sbjct: 439 AERDDPSELTRLLEALRDPFADRD--DDYTHRPPDWGKHLEV---SCSS 482
>gi|455646323|gb|EMF25350.1| hypothetical protein H262_00220 [Citrobacter freundii GTC 09479]
Length = 480
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 291/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ ++++A+ L + F+ P + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNDALAEQLAIPAALFDIPTGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE ++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ED ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPQWQ--------------ED-------ADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D
Sbjct: 219 LWFNDVVTRTATLIADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDYICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++ Y + Y M +KL
Sbjct: 279 NQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPVEALNDALDSYQLALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + M+ ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFFSEQKDDNELLSELFSLMSRERSDYTRTFRMLSQTE------QHSAQSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D R+ M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNIADAVRQTQMKAANPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 445 LHRLHQALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|251789270|ref|YP_003003991.1| hypothetical protein Dd1591_1659 [Dickeya zeae Ech1591]
gi|247537891|gb|ACT06512.1| protein of unknown function UPF0061 [Dickeya zeae Ech1591]
Length = 483
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 204/516 (39%), Positives = 286/516 (55%), Gaps = 56/516 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT+++P+ + +L+ ++ +A++L L FE D +SG L G P AQ Y G
Sbjct: 19 YTELTPTP-LHGARLLYYNAPLAETLGLSADYFE-GDNRRIWSGEKTLPGMAPLAQVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLG--EILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
HQFG+WAGQLGDGR I LG ++ + +++ W LKGAG TPYSR DG AVLRS +REF
Sbjct: 77 HQFGVWAGQLGDGRGILLGQQQLADGRTQDW--HLKGAGLTPYSRMGDGRAVLRSVVREF 134
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEA+H L IPTTRAL +VT+ V R+ +EE GA++ RVA S +RFG ++
Sbjct: 135 LASEALHHLNIPTTRALTIVTSDHPVQRE-------QEERGAMLLRVADSHVRFGHFEHF 187
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
R + + VR LA+Y I H+ H + ++++ W +
Sbjct: 188 YYR--REPEKVRQLAEYVIACHWPHWQQ---------------------ETDRFYLWFND 224
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPFGF+D + P + N +D G R
Sbjct: 225 VVERTARLIAHWQAVGFAHGVMNTDNMSILGLTIDYGPFGFMDDYQPGYICNHSDHQG-R 283
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK- 489
Y F NQP + LWN+ + + +L+ L+ + + RY M + +M KLG
Sbjct: 284 YAFDNQPAVALWNLHRLAQSLSG--LMSADKLQQALNRYEPALMQRFGELMRAKLGFTTP 341
Query: 490 --YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ LL M ++ DY++ FR LS + + PL+ V +D +
Sbjct: 342 LAQDNDVLVGLLQLMTREQADYSHIFRLLSETE------QHSRHSPLRDVFID-----RA 390
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A+ W Y Q L+ D R+ M NP+Y+LRNYL Q AI+ AE GD G + RL
Sbjct: 391 AFDEWFSLYRQRLMLESTDDAVRQQQMKLANPRYILRNYLAQQAIERAETGDVGLLARLH 450
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + +PYD+QP A LPP W + SCSS
Sbjct: 451 QTLCQPYDDQPERADLAGLPPDWGKHLAI---SCSS 483
>gi|271500169|ref|YP_003333194.1| hypothetical protein Dd586_1623 [Dickeya dadantii Ech586]
gi|270343724|gb|ACZ76489.1| protein of unknown function UPF0061 [Dickeya dadantii Ech586]
Length = 483
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/546 (38%), Positives = 295/546 (54%), Gaps = 70/546 (12%)
Query: 103 EDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDP 162
DL +++ + ++LPG YT+++P+ + +L+ + S+A L L
Sbjct: 3 HDLPFNNHYHQQLPG--------------YYTELTPTP-LHGARLLYHNVSLAQELGLSA 47
Query: 163 KEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLG--EILNLKSER 220
FE D +SG L G P AQ Y GHQFG+WAGQLGDGR I LG ++ + +++
Sbjct: 48 DWFE-GDNQRIWSGERLLPGMAPLAQVYSGHQFGVWAGQLGDGRGILLGQQQLADGRTQD 106
Query: 221 WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDM 280
W LKGAG TPYSR DG AVLRS +REFL SEA+H LGIPTTRAL +V++ V R+
Sbjct: 107 W--HLKGAGLTPYSRMGDGRAVLRSVVREFLASEALHHLGIPTTRALTIVSSDHPVRRE- 163
Query: 281 FYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMN 340
+EE GA++ RVA S +RFG ++ R + + VR LA+Y I H+ +
Sbjct: 164 ------QEERGAMLLRVADSHVRFGHFEHFYYR--REPEQVRQLAEYVIACHWPQWQQ-- 213
Query: 341 KSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSIL 400
+++Y W +V RTA L+A WQ VGF HGV+NTDNMSIL
Sbjct: 214 -------------------DADRYYLWFSDVVARTARLIAHWQAVGFAHGVMNTDNMSIL 254
Query: 401 GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDK 460
GLTIDYGPFGF+D + P + N +D G RY F NQP + LWN+ + + +L+ L+ +
Sbjct: 255 GLTIDYGPFGFMDDYQPDYICNHSDHQG-RYAFDNQPAVALWNLHRLAQSLSG--LMPVE 311
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
++ Y + M + +M KLG Q ++ LL M ++ DYT+ FR LS
Sbjct: 312 RLQQALKGYESALMQRFGELMRAKLGFDTPQAQDNDLLVGLLQLMKRERADYTHIFRLLS 371
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+ S PL+ V +D+ A+ W +Y Q L+ D ER+ M
Sbjct: 372 ETERHSSHS------PLRDVFIDLA-----AFDGWFSAYRQRLMLESADDTERQQRMKQA 420
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVC 637
NP+Y+LRNYL Q AID AE D + RL + +++PY EQP A LPP W
Sbjct: 421 NPRYILRNYLAQQAIDLAEKEDVSALARLHQTLQQPYAEQPDKADLAALPPDWGKH---L 477
Query: 638 MLSCSS 643
+SCSS
Sbjct: 478 EISCSS 483
>gi|152970713|ref|YP_001335822.1| hypothetical protein KPN_02164 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
gi|378979316|ref|YP_005227457.1| hypothetical protein KPHS_31570 [Klebsiella pneumoniae subsp.
pneumoniae HS11286]
gi|425092045|ref|ZP_18495130.1| hypothetical protein HMPREF1308_02308 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|449052301|ref|ZP_21732197.1| hypothetical protein G057_10475 [Klebsiella pneumoniae hvKP1]
gi|166987597|sp|A6TAH1.1|Y2131_KLEP7 RecName: Full=UPF0061 protein KPN78578_21310
gi|150955562|gb|ABR77592.1| hypothetical protein KPN_02164 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
gi|364518727|gb|AEW61855.1| hypothetical protein KPHS_31570 [Klebsiella pneumoniae subsp.
pneumoniae HS11286]
gi|405612367|gb|EKB85124.1| hypothetical protein HMPREF1308_02308 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|448875959|gb|EMB10961.1| hypothetical protein G057_10475 [Klebsiella pneumoniae hvKP1]
Length = 480
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|354723168|ref|ZP_09037383.1| hypothetical protein EmorL2_09929 [Enterobacter mori LMG 25706]
Length = 480
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 207/518 (39%), Positives = 284/518 (54%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ + + +L+ + +AD L + P F+ + + G T LAG P AQ
Sbjct: 13 LPGFYTALKPTP-LHHSRLIWHNAPLADELAIPPDLFQPAEGAGVWGGETLLAGMQPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPNGETVDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A+S LRFG ++
Sbjct: 132 ESLASEAMHALGIPTTRALSIVTSDTPVARETM-------EQGAMLVRIAESHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R + + VR LADYAIR H+ ++ + KY W
Sbjct: 185 HFYYR--REPEKVRQLADYAIRRHWPQLQG---------------------EAEKYVLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTAS++A+WQ VGF HGV+NTDNMS+LGLT DYGP+GFLD + P + N +D G
Sbjct: 222 RDIVSRTASMIARWQTVGFAHGVMNTDNMSLLGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY A+M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDGYQEVLLREYGALMRNKLGLL 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + ++++ L MA + DYT FR LS + + + PL+ +D
Sbjct: 339 TQEKGDNELLNTLFALMAREGSDYTRTFRMLSQTEQNSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W Y L + D R+ M + NP VLRN+L Q AI+ AE G + E+ R
Sbjct: 388 RQAFDDWFTLYRSRLQQEQVDDATRQEKMKAANPAMVLRNWLAQRAIEQAEQGQYDELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHVALRTPFADRD--DDYVSRPPEWGKRLEV---SCSS 480
>gi|254197950|ref|ZP_04904372.1| conserved hypothetical protein [Burkholderia pseudomallei S13]
gi|169654691|gb|EDS87384.1| conserved hypothetical protein [Burkholderia pseudomallei S13]
Length = 525
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 226/548 (41%), Positives = 297/548 (54%), Gaps = 73/548 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L A + P+A + P +V +S+ A L L+P + P F F G
Sbjct: 28 PRDDAF--QQLGAAFVTRLPAAPLPAPYVVGFSDDAARMLGLEPALRDAPGFAELFCGNP 85
Query: 179 ----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYS
Sbjct: 86 TRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYS 143
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 144 RMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVVREEI-------ETSAVV 196
Query: 295 CRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
RVAQSF+RFG ++ A+ E L R LAD+ I E + D D
Sbjct: 197 TRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI-------------ERFYPACRDAD 240
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y A E RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+D
Sbjct: 241 --------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFID 292
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLID 458
AFD N +D G RY + QP I WN + L A + ++
Sbjct: 293 AFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSEDARAERAVE 351
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
D A+ V+ R+ +F + M KLGL + + + ++LL M D+T FR
Sbjct: 352 D--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRH 409
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L+ V + + P++ + +D ++A+ W Y L D R A MN
Sbjct: 410 LARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRARLSEEARDDASRAAAMN 460
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQ + YA LPP WA
Sbjct: 461 RMNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQLEHDAYAALPPDWA---S 517
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 518 TLEVSCSS 525
>gi|425076260|ref|ZP_18479363.1| hypothetical protein HMPREF1305_02170 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|425086893|ref|ZP_18489986.1| hypothetical protein HMPREF1307_02339 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
gi|405591969|gb|EKB65421.1| hypothetical protein HMPREF1305_02170 [Klebsiella pneumoniae subsp.
pneumoniae WGLW1]
gi|405603617|gb|EKB76738.1| hypothetical protein HMPREF1307_02339 [Klebsiella pneumoniae subsp.
pneumoniae WGLW3]
Length = 480
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQKLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|422805734|ref|ZP_16854166.1| ydiU [Escherichia fergusonii B253]
gi|324113459|gb|EGC07434.1| ydiU [Escherichia fergusonii B253]
Length = 480
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 209/524 (39%), Positives = 293/524 (55%), Gaps = 59/524 (11%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A +T ++P+ + N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPATWTAINPTP-LHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIP TR+L +VT+ V R+ E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPGTRSLAIVTSDTPVYRE-------TTETGAMLMRLAQSHMRFG 181
Query: 306 SYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
++ + R D++ V+ LAD+AIRH++ H++ +KY
Sbjct: 182 HFEHFYYLR---DIEKVQLLADFAIRHYWPHLQE---------------------AQDKY 217
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
A W +V RTASL+A WQ VGF HGV+NTDNMSI+GLT+DYGPFGFLD ++P F N +
Sbjct: 218 AIWFRDVVARTASLIAGWQTVGFAHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHS 277
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F NQP + LWN+ + + TL + I N ++ Y + Y M +K
Sbjct: 278 DHQG-RYSFDNQPAVALWNLQRLAQTL--SPFIAVNALNDALDSYKQVLLAVYGKRMRQK 334
Query: 485 LGLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
LG Y +Q ++++L MA + DYT FR LS + + + PL+ +
Sbjct: 335 LGF--YTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASS------PLRDEFI 386
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
D + A+ SW Y + + ++D+ER+ M SVNP VLRN+L Q AI+ A+ GD
Sbjct: 387 D-----RAAFDSWFSRYRARIQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGD 441
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL ++ P++++ + Y+R PP W R V SCSS
Sbjct: 442 MEELHRLHDVLRNPFNDRD--DDYSRRPPEWGKRLEV---SCSS 480
>gi|392419487|ref|YP_006456091.1| hypothetical protein A458_02060 [Pseudomonas stutzeri CCUG 29243]
gi|390981675|gb|AFM31668.1| hypothetical protein A458_02060 [Pseudomonas stutzeri CCUG 29243]
Length = 486
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 219/549 (39%), Positives = 304/549 (55%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L L +D+ F R GD + T+VSP + +P+LV SE+ L
Sbjct: 1 MKSLTQLTFDNRFAR--LGDTFS------------TEVSPQP-LSDPRLVVVSEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+P F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQPLFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDSLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ I HF E
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHAEL---KQLLEHVIEAHF--TE 213
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ E + A+ EV ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 214 LLEHPE-------------------PFHAFFREVLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDAG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + ++ ++ +LL M VDYTNFFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFAQAEADDEALVRRLLQLMQASAVDYTNFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS A+ ++ L+ +D+ + + +W Y G ER+A M
Sbjct: 372 ELSESPAEQAVRR------LREDFVDL-----QGFDAWAADYCARTAREGSEPAERQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP++EQPGM++YA PP W
Sbjct: 421 QAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHAVLSRPFEEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|311279408|ref|YP_003941639.1| hypothetical protein Entcl_2101 [Enterobacter cloacae SCF1]
gi|308748603|gb|ADO48355.1| protein of unknown function UPF0061 [Enterobacter cloacae SCF1]
Length = 480
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 291/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L Y++++P A + N +L+ + +A L + F + + G L G P
Sbjct: 10 RDELPDFYSELAP-APLANARLIWHNAPLAQMLGIPDALFAPENGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
++RE L SEAMH LG+ TTRAL +VT+ V R+ E GA++ R+A+S +RFG
Sbjct: 129 TLRESLASEAMHHLGVATTRALSVVTSDTPVYRETV-------EQGAMLIRIAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ H+ VD +++KY
Sbjct: 182 HFEHFYYR--REPQKVQLLADYVIRHHWPHL--------------------VD-SADKYT 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V +TA +A+WQ +GF HGV+NTDNMSILGLT+DYGPFGFLD F PSF N +D
Sbjct: 219 LWLRDVVTKTAVAIARWQTLGFAHGVMNTDNMSILGLTLDYGPFGFLDDFQPSFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N +E Y ++EY + M +KL
Sbjct: 279 HQG-RYSFENQPAVALWNLQRLAQTLSPFIAVD--ALNQALEGYELALLEEYGSRMRRKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L M + DYT FR LS + + PL+ +D
Sbjct: 336 GLFTQEKGDNDLLNGLFALMEREGSDYTRTFRMLSATEQHSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+EA+ W Y + L I+D+ER+ +M NP VLRN+L Q AI+ AE GD+ E
Sbjct: 388 ---REAFDRWFSDYRRRLQQEQIADDERQRVMKQENPAIVLRNWLAQRAIEQAERGDYQE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+D++ + YA PP W R V SCSS
Sbjct: 445 LSRLHEALRTPFDDRS--DDYASRPPEWGKRLEV---SCSS 480
>gi|395230862|ref|ZP_10409161.1| UPF0061 protein ydiU [Citrobacter sp. A1]
gi|424732277|ref|ZP_18160856.1| protein ydiu [Citrobacter sp. L17]
gi|394715315|gb|EJF21137.1| UPF0061 protein ydiU [Citrobacter sp. A1]
gi|422893435|gb|EKU33283.1| protein ydiu [Citrobacter sp. L17]
Length = 480
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 290/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ ++++A+ L + F+ P + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNDALAEQLAIPAALFDIPTGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE ++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ED ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPQWQ--------------ED-------ADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D
Sbjct: 219 LWFNDVVTRTATLIADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDYICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++ Y + Y M +KL
Sbjct: 279 NQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPVEALNDALDSYQLALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + M+ ++ DYT FR LS + PL+ +D
Sbjct: 336 GFFSEQKDDNELLSELFSLMSRERSDYTRTFRMLSQTEQHSGQS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D R+ M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNIADAVRQTQMKAANPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 445 LHRLHQALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|402843535|ref|ZP_10891930.1| PF02696 family protein [Klebsiella sp. OBRC7]
gi|402276953|gb|EJU26048.1| PF02696 family protein [Klebsiella sp. OBRC7]
Length = 480
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 212/521 (40%), Positives = 286/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + + +L +D F + G T L G P
Sbjct: 10 RDELPDFYTALTPTP-LENARLVWHNAPLGRTLGVDASLFSPQKGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREALASEAMHALGIPTTRALAIVASDTPVYRETV-------ERGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H +E L V+ LADY IRHH+ H++N +++Y
Sbjct: 182 HFE-HFYYRREPLK-VQQLADYVIRHHWPHLQN---------------------EADRYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFSDVVTRTAEMIACWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + TL + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYRFDNQPAVGLWNLQRLAQTL--SPFISAEALNGALDSYQQALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L M + DYT FR LS + + + PL+ +D
Sbjct: 336 GLFTQQKGDNELLDGLFALMEREGSDYTRTFRMLSASEQESAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW +Y L + D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RETFDSWFTAYRARLRDEQVEDAQRQARMRSVNPAIVLRNWLAQRAIEQAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L + P+ ++ ++Y + PP W R V SCSS
Sbjct: 445 LESLHSALSHPFADR--TDEYIQRPPDWGRRLEV---SCSS 480
>gi|420258400|ref|ZP_14761134.1| hypothetical protein YWA314_06637 [Yersinia enterocolitica subsp.
enterocolitica WA-314]
gi|404514126|gb|EKA27927.1| hypothetical protein YWA314_06637 [Yersinia enterocolitica subsp.
enterocolitica WA-314]
Length = 499
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 211/533 (39%), Positives = 286/533 (53%), Gaps = 52/533 (9%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
EL P+ + + L YT + P+ ++ +L+ SE +A LELD F P ++
Sbjct: 16 ELDNSPQFSNSYGQQLSGFYTHLPPTP-LKGARLLYHSEPLARELELDTSWFSDPKAAVW 74
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L G P AQ Y GHQFG WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 75 -AGEMLLPGMEPLAQVYSGHQFGQWAGQLGDGRGILLGEQKLSDGRHMDWHLKGAGLTPY 133
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LG+PT+RAL +VT+ V R+ + E GA+
Sbjct: 134 SRMGDGRAVLRSVVREFLASEALHHLGVPTSRALTIVTSDHPVYRE-------QAERGAM 186
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q V+ LADY I H+ + +
Sbjct: 187 LLRVAESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWPQWVGLEEC----------- 233
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
Y W +V +RTA L+A WQ +GF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 234 ----------YLLWFTDVVKRTARLMAHWQTIGFAHGVMNTDNMSILGITMDYGPFGFLD 283
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ L+ + +E Y +
Sbjct: 284 DYVPDYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LLSTAQLQQALEAYEPEL 340
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLG + Q +++ LL+ M + DYT FR LS V+ +
Sbjct: 341 MAAYGQQMRAKLGFSDSDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVETHSA------ 394
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
L PL+ +D + A+ SW Y L I D +R+ M +VNPKY+LRNYL Q
Sbjct: 395 LSPLRDDFID-----RAAFDSWYSRYRARLQQEQIDDAQRQQAMRAVNPKYILRNYLAQL 449
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AID AE D ++RL + +++P+ EQP + A LPP W +SCSS
Sbjct: 450 AIDQAEKDDIQPLQRLHQALQQPFAEQPELNDLAALPPDWGKH---LEISCSS 499
>gi|383190686|ref|YP_005200814.1| hypothetical protein Rahaq2_2843 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
gi|371588944|gb|AEX52674.1| hypothetical protein Rahaq2_2843 [Rahnella aquatilis CIP 78.65 =
ATCC 33071]
Length = 484
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 207/541 (38%), Positives = 292/541 (53%), Gaps = 62/541 (11%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
++H + +LPG YT++ P+ ++ +L+ SE +A L LD F
Sbjct: 3 QFEHHYADQLPG--------------FYTQLQPTP-LKGARLLYHSEPLARELGLDESLF 47
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
+ ++ G G P AQ Y GHQFG WAGQLGDGR I LGE + +R++ L
Sbjct: 48 G-AEHRQYWCGEKFFPGMQPLAQVYSGHQFGQWAGQLGDGRGILLGEQVLPSGKRFDWHL 106
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AVLRS +REFL SEA+H L +PTTRAL +VT+ + V R+
Sbjct: 107 KGAGLTPYSRMGDGRAVLRSVVREFLASEALHHLSVPTTRALTIVTSDEPVFRE------ 160
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
+ E GA++ RVA+S +RFG ++ R Q + V+ LADY I HH+ + +L
Sbjct: 161 -QPERGAMLIRVAESHVRFGHFEHFYYRKQPEQ--VKQLADYVIAHHWPQLLESEPVAAL 217
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
+Y W V ERTA L+AQWQ +GF HGV+NTDNMSILGLTID
Sbjct: 218 -----------------RYQQWFTGVVERTARLMAQWQSIGFAHGVMNTDNMSILGLTID 260
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV 465
YGP+GFLD + P + N +D G RY + NQP + WN+ + + TL+ L+ ++
Sbjct: 261 YGPYGFLDDYQPGYICNHSDHQG-RYSYDNQPAVAYWNLHRLAQTLSG--LMSAEQLQTA 317
Query: 466 MERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKAD 522
+ Y M Y +M KLG NKQ +++ LL+ MA + D+T FR LS +
Sbjct: 318 LGEYEPALMRAYGTLMRGKLGFFTENKQDNDLLTGLLSLMAKEGRDFTQTFRLLSQTE-- 375
Query: 523 PSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+ + PL+ +D ++A+ SW +Y Q L + I D R+ M NP+ +
Sbjct: 376 ----QQQAASPLRDEFID-----RQAFDSWYQAYRQRLQTEDIGDATRQDAMKQSNPRII 426
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCS 642
LRNYL Q AI+ AE D + +L + + PY + P ++ A LPP W +SCS
Sbjct: 427 LRNYLAQKAIERAEADDISALEQLHQALRDPYSDAPQYDEMAALPPDWGKH---LEISCS 483
Query: 643 S 643
S
Sbjct: 484 S 484
>gi|419975172|ref|ZP_14490585.1| hypothetical protein KPNIH1_17518 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|419979625|ref|ZP_14494915.1| hypothetical protein KPNIH2_11070 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|419984197|ref|ZP_14499345.1| hypothetical protein KPNIH4_04985 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|419991823|ref|ZP_14506785.1| hypothetical protein KPNIH5_14214 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|419998242|ref|ZP_14513031.1| hypothetical protein KPNIH6_17333 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|420003235|ref|ZP_14517882.1| hypothetical protein KPNIH7_13507 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|420008731|ref|ZP_14523219.1| hypothetical protein KPNIH8_12011 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|420015187|ref|ZP_14529489.1| hypothetical protein KPNIH9_15259 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|420020488|ref|ZP_14534675.1| hypothetical protein KPNIH10_13282 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|420026177|ref|ZP_14540181.1| hypothetical protein KPNIH11_12622 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|420031965|ref|ZP_14545783.1| hypothetical protein KPNIH12_12844 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
gi|420037801|ref|ZP_14551453.1| hypothetical protein KPNIH14_13590 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|420043387|ref|ZP_14556875.1| hypothetical protein KPNIH16_12854 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|420049392|ref|ZP_14562700.1| hypothetical protein KPNIH17_14089 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|420055002|ref|ZP_14568172.1| hypothetical protein KPNIH18_13662 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|420060472|ref|ZP_14573471.1| hypothetical protein KPNIH19_12571 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|420066604|ref|ZP_14579403.1| hypothetical protein KPNIH20_14434 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|420071946|ref|ZP_14584588.1| hypothetical protein KPNIH21_12408 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|420078270|ref|ZP_14590729.1| hypothetical protein KPNIH22_14952 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|420081636|ref|ZP_14593942.1| hypothetical protein KPNIH23_02831 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|428942695|ref|ZP_19015669.1| hypothetical protein MTE2_23668 [Klebsiella pneumoniae VA360]
gi|397343757|gb|EJJ36899.1| hypothetical protein KPNIH1_17518 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|397348446|gb|EJJ41546.1| hypothetical protein KPNIH2_11070 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|397354714|gb|EJJ47753.1| hypothetical protein KPNIH4_04985 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|397360838|gb|EJJ53509.1| hypothetical protein KPNIH6_17333 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|397362598|gb|EJJ55246.1| hypothetical protein KPNIH5_14214 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|397370219|gb|EJJ62810.1| hypothetical protein KPNIH7_13507 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|397376830|gb|EJJ69077.1| hypothetical protein KPNIH9_15259 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|397382922|gb|EJJ75076.1| hypothetical protein KPNIH8_12011 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|397387819|gb|EJJ79826.1| hypothetical protein KPNIH10_13282 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|397395803|gb|EJJ87503.1| hypothetical protein KPNIH11_12622 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|397398868|gb|EJJ90526.1| hypothetical protein KPNIH12_12844 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
gi|397405040|gb|EJJ96519.1| hypothetical protein KPNIH14_13590 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|397413325|gb|EJK04542.1| hypothetical protein KPNIH17_14089 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|397414161|gb|EJK05363.1| hypothetical protein KPNIH16_12854 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|397422267|gb|EJK13244.1| hypothetical protein KPNIH18_13662 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|397429492|gb|EJK20206.1| hypothetical protein KPNIH20_14434 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|397433521|gb|EJK24168.1| hypothetical protein KPNIH19_12571 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|397439708|gb|EJK30141.1| hypothetical protein KPNIH21_12408 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH21]
gi|397445035|gb|EJK35290.1| hypothetical protein KPNIH22_14952 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH22]
gi|397452981|gb|EJK43045.1| hypothetical protein KPNIH23_02831 [Klebsiella pneumoniae subsp.
pneumoniae KPNIH23]
gi|426298153|gb|EKV60581.1| hypothetical protein MTE2_23668 [Klebsiella pneumoniae VA360]
Length = 480
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGVGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|290509042|ref|ZP_06548413.1| hypothetical protein HMPREF0485_00813 [Klebsiella sp. 1_1_55]
gi|289778436|gb|EFD86433.1| hypothetical protein HMPREF0485_00813 [Klebsiella sp. 1_1_55]
Length = 480
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFASENGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ + R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPIYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDIWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|387814901|ref|YP_005430388.1| hypothetical protein MARHY2499 [Marinobacter hydrocarbonoclasticus
ATCC 49840]
gi|381339918|emb|CCG95965.1| conserved hypothetical protein [Marinobacter hydrocarbonoclasticus
ATCC 49840]
Length = 484
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 201/514 (39%), Positives = 281/514 (54%), Gaps = 52/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++V PS + P++V +++++A + + D+ +GA L G P A Y G
Sbjct: 20 YSRVQPSP-LSEPRMVCFNQALASDMGFLVRN--ENDWAAIGAGAELLEGMDPVAMKYTG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGM+ +LGDGR + L E + RW+ LKGAG TPYSRF DG AVLRS+IRE+LC
Sbjct: 77 HQFGMYNPELGDGRGLLLWETVGPDGTRWDWHLKGAGTTPYSRFGDGRAVLRSTIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +++ V R+ E A + RVA+S +RFG ++ A
Sbjct: 137 SEAMHGLGIPTTRALFMISAKDPVRRESI-------ETAAALMRVAKSHIRFGHFEFAAH 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E D ++TL ++ I HF H+ ++ + + +YA W EV
Sbjct: 190 --HEGPDALKTLLEHVIALHFPHLISLPEDQ-------------------RYARWFEEVV 228
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+A+WQ VGF HGV+N+DNMSI+G T DYGPF FLD FD F N +D G RY
Sbjct: 229 ERTARLIAKWQAVGFCHGVMNSDNMSIIGDTFDYGPFAFLDDFDAGFVCNHSDHEG-RYA 287
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP +G N + L ++D+ + RY T + + ++ M KLGL + +
Sbjct: 288 YNRQPQVGFINCQYLANALLP--IMDEDTVRRGLRRYETAYNEHFKHQMLAKLGLEEADG 345
Query: 493 Q---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+I N + +VDYT FFR LSN+ D P++ + D +
Sbjct: 346 SDMGLIMDTFNMLHEHRVDYTRFFRGLSNL-------HDHGTAPVRDLFAD-----RSVA 393
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
W+ Y L +ER+ M VNPKY+LRNYL Q I A+ GD+ ++ LLK+
Sbjct: 394 DEWLERYEARLQKETRGHDEREYAMRRVNPKYILRNYLAQQVILEAQNGDYEPMKELLKV 453
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+E+P+DEQP EKYA LPP W + SCSS
Sbjct: 454 LEKPFDEQPEYEKYAALPPDWGKHLNI---SCSS 484
>gi|374334316|ref|YP_005091003.1| hypothetical protein GU3_02480 [Oceanimonas sp. GK1]
gi|372984003|gb|AEY00253.1| hypothetical protein GU3_02480 [Oceanimonas sp. GK1]
Length = 462
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 204/507 (40%), Positives = 289/507 (57%), Gaps = 56/507 (11%)
Query: 142 VENPQLVAWSESVADSL--ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWA 199
+++P L+ + +A+SL LD +++ SG L G P+AQ Y GHQFG ++
Sbjct: 7 LDSPSLLLVNYDLAESLGISLDDRQWLE-----ITSGHRLLPGMTPFAQVYAGHQFGGFS 61
Query: 200 GQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFL 259
+LGDGRA+ LGE++ RW+L LKGAGKTPYSRF DG AVLRSS+RE+L SEA+H+L
Sbjct: 62 PRLGDGRALLLGEVVAPGGARWDLHLKGAGKTPYSRFGDGRAVLRSSLREYLASEALHYL 121
Query: 260 GIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLD 319
GIPTTRALCLV +G+ V R+ + EPGA + R A S LRFG ++ GQ +
Sbjct: 122 GIPTTRALCLVGSGEPVYRE-------QVEPGAALLRAAPSHLRFGHFEYFYYSGQP--E 172
Query: 320 IVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLV 379
+ L DY I + +E Y A V RTA L+
Sbjct: 173 HIPALLDYLIDTQWPDLEK---------------------GPQGYGALFERVVTRTAELI 211
Query: 380 AQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
A+WQ VGF HGV+NTDNMS+LGLT+DYGP+GFLDA+DP N +D P RY + QP +
Sbjct: 212 ARWQAVGFCHGVMNTDNMSMLGLTLDYGPYGFLDAYDPGHICNHSD-PAGRYAYDQQPAV 270
Query: 440 GLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IIS 496
GLWN+ + + L+ +D + + + +Y + + Y M +KLGL ++++Q +
Sbjct: 271 GLWNLQRLAQALSGHIELDALQQS--LGQYEHQLLTAYSEHMRQKLGLEQWHEQDPALFR 328
Query: 497 KLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSY 556
+ + +A VDY+ +FR L+ + A+ +P + K +AW W Y
Sbjct: 329 DMFSLLAEHGVDYSCWFRRLALLDAEGDLPAPLAALLPK----------PDAWHDWFARY 378
Query: 557 IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDE 616
L+ + ER+A M++VNP YVLRN+L Q AI+ AE GD E LL+L+ RP+D+
Sbjct: 379 RARLVLESRTQAERRAAMDAVNPNYVLRNHLAQRAIERAEQGDMAEADTLLQLLARPFDD 438
Query: 617 QPGMEKYARLPPAWAYRPGVCMLSCSS 643
+P YA PAWA +C +SCSS
Sbjct: 439 RPEFNDYAEPAPAWA--ASLC-ISCSS 462
>gi|398801390|ref|ZP_10560633.1| hypothetical protein PMI17_04472 [Pantoea sp. GM01]
gi|398091947|gb|EJL82370.1| hypothetical protein PMI17_04472 [Pantoea sp. GM01]
Length = 479
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 196/473 (41%), Positives = 269/473 (56%), Gaps = 50/473 (10%)
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+SG L G P AQ Y GHQFG+WAGQLGDGR I LGE K + + LKGAG TPY
Sbjct: 54 WSGRELLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSKGGKLDWHLKGAGLTPY 113
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+ +E GA+
Sbjct: 114 SRMGDGRAVIRSSVREFLASEALHHLGIPTTRALALAIGDEPVLRE-------TQERGAM 166
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ R+A+S LRFG ++ G++ D VR LADYAIRHH+ +++
Sbjct: 167 LMRIAESHLRFGHFEHVYYAGEQ--DKVRMLADYAIRHHWPQLQD--------------- 209
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+++Y W ++ +RTASL+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD
Sbjct: 210 ------EADRYQLWFTDIVKRTASLIAHWQSVGFAHGVMNTDNMSILGLTLDYGPYGFLD 263
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P++ N +D G RY F NQP IGLWN+ + + L+ L+ ++ + +Y +
Sbjct: 264 DYQPNYICNHSDYQG-RYAFENQPMIGLWNLNRLAHALSG--LLSTEQLKQALGQYENEL 320
Query: 474 MDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M + M KLGL + I++ LL+ M + DYT FR LS+ + + E
Sbjct: 321 MRVWGEKMRAKLGLLTADANDNTILTGLLSLMTAEHSDYTLTFRMLSDTQ------QQET 374
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
PL+ +D + A+ W Y Q LL SDE+R+A+M + NP VLRNYL Q
Sbjct: 375 RSPLRDEFID-----RAAFDRWYSDYRQRLLQDQASDEQRQAVMKAANPALVLRNYLAQQ 429
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
I+ E G+ + RL +++PY + + + PP W +SCSS
Sbjct: 430 VIEEVEKGETTALARLHNALQQPYSDAAVSAELRQRPPEWG---KTLEVSCSS 479
>gi|420372208|ref|ZP_14872517.1| hypothetical protein SF123566_2509, partial [Shigella flexneri
1235-66]
gi|391318491|gb|EIQ75630.1| hypothetical protein SF123566_2509, partial [Shigella flexneri
1235-66]
Length = 443
Score = 344 bits (883), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 207/481 (43%), Positives = 276/481 (57%), Gaps = 50/481 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ + N +L+ + +A++L + F+ + + G T L G P
Sbjct: 10 RDELPETYTALSPTP-LNNARLIWHNTELANTLSIPSSLFK--NAAGVWGGETLLPGMSP 66
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQF +WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 67 LAQVYSGHQFVVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRS 126
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ EPGA++ RVA S LRFG
Sbjct: 127 TIRESLASEAMHYLGIPTTRALSIVTSDSPVYRETV-------EPGAMLMRVAPSHLRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ H+E+ DED KY
Sbjct: 180 HFEHFYYR--REPEKVRQLADFAIRHYWSHLED------------DED---------KYR 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTASL+AQWQ VGF HGV+NTDNMS+LGLT+DYGPFGFLD ++P F N +D
Sbjct: 217 LWFNDVVARTASLIAQWQTVGFAHGVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 277 HQG-RYSFDNQPAVALWNLQRLAQTLSPFVAVD--ALNEALDSYQQVLLTHYGQRMRQKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA ++ DYT FR LS + + PL+ +D
Sbjct: 334 GFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAAS------PLREEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L +SD ER+ LM SVNP VLRN+L Q AI+AAE GD E
Sbjct: 386 ---RAAFDDWFARYRGRLQQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMME 442
Query: 603 V 603
+
Sbjct: 443 L 443
>gi|338721443|ref|XP_003364376.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O [Equus caballus]
Length = 667
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 215/524 (41%), Positives = 275/524 (52%), Gaps = 99/524 (18%)
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+ ERWELQLKGAG T
Sbjct: 117 LFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEVCTAAGERWELQLKGAGPT 176
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
P+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+ V RD FYDGNPK E
Sbjct: 177 PFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSQSTVVRDAFYDGNPKYEKC 236
Query: 292 AIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTLADYAIRHHFRHIENMNKS 342
+V R+A +FLRFGS++I H R + DI + DY I + I+ + S
Sbjct: 237 TVVLRIASTFLRFGSFEIFKSTDEHTGRAGPSVGRNDIRVQMLDYVIGSFYPEIQAAHAS 296
Query: 343 ESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGL 402
+S+ + AA+ EV RTA +VA+WQ VGF HGVLNTDNMSI+GL
Sbjct: 297 DSV----------------QRNAAFFREVTRRTARMVAEWQCVGFCHGVLNTDNMSIVGL 340
Query: 403 TIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA 462
TIDYGPFGFLD +DP N +D GR Y ++ QP++ WN+ + + L + EA
Sbjct: 341 TIDYGPFGFLDRYDPDHVCNASDNAGR-YTYSKQPEVCKWNLQKLAEALEPELPRELGEA 399
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDY--------- 509
+ E + +F Y M +KLGL + ++ +++KLL M + D+
Sbjct: 400 -ILAEEFDAEFHRHYLQKMRRKLGLVQAEQEEDAVLVAKLLETMHLTGADFTNTFYLLSS 458
Query: 510 ----------TNFFRALSNVKAD---------PSIPEDEL------------LVPLKAVL 538
T F AL+ A P + +L L L
Sbjct: 459 FPAGPESLGLTEFLAALTTQCASLEELRLAFRPQMDPRQLSMMLMLAQSNPQLFALIGTR 518
Query: 539 LDIGKERKEA---------------------WISWVLSYIQELLS--SGISD-----EER 570
++ KE + W W+ +Y L G D ER
Sbjct: 519 ANVTKELERVEQQSRLEQLSPAELLSRNRGHWADWLQAYRARLEQDKEGAGDPEAWQAER 578
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
+M++ NPKYVLRNY+ Q+AI+AAE GDF EVRR+LKL+E PY
Sbjct: 579 VRVMHANNPKYVLRNYIAQTAIEAAESGDFSEVRRVLKLLEAPY 622
>gi|381404726|ref|ZP_09929410.1| hypothetical protein S7A_10755 [Pantoea sp. Sc1]
gi|380737925|gb|EIB98988.1| hypothetical protein S7A_10755 [Pantoea sp. Sc1]
Length = 483
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 214/543 (39%), Positives = 297/543 (54%), Gaps = 69/543 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L++D+++ REL G YT ++P+ + +L+ + +A S+ LD
Sbjct: 6 LSFDNTWFRELTG--------------GYTALNPTP-LAGGRLLYHNAPLAASMGLDNAL 50
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F ++ GA L G P AQ Y GHQFG+WAGQLGDGR I LGE E+ +
Sbjct: 51 FTGNGHDVW-HGAALLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRTEDGEKLDWH 109
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSR DG AV+RSS+REFL SEA+H LGIPTTRAL L + V R+
Sbjct: 110 LKGAGLTPYSRMGDGRAVIRSSVREFLASEALHHLGIPTTRALTLSIGDEPVYRE----- 164
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSE 343
E GA++ R++ S LRFG ++ S+ QE V+ LADYAIRHH+ H+
Sbjct: 165 --TAERGAMLMRISPSHLRFGHFEHFFYSQQQEK---VQQLADYAIRHHWPHLVE----- 214
Query: 344 SLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLT 403
+++Y W +V RTA L+A WQ VGF HGV+NTDNMSILGLT
Sbjct: 215 ----------------EADRYQRWFTDVVVRTARLIALWQSVGFAHGVMNTDNMSILGLT 258
Query: 404 IDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEAN 463
IDYGP+GFLD + P F N +D G RY F NQP IG+WN+ + + L+ L+ ++
Sbjct: 259 IDYGPYGFLDDYQPDFICNHSDYQG-RYSFENQPMIGMWNLNRLAHALSG--LLTTEQLR 315
Query: 464 YVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+ Y + M + M KLGL + QI++ LL M + DYT FR LS +
Sbjct: 316 SALSAYEPELMRVWGERMRAKLGLLTQQSSDNQILTDLLALMTQEHSDYTLTFRQLSETQ 375
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
+ E PL+ +D +EA+ W Y L+ +SD ER+A+M + NP
Sbjct: 376 ------QAESRSPLRDEFID-----REAFDRWYQRYRSRLMDEQVSDAERQAVMKAANPA 424
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLS 640
+LRNYL Q AI+ AE G+ G + RL + +++P+ ++ + Y + PP W +S
Sbjct: 425 VILRNYLAQQAIEEAERGEQGALARLHQALQQPFSDETAAD-YRQRPPDWG---KTLEVS 480
Query: 641 CSS 643
CSS
Sbjct: 481 CSS 483
>gi|53805169|ref|YP_113101.1| hypothetical protein MCA0585 [Methylococcus capsulatus str. Bath]
gi|81682800|sp|Q60B95.1|Y585_METCA RecName: Full=UPF0061 protein MCA0585
gi|53758930|gb|AAU93221.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
Length = 504
Score = 344 bits (882), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 209/508 (41%), Positives = 273/508 (53%), Gaps = 53/508 (10%)
Query: 144 NPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLG 203
P++V ++ ++A L P+ P +G P G A Y GHQFG W QLG
Sbjct: 42 EPRMVHFNAALAGELGFGPEAG--PQLLEILAGNRPWPGYASSASVYAGHQFGAWVPQLG 99
Query: 204 DGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPT 263
DGRA+ + E+ ER ELQLKGAG TPYSR DG AVLRSSIRE+L SEAMH LG+PT
Sbjct: 100 DGRALLIAEVRTPARERVELQLKGAGPTPYSRGLDGRAVLRSSIREYLASEAMHALGVPT 159
Query: 264 TRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRT 323
TR L LV + + V R+ E A+VCR A SF+RFG ++ A RGQ + +
Sbjct: 160 TRCLSLVASPQPVARETV-------ESAAVVCRAAASFVRFGQFEYFAGRGQT--EPMAR 210
Query: 324 LADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQ 383
LAD+ I HF H++ + ++AAW EV ERTA L+AQWQ
Sbjct: 211 LADHVIAEHFPHLQGHPE---------------------RHAAWLGEVIERTARLIAQWQ 249
Query: 384 GVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN 443
+GF HGV+NTDN S+LGLT+DYGPFGF+D F N +D G RY + QP++G WN
Sbjct: 250 LLGFCHGVMNTDNFSVLGLTLDYGPFGFMDRFRWYHVCNHSDYEG-RYAYRAQPEVGRWN 308
Query: 444 IAQF----STTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IIS 496
+ S LA A + ++ RY + + KLGL + + +I
Sbjct: 309 CERLLQAVSPLLADAPGRAAEIGQDLLRRYASVYHRAVMRGWADKLGLREVRETDAGLID 368
Query: 497 KLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSY 556
+ L + + D+T FR L ++ D P ++ DI A+ +WV Y
Sbjct: 369 EFLGLLQRGRGDFTRSFRLLGRIRTDSDAPARG----VREAFADI-----NAFDAWVADY 419
Query: 557 IQELLS-SGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD 615
L S + DE R MN VNPKYVLRN+L Q AID A LGD+ EV RL +L+ RPYD
Sbjct: 420 RTRLRSEQNVDDEARAGRMNRVNPKYVLRNHLAQIAIDKAMLGDYSEVARLAELLRRPYD 479
Query: 616 EQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EQP ME YA PP + +SCSS
Sbjct: 480 EQPDMEAYAAEPPDYMRN---IEVSCSS 504
>gi|262044139|ref|ZP_06017213.1| SelO family protein [Klebsiella pneumoniae subsp. rhinoscleromatis
ATCC 13884]
gi|259038511|gb|EEW39708.1| SelO family protein [Klebsiella pneumoniae subsp. rhinoscleromatis
ATCC 13884]
Length = 480
Score = 344 bits (882), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 280/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGMPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LEHLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|238895219|ref|YP_002919954.1| hypothetical protein KP1_3267 [Klebsiella pneumoniae subsp.
pneumoniae NTUH-K2044]
gi|402780328|ref|YP_006635874.1| selenoprotein O-like protein [Klebsiella pneumoniae subsp.
pneumoniae 1084]
gi|238547536|dbj|BAH63887.1| hypothetical protein KP1_3267 [Klebsiella pneumoniae subsp.
pneumoniae NTUH-K2044]
gi|402541234|gb|AFQ65383.1| Selenoprotein O-like protein [Klebsiella pneumoniae subsp.
pneumoniae 1084]
Length = 480
Score = 344 bits (882), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGVGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RV++S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVSESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|206579419|ref|YP_002237990.1| hypothetical protein KPK_2154 [Klebsiella pneumoniae 342]
gi|226701195|sp|B5XQE2.1|Y2154_KLEP3 RecName: Full=UPF0061 protein KPK_2154
gi|206568477|gb|ACI10253.1| conserved hypothetical protein [Klebsiella pneumoniae 342]
Length = 480
Score = 344 bits (882), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT + P+ ++N +L+ + +A L + F + + G L G P
Sbjct: 10 RDELPDFYTSLLPTP-LDNARLIWRNAPLAQQLGVPDALFAPENGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ + R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPIYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|330009650|ref|ZP_08306543.1| hypothetical protein HMPREF9538_04237 [Klebsiella sp. MS 92-3]
gi|328534777|gb|EGF61332.1| hypothetical protein HMPREF9538_04237 [Klebsiella sp. MS 92-3]
Length = 480
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 281/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
++RE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TLRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 182 HFEHFYYR--REPQKVQKLADYVIRHHWPQLQD---------------------EADKYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDVGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|171058525|ref|YP_001790874.1| hypothetical protein Lcho_1842 [Leptothrix cholodnii SP-6]
gi|170775970|gb|ACB34109.1| protein of unknown function UPF0061 [Leptothrix cholodnii SP-6]
Length = 503
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 213/503 (42%), Positives = 273/503 (54%), Gaps = 51/503 (10%)
Query: 148 VAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRA 207
VA SE A L + P S G+ P A Y GHQFG WAGQLGDGRA
Sbjct: 45 VAVSEGAAAELGWAGDWWLHPQALAAHSAGPSWPGSTPMATVYSGHQFGSWAGQLGDGRA 104
Query: 208 ITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRAL 267
+ LGEI R E+QLKG+G TPYSR DG AVLRSSIREFLCSEAM LGIPTTRAL
Sbjct: 105 LLLGEIDTPSGPR-EIQLKGSGLTPYSRMGDGRAVLRSSIREFLCSEAMAGLGIPTTRAL 163
Query: 268 CLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADY 327
+ + V R+ E ++V R A SF+RFG ++ GQ +R L D+
Sbjct: 164 AITASPLQVRRE-------GPETTSVVTRTAPSFIRFGHFEHFCHHGQPA--ALRQLFDF 214
Query: 328 AIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGF 387
+ HH+ D H L + V+ RTA L+AQWQ VGF
Sbjct: 215 VLEHHYPECR-------------DAPHPAAALLES--------VSRRTAELMAQWQAVGF 253
Query: 388 THGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF 447
HGV+NTDNMSILGLTIDYGPFGFLD FDP N +D G RY +A QP + WN+
Sbjct: 254 CHGVMNTDNMSILGLTIDYGPFGFLDGFDPGHICNHSDHQG-RYAYARQPQVAYWNLHAL 312
Query: 448 STTLAAAKLIDDKEANYV----MERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLN 500
+ L D+E + + +E Y F + A+M KLGL ++ ++ LL
Sbjct: 313 AQALVPLVEGSDEEISEILGAALEPYRELFPHQMDALMGAKLGLQSRRDEDRALLDDLLG 372
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL 560
M+ +VDYT +R L AD S +D P++ + L+ + A+ +W Y L
Sbjct: 373 LMSATQVDYTLCWRQL----ADFSSADDGGTGPVRDLFLN-----RPAFDAWAARYRSRL 423
Query: 561 LSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGM 620
+ G D ER+A MN VNP+YVLRN+L + AI ++ GD EV+RL +++ERP+DEQP
Sbjct: 424 QAEGSVDAERRARMNHVNPRYVLRNHLAELAIRRSQAGDDSEVQRLARVLERPFDEQPEH 483
Query: 621 EKYARLPPAWAYRPGVCMLSCSS 643
YA LPP WA +SCSS
Sbjct: 484 AAYAALPPDWAQ---TLEISCSS 503
>gi|418531206|ref|ZP_13097123.1| hypothetical protein CTATCC11996_15985 [Comamonas testosteroni ATCC
11996]
gi|371451708|gb|EHN64743.1| hypothetical protein CTATCC11996_15985 [Comamonas testosteroni ATCC
11996]
Length = 503
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 217/528 (41%), Positives = 290/528 (54%), Gaps = 60/528 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAVPY 186
A +T + P+ V P +A S S A + L+P+ + SG +G+ P
Sbjct: 21 AFFTYLHPT-PVSEPHWIAASVSTARWMGLNPQWLHSAEALQILSGNAVSDHGNSGSKPL 79
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG+WAGQLGDGRAI LGE + +E+QLKGAG+TPYSR DG AVLRSS
Sbjct: 80 ATVYSGHQFGVWAGQLGDGRAILLGE----TEQGFEVQLKGAGRTPYSRMGDGRAVLRSS 135
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IREFLCSEAM LGIPTTRAL L + V R+ E A+V RVA+SF+RFG
Sbjct: 136 IREFLCSEAMTALGIPTTRALALTGSPLPVARETM-------ETAAVVTRVAESFIRFGH 188
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ A+R + ++ LAD I H+ E + L N YA
Sbjct: 189 FEHFAARDMQAE--LKALADMVIDQHY-----------------PECRTAAALNGNPYAN 229
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
+ V+ERTA L+AQWQGVGF HGV+NTDNMSILGLTIDYGPF FLD FDP N +D
Sbjct: 230 FLQAVSERTARLLAQWQGVGFCHGVMNTDNMSILGLTIDYGPFQFLDVFDPGHICNHSDS 289
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY-VMERYGTKFMDEYQAIMTKKL 485
G RY F QP + WN+ + A LI D+E +E Y T F Y M KL
Sbjct: 290 QG-RYAFNRQPQVAYWNL--YCLGQALLPLIGDEELTIAALESYKTVFPAAYARQMLAKL 346
Query: 486 GLPKYNKQ----------IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
GLP+ +++ LL +A KVDYT FF L+ A + + PL+
Sbjct: 347 GLPENEAGTPATEGRFALLVNPLLQILADSKVDYTIFFTRLTAAVAQGQQRKID-FEPLR 405
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
++LD + ++ +W L+Y ++L + + + LM NP++VLRN+L ++ I AA
Sbjct: 406 DIILD-----RASFDAWSLTYSEQL--AQMDKAQTVDLMQKSNPRFVLRNHLGETVIRAA 458
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ GDF V+++L +++ PYD P +A PP WA +SCSS
Sbjct: 459 QAGDFAPVQQMLAVLQTPYDPHPDHADWAGFPPDWA---SSIEISCSS 503
>gi|386035301|ref|YP_005955214.1| hypothetical protein KPN2242_13795 [Klebsiella pneumoniae KCTC
2242]
gi|424831096|ref|ZP_18255824.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae Ecl8]
gi|339762429|gb|AEJ98649.1| hypothetical protein KPN2242_13795 [Klebsiella pneumoniae KCTC
2242]
gi|414708529|emb|CCN30233.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
pneumoniae Ecl8]
Length = 480
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 280/521 (53%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++ Y
Sbjct: 182 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADMYL 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 219 LWFRDIVTRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 388 ---RAAFDSWFAGYRARLCDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 480
>gi|91788443|ref|YP_549395.1| hypothetical protein Bpro_2581 [Polaromonas sp. JS666]
gi|121957872|sp|Q12AE5.1|Y2581_POLSJ RecName: Full=UPF0061 protein Bpro_2581
gi|91697668|gb|ABE44497.1| protein of unknown function UPF0061 [Polaromonas sp. JS666]
Length = 496
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 221/543 (40%), Positives = 294/543 (54%), Gaps = 69/543 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L W +SF R PG YT++ P+ + +P V S+++A L L+
Sbjct: 19 LKWGNSFARLGPG--------------FYTELQPTP-LPSPYWVGRSQALARELGLEDHW 63
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
E + +G AG+ P A Y GHQFG+WAGQLGDGRAI LG+ L + E+Q
Sbjct: 64 LESAEALEVLTGNRSTAGSRPLASVYSGHQFGVWAGQLGDGRAILLGD-LQTPAGPQEIQ 122
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG+TPYSR DG AVLRSSIREFL SEAMH LGIPTTRALC+ + V R+
Sbjct: 123 LKGAGRTPYSRMGDGRAVLRSSIREFLASEAMHGLGIPTTRALCVTGSDAPVRREDI--- 179
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
E A+V R + SF+RFG ++ + Q D ++TLADY I
Sbjct: 180 ----ETAAVVTRTSPSFIRFGHFEHFSYSNQHDR--LKTLADYVI--------------- 218
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
D + YAA +ERTA L+A WQ +GF HGV+NTDNMSILGLTI
Sbjct: 219 ------DGFYPACREAKQPYAALLEAASERTARLMAAWQAIGFCHGVMNTDNMSILGLTI 272
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-AN 463
DYGPF FLDAFDP N +D P RY + QP+I WN+ F A LI+D+E A
Sbjct: 273 DYGPFQFLDAFDPGHICNHSD-PQGRYAYNKQPNIAYWNL--FCLGQALLPLIEDQEQAL 329
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+E Y T F +A M KLGL + ++++I +A +KVDYT F+R L
Sbjct: 330 AALESYKTVFPQALEARMRDKLGLVETQAGDRELIESTFKLLASNKVDYTIFWRRLCGFT 389
Query: 521 ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPK 580
P + ++ + D +E++ +W L Y + + +G+ R LM NPK
Sbjct: 390 --PQSGHES----VRDLFFD-----RESFNAWALQYSERV--AGVDQGVRANLMLKSNPK 436
Query: 581 YVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLS 640
+VLRN+L + AI AA+L DF V LL L++ P+DE PG + ++ PP WA +S
Sbjct: 437 FVLRNHLGEEAIRAAKLKDFSGVNTLLGLLQAPFDEHPGHDSFSDFPPDWA---SSIEIS 493
Query: 641 CSS 643
CSS
Sbjct: 494 CSS 496
>gi|418293408|ref|ZP_12905316.1| hypothetical protein PstZobell_08917 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
gi|379064799|gb|EHY77542.1| hypothetical protein PstZobell_08917 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
Length = 486
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 218/549 (39%), Positives = 301/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L++D+ F R GD + T+VSP +E P+LV SE+ L
Sbjct: 1 MKTLTQLHFDNRFAR--LGDTFS------------TQVSPQP-LEAPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+ F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQALFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ ++ V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTSSDTLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ I HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHAEL---KQLLEHVIAAHFSELL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EHPEPFHMFFRT---------------------VLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
D K ME + + E+ +M ++LG + ++ ++ +LL M VDYTNFFR
Sbjct: 312 DVKVLRETMELFLPLYEAEWLDLMRRRLGFAQAQADDETLVRRLLQLMQASAVDYTNFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS A+ ++ L+ +D+ + + +W Y G R+A M
Sbjct: 372 ELSESPAEQAVRR------LREDFVDL-----QGFDAWAADYCARTALEGGDPAARQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP+DEQPGM++YA PP W
Sbjct: 421 QAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHAVLSRPFDEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|398845569|ref|ZP_10602598.1| hypothetical protein PMI38_01956 [Pseudomonas sp. GM84]
gi|398253428|gb|EJN38556.1| hypothetical protein PMI38_01956 [Pseudomonas sp. GM84]
Length = 486
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 293/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L++D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLSFDNRFAR--LGD------------AFSTQVLPDP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+L
Sbjct: 46 DLDPAQAELPIFAELFSGQKLWEEADPRAMVYSGHQFGAYNPRLGDGRGLLLAEVLTDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIPT+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPTSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L DY + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDYVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ IG WN++ + +L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIGHWNLSALAQSLTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLATAEDDDMALVERLLQCMQSGGVDYNLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L + P L ++ +D+ + +W Y+ + E R+
Sbjct: 371 RKLGDQ------PVAAALTVVRDDFIDLA-----GFDAWGADYLARCEREAGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M +VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQ GM+ YA PP W
Sbjct: 420 MQAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSRPFEEQAGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|339999185|ref|YP_004730068.1| hypothetical protein SBG_1197 [Salmonella bongori NCTC 12419]
gi|339512546|emb|CCC30286.1| conserved hypothetical protein [Salmonella bongori NCTC 12419]
Length = 480
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 209/521 (40%), Positives = 293/521 (56%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++++A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWFNDALAQQLAIPVSLFDTTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQILADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTAVQRE-------TQEAGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ ++ +Y
Sbjct: 182 HFEHFYYR--REPEKVKQLADFAIRHYWPQWQD---------------------APERYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EV RT +L+A+WQ GF HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVVIRTGTLIAEWQAAGFAHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I N +ERY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPAVALWNLQRLAQTL--TPFIAADVLNNALERYQEALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ + +D
Sbjct: 336 GFFTRQKDDNALLNELFSLMAREGSDYTLTFRMLSHTEQQSASS------PLRDMFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ + +W Y L + + D +R+ M SVNP VLRN+L Q AI+AAE D E
Sbjct: 388 ---RAGFDAWFDRYRARLRTEAVDDMQRQQQMQSVNPAVVLRNWLAQRAIEAAEQDDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ + YA PP W R V SCSS
Sbjct: 445 LHRLHEILRQPFADRD--DDYASRPPEWGKRLEV---SCSS 480
>gi|385788260|ref|YP_005819369.1| hypothetical protein EJP617_28010 [Erwinia sp. Ejp617]
gi|310767532|gb|ADP12482.1| hypothetical protein EJP617_28010 [Erwinia sp. Ejp617]
Length = 479
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 213/518 (41%), Positives = 284/518 (54%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L+ YT + P+ ++N +L+ + +A L LD + F + L+ SG G P AQ
Sbjct: 11 LNGFYTALQPTP-LKNARLLYHNAGLARELGLDERLFHAQNAGLW-SGERLPDGMQPLAQ 68
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR DG AVLRS++R
Sbjct: 69 VYSGHQFGVWAGQLGDGRGILLGEQQLPDGRKFDWHLKGAGLTPYSRMGDGRAVLRSTLR 128
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL EAMH LGI T+RAL +V++ + V R+ E GA++ RVA+S +RFG ++
Sbjct: 129 EFLAGEAMHHLGIATSRALTVVSSDEPVYRE-------TTETGAMLLRVAESHVRFGHFE 181
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
+GQ + V LADY IRHH+ +KY W
Sbjct: 182 HFYYQGQP--EKVTQLADYVIRHHWPQWVQ---------------------ERDKYLLWF 218
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V +RTA L+A WQ +GF HGV+NTDNMSILGLT+DYGPFGFLD + P F N +D G
Sbjct: 219 SDVVQRTARLIAGWQSIGFAHGVMNTDNMSILGLTLDYGPFGFLDDYQPEFICNHSDHQG 278
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP IGLWN+ + + L+ L+ ++ + Y + M + M KLGL
Sbjct: 279 -RYSFENQPMIGLWNLNRLAHALSG--LMSPQQLEQALAGYEPELMRCWGEKMRAKLGLL 335
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ LL+ M + DYT FR LS + S PL+ +D
Sbjct: 336 IPGKDDNHILTGLLSLMTREGSDYTRTFRQLSQSEQLQSRS------PLRDEFID----- 384
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ SW + Q LL SDEER+ M NP +LRNYL Q AI+ AE D + R
Sbjct: 385 RDAFDSWYNVWRQRLLKEECSDEERQRTMKLANPALILRNYLAQQAIERAEQEDISVLAR 444
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + RPYDE P AR PP W + V SCSS
Sbjct: 445 LHQALSRPYDEAPEFADLARRPPDWGKKLEV---SCSS 479
>gi|120555480|ref|YP_959831.1| hypothetical protein Maqu_2569 [Marinobacter aquaeolei VT8]
gi|120555487|ref|YP_959838.1| hypothetical protein Maqu_2576 [Marinobacter aquaeolei VT8]
gi|120555494|ref|YP_959845.1| hypothetical protein Maqu_2583 [Marinobacter aquaeolei VT8]
gi|120325329|gb|ABM19644.1| protein of unknown function UPF0061 [Marinobacter aquaeolei VT8]
gi|120325336|gb|ABM19651.1| protein of unknown function UPF0061 [Marinobacter aquaeolei VT8]
gi|120325343|gb|ABM19658.1| protein of unknown function UPF0061 [Marinobacter aquaeolei VT8]
Length = 484
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 198/514 (38%), Positives = 284/514 (55%), Gaps = 52/514 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++V PS + P++V +++++A + ++ D+ +GA L G P A Y G
Sbjct: 20 YSRVQPSP-LSEPRMVCFNQALASDMGFLVRD--ENDWAAIGAGAELLEGMDPVAMKYTG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFGM+ +LGDGR + L E + RW+ LKGAG TPYSRF DG AVLRS+IRE+LC
Sbjct: 77 HQFGMYNPELGDGRGLLLWETVGPDGTRWDWHLKGAGTTPYSRFGDGRAVLRSTIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL +++ V R+ E A + RVA+S +RFG ++ A
Sbjct: 137 SEAMHGLGIPTTRALFMISAKDPVRRESI-------ETAAALMRVAKSHIRFGHFEFAAH 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E + ++TL ++ I HF H+ ++ + + +YA W EV
Sbjct: 190 --HEGPEALKTLLEHVIALHFPHLISLPEEQ-------------------RYARWFEEVV 228
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA L+A+WQ VGF HGV+N+DNMSI+G T DYGPF FLD FD F N +D G RY
Sbjct: 229 ERTARLIAKWQAVGFCHGVMNSDNMSIIGDTFDYGPFAFLDDFDAGFVCNHSDHEG-RYA 287
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP +G N + L ++D+ + RY T + + ++ M KLGL + +
Sbjct: 288 YNRQPQVGFINCQYLANALLP--IMDEDTVRRGLRRYETAYNEHFKHQMLAKLGLEEADG 345
Query: 493 QIISKLLNNMAV---DKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ +++ ++ +VDYT FFR LSN+ D P++ + D +
Sbjct: 346 SDMGLIMDTFSMLHEHRVDYTRFFRGLSNL-------HDHGTAPVRDLFAD-----RSVA 393
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
W+ Y L +ER+ M VNPKY+LRNYL Q I A+ GD+ ++ LLK+
Sbjct: 394 DEWLERYEARLQKETRGHDEREYAMRRVNPKYILRNYLAQQVILEAQNGDYEPMKELLKV 453
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+E+P+DEQP EKYA LPP W + SCSS
Sbjct: 454 LEKPFDEQPEYEKYAALPPDWGKHLNI---SCSS 484
>gi|423123340|ref|ZP_17111019.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5250]
gi|376401971|gb|EHT14572.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5250]
Length = 480
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 215/521 (41%), Positives = 284/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + +A SL + F + G T L G P
Sbjct: 10 RDELPDFYTALAPTP-LENARLVWHNAPLARSLGVADSLFSPEKGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGSWAGQLGDGRGILLGEQQLADGRRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREGLASEAMHALGIPTTRALAIVASDTPVYRE-------TAERGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H +E L V+ LADY IRHH+ H++N ++KY
Sbjct: 182 HFE-HFYYRREPLK-VQQLADYVIRHHWPHLQN---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 VWFSDVVTRTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + TL + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDNQPAVGLWNLQRLAQTL--SPFISAELLNGALDGYQHALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L MA + DYT FR LS + + PL+ +D
Sbjct: 336 GLFTQQKGDNELLDGLFALMAREGSDYTRTFRMLSASEQASAA------SPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW Y L + DE+R+ M SVNP VLRN+L Q I+ AE GD E
Sbjct: 388 ---RETFDSWFADYRARLRDELVDDEQRQVRMRSVNPALVLRNWLAQRTIELAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + +P+ ++ + Y PP W R V SCSS
Sbjct: 445 LERLHNALSQPFIDR--TDDYVNRPPDWGRRLEV---SCSS 480
>gi|417475487|ref|ZP_12170285.1| Selenoprotein O and cysteine [Salmonella enterica subsp. enterica
serovar Rubislaw str. A4-653]
gi|353644109|gb|EHC88148.1| Selenoprotein O and cysteine [Salmonella enterica subsp. enterica
serovar Rubislaw str. A4-653]
Length = 506
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 214/547 (39%), Positives = 296/547 (54%), Gaps = 79/547 (14%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDELAQQLAIPASLFDVTNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRF--------- 236
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMRMGDGRAVL 128
Query: 237 ----ADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGA 292
DG AVLRS+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA
Sbjct: 129 YSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGA 181
Query: 293 IVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDE 352
++ R+AQS +RFG ++ R + + V+ LAD+AIRH++ +++ +
Sbjct: 182 MLMRLAQSHMRFGHFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE----------- 228
Query: 353 DHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFL 412
KYA W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFL
Sbjct: 229 ----------KYALWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFL 278
Query: 413 DAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTK 472
D +DP F N +D G RY F NQP + LWN+ + + TL I+ N ++RY
Sbjct: 279 DDYDPGFIGNHSDHQG-RYRFDNQPSVALWNLQRLAQTL--TPFIEIDALNRALDRYQDA 335
Query: 473 FMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
+ Y M +KLG K + ++++L + MA + DYT FR LS+ + +
Sbjct: 336 LLTHYGQRMRQKLGFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS--- 392
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVL------ 583
PL+ +D + A+ +W Y L + + D R+ M VNP VL
Sbjct: 393 ---PLRDTFID-----RTAFDAWFDRYRARLRTEAVDDALRQQQMQRVNPAVVLRRAIWL 444
Query: 584 -------RNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
RN+L Q AIDAAE GD E+ RL +++ +P+ ++ + YA PP W R V
Sbjct: 445 AQRAIDARNWLAQRAIDAAEQGDMAELHRLHEVLRQPFTDRD--DDYASRPPEWGKRLEV 502
Query: 637 CMLSCSS 643
SCSS
Sbjct: 503 ---SCSS 506
>gi|238753662|ref|ZP_04615024.1| hypothetical protein yruck0001_13940 [Yersinia ruckeri ATCC 29473]
gi|238708214|gb|EEQ00570.1| hypothetical protein yruck0001_13940 [Yersinia ruckeri ATCC 29473]
Length = 480
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 212/541 (39%), Positives = 290/541 (53%), Gaps = 66/541 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
++D+S+ R+L G YT++SP+ + +L+ +SES+A LELD F
Sbjct: 3 HFDNSYARQLAG--------------FYTRLSPTP-LSGARLLYYSESLASELELDASWF 47
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
++ +G LAG P AQ Y GHQFG+WAGQLGDGR I LGE + + L
Sbjct: 48 SGEKTGVW-TGEQLLAGMDPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRQLDWHL 106
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AVLRS IREFL SEA+H+LG+PT+RAL +VT+ V R+
Sbjct: 107 KGAGLTPYSRMGDGRAVLRSVIREFLASEALHYLGVPTSRALTIVTSEHPVFRE------ 160
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
+ E GA++ RVA+S +RFG ++ R Q D VR LADY I H+
Sbjct: 161 -QPERGAMLLRVAESHVRFGHFEHFYHRQQPDQ--VRQLADYVIARHWPQWVGQ------ 211
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
++ Y AW +V ERTA L+A WQ +GF HGV+NTDNMSILG+T+D
Sbjct: 212 ---------------AHVYLAWFTDVVERTARLIAHWQTLGFAHGVMNTDNMSILGITMD 256
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV 465
YGPFGFLD + P + N +D G RY F NQP + WN+ + +L+ L+ E
Sbjct: 257 YGPFGFLDEYQPEYICNHSDHQG-RYAFDNQPAVAYWNLHRLGQSLSG--LLTSGELQQA 313
Query: 466 MERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKAD 522
++ Y M Y M KLG KQ +++ LL+ M +K DY+ FR LS V+
Sbjct: 314 LDVYEPTLMAAYGQQMRAKLGFFTAEKQDNDLLTDLLSLMQKEKQDYSRTFRRLSQVEQL 373
Query: 523 PSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+ PL+ +D +EA+ W Y L D +R+ M +VNP +
Sbjct: 374 SAQS------PLRDDFID-----REAFDGWYRRYRLRLQQENRDDAQRQQAMKAVNPALI 422
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCS 642
LRNYL Q AI+ AE D ++RL +++P+ + E A LPP W + SCS
Sbjct: 423 LRNYLAQQAIERAEQEDVSVLKRLHLALQQPFADNADNEDLAALPPDWGKHLDI---SCS 479
Query: 643 S 643
S
Sbjct: 480 S 480
>gi|259908568|ref|YP_002648924.1| hypothetical protein EpC_19180 [Erwinia pyrifoliae Ep1/96]
gi|387871450|ref|YP_005802824.1| hypothetical protein EPYR_02073 [Erwinia pyrifoliae DSM 12163]
gi|224964190|emb|CAX55697.1| conserved uncharacterized protein YdiA [Erwinia pyrifoliae Ep1/96]
gi|283478537|emb|CAY74453.1| UPF0061 protein ECA1842 [Erwinia pyrifoliae DSM 12163]
Length = 479
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 211/518 (40%), Positives = 283/518 (54%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L+ CYT + P+ ++N +L+ + +A L LD + F + L+ P G P AQ
Sbjct: 11 LNGCYTALQPTP-LKNARLLYHNAGLARELGLDERLFNAQNAGLWGGERLP-DGMQPLAQ 68
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR + LGE +++ LKGAG TPYSR DG AVLRS++R
Sbjct: 69 VYSGHQFGVWAGQLGDGRGMLLGEQQLPDGRKFDWHLKGAGLTPYSRMGDGRAVLRSTLR 128
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EF+ EAMH LGI T+RAL +V + + V R+ E GA++ RVA+S +RFG ++
Sbjct: 129 EFIAGEAMHHLGIATSRALTVVGSDEPVYRE-------TTETGAMLLRVAESHVRFGHFE 181
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
+GQ + V LADY IRHH+ +KY W
Sbjct: 182 HFYYQGQP--EKVTQLADYVIRHHWPQWVQ---------------------ERDKYLLWF 218
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V +RTA L+A WQ +GF HGV+NTDNMSILGLT+DYGPFGFLD + P F N +D G
Sbjct: 219 SDVVQRTARLIAGWQSIGFAHGVMNTDNMSILGLTLDYGPFGFLDDYQPEFICNHSDHQG 278
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP IGLWN+ + + L+ L+ ++ + Y + M + M KLGL
Sbjct: 279 -RYSFENQPMIGLWNLNRLAHALSG--LMSPQQLEQALAGYEPELMRCWGEKMRAKLGLL 335
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ LL+ M + DYT FR LS + S PL+ +D
Sbjct: 336 IPGKDDNHILTGLLSLMTREGSDYTRTFRQLSQSEQLQSRS------PLRDEFID----- 384
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ SW + Q LL SDEER+ M NP +LRNYL Q AI+ AE D + R
Sbjct: 385 RDAFDSWYNVWRQRLLKEECSDEERQRTMKLANPALILRNYLAQQAIERAEQEDISVLAR 444
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + RPYDE P AR PP W + V SCSS
Sbjct: 445 LHQALSRPYDEAPEFADLARRPPDWGKKLEV---SCSS 479
>gi|221116553|ref|XP_002164964.1| PREDICTED: selenoprotein O-like [Hydra magnipapillata]
Length = 634
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 191/433 (44%), Positives = 259/433 (59%), Gaps = 31/433 (7%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ +L+ LN+D+ +R LP D T + R V+ AC++ V P+ VENP +VA+S L
Sbjct: 31 MSSLKSLNFDNLALRTLPIDKETSNQTRTVVGACFSLVKPTP-VENPVVVAYSPEALALL 89
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+ K+ E DF +FSG L G+ A CY GHQFG ++GQLGDG A+ LGE++N
Sbjct: 90 GIKEKDLEADDFKDYFSGNQLLNGSQSAAHCYCGHQFGYFSGQLGDGAAMYLGEVVNDAG 149
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+RWELQLKGAG TPYSR ADG VLRSSIREFLCSEAM +LG+PTTRA +T+ V R
Sbjct: 150 QRWELQLKGAGLTPYSRNADGRKVLRSSIREFLCSEAMFYLGVPTTRAGSCITSDTRVVR 209
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQE---------DLDIVRTLADYAI 329
D+FYDGNP E IV R+A SF+RFGS++I +E DI+ TL +Y +
Sbjct: 210 DIFYDGNPIMERCTIVSRIAPSFIRFGSFEIFKPLDRETGRVGPSVGKDDILHTLLEYVV 269
Query: 330 RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
+ I + +G+++ + +D E+ RTA +VA+WQ VGF H
Sbjct: 270 STFYPEIWQTH--------SGNKEKAYLDFFK--------EIVRRTAFMVAKWQCVGFCH 313
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFST 449
GVLNTDNMSI+G+TIDYGPFGF+D F+ F N +D G RY + QP+I WN+ + +
Sbjct: 314 GVLNTDNMSIIGVTIDYGPFGFMDYFNSDFICNASDTNG-RYSYKKQPEICKWNLLKLAE 372
Query: 450 TLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDK 506
+ A + DK + E Y ++F + Y M +KLGL N +++I LL M
Sbjct: 373 AIKNAVPL-DKTKEIINEIYDSEFRESYYKGMREKLGLKTNNVNDEKLIQNLLYTMQQSA 431
Query: 507 VDYTNFFRALSNV 519
D+TN F LS V
Sbjct: 432 SDFTNTFLILSGV 444
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 58/103 (56%), Gaps = 12/103 (11%)
Query: 546 KEAWISWVLSYIQELLSSGIS-------DEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+ W W+ SY + ++S +E RK LM SVNP+++LRN+L Q AI+ AE G
Sbjct: 531 RNLWKDWLKSYRERIMSESEGSVLLEEYEEHRKQLMFSVNPRFILRNHLAQEAIEDAERG 590
Query: 599 DFGEVRRLLKLMERP-----YDEQPGMEKYARLPPAWAYRPGV 636
D+ +VR LL+L+ +P Y+ Q KY PAWA R V
Sbjct: 591 DYTKVRELLQLLRKPYLKDLYESQIATNKYDNAAPAWACRLRV 633
>gi|423198735|ref|ZP_17185318.1| hypothetical protein HMPREF1171_03350 [Aeromonas hydrophila SSU]
gi|404629925|gb|EKB26650.1| hypothetical protein HMPREF1171_03350 [Aeromonas hydrophila SSU]
Length = 475
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 205/468 (43%), Positives = 257/468 (54%), Gaps = 53/468 (11%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
PL G P AQ Y GHQFG ++ +LGDGRA+ LGE L RW+L LKGAGKTP+SRF D
Sbjct: 58 PLPGMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGSRWDLHLKGAGKTPFSRFGD 117
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R A
Sbjct: 118 GRAVLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYRE-------QVETGATVLRTA 170
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
S LRFG ++ A GQ + + L DY +RHHF + +
Sbjct: 171 PSHLRFGHFEYFAWSGQG--EKIPALIDYLLRHHFPELADG------------------- 209
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P
Sbjct: 210 ------AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPD 263
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D PG RY QP +G WN+ + + LA +D + +Y + M Y
Sbjct: 264 FVCNHSD-PGGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALASALAQYEHQLMLHYS 320
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
+M KLGL + ++ + +L +A KVDY F R L + P L
Sbjct: 321 ELMRAKLGLAVWEEEDPALFRELFRLLAAHKVDYHLFLRRLGALTVQGDWPAS-----LL 375
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
A+L D AW W+ +Y L G D RK LM++VNPKYVLRN L Q I+AA
Sbjct: 376 ALLPD-----PAAWQGWLEAYRARLSREGSEDAVRKGLMDAVNPKYVLRNALAQRVIEAA 430
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GD RL ++ PYDEQP E A PAW Y G LSCSS
Sbjct: 431 ERGDMAPFERLFAALQHPYDEQPEYEDLATPQPAW-YCGG--ELSCSS 475
>gi|423108807|ref|ZP_17096502.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5243]
gi|376383001|gb|EHS95729.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5243]
Length = 480
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 283/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + +A L + F + G L G P
Sbjct: 10 RDELPDFYTALAPTP-LENTRLVWHNAPLAQELGIPESLFNLDKGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGSWAGQLGDGRGILLGEQQLADGRRVDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREGLASEAMHALGIPTTRALAMVASDTPVYRETV-------EQGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ H++N ++KY
Sbjct: 182 HFEHFYYR--REPQKVQLLADYVIRHHWPHLQN---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 VWFRDVVTRTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F +QP +GLWN+ + + L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDHQPAVGLWNLQRLAQAL--SPFISAEALNGALDDYQHALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L M + DYT FR LS + D + PL+ +D
Sbjct: 336 GLFTEQKGDNELLDGLFTLMEREGNDYTRTFRMLSLSEQDSAA------TPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW +Y L I D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RERFDSWFAAYRARLRDEQIDDAQRQAQMRSVNPAIVLRNWLAQRAIEQAEQGDMRE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + +P+ ++ ++Y++ PP W R V SCSS
Sbjct: 445 LERLHSALSQPFVDR--TDEYSQRPPDWGKRLEV---SCSS 480
>gi|167586949|ref|ZP_02379337.1| hypothetical protein BuboB_16527 [Burkholderia ubonensis Bu]
Length = 525
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 220/538 (40%), Positives = 291/538 (54%), Gaps = 70/538 (13%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL----AGAV 184
L A + P+A + P +V +S+ VA L L P F F+G P A A+
Sbjct: 35 LGAAFHTRLPAAPLPAPYVVGFSDEVARLLGLPAALAGHPQFAELFAG-NPTRDWPAEAM 93
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YA Y GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG+G+TPYSR DG AVLR
Sbjct: 94 SYASVYSGHQFGVWAGQLGDGRALTIGELDGTDGRRYELQLKGSGRTPYSRMGDGRAVLR 153
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAMH LGIPTTRAL ++ + V R+ E A+V RV++SF+RF
Sbjct: 154 SSIREFLCSEAMHHLGIPTTRALTVIGSDAPVVREEI-------ETSAVVTRVSESFVRF 206
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ S + DL +R LAD+ I + + + + Y
Sbjct: 207 GHFEHFFSNDRPDL--LRALADHVIERFYPACRDAD---------------------DPY 243
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
A RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD + N +
Sbjct: 244 LALLEAATLRTADLVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFVDAFDANHICNHS 303
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERY 469
D G RY + QP I WN + L A + ++D +A V+ ++
Sbjct: 304 DTHG-RYAYRMQPRIAHWNCYCLAQALLPLIGLQHDIADDDARAERAVEDAQA--VLAKF 360
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSI 525
+F + +M KLGL + + ++LL M D+T FR LS + K D S
Sbjct: 361 PERFGPALERLMRAKLGLEAERDGDAALANQLLEVMHASHADFTLTFRHLSQLSKHDASR 420
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN
Sbjct: 421 D-----APVRDLFID-----RDAFDAWANLYRARLSEEARDDAARAAAMNRVNPKYVLRN 470
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+L + AI A+ DF EV RL +++ RP+DEQP YA LPP WA G +SCSS
Sbjct: 471 HLAEIAIRHAKEKDFSEVERLAQVLRRPFDEQPEYASYAALPPDWA---GSLEVSCSS 525
>gi|423114827|ref|ZP_17102518.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5245]
gi|376383702|gb|EHS96429.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5245]
Length = 480
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 211/521 (40%), Positives = 284/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ +EN +LV + +A L + F + G L G P
Sbjct: 10 RDELPDFYTALSPTP-LENARLVWHNAPLAQELGIPESLFNLDKGAGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGSWAGQLGDGRGILLGEQQLADGRRVDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREGLASEAMHALGIPTTRALAMVASDTPVYRETV-------EQGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ H++N ++KY
Sbjct: 182 HFEHFYYR--REPQKVQLLADYVIRHHWPHLQN---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 VWFRDVVTRTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F +QP +GLWN+ + + L + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDHQPAVGLWNLQRLAQAL--SPFISAEALNGALDDYQHALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L + M + DYT FR LS + + + PL+ +D
Sbjct: 336 GLLTQQKGDNELLDGLFSLMEREGSDYTRTFRMLSLSEQESAA------TPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW +Y L I D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RERFDSWFAAYRARLRDEQIDDAQRQAQMRSVNPAIVLRNWLAQRAIEQAEQGDMRE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + +P+ ++ ++Y++ PP W R V SCSS
Sbjct: 445 LERLHSALSQPFVDR--TDEYSQRPPDWGKRLEV---SCSS 480
>gi|90579729|ref|ZP_01235538.1| hypothetical protein VAS14_02166 [Photobacterium angustum S14]
gi|90439303|gb|EAS64485.1| hypothetical protein VAS14_02166 [Photobacterium angustum S14]
Length = 487
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 206/513 (40%), Positives = 281/513 (54%), Gaps = 50/513 (9%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T V+P + NP L++ ++ +A LELD + DF FSG L+G P A Y GH
Sbjct: 22 TFVTPQP-LSNPYLISVNQHIAKLLELDINAIQSDDFINIFSGNDTLSGFDPIAMKYTGH 80
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG + LGDGR + LGE+ ++W++ LKG+G TPYSR DG AV+RSSIRE+L S
Sbjct: 81 QFGQYNPDLGDGRGLLLGEVQTSNGKKWDIHLKGSGLTPYSRMGDGRAVIRSSIREYLAS 140
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
AM LGIPT+ AL ++ + V R+ K+E GA + RV++S +RFG ++
Sbjct: 141 AAMAGLGIPTSHALAVIGSDTHVYRE-------KQEFGATLIRVSESHIRFGHFEYLFYT 193
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
Q D +R LADY I+HHF + + K YAA +V E
Sbjct: 194 QQHDQ--LRLLADYVIQHHFPECQQVEK---------------------PYAALFEQVCE 230
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
TA ++A WQ VGF HGV+NTDNMSILGLT DYGP+GFLD ++P + N +D G RY F
Sbjct: 231 NTAKMIAHWQAVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYNPGYICNHSDYSG-RYAF 289
Query: 434 ANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKY 490
QP IGLWN++ LA +ID + + +E Y + Y +M +KLGL +
Sbjct: 290 NQQPSIGLWNLSALGYALAP--IIDKSDIEHALEIYQHQLQMHYSKLMRQKLGLFDSQEQ 347
Query: 491 NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWI 550
+ ++ +L N + +DYT FFR LS + D L A I +
Sbjct: 348 DNELFQQLFNLLKQQSIDYTQFFRTLSTLSQDELHNTSSHFSSLTANTTPIDE------- 400
Query: 551 SWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLM 610
W++ Y + + S +D++R ALM NPKY+LRNYL Q AID AE G+F V LL ++
Sbjct: 401 -WLVDYKKRI--SNTNDQQRLALMLKSNPKYILRNYLAQLAIDGAEQGNFTFVENLLTVL 457
Query: 611 ERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
P+DE P E A LPP W +SCSS
Sbjct: 458 HDPFDEHPNFEDLADLPPKWGKE---LEISCSS 487
>gi|350569951|ref|ZP_08938328.1| SelO family protein [Neisseria wadsworthii 9715]
gi|349797526|gb|EGZ51284.1| SelO family protein [Neisseria wadsworthii 9715]
Length = 489
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 206/517 (39%), Positives = 290/517 (56%), Gaps = 52/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V+ + + +P VA + +A +L L F+ P+ +G+ P A Y G
Sbjct: 19 YARVN-TEPLGDPYWVAQNHDLAAALNLLNDFFDAPETLAMLAGSAKKYVPQPLASVYSG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ QLGDGRA+ LG + + + WE QLKGAGKTP+SRFADG AVLRSSIRE+LC
Sbjct: 78 HQFGVYVPQLGDGRAVLLGRSEDAQGKAWEWQLKGAGKTPFSRFADGRAVLRSSIREYLC 137
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM+ LGIPTTRALC+ + V R+ E A+V R+A SF+RFG ++
Sbjct: 138 SEAMYGLGIPTTRALCITGSNDAVFRE-------TPETAAVVTRIAPSFIRFGHFEYFYH 190
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+G + ++ LAD+ IR+HF ++ Y A ++
Sbjct: 191 KGMH--EYLQPLADFLIRYHFPECTQADQ---------------------PYLALLQTIS 227
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA LVA WQ VGF HGVLNTDNMS LGLTIDYGPFGFLDA+D N +D G RY
Sbjct: 228 ERTADLVAAWQAVGFCHGVLNTDNMSALGLTIDYGPFGFLDAYDRRHVCNHSD-SGGRYA 286
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---K 489
+ QP + WN+++ ++ L ++ V++ + + + Y M KLGL K
Sbjct: 287 YNEQPYVVHWNLSRLASCF--LPLCEEAGLVAVLDAFPNLYRNAYLKNMRAKLGLQTERK 344
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALS---NVKADPSIPEDELLVPLKAVLLDIGKERK 546
++++I+ + N + +VD+T FFR LS N +P VP K L G++
Sbjct: 345 EDEELITDMFNVLQGRRVDFTLFFRHLSETGNTHGEP--------VPPKLAAL-FGEQNM 395
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
E + SW+ Y L + + R A MNSVNP YVLRNYL + AI+ A+ G FGE+ RL
Sbjct: 396 EGFTSWLGGYRTRLRAENSGPQARAARMNSVNPLYVLRNYLAEQAIEQAKQGHFGEIERL 455
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + P++E+ +A+ P WA G+C +SCSS
Sbjct: 456 RRCLASPFEERAEFADFAQPAPEWA--AGIC-VSCSS 489
>gi|332161632|ref|YP_004298209.1| hypothetical protein YE105_C2010 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|386308250|ref|YP_006004306.1| selenoprotein O [Yersinia enterocolitica subsp. palearctica Y11]
gi|418241715|ref|ZP_12868239.1| hypothetical protein IOK_09973 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|433549711|ref|ZP_20505755.1| Selenoprotein O and cysteine-containing homologs [Yersinia
enterocolitica IP 10393]
gi|318605876|emb|CBY27374.1| selenoprotein O and cysteine-containing homologs [Yersinia
enterocolitica subsp. palearctica Y11]
gi|325665862|gb|ADZ42506.1| hypothetical protein YE105_C2010 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|330864109|emb|CBX74180.1| UPF0061 protein YpsIP31758_1734 [Yersinia enterocolitica W22703]
gi|351778834|gb|EHB20967.1| hypothetical protein IOK_09973 [Yersinia enterocolitica subsp.
palearctica PhRBD_Ye1]
gi|431788846|emb|CCO68795.1| Selenoprotein O and cysteine-containing homologs [Yersinia
enterocolitica IP 10393]
Length = 499
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 212/533 (39%), Positives = 287/533 (53%), Gaps = 52/533 (9%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
EL P+ + + L YT + P+ ++ +L+ SE +A LELD F P ++
Sbjct: 16 ELDNSPQFSNSYGQQLSGFYTHLQPTP-LKGARLLYHSEPLARELELDTSWFSDPKAAVW 74
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L G P AQ Y GHQFG WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 75 -AGEMLLPGMEPLAQVYSGHQFGQWAGQLGDGRGILLGEQKLSDGRHMDWHLKGAGLTPY 133
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LG+PT+RAL +VT+ V R+ + E GA+
Sbjct: 134 SRMGDGRAVLRSVVREFLASEALHHLGVPTSRALTIVTSDHPVYRE-------QAERGAM 186
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q V+ LADY I H+ G E+
Sbjct: 187 LLRVAESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWPQW------------VGQEE 232
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
Y W +V +RTA L+A WQ +GF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 233 ---------CYLLWFTDVVKRTARLMAHWQTIGFAHGVMNTDNMSILGITMDYGPFGFLD 283
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ L+ + +E Y +
Sbjct: 284 DYVPDYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LLSTTQLQQALEAYEPEL 340
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLG + + Q +++ LL+ M + DYT FR LS V+ +
Sbjct: 341 MAAYGQQMRAKLGFFESDSQDNELLTGLLSLMIKEGRDYTRTFRLLSEVETHSA------ 394
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
L PL+ + + A+ SW Y L I D +R+ M +VNPKY+LRNYL Q
Sbjct: 395 LSPLRDDFIG-----RAAFDSWYSRYRARLQQEQIDDAQRQQAMRAVNPKYILRNYLAQL 449
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AID AE D ++RL + +++P+ EQP + A LPP W +SCSS
Sbjct: 450 AIDHAEKDDIQPLQRLHQALQQPFAEQPELNDLAALPPDWGKH---LEISCSS 499
>gi|358448322|ref|ZP_09158826.1| hypothetical protein KYE_03545 [Marinobacter manganoxydans MnI7-9]
gi|357227419|gb|EHJ05880.1| hypothetical protein KYE_03545 [Marinobacter manganoxydans MnI7-9]
Length = 484
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 204/529 (38%), Positives = 289/529 (54%), Gaps = 52/529 (9%)
Query: 118 DPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA 177
D R + E+ + YT+V PS +++P++V ++ +A+ + + D+ +G+
Sbjct: 5 DFRIEHRYLELPDSFYTRVQPSP-LKDPKMVCFNHKLAEQMGF--RADAESDWTGVGAGS 61
Query: 178 TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFA 237
L G P A Y GHQFG++ +LGDGR + L E + RW+ LKGAG TPYSRF
Sbjct: 62 ELLEGMDPVAMKYTGHQFGVYNPELGDGRGLLLWETIGPDGRRWDWHLKGAGMTPYSRFG 121
Query: 238 DGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRV 297
DG AVLRS+IRE+LCSEAM+ LGIPTTRAL +V+ V R+ E A + RV
Sbjct: 122 DGRAVLRSTIREYLCSEAMYGLGIPTTRALFMVSARDPVRRESI-------ETAAALVRV 174
Query: 298 AQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
A++ +RFG ++ A E + V+TL ++ I HF H+ N+ E
Sbjct: 175 AETHIRFGHFEFAAH--HEGPETVKTLLEHVISLHFPHLINLPDDE-------------- 218
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
+Y+ W EV ERTA +A WQ VGF HGV+N+DNMSI+G T DYGP+ FLD FD
Sbjct: 219 -----RYSRWFEEVVERTARTIADWQAVGFCHGVMNSDNMSIIGDTFDYGPYAFLDDFDA 273
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
+ N TD G RY + QP +G N +T L ++++ + + RY + + +
Sbjct: 274 GYISNHTD-QGGRYAYNRQPQVGFENCRYLATALLP--VMEEDDVRRGLRRYEVAYNERF 330
Query: 478 QAIMTKKLGLPKYNKQIISKLLNN---MAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
M KLGL ++ +S +++ M VDYT FFRALSN+ + P +L V
Sbjct: 331 LQNMQDKLGLAIEDEADLSLIMDTFSMMHEHHVDYTAFFRALSNLHSHGHGPVRDLFVD- 389
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ W+ Y + LL + +ER+ M SVNPKYVLRNYL Q I
Sbjct: 390 -----------RSVADQWLERYEERLLYETRAHDEREFAMRSVNPKYVLRNYLAQQVIQE 438
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ GD+ ++ LLK++ERPYDEQP + YA LPP W + SCSS
Sbjct: 439 AQNGDYEPMKALLKVLERPYDEQPENDAYAALPPDWGKHLNI---SCSS 484
>gi|420366600|ref|ZP_14867437.1| hypothetical protein SF123566_7855 [Shigella flexneri 1235-66]
gi|391324116|gb|EIQ80727.1| hypothetical protein SF123566_7855 [Shigella flexneri 1235-66]
Length = 480
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 286/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +++ ++++A L + F+ + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARIIWHNDALAAHLGIPAALFDVSGGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQLLANGTTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSETPVQRE-------TTEAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPQWQE---------------------EADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P F N +D
Sbjct: 219 LWFTDVVTRTATLMADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQ LWN+ + + TL+ +D N ++ Y + Y M +KL
Sbjct: 279 HQG-RYSFDNQTAAALWNLQRLAQTLSPFIPVD--VLNAALDGYQQALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +I+S+L + MA + DYT FR LS + + L PL+ +D
Sbjct: 336 GFFSEQKNDNEILSELFSLMAREGSDYTRTFRMLSQTE------QHSTLSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L ++D R+A M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRTRLQQDNVADVVRQAQMKTANPAMVLRNWLAQRAISQAEQGDYTE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y PP W R V SCSS
Sbjct: 445 LHRLHAALRTPFIDRD--DDYISRPPDWGKRLEV---SCSS 480
>gi|421497328|ref|ZP_15944500.1| hypothetical protein B224_002628 [Aeromonas media WS]
gi|407183674|gb|EKE57559.1| hypothetical protein B224_002628 [Aeromonas media WS]
Length = 475
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 213/525 (40%), Positives = 281/525 (53%), Gaps = 57/525 (10%)
Query: 122 DSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLA 181
++ E+ AC V+P + P+L+ ++ + L LD D+ PL
Sbjct: 5 NTFATELSWAC-EPVAPQP-LREPRLLHLNQGLLRELGLD--GIGEADWLACCGLGQPLP 60
Query: 182 GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLA 241
G P AQ Y GHQFG ++ +LGDGRA+ LGE L +RW+L LKGAGKTP+SRF DG A
Sbjct: 61 GMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGQRWDLHLKGAGKTPFSRFGDGRA 120
Query: 242 VLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSF 301
VLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R A S
Sbjct: 121 VLRSSIREYLASEALHALGIPTTRALVLVGSDEPVYRE-------QVESGATVLRTAPSH 173
Query: 302 LRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTS 361
LRFG ++ A GQ + + L +Y +RHHF +E+
Sbjct: 174 LRFGHFEYFAWSGQG--EKIPALINYLLRHHFPELESG---------------------- 209
Query: 362 NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTP 421
A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P F
Sbjct: 210 ---AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPDFVC 266
Query: 422 NTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
N +D PG RY QP +G WN+ + + L A+ +D + +Y + M Y +M
Sbjct: 267 NHSD-PGGRYALDQQPAVGYWNLQKLAQAL--AEQVDGDALAAALAQYEHQLMLHYSELM 323
Query: 482 TKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
+LGL + + + +L +A +VDY F R L + P L A+L
Sbjct: 324 RARLGLETWEDEDPALFRQLFQLLAAHRVDYHLFLRRLGELTTQGEWP-----ASLLALL 378
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
D AW W+ +Y L+ G D RK M++VNPKYVLRN L Q IDAAE G
Sbjct: 379 PD-----PAAWQEWLETYRARLVREGSQDAARKVRMDAVNPKYVLRNALAQQVIDAAETG 433
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL ++RPYDEQP E A P W Y G LSCSS
Sbjct: 434 NMAPFERLFAALQRPYDEQPEYEDLATPVPQW-YCGG--ELSCSS 475
>gi|421745987|ref|ZP_16183813.1| hypothetical protein B551_04536 [Cupriavidus necator HPC(L)]
gi|409775504|gb|EKN56984.1| hypothetical protein B551_04536 [Cupriavidus necator HPC(L)]
Length = 515
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 209/500 (41%), Positives = 276/500 (55%), Gaps = 67/500 (13%)
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
PDF F G A P A Y GHQFG+WAGQLGDGRAI + E WE+QLKG
Sbjct: 59 PDFAEIFIGNRVPDWADPLATVYSGHQFGVWAGQLGDGRAIRIAEAQTANGP-WEIQLKG 117
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
+GKTPYSR DG AVLRSSIRE+LCSEAM LGIPTTRALC+V + V R+
Sbjct: 118 SGKTPYSRMGDGRAVLRSSIREYLCSEAMAALGIPTTRALCIVGSDAPVRRETI------ 171
Query: 288 EEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSF 347
E A+V R+A +F+RFG ++ A+ +D+ +R LAD+ I + E++S
Sbjct: 172 -ETAAVVTRLAPTFIRFGHFEHFAA--HDDVAALRQLADFVIDRFMPECRDSAGGETIS- 227
Query: 348 STGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYG 407
Y A EV+ RTA L+AQWQ VGF HGV+NTDNMSILGLTIDYG
Sbjct: 228 ---------------PYQALLREVSLRTADLMAQWQAVGFCHGVMNTDNMSILGLTIDYG 272
Query: 408 PFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL-IDDKEA---- 462
PFGFLDAFD + N +D G RY ++ QP +G WN+ + L + +D +A
Sbjct: 273 PFGFLDAFDANHICNHSDTQG-RYAYSQQPQVGFWNLHCLAQALLPLWIEREDGQAPTEA 331
Query: 463 -------------NYVMERYGTKFMDEYQAIMTKKLGLPK------YNKQIISKLLNNMA 503
+ +RY +F Y+A KLGL ++ +++ L +
Sbjct: 332 AKEAAIEAAHAGLDPFRDRYAQRFFQLYRA----KLGLASADIDHAADEALLTDLFRLLH 387
Query: 504 VDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSS 563
+VDYT F+R L+ + S + P++ + +D + W +W Y L +
Sbjct: 388 TQRVDYTLFWRNLARI----SSADGSRDAPVRDLFMD-----RAGWDAWAERYRARLRAE 438
Query: 564 GISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKY 623
D R A M +VNPKYVLRN+L + AI A+ DF EV+RLL ++ RP+DEQP E Y
Sbjct: 439 NSDDAGRAASMLAVNPKYVLRNHLAEVAIQRAKEKDFSEVQRLLAVLSRPFDEQPEAESY 498
Query: 624 ARLPPAWAYRPGVCMLSCSS 643
A LPP WA G+ +SCSS
Sbjct: 499 AALPPDWA--SGI-EVSCSS 515
>gi|109900258|ref|YP_663513.1| hypothetical protein Patl_3959 [Pseudoalteromonas atlantica T6c]
gi|121957895|sp|Q15NS9.1|Y3959_PSEA6 RecName: Full=UPF0061 protein Patl_3959
gi|109702539|gb|ABG42459.1| protein of unknown function UPF0061 [Pseudoalteromonas atlantica
T6c]
Length = 480
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 201/542 (37%), Positives = 293/542 (54%), Gaps = 65/542 (11%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+N DHS+ L GD + P V NPQLV + ++ D+L+L
Sbjct: 1 MNLDHSYATHL-GDLGALTKP--------------LRVANPQLVEVNHTLRDALQLPASW 45
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F + G T +AQ YGGHQFG W LGDGR + LGE + + W+L
Sbjct: 46 FTQSSIMSMLFGNTSSFTTHSFAQKYGGHQFGGWNPDLGDGRGVLLGEAKDKFGKSWDLH 105
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSRFADG AVLRS++RE+L SEA+H +GIPT+RALCL+T+ + V R+
Sbjct: 106 LKGAGPTPYSRFADGRAVLRSTLREYLASEALHHMGIPTSRALCLITSDEPVYRE----- 160
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
K+E A++ RV+QS +RFG ++ G +LD ++ L DY HHF
Sbjct: 161 --KQEKAAMMIRVSQSHIRFGHFEYFYHNG--ELDKLKRLFDYCFEHHF----------- 205
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
S + + + A ++ TA+L+A+WQ GF HGV+NTDNMSI G+T
Sbjct: 206 ----------SACLHSESPHLAMLEKIVTDTATLIAKWQAYGFNHGVMNTDNMSIHGITF 255
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
D+GP+ FLD F+P F N +D G RY F QP +GLWN+ + A + ++
Sbjct: 256 DFGPYAFLDDFNPKFVCNHSDHRG-RYAFEQQPSVGLWNLNALAH--AFTPYLSVEQIKG 312
Query: 465 VMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ +Y M E+ +M +KLGL + +++++ L+ + DK DY FR L V
Sbjct: 313 ALSQYEASLMAEFSQLMRQKLGLYENTQNTAELVNRWLDLIYQDKRDYHISFRLLCEVDE 372
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
E++ LV + D K +W+ Y L++ G+ +ER+A M ++NP+Y
Sbjct: 373 H---GENQPLVD-HFIQRDTAK-------TWLEHYQNALITQGVKRQERQANMRNINPEY 421
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSC 641
VLRNY Q AIDAA+ GDF R+LL +++ P++ +P ++A+ PP W +SC
Sbjct: 422 VLRNYQAQLAIDAAQNGDFSRFRKLLHVLQHPFESKPEYAEFAKPPPNWGKH---MEISC 478
Query: 642 SS 643
SS
Sbjct: 479 SS 480
>gi|238791683|ref|ZP_04635320.1| hypothetical protein yinte0001_13960 [Yersinia intermedia ATCC
29909]
gi|238728787|gb|EEQ20304.1| hypothetical protein yinte0001_13960 [Yersinia intermedia ATCC
29909]
Length = 503
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 209/533 (39%), Positives = 289/533 (54%), Gaps = 52/533 (9%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
E P+ ++ + L YT + P+ + L+ S +A L LD F P ++
Sbjct: 20 EFEDAPQFNNSYGQQLSGFYTYLQPTP-LRGAHLLYHSAPLAQELGLDESWFSLPKAAIW 78
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L+G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 79 -AGEALLSGMEPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRSMDWHLKGAGLTPY 137
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LGIPT+RAL +VT+ V R+ + E GA+
Sbjct: 138 SRMGDGRAVLRSVVREFLASEALHHLGIPTSRALTIVTSEHPVYRE-------QAERGAM 190
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q V+ LADY I H+ + + ++E
Sbjct: 191 LLRVAESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWP--QCVGQAEC--------- 237
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
Y W +V +RTA L+AQWQ +GF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 238 ----------YLLWFTDVVKRTARLIAQWQTIGFAHGVMNTDNMSILGITMDYGPFGFLD 287
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ L+ ++ + Y +
Sbjct: 288 DYVPGYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LMSVEQLQLALSAYEPEL 344
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLG + + Q ++++LL+ M + DYT FR LS V+ +
Sbjct: 345 MAAYGQQMRAKLGFVESSSQDNELLTELLSLMTQEGRDYTRTFRLLSQVEMHSAQS---- 400
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
PL+ +D + + SW Y L I D +R+ LM +VNPKY+LRNYL Q
Sbjct: 401 --PLRDDFID-----RAGFDSWYSRYRARLQQEPIDDAQRQYLMKAVNPKYILRNYLAQQ 453
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AID AE D ++RL + ++ P+ EQP + A+LPP W +SCSS
Sbjct: 454 AIDHAEKDDIQPLQRLHQALQHPFAEQPEFDDLAKLPPDWGKH---LEISCSS 503
>gi|330445879|ref|ZP_08309531.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
mandapamensis svers.1.1.]
gi|328490070|dbj|GAA04028.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
mandapamensis svers.1.1.]
Length = 487
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 214/514 (41%), Positives = 286/514 (55%), Gaps = 52/514 (10%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T V+P + NP L++ + +VA LELD DF FSG LAG P A Y GH
Sbjct: 22 TFVTPQP-LTNPYLISINPNVAKQLELDVNSLNNSDFINIFSGNDTLAGFDPIAMKYTGH 80
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG + LGDGR + LGE+ + ++W+L LKG+G TPYSR DG AV+RSSIRE+L S
Sbjct: 81 QFGQYNPDLGDGRGLLLGEVQTSQGKKWDLHLKGSGLTPYSRMGDGRAVIRSSIREYLAS 140
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHAS 312
AM LGIPTT AL ++ + V R+ K+E GA + RVA+S LRFG ++ + +
Sbjct: 141 AAMAGLGIPTTYALAVIGSDTHVYRE-------KQEFGATLIRVAESHLRFGHFEYLFYT 193
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+ E L + LADY I+HHF ++ K YAA ++
Sbjct: 194 QQHEQLTL---LADYVIQHHFPELQQAEK---------------------PYAAMFEQIC 229
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
TA ++A WQ VGF HGV+NTDNMSILGLT DYGP+GFLD ++PSF N +D G RY
Sbjct: 230 SNTAEMIAHWQAVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYNPSFICNHSDYSG-RYA 288
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
F QP IGLWN++ LA +ID + + +E Y + Y +M KLGL ++
Sbjct: 289 FNQQPSIGLWNLSALGYALAP--IIDKADIEHALEIYQHQLQISYSKLMRNKLGLFDSHE 346
Query: 493 Q---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
Q + +L + + + +DYT FFR LS +I + EL AV A
Sbjct: 347 QDTELFQQLFDLLKQNGMDYTLFFRTLS------AISQAEL--NTSAVRFSNLTTNTTAV 398
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKL 609
W+ +Y + + I D++R ALM NPKY+LRNYL Q AID+AE GDF V LL +
Sbjct: 399 DKWLQAYKKRV--ENIDDQQRLALMLKSNPKYILRNYLAQLAIDSAEQGDFTLVDNLLTI 456
Query: 610 MERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+DE P +E A LPP W +SCSS
Sbjct: 457 LHDPFDEHPELEDLADLPPKWGKE---LEISCSS 487
>gi|431804891|ref|YP_007231794.1| hypothetical protein B479_24810 [Pseudomonas putida HB3267]
gi|430795656|gb|AGA75851.1| hypothetical protein B479_24810 [Pseudomonas putida HB3267]
Length = 486
Score = 341 bits (875), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 212/551 (38%), Positives = 297/551 (53%), Gaps = 71/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN +
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDQG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ E A++ R+AQS +RFG ++ + +R E R L D+ + H+
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTRQPEQ---QRVLIDHVLEQHYPECR 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T + ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 DAEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEV 313
Query: 458 DD-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNF 512
+ KEA + Y ++D +M ++LGL + ++ +LL M VDY+ F
Sbjct: 314 EPLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEEDDMALVERLLQRMQSGGVDYSLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
FR L + P E L ++ +D+ + +W Y+ + E R+
Sbjct: 370 FRKLGDQ------PVAEALKMVRDDFIDLA-----GFDAWGADYLARCEREADNVEGRRE 418
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ +P++EQ GM+ YA PP W
Sbjct: 419 RMHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSKPFEEQAGMQGYAERPPEWGK 478
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 479 H---LEISCSS 486
>gi|398791530|ref|ZP_10552254.1| hypothetical protein PMI39_00828 [Pantoea sp. YR343]
gi|398215021|gb|EJN01588.1| hypothetical protein PMI39_00828 [Pantoea sp. YR343]
Length = 479
Score = 341 bits (875), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 207/526 (39%), Positives = 288/526 (54%), Gaps = 53/526 (10%)
Query: 121 TDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL 180
T+S +E L YT + P+ + +L + +A + LD F ++ SG L
Sbjct: 4 TNSWQQE-LAGFYTALDPTP-LAGGRLFYHNAPLAQEMGLDDALFAGSGHGVW-SGRELL 60
Query: 181 AGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGL 240
G P AQ Y GHQFG+WAGQLGDGR I LGE + + LKGAG TPYSR DG
Sbjct: 61 PGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLANGRKLDWHLKGAGLTPYSRMGDGR 120
Query: 241 AVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQS 300
AV+RSS+REFL SEA+H LGIPTTRAL L + V R+ +E GA++ R+A S
Sbjct: 121 AVIRSSVREFLASEALHHLGIPTTRALALAIGDEPVLRE-------TQERGAMLMRIADS 173
Query: 301 FLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLT 360
LRFG ++ H G E D VR LADYAIRHH+ ++
Sbjct: 174 HLRFGHFE-HFYYGGEQ-DKVRQLADYAIRHHWPQLKE---------------------E 210
Query: 361 SNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFT 420
+++Y W ++ +RTASL+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD + P +
Sbjct: 211 ADRYLLWFTDIVKRTASLIAHWQSVGFAHGVMNTDNMSILGLTLDYGPYGFLDDYQPDYI 270
Query: 421 PNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAI 480
N +D G RY F NQP IGLWN+ + + L+ L+ ++ + Y + M +
Sbjct: 271 CNHSDYQG-RYAFENQPMIGLWNLNRLAHALSG--LMTTEQLKLALGHYENELMRVWGEK 327
Query: 481 MTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAV 537
M KLGL + +++ LL+ M ++ DYT FR LS+ + +DE PL+
Sbjct: 328 MRAKLGLLTADANDNTLLTGLLSMMTAERSDYTLTFRMLSDTQ------QDESRSPLRDE 381
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+D ++A+ W Y Q LL SD +R+A+M + NP VLRNYL Q I+ E
Sbjct: 382 FID-----RDAFDRWYSDYRQRLLQDNASDAQRQAVMKAANPALVLRNYLAQQVIEEVEN 436
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
G+ + RL +++P+ + + + PP W +SCSS
Sbjct: 437 GETTALARLHSALQQPFSDAAVSAELRQRPPEWG---KTLEVSCSS 479
>gi|317491950|ref|ZP_07950384.1| hypothetical protein HMPREF0864_01148 [Enterobacteriaceae bacterium
9_2_54FAA]
gi|316920071|gb|EFV41396.1| hypothetical protein HMPREF0864_01148 [Enterobacteriaceae bacterium
9_2_54FAA]
Length = 480
Score = 341 bits (875), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 207/518 (39%), Positives = 288/518 (55%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT++ P+ +++ +++ S+ +A L LD EF + G + L G P AQ
Sbjct: 12 LPGFYTELKPTP-LKDARVLYHSQPLAAELGLD-AEFFSGESAAVLRGESLLEGMNPIAQ 69
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR DG AVLRS IR
Sbjct: 70 VYSGHQFGVWAGQLGDGRGILLGEQQLPDGRKYDWHLKGAGLTPYSRMGDGRAVLRSVIR 129
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL SEA+H LGIP++RAL +VT+ + V R+ + E GA++ RVA+S LRFG ++
Sbjct: 130 EFLASEALHHLGIPSSRALSIVTSQQPVFRE-------QPERGAMLLRVAESHLRFGHFE 182
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R Q D VR LADYAIRHH+ H+ D+D +Y W
Sbjct: 183 HFYYREQP--DEVRKLADYAIRHHWPHL------------VDDKD---------RYVLWL 219
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ ERTA ++A WQ GF HGV+NTDNMSILGLTID+GP+ FLD + P F N +D G
Sbjct: 220 RDITERTARMIALWQSQGFAHGVMNTDNMSILGLTIDFGPYAFLDDYQPDFICNHSDYQG 279
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F NQP + WN+ + L+ LI + ++ Y M + M +KLG
Sbjct: 280 -RYAFDNQPAVAYWNLHRLGQALSG--LISADQIRGALDAYEPALMVAFGEQMRQKLGFF 336
Query: 489 KYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
Q ++++LL+ MA + DYT FRALS+V S + L+ +D
Sbjct: 337 SRQNQDNDLLTELLSLMAKEGRDYTRTFRALSDVVLSDST------MALRDDFID----- 385
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ A+ W + L G+ D R+ M +VNPK +LRNYL Q+AI+AAE D + R
Sbjct: 386 RAAFDGWHQKWRLRLQQDGVDDVTRQTQMKAVNPKRILRNYLAQNAIEAAEKDDVSVLTR 445
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + ++ PY++ + + LPP W + +SCSS
Sbjct: 446 LHQGLQNPYEDDAAFDDLSALPPDWGKK---LEISCSS 480
>gi|423016786|ref|ZP_17007507.1| hypothetical protein AXXA_20157 [Achromobacter xylosoxidans AXX-A]
gi|338780214|gb|EGP44629.1| hypothetical protein AXXA_20157 [Achromobacter xylosoxidans AXX-A]
Length = 495
Score = 341 bits (875), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 215/517 (41%), Positives = 287/517 (55%), Gaps = 46/517 (8%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT++ P + NP+L+ + A + LDP P+F FSGA PL G A Y
Sbjct: 21 AFYTRLEPQ-PLNNPRLLHANADAAALIGLDPAALRTPEFLRVFSGAQPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGEI + WELQLKGAG TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEIQG-PAGAWELQLKGAGLTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGIPTTRALALVASDDPVWRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q DL ++TLADY I ++ + E+ S + Y E
Sbjct: 192 SSRRQPDL--LKTLADYVIDRYYPECRAVPAGEAPS-------------DTAPYVRLLRE 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHVCNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L L+ D + V++ + F + M K+GL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSL--HTLVQDVDGLRAVLDEFEGVFTRAFHDRMGAKMGLAA 353
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ ++ LL M ++ D+T +R L++ + P +L I +E
Sbjct: 354 WRPADEPLLDDLLKLMDANQADFTLAWRRLADAVSGNRAPFQDLF---------IDREAA 404
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
AW+ +L+ + G E A MN VNP YVLRN+L + AI AA+ GD GE+ L
Sbjct: 405 AAWLDRLLARQAQ---DGRPATEVAAAMNRVNPLYVLRNHLAEEAIRAAKTGDAGEIETL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L+ P+ + G EKYA LPP WA +SCSS
Sbjct: 462 MTLLRDPFTARTGYEKYASLPPDWA---NGIEVSCSS 495
>gi|354725825|ref|ZP_09040040.1| hypothetical protein EmorL2_23478 [Enterobacter mori LMG 25706]
Length = 480
Score = 341 bits (874), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 207/518 (39%), Positives = 283/518 (54%), Gaps = 53/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT + P+ + + +L+ + +AD L + P F + + G T LAG P AQ
Sbjct: 13 LPGFYTALKPTP-LHHSRLIWHNAPLADELAIPPDLFPPAEGAGVWGGETLLAGMQPLAQ 71
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE E + LKGAG TPYSR DG AVLRS+IR
Sbjct: 72 VYSGHQFGVWAGQLGDGRGILLGEQQLPNGETVDWHLKGAGLTPYSRMGDGRAVLRSTIR 131
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A+S LRFG ++
Sbjct: 132 ESLASEAMHALGIPTTRALSIVTSDTPVARETM-------EQGAMLVRIAESHLRFGHFE 184
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
H +E + VR LADYAIR H+ ++ + KY W
Sbjct: 185 -HFYYHREP-EKVRQLADYAIRRHWPQLQG---------------------EAEKYVLWF 221
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ RTAS++A+WQ VGF HGV+NTDNMS+LGLT DYGP+GFLD + P + N +D G
Sbjct: 222 RDIVSRTASMIARWQTVGFAHGVMNTDNMSLLGLTFDYGPYGFLDDYQPGYICNHSDYQG 281
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP +GLWN+ + + +L + ID N ++ Y + EY A+M KLGL
Sbjct: 282 -RYSFDNQPAVGLWNLQRLAQSL--SPFIDVDALNDALDGYQEVLLREYGALMRNKLGLL 338
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + ++++ L MA + DYT R LS + + + PL+ +D
Sbjct: 339 TQEKGDNELLNTLFALMAREGSDYTRTIRMLSQTEQNSAAS------PLRDEFID----- 387
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ W Y L + D R+ M + NP VLRN+L Q AI+ AE G + E+ R
Sbjct: 388 RQAFDDWFTLYRSRLQQEQVDDATRQEKMKAANPAMVLRNWLAQRAIEQAEQGQYDELHR 447
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + P+ ++ + Y PP W R V SCSS
Sbjct: 448 LHVALRTPFADRD--DDYVSRPPEWGKRLEV---SCSS 480
>gi|298286503|ref|NP_001177241.1| selenoprotein O [Ciona intestinalis]
Length = 640
Score = 341 bits (874), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 195/434 (44%), Positives = 260/434 (59%), Gaps = 35/434 (8%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K EDL +D+ ++ LP D R+V AC++ P+ +ENP+LVA+SES L
Sbjct: 26 IKQPEDLQFDNLALKTLPVDESKVPGSRQVRGACFSLTDPTP-LENPKLVAFSESALRLL 84
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L F +F G L G+V + CY GHQFG ++GQLGDG AI LGE++N K
Sbjct: 85 DLKCNPDTEAKFSEYFCGNKLLPGSVTASHCYCGHQFGYFSGQLGDGAAIYLGEVINSKG 144
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+RWE+QLKGAG+TPYSR ADG VLRS+IREFLCSEA+ LGIPTTRA +V + V R
Sbjct: 145 DRWEIQLKGAGQTPYSRSADGRKVLRSTIREFLCSEAIFHLGIPTTRAGTVVVSDDKVVR 204
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH------ASRGQED---LDIVRTLADYAI 329
DMFYDG K E A+V R+A SFLRFGS++I RG I+ T+ YA+
Sbjct: 205 DMFYDGKAKLENCAVVLRLAPSFLRFGSFEIFKPIDPATGRGGPSTGMTGILPTMLQYAL 264
Query: 330 RHHFRHIEN-MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
+ F+ ++ + K E +Y A EV RTA+LVA+WQ VGF
Sbjct: 265 DNFFKEVDQALPKVE-------------------QYLAMYKEVCVRTAALVAKWQCVGFC 305
Query: 389 HGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFS 448
HGVLNTDNMS+LGLTIDYGPFGF+D FDP+F N +D G RY + QP+I WN+ +F+
Sbjct: 306 HGVLNTDNMSLLGLTIDYGPFGFMDRFDPNFQCNNSDNKG-RYVYKAQPEICQWNLKKFA 364
Query: 449 TTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVD 505
+ ++D + E Y ++ +Y + M KKLGL K ++ ++ LN M
Sbjct: 365 EAIQECLPLND-SLKVLEESYFPEYKQQYLSEMRKKLGLVKNLPEDEALVDSFLNTMEET 423
Query: 506 KVDYTNFFRALSNV 519
D+TN FR+LS V
Sbjct: 424 YADFTNSFRSLSVV 437
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 54/85 (63%), Gaps = 7/85 (8%)
Query: 540 DIGKERKEAWISWVLSYIQELLSS-------GISDEERKALMNSVNPKYVLRNYLCQSAI 592
D+ K KE W SW+ Y L + D +RK LMNS+NPKY+LRNY+ ++AI
Sbjct: 523 DLLKSNKEKWQSWLKKYCSRLKKEITLQQNLQVLDGQRKQLMNSINPKYILRNYIAENAI 582
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQ 617
AE GDF EVR++L+++E P+ ++
Sbjct: 583 KKAENGDFSEVRKVLQMLENPFHDE 607
>gi|365834257|ref|ZP_09375703.1| hypothetical protein HMPREF0454_00522 [Hafnia alvei ATCC 51873]
gi|364569034|gb|EHM46657.1| hypothetical protein HMPREF0454_00522 [Hafnia alvei ATCC 51873]
Length = 501
Score = 341 bits (874), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 206/518 (39%), Positives = 290/518 (55%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L YT++ P+ +++ +++ +S+ +A L L EF + G + L G P AQ
Sbjct: 33 LPGFYTELKPTP-LKDARVLYYSQPLAAELGLGA-EFFSGESAAVLRGESLLEGMNPIAQ 90
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR DG AVLRS IR
Sbjct: 91 VYSGHQFGVWAGQLGDGRGILLGEQQLPDGRKYDWHLKGAGLTPYSRMGDGRAVLRSVIR 150
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL SEA+H LGIP++RAL +VT+ + V R+ + E GA++ RVA+S LRFG ++
Sbjct: 151 EFLASEALHHLGIPSSRALSIVTSQQPVFRE-------QPERGAMLLRVAESHLRFGHFE 203
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
R Q D VR LADYAIRHH+ H+ + D+D +Y W
Sbjct: 204 HFYYREQP--DEVRKLADYAIRHHWPHLVD------------DKD---------RYVLWL 240
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++ ERTA ++A WQ GF HGV+NTDNMSILGLTID+GP+ FLD + P F N +D G
Sbjct: 241 RDITERTARMIALWQSQGFAHGVMNTDNMSILGLTIDFGPYAFLDDYQPDFICNHSDYQG 300
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F NQP + WN+ + L+ LI + ++ Y M + M +KLG
Sbjct: 301 -RYAFDNQPAVAYWNLHRLGQALSG--LISADQIRGALDAYEPALMVAFGEQMRQKLGFF 357
Query: 489 KYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
Q ++++LL+ MA + DYT FRALS+V S + L+ +D
Sbjct: 358 SRQNQDNDLLTELLSLMAKEGRDYTRTFRALSDVVLSDST------MALRDDFID----- 406
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+ A+ +W + L G+ D R+ M +VNPK +LRNYL Q+AI+AAE D + R
Sbjct: 407 RAAFDAWHQKWRLRLQQDGVDDAARQTQMKAVNPKRILRNYLAQNAIEAAEKDDVSVLTR 466
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + ++ PY++ + + LPP W + +SCSS
Sbjct: 467 LHQGLQNPYEDDAAFDDLSALPPDWGKK---LEISCSS 501
>gi|339489792|ref|YP_004704320.1| hypothetical protein PPS_4913 [Pseudomonas putida S16]
gi|338840635|gb|AEJ15440.1| conserved hypothetical protein [Pseudomonas putida S16]
Length = 486
Score = 341 bits (874), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 212/551 (38%), Positives = 296/551 (53%), Gaps = 71/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN +
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDQG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ E A++ R+AQS +RFG ++ + +R E R L D+ + H+
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTRQPEQ---QRVLIDHVLEQHYPECR 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T + ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 DAEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEV 313
Query: 458 DD-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNF 512
+ KEA + Y ++D +M ++LGL + ++ +LL M VDY F
Sbjct: 314 EPLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEEDDMALVERLLQRMQSGGVDYNLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
FR L + P E L ++ +D+ + +W Y+ + E R+
Sbjct: 370 FRKLGDQ------PVAEALKVVRDDFIDLA-----GFDAWGADYLARCEREADNVEGRRE 418
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ +P++EQ GM+ YA PP W
Sbjct: 419 RMHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSKPFEEQAGMQGYAERPPEWGK 478
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 479 H---LEISCSS 486
>gi|421844156|ref|ZP_16277315.1| hypothetical protein D186_03921 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
gi|411775063|gb|EKS58531.1| hypothetical protein D186_03921 [Citrobacter freundii ATCC 8090 =
MTCC 1658]
Length = 480
Score = 341 bits (874), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 208/521 (39%), Positives = 289/521 (55%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT +SP+ ++N +L+ ++++A+ L + F+ + G + L G P
Sbjct: 10 RDELPATYTALSPTP-LKNARLIWHNDALAEQLAIPAALFDISTGAGVWGGESLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE ++ LKGAG T YSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGVWAGQLGDGRGILLGEQQLADGSTFDWHLKGAGLTRYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA++ RVAQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------EAGAMLIRVAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + VR LAD+AIRH++ + ED ++KY
Sbjct: 182 HFEHFYYR--REPEKVRQLADFAIRHYWPQWQ--------------ED-------ADKYQ 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D
Sbjct: 219 LWFNDVVTRTATLIADWQAVGFAHGVMNTDNMSILGLTMDYGPFGFLDDYVPDYICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP LWN+ + + TL + I + N ++ Y + Y M +KL
Sbjct: 279 NQG-RYSFDNQPAAALWNLQRLAQTL--SPFIPVEALNDALDSYQMALLTRYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + +++S+L + M+ ++ DYT FR LS + + PL+ +D
Sbjct: 336 GFFSEQKDDNELLSELFSLMSRERSDYTRTFRMLSQTE------QHSAQSPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ W Y L I+D R+ M + NP VLRN+L Q AI AE GD+ E
Sbjct: 388 ---RAAFDDWFTRYRSRLQQDNIADAARQTQMKAANPAMVLRNWLAQRAISQAEQGDYAE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + + P+ ++ + Y PP W R V SCSS
Sbjct: 445 LHRLHQALRTPFADRD--DDYVSRPPDWGKRLEV---SCSS 480
>gi|167036107|ref|YP_001671338.1| hypothetical protein PputGB1_5118 [Pseudomonas putida GB-1]
gi|189040232|sp|B0KN22.1|Y5118_PSEPG RecName: Full=UPF0061 protein PputGB1_5118
gi|166862595|gb|ABZ01003.1| protein of unknown function UPF0061 [Pseudomonas putida GB-1]
Length = 486
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 210/548 (38%), Positives = 291/548 (53%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L++D+ F R GD A T+V P + P+LV SES L
Sbjct: 1 MKALDQLSFDNRFAR--LGD------------AFSTQVLPEP-IAEPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAHAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIPT+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPTSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L +I+
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTT--VIE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++ +LL M VDY+ FFR
Sbjct: 313 VEPLKETLGLFLPLYQAHYLDLMRRRLGLTTAEEDDMALVERLLQCMQRGGVDYSLFFRK 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L P + L ++ +D+ A+ +W Y+ + E R+ M+
Sbjct: 373 LGEQ------PAADALKVVRDDFIDLA-----AFDAWGADYLARCDREPGNAEGRRERMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ P++EQPGM+ YA PP W
Sbjct: 422 AVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSHPFEEQPGMQAYAERPPEWGKH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|255536675|ref|YP_003097046.1| hypothetical protein FIC_02554 [Flavobacteriaceae bacterium
3519-10]
gi|255342871|gb|ACU08984.1| protein of hypothetical function UPF0061 [Flavobacteriaceae
bacterium 3519-10]
Length = 514
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 206/518 (39%), Positives = 291/518 (56%), Gaps = 59/518 (11%)
Query: 115 LPGDPRTDSIPRE---VLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
PGD ++ R+ VL A TK+ A N +L+ +++ ++D + L P E +
Sbjct: 14 FPGDTSGNTRQRQTPKVLFAS-TKIVGFA---NAELIHFNQKLSDEIGLGPIE---TNAD 66
Query: 172 LFFSGATPLAGAVP-YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGK 230
F AT L + YA Y GHQFG WAGQLGDGRAI GEI N ++ ELQ KGAG
Sbjct: 67 RDFLNATALPENIKTYATAYAGHQFGNWAGQLGDGRAIFAGEITNAAGKKTELQWKGAGA 126
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSR ADG AVLRSS+RE+L SEAM LG+PTTRAL L TG+ V RDM Y+GNP++E
Sbjct: 127 TPYSRHADGRAVLRSSVREYLMSEAMFHLGVPTTRALSLSLTGEQVERDMLYNGNPQDEK 186
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
GA+V R A+SFLRFG +Q+ A+ Q++++ +R LAD+ + +++ I+ +
Sbjct: 187 GAVVVRTAESFLRFGHFQLMAA--QDEIETLRQLADFTVSNYYPTIDPND---------- 234
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
KYA ++A RTA ++ +W VGF HGV+NTDNMS LGLTIDYGPF
Sbjct: 235 ----------PQKYAELFRQIASRTADMIVEWYRVGFVHGVMNTDNMSALGLTIDYGPFS 284
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERY 469
FLD + +FTPNTTDLPGRRY F NQ I WN+ Q ++ L L++D E ++ +
Sbjct: 285 FLDEYSLNFTPNTTDLPGRRYAFGNQAKIAQWNLWQLASALFP--LVNDVEILQNILNGF 342
Query: 470 GTKFMDEYQAIMTKKLGLPKYNKQII----------SKLLNNMAVDKVDYTNFFRALSNV 519
F ++ +M K G Q+I KL+ ++ K+DYT FF L
Sbjct: 343 SDDFWKKHDKMMASKFGF----DQLIEGDDSFFTAWQKLMEDL---KIDYTLFFSRL--- 392
Query: 520 KADPSIPEDELLVPLKAVLLD-IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVN 578
+ + D+L V + + + + +++ +Y L + I+ E+ LM N
Sbjct: 393 --EMTAGSDDLKTTFGDVFYSPVSDDSFKLFENFIETYRTRLTKNTITPEDSLQLMRKTN 450
Query: 579 PKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDE 616
P++VLRNY+ I E G ++L +E PY+E
Sbjct: 451 PRFVLRNYILFERIAELEQGKRDLFNKILTALESPYEE 488
>gi|386284608|ref|ZP_10061827.1| hypothetical protein SULAR_05148 [Sulfurovum sp. AR]
gi|385344011|gb|EIF50728.1| hypothetical protein SULAR_05148 [Sulfurovum sp. AR]
Length = 478
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 209/517 (40%), Positives = 286/517 (55%), Gaps = 62/517 (11%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
CYT+V P+ +EN L+ +E VA+ L++D +E F F +GA L G+ P+A CY
Sbjct: 19 CYTRVKPTP-LENVFLIHANEDVAELLDIDIEELYSDAFVEFVNGAWQLEGSDPFAMCYA 77
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG + +LGDGRAI +G I ++W LQLKGAG+T YSR DG AVLRSSIRE+L
Sbjct: 78 GHQFGHFVPRLGDGRAINIGTI-----KQWHLQLKGAGQTRYSRSGDGRAVLRSSIREYL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--I 309
SEAMH LGI +TRAL L+ + V R+ + E GAIV RV+ S++RFG+++
Sbjct: 133 MSEAMHGLGIESTRALALIGSEHKVYREEW-------ETGAIVLRVSPSWVRFGTFEYFT 185
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
H R +E + LADYAI + H+ + +KY +
Sbjct: 186 HKKRYEE----LEALADYAIAESYPHLVEV---------------------PDKYLQFFT 220
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA L+A+WQ VGF HGV+NTDNMSI GLTIDYGP+ FLD +D + N TD G
Sbjct: 221 EVVSRTARLMAEWQAVGFNHGVMNTDNMSIAGLTIDYGPYAFLDDYDSQYICNHTD-QGG 279
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F NQP+IG WN+ LA +++ + ++ Y + + Y +M KK+GL K
Sbjct: 280 RYSFGNQPNIGAWNLQALMHALAP--MVNSDKMEKALDDYARVYTERYLELMGKKIGLDK 337
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
++ +LL+ M +DYT FFR LS + + LL +G K
Sbjct: 338 LQDSDLELFKQLLSMMQGMSIDYTLFFRTLSRYDGE------------RTALLKLGLYHK 385
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
W+ SY + L ++ S +ER + M NPK+VL+NY+ Q AI AA GDF V L
Sbjct: 386 PM-NEWLDSYDERLKANTSSTKERHSAMLQTNPKFVLKNYMLQEAITAAVNGDFSVVDNL 444
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ + PY E E++A P LSCSS
Sbjct: 445 FEIAKDPYAEHETHERWAGATPEEFKNQK---LSCSS 478
>gi|397692969|ref|YP_006530849.1| hypothetical protein T1E_0199 [Pseudomonas putida DOT-T1E]
gi|397329699|gb|AFO46058.1| UPF0061 protein [Pseudomonas putida DOT-T1E]
Length = 486
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 293/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY+ FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTNAEDDDMALVERLLQCMQRGGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L ++ +D+ + +W Y+ + E R+
Sbjct: 371 RKLGEQ------PVAEALKAVRDDFIDLA-----GFDAWGADYLARCGREPGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLTRPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|440230671|ref|YP_007344464.1| hypothetical protein D781_1995 [Serratia marcescens FGI94]
gi|440052376|gb|AGB82279.1| hypothetical protein D781_1995 [Serratia marcescens FGI94]
Length = 480
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 216/541 (39%), Positives = 294/541 (54%), Gaps = 66/541 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
+D+++ R+LPG YT ++P+ +E +L+ S +A L LD F
Sbjct: 3 QFDNAYYRQLPG--------------FYTALTPTP-LEGARLLYHSAPLAQQLGLDDSWF 47
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
+ P++ SG L G P AQ Y GHQFG+WAGQLGDGR I LGE + L
Sbjct: 48 NAENTPVW-SGERLLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLPDGTHLDWHL 106
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AVLRS+IREFL SEAMH LGI TTRAL +VT+ + V R+
Sbjct: 107 KGAGLTPYSRMGDGRAVLRSAIREFLASEAMHHLGIATTRALTVVTSDQPVYRE------ 160
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
+ E GA++ RVA+S +RFG ++ R Q D VR LAD+ I H+ + +
Sbjct: 161 -QPERGAMLLRVAESHVRFGHFEHFYYRQQP--DQVRQLADFVIERHWPQLADQQ----- 212
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
+KY W +VAERTA L+A WQ VGF HGV+NTDNMSILGLTID
Sbjct: 213 ----------------DKYLLWFTDVAERTARLMADWQTVGFAHGVMNTDNMSILGLTID 256
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV 465
YGP+GFLD + P + N +D G RY F NQP + LWN+ + + L + L+ ++
Sbjct: 257 YGPYGFLDDYQPGYICNHSDHQG-RYAFDNQPAVALWNLHRLAQAL--SPLMTPQQLQQA 313
Query: 466 MERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKAD 522
+ Y M Y M KLG +Q ++++LL+ MA + DYT FR LS V+
Sbjct: 314 LTAYEPALMRAYGDRMRAKLGFFSQQRQDNDLLTELLSLMAQEGRDYTRTFRLLSEVE-- 371
Query: 523 PSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+ + PL+ +D +EA+ W Y + L + D +R+ M +VNPK +
Sbjct: 372 ----QQQAQTPLRDEFID-----REAFDGWYRRYRERLQQEQVGDAQRRQAMQAVNPKLI 422
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCS 642
LRNYL Q AI AAE D ++ L + + +PYD+ + A LPP W +SCS
Sbjct: 423 LRNYLAQEAIAAAEQDDASKLAHLHQALLKPYDDDARYDALAALPPEWGKH---LEISCS 479
Query: 643 S 643
S
Sbjct: 480 S 480
>gi|73541090|ref|YP_295610.1| hypothetical protein Reut_A1396 [Ralstonia eutropha JMP134]
gi|121957743|sp|Q472B7.1|Y1396_RALEJ RecName: Full=UPF0061 protein Reut_A1396
gi|72118503|gb|AAZ60766.1| Protein of unknown function UPF0061 [Ralstonia eutropha JMP134]
Length = 520
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 211/527 (40%), Positives = 290/527 (55%), Gaps = 61/527 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + + LV+ + + A L + + PDF F G + A P A Y G
Sbjct: 39 FTRLRPT-PLPSAYLVSVAPNAAALLGMPVEAASEPDFIEAFVGNSVPDWADPLATVYSG 97
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI L + + WE+QLKGAG TPYSR ADG AVLRSSIRE+LC
Sbjct: 98 HQFGVWAGQLGDGRAIRLAQA-QTDTGPWEIQLKGAGLTPYSRMADGRAVLRSSIREYLC 156
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LG+PTTRAL ++ + V R+ E A+V R+A +F+RFG ++ A+
Sbjct: 157 SEAMAALGVPTTRALSIIGSDAPVRRETI-------ETAAVVTRLAPTFIRFGHFEHFAA 209
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
ED+ +R LAD+ I + + Y A EV+
Sbjct: 210 --HEDVAALRQLADFVINNFMPACRE---------------------AAQPYQALLREVS 246
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA +VA WQ +GF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G RY
Sbjct: 247 LRTADMVAHWQAIGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDTQG-RYA 305
Query: 433 FANQPDIGLWNIAQFSTTL----------AAAK---LIDDKEA-NYVMERYGTKFMDEYQ 478
++ QP + WN+ + L AA+ + +EA + +RY ++F Y+
Sbjct: 306 YSQQPQVAFWNLHCLAQALLPLWLEPGADEAARDGAVAQAREALDPFRDRYASEFFRHYR 365
Query: 479 AIMTKKL--GLPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
A + + G K ++ +++ L + VDYT F+R L+ + S + P++
Sbjct: 366 AKLGIHMPAGGDKEDEPLLTSLFQLLHEQHVDYTLFWRNLARI----SSADGSGDAPVRD 421
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
+ LD + AW +W SY L + D R+ M +VNPKYVLRN+L + AI A
Sbjct: 422 LFLD-----RAAWDTWAESYRNRLRAEQSDDAARRVAMLAVNPKYVLRNHLAEIAIRRAR 476
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF EV RLL ++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 477 EKDFSEVDRLLAVLSRPFDEQPEAEAYAALPPDWA---GGLEVSCSS 520
>gi|387192963|gb|AFJ68681.1| selenoprotein o, partial [Nannochloropsis gaditana CCMP526]
Length = 572
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 194/448 (43%), Positives = 267/448 (59%), Gaps = 38/448 (8%)
Query: 93 SKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS- 151
S+ K LE L +D+ +R LP DP+ ++ R V ++ Y++V P ++NP LVA S
Sbjct: 59 SRPQPKTYTLETLPFDNLALRSLPLDPQPENFIRPVPNSVYSRVEPEP-LKNPVLVALSP 117
Query: 152 ESVADSLELDPKEFERP-DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITL 210
+++ D L LDP E +R D + G L G+ YA CY GHQFG ++GQLGDG AI+L
Sbjct: 118 DALTDLLSLDPSELKREEDLAAYLGGNKRLPGSETYAHCYAGHQFGAFSGQLGDGAAISL 177
Query: 211 GEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLV 270
GE++ + ER E+QLKGAG TPYSR ADG VLRSSIREFLCSEAM FLG+PTTRA L+
Sbjct: 178 GEVVGERGERCEIQLKGAGPTPYSRRADGRKVLRSSIREFLCSEAMSFLGVPTTRAGALI 237
Query: 271 TTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI---------HASRGQEDLDIV 321
T+ RD+FY+GN E ++V R+A SFLRFGS+++ A + +++
Sbjct: 238 TSDTLTQRDIFYNGNVINERCSVVTRLAPSFLRFGSFEVVKTQDAYTGRAGPSPGNTELL 297
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
R L D+ I+ +F H+ ++ D ++Y A+ EV +TA LVA
Sbjct: 298 RELLDFTIQTYFPHLGHLE-----------------DNKPDQYLAFYREVVAKTAGLVAA 340
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
WQ VGFTHGVLNTDNMS+LGLTIDYGP+GF+D FDP F PN +D G RY + QP+I
Sbjct: 341 WQAVGFTHGVLNTDNMSVLGLTIDYGPYGFMDFFDPDFIPNGSD-NGGRYTYVKQPEICK 399
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL-------PKYNKQI 494
WN+ +F+ L+ L D+ + Y ++ Y +M KKLGL K +++
Sbjct: 400 WNLEKFAEALSLL-LPLDRSLPLLSSLYDDEYSRAYFFLMRKKLGLREGGAEGGKEEEEL 458
Query: 495 ISKLLNNMAVDKVDYTNFFRALSNVKAD 522
+ KL M D+T F LS ++ D
Sbjct: 459 VEKLFKTMEETAADFTMTFVELSRLERD 486
>gi|293604642|ref|ZP_06687044.1| SelO family protein [Achromobacter piechaudii ATCC 43553]
gi|292816973|gb|EFF76052.1| SelO family protein [Achromobacter piechaudii ATCC 43553]
Length = 495
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 218/518 (42%), Positives = 286/518 (55%), Gaps = 48/518 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+++P + NP+L+ + A + LDP + P+F FSG PL G A Y
Sbjct: 21 AFYTRLTPQG-LNNPRLLHANADAAALIGLDPAVLDSPEFLQVFSGGQPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ WELQLKGAG TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEVQG-PDGGWELQLKGAGMTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTT+AL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGIPTTQALALVVSDDPVMRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q DL ++TLADY I + D + Y
Sbjct: 192 SSRRQPDL--LKTLADYVIDRFYPECR-------------DAPADPAQAEAAPYLNLLRV 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTHRTARLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHVCNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L A L+ D +A V++ + F + M KLGL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSLHA--LVQDVDALRAVLDEFEAVFTRAFHDRMGAKLGLAA 353
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ ++ LL M ++ D+T +R L++ V S ED L I +
Sbjct: 354 WQPADEPLLDDLLKLMDANQADFTLSWRRLADAVLGQRSAFED----------LFIDRPA 403
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
AW+ +L+ + G E R MN VNP YVLRN+L + AI AA+ GD E+
Sbjct: 404 AAAWLDRLLARQAQ---DGRPAEARADAMNRVNPLYVLRNHLAEEAIRAAKKGDASEIDT 460
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+KL+ PY QPG E+YA LPP WA G +SCSS
Sbjct: 461 LMKLLRNPYQPQPGYERYAGLPPDWA---GSLEVSCSS 495
>gi|386014338|ref|YP_005932615.1| hypothetical protein PPUBIRD1_4857 [Pseudomonas putida BIRD-1]
gi|313501044|gb|ADR62410.1| Hypothetical protein, conserved [Pseudomonas putida BIRD-1]
Length = 486
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 292/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY+ FF
Sbjct: 315 PLKEALGLFLPLYQVHYLD----LMRRRLGLTTAEDDDMALVERLLQCMQRGGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L + +D+ + +W Y+ + E R+
Sbjct: 371 RTLGEQ------PVAEALKVARDDFIDLA-----GFDAWGADYLARCGREPDNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLTRPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|406674903|ref|ZP_11082095.1| hypothetical protein HMPREF1170_00303 [Aeromonas veronii AMC35]
gi|404628411|gb|EKB25193.1| hypothetical protein HMPREF1170_00303 [Aeromonas veronii AMC35]
Length = 475
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 213/525 (40%), Positives = 280/525 (53%), Gaps = 57/525 (10%)
Query: 122 DSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLA 181
++ E+ AC V+P ++ P+L+ + ++ D L L D+ L
Sbjct: 5 NTFATELPWAC-EPVAPQP-LQQPRLLHLNRALLDELGLG--GVSEADWIACCGEGKVLP 60
Query: 182 GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLA 241
G P AQ Y GHQFG ++ +LGDGRA+ LGE L +RW+L LKGAGKTP+SRF DG A
Sbjct: 61 GMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGQRWDLHLKGAGKTPFSRFGDGRA 120
Query: 242 VLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSF 301
VLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ E GA V R A S
Sbjct: 121 VLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYREQV-------ETGATVLRTAPSH 173
Query: 302 LRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTS 361
LRFG + A GQ + + L DY +RHHF +E+
Sbjct: 174 LRFGHIEYFAWSGQG--EKIPPLIDYLLRHHFPELESG---------------------- 209
Query: 362 NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTP 421
A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P F
Sbjct: 210 ---AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPDFVC 266
Query: 422 NTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
N +D P RY QP +G WN+ + + LA +D + +Y + M Y +M
Sbjct: 267 NHSD-PAGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALAAALAQYEQQLMLHYSELM 323
Query: 482 TKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + + +L +A KVDY F R L V + + P L A+L
Sbjct: 324 RAKLGLAVWEEDDPALFRELFRLLAAHKVDYHLFLRRLGEVTQEGAWPAS-----LLALL 378
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+ G W +W+ Y L+ G D RK M+++NPKYVLRN L Q IDAA++G
Sbjct: 379 SEPG-----VWQAWLERYRARLMREGSEDAVRKTQMDAINPKYVLRNALAQQVIDAADVG 433
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
D RL ++ PYDEQP E A PAW Y G LSCSS
Sbjct: 434 DMQPFERLFAALQHPYDEQPEYEDLATPTPAW-YCGG--ELSCSS 475
>gi|157370404|ref|YP_001478393.1| hypothetical protein Spro_2164 [Serratia proteamaculans 568]
gi|157322168|gb|ABV41265.1| protein of unknown function UPF0061 [Serratia proteamaculans 568]
Length = 480
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 208/528 (39%), Positives = 293/528 (55%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ ++ L YT+++P+ + +L+ SE +A L LD F + P++ +G T
Sbjct: 2 PQFENAYQQQLAGFYTELNPTP-LTGTRLLYHSEPLARELGLDESWFTQDKTPIW-AGET 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 60 LLPGMRPLAQVYSGHQFGVWAGQLGDGRGILLGEQRLADGRSMDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS IREFL SEA+H LGIPTTRAL +VT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSVIREFLASEALHHLGIPTTRALTIVTSDQPVYRE-------QAERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + V+ LAD+ I H+ ++
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VQQLADFVIARHWPQFKDQ------------------- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
S+ Y W +V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+GFLD + P
Sbjct: 212 --SDGYLLWFTDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPYGFLDDYKPD 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY + NQP + LWN+ + + TL+ L+ ++ + Y M Y
Sbjct: 270 YICNHSDHQG-RYAYDNQPAVALWNLHRLAQTLSG--LMSTEQLQNALAAYEPALMRAYG 326
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG ++Q +++ LL+ MA + DY+ FR LS + + + PL+
Sbjct: 327 EQMRAKLGFFTQSQQDNDLLTGLLSLMAQEGRDYSRTFRLLSQTE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ W Y Q L ISD +R+ M +VNPK +LRNYL Q AI++A
Sbjct: 381 DEFID-----RAAFDGWYQQYRQRLQQEQISDAQRQQAMKAVNPKLILRNYLAQQAIESA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++ RL + + P+ + P + A LPP W +SCSS
Sbjct: 436 EQDDVSKLARLHQALLAPFADNPEYDDLAALPPDWGKH---LEISCSS 480
>gi|421908407|ref|ZP_16338249.1| Selenoprotein O and cysteine-containing homologs [Klebsiella
pneumoniae subsp. pneumoniae ST258-K26BO]
gi|410117668|emb|CCM80874.1| Selenoprotein O and cysteine-containing homologs [Klebsiella
pneumoniae subsp. pneumoniae ST258-K26BO]
Length = 482
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 211/523 (40%), Positives = 281/523 (53%), Gaps = 55/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGVGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAG--QLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVL 243
AQ Y GHQFG WAG QLGDGR I LGE R++ LKGAG TPYSR DG AVL
Sbjct: 69 LAQVYSGHQFGAWAGXXQLGDGRGILLGEQQLADXXRYDWHLKGAGLTPYSRMGDGRAVL 128
Query: 244 RSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLR 303
RS+IRE L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +R
Sbjct: 129 RSTIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVR 181
Query: 304 FGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK 363
FG ++ R + V+ LADY IRHH+ +++ ++K
Sbjct: 182 FGHFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADK 218
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
Y W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N
Sbjct: 219 YLLWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNH 278
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTK 483
+D G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M
Sbjct: 279 SDYQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRD 335
Query: 484 KLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
KLGL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 336 KLGLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID 389
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD
Sbjct: 390 -----RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDM 444
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GE+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 445 GELERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 482
>gi|385330885|ref|YP_005884836.1| hypothetical protein HP15_1144 [Marinobacter adhaerens HP15]
gi|311694035|gb|ADP96908.1| protein belonging to uncharacterized protein family UPF0061
[Marinobacter adhaerens HP15]
Length = 484
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 205/520 (39%), Positives = 284/520 (54%), Gaps = 52/520 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ + YT+V PS +++ ++V ++ +A+ + + ++ +G+ L G P
Sbjct: 14 ELPDSFYTRVQPSP-LKDAKMVCFNHKLAEQMGF--RADSESEWTGVGAGSELLEGMEPV 70
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG + LGDGR + L E + RW+ LKGAG TPYSRF DG AVLRS+
Sbjct: 71 AMKYTGHQFGAYNPDLGDGRGLLLWETVGPDGRRWDWHLKGAGMTPYSRFGDGRAVLRST 130
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IRE+LCSEAMH LGIPTTRAL +V+ V R+ E A + RVAQS +RFG
Sbjct: 131 IREYLCSEAMHGLGIPTTRALFMVSAKDPVRRESI-------ETAATLVRVAQSHIRFGH 183
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ A E + V+TL ++ I H H+ N+ D+D +YA
Sbjct: 184 FEFAAH--HEGPESVKTLLEHVISLHSPHLINLP----------DDD---------RYAR 222
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
W EV ERTA +A WQ VGF HGV+N+DNMSI+G T DYGPF FLD FD + N TD
Sbjct: 223 WFEEVVERTARTIADWQAVGFCHGVMNSDNMSIIGDTFDYGPFAFLDDFDAGYISNHTD- 281
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY + QP +G N +T L ++++ + + RY + + + M KLG
Sbjct: 282 QGGRYAYNRQPQVGFENCRYLATALLP--VMEEDDVRRGLRRYEVAYNERFLQNMRDKLG 339
Query: 487 LPKYNKQIISKLLNN---MAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L ++ +S +++ M VDYT FFRALSN+ + P +L V
Sbjct: 340 LAIEDEADLSLIMDTFSMMHEHHVDYTAFFRALSNLHSHGPGPVRDLFVD---------- 389
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
+ W+ Y + LL+ + +ER+ M VNPKYVLRNYL Q I A+ GD+ +
Sbjct: 390 --RSVADQWLERYEERLLNESRAHDEREYAMRRVNPKYVLRNYLAQQVIQEAQNGDYEPM 447
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ LLK++ERPYDEQP E YA LPP W + SCSS
Sbjct: 448 KALLKVLERPYDEQPENEAYAALPPDWGKHLNI---SCSS 484
>gi|336249891|ref|YP_004593601.1| hypothetical protein EAE_17055 [Enterobacter aerogenes KCTC 2190]
gi|334735947|gb|AEG98322.1| hypothetical protein EAE_17055 [Enterobacter aerogenes KCTC 2190]
Length = 480
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 207/523 (39%), Positives = 282/523 (53%), Gaps = 57/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ ++N +L+ + ++A +L + F + G L G P
Sbjct: 10 RDRLPGFYTSLAPTP-LDNARLIWRNTALAQTLGVPETIFNPQHGAGVWGGEAVLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE +R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLPDGQRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ +EE G ++ R+A+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------REERGTMLMRIAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LADY I HH+ ++ ++KY
Sbjct: 182 HFEHFYYR--REAEKVQQLADYVIEHHWPQLQQ---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDNQPAVGLWNLQRLAQSL--SPFISADALNAALDDYQPALLTTYGRRMRDKL 335
Query: 486 GLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
G Y +Q ++ L + M + DYT FR LS + + PL+ +D
Sbjct: 336 GF--YTQQTGDNTLLDGLFSLMEREGSDYTRTFRMLSQSEQHSAAS------PLRDEFID 387
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ SW Y L I D ER+ M VNP VLRN+L Q AI+ AE GD
Sbjct: 388 -----RAAFDSWFADYRARLRDEQIDDSERQQRMQGVNPAVVLRNWLAQRAIEKAEDGDM 442
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GE+ RL + + +P+ ++ + YA PP W V SCSS
Sbjct: 443 GELERLHEALAQPFADR--TDDYANRPPDWGKHLEV---SCSS 480
>gi|424903806|ref|ZP_18327319.1| hypothetical protein A33K_15181 [Burkholderia thailandensis MSMB43]
gi|390931679|gb|EIP89080.1| hypothetical protein A33K_15181 [Burkholderia thailandensis MSMB43]
Length = 525
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 220/547 (40%), Positives = 290/547 (53%), Gaps = 71/547 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR D+ + L + P+A + P +V +S+ A L LDP + P F F G
Sbjct: 28 PRGDAFAQ--LGGAFLTRLPAAPLPAPYVVGFSDEAARMLGLDPALRDAPGFADLFCGNP 85
Query: 179 PL---AGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSR 235
++PYA Y GHQFG+WAGQLGDGRA+T+GE+ + R+ELQLKGAG+TPYSR
Sbjct: 86 TRDWPPASLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-DGRRYELQLKGAGRTPYSR 144
Query: 236 FADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVC 295
DG AVLRSSIREFL SEAMH LGIPTTRAL ++ + + V R+ E A+V
Sbjct: 145 MGDGRAVLRSSIREFLGSEAMHHLGIPTTRALTVIGSDQPVIREEI-------ETSAVVT 197
Query: 296 RVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVA+SF+RFG ++ A+ E L R LAD+ I D +
Sbjct: 198 RVAESFVRFGHFEHFFANDRPEQL---RALADHVI---------------------DRFY 233
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
+ Y A EV RTA LVAQWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DA
Sbjct: 234 PACRDADDPYLALLAEVTRRTAELVAQWQAVGFCHGVMNTDNMSILGVTIDYGPFGFIDA 293
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL---------------AAAKLIDD 459
FD N +D G RY + QP I WN + L A + ++D
Sbjct: 294 FDAKHVCNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLFGLDRDAPSEDARAERAVED 352
Query: 460 KEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRAL 516
A V+ R+ +F + M KLGL + + + ++LL M D+T FR L
Sbjct: 353 AHA--VLGRFPEQFGPALERAMRAKLGLALEREGDAALANQLLEIMDASHADFTLTFRHL 410
Query: 517 SNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNS 576
+ V + + P + + +D ++A+ W Y L D R A MN
Sbjct: 411 ARVSKHDARGD----APARDLFID-----RDAFDRWANLYRARLSEEARDDAARAAAMNR 461
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
NPKYVLRN+L ++AI A+ DF E+ RL ++ RP+DEQP + YA LPP WA
Sbjct: 462 SNPKYVLRNHLAETAIRRAKEKDFSEIERLAAVLRRPFDEQPEHDAYAALPPDWA---ST 518
Query: 637 CMLSCSS 643
+SCSS
Sbjct: 519 LEVSCSS 525
>gi|323495070|ref|ZP_08100159.1| hypothetical protein VIBR0546_02384 [Vibrio brasiliensis LMG 20546]
gi|323310727|gb|EGA63902.1| hypothetical protein VIBR0546_02384 [Vibrio brasiliensis LMG 20546]
Length = 487
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 290/521 (55%), Gaps = 62/521 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YTKV P ++N + V W+ A+ L P++ + +G L P A Y G
Sbjct: 19 YTKVVPQP-LDNTRWVVWNSHFANQFGL-PQQAPDGELKRLLTGEKSLEN-TPLAMKYAG 75
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ LGDGR + +GE+ N + E ++L LKG G TPYSR DG AVLRS+IRE+LC
Sbjct: 76 HQFGVYNPDLGDGRGLLIGELTNHRDEIFDLHLKGCGVTPYSRAGDGRAVLRSTIREYLC 135
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIPTTRAL ++ + V RD K E GA++ R++ + +RFG ++
Sbjct: 136 SEAMAGLGIPTTRALGMLVSDTLVYRD-------KSEQGALLLRMSPTHIRFGHFEHFFY 188
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E D ++ LAD I HF S +D + Y A +V
Sbjct: 189 --SEQFDELKLLADKVIEWHFS--------------------SALD-SEQPYQAMFEQVI 225
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTA ++A WQ GFTHGV+NTDNMSI+G T DYGPF FLD ++P + N +D RRY
Sbjct: 226 ERTAEMIAYWQAYGFTHGVMNTDNMSIIGETFDYGPFAFLDDYNPDYVCNHSDYQ-RRYA 284
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F QP I LWN+ + +L++ LI + ++ R+ + +M KLG+ +
Sbjct: 285 FNQQPRIALWNLTALAHSLSS--LICREMLEQILARFEPCLGHHFSRLMRAKLGINSQQQ 342
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ S + + + ++DYT F R LS++ D I +LD+ +R+ A
Sbjct: 343 SDTRLFSTMFDLLHKQQIDYTRFLRELSSIDID-GIDR----------VLDLFADRQLA- 390
Query: 550 ISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
W+ Y+ QEL +SG ISD +R M VNPK++LRNYL Q AID AE GDF EV
Sbjct: 391 TQWLTHYLERCQQELTASGEVISDRQRCEAMRRVNPKFILRNYLAQIAIDQAEQGDFSEV 450
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
+RL L++ P+DEQP M KYA LPPAW G M LSCSS
Sbjct: 451 QRLSDLLKYPFDEQPEMSKYADLPPAW----GKDMSLSCSS 487
>gi|148550143|ref|YP_001270245.1| hypothetical protein Pput_4941 [Pseudomonas putida F1]
gi|395445926|ref|YP_006386179.1| hypothetical protein YSA_04247 [Pseudomonas putida ND6]
gi|167012990|sp|A5WAA1.1|Y4941_PSEP1 RecName: Full=UPF0061 protein Pput_4941
gi|148514201|gb|ABQ81061.1| protein of unknown function UPF0061 [Pseudomonas putida F1]
gi|388559923|gb|AFK69064.1| hypothetical protein YSA_04247 [Pseudomonas putida ND6]
Length = 486
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 292/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDVG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY+ FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMALVERLLQCMQRGGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L + +D+ + +W Y+ + E R+
Sbjct: 371 RKLGEQ------PVAEALKVARDDFIDLA-----GFDAWGADYLARCRREPGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLTRPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|117620918|ref|YP_858551.1| hypothetical protein AHA_4127 [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
gi|166227227|sp|A0KQK0.1|Y4127_AERHH RecName: Full=UPF0061 protein AHA_4127
gi|117562325|gb|ABK39273.1| YdiU family protein [Aeromonas hydrophila subsp. hydrophila ATCC
7966]
Length = 475
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 204/469 (43%), Positives = 256/469 (54%), Gaps = 55/469 (11%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
PL G P AQ Y GHQFG ++ +LGDGRA+ LGE RW+L LKGAGKTP+SRF D
Sbjct: 58 PLPGMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQQAPDGSRWDLHLKGAGKTPFSRFGD 117
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R A
Sbjct: 118 GRAVLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYRE-------QVETGATVLRTA 170
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
S LRFG + A GQ + + L DY +RHHF + +
Sbjct: 171 PSHLRFGHVEYFAWSGQGER--IPALIDYLLRHHFPELADG------------------- 209
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P
Sbjct: 210 ------AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPD 263
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D PG RY QP +G WN+ + + LA +D + +Y + M Y
Sbjct: 264 FVCNHSD-PGGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALAEALAQYEHQLMLHYS 320
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL-LVPL 534
+M KLGL + + + +L +A VDY F R L V + + P L L+P
Sbjct: 321 ELMRAKLGLAVWEEDDPVLFRELFQLLAAHGVDYHLFLRRLGEVTREGAWPASLLALLPE 380
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
A AW W+ +Y L G D RK LM++VNPKYVLRN L Q I+A
Sbjct: 381 PA-----------AWQGWLEAYRARLAREGSEDGVRKGLMDAVNPKYVLRNALAQRVIEA 429
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE GD RL ++ PYDEQP E+ A PAW Y G LSCSS
Sbjct: 430 AEQGDMAPFERLFTALQHPYDEQPEYEELATPQPAW-YCGG--ELSCSS 475
>gi|421523549|ref|ZP_15970178.1| hypothetical protein PPUTLS46_16968 [Pseudomonas putida LS46]
gi|402752535|gb|EJX13040.1| hypothetical protein PPUTLS46_16968 [Pseudomonas putida LS46]
Length = 486
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 212/550 (38%), Positives = 292/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+ +
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRD 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY+ FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMALVERLLQCMQRGGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L ++ +D+ + +W Y+ + E R+
Sbjct: 371 RKLGEQ------PVAEALKAVRDDFIDLA-----GFDAWGADYLARCGREPGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y LRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MHAVNPLYALRNYLAQKAIEAAEAGDYSEVRRLHQVLTRPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|33596537|ref|NP_884180.1| hypothetical protein BPP1919 [Bordetella parapertussis 12822]
gi|33601090|ref|NP_888650.1| hypothetical protein BB2107 [Bordetella bronchiseptica RB50]
gi|412338727|ref|YP_006967482.1| hypothetical protein BN112_1410 [Bordetella bronchiseptica 253]
gi|427815206|ref|ZP_18982270.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
gi|427819480|ref|ZP_18986543.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
gi|427825049|ref|ZP_18992111.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
gi|39932513|sp|Q7W954.1|Y1919_BORPA RecName: Full=UPF0061 protein BPP1919
gi|39932520|sp|Q7WKJ9.1|Y2107_BORBR RecName: Full=UPF0061 protein BB2107
gi|33566306|emb|CAE37219.1| conserved hypothetical protein [Bordetella parapertussis]
gi|33575525|emb|CAE32603.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
gi|408768561|emb|CCJ53327.1| conserved hypothetical protein [Bordetella bronchiseptica 253]
gi|410566206|emb|CCN23766.1| conserved hypothetical protein [Bordetella bronchiseptica 1289]
gi|410570480|emb|CCN18662.1| conserved hypothetical protein [Bordetella bronchiseptica D445]
gi|410590314|emb|CCN05398.1| conserved hypothetical protein [Bordetella bronchiseptica Bbr77]
Length = 495
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 216/536 (40%), Positives = 285/536 (53%), Gaps = 50/536 (9%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+++LP D ++P E YT++ P P+L+ + A + LDP EF F
Sbjct: 6 LQDLPTDNSFAALPAEF----YTRLQPRPPAA-PRLLHANAEAAALIGLDPAEFSTQAFL 60
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
FSG PL G A Y GHQFG+WAGQLGDGRA LGE+ + WELQLKGAG T
Sbjct: 61 DVFSGHAPLPGGDTLAAVYSGHQFGVWAGQLGDGRAHLLGEVRG-PAGGWELQLKGAGMT 119
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSS+RE+L SEAMH LGIPTTR+L LV + V R+ E
Sbjct: 120 PYSRMGDGRAVLRSSVREYLASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETA 172
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V R+A SF+RFGS++ ++R Q + +R LADY I +
Sbjct: 173 AVVTRMAPSFVRFGSFEHWSARRQPEQ--LRVLADYVIDRFYPECRVAGAGR-------- 222
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+D + V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF
Sbjct: 223 -----LDGEHGEILGLLAAVTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGF 277
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYG 470
+D F N +D G RY + QP +GLWN+ + +++L L D EA V++ Y
Sbjct: 278 MDTFQLGHICNHSDSEG-RYAWNRQPSVGLWNLYRLASSL--HTLAPDPEALRAVLDGYE 334
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + M KLGLP++ ++ ++ LL M D+T FR L P
Sbjct: 335 AVFTQAFHGRMAGKLGLPQFLPEDETLLDDLLQLMHQQGADFTLAFRRLGEAVRGQRQPF 394
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
++L + + A +W S G + + R A M+ VNP YVLRN+L
Sbjct: 395 EDLFID------------RAAAGAWYDRLAARHASDGRAAQARAAAMDEVNPLYVLRNHL 442
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI AA GD GE+ LLKL+ PY +QPG + YA L P WA +SCSS
Sbjct: 443 AEQAIRAAARGDAGEIDILLKLLRNPYKQQPGYDAYAGLAPDWA---AGLEVSCSS 495
>gi|452748829|ref|ZP_21948604.1| hypothetical protein B381_13751 [Pseudomonas stutzeri NF13]
gi|452007249|gb|EMD99506.1| hypothetical protein B381_13751 [Pseudomonas stutzeri NF13]
Length = 486
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 216/549 (39%), Positives = 303/549 (55%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L L +D+ F R GD + T+VSP + +P+LV SE+ L
Sbjct: 1 MKSLTQLTFDNRFAR--LGDTFS------------TEVSPQP-LSDPRLVVVSEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+P F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQPLFVELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDSLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++AI HF E
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHAEL---KQLLEHAIEAHF--PE 213
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ E + A+ EV ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 214 LLEHPEP-------------------FHAFFREVLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDAG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + ++ ++ +LL M VDYTNFFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFAQAEADDEALVRRLLQLMQASAVDYTNFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS A+ ++ L+ +++ + + +W Y ER+ M
Sbjct: 372 ELSESPAEQAVRR------LREDFVEL-----QGFDAWAADYCARTARESSDLGERQVRM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+VNPKY+LRNYL Q I+AAE GD+ VR L +++ RP++EQPGM++YA PP W
Sbjct: 421 QAVNPKYILRNYLAQQVIEAAEKGDYAPVRELHQVLSRPFEEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|33517006|sp|Q88CW2.2|Y5068_PSEPK RecName: Full=UPF0061 protein PP_5068
Length = 486
Score = 338 bits (867), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 290/550 (52%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 46 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL ++ +LL M VDY+ FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMVLVERLLQCMQRGGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L + +D+ + +W Y+ + E R+
Sbjct: 371 RKLGEQ------PVAEALKVARDDFIDLA-----GFDAWGADYLARCGREPGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLARPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|410420711|ref|YP_006901160.1| hypothetical protein BN115_2929 [Bordetella bronchiseptica MO149]
gi|408448006|emb|CCJ59685.1| conserved hypothetical protein [Bordetella bronchiseptica MO149]
Length = 495
Score = 338 bits (866), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 216/536 (40%), Positives = 285/536 (53%), Gaps = 50/536 (9%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+++LP D ++P E YT++ P P+L+ + A + LDP EF F
Sbjct: 6 LQDLPTDNSFAALPAEF----YTRLQPRPPAA-PRLLHANAEAAALIGLDPAEFSTQAFL 60
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
FSG PL G A Y GHQFG+WAGQLGDGRA LGE+ + WELQLKGAG T
Sbjct: 61 DVFSGHAPLPGGDTLAAVYSGHQFGVWAGQLGDGRAHLLGEVRG-PAGGWELQLKGAGMT 119
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSS+RE+L SEAMH LGIPTTR+L LV + V R+ E
Sbjct: 120 PYSRMGDGRAVLRSSVREYLASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETA 172
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V R+A SF+RFGS++ ++R Q + +R LADY I +
Sbjct: 173 AVVTRMAPSFVRFGSFEHWSARRQPEQ--LRVLADYVIDRFYPECRVAGAGR-------- 222
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+D + V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF
Sbjct: 223 -----LDGEHGEILGLLAAVTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGF 277
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYG 470
+D F N +D G RY + QP +GLWN+ + +++L L D EA V++ Y
Sbjct: 278 MDTFQLGHICNHSDSEG-RYAWNRQPSVGLWNLYRLASSL--HTLAPDPEALRAVLDGYE 334
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + M KLGLP++ ++ ++ LL M D+T FR L P
Sbjct: 335 AVFTQAFHGRMAGKLGLPQFLPEDETLLDDLLQLMHQQGADFTLAFRRLGEAVRGQRQPF 394
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
++L + + A +W S G + + R A M+ VNP YVLRN+L
Sbjct: 395 EDLFID------------RAAAGAWYDRLAVRHASDGRAAQARAAAMDEVNPLYVLRNHL 442
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI AA GD GE+ LLKL+ PY +QPG + YA L P WA +SCSS
Sbjct: 443 AEQAIRAAARGDAGEIDILLKLLRNPYKQQPGYDAYAGLAPDWA---AGLEVSCSS 495
>gi|83719782|ref|YP_442661.1| hypothetical protein BTH_I2140 [Burkholderia thailandensis E264]
gi|257138874|ref|ZP_05587136.1| hypothetical protein BthaA_06635 [Burkholderia thailandensis E264]
gi|121957850|sp|Q2SWN8.1|Y2140_BURTA RecName: Full=UPF0061 protein BTH_I2140
gi|83653607|gb|ABC37670.1| Uncharacterized ACR, YdiU/UPF0061 family superfamily [Burkholderia
thailandensis E264]
Length = 521
Score = 338 bits (866), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 226/556 (40%), Positives = 297/556 (53%), Gaps = 75/556 (13%)
Query: 115 LPGDPRTDSIPRE----VLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
LP T + PR+ L A + P+A + P +V +S+ A L LDP + P F
Sbjct: 14 LPDLAATLAAPRDGAFLQLGAAFLTRQPAAPLPAPYVVGFSDDAARMLGLDPALRDAPGF 73
Query: 171 PLFFSGAT----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLK 226
F G P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE L R+ELQLK
Sbjct: 74 AGLFCGNPTRDWPQA-SLPYASVYSGHQFGVWAGQLGDGRALTIGE-LEHDGRRYELQLK 131
Query: 227 GAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNP 286
GAG+TPYSR DG AVLRSSIRE+LCSEAMH LGIPTTRAL ++ + + V R+
Sbjct: 132 GAGRTPYSRMGDGRAVLRSSIREYLCSEAMHHLGIPTTRALAVIGSDQPVVREEI----- 186
Query: 287 KEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
E A+V RVA+SF+RFG ++ A+ E L R LAD+ I
Sbjct: 187 --ETSAVVTRVAESFVRFGHFEHFFANDRPEQL---RALADHVI---------------- 225
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
D + + Y A E RTA LVAQWQ VGF HGV+NTDNMSILG+TID
Sbjct: 226 -----DRFYPACRDADDPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTID 280
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-------------- 451
YGPFGF+DAFD N +D G RY + QP I WN + L
Sbjct: 281 YGPFGFIDAFDAKHVCNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSED 339
Query: 452 -AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKV 507
A + ++D A+ V+ R+ +F + M KLGL + + + ++LL M +
Sbjct: 340 ARAERAVED--AHAVLGRFAEQFGPALERAMRAKLGLELEREGDAALANQLLEIMDASRA 397
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD 567
D+T FR L+ V + + P++ + +D ++A+ W Y L D
Sbjct: 398 DFTLTFRHLARVSKHDARGD----APVRDLFVD-----RDAFDRWANLYRARLSEEARDD 448
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
R A MN NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LP
Sbjct: 449 AARAAAMNRANPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALP 508
Query: 628 PAWAYRPGVCMLSCSS 643
P WA +SCSS
Sbjct: 509 PDWA---STLEVSCSS 521
>gi|339495909|ref|YP_004716202.1| hypothetical protein PSTAB_3832 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
gi|338803281|gb|AEJ07113.1| hypothetical protein PSTAB_3832 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
Length = 486
Score = 338 bits (866), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 300/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L++D+ F R GD + T+VSP +E P+LV SE+ L
Sbjct: 1 MKTLTQLHFDNRFAR--LGDTFS------------TQVSPQP-LEAPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+ F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQALFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDTLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ + HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHSEL---KQLFEHVVEAHFPELL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EHPEPFHMFFRT---------------------VLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + + ++I +LL M VDYT FFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFSQAEDGDAELIRRLLQLMQGSAVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L P ++ + L+ +D+ + + +W Y S G R+A M
Sbjct: 372 ELGER------PVEQAVQRLREDFIDL-----QGFDAWAADYCARSASEGGDPVARQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP+DEQPGME+YA PP W
Sbjct: 421 HAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHAVLSRPFDEQPGMERYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|167619714|ref|ZP_02388345.1| hypothetical protein BthaB_25647 [Burkholderia thailandensis Bt4]
Length = 521
Score = 338 bits (866), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 226/556 (40%), Positives = 297/556 (53%), Gaps = 75/556 (13%)
Query: 115 LPGDPRTDSIPRE----VLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
LP T + PR+ L A + P+A + P +V +S+ A L LDP + P F
Sbjct: 14 LPDLAATLAAPRDGAFLQLGAAFLTRQPAAPLPAPYVVGFSDDAARMLGLDPALRDAPGF 73
Query: 171 PLFFSGAT----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLK 226
F G P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE L R+ELQLK
Sbjct: 74 AGLFCGNPTRDWPQA-SMPYASVYSGHQFGVWAGQLGDGRALTIGE-LEHDGRRYELQLK 131
Query: 227 GAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNP 286
GAG+TPYSR DG AVLRSSIRE+LCSEAMH LGIPTTRAL ++ + + V R+
Sbjct: 132 GAGRTPYSRMGDGRAVLRSSIREYLCSEAMHHLGIPTTRALAVIGSDQPVVREEI----- 186
Query: 287 KEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
E A+V RVA+SF+RFG ++ A+ E L R LAD+ I
Sbjct: 187 --ETSAVVTRVAESFVRFGHFEHFFANDRPEQL---RALADHVI---------------- 225
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
D + + Y A E RTA LVAQWQ VGF HGV+NTDNMSILG+TID
Sbjct: 226 -----DRFYPACRDADDPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTID 280
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-------------- 451
YGPFGF+DAFD N +D G RY + QP I WN + L
Sbjct: 281 YGPFGFIDAFDAKHVCNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSED 339
Query: 452 -AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKV 507
A + ++D A+ V+ R+ +F + M KLGL + + + ++LL M +
Sbjct: 340 ARAERAVED--AHAVLGRFAEQFGPALERAMRAKLGLELEREGDAALANQLLEIMDASRA 397
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD 567
D+T FR L+ V + + P++ + +D ++A+ W Y L D
Sbjct: 398 DFTLTFRHLARVSKHDARGD----APVRDLFVD-----RDAFDRWANLYRARLSEEARDD 448
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
R A MN NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LP
Sbjct: 449 AARAAAMNRANPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALP 508
Query: 628 PAWAYRPGVCMLSCSS 643
P WA +SCSS
Sbjct: 509 PDWA---STLEVSCSS 521
>gi|26991744|ref|NP_747169.1| hypothetical protein PP_5068 [Pseudomonas putida KT2440]
gi|24986851|gb|AAN70633.1|AE016707_3 conserved hypothetical protein [Pseudomonas putida KT2440]
Length = 540
Score = 338 bits (866), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 290/550 (52%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV SES L
Sbjct: 55 VKALDQLTFDNRFARL--GD------------AFSTQVLPEP-IADPRLVVASESAMALL 99
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN
Sbjct: 100 DLDPAQAELPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAG 159
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 160 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 219
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+
Sbjct: 220 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRE 270
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 271 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 309
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 310 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 368
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL ++ +LL M VDY+ FF
Sbjct: 369 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMVLVERLLQCMQRGGVDYSLFF 424
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L + +D+ + +W Y+ + E R+
Sbjct: 425 RKLGEQ------PVAEALKVARDDFIDLA-----GFDAWGADYLARCGREPGNAEGRRER 473
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 474 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLARPFEEQPGMQAYAERPPEWGKH 533
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 534 ---LEISCSS 540
>gi|145297287|ref|YP_001140128.1| hypothetical protein ASA_0185 [Aeromonas salmonicida subsp.
salmonicida A449]
gi|418362040|ref|ZP_12962684.1| hypothetical protein IYQ_16989 [Aeromonas salmonicida subsp.
salmonicida 01-B526]
gi|166225454|sp|A4SHK8.1|Y185_AERS4 RecName: Full=UPF0061 protein ASA_0185
gi|142850059|gb|ABO88380.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
salmonicida A449]
gi|356686675|gb|EHI51268.1| hypothetical protein IYQ_16989 [Aeromonas salmonicida subsp.
salmonicida 01-B526]
Length = 475
Score = 337 bits (865), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 203/468 (43%), Positives = 256/468 (54%), Gaps = 53/468 (11%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
PL G P AQ Y GHQFG ++ +LGDGRA+ LGE L +RW+L LKGAGKTP+SRF D
Sbjct: 58 PLPGMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLATDGQRWDLHLKGAGKTPFSRFGD 117
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ +EE GA V R A
Sbjct: 118 GRAVLRSSIREYLASEALHALGIPTTRALVLVGSKEPVYRE-------QEETGATVLRTA 170
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
S LRFG + A GQ + + L DY +R+HF +EN
Sbjct: 171 PSHLRFGHIEYFAWSGQG--EKIPALIDYLLRYHFPELENG------------------- 209
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
A EV RTA L+A+WQ GF HGVLNTDNMS+LGLT+DYGP+GF+DA+ P
Sbjct: 210 ------AELFAEVVRRTARLIAKWQAAGFCHGVLNTDNMSLLGLTLDYGPYGFIDAYVPD 263
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D P RY QP +G WN+ + + LA +D + +Y + M Y
Sbjct: 264 FVCNHSD-PDGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALATSLAQYEHQLMLHYS 320
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
+M KLGL ++ ++ + +L +A VDY F R L V P L +
Sbjct: 321 ELMRAKLGLTQWEEEDPALFRQLFQLLASQGVDYHLFLRRLGEVTGTGEWPASLLAL--- 377
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+ W W+ Y L G D RKA M+++NPKYVLRN L Q AIDAA
Sbjct: 378 -------LPDPDLWQGWLELYRVRLTREGGEDAVRKAQMDAINPKYVLRNALAQQAIDAA 430
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GD + RLL +++PYDEQP A P W Y G LSCSS
Sbjct: 431 EGGDMTQFERLLAALQQPYDEQPEYADLATPVPQW-YCGG--ELSCSS 475
>gi|187478767|ref|YP_786791.1| hypothetical protein BAV2277 [Bordetella avium 197N]
gi|121957857|sp|Q2KYJ8.1|Y2277_BORA1 RecName: Full=UPF0061 protein BAV2277
gi|115423353|emb|CAJ49887.1| conserved hypothetical protein [Bordetella avium 197N]
Length = 490
Score = 337 bits (865), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 222/517 (42%), Positives = 281/517 (54%), Gaps = 55/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
YT+++ + + P+L+ + A + LDP E F SG PL G A Y G
Sbjct: 23 YTRLA-AQPLGRPRLLHANAEAAALIGLDPAELHTQAFLEVASGQRPLPGGDTLAAVYSG 81
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRA LGE+ WELQLKGAG TPYSR DG AVLRSS+RE+L
Sbjct: 82 HQFGVWAGQLGDGRAHLLGEVRG-PGGSWELQLKGAGLTPYSRMGDGRAVLRSSVREYLA 140
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL LV + V R+ E AIV R++ SF+RFGS++ +S
Sbjct: 141 SEAMHGLGIPTTRALALVVSDDPVMRE-------TRETAAIVTRMSPSFVRFGSFEHWSS 193
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R D + +R LADY I + N E V+ L EV+
Sbjct: 194 R--RDGERLRILADYVIDRFYPQCREANG----------EHGDVLALLR--------EVS 233
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
+RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGF+DAF N +D G RY
Sbjct: 234 QRTAHLMADWQSVGFCHGVMNTDNMSILGLTLDYGPFGFMDAFQLGHVCNHSDSEG-RYA 292
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN 491
+ QP + LWN+ + +L L+ D +A V+ Y T F + A M KLGL +
Sbjct: 293 WNRQPSVALWNLYRLGGSLHG--LVPDADALRGVLAEYETLFTQAFHARMGAKLGLSVWQ 350
Query: 492 K---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD--IGKERK 546
++ LL M + D+T FRAL+ + P LD I +E
Sbjct: 351 SDDEALLDDLLRLMHDSRADFTLTFRALAQAVRGQTQP-----------FLDYFIDREAA 399
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+AW S + + G + R M+ VNP YVLRN+L + AI AA+ GD E+ RL
Sbjct: 400 QAWWSRLAA---RHACDGRAAAVRAEGMDRVNPLYVLRNHLAEQAIRAAQQGDASEIDRL 456
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L L+ RPYD QPG E YA LPP WA V SCSS
Sbjct: 457 LGLLRRPYDLQPGAEAYAALPPDWAAGLSV---SCSS 490
>gi|167581598|ref|ZP_02374472.1| hypothetical protein BthaT_25874 [Burkholderia thailandensis TXDOH]
Length = 521
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 226/556 (40%), Positives = 297/556 (53%), Gaps = 75/556 (13%)
Query: 115 LPGDPRTDSIPRE----VLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDF 170
LP T + PR+ L A + P+A + P +V +S+ A L LDP + P F
Sbjct: 14 LPDLAATLAAPRDGAFLQLGAAFLTRQPAAPLPAPYVVGFSDDAARMLGLDPALRDAPGF 73
Query: 171 PLFFSGAT----PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLK 226
F G P A ++PYA Y GHQFG+WAGQLGDGRA+T+GE L R+ELQLK
Sbjct: 74 AGLFCGNPTRDWPQA-SMPYASVYSGHQFGVWAGQLGDGRALTIGE-LEHGGRRYELQLK 131
Query: 227 GAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNP 286
GAG+TPYSR DG AVLRSSIRE+LCSEAMH LGIPTTRAL ++ + + V R+
Sbjct: 132 GAGRTPYSRMGDGRAVLRSSIREYLCSEAMHHLGIPTTRALAVIGSDQPVVREEI----- 186
Query: 287 KEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
E A+V RVA+SF+RFG ++ A+ E L R LAD+ I
Sbjct: 187 --ETSAVVTRVAESFVRFGHFEHFFANDRPEQL---RALADHVI---------------- 225
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
D + + Y A E RTA LVAQWQ VGF HGV+NTDNMSILG+TID
Sbjct: 226 -----DRFYPACRDADDPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDNMSILGVTID 280
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-------------- 451
YGPFGF+DAFD N +D G RY + QP I WN + L
Sbjct: 281 YGPFGFIDAFDAKHVCNHSDTHG-RYAYRMQPRIAHWNCFCLAQALLPLIGLHRDAPSED 339
Query: 452 -AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKV 507
A + ++D A+ V+ R+ +F + M KLGL + + + ++LL M +
Sbjct: 340 ARAERAVED--AHAVLGRFAEQFGPALERAMRAKLGLELEREGDAALANQLLEIMDASRA 397
Query: 508 DYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD 567
D+T FR L+ V + + P++ + +D ++A+ W Y L D
Sbjct: 398 DFTLTFRRLARVSKHDARGD----APVRDLFVD-----RDAFDRWANLYRARLSEEARDD 448
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLP 627
R A MN NPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP + YA LP
Sbjct: 449 AARAAAMNRANPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQPEHDAYAALP 508
Query: 628 PAWAYRPGVCMLSCSS 643
P WA +SCSS
Sbjct: 509 PDWA---STLEVSCSS 521
>gi|410472646|ref|YP_006895927.1| hypothetical protein BN117_1987 [Bordetella parapertussis Bpp5]
gi|408442756|emb|CCJ49320.1| conserved hypothetical protein [Bordetella parapertussis Bpp5]
Length = 495
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 216/536 (40%), Positives = 285/536 (53%), Gaps = 50/536 (9%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+++LP D ++P E YT++ P P+L+ + A + LDP EF F
Sbjct: 6 LQDLPTDNSFAALPAEF----YTRLQPRPPAV-PRLLHANAEAAALIGLDPAEFSTQAFL 60
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
FSG PL G A Y GHQFG+WAGQLGDGRA LGE+ + WELQLKGAG T
Sbjct: 61 DVFSGHAPLPGGDTLAAVYSGHQFGVWAGQLGDGRAHLLGEVRG-PAGGWELQLKGAGMT 119
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSS+RE+L SEAMH LGIPTTR+L LV + V R+ E
Sbjct: 120 PYSRMGDGRAVLRSSVREYLASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETA 172
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V R+A SF+RFGS++ ++R Q + +R LADY I +
Sbjct: 173 AVVTRMAPSFVRFGSFEHWSARRQPEQ--LRVLADYVIDRFYPECRVAGAGR-------- 222
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+D + V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF
Sbjct: 223 -----LDGEHGEILGLLAAVTRRTAFLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGF 277
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYG 470
+D F N +D G RY + QP +GLWN+ + +++L L D EA V++ Y
Sbjct: 278 MDTFQLGHICNHSDSEG-RYAWNRQPSVGLWNLYRLASSL--HTLAPDPEALRAVLDGYE 334
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + M KLGLP++ ++ ++ LL M D+T FR L P
Sbjct: 335 AVFTQAFHGRMAGKLGLPQFLPEDETLLDDLLQLMHQQGADFTLAFRRLGEAVRGQRQPF 394
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
++L + + A +W S G + + R A M+ VNP YVLRN+L
Sbjct: 395 EDLFID------------RAAAGAWYDRLAARHASDGRAAQARAAAMDEVNPLYVLRNHL 442
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI AA GD GE+ LLKL+ PY +QPG + YA L P WA +SCSS
Sbjct: 443 AEQAIRAAARGDAGEIDILLKLLRNPYKQQPGYDAYAGLAPDWA---AGLEVSCSS 495
>gi|238796340|ref|ZP_04639849.1| hypothetical protein ymoll0001_21680 [Yersinia mollaretii ATCC
43969]
gi|238719785|gb|EEQ11592.1| hypothetical protein ymoll0001_21680 [Yersinia mollaretii ATCC
43969]
Length = 491
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 205/520 (39%), Positives = 280/520 (53%), Gaps = 52/520 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
+ L YT + P+ ++ L+ SE +A L LD F P ++ +G T L G P
Sbjct: 21 QQLSGFYTHLQPTP-LKGAHLLYHSEPLAQELGLDASWFSGPKAAVW-AGETLLPGMEPL 78
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR DG AVLRS
Sbjct: 79 AQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRSMDWHLKGAGLTPYSRMGDGRAVLRSV 138
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
+REFL SEA+H LGIPT+RAL +VT+ V R+ + + GA++ RVA+S +RFG
Sbjct: 139 VREFLASEALHHLGIPTSRALTIVTSHHPVYRE-------QPDRGAMLLRVAESHVRFGH 191
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ R Q + V+ LADY I H+ + +Y
Sbjct: 192 FEHFYYRQQPEQ--VKQLADYVIARHWPQFVG---------------------HTEQYLL 228
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
W +V +RTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD + P + N +D
Sbjct: 229 WFTDVVKRTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLDDYVPGYICNHSDH 288
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY F NQP + LWN+ + L+ L+ ++ + Y + M Y M KLG
Sbjct: 289 QG-RYAFDNQPAVALWNLHRLGQALSG--LMSVEQLQLALNAYEPELMAAYGQQMRAKLG 345
Query: 487 LPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L Q ++++LL+ M + DYT FR LS V+ + PL+ +D
Sbjct: 346 LFDSGDQDNDLLTELLSLMIREGRDYTRTFRLLSEVEIHSAQS------PLRDDFVD--- 396
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
+ + SW Y L + D +R+ M +VNPKY+LRNYL Q AID AE D +
Sbjct: 397 --RAGFDSWYSRYRARLQQESVDDAQRQHAMKAVNPKYILRNYLAQLAIDHAEKDDIQPL 454
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+RL + ++ P+ +QP + A LPP W +SCSS
Sbjct: 455 QRLHQALQHPFADQPEFDDLAALPPDWGKH---LEISCSS 491
>gi|294872672|ref|XP_002766364.1| Selenoprotein O, putative [Perkinsus marinus ATCC 50983]
gi|239867169|gb|EEQ99081.1| Selenoprotein O, putative [Perkinsus marinus ATCC 50983]
Length = 628
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 235/611 (38%), Positives = 318/611 (52%), Gaps = 110/611 (18%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
+ LE L D +P PR V +A Y V P + PQ V S S L
Sbjct: 43 RVLEQLPVDRKLHEGVPNQPRP------VPNAIYAAV-PFQPLSKPQTVCISPSAFRLLG 95
Query: 160 ----LDPKEFERPDFPLFFSGATPLAGAV-PYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
+D E + F + SG+ + G+ P A Y GHQFG ++GQLGDG A+ LGE+
Sbjct: 96 VFHGIDYDELDEA-FAEYISGSRRIPGSPGPAAHVYCGHQFGYFSGQLGDGAAMLLGEVN 154
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCL-VTTG 273
+ E+QLKG+GKTP+SR ADG VLRS+IREFLCSE MH LGIPTTRA + V+
Sbjct: 155 GI-----EIQLKGSGKTPFSRSADGRKVLRSTIREFLCSEHMHALGIPTTRAAAVSVSFE 209
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS------RG---QEDLDIVRTL 324
V RD+ YDGN K EP A+V R+A++FLRFGS++I S RG D +++ L
Sbjct: 210 DQVIRDINYDGNAKLEPTAVVVRLAETFLRFGSFEIFKSTDSITGRGGPSAGDTALLQKL 269
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
D+ I +++ D + + V+ K + V ERTA LVA+WQ
Sbjct: 270 VDFVINNYYEA------------ECADIEETSVE---KKCEQFFQAVVERTAKLVAKWQC 314
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSI+G TIDYGP+GF++AF + NT+D G RY + QP I LWN
Sbjct: 315 VGFCHGVLNTDNMSIVGDTIDYGPYGFVEAFQRDYICNTSDT-GGRYTYEAQPRICLWNC 373
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNN 501
+ + LA L +K + + YG FM EY+ +M KLGL + + ++ +LL+
Sbjct: 374 TKLAEALAPI-LDPEKSTDILRSTYGRVFMKEYKRLMAMKLGLVEEREGDSDLVERLLDT 432
Query: 502 MAVDKVDYTNFFRALSNVKADPS----------------IPED-----------ELLVPL 534
M D+TN FRALS VK D +PE+ E+L L
Sbjct: 433 MENTAADFTNTFRALSTVKVDGDDTDYGDAIERIIESCLVPEELAARIKVPVRPEVLAQL 492
Query: 535 KAVL--------LDIGKER------------------------KEAWISWVLSYIQELLS 562
K V +D G R +EAW W+ SY++ L++
Sbjct: 493 KLVNPQTLPLYGIDEGALRRWEEELDKKRQYLNMDESTKRESDREAWSKWLESYVRRLIA 552
Query: 563 SG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGM 620
SD++R MN VNPK VLRN+L Q IDAAE G+F VR LL+++ P+ E+
Sbjct: 553 ETGRRSDKDRSDHMNRVNPKVVLRNHLAQKVIDAAEEGNFAPVRELLQVLVDPFSEKIP- 611
Query: 621 EKYARLPPAWA 631
E++ + PP A
Sbjct: 612 EEFTKPPPPGA 622
>gi|409396913|ref|ZP_11247856.1| hypothetical protein C211_15650 [Pseudomonas sp. Chol1]
gi|409118415|gb|EKM94814.1| hypothetical protein C211_15650 [Pseudomonas sp. Chol1]
Length = 485
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 216/549 (39%), Positives = 304/549 (55%), Gaps = 68/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L +L++D+ F R GD A T V P + +P+LV SE+ L
Sbjct: 1 MKTLTELHFDNRFAR--LGD------------AFSTAVEPQP-LADPRLVVVSEAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+P F FSG + A P A Y GHQFG++ QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAEQPLFVELFSGHKLWSTAEPRAMVYSGHQFGVYNPQLGDGRGLLLGEVRNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE + LGIP+TRALC+ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEHLAALGIPSTRALCVTASATPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ ++E GA++ R+A S LRFG ++ + +R +L + L DY++ HF +
Sbjct: 166 E-------RQERGAMLLRLAPSHLRFGHFEFFYYTRQHAEL---KQLLDYSLEAHFAPLR 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A EV ERTA+LVA+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLALFREVLERTAALVARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
S+LG+T+D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SLLGITLDFGPYAFLDDFDARFICNHSDDRG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ ME + + E+ +M ++LG + +++++ +LL M VDYT FFR
Sbjct: 312 EVTRLRETMELFLPLYEAEWLDLMRRRLGFTRAEADDERLVRRLLQLMQDSAVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L + A ++ L+ +D+ + +W Y + ++ +R+A M
Sbjct: 372 ELGDSPAPQAVQR------LREDFVDLA-----GFDAWAADYCAR-SARDATETDRQARM 419
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYL Q AI+AAE GD+G VR L ++ RP+DEQPGM++YA PP W
Sbjct: 420 HAVNPKYILRNYLAQQAIEAAEQGDYGPVRELHAVLGRPFDEQPGMQRYAERPPGWGKH- 478
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 479 --LEISCSS 485
>gi|386824765|ref|ZP_10111894.1| hypothetical protein Q5A_11171 [Serratia plymuthica PRI-2C]
gi|386378210|gb|EIJ19018.1| hypothetical protein Q5A_11171 [Serratia plymuthica PRI-2C]
Length = 480
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 211/528 (39%), Positives = 290/528 (54%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ L YT++ P+ ++ +L+ SE +A L LD F + P++ SG T
Sbjct: 2 PQFENAYHHQLPGFYTELKPTP-LKGARLLYHSEPLARELGLDESWFTQDKSPIW-SGET 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQKLADGRSMDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS+IREFL SEA+H LGIPTTRAL LVT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSAIREFLASEALHHLGIPTTRALTLVTSEQPVFRE-------QPERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + V+ LAD+ I H+ + +DH
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VQQLADFVIARHWPQL---------------KDH---- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+ Y W ++V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+ FLD + P
Sbjct: 212 --DDGYLPWFIDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPYAFLDDYKPD 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D G RY F NQP + LWN+ + + L+ L+ ++ + Y M Y
Sbjct: 270 FICNHSDHQG-RYAFDNQPAVALWNLHRLAQALSG--LMTTEQLQQALAAYEPALMRAYG 326
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + Q +++ LL+ MA + DYT FR LS + + + PL+
Sbjct: 327 EQMRAKLGFFTQSTQDNDLLTGLLSLMAQEGRDYTRTFRLLSQTE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D G A+ +W Y Q L +SD ER+ M + NPK +LRNYL Q AI++A
Sbjct: 381 DEFIDRG-----AFDAWYQQYRQRLQQEQVSDSERQQAMKAANPKLILRNYLAQQAIESA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++ RL + + P+ + + A LPP W +SCSS
Sbjct: 436 EQDDVSKLARLHQALLTPFADAAEYDDLAALPPDWGKH---LEISCSS 480
>gi|431925603|ref|YP_007238637.1| hypothetical protein Psest_0396 [Pseudomonas stutzeri RCH2]
gi|431823890|gb|AGA85007.1| hypothetical protein Psest_0396 [Pseudomonas stutzeri RCH2]
Length = 486
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 215/549 (39%), Positives = 299/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L L +D+ F R GD + T+VSP + +P+LV SE+ L
Sbjct: 1 MKSLTQLTFDNRFAR--LGDTFS------------TEVSPQP-LSDPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P E E+P F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLAPTEAEQPLFTKLFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ ++ V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTSSDSLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ I HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHGEL---KQLLEHVIAAHFAELL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ F T V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EHPEPFHAFFRT---------------------VLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + + ++ +LL M VDYTNFFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFAQAEATDDALVRRLLQLMQASAVDYTNFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS A+ ++ L+ +D+ + + +W Y G ER+ M
Sbjct: 372 ELSESPAEQAVRR------LREDFVDL-----QGFDAWAADYCTRTALEGGDPAERQTRM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP++EQPGM++YA PP W
Sbjct: 421 QAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHTVLARPFEEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|421482937|ref|ZP_15930516.1| hypothetical protein QWC_10019 [Achromobacter piechaudii HLE]
gi|400198741|gb|EJO31698.1| hypothetical protein QWC_10019 [Achromobacter piechaudii HLE]
Length = 495
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 217/517 (41%), Positives = 287/517 (55%), Gaps = 46/517 (8%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+++P + +P+L+ + A + LDP P+F FSG+ PL G A Y
Sbjct: 21 AFYTRLTPQG-LNHPRLLHANAEAAALIGLDPAVLSTPEFLAVFSGSQPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ WELQLKGAG TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEVEG-PDGGWELQLKGAGMTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGIPTTRALALVGSDDPVMRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q +L ++TLADY I + E LS E ++L
Sbjct: 192 SSRRQPEL--LKTLADYVIDRFYPECRESPTGEPLS-----ETAPYINLLR--------A 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHVCNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L A L+ D E V++ + F + M KLGL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSLHA--LVQDVEGLRAVLDEFEAVFTRAFHDRMGAKLGLAA 353
Query: 490 YN---KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ + ++ LL M ++ D+T +R L AD + E +A D+ +R+
Sbjct: 354 WRPADEALLDDLLKLMDANQADFTLSWRRL----ADAVLGE-------RAAFQDLFIDRQ 402
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
A +W+ + G EE MN VNP YVLRN+L + AI AA+ GD E+ L
Sbjct: 403 AA-SAWLDRLLARHAEDGRPAEETAQAMNRVNPLYVLRNHLAEEAIRAAKAGDVSEIDTL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+KL+ P+ Q G E+YA LPP WA G +SCSS
Sbjct: 462 MKLLRAPFVAQAGYERYAGLPPDWA---GSLEVSCSS 495
>gi|89076698|ref|ZP_01162989.1| hypothetical protein SKA34_14565 [Photobacterium sp. SKA34]
gi|89047651|gb|EAR53257.1| hypothetical protein SKA34_14565 [Photobacterium sp. SKA34]
Length = 487
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 205/513 (39%), Positives = 278/513 (54%), Gaps = 50/513 (9%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T V+P + NP L++ + +A LELD + DF FSG LAG P A Y GH
Sbjct: 22 TFVTPQP-LSNPYLMSVNPHIAKLLELDINAIQSDDFINIFSGNDTLAGFDPIAMKYTGH 80
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG + LGDGR + LGE+ + ++W++ LKG+G TPYSR DG AV+RSSIRE+L S
Sbjct: 81 QFGQYNPDLGDGRGLLLGEVQTSQGKKWDIHLKGSGLTPYSRMGDGRAVIRSSIREYLAS 140
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
AM LGIPT+ AL ++ + V R+ K+E GA + RV++S +RFG ++
Sbjct: 141 AAMAGLGIPTSHALAVIGSDTHVYRE-------KQEFGATLIRVSESHIRFGHFEYLFYT 193
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
Q D +R LADY I+HHF + + K YAA +V E
Sbjct: 194 QQHDQ--LRLLADYVIQHHFPECQQVEK---------------------PYAALFEQVCE 230
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
TA ++A WQ VGF HGV+NTDNMSILGLT DYGP+GFLD ++P + N +D G RY F
Sbjct: 231 NTAKMIAHWQAVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYNPGYICNHSDYSG-RYAF 289
Query: 434 ANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKY 490
QP IGLWN++ LA +ID + + +E Y + Y +M +KLGL +
Sbjct: 290 NQQPSIGLWNLSALGYALAP--IIDKSDIEHALEIYQHQLQMHYSKLMRQKLGLFDSQEQ 347
Query: 491 NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWI 550
+ ++ +L N + +DYT FFR LS + D L A I +
Sbjct: 348 DNELFQQLFNLLKQQSIDYTQFFRTLSTLSQDELDNTSSHFSSLTANTTPIDE------- 400
Query: 551 SWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLM 610
W+ Y + + S D++R ALM NPKY+LRNYL Q AID AE G+F + LL ++
Sbjct: 401 -WLADYKKRI--SNTDDQQRLALMLKSNPKYILRNYLAQLAIDGAEQGNFTFIENLLTVL 457
Query: 611 ERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
P+ E P E A LPP W +SCSS
Sbjct: 458 HDPFGEHPNFEDLADLPPKWGKE---LEISCSS 487
>gi|146284193|ref|YP_001174346.1| hypothetical protein PST_3881 [Pseudomonas stutzeri A1501]
gi|166201477|sp|A4VRA3.1|Y3881_PSEU5 RecName: Full=UPF0061 protein PST_3881
gi|145572398|gb|ABP81504.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 486
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 299/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L++D+ F R GD + T+VSP +E P+LV SE+ L
Sbjct: 1 MKTLTQLHFDNRFAR--LGDTFS------------TQVSPQP-LEAPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+ F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQALFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDTLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ I HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHGEL---KQLLEHVIEVHFPELL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EHPEPFHMFFRT---------------------VLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + + ++I +LL M VDYT FFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFSQAEDGDAELIRRLLQLMQGSAVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L A+ ++ L+ +D+ + + +W Y G R+A M
Sbjct: 372 ELGERPAEQAVQR------LREDFIDL-----QGFDAWAADYCARSAREGGDPVARQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP+DEQPGME+YA PP W
Sbjct: 421 HAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHAVLSRPFDEQPGMERYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|238787108|ref|ZP_04630908.1| hypothetical protein yfred0001_5940 [Yersinia frederiksenii ATCC
33641]
gi|238724896|gb|EEQ16536.1| hypothetical protein yfred0001_5940 [Yersinia frederiksenii ATCC
33641]
Length = 503
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 209/533 (39%), Positives = 283/533 (53%), Gaps = 52/533 (9%)
Query: 114 ELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
E P+ D+ + L YT + P+ ++ +L SE +A L LD F P ++
Sbjct: 20 EFDNAPQFDNSYGQQLSGFYTHLQPTP-LKGARLFYHSEPLAQELGLDASWFSTPKSAVW 78
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 79 -AGERLLPGMEPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRSMDWHLKGAGLTPY 137
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LG+PT+RAL +VT+ V R+ + E GA+
Sbjct: 138 SRMGDGRAVLRSVVREFLASEALHHLGVPTSRALTIVTSDHPVYRE-------QPERGAM 190
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q V+ LADY I H+ G E+
Sbjct: 191 LLRVAESHVRFGHFEHFYYRQQPAQ--VKQLADYVIARHWPQF------------VGQEE 236
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
Y W +V +RTA L+A WQ GF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 237 ---------CYLLWFTDVVKRTAGLMAHWQTKGFAHGVMNTDNMSILGITMDYGPFGFLD 287
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ L+ ++ +E Y +
Sbjct: 288 DYAPGYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSG--LMSTEQLQLALEAYEPEL 344
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLG + Q +++ LL+ M + DYT FR LS V+ +
Sbjct: 345 MAAYGQQMRAKLGFSHSDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVEMHSTQS---- 400
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
PL+ +D + A+ SW Y L I D +R+ M + NPKY+LRNYL Q
Sbjct: 401 --PLRDDFID-----RAAFDSWFSRYRLRLQQESIDDVQRQQAMKAANPKYILRNYLAQL 453
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AID AE D ++RL + +++P+ +QP A LPP W +SCSS
Sbjct: 454 AIDHAEKDDIEFLQRLHQALQQPFADQPEFNDLAELPPDWGKH---LEISCSS 503
>gi|444351878|ref|YP_007388022.1| Selenoprotein O and cysteine-containing homologs [Enterobacter
aerogenes EA1509E]
gi|443902708|emb|CCG30482.1| Selenoprotein O and cysteine-containing homologs [Enterobacter
aerogenes EA1509E]
Length = 480
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 206/523 (39%), Positives = 281/523 (53%), Gaps = 57/523 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ ++N +L+ + ++A +L + F + G L G P
Sbjct: 10 RDRLPGFYTSLAPTP-LDNARLIWRNTALAQTLGVPETLFNPQHGAGVWGGEAVLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE +R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLPDGQRFDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ +EE G ++ R+A+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVYRE-------REERGTMLMRIAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LADY I HH+ ++ ++KY
Sbjct: 182 HFEHFYYR--REAEKVQQLADYVIEHHWPQLQQ---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDNQPAVGLWNLQRLAQSL--SPFISADALNAALDDYQPALLTAYGRRMRDKL 335
Query: 486 GLPKYNKQ-----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
G Y +Q ++ L + M + DYT FR LS + + PL+ +D
Sbjct: 336 GF--YTQQTGDNTLLDGLFSLMEREGSDYTRAFRMLSQSEQHSAAS------PLRDEFID 387
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+ A+ SW Y L I D ER+ M VNP VLRN+L Q AI+ AE GD
Sbjct: 388 -----RAAFDSWFADYRARLRDEQIDDSERQQRMQGVNPALVLRNWLAQRAIEQAEAGDM 442
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL + + +P+ ++ + YA PP W V SCSS
Sbjct: 443 RELERLHEALAQPFADR--TDDYASRPPDWGKHLEV---SCSS 480
>gi|238782552|ref|ZP_04626583.1| hypothetical protein yberc0001_22020 [Yersinia bercovieri ATCC
43970]
gi|238716479|gb|EEQ08460.1| hypothetical protein yberc0001_22020 [Yersinia bercovieri ATCC
43970]
Length = 485
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 206/533 (38%), Positives = 285/533 (53%), Gaps = 53/533 (9%)
Query: 115 LPGD-PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF 173
LP + P+ ++ + L YT + P+ + L+ SE +A L LD F P ++
Sbjct: 2 LPANTPQFNNSYGQQLSGFYTHLQPTP-LTGAHLLYHSEPLAQELGLDASWFSGPKAAIW 60
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+G L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPY
Sbjct: 61 -AGEALLPGMEPLAQVYSGHQFGVWAGQLGDGRGILLGEQQLSDGRSMDWHLKGAGLTPY 119
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS +REFL SEA+H LGIP++RAL +VT+ V R+ + E GA+
Sbjct: 120 SRMGDGRAVLRSVVREFLASEALHHLGIPSSRALTIVTSNHPVYRE-------QPERGAM 172
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ RVA+S +RFG ++ R Q + V+ LADY I H+ + +
Sbjct: 173 LLRVAESHVRFGHFEHFYYRQQPEQ--VKQLADYVIARHWPQLVGL-------------- 216
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
+ Y W +V +RTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD
Sbjct: 217 -------AEGYLLWFTDVVKRTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLD 269
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P + N +D G RY F NQP + LWN+ + L+ +D + + Y +
Sbjct: 270 DYVPGYICNHSDHQG-RYAFDNQPAVALWNLHRLGQALSGLMSVD--QLQLALNAYEPEL 326
Query: 474 MDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
M Y M KLGL Q +++ LL+ M+ + DYT FR LS V+ +
Sbjct: 327 MAAYGQQMRAKLGLFDSGSQDNDLLTALLSLMSKEGRDYTRTFRLLSEVEIHSAQS---- 382
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
PL+ +D + A+ SW Y L + D +R+ M +VNPKY+LRNYL Q
Sbjct: 383 --PLRDDFVD-----RAAFDSWYSRYRARLQQESVDDAQRQQAMKAVNPKYILRNYLAQH 435
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
I AE D ++RL + +++P+ +QP + A LPP W +SCSS
Sbjct: 436 VISHAEKDDIQPLQRLHQALQQPFADQPEFDDLAALPPDWGKH---LEISCSS 485
>gi|187928542|ref|YP_001899029.1| hypothetical protein Rpic_1456 [Ralstonia pickettii 12J]
gi|187725432|gb|ACD26597.1| protein of unknown function UPF0061 [Ralstonia pickettii 12J]
Length = 529
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 217/529 (41%), Positives = 277/529 (52%), Gaps = 68/529 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGM 197
PS + P LV +S A SL + E + F+G + P A Y GHQFG+
Sbjct: 46 PSGAIGEPYLVGFSPDAAASLGITRAELDTAAGLAVFTGNAVATWSDPLATVYSGHQFGV 105
Query: 198 WAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH 257
WAGQLGDGRA+ L E +E+QLKGAG+TPYSR DG AVLRSSIREFLCSEAM
Sbjct: 106 WAGQLGDGRALLLAEFQTADGP-YEVQLKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMA 164
Query: 258 FLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED 317
LGIPTTRALC+ V R+ + E A+V R+A SF+RFG ++ A+ E
Sbjct: 165 GLGIPTTRALCVTGADAPVRRE-------EIETAAVVTRLAPSFVRFGHFEHFAA--SEQ 215
Query: 318 LDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTAS 377
L +R LADY I D H Y A E+A RTA
Sbjct: 216 LPQLRALADYVI---------------------DRFHPASRSEPQPYLALLRELARRTAE 254
Query: 378 LVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQP 437
L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +A QP
Sbjct: 255 LMADWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAYAQQP 313
Query: 438 DIGLWNIAQFSTTL-----------------AAAKLIDDKEANYVM---ERYGTKFMDEY 477
IG WN+ + L A A+ D N ++ + YG F Y
Sbjct: 314 QIGYWNLFCLAQALLPLFGEDPDVFVNLSDEAQAQPAIDAAQNVLLTYRDVYGAAFYARY 373
Query: 478 QAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
+A KLGL ++ + L + + DYT FFR L++V+ D + E
Sbjct: 374 RA----KLGLSTAQDADEALFGDLFKLLHNQRADYTLFFRHLADVRRDDTPAAAE----- 424
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ D +R A + W+ +Y Q L + SD+ER A M VNPKYVLRN+L + AI
Sbjct: 425 ARTVRDFFFDRAAADV-WLAAYRQRLQAEPQSDDERAAAMQRVNPKYVLRNHLAEIAIRR 483
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ DF EV L ++ RP+D+ PG E YA+ P WA +SCSS
Sbjct: 484 AKEKDFSEVENLRAVLARPFDDHPGFEHYAQPAPDWA---SSLEVSCSS 529
>gi|428150498|ref|ZP_18998268.1| Selenoprotein O and cysteine-containing homologs [Klebsiella
pneumoniae subsp. pneumoniae ST512-K30BO]
gi|427539520|emb|CCM94406.1| Selenoprotein O and cysteine-containing homologs [Klebsiella
pneumoniae subsp. pneumoniae ST512-K30BO]
Length = 478
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 209/521 (40%), Positives = 279/521 (53%), Gaps = 55/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT +SP+ ++N +L+ + +A L + F + G L G P
Sbjct: 10 RDELPDFYTSLSPTP-LDNARLIWRNAPLAQQLGVPDALFAPESGVGVWGGEALLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I LGE R++ LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLGEQQLADGRRYDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+ E L SEAMH LGIPTTRAL +VT+ V R+ + EPGA++ RVA+S +RFG
Sbjct: 129 T--ESLASEAMHALGIPTTRALAMVTSDTPVYRE-------RVEPGAMLMRVAESHVRFG 179
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + V+ LADY IRHH+ +++ ++KY
Sbjct: 180 HFEHFYYR--REPQKVQQLADYVIRHHWPQLQD---------------------EADKYL 216
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W ++ RTA +A WQ VGF HGV+NTDNMSILGLTIDYGP+GFLD F P F N +D
Sbjct: 217 LWFRDIVMRTAQTIASWQTVGFAHGVMNTDNMSILGLTIDYGPYGFLDDFQPDFICNHSD 276
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L + I + N ++ Y + Y M KL
Sbjct: 277 YQG-RYSFENQPAVGLWNLQRLAQSL--SPFISAEALNAALDEYQHALLTAYGQRMRDKL 333
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M +K DYT FR LS+ + + PL+ +D
Sbjct: 334 GLFSQQKGDNDLLDGLFALMIREKSDYTRTFRLLSHSEQLSAAS------PLRDEFID-- 385
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L + D +R+ M VNP VLRN+L Q AI+ AE GD GE
Sbjct: 386 ---RAAFDSWFAGYRARLRDEQVDDAQRQQRMQGVNPALVLRNWLAQRAIEQAEAGDMGE 442
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + P+ ++ + Y R PP W R V SCSS
Sbjct: 443 LERLHAALADPFTDRE--DDYVRRPPDWGKRLEV---SCSS 478
>gi|325275714|ref|ZP_08141598.1| hypothetical protein G1E_20125 [Pseudomonas sp. TJI-51]
gi|324099154|gb|EGB97116.1| hypothetical protein G1E_20125 [Pseudomonas sp. TJI-51]
Length = 486
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 207/548 (37%), Positives = 289/548 (52%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + P+LV SE L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IAEPRLVVASEPAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F FSG A P A Y GHQFG + +LGDGR + L E+LN +
Sbjct: 46 DLDPAQAELPLFAELFSGHKLWDQADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVLNDAN 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H L IPT+RALC++ + V R
Sbjct: 106 QHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALHIPTSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ RVAQS +RFG ++ Q + R L D+ ++ H+
Sbjct: 166 E-------TRESAAMLTRVAQSHVRFGHFEYFYYTKQPEQQ--RVLLDHVLQQHYAECGT 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNADLIARWQACGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F+ N +D G RY +ANQ I WN++ + L +++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFSCNHSDDRG-RYSYANQVPIAHWNLSALAQALTT--VVE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++ +LL M VDY FFR
Sbjct: 313 VEPLKQALSLFLPLYQAHYLDLMRRRLGLTTAEDDDMALVERLLQCMQRGGVDYNLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L P E L ++ +D+ + +W Y+ + E R+ M+
Sbjct: 373 LGEQ------PVAEALKVVRNDFIDLA-----GFDAWGAEYLARCEREAGNAEGRRERMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ P++EQPGM+ YA PP W
Sbjct: 422 AVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSNPFEEQPGMQAYAERPPEWGKH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|410909440|ref|XP_003968198.1| PREDICTED: UPF0061 protein azo1574-like [Takifugu rubripes]
Length = 584
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 214/549 (38%), Positives = 289/549 (52%), Gaps = 54/549 (9%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS----------ESVADSLELDPKE 164
P DP + R V + +++ P+ +L A S + + L LD
Sbjct: 70 FPIDPVDGNFVRTVKNCVFSRSLPTPLKGPLRLAAVSTRASCQLFHQDVIGGILNLDVAA 129
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+F + SG + G+ P A YGGHQFG WAGQLGDGRA TLG+ N E WELQ
Sbjct: 130 ARSEEFLRYASGGALMVGSEPLAHRYGGHQFGYWAGQLGDGRAHTLGQFTNRNGEVWELQ 189
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKG+GKTPYSR DG AV+RSS+REFLCSEAMHFLG+PT+RA L+ + + V RD FYDG
Sbjct: 190 LKGSGKTPYSRSGDGRAVVRSSVREFLCSEAMHFLGVPTSRAASLIVSDEPVLRDQFYDG 249
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
N K E GA+V RVA+S+ R GS +I + G+ ++R L D+ I HF I
Sbjct: 250 NVKAERGAVVLRVARSWFRIGSLEILSESGE--FGLLRELMDFVIDEHFPSI-------- 299
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
S+ D D KY + V TA L+A+W VGF HGV NTDN S+L +TI
Sbjct: 300 ---SSDDPD---------KYLVFYSTVVNETAHLIARWTSVGFAHGVCNTDNFSLLSVTI 347
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI----DDK 460
DYGPFGF++++DPSF PN +D G RY Q +GL+N+ + LAA + + K
Sbjct: 348 DYGPFGFVESYDPSFVPNVSDDEG-RYSIGAQAGVGLFNLGKL---LAALRPVLTGEQQK 403
Query: 461 EANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALS 517
EA V+ Y + + KLGL + +I+ LL M + D+T FR LS
Sbjct: 404 EAQSVLNGYADVYQRRILQLFRAKLGLLGEEEDDGFLIALLLKLMEDTRSDFTLTFRQLS 463
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKALMNS 576
A+ ++ + G + + W+ Y+ L D+ R+ M
Sbjct: 464 EASAEQLHGQN-----FTQMWALEGLSSHQLFPDWLGLYLPRLRRQQRDDDSGRRNRMKR 518
Query: 577 VNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL--PPAWAYRP 634
VNP+YVLRN++ +SA+ AE DF EV L + + P+ Q E+ PPAWA
Sbjct: 519 VNPRYVLRNWMAESAVRKAERNDFSEVALLHRTLSSPFVTQEAAEEAGYAAKPPAWARGL 578
Query: 635 GVCMLSCSS 643
V SCSS
Sbjct: 579 KV---SCSS 584
>gi|333926961|ref|YP_004500540.1| hypothetical protein SerAS12_2106 [Serratia sp. AS12]
gi|333931915|ref|YP_004505493.1| hypothetical protein SerAS9_2106 [Serratia plymuthica AS9]
gi|386328784|ref|YP_006024954.1| hypothetical protein [Serratia sp. AS13]
gi|333473522|gb|AEF45232.1| UPF0061 protein ydiU [Serratia plymuthica AS9]
gi|333491021|gb|AEF50183.1| UPF0061 protein ydiU [Serratia sp. AS12]
gi|333961117|gb|AEG27890.1| UPF0061 protein ydiU [Serratia sp. AS13]
Length = 480
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 211/528 (39%), Positives = 289/528 (54%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ L YT++ P+ ++ +L+ SE +A L LD F + P++ SG
Sbjct: 2 PQFENAYHHQLPGFYTELKPTP-LKGARLLYHSEPLARELGLDESWFTQDKTPIW-SGER 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQKLADGRSMDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS+IREFL SEA+H LGIPTTRAL LVT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSAIREFLASEALHHLGIPTTRALTLVTSEQPVFRE-------QPERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + V+ LAD+ I H+ + +DH
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VQQLADFVIARHWPQL---------------KDH---- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+ Y W ++V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGP+ FLD + P
Sbjct: 212 --DDGYLPWFIDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPYAFLDDYKPD 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D G RY F NQP + LWN+ + + L+ L+ ++ + Y M Y
Sbjct: 270 FICNHSDHQG-RYAFDNQPAVALWNLHRLAQALSG--LMTTEQLQQALAVYEPALMRAYG 326
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + Q +++ LL+ MA + DYT FR LS + + + PL+
Sbjct: 327 EQMRAKLGFFTQSTQDNDLLTGLLSLMAQEGRDYTRTFRLLSQTE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D G A+ +W Y Q L +SD ER+ M + NPK +LRNYL Q AI+ A
Sbjct: 381 DEFIDRG-----AFDAWYQQYRQRLQQEQVSDSERQQAMKAANPKLILRNYLAQQAIERA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++ RL + + P+ + P + A LPP W +SCSS
Sbjct: 436 EQDDVSKLARLHQALLTPFADVPEYDDLAALPPDWGKH---LEISCSS 480
>gi|153948973|ref|YP_001400709.1| hypothetical protein YpsIP31758_1734 [Yersinia pseudotuberculosis
IP 31758]
gi|166980210|sp|A7FHI1.1|Y1734_YERP3 RecName: Full=UPF0061 protein YpsIP31758_1734
gi|152960468|gb|ABS47929.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
31758]
Length = 483
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 206/528 (39%), Positives = 280/528 (53%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P D+ L YT++ P+ ++ +L+ S+ +A L LD F P ++ +G
Sbjct: 5 PEFDNSYARQLSGFYTRLQPTP-LKGARLLYHSKPLAQELGLDAHWFTEPKTAVW-AGEA 62
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFGMWAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 63 LLPGMEPLAQVYSGHQFGMWAGQLGDGRGILLGEQRLNDGRYMDWHLKGAGLTPYSRMGD 122
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS IREFL SEA+H LGIPT+RAL +VT+ + R+ + E GA++ RVA
Sbjct: 123 GRAVLRSVIREFLASEALHHLGIPTSRALTIVTSDHPIYRE-------QTERGAMLLRVA 175
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q V+ LADY I H+ +
Sbjct: 176 ESHIRFGHFEHFYYRQQPKQ--VQQLADYVIARHWPQWVGHQEC---------------- 217
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
Y W +V ERTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD + P
Sbjct: 218 -----YRLWFTDVVERTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLDDYVPG 272
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY + NQP + LWN+ + L+ L+ + +E Y M Y
Sbjct: 273 YICNHSDHQG-RYAYDNQPAVALWNLHRLGHALSG--LMSADQLQLALEAYEPALMVAYG 329
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + Q +++ LL+ M + DYT FR LS V+ + PL+
Sbjct: 330 EQMRAKLGFLERDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVEVHSAQS------PLR 383
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ W Y L I D++R+ M + NPKY+LRNYL Q AI A
Sbjct: 384 DDFID-----RAAFDDWYRRYRSRLQQESIDDDQRQQSMKAANPKYILRNYLAQQAITQA 438
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++RL + +++P+ +QP + A LPP W +SCSS
Sbjct: 439 EKDDIQPLQRLHQALQQPFTDQPEFDDLAALPPDWGKH---LEISCSS 483
>gi|423204849|ref|ZP_17191405.1| hypothetical protein HMPREF1168_01040 [Aeromonas veronii AMC34]
gi|404625725|gb|EKB22540.1| hypothetical protein HMPREF1168_01040 [Aeromonas veronii AMC34]
Length = 475
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 208/505 (41%), Positives = 268/505 (53%), Gaps = 55/505 (10%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
++ P+L+ + ++ D L L D+ L G P AQ Y GHQFG ++ +
Sbjct: 23 LQQPRLLHLNRALLDELGLG--GVSEADWIACCGEGKVLPGMQPVAQVYAGHQFGGYSPR 80
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRA+ LGE + W+L LKGAGKTP+SRF DG AVLRSSIRE+L SEA+H LGI
Sbjct: 81 LGDGRALLLGEQQAPDGQHWDLHLKGAGKTPFSRFGDGRAVLRSSIREYLASEALHALGI 140
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PTTRAL LV + + V R+ E GA V R A S LRFG + A GQ + +
Sbjct: 141 PTTRALVLVGSQEPVYREQV-------ETGATVLRTAPSHLRFGHIEYFAWSGQG--EKI 191
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
L DY +RHHF +E+ A EV RTA L+A+
Sbjct: 192 LPLIDYLLRHHFPELESG-------------------------AELFAEVVRRTARLIAK 226
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P F N +D P RY QP +G
Sbjct: 227 WQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPDFVCNHSD-PAGRYALDQQPAVGY 285
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKL 498
WN+ + + LA +D + +Y + M Y +M KLGL + + + +L
Sbjct: 286 WNLQKLAQALAGH--VDGDALAAALAQYEQQLMLHYSELMRAKLGLAVWEEDDPALFREL 343
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
+A KVDY F R L V + P L A+L + G W W+ Y
Sbjct: 344 FRLLAAHKVDYHLFLRRLGEVTQEGGWP-----ASLLALLSEPG-----VWQEWLERYRA 393
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
L+ G D RKA M+++NPKYVLRN L Q IDAA+ GD RL ++RPYDEQP
Sbjct: 394 RLMREGSEDAVRKAQMDAINPKYVLRNALAQQVIDAADAGDMRPFERLFAALQRPYDEQP 453
Query: 619 GMEKYARLPPAWAYRPGVCMLSCSS 643
E A PAW Y G LSCSS
Sbjct: 454 EYEDLATPTPAW-YCGG--ELSCSS 475
>gi|421783238|ref|ZP_16219689.1| hypothetical protein B194_2295 [Serratia plymuthica A30]
gi|407754678|gb|EKF64810.1| hypothetical protein B194_2295 [Serratia plymuthica A30]
Length = 480
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 211/528 (39%), Positives = 289/528 (54%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P+ ++ L YT++ P+ ++ +L+ SE +A L LD F + P++ SG
Sbjct: 2 PQFENAYHHQLPGFYTELKPTP-LKGARLLYHSEPLARELGLDESWFTQDKTPIW-SGER 59
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFG+WAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 60 LLPGMQPLAQVYSGHQFGVWAGQLGDGRGILLGEQKLADGRSMDWHLKGAGLTPYSRMGD 119
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS+IREFL SEA+H LGIPTTRAL LVT+ + V R+ + E GA++ RVA
Sbjct: 120 GRAVLRSAIREFLASEALHHLGIPTTRALTLVTSEQPVFRE-------QPERGAMLLRVA 172
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q + V+ LAD+ I H+ + +DH
Sbjct: 173 ESHVRFGHFEHFYYRKQPEQ--VQQLADFVIARHWPQL---------------KDH---- 211
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
+ Y W ++V ERTA L+A WQ VGF HGV+NTDNMSILG+TIDYGPF FLD + P
Sbjct: 212 --DDGYLPWFIDVVERTARLIAHWQTVGFAHGVMNTDNMSILGITIDYGPFAFLDDYKPD 269
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D G RY F NQP + LWN+ + + L+ L+ ++ + Y M Y
Sbjct: 270 FICNHSDHQG-RYAFDNQPAVALWNLHRLAQALSG--LMTTEQLQRALAAYEPALMRAYG 326
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + Q +++ LL+ MA + DYT FR LS + + + PL+
Sbjct: 327 EQMRAKLGFFTQSTQDNDLLTGLLSLMAQEGRDYTRTFRLLSQTE------QQQAQSPLR 380
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D G A+ +W Y Q L +SD ER+ M + NPK +LRNYL Q AI++A
Sbjct: 381 DEFIDRG-----AFDAWYQQYRQRLQQEQVSDSERQQAMKAANPKLILRNYLAQQAIESA 435
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++ RL + + P+ + + A LPP W +SCSS
Sbjct: 436 EQDDVSKLARLHQALLTPFADAAEYDDLAALPPDWGKH---LEISCSS 480
>gi|51596645|ref|YP_070836.1| hypothetical protein YPTB2321 [Yersinia pseudotuberculosis IP
32953]
gi|145598040|ref|YP_001162116.1| hypothetical protein YPDSF_0737 [Yersinia pestis Pestoides F]
gi|170024079|ref|YP_001720584.1| hypothetical protein YPK_1840 [Yersinia pseudotuberculosis YPIII]
gi|186895702|ref|YP_001872814.1| hypothetical protein YPTS_2396 [Yersinia pseudotuberculosis PB1/+]
gi|81639232|sp|Q66A11.1|Y2321_YERPS RecName: Full=UPF0061 protein YPTB2321
gi|166228851|sp|A4TIN1.1|Y737_YERPP RecName: Full=UPF0061 protein YPDSF_0737
gi|226696097|sp|B1JJ37.1|Y1840_YERPY RecName: Full=UPF0061 protein YPK_1840
gi|226701279|sp|B2K5K6.1|Y2396_YERPB RecName: Full=UPF0061 protein YPTS_2396
gi|51589927|emb|CAH21559.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
32953]
gi|145209736|gb|ABP39143.1| hypothetical protein YPDSF_0737 [Yersinia pestis Pestoides F]
gi|169750613|gb|ACA68131.1| protein of unknown function UPF0061 [Yersinia pseudotuberculosis
YPIII]
gi|186698728|gb|ACC89357.1| protein of unknown function UPF0061 [Yersinia pseudotuberculosis
PB1/+]
Length = 487
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 206/528 (39%), Positives = 280/528 (53%), Gaps = 52/528 (9%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
P D+ L YT++ P+ ++ +L+ S+ +A L LD F P ++ +G
Sbjct: 9 PEFDNSYARQLSGFYTRLQPTP-LKGARLLYHSKPLAQELGLDAHWFTEPKTAVW-AGEA 66
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
L G P AQ Y GHQFGMWAGQLGDGR I LGE + LKGAG TPYSR D
Sbjct: 67 LLPGMEPLAQVYSGHQFGMWAGQLGDGRGILLGEQRLNDGRYMDWHLKGAGLTPYSRMGD 126
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRS IREFL SEA+H LGIPT+RAL +VT+ + R+ + E GA++ RVA
Sbjct: 127 GRAVLRSVIREFLASEALHHLGIPTSRALTIVTSDHPIYRE-------QTERGAMLLRVA 179
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
+S +RFG ++ R Q V+ LADY I H+ +
Sbjct: 180 ESHIRFGHFEHFYYRQQPKQ--VQQLADYVIARHWPQWVGHQEC---------------- 221
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
Y W +V ERTA L+A WQ VGF HGV+NTDNMSILG+T+DYGPFGFLD + P
Sbjct: 222 -----YRLWFTDVVERTARLMAHWQTVGFAHGVMNTDNMSILGITMDYGPFGFLDDYVPG 276
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
+ N +D G RY + NQP + LWN+ + L+ L+ + +E Y M Y
Sbjct: 277 YICNHSDHQG-RYAYDNQPAVALWNLHRLGHALSG--LMSADQLQLALEAYEPALMVAYG 333
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M KLG + + Q +++ LL+ M + DYT FR LS V+ + PL+
Sbjct: 334 EQMRAKLGFLERDSQDNDLLTGLLSLMIKEGRDYTRTFRLLSEVEVHSAQS------PLR 387
Query: 536 AVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+D + A+ W Y L I D++R+ M + NPKY+LRNYL Q AI A
Sbjct: 388 DDFID-----RAAFDDWYRRYRSRLQQESIDDDQRQQSMKAANPKYILRNYLAQQAITQA 442
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E D ++RL + +++P+ +QP + A LPP W +SCSS
Sbjct: 443 EKDDIQPLQRLHQALQQPFTDQPEFDDLAALPPDWGKH---LEISCSS 487
>gi|170719585|ref|YP_001747273.1| hypothetical protein PputW619_0398 [Pseudomonas putida W619]
gi|226706096|sp|B1J2K5.1|Y398_PSEPW RecName: Full=UPF0061 protein PputW619_0398
gi|169757588|gb|ACA70904.1| protein of unknown function UPF0061 [Pseudomonas putida W619]
Length = 486
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 209/550 (38%), Positives = 291/550 (52%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL+ L +D+ F R GD A T+V P + +P+LV S+S L
Sbjct: 1 MKALDQLTFDNRFAR--LGD------------AFSTQVLPEP-IADPRLVIASKSAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + + P F FSG GA P A Y GHQFG + +LGDGR + L E++N
Sbjct: 46 DLDPAQADTPVFAELFSGHKLWEGADPRAMVYSGHQFGSYNPRLGDGRGLLLAEVVNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGI T+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIATSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+AQS +RFG ++ Q + R L D+ + H+
Sbjct: 166 E-------TRESAAMLTRLAQSHVRFGHFEYFYYTKQPEQQ--RVLIDHVLEQHYPECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIAHWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDY FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMALVERLLQRMQSGGVDYNLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L + P E L ++ +D+ + +W Y+ + + R+
Sbjct: 371 RRLGDQ------PVAEALKGVRDDFIDLA-----GFDAWGADYLARCEREAGNGDGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ P++EQ GM+ YA PPAW
Sbjct: 420 MHAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSTPFEEQAGMQAYAERPPAWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|419952938|ref|ZP_14469084.1| hypothetical protein YO5_17045 [Pseudomonas stutzeri TS44]
gi|387970214|gb|EIK54493.1| hypothetical protein YO5_17045 [Pseudomonas stutzeri TS44]
Length = 486
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 212/549 (38%), Positives = 301/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L L +D+ F R GD A T V P + +P+LV S++ L
Sbjct: 1 MKSLTQLTFDNRFAR--LGD------------AFSTAVMPQP-LADPRLVVASDAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P E+P F FSG + A P A Y GHQFG++ QLGDGR + LGE+LN
Sbjct: 46 DLEPAVVEQPLFVELFSGHKLWSTAEPRAMVYSGHQFGVYNPQLGDGRGLLLGEVLNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE + LGIP+TRALC+ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEHLAALGIPSTRALCVTASATPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ ++E GA++ R+A S LRFG ++ + +R +L + L DY++ HF +
Sbjct: 166 E-------RQERGAMLLRLAPSHLRFGHFEFFYYTRRHAEL---KQLLDYSLEAHFPQLR 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A EV ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLALFREVLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
S+LG+T+D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SLLGITLDFGPYAFLDDFDARFICNHSDDRG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ ME + + E+ +M ++LG + +++++ +LL M VDYT FFR
Sbjct: 312 EVTRLRETMELFLPLYEAEWLDLMRRRLGFARAEADDERLVRRLLQLMQDSAVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L + A ++ L+ +D+ + +W Y + + + R+A M
Sbjct: 372 ELGDSPAPQAVRR------LREDFVDLA-----GFDAWAADYCARVAREDATQDSRQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYL Q I+AAE GD+G VR L ++ RP+DEQPGM++YA PP W
Sbjct: 421 HAVNPKYILRNYLAQQVIEAAEQGDYGPVRELHAVLGRPFDEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|386284444|ref|ZP_10061666.1| hypothetical protein SULAR_04327 [Sulfurovum sp. AR]
gi|385344729|gb|EIF51443.1| hypothetical protein SULAR_04327 [Sulfurovum sp. AR]
Length = 476
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 211/518 (40%), Positives = 277/518 (53%), Gaps = 64/518 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
CY +V+P+ E P L+ + VA L++D E + F F +G G+ P+A CY
Sbjct: 18 VCYDRVTPTPLAE-PYLIHANTDVAKVLDIDETELQTEAFVKFLNGEYIAEGSEPFAMCY 76
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + +LGDGRAI +G I +++ LQLKGAG T YSR DG AVLRSSIRE+
Sbjct: 77 AGHQFGYFVPRLGDGRAINIGTI-----DKYHLQLKGAGITEYSRHGDGRAVLRSSIREY 131
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH L IPTT L L+ + V RD K E GAIVCRV+ S++RFG+++ +
Sbjct: 132 LMSEAMHGLSIPTTLCLGLIGSEHDVRRD-------KIEKGAIVCRVSSSWVRFGTFEYY 184
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A +G+ + LADY I +F H +G E N+Y +
Sbjct: 185 AHQGK--FKELAALADYVIEENFPH------------HSGKE---------NRYTLLFND 221
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V TA L+AQW VGF HGV+NTDNMSI GLTIDYGP+ FLD F N TD+ G R
Sbjct: 222 VLIITARLIAQWMSVGFNHGVMNTDNMSIAGLTIDYGPYAFLDDFRHENVCNQTDVEG-R 280
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP-- 488
Y FANQP+I WN+ L+ D E N M + ++ + M KKLG
Sbjct: 281 YSFANQPEIAKWNLKSLIMALSPLTDTDKMEKNLAM--FDKIYIRYFHYYMCKKLGFEGT 338
Query: 489 -KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG--KER 545
+ + ++I +L+ + VDYT FFR LS+ + D + LL G E
Sbjct: 339 IEGDPELIDDMLDMLEQLHVDYTLFFRTLSHYEGD------------RKALLSTGLYHEP 386
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
AW+ + I+ I ERK M S NPKYVL+NY+ Q AIDAAE GDF V
Sbjct: 387 MNAWLDRYDARIKT-----IDTTERKEQMLSSNPKYVLKNYMLQEAIDAAEKGDFSVVDD 441
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L ++ + P+DE P E++A P LSCSS
Sbjct: 442 LFQIAQNPFDEHPAFERWAEATPQEFKNK---RLSCSS 476
>gi|386022546|ref|YP_005940571.1| hypothetical protein PSTAA_3974 [Pseudomonas stutzeri DSM 4166]
gi|327482519|gb|AEA85829.1| conserved hypothetical protein [Pseudomonas stutzeri DSM 4166]
Length = 486
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 298/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L++D+ F R GD + T+VSP +E P+LV SE+ L
Sbjct: 1 MKTLTQLHFDNRFAR--LGDTFS------------TQVSPQP-LEAPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E+ F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAEAEQALFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDTLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L ++ I HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHGEL---KQLLEHVIEAHFPELL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 216 EHPEPFHMFFRT---------------------VLERTAALIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ K ME + + E+ +M ++LG + + ++I +LL M VDYT FFR
Sbjct: 312 EVKVLRETMELFLPLYEAEWLDLMRRRLGFSQAEDGDAELIRRLLQLMQGSAVDYTRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L P ++ L+ +D+ + + +W Y G R+A M
Sbjct: 372 ELGER------PAEQAAQRLREDFIDL-----QGFDAWAADYCARSAREGGDPVARQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP+DEQPGME+YA PP W
Sbjct: 421 HAVNPKYILRNYLAQQAIEAAEKGDYAPVRELHAVLSRPFDEQPGMERYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|241663096|ref|YP_002981456.1| hypothetical protein Rpic12D_1497 [Ralstonia pickettii 12D]
gi|240865123|gb|ACS62784.1| protein of unknown function UPF0061 [Ralstonia pickettii 12D]
Length = 529
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 216/529 (40%), Positives = 280/529 (52%), Gaps = 68/529 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGM 197
P+ + P LV +S A SL + E + F+G + P A Y GHQFG+
Sbjct: 46 PAGAIGEPYLVGFSPDAAASLGISRAELDTAAGLAVFTGNAVATWSDPLATVYSGHQFGV 105
Query: 198 WAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH 257
WAGQLGDGRA+ L E +E+QLKGAG+TPYSR DG AVLRSSIREFLCSEAM
Sbjct: 106 WAGQLGDGRALLLAEFQTADGP-YEVQLKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMA 164
Query: 258 FLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED 317
LGIPTTRALC+ V R+ + E A+V R+A SF+RFG ++ A+ E
Sbjct: 165 GLGIPTTRALCVTGADAPVRRE-------EIETAAVVTRLATSFVRFGHFEHFAA--SEQ 215
Query: 318 LDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTAS 377
L +R LADY I + ++SE Y A E+A RTA
Sbjct: 216 LPQLRALADYVIDRFY----PASRSEP-----------------QPYLALLREIARRTAE 254
Query: 378 LVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQP 437
L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +A QP
Sbjct: 255 LMADWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSD-SGGRYAYAQQP 313
Query: 438 DIGLWNIAQFSTTL-----------------AAAKLIDDKEANYVM---ERYGTKFMDEY 477
IG WN+ + L A A+ D N ++ + YG F Y
Sbjct: 314 QIGYWNLFCLAQALLPLFGEDPHVFVNLSDEAQAQPAIDAAQNVLLTYRDVYGAAFYARY 373
Query: 478 QAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
+A KLGL ++ + L + + DYT FFR L++V+ D + E
Sbjct: 374 RA----KLGLSTAQDADEALFGDLFKLLHNQRADYTLFFRHLADVRRDDTPAAAE----- 424
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ D +R A + W+ +Y Q L + SD+ER A M VNPKYVLRN+L + AI
Sbjct: 425 ARTVRDFFFDRAAADV-WLAAYRQRLQAEPQSDDERAAAMQRVNPKYVLRNHLAEIAIRR 483
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ DF EV L ++ RP+D+ PG E YA+ P WA +SCSS
Sbjct: 484 AKEKDFSEVENLRAVLARPFDDHPGFEHYAQPAPDWA---SSLEVSCSS 529
>gi|398939166|ref|ZP_10668385.1| hypothetical protein PMI27_02159 [Pseudomonas sp. GM41(2012)]
gi|398164802|gb|EJM52932.1| hypothetical protein PMI27_02159 [Pseudomonas sp. GM41(2012)]
Length = 487
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 219/551 (39%), Positives = 300/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD + T V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGDTFS------------THVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALYALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMVLRMAPSHVRFGHFEYFYYTKRPEKQ----KELGDHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T DYGPF FLD FD F N +D G RY ++NQ IG WN++ + L
Sbjct: 254 MSILGITFDYGPFAFLDDFDAHFICNHSDDQG-RYSYSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y + Y +M ++LG K ++ ++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYFPLYQAHYLDLMRRRLGFTKAEDEDQNLLEHLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + ++ L+ +DI + + +W YI + G + E+R+
Sbjct: 371 RRLGEESPELAVAR------LRDDFVDI-----KGFDAWGELYIARVTREGEVDQEQRRK 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P+DEQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFDEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|395799741|ref|ZP_10479021.1| hypothetical protein A462_30789 [Pseudomonas sp. Ag1]
gi|395336246|gb|EJF68107.1| hypothetical protein A462_30789 [Pseudomonas sp. Ag1]
Length = 487
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 218/548 (39%), Positives = 303/548 (55%), Gaps = 64/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD T V P ++ P+LV SE+ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------GFSTHVLPEP-IDEPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETPVFAELFGGHKLWAEAEPRAMIYSGHQFGSYNPQLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSTTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L + LA++ + HF E
Sbjct: 166 E-------KQERAAMVLRLAHSHVRFGHFEYFYYTKKPELQ--KQLAEHVLSLHF--PEC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
M + E Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 MEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDAQFVCNHSDHEG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y F Y +M ++LGL +++++ +LL M VDYT FFR
Sbjct: 313 VEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLVERLLQLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L + A ++ L+ +D+ + + W + ++ S ++E+R+ M+
Sbjct: 373 LGDESAALAVAR------LRDDFVDL--KGFDGWADLYKARVERDASG--TEEQRRERMH 422
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNP Y+LRNYL Q+AI AAELGD+ EVRRL +++ +P++EQPGME+YA+ PP W
Sbjct: 423 GVNPLYILRNYLAQNAIQAAELGDYSEVRRLHEVLTKPFEEQPGMEQYAQRPPDWGKH-- 480
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 481 -LEISCSS 487
>gi|423203713|ref|ZP_17190281.1| hypothetical protein HMPREF1167_03864 [Aeromonas veronii AER39]
gi|404612491|gb|EKB09552.1| hypothetical protein HMPREF1167_03864 [Aeromonas veronii AER39]
Length = 475
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 213/525 (40%), Positives = 281/525 (53%), Gaps = 57/525 (10%)
Query: 122 DSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLA 181
++ E+ AC V+P ++ P+L+ + ++ D L L D+ L
Sbjct: 5 NTFATELPWAC-EPVAPQP-LQQPRLLHLNRALLDELGLG--GVSEADWIACCGEGKVLP 60
Query: 182 GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLA 241
G P AQ Y GHQFG ++ +LGDGRA+ LGE L +RW+L LKGAGKTP+SRF DG A
Sbjct: 61 GMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGQRWDLHLKGAGKTPFSRFGDGRA 120
Query: 242 VLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSF 301
VLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R S
Sbjct: 121 VLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYRE-------RVETGATVLRTTPSH 173
Query: 302 LRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTS 361
LRFG + A GQ + + L DY +R+HF +E +G E +
Sbjct: 174 LRFGHIEYFAWSGQG--EKIPPLIDYLLRYHFPELE-----------SGAELFA------ 214
Query: 362 NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTP 421
EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P F
Sbjct: 215 --------EVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPDFVC 266
Query: 422 NTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
N +D P RY QP +G WN+ + + LA +D + +Y + M Y +M
Sbjct: 267 NHSD-PAGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALTAALAQYEQQLMLHYSELM 323
Query: 482 TKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + + +L +A KVDY F R L V + + P A L
Sbjct: 324 RAKLGLAVWEEDDPALFRELFRLLAAHKVDYHLFLRRLGEVTQEGAWP---------ASL 374
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
L + E W +W+ Y L+ G D RKA M+++NPKYVLRN L Q IDAA+ G
Sbjct: 375 LALLPE-PLGWQAWLERYRARLMREGSEDAVRKAQMDAINPKYVLRNALAQQVIDAADAG 433
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
D RL ++ PYDEQP E A PAW Y G LSCSS
Sbjct: 434 DMQPFERLFAALQHPYDEQPEYEDLATPTPAW-YCGG--ELSCSS 475
>gi|300691438|ref|YP_003752433.1| hypothetical protein RPSI07_1789 [Ralstonia solanacearum PSI07]
gi|299078498|emb|CBJ51151.1| conserved protein of unknown function, UPF0061 [Ralstonia
solanacearum PSI07]
Length = 529
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 213/537 (39%), Positives = 280/537 (52%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L E + P F+G A + P A Y GH
Sbjct: 38 TRLPPMPMPASPDLVGFSPEAAAPLGLSRAELDTPAGLDVFAGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+Q+KGAG+TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQIKGAGRTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREEI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A E A
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLRETAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNIAQFSTTL-------------AAAKLIDDKEANYVM-----------ERY 469
A QP I WN+ + L A L D+ +A + + Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNDGAAFVDLSDEAQAQPAIDAAQEALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
++ V D +++ +W+ +Y Q L + + D+ R M VNPKYVLRN+
Sbjct: 421 ALAQARTVRDVFFD-----RDSADAWLAAYRQRLQAEPVPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFAEVENLRAVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|421138728|ref|ZP_15598783.1| hypothetical protein MHB_05606 [Pseudomonas fluorescens BBc6R8]
gi|404510115|gb|EKA24030.1| hypothetical protein MHB_05606 [Pseudomonas fluorescens BBc6R8]
Length = 487
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 218/548 (39%), Positives = 303/548 (55%), Gaps = 64/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD T V P ++ P+LV SE+ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------GFSTHVLPEP-IDEPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETPVFAELFGGHKLWAEAEPRAMIYSGHQFGSYNPQLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSTTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L + LA++ + HF E
Sbjct: 166 E-------KQERAAMVLRLAHSHVRFGHFEYFYYTKKPELQ--KQLAEHVLSLHF--PEC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
M + E Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 MEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDAQFVCNHSDHEG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y F Y +M ++LGL +++++ +LL M VDYT FFR
Sbjct: 313 VEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLVERLLQLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L + A ++ L+ +D+ + + W + ++ S ++E+R+ M+
Sbjct: 373 LGDESAALAVAR------LRDDFVDL--KGFDEWADLYKARVERDASG--TEEQRRERMH 422
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNP Y+LRNYL Q+AI AAELGD+ EVRRL +++ +P++EQPGME+YA+ PP W
Sbjct: 423 GVNPLYILRNYLAQNAIQAAELGDYSEVRRLHEVLSKPFEEQPGMEQYAQRPPDWGKH-- 480
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 481 -LEISCSS 487
>gi|398950655|ref|ZP_10673768.1| hypothetical protein PMI26_01507 [Pseudomonas sp. GM33]
gi|398157640|gb|EJM46019.1| hypothetical protein PMI26_01507 [Pseudomonas sp. GM33]
Length = 487
Score = 335 bits (859), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 220/551 (39%), Positives = 303/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVHNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALNALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEQQ----KVLGDHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLENLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + +I L+ +D+ + + +W Y++ + G D E+R+
Sbjct: 371 RRLGDEAPEQAITR------LRDDFVDL-----KGFDAWGERYVERVAREGALDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|167719145|ref|ZP_02402381.1| hypothetical protein BpseD_08982 [Burkholderia pseudomallei DM98]
Length = 458
Score = 335 bits (859), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 215/505 (42%), Positives = 278/505 (55%), Gaps = 69/505 (13%)
Query: 161 DPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
+P + P F F G ++PYA Y GHQFG+WAGQLGDGRA+T+GE+ +
Sbjct: 1 EPALRDAPGFAELFCGNPTRDWPQASLPYASVYSGHQFGVWAGQLGDGRALTIGELAH-D 59
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
R+ELQLKGAG+TPYSR DG AVLRSSIREFLCSEAMH LGIPTTRAL ++ + + V
Sbjct: 60 GRRYELQLKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMHHLGIPTTRALAVIGSDQPVV 119
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHI 336
R+ E A+V RVAQSF+RFG ++ A+ E L R LAD+ I
Sbjct: 120 REEI-------ETSAVVTRVAQSFVRFGHFEHFFANDRPEQL---RALADHVI------- 162
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + D D + Y A E RTA LVAQWQ VGF HGV+NTDN
Sbjct: 163 ------ERFYPACRDAD--------DPYLALLAEATRRTAELVAQWQAVGFCHGVMNTDN 208
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWN---IAQF------ 447
MSILGLTIDYGPFGF+DAFD N +D G RY + QP I WN +AQ
Sbjct: 209 MSILGLTIDYGPFGFIDAFDAKHVCNHSDTQG-RYAYRMQPRIAHWNCFCLAQALLPLIG 267
Query: 448 ------STTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKL 498
S + A + ++D A+ V+ R+ +F + M KLGL + + + ++L
Sbjct: 268 LHRDAPSEDVRAERAVED--AHAVLGRFPEQFGPALERAMRAKLGLALEREGDAALANQL 325
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
L M D+T FR L+ V + + P++ + +D ++A+ W Y
Sbjct: 326 LEIMDASHADFTLTFRHLARVSKHDARGD----APVRDLFID-----RDAFDRWANLYRA 376
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
L D R A MN VNPKYVLRN+L ++AI A+ DF EV RL ++ RP+DEQP
Sbjct: 377 RLSEEARDDASRAAAMNRVNPKYVLRNHLAETAIRRAKEKDFSEVERLAAVLRRPFDEQP 436
Query: 619 GMEKYARLPPAWAYRPGVCMLSCSS 643
+ YA LPP WA +SCSS
Sbjct: 437 EHDAYAALPPDWA---STLEVSCSS 458
>gi|330827841|ref|YP_004390793.1| hypothetical protein B565_0141 [Aeromonas veronii B565]
gi|423211487|ref|ZP_17198020.1| hypothetical protein HMPREF1169_03538 [Aeromonas veronii AER397]
gi|328802977|gb|AEB48176.1| hypothetical protein B565_0141 [Aeromonas veronii B565]
gi|404613567|gb|EKB10588.1| hypothetical protein HMPREF1169_03538 [Aeromonas veronii AER397]
Length = 475
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 214/525 (40%), Positives = 282/525 (53%), Gaps = 57/525 (10%)
Query: 122 DSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLA 181
++ E+ AC V+P ++ P+L+ + ++ D L L D+ L
Sbjct: 5 NTFATELPWAC-EPVAPQP-LQQPRLLHLNRALLDELGLG--GVSEADWIACCGEGKVLP 60
Query: 182 GAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLA 241
G P AQ Y GHQFG ++ +LGDGRA+ LGE L +RW+L LKGAGKTP+SRF DG A
Sbjct: 61 GMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGQRWDLHLKGAGKTPFSRFGDGRA 120
Query: 242 VLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSF 301
VLRSSIRE+L SEA+H LGIPTTRAL LV + + V R+ + E GA V R S
Sbjct: 121 VLRSSIREYLASEALHALGIPTTRALVLVGSQEPVYRE-------RVETGATVLRTTPSH 173
Query: 302 LRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTS 361
LRFG + A GQ + + L DY +R+HF +E +G E +
Sbjct: 174 LRFGHIEYFAWSGQG--EKIPPLIDYLLRYHFPELE-----------SGAELFA------ 214
Query: 362 NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTP 421
EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P F
Sbjct: 215 --------EVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPDFVC 266
Query: 422 NTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIM 481
N +D P RY QP +G WN+ + + LA +D + +Y + M Y +M
Sbjct: 267 NHSD-PAGRYALDQQPAVGYWNLQKLAQALAGH--VDGDALAAALAQYEQQLMLHYSELM 323
Query: 482 TKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + + +L +A KVDY F R L V + + P LLV L L
Sbjct: 324 RAKLGLAVWEEDDPALFRELFRLLAAHKVDYHLFLRRLGEVTQEGAWPAS-LLVLLPEPL 382
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
W +W+ Y L+ G D RKA M+++NPKYVLRN L Q I+AA+ G
Sbjct: 383 ---------GWQAWLERYRARLMREGSEDVVRKAQMDAINPKYVLRNALAQQVIEAADAG 433
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
D RL ++RPYDEQP E A PAW Y G LSCSS
Sbjct: 434 DMQPFGRLFAALQRPYDEQPEYEDLATPTPAW-YCGG--ELSCSS 475
>gi|423120703|ref|ZP_17108387.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5246]
gi|376396204|gb|EHT08847.1| UPF0061 protein ydiU [Klebsiella oxytoca 10-5246]
Length = 480
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 283/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ ++N +L+ + +A +L + F + G T L G P
Sbjct: 10 RDELPDFYTPLAPTP-LKNARLIWHNAPLAQTLGIPEALFHPAQGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG WAGQLGDGR I L E R + LKGAG TPYSR DG AVLRS
Sbjct: 69 LAQVYSGHQFGAWAGQLGDGRGILLAEQQLSDGRRLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +VT+ V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIRESLASEAMHALGIPTTRALAMVTSDTPVQRETL-------ESGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LADY IRHH+ + VD ++KY
Sbjct: 182 HFEHFYYR--REPEKVQQLADYVIRHHWPEL--------------------VD-DADKYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 LWFRDVVTRTATLIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFKPDFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + +L+ +D N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFENQPAVGLWNLQRLAQSLSPFIAVD--ALNVALDDYQHALLTVYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + ++ L M + DYT FR LS + + PL+ +D
Sbjct: 336 GLFTQQKGDNDLLDGLFALMIREGSDYTRTFRMLSVSEQHSAAS------PLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ A+ SW Y L I D +R+ M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RAAFDSWFAGYRARLRDEPIDDAQRQQQMQSVNPALVLRNWLAQRAIELAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL +++ +P+ ++ ++Y PP W R V SCSS
Sbjct: 445 LARLHEVLSQPFADRD--DEYINRPPDWGRRLEV---SCSS 480
>gi|386333449|ref|YP_006029619.1| hypothetical protein RSPO_c01783 [Ralstonia solanacearum Po82]
gi|334195898|gb|AEG69083.1| Hypothetical cytosolic protein [Ralstonia solanacearum Po82]
Length = 529
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 216/537 (40%), Positives = 277/537 (51%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L E P F G A + P A Y GH
Sbjct: 38 TRLPPLPMPASPYLVGFSPEAAAPLGLSRAGLETPAGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREAI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNI----------------------AQFSTTLAAAKLIDDKEANYVMER--Y 469
A QP I WN+ S A ID +A ++ R Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNDGAAFVDLSDEAQAQPAIDAAQAALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
D ++ V D +++ +W+ Y + L + + D+ R M VNPKYVLRN+
Sbjct: 421 ADAQARTVRDVFFD-----RDSADAWLADYRRRLQAEPLPDDARAEAMRHVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFSEVEHLRTVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|398858786|ref|ZP_10614472.1| hypothetical protein PMI36_02381 [Pseudomonas sp. GM79]
gi|398238359|gb|EJN24089.1| hypothetical protein PMI36_02381 [Pseudomonas sp. GM79]
Length = 487
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 222/551 (40%), Positives = 300/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P + P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IAAPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNNAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI--HASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF H
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEFFYYTKRPEQQ----KELGEHVLAMHFPHC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLEHLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + PE + L L+ +D+ + + +W YI + G+ D E+R+
Sbjct: 371 RRLGD-----ESPE-QTLARLRDDFVDL-----KGFDAWGELYIARVAREGVVDQEQRRT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME+YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYAEVRRLHAVLSNPFEEQPGMERYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|408416152|ref|YP_006626859.1| hypothetical protein BN118_2300 [Bordetella pertussis 18323]
gi|401778322|emb|CCJ63725.1| conserved hypothetical protein [Bordetella pertussis 18323]
Length = 495
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 215/536 (40%), Positives = 283/536 (52%), Gaps = 50/536 (9%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+++LP D ++P E YT++ P P+L+ + A + LDP EF F
Sbjct: 6 LQDLPTDNSFAALPAEF----YTRLQPRPPAA-PRLLHANAEAAALIGLDPAEFSTQAFL 60
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
FSG PL G A Y GHQFG+WAGQLGDGRA LGE+ + WELQLKGAG T
Sbjct: 61 DVFSGHAPLPGGDTLAAVYSGHQFGVWAGQLGDGRAHLLGEVRG-PAGGWELQLKGAGMT 119
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSS+RE+L SEAMH LGIPTTR+L LV + V R+ E
Sbjct: 120 PYSRMGDGRAVLRSSVREYLASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETA 172
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V R+A SF+RFGS++ ++R Q + +R LADY I +
Sbjct: 173 AVVTRMAPSFVRFGSFEHWSARRQPEQ--LRVLADYVIDRFYPECRVAGAGR-------- 222
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+D + V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF
Sbjct: 223 -----LDGEHGEILGLLAAVTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGF 277
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYG 470
+D F N +D G RY + QP +GLWN+ + +++L L D EA V++ Y
Sbjct: 278 MDTFQLGHICNHSDSEG-RYAWNRQPSVGLWNLYRLASSL--HTLAPDPEALRAVLDGYE 334
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + M KLGLP++ ++ ++ LL M D+T FR L P
Sbjct: 335 AVFTQAFHGRMAGKLGLPQFLPEDETLLDDLLQLMHQQGADFTLAFRRLGEAVRGQRQPF 394
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
++ + + A +W S G + + R A M+ VNP YVLRN+L
Sbjct: 395 EDSFID------------RAAAGAWYDRLAARHASDGRAAQARAAAMDEVNPLYVLRNHL 442
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI AA GD GE+ LLKL+ PY QPG + YA L P WA +SCSS
Sbjct: 443 AEQAIRAAARGDAGEIDILLKLLRNPYKHQPGYDAYAGLAPDWA---AGLEVSCSS 495
>gi|157145977|ref|YP_001453296.1| hypothetical protein CKO_01731 [Citrobacter koseri ATCC BAA-895]
gi|157083182|gb|ABV12860.1| hypothetical protein CKO_01731 [Citrobacter koseri ATCC BAA-895]
Length = 431
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 200/473 (42%), Positives = 263/473 (55%), Gaps = 52/473 (10%)
Query: 174 FSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPY 233
+ G + L G P AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPY
Sbjct: 8 WGGESLLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPY 67
Query: 234 SRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAI 293
SR DG AVLRS+IRE L SEAMH+LGIPTTRAL +VT+ V R+ E GA+
Sbjct: 68 SRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSIVTSDTPVYRETV-------ESGAM 120
Query: 294 VCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDED 353
+ R+AQS +RFG ++ R + D VR LAD+AIRH++ + +ED
Sbjct: 121 LMRLAQSHMRFGHFEHFYYR--REPDKVRQLADFAIRHYWPQFQ------------AEED 166
Query: 354 HSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLD 413
KYA W +V RTA L+A WQ VGF HGV+NTDNMS+LGLTIDYGPFGFLD
Sbjct: 167 ---------KYALWFRDVVARTARLIADWQTVGFAHGVMNTDNMSVLGLTIDYGPFGFLD 217
Query: 414 AFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKF 473
+ P F N +D G RY F NQP +GLWN+ + + TL+ +D N ++ Y
Sbjct: 218 DYQPGFICNHSDHQG-RYSFDNQPAVGLWNLQRLAQTLSPFMPVD--TLNDALDGYQLAL 274
Query: 474 MDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
+ Y M +KLG K + ++++L MA + DY+ FR LS + +
Sbjct: 275 LTHYGQRMRQKLGFFTEQKEDNALLNELFALMAREGSDYSRTFRMLSQTEQQSAAS---- 330
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
PL+ +D + A+ W Y L + D R+ M VNP VLRN+L Q
Sbjct: 331 --PLRDEFID-----RAAFDGWFSRYRARLQQEQMDDATRQQHMQRVNPAVVLRNWLAQR 383
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI +AE GD GE+ +L +++ P+ ++ + Y PP W R V SCSS
Sbjct: 384 AIASAEQGDMGELHQLHQVLRDPFTDRN--DDYVSRPPDWGKRLEV---SCSS 431
>gi|289207204|ref|YP_003459270.1| hypothetical protein TK90_0017 [Thioalkalivibrio sp. K90mix]
gi|288942835|gb|ADC70534.1| protein of unknown function UPF0061 [Thioalkalivibrio sp. K90mix]
Length = 500
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 214/476 (44%), Positives = 269/476 (56%), Gaps = 59/476 (12%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
P+ G P A Y GHQFG++ QLGDGR LGE+ E WELQ+KGAG+T YSR AD
Sbjct: 73 PMEGPEPLASVYAGHQFGVFVPQLGDGRVKLLGEVRTATGEHWELQVKGAGRTRYSRGAD 132
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEAM LG+PTTRA+ L + V R+ + EPGAIV R A
Sbjct: 133 GRAVLRSSIREYLISEAMAALGVPTTRAVALYGSSLQVLRE-------RVEPGAIVLRAA 185
Query: 299 QSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVV 357
SFLRFG ++ H S E L R L DYA+ H + + +
Sbjct: 186 PSFLRFGHFEYFHYSGYSERL---RELIDYALAHDYPELAD------------------- 223
Query: 358 DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP 417
+ AA +V TA ++A WQ VGF HGV+NTDNMS+LGLTIDYGPF FLDA+DP
Sbjct: 224 --AEDPVAAMLEQVIANTAEMIADWQAVGFCHGVMNTDNMSLLGLTIDYGPFAFLDAYDP 281
Query: 418 SFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEY 477
+ N TD G RY F QP I WN+ + + TL +EA +ER MD
Sbjct: 282 GYICNHTD-QGGRYAFDQQPAIAQWNLIRLAETLVIHFQDTTREA--AIERAKALLMDFM 338
Query: 478 ----QAIMTK---KLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
QA +T+ KLGL + ++ ++I LL MA + VDYT FFR L P + +
Sbjct: 339 PRFEQAWLTRMRTKLGLVEEHEGDLELIHDLLARMAEEGVDYTRFFRQL------PDLEQ 392
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
E+ L+A L D AW +W Y L + E R+A MN+VNPKY+LRN+L
Sbjct: 393 PEIREQLEAELEDAA-----AWRAWWSRYQARLEAEARPFEARRAAMNAVNPKYILRNHL 447
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
Q+AI+ AE GD E+ RL ++ RP+DEQP E YA LPPAWA LSCSS
Sbjct: 448 AQAAIEQAEAGDTSELLRLQAILARPFDEQPEFEAYADLPPAWA---AGIQLSCSS 500
>gi|309781983|ref|ZP_07676713.1| YdiU family protein [Ralstonia sp. 5_7_47FAA]
gi|404377676|ref|ZP_10982776.1| UPF0061 protein [Ralstonia sp. 5_2_56FAA]
gi|308919049|gb|EFP64716.1| YdiU family protein [Ralstonia sp. 5_7_47FAA]
gi|348611690|gb|EGY61330.1| UPF0061 protein [Ralstonia sp. 5_2_56FAA]
Length = 529
Score = 334 bits (857), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 216/529 (40%), Positives = 279/529 (52%), Gaps = 68/529 (12%)
Query: 138 PSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGM 197
P+ + P LV +S A SL + E + F+G + P A Y GHQFG+
Sbjct: 46 PAGAIGEPYLVGFSPDAAASLGISRAELDTAAGLAVFTGNAVATWSDPLATVYSGHQFGV 105
Query: 198 WAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH 257
WAGQLGDGRA+ L E +E+QLKGAG+TPYSR DG AVLRSSIREFLCSEAM
Sbjct: 106 WAGQLGDGRALLLAEFQTADGP-YEVQLKGAGRTPYSRMGDGRAVLRSSIREFLCSEAMA 164
Query: 258 FLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED 317
LGIPTTRALC+ V R+ + E A+V R+A SF+RFG ++ A+ E
Sbjct: 165 GLGIPTTRALCVTGADAPVRRE-------EIETAAVVTRLATSFVRFGHFEHFAA--SEQ 215
Query: 318 LDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTAS 377
L +R LADY I + ++SE Y A E+A RTA
Sbjct: 216 LPQLRALADYVIDRFY----PASRSEP-----------------QPYLALLREIARRTAE 254
Query: 378 LVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQP 437
L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +A QP
Sbjct: 255 LMADWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSD-SGGRYAYAQQP 313
Query: 438 DIGLWNIAQFSTTL-----------------AAAKLIDDKEANYVM---ERYGTKFMDEY 477
IG WN+ + L A A+ D N ++ + YG F Y
Sbjct: 314 QIGYWNLFCLAQALLPLFGEDPHVFVDLSDEAQAQPAIDAAQNVLLTYRDVYGAAFYARY 373
Query: 478 QAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPL 534
+A KLGL ++ + L + + DYT FFR L+ V+ D + E
Sbjct: 374 RA----KLGLSTAQDADEALFGDLFKLLHNQRADYTLFFRHLAEVRRDDTPAAAE----- 424
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ D +R A + W+ +Y Q L + SD+ER A M VNPKYVLRN+L + AI
Sbjct: 425 ARTVRDFFFDRAAADV-WLAAYRQRLQAEPQSDDERAAAMYRVNPKYVLRNHLAEIAIRR 483
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ DF EV L ++ RP+D+ PG E YA+ P WA +SCSS
Sbjct: 484 AKEKDFSEVENLRAVLARPFDDHPGFEHYAQPAPDWA---SSLEVSCSS 529
>gi|395496220|ref|ZP_10427799.1| hypothetical protein PPAM2_09135 [Pseudomonas sp. PAMC 25886]
Length = 487
Score = 334 bits (857), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 217/548 (39%), Positives = 304/548 (55%), Gaps = 64/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD T V P ++ P+LV SE+ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------GFSTHVLPEP-IDEPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETPVFAELFGGHKLWAEAEPRAMIYSGHQFGSYNPQLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L + LA++ + HF E
Sbjct: 166 E-------KQERAAMVLRLAHSHVRFGHFEYFYYTKKPELQ--KALAEHVLSLHF--PEC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 LEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDAQFICNHSDHEG-RYSFSNQVPIGQWNLSALAQAL--TPFIT 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y F Y +M ++LGL +++++ +LL M VDYT FFR
Sbjct: 313 VEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLVERLLQLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L + A ++ L+ +D+ + + W + ++ S ++E+R+ M+
Sbjct: 373 LGDESAALAVAR------LRDDFVDL--KGFDEWADLYKARVEREASG--TEEQRRERMH 422
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP Y+LRNYL Q+AI AAELGD+ EVRRL +++ +P++EQPGME+YA+ PP W
Sbjct: 423 AVNPLYILRNYLAQNAIQAAELGDYSEVRRLHEVLTKPFEEQPGMEQYAQRPPDWGKH-- 480
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 481 -LEISCSS 487
>gi|410258674|gb|JAA17304.1| selenoprotein O [Pan troglodytes]
Length = 666
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 254/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVETPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTATGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV +RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDSV----------------QRNAAFFREVTQRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFSLLSSFPVELESP 471
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D E +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WADWLQAYRARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLETPYHCEAGAATDAEATEANGADGRQRSYSSKPPLWA 660
>gi|119477338|ref|ZP_01617529.1| hypothetical protein GP2143_00152 [marine gamma proteobacterium
HTCC2143]
gi|119449264|gb|EAW30503.1| hypothetical protein GP2143_00152 [marine gamma proteobacterium
HTCC2143]
Length = 489
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 211/533 (39%), Positives = 299/533 (56%), Gaps = 54/533 (10%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFF 174
+P D R + ++ ++ V P + P L++ + VA+ + LDP+ + F +F
Sbjct: 7 IPFDNRFSKLSNDL----FSDVKPQG-LAQPFLISANPVVAELIGLDPQALKTASFVEYF 61
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
SG L A P A Y GHQFG + QLGDGR + LGE+ + W+L LKGAG+TPYS
Sbjct: 62 SGNATLRNASPLAMVYSGHQFGSYNPQLGDGRGLLLGEVETASNGTWDLHLKGAGQTPYS 121
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
RFADG AVLRS+IRE+LCSEAM LGI TTR L ++ + V R+ E GA +
Sbjct: 122 RFADGRAVLRSTIREYLCSEAMAGLGIATTRGLGIIGSATPVYRE-------TPEMGATL 174
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVAQS +RFGS++ + DIV+ LADY I +F +E +SE+
Sbjct: 175 VRVAQSHVRFGSFEYFHYNNRP--DIVKQLADYVITRNFPELE---QSET---------- 219
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
KYA + + V TA ++AQWQ VGF HGV+NTDNMSI+G T D+GPFGF+D
Sbjct: 220 --------KYADFLLAVVTSTAFMIAQWQAVGFAHGVMNTDNMSIIGDTFDFGPFGFMDD 271
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFM 474
++P+F N +D G RY F QP IGLWN+ + L+ LID + + +Y +
Sbjct: 272 YNPNFICNHSDHEG-RYAFNQQPGIGLWNLNALAHALST--LIDRESITQALSQYEQLLV 328
Query: 475 DEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELL 531
++Y I KLGL + + +++ LL+ + K DYTNFFR LS+ + S PE E L
Sbjct: 329 NQYNRIFRLKLGLREEKDADAELVGSLLDLLEDQKADYTNFFRLLSHCQH--SSPEFETL 386
Query: 532 VPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSA 591
L+ +D + ++ +W+L Y Q L++ R+ M + NPKY+LRNY+ Q
Sbjct: 387 --LRDRFVD-----RSSFDAWMLQYQQRLMAENSDPVLRRETMLATNPKYILRNYIAQQV 439
Query: 592 IDAAELG-DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
ID A D+ ++ LL +++ P++E P YA PP W R V SCSS
Sbjct: 440 IDKANSDQDYSDIGNLLTILQNPFEEHPQFSHYASDPPDWGKRLEV---SCSS 489
>gi|292488141|ref|YP_003531020.1| hypothetical protein EAMY_1662 [Erwinia amylovora CFBP1430]
gi|292899351|ref|YP_003538720.1| hypothetical protein EAM_1638 [Erwinia amylovora ATCC 49946]
gi|428785076|ref|ZP_19002567.1| UPF0061 protein [Erwinia amylovora ACW56400]
gi|291199199|emb|CBJ46313.1| conserved hypothetical protein [Erwinia amylovora ATCC 49946]
gi|291553567|emb|CBA20612.1| UPF0061 protein ECA1842 [Erwinia amylovora CFBP1430]
gi|312172275|emb|CBX80532.1| UPF0061 protein ECA1842 [Erwinia amylovora ATCC BAA-2158]
gi|426276638|gb|EKV54365.1| UPF0061 protein [Erwinia amylovora ACW56400]
Length = 479
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 209/518 (40%), Positives = 284/518 (54%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L+ YT P+ ++N +L+ + +A L+LD + F+ + L+ P G P AQ
Sbjct: 11 LNGFYTAQQPTP-LKNARLLYHNAGLARELKLDERLFQAQNVGLWNGERLP-EGMQPLAQ 68
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR I LGE +++ LKGAG TPYSR DG AVLRS++R
Sbjct: 69 VYSGHQFGVWAGQLGDGRGILLGEQQLPDGRKFDWHLKGAGLTPYSRMGDGRAVLRSTLR 128
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL EAMH LGI T+RAL +VT+ + V R+ E GA++ RVA+S +RFG ++
Sbjct: 129 EFLAGEAMHHLGIKTSRALTVVTSDEPVYRE-------TTETGAMLLRVAESHVRFGHFE 181
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
GQ + V LADY IRHH+ +KY W
Sbjct: 182 HFYYLGQP--EKVTQLADYVIRHHWPQWVQ---------------------ERDKYLLWF 218
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V +RTA L+A WQ +GF HGV+NTDNMSILGLT+DYGPFGFLD + P + N +D G
Sbjct: 219 SDVVQRTARLIAGWQSIGFAHGVMNTDNMSILGLTLDYGPFGFLDDYQPGYICNHSDYQG 278
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP IGLWN+ + + L+ L+ ++ + Y + M + M KLGL
Sbjct: 279 -RYSFENQPTIGLWNLNRLAHALSG--LMSPQQLKQALAGYEPELMRCWGEKMRAKLGLL 335
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ LL+ M ++ DYT FR LS ++ S PL+ +D
Sbjct: 336 TPGKDDNHILTGLLSLMTRERSDYTRTFRQLSQIQQLQSRS------PLRDEFID----- 384
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ SW + Q LL SDEER+ M NP +LRNYL Q AI+ AE D + R
Sbjct: 385 RDAFDSWYNVWRQRLLKEECSDEERQRTMKLANPALILRNYLAQQAIERAEQDDISVLAR 444
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + +PY + P AR PP W + V SCSS
Sbjct: 445 LHQALSQPYADAPEFADLARRPPDWGKKLEV---SCSS 479
>gi|410223380|gb|JAA08909.1| selenoprotein O [Pan troglodytes]
gi|410290304|gb|JAA23752.1| selenoprotein O [Pan troglodytes]
Length = 666
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 254/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVETPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTATGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV +RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDSV----------------QRNAAFFREVTQRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFSLLSSFPVELESP 471
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D E +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WADWLQAYRARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLETPYHCEAGAATDAEATEANGADGRQRSYSSKPPLWA 660
>gi|269961052|ref|ZP_06175421.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
gi|269834271|gb|EEZ88361.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
Length = 489
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 210/551 (38%), Positives = 291/551 (52%), Gaps = 68/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E +N+ H F ELP A +T V+P ++N + V W+ A
Sbjct: 1 MSVWEGVNFTHRF-SELPS-------------AFFTYVTPQL-LDNTRWVVWNGEFAQQF 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L E E + F+G A P A Y GHQFG++ LGDGR + L E+ +
Sbjct: 46 GLPATENE--ELLNVFAGQKEFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDG 103
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+++ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R
Sbjct: 104 TWFDIHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYR 163
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA++ RVA++ +RFG ++ Q L + LAD I HF
Sbjct: 164 E-------KMEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLADKVIEWHFPECSQ 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
K YAA V E+TA ++A WQ GF HGV+NTDNMS
Sbjct: 215 AEKP---------------------YAAMFESVVEKTAEMIAYWQAYGFAHGVMNTDNMS 253
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ L+
Sbjct: 254 ILGQTFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSP--LVQ 310
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
++ + ++ + ++ +M KLGL + ++ + + +K DYT FFR
Sbjct: 311 REDLEVALGKFEVRLSQKFSELMRAKLGLHTKVDEDGRLFEAMFELLNQNKADYTRFFRE 370
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ-ELLSSG--ISDEERKA 572
LSN+ ++ +P + L I +E AW+ L+ + E+ G +S E R
Sbjct: 371 LSNL---------DVKLPQAVIDLFIDREAASAWVDLYLARCELEVDEHGERVSAETRCE 421
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP + YA+LPP W
Sbjct: 422 KMRRTNPKYILRNYLAQLAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDDYAKLPPEWGK 481
Query: 633 RPGVCMLSCSS 643
+ +SCSS
Sbjct: 482 K---MEISCSS 489
>gi|167836286|ref|ZP_02463169.1| hypothetical protein Bpse38_07331 [Burkholderia thailandensis
MSMB43]
Length = 476
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 214/521 (41%), Positives = 279/521 (53%), Gaps = 69/521 (13%)
Query: 145 PQLVAWSESVADSLELDPKEFERPDFPLFFSGATPL---AGAVPYAQCYGGHQFGMWAGQ 201
P +V +S+ A L LDP + P F F G ++PYA Y GHQFG+WAGQ
Sbjct: 3 PYVVGFSDEAARMLGLDPALRDAPGFADLFCGNPTRDWPPASLPYASVYSGHQFGVWAGQ 62
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRA+T+GE+ + R+ELQLKGAG+TPYSR DG AVLRSSIREFL SEAMH LGI
Sbjct: 63 LGDGRALTIGELAH-DGRRYELQLKGAGRTPYSRMGDGRAVLRSSIREFLGSEAMHHLGI 121
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDI 320
PTTRAL ++ + + V R+ E A+V RVA+SF+RFG ++ A+ E L
Sbjct: 122 PTTRALTVIGSDQPVIREEI-------ETSAVVTRVAESFVRFGHFEHFFANDRPEQL-- 172
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
R LAD+ I D + + Y A EV RTA LVA
Sbjct: 173 -RALADHVI---------------------DRFYPACRDADDPYLALLAEVTRRTAELVA 210
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
QWQ VGF HGV+NTDNMSILG+TIDYGPFGF+DAFD N +D G RY + QP I
Sbjct: 211 QWQAVGFCHGVMNTDNMSILGVTIDYGPFGFIDAFDAKHVCNHSDTHG-RYAYRMQPRIA 269
Query: 441 LWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
WN + L A + ++D A V+ R+ +F + M KL
Sbjct: 270 HWNCFCLAQALLPLFGLDRDAPSEDARAERAVEDAHA--VLGRFPEQFGPALERAMRAKL 327
Query: 486 GLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + + + ++LL M D+T FR L+ V + + P + + +D
Sbjct: 328 GLALEREGDAALANQLLEIMDASHADFTLTFRHLARVSKHDARGD----APARDLFID-- 381
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
++A+ W Y L D R A MN NPKYVLRN+L ++AI A+ DF E
Sbjct: 382 ---RDAFDRWANLYRARLSEEARDDAARAAAMNRSNPKYVLRNHLAETAIRRAKEKDFSE 438
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL ++ RP+DEQP + YA LPP WA +SCSS
Sbjct: 439 IERLAAVLRRPFDEQPEHDAYAALPPDWA---STLEVSCSS 476
>gi|359798881|ref|ZP_09301450.1| hypothetical protein KYC_18090 [Achromobacter arsenitoxydans SY8]
gi|359363019|gb|EHK64747.1| hypothetical protein KYC_18090 [Achromobacter arsenitoxydans SY8]
Length = 495
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 213/518 (41%), Positives = 291/518 (56%), Gaps = 48/518 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+++P + NP+L+ + A + LDP P+F FSGA PL G A Y
Sbjct: 21 AFYTRLAPQG-LNNPRLLHANADAAALIGLDPAALSTPEFLDVFSGARPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ + WELQLKG+G TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEVQGPEGG-WELQLKGSGMTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LG+PTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGVPTTRALALVVSDDPVMRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q D+ ++TLADY I ++ + ES + + Y
Sbjct: 192 SSRRQPDM--LKTLADYVIDRYYPECRDAPAGESPA-------------DTAPYINLLRA 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHVCNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L L+ D +A V++ + F + M K+GL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSL--HMLVQDADALRAVLDEFEAVFTRAFHDRMGAKMGLAA 353
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ ++ LL M ++ D+T +R L++ V+ S ED L I +
Sbjct: 354 WLPEDEALLDDLLKLMDANQADFTLTWRRLADAVQGRRSAFED----------LFIDRPA 403
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
AW+ +++ + G +E A MN VNP YVLRN+L + AI AA+ GD E+
Sbjct: 404 ASAWLDRLVARHAQ---DGRLVQETVAGMNRVNPLYVLRNHLAEQAIRAAKTGDASEIDT 460
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+KL+ P+ Q G E+YA LPP WA G +SCSS
Sbjct: 461 LMKLLRNPFVAQEGYERYATLPPDWA---GGIEVSCSS 495
>gi|397685525|ref|YP_006522844.1| hypothetical protein PSJM300_02030 [Pseudomonas stutzeri DSM 10701]
gi|395807081|gb|AFN76486.1| hypothetical protein PSJM300_02030 [Pseudomonas stutzeri DSM 10701]
Length = 486
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 216/548 (39%), Positives = 298/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L +LN+D+ F R GD T+V P E P+LV SE+ L
Sbjct: 1 MKTLTELNFDNRFAR--LGD------------VFSTEVMPQPLAE-PRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +RP F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPTEADRPLFAELFSGHKLWSTAEPRAMVYSGHQFGAYNPQLGDGRGLLLGEVINDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ ++ V R
Sbjct: 106 DYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTSSQTPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ ++E GA++ R+A S +RFG ++ Q D +R L D+ I HF +
Sbjct: 166 E-------RQERGAMLLRLAPSHVRFGHFEFFYYTRQH--DALRQLLDHVIACHF--PDC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ E Y ++ +V ERTA ++A+WQ GF HGV+NTDNMS
Sbjct: 215 LEHPEP-------------------YRSFFRQVLERTAGMIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFVE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
M+ + + ++ A+M +LG + ++ +I LL M VDYT FFR
Sbjct: 313 VGALRESMDLFLPLYEAQWLALMRGRLGFVQADDGDQALIQDLLKLMQGSAVDYTRFFRE 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L + P ++ L L+ +D+ + + W +Y Q GI R+ M
Sbjct: 373 LGDS------PAEQALSRLREDFVDL-----QGFDRWAQTYRQRSEREGIEQVARQTRMR 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+ NPKY+LRNYL Q AI+AAE G++ VR L ++ RP+DEQPGME+YA+ PP W
Sbjct: 422 AANPKYILRNYLAQQAIEAAEQGNYEPVRELHAVLSRPFDEQPGMERYAQRPPEWGKH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|344174697|emb|CCA86507.1| conserved hypothetical protein, UPF0061 [Ralstonia syzygii R24]
Length = 529
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 212/537 (39%), Positives = 280/537 (52%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L E + P F+G A + P A Y GH
Sbjct: 38 TRLPPIPMPASPDLVGFSPEAAAPLGLSRAELDTPAGLDVFAGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+Q+KGAG+TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQIKGAGRTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREEI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ + D + + Y A E A
Sbjct: 209 -NEKLPELRALADFVL---------------------DRFYPACRAEAQPYLALLRETAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNIAQFSTTL-------------AAAKLIDDKEANYVM-----------ERY 469
A QP I WN+ + L A L D+ +A + + Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNDGTAFVDLSDEAQAQPAIDAAQEALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
++ V D +++ +W+ +Y Q L + + D+ R M VNPKYVLRN+
Sbjct: 421 ALAQARTVRDVFFD-----RDSADAWLAAYRQRLQAEPVPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFAEVENLRAVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|426407294|ref|YP_007027393.1| hypothetical protein PputUW4_00380 [Pseudomonas sp. UW4]
gi|426265511|gb|AFY17588.1| hypothetical protein PputUW4_00380 [Pseudomonas sp. UW4]
Length = 487
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 220/551 (39%), Positives = 302/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALNALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEQQ----KVLGDHVLAMHFP-- 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLENLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + +I L+ +D+ + + +W Y+ + G D E+R+
Sbjct: 371 RRLGDEAPEQAITR------LRDDFVDL-----KGFDAWGELYVARVAREGAVDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|424047081|ref|ZP_17784642.1| hypothetical protein VCHENC03_2312 [Vibrio cholerae HENC-03]
gi|408884379|gb|EKM23123.1| hypothetical protein VCHENC03_2312 [Vibrio cholerae HENC-03]
Length = 489
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 210/551 (38%), Positives = 289/551 (52%), Gaps = 68/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E +N+ H F ELP A +T V+P ++N + V W+ A
Sbjct: 1 MSVWEGVNFTHRF-SELPS-------------AFFTYVTPQF-LDNTRWVVWNGEFAQQF 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L E E + F+G A P A Y GHQFG++ LGDGR + L E+ +
Sbjct: 46 GLPATENE--ELLNVFAGQKEFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDG 103
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+++ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R
Sbjct: 104 TWFDIHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYR 163
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA++ RVA++ +RFG ++ Q L + LAD I HF
Sbjct: 164 E-------KMEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLADKVIEWHFPECSQ 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
K YAA V E+TA ++A WQ GF HGV+NTDNMS
Sbjct: 215 AEKP---------------------YAAMFESVVEKTAEMIAYWQAYGFAHGVMNTDNMS 253
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ +
Sbjct: 254 ILGQTFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSPLVQRE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
D EA + ++ + ++ +M KLGL + ++ + + +K DYT FFR
Sbjct: 313 DLEA--ALGKFEVRLSQKFSELMRAKLGLHTKVDEDGRLFEAMFELLNQNKADYTRFFRE 370
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ---ELLSSGISDEERKA 572
LSN+ ++ P + L I +E AW+ L+ + + L +S + R
Sbjct: 371 LSNL---------DVKAPQAVIDLFIDREAASAWVDLYLARCELEVDELGERVSAQTRCE 421
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP + YA+LPP W
Sbjct: 422 QMRRTNPKYILRNYLAQLAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDDYAKLPPEWGK 481
Query: 633 RPGVCMLSCSS 643
+ +SCSS
Sbjct: 482 K---MEISCSS 489
>gi|348551636|ref|XP_003461636.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O-like [Cavia
porcellus]
Length = 697
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 200/474 (42%), Positives = 263/474 (55%), Gaps = 38/474 (8%)
Query: 93 SKMTKKLKALEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQ 146
+ M + L L +D+ +R LP G S+PR V AC+++ P A + P+
Sbjct: 60 TAMDSAPRWLAGLRFDNQVLRALPVETPPPGSEDALSVPRTVAGACFSRARP-ARLRQPR 118
Query: 147 LVAWSESVADSLEL-DPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDG 205
+VA S L L +P + LFFSG L GA P A CY GHQFG +AGQLGDG
Sbjct: 119 VVALSGPALALLGLPEPDASVEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDG 178
Query: 206 RAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTR 265
A+ LGE+ ERWE+QLKGAG T +SR ADG VLRSSIREFLCSEAM LGIPTTR
Sbjct: 179 AAMYLGEVCTEAGERWEMQLKGAGPTAFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTR 238
Query: 266 ALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI---------HASRGQE 316
A VT+ V RD+FYDGNPK E +V R+A +F+RFGS++I A +
Sbjct: 239 AGACVTSESTVVRDVFYDGNPKYEKCTVVLRIAPTFIRFGSFEIFKPADEYTGRAGPSVQ 298
Query: 317 DLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTA 376
DI L DY I + I+ + +S + AA+ EV RTA
Sbjct: 299 RNDIRIQLLDYVISSFYPEIQAAHACDSDRVP--------------RNAAFFREVTRRTA 344
Query: 377 SLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQ 436
+VA+WQ VGF HGVLNTDNMSI+GLTIDYGPFGFLD +DP N +D G RY ++ Q
Sbjct: 345 RMVAEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRYDPDHVCNASDNAG-RYTYSKQ 403
Query: 437 PDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ--- 493
P++ WN+ + + L + E V E + T+F Y M +KLGL + ++
Sbjct: 404 PEVCKWNLQKLAEALEPELPLALGE-TIVAEEFDTEFQKHYLQKMRRKLGLVQGEREEDG 462
Query: 494 -IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE-DELLVPLKAVLLDIGKER 545
+++KLL M + D+TN F LS+ A+P P DE L L + + + R
Sbjct: 463 ALVAKLLETMHLTGADFTNTFCLLSSFPAEPEAPGLDEFLTALTSQCASLEERR 516
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 38/45 (84%)
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQ 617
+M+S NPKYVLRNY+ Q+AI+AAE GDF EVRR+LKL+E PY +
Sbjct: 611 VMHSSNPKYVLRNYIAQNAIEAAENGDFSEVRRVLKLLESPYQHE 655
>gi|398841409|ref|ZP_10598630.1| hypothetical protein PMI18_04000 [Pseudomonas sp. GM102]
gi|398108499|gb|EJL98457.1| hypothetical protein PMI18_04000 [Pseudomonas sp. GM102]
Length = 487
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 221/551 (40%), Positives = 298/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P + P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IAAPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A A P A Y GHQFG + QLGDGR + LGEI N
Sbjct: 46 DLDPAEAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEIYNNAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI--HASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF H
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEFFYYTKRPEQQ----KELGEHVLAMHFPHC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLEHLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L N + +I L+ +D+ + + +W YI + G D ++R+
Sbjct: 371 RRLGNESPELAIAR------LRDDFVDL-----KGFDAWGELYIARVAREGNGDQQQRRK 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME+YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMERYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|344169562|emb|CCA81922.1| conserved hypothetical protein, UPF0061 [blood disease bacterium
R229]
Length = 529
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 212/537 (39%), Positives = 280/537 (52%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L E + P F+G A + P A Y GH
Sbjct: 38 TRLPPMPMPASPDLVGFSPEAAAPLGLSRAELDTPAGLDVFAGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+Q+KGAG+TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQIKGAGRTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREEI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A E A
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLRETAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNIAQFSTTL-------------AAAKLIDDKEANYVM-----------ERY 469
A QP I WN+ + L A L D+ +A + + Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNDGAAFVDLSDEAQAQPAIDAAQEALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ + P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRN-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
++ V D +++ +W+ +Y Q L + + D+ R M VNPKYVLRN+
Sbjct: 421 ALAQARTVRDVFFD-----RDSADAWLAAYRQRLQAEPVPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFAEVENLRAVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|421505340|ref|ZP_15952278.1| hypothetical protein A471_18750 [Pseudomonas mendocina DLHK]
gi|400343749|gb|EJO92121.1| hypothetical protein A471_18750 [Pseudomonas mendocina DLHK]
Length = 487
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 217/550 (39%), Positives = 302/550 (54%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L+ L +D+ F R GD A T+V P +E P+LV S L
Sbjct: 1 MKKLDQLTFDNRFAR--LGD------------AFSTEVLPEP-IEQPRLVVASSDAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +R +F F+G A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLDPAEAQREEFAELFAGHKLWGEAEPRAMVYSGHQFGGYTPRLGDGRGLLLGEVVNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ T+ V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTTSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+A S +RFG ++ + +R E L + L ++ + +HF H
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTRQHEQLKV---LGEHVLANHFPHC- 214
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
T DE + A EV ERTA+++A WQ GF HGV+NTDNM
Sbjct: 215 ----------LTQDE----------PWLAMFREVLERTAAMIAHWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L ++
Sbjct: 255 SILGITFDYGPYAFLDDFDANHICNHSDDTG-RYSFSNQVPIAHWNLAALAQAL--TPMV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDK-VDYTNFF 513
+ ++ +E + + Y +M K+LGL ++ ++ +LL M K DY+ FF
Sbjct: 312 EVEKLRETLELFLPLYQAHYLDLMRKRLGLTSAEDDDEALVQRLLQLMQQGKATDYSLFF 371
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P D L V ++ +D+ + +W Y+ G ER+A
Sbjct: 372 RQLGE-----QAPADALQV-VRNDFVDLA-----GFDAWGRDYLARCEREGQQQAERRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y+LRNYL Q I+AAE GD+G VR L ++ RP+DEQPGM++YA+ PP W
Sbjct: 421 MHAVNPLYILRNYLAQQVIEAAEAGDYGPVRELHAVLSRPFDEQPGMQRYAQRPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|451970174|ref|ZP_21923401.1| Selenoprotein O and cysteine-containing protein [Vibrio
alginolyticus E0666]
gi|451933688|gb|EMD81355.1| Selenoprotein O and cysteine-containing protein [Vibrio
alginolyticus E0666]
Length = 489
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 205/520 (39%), Positives = 285/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L P E + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGL-PAE-QSDELLAVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFTDCASAEKP---------------------YAAMFGE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GFTHGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFTHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ +D EA+ + ++ + ++ +M +KLGL
Sbjct: 285 YAFDQQPRIALWNLSALAHALSPLVEREDLEAS--LSQFEVRLSQQFSRLMREKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + + DYT FFR LSN+ D S +AV+ L + +E
Sbjct: 343 IAEDGRLFEAMFELLHQNNTDYTRFFRTLSNLDTDSS----------QAVIDLFLDREAA 392
Query: 547 EAWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + + L IS E+R M NPKY+LRNYL Q AID AE GDF E+
Sbjct: 393 RAWLDLYLARCELEVDELGELISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEL 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RP+DEQP + YA+LPP W + + SCSS
Sbjct: 453 HRLAELLKRPFDEQPEFDDYAKLPPEWGKKMEI---SCSS 489
>gi|421888121|ref|ZP_16319233.1| conserved hypothetical protein, UPF0061 [Ralstonia solanacearum
K60-1]
gi|378966511|emb|CCF95981.1| conserved hypothetical protein, UPF0061 [Ralstonia solanacearum
K60-1]
Length = 529
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 215/537 (40%), Positives = 277/537 (51%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L + P F G A + P A Y GH
Sbjct: 38 TRLPPLPMPASPYLVGFSPEAAAPLGLSHAGLDTPAGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREAI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNI----------------------AQFSTTLAAAKLIDDKEANYVMER--Y 469
A QP I WN+ S A ID +A ++ R Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNDGAAFVDLSDETQAQPAIDAAQAALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
D ++ V D +++ +W+ Y + L + + D+ R M VNPKYVLRN+
Sbjct: 421 ADAQARTVRDVFFD-----RDSADAWLADYRRRLQAEPLPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFSEVEHLRTVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|163857352|ref|YP_001631650.1| hypothetical protein Bpet3040 [Bordetella petrii DSM 12804]
gi|226703679|sp|A9IT50.1|Y3040_BORPD RecName: Full=UPF0061 protein Bpet3040
gi|163261080|emb|CAP43382.1| conserved hypothetical protein [Bordetella petrii]
Length = 497
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 218/519 (42%), Positives = 290/519 (55%), Gaps = 48/519 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+++P + P+L+ +E A + L +F FSG PL G A Y
Sbjct: 21 AFYTRLAPQ-PLTAPRLLHANEQAAALIGLSADALRSDEFLRVFSGQQPLPGGQTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ WELQLKGAG TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEVAGPDGN-WELQLKGAGMTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTR+L LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q D +R LADY I + E+ D +++ + + E
Sbjct: 192 SSRRQP--DELRILADYVIDKFYPECREPRPGEAPG-----PDGALLRMLA--------E 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+DAF N +D G R
Sbjct: 237 VTRRTAELMAGWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDAFRLDHICNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L A L+ D EA V++ Y F + M KLGL +
Sbjct: 296 YAWNRQPSVALWNLYRLGGSLHA--LVPDVEALRAVLDSYEVIFTRAFHQRMAAKLGLRE 353
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ LL M ++ D+T FR L++ V+ P +D L I ++
Sbjct: 354 WRADDEPLLDDLLRLMHDNRADFTLTFRRLADAVRGRPQGLQD----------LFIDRDA 403
Query: 546 KEAWISWVLS-YIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW + + + QE +G + R A M++VNP YVLRN+L + AI AA+ GD GE+
Sbjct: 404 ALAWFERLAARHAQE--GAGNDAQARAAGMDAVNPLYVLRNHLAEQAIRAAKAGDAGEID 461
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL L+ P EQPG + YA LPP WA G +SCSS
Sbjct: 462 TLLALLRDPCVEQPGRDAYAALPPDWA---GGIEVSCSS 497
>gi|422603852|ref|ZP_16675870.1| hypothetical protein PSYMO_01185 [Pseudomonas syringae pv. mori
str. 301020]
gi|330886272|gb|EGH20173.1| hypothetical protein PSYMO_01185 [Pseudomonas syringae pv. mori
str. 301020]
Length = 487
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 218/549 (39%), Positives = 305/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++Q++S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTIAQEQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALM 574
L + P E L L+ +DI + + W +Y+ + G +++ER+ M
Sbjct: 373 LGDQ------PAVEALRTLRDDFVDI-----KGFDGWAEAYLARIAGEGKGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|226874893|ref|NP_001152883.1| selenoprotein O [Macaca mulatta]
Length = 669
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 197/446 (44%), Positives = 254/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR+V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAQSAPRQVPGACFTRVRPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRIASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+ + + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHTSDRV----------------QRNAAFFREVTRRTAWMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCKWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + ++ ++SKLL
Sbjct: 387 QKLAEALQPELPLELGEA-ILAEEFDAEFQRHYMQKMRRKLGLVQLELEEDRALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFFLLSSFPVELESP 471
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 55/106 (51%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D ER +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WAEWLQAYRARLDKDLEGAGDAAAWQAERVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLENPYHCEAGAATDPEATEADGADGRQRSYSSKPPLWA 660
>gi|319738592|ref|NP_001135537.2| selenoprotein O [Xenopus (Silurana) tropicalis]
Length = 651
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 195/456 (42%), Positives = 264/456 (57%), Gaps = 52/456 (11%)
Query: 105 LNWDHSFVRELPGDPRTDS-----IPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
L +D+ +R LP +P + PR+V AC+++V P+ + NP +VA S S L
Sbjct: 27 LTFDNLALRSLPVEPGDGTEEEARTPRQVPGACFSRVRPTPLL-NPTVVALSRSALSLLG 85
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
L E E + +FSG L G+ P A CY GHQFG +AGQLGDG A+ LGE++N +
Sbjct: 86 LQVGE-EDEEATEYFSGNRLLPGSEPAAHCYCGHQFGNFAGQLGDGAAMYLGEVVNATGK 144
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
RWE+QLKGAG TPYSR ADG VLRSSIREFLCSEAM LGIP+TRA VT V RD
Sbjct: 145 RWEIQLKGAGLTPYSRQADGRKVLRSSIREFLCSEAMSHLGIPSTRAGSCVTADSTVIRD 204
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ---------EDLDIVRTLADYAIR 330
++YDGNPK+E +V R+A +FLRFGS++I + + DI + DY IR
Sbjct: 205 IYYDGNPKKEKCTVVSRIAPTFLRFGSFEIFKPTDEFTGRKGPSVDRNDIRIQMLDYVIR 264
Query: 331 HHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHG 390
+ I+ E H+ + + K AA+ E+ +RTA LVA+WQ VGF HG
Sbjct: 265 TFYPDIQ--------------EKHAGNN--TEKNAAFFREITKRTARLVAEWQCVGFCHG 308
Query: 391 VLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTT 450
VLNTDNMSI+GLTIDYGPFGF+D +DP + N +D G RY + QP+I WN+ + +
Sbjct: 309 VLNTDNMSIVGLTIDYGPFGFIDRYDPEYICNGSDNMG-RYAYNKQPEICKWNLGKLAEA 367
Query: 451 L-------AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLL 499
L + ++DD+ Y +F + Y M KKLGL + + ++S LL
Sbjct: 368 LIPELPLSISQSILDDE--------YDAEFQNHYMEKMRKKLGLVRLKLDDDSHLVSDLL 419
Query: 500 NNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLK 535
M + D+TN FR LS D + +D L + ++
Sbjct: 420 ETMNITGSDFTNTFRVLSKFSGDEAEIQDFLNIIIE 455
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKAL-------MNSVNPKYVLRNYLCQSAI 592
D+ K+ K+ W W+ Y L S E+RKA M+S NP Y+LRNY+ Q+AI
Sbjct: 519 DLLKDNKKHWKEWLRKYSVRLEKERGSVEDRKAFHEEHVKTMDSNNPSYILRNYIAQNAI 578
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
D+AE GDF EV+R+L+++E PY E + A P
Sbjct: 579 DSAESGDFSEVKRVLQMLENPYQEGESCQSIADKSP 614
>gi|83749027|ref|ZP_00946034.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
gi|83724290|gb|EAP71461.1| Hypothetical cytosolic protein [Ralstonia solanacearum UW551]
Length = 529
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 214/537 (39%), Positives = 277/537 (51%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L + P F G A + P A Y GH
Sbjct: 38 TRLPPLPMPASPYLVGFSPEAAAPLGLSRTGLDTPTGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRREAI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNI----------------------AQFSTTLAAAKLIDDKEANYVMER--Y 469
A QP I WN+ S A ID +A ++ R Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRSDNGAAAFVDLSDEAQAQPAIDAAQAALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
D ++ + D +++ +W+ Y + L + + D+ R M VNPKYVLRN+
Sbjct: 421 ADAQARTVRDLFFD-----RDSADTWLADYRRRLQAEPLPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFSEVEHLRTVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 529
>gi|330807153|ref|YP_004351615.1| hypothetical protein PSEBR_a466 [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
gi|327375261|gb|AEA66611.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
brassicacearum NFM421]
Length = 487
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 218/549 (39%), Positives = 299/549 (54%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K LE L +D+ F R GD T V P ++NP+LV S + L
Sbjct: 1 MKTLETLTFDNRFAR--LGD------------GLSTHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPVEAEAPLFAEIFGGHKLWAETEPRAMVYSGHQFGHYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L LA++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKKPELHA--ALAEHVLNLHFAECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y + Y +M ++LGL + +++++ +LL M VDY+ FFR
Sbjct: 313 VEALRETLGLYLPLYQAHYLDLMRRRLGLTRAEEDDQKLLERLLQLMQNSGVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKALM 574
L + PE + + L+ +D+ + + +W YI + G I ++R+A M
Sbjct: 373 LGD-----QAPE-QAVASLRDDFVDL-----KGFDAWGELYIARVNREGAIDQDQRRARM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP YVLRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGM+ YA+ PP W
Sbjct: 422 HAVNPLYVLRNYLAQKAIDAAESGDYEEVRRLHTVLSKPFEEQPGMDSYAQRPPEWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|28897683|ref|NP_797288.1| hypothetical protein VP0909 [Vibrio parahaemolyticus RIMD 2210633]
gi|260364548|ref|ZP_05777157.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030]
gi|260879429|ref|ZP_05891784.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034]
gi|260895843|ref|ZP_05904339.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466]
gi|33517002|sp|Q87R88.1|Y909_VIBPA RecName: Full=UPF0061 protein VP0909
gi|28805896|dbj|BAC59172.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD
2210633]
gi|308086790|gb|EFO36485.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466]
gi|308093203|gb|EFO42898.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034]
gi|308115348|gb|EFO52888.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030]
Length = 489
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 199/519 (38%), Positives = 278/519 (53%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGLPAAQ--NDELLAVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL +V + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMVVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHL 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFGE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L + L++ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHAL--SPLVEREDLEQALSQFEGRLSQQFSRLMRSKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT FFRALSN+ P+ + + L I +E +
Sbjct: 343 IAEDGRLFESMFELLNQNHTDYTRFFRALSNLDKQPA---------QEVIDLFIDREAAQ 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + + IS E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDEIGEPISAEQRSEQMRQTNPKYILRNYLAQLAIDKAEEGDFSEVH 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ PYD QP E YA+LPP W + +SCSS
Sbjct: 454 RLAEILRHPYDSQPEFEAYAKLPPEWGKK---MEISCSS 489
>gi|207743083|ref|YP_002259475.1| hypothetical protein RSIPO_01250 [Ralstonia solanacearum IPO1609]
gi|206594480|emb|CAQ61407.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
Length = 537
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 214/537 (39%), Positives = 277/537 (51%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L + P F G A + P A Y GH
Sbjct: 46 TRLPPLPMPASPYLVGFSPEAAAPLGLSRTGLDTPTGLDVFVGNAIAAWSDPLATVYSGH 105
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 106 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 164
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 165 EAMAGLGIPTTRALCVIGADAPVRREAI-------ETAAVVTRLAPSFVRFGHFEHFAA- 216
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 217 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 254
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 255 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 313
Query: 434 ANQPDIGLWNI----------------------AQFSTTLAAAKLIDDKEANYVMER--Y 469
A QP I WN+ S A ID +A ++ R Y
Sbjct: 314 AQQPQIAYWNLFCLAQALLPLFGSRSDNGAAAFVDLSDEAQAQPAIDAAQAALLVYRDTY 373
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 374 GAAFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 428
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
D ++ + D +++ +W+ Y + L + + D+ R M VNPKYVLRN+
Sbjct: 429 ADAQARTVRDLFFD-----RDSADTWLADYRRRLQAEPLPDDARAEAMRRVNPKYVLRNH 483
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 484 LAEIAIRRAKEKDFSEVEHLRTVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 537
>gi|421897554|ref|ZP_16327922.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
gi|206588760|emb|CAQ35723.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
Length = 536
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 215/536 (40%), Positives = 278/536 (51%), Gaps = 71/536 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L + P F G A + P A Y GH
Sbjct: 46 TRLPPLPMPASPYLVGFSPEAAAPLGLSRAGLDTPAGLDVFVGNVIAAWSDPLATVYSGH 105
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 106 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 164
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 165 EAMAGLGIPTTRALCVIGADAPVRREAI-------ETAAVVTRLAPSFVRFGHFEHFAA- 216
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 217 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 254
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 255 RTAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 313
Query: 434 ANQPDIGLWNIAQFSTTL---------------------AAAKLIDDKEANYVMER--YG 470
A QP I WN+ + L A ID +A ++ R YG
Sbjct: 314 AQQPQIAYWNLFCLAQALLPLFGSRSDNGAAFVDLSDEAQAQPAIDAAQAALLVYRDTYG 373
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 374 AAFYACYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTPA 428
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
D ++ + D +++ +W+ Y + L + + D+ R M VNPKYVLRN+L
Sbjct: 429 DAQARTVRDLFFD-----RDSADAWLADYRRRLQAEPLPDDARAEAMRRVNPKYVLRNHL 483
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI A+ DF EV L ++ RP+DE PG E+YA P WA +SCSS
Sbjct: 484 AEIAIRRAKEKDFSEVEHLRAVLARPFDEHPGFERYAGPAPDWA---ASLEVSCSS 536
>gi|423694983|ref|ZP_17669473.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens Q8r1-96]
gi|388009400|gb|EIK70651.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens Q8r1-96]
Length = 487
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 216/549 (39%), Positives = 298/549 (54%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K LE L +D+ F R GD T V P ++NP+LV S + L
Sbjct: 1 MKTLETLTFDNRFAR--LGD------------GLSTHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPVEAEAPLFAEIFGGHKLWAETEPRAMVYSGHQFGHYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L LA++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKKPELHA--ALAEHVLNLHFAECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y + Y +M ++LGL + +++++ +LL M VDY+ FFR
Sbjct: 313 VEALRETLGLYLPLYQAHYLDLMRRRLGLTRAEEDDQKLLERLLQLMQNSGVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKALM 574
L + + +I L+ +D+ + + +W YI + G + ++R+A M
Sbjct: 373 LGDQAPEQAI------ATLRDDFVDL-----KGFDAWGELYIARVNRDGAVEQDQRRARM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP YVLRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGM+ YA+ PP W
Sbjct: 422 HAVNPLYVLRNYLAQKAIDAAESGDYEEVRRLHTVLSKPFEEQPGMDSYAQRPPEWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|444425239|ref|ZP_21220684.1| hypothetical protein B878_04816 [Vibrio campbellii CAIM 519 = NBRC
15631]
gi|444241527|gb|ELU53050.1| hypothetical protein B878_04816 [Vibrio campbellii CAIM 519 = NBRC
15631]
Length = 489
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 211/551 (38%), Positives = 291/551 (52%), Gaps = 68/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E +N+ H F ELP A +T V+P ++N + V W+ A
Sbjct: 1 MSVWEGVNFTHRF-SELPS-------------AFFTYVTPQL-LDNTRWVVWNGEFAQQF 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L E E + F+G A P A Y GHQFG++ LGDGR + L E+ +
Sbjct: 46 GLPAAENE--ELLNVFAGQKEFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDG 103
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+++ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R
Sbjct: 104 TWFDIHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYR 163
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA++ RVA++ +RFG ++ Q L + LAD I HF
Sbjct: 164 E-------KTEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLADKVIEWHFPECSQ 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ K YAA V E+TA ++A WQ GF HGV+NTDNMS
Sbjct: 215 VEKP---------------------YAAMFEFVVEKTAEMIAYWQAYGFAHGVMNTDNMS 253
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ +
Sbjct: 254 ILGQTFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSPLVQRE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
D EA + ++ + ++ +M KLGL + ++ + + +K DYT FFR
Sbjct: 313 DLEA--ALGKFEVRLSQKFSELMRAKLGLHTKVDEDGRLFEAMFELLNQNKADYTRFFRE 370
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ-ELLSSG--ISDEERKA 572
LSN+ ++ P + L I +E AW+ L+ + E+ G +S + R
Sbjct: 371 LSNL---------DVKSPQAVIDLFIDREAASAWVDLYLARCELEVDECGERVSAQTRCE 421
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP + YA+LPP W
Sbjct: 422 KMRRTNPKYILRNYLAQIAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDDYAKLPPEWGK 481
Query: 633 RPGVCMLSCSS 643
+ + SCSS
Sbjct: 482 KMEI---SCSS 489
>gi|146305595|ref|YP_001186060.1| hypothetical protein Pmen_0558 [Pseudomonas mendocina ymp]
gi|167013044|sp|A4XPR2.1|Y558_PSEMY RecName: Full=UPF0061 protein Pmen_0558
gi|145573796|gb|ABP83328.1| protein of unknown function UPF0061 [Pseudomonas mendocina ymp]
Length = 487
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 217/550 (39%), Positives = 302/550 (54%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L+ L +D+ F R GD A T+V P +E P+LV S L
Sbjct: 1 MKKLDQLTFDNRFAR--LGD------------AFSTEVLPEP-IEQPRLVVASSDAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +R +F F+G A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLDPAEAQREEFAELFAGHKLWGEAEPRAMVYSGHQFGGYTPRLGDGRGLLLGEVVNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ T+ V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTTSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+A S +RFG ++ + +R E L + L ++ + +HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTRQHEQLKV---LGEHVLANHFPQC- 214
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
T DE + A EV ERTA+++A WQ GF HGV+NTDNM
Sbjct: 215 ----------LTQDE----------PWLAMFREVLERTAAMIAHWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +I
Sbjct: 255 SILGITFDYGPYAFLDDFDANHICNHSDDTG-RYSFSNQVPIAHWNLAALAQAL--TPMI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDK-VDYTNFF 513
+ ++ +E + + Y +M K+LGL ++ ++ +LL M K DY+ FF
Sbjct: 312 EVEKLRETLELFLPLYQAHYLDLMRKRLGLTSAEDDDEALVQRLLQLMQQGKATDYSLFF 371
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P D L V ++ +D+ + +W Y+ G +ER+A
Sbjct: 372 RQLGE-----QAPADALQV-VRNDFVDLA-----GFDAWGRDYLARCEREGQQQDERRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y+LRNYL Q I+AAE GD+G VR L ++ RP+DEQPGM++YA+ PP W
Sbjct: 421 MHAVNPLYILRNYLAQQVIEAAEAGDYGPVRELHAVLSRPFDEQPGMQRYAQRPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|388544653|ref|ZP_10147940.1| hypothetical protein PMM47T1_09706 [Pseudomonas sp. M47T1]
gi|388277350|gb|EIK96925.1| hypothetical protein PMM47T1_09706 [Pseudomonas sp. M47T1]
Length = 485
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 218/548 (39%), Positives = 298/548 (54%), Gaps = 66/548 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++NP+LV S+ L
Sbjct: 1 MKALDELTFDNRFARL--GD------------AFSTSVLPDP-IDNPRLVVASDGAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P E P F FSG A A P A Y GHQFG ++ +LGDGR + LGE+ N
Sbjct: 46 DLEPTEAHSPVFAQLFSGHKLWAEAQPRAMVYSGHQFGGYSPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIPT+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGLTPYSRMGDGRAVLRSSIREFLASEALHALGIPTSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+A S +RFG ++ Q + + L ++ + HF E
Sbjct: 166 E-------TQERAAMVLRLAPSHIRFGHFEYFYYTKQPEQ--AKVLGEHVLAMHFP--EC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ + E Y A + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 LEQPE-------------------PYLAMFRAIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ I WN++ + L I+
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIAHWNLSALAQAL--TPFIN 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRA 515
+ +E + + Y +M ++LGL + +K +I LL M VDY F R
Sbjct: 313 VQALRETLELFLPLYEAHYLDLMRRRLGLAQGEDSDKALIEDLLRLMQNSSVDYNLFLRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L + P + + L+ +D ++ + W Y+Q L G D ER A M+
Sbjct: 373 LGDQ------PAAQAVATLRDDFID-----RDGFDHWSARYLQRLAVQG-DDPERTARMH 420
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP Y+LRNYL Q+AIDAA+ GD+ EVRRL ++ RP+DEQPGM+ YA+ PP W
Sbjct: 421 AVNPLYLLRNYLAQNAIDAAQQGDYEEVRRLHNVLTRPFDEQPGMQAYAQRPPEWGKH-- 478
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 479 -LEISCSS 485
>gi|424070379|ref|ZP_17807814.1| hypothetical protein Pav037_0491 [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
gi|408000702|gb|EKG41049.1| hypothetical protein Pav037_0491 [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
Length = 487
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 221/551 (40%), Positives = 308/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL N +Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVANDQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P DE L L+ +DI + + W +Y + L +++ER+
Sbjct: 371 RRLGDQ------PADEALRTLRDDFVDI-----KGFDGWAHAYQARIALEDNGTEQERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|320156984|ref|YP_004189363.1| hypothetical protein VVMO6_02138 [Vibrio vulnificus MO6-24/O]
gi|319932296|gb|ADV87160.1| selenoprotein O and cysteine-containing [Vibrio vulnificus
MO6-24/O]
Length = 490
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 201/517 (38%), Positives = 282/517 (54%), Gaps = 53/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y V+P ++N + V W+ +A L PK + P FSGA P + P A Y G
Sbjct: 21 YRLVTPQP-LDNSRWVIWNGELAQGFAL-PKHADDPQLLAVFSGAEPFSAFKPLAMKYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ LGDGR + L E+ N + + +++ LKGAG TP+SR DG AVLRS+IRE+LC
Sbjct: 79 HQFGVYNPDLGDGRGLLLAEMQNRQGQWFDIHLKGAGLTPFSRMGDGRAVLRSTIREYLC 138
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGI TTRAL ++ + V R+ + E GA + R+AQ+ +RFG ++
Sbjct: 139 SEAMAALGIETTRALGMMVSDTPVYRE-------QVEQGACLIRLAQTHIRFGHFEHFFY 191
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E D +R LAD I + +K Y A +V
Sbjct: 192 --TEQYDELRLLADNVIEWYMPECTAHDKP---------------------YLAMFEQVV 228
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+++AQWQ VGF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G RY
Sbjct: 229 ARTATMIAQWQAVGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-RYA 287
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F QP + LWN++ + L + LI+ + + +Y + +M +KLGL +
Sbjct: 288 FDQQPRVALWNLSALAHAL--SPLIERDDLELALAQYEPTLGKVFSQLMRQKLGLLSQQE 345
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ + + +A + DYT FFR LS + ++ + +L V A W
Sbjct: 346 GDSELFNAMFALLAENHTDYTRFFRTLSQLDSEDAQTVIDLFVDRNAA---------RGW 396
Query: 550 ISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+S L + E +SG S ++R M +VNPKY+LRNYL Q AID A+ GDF EV L
Sbjct: 397 LSRYLERVALEQTASGEAKSAQQRCEQMRAVNPKYILRNYLAQQAIDKAQQGDFSEVHTL 456
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
KL++ PYDEQ ME YA LPP W + ++SCSS
Sbjct: 457 AKLLKNPYDEQAEMEAYAHLPPEWGKK---MVISCSS 490
>gi|153837943|ref|ZP_01990610.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810]
gi|149748634|gb|EDM59493.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810]
Length = 489
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 199/519 (38%), Positives = 279/519 (53%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L + + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGLPVAQ--NDELLVVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHL 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFCE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ L++ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHALSP--LVEREDLEQALSQFEGRLSQQFSRLMRSKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT FFRALSN+ PS + + L I +E +
Sbjct: 343 IAEDGRLFESMFELLNQNHTDYTRFFRALSNLDKQPS---------QEVIDLFIDREAAQ 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + + IS E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDEIGEPISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEVH 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ PYD QP E YA+LPP W + + SCSS
Sbjct: 454 RLAEILRHPYDSQPEFEAYAKLPPEWGKKMEI---SCSS 489
>gi|339325679|ref|YP_004685372.1| hypothetical protein CNE_1c15480 [Cupriavidus necator N-1]
gi|338165836|gb|AEI76891.1| protein UPF061 [Cupriavidus necator N-1]
Length = 523
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 215/534 (40%), Positives = 287/534 (53%), Gaps = 72/534 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + +P LV + + A L D R DF F G A P A Y G
Sbjct: 39 FTRLRPT-PLPSPYLVGVAPAAAALLGWDANIGSREDFIETFVGNQVPDWADPLASVYSG 97
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI L + + WE+QLKGAG TPYSR ADG AVLRSSIRE+LC
Sbjct: 98 HQFGVWAGQLGDGRAIRLAQA-ETATGPWEVQLKGAGLTPYSRMADGRAVLRSSIREYLC 156
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LG+PTTRAL ++ + V R+ E A+V R++ +F+RFG ++ A+
Sbjct: 157 SEAMAALGVPTTRALSIMGSDAPVRRETI-------ETAAVVTRLSPTFIRFGHFEHFAA 209
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+D+ +R LAD+ I + + + Y A EV+
Sbjct: 210 --HDDVAALRKLADFVIDNFMPACRD---------------------DTQPYQALLREVS 246
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G RY
Sbjct: 247 LRTADLIAHWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDTQG-RYA 305
Query: 433 FANQPDIGLWN---IAQFSTTLAAAKLIDDKEA-------------NYVMERYGTKFMDE 476
++ QP + WN +AQ L A DKE + ERY F
Sbjct: 306 YSQQPQVAFWNLHCLAQALLPLWLAPEDADKEGARDAAVEAARAALDPFRERYAAAFFRH 365
Query: 477 YQAIMTKKLGL-------PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
Y+A KLGL K ++ +++ L + +VDYT F+R L + + + +
Sbjct: 366 YRA----KLGLRPPAGGDDKSDEPLLTSLFQLLHGQRVDYTLFWRKLCGISSTDAARD-- 419
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQ 589
P++ + LD + A+ +WV Y L + D R+ M +VNPKYVLRN+L +
Sbjct: 420 --APVRDLFLD-----RAAFDTWVADYRVRLRAEQSHDAARELEMLAVNPKYVLRNHLAE 472
Query: 590 SAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+AI A DF EV RLL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 473 TAIRHAGEKDFTEVDRLLAVLSRPFDEQPEAEHYAALPPDWA---SGLEVSCSS 523
>gi|71736351|ref|YP_272788.1| hypothetical protein PSPPH_0485 [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|121957904|sp|Q48P81.1|Y485_PSE14 RecName: Full=UPF0061 protein PSPPH_0485
gi|71556904|gb|AAZ36115.1| SelO family protein [Pseudomonas syringae pv. phaseolicola 1448A]
Length = 487
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 216/549 (39%), Positives = 304/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD + S+ E +E P+LV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGDAFSTSVLSE-------------PIETPRLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEAYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++Q++S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTIAQEQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALM 574
L + P E L L+ +DI + + W +Y+ + G +++ER+ M
Sbjct: 373 LGDQ------PAAEALRTLRDDFVDI-----KGFDGWAEAYLARIAGEGKGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|260902805|ref|ZP_05911200.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037]
gi|308108627|gb|EFO46167.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037]
Length = 489
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 199/519 (38%), Positives = 279/519 (53%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L + + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGLPVAQ--NDELLVVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHL 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFCE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ L++ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAYALSP--LVEREDLEQALSQFEGRLSQQFSRLMRSKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT FFRALSN+ PS + + L I +E +
Sbjct: 343 IAEDGRLFESMFELLNQNHTDYTRFFRALSNLDKQPS---------QEVIDLFIDREAAQ 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + + IS E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDEIGEPISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEVH 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ PYD QP E YA+LPP W + + SCSS
Sbjct: 454 RLAEILRHPYDSQPEFEAYAKLPPEWGKKMEI---SCSS 489
>gi|90410397|ref|ZP_01218413.1| hypothetical protein P3TCK_20600 [Photobacterium profundum 3TCK]
gi|90328638|gb|EAS44922.1| hypothetical protein P3TCK_20600 [Photobacterium profundum 3TCK]
Length = 509
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 218/551 (39%), Positives = 296/551 (53%), Gaps = 48/551 (8%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L +++++ ELP T IP+ + +P LV+ + VA+ L
Sbjct: 1 MKTLSQLVFNNTY-SELPTTFGTAVIPQPL--------------SDPFLVSVNPQVAEML 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
ELDP E + F F+G LAG P A Y GHQFG + LGDGR + LGE+L +
Sbjct: 46 ELDPLEAKTRLFIDTFTGNEELAGTTPLAMKYTGHQFGHYNPDLGDGRGLLLGEVLTSTN 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+W++ LKG+GKTPYSR DG AVLRSSIRE+L S A++ LGI TT AL L+ + V R
Sbjct: 106 TKWDIHLKGSGKTPYSRQGDGRAVLRSSIREYLGSAALNGLGIKTTHALALLGSTTLVFR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA + RVA+S LRFG ++ Q ++ LADY I+HHF +
Sbjct: 166 E-------KMERGATLIRVAESHLRFGHFEYLFYTHQH--CELKLLADYLIKHHFPDL-- 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTS--NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
L+ G ED V N YA+ + + TA L+A WQ VGF HGV+NTDN
Sbjct: 215 ------LTTEGGQEDKQTVSANQHHNIYASMLTRIVKLTARLIAGWQSVGFAHGVMNTDN 268
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MS+LGLT DYGPFGFLD ++P + N +D G RY F QP I LWN++ L L
Sbjct: 269 MSVLGLTFDYGPFGFLDDYNPDYICNHSDYSG-RYAFNQQPSIALWNLSALGYALTP--L 325
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFF 513
ID ++ + +++ Y +Y A M KLGL + ++ + S L + VDYT FF
Sbjct: 326 IDKEDVDAILDSYHLTLQRDYSARMRNKLGLAEKREEDTVLFSSLFELLQSQMVDYTLFF 385
Query: 514 RALSNVKA-DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
R LS++ A D S+ +P D + W+ +Y L DE R
Sbjct: 386 RTLSSISATDLSVSA----LPNSIERFDDLFSCTQPLKKWLKAYAVRLNFEKDDDESRLE 441
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NPKY+LRNYL Q AID AE GDF + LL+++ P+DE P ++A PP W
Sbjct: 442 WMKQHNPKYILRNYLAQQAIDKAEDGDFAMIDELLQVLSSPFDEHPEFNQFADKPPYWGK 501
Query: 633 RPGVCMLSCSS 643
+ +SCSS
Sbjct: 502 K---LEISCSS 509
>gi|260773196|ref|ZP_05882112.1| UPF0061 domain-containing protein [Vibrio metschnikovii CIP 69.14]
gi|260612335|gb|EEX37538.1| UPF0061 domain-containing protein [Vibrio metschnikovii CIP 69.14]
Length = 489
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 207/523 (39%), Positives = 280/523 (53%), Gaps = 66/523 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
Y +V P ++NPQ +AW+ A L ++PD L FSG P A Y
Sbjct: 21 YREVMPQP-LDNPQWIAWNAEFATQFGLP----DQPDQELLVCFSGLQMPESFKPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI +L E ++L LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGVLLAEITSLSGEVFDLHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGI TTRAL ++ + V R+ + E GA++ R++QS +RFG ++
Sbjct: 136 LCSEAMAGLGIATTRALGMMVSDTLVYRE-------QAEKGALLVRMSQSHVRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q ++ +R LAD I H+ + N YA W +
Sbjct: 189 FYTNQ--INELRLLADKVIEWHYPQCLQAD---------------------NPYADWFAQ 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA ++AQWQ VGF HGV+NTDNMSILG T DYGPFGFLD +D SF N +D G R
Sbjct: 226 VVERTAKMIAQWQAVGFAHGVMNTDNMSILGQTFDYGPFGFLDDYDSSFICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP IGLWN++ + L+ LID + + RY + +M +KLGL
Sbjct: 285 YAFNQQPRIGLWNLSALAHALSP--LIDRGDLEQALSRYEPLLNQYFSELMRQKLGLLTQ 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ +L +A +VDYT F R LS + AD + ++D+ +R
Sbjct: 343 QPGDSELFDQLFTLLAKHRVDYTRFMRQLSCLDHAD------------EQSVIDLVADR- 389
Query: 547 EAWISWVLSYIQELLSSG------ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
+A W+ Y+Q +S +R A M NPKY+LRNYL Q AI+ AE GD+
Sbjct: 390 DAGQRWLEHYLQRCQQEKDAQGHLVSVSQRCATMRKHNPKYILRNYLAQIAIERAEQGDY 449
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E+ RL ++ P+ EQ E YA+LPP W +SCSS
Sbjct: 450 RELERLTNVLRDPFSEQVENEHYAQLPPDWG---KTLSISCSS 489
>gi|153834515|ref|ZP_01987182.1| conserved hypothetical protein [Vibrio harveyi HY01]
gi|148869101|gb|EDL68140.1| conserved hypothetical protein [Vibrio harveyi HY01]
Length = 489
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 212/549 (38%), Positives = 290/549 (52%), Gaps = 72/549 (13%)
Query: 103 EDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDP 162
E +N+ H F ELP +T V+P ++N + V W+ A L
Sbjct: 5 EGVNFTHRF-SELPS-------------VFFTYVTPQL-LDNTRWVVWNGEFAQQFGLPA 49
Query: 163 KEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWE 222
E + FSG A P A Y GHQFG++ LGDGR + L E+ + ++
Sbjct: 50 TE--NDELLNVFSGQVDFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDGTWFD 107
Query: 223 LQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFY 282
+ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R+
Sbjct: 108 IHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYRE--- 164
Query: 283 DGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKS 342
K E GA++ RVA++ +RFG ++ Q L + L D I HF
Sbjct: 165 ----KTEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLTDKVIEWHF--------P 210
Query: 343 ESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGL 402
E L T YAA + E+TA ++A WQ GF HGV+NTDNMSILG
Sbjct: 211 ECLE-------------TEKPYAAMFESIVEKTAEMIAYWQAYGFAHGVMNTDNMSILGQ 257
Query: 403 TIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA 462
T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ +D EA
Sbjct: 258 TFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSPLVQREDLEA 316
Query: 463 NYVMERYGTKFMDEYQAIMTKKLGLPKYNK-----QIISKLLNNMAVDKVDYTNFFRALS 517
+ ++ + ++ +M KLGL Y K ++ + + +K DYT FFR LS
Sbjct: 317 --ALGKFEVRLSQKFSELMRAKLGL--YTKVDEDGRLFEAMFELLNQNKADYTRFFRELS 372
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ-ELLSSG--ISDEERKALM 574
N+ + P + L I +E AW+ L+ + E+ G ++ + R M
Sbjct: 373 NLDVES---------PQAVIDLFIDREAASAWVDLYLARCELEVDEHGERVTVQLRCERM 423
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
VNPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP ++YA+LPP W +
Sbjct: 424 RQVNPKYILRNYLAQLAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDEYAKLPPEWGKK- 482
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 483 --MEISCSS 489
>gi|422631780|ref|ZP_16696961.1| hypothetical protein PSYPI_19296 [Pseudomonas syringae pv. pisi
str. 1704B]
gi|330941638|gb|EGH44418.1| hypothetical protein PSYPI_19296 [Pseudomonas syringae pv. pisi
str. 1704B]
Length = 487
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 222/551 (40%), Positives = 307/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P + + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPGQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFF 513
+EA + Y T ++D +M ++LGL N +Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQTHYLD----LMRRRLGLTVANDQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L P E L L+ +DI + + W +Y + L +++ER+A
Sbjct: 371 RRLGGQ------PAAEALRTLRDDFVDI-----KGFDGWAQAYQARIALEDNGTEQERQA 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|91227740|ref|ZP_01261967.1| hypothetical protein V12G01_00512 [Vibrio alginolyticus 12G01]
gi|91188387|gb|EAS74682.1| hypothetical protein V12G01_00512 [Vibrio alginolyticus 12G01]
Length = 489
Score = 331 bits (848), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 202/520 (38%), Positives = 283/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L P+E + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGL-PEE-QNDELLAVFSGLSEFEQFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + ++L LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQDGTWFDLHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LG+PTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGVPTTRALGMMVSDTPVYRE-------KTESGALLLRMAETHVRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFDA 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ +TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVTKTAEMIAYWQAFGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ L++ ++ + ++ ++ +M +KLGL
Sbjct: 285 YAFDQQPRIALWNLSALAHALSP--LVEREDLESSLSQFEVHLSQQFSRLMREKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + +K DYT FFR LSN+ PS +AV+ L + +E
Sbjct: 343 IAEDGRLFEAMFELLHQNKTDYTRFFRTLSNLDNAPS----------QAVIDLFLDREAA 392
Query: 547 EAWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + + L IS E+R M NPKY+LRNYL Q AID AE GDF E+
Sbjct: 393 RAWLDLYLARCELEVDELGGLISTEQRCKQMRQANPKYILRNYLAQLAIDKAEEGDFSEL 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RP+DEQP + YA+LPP W + + SCSS
Sbjct: 453 HRLAELLKRPFDEQPEFDNYAKLPPEWGKKMEI---SCSS 489
>gi|254229913|ref|ZP_04923316.1| conserved hypothetical protein [Vibrio sp. Ex25]
gi|151937549|gb|EDN56404.1| conserved hypothetical protein [Vibrio sp. Ex25]
Length = 509
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 204/520 (39%), Positives = 284/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L P E + + FSG + P A Y
Sbjct: 39 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGL-PAE-QSDELLAVFSGQSEFEPFRPLAMKY 95
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 96 AGHQFGVYNPDLGDGRGLLLAEIEHQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 155
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 156 LCSEAMVGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHF 208
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 209 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFGE 245
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 246 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 304
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ +D EA+ + ++ + ++ +M +KLGL
Sbjct: 305 YAFDQQPRIALWNLSALAHALSPLVEREDLEAS--LSQFEVRLSQQFSRLMREKLGLKTK 362
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + + DYT FFR LSN+ D S +AV+ L + +E
Sbjct: 363 IAEDGRLFEAMFELLHQNNTDYTRFFRTLSNLDTDSS----------QAVIDLFLDREAA 412
Query: 547 EAWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + + L IS E+R M NPKY+LRNYL Q AID AE GDF E+
Sbjct: 413 RAWLDLYLARCELEVDELGELISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEL 472
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RP+DEQP + YA+LPP W + + SCSS
Sbjct: 473 HRLAELLKRPFDEQPEFDDYAKLPPEWGKKMEI---SCSS 509
>gi|422674926|ref|ZP_16734275.1| hypothetical protein PSYAR_19366 [Pseudomonas syringae pv. aceris
str. M302273]
gi|330972649|gb|EGH72715.1| hypothetical protein PSYAR_19366 [Pseudomonas syringae pv. aceris
str. M302273]
Length = 487
Score = 331 bits (848), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 221/551 (40%), Positives = 307/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTLHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L +D
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVD 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL N +Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVANEQGEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + W +Y + L +++ER+
Sbjct: 371 RRLGDQ------PAAEALRTLRDDFVDI-----KGFDGWAQAYQARIALEDNGTEQERQN 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|262394822|ref|YP_003286676.1| hypothetical protein VEA_004051 [Vibrio sp. Ex25]
gi|262338416|gb|ACY52211.1| UPF0061 domain-containing protein [Vibrio sp. Ex25]
Length = 489
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 204/520 (39%), Positives = 284/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L P E + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGL-PAE-QSDELLAVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMVGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFGE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ +D EA+ + ++ + ++ +M +KLGL
Sbjct: 285 YAFDQQPRIALWNLSALAHALSPLVEREDLEAS--LSQFEVRLSQQFSRLMREKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + + DYT FFR LSN+ D S +AV+ L + +E
Sbjct: 343 IAEDGRLFEAMFELLHQNNTDYTRFFRTLSNLDTDSS----------QAVIDLFLDREAA 392
Query: 547 EAWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + + L IS E+R M NPKY+LRNYL Q AID AE GDF E+
Sbjct: 393 RAWLDLYLARCELEVDELGELISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEL 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RP+DEQP + YA+LPP W + + SCSS
Sbjct: 453 HRLAELLKRPFDEQPEFDDYAKLPPEWGKKMEI---SCSS 489
>gi|423097880|ref|ZP_17085676.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens Q2-87]
gi|397884878|gb|EJL01361.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens Q2-87]
Length = 487
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 214/549 (38%), Positives = 297/549 (54%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K LE L +D+ F R D VL P A ++NP+LV S + L
Sbjct: 1 MKTLETLTFDNRFAR------LGDGFSAHVL--------PEA-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPLFAEIFGGHKLWAETEPRAMVYSGHQFGHYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R++ S +RFG ++ + +L LA++ + HF
Sbjct: 166 E-------KQERAAMVLRLSPSHVRFGHFEYFYYTKKPELQA--ALAEHVLNLHFAECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + Y F Y +M ++LGL + +++++ +LL M VDY+ FFR
Sbjct: 313 VEALRETLGLYLPLFQAHYLDLMRRRLGLTSAEEDDQKLLERLLQLMQNSGVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKALM 574
L + + ++ L+ +D+ + + +W Y+ + G + ++R+ M
Sbjct: 373 LGDQSPEQAV------ATLRDDFVDL-----KGFDAWGELYVARVNREGPVDQDQRRIRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP YVLRNYL Q AIDAAE GD+ EVRRL ++ RP++EQPGM+ YA+ PP W
Sbjct: 422 HAVNPLYVLRNYLAQKAIDAAESGDYDEVRRLHTVLSRPFEEQPGMDNYAQRPPEWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|432862552|ref|XP_004069912.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O-like [Oryzias
latipes]
Length = 685
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 203/455 (44%), Positives = 260/455 (57%), Gaps = 51/455 (11%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
LE LN+++ +++LP DP +S R+V AC+++V P + NP+ VA S L L
Sbjct: 38 LERLNFENVVLKKLPVDPSEESGVRQVRGACFSRVKPQP-LTNPRFVAVSGEALSLLGLR 96
Query: 162 PKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL------ 214
+E P P + SG+ + G+ P A CY GHQFG +AGQLGDG A LGE+
Sbjct: 97 GREVLSDPLGPDYLSGSRVMPGSEPAAHCYCGHQFGQFAGQLGDGAACYLGEVRAPPGQD 156
Query: 215 -----NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCL 269
S RWE+Q+KGAG TPYSR ADG VLRSSIREFLCSEAM FLG+PTTRA +
Sbjct: 157 PEMLRENPSGRWEIQVKGAGLTPYSRQADGRKVLRSSIREFLCSEAMFFLGVPTTRAGSV 216
Query: 270 VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQEDL 318
VT+ V RD+FY G P+ E ++V R+A +FLRFGS++I S G E+
Sbjct: 217 VTSDSRVVRDVFYSGRPRHERCSVVLRIAPTFLRFGSFEIFKPADEFTGRQGPSYGHEE- 275
Query: 319 DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASL 378
I + DY I + I+ + GD + A+ EV RTA L
Sbjct: 276 -IRGQMMDYVIGTFYPEIQQ---------NHGDR--------VERNVAFFREVMRRTARL 317
Query: 379 VAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPD 438
VAQWQ VGF HGVLNTDNMSILGLT+DYGPFGF+D FDP+F N +D GR Y + QP
Sbjct: 318 VAQWQCVGFCHGVLNTDNMSILGLTLDYGPFGFMDRFDPNFICNASDSSGR-YSYQAQPA 376
Query: 439 IGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK----QI 494
I WN+ + + LA D EA VM+ Y F Y A M KKLGL K + +
Sbjct: 377 ICRWNLVKLAEALAPEVPPDRAEA--VMDEYLDAFNSFYLANMRKKLGLLKKEEPEDAML 434
Query: 495 ISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
I++LL M D+TN FR+LS + P+ EDE
Sbjct: 435 ITELLQAMHNTGADFTNTFRSLSRISC-PAEGEDE 468
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 62/100 (62%), Gaps = 12/100 (12%)
Query: 544 ERKEAWISWVLSYIQELLSS--GISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAE 596
++ E W SW+ Y + L G SDE ER +M+ NP+ VLRNY+ Q+AI+AAE
Sbjct: 549 QQAEEWTSWIRLYRKRLALELEGQSDEQAVQEERARVMDGTNPRVVLRNYIAQNAIEAAE 608
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGV 636
GDF EV+R+L+++E+P+ QPG+E PAW + G
Sbjct: 609 KGDFSEVQRVLRVLEKPFSSQPGLEL-----PAWVHGGGA 643
>gi|398870845|ref|ZP_10626165.1| hypothetical protein PMI34_01352 [Pseudomonas sp. GM74]
gi|398207474|gb|EJM94223.1| hypothetical protein PMI34_01352 [Pseudomonas sp. GM74]
Length = 487
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 217/551 (39%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAAAETPEFAELFSGHKLWADAIPRAMVYSGHQFGFYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMLLRLSPSHVRFGHFEYFYYTKRPEQQ----KELGDHVLAMHFP-- 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLITAEDDDQKLLENLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + +I L+ +D+ + + +W Y+ + G D E+R+
Sbjct: 371 RRLGDEAPEQAIAR------LRDDFIDL-----KGFDAWGELYVARVAREGTLDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAA+ GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAQSGDYTEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|83405179|gb|AAI10867.1| Selenoprotein O [Homo sapiens]
Length = 669
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 253/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTANGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDSV----------------QRNAAFFREVTRRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFYLLSSFPVELESP 471
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D E +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WADWLQAYRARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWA 660
>gi|156973707|ref|YP_001444614.1| hypothetical protein VIBHAR_01411 [Vibrio harveyi ATCC BAA-1116]
gi|166231362|sp|A7MV92.1|Y1411_VIBHB RecName: Full=UPF0061 protein VIBHAR_01411
gi|156525301|gb|ABU70387.1| hypothetical protein VIBHAR_01411 [Vibrio harveyi ATCC BAA-1116]
Length = 489
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 209/551 (37%), Positives = 289/551 (52%), Gaps = 68/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E +N+ H F ELP A +T V+P ++N + V W+ A
Sbjct: 1 MSVWEGVNFTHRF-SELPS-------------AFFTYVTPQL-LDNTRWVVWNGEFAQQF 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L E E + F+G A P A Y GHQFG++ LGDGR + L E+ +
Sbjct: 46 GLPAAENE--ELLNVFAGQKEFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDG 103
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+++ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R
Sbjct: 104 TWFDIHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYR 163
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA++ RVA++ +RFG ++ Q L + LAD I HF
Sbjct: 164 E-------KTEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLADKVIEWHFPECSE 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
K YAA + E+TA ++A WQ GF HGV+NTDNMS
Sbjct: 215 AEKP---------------------YAAMFESIVEKTAEMIAYWQAYGFAHGVMNTDNMS 253
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ +
Sbjct: 254 ILGQTFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSPLVQRE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
D EA + ++ + ++ +M KLGL + ++ + + +K DYT FFR
Sbjct: 313 DLEA--ALGKFEVRLSQKFSELMRAKLGLHTKVDEDGRLFEAMFELLNQNKADYTRFFRE 370
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ-ELLSSG--ISDEERKA 572
LSN+ ++ P + L I +E AW+ L+ + E+ G +S + R
Sbjct: 371 LSNL---------DVKSPQAVIDLFIDREAASAWVDLYLARCELEVDECGERVSAQTRCE 421
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M NPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP + Y +LPP W
Sbjct: 422 KMRRTNPKYILRNYLAQIAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDDYTKLPPEWGK 481
Query: 633 RPGVCMLSCSS 643
+ +SCSS
Sbjct: 482 K---MEISCSS 489
>gi|402700189|ref|ZP_10848168.1| hypothetical protein PfraA_10191 [Pseudomonas fragi A22]
Length = 487
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 212/549 (38%), Positives = 300/549 (54%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S++ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASDAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + P F F G T A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLDPAVAQDPVFARLFGGHTLWADAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TP+SR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGMTPWSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+AQS +RFG ++ Q + + L ++ + HF E
Sbjct: 166 E-------KQERAAMVLRLAQSHIRFGHFEYFYYTKQPEQQ--KQLGEHVLALHF--PEC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 LEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDHEG-RYSFSNQVPIGQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++++I LL M +DY+ FFR
Sbjct: 313 VEALRETLGLFLPLYQAHYTDLMRRRLGLTSAEEGDQKLIETLLQRMQGSAIDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKALM 574
L + P + + L+ +D+ + + W Y+ + G++D+ R+ M
Sbjct: 373 LGDE------PAAQAVARLRDEFVDL-----KGFDEWAAQYVDRVARDGVNDQHARRERM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+ VNP Y+LRNYL Q AIDAAE GD+ EVRRL +++ +P+ EQPGM+ YA PP W
Sbjct: 422 HGVNPLYILRNYLAQKAIDAAEAGDYSEVRRLHQVLTQPFTEQPGMQGYAERPPEWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|66043761|ref|YP_233602.1| hypothetical protein Psyr_0494 [Pseudomonas syringae pv. syringae
B728a]
gi|75503690|sp|Q4ZZ58.1|Y494_PSEU2 RecName: Full=UPF0061 protein Psyr_0494
gi|63254468|gb|AAY35564.1| Protein of unknown function UPF0061 [Pseudomonas syringae pv.
syringae B728a]
Length = 487
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 219/551 (39%), Positives = 308/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE +H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEVLHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L +D
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVD 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL + ++Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVAHEQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + W +Y + L + +++ER+
Sbjct: 371 RRLGDQ------PAAEALRTLRDDFVDI-----KGFDGWAQAYQARIALENNGTEQERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|32880229|ref|NP_113642.1| selenoprotein O [Homo sapiens]
gi|172045770|sp|Q9BVL4.3|SELO_HUMAN RecName: Full=Selenoprotein O; Short=SelO
gi|32492907|gb|AAP85540.1| selenoprotein O [Homo sapiens]
Length = 669
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 253/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTATGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDSV----------------QRNAAFFREVTRRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFYLLSSFPVELESP 471
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D E +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WADWLQAYRARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWA 660
>gi|334706298|ref|ZP_08522164.1| YdiU family protein [Aeromonas caviae Ae398]
Length = 475
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 203/469 (43%), Positives = 256/469 (54%), Gaps = 55/469 (11%)
Query: 179 PLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFAD 238
PL G P AQ Y GHQFG ++ +LGDGRA+ LGE L RW+L LKGAGKTP+SRF D
Sbjct: 58 PLPGMQPVAQVYAGHQFGGYSPRLGDGRALLLGEQLAPDGGRWDLHLKGAGKTPFSRFGD 117
Query: 239 GLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVA 298
G AVLRSSIRE+L SEA+H LGIPTTRAL L+ + + V R+ + E GA V R A
Sbjct: 118 GRAVLRSSIREYLASEALHALGIPTTRALVLLGSDEPVYRE-------QVESGATVLRTA 170
Query: 299 QSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD 358
S LRFG ++ A GQ + + L DYA +HF +
Sbjct: 171 PSHLRFGHFEYFAWSGQG--EKIPALIDYARCYHF-----------------------PE 205
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
LT A EV RTA L+A+WQ GF HGV+NTDNMS+LGLT+DYGP+GF+DA+ P
Sbjct: 206 LTDG--AELFAEVVRRTARLIAKWQAAGFCHGVMNTDNMSLLGLTLDYGPYGFIDAYVPD 263
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ 478
F N +D P RY QP +G WN+ + + LA +D + +Y + M Y
Sbjct: 264 FVCNHSD-PDGRYALDQQPAVGYWNLQKLAQALAGH--MDGDALASALAQYEHQLMLHYS 320
Query: 479 AIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL-LVPL 534
+M KLGL + ++ + +L +A KVDY F R L V + + P L L+P
Sbjct: 321 ELMRAKLGLAVWEEEDPALFRELFRLLAAHKVDYHLFLRRLGEVTREGAWPASLLALLPD 380
Query: 535 KAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
AV W W+ +Y L G D RK M++VNPKYVLRN L Q I+A
Sbjct: 381 SAV-----------WQGWLEAYRARLTREGSVDGVRKGQMDAVNPKYVLRNALAQQVIEA 429
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE GD RL ++ PYDEQP E A P W Y G LSCSS
Sbjct: 430 AEQGDMAPFERLFAALQHPYDEQPEYEDLATPHPGW-YCGG--ELSCSS 475
>gi|422321783|ref|ZP_16402828.1| hypothetical protein HMPREF0005_02056 [Achromobacter xylosoxidans
C54]
gi|317403322|gb|EFV83836.1| hypothetical protein HMPREF0005_02056 [Achromobacter xylosoxidans
C54]
Length = 495
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 214/517 (41%), Positives = 283/517 (54%), Gaps = 46/517 (8%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+++P + P+L+ + A + LDP P+F FSGA PL G A Y
Sbjct: 21 AFYTRLAPQ-PLNQPRLLHANADAAALIGLDPSALRTPEFLRVFSGAEPLPGGDTLAAVY 79
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGEI WELQLKG+G TPYSR DG AVLRSS+RE+
Sbjct: 80 SGHQFGVWAGQLGDGRAHLLGEIQG-PGGAWELQLKGSGLTPYSRMGDGRAVLRSSVREY 138
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 139 LASEAMHGLGIPTTRALALVASDDPVWRETV-------ETAAIVTRMSPSFVRFGSFEHW 191
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+SR Q D+ +RTLADY I ++ E DE V L E
Sbjct: 192 SSRRQPDM--LRTLADYVIDRYYPECRAAPAGEPQ-----DEAAPYVGLLR--------E 236
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF+D F N +D G R
Sbjct: 237 VTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGFMDGFRLGHVCNHSDSEG-R 295
Query: 431 YCFANQPDIGLWNIAQFSTTL-AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + +L A A +D A V++ + F + M K+GL
Sbjct: 296 YSWNRQPSVALWNLYRLGGSLHALAPDVDGLRA--VLDEFEGVFTRAFHDRMGAKMGLAA 353
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ ++ LL M ++ D+T +R L++ + P +L I +
Sbjct: 354 WRPADEPLLDDLLKLMDANQADFTLTWRRLADAVSGDRAPFQDLF---------IDRAAA 404
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
AW+ +L+ + G E MN VNP YVLRN+L + AI AA+ GD E+ L
Sbjct: 405 SAWLDRLLARHAQ---DGRPAAEVAEAMNRVNPLYVLRNHLAEEAIRAAKAGDASEIDTL 461
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ L+ P+ + G EKYA LPP WA +SCSS
Sbjct: 462 MTLLRAPFTARVGYEKYAGLPPDWA---NGIEVSCSS 495
>gi|119593912|gb|EAW73506.1| selenoprotein O [Homo sapiens]
Length = 666
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 253/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTATGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDSV----------------QRNAAFFREVTRRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFYLLSSFPVELESP 471
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W W+ +Y L G D E +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 555 WADWLQAYRARLDKDLEGAGDAAAWQAEHVRVMHANNPKYVLRNYIAQNAIEAAERGDFS 614
Query: 602 EVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 615 EVRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWA 660
>gi|398996574|ref|ZP_10699427.1| hypothetical protein PMI22_04059 [Pseudomonas sp. GM21]
gi|398126445|gb|EJM15880.1| hypothetical protein PMI22_04059 [Pseudomonas sp. GM21]
Length = 487
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 213/551 (38%), Positives = 300/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R D+ VL ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR------LGDTFSAHVL---------PEPIDNPRLVVASPAAMKLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + P+F FSG A AVP A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAQTPEFAELFSGHKLWADAVPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEKQ----KELGDHVLAMHFP-- 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+N+DN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNSDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y + Y +M ++LG +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLYQAHYLDLMRRRLGFTTAEDDDQKLLEHLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + ++ V L+ +DI + + +W Y+ + G+ D ++R+
Sbjct: 371 RRLGDESPELAV------VRLRDDFVDI-----KGFDAWGELYVARVAREGVVDQQQRRT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGM+ YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMDTYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|402884645|ref|XP_003905786.1| PREDICTED: selenoprotein O-like [Papio anubis]
Length = 666
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 195/446 (43%), Positives = 253/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR+V AC+T+V P+ + P++VA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAQSAPRQVPGACFTRVRPTP-LRQPRVVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L G P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAALFFSGNALLPGTEPAAHCYCGHQFGQFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRIASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+ + + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASDRV----------------QRNAAFFQEVTRRTAWMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + ++ ++SKLL
Sbjct: 387 QKLAEALQPELPLELGEA-ILAEEFDAEFQRHYMQKMRRKLGLVQLELEEDRALVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ + P
Sbjct: 446 TMHLTGADFTNTFFLLSSFPVELESP 471
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 59/117 (50%), Gaps = 23/117 (19%)
Query: 538 LLDIGKERKEAWISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQS 590
+ ++ + W W+ +Y L G D ER +M++ NPKYVLRNY+ Q+
Sbjct: 544 MAELQSRNQSHWADWLQAYRARLDKDLEGAGDAAAWQAERVRVMHANNPKYVLRNYIAQN 603
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
AI+AAE GDF EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 604 AIEAAERGDFSEVRRVLKLLENPYHCEAGAATDPEATEADGADGRQRSYSSKPPLWA 660
>gi|419796616|ref|ZP_14322147.1| uncharacterized ACR protein, YdiU/UPF0061 family [Neisseria sicca
VK64]
gi|385699316|gb|EIG29622.1| uncharacterized ACR protein, YdiU/UPF0061 family [Neisseria sicca
VK64]
Length = 489
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 208/515 (40%), Positives = 277/515 (53%), Gaps = 48/515 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++VSP + P VA++ +A L LD +F+ + SG P P A Y G
Sbjct: 19 YSRVSPEP-LTAPYWVAFNTDLAAELNLD-TDFQTTANLAYLSGNAPQYAPAPIASVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGRAI +G+ ++ +R E QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 77 HQFGVYTPRLGDGRAILIGDSVDAAGQRQEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ E A++ R+A +FLRFG ++
Sbjct: 137 SEAMHGLGIPTTRALALCGSNDPVYRETV-------ETAAVLTRIAPNFLRFGHFEYFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
G+E ++ LADY IRH++ + + N YAA ++
Sbjct: 190 TGREAE--IQQLADYLIRHYYPDCRDAD---------------------NPYAALLEQIR 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA VA WQ VGF HGV+NTDNMS LGLTIDYGPFGFLD +D N +D G RY
Sbjct: 227 NRTADTVAAWQSVGFCHGVMNTDNMSALGLTIDYGPFGFLDDYDRRHVCNHSDTQG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP + WN + ++ A L+ +++ + F Y M +KLGL + +K
Sbjct: 286 YNAQPFVAHWNFSALASCFDA--LVPHNTLEQLIDGWTEVFQTTYLEKMRRKLGLQQADK 343
Query: 493 Q----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +I+ L + K D+T FFR LS V S E L P G A
Sbjct: 344 RDDESLIADLFTALQDQKTDFTLFFRNLSEV----SNTHGEPLPPKLEQTFKNGV--PPA 397
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+I W+ Y Q L + + ER MN NP Y+LRNYL + AI A GD+ E+ RL +
Sbjct: 398 FIRWLGRYRQRLRAENSNPAERAIRMNLTNPLYILRNYLAEQAIAQARNGDYREIERLRR 457
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RP+DEQ A PP + VC+ SCSS
Sbjct: 458 CLARPFDEQAEFADLAEPPPEGSI--PVCV-SCSS 489
>gi|390458938|ref|XP_003732203.1| PREDICTED: selenoprotein O [Callithrix jacchus]
Length = 665
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 197/446 (44%), Positives = 254/446 (56%), Gaps = 41/446 (9%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G + PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVETPPAGPEGASTTPRLVPGACFTRVRPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQFG +AGQLGDG A+ LGE+
Sbjct: 104 ALLGLGAPPAPEAEAEAALFFSGNALLPGAEPAAHCYCGHQFGHFAGQLGDGAAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR DG VLRSSIREFLCSEAM LG+PTTRA VT+
Sbjct: 164 CTAAGERWELQLKGAGPTPFSR-PDGRKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSE 222
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V R+A +F+RFGS++I H+ R + DI L
Sbjct: 223 STVARDVFYDGNPKYEKCTVVLRIASTFIRFGSFEIFKSTDEHSGRAGPSVGRNDIRVQL 282
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S+S+ + AA+ EV RTA +VA+WQ
Sbjct: 283 LDYVIGSFYPEIQAAHASDSV----------------QRNAAFFREVTRRTARMVAEWQC 326
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 327 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCKWNL 385
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + E + +F Y M +KLGL + + ++S+LL
Sbjct: 386 QKLAEALQPELPLELGEA-ILAEEFDAEFQRHYLQKMRRKLGLVQVELEEDGALVSRLLQ 444
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ DP P
Sbjct: 445 TMHLTGADFTNTFYLLSSFLVDPESP 470
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 54/106 (50%), Gaps = 23/106 (21%)
Query: 549 WISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W +W+ Y L G D ER +M + NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 554 WATWLQEYRARLDKDLEGAGDAAAWQAERVRVMRASNPKYVLRNYIAQNAIEAAERGDFS 613
Query: 602 EVRRLLKLMERPYDEQP----------------GMEKYARLPPAWA 631
EVR++LKL+E PY + G Y+ PP WA
Sbjct: 614 EVRQVLKLLETPYQCEAGTATEPEATEARGATGGQHSYSSKPPLWA 659
>gi|423686545|ref|ZP_17661353.1| hypothetical protein VFSR5_1868 [Vibrio fischeri SR5]
gi|371494613|gb|EHN70211.1| hypothetical protein VFSR5_1868 [Vibrio fischeri SR5]
Length = 485
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 207/538 (38%), Positives = 287/538 (53%), Gaps = 58/538 (10%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPD 169
SF L R +PR +T V P+ ++N + + W+ +A +L +
Sbjct: 2 SFWNSLSITTRYSRLPR----CFFTYVQPTP-LDNSRWLIWNSELAKQFDLPENVHNHSE 56
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
FSG + P A Y GHQFG + LGDGR + L EI + K ++L LKGAG
Sbjct: 57 LLDAFSGEVVPSVFAPLAMKYAGHQFGSYNPDLGDGRGLLLAEIKDKKGNSFDLHLKGAG 116
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++T+ V R+ + E
Sbjct: 117 LTPYSRSGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMTSDTPVFREGY-------E 169
Query: 290 PGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
GA++ R+A++ +RFG ++ + S E+L + LAD I HF +K
Sbjct: 170 TGALLIRMAETHIRFGHFEHLFYSNLLEEL---KLLADKVIEWHFPCCLGEDKP------ 220
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
Y A + +RTA ++AQWQ VGF HGV+NTDNMSI+G T DYGP
Sbjct: 221 ---------------YLAMFNNIVDRTAYMIAQWQAVGFAHGVMNTDNMSIIGQTFDYGP 265
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER 468
FGFLD ++P + N +D G RY F QP IGLWN++ + +L+ LID + +E+
Sbjct: 266 FGFLDDYEPGYICNHSDYQG-RYAFNQQPRIGLWNLSALAHSLSP--LIDKPDLEKALEQ 322
Query: 469 YGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y K D + +M KKLGL + + ++ + ++ + VDYT F RALS++ +
Sbjct: 323 YEIKLHDYFSQLMRKKLGLLSKQEGDTRLFESMFELLSQNTVDYTRFMRALSDLDSQD-- 380
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
K ++D+ +R EA W Y+ S + R + M VNPKYVLRN
Sbjct: 381 ---------KQTVIDLFVDR-EAATLWTDLYLTRCKLEADSFDMRCSKMRKVNPKYVLRN 430
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
YL Q AI A GDF +V+ L L+ P+DE P E+YA LPP W R + SCSS
Sbjct: 431 YLAQQAIVKANEGDFSDVKILSTLLASPFDEHPDFERYAELPPEWGKRMEI---SCSS 485
>gi|416019138|ref|ZP_11566031.1| hypothetical protein PsgB076_24274 [Pseudomonas syringae pv.
glycinea str. B076]
gi|416024016|ref|ZP_11568195.1| hypothetical protein PsgRace4_06158 [Pseudomonas syringae pv.
glycinea str. race 4]
gi|320321966|gb|EFW78062.1| hypothetical protein PsgB076_24274 [Pseudomonas syringae pv.
glycinea str. B076]
gi|320330930|gb|EFW86904.1| hypothetical protein PsgRace4_06158 [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 487
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 304/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL + ++Q++S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTVAQEQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALM 574
L + P E L L+ +DI + + +W +Y + +++ER+ M
Sbjct: 373 LGDQ------PAAEALRTLRDDFVDI-----KGFDAWAEAYQARIAGEDKGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|298160544|gb|EFI01567.1| Selenoprotein O [Pseudomonas savastanoi pv. savastanoi NCPPB 3335]
Length = 487
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 303/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDTG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL ++Q++S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTIAEDQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALM 574
L + P E L L+ +DI + + W +Y+ + +++ER+ M
Sbjct: 373 LGDQ------PAVEALRTLRDDFVDI-----KGFDGWAEAYLARIAGEDKGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|226946528|ref|YP_002801601.1| hypothetical protein Avin_45110 [Azotobacter vinelandii DJ]
gi|259647051|sp|C1DHP3.1|Y4511_AZOVD RecName: Full=UPF0061 protein Avin_45110
gi|226721455|gb|ACO80626.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
Length = 487
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 218/550 (39%), Positives = 298/550 (54%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L +L +D+ F R GD + T V+P + +P+LV S + L
Sbjct: 1 MKRLSELAFDNRFARL--GDTFS------------TAVTP-LPIASPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P + P + +G P GA P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLEPAVADDPQLVEYCAGQCPWPGAEPRAMAYSGHQFGFYNPQLGDGRGLLLGEVINAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERW+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIPT+RALC+ + V R
Sbjct: 106 ERWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPTSRALCVTASDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ EE A + R+A S +RFG ++ + SR E L R L DY I +F
Sbjct: 166 E-------TEERAATLLRLAPSHVRFGHFEFFYYSRQHEAL---RQLLDYVIGEYF---- 211
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ L+ + Y A+ V ERTA+L+A+WQ GF HGV+NTDNM
Sbjct: 212 ----ADCLA-------------QPDPYRAFFDRVLERTAALLARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FDP F N +D G RY F NQ I WN++ L +
Sbjct: 255 SILGITFDFGPYAFLDDFDPGFVCNHSDDTG-RYSFDNQVPIAHWNLSALGQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
D ++R+ F + +M ++LG +++ I +LL M VDY+ FFR
Sbjct: 312 DKDALLGSLKRFLPLFRGAWLELMRRRLGFTTAEADDRERIQRLLQLMQGSAVDYSRFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDE-ERKAL 573
L + P L L+ +D+ + +W ++ L +DE R+A
Sbjct: 372 ELGDR------PAAAALRRLREDFVDLA-----GFDAWAGDHLARLARENEADEAARRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNPKY+LRNYL Q AI+AAE GD+ VR L ++ RP+DEQPGME+YA PP W
Sbjct: 421 MHAVNPKYILRNYLAQQAIEAAERGDYSPVRELHAVLSRPFDEQPGMERYAERPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|302189835|ref|ZP_07266508.1| hypothetical protein Psyrps6_25966 [Pseudomonas syringae pv.
syringae 642]
Length = 487
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 220/551 (39%), Positives = 307/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL + ++Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVAHEQDEQLVSQLLKLMQSSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + W +Y + L + EER+
Sbjct: 371 RRLGDQ------PAVEALRTLRDDFVDI-----KGFDGWAEAYQARIGLEDNGTGEERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|398909678|ref|ZP_10654668.1| hypothetical protein PMI29_00479 [Pseudomonas sp. GM49]
gi|398187628|gb|EJM74962.1| hypothetical protein PMI29_00479 [Pseudomonas sp. GM49]
Length = 487
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 219/551 (39%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELIFDNRFDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPTVAETPEFAELFSGHKLWADAIPRAMVYSGHQFGFYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMLLRLSPSHVRFGHFEYFYYTKRPEQQ----KELGDHVLAMHFP-- 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLENLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + +I L+ +D+ + + +W Y+ + G D E+R+
Sbjct: 371 RRLGDEAPEQAIAR------LRDDFVDL-----KGFDAWGELYVARVAREGALDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP YVLRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYVLRNYLAQKAIDAAESGDYLEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|422638997|ref|ZP_16702427.1| hypothetical protein PSYCIT7_08384 [Pseudomonas syringae Cit 7]
gi|330951391|gb|EGH51651.1| hypothetical protein PSYCIT7_08384 [Pseudomonas syringae Cit 7]
Length = 487
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 218/551 (39%), Positives = 309/551 (56%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL + ++Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVAQEQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + +W +Y + + +++ER+
Sbjct: 371 RRLGDQ------PAAEALRTLRDDFVDI-----KGFDAWAEAYQTRIAVEDNGTEQERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|424065676|ref|ZP_17803150.1| hypothetical protein Pav013_0366 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
gi|408003073|gb|EKG43286.1| hypothetical protein Pav013_0366 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
Length = 487
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 218/549 (39%), Positives = 303/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKYN---KQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL N +Q++S+LL M VDYT FFR
Sbjct: 313 VEALREAIGLFLPLYQAHYLDLMRRRLGLTVANDQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKALM 574
L + P E L L+ +DI + + W +Y + L +++ER+ M
Sbjct: 373 LGDQ------PAAEALRTLRDDFVDI-----KGFDGWAHAYQARIALEDNGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFSEQPGMQGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|332304718|ref|YP_004432569.1| hypothetical protein Glaag_0332 [Glaciecola sp. 4H-3-7+YE-5]
gi|410639610|ref|ZP_11350156.1| hypothetical protein GCHA_0379 [Glaciecola chathamensis S18K6]
gi|332172047|gb|AEE21301.1| protein of unknown function UPF0061 [Glaciecola sp. 4H-3-7+YE-5]
gi|410140929|dbj|GAC08343.1| hypothetical protein GCHA_0379 [Glaciecola chathamensis S18K6]
Length = 480
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 207/543 (38%), Positives = 300/543 (55%), Gaps = 67/543 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+N DHS+ +L GD + P V NPQL+ + ++ ++L+L
Sbjct: 1 MNLDHSYATQL-GDLGALTKP--------------LSVANPQLIEVNHTLREALQLPASW 45
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F + G T +AQ YGGHQFG W LGDGR + LGE + + W+L
Sbjct: 46 FTQSSIMSMLFGNTSSLTKHSFAQKYGGHQFGGWNPDLGDGRGLLLGEAKDQQGNPWDLH 105
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSRFADG AVLRS++RE+L SEA+H +GIPT+RALCL+T+ + V R+
Sbjct: 106 LKGAGPTPYSRFADGRAVLRSTLREYLASEALHHMGIPTSRALCLITSDEPVYRE----- 160
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
K+E A++ RV+QS +RFG ++ G +LD + L DY HF S+
Sbjct: 161 --KQEKAAMMIRVSQSHIRFGHFEYFYHNG--ELDKLEKLFDYCFERHF--------SDC 208
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L T + + A ++ TA+L+A+WQ GF HGV+NTDNMSI G+T
Sbjct: 209 LQ-------------TESPHLAMLEKIVTDTATLIAKWQAFGFNHGVMNTDNMSIHGITF 255
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
D+GP+ FLD FDP F N +D G RY F QP IGLWN+ + I+ ++
Sbjct: 256 DFGPYAFLDDFDPKFVCNHSDHQG-RYAFEQQPGIGLWNLNALAHAFTPYLSIEQIKS-- 312
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ +Y + M E+ +M +KLGL + N +++++ L+ ++ DK DY FR L ++
Sbjct: 313 ALSQYEPRLMAEFSQLMRQKLGLYENNHTTAELVNRWLDLVSQDKRDYHISFRLLCDIDE 372
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ P+ L+D +R EA +W+ Y Q + + G +ER+A M VNP Y
Sbjct: 373 QGAHPK----------LVDHFIQR-EAAQAWLTQYQQAIRAQGTDTQERQAQMRKVNPAY 421
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LS 640
VLRNY Q AIDAAE GDF R LL+++++P++ +P ++A+ PP W G M +S
Sbjct: 422 VLRNYQAQLAIDAAEQGDFTHFRMLLQVLQQPFESKPEYAEFAKPPPDW----GKHMEIS 477
Query: 641 CSS 643
CSS
Sbjct: 478 CSS 480
>gi|70733990|ref|YP_257630.1| hypothetical protein PFL_0486 [Pseudomonas protegens Pf-5]
gi|121957905|sp|Q4KJF3.1|Y486_PSEF5 RecName: Full=UPF0061 protein PFL_0486
gi|68348289|gb|AAY95895.1| conserved hypothetical protein [Pseudomonas protegens Pf-5]
Length = 487
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 216/550 (39%), Positives = 301/550 (54%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++NP+LVA S L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDNPRLVAASPGAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E GA+V R+A S +RFG ++ + ++ E + L ++ + HF +
Sbjct: 166 E-------KQERGAMVLRLAPSHVRFGHFEYFYYTKKPEQ---QKQLGEHVLALHFPECQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 ELPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L I
Sbjct: 255 SILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPFI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ + + + Y +M ++LG + +++++ +LL M VDY+ FFR
Sbjct: 312 SVEALRESLGLFLPLYQAHYLDLMRRRLGFTQAEDDDQKLVERLLQLMQNSGVDYSLFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKAL 573
L PE + L L+ +D + + +W Y + + I ++ R+A
Sbjct: 372 RLGE-----HAPE-QALARLRDDFVD-----RNGFDAWAELYRERVARDPIQGQDLRRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 421 MHAVNPLYILRNYLAQKAIDAAEAGDYSEVRRLHQVLSRPFEEQPGMDSYAERPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|386283589|ref|ZP_10060813.1| hypothetical protein SULAR_00015 [Sulfurovum sp. AR]
gi|385345132|gb|EIF51844.1| hypothetical protein SULAR_00015 [Sulfurovum sp. AR]
Length = 479
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 208/523 (39%), Positives = 290/523 (55%), Gaps = 58/523 (11%)
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAV 184
P L + + ++ +++P L++++ A ++LD + P F +G GA
Sbjct: 11 PYLSLDSEFYDMTEPTPLDDPYLISFNPKAAALIDLDDSVKDDPRFVALLNGTFIPKGAR 70
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
++ CY GHQFG +A +LGDGRAI LG I W LQ KG+G+T YSR +DG A L
Sbjct: 71 TFSMCYAGHQFGNYAPRLGDGRAINLGSI-----NGWHLQTKGSGETLYSRSSDGRAALP 125
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIRE+L SEAMH LGIPTTRAL ++ + + R+ E GAIV R++ S++RF
Sbjct: 126 SSIREYLMSEAMHHLGIPTTRALGIIGSQTKILRNQI-------ERGAIVMRMSPSWVRF 178
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G+++ ++ D +R+LADY I + H+++ DE N+Y
Sbjct: 179 GTFEYFYYF--KEYDKLRSLADYVITESYPHLQD------------DE---------NRY 215
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
+ EV ERTA+L+AQWQG+GF HGV+NTDNMSI+GLTIDYGP+ LD FD F N T
Sbjct: 216 YKFFCEVVERTANLIAQWQGIGFNHGVMNTDNMSIVGLTIDYGPYAMLDDFDYGFVCNKT 275
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGT-KFMDEYQAIMTK 483
D G RY + +QP++ WN+ S L LID ++ +G + D Y +M +
Sbjct: 276 DKAG-RYSYGDQPNVSYWNLTMLSKALTP--LIDKNRMQKKLDDFGNFLYPDAYIDVMRE 332
Query: 484 KLGLP-KYNK--QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
KLGL K N+ ++I++L+ + VDYT FFR LS D +P EL + V LD
Sbjct: 333 KLGLELKLNEDVELITELVGTLQEAYVDYTLFFRTLSRYDGD-RMPIFEL--AMNPVPLD 389
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
SW+ Y L S ER+ M NPKYVL+NY+ Q AI+ A+ GDF
Sbjct: 390 ----------SWLTLYDARLAKETRSQNERQKAMLKTNPKYVLKNYMLQEAIELAQKGDF 439
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
V LL + PYDE P E +A P A++ +C LSCSS
Sbjct: 440 SMVETLLYIAAHPYDELPEFEHFAEETPE-AHK-NIC-LSCSS 479
>gi|197334649|ref|YP_002156591.1| hypothetical protein VFMJ11_1896 [Vibrio fischeri MJ11]
gi|226696169|sp|B5FG68.1|Y1896_VIBFM RecName: Full=UPF0061 protein VFMJ11_1896
gi|197316139|gb|ACH65586.1| protein VV1_0039 [Vibrio fischeri MJ11]
Length = 485
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 206/538 (38%), Positives = 287/538 (53%), Gaps = 58/538 (10%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPD 169
SF L R +PR +T V P+ ++N + + W+ +A +L +
Sbjct: 2 SFWNSLSITTRYSRLPR----CFFTYVQPTP-LDNSRWLIWNSELAKQFDLPENVHNHSE 56
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
FSG + P A Y GHQFG + LGDGR + L EI + K ++L LKGAG
Sbjct: 57 LLDAFSGEVVPSVFAPLAMKYAGHQFGSYNPDLGDGRGLLLAEIKDKKGNSFDLHLKGAG 116
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++T+ V R+ + E
Sbjct: 117 LTPYSRSGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMTSDTPVFREGY-------E 169
Query: 290 PGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
GA++ R+A++ +RFG ++ + S E+L + L+D I HF +K
Sbjct: 170 TGALLIRMAETHIRFGHFEHLFYSNLLEEL---KLLSDKVIEWHFPCCLGEDKP------ 220
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
Y A + +RTA ++AQWQ VGF HGV+NTDNMSI+G T DYGP
Sbjct: 221 ---------------YLAMFNNIVDRTAYMIAQWQAVGFAHGVMNTDNMSIIGQTFDYGP 265
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER 468
FGFLD ++P + N +D G RY F QP IGLWN++ + +L+ LID + +E+
Sbjct: 266 FGFLDDYEPGYICNHSDYQG-RYAFNQQPRIGLWNLSALAHSLSP--LIDKSDLEKALEQ 322
Query: 469 YGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y K D + +M KKLGL + + ++ + ++ + VDYT F R LS++ +
Sbjct: 323 YEIKLHDYFSQLMRKKLGLLSKQEGDTRLFESMFELLSQNTVDYTRFMRVLSDLDSQD-- 380
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
K ++D+ +R EA WV Y+ S + R + M VNPKYVLRN
Sbjct: 381 ---------KQTVIDLFVDR-EAATLWVDLYLTRCKLEADSFDMRCSKMRKVNPKYVLRN 430
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
YL Q AI A GDF +V+ L L+ P+DE P E+YA LPP W R + SCSS
Sbjct: 431 YLAQQAIVKANEGDFSDVKILSTLLASPFDEHPDFERYAELPPEWGKRMEI---SCSS 485
>gi|194289568|ref|YP_002005475.1| hypothetical protein RALTA_A1459 [Cupriavidus taiwanensis LMG
19424]
gi|193223403|emb|CAQ69408.1| conserved hypothetical protein, UPF0061 [Cupriavidus taiwanensis
LMG 19424]
Length = 529
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 213/534 (39%), Positives = 288/534 (53%), Gaps = 72/534 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + +P LV+ + + A L D R DF F G A P A Y G
Sbjct: 45 FTRLLPT-PLPSPYLVSVAPAAAALLGWDASIGGRQDFVETFIGNQVPDWADPLATVYSG 103
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI L + + WE+QLKGAG TPYSR ADG AVLRSSIRE+LC
Sbjct: 104 HQFGVWAGQLGDGRAIRLAQA-QTDTGPWEIQLKGAGLTPYSRMADGRAVLRSSIREYLC 162
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LG+PTTRAL ++ + V R+ E A+V R++ +F+RFG ++ A+
Sbjct: 163 SEAMAALGVPTTRALSIMGSDAPVRRETI-------ETAAVVTRLSPTFIRFGHFEHFAA 215
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+D+ +R LAD+ I + + S Y A EV+
Sbjct: 216 --HDDVAALRKLADFVIDNFMPACRD---------------------DSQPYQALLREVS 252
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G RY
Sbjct: 253 LRTADLIAHWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDTQG-RYA 311
Query: 433 FANQPDIGLWN---IAQFSTTLAAAKLIDDKEA-------------NYVMERYGTKFMDE 476
++ QP + WN +AQ L D+E+ + +RY F
Sbjct: 312 YSQQPQVAFWNLHCLAQALLPLWLPPAQADQESARDAAVEAARAALDPFRDRYAAAFFRH 371
Query: 477 YQAIMTKKLGL-------PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
Y+A KLGL K ++ +++ L + +VDYT F+R L + + + +
Sbjct: 372 YRA----KLGLRPPVGGDDKADEPLLTSLFQLLHSQRVDYTLFWRRLCRISSTDASRDG- 426
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQ 589
P++ + LD + A+ +WV Y L + D R+ M +VNPKYVLRN+L +
Sbjct: 427 ---PVRDLFLD-----RAAFDAWVADYRVRLRTEQSHDAARELEMLAVNPKYVLRNHLAE 478
Query: 590 SAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+AI A DF EV RLL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 479 TAIRQARGKDFSEVERLLAVLSRPFDEQPEAEHYAALPPDWA---AGLEVSCSS 529
>gi|300704059|ref|YP_003745661.1| hypothetical protein RCFBP_11757 [Ralstonia solanacearum CFBP2957]
gi|299071722|emb|CBJ43046.1| conserved protein of unknown function, UPF0061 [Ralstonia
solanacearum CFBP2957]
Length = 529
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 212/537 (39%), Positives = 277/537 (51%), Gaps = 72/537 (13%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P +P LV +S A L L + P F G A + P A Y GH
Sbjct: 38 TRLPPLPMPASPYLVGFSPEAAAPLGLSRAGLDTPAGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ + E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRRE-------EIETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + + Y A EVA
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEAQPYLALLREVAR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
TA L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 STAELIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNI----------------------AQFSTTLAAAKLIDDKEANYVMER--Y 469
A QP I WN+ S A ID +A ++ R Y
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLFGSRNDNDGTAFVDLSDEAQAQPAIDAAQAALLVYRDTY 365
Query: 470 GTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
G F Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 GATFYARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTP 420
Query: 527 EDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNY 586
D ++ + D +++ +W+ Y + L + + D+ R M VNPKYVLRN+
Sbjct: 421 ADAQARTVRDIFFD-----RDSADAWLADYRRRLQTEPLPDDARAEAMRRVNPKYVLRNH 475
Query: 587 LCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + AI A+ DF EV L ++ RP+D+ PG ++YA P WA +SCSS
Sbjct: 476 LAEIAIRRAKEKDFSEVEHLRTVLARPFDDHPGFQRYAGPAPDWA---ASLEVSCSS 529
>gi|416941360|ref|ZP_11934540.1| hypothetical protein B1M_21293, partial [Burkholderia sp. TJI49]
gi|325524396|gb|EGD02479.1| hypothetical protein B1M_21293 [Burkholderia sp. TJI49]
Length = 426
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 200/471 (42%), Positives = 264/471 (56%), Gaps = 65/471 (13%)
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGRA+T+GE+ R+ELQLKG G+TPYSR DG AVLRSSIREFL
Sbjct: 2 GHQFGVWAGQLGDGRALTVGELAGADGRRYELQLKGGGRTPYSRMGDGRAVLRSSIREFL 61
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAMH LGIPTTRAL ++ + + V R+ E A+V RV++SF+RFG ++
Sbjct: 62 CSEAMHHLGIPTTRALTVIGSDQPVIREEI-------ETSAVVTRVSESFVRFGHFEHFF 114
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
S + DL +R LAD+ I + + + + Y A
Sbjct: 115 SNDRPDL--LRQLADHVIDRFYPDCRDAD---------------------DPYLALLEAA 151
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTA LVAQWQ VGF HGV+NTDNMSILGLTIDYGPFGF+DAFD + N +D G RY
Sbjct: 152 TLRTADLVAQWQAVGFCHGVMNTDNMSILGLTIDYGPFGFVDAFDANHICNHSDTSG-RY 210
Query: 432 CFANQPDIGLWNIAQFSTTL---------------AAAKLIDDKEANYVMERYGTKFMDE 476
+ QP I WN + L A + ++D +A V+ ++ +F
Sbjct: 211 AYRMQPRIAHWNCYCLAQALLPLIGLQHGIADDDARAERAVEDAQA--VLAKFPERFGPA 268
Query: 477 YQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLV 532
+ M KLGL + + + ++LL M + D+T FR L+ + K D S
Sbjct: 269 LERAMRAKLGLELERENDAALANQLLETMHASRADFTLTFRRLAQLSKHDASRD-----A 323
Query: 533 PLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI 592
P++ + +D ++A+ +W Y L D R A MN VNPKYVLRN+L + AI
Sbjct: 324 PVRDLFID-----RDAFDAWANLYRARLSDETRDDAARAAAMNRVNPKYVLRNHLAEVAI 378
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A+ DF EV RL +++ RP+DEQP E YA LPP WA G +SCSS
Sbjct: 379 RRAKEKDFSEVERLAQVLRRPFDEQPEHESYAALPPDWA---GSLEVSCSS 426
>gi|260219458|emb|CBA26303.1| UPF0061 protein Rfer_2395 [Curvibacter putative symbiont of Hydra
magnipapillata]
Length = 503
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 212/517 (41%), Positives = 280/517 (54%), Gaps = 53/517 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A Y + P+ + P V S S A LD + P+ +G L G+ P A Y
Sbjct: 36 AFYAPLEPT-PLPAPYWVGTSASAARWAGLDASHLDNPEVLQALTGNRLLQGSEPLASVY 94
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG WAGQLGDGRAI LGE+ L E+QLKGAG TP+SR DG AVLRSSIREF
Sbjct: 95 SGHQFGQWAGQLGDGRAILLGELNGL-----EVQLKGAGLTPFSRMGDGRAVLRSSIREF 149
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAM+ LGIPT+RALC+ + V R+ E A+V RVA SF+RFG ++
Sbjct: 150 LASEAMNGLGIPTSRALCVTGSDAPVRRETI-------ETAAVVTRVAPSFIRFGHFEHF 202
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
G ++ LAD+ I H++ + N Y +
Sbjct: 203 CHHGMPGE--LKILADFVIDHYYPDCRTDAR-----------------WNGNPYVSLLAA 243
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA +VA+WQ VGF HGV+NTDNMSILGLTIDYGPF F+DA+DP N +D G R
Sbjct: 244 VTERTAHMVARWQAVGFCHGVMNTDNMSILGLTIDYGPFQFMDAYDPGHICNHSDT-GGR 302
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y F QP++ WN+ F A LID++E A +E + + + M KLG +
Sbjct: 303 YAFYKQPNVAYWNL--FCLGQAMMPLIDEQEHAIAALETFKDIYPRAFAERMAAKLGFSE 360
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+K +I +L +A DKVD+T F+R LS+ D + + ++ + LD +
Sbjct: 361 VQEAHKPVIEGILKLLAADKVDFTIFWRRLSHWVRDEASAGNS----VRDLFLD-----R 411
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+ +W+LSY ELL+ I LM NPK+VLRN+L + AI AA DF V L
Sbjct: 412 AGFDAWLLSY-SELLAH-IPRAPAANLMLKSNPKFVLRNHLGEQAIQAARQKDFSMVADL 469
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LK++E PYDE + +A PP WA + +SCSS
Sbjct: 470 LKVLEAPYDEHREFDAWAGFPPDWAAQ---ISISCSS 503
>gi|410996371|gb|AFV97836.1| hypothetical protein B649_07620 [uncultured Sulfuricurvum sp.
RIFRC-1]
Length = 478
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 203/516 (39%), Positives = 281/516 (54%), Gaps = 62/516 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V+P A ++NP+LV+ + L LDP + + +G G+ PYA CY G
Sbjct: 20 YHEVAP-APLKNPKLVSHNLEALKLLGLDPNDLNLTELEKLLNGTLQFKGSRPYAMCYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + +LGDGRAI LG + + W LQLKG+G+T YSR DG AVLRSSIRE+L
Sbjct: 79 HQFGYYVQRLGDGRAINLGSV-----KGWNLQLKGSGQTRYSRQGDGRAVLRSSIREYLM 133
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IH 310
SEAM+ LGIPT+RAL ++++ + V R+ + E GAIV R+A S++RFGS++ H
Sbjct: 134 SEAMYGLGIPTSRALAIISSDEKVARERW-------EYGAIVLRLAPSWIRFGSFEYFFH 186
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+R +E + TLAD+ + ES G ED Y
Sbjct: 187 TNRHKE----LETLADFLLH------------ESFPEFVGVED---------PYLTMFGS 221
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ +RTA L+AQWQ VGF HGV+NTDNMS +G+TIDYGPF F+D F+ + N TD G R
Sbjct: 222 IVKRTAELIAQWQSVGFNHGVMNTDNMSAIGITIDYGPFAFMDTFESDYICNHTDTQG-R 280
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP-- 488
Y + NQP IG WN+ + + L+ L+ ++ ++RYG F ++ KLGL
Sbjct: 281 YSYNNQPRIGYWNLERLAHALSP--LVTPEKLKTELDRYGDYFTTRLMELLLAKLGLDTP 338
Query: 489 -KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
K + ++ L M ++D T FFR LS + D LL L + +
Sbjct: 339 HKNDSDLLRALFTLMENGRIDMTPFFRTLSRYDGN----RDTLLS------LTLAPNQLN 388
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
W+ Y + L + S E+R M NPKY+L+NY+ Q AI+AAE GDF V LL
Sbjct: 389 EWLD---QYDERLSLNSSSVEKRHQQMLRTNPKYILKNYILQEAIEAAEKGDFSLVNDLL 445
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
KL + PYDE ++YA + P LSCSS
Sbjct: 446 KLAQNPYDEHELFDRYAGITPP---EHKNLKLSCSS 478
>gi|254447804|ref|ZP_05061269.1| hypothetical protein GP5015_92 [gamma proteobacterium HTCC5015]
gi|198262584|gb|EDY86864.1| hypothetical protein GP5015_92 [gamma proteobacterium HTCC5015]
Length = 493
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 205/515 (39%), Positives = 280/515 (54%), Gaps = 52/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
++++ A V + +L W+ +A L L P + +G P P AQ Y G
Sbjct: 27 FSRIEFHAPVSS-RLAVWNSGLAADLGL-PSDSPDESLSRRLAGLEPWPAFTPIAQRYAG 84
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+W QLGDGRA L E+ +++ + ELQLKG G TPYSR DG AVLRS+IRE+LC
Sbjct: 85 HQFGVWVPQLGDGRAALLAELEDIRGQHQELQLKGGGPTPYSRMGDGRAVLRSTIREYLC 144
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + + V R+ E A + RVA S LRFGS++
Sbjct: 145 SEAMHGLGIPTTRALALFDSDEPVQREQI-------ETAATLVRVAPSHLRFGSFEYFYH 197
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG+ + ++TL ++A++H F E+L D D V + V
Sbjct: 198 RGEHEH--LKTLTEFALKHSF--------PEAL-----DSDEPVATMLQT--------VV 234
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ERTASL+A WQ VGF HGV+NTDNMS+LGLT+DYGPFGFLDA+DP N +D G RY
Sbjct: 235 ERTASLMADWQSVGFCHGVMNTDNMSLLGLTLDYGPFGFLDAYDPGHICNHSDHSG-RYA 293
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
++ QP +G WN+ + + ++ A +++ Y F Y + K G + +
Sbjct: 294 YSQQPAVGQWNLVALVSCFLPQ--LGEERARAILDHYPDAFDRAYGERLRGKFGFKQEQQ 351
Query: 493 ---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
Q+I++ M +VDYT FFR L D + P D+ V LD +R EA
Sbjct: 352 GDDQLIAQCFGVMQ-GRVDYTRFFRRLCEF--DENQPLDQQAV------LDECPDR-EAA 401
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI-DAAELGDFGEVRRLLK 608
I W+ Y L + +R A M NP+YVLRNYL + AI A + GDF EV++L
Sbjct: 402 IEWLARYRSRLQAEHSDRPQRSASMKGHNPRYVLRNYLAEVAIRKATDEGDFSEVKKLAA 461
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ PY +Q + Y +LPP WA +SCSS
Sbjct: 462 VLSDPYRDQLNCDHYDQLPPDWA---ASLAVSCSS 493
>gi|421725344|ref|ZP_16164538.1| hypothetical protein KOXM_07128 [Klebsiella oxytoca M5al]
gi|410373885|gb|EKP28572.1| hypothetical protein KOXM_07128 [Klebsiella oxytoca M5al]
Length = 480
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 208/521 (39%), Positives = 283/521 (54%), Gaps = 53/521 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L YT ++P+ +EN +LV + +A S+ + F + G T L G +P
Sbjct: 10 RDELPDFYTALAPTP-LENARLVWHNAPLARSMGVAESLFSPEKGGGVWGGETVLPGKLP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
A + G FG WAG +GDGR + LGE +E LKGAG TPYSR DG AVLRS
Sbjct: 69 LAPVFRGPPFGFWAGPVGDGRGLLLGEPPVGDGCWFEWPLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH LGIPTTRAL +V + V R+ E GA++ R+A+S +RFG
Sbjct: 129 TIREGLASEAMHALGIPTTRALAIVASDTPVYRETV-------ERGAMLMRLAESHVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H +E L V+ LADY IRHH+ H++N ++KY
Sbjct: 182 HFE-HFYYRREPLK-VQQLADYVIRHHWPHLQN---------------------EADKYI 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
AW +V RTA ++A WQ VGF HGV+NTDNMSILGLT+DYGP+GFLD F P F N +D
Sbjct: 219 AWYSDVVARTAEMIASWQTVGFAHGVMNTDNMSILGLTMDYGPYGFLDDFQPGFICNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP +GLWN+ + + TL + I + N ++ Y + Y M KL
Sbjct: 279 YQG-RYSFDNQPAVGLWNLQRLAQTL--SPFISAELLNGALDSYQHALLTAYGRRMRDKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL K + +++ L MA + DYT FR LS + ++ PL+ +D
Sbjct: 336 GLFTQQKGDNELLDGLFALMAREGSDYTRTFRMLSASE------QESAASPLRDEFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+E + SW Y L + D +R+A M SVNP VLRN+L Q AI+ AE GD E
Sbjct: 388 ---RETFDSWFADYRARLRDEQVDDAQRQARMRSVNPALVLRNWLAQRAIELAEQGDMSE 444
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RL + +P+ ++ + Y PP W R +SCSS
Sbjct: 445 LERLHNALSQPFIDR--TDDYVNRPPDWGRR---LEVSCSS 480
>gi|410637728|ref|ZP_11348299.1| hypothetical protein GLIP_2883 [Glaciecola lipolytica E3]
gi|410142696|dbj|GAC15504.1| hypothetical protein GLIP_2883 [Glaciecola lipolytica E3]
Length = 478
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 203/510 (39%), Positives = 279/510 (54%), Gaps = 66/510 (12%)
Query: 144 NPQLVAWSESVADSLELDPKEFERPD--FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
P+L +++ +AD +E PKE + F F L AQ YGGHQFG W
Sbjct: 25 QPELALFNQKLADEIEF-PKELHQQHALFAELFEAEGKL-NQHAIAQKYGGHQFGGWNPD 82
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGR + L EI K +RW+L LKGAGKTPYSRF DG AVLRS+IRE+L SEA+H LGI
Sbjct: 83 LGDGRGLLLAEIETTKKQRWDLHLKGAGKTPYSRFGDGRAVLRSTIREYLASEALHHLGI 142
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PT+RALCL+ + + V R+ K E GA++ R QS +RFG ++ Q LD +
Sbjct: 143 PTSRALCLIASNETVYRE-------KPETGAMLIRACQSHIRFGHFEYFFHSKQ--LDKL 193
Query: 322 RTLADYAIRHHF-RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
L +Y +H+ + +++ N SL +H V+ +TA L+A
Sbjct: 194 EKLFNYTFHNHYPQFMDSQNPHYSLL------EHIVL----------------QTADLIA 231
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
+WQ GF HGV+NTDNMSI G+T D+GP+ FLDA+DP + N +D G RY F QP +
Sbjct: 232 KWQAFGFCHGVMNTDNMSIHGITFDFGPYAFLDAYDPEYICNHSD-HGGRYAFDQQPGVA 290
Query: 441 LWN---IAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QI 494
LWN +A T + +LI K+A + +Y + ++ +M +K G K N Q+
Sbjct: 291 LWNLNALAHAFTPYLSIELI--KQA---LGQYEIQLQSQFATLMGQKFGFKKINSDDMQL 345
Query: 495 ISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVL 554
++ LN ++ DK DYT FR L DE L K L D +R W
Sbjct: 346 VNGWLNLLSQDKRDYTQSFRLLC----------DEHLSTQK--LADHFIDRTNV-TQWHQ 392
Query: 555 SYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPY 614
Y+ + + ER +M NPKY+LRNYL Q+AI AE G F E +RLLK+++ P+
Sbjct: 393 LYLARIAKESLPKNERLIMMREANPKYILRNYLAQNAIQQAEGGSFEECKRLLKVLQNPF 452
Query: 615 DEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
+EQ + YA+ PP W G M +SCSS
Sbjct: 453 EEQHEYQHYAQTPPDW----GQSMEISCSS 478
>gi|417320372|ref|ZP_12106918.1| hypothetical protein VP10329_21700 [Vibrio parahaemolyticus 10329]
gi|328473335|gb|EGF44183.1| hypothetical protein VP10329_21700 [Vibrio parahaemolyticus 10329]
Length = 489
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 198/519 (38%), Positives = 278/519 (53%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGLPAAQ--NDELLAVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHL 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA E
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFGE 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L + L++ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHAL--SPLVEREDLEQALSQFERRLSQQFSRLMRSKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT FFRALSN+ P+ + + L I +E +
Sbjct: 343 IAEDGRLFESMFELLNQNHTDYTRFFRALSNLDKQPA---------QEVIDLFIDREAAQ 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + + IS E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDEIGEPISAEQRCEQMCQANPKYILRNYLAQLAIDKAEEGDFSEVH 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ PYD QP E YA+LPP W + + SCSS
Sbjct: 454 RLAEILRHPYDSQPEFEAYAKLPPEWGKKMEI---SCSS 489
>gi|388602079|ref|ZP_10160475.1| hypothetical protein VcamD_19557 [Vibrio campbellii DS40M4]
Length = 489
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 210/551 (38%), Positives = 292/551 (52%), Gaps = 68/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E +N+ H F ELP A +T V+P ++N + V W+ A
Sbjct: 1 MSVWEGVNFTHRF-SELPS-------------AFFTYVTPQL-LDNTRWVVWNGEFAQQF 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
L E E + F+G A P A Y GHQFG++ LGDGR + L E+ +
Sbjct: 46 GLPAAENE--ELLNVFAGQKEFAPFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMQHQDG 103
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+++ LKGAG TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++ + V R
Sbjct: 104 TWFDIHLKGAGLTPYSRMGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMDSDTPVYR 163
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K E GA++ RVA++ +RFG ++ Q L + LAD I HF
Sbjct: 164 E-------KMEYGALLIRVAETHIRFGHFEHFFYTNQ--LAEQKLLADKVIEWHF----- 209
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
E L T YAA + E+TA ++A WQ GF HGV+NTDNMS
Sbjct: 210 ---PECLE-------------TEKPYAAMFESIVEKTAEMIAYWQAYGFAHGVMNTDNMS 253
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
LG T DYGPFGFLD +DP++ N +D G RY F QP I LWN++ + +L+ +
Sbjct: 254 TLGQTFDYGPFGFLDDYDPNYICNHSDYQG-RYAFEQQPRIALWNLSALAHSLSPLVQRE 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
D EA + ++ + ++ +M KLGL + ++ + + +K DYT FFR
Sbjct: 313 DLEA--ALGKFEVRLSQKFSELMRAKLGLHTKVDEDGRLFEAMFELLNQNKADYTRFFRE 370
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ-ELLSSG--ISDEERKA 572
LS++ ++ P + L I +E AW+ L+ + E+ G +S + R
Sbjct: 371 LSSL---------DVKSPQAVIDLFIDREAASAWVDLYLARCELEVDECGERVSAQTRCE 421
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M +NPKY+LRNYL Q AID AE GDF EV RL +L++RPYDEQP + YA+LPP W
Sbjct: 422 KMRRMNPKYILRNYLAQIAIDKAEEGDFSEVNRLAELLKRPYDEQPEFDDYAKLPPEWGK 481
Query: 633 RPGVCMLSCSS 643
+ + SCSS
Sbjct: 482 KMEI---SCSS 489
>gi|290979991|ref|XP_002672716.1| UPF0061 domain-containing protein [Naegleria gruberi]
gi|284086295|gb|EFC39972.1| UPF0061 domain-containing protein [Naegleria gruberi]
Length = 701
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 219/556 (39%), Positives = 277/556 (49%), Gaps = 89/556 (16%)
Query: 153 SVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGE 212
+V ++ KE + +F SG + YA CYGG QFG WAGQLGDGRAI++G+
Sbjct: 170 TVEHLMKQQEKEHDLDNFVNILSGYDLVNSTKYYAHCYGGFQFGNWAGQLGDGRAISMGQ 229
Query: 213 ILN---------------------LKSER-WELQLKGAGKTPYSRFADGLAVLRSSIREF 250
+ +K +R WELQ KGAG TP+SR ADG AVLRSSIREF
Sbjct: 230 VETPFTDMDSSGFEFNNSRNSYNYIKPKRLWELQFKGAGHTPFSRHADGRAVLRSSIREF 289
Query: 251 LCSEAMHFLGIPTTRALCLVTTG-KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
L SE M LGI TTRA LV + K V RD FYD NPK E GAIV RVA +F+RFGS+ I
Sbjct: 290 LGSEFMDSLGIATTRAFSLVRSKEKAVLRDEFYDNNPKYEYGAIVLRVAPTFVRFGSFDI 349
Query: 310 HASRGQ---------EDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLT 360
R E+ + LA Y I++HF H+ + GD LT
Sbjct: 350 FNYRYHPINEKEKALEEKKNIEVLARYVIKNHFPHL----------WINGD-------LT 392
Query: 361 SNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFT 420
++ E+ RTA L A W VGF HGVLNTDNMSILGLTIDYGPFGF+D F F
Sbjct: 393 LELKEKFSKEIVRRTAKLCADWMSVGFVHGVLNTDNMSILGLTIDYGPFGFVDYFSEDFV 452
Query: 421 PNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAI 480
PN +D G RY + NQP I WN+ + L ++ A V+ Y F Y
Sbjct: 453 PNNSDSDG-RYRYKNQPAIVFWNLQKLMRAFTPTLLPEEYFAK-VLNVYAPHFEHYYLMN 510
Query: 481 MTKKLGLPKYNK--------------------------QIISKLLNNMAVDKVDYTNFFR 514
KKLGL + ++I L M ++ D+TNFFR
Sbjct: 511 FRKKLGLISSSTIIDTSEDVTNFDMFDGDSENLRNEDWELIEGFLAWMNENRADFTNFFR 570
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWIS----WVLSYIQELLSSGISDEER 570
LSNVK + + ELL L + E +S W+ Y + L S +SDEER
Sbjct: 571 LLSNVKKGAEVSQ-ELLDNLLQTRMHADHTPSETTVSELKNWLSIYTKRLESVPLSDEER 629
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPG--MEKYARLPP 628
K M+ NP+Y+LRNY+ Q I +AE D+G + ++ PYD EK+ P
Sbjct: 630 KTQMDKTNPRYILRNYIAQKVIKSAEEFDYGPLYEYYNVLRNPYDNHSTEFEEKFGGNAP 689
Query: 629 AWAYRPGVCM-LSCSS 643
C+ LSCSS
Sbjct: 690 L----SSRCLKLSCSS 701
>gi|398901918|ref|ZP_10650659.1| hypothetical protein PMI30_02537 [Pseudomonas sp. GM50]
gi|398179139|gb|EJM66759.1| hypothetical protein PMI30_02537 [Pseudomonas sp. GM50]
Length = 487
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 219/551 (39%), Positives = 295/551 (53%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P + P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IAAPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNNAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALSIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI--HASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R++ S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMVLRLSPSHVRFGHFEFFYYTKRPEQQ----KELGEHVLAMHF--- 211
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 212 ------------------PLCLEQPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLEHLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + PE + L L+ +DI + + +W YI + G + E+R+
Sbjct: 371 RRLGD-----ESPE-QALARLRDDFVDI-----KGFDAWGELYIARVAREGEVDQEQRRT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|59712376|ref|YP_205152.1| hypothetical protein VF_1769 [Vibrio fischeri ES114]
gi|75353666|sp|Q5E3Y2.1|Y1769_VIBF1 RecName: Full=UPF0061 protein VF_1769
gi|59480477|gb|AAW86264.1| conserved protein [Vibrio fischeri ES114]
Length = 485
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 207/538 (38%), Positives = 288/538 (53%), Gaps = 58/538 (10%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPD 169
SF L R +PR +T V P+ ++N + + W+ +A +L +
Sbjct: 2 SFWNSLSITTRYSRLPR----CFFTYVQPTP-LDNSRWLIWNSELAKQFDLPENVHNHSE 56
Query: 170 FPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAG 229
FSG T + P A Y GHQFG + LGDGR + L EI + K ++L LKGAG
Sbjct: 57 LLDAFSGETVPSVFSPLAMKYAGHQFGCYNPDLGDGRGLLLAEIKDKKGNSFDLHLKGAG 116
Query: 230 KTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEE 289
TPYSR DG AVLRS+IRE+LCSEAM LGIPTTRAL ++T+ V R+ + E
Sbjct: 117 LTPYSRSGDGRAVLRSTIREYLCSEAMAGLGIPTTRALGMMTSDTPVFREGY-------E 169
Query: 290 PGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
GA++ R+A++ +RFG ++ + S E+L + L+D I HF +K
Sbjct: 170 TGALLIRMAETHIRFGHFEHLFYSNLLEEL---KLLSDKVIEWHFPCCLGEDKP------ 220
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
Y A + +RTA ++AQWQ VGF HGV+NTDNMSI+G T DYGP
Sbjct: 221 ---------------YLAMFNNIVDRTAYMIAQWQAVGFAHGVMNTDNMSIIGQTFDYGP 265
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER 468
FGFLD ++P + N +D G RY F QP IGLWN++ + +L+ LID + +E+
Sbjct: 266 FGFLDDYEPGYICNHSDYQG-RYAFNQQPRIGLWNLSALAHSLSP--LIDKSDLEKALEQ 322
Query: 469 YGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y K D + +M KKLGL + + ++ + ++ + VDYT F RALS + +
Sbjct: 323 YEIKLHDYFSQLMRKKLGLLSKQEGDTRLFESMFELLSQNAVDYTRFMRALSYLDSQD-- 380
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
K ++D+ +R EA W+ Y+ S + R + M VNPKYVLRN
Sbjct: 381 ---------KQTVVDLFVDR-EAATLWIDLYLTRCKLEVDSFDMRCSKMRKVNPKYVLRN 430
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
YL Q AI A GDF +V+ L L+ P+DE P E+YA LPP W R + SCSS
Sbjct: 431 YLAQQAIVKANEGDFSDVKILSTLLASPFDEHPDFERYAELPPEWGKRMEI---SCSS 485
>gi|299066764|emb|CBJ37958.1| conserved protein of unknown function, UPF0061 [Ralstonia
solanacearum CMR15]
Length = 525
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 214/533 (40%), Positives = 272/533 (51%), Gaps = 68/533 (12%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P P LV +S A L L + P F G A + P A Y GH
Sbjct: 38 TRLPPVPMPAAPYLVGFSPEAAAPLGLSRAGLDTPAGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRRETI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + Y A EV
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEPQPYLALLREVGR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAALIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNI------------------AQFSTTLAAAKLIDDKEANYVMER--YGTKF 473
A QP I WN+ A S A ID + ++ R YG F
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLCGSDPTVFADLSDEAQAQPAIDAAQEALLVYRDTYGAAF 365
Query: 474 MDEYQAIMTKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 YARYRA----KLGLTQPHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTPAQAQ 420
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
++ V D +++ +W+ +Y Q L + D R M VNPKYVLRN+L +
Sbjct: 421 TRTVRDVFFD-----RDSADAWLTAYRQRLQAEPAPDAARAEAMRRVNPKYVLRNHLAEI 475
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI A DF EV L ++ RP+D+ PG E+YA P WA +SCSS
Sbjct: 476 AIRRAGEKDFSEVENLRVVLARPFDDHPGFERYAGPAPDWA---ASLEVSCSS 525
>gi|28872142|ref|NP_794761.1| hypothetical protein PSPTO_5028 [Pseudomonas syringae pv. tomato
str. DC3000]
gi|33517004|sp|Q87VB1.1|Y5028_PSESM RecName: Full=UPF0061 protein PSPTO_5028
gi|28855396|gb|AAO58456.1| conserved hypothetical protein [Pseudomonas syringae pv. tomato
str. DC3000]
Length = 487
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 225/554 (40%), Positives = 306/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ E P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQSELPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSGVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE- 569
FFR L + P + L L+ +DI + + W +Y + + E+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KVFDDWAQAYQARIAAEENGTEQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++VNP Y+LRNYL Q+AI+AAE GD+ EVRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKERMHAVNPLYILRNYLAQNAIEAAEKGDYEEVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|398968744|ref|ZP_10682484.1| hypothetical protein PMI25_04224 [Pseudomonas sp. GM30]
gi|398143280|gb|EJM32157.1| hypothetical protein PMI25_04224 [Pseudomonas sp. GM30]
Length = 487
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 216/551 (39%), Positives = 298/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSAHVLPEP-IDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPATAETQEFAELFGGHKLWADAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNNAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHIRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LG +++++ LL M VDYT FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGFITAEDDDQKLLEDLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L A+ ++ L+ +DI + + +W Y+ + G SD E+R+
Sbjct: 371 RHLGEESAEQAVAR------LRDDFVDI-----KGFDAWGERYVARVARDGDSDQEQRRT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P+DEQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSKPFDEQPGMEGYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|289624142|ref|ZP_06457096.1| hypothetical protein PsyrpaN_03169 [Pseudomonas syringae pv.
aesculi str. NCPPB 3681]
gi|289647584|ref|ZP_06478927.1| hypothetical protein Psyrpa2_07497 [Pseudomonas syringae pv.
aesculi str. 2250]
gi|422580961|ref|ZP_16656105.1| hypothetical protein PSYAE_00890 [Pseudomonas syringae pv. aesculi
str. 0893_23]
gi|330865812|gb|EGH00521.1| hypothetical protein PSYAE_00890 [Pseudomonas syringae pv. aesculi
str. 0893_23]
Length = 487
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 302/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL ++Q++S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTIAEDQDEQLVSQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI-SDEERKALM 574
L + P E L L+ +DI + + W +Y+ + +++ER+ M
Sbjct: 373 LGDQ------PAVEALRTLRDDFVDI-----KGFDGWAEAYLARIAGEDKGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPVTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|422618998|ref|ZP_16687692.1| hypothetical protein PSYJA_18136 [Pseudomonas syringae pv. japonica
str. M301072]
gi|440720817|ref|ZP_20901229.1| hypothetical protein A979_08408 [Pseudomonas syringae BRIP34876]
gi|440727728|ref|ZP_20907954.1| hypothetical protein A987_16688 [Pseudomonas syringae BRIP34881]
gi|443641221|ref|ZP_21125071.1| Hypothetical protein PssB64_0494 [Pseudomonas syringae pv. syringae
B64]
gi|330899372|gb|EGH30791.1| hypothetical protein PSYJA_18136 [Pseudomonas syringae pv. japonica
str. M301072]
gi|440363133|gb|ELQ00303.1| hypothetical protein A987_16688 [Pseudomonas syringae BRIP34881]
gi|440365187|gb|ELQ02301.1| hypothetical protein A979_08408 [Pseudomonas syringae BRIP34876]
gi|443281238|gb|ELS40243.1| Hypothetical protein PssB64_0494 [Pseudomonas syringae pv. syringae
B64]
Length = 487
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 219/551 (39%), Positives = 307/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P + + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPGQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL + ++Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVAQEQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + W +Y + L +++ER+
Sbjct: 371 RRLGDQ------PAAEALRTLRDDFVDI-----KGFDGWAQAYQARIALEDNGTEQERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|422650706|ref|ZP_16713508.1| hypothetical protein PSYAC_03971 [Pseudomonas syringae pv.
actinidiae str. M302091]
gi|330963791|gb|EGH64051.1| hypothetical protein PSYAC_03971 [Pseudomonas syringae pv.
actinidiae str. M302091]
Length = 487
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 224/554 (40%), Positives = 306/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ E P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQAELPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 E---------------------QPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSGVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLS-SGISDEE 569
FFR L + P + L L+ +DI + + W +Y + + +D+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KGFDDWAHAYQARIAAEENGTDQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++V+P Y+LRNYL Q+AI+AAE GD+ EVRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKERMHAVSPLYILRNYLAQNAIEAAEKGDYEEVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|422300416|ref|ZP_16387933.1| hypothetical protein Pav631_4583 [Pseudomonas avellanae BPIC 631]
gi|407987400|gb|EKG30213.1| hypothetical protein Pav631_4583 [Pseudomonas avellanae BPIC 631]
Length = 487
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 224/554 (40%), Positives = 306/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ E P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQAELPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSAVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLS-SGISDEE 569
FFR L + P + L L+ +DI + + W Y + + +D+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KGFDDWAHVYQARIAAEENGTDQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++VNP Y+LRNYL Q+AI+AAE G++ EVRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKERMHAVNPLYILRNYLAQNAIEAAEKGNYEEVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|449144456|ref|ZP_21775271.1| hypothetical protein D908_06188 [Vibrio mimicus CAIM 602]
gi|449079957|gb|EMB50876.1| hypothetical protein D908_06188 [Vibrio mimicus CAIM 602]
Length = 489
Score = 328 bits (840), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 204/524 (38%), Positives = 279/524 (53%), Gaps = 64/524 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFFSGATPLAGAVP 185
A YT + P +EN + W+ +A +EF P+ P SG A P
Sbjct: 19 AFYTSIHPQP-LENARWGMWNALLA-------QEFGLPEVPNSELLAALSGQHLPADFAP 70
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRS
Sbjct: 71 LAMKYAGHQFGVYNPDLGDGRGLLLAEMASKTGDVYDIHLKGAGLTPYSRMGDGRAVLRS 130
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIRE+LCSEAM LGI TTRAL L+ + V R+ +EE GA++ RVAQS +RFG
Sbjct: 131 SIREYLCSEAMAGLGIATTRALALMNSDTPVYRE-------REERGALLVRVAQSHIRFG 183
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ HF + E + + L+ + ++ YA
Sbjct: 184 HFE-----------------------HFYYTEQHTELKLLADKVIEWYFPTCAQSTKPYA 220
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 221 DWFHQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDDYDPNFICNHSD 280
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F QP IGLWN++ + L + LI+ + +E Y + +M KL
Sbjct: 281 YQG-RYAFDQQPRIGLWNLSALAHAL--SPLIEKADLEAALESYSEHLNRYFSQLMRAKL 337
Query: 486 GLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + ++ + +A + DYT F R LS + + +L+V +A
Sbjct: 338 GLATQQEGDGELFTDFFALLANNHTDYTRFLRELSCLDRQSTEAVIDLVVDRQAA----- 392
Query: 543 KERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
+AW++ L +EL G IS ER +M VNPKY+LRNYL Q AI+ AE GD
Sbjct: 393 ----KAWLTRYLERAARELGQDGQPISQVERCQVMRQVNPKYILRNYLAQQAIELAERGD 448
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F E++RL +++ PYDE P E YA+LPP W + +SCSS
Sbjct: 449 FQEMQRLAQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|262166059|ref|ZP_06033796.1| UPF0061 domain-containing protein [Vibrio mimicus VM223]
gi|262025775|gb|EEY44443.1| UPF0061 domain-containing protein [Vibrio mimicus VM223]
Length = 489
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 207/524 (39%), Positives = 279/524 (53%), Gaps = 64/524 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFFSGATPLAGAVP 185
A YT + P +EN W+ +A +EF P+ P SG A P
Sbjct: 19 AFYTSIRPQP-LENVDWGMWNAPLA-------QEFGLPEVPNSELLAALSGQQLPADFAP 70
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRS
Sbjct: 71 LAMKYAGHQFGVYNPDLGDGRGLLLAEMASKTGDVYDIHLKGAGHTPYSRMGDGRAVLRS 130
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIRE+LCSEAM LGI TTRAL L+ + V R+ +EE GA++ RVAQS +RFG
Sbjct: 131 SIREYLCSEAMAGLGIATTRALALMNSDTPVYRE-------REERGALLVRVAQSHIRFG 183
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H ++ ++ + LAD I HF ++ YA
Sbjct: 184 HFE-HFYYTEQHTEL-KLLADKVIEWHF---------------------PTCAQSAKPYA 220
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 221 DWFHQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDDYDPNFICNHSD 280
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F QP IGLWN++ + L + LI+ + +E Y + M KL
Sbjct: 281 YQG-RYAFDQQPRIGLWNLSALAHAL--SPLIEKADLEAALESYSEHLNRYFSQWMRAKL 337
Query: 486 GLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + ++ + +A + DYT F R LS + + +L+V +A
Sbjct: 338 GLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQSTEAVIDLVVDRQAA----- 392
Query: 543 KERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
+AW++ L +EL G IS ER M VNPKY+LRNYL Q AI+ AE GD
Sbjct: 393 ----KAWLTRYLERAARELGQDGQPISQVERCQAMRQVNPKYILRNYLAQQAIELAERGD 448
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F E++RL +++ PYDE P E YA+LPP W + +SCSS
Sbjct: 449 FQEMQRLAQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|258621294|ref|ZP_05716328.1| conserved hypothetical protein [Vibrio mimicus VM573]
gi|424807162|ref|ZP_18232570.1| hypothetical protein SX4_0211 [Vibrio mimicus SX-4]
gi|258586682|gb|EEW11397.1| conserved hypothetical protein [Vibrio mimicus VM573]
gi|342325104|gb|EGU20884.1| hypothetical protein SX4_0211 [Vibrio mimicus SX-4]
Length = 489
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 206/524 (39%), Positives = 279/524 (53%), Gaps = 64/524 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFFSGATPLAGAVP 185
A YT + P +EN + W+ +A +EF P+ P SG A P
Sbjct: 19 AFYTSIRPQL-LENVRWGMWNAPLA-------QEFGLPEVPNSELLAALSGQQLPADFAP 70
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRS
Sbjct: 71 LAMKYAGHQFGVYNPDLGDGRGLLLAEMASKTGDVYDIHLKGAGLTPYSRMGDGRAVLRS 130
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIRE+LCSEAM LGI TTRAL L+ + V R+ +EE GA++ RVAQS +RFG
Sbjct: 131 SIREYLCSEAMAGLGIATTRALALMNSDTPVYRE-------REERGALLVRVAQSHIRFG 183
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H ++ ++ + LAD I HF ++ YA
Sbjct: 184 HFE-HFYYTEQHTEL-KLLADKVIEWHF---------------------PTCAQSAKPYA 220
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 221 DWFHQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDDYDPNFICNHSD 280
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F QP IGLWN++ + L + LI+ + +E Y + M KL
Sbjct: 281 YQG-RYAFDQQPRIGLWNLSALAHAL--SPLIEKADLEAALESYSEHLNRYFSQWMRAKL 337
Query: 486 GLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + ++ + +A + DYT F R LS + + +L+V +A
Sbjct: 338 GLTTQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGTEAVIDLVVDRQAA----- 392
Query: 543 KERKEAWISWVLSYIQELL---SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
+AW++ L L S IS ER M VNPKY+LRNYL Q AI+ AE GD
Sbjct: 393 ----KAWLTRYLERAARELGQDSQPISQVERCQAMRQVNPKYILRNYLAQQAIELAERGD 448
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
F E++RL++++ PYDE P E YA+LPP W + +SCSS
Sbjct: 449 FQEMQRLVQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|422587866|ref|ZP_16662536.1| hypothetical protein PSYMP_05329 [Pseudomonas syringae pv.
morsprunorum str. M302280]
gi|330873912|gb|EGH08061.1| hypothetical protein PSYMP_05329 [Pseudomonas syringae pv.
morsprunorum str. M302280]
Length = 487
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 224/554 (40%), Positives = 306/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSSVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLS-SGISDEE 569
FFR L + P + L L+ +DI + + W Y + + +D+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KGFDDWAHVYQARIAAEENGTDQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++VNP Y+LRNYL Q+AI+AAE GD+ EVRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKDRMHAVNPLYILRNYLAQNAIEAAEKGDYEEVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|421617149|ref|ZP_16058145.1| hypothetical protein B597_09969 [Pseudomonas stutzeri KOS6]
gi|409780880|gb|EKN60493.1| hypothetical protein B597_09969 [Pseudomonas stutzeri KOS6]
Length = 486
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 218/549 (39%), Positives = 304/549 (55%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L +D+ F R GD + T+VSP ++ P+LV SE+ L
Sbjct: 1 MKTLTQLTFDNRFAR--LGDTFS------------TEVSPQP-LDAPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E+P F FSG + A P A Y GHQFG + QLGDGR + LGE++N
Sbjct: 46 DLDPAKAEQPLFAELFSGHKIWSTAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVINEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAGKTPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 EYWDLHLKGAGKTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSDTLVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ + E GA++ R+A S +RFG ++ + +R +L + L D+ I HF +
Sbjct: 166 E-------RPERGAMLLRLAPSHVRFGHFEFFYYTRQHAEL---KQLLDHVIEAHFADV- 214
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ E Y ++ EV ERTA+LVA+WQ GF HGV+NTDNM
Sbjct: 215 -LEHPEP-------------------YHSFFREVLERTAALVARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GP+ FLD FD F N +D G RY F NQ I WN+A + L +
Sbjct: 255 SILGITFDFGPYAFLDDFDARFICNHSDDTG-RYSFENQVPIAHWNLAALAQAL--TPFV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
D K ME + + E+ +M ++LG + ++ ++ +LL + VDYT+FFR
Sbjct: 312 DVKVLRETMELFLPLYEAEWLDLMRRRLGFDQAEAGDEALVRRLLQLLQTSAVDYTHFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L A+ ++ L+ +D+ + + +W Y G + E R+A M
Sbjct: 372 ELGEGTAEQAVRR------LREEFVDL-----QGFDAWAEDYCARTAREGAAAEARQARM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+VNPKY+LRNYL Q AI+AAE GD+G VR L ++ RP++EQPGM++YA PP W
Sbjct: 421 QAVNPKYILRNYLAQQAIEAAEKGDYGPVRELHAVLSRPFEEQPGMQRYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|408484210|ref|ZP_11190429.1| hypothetical protein PsR81_26783 [Pseudomonas sp. R81]
Length = 487
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 218/551 (39%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDAPRLVVASTAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G T A A P A Y GHQFG + QLGDGR + LGE N
Sbjct: 46 DLDPAVAETPVFAELFGGHTLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEAYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E GA+V R+A S +RFG ++ + ++ E ++ H+
Sbjct: 166 E-------KQERGAMVLRMAHSHIRFGHFEYFYYTKKPEQQALLA-----------EHVL 207
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
N++ E Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 208 NLHYPECRE-------------QPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L +
Sbjct: 255 SILGITFDFGPFAFLDDFDAHFICNHSDHEG-RYSFSNQVPIGQWNLSALAQALTPFISV 313
Query: 458 DD-KEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
D KEA + Y + Y +M ++LGL +++++ +LL M VDYT FF
Sbjct: 314 DALKEA---LGLYLPLYQANYLDLMRRRLGLTTAEDDDQKLVERLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + A ++ L+ +D+ + +W Y ++ G S E+R+
Sbjct: 371 RRLGDESAALAVTR------LRDDFVDLA-----GFDAWAEQYKARVVRDGEYSQEQRRE 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ +P++EQ GME+YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAITAAESGDYSEVRRLHEVLSKPFEEQAGMEQYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|340362031|ref|ZP_08684434.1| SelO family protein [Neisseria macacae ATCC 33926]
gi|339887917|gb|EGQ77424.1| SelO family protein [Neisseria macacae ATCC 33926]
Length = 489
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 207/516 (40%), Positives = 277/516 (53%), Gaps = 50/516 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++VSP + P VA++ +A L LD +F+ + SG P P A Y G
Sbjct: 19 YSRVSPEP-LTAPYWVAFNTDLAAELNLD-TDFQTTSNLAYLSGNAPQYAPAPIAGVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGRAI +G+ ++ +R E QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 77 HQFGVYTPRLGDGRAILIGDSVDAAGQRQEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ E A++ R+A SFLRFG ++
Sbjct: 137 SEAMHGLGIPTTRALALCGSDDPVYRETV-------ETAAVLTRIAPSFLRFGHFEYFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
G+E ++ LADY IRH++ ++ + N YAA ++
Sbjct: 190 TGREAE--IQQLADYLIRHYYPGCQDAD---------------------NPYAALLEQIR 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
TA VA WQ VGF HGV+NTDNMS LGLTIDYGPFGFLD +D N +D G RY
Sbjct: 227 NHTADTVAAWQSVGFCHGVMNTDNMSALGLTIDYGPFGFLDDYDRRHVCNHSDTQG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP + WN + ++ A L+ +++ + F Y M +KLGL + +K
Sbjct: 286 YNAQPFVAHWNFSALASCFDA--LVPHNTLEQLIDGWTEVFQTTYLEKMRRKLGLQQADK 343
Query: 493 Q----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER-KE 547
+ +I+ L + K D+T FFR LS V PL + L K
Sbjct: 344 RDDESLIADLFAALQDQKTDFTLFFRNLSEVGNTHG-------EPLPSKLEQTFKNGVPP 396
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A+I W+ Y Q L + ER MN NP Y+LRNYL + AI A+ GD+ E+ RL
Sbjct: 397 AFIRWLGRYRQRLRAENSVPAERAIHMNRTNPLYILRNYLAEQAIAQAQNGDYREIERLR 456
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RP+DEQ A PP + VC +SCSS
Sbjct: 457 RCLARPFDEQAEFADLAEPPPEGSI--PVC-VSCSS 489
>gi|152993207|ref|YP_001358928.1| hypothetical protein SUN_1621 [Sulfurovum sp. NBC37-1]
gi|151425068|dbj|BAF72571.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 478
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 200/515 (38%), Positives = 280/515 (54%), Gaps = 58/515 (11%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
C+ +V PS + P L+ +E+VA+ L +D +E +F F +GA G+ +A CY
Sbjct: 19 CHDRVKPSP-LTKPFLIHANEAVAEMLGIDKEELYTDEFVDFVNGAYQPEGSDAFAMCYA 77
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG + +LGDGRAI +G + L +QLKGAG+T YSR DG AVLRSSIRE+L
Sbjct: 78 GHQFGFFVDRLGDGRAINIGTLNGL-----HMQLKGAGQTKYSRSGDGRAVLRSSIREYL 132
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH LGI TTRAL L+ + V R + E GAIV RV+ S++RFG+++ A
Sbjct: 133 MSEAMHGLGIETTRALALIGSEHSVFRQEW-------EKGAIVLRVSPSWVRFGTFEYFA 185
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
+ + + L DYAI + H+ +D+ N YA + EV
Sbjct: 186 HK--KKFKELEALRDYAIAESYPHL--------------------IDV-ENAYARFFGEV 222
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
+RTA L+A+WQ VGF HGV+NTDNMSI GLTIDYGP+ FLD +D + N TD G RY
Sbjct: 223 VKRTARLMAEWQAVGFNHGVMNTDNMSIAGLTIDYGPYAFLDEYDAGYICNHTDQYG-RY 281
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY- 490
F NQP IG WN+ L+ ++ E N M +Y + + Y +M +K+G +
Sbjct: 282 SFGNQPSIGEWNLRALMAALSPLIQMEKMEEN--MTQYWKIYREHYLKLMCRKMGFDEVL 339
Query: 491 --NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ ++ +L + +DYT FFR LS D +A +L +G K
Sbjct: 340 DGDLDLVKHMLGTLQGLHIDYTLFFRTLSRYTGD------------RAGILKLGLYHKPM 387
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W+ Y + L + + +ER+ M NPK+VL+NY+ Q IDAAE DF + RL +
Sbjct: 388 Q-DWLDDYDKRLAQNSSTQQEREERMLQTNPKFVLKNYMLQEVIDAAEKDDFSLIDRLFR 446
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+++ PY E P E++A P LSCSS
Sbjct: 447 IVQDPYAEHPAYERWAGATPD---ELKNTKLSCSS 478
>gi|17546467|ref|NP_519869.1| hypothetical protein RSc1748 [Ralstonia solanacearum GMI1000]
gi|33517070|sp|Q8XYL0.1|Y1748_RALSO RecName: Full=UPF0061 protein RSc1748
gi|17428765|emb|CAD15450.1| conserved hypothetical protein [Ralstonia solanacearum GMI1000]
Length = 525
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 213/533 (39%), Positives = 272/533 (51%), Gaps = 68/533 (12%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T++ P P LV +S A L L + P F G A + P A Y GH
Sbjct: 38 TRLPPVPMPAAPYLVGFSPEAAAPLGLSRAGLDTPAGLDVFVGNAIAAWSDPLATVYSGH 97
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG+WAGQLGDGRA+ L E L E+QLKGAG TPYSR DG AVLRSSIREFLCS
Sbjct: 98 QFGVWAGQLGDGRALLLAE-LQTADGPCEVQLKGAGLTPYSRMGDGRAVLRSSIREFLCS 156
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
EAM LGIPTTRALC++ V R+ E A+V R+A SF+RFG ++ A+
Sbjct: 157 EAMAGLGIPTTRALCVIGADAPVRRETI-------ETAAVVTRLAPSFVRFGHFEHFAA- 208
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
E L +R LAD+ I D + Y A EV
Sbjct: 209 -NEKLPELRALADFVI---------------------DRFYPACRAEPQPYLALLREVGR 246
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
RTA+L+AQWQ VGF HGV+NTDNMSILGLT+DYGPFGFLD FD + N +D G RY +
Sbjct: 247 RTAALIAQWQAVGFCHGVMNTDNMSILGLTLDYGPFGFLDGFDANHICNHSDT-GGRYAY 305
Query: 434 ANQPDIGLWNIAQFSTTL---------AAAKLIDDKEANYVM-----------ERYGTKF 473
A QP I WN+ + L A L D+ +A + + YG F
Sbjct: 306 AQQPQIAYWNLFCLAQALLPLCGSDPTAFTDLSDEAQAQPAIDAAQEALLVYRDTYGEAF 365
Query: 474 MDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
Y+A KLGL + ++ + L + + DYT FFR L++V+ D P
Sbjct: 366 YARYRA----KLGLTQAHDGDEALFGDLFKLLHTQRADYTLFFRHLADVRRD-DTPAQAQ 420
Query: 531 LVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQS 590
++ V D +++ +W+ +Y Q L + D R M VNPKYVLRN+L +
Sbjct: 421 ARTVRDVFFD-----RDSADAWLAAYRQRLQTEPAPDAARAEAMRRVNPKYVLRNHLAEI 475
Query: 591 AIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AI A DF EV L ++ RP+D+ PG E YA P WA +SCSS
Sbjct: 476 AIRRAGEKDFSEVENLRAVLARPFDDHPGFEHYAGPAPDWA---ASLEVSCSS 525
>gi|440742946|ref|ZP_20922268.1| hypothetical protein A988_06140 [Pseudomonas syringae BRIP39023]
gi|440376797|gb|ELQ13460.1| hypothetical protein A988_06140 [Pseudomonas syringae BRIP39023]
Length = 487
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 217/551 (39%), Positives = 308/551 (55%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F GD A T V P ++ PQLV S+S L
Sbjct: 1 MKALDELTFDNRFAH--LGD------------AFSTSVLPEP-IDAPQLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL + ++Q++S+LL M VDYT FF
Sbjct: 315 ALREAIGLFLPLYQAHYLD----LMRRRLGLTVAQEQDEQLVSQLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEERKA 572
R L + P E L L+ +DI + + +W +Y + + +++ER+
Sbjct: 371 RRLGDQ------PAAEALRTLRDDFVDI-----KGFDAWAEAYQTRIAVEDNGTEQERQT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|410647499|ref|ZP_11357930.1| hypothetical protein GAGA_3495 [Glaciecola agarilytica NO2]
gi|410132920|dbj|GAC06329.1| hypothetical protein GAGA_3495 [Glaciecola agarilytica NO2]
Length = 480
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 206/543 (37%), Positives = 299/543 (55%), Gaps = 67/543 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+N DHS+ +L GD + P V NPQL+ + ++ ++L+L
Sbjct: 1 MNLDHSYATQL-GDLGALTKP--------------LSVANPQLIEVNHTLREALQLPASW 45
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
F + G T +AQ YGGHQFG W LGDGR + LGE + + W+L
Sbjct: 46 FTQSSIMSMLFGNTSSLTKHSFAQKYGGHQFGGWNPDLGDGRGLLLGEAKDQQGTPWDLH 105
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKGAG TPYSRFADG AVLRS++RE+L SEA+H +GIPT+RALCL+T+ + V R+
Sbjct: 106 LKGAGPTPYSRFADGRAVLRSTLREYLASEALHHMGIPTSRALCLITSDEPVYRE----- 160
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
K+E A++ RV+QS +RFG ++ G +LD + L DY HF S+
Sbjct: 161 --KQEKAAMMIRVSQSHIRFGHFEYFYHNG--ELDKLEKLFDYCFERHF--------SDC 208
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
L + + A ++ TA+L+A+WQ GF HGV+NTDNMSI G+T
Sbjct: 209 LQ-------------AESPHLAMLEKIVTDTATLIAKWQAFGFNHGVMNTDNMSIHGITF 255
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
D+GP+ FLD FDP F N +D G RY F QP IGLWN+ + I+ ++
Sbjct: 256 DFGPYAFLDDFDPKFVCNHSDHQG-RYAFEQQPGIGLWNLNALAHAFTPYLSIEQIKS-- 312
Query: 465 VMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKA 521
+ +Y + M E+ +M +KLGL + N +++++ L+ ++ DK DY FR L ++
Sbjct: 313 ALSQYEPRLMAEFSQLMRQKLGLYENNHTTAELVNRWLDLVSQDKRDYHISFRLLCDIDE 372
Query: 522 DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKY 581
+ P+ L+D +R EA +W+ Y Q + + G +ER+A M VNP Y
Sbjct: 373 QGAHPK----------LVDHFIQR-EAAQAWLTQYQQAIRAQGTDTQERQAQMRKVNPAY 421
Query: 582 VLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LS 640
VLRNY Q AIDAAE GDF R LL+++++P++ +P ++A+ PP W G M +S
Sbjct: 422 VLRNYQAQLAIDAAEQGDFTHFRMLLQVLQQPFESKPEYAEFAKPPPDW----GKHMEIS 477
Query: 641 CSS 643
CSS
Sbjct: 478 CSS 480
>gi|257482499|ref|ZP_05636540.1| hypothetical protein PsyrptA_04473 [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
gi|422594332|ref|ZP_16668623.1| hypothetical protein PLA107_06416 [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|330984640|gb|EGH82743.1| hypothetical protein PLA107_06416 [Pseudomonas syringae pv.
lachrymans str. M301315]
Length = 487
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 217/549 (39%), Positives = 302/549 (55%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTSVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQADLPLFAEIFSGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAGCVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + ++TLA++ + H+ H +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEQ--LKTLAEHVLTMHYPHCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L I
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQAL--TPFIS 312
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ + + + Y +M ++LGL ++Q+ S+LL M VDYT FFR
Sbjct: 313 VEALRETIGLFLPLYQAHYLDLMRRRLGLTIAEDQDEQLASQLLKLMQNSGVDYTLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSS-GISDEERKALM 574
L + P E L L+ +DI + + +W +Y + +++ER+ M
Sbjct: 373 LGDQ------PAVEALRTLRDDFVDI-----KGFDAWAEAYQTRIAGEDNGTEQERQTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGME YA+ PP W
Sbjct: 422 HAVNPLYILRNYLAQNAIAAAEKGDYAEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|255067030|ref|ZP_05318885.1| SelO family protein [Neisseria sicca ATCC 29256]
gi|255048626|gb|EET44090.1| SelO family protein [Neisseria sicca ATCC 29256]
Length = 489
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 209/515 (40%), Positives = 275/515 (53%), Gaps = 48/515 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++VSP + P VA++ +A L LD +F+ + SG P P A Y G
Sbjct: 19 YSRVSPEP-LTAPYWVAFNTDLAAELNLD-TDFQTTANLAYLSGNAPQYAPAPIASVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGRAI +G+ ++ +R E QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 77 HQFGVYTPRLGDGRAILIGDSVDAAGQRQEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ E A++ R+A SFLRFG ++
Sbjct: 137 SEAMHGLGIPTTRALALCGSDDPVYRETV-------ETAAVLTRIAPSFLRFGHFEYFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
G+E ++ LADY IRH++ + + N YAA ++
Sbjct: 190 TGREAE--IQQLADYLIRHYYPDCRDAD---------------------NPYAALLEQIR 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA VA WQ VGF HGV+NTDNMS LGLTIDYGPFGFLD +D N +D G RY
Sbjct: 227 NRTADTVAAWQSVGFCHGVMNTDNMSALGLTIDYGPFGFLDDYDRRHVCNHSDTQG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP + WN A ++ A L+ +++ + F Y M +KLGL + +K
Sbjct: 286 YNAQPYVAHWNFAALASCFDA--LVPHDTLEQLIDGWTEVFQTTYLEKMRRKLGLQQADK 343
Query: 493 Q----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +I+ L + K D+T FFR LS V S E L P G A
Sbjct: 344 RDDESLIADLFAALQDQKTDFTLFFRNLSEV----SNTHGEPLPPKLEQTFKNGV--PPA 397
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+I W+ Y Q L + ER MN NP Y+LRNYL + AI A G + E+ RL +
Sbjct: 398 FIRWLGRYRQRLRAESSVPAERAIRMNLTNPLYILRNYLAEQAIAQARNGVYREIERLRR 457
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RP+DEQ A PP + VC +SCSS
Sbjct: 458 CLARPFDEQAEFADLAEPPPEGSM--PVC-VSCSS 489
>gi|410092428|ref|ZP_11288954.1| hypothetical protein AAI_17061 [Pseudomonas viridiflava UASWS0038]
gi|409760199|gb|EKN45359.1| hypothetical protein AAI_17061 [Pseudomonas viridiflava UASWS0038]
Length = 491
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 221/558 (39%), Positives = 306/558 (54%), Gaps = 80/558 (14%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD + S+ E P AE P+LV S++ L
Sbjct: 1 MKALDELVFDNRFAR--LGDAFSTSVLPE----------PIAE---PRLVVASKAALSLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + E P F F+G A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLDPSQAETPLFAEIFAGHKLWQEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LG+P++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGVPSSRAACVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ EE A+V R+A S +RFGS + + ++ E L + LA++ + H+ +
Sbjct: 166 E-------TEESAAMVLRLAHSHVRFGSLEYFYYTKQPEQL---KQLAEHVLTMHYPQCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 E---------------------EPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L +
Sbjct: 255 SILGITFDFGPFAFLDDFDQHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFISV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ-------IISKLLNNMAVDK 506
D + E G F+ YQA +M ++LGL + N+Q +IS+LL M
Sbjct: 314 DA-----LRETIGL-FLPLYQAHYRDLMRRRLGLTQANEQDDEQDDILISRLLQLMQNSG 367
Query: 507 VDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLS-SGI 565
VDYT FFR L + P E L L+ +DI + + SW +Y+ +
Sbjct: 368 VDYTLFFRRLGDA------PAAEALRVLRDDFVDI-----KGFDSWGETYLARIAQEENT 416
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
S++ERK M++VNP Y+LRNYL Q+AI AAE GD+ E+RRL +++ P+ EQP M++YA+
Sbjct: 417 SEDERKTRMHAVNPLYILRNYLAQNAIQAAEKGDYEEIRRLHEVLCNPFTEQPDMDRYAQ 476
Query: 626 LPPAWAYRPGVCMLSCSS 643
PP W +SCSS
Sbjct: 477 RPPDWGKH---LEISCSS 491
>gi|319738636|ref|NP_001188360.1| selenoprotein O [Sus scrofa]
Length = 672
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 199/464 (42%), Positives = 261/464 (56%), Gaps = 50/464 (10%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+++V P A + P++VA SE
Sbjct: 45 LVGLRFDNRALRALPVETPPPGPEGAPSAPRPVPGACFSRVRP-APLRQPRVVALSEPAL 103
Query: 156 DSLELDP-------KEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
L L +E + LFFSG L G+ P A CY GHQFG +AGQLGDG A+
Sbjct: 104 ALLGLGAPPADADAREAREAEAALFFSGNALLPGSEPAAHCYCGHQFGQFAGQLGDGAAM 163
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE+ ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA
Sbjct: 164 YLGEVCTAAGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGA 223
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA-----------SRGQED 317
V + V RD+ YDGNP+ E A+V R+A +FLRFGS++I S G+ D
Sbjct: 224 CVVSQSTVVRDVLYDGNPRPEKCAVVLRIAPTFLRFGSFEIFKPADELTGRAGPSVGRND 283
Query: 318 LDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTAS 377
+ + + DY I + + + +S+ ++AA+ EV RTA
Sbjct: 284 IRV--QMLDYVISSFYPETQAAHAGDSV----------------QRHAAFFREVTRRTAQ 325
Query: 378 LVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQP 437
LVA+WQ VGF HGVLNTDNMS++GLTIDYGPFGFLD +DP N +D G RY ++ QP
Sbjct: 326 LVAEWQCVGFCHGVLNTDNMSVVGLTIDYGPFGFLDRYDPDHVCNASDTAG-RYAYSKQP 384
Query: 438 DIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---- 493
++ WN+ + + L A ++ EA + E + +F Y M KKLGL + +
Sbjct: 385 EVCKWNLQKLAEALDPALPLELGEA-ILAEEFDAEFRRYYLQKMRKKLGLVRAELEEDGA 443
Query: 494 IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE-DELLVPLKA 536
+++KLL M + D+TN F LS+ A P P+ E L L A
Sbjct: 444 LVAKLLETMHLTGADFTNTFYLLSSFPAGPESPDLAEFLATLTA 487
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 12/71 (16%)
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGME----------- 621
+M++ NPKYVLRNY+ Q+AI+AAE GDF EVRR+LKL+E PY +
Sbjct: 593 VMHANNPKYVLRNYIAQNAIEAAENGDFSEVRRVLKLLETPYHREGEAAEPAEPEAAEGR 652
Query: 622 -KYARLPPAWA 631
Y+ PP WA
Sbjct: 653 LSYSSKPPLWA 663
>gi|389686325|ref|ZP_10177646.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
chlororaphis O6]
gi|388549786|gb|EIM13058.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
chlororaphis O6]
Length = 487
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 214/552 (38%), Positives = 304/552 (55%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L++D+ F R GD A T V P +++P+LV S + L
Sbjct: 1 MKALDELSFDNRFAR--LGD------------AFSTHVLPEP-IDHPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+ E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPEAAESPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L ++ + HF +
Sbjct: 166 E-------KQERAAMLLRMSPSHVRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHF--L 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGVTFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + + F Y +M ++LGL + +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLFLPLFQAHYLDLMRRRLGLTSAEEEDQKLVERLLQLMQGSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLD--IGKERKEAWISWVLSYIQELLSSGISDEERK 571
R L + A+ ++ A L D + ++ +AW + + + E+R+
Sbjct: 371 RRLGDESAELAV----------ARLRDDFVDRQGFDAWADLYKARVAR--EQDDTQEQRR 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
A M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P+++Q GM+ YA PP W
Sbjct: 419 ARMHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSKPFEQQAGMDSYAERPPEWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 KH---LEISCSS 487
>gi|213971491|ref|ZP_03399603.1| conserved hypothetical protein [Pseudomonas syringae pv. tomato T1]
gi|301385801|ref|ZP_07234219.1| hypothetical protein PsyrptM_24336 [Pseudomonas syringae pv. tomato
Max13]
gi|302062914|ref|ZP_07254455.1| hypothetical protein PsyrptK_23249 [Pseudomonas syringae pv. tomato
K40]
gi|302133691|ref|ZP_07259681.1| hypothetical protein PsyrptN_19974 [Pseudomonas syringae pv. tomato
NCPPB 1108]
gi|213923773|gb|EEB57356.1| conserved hypothetical protein [Pseudomonas syringae pv. tomato T1]
Length = 487
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 225/554 (40%), Positives = 306/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ E P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQSELPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSGVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE- 569
FFR L + P + L L+ +DI + + W +Y + + E+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KGFDDWGQAYQARIAAEENGTEQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++VNP Y+LRNYL Q+AI+AAE GD+ EVRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKERMHAVNPLYILRNYLAQNAIEAAEKGDYEEVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|229588001|ref|YP_002870120.1| hypothetical protein PFLU0444 [Pseudomonas fluorescens SBW25]
gi|259647049|sp|C3KBV5.1|Y444_PSEFS RecName: Full=UPF0061 protein PFLU_0444
gi|229359867|emb|CAY46720.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
Length = 487
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 217/552 (39%), Positives = 302/552 (54%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDAPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPSVAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E GA+V R+A S +RFG ++ + + ++ ++ H+
Sbjct: 166 E-------KQERGAMVLRMAHSHIRFGHFEYFYYTKKPEQQAELA------------EHV 206
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
N++ E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 207 LNLHYPECRE-------------QPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDHEG-RYSFSNQVPIGQWNLSALAQALTPFIG 312
Query: 457 IDD-KEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNF 512
+D KEA + Y + Y +M ++LGL +++++ +LL M VDYT F
Sbjct: 313 VDALKEA---LGLYLPLYQANYLDLMRRRLGLTTAEDDDQKLVERLLKLMQSSGVDYTLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERK 571
FR L + A ++ L+ +D+ + +W Y + G S+E+R+
Sbjct: 370 FRRLGDEPAALAVTR------LRDDFVDLA-----GFDAWAEQYKARVERDGDNSEEQRR 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
A M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ +P++EQ GME+YA+ PP W
Sbjct: 419 ARMHAVNPLYILRNYLAQNAIAAAESGDYSEVRRLHEVLSKPFEEQAGMEQYAQRPPDWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 KH---LEISCSS 487
>gi|269965587|ref|ZP_06179701.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
gi|269829812|gb|EEZ84047.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
Length = 489
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 201/520 (38%), Positives = 282/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L P+E + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGL-PEE-QNDELLAVFSGLSEFEQFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + ++L LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQNGTWFDLHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LG+PTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGVPTTRALGMMVSDTPVYRE-------KTESGALLLRMAETHVRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFDA 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ +TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVTKTAEMLAYWQAFGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L+ L++ ++ + ++ ++ +M +KLGL
Sbjct: 285 YAFDQQPRIALWNLSALAHALSP--LVEREDLESSLSQFEVHLSQQFSRLMREKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + +K DYT FFR LSN+ PS +AV+ L + +E
Sbjct: 343 IAEDGRLFEAMFELLHQNKTDYTRFFRTLSNLDNAPS----------QAVIDLFLDREAA 392
Query: 547 EAWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + + L IS E+R M NPKY+LRNYL Q AID AE GDF E+
Sbjct: 393 RAWLDLYLARCELEVDELGGLISTEQRCKQMRQANPKYILRNYLAQLAIDKAEEGDFSEL 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RP+DEQ + YA+LPP W + + SCSS
Sbjct: 453 HRLAELLKRPFDEQTEFDDYAKLPPEWGKKMEI---SCSS 489
>gi|440638907|gb|ELR08826.1| hypothetical protein GMDG_03502 [Geomyces destructans 20631-21]
Length = 643
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 222/598 (37%), Positives = 311/598 (52%), Gaps = 90/598 (15%)
Query: 101 ALEDLNWDHSFVRELPGD------------PRTDSIPREVLHACYTKVSPSAEVENPQLV 148
AL+DL +F LP D PR D PR V A +T V P V +P+L+
Sbjct: 37 ALKDLPKSWNFTANLPADSAFPSPAISHKTPRDDLGPRMVKGALFTWVRPEEAV-DPELL 95
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGATPLA-------GAVPYAQCYGGHQFGMWAGQ 201
S L + P+E + +F +G L G P+AQCYGG QFG WAGQ
Sbjct: 96 GVSTEALRDLGIKPEEAQTDEFRQLVAGNRLLGWNEDKQEGGYPWAQCYGGWQFGSWAGQ 155
Query: 202 LGDGRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLG 260
LGDGRAI+L E N ++ R+ELQLKGAG TPYSRFADG AVLRSSIREF+ SEA++ L
Sbjct: 156 LGDGRAISLFETTNPDTKTRYELQLKGAGMTPYSRFADGKAVLRSSIREFVVSEALNALR 215
Query: 261 IPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDI 320
IPTTRAL L R + EPGAIV R AQS+LR G++ + +RG D D+
Sbjct: 216 IPTTRALSLTLLPHSKVR------RERTEPGAIVTRFAQSWLRIGTFDLLRARG--DRDL 267
Query: 321 VRTLADYAIRHHFRHIENM------NKSESLSFSTGDEDHSVVD----LTSNKYAAWAVE 370
VR LADY H F ++ ++ ++ + + +D L N+YA E
Sbjct: 268 VRKLADYTAEHVFSGWSSLPARLPDDQQDTAEPPSTPVEKDTIDGPTGLEENRYARLYRE 327
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ R A VA WQ FT+GVLNTDN S++GL++D+GPF FLD FDP++TPN D R
Sbjct: 328 ITRRNAKTVAAWQAYAFTNGVLNTDNTSLMGLSLDFGPFAFLDTFDPNYTPNHDD-GMLR 386
Query: 431 YCFANQPDIGLWNIAQFSTTL-----------------------AAAKLIDDKEA--NYV 465
Y + NQP I WN+ + TL A +L+ E
Sbjct: 387 YSYRNQPTIIWWNLVRLGETLGELIGAGAGVDAAEFVEKGVRQEGADELVSRAEGLITRT 446
Query: 466 MERYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKA 521
E Y F++EY+ +MT +LGL P + + S+LL+ M K+D+ FFR LS V
Sbjct: 447 GEEYKAVFLEEYKRLMTARLGLKVHKPDDFETLFSELLDTMEALKLDFNQFFRRLSGV-- 504
Query: 522 DPSIPEDE----------LLVPLKAVLLD--IGKERKEAWI-SWVLSYIQELLSSGIS-- 566
SI E E L + V+ D + +ER AW+ W +++ + ++
Sbjct: 505 --SIKEIETEEARKEKAGLFFHKEGVVGDEAVARERVGAWLDKWRTRVVEDWGAQEVTAQ 562
Query: 567 -DEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLKLMERPYDEQPGMEK 622
+EER+A M +VNP ++ R+++ I E G+ + R++K+ P++E G ++
Sbjct: 563 AEEERQAAMKAVNPNFIPRSWILDEVIRRVEKDGERDVLGRVMKMALNPFEETWGGDR 620
>gi|404403764|ref|ZP_10995348.1| hypothetical protein PfusU_28497 [Pseudomonas fuscovaginae UPB0736]
Length = 487
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 216/551 (39%), Positives = 299/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++NP+LVA S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IDNPRLVAASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E F F G A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLDPASAEDAVFAQLFGGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++R LC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRTLCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ ++E A+V R+A S +RFG ++ Q + + L ++ + HF E
Sbjct: 166 E-------RQERAAMVLRLAPSHIRFGHFEYFYYTQQTEQH--KQLGEHVLAQHF--PEC 214
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 215 LEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQALTPVISVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
+EA + Y ++D +M ++LGL ++Q++ +LL M V VDYT FF
Sbjct: 315 ALREALGLFLPLYQAHYLD----LMRRRLGLTTAEDGDQQLVEELLQRMQVGGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L A ++ L+ +D+ + + +W Y + DE R+
Sbjct: 371 RRLGEQPAHLAVAR------LRDDFVDL-----KGFDAWAEHYTARVAREPDQDEARRTT 419
Query: 574 -MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AI+AAE GD+ EVR+L ++ RP++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQRAIEAAEKGDYAEVRQLHAVLSRPFEEQPGMEAYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|389872505|ref|YP_006379924.1| hypothetical protein TKWG_14400 [Advenella kashmirensis WT001]
gi|388537754|gb|AFK62942.1| hypothetical protein TKWG_14400 [Advenella kashmirensis WT001]
Length = 494
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 210/521 (40%), Positives = 278/521 (53%), Gaps = 51/521 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT++ + +P L+ + V L L ++ P F SG L G V + Y
Sbjct: 17 AFYTRLRMQG-LTDPTLLHVNPDVLALLGLTMEDARSPQFLSIMSGNADLPGGVTLSAVY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEIL----NLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
GHQFG+WAGQLGDGRA LG I N K WE+QLKG+GKTPYSR DG AVLRSS
Sbjct: 76 SGHQFGVWAGQLGDGRAHLLGAIRGTDGNGKPADWEIQLKGSGKTPYSRMGDGRAVLRSS 135
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
+RE+L S AM LGIPTT+ALCLV + V R+ E AIV RVA SF+RFGS
Sbjct: 136 VREYLASAAMTGLGIPTTQALCLVASDDPVYRETV-------ETAAIVARVAPSFVRFGS 188
Query: 307 YQ-IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ +A++ D +R L DY I F D +H++ D+
Sbjct: 189 FEHWYAAK---DPARLRELLDYVISSFFAD----------QIPLPDNEHTLNDVIEQ--- 232
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
+ V ERTA+L+A WQ VGF HGV+NTDNMS+LGLT+DYGP+GF+DAF + N TD
Sbjct: 233 -FVDVVIERTATLMADWQSVGFNHGVMNTDNMSVLGLTLDYGPYGFMDAFRINHVCNHTD 291
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY + QP +GLWN+ +F+ A D + +ERY F+ Y+ M KL
Sbjct: 292 TQG-RYAWNAQPSVGLWNLYRFANCFVALG-ADPERLKARLERYEGLFIAAYRDRMLAKL 349
Query: 486 GLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
GL + + ++I + D+T FR L+ + D S L+ + D
Sbjct: 350 GLQTWQEGDDELIDGWWRVLHEQSADFTLSFRYLAQIDNDES--------ALRRLFADTA 401
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ W+LSY + L + + R + M+ VNP YVLRNYL + AI AA GD
Sbjct: 402 GLEQ-----WLLSYRKRLQDNEGDAQARASRMDRVNPLYVLRNYLAEEAIQAAAKGDMSV 456
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL+++ PY QPGME +A PP W V SCSS
Sbjct: 457 TDSLLQVLRDPYTAQPGMEHFAEPPPEWGRELEV---SCSS 494
>gi|332284548|ref|YP_004416459.1| hypothetical protein PT7_1295 [Pusillimonas sp. T7-7]
gi|330428501|gb|AEC19835.1| hypothetical protein PT7_1295 [Pusillimonas sp. T7-7]
Length = 491
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 216/516 (41%), Positives = 285/516 (55%), Gaps = 47/516 (9%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT++SP + P+L+ + VA L PK F PDF SG+ PL G A Y
Sbjct: 20 AFYTRLSPQP-LTQPRLLHANPDVAALLGWSPKVFNDPDFLDICSGSAPLPGGKTLAAVY 78
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE++ L S WELQLKG+G+TPYSR DG AVLRSS+RE+
Sbjct: 79 SGHQFGVWAGQLGDGRAHLLGEVVAL-SGSWELQLKGSGRTPYSRMGDGRAVLRSSVREY 137
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAM LGIPTTRAL LV + V R+ E AIV RV+ SF+RFGS++ H
Sbjct: 138 LASEAMAGLGIPTTRALALVVSDDPVYRETV-------ETAAIVTRVSPSFIRFGSFE-H 189
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
S ++L R L +Y + + + ES+ ++D + L +
Sbjct: 190 WSGSPDNL---RALCNYVVDRFYPECRDAADGESVR----EQDVVLRFLRA--------- 233
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V ERTA L+A WQ GF HGV+NTDNMSILGLTIDYGP+GF+D F + N +D G R
Sbjct: 234 VVERTARLMADWQTAGFCHGVMNTDNMSILGLTIDYGPYGFMDDFQVNHVCNHSDTQG-R 292
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y + QP + WN+ + ++ L + D N + R+ F+ Y+ +++KLGL ++
Sbjct: 293 YAWNAQPSVANWNLYRLASALMGLDIPADALKNE-LGRFEAVFLQAYRGNLSRKLGLRQW 351
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + D+T FR L+ V P E L G E +
Sbjct: 352 EDGDDELFDDWWRLLHTQSADFTLCFRGLAGV---PGQREPWL----------SGFEDQA 398
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
A +W+ Y+ L D ER MN NP YVLRN+L ++AI AA GD GE+ LL
Sbjct: 399 AANAWLDRYMARLARDKRPDHERIEQMNRANPVYVLRNHLAEAAIQAAAQGDAGEINTLL 458
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+ PY E+PG E YA PP WA R V SCSS
Sbjct: 459 GLLREPYVEKPGFEAYASAPPDWASRLEV---SCSS 491
>gi|422659854|ref|ZP_16722275.1| hypothetical protein PLA106_20733 [Pseudomonas syringae pv.
lachrymans str. M302278]
gi|331018468|gb|EGH98524.1| hypothetical protein PLA106_20733 [Pseudomonas syringae pv.
lachrymans str. M302278]
Length = 487
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 224/554 (40%), Positives = 305/554 (55%), Gaps = 76/554 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV SES L
Sbjct: 1 MKALDELVFDNRFAR--LGD------------AFSTHVLPEP-IDAPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ E P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLAPEQSELPLFAEIFSGHKLWAEAAPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFGS + + ++ E L +TLA++ + H+ H +
Sbjct: 166 E-------KQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHL---KTLAEHVLTMHYPHCQ 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 EQPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ L +
Sbjct: 255 SILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALGQALTPFVSV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYT 510
+ + E G F+ YQA +M ++LGL Q ++S+LL M VDYT
Sbjct: 314 EA-----LRETIGL-FLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSGVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE- 569
FFR L + P + L L+ +DI + + W +Y + + E+
Sbjct: 368 LFFRRLGDQ------PAAQALRALRDDFVDI-----KGFDDWAQAYQARIAAEENGTEQA 416
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
RK M++VNP Y+LRNYL Q+AI+AAE GD+ VRRL +++ P+ EQPGME YA+ PP
Sbjct: 417 RKERMHAVNPLYILRNYLAQNAIEAAEKGDYEAVRRLHQVLCTPFTEQPGMEGYAQRPPD 476
Query: 630 WAYRPGVCMLSCSS 643
W +SCSS
Sbjct: 477 WGKH---LEISCSS 487
>gi|410627270|ref|ZP_11338012.1| hypothetical protein GMES_2486 [Glaciecola mesophila KMM 241]
gi|410153120|dbj|GAC24781.1| hypothetical protein GMES_2486 [Glaciecola mesophila KMM 241]
Length = 480
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 199/506 (39%), Positives = 290/506 (57%), Gaps = 52/506 (10%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V NPQLV + ++ D+L+L F + G T +AQ YGGHQFG W
Sbjct: 23 VANPQLVEVNHTLRDALQLPASWFTQSSIMSMLFGNTSSFTTHSFAQKYGGHQFGGWNPD 82
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGR + LGE + + W+L LKGAG TPYSRFADG AVLRS++RE+L SEA+H +GI
Sbjct: 83 LGDGRGLLLGEAKDKFGKSWDLHLKGAGPTPYSRFADGRAVLRSTLREYLASEALHHMGI 142
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PT+RALCL+T+ + V R+ K+E A++ RV+QS +RFG ++ G +LD +
Sbjct: 143 PTSRALCLITSDEPVYRE-------KQEKAAMMIRVSQSHIRFGHFEYFYHNG--ELDKL 193
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
+ L DY HHF S + + + A +V TA+L+A+
Sbjct: 194 KRLFDYCFEHHF---------------------SACLHSESPHLAMLEKVVTDTATLIAK 232
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
WQ GF HGV+NTDNMSI G+T D+GP+ FLD FDP F N +D G RY F QP +GL
Sbjct: 233 WQAYGFNHGVMNTDNMSIHGITFDFGPYAFLDDFDPKFVCNHSDHQG-RYAFEQQPGVGL 291
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKL 498
WN+ + A ++ ++ + +Y K M E+ +M +KLGL + + +++++
Sbjct: 292 WNLNALAH--AFTPYLNVEQIKGELSQYEPKLMAEFSQLMRQKLGLYENTQNTAELVNRW 349
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
L+ ++ DK DY FR L + E++ LV D +R A + W+ Y Q
Sbjct: 350 LDLISQDKRDYHISFRLLCEIDEH---GENQPLV-------DHFMQRDTAKM-WLEHYQQ 398
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
L++ + +ER+A M ++NP+YVLRNY Q AIDAA+ GDF RRLL++++ P++ +P
Sbjct: 399 ALITQNVKRQERQANMRNINPEYVLRNYQAQLAIDAAQDGDFSRFRRLLQVLQHPFEGKP 458
Query: 619 GMEKYARLPPAWAYRPGVCM-LSCSS 643
++A+ PP W G M +SCSS
Sbjct: 459 EYAEFAKPPPDW----GKHMEISCSS 480
>gi|269469310|gb|EEZ80812.1| hypothetical protein Sup05_0886 [uncultured SUP05 cluster
bacterium]
Length = 451
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 199/506 (39%), Positives = 273/506 (53%), Gaps = 72/506 (14%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
+ N L+ ++++ D L LD F+ SG G P A Y GHQFG + Q
Sbjct: 14 LNNTFLIHKNQALYDQLGLD---FDEKTLLKIASGEQKFEGTQPIASIYAGHQFGHFVPQ 70
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGR+ +G++ +EL LKGAG TPYSR ADG AVLRSSIRE+LCS AM L I
Sbjct: 71 LGDGRSCLIGQV-----SGYELSLKGAGTTPYSRGADGRAVLRSSIREYLCSIAMKGLNI 125
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
TT AL LV++ V R+ EPG+IV RVA S +RFG +++ ASRGQ V
Sbjct: 126 ATTEALTLVSSDTEVYRENI-------EPGSIVMRVAPSHVRFGHFELFASRGQTAQ--V 176
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
+ LAD+ I H++ H + ++Y + EV + TA ++A+
Sbjct: 177 KQLADFVIEHYYPHCQG----------------------ESRYVDFFNEVVKHTAVMIAR 214
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
WQ GF+HGV+NTDNMSILGLTIDYGPFGFL+ ++P F N +D G RY F QP I L
Sbjct: 215 WQAQGFSHGVMNTDNMSILGLTIDYGPFGFLETYNPKFVCNHSDHEG-RYAFEQQPGIAL 273
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKL 498
WN+A+ +L + LID K++ V++ Y + Y +M +K G K + Q +I +
Sbjct: 274 WNLARLGDSLES--LIDAKQSKAVLDNYQAYLVKAYSKLMRQKFGFIKKDDQDNVLIGQF 331
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
+ ++ DYTN R LSN+ D+L + + W+ Y +
Sbjct: 332 FEVLYQNQKDYTNSLRQLSNI--------DQL-------------SKDTDFTDWIELYHK 370
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG-DFGEVRRLLKLMERPYDEQ 617
+ SD R LMN VNPKY+LRNYL + AI AE D+ E+ L L+ +P+D
Sbjct: 371 RIDQEKSSD--RVELMNIVNPKYILRNYLAEVAIRKAEDDKDYSEIETLFDLLSQPFDTH 428
Query: 618 PGMEKYARLPPAWAYRPGVCMLSCSS 643
G++ YA P+WA V SCSS
Sbjct: 429 SGLDSYASKAPSWAQGLEV---SCSS 451
>gi|433657166|ref|YP_007274545.1| Selenoprotein O and cysteine-containing protein-like protein
[Vibrio parahaemolyticus BB22OP]
gi|432507854|gb|AGB09371.1| Selenoprotein O and cysteine-containing protein-like protein
[Vibrio parahaemolyticus BB22OP]
Length = 489
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 196/519 (37%), Positives = 277/519 (53%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P ++N + VAW+ A L + + FSG + P A Y
Sbjct: 19 AFYTLVEPQP-LDNTRWVAWNGEFAQQFGLPAAQ--NDELLAVFSGQSEFEPFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEIEHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTR+L ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRSLGMMVSDTPVYRE-------KTEFGAMLIRMAETHVRFGHFEHL 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF + K YAA +
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHFADCASAEKP---------------------YAAMFGD 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G R
Sbjct: 226 IVQKTADMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + L + L++ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHAL--SPLVEREDLEQALSQFEGRLSQQFSCLMRSKLGLKTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT FFRALSN+ P+ + + L I +E
Sbjct: 343 IAEDGRLFESMFELLNQNHTDYTRFFRALSNLDKQPA---------QEVIDLFIDREAAR 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + + IS E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDEIGEPISAEQRCEQMRQANPKYILRNYLAQLAIDKAEEGDFSEVH 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +++ PYD QP E YA+LPP W + +SCSS
Sbjct: 454 RLAEILRHPYDSQPEFEAYAKLPPEWGKK---MEISCSS 489
>gi|149191184|ref|ZP_01869442.1| hypothetical protein VSAK1_02354 [Vibrio shilonii AK1]
gi|148835022|gb|EDL52001.1| hypothetical protein VSAK1_02354 [Vibrio shilonii AK1]
Length = 485
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 204/515 (39%), Positives = 283/515 (54%), Gaps = 54/515 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPL-FFSGATPLAGAVPYAQCYG 191
YT+V P+ + NP+ VAW++ +A L P+ E L SG+ P A Y
Sbjct: 21 YTQVEPTP-LNNPRWVAWNQELAGELGF-PEMVEDEQALLDVLSGSVSSEHIKPLAMKYA 78
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG++ LGDGR + LGE++ + ++L LKGAG+TPYSR DG AVLRS+IRE+L
Sbjct: 79 GHQFGIYNPDLGDGRGLLLGEVVGKSGQTFDLHLKGAGQTPYSRMGDGRAVLRSTIREYL 138
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAM LGIPTTRAL ++ + V R+ + E GA++ R A + +RFG ++
Sbjct: 139 CSEAMAGLGIPTTRALAMMVSDTLVYRE-------QVEQGALLVRAADTHIRFGHFEHFF 191
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
GQ + +R LAD I HF + +K Y A+ EV
Sbjct: 192 YTGQHEQ--LRLLADKVIEWHFPDCLDADKP---------------------YVAFFAEV 228
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
TA ++A WQ GF HGV+NTDNMSILG T DYGPFGF+D ++P + N +D G RY
Sbjct: 229 VRLTAEMIAHWQAKGFAHGVMNTDNMSILGQTFDYGPFGFMDDYEPGYICNHSDYQG-RY 287
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYN 491
F NQP I +WN+ + L+ LID K+ ++ +E + EY M K GL
Sbjct: 288 AFDNQPSIAMWNLTALAHALSP--LIDRKDLDHGLETFTPILQTEYSCQMRDKFGLSTKQ 345
Query: 492 KQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ ++ + M +KVDYT FFRALSN+ + + P+ + +D K +
Sbjct: 346 SEDGDFFNRSFDLMESEKVDYTRFFRALSNIDSTG-------IAPVVDLFIDRAKAQ--- 395
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+WV SY+ + S+ ER M NPKY+LRNYL Q AI+ AE GDF V +L
Sbjct: 396 --AWVESYLLRCMLENDSEAERCRKMRLANPKYILRNYLAQQAIELAEKGDFSLVHQLAD 453
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L++ PY+EQ E++A+LPP W R + SCSS
Sbjct: 454 LLKFPYEEQAEHEEFAKLPPEWGKRMEI---SCSS 485
>gi|86146089|ref|ZP_01064415.1| hypothetical protein MED222_02913 [Vibrio sp. MED222]
gi|85836036|gb|EAQ54168.1| hypothetical protein MED222_02913 [Vibrio sp. MED222]
Length = 485
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 210/530 (39%), Positives = 291/530 (54%), Gaps = 62/530 (11%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q +AW+ ++A+ L E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LSNVQWLAWNHNLANELGFPSFECTSEELLETLSGNVE 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PEQFSPVAMKYAGHQFGSYNPDLGDGRGLLLAQVVAKSGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R A+
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRAAE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF E L D+D
Sbjct: 180 SHIRFGHFEHLFYTNQ--LAEHKLLADKVIEWHF--------PECL-----DDD------ 218
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA +V +RTA +VA WQ GF HGV+NTDNMSI+G T DYGPF FLD +DP
Sbjct: 219 --KPYAAMFNQVVDRTAEMVALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYDPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IGLWN++ + +L + L+D E +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGLWNLSALAHSL--SPLVDKAELEGALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL + + ++ + M+ +KVDY FFR LSN+ D +P+D
Sbjct: 334 MMRRKLGLLSKQEGDSRLFESMFELMSQNKVDYPRFFRTLSNL--DTLLPQD-------- 383
Query: 537 VLLDIGKERKEAWISWVLSYIQ--ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
++D+ +R+ A + WV +Y+Q EL S ++D K M VNPKY+LRNYL Q AID
Sbjct: 384 -VIDLVIDREAAKL-WVDNYLQRCELEDSSVADRCEK--MRQVNPKYILRNYLAQLAIDK 439
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
AE GD ++ L+ ++ PY E P E A LPP W G M +SCSS
Sbjct: 440 AERGDSSDIDALMVVLADPYAEHPDYEHLAALPPEW----GKAMEISCSS 485
>gi|398889754|ref|ZP_10643533.1| hypothetical protein PMI31_01349 [Pseudomonas sp. GM55]
gi|398189202|gb|EJM76485.1| hypothetical protein PMI31_01349 [Pseudomonas sp. GM55]
Length = 487
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 217/551 (39%), Positives = 300/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ + R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRYDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETPEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALNALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMILRLAPSHVRFGHFEYFYYTKRPEQQ----KVLGDHVLAMHFPQC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL + ++ ++ LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAQEDDQTLLESLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + PE + + L+ +D+ + + +W YI + G D E+R+
Sbjct: 371 RRLGD-----DAPE-QAITRLRDDFVDL-----KGFDAWGERYIARVAREGAHDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AI AAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAITAAESGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|365538386|ref|ZP_09363561.1| hypothetical protein VordA3_01542 [Vibrio ordalii ATCC 33509]
Length = 489
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 213/529 (40%), Positives = 291/529 (55%), Gaps = 66/529 (12%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAV 184
E+ A Y+ V+P A ++N + VAW+ S+A L L + P+ L SG A
Sbjct: 15 ELPSAFYSPVNP-APLDNVRWVAWNASLAGDLSLPTQ----PNDELLHSLSGQVIPAQFK 69
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L EI + E ++L LKGAG TPYSR DG AVLR
Sbjct: 70 PLAMKYAGHQFGIYNPDLGDGRGLLLAEIESKTGEVYDLHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
S+IRE+LCSEAM LGI TTRAL ++++ V R+ K+E GA++ RVAQS +RF
Sbjct: 130 STIREYLCSEAMVGLGIATTRALAMMSSDTPVYRE-------KQERGALLVRVAQSHIRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK- 363
G ++ Q L + LAD I H+ LT K
Sbjct: 183 GHFEHFFYTNQ--LAEQKQLADKVIEWHYPDC----------------------LTQEKP 218
Query: 364 YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNT 423
YAA ++ ERTA ++A WQ VGF HGV+NTDNMSILG T DYGPF FLD ++P++ N
Sbjct: 219 YAAMFSQIVERTAKMIADWQAVGFAHGVMNTDNMSILGQTFDYGPFAFLDDYEPTYIGNH 278
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTK 483
+D G RY F QP + LWN++ + L + L++ + + ++ + + M +
Sbjct: 279 SDYQG-RYAFDQQPRVALWNLSALAHAL--SPLVERSDLEAALAQFEAQLGRYFSQQMRR 335
Query: 484 KLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLD 540
KLGL + + ++ + + DYT FFR LSN+ E E LV LD
Sbjct: 336 KLGLLTSLPGDSVLFEQMFELLTKNHTDYTRFFRQLSNLDR-----EGEQLV------LD 384
Query: 541 IGKERKEAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDA 594
+ +R A SW+ Y +E+ SSG IS E+R A M VNPKY+LRNYL Q AID
Sbjct: 385 LFIDRAAAQ-SWLEQYQARCEREIDSSGNAISIEQRCAEMRKVNPKYILRNYLAQQAIDK 443
Query: 595 AELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
AE GD+ +V +L +L+ PY EQP +A+LPP W + +SCSS
Sbjct: 444 AEQGDYQQVHQLAQLLANPYAEQPEKSHFAQLPPEWGKK---MEISCSS 489
>gi|424035146|ref|ZP_17774456.1| hypothetical protein VCHENC02_0907 [Vibrio cholerae HENC-02]
gi|408898130|gb|EKM33673.1| hypothetical protein VCHENC02_0907 [Vibrio cholerae HENC-02]
Length = 489
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 202/520 (38%), Positives = 284/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T V+P ++N + V W+ +A L E + F+G A P A Y
Sbjct: 19 AFFTYVTPQP-LDNTRWVVWNGELAKQFGL--PESANEELLNVFAGQNEFASFAPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L E+ + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEMQHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMDSDTPVYRE-------KMEYGALLIRIAETHIRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I +F + K YAA
Sbjct: 189 FYTNQ--LSEQKYLADKVIEWYFPDCLEVEKP---------------------YAAMFET 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ E+T+ ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD +DP++ N +D G R
Sbjct: 226 IVEKTSVMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYDPNYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + +L+ +D EA + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHSLSPLVQREDLEA--ALGKFEMRLSQKFSELMRAKLGLLTK 342
Query: 491 NKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
++ + + + +K DYT FFR LSN+ + S +AV+ L I +E
Sbjct: 343 IEEDGCLFEAMFELLNQNKTDYTRFFRELSNLDVNSS----------QAVIDLFIDREAA 392
Query: 547 EAWISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + E+ G +S E R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 393 SAWVDLYLARCELEVDERGECVSAETRCEKMRRANPKYILRNYLAQLAIDKAEEGDFSEV 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RPYDEQP ++ YA+LPP W + + SCSS
Sbjct: 453 SRLAELLKRPYDEQPELDDYAKLPPEWGKKMEI---SCSS 489
>gi|349609535|ref|ZP_08888925.1| hypothetical protein HMPREF1028_00900 [Neisseria sp. GT4A_CT1]
gi|348611728|gb|EGY61365.1| hypothetical protein HMPREF1028_00900 [Neisseria sp. GT4A_CT1]
Length = 489
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 208/515 (40%), Positives = 275/515 (53%), Gaps = 48/515 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++VSP + P VA++ +A L LD +F+ + SG P P A Y G
Sbjct: 19 YSRVSPEP-LTAPYWVAFNTDLAAELNLD-TDFQTTANLAYLSGNAPQYAPAPIASVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGRAI +G+ ++ +R E QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 77 HQFGVYTPRLGDGRAILIGDSVDAAGQRQEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ E A++ R+A SFLRFG ++
Sbjct: 137 SEAMHGLGIPTTRALALCGSDDPVYRETV-------ETAAVLTRIAPSFLRFGHFEYFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
G+E ++ LADY IRH++ + + N YAA ++
Sbjct: 190 TGREAE--IQQLADYLIRHYYPDCRDAD---------------------NPYAALLEQIR 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA VA WQ VGF HGV+NTDNMS LGLTIDYGPFGFLD +D N +D G RY
Sbjct: 227 NRTADTVAAWQSVGFCHGVMNTDNMSALGLTIDYGPFGFLDDYDRRHVCNHSDTQG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP + WN + ++ L+ +++ + F Y M +KLGL + +K
Sbjct: 286 YNAQPFVAHWNFSALASCFDT--LVPHDTLEQLIDGWTEVFQTTYLEKMRRKLGLQQADK 343
Query: 493 Q----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +I+ L + K D+T FFR LS V S E L P G A
Sbjct: 344 RDDESLIADLFAALQDQKTDFTLFFRNLSGV----SNTHGEPLPPKLEQTFKNGV--PPA 397
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+I W+ Y Q L + + ER MN NP Y+LRNYL + AI A GD+ E+ RL
Sbjct: 398 FIRWLGRYRQRLRAENSNPAERAIRMNLTNPLYILRNYLAEQAIAQARNGDYREIERLRC 457
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ RP+DEQ A PP + VC+ SCSS
Sbjct: 458 CLARPFDEQAEFADLAEPPPEGSI--PVCV-SCSS 489
>gi|424033229|ref|ZP_17772644.1| hypothetical protein VCHENC01_1463 [Vibrio cholerae HENC-01]
gi|408874963|gb|EKM14125.1| hypothetical protein VCHENC01_1463 [Vibrio cholerae HENC-01]
Length = 489
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 202/520 (38%), Positives = 284/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T V+P ++N + V W+ +A L E F+G A P A Y
Sbjct: 19 AFFTYVTPQP-LDNTRWVVWNGELAKQFGL--PESANEALLNVFAGQNEFASFAPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L E+ + +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEMQHQNGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMDSDTPVYRE-------KMEYGALLIRIAETHIRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I +F + K YAA
Sbjct: 189 FYTNQ--LSEQKYLADKVIEWYFPDCLEVEKP---------------------YAAMFET 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ E+T+ ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD +DP++ N +D G R
Sbjct: 226 IVEKTSVMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYDPNYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--- 487
Y F QP I LWN++ + +L+ +D EA + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHSLSPLVQREDLEA--ALGKFEMRLSQKFSELMRAKLGLLTK 342
Query: 488 PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ + ++ + + +K DYT FFR LSN+ + S +AV+ L I +E
Sbjct: 343 IEEDGRLFEAMFELLNQNKTDYTRFFRELSNLDVNSS----------QAVIDLFIDREAA 392
Query: 547 EAWISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + E+ G +S E R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 393 SAWVDLYLARCELEVDERGECVSAETRCEKMRRANPKYILRNYLAQLAIDKAEEGDFSEV 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RPYDEQP ++ YA+LPP W + + SCSS
Sbjct: 453 SRLAELLKRPYDEQPELDDYAKLPPEWGKKMEI---SCSS 489
>gi|149374466|ref|ZP_01892240.1| hypothetical protein MDG893_10476 [Marinobacter algicola DG893]
gi|149361169|gb|EDM49619.1| hypothetical protein MDG893_10476 [Marinobacter algicola DG893]
Length = 532
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 193/520 (37%), Positives = 286/520 (55%), Gaps = 52/520 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ + YT+V P+ +++ ++V ++ +A ++ + D+ +G L G P
Sbjct: 62 ELPDSFYTRVQPTP-LKDARMVCFNHELAKTMGFHAQN--PADWTGIGAGTELLEGMDPV 118
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFGM+ LGDGR + L E + RW+ LKGAG TPYSRF DG AVLRS+
Sbjct: 119 AMKYTGHQFGMYNPDLGDGRGLLLWETVGPDGRRWDWHLKGAGMTPYSRFGDGKAVLRST 178
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IRE+LCSEAM LGIPTTRAL +V+ V R+ E A + RVA++ +RFG
Sbjct: 179 IREYLCSEAMAALGIPTTRALFMVSAKDPVRRESI-------ETAAALVRVAETHIRFGH 231
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ A E ++TL ++ I HF H+ ++ + E +Y
Sbjct: 232 FEFAAH--HEGEQALKTLIEHVIALHFPHLISLPEDE-------------------RYQR 270
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
W VEV ERTA ++A WQ VGF HGV+N+DNMSI+G T DYGP+ FLD FD + N TD
Sbjct: 271 WYVEVVERTARMIADWQAVGFCHGVMNSDNMSIIGDTFDYGPYAFLDDFDAGYICNHTD- 329
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY + QP+ G N + L ++++ + + RY + + + M KLG
Sbjct: 330 KGGRYAYNRQPNTGFVNCQYLANALLP--VMNEDDVRRGLRRYEIAYNERFLQNMRDKLG 387
Query: 487 LPKYNKQIISKLLNNMAV---DKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + ++ +S +++ ++ +DYT FFR LSN+ + S P +L V ++V D
Sbjct: 388 LAQEDESDLSLIMDTFSMLHEHHIDYTLFFRGLSNLTSKGSSPIRDLFVD-RSVADD--- 443
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
W+ Y Q L S + +ER+ M VNPKY+LRNYL Q I A+ GD+ +
Sbjct: 444 --------WIERYEQRLQSETRAHDEREYHMRKVNPKYILRNYLAQQVILEAQNGDYEPM 495
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ LL+++++P+DEQP E+Y+ PP W + SCSS
Sbjct: 496 KELLEVLKKPFDEQPEFEQYSAPPPDWGKHLSI---SCSS 532
>gi|260768958|ref|ZP_05877892.1| UPF0061 domain-containing protein [Vibrio furnissii CIP 102972]
gi|260616988|gb|EEX42173.1| UPF0061 domain-containing protein [Vibrio furnissii CIP 102972]
Length = 489
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 204/521 (39%), Positives = 278/521 (53%), Gaps = 58/521 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQ 188
A +T V P + + + VAW+ +A L P+ L SGA A P A
Sbjct: 19 AFFTPVQPQP-LSHVRWVAWNHDLAHQFGLP----HTPNDELLHSLSGAQLPAAFSPLAM 73
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSSIR
Sbjct: 74 KYAGHQFGVYNPDLGDGRGLLLAEMATRQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSIR 133
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+LCSEAM LGI TTRAL L+ + V R+ K+E GA++ RVAQS +RFG ++
Sbjct: 134 EYLCSEAMAGLGIATTRALALMRSDTPVYRE-------KQERGALLVRVAQSHIRFGHFE 186
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
Q D ++ LAD I + H T+ YAA
Sbjct: 187 YLFYTEQH--DELKLLADKVIEWYLPHCAK---------------------TAQPYAAMF 223
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+ +RTA ++AQWQ VGF HGV+NTDNMSILG T DYGP+GFLD ++P + N +D G
Sbjct: 224 DHIVDRTAKMIAQWQAVGFAHGVMNTDNMSILGQTFDYGPYGFLDDYEPGYICNHSDYQG 283
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F QP + LWN++ + L + LI+ + + Y T+ + +M KLGL
Sbjct: 284 -RYAFDQQPRVALWNLSALAHAL--SPLIEREALEAALSAYETQLNGYFSGLMRDKLGLT 340
Query: 489 ---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ + ++ L + VDYT F R LS + P+ +L + A L
Sbjct: 341 TRLEGDGELFHDLFELLETHHVDYTRFMRQLSALDTQPAQHVADLCLDRDAAL------- 393
Query: 546 KEAWISWVLSYI---QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
AW++ L+ ++ +S ER A M VNPKY+LRNYL Q AID AE GD E
Sbjct: 394 --AWLTRYLNRCALERDEQGQVVSAHERCAKMRQVNPKYILRNYLAQIAIDKAEQGDDSE 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
V RL ++++ P+DEQP E YA+LPP W + +SCSS
Sbjct: 452 VLRLAQVLKHPFDEQPDAEAYAKLPPEWGKK---LEISCSS 489
>gi|89093059|ref|ZP_01166010.1| hypothetical protein MED92_03243 [Neptuniibacter caesariensis]
gi|89082709|gb|EAR61930.1| hypothetical protein MED92_03243 [Oceanospirillum sp. MED92]
Length = 488
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 219/550 (39%), Positives = 301/550 (54%), Gaps = 67/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ LE LN+D+S++R LP + Y +V P+ + +P L++++ +VA L
Sbjct: 1 MAQLESLNFDNSYLR-LP-------------ESFYQRVEPTP-LRDPHLISFNPAVAKLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + +FSG L G+ P A Y GHQFG++ +LGDGR + LGE++N +
Sbjct: 46 DLDPCGIKPAQIADYFSGNALLPGSEPLAMKYTGHQFGVYNPELGDGRGLLLGEVVNKQG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERW+L LKGAGKT +SRF DG AVLRSSIRE+L SEAMH L IPTTRALCLV + + V R
Sbjct: 106 ERWDLHLKGAGKTAFSRFGDGRAVLRSSIREYLISEAMHGLNIPTTRALCLVGSEEMVMR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ EP A V RV Q +RFG ++ ++ +R D ++ LADYA+ F
Sbjct: 166 EGMM------EPCAAVLRVTQCHIRFGHFEHLYYTRQH---DALKELADYALERFF---- 212
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
E L Y A EV +R+ASLVA+WQ GF H VLNTDNM
Sbjct: 213 ----PEFLE-------------AEQPYLAMFTEVVQRSASLVAKWQAYGFVHAVLNTDNM 255
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
S++G T DYGPF FLD ++PS N D G RY FA QP I WN++ + L LI
Sbjct: 256 SLIGETFDYGPFSFLDTYNPSLISNHNDHQG-RYAFAQQPGIIHWNLSCLAQALLP--LI 312
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFR 514
+ ++ V++ Y ++ A K++GL + ++++I L A + VD FFR
Sbjct: 313 EREDLVKVLDSYPERYRLAELAEFRKRMGLQLEMEVDEELIRDLTKLFASEAVDMNRFFR 372
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWIS-WVLSYIQELLSSGISDEERKAL 573
LS+ +E L L +L R A ++ W+L Y L S R+A
Sbjct: 373 KLSDFDGS-----EESLANLMGLL------RNPAQLTPWLLKYEARLKDEPASFPIRRAQ 421
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M SVNP+++LRNY+ + AI A GDF V LL L+ P +E E YA PP WA
Sbjct: 422 MRSVNPEFILRNYMAEEAIQQATKGDFSLVNELLGLLRNPMEELENYEVYAEKPPEWA-- 479
Query: 634 PGVCMLSCSS 643
G+C L+CSS
Sbjct: 480 AGIC-LTCSS 488
>gi|343495773|ref|ZP_08733886.1| hypothetical protein VINI7043_22597 [Vibrio nigripulchritudo ATCC
27043]
gi|342822257|gb|EGU57006.1| hypothetical protein VINI7043_22597 [Vibrio nigripulchritudo ATCC
27043]
Length = 482
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 206/517 (39%), Positives = 291/517 (56%), Gaps = 61/517 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
++KV+P+ +EN + V W+E +A L L E PD L SG + L P A Y
Sbjct: 21 FSKVTPTP-LENVRWVDWNEKLAVELGLP----ESPDGELLDLLSGNSVLDAFPPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + LGDGR + L ++ + + +++ +KGAG TPYSR DG AVLRSSIRE+
Sbjct: 76 VGHQFGAYNPDLGDGRGLLLFQV-DTDDKSYDIHIKGAGLTPYSRQGDGRAVLRSSIREY 134
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-I 309
L SEA+H L IP+TRAL L+T+ V R+ E GAI RVA + +RFG ++ +
Sbjct: 135 LMSEALHGLAIPSTRALALLTSDTPVYREEI-------ETGAICVRVATTHIRFGHFEYL 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
+ + EDL + +D+ I HHF +I + N Y A
Sbjct: 188 YYTNQIEDL---KQFSDFVIDHHFPNITD---------------------EPNPYLAMFQ 223
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTAS++A+WQ +GF HGV+NTDNMSILG T D+GPFG ++ FDPSF N +D G
Sbjct: 224 EVVSRTASMIAKWQSIGFAHGVMNTDNMSILGETFDFGPFGMMENFDPSFICNHSDYQG- 282
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F NQP IGLWN+ + L+ LI ++ ++ Y T EY +M KLGL +
Sbjct: 283 RYAFDNQPSIGLWNLTALAQALSP--LIAKEDLQKTLDTYYTTLTKEYSVLMRNKLGLLE 340
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ ++L M ++ DYTN FRALSN + + L LK
Sbjct: 341 SKPEDTELFNRLFALMKENQADYTNTFRALSNAD---KVGKSAFLAELK---------NS 388
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
++ + W SY + L S +++ R M + NPKY+LRNY+ Q+AI+ A+ GDF E++RL
Sbjct: 389 DSALDWFSSYQERLESEESNEKLRCNQMRTHNPKYILRNYMAQTAIERAQEGDFSEMKRL 448
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
KL++ P+DE G E+ + P WA G+ LSCSS
Sbjct: 449 KKLLDFPFDEDSGTEEDTKPAPEWA--QGLA-LSCSS 482
>gi|384424919|ref|YP_005634277.1| Selenoprotein O and cysteine-containing-like protein [Vibrio
cholerae LMA3984-4]
gi|327484472|gb|AEA78879.1| Selenoprotein O and cysteine-containing-like protein [Vibrio
cholerae LMA3984-4]
Length = 489
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 209/525 (39%), Positives = 281/525 (53%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQHLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSS+
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSL 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF E TS YAAW
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCEQ---------------------TSKPYAAW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD---- 388
Query: 545 RKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 -REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|375264856|ref|YP_005022299.1| hypothetical protein VEJY3_04130 [Vibrio sp. EJY3]
gi|369840180|gb|AEX21324.1| hypothetical protein VEJY3_04130 [Vibrio sp. EJY3]
Length = 489
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 198/519 (38%), Positives = 270/519 (52%), Gaps = 54/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T + P + N Q V W+ A L P E FSG P A Y
Sbjct: 19 AFFTFIEPQPLL-NTQWVVWNGDFAQQFGLPPIADE--TLLEVFSGQANFDEFRPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + LGDGR + L EI +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGTYNPDLGDGRGLLLAEIQRQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ R+A++ +RFG ++
Sbjct: 136 LCSEAMQGLGIPTTRALGMMVSDTQVYRE-------KTENGAMLIRMAETHIRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I H T YAA
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHLPECAQ---------------------TEKPYAAMFAN 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ E+TA ++A+WQ GF HGV+NTDNMSILG T DYGPFGFLD +DP + N +D G R
Sbjct: 226 IVEKTADMIAKWQAFGFAHGVMNTDNMSILGQTFDYGPFGFLDDYDPGYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP + LWN++ + L+ L++ + + ++ + ++ +M KLGL
Sbjct: 285 YAFDQQPRVALWNLSALAHALSP--LVERTDLEAALGQFEVRLSQQFSRLMRSKLGLKNR 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + +K DYT FFR LS + D P+D + L I +E +
Sbjct: 343 IDEDSRLFESMFELLNQNKTDYTRFFRTLSTL--DKKSPQD-------VIDLFIDREAAQ 393
Query: 548 AWISWVLSYIQ---ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
AW+ L+ + + L ++ E+R M NPKY+LRNYL Q AID AE GDF EV
Sbjct: 394 AWLDLYLARCELEVDELGKPVTTEQRCEQMRRANPKYILRNYLAQLAIDKAEEGDFSEVN 453
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL L+ PYD QP E+YA+LPP W + +SCSS
Sbjct: 454 RLAALLRNPYDSQPEFEEYAKLPPEWGKK---MEISCSS 489
>gi|258626476|ref|ZP_05721316.1| conserved hypothetical protein [Vibrio mimicus VM603]
gi|258581187|gb|EEW06096.1| conserved hypothetical protein [Vibrio mimicus VM603]
Length = 489
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 207/535 (38%), Positives = 284/535 (53%), Gaps = 68/535 (12%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFF 174
R ++P+ A YT + P +EN + W+ +A +EF P+ P
Sbjct: 12 RFSALPK----AFYTSIRPQP-LENVRWGMWNAPLA-------QEFGLPEVPNSELLAAL 59
Query: 175 SGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYS 234
SG A P A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYS
Sbjct: 60 SGQQLPADFAPLAMKYAGHQFGVYNPDLGDGRGLLLAEMASKTGDVYDIHLKGAGLTPYS 119
Query: 235 RFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIV 294
R DG AVLRSSIRE+LCSEAM LGI TTRAL L+ + V R+ EE GA++
Sbjct: 120 RMGDGRAVLRSSIREYLCSEAMAGLGIATTRALALMNSDTPVYRE-------HEERGALL 172
Query: 295 CRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDH 354
RVAQS +RFG ++ H ++ ++ + LAD I HF
Sbjct: 173 VRVAQSHIRFGHFE-HFYYTEQHTEL-KLLADKVIEWHF--------------------- 209
Query: 355 SVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDA 414
++ YA W +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD
Sbjct: 210 PTCAQSAKPYADWFHQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDD 269
Query: 415 FDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFM 474
+DP+F N +D G RY F QP IGLWN++ + L + LI+ + +E Y
Sbjct: 270 YDPNFICNHSDYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIEKADLEAALESYSEHLN 326
Query: 475 DEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELL 531
+ +M KLGL + ++ + +A + DYT F R LS + + E++
Sbjct: 327 RYFSQLMRAKLGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQST----EVV 382
Query: 532 VPLKAVLLDIGKERKEAWISWVLSYIQELL---SSGISDEERKALMNSVNPKYVLRNYLC 588
+ L I ++ +AW++ L L S IS ER M VNPKY+LRNYL
Sbjct: 383 IDLV-----IDRQAAKAWLTRYLERAARELGQDSQPISQVERCQAMRQVNPKYILRNYLA 437
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
Q AI+ AE GDF E++ L +++ PYDE P E YA+LPP W + +SCSS
Sbjct: 438 QQAIELAERGDFQEMQCLAQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|378948430|ref|YP_005205918.1| Selenoprotein O-like protein [Pseudomonas fluorescens F113]
gi|359758444|gb|AEV60523.1| Selenoprotein O-like protein [Pseudomonas fluorescens F113]
Length = 487
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 213/549 (38%), Positives = 297/549 (54%), Gaps = 66/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K LE L +D+ F R GD T V P ++NP+LV S + L
Sbjct: 1 MKTLETLTFDNRFAR--LGD------------GLSTHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E + P F F G A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAQAPLFAEIFGGHKLWAETEPRAMVYSGHQFGHYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H LGIP+TRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALGIPSTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+A S +RFG ++ + +L LA++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKKPELHA--ALAEHVLNLHFAECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L +D
Sbjct: 256 ILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQALTPFISVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ Y + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ALRETLGL--YLPLYQAHYLDLMRRRLGLTCAEEDDQTLLERLLQLMQNSGVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKALM 574
L + PE + + L+ +D+ + + +W Y+ + G + ++R+ M
Sbjct: 373 LGD-----QAPE-QAVATLRDDFVDL-----KGFDAWGELYVARVNREGPVDQDQRRTRM 421
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP YVLRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGM+ YA+ PP W
Sbjct: 422 HAVNPLYVLRNYLAQKAIDAAESGDYEEVRRLHTVLSKPFEEQPGMDSYAQRPPEWGKH- 480
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 481 --LEISCSS 487
>gi|316983151|ref|NP_001186909.1| selenoprotein O precursor [Pongo abelii]
Length = 669
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 197/446 (44%), Positives = 251/446 (56%), Gaps = 40/446 (8%)
Query: 102 LEDLNWDHSFVRELP------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVA 155
L L +D+ +R LP G S PR V AC+T+V P+ + P+LVA SE
Sbjct: 45 LAGLRFDNRALRALPVEAPPPGPEGAPSAPRPVPGACFTRVQPTP-LRQPRLVALSEPAL 103
Query: 156 DSLELDPKEFERPDFP--LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI 213
L L + LFFSG L GA P A CY GHQF AGQLG+G A+ LGE+
Sbjct: 104 ALLGLGAPPAREAEAEAELFFSGNAILPGAEPAAHCYWGHQFDQLAGQLGEGSAMYLGEV 163
Query: 214 LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTG 273
ERWELQLKGAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+
Sbjct: 164 CTATGERWELQLKGAGPTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGACVTSE 223
Query: 274 KFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTL 324
V RD+FYDGNPK E +V RVA +F+RFGS++I H R + DI L
Sbjct: 224 STVVRDVFYDGNPKYEQCTVVLRVASTFIRFGSFEIFKSADEHTGRAGPSVGRNDIRVQL 283
Query: 325 ADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQG 384
DY I + I+ + S ++ + AA+ EV RTA +VA+WQ
Sbjct: 284 LDYVISSFYPEIQAAHASNNV----------------QRNAAFFREVTRRTARMVAEWQC 327
Query: 385 VGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNI 444
VGF HGVLNTDNMSILGLTIDYGPFGFLD +DP N +D G RY ++ QP++ WN+
Sbjct: 328 VGFCHGVLNTDNMSILGLTIDYGPFGFLDRYDPDHVCNASDNTG-RYAYSKQPEVCRWNL 386
Query: 445 AQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLN 500
+ + L ++ EA + + + +F Y M +KLGL + + ++SKLL
Sbjct: 387 RKLAEALQPELPLELGEA-ILADEFDAEFQRHYLQKMRRKLGLVQVELEEDGVLVSKLLE 445
Query: 501 NMAVDKVDYTNFFRALSNVKADPSIP 526
M + D+TN F LS+ +P P
Sbjct: 446 TMHLTGADFTNTFYLLSSFPVEPESP 471
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 58/115 (50%), Gaps = 23/115 (20%)
Query: 540 DIGKERKEAWISWVLSYIQELLS--SGISDE-----ERKALMNSVNPKYVLRNYLCQSAI 592
D+ + W W+ +Y L G D ER +M++ NPKYVLRNY+ Q+AI
Sbjct: 546 DLQSRNQGHWADWLQAYRARLDKDLEGARDAAAWQAERVRVMHANNPKYVLRNYIAQNAI 605
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPG----------------MEKYARLPPAWA 631
+AAE GDF EVRR+LKL+E PY + G Y+ PP WA
Sbjct: 606 EAAERGDFSEVRRVLKLLETPYHCEAGAATDAEATEADGADGRQRSYSSKPPLWA 660
>gi|452124908|ref|ZP_21937492.1| hypothetical protein F783_04955 [Bordetella holmesii F627]
gi|452128315|ref|ZP_21940892.1| hypothetical protein H558_05040 [Bordetella holmesii H558]
gi|451924138|gb|EMD74279.1| hypothetical protein F783_04955 [Bordetella holmesii F627]
gi|451925362|gb|EMD75500.1| hypothetical protein H558_05040 [Bordetella holmesii H558]
Length = 489
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 214/518 (41%), Positives = 283/518 (54%), Gaps = 53/518 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT+V P A NP+L+ + A + LDP+ PDF SG PL G A Y
Sbjct: 20 AFYTRVLPQAP-GNPRLLHANADAAALIGLDPEALTTPDFLAVASGQMPLPGGDTLAAVY 78
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LGE+ WELQLKGAG TPYSR DG AVLRSS+RE+
Sbjct: 79 SGHQFGVWAGQLGDGRAHLLGEVAGPNGS-WELQLKGAGLTPYSRMGDGRAVLRSSVREY 137
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAMH LGIPTTRAL LV + V R+ E AIV R++ SF+RFGS++
Sbjct: 138 LASEAMHGLGIPTTRALALVVSDDPVMRE-------TRETAAIVTRMSPSFVRFGSFEHW 190
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+S D ++ L DY I + + D +H V A+ E
Sbjct: 191 SS--HRDPAHLQLLLDYVIDKFYPGCRD-----------ADGEHGAV-------LAFLGE 230
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V+ RTA+L+A WQ VGF HGV+NTDNMSILGLT+DYGPFGF+D F N +D G R
Sbjct: 231 VSRRTANLMADWQSVGFCHGVMNTDNMSILGLTLDYGPFGFMDGFQLDHVCNHSDTQG-R 289
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPK 489
Y + QP + LWN+ + + +L L+ D EA V+ +Y + F + A M K+G+
Sbjct: 290 YAWNRQPSVALWNLYRLAGSL--HMLVPDAEALRAVLGQYESIFTQAFHARMAAKMGVSG 347
Query: 490 Y---NKQIISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ ++ LL M + D+T FRAL+ V+ +P LLD +R
Sbjct: 348 WQAADEMLLDDLLRLMHDSRADFTLTFRALAQAVRGEP------------GQLLDFFIDR 395
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
+A +W + G + + R M++VNP YVLRN+L + AI AA GD E+ R
Sbjct: 396 -QATQAWWERQVARHAVDGRAAQVRAEAMDAVNPLYVLRNHLAEQAIRAAVQGDASEIER 454
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+ L+ P+ + YA LPP WA V SCSS
Sbjct: 455 LMGLLRDPFRARADAGGYAALPPDWASDLSV---SCSS 489
>gi|443471166|ref|ZP_21061239.1| Selenoprotein O and cysteine-containing like protein [Pseudomonas
pseudoalcaligenes KF707]
gi|442901069|gb|ELS27068.1| Selenoprotein O and cysteine-containing like protein [Pseudomonas
pseudoalcaligenes KF707]
Length = 486
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 219/553 (39%), Positives = 295/553 (53%), Gaps = 75/553 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L+ L +D+ F R GD A T+V P ++ P+LV S + L
Sbjct: 1 MKTLDTLTFDNRFAR--LGD------------AFSTEVLPEP-LDEPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E F FSG + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLDPTEAESTLFAELFSGHKLWSDAQPRAMVYSGHQFGAYNPRLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SE +H LGIPT+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEHLHALGIPTSRALCVIGSSTPVYR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K E GA+V R+A S +RFG ++ + +R E L + L ++ + HF
Sbjct: 166 E-------KRETGAMVMRLAPSHVRFGHFEYFYYTRQHEQLKV---LGEHVLACHFPDCL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
K F T + ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 AAEKPWLAMFRT---------------------LVERNAELIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +
Sbjct: 255 SILGITFDYGPYAFLDDFDANHICNHSDDSG-RYSFSNQVPIAHWNLAALAQALTPFVEV 313
Query: 458 DDKEANYVMERYGTKFMDEYQA----IMTKKLGLPKY---NKQIISKLLNNMAVDKVDYT 510
DD + E G F+ YQA +M ++LG + ++ ++ LL M VDYT
Sbjct: 314 DD-----LRECLGL-FLPLYQAQWLDLMRRRLGFTQAEDGDEALVQALLKLMQGSAVDYT 367
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEER 570
FFR L + + D + L L+ +D+ + +W Y G + R
Sbjct: 368 QFFRRLGDQEVDAA------LARLREDFIDLA-----GFDAWGAQYKARTAREGEDQDAR 416
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
+A M+ +NP Y+LRNYL Q AI+AAE GD+ EVRRL ++ RP+DEQPGME YA PP W
Sbjct: 417 RARMHGLNPCYILRNYLAQRAIEAAEQGDYEEVRRLHAVLSRPFDEQPGMEAYAERPPEW 476
Query: 631 AYRPGVCMLSCSS 643
+SCSS
Sbjct: 477 GRH---LEISCSS 486
>gi|327273185|ref|XP_003221361.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O-like [Anolis
carolinensis]
Length = 680
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 196/443 (44%), Positives = 255/443 (57%), Gaps = 36/443 (8%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
L +D+ +R L +P + PR V AC+++V P+ P+LV S +
Sbjct: 55 LRFDNRALRALHLNPSERTCPRPVPGACFSRVRPTP-WRTPRLVTSSAPATSCCWAEGAA 113
Query: 165 F--ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWE 222
E PL+FSG LAGA P A CY GHQFG +AGQLGDG A+ LGE+LN + +RWE
Sbjct: 114 LCGEEGRGPLYFSGNRXLAGAEPAAHCYCGHQFGXFAGQLGDGAALYLGEVLNAEGQRWE 173
Query: 223 LQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFY 282
QL+GAG TP+SR ADG VLRSSIREFLCSEAM LGIPTTRA VT+ V RD+FY
Sbjct: 174 AQLRGAGLTPFSRQADGRKVLRSSIREFLCSEAMFHLGIPTTRAGTCVTSDSEVIRDIFY 233
Query: 283 DGNPKEEPGAIVCRVAQSFLRFGSYQI------HASRGQEDL---DIVRTLADYAIRHHF 333
DGNPK+E +V R+A +F+RFGS++I + R + DI + DY I +
Sbjct: 234 DGNPKKEKCTVVLRIAPTFIRFGSFEIFKPADEYTGRKGPSVNRNDIRIQMLDYVISTFY 293
Query: 334 RHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLN 393
I E HS D + A+ EV RTA +VA+WQ VGF HGVLN
Sbjct: 294 PEIL--------------EAHS--DNKVERNTAFFREVTRRTARMVAEWQCVGFCHGVLN 337
Query: 394 TDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA 453
TDNMSI+GLTIDYGPFGF+D +DP N +D G RY + QP++ WN+ + + L
Sbjct: 338 TDNMSIVGLTIDYGPFGFMDRYDPEHICNGSDNTG-RYAYNKQPEVCKWNLGKLAEAL-D 395
Query: 454 AKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAVDKVDY 509
+L + + E Y T+F Y IM KKLGL + + +++S L M V D+
Sbjct: 396 PELPLEISIPILEEEYDTEFGKHYLQIMRKKLGLIQLQLADDDKLVSDFLETMQVTGADF 455
Query: 510 TNFFRALSN--VKADPSIPEDEL 530
TN F LS+ V++DP ED L
Sbjct: 456 TNTFHFLSSFPVESDPLKLEDFL 478
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 7/84 (8%)
Query: 540 DIGKERKEAWISWVLSY-------IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI 592
D+ K W W+ Y ++ + + E +MNS NPKY+LRNY+ Q+AI
Sbjct: 547 DLLSRNKGHWKDWLQKYKARLEKDMEHVSNVDTWHAEHVKIMNSNNPKYILRNYIAQNAI 606
Query: 593 DAAELGDFGEVRRLLKLMERPYDE 616
+AAE GDF EV ++LK +E+PY+E
Sbjct: 607 EAAENGDFMEVEKVLKRLEKPYEE 630
>gi|398849651|ref|ZP_10606383.1| hypothetical protein PMI37_00443 [Pseudomonas sp. GM80]
gi|398250550|gb|EJN35863.1| hypothetical protein PMI37_00443 [Pseudomonas sp. GM80]
Length = 487
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 214/551 (38%), Positives = 299/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSAHVLPEP-IDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPATAETQEFAELFGGHKLWADAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALYALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHIRFGHFEYFYYTKRPEQQ----KELGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ +G WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPVGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++ G +++++ LL M VDYT FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRFGFITAEDDDQKLLEDLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L A+ ++ V L+ +DI + + +W YI + G +D E+R+A
Sbjct: 371 RRLGEESAEQAV------VRLRDDFVDI-----KGFDAWGERYIARVARDGDADQEQRRA 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAEQGDYAEVRRLHAVLSKPFEEQPGMEGYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|350530695|ref|ZP_08909636.1| hypothetical protein VrotD_06218 [Vibrio rotiferianus DAT722]
Length = 489
Score = 325 bits (832), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 201/520 (38%), Positives = 285/520 (54%), Gaps = 56/520 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T V+P ++N + V W+ A L + F+G A P A Y
Sbjct: 19 AFFTHVAPQP-LDNTRWVVWNGEFAQQFGLPVAA--NDEVLNVFAGQADFAPFAPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L E+ + +++ LKGAG TPYSR DG AVLRS++RE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEMQHQDGTWFDIHLKGAGLTPYSRMGDGRAVLRSTVREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++ + V R+ K E GA++ RVA++ +RFG ++
Sbjct: 136 LCSEAMAGLGIPTTRALGMMDSDTPVYRE-------KMEYGALLIRVAETHIRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q L + LAD I HF E L YAA
Sbjct: 189 FYTNQ--LAEQKLLADKVIEWHF--------PECLK-------------AVKPYAAMFEL 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ++TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD +DP++ N +D G R
Sbjct: 226 IVDKTAVMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYDPNYICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP I LWN++ + +L+ L+ ++ + ++ + ++ +M KLGL
Sbjct: 285 YAFEQQPRIALWNLSALAHSLSP--LVAREDLEMALGKFEVRLSRKFSELMRAKLGLHTK 342
Query: 491 ---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK 546
+ ++ + + +K DY+ FFR LSN+ A PS +AV+ L I +E
Sbjct: 343 VDEDGRLFEAMFELLNQNKTDYSRFFRELSNLDAKPS----------QAVIDLFIDREAA 392
Query: 547 EAWISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
AW+ L+ + E+ +G ++ ++R M VNPKY+LRNYL Q AID AE GDF EV
Sbjct: 393 SAWVDLYLARCELEVDENGERVTVQQRCERMRQVNPKYILRNYLAQLAIDKAEEGDFSEV 452
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
RL +L++RPYDEQP + YA+LPP W + +SCSS
Sbjct: 453 NRLAELLKRPYDEQPEFDDYAKLPPEWGKK---MEISCSS 489
>gi|229529045|ref|ZP_04418435.1| hypothetical protein VCG_002138 [Vibrio cholerae 12129(1)]
gi|229332819|gb|EEN98305.1| hypothetical protein VCG_002138 [Vibrio cholerae 12129(1)]
Length = 489
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 209/523 (39%), Positives = 279/523 (53%), Gaps = 60/523 (11%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQHLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSS+
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSL 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF E TS YAAW
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCEQ---------------------TSKPYAAW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSECLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGK 543
+ ++ + +A + DYT F R LS + + +AV+ L + +
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLVLDR 389
Query: 544 ERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF
Sbjct: 390 EAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDF 449
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 450 EEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|410617300|ref|ZP_11328271.1| hypothetical protein GPLA_1495 [Glaciecola polaris LMG 21857]
gi|410163137|dbj|GAC32409.1| hypothetical protein GPLA_1495 [Glaciecola polaris LMG 21857]
Length = 480
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 197/505 (39%), Positives = 278/505 (55%), Gaps = 50/505 (9%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V NP+LV + ++ D+L+L F + T +AQ YGGHQFG W
Sbjct: 23 VANPKLVEVNHTLRDALQLPASGFTQSSIMSMLFDNTSSFTKHSFAQKYGGHQFGGWNPD 82
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGR + LG++ + +RW+L LKGAG TPYSRFADG AVLRS++RE+L SEA+H +GI
Sbjct: 83 LGDGRGLLLGDVKDKNGQRWDLHLKGAGPTPYSRFADGRAVLRSTLREYLASEALHHMGI 142
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PT+RALCL+T+ + V R+ K+E A++ RV+QS +RFG ++ GQ LD +
Sbjct: 143 PTSRALCLITSDEPVYRE-------KQERAAMMIRVSQSHIRFGHFEYFYHSGQ--LDKL 193
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
L Y + HHF+ N N + A + TASL+A+
Sbjct: 194 EKLFAYCLEHHFKSCAN---------------------AKNPHLAMLERIVLDTASLIAK 232
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
WQ GF HGV+NTDNMSI G+T D+GP+ FLD FDP F N +D G RY F QP IGL
Sbjct: 233 WQAFGFNHGVMNTDNMSIHGITFDFGPYAFLDDFDPKFVCNHSDHQG-RYAFEEQPGIGL 291
Query: 442 WNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKL 498
WN+ + A + +E + Y ++ + E+ +M +KLGL ++++
Sbjct: 292 WNLNALAH--AFTPYLSIEEIKLALGNYESQLLSEFSQLMHQKLGLFTPSPSTAELVNGW 349
Query: 499 LNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQ 558
L+ ++ DK DY FR L + P+ L++ +R EA SW+ Y Q
Sbjct: 350 LDLVSQDKRDYHISFRLLCEINEHQLNPQ----------LVNHFIQR-EAAQSWLSQYQQ 398
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
LL G+ EER+ M VNP+YVLRNY Q AIDAAE GDF + L ++++P++ +P
Sbjct: 399 TLLEQGVPVEERQNRMRQVNPEYVLRNYQAQLAIDAAEDGDFSRFKTFLHVLQQPFESKP 458
Query: 619 GMEKYARLPPAWAYRPGVCMLSCSS 643
+A+ PP W +SCSS
Sbjct: 459 EYADFAKPPPDWGKH---MEISCSS 480
>gi|54309205|ref|YP_130225.1| hypothetical protein PBPRA2020 [Photobacterium profundum SS9]
gi|46913637|emb|CAG20423.1| hypothetical protein PBPRA2020 [Photobacterium profundum SS9]
Length = 522
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 215/557 (38%), Positives = 300/557 (53%), Gaps = 51/557 (9%)
Query: 97 KKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVAD 156
+ +K L L +++++ ELP T IP+ + +P LV+ + VA+
Sbjct: 7 QSMKTLSQLVFNNTY-SELPTTFGTAVIPQPL--------------SDPFLVSVNPQVAE 51
Query: 157 SLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNL 216
LELDP E + F F+G LAG P A Y GHQFG + LGDGR + LGE+L
Sbjct: 52 MLELDPLEAKTRLFINSFTGNKELAGTAPLAMKYTGHQFGHYNPDLGDGRGLLLGEVLTS 111
Query: 217 KSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFV 276
+ +W++ LKG+GKTPYSR DG AVLRSSIRE+L S A++ LGI TT AL L+ + V
Sbjct: 112 TNAKWDIHLKGSGKTPYSRQGDGRAVLRSSIREYLGSAALNGLGIKTTHALALLGSTTLV 171
Query: 277 TRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH- 335
+R+ K E GA + RVA+S LRFG ++ Q ++ LADY I+HHF
Sbjct: 172 SRE-------KMERGATLIRVAESHLRFGHFEYLFYTHQH--SELKLLADYLIKHHFPDL 222
Query: 336 IENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+ ++ E ++ ++ H++ YA+ + E TA L+A WQ VGF HGV+NTD
Sbjct: 223 LTTESEQEDKQTASPNQHHNI-------YASMLTRIVELTAQLIAGWQSVGFAHGVMNTD 275
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAK 455
NMS+LGLT DYGPFGFLD ++P + N +D G RY F QP I LWN++ L
Sbjct: 276 NMSVLGLTFDYGPFGFLDDYNPDYICNHSDYSG-RYAFNQQPSIALWNLSALGYALTP-- 332
Query: 456 LIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNF 512
LID ++ + ++ RY +Y A M KLGL + ++ + S L + VDYT F
Sbjct: 333 LIDKEDVDAILNRYHLTLQRDYSARMRNKLGLIEKREEDTVLFSSLFELLQSQMVDYTLF 392
Query: 513 FRALSNVKA-DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-----LSSGIS 566
FR LS++ A D S+ +P D + W+ +Y L S
Sbjct: 393 FRTLSSISATDLSVTS----LPNSIERFDDLFTCTQPLEKWLKAYAVRLSFENDTSEKNG 448
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARL 626
D R M NPKY+LRNYL Q AID AE GDF + LL+++ P+DE ++A
Sbjct: 449 DTLRLTQMKLHNPKYILRNYLAQQAIDKAEDGDFTMIDELLQVLSSPFDEHLEFNQFADK 508
Query: 627 PPAWAYRPGVCMLSCSS 643
PP W + +SCSS
Sbjct: 509 PPYWGKK---LEISCSS 522
>gi|218708872|ref|YP_002416493.1| hypothetical protein VS_0872 [Vibrio splendidus LGP32]
gi|254807253|sp|B7VL54.1|Y872_VIBSL RecName: Full=UPF0061 protein VS_0872
gi|218321891|emb|CAV17878.1| Hypothetical protein VS_0872 [Vibrio splendidus LGP32]
Length = 485
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 206/528 (39%), Positives = 287/528 (54%), Gaps = 58/528 (10%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q +AW+ ++A+ L E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LNNVQWLAWNHNLANELGFPSFECTSEELLETLSGNVE 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PEQFSPVAMKYAGHQFGSYNPDLGDGRGLLLAQVVAKSGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R A+
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRAAE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF E L D+D
Sbjct: 180 SHIRFGHFEHLFYTNQ--LAEHKLLADKVIEWHF--------PECL-----DDD------ 218
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA ++ +RTA +VA WQ GF HGV+NTDNMSI+G T DYGPF FLD +DP
Sbjct: 219 --KPYAAMFNQIVDRTAEMVALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYDPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IGLWN++ + +L + L+D + +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGLWNLSALAHSL--SPLVDKADLEAALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL ++ ++ + M+ +KVDY FFR LSN+ D +P+D
Sbjct: 334 LMRRKLGLLSKHEGDSRLFESMFELMSQNKVDYPRFFRTLSNL--DTLLPQD-------- 383
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
++D+ +R+ A + WV +Y+Q S ER M VNPKY+LRNYL Q AID AE
Sbjct: 384 -VIDLVIDREAAKL-WVDNYLQRCELEESSVAERCEKMRQVNPKYILRNYLAQLAIDKAE 441
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
GD ++ L+ ++ PY E P E A LPP W G M +SCSS
Sbjct: 442 RGDSSDIDALMVVLADPYAEHPDYEHLAALPPEW----GKAMEISCSS 485
>gi|285026514|ref|NP_001038336.2| selenoprotein O [Danio rerio]
gi|172046215|sp|Q1LVN8.2|SELO_DANRE RecName: Full=Selenoprotein O; Short=SelO
Length = 692
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 195/458 (42%), Positives = 268/458 (58%), Gaps = 50/458 (10%)
Query: 89 GGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLV 148
G D+ ++ +LE L +D+ +++LP DP T+ R+V +C+++V P+ ++NP+ V
Sbjct: 28 GMDDMGVSLSRSSLERLEFDNVALKKLPLDPSTEPGVRQVRGSCFSRVQPTP-LKNPEFV 86
Query: 149 AWSESVADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRA 207
A S L LD +E + P P + SG+ + G+ P A CY GHQFG +AGQLGDG A
Sbjct: 87 AVSAPALALLGLDAEEVLKDPLGPEYLSGSKVMPGSEPAAHCYCGHQFGQFAGQLGDGAA 146
Query: 208 ITLGEILNLKSE-----------RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAM 256
LGE+ + RWE+Q+KGAG TPYSR ADG VLRSSIREFLCSEA+
Sbjct: 147 CYLGEVKAPAGQSPELLRENPTGRWEIQVKGAGLTPYSRQADGRKVLRSSIREFLCSEAV 206
Query: 257 HFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA----- 311
LG+PTTRA +VT+ V RD+FYDGNP+ E ++V R+A SF+RFGS++I
Sbjct: 207 FALGVPTTRAGSVVTSDSRVMRDIFYDGNPRMERCSVVLRIAPSFIRFGSFEIFKRADEF 266
Query: 312 ------SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
S G ++L + +Y I + + I + DLT +
Sbjct: 267 TGRQGPSYGHDELRT--QMLEYVIENFYPEIH----------------RNYPDLT-ERNT 307
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
A+ EV RTA LVAQWQ VGF HGVLNTDNMSILGLT+DYGPFGF+D FDP F N +D
Sbjct: 308 AFFKEVTVRTARLVAQWQCVGFCHGVLNTDNMSILGLTLDYGPFGFMDRFDPDFICNASD 367
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY + QP I WN+A+ + L L D+ A V++ Y + D Y + M KKL
Sbjct: 368 NSG-RYSYQAQPAICRWNLARLAEAL-EPDLPPDR-AEQVLDEYLPLYNDFYLSNMRKKL 424
Query: 486 GLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSNV 519
GL + ++ +I++L+ M D+TN FR+LS +
Sbjct: 425 GLLRKEEPEDEMLITELMQTMHNTGADFTNTFRSLSQI 462
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 63/113 (55%), Gaps = 17/113 (15%)
Query: 538 LLDIGKER-----KEAWISWVLSYIQEL---LSSGIS----DEERKALMNSVNPKYVLRN 585
L+D +E+ E W W+ Y Q L SG+ ER +MN+ NP VLRN
Sbjct: 543 LMDTTEEQLRVKHTEHWSDWIQKYRQRLARECESGVDVKDVQTERVRVMNNNNPHVVLRN 602
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM 638
Y+ Q+AI AAE GDF EV+R+LK++E+P+ Q G+E+ P W R G +
Sbjct: 603 YIAQNAIAAAENGDFSEVQRVLKVLEKPFSVQEGLEQ-----PGWMGRGGAAI 650
>gi|336124559|ref|YP_004566607.1| hypothetical protein VAA_02308 [Vibrio anguillarum 775]
gi|335342282|gb|AEH33565.1| Hypothetical cytosolic protein [Vibrio anguillarum 775]
Length = 489
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 209/524 (39%), Positives = 288/524 (54%), Gaps = 68/524 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
Y++V+P A ++N + VAW+ S+A L L + P+ L SG A P A Y
Sbjct: 21 YSQVNP-APLDNVRWVAWNASLAGDLSLPTQ----PNDELLHSLSGQVIPAQFKPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L EI + E ++L LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 76 AGHQFGIYNPDLGDGRGLLLVEIESKTGEVYDLHLKGAGLTPYSRMGDGRAVLRSTIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGI TTRAL ++++ V R+ K+E GA++ RVAQS +RFG ++
Sbjct: 136 LCSEAMAGLGIATTRALAMMSSDTPVYRE-------KQERGALLVRVAQSHIRFGHFEHF 188
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNK-YAAWAV 369
Q L + LAD I H+ LT K YAA
Sbjct: 189 FYTNQ--LAEQKQLADKVIEWHYPDC----------------------LTKEKPYAAMFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+ ERTA ++A WQ VGF HGV+NTDNMSILG T DYGPF FLD ++P++ N +D G
Sbjct: 225 HIVERTAKMIADWQAVGFAHGVMNTDNMSILGQTFDYGPFAFLDDYEPTYIGNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG--- 486
RY F QP + LWN++ + L + L++ + + ++ + + M KLG
Sbjct: 284 RYAFDQQPRVALWNLSALAHAL--SPLVERSDLEAALAQFEAQLGRYFSQQMRCKLGVLA 341
Query: 487 -LPKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
LP + + ++ + + DYT FFR LSN+ + P +LD+ +R
Sbjct: 342 SLPG-DSVLFEQMFELLTKNHTDYTRFFRQLSNLDREGEQP-----------VLDLFIDR 389
Query: 546 KEAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
A SW+ Y+ +E+ SSG IS E+R A M VNPKY+LRNYL Q AID AE GD
Sbjct: 390 AAAQ-SWLEQYLARCEREIDSSGDAISIEQRCAEMRKVNPKYILRNYLAQQAIDKAEQGD 448
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +V +L +L+ PY EQP +A+LPP W + +SCSS
Sbjct: 449 YQQVHQLAQLLANPYAEQPEKSHFAQLPPEWGKK---MEISCSS 489
>gi|395819536|ref|XP_003783138.1| PREDICTED: selenoprotein O-like [Otolemur garnettii]
Length = 630
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 189/431 (43%), Positives = 249/431 (57%), Gaps = 38/431 (8%)
Query: 125 PREVLHACYTKVSPSAEVENPQLVAWSESVADSL-----ELDPKEFERPDFPLFFSGATP 179
PR V AC+++V P A + P+LVA SE L + LFFSG
Sbjct: 37 PRPVPGACFSRVRP-APLREPRLVALSEPALALLGLAAPSAVATREAEAEAALFFSGNAL 95
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
L GA P A CY GHQFG +AGQLGDG A+ LGE+ ERWELQLKGAG TP+SR ADG
Sbjct: 96 LPGAEPAAHCYCGHQFGQFAGQLGDGAAMYLGEVCTAAGERWELQLKGAGPTPFSRQADG 155
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
VLRSSIREFLCSEAM LG+PTTRA VT+ V RD+FYDGNPK E +V R+A
Sbjct: 156 RKVLRSSIREFLCSEAMFHLGVPTTRAGACVTSESTVVRDVFYDGNPKYEKCTVVLRIAS 215
Query: 300 SFLRFGSYQI------HASRGQEDL---DIVRTLADYAIRHHFRHIENMNKSESLSFSTG 350
+FLRFGS++I H R + DI + DYA+ + I+ + S+S+
Sbjct: 216 TFLRFGSFEIFKPTDEHTGRAGPSVGRNDIRVQMLDYAVSSFYPDIQAAHASDSV----- 270
Query: 351 DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFG 410
+ AA+ EV RTA +VA+WQ VGF HGVLNTDNMSI+GLT+DYGPFG
Sbjct: 271 -----------QRNAAFFREVTRRTARMVAEWQCVGFCHGVLNTDNMSIVGLTLDYGPFG 319
Query: 411 FLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYG 470
FLD +DP N +D G RY ++ QP++ WN+ + + L ++ E + V E +
Sbjct: 320 FLDRYDPDHVCNASDTAG-RYAYSKQPEVCKWNLQKLAEALEPELPLELGE-SIVAEEFD 377
Query: 471 TKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
+F Y M +KLGL ++ ++++LL M + D+TN F +LS+ + P
Sbjct: 378 AEFQRHYLQKMRRKLGLVGMEQEEDVALVARLLETMHLTGADFTNTFYSLSSFPTERESP 437
Query: 527 E-DELLVPLKA 536
+ +E L L A
Sbjct: 438 DLEEFLAVLTA 448
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 56/112 (50%), Gaps = 21/112 (18%)
Query: 541 IGKERKEAWISWVLSYIQELLSSGISDEERKA-------LMNSVNPKYVLRNYLCQSAID 593
+ + +E W W+ Y L E+ A +M + NPKYVLRNY+ Q+AI+
Sbjct: 513 LQSQNREHWAGWLQQYRARLDKDMEYVEDMAAWQAEHIRVMRANNPKYVLRNYIAQTAIE 572
Query: 594 AAELGDFGEVRRLLKLMERPYDEQPGME--------------KYARLPPAWA 631
AAE GDF EV+R+LKL+E PYD G Y+ PP WA
Sbjct: 573 AAEGGDFSEVQRVLKLLETPYDNGGGAAAEPKDGSRAASRRPSYSSKPPLWA 624
>gi|113867529|ref|YP_726018.1| hypothetical protein H16_A1518 [Ralstonia eutropha H16]
gi|113526305|emb|CAJ92650.1| uncharacterized conserved protein [Ralstonia eutropha H16]
Length = 523
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 210/534 (39%), Positives = 285/534 (53%), Gaps = 72/534 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T++ P+ + P LV + + A L D + DF F G A P A Y G
Sbjct: 39 FTRLRPT-PLPAPYLVGVAPAAAALLGWDAGIGSQQDFIETFIGNQVPDWADPLATVYSG 97
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG+WAGQLGDGRAI L + + WE+QLKGAG TPYSR ADG AVLRSSIRE+LC
Sbjct: 98 HQFGVWAGQLGDGRAIRLAQA-ETATGPWEIQLKGAGLTPYSRMADGRAVLRSSIREYLC 156
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LG+PTTRAL ++ + V R+ E A+V R++ +F+RFG ++ A+
Sbjct: 157 SEAMAALGVPTTRALSIMGSDAPVRRETI-------ETSAVVTRLSPTFIRFGHFEHFAA 209
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+D+ +R LAD+ I + + + Y A EV+
Sbjct: 210 --HDDVAALRKLADFVIDNFMPACRD---------------------DAQPYQALLREVS 246
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA L+A WQ VGF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G RY
Sbjct: 247 LRTADLIAHWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDTQG-RYA 305
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLID---DKEA-------------NYVMERYGTKFMDE 476
++ QP + WN+ + L L D+E + +RY F
Sbjct: 306 YSQQPQVAFWNLHCLAQALLPLWLPPEDADQEGARDAAVAAARAALDPFRDRYAAAFFRH 365
Query: 477 YQAIMTKKLGL-------PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDE 529
Y+A KLGL K ++ +++ L + +VDYT F+R L + + + +
Sbjct: 366 YRA----KLGLRPPAGGDDKSDEPLLTSLFQLLHGQRVDYTLFWRKLCGISSTDASRD-- 419
Query: 530 LLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQ 589
P++ + LD + A+ +WV Y L + D R+ M +VNPKYVLRN+L +
Sbjct: 420 --APVRDLFLD-----RAAFDAWVADYRVRLRAEQSHDAARELEMLAVNPKYVLRNHLAE 472
Query: 590 SAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+AI A DF EV RLL ++ RP+DEQP E YA LPP WA +SCSS
Sbjct: 473 TAIRHAREKDFTEVDRLLAVLSRPFDEQPEAEHYAALPPDWA---SGLEVSCSS 523
>gi|395647847|ref|ZP_10435697.1| hypothetical protein Pext1s1_04713 [Pseudomonas extremaustralis
14-3 substr. 14-3b]
Length = 487
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 211/551 (38%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDEPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLEPAVAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E GA+V R+A S +RFG ++ + + ++ + LA++ ++ H+
Sbjct: 166 E-------KQERGAMVLRLANSHIRFGHFEYFYYTKKPEQQAE----LAEHVLKLHYPEC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ Y A E+ ER A ++A+WQ GF HGV+NTDN
Sbjct: 215 REQPEP---------------------YLAMFREIVERNAEMIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDHDG-RYSFSNQVPIGQWNLSALAQALTPFIS 312
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
+D + + Y + Y +M ++LGL +++++ +LL M VDYT FF
Sbjct: 313 VDALKETLGL--YLPLYQAHYLDLMRRRLGLTTAEDDDQKLVERLLKLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDE-ERKA 572
R L + A ++ L+ +D + + +W Y + G E +R+A
Sbjct: 371 RRLGDEPAALAVTR------LRDDFVD-----RAGFDAWAELYTARIARDGDDTEAQRRA 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ +P++EQ GME+YA+ PP W
Sbjct: 420 RMHAVNPLYILRNYLAQNAITAAESGDYSEVRRLHEVLSKPFEEQAGMEQYAQRPPDWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|262404283|ref|ZP_06080838.1| UPF0061 domain-containing protein [Vibrio sp. RC586]
gi|262349315|gb|EEY98453.1| UPF0061 domain-containing protein [Vibrio sp. RC586]
Length = 489
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 204/522 (39%), Positives = 278/522 (53%), Gaps = 64/522 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFFSGATPLAGAVPYA 187
YT + P ++N + W+ ++A +EF P+ P SG A P A
Sbjct: 21 YTLIRP-LPLQNVRWGMWNAALA-------QEFGLPEMPNDELLASLSGQRLSANFAPLA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSSI
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKRGEVFDIHLKGAGLTPYSRMGDGRAVLRSSI 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+AQ+ +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSDTPVYRE-------REERGALLVRLAQTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q L ++ L D I HF D + SV YA W
Sbjct: 186 EHFYYTDQ--LAELKLLVDKIIEWHF----------------PDCNQSV-----KPYANW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP F N +D
Sbjct: 223 FQQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDDYDPHFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + +I+ + +E Y + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPIIEKMDLEIALESYSDHLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + + L + L + ++
Sbjct: 340 TTQQEGDGELFADFFALLANNHTDYTRFLRELSCL---------DRLGTEAVIDLVVDRQ 390
Query: 545 RKEAWISWVLSYIQELL---SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+AW++ L L S IS ER M VNPKY+LRNYL Q AI+ AE GDF
Sbjct: 391 AAKAWLTRYLERAARELGQDSQPISQVERCQAMRQVNPKYILRNYLAQQAIELAERGDFQ 450
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E++RL +++ PYDE P E YA+LPP W + +SCSS
Sbjct: 451 EMQRLAQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|121957848|sp|Q6LQK3.2|Y2020_PHOPR RecName: Full=UPF0061 protein PBPRA2020
Length = 514
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 215/555 (38%), Positives = 299/555 (53%), Gaps = 51/555 (9%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L L +++++ ELP T IP+ + +P LV+ + VA+ L
Sbjct: 1 MKTLSQLVFNNTY-SELPTTFGTAVIPQPL--------------SDPFLVSVNPQVAEML 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
ELDP E + F F+G LAG P A Y GHQFG + LGDGR + LGE+L +
Sbjct: 46 ELDPLEAKTRLFINSFTGNKELAGTAPLAMKYTGHQFGHYNPDLGDGRGLLLGEVLTSTN 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+W++ LKG+GKTPYSR DG AVLRSSIRE+L S A++ LGI TT AL L+ + V+R
Sbjct: 106 AKWDIHLKGSGKTPYSRQGDGRAVLRSSIREYLGSAALNGLGIKTTHALALLGSTTLVSR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRH-IE 337
+ K E GA + RVA+S LRFG ++ Q ++ LADY I+HHF +
Sbjct: 166 E-------KMERGATLIRVAESHLRFGHFEYLFYTHQH--SELKLLADYLIKHHFPDLLT 216
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ E ++ ++ H++ YA+ + E TA L+A WQ VGF HGV+NTDNM
Sbjct: 217 TESEQEDKQTASPNQHHNI-------YASMLTRIVELTAQLIAGWQSVGFAHGVMNTDNM 269
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
S+LGLT DYGPFGFLD ++P + N +D G RY F QP I LWN++ L LI
Sbjct: 270 SVLGLTFDYGPFGFLDDYNPDYICNHSDYSG-RYAFNQQPSIALWNLSALGYALTP--LI 326
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFR 514
D ++ + ++ RY +Y A M KLGL + ++ + S L + VDYT FFR
Sbjct: 327 DKEDVDAILNRYHLTLQRDYSARMRNKLGLIEKREEDTVLFSSLFELLQSQMVDYTLFFR 386
Query: 515 ALSNVKA-DPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-----LSSGISDE 568
LS++ A D S+ +P D + W+ +Y L S D
Sbjct: 387 TLSSISATDLSVTS----LPNSIERFDDLFTCTQPLEKWLKAYAVRLSFENDTSEKNGDT 442
Query: 569 ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
R M NPKY+LRNYL Q AID AE GDF + LL+++ P+DE ++A PP
Sbjct: 443 LRLTQMKLHNPKYILRNYLAQQAIDKAEDGDFTMIDELLQVLSSPFDEHLEFNQFADKPP 502
Query: 629 AWAYRPGVCMLSCSS 643
W + +SCSS
Sbjct: 503 YWGKK---LEISCSS 514
>gi|417825150|ref|ZP_12471738.1| hypothetical protein VCHE48_3095 [Vibrio cholerae HE48]
gi|340046635|gb|EGR07565.1| hypothetical protein VCHE48_3095 [Vibrio cholerae HE48]
Length = 489
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 206/521 (39%), Positives = 279/521 (53%), Gaps = 56/521 (10%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNSRLAQQFGL--PEAPNDELLLSLSGQHLPADFSPVAMK 74
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 75 YAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 134
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG ++
Sbjct: 135 YLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHFEH 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ LAD I +F TS YAAW
Sbjct: 188 FFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPYAAWFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 225 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 284 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 341
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKER 545
+ ++ + +A + DYT+F R LS + + +AV+ L + +E
Sbjct: 342 QQEGDGELFADFFALLANNHTDYTSFLRELSCLDRQGN----------EAVIDLVLDREA 391
Query: 546 KEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF E
Sbjct: 392 AKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDFEE 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 452 MQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|261365768|ref|ZP_05978651.1| SelO family protein [Neisseria mucosa ATCC 25996]
gi|288565671|gb|EFC87231.1| SelO family protein [Neisseria mucosa ATCC 25996]
Length = 498
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 208/524 (39%), Positives = 276/524 (52%), Gaps = 57/524 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y++VSP + P VA++ +A L LD +F+ + SG P P A Y G
Sbjct: 19 YSRVSPEP-LTAPYWVAFNTDLAAELNLD-TDFQTTSNLAYLSGNAPQYAPAPIASVYSG 76
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGRA+ +G+ ++ +R E QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 77 HQFGVYTPRLGDGRALLIGDSVDTAGQRQEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTT AL L + V R+ E A++ R+A SFLRFG ++
Sbjct: 137 SEAMHGLGIPTTHALALCGSDDPVYRETV-------ETAAVLTRIAPSFLRFGHFEYFYY 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
G+E +R LADY IRH++ ++ T N YAA ++
Sbjct: 190 TGRE--AEIRQLADYLIRHYYPDCQD---------------------TDNPYAALLEQIR 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDP---------SFTPNT 423
RTA VA WQ VGF HGV+NTDNMS LGLTIDYGPFGFLD + P N
Sbjct: 227 NRTADTVAAWQSVGFCHGVMNTDNMSALGLTIDYGPFGFLDDYGPFGFLDDYDRRHVCNH 286
Query: 424 TDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTK 483
+D G RY + QP + WN A ++ A L+ +++ + F Y M +
Sbjct: 287 SDTQG-RYAYNAQPFVAHWNFAALASCFDA--LVPHDTLEQLIDGWTEVFQTTYLEKMRR 343
Query: 484 KLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL + +K+ +I+ L + K D+T FFR LS V S E L P
Sbjct: 344 KLGLQQADKRDDESLIADLFAALQDQKTDFTLFFRNLSEV----SNTHGEPLPPALEQTF 399
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
G ++I W+ Y Q L + ER MN NP Y+LRNYL + AI A G+
Sbjct: 400 KNGV--PPSFIRWLGRYRQRLRAENSDPAERAIRMNRTNPLYILRNYLAEQAIAQARNGN 457
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ E+ RL + + RP+DEQ A PP + VC+ SCSS
Sbjct: 458 YREIERLRRCLARPFDEQAEFADLAEPPPEGSI--PVCV-SCSS 498
>gi|422307881|ref|ZP_16395035.1| hypothetical protein VCCP1035_2438 [Vibrio cholerae CP1035(8)]
gi|408618833|gb|EKK91891.1| hypothetical protein VCCP1035_2438 [Vibrio cholerae CP1035(8)]
Length = 489
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 207/521 (39%), Positives = 277/521 (53%), Gaps = 56/521 (10%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNSRLAQQFGL--PEAPNDELLLSLSGQHLPADFSPVAMK 74
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 75 YAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 134
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ EE GA++ R+A + +RFG ++
Sbjct: 135 YLCSEAMAGLGIATTRALALMSSETPVYREC-------EERGALLVRLAHTHVRFGHFEH 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ LAD I HF TS YAAW
Sbjct: 188 FFYTDQ--YANLKLLADKVIEWHFPDCVQ---------------------TSKPYAAWFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 225 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 284 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 341
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKER 545
+ ++ + +A + DYT F R LS + + +AV+ L + +E
Sbjct: 342 QQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLVLDREA 391
Query: 546 KEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF E
Sbjct: 392 AKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDFEE 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 452 MQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|118602378|ref|YP_903593.1| hypothetical protein Rmag_0346 [Candidatus Ruthia magnifica str. Cm
(Calyptogena magnifica)]
gi|118567317|gb|ABL02122.1| protein of unknown function UPF0061 [Candidatus Ruthia magnifica
str. Cm (Calyptogena magnifica)]
Length = 457
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 202/512 (39%), Positives = 283/512 (55%), Gaps = 78/512 (15%)
Query: 139 SAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMW 198
+ ++ P L+ ++++ D L+L K+ E + SG P A Y G+QFG +
Sbjct: 17 TQSLKQPFLIHKNQALQDRLKLSIKDNELLNIA---SGKNKFQCMQPIASIYAGYQFGHF 73
Query: 199 AGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
QLGDGR+ +G++ L EL LKGAG+TPYSR ADG AVLRSSIRE+LCS AM
Sbjct: 74 VPQLGDGRSCLIGQVQGL-----ELSLKGAGQTPYSRGADGRAVLRSSIREYLCSIAMKG 128
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDL 318
L IPTT AL LV + V R+ E GAIV R A S +RFG +++ A RGQ +
Sbjct: 129 LNIPTTEALTLVGSHSEVYRENI-------ETGAIVMRCAPSHIRFGHFELFAVRGQ--I 179
Query: 319 DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASL 378
VR LAD+ I HH+++ + N+Y + EV ++TA +
Sbjct: 180 SQVRQLADFVIEHHYQYCQG----------------------ENQYIDFFNEVVQKTAIM 217
Query: 379 VAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPD 438
+A WQ GF HGV+NTDNMSILGLTIDYGPFGFL+ ++P F N +D G RY F QP+
Sbjct: 218 IAHWQAQGFVHGVMNTDNMSILGLTIDYGPFGFLETYNPKFICNHSDHEG-RYSFDQQPN 276
Query: 439 IGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---II 495
I LWN+++ + +L++ LI+ K+A V+++Y ++ Y +M +K GL + +KQ +I
Sbjct: 277 IALWNLSRLADSLSS--LINTKQAKLVLDKYQNYLVESYSVLMRQKFGLHEKDKQDHVLI 334
Query: 496 SKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLS 555
++ + + K D TN R LSNV D+L A+ D WI
Sbjct: 335 TQFFDMLYQHKKDRTNSLRQLSNV--------DKL-----AINTDFND-----WI----- 371
Query: 556 YIQELLSSGISDE---ERKALMNSVNPKYVLRNYLCQSAIDAAELG-DFGEVRRLLKLME 611
EL +S E R ++MNSVNP Y+LRNYL + AI AE D+ E+ L L+
Sbjct: 372 ---ELYDKRVSQENNRNRISMMNSVNPNYILRNYLAEVAIRKAEDDKDYTEIEILFDLLS 428
Query: 612 RPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+P++ ME Y P WA G+ +SCSS
Sbjct: 429 KPFEVHQDMEFYTYEAPDWA--QGLT-VSCSS 457
>gi|387891678|ref|YP_006321975.1| hypothetical protein PflA506_0436 [Pseudomonas fluorescens A506]
gi|387159431|gb|AFJ54630.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens A506]
Length = 487
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 214/552 (38%), Positives = 299/552 (54%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S++ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDEPRLVVASKAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAETSVFAELFGGHKLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIPT+RALC++ + V R
Sbjct: 106 KHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGIPTSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E GA+V R+A S +RFG ++ + + ++ ++ H+
Sbjct: 166 E-------KQERGAMVLRLAHSHIRFGHFEYFYYTKKPEQQAELA------------EHV 206
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
N++ E Y A E+ ER A ++A+WQ GF HGV+NTDN
Sbjct: 207 LNLHYPECRE-------------QPEPYLAMFREIVERNAEMIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDHEG-RYSFSNQVPIGQWNLSALAQALTPFIS 312
Query: 457 IDD-KEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNF 512
+D KEA + Y + Y +M ++LGL ++Q++ +LL M VDYT F
Sbjct: 313 VDALKEA---LGLYLPLYQAHYLDLMRRRLGLTTAEDADQQLVERLLKLMQNSGVDYTLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERK 571
R L + A ++ L+ +D+ + +W Y + G + E+R+
Sbjct: 370 LRHLGDEPAALAVAR------LRDDFVDLA-----GFDAWAEHYKARVARDGDYTQEQRR 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ RP++EQ GME+YA+ PP W
Sbjct: 419 ERMHAVNPLYILRNYLAQNAIAAAESGDYSEVRRLHEVLTRPFEEQAGMEQYAQRPPDWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 KH---LEISCSS 487
>gi|188584584|ref|YP_001928029.1| hypothetical protein Mpop_5402 [Methylobacterium populi BJ001]
gi|226707709|sp|B1ZBT6.1|Y5402_METPB RecName: Full=UPF0061 protein Mpop_5402
gi|179348082|gb|ACB83494.1| protein of unknown function UPF0061 [Methylobacterium populi BJ001]
Length = 498
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 199/504 (39%), Positives = 274/504 (54%), Gaps = 46/504 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +V+P+A VE P+LV + ++A L LDP E P+ SG GA P A Y G
Sbjct: 19 FARVAPTA-VEAPRLVRLNRTLALDLGLDPDRLESPEGLDVLSGRRVAEGAEPLAAAYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE++ R ++QLKG+G TP+SR DG A L +RE+L
Sbjct: 78 HQFGQFVPQLGDGRAILLGEVVGRDGRRRDIQLKGSGPTPFSRRGDGRAALGPVLREYLV 137
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL VTTG+ V R+ PGA++ RVA S +R GS+Q A+
Sbjct: 138 SEAMHALGIPTTRALAAVTTGEPVIRETVL-------PGAVLTRVASSHIRVGSFQFFAA 190
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D++ +R LAD+AI H + +E+ N Y A V
Sbjct: 191 RG--DVEGLRALADHAIARH-----DPEAAEA----------------ENPYRALLEGVI 227
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A LVA+W G+GF HGV+NTDNMSI G TIDYGP FLDA+DP+ ++ D G RY
Sbjct: 228 RRQAELVARWLGIGFIHGVMNTDNMSIAGETIDYGPCAFLDAYDPATAFSSIDRHG-RYA 286
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDD----KEANYVMERYGTKFMDEYQAIMTKKLGLP 488
+ NQP I LWN+ + + L D+ EA + + F Y ++ +KLGL
Sbjct: 287 YGNQPRIALWNLTRLAEALLPLLSEDETKAVAEAEAALTGFAGLFEAAYHGLLNRKLGLT 346
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVK-ADPSIPEDELLVPLKAVLLDIGKE 544
+ + LL MA + D+T FR LS PE E + ++++ +D
Sbjct: 347 TMRDGDPALAGDLLKTMAENGADFTLTFRRLSAAAPGSGPAPEPEAVEAVRSLFID---- 402
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEV 603
++ +W + + L S RKALM SVNP ++ RN+ ++ I+AA E DF
Sbjct: 403 -PTSFDAWAERWRRRLDEEPGSAAGRKALMRSVNPAFIPRNHRVEAMIEAAVERQDFVPF 461
Query: 604 RRLLKLMERPYDEQPGMEKYARLP 627
LL ++ RPYD+QP ++A P
Sbjct: 462 ETLLTVLSRPYDDQPDFAQFAEAP 485
>gi|399519207|ref|ZP_10760015.1| conserved hypothetical protein [Pseudomonas pseudoalcaligenes CECT
5344]
gi|399113031|emb|CCH36573.1| conserved hypothetical protein [Pseudomonas pseudoalcaligenes CECT
5344]
Length = 487
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 215/551 (39%), Positives = 299/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+ L++D+ F R GD A T++ P +E P+LV SES L
Sbjct: 1 MKSLDTLSFDNRFAR--LGD------------AFSTEILPEP-IEQPRLVVASESALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L E +RP+F F+G A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLATSEAQRPEFAELFAGHKLWEEAEPRAMVYSGHQFGGYTPRLGDGRGLLLGEVVNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG TPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 106 QHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEHLHALGIPSSRALCVTGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFG ++ + +R E L +TL ++ + HF
Sbjct: 166 E-------KQESAAMVLRLAQSHVRFGHFEYFYYTRQHEHL---KTLGEHVMACHFPQCL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ + A EV ERTA+++A WQ GF HGV+NTDNM
Sbjct: 216 EQDE---------------------PWLALLREVIERTAAMIAYWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +
Sbjct: 255 SILGITFDYGPYAFLDDFDANHICNHSDDTG-RYSFSNQVPIAHWNLAALAQALTPFAAV 313
Query: 458 DD-KEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDK-VDYTNF 512
+ +EA +E + + Y +M K+LG ++ +I +LL M K DYT F
Sbjct: 314 EQLREA---LELFLPLYQAHYLDLMRKRLGFTSAEDEDEALIQRLLQLMQQGKATDYTLF 370
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
FR L P + L + ++ +D+ + +W Y+ G ER
Sbjct: 371 FRHLGE-----QAPAEALKI-VREDFVDLA-----GFDAWSRDYLARCEREGGEQAERLV 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M+SVNPKY+LRNYL Q AI+AAE GD+G VR L ++ RP+DEQP M++YA PP W
Sbjct: 420 RMHSVNPKYILRNYLAQQAIEAAEQGDYGPVRELHAVLSRPFDEQPDMQRYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|84387713|ref|ZP_00990729.1| hypothetical cytosolic protein [Vibrio splendidus 12B01]
gi|84377396|gb|EAP94263.1| hypothetical cytosolic protein [Vibrio splendidus 12B01]
Length = 485
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 205/528 (38%), Positives = 281/528 (53%), Gaps = 58/528 (10%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q +AW+ ++A+ L E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LSNVQWLAWNHNLANELGFPSFENASEELLETLSGNVD 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PEQFSPLAMKYAGHQFGSYNPDLGDGRGLLLAQVVAKSGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R A+
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRAAE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF E L DED
Sbjct: 180 SHIRFGHFEHLFYTNQ--LSEHKLLADKVIEWHF--------PECL-----DED------ 218
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA E+ +RTA +VA WQ GF HGV+NTDNMSI+G T DYGPF FLD +DP
Sbjct: 219 --KPYAAMFNEIVDRTAEMVALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYDPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IGLWN++ + +L + L++ + +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGLWNLSALAHSL--SPLVNKADLEAALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL + + ++ + M+ +KVDY FFR LSN+ P P +
Sbjct: 334 MMRRKLGLLSKQEGDSRLFESMFELMSQNKVDYPRFFRTLSNLDTLP---------PQEV 384
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
+ L I +E + W+ +Y+Q S ER M VNPKY+LRNYL Q AID AE
Sbjct: 385 IDLIIDREAAKLWMD---NYLQRCELEDSSVAERCEKMRQVNPKYILRNYLAQLAIDKAE 441
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
GD ++ L ++ PY E P E A LPP W G M +SCSS
Sbjct: 442 RGDSSDIEALTVVLADPYAEHPDYEHLAALPPEW----GKAMEISCSS 485
>gi|254286864|ref|ZP_04961816.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
gi|150423014|gb|EDN14963.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
Length = 508
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 207/523 (39%), Positives = 279/523 (53%), Gaps = 60/523 (11%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 37 QAFYTPVHPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQQLPADFSPVA 91
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+
Sbjct: 92 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSL 151
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 152 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 204
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF TS YAAW
Sbjct: 205 EHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPYAAW 241
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 242 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 301
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 302 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGL 358
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGK 543
+ ++ + +A + DYT F R LS + + +AV+ L + +
Sbjct: 359 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLVLDR 408
Query: 544 ERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF
Sbjct: 409 EAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDF 468
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 469 EEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|417821282|ref|ZP_12467896.1| hypothetical protein VCHE39_2785 [Vibrio cholerae HE39]
gi|423956443|ref|ZP_17734997.1| hypothetical protein VCHE40_2086 [Vibrio cholerae HE-40]
gi|423985231|ref|ZP_17738548.1| hypothetical protein VCHE46_2093 [Vibrio cholerae HE-46]
gi|340038913|gb|EGQ99887.1| hypothetical protein VCHE39_2785 [Vibrio cholerae HE39]
gi|408657617|gb|EKL28695.1| hypothetical protein VCHE40_2086 [Vibrio cholerae HE-40]
gi|408664132|gb|EKL34972.1| hypothetical protein VCHE46_2093 [Vibrio cholerae HE-46]
Length = 489
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 205/521 (39%), Positives = 279/521 (53%), Gaps = 56/521 (10%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNSRLAQQFGL--PEAPNDELLLSLSGQHLPADCSPVAMK 74
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 75 YAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 134
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG ++
Sbjct: 135 YLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHFEH 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ LAD I +F TS YAAW
Sbjct: 188 FFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPYAAWFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 225 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 284 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 341
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKER 545
+ ++ + +A + DYT+F R LS + + +AV+ L + +E
Sbjct: 342 QQEGDGELFADFFALLANNHTDYTSFLRELSCLDRQGN----------EAVIDLVLDREA 391
Query: 546 KEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF E
Sbjct: 392 AKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDFEE 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 452 MQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|451981719|ref|ZP_21930067.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
gi|451761067|emb|CCQ91332.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
Length = 495
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 201/527 (38%), Positives = 292/527 (55%), Gaps = 65/527 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
++ LE LN+ + FVR L + + P V NP VA + VA L
Sbjct: 1 MQTLETLNFQNRFVR---------------LGGEFYQYKPPTPVSNPFPVAKNPDVAGLL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+EFERP+F F G L GA P A Y G QFG + QLGDGR + LGE+ N +
Sbjct: 46 DLDPQEFERPEFWQHFGGNRVLPGAQPLAMVYSGFQFGSYNPQLGDGRGLLLGEVQNEQG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W++ LKG G+T + R DG A LRSSIRE+LC EAM LGIPTTR+L +V + + R
Sbjct: 106 EFWDVYLKGCGQTRFCRGFDGRATLRSSIREYLCGEAMAGLGIPTTRSLAVVGIQELIQR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
++ EP A++ R+A++ +RFG++ H + E V LAD+ I H+F +E
Sbjct: 166 EL-------PEPAAVLVRIARTHVRFGNFDYFHYTNRPEK---VAELADHVIHHYFPELE 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ +KYA +V ++TA ++A WQ VGF HGV+NTDNM
Sbjct: 216 S---------------------APDKYAQMFAQVVDKTAWMIACWQAVGFGHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG T DYGP+GF+D ++P F PN +D+ G RY +A QP IG WN+A+ TL L+
Sbjct: 255 SILGETFDYGPYGFMDRYNPIFVPNHSDIHG-RYSYAQQPQIGHWNLAKLGETL--THLV 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFR 514
+ + +E+Y +F + +M +KLGL + + ++S L+ ++ K D+TNFFR
Sbjct: 312 EPERLQKELEQYAARFNHYNRTMMGRKLGLSVLDSEFDNLVSGLIQLLSRHKPDHTNFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LS + ++ P LD W+ Y + L +S EE+K M
Sbjct: 372 TLSGFRCG-ALDALRTYFPNNPDELD----------GWLDRYTRLLEREDVSPEEQKEAM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLMERPYDEQPGM 620
++VNPK++LRNYL Q AID A + D+ E+ RL +++ P+ +QP +
Sbjct: 421 DAVNPKFILRNYLAQQAIDRALKENDYSEIERLRVILKHPFGDQPEL 467
>gi|218894122|ref|YP_002442991.1| hypothetical protein PLES_54131 [Pseudomonas aeruginosa LESB58]
gi|416860084|ref|ZP_11914142.1| hypothetical protein PA13_19855 [Pseudomonas aeruginosa 138244]
gi|226707710|sp|B7V3B6.1|Y5413_PSEA8 RecName: Full=UPF0061 protein PLES_54131
gi|218774350|emb|CAW30167.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58]
gi|334837793|gb|EGM16540.1| hypothetical protein PA13_19855 [Pseudomonas aeruginosa 138244]
gi|453046535|gb|EME94251.1| hypothetical protein H123_09687 [Pseudomonas aeruginosa PA21_ST175]
Length = 486
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 302/548 (55%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R L G T+ +P P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR-LGGAFSTEVLP-----------DPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|334347697|ref|XP_003341968.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O-like [Monodelphis
domestica]
Length = 699
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 194/458 (42%), Positives = 261/458 (56%), Gaps = 63/458 (13%)
Query: 102 LEDLNWDHSFVRELPGD---PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV---- 154
L L +D+ +R LP + P DS PR V AC+++V PS + P+LVA+S
Sbjct: 54 LSGLRFDNRALRALPVEEPPPGGDSAPRPVPGACFSRVRPSP-LRQPRLVAFSAPALALL 112
Query: 155 ---------ADSLELDPKEF-ERP---------DFPLFFSGATPLAGAVPYAQCYGGHQF 195
A + +P+E E P + L+FSG L G+ P A CY GHQF
Sbjct: 113 GLDPPPPLGAGPDQEEPEEAGETPSRRVSSAEAELELYFSGNALLPGSEPAAHCYCGHQF 172
Query: 196 GMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEA 255
G +AGQLGDG A+ LGE+L +RWELQLKGAG TP+SR ADG VLRSSIREFLCSEA
Sbjct: 173 GSFAGQLGDGAAVYLGEVLGAAGQRWELQLKGAGLTPFSRQADGRKVLRSSIREFLCSEA 232
Query: 256 MHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI------ 309
M LGIPTTRA VT+ V RD++YDGNPK E A+V R+A +FLRFGS++I
Sbjct: 233 MFHLGIPTTRAGSCVTSESKVIRDIYYDGNPKYESCAVVLRIASTFLRFGSFEIFKPPDE 292
Query: 310 HASR-----GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
H R G+ D+ + + DY I + I+ + +S+ +
Sbjct: 293 HTGRKGPSVGRNDIRV--QMLDYVIGSFYPEIQAAHARDSM----------------QRN 334
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
A+ E+ RTA LVA WQ VGF HGVLNTDNMSI+GLTIDYGPFGF+D +DP N++
Sbjct: 335 LAFFREITRRTARLVADWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFMDRYDPDHVCNSS 394
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY ++ QP++ WN+ + + L ++ E V+E Y +F Y M +K
Sbjct: 395 DTTG-RYAYSKQPEVCKWNLRKLAEALVPELPLELSEP--VLEEYDAEFDKRYLHKMRQK 451
Query: 485 LGLPKY----NKQIISKLLNNMAVDKVDYTNFFRALSN 518
LGL + ++++ + LL M + D+TN F LS+
Sbjct: 452 LGLVQLQLEEDRELAAALLETMRLTGADFTNTFCLLSS 489
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 62/119 (52%), Gaps = 27/119 (22%)
Query: 540 DIGKERKEAWISWVLSYIQEL----LSSGIS---DEERKALMNSVNPKYVLRNYLCQSAI 592
++ + +E W +W+ +Y L S+G + D ER +M + NP+ VLRNY+ Q+AI
Sbjct: 572 ELIRRNREHWDAWLQTYRARLERDRQSAGSASGWDTERVRVMRANNPRIVLRNYIAQNAI 631
Query: 593 DAAELGDFGEVRRLLKLMERPYDE--------------------QPGMEKYARLPPAWA 631
+AAE GDF EV+R+L+L+E+PY E Y R PP WA
Sbjct: 632 EAAEQGDFSEVQRVLRLLEKPYGEPWEDDADGLLAAAAAADSGEAESRRSYGRKPPLWA 690
>gi|419830404|ref|ZP_14353889.1| hypothetical protein VCHC1A2_2790 [Vibrio cholerae HC-1A2]
gi|419834083|ref|ZP_14357538.1| hypothetical protein VCHC61A2_2728 [Vibrio cholerae HC-61A2]
gi|422917786|ref|ZP_16952104.1| hypothetical protein VCHC02A1_2092 [Vibrio cholerae HC-02A1]
gi|423822690|ref|ZP_17716700.1| hypothetical protein VCHC55C2_2090 [Vibrio cholerae HC-55C2]
gi|423856431|ref|ZP_17720507.1| hypothetical protein VCHC59A1_2144 [Vibrio cholerae HC-59A1]
gi|423882958|ref|ZP_17724095.1| hypothetical protein VCHC60A1_2088 [Vibrio cholerae HC-60A1]
gi|423998215|ref|ZP_17741467.1| hypothetical protein VCHC02C1_2117 [Vibrio cholerae HC-02C1]
gi|424020033|ref|ZP_17759819.1| hypothetical protein VCHC59B1_2117 [Vibrio cholerae HC-59B1]
gi|424625404|ref|ZP_18063865.1| hypothetical protein VCHC50A1_2112 [Vibrio cholerae HC-50A1]
gi|424629889|ref|ZP_18068176.1| hypothetical protein VCHC51A1_2010 [Vibrio cholerae HC-51A1]
gi|424633933|ref|ZP_18072033.1| hypothetical protein VCHC52A1_2111 [Vibrio cholerae HC-52A1]
gi|424637013|ref|ZP_18075021.1| hypothetical protein VCHC55A1_2110 [Vibrio cholerae HC-55A1]
gi|424640922|ref|ZP_18078805.1| hypothetical protein VCHC56A1_2189 [Vibrio cholerae HC-56A1]
gi|424648989|ref|ZP_18086652.1| hypothetical protein VCHC57A1_2003 [Vibrio cholerae HC-57A1]
gi|443527908|ref|ZP_21093957.1| hypothetical protein VCHC78A1_02032 [Vibrio cholerae HC-78A1]
gi|341636668|gb|EGS61362.1| hypothetical protein VCHC02A1_2092 [Vibrio cholerae HC-02A1]
gi|408012399|gb|EKG50180.1| hypothetical protein VCHC50A1_2112 [Vibrio cholerae HC-50A1]
gi|408018140|gb|EKG55602.1| hypothetical protein VCHC52A1_2111 [Vibrio cholerae HC-52A1]
gi|408023420|gb|EKG60587.1| hypothetical protein VCHC56A1_2189 [Vibrio cholerae HC-56A1]
gi|408023980|gb|EKG61122.1| hypothetical protein VCHC55A1_2110 [Vibrio cholerae HC-55A1]
gi|408032827|gb|EKG69398.1| hypothetical protein VCHC57A1_2003 [Vibrio cholerae HC-57A1]
gi|408055084|gb|EKG90029.1| hypothetical protein VCHC51A1_2010 [Vibrio cholerae HC-51A1]
gi|408620177|gb|EKK93189.1| hypothetical protein VCHC1A2_2790 [Vibrio cholerae HC-1A2]
gi|408634666|gb|EKL06901.1| hypothetical protein VCHC55C2_2090 [Vibrio cholerae HC-55C2]
gi|408640719|gb|EKL12505.1| hypothetical protein VCHC59A1_2144 [Vibrio cholerae HC-59A1]
gi|408641082|gb|EKL12863.1| hypothetical protein VCHC60A1_2088 [Vibrio cholerae HC-60A1]
gi|408648905|gb|EKL20222.1| hypothetical protein VCHC61A2_2728 [Vibrio cholerae HC-61A2]
gi|408852570|gb|EKL92392.1| hypothetical protein VCHC02C1_2117 [Vibrio cholerae HC-02C1]
gi|408867127|gb|EKM06489.1| hypothetical protein VCHC59B1_2117 [Vibrio cholerae HC-59B1]
gi|443453780|gb|ELT17598.1| hypothetical protein VCHC78A1_02032 [Vibrio cholerae HC-78A1]
Length = 489
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 205/521 (39%), Positives = 277/521 (53%), Gaps = 56/521 (10%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNARLAQQFGL--PEAPNDELLLSLSGQHLPADFSPVAMK 74
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 75 YAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 134
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG ++
Sbjct: 135 YLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHFEH 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ L D I HF TS YAAW
Sbjct: 188 FFYTDQH--ANLKLLTDKVIEWHFPDCVQ---------------------TSKPYAAWFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 225 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 284 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 341
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKER 545
+ ++ + +A + DYT F R LS + + +AV+ L + +E
Sbjct: 342 QQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLVLDREA 391
Query: 546 KEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF E
Sbjct: 392 AKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDFEE 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 452 MQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|421170857|ref|ZP_15628774.1| hypothetical protein PABE177_5543 [Pseudomonas aeruginosa ATCC
700888]
gi|404522144|gb|EKA32672.1| hypothetical protein PABE177_5543 [Pseudomonas aeruginosa ATCC
700888]
Length = 486
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 302/548 (55%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R GD A T+V P A + P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR--LGD------------AFSTEVLP-APIAEPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|424659638|ref|ZP_18096887.1| hypothetical protein VCHE16_1800 [Vibrio cholerae HE-16]
gi|408051911|gb|EKG86981.1| hypothetical protein VCHE16_1800 [Vibrio cholerae HE-16]
Length = 489
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 206/525 (39%), Positives = 278/525 (52%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG LA P A
Sbjct: 18 QAFYTPVQPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQQLLADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSSI
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSI 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I +F TS YAAW
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPYAAW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSDHLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + + ++D+ +
Sbjct: 340 ATQQEGDGELFADFFALLASNHTDYTRFLRELSCLDRQGN-----------EAVIDLVLD 388
Query: 545 RKEAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
R+ A I W+ Y+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 REAAKI-WLTRYLDRAARELGQEGGPISSSERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|229515315|ref|ZP_04404775.1| hypothetical protein VCB_002972 [Vibrio cholerae TMA 21]
gi|229348020|gb|EEO12979.1| hypothetical protein VCB_002972 [Vibrio cholerae TMA 21]
Length = 489
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 206/523 (39%), Positives = 281/523 (53%), Gaps = 60/523 (11%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNSRLAQQFGL--PEAPNDELLLSLSGQHLPADFSPVAMK 74
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 75 YAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 134
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG ++
Sbjct: 135 YLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHFEH 187
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ LAD I +F TS YAAW
Sbjct: 188 FFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPYAAWFS 224
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 225 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 283
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 284 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 341
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ ++ + L +A + DYT F R LS + +E ++ L +LD +
Sbjct: 342 QQEGDGELFADLFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD-----R 389
Query: 547 EAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF
Sbjct: 390 EAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDF 449
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E++RL+ ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 450 EEMQRLVTVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|242046688|ref|XP_002400867.1| selenoprotein O, putative [Ixodes scapularis]
gi|215498714|gb|EEC08208.1| selenoprotein O, putative [Ixodes scapularis]
Length = 620
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 220/615 (35%), Positives = 311/615 (50%), Gaps = 122/615 (19%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+ E L +D+ +R LP D + + R V AC+++V P+ +++P++V SE L
Sbjct: 1 MTTFETLKFDNLALRRLPIDTESRNYVRTVRGACFSRVMPTP-LKSPEMVVVSEDAMLLL 59
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LD +FER D +FSG L G+ P A CY GHQFG ++GQLGDG A+ LGE++N K
Sbjct: 60 DLDRAQFERSDAAEYFSGNKLLPGSEPAAHCYCGHQFGYFSGQLGDGAAMYLGEVINQKG 119
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
ERWE+QLKGAG TPYSR ADG VLRSSIREFLCSEAMH LGIPTTRA +++ V+R
Sbjct: 120 ERWEIQLKGAGLTPYSRSADGRKVLRSSIREFLCSEAMHHLGIPTTRAGTCISSETLVSR 179
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ---------EDLDIVRTLADYAI 329
DMFYDG+PK+E +++ R+A +FLRFGS++I + Q DI+ L DY++
Sbjct: 180 DMFYDGHPKDEKCSVILRIAPTFLRFGSFEIFKTLDQFTGRVGPSVGRKDILIQLLDYSM 239
Query: 330 RHHFR-HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
+ ++E+ N E + Y + EV + TASLVA+WQ VGF
Sbjct: 240 SIFMQIYLEHGNDKEKM------------------YIEFFKEVIKSTASLVAKWQCVGFC 281
Query: 389 HGVLNTD---NMSIL------GLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
HGV+N +M+ L L I GF+ + T + D G RY + QP+I
Sbjct: 282 HGVVNCKFKKHMTCLLCHRFPSLNI----IGFISSVIYLHTFLSDD--GGRYTYIKQPEI 335
Query: 440 GLWNIAQFSTTLAAA-------KLIDDKEANY---------------------------- 464
LWN+ +F+ + A L+D Y
Sbjct: 336 CLWNLRKFAEAIQGAVPLSKTLPLLDAYSLEYETCFLAEIRNKFGLFQKDPAEDKVLITS 395
Query: 465 ---VMERYGTKFMDEYQAIMTKKLGLPKYNKQIISK-----LLNNMAVDKVDYTNFFRAL 516
ME G F ++ + T L +P +++ SK L + D FR L
Sbjct: 396 FYDAMEATGADFTRSFRCLST--LCVPGHDQHESSKDALKAALLSCCSTHSDLMTHFRTL 453
Query: 517 SNVKADP-----SIPEDELLVPL-KAVL--------LDIGKERK------------EAWI 550
S+ + S ELL L K VL ++ GKE K + W
Sbjct: 454 SSTRDFQLFLILSQSNPELLEQLGKGVLAKERIMAQIEKGKELKDMTAEEMEKKNAQVWT 513
Query: 551 SWVLSYIQELLSSGIS-------DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
W+ Y + L + E+R LMNS NP++VLRN++ Q AID AE GD+ EV
Sbjct: 514 EWIEKYSRRLAAEAKDHSDLQGLQEQRVQLMNSHNPRFVLRNHVAQRAIDMAEKGDYSEV 573
Query: 604 RRLLKLMERPYDEQP 618
R++LK+++ PY + P
Sbjct: 574 RKVLKILQHPYSDNP 588
>gi|330501550|ref|YP_004378419.1| hypothetical protein [Pseudomonas mendocina NK-01]
gi|328915836|gb|AEB56667.1| hypothetical protein MDS_0636 [Pseudomonas mendocina NK-01]
Length = 487
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 218/552 (39%), Positives = 300/552 (54%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+ L +D+ F R GD A T+V P +E P+LV SES L
Sbjct: 1 MKSLDQLIFDNRFAR--LGD------------AFSTEVLPEP-IEQPRLVVASESAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P E +R +F F+G A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLAPDEAQRSEFAELFAGHKLWEEAEPRAMVYSGHQFGGYTPRLGDGRGLLLGEVINDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIRE L SE +H LGIP++RALC+ + V R
Sbjct: 106 EHWDLHLKGAGMTPYSRMGDGRAVLRSSIRELLASEHLHALGIPSSRALCVTGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+AQS +RFG ++ + +R E L +TL ++ + HF
Sbjct: 166 E-------KKESAAMVLRLAQSHVRFGHFEYFYYTRQHEHL---KTLGEHVMACHFPACL 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
++ + A EV ERTAS++A WQ GF HGV+NTDNM
Sbjct: 216 EQDE---------------------PWLALLREVIERTASMIAHWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +
Sbjct: 255 SILGITFDYGPYAFLDDFDANHICNHSDDTG-RYSFSNQVPIAHWNLAALAQALTPFASV 313
Query: 458 DD-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKV-DYTN 511
+ +EA + + Y ++D +M K+LG + +I +LL M K DY+
Sbjct: 314 EKLREALDLFLPLYQAHYLD----LMRKRLGFTSAEDEDDALIQRLLQLMQQGKASDYSL 369
Query: 512 FFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
FFR L P + L V ++ +D+ + +W Y+ G ER+
Sbjct: 370 FFRRLGE-----QAPAEALKV-VRDDFVDLA-----GFDAWGHDYLARCELEGGEQSERQ 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
A M++VNPKY+LRNYL Q AI+AAE GD+G VR L ++ RP+DEQPGM++YA PP W
Sbjct: 419 ARMHAVNPKYILRNYLAQHAIEAAEAGDYGPVRELHAVLSRPFDEQPGMQRYAERPPEWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 KH---LEISCSS 487
>gi|149909012|ref|ZP_01897671.1| hypothetical protein PE36_19190 [Moritella sp. PE36]
gi|149808023|gb|EDM67966.1| hypothetical protein PE36_19190 [Moritella sp. PE36]
Length = 520
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 204/530 (38%), Positives = 277/530 (52%), Gaps = 62/530 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y K P + +P++V+ + V L LD + SG +G P A Y G
Sbjct: 34 YNKQMPDG-ISDPKMVSLNPQVLALLGLDNVVADSDALLQLCSGNYLPSGFDPLAMKYTG 92
Query: 193 HQFGMWAGQLGDGRAITLGEI------------LNLKSERWELQLKGAGKTPYSRFADGL 240
HQFG + LGDGR + L ++ + K+ W+L LKGAGKTPYSR DG
Sbjct: 93 HQFGHYNPDLGDGRGLLLAQVKGNDASGDGSNSNSSKNTTWDLHLKGAGKTPYSRQGDGR 152
Query: 241 AVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQS 300
AVLRSSIRE+LCS AM LGIPTT+AL +V V R+ + E A+V RVA+S
Sbjct: 153 AVLRSSIREYLCSAAMQGLGIPTTQALSVVVGSDAVMRE-------QVEQAAMVVRVAES 205
Query: 301 FLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLT 360
+RFG ++ H Q+ LD ++ + DY + HF + LT
Sbjct: 206 HVRFGHFE-HFYYTQQ-LDDLKLMLDYTLTKHFPDV----------------------LT 241
Query: 361 SN-KYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
+ Y A+ +V TA L+A WQ VGF HGV+NTDNMSILG T DYGPF F D FDP++
Sbjct: 242 AEVPYLAFYKQVMTTTAELMAHWQAVGFVHGVMNTDNMSILGQTFDYGPFAFQDNFDPAY 301
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N TD G RY F QP +G WN+ L +D + N V++ Y F+ +++
Sbjct: 302 VCNHTDYSG-RYAFNQQPQVGYWNLMALGRALTP--FMDVEPLNTVLQTYDDIFLAKFRE 358
Query: 480 IMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI---PEDELLVP 533
+M KLGL + + ++I LL +A VDYT FFR+LS+ + +E
Sbjct: 359 LMRGKLGLQQVQDTDGELIKNLLEILAGSAVDYTYFFRSLSDFDSAEDAENSTNNEKNSA 418
Query: 534 LKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAID 593
++ +D +EA+ W + Y Q L+ DE RK MN VNPKY+LRNYL Q AI
Sbjct: 419 IRDQFID-----REAFDGWAVKYQQRLVLESSVDEVRKVRMNQVNPKYILRNYLAQQAIT 473
Query: 594 AAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
A D+ V LL+++ P+ E P E A LPP W + +LSCSS
Sbjct: 474 QATDYDYSLVNELLEVLTNPFSEHPEFETLAALPPEWGRK---MVLSCSS 520
>gi|424017109|ref|ZP_17756938.1| hypothetical protein VCHC55B2_2294 [Vibrio cholerae HC-55B2]
gi|408859795|gb|EKL99449.1| hypothetical protein VCHC55B2_2294 [Vibrio cholerae HC-55B2]
Length = 487
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 205/521 (39%), Positives = 277/521 (53%), Gaps = 56/521 (10%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQC 189
A YT V P ++N + W+ +A L E + L SG A P A
Sbjct: 16 QAFYTPVHPQP-LQNVRWGMWNARLAQQFGL--PEAPNDELLLSLSGQHLPADFSPVAMK 72
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+RE
Sbjct: 73 YAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSLRE 132
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG ++
Sbjct: 133 YLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHFEH 185
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
Q ++ L D I HF TS YAAW
Sbjct: 186 FFYTDQH--ANLKLLTDKVIEWHFPDCVQ---------------------TSKPYAAWFS 222
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G
Sbjct: 223 QVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG- 281
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 282 RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGLAT 339
Query: 490 YNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKER 545
+ ++ + +A + DYT F R LS + + +AV+ L + +E
Sbjct: 340 QQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLVLDREA 389
Query: 546 KEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF E
Sbjct: 390 AKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERGDFEE 449
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 450 MQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 487
>gi|47225785|emb|CAF98265.1| unnamed protein product [Tetraodon nigroviridis]
Length = 660
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 200/444 (45%), Positives = 253/444 (56%), Gaps = 46/444 (10%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
+LE L++D+ +R+LP DP + R+V AC+++V P + P+ VA S L L
Sbjct: 9 SLERLDFDNIALRKLPLDPSEEPGVRQVKGACFSRVKPQP-LTKPRFVAVSHEALKLLGL 67
Query: 161 DPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEI------ 213
D +E P P + SG+ + G+ P A CY GHQFG +AGQLGDG A LGE+
Sbjct: 68 DGEEVLHDPLGPEYLSGSKVMPGSDPAAHCYCGHQFGQFAGQLGDGAACYLGEVKVPPDQ 127
Query: 214 -----LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
S RWE+Q+KGAG TPYSR ADG VLRSSIREFLCSEAM FLGIPTTRA
Sbjct: 128 DPELLRENPSGRWEIQVKGAGLTPYSRQADGRKVLRSSIREFLCSEAMFFLGIPTTRAGS 187
Query: 269 LVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH-------ASRGQE-DLDI 320
+VT+ V RD++Y GNP E ++V R+A +FLRFGS++I RG LD
Sbjct: 188 VVTSDSRVVRDVYYSGNPCYEKCSVVLRIAPTFLRFGSFEIFKPPDELTGRRGPSCGLDE 247
Query: 321 VR-TLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLV 379
+R + DY I + I+ + D T + A+ EV RTA LV
Sbjct: 248 IRGQMMDYVIELFYPEIQ----------------QNFPDRT-ERNVAFFREVMVRTARLV 290
Query: 380 AQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
AQWQ VGF HGVLNTDNMSILGLT+DYGP+GF+D FDP F N +D G RY + QP I
Sbjct: 291 AQWQCVGFCHGVLNTDNMSILGLTLDYGPYGFMDRFDPDFICNASDNSG-RYSYQAQPAI 349
Query: 440 GLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQ----II 495
WN+ + + LA D EA VM+ Y F Y M KKLGL K ++ +I
Sbjct: 350 CRWNLVKLAEALAPELPPDRAEA--VMDEYLALFNGFYLQNMRKKLGLLKKDEPEDEILI 407
Query: 496 SKLLNNMAVDKVDYTNFFRALSNV 519
S LL M D+TN FR LS +
Sbjct: 408 SDLLQTMHGTGADFTNTFRCLSQI 431
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 62/98 (63%), Gaps = 12/98 (12%)
Query: 540 DIGKERKEAWISWVLSYIQELLSS--GISD-----EERKALMNSVNPKYVLRNYLCQSAI 592
++ + EAW W+ Y + L G SD EER LM++ NP+ +LRNY+ Q+AI
Sbjct: 519 ELKARQAEAWRGWIARYRKRLAGELEGQSDAHTVQEERVRLMDAANPRVILRNYIAQNAI 578
Query: 593 DAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
+AAE GDF EVRR+L+++E+PY QPG+E PAW
Sbjct: 579 EAAENGDFSEVRRVLQVLEKPYSWQPGLEF-----PAW 611
>gi|15600216|ref|NP_253710.1| hypothetical protein PA5023 [Pseudomonas aeruginosa PAO1]
gi|418587697|ref|ZP_13151723.1| hypothetical protein O1O_23438 [Pseudomonas aeruginosa MPAO1/P1]
gi|418591034|ref|ZP_13154937.1| hypothetical protein O1Q_10511 [Pseudomonas aeruginosa MPAO1/P2]
gi|421519589|ref|ZP_15966260.1| hypothetical protein A161_25085 [Pseudomonas aeruginosa PAO579]
gi|33517097|sp|Q9HUE6.1|Y5023_PSEAE RecName: Full=UPF0061 protein PA5023
gi|9951311|gb|AAG08408.1|AE004915_3 conserved hypothetical protein [Pseudomonas aeruginosa PAO1]
gi|375041635|gb|EHS34323.1| hypothetical protein O1O_23438 [Pseudomonas aeruginosa MPAO1/P1]
gi|375050113|gb|EHS42597.1| hypothetical protein O1Q_10511 [Pseudomonas aeruginosa MPAO1/P2]
gi|404345508|gb|EJZ71860.1| hypothetical protein A161_25085 [Pseudomonas aeruginosa PAO579]
Length = 486
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 301/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R L G T+ +P P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR-LGGAFSTEVLP-----------DPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + + ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDHALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|229593872|ref|XP_001026305.3| hypothetical protein TTHERM_00852990 [Tetrahymena thermophila]
gi|225567248|gb|EAS06060.3| hypothetical protein TTHERM_00852990 [Tetrahymena thermophila
SB210]
Length = 634
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 180/421 (42%), Positives = 246/421 (58%), Gaps = 40/421 (9%)
Query: 115 LPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF---ERPDFP 171
LP + D+ P +V A Y+KV P +NP++V+ SES + L+L +E E+
Sbjct: 36 LPVEENKDNTPHQVRGAFYSKVKPQVR-KNPKIVSLSESALNLLDLSKEEVLKDEKESAE 94
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
+ P + A P A CY GHQFG WA QLGDGRAI+ G+I N K E ELQLKG+G T
Sbjct: 95 ILTGNVIP-SNAQPIAHCYCGHQFGSWAAQLGDGRAISYGDIRNQKGEIIELQLKGSGIT 153
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSRFADG AVLRSSIRE+LCSEAMHFL IPTTRA + T RD Y+ E
Sbjct: 154 PYSRFADGNAVLRSSIREYLCSEAMHFLNIPTTRAASITITEDQAMRDPLYNQQIVYEKC 213
Query: 292 AIVCRVAQSFLRFGSYQIHASRG-QEDL--DIVRTLADYAIRHHFRHIENMNKSESLSFS 348
A+V R++ +F+RFGS+QI +G E L ++ L D+ I++H+
Sbjct: 214 AVVLRLSPTFIRFGSFQICNKQGPSEGLGEQMIPELLDFIIKNHYPEF------------ 261
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGP 408
G ED KY + E+ +RTA LVA+WQ VGF HGVLNTDNMSI+G+TIDYGP
Sbjct: 262 NGKED---------KYMLFLQEITKRTAQLVAKWQSVGFCHGVLNTDNMSIVGVTIDYGP 312
Query: 409 FGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER 468
FGF++ FD N +D G YC+ NQP WN+ + + A + +++ YV++
Sbjct: 313 FGFMEHFDKKHICNHSDKEG-YYCYQNQPSACKWNLLRLIEGIKWA-VNEEQAKEYVIQN 370
Query: 469 YGTKFMDEYQAIMTKKLGL---------PKYNKQIISKLLNNMAVDKVDYTNFFRALSNV 519
+ + D Y +M +K+GL + +K+II+ L++ M ++TNFFR LS +
Sbjct: 371 FDKIYYDHYYTLMRRKIGLFREDLYEKNLQLDKKIINNLMDYMDSSGSEFTNFFRKLSQI 430
Query: 520 K 520
K
Sbjct: 431 K 431
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 49/78 (62%), Gaps = 6/78 (7%)
Query: 569 ERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD---EQPGMEKYAR 625
ERK M+SVNP VLRNY+ Q I+ AE GD+ + +LLK++ RPY+ E K +
Sbjct: 560 ERKQKMDSVNPAVVLRNYMAQQVIEQAEKGDYSGIEKLLKVLSRPYEDVKENDQEIKICK 619
Query: 626 LPPAWAYRPGVCMLSCSS 643
+ P WA + +C +SCSS
Sbjct: 620 ITPGWASK--LC-VSCSS 634
>gi|33592228|ref|NP_879872.1| hypothetical protein BP1090 [Bordetella pertussis Tohama I]
gi|384203531|ref|YP_005589270.1| hypothetical protein BPTD_1082 [Bordetella pertussis CS]
gi|39932509|sp|Q7VZ47.1|Y1090_BORPE RecName: Full=UPF0061 protein BP1090
gi|33571873|emb|CAE41388.1| conserved hypothetical protein [Bordetella pertussis Tohama I]
gi|332381645|gb|AEE66492.1| hypothetical protein BPTD_1082 [Bordetella pertussis CS]
Length = 487
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 210/536 (39%), Positives = 277/536 (51%), Gaps = 58/536 (10%)
Query: 112 VRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP 171
+++LP D ++P E YT++ P P+L+ + A + LDP EF F
Sbjct: 6 LQDLPTDNSFAALPAEF----YTRLQPRPPAA-PRLLHANAEAAALIGLDPAEFSTQAFL 60
Query: 172 LFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKT 231
FSG PL G A Y GHQFG+WAGQLG+ R G WELQLKGAG T
Sbjct: 61 DVFSGHAPLPGGDTLAAVYSGHQFGVWAGQLGEVRGPAGG---------WELQLKGAGMT 111
Query: 232 PYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPG 291
PYSR DG AVLRSS+RE+L SEAMH LGIPTTR+L LV + V R+ E
Sbjct: 112 PYSRMGDGRAVLRSSVREYLASEAMHGLGIPTTRSLALVVSDDPVMRETV-------ETA 164
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
A+V R+A SF+RFGS++ ++R Q + +R LADY I +
Sbjct: 165 AVVTRMAPSFVRFGSFEHWSARRQPEQ--LRVLADYVIDRFYPECRVAGAGR-------- 214
Query: 352 EDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGF 411
+D + V RTA L+A WQ VGF HGV+NTDNMSILGLT+DYGP+GF
Sbjct: 215 -----LDGEHGEILGLLAAVTRRTALLMADWQAVGFCHGVMNTDNMSILGLTLDYGPYGF 269
Query: 412 LDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYG 470
+D F N +D G RY + QP +GLWN+ + +++L L D EA V++ Y
Sbjct: 270 MDTFQLGHICNHSDSEG-RYAWNRQPSVGLWNLYRLASSL--HTLAPDPEALRAVLDGYE 326
Query: 471 TKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F + M KLGLP++ ++ ++ LL M D+T FR L P
Sbjct: 327 AVFTQAFHGRMAGKLGLPQFLPEDETLLDDLLQLMHQQGADFTLAFRRLGEAVRGQRQPF 386
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
++ + + A +W S G + + R A M+ VNP YVLRN+L
Sbjct: 387 EDSFID------------RAAAGAWYDRLAARHASDGRAAQARAAAMDEVNPLYVLRNHL 434
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ AI AA GD GE+ LLKL+ PY QPG + YA L P WA +SCSS
Sbjct: 435 AEQAIRAAARGDAGEIDILLKLLRNPYKHQPGYDAYAGLAPDWA---AGLEVSCSS 487
>gi|421351670|ref|ZP_15802035.1| hypothetical protein VCHE25_2913 [Vibrio cholerae HE-25]
gi|395952115|gb|EJH62729.1| hypothetical protein VCHE25_2913 [Vibrio cholerae HE-25]
Length = 489
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 207/525 (39%), Positives = 280/525 (53%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQQLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSS+
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSL 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF TS YAAW
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPYAAW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD---- 388
Query: 545 RKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 -REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|262171075|ref|ZP_06038753.1| UPF0061 domain-containing protein [Vibrio mimicus MB-451]
gi|261892151|gb|EEY38137.1| UPF0061 domain-containing protein [Vibrio mimicus MB-451]
Length = 489
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 205/525 (39%), Positives = 281/525 (53%), Gaps = 66/525 (12%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFP-----LFFSGATPLAGAVP 185
A YT + P +EN + W+ +A +EF P+ P SG A P
Sbjct: 19 AFYTSIRPQL-LENVRWGMWNAPLA-------QEFGLPEVPNSELLAALSGQQLPADFAP 70
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRS
Sbjct: 71 LAMKYAGHQFGVYNPDLGDGRGLLLAEMASKTGDVYDIHLKGAGLTPYSRMGDGRAVLRS 130
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIRE+LCSEAM LGI TTRAL L+ + V R+ +EE GA++ RVA S +RFG
Sbjct: 131 SIREYLCSEAMAGLGIATTRALALMNSDTPVYRE-------REERGALLVRVAPSHIRFG 183
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ H ++ ++ + LAD I HF ++ YA
Sbjct: 184 HFE-HFYYTEQHTEL-KLLADKVIEWHF---------------------PTCAQSAKPYA 220
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 221 DWFHQVVERTALMIAQWQVYGFNHGVMNTDNMSILGQTFDYGPFAFLDDYDPNFICNHSD 280
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F QP IGLWN++ + L + LI+ + +E Y + +M KL
Sbjct: 281 YQG-RYAFDQQPRIGLWNLSALAHAL--SPLIEKADLEAALESYSEHLNRYFSQLMRAKL 337
Query: 486 GLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDI 541
GL + ++ + +A + DYT F R LS + + +AV+ L +
Sbjct: 338 GLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQST----------EAVIDLVV 387
Query: 542 GKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
++ +AW++ L +EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 388 DRQAAKAWLTRYLERAARELGQDGQPISQVERCQAMRQVNPKYILRNYLAQQAIELAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++ L +++ PYDE P E YA+LPP W + +SCSS
Sbjct: 448 DFQEMQCLAQVLATPYDEHPEFEHYAKLPPEWGKK---LEISCSS 489
>gi|114320205|ref|YP_741888.1| hypothetical protein Mlg_1045 [Alkalilimnicola ehrlichii MLHE-1]
gi|121957660|sp|Q0A9T9.1|Y1045_ALHEH RecName: Full=UPF0061 protein Mlg_1045
gi|114226599|gb|ABI56398.1| protein of unknown function UPF0061 [Alkalilimnicola ehrlichii
MLHE-1]
Length = 494
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 198/503 (39%), Positives = 273/503 (54%), Gaps = 50/503 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +V P+ V P LV +E +A++L L+ F+G GA P A Y G
Sbjct: 21 FARVRPTP-VAQPGLVRLNEPLAEALGLEVAALRGKAGLAMFAGNRLPEGAEPIALAYAG 79
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG W QLGDGRA+ LGE+++ R ++QLKG+G TP+SR DG A + +RE+L
Sbjct: 80 HQFGQWVPQLGDGRAVLLGEVVDRDGRRRDIQLKGSGITPFSRGGDGRAPIGPVVREYLA 139
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTR+L VTTG+ V R+ + EPG I+ RVA S +R G+++
Sbjct: 140 SEAMHALGIPTTRSLAAVTTGEPVLRE-------RVEPGGILTRVAHSHVRVGTFEYFHW 192
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R ED+D +RTLADY I H+ + + + + A V
Sbjct: 193 R--EDVDALRTLADYVIARHYPELAD---------------------DARPHLALLKAVI 229
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
+RTA LVA W VGF HGV+NTDN S++G T+DYGPFGFLDA+ P + D+ RY
Sbjct: 230 DRTAELVAHWISVGFIHGVMNTDNTSLVGETLDYGPFGFLDAYHPRTCYSAIDIEN-RYA 288
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE----ANYVMERYGTKFMDEYQAIMTKKLGLP 488
F QP I WN+ + + TL D+ E A + + +F + A + KLGL
Sbjct: 289 FDQQPRIAHWNLTRLAETLLPLLHEDEDEAVARAGEALNGFLPRFEACHHARLRAKLGLA 348
Query: 489 KYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ + + +LL+ MA + D+T FRALS+ + D D P + R
Sbjct: 349 ESRRGDIDLAHELLDLMARQQADFTQVFRALSDERMD-----DPDEGPARRCF-----AR 398
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVR 604
EA W +IQ L G + R+A M +VNPK++LRN+L Q A+DAA E GDFG +
Sbjct: 399 PEALDGWRARWIQRLRQEGRPEPARQAAMRAVNPKFILRNHLAQWAVDAATERGDFGPMD 458
Query: 605 RLLKLMERPYDEQPGMEKYARLP 627
RLL+++ RPYD QP E A P
Sbjct: 459 RLLQVLTRPYDPQPEAEALAAPP 481
>gi|456063293|ref|YP_007502263.1| hypothetical protein D521_0960 [beta proteobacterium CB]
gi|455440590|gb|AGG33528.1| hypothetical protein D521_0960 [beta proteobacterium CB]
Length = 488
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 208/515 (40%), Positives = 274/515 (53%), Gaps = 70/515 (13%)
Query: 148 VAWSESVADSLELD------PKEFERPDFPLFFSGATPLAGAV----PYAQCYGGHQFGM 197
VA+S SVA L L+ PK+ P++ +G G + P + Y GHQFG
Sbjct: 25 VAFSPSVAKLLNLELGDDGLPKD---PEWLEVLAGNQLNVGELIFSDPISTAYSGHQFGS 81
Query: 198 WAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMH 257
WAGQLGDGRAI LG+I L ELQLKGAG+T YSR DG AVLRSSIREFLCSEAMH
Sbjct: 82 WAGQLGDGRAILLGDINQL-----ELQLKGAGRTHYSRMGDGRAVLRSSIREFLCSEAMH 136
Query: 258 FLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQED 317
LG+PT+RAL +V + + V R+ E A+ RVA SF+R G ++
Sbjct: 137 ALGLPTSRALAVVGSKQAVRRETI-------ETAAVCSRVAPSFIRIGHFE--------- 180
Query: 318 LDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTAS 377
HF ++N+ + + L+ + + T Y E++ R A
Sbjct: 181 --------------HFASLQNLTRLQELADLLIAKFYPECASTKEPYLNLFKEISARNAK 226
Query: 378 LVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQP 437
LVA WQ VGF HGVLN+DN+S LGLTIDYGPFGFLD F+ N +D GR Y + QP
Sbjct: 227 LVAGWQAVGFCHGVLNSDNISALGLTIDYGPFGFLDQFEIDHICNHSDHSGR-YSYHRQP 285
Query: 438 DIGLWNIAQF-STTLAAAKLIDDKEANYVM-----ERYGTKFMDEYQAIMTKKLGLPKYN 491
I WN+A S L +L E + + E + + E+Q KLGL
Sbjct: 286 QIMHWNMACLASAMLPLLELEHSAEESQALLRSALEEFPIIYAAEWQRAFRLKLGLQSQQ 345
Query: 492 KQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+I +LL M KVD+TNFFR+L VK D E + + +D ++
Sbjct: 346 DSDISLIERLLQAMHDSKVDFTNFFRSLGKVKKDSKSVE----ISQRDEFVD-----RKN 396
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
W Y+ L S +SD +RK LM+ VNPKY+LRNYL Q+AI+ A+ DF EV LL
Sbjct: 397 IDQWFADYLNRLQSEALSDVDRKTLMDKVNPKYILRNYLAQTAIEKAQHDDFSEVDALLT 456
Query: 609 LMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ P+DEQ ++Y++ PP R V SCSS
Sbjct: 457 ILSNPFDEQMEFDRYSKPPPLDMQRVAV---SCSS 488
>gi|398879325|ref|ZP_10634422.1| hypothetical protein PMI33_04158 [Pseudomonas sp. GM67]
gi|398196796|gb|EJM83790.1| hypothetical protein PMI33_04158 [Pseudomonas sp. GM67]
Length = 487
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 214/551 (38%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F D D+ VL ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRF------DRLGDTFSAHVL---------PEPIDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P+F FSG A AVP A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPEFAELFSGHKLWADAVPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMILRLSPSHVRFGHFEYFYYTKRPEKQ----KELGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LG +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGFTTAEDDDQKLLEQLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + + ++ L+ +DI + + +W Y+ + G + ++R+
Sbjct: 371 RRLGDESPELAVAR------LRDDFVDI-----KGFDAWAERYVARVAREGEVDQQQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMEGYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|424921036|ref|ZP_18344397.1| hypothetical protein I1A_000464 [Pseudomonas fluorescens R124]
gi|404302196|gb|EJZ56158.1| hypothetical protein I1A_000464 [Pseudomonas fluorescens R124]
Length = 487
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 212/551 (38%), Positives = 299/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +++ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFENRFAR--LGD------------AFSAHVLPEP-MDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P + +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLEPTTADTQEFAELFGGHKLWADAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNNAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + H+
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHY--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LG +++++ LL M VDYT FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGFTIAEDDDQKLLEDLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + A+ ++ L+ +DI + + +W Y+ + G SD E+R+
Sbjct: 371 RRLGDQSAEQAVAR------LRDDFVDI-----KGFDAWGERYVARVARDGDSDQEQRRT 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P+DEQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSKPFDEQPGMEGYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|212538009|ref|XP_002149160.1| YdiU domain protein [Talaromyces marneffei ATCC 18224]
gi|210068902|gb|EEA22993.1| YdiU domain protein [Talaromyces marneffei ATCC 18224]
Length = 647
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 221/591 (37%), Positives = 304/591 (51%), Gaps = 88/591 (14%)
Query: 109 HSFVRELPGDP------RTDSIPREVLH------ACYTKVSPSAEVENPQLVAWSESVAD 156
++F +LP DP ++ PRE L A YT V P E P+L+ S +
Sbjct: 45 NTFTSKLPPDPAFETPKQSHDAPRETLGPRIVKGAMYTYVRPET-AEEPELLGVSPRAME 103
Query: 157 SLELDPKEFERPDFPLFFSGATPL-----AGAVPYAQCYGGHQFGMWAGQLGDGRAITLG 211
L L P E + DF +G L G P+AQCYGG QFG WAGQLGDGRAI+L
Sbjct: 104 DLGLQPGEEKTEDFVSLVAGNKILWNEEEGGVYPWAQCYGGWQFGAWAGQLGDGRAISLC 163
Query: 212 EILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLV 270
E+ N + R+ELQLKGAG+TPYSRFADG AVLRSSIRE++ SEA+ LGIPTTRAL L
Sbjct: 164 ELTNPSTNVRYELQLKGAGRTPYSRFADGKAVLRSSIREYVVSEALDALGIPTTRALSLT 223
Query: 271 TTGKF-VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAI 329
K V R+ EPGAIV R AQS+LR GS+ I SR + DL VR LA Y
Sbjct: 224 LLPKSKVLRERI-------EPGAIVARFAQSWLRIGSFDILHSRNERDL--VRQLATYIA 274
Query: 330 RHHFRHIENMNKSESL---SFSTGD-------------EDHSVVDLTSNKYAAWAVEVAE 373
F E++ +L S+GD E N++ E+
Sbjct: 275 EDVFPGWESLPGVVNLPNEGSSSGDVNVDDPPRGIPAAELQGKEGQEENRFTRLYREIVR 334
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
R A VA WQ GF +GVLNTDN SI GL++D+GPF F+D FDPS+TPN D RY +
Sbjct: 335 RNAKTVAAWQAYGFMNGVLNTDNTSIFGLSLDFGPFAFMDNFDPSYTPNHDD-HYLRYSY 393
Query: 434 ANQPDIGLWNIAQ----FSTTLAAAKLIDDKE---------------------ANYVMER 468
NQP + WN+ + F + A+ +DD+E N E
Sbjct: 394 KNQPSVIWWNLVRLGEAFGELIGGAERVDDEEFITKGVTEEFGQILIKRAETIINRTCEE 453
Query: 469 YGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y F +EY +M+++LGL + + S+LL+ M ++D+ +FFR LS++ +
Sbjct: 454 YRAVFKNEYVRLMSRRLGLLTSKESDFETLFSELLDTMEHLELDFNHFFRRLSDIGTEEL 513
Query: 525 IPEDELLVPLKAVLLDIGK-----------ERKEAWIS-WVLSYIQELLSSGISDEERKA 572
+++ K + G +R AW+S W ++ G +D+ERK
Sbjct: 514 ETDEQRQAIAKRFFHNEGVGGVGNTEESTCKRIAAWLSLWKDRIHEDWKQDGRTDQERKE 573
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAE-LGDFGEVRRLLKLMERPYDEQPGMEK 622
LM SVNPK++ R+++ I+ E GD + R+++ P+ E G++K
Sbjct: 574 LMKSVNPKFIPRSWILDEVIERVEHKGDRQILGRVMQYALNPFQEDWGVDK 624
>gi|355643207|ref|ZP_09053150.1| hypothetical protein HMPREF1030_02236 [Pseudomonas sp. 2_1_26]
gi|392986700|ref|YP_006485287.1| hypothetical protein PADK2_26610 [Pseudomonas aeruginosa DK2]
gi|419751732|ref|ZP_14278142.1| hypothetical protein CF510_01856 [Pseudomonas aeruginosa
PADK2_CF510]
gi|420142229|ref|ZP_14649850.1| hypothetical protein PACIG1_5363 [Pseudomonas aeruginosa CIG1]
gi|421163635|ref|ZP_15622334.1| hypothetical protein PABE173_5864 [Pseudomonas aeruginosa ATCC
25324]
gi|421183104|ref|ZP_15640569.1| hypothetical protein PAE2_5055 [Pseudomonas aeruginosa E2]
gi|424944179|ref|ZP_18359942.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
gi|346060625|dbj|GAA20508.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
gi|354829867|gb|EHF13928.1| hypothetical protein HMPREF1030_02236 [Pseudomonas sp. 2_1_26]
gi|384401808|gb|EIE48161.1| hypothetical protein CF510_01856 [Pseudomonas aeruginosa
PADK2_CF510]
gi|392322205|gb|AFM67585.1| hypothetical protein PADK2_26610 [Pseudomonas aeruginosa DK2]
gi|403245003|gb|EJY58838.1| hypothetical protein PACIG1_5363 [Pseudomonas aeruginosa CIG1]
gi|404528228|gb|EKA38338.1| hypothetical protein PABE173_5864 [Pseudomonas aeruginosa ATCC
25324]
gi|404540804|gb|EKA50193.1| hypothetical protein PAE2_5055 [Pseudomonas aeruginosa E2]
Length = 486
Score = 321 bits (822), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 301/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|386061196|ref|YP_005977718.1| hypothetical protein PAM18_5138 [Pseudomonas aeruginosa M18]
gi|347307502|gb|AEO77616.1| hypothetical protein PAM18_5138 [Pseudomonas aeruginosa M18]
Length = 486
Score = 321 bits (822), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 213/548 (38%), Positives = 301/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R GD A T+V P + P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR--LGD------------AFSTEVLPDP-IAEPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQVG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|398881963|ref|ZP_10636935.1| hypothetical protein PMI32_00613 [Pseudomonas sp. GM60]
gi|398199682|gb|EJM86617.1| hypothetical protein PMI32_00613 [Pseudomonas sp. GM60]
Length = 487
Score = 321 bits (822), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 214/551 (38%), Positives = 302/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F D D+ VL ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRF------DRLGDTFSAHVL---------PEPIDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E + P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEADTPEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMILRLSPSHVRFGHFEYFYYTKRPEKQ----KELGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLEQLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDE-ERKA 572
R L + ++ L+ +DI + + +W Y+ + G D+ +R+
Sbjct: 371 RRLGEESPELAVAR------LRDDFVDI-----KGFDAWAERYVARVAREGEVDQPQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSKPFEEQPGMEGYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|407366891|ref|ZP_11113423.1| hypothetical protein PmanJ_23957 [Pseudomonas mandelii JR-1]
Length = 487
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 212/551 (38%), Positives = 295/551 (53%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L++L +D+ F R D+ VL ++NP+LV S + L
Sbjct: 1 MKPLDELTFDNRFAR------LGDTFSAHVL---------PEPIDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + +F FSG A AVP A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVADTREFAELFSGHKLWADAVPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALNALSIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEKQ----KELGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQALTPFIS 312
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
+D + Y + Y +M ++LG +++++ LL M VDYT FF
Sbjct: 313 VDALRETLGL--YLPLYQAHYLDLMRRRLGFTTAEDDDQKLLEHLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + + ++ L+ +DI + + +W Y+ + G I E+R+
Sbjct: 371 RRLGDESPELAVAR------LRDDFVDI-----KGFDAWAELYVARVAREGEIDQEQRRK 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P+DEQ GME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSHPFDEQAGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|237798193|ref|ZP_04586654.1| hypothetical protein POR16_05064 [Pseudomonas syringae pv. oryzae
str. 1_6]
gi|331021045|gb|EGI01102.1| hypothetical protein POR16_05064 [Pseudomonas syringae pv. oryzae
str. 1_6]
Length = 487
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 220/553 (39%), Positives = 303/553 (54%), Gaps = 74/553 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++ P+LV S+S L
Sbjct: 1 MKALDELIFDNRFAR--LGD------------AFSAHVLPEP-IDAPRLVVASQSALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L P++ + P F FSG A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLVPEQADLPLFAEIFSGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVYNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RA C+V++ V R
Sbjct: 106 EHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ +E A+V R+AQS +RFGS + Q + + TLA++ + H+ +
Sbjct: 166 E-------TQEHAAMVLRLAQSHVRFGSLEYFFYTKQPEH--LNTLAEHVLTMHYPQCQE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ Y A E+ ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 QPE---------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GPF FLD FD F N +D G RY F+NQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPFAFLDDFDEHFICNHSDHEG-RYSFSNQVPIAQWNLSALAQALTPFVSVE 314
Query: 459 DKEANYVMERYGTKFMDEYQA----IMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTN 511
+ E G F+ YQA +M ++LGL +Q +IS+LL M VDYT
Sbjct: 315 A-----LRETIGL-FLPLYQAHYLDLMRRRLGLTGAEEQDDKLISQLLQLMQNSGVDYTL 368
Query: 512 FFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQEL-LSSGISDEER 570
FFR L + P E L L+ +DI + + W Y + L ++++R
Sbjct: 369 FFRRLGDQ------PAAEALRSLRDDFVDI-----KGFDGWAEKYQARIALEESGTEQDR 417
Query: 571 KALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAW 630
+A M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ P+ EQPGM+ YA+ PP W
Sbjct: 418 QARMHAVNPLYILRNYLAQNAIAAAEKGDYEEVRRLHQVLCTPFTEQPGMQGYAQRPPDW 477
Query: 631 AYRPGVCMLSCSS 643
+SCSS
Sbjct: 478 GKH---LEISCSS 487
>gi|107104123|ref|ZP_01368041.1| hypothetical protein PaerPA_01005196 [Pseudomonas aeruginosa PACS2]
Length = 486
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 212/548 (38%), Positives = 301/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R GD A T+V P + P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR--LGD------------AFSTEVLPDP-IAEPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQVG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ E+R L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEIRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|115385943|ref|XP_001209518.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187965|gb|EAU29665.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 619
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 217/597 (36%), Positives = 314/597 (52%), Gaps = 88/597 (14%)
Query: 101 ALEDLNWDHSFVRELPGDP------------RTDSIPREVLHACYTKVSPSAEVENPQLV 148
+L DL + F +LP DP R PR V A YT V P E P+L+
Sbjct: 13 SLGDLPKSNVFTSKLPADPAFETPEDSHRAPRETLGPRMVKGALYTFVRPEP-AEEPELL 71
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGATPL-----AGAVPYAQCYGGHQFGMWAGQLG 203
S + L L P E E P+F +G G P+AQCYGG QFG WAGQLG
Sbjct: 72 GVSPKAMEDLGLKPGEEETPEFKELVAGNKMFWDEERGGIYPWAQCYGGWQFGTWAGQLG 131
Query: 204 DGRAITLGEILNLKSER-WELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIP 262
DGRAI+L E N +++R +ELQLKGAG+TPYSRFADG AVLRSSIRE++ SEA+ LG+P
Sbjct: 132 DGRAISLFESTNPETKRRYELQLKGAGRTPYSRFADGKAVLRSSIREYIVSEALSALGVP 191
Query: 263 TTRALCLVTTGKF-VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
TTRAL L K V R+ EPGAIV R A++++R G++ I +RG D D++
Sbjct: 192 TTRALSLTLLPKSKVLRERI-------EPGAIVARFAETWIRIGTFDILRARG--DRDLI 242
Query: 322 RTLADYAIRHHFRHIENMNKSESLSF------STGDEDHSVV--------DLTSNKYAAW 367
R LA + E + + +L+ + + D + D+ N++A
Sbjct: 243 RKLATFVAEDVLGGWEALPSAVTLAKDQLQPEAVDNPDRGLAWDHIQKHEDVEENRFARL 302
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
E+A R A VA WQ GF +GVLNTDN SI GL++DYGPF F+D FDP +TPN D
Sbjct: 303 YREIARRNAKTVAAWQAYGFMNGVLNTDNTSIYGLSLDYGPFAFMDNFDPQYTPNHDD-H 361
Query: 428 GRRYCFANQPDIGLWNIAQFSTTL-----AAAKLIDD----------------KEANYVM 466
RY + NQP I WN+ + +L A AK+ ++ K A ++
Sbjct: 362 MLRYSYKNQPTIIWWNLVRLGESLGELIGAGAKVDEETFVKEGLTEEAAPAVIKLAEDII 421
Query: 467 ERYG----TKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFRALSN 518
+R G T F++EY+ +M ++LGL +++ S+LL+ + ++D+ +FFR LSN
Sbjct: 422 DRTGNEFRTVFLNEYKRLMNRRLGLKTQKESDFQELYSELLDTLEALELDFNHFFRRLSN 481
Query: 519 VKADPSIPEDEL--LVPL---------KAVLLDIGKERKEAWI-SWVLSYIQELLSSGIS 566
V ED+ + P D +ER W+ SW + +++ + +
Sbjct: 482 VPLSELDTEDKRKEVAPRFFHAEGFGGIGYTEDSARERIAKWLESWRVRVLEDWGQN--N 539
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLKLMERPYDEQPGMEK 622
DEER+ M VNP ++ R ++ I+ E GD + R++++ P++E+ G+ K
Sbjct: 540 DEERQKAMKGVNPNFIPRGWILDEVIERVERKGDRAVLGRVMQMALNPFEEEWGLNK 596
>gi|451982889|ref|ZP_21931189.1| Selenoprotein O and cysteine-containing homologs [Pseudomonas
aeruginosa 18A]
gi|451759445|emb|CCQ83712.1| Selenoprotein O and cysteine-containing homologs [Pseudomonas
aeruginosa 18A]
Length = 486
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 300/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGHTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|254244093|ref|ZP_04937415.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|421156542|ref|ZP_15615987.1| hypothetical protein PABE171_5369 [Pseudomonas aeruginosa ATCC
14886]
gi|126197471|gb|EAZ61534.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|404518977|gb|EKA29771.1| hypothetical protein PABE171_5369 [Pseudomonas aeruginosa ATCC
14886]
Length = 486
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 300/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + + ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDHALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|313110060|ref|ZP_07795963.1| hypothetical protein PA39016_002220009 [Pseudomonas aeruginosa
39016]
gi|386063460|ref|YP_005978764.1| hypothetical protein NCGM2_0489 [Pseudomonas aeruginosa NCGM2.S1]
gi|310882465|gb|EFQ41059.1| hypothetical protein PA39016_002220009 [Pseudomonas aeruginosa
39016]
gi|348032019|dbj|BAK87379.1| hypothetical protein NCGM2_0489 [Pseudomonas aeruginosa NCGM2.S1]
Length = 486
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 301/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A++ R+A S +RFG ++ Q D ++ LA + HHF +
Sbjct: 166 E-------KKESAAMLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVQEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|94310802|ref|YP_584012.1| hypothetical protein Rmet_1864 [Cupriavidus metallidurans CH34]
gi|121957843|sp|Q1LM83.1|Y1864_RALME RecName: Full=UPF0061 protein Rmet_1864
gi|93354654|gb|ABF08743.1| conserved hypothetical protein [Cupriavidus metallidurans CH34]
Length = 544
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 214/538 (39%), Positives = 286/538 (53%), Gaps = 80/538 (14%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFER----PDFPLFFSGATPLAGAVPYAQ 188
+T++SP+ + +P LV+ + + A L + + + P F F G A P A
Sbjct: 60 FTRLSPT-PLPSPYLVSVAPAAAALLGWNETDLQDAVKDPAFIDSFVGNAVPDWADPLAT 118
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGRAI L E WE+QLKG G TPYSR ADG AVLRSSIR
Sbjct: 119 VYSGHQFGVWAGQLGDGRAIRLAEA-QTPGGPWEIQLKGGGLTPYSRMADGRAVLRSSIR 177
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+LCSEAM+ LG+PTTRAL ++ + V R+ E A+V R+A SF+RFG ++
Sbjct: 178 EYLCSEAMYALGVPTTRALSIIGSDAPVRRETI-------ETSAVVTRLAPSFIRFGHFE 230
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
A+R ED +R LAD+ I + + N +N Y A
Sbjct: 231 HFAAR--EDHASLRQLADFVIDNFYPACRN---------------------AANPYQALL 267
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V+ TA +VA WQ VGF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G
Sbjct: 268 RDVSLLTADMVAHWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDQQG 327
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV--------------------MER 468
RY ++ QP + WN+ LA A L ++AN +R
Sbjct: 328 -RYAYSQQPQVAFWNL----HCLAQALLPLWRDANAADPEAEKAAAVEAAREALDPFRDR 382
Query: 469 YGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
Y F Y+A KLGL +Q +++ L + ++VDYT+F+R LS V S
Sbjct: 383 YAEAFFRHYRA----KLGLRSEQEQDETLMTNLFRVLHENRVDYTSFWRNLSRV----SS 434
Query: 526 PEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRN 585
++ ++ + LD A Y L S D R M + NPKYVLRN
Sbjct: 435 LDNSHDAAVRDLFLDRAAWDAWA-----AEYRARLQSEQSDDAARTTAMLATNPKYVLRN 489
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ ++AI AA DF EV RL+ ++ +P+DEQP E YA+LPP WA +SCSS
Sbjct: 490 HMAETAIRAARDKDFSEVDRLMAVLSKPFDEQPEAESYAKLPPDWA---SGLEVSCSS 544
>gi|421354603|ref|ZP_15804935.1| hypothetical protein VCHE45_1953 [Vibrio cholerae HE-45]
gi|395953728|gb|EJH64341.1| hypothetical protein VCHE45_1953 [Vibrio cholerae HE-45]
Length = 489
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 205/525 (39%), Positives = 276/525 (52%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVQPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQQLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSSI
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSI 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALISSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF TS YA W
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPYAVW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSDHLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + + ++D+ +
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN-----------EAVIDLVLD 388
Query: 545 RKEAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
R+ A I W+ Y+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 REAAKI-WLTRYLDRAARELGQEGGPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|261210570|ref|ZP_05924863.1| UPF0061 domain-containing protein [Vibrio sp. RC341]
gi|260840355|gb|EEX66926.1| UPF0061 domain-containing protein [Vibrio sp. RC341]
Length = 489
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 199/522 (38%), Positives = 277/522 (53%), Gaps = 64/522 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
YT P ++N + W+ ++A L E P+ L SG G P A Y
Sbjct: 21 YTSSRPQP-LKNVRWGMWNAALAQDFALP----EVPNDELLASLSGQQLAVGFAPLAMKY 75
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + L E++ + E +++ LKGAG TPYSR DG AVLRSSIRE+
Sbjct: 76 AGHQFGVYNPDLGDGRGLLLAEMVTKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSIREY 135
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGI TTRAL L+ + V R+ +EE GA++ R+AQS +RFG ++ H
Sbjct: 136 LCSEAMAGLGIATTRALALMVSDTPVYRE-------REERGALLVRLAQSHIRFGHFE-H 187
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
++ ++ + LAD I HF ++ YA W +
Sbjct: 188 LFYTEQHTEL-KLLADKVIEWHFPDCAK---------------------SAKPYANWFQQ 225
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D G R
Sbjct: 226 IVERTALMIAQWQVYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQG-R 284
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
Y F QP IGLWN++ + L + L++ + + Y + +M KLGL
Sbjct: 285 YAFDQQPRIGLWNLSALAHAL--SPLVEKADLETALASYSDHLNVHFSQLMRAKLGLATQ 342
Query: 491 NK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
+ ++ + + + DYT F R LS + + +L++ +E
Sbjct: 343 QEGDGELFADFFALLTNNHTDYTRFLRELSCLDRQGNEAVTDLVLD------------RE 390
Query: 548 AWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
A +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ AE GDF
Sbjct: 391 AAKTWLTRYLERAARELGQEGRPISSSERCQAMRQVNPKYILRNYLAQQAIEFAERGDFE 450
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 451 EMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|410633034|ref|ZP_11343681.1| hypothetical protein GARC_3594 [Glaciecola arctica BSs20135]
gi|410147203|dbj|GAC20548.1| hypothetical protein GARC_3594 [Glaciecola arctica BSs20135]
Length = 483
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 201/546 (36%), Positives = 292/546 (53%), Gaps = 78/546 (14%)
Query: 109 HSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFE-R 167
HSF +EL A ++V P V N +L ++ ++A L L P E++
Sbjct: 5 HSFAQELT--------------ALGSEVKPIKLV-NSRLAVFNHNLAAELNL-PFEWQLE 48
Query: 168 PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKG 227
D + AQ YGGHQFG W +LGDGR + L E+++ +++ W+L LKG
Sbjct: 49 ADLFKALYADNGVLNKCTVAQKYGGHQFGHWNPELGDGRGLLLAEVIDEQNQPWDLHLKG 108
Query: 228 AGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPK 287
AG TPYSRFADG AVLRS+IRE+L SEA+H+LGIPT+RALCL+T+ + V R+ K
Sbjct: 109 AGPTPYSRFADGRAVLRSTIREYLASEALHYLGIPTSRALCLITSDEPVYRE-------K 161
Query: 288 EEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
+E A + RV QS LRFG ++ H+ + Q+ ++ L DY ++HF+ K++S
Sbjct: 162 QEQAAKMIRVCQSHLRFGHFEYFYHSKQPQK----LQNLFDYCFKYHFKEC---TKADS- 213
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
Y A ++ TA L+A+WQ GF HGV+NTDNMSI G+T D
Sbjct: 214 -----------------PYLAMLEKIVHDTAKLIAKWQAFGFNHGVMNTDNMSIHGITFD 256
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV 465
YGP+ FLD F+P+F N +D P RY F +QP +GLWN+ + A ++ ++
Sbjct: 257 YGPYAFLDDFEPTFICNHSD-PQGRYSFDSQPGVGLWNLNALAQ--AFTPYLEIEQIKQA 313
Query: 466 MERYGTKFMDEYQAIMTKKLGL------PKYNKQIISKLLNNMAVDKVDYTNFFRALS-- 517
+ Y + EY +M KLGL + N II+ L+ +AV+K DY+ FR LS
Sbjct: 314 LSNYEPTLLKEYSRLMHNKLGLLPGSSNGEANTHIINTWLDILAVEKKDYSATFRQLSQF 373
Query: 518 NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
++ +D D+ I +ER + W Y L+ GIS R+A M
Sbjct: 374 DIFSDNQSLRDQF----------INRERFDEWAK---HYTLALMEQGISQTLRQAKMRRH 420
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVC 637
NP +LRNYL Q ID AE G+F + + +++PY+E +K++ PP W +
Sbjct: 421 NPHILLRNYLTQQVIDRAEEGNFDMFHQFIAALKKPYEEIEEYQKFSAPPPDWGKQ---L 477
Query: 638 MLSCSS 643
+SCSS
Sbjct: 478 EISCSS 483
>gi|383813981|ref|ZP_09969404.1| hypothetical protein SPM24T3_06493 [Serratia sp. M24T3]
gi|383297179|gb|EIC85490.1| hypothetical protein SPM24T3_06493 [Serratia sp. M24T3]
Length = 480
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 210/541 (38%), Positives = 290/541 (53%), Gaps = 66/541 (12%)
Query: 106 NWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEF 165
++H + +LPG YT++ P+ ++ +L+ S +A+ L L+ F
Sbjct: 3 QFEHQYFDQLPG--------------FYTELQPTP-LQGARLLYHSAPLAEELGLESSLF 47
Query: 166 ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQL 225
+ ++SG G P AQ Y GHQFG WAGQLGDGR + LGE + L
Sbjct: 48 TVEN-SAYWSGEKLFPGMRPLAQVYSGHQFGQWAGQLGDGRGLLLGEQKLADGSSLDWHL 106
Query: 226 KGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGN 285
KGAG TPYSR DG AVLRS +REFL SEA+H+LG+PTTRAL +VT+ + V R+
Sbjct: 107 KGAGLTPYSRMGDGRAVLRSVVREFLASEALHYLGVPTTRALSIVTSNEPVYRE------ 160
Query: 286 PKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESL 345
+ E GA++ RVA S +RFG ++ R Q + V LADY I H + + +
Sbjct: 161 -QAERGAMLVRVAPSHIRFGHFEHFYYRKQPEQ--VAMLADYCIEHFWPQLRD------- 210
Query: 346 SFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTID 405
+++Y W +V ERTA L+AQWQ VGF HGV+NTDNMSILGLTID
Sbjct: 211 --------------GADRYLQWFTDVVERTARLMAQWQSVGFAHGVMNTDNMSILGLTID 256
Query: 406 YGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYV 465
YGP+GFLD + P F N TD G RY F NQP + WN+ + + +L+ L+ +E
Sbjct: 257 YGPYGFLDDYKPDFICNHTDSQG-RYSFDNQPSVAYWNLHRLAQSLSG--LLSTEELQQA 313
Query: 466 MERYGTKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKAD 522
+ Y M EY +M KLG NKQ +++ LL+ MA + DYT FR LS ++
Sbjct: 314 LAAYEPALMIEYGKLMRAKLGFFTENKQDNSVLTGLLSLMANEGRDYTRTFRLLSEIRL- 372
Query: 523 PSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
DE ++ +D +EA+ W SY Q LL D R+ M NP+ +
Sbjct: 373 -----DEERSAMRDEFID-----REAFDLWYQSYRQRLLLEQQDDATRQQAMKKSNPRII 422
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCS 642
LRNYL Q AI+ AE D ++ L + ++ PY + +++A LPP W V SCS
Sbjct: 423 LRNYLAQQAIEGAEADDITRLQALHQALQDPYSDDSRFDEFAALPPDWGKHLEV---SCS 479
Query: 643 S 643
S
Sbjct: 480 S 480
>gi|398923018|ref|ZP_10660432.1| hypothetical protein PMI28_00006 [Pseudomonas sp. GM48]
gi|398175924|gb|EJM63662.1| hypothetical protein PMI28_00006 [Pseudomonas sp. GM48]
Length = 487
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 217/551 (39%), Positives = 301/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F D D+ VL ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRF------DRLGDAFSAHVL---------PEPIDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E P+F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPGVAETPEFAELFSGHKLWADAIPRAMVYSGHQFGFYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMLLRLSPSHVRFGHFEYFYYTKRPEQQ----KELGDHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEDDDQKLLEQLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L + + +I L+ +D+ + + +W YI + G D E+R+
Sbjct: 371 RRLGDEAPEQAITR------LRDDFVDL-----KGFDAWGELYIARVAREGAPDQEQRRQ 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYTEVRRLHAVLCNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|422923235|ref|ZP_16956393.1| hypothetical protein VCBJG01_1958 [Vibrio cholerae BJG-01]
gi|341644327|gb|EGS68552.1| hypothetical protein VCBJG01_1958 [Vibrio cholerae BJG-01]
Length = 489
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 207/525 (39%), Positives = 279/525 (53%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVHPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQHLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLRSS+
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLRSSL 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF TS YAAW
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPYAAW 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +D +F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDLNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD---- 388
Query: 545 RKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 -REAAKTWLTRYLERAARELGQEGRPISSSERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|153829250|ref|ZP_01981917.1| conserved hypothetical protein [Vibrio cholerae 623-39]
gi|148875288|gb|EDL73423.1| conserved hypothetical protein [Vibrio cholerae 623-39]
Length = 508
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 262/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIRE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSIREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ H + ++ + LAD I HF TS Y
Sbjct: 202 GHFE-HFFYTDQHANL-KLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 299 DYQG-RYAFDKQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 407
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 408 ----REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFA 463
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 464 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|116053171|ref|YP_793492.1| hypothetical protein PA14_66410 [Pseudomonas aeruginosa UCBPP-PA14]
gi|416876598|ref|ZP_11919337.1| hypothetical protein PA15_14686 [Pseudomonas aeruginosa 152504]
gi|421177277|ref|ZP_15634933.1| hypothetical protein PACI27_5496 [Pseudomonas aeruginosa CI27]
gi|122256814|sp|Q02EZ4.1|Y6641_PSEAB RecName: Full=UPF0061 protein PA14_66410
gi|115588392|gb|ABJ14407.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|334840587|gb|EGM19237.1| hypothetical protein PA15_14686 [Pseudomonas aeruginosa 152504]
gi|404529921|gb|EKA39941.1| hypothetical protein PACI27_5496 [Pseudomonas aeruginosa CI27]
Length = 486
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 300/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + F F G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLPAETSDEALFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVINQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A + R+A S +RFG ++ Q D ++ LA + HHF +
Sbjct: 166 E-------KKESAATLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVQEHHF---AD 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
N +E YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 214 CNAAE------------------RPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDAG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ ++ + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LDLFLPLYQAHYLDLMRRRLGLGVAAENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE L L+ +D +EA+ W +Y + + G E R+ M+
Sbjct: 373 LGE-----ETPE-RALASLRDDFVD-----REAFDRWAEAYRRRVEEEGGDQESRRRRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +++ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHQVLSRPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|104779648|ref|YP_606146.1| hypothetical protein PSEEN0369 [Pseudomonas entomophila L48]
gi|166232630|sp|Q1IG73.1|Y369_PSEE4 RecName: Full=UPF0061 protein PSEEN0369
gi|95108635|emb|CAK13329.1| conserved hypothetical protein [Pseudomonas entomophila L48]
Length = 486
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 210/550 (38%), Positives = 293/550 (53%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+ L +D+ F R GD A T+V P + +P+LV SE+ L
Sbjct: 1 MKSLDQLVFDNRFAR--LGD------------AFSTQVLPDP-IADPRLVVASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + + P F FSG A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLDPAQADLPVFAELFSGHKLWEEADPRAMVYSGHQFGSYNPRLGDGRGLLLGEVVNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSSATVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+A S +RFG ++ Q + R L D+ + H+
Sbjct: 166 E-------TRETAAMLLRLAHSHVRFGHFEYFYYTQQPEQQ--RLLIDHVLEQHYPECRE 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ F T + ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 217 AEQPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T D+GP+ FLD FD +F N +D G RY +ANQ I WN++ + L ++
Sbjct: 256 ILGITFDFGPYAFLDDFDANFICNHSDDRG-RYSYANQVPIAHWNLSALAQALTTVIEVE 314
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
KEA + Y ++D +M ++LGL + ++ +LL M VDYT FF
Sbjct: 315 PLKEALGLFLPLYQAHYLD----LMRRRLGLTTAEDDDMALVERLLQRMQSGGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L P E L ++ +D+ + +W + Y+ + E R+
Sbjct: 371 RKLGER------PVAEALKVVRDDFVDLA-----GFDAWGVEYLARCEREPGNAEGRRER 419
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M +VNP YVLRNYL Q AI+AAE GD+ EVRRL +++ RP++EQPGM+ YA PP W
Sbjct: 420 MQAVNPLYVLRNYLAQKAIEAAEAGDYSEVRRLHQVLSRPFEEQPGMQAYAERPPEWGKH 479
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 480 ---LEISCSS 486
>gi|425897143|ref|ZP_18873734.1| PF02696 family protein [Pseudomonas chlororaphis subsp.
aureofaciens 30-84]
gi|397884021|gb|EJL00507.1| PF02696 family protein [Pseudomonas chlororaphis subsp.
aureofaciens 30-84]
Length = 487
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 211/551 (38%), Positives = 298/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IDRPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+ + P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPEVAQSPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMLLRMSPSHVRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHFPAC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGVTFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + + F Y +M ++LGL +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLFLPLFQAHYLDLMRRRLGLTSAEDEDQKLVERLLQLMQGSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKA 572
R L N A+ ++ L+ +D ++ + +W Y + + +E R+
Sbjct: 371 RHLGNESAELAVAR------LRDDFVD-----RQGFDAWADLYKARVARDPVQGQELRRE 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P+++Q GM+ YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSKPFEQQAGMDSYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|398987504|ref|ZP_10692024.1| hypothetical protein PMI23_02453 [Pseudomonas sp. GM24]
gi|398150648|gb|EJM39230.1| hypothetical protein PMI23_02453 [Pseudomonas sp. GM24]
Length = 487
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 208/551 (37%), Positives = 295/551 (53%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNHFAH--LGD------------AFSAHVLPEP-IDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPTTAETNEFAELFGGHKLWADAEPRAMIYSGHQFGGYTPQLGDGRGLLLGEVYNTAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA++ L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALYALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHIRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHFPEC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 REQPEP---------------------YLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ +G WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPVGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++ G + +++++ LL M VDYT FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRFGFTTAEEDDQKLLEDLLQLMQNSGVDYTLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKA 572
R L A+ ++ L+ +DI + + +W Y+ + G +D E+R+A
Sbjct: 371 RRLGEESAEQAVAR------LRDDFVDI-----KGFDAWGERYVARVARDGDADQEQRRA 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P+++QPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSKPFEQQPGMEAYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|152984013|ref|YP_001351079.1| hypothetical protein PSPA7_5760 [Pseudomonas aeruginosa PA7]
gi|167016712|sp|A6VDE4.1|Y5760_PSEA7 RecName: Full=UPF0061 protein PSPA7_5760
gi|150959171|gb|ABR81196.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
Length = 486
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 300/548 (54%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K+L+DL++D+ F R D+ EVL P AE P+LV S + L
Sbjct: 1 MKSLDDLDFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L + + P F F G + A P A Y GHQFG + +LGDGR + LGE+LN
Sbjct: 46 DLPAEASDEPVFAELFGGHKLWSEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVLNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRAACVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A++ R+A S +RFG ++ Q D ++ LA + + HHF
Sbjct: 166 E-------KKESAAMLLRLAPSHVRFGHFEYFYYTRQHDQ--LKQLAAFVLEHHF----- 211
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ YAA +V ER A L+A+WQ GF HGV+NTDNMS
Sbjct: 212 ----------------ADCGAAERPYAAMFRQVVERNAELIARWQAYGFCHGVMNTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD + N +D G RY F+NQ I WN+A + L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDANHICNHSDDSG-RYSFSNQVPIAHWNLAALAQALTPLVEVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLP---KYNKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ +E + + Y +M ++LGL + ++ ++ +LL M VDY+ FFR
Sbjct: 315 ELRAS--LELFLPLYQAHYLDLMRRRLGLGVAVENDQALVQELLQRMQGSAVDYSLFFRR 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L PE + L L+ +D +EA+ W +Y + + + G R+ M+
Sbjct: 373 LGE-----DAPE-QALARLRDDFVD-----REAFDRWGEAYRRRVEAEGGEQAARRQRMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
+VNP YVLRNYL Q AI+AAE GD+ EVR L +L+ RP++EQPGME++ R PP W
Sbjct: 422 AVNPLYVLRNYLAQQAIEAAEQGDYTEVRLLHRLLARPFEEQPGMERFTRRPPDWGRH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|153826372|ref|ZP_01979039.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
gi|149739850|gb|EDM54041.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
Length = 508
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 193/465 (41%), Positives = 261/465 (56%), Gaps = 51/465 (10%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ H + ++ + LAD I HF TS Y
Sbjct: 202 GHFE-HFFYTDQHANL-KLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSEHLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQ----RNEAVIDL---VLD- 407
Query: 542 GKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 408 -REAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAERG 466
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 467 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|15641933|ref|NP_231565.1| hypothetical protein VC1931 [Vibrio cholerae O1 biovar El Tor str.
N16961]
gi|121587816|ref|ZP_01677574.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|121727850|ref|ZP_01680917.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|153818732|ref|ZP_01971399.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|153822507|ref|ZP_01975174.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|227082061|ref|YP_002810612.1| hypothetical protein VCM66_1855 [Vibrio cholerae M66-2]
gi|254849018|ref|ZP_05238368.1| conserved hypothetical protein [Vibrio cholerae MO10]
gi|298498032|ref|ZP_07007839.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|360035814|ref|YP_004937577.1| hypothetical protein Vch1786_I1422 [Vibrio cholerae O1 str.
2010EL-1786]
gi|9656468|gb|AAF95079.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor
str. N16961]
gi|121547917|gb|EAX58000.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|121629886|gb|EAX62300.1| conserved hypothetical protein [Vibrio cholerae V52]
gi|126510695|gb|EAZ73289.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
gi|126519981|gb|EAZ77204.1| conserved hypothetical protein [Vibrio cholerae B33]
gi|227009949|gb|ACP06161.1| conserved hypothetical protein [Vibrio cholerae M66-2]
gi|254844723|gb|EET23137.1| conserved hypothetical protein [Vibrio cholerae MO10]
gi|297542365|gb|EFH78415.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
gi|356646968|gb|AET27023.1| conserved hypothetical protein [Vibrio cholerae O1 str.
2010EL-1786]
Length = 508
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 259/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 202 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 407
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 408 ----REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFA 463
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 464 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|229507975|ref|ZP_04397480.1| hypothetical protein VCF_003207 [Vibrio cholerae BX 330286]
gi|229511789|ref|ZP_04401268.1| hypothetical protein VCE_003198 [Vibrio cholerae B33]
gi|229518926|ref|ZP_04408369.1| hypothetical protein VCC_002953 [Vibrio cholerae RC9]
gi|229607520|ref|YP_002878168.1| hypothetical protein VCD_002432 [Vibrio cholerae MJ-1236]
gi|255745312|ref|ZP_05419261.1| UPF0061 domain-containing protein [Vibrio cholera CIRS 101]
gi|262156036|ref|ZP_06029156.1| UPF0061 domain-containing protein [Vibrio cholerae INDRE 91/1]
gi|379741762|ref|YP_005333731.1| hypothetical protein O3Y_09345 [Vibrio cholerae IEC224]
gi|417813975|ref|ZP_12460628.1| hypothetical protein VCHC49A2_2982 [Vibrio cholerae HC-49A2]
gi|417817712|ref|ZP_12464341.1| hypothetical protein VCHCUF01_2966 [Vibrio cholerae HCUF01]
gi|418334951|ref|ZP_12943865.1| hypothetical protein VCHC06A1_2282 [Vibrio cholerae HC-06A1]
gi|418338567|ref|ZP_12947461.1| hypothetical protein VCHC23A1_2927 [Vibrio cholerae HC-23A1]
gi|418346485|ref|ZP_12951247.1| hypothetical protein VCHC28A1_2271 [Vibrio cholerae HC-28A1]
gi|418350247|ref|ZP_12954978.1| hypothetical protein VCHC43A1_2911 [Vibrio cholerae HC-43A1]
gi|419826909|ref|ZP_14350408.1| hypothetical protein VCCP10336_2525 [Vibrio cholerae CP1033(6)]
gi|421317738|ref|ZP_15768306.1| hypothetical protein VCCP10325_2825 [Vibrio cholerae CP1032(5)]
gi|421321702|ref|ZP_15772255.1| hypothetical protein VCCP103811_2978 [Vibrio cholerae CP1038(11)]
gi|421325502|ref|ZP_15776026.1| hypothetical protein VCCP104114_2721 [Vibrio cholerae CP1041(14)]
gi|421329163|ref|ZP_15779673.1| hypothetical protein VCCP104215_2937 [Vibrio cholerae CP1042(15)]
gi|421333071|ref|ZP_15783548.1| hypothetical protein VCCP104619_2947 [Vibrio cholerae CP1046(19)]
gi|421336660|ref|ZP_15787121.1| hypothetical protein VCCP104821_2834 [Vibrio cholerae CP1048(21)]
gi|421340090|ref|ZP_15790522.1| hypothetical protein VCHC20A2_2452 [Vibrio cholerae HC-20A2]
gi|421348070|ref|ZP_15798447.1| hypothetical protein VCHC46A1_2761 [Vibrio cholerae HC-46A1]
gi|422897037|ref|ZP_16934487.1| hypothetical protein VCHC40A1_2064 [Vibrio cholerae HC-40A1]
gi|422903239|ref|ZP_16938215.1| hypothetical protein VCHC48A1_2047 [Vibrio cholerae HC-48A1]
gi|422907123|ref|ZP_16941927.1| hypothetical protein VCHC70A1_2113 [Vibrio cholerae HC-70A1]
gi|422913970|ref|ZP_16948476.1| hypothetical protein VCHFU02_2271 [Vibrio cholerae HFU-02]
gi|422926176|ref|ZP_16959190.1| hypothetical protein VCHC38A1_1998 [Vibrio cholerae HC-38A1]
gi|423145495|ref|ZP_17133089.1| hypothetical protein VCHC19A1_2274 [Vibrio cholerae HC-19A1]
gi|423150171|ref|ZP_17137485.1| hypothetical protein VCHC21A1_1944 [Vibrio cholerae HC-21A1]
gi|423153991|ref|ZP_17141172.1| hypothetical protein VCHC22A1_1979 [Vibrio cholerae HC-22A1]
gi|423157075|ref|ZP_17144168.1| hypothetical protein VCHC32A1_2271 [Vibrio cholerae HC-32A1]
gi|423160645|ref|ZP_17147585.1| hypothetical protein VCHC33A2_1979 [Vibrio cholerae HC-33A2]
gi|423165466|ref|ZP_17152195.1| hypothetical protein VCHC48B2_2075 [Vibrio cholerae HC-48B2]
gi|423731482|ref|ZP_17704785.1| hypothetical protein VCHC17A1_2144 [Vibrio cholerae HC-17A1]
gi|423768496|ref|ZP_17712910.1| hypothetical protein VCHC50A2_2041 [Vibrio cholerae HC-50A2]
gi|423895373|ref|ZP_17727120.1| hypothetical protein VCHC62A1_2274 [Vibrio cholerae HC-62A1]
gi|423930811|ref|ZP_17731514.1| hypothetical protein VCHC77A1_2056 [Vibrio cholerae HC-77A1]
gi|424002926|ref|ZP_17746001.1| hypothetical protein VCHC17A2_2424 [Vibrio cholerae HC-17A2]
gi|424006715|ref|ZP_17749685.1| hypothetical protein VCHC37A1_2184 [Vibrio cholerae HC-37A1]
gi|424024696|ref|ZP_17764347.1| hypothetical protein VCHC62B1_2239 [Vibrio cholerae HC-62B1]
gi|424027581|ref|ZP_17767184.1| hypothetical protein VCHC69A1_2107 [Vibrio cholerae HC-69A1]
gi|424586854|ref|ZP_18026433.1| hypothetical protein VCCP10303_2010 [Vibrio cholerae CP1030(3)]
gi|424595502|ref|ZP_18034823.1| hypothetical protein VCCP1040_2024 [Vibrio cholerae CP1040(13)]
gi|424599419|ref|ZP_18038599.1| hypothetical protein VCCP104417_2010 [Vibrio Cholerae CP1044(17)]
gi|424602140|ref|ZP_18041282.1| hypothetical protein VCCP1047_1965 [Vibrio cholerae CP1047(20)]
gi|424607109|ref|ZP_18046053.1| hypothetical protein VCCP1050_2026 [Vibrio cholerae CP1050(23)]
gi|424610933|ref|ZP_18049772.1| hypothetical protein VCHC39A1_2120 [Vibrio cholerae HC-39A1]
gi|424613745|ref|ZP_18052533.1| hypothetical protein VCHC41A1_2028 [Vibrio cholerae HC-41A1]
gi|424617725|ref|ZP_18056397.1| hypothetical protein VCHC42A1_2118 [Vibrio cholerae HC-42A1]
gi|424622506|ref|ZP_18061013.1| hypothetical protein VCHC47A1_2154 [Vibrio cholerae HC-47A1]
gi|424645469|ref|ZP_18083205.1| hypothetical protein VCHC56A2_2297 [Vibrio cholerae HC-56A2]
gi|424653238|ref|ZP_18090618.1| hypothetical protein VCHC57A2_2008 [Vibrio cholerae HC-57A2]
gi|424657059|ref|ZP_18094344.1| hypothetical protein VCHC81A2_2010 [Vibrio cholerae HC-81A2]
gi|440710133|ref|ZP_20890784.1| hypothetical protein VC4260B_15290 [Vibrio cholerae 4260B]
gi|443504293|ref|ZP_21071251.1| hypothetical protein VCHC64A1_02269 [Vibrio cholerae HC-64A1]
gi|443508191|ref|ZP_21074954.1| hypothetical protein VCHC65A1_02258 [Vibrio cholerae HC-65A1]
gi|443512033|ref|ZP_21078671.1| hypothetical protein VCHC67A1_02269 [Vibrio cholerae HC-67A1]
gi|443515591|ref|ZP_21082102.1| hypothetical protein VCHC68A1_01983 [Vibrio cholerae HC-68A1]
gi|443519385|ref|ZP_21085781.1| hypothetical protein VCHC71A1_01970 [Vibrio cholerae HC-71A1]
gi|443524275|ref|ZP_21090488.1| hypothetical protein VCHC72A2_02277 [Vibrio cholerae HC-72A2]
gi|443531872|ref|ZP_21097886.1| hypothetical protein VCHC7A1_03018 [Vibrio cholerae HC-7A1]
gi|443535670|ref|ZP_21101548.1| hypothetical protein VCHC80A1_01955 [Vibrio cholerae HC-80A1]
gi|443539216|ref|ZP_21105070.1| hypothetical protein VCHC81A1_02784 [Vibrio cholerae HC-81A1]
gi|449055640|ref|ZP_21734308.1| Selenoprotein O and cysteine-containing protein [Vibrio cholerae O1
str. Inaba G4222]
gi|33517106|sp|Q9KQR7.2|Y1931_VIBCH RecName: Full=UPF0061 protein VC_1931
gi|229343615|gb|EEO08590.1| hypothetical protein VCC_002953 [Vibrio cholerae RC9]
gi|229351754|gb|EEO16695.1| hypothetical protein VCE_003198 [Vibrio cholerae B33]
gi|229355480|gb|EEO20401.1| hypothetical protein VCF_003207 [Vibrio cholerae BX 330286]
gi|229370175|gb|ACQ60598.1| hypothetical protein VCD_002432 [Vibrio cholerae MJ-1236]
gi|255737142|gb|EET92538.1| UPF0061 domain-containing protein [Vibrio cholera CIRS 101]
gi|262030214|gb|EEY48858.1| UPF0061 domain-containing protein [Vibrio cholerae INDRE 91/1]
gi|340036461|gb|EGQ97437.1| hypothetical protein VCHC49A2_2982 [Vibrio cholerae HC-49A2]
gi|340037435|gb|EGQ98410.1| hypothetical protein VCHCUF01_2966 [Vibrio cholerae HCUF01]
gi|341621330|gb|EGS47076.1| hypothetical protein VCHC70A1_2113 [Vibrio cholerae HC-70A1]
gi|341621473|gb|EGS47218.1| hypothetical protein VCHC48A1_2047 [Vibrio cholerae HC-48A1]
gi|341622398|gb|EGS48061.1| hypothetical protein VCHC40A1_2064 [Vibrio cholerae HC-40A1]
gi|341637631|gb|EGS62309.1| hypothetical protein VCHFU02_2271 [Vibrio cholerae HFU-02]
gi|341646382|gb|EGS70496.1| hypothetical protein VCHC38A1_1998 [Vibrio cholerae HC-38A1]
gi|356417660|gb|EHH71275.1| hypothetical protein VCHC06A1_2282 [Vibrio cholerae HC-06A1]
gi|356418531|gb|EHH72128.1| hypothetical protein VCHC21A1_1944 [Vibrio cholerae HC-21A1]
gi|356423105|gb|EHH76566.1| hypothetical protein VCHC19A1_2274 [Vibrio cholerae HC-19A1]
gi|356428551|gb|EHH81777.1| hypothetical protein VCHC22A1_1979 [Vibrio cholerae HC-22A1]
gi|356430209|gb|EHH83418.1| hypothetical protein VCHC23A1_2927 [Vibrio cholerae HC-23A1]
gi|356433564|gb|EHH86753.1| hypothetical protein VCHC28A1_2271 [Vibrio cholerae HC-28A1]
gi|356439732|gb|EHH92697.1| hypothetical protein VCHC32A1_2271 [Vibrio cholerae HC-32A1]
gi|356444743|gb|EHH97552.1| hypothetical protein VCHC43A1_2911 [Vibrio cholerae HC-43A1]
gi|356445742|gb|EHH98544.1| hypothetical protein VCHC33A2_1979 [Vibrio cholerae HC-33A2]
gi|356450987|gb|EHI03692.1| hypothetical protein VCHC48B2_2075 [Vibrio cholerae HC-48B2]
gi|378795272|gb|AFC58743.1| hypothetical protein O3Y_09345 [Vibrio cholerae IEC224]
gi|395915996|gb|EJH26826.1| hypothetical protein VCCP10325_2825 [Vibrio cholerae CP1032(5)]
gi|395917340|gb|EJH28168.1| hypothetical protein VCCP104114_2721 [Vibrio cholerae CP1041(14)]
gi|395918696|gb|EJH29520.1| hypothetical protein VCCP103811_2978 [Vibrio cholerae CP1038(11)]
gi|395927697|gb|EJH38460.1| hypothetical protein VCCP104215_2937 [Vibrio cholerae CP1042(15)]
gi|395928473|gb|EJH39226.1| hypothetical protein VCCP104619_2947 [Vibrio cholerae CP1046(19)]
gi|395931759|gb|EJH42503.1| hypothetical protein VCCP104821_2834 [Vibrio cholerae CP1048(21)]
gi|395939373|gb|EJH50055.1| hypothetical protein VCHC20A2_2452 [Vibrio cholerae HC-20A2]
gi|395942649|gb|EJH53325.1| hypothetical protein VCHC46A1_2761 [Vibrio cholerae HC-46A1]
gi|395958838|gb|EJH69301.1| hypothetical protein VCHC56A2_2297 [Vibrio cholerae HC-56A2]
gi|395959414|gb|EJH69848.1| hypothetical protein VCHC57A2_2008 [Vibrio cholerae HC-57A2]
gi|395962126|gb|EJH72428.1| hypothetical protein VCHC42A1_2118 [Vibrio cholerae HC-42A1]
gi|395970808|gb|EJH80532.1| hypothetical protein VCHC47A1_2154 [Vibrio cholerae HC-47A1]
gi|395973307|gb|EJH82871.1| hypothetical protein VCCP10303_2010 [Vibrio cholerae CP1030(3)]
gi|395975700|gb|EJH85180.1| hypothetical protein VCCP1047_1965 [Vibrio cholerae CP1047(20)]
gi|408007191|gb|EKG45287.1| hypothetical protein VCHC39A1_2120 [Vibrio cholerae HC-39A1]
gi|408013052|gb|EKG50803.1| hypothetical protein VCHC41A1_2028 [Vibrio cholerae HC-41A1]
gi|408032204|gb|EKG68795.1| hypothetical protein VCCP1040_2024 [Vibrio cholerae CP1040(13)]
gi|408041745|gb|EKG77841.1| hypothetical protein VCCP104417_2010 [Vibrio Cholerae CP1044(17)]
gi|408043179|gb|EKG79191.1| hypothetical protein VCCP1050_2026 [Vibrio cholerae CP1050(23)]
gi|408053560|gb|EKG88566.1| hypothetical protein VCHC81A2_2010 [Vibrio cholerae HC-81A2]
gi|408607699|gb|EKK81102.1| hypothetical protein VCCP10336_2525 [Vibrio cholerae CP1033(6)]
gi|408624104|gb|EKK97056.1| hypothetical protein VCHC17A1_2144 [Vibrio cholerae HC-17A1]
gi|408633777|gb|EKL06078.1| hypothetical protein VCHC50A2_2041 [Vibrio cholerae HC-50A2]
gi|408654243|gb|EKL25385.1| hypothetical protein VCHC77A1_2056 [Vibrio cholerae HC-77A1]
gi|408655173|gb|EKL26298.1| hypothetical protein VCHC62A1_2274 [Vibrio cholerae HC-62A1]
gi|408845323|gb|EKL85439.1| hypothetical protein VCHC37A1_2184 [Vibrio cholerae HC-37A1]
gi|408846096|gb|EKL86208.1| hypothetical protein VCHC17A2_2424 [Vibrio cholerae HC-17A2]
gi|408870402|gb|EKM09682.1| hypothetical protein VCHC62B1_2239 [Vibrio cholerae HC-62B1]
gi|408878884|gb|EKM17877.1| hypothetical protein VCHC69A1_2107 [Vibrio cholerae HC-69A1]
gi|439974356|gb|ELP50533.1| hypothetical protein VC4260B_15290 [Vibrio cholerae 4260B]
gi|443431238|gb|ELS73790.1| hypothetical protein VCHC64A1_02269 [Vibrio cholerae HC-64A1]
gi|443435133|gb|ELS81277.1| hypothetical protein VCHC65A1_02258 [Vibrio cholerae HC-65A1]
gi|443439016|gb|ELS88731.1| hypothetical protein VCHC67A1_02269 [Vibrio cholerae HC-67A1]
gi|443443001|gb|ELS96303.1| hypothetical protein VCHC68A1_01983 [Vibrio cholerae HC-68A1]
gi|443446803|gb|ELT03459.1| hypothetical protein VCHC71A1_01970 [Vibrio cholerae HC-71A1]
gi|443449609|gb|ELT09900.1| hypothetical protein VCHC72A2_02277 [Vibrio cholerae HC-72A2]
gi|443457262|gb|ELT24659.1| hypothetical protein VCHC7A1_03018 [Vibrio cholerae HC-7A1]
gi|443461210|gb|ELT32283.1| hypothetical protein VCHC80A1_01955 [Vibrio cholerae HC-80A1]
gi|443465316|gb|ELT39976.1| hypothetical protein VCHC81A1_02784 [Vibrio cholerae HC-81A1]
gi|448264679|gb|EMB01916.1| Selenoprotein O and cysteine-containing protein [Vibrio cholerae O1
str. Inaba G4222]
Length = 489
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 259/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 388
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 389 ----REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFA 444
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 445 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|430809394|ref|ZP_19436509.1| hypothetical protein D769_24048 [Cupriavidus sp. HMR-1]
gi|429498203|gb|EKZ96717.1| hypothetical protein D769_24048 [Cupriavidus sp. HMR-1]
Length = 516
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 214/536 (39%), Positives = 284/536 (52%), Gaps = 76/536 (14%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFER----PDFPLFFSGATPLAGAVPYAQ 188
+T+++P+ + +P LV+ + + A L + + + P F F G A P A
Sbjct: 32 FTRLTPT-PLPSPYLVSVAPAAAALLGWNETDLQDAVKDPAFIDSFVGNAVPDWADPLAT 90
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGRAI L E WE+QLKG G TPYSR ADG AVLRSSIR
Sbjct: 91 VYSGHQFGVWAGQLGDGRAIRLAEA-QTPGGPWEIQLKGGGLTPYSRMADGRAVLRSSIR 149
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+LCSEAM+ LG+PTTRAL ++ + V R+ E A+V R+A SF+RFG ++
Sbjct: 150 EYLCSEAMYALGVPTTRALSIIGSDAPVRRETI-------ETSAVVTRLAPSFIRFGHFE 202
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
A+R ED +R LAD+ I + + + +N Y A
Sbjct: 203 HFAAR--EDHASLRQLADFVIDNFYPACRD---------------------AANPYQALL 239
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
EV+ TA +VA WQ VGF HGV+NTDNMSILGLTIDYGPFGFLDAFD + N +D G
Sbjct: 240 REVSLLTADMVAHWQAVGFCHGVMNTDNMSILGLTIDYGPFGFLDAFDANHICNHSDQQG 299
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA------------------NYVMERYG 470
RY ++ QP I WN+ + L L D A + +RY
Sbjct: 300 -RYAYSQQPQIAFWNLHCLAQAL--LPLWRDTNAADPEVEKAAAVEAAREALDPFRDRYA 356
Query: 471 TKFMDEYQAIMTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE 527
F Y+A KLGL +Q +++ L + ++VDYT F+R LS V S +
Sbjct: 357 EAFFRHYRA----KLGLRSEQEQDETLMTNLFRVLHENRVDYTLFWRNLSRV----SSLD 408
Query: 528 DELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYL 587
+ P++ + LD A Y L S D R M + NPKYVLRN++
Sbjct: 409 NSHDAPVRDLFLDRAAWDAWA-----AEYRARLQSEQSDDAARTTGMLATNPKYVLRNHM 463
Query: 588 CQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++AI AA DF EV RLL ++ +P+DEQP E YA+LPP WA +SCSS
Sbjct: 464 AETAIRAARDKDFSEVDRLLAVLSKPFDEQPEAEPYAKLPPDWA---SGLEVSCSS 516
>gi|19115652|ref|NP_594740.1| UPF0061 family protein [Schizosaccharomyces pombe 972h-]
gi|3183368|sp|O13890.1|YE35_SCHPO RecName: Full=UPF0061 protein C20G4.05c
gi|2330761|emb|CAB11255.1| UPF0061 family protein [Schizosaccharomyces pombe]
Length = 568
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 216/600 (36%), Positives = 312/600 (52%), Gaps = 87/600 (14%)
Query: 95 MTKKLKALEDLNWDHSFVRELPGDP-------------RTDSIPREVLHA-CYTKVSPSA 140
M+KKLK DL +F LP DP R +PR V +T ++PS
Sbjct: 1 MSKKLK---DLPVSSTFTSNLPPDPLVPTVQAMKKADDRILHVPRFVEGGGLFTYLTPSL 57
Query: 141 EVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA-TPLAGAVPYAQCYGGHQFGMWA 199
+ N QL+A+S S SL L+ E + F G+ + P+AQCYGG+QFG WA
Sbjct: 58 KA-NSQLLAYSPSSVKSLGLEESETQTEAFQQLVVGSNVDVNKCCPWAQCYGGYQFGDWA 116
Query: 200 GQLGDGRAITLGEILNLKS-ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
GQLGDGR ++L E+ N ++ +R+E+Q+KGAG+TPYSRFADG AVLRSSIRE+LC EA++
Sbjct: 117 GQLGDGRVVSLCELTNPETGKRFEIQVKGAGRTPYSRFADGKAVLRSSIREYLCCEALYA 176
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDL 318
LGIPTT+AL + V + EP A+VCR+A S++R G++ + Q +
Sbjct: 177 LGIPTTQALAISNLEGVVAQ------RETVEPCAVVCRMAPSWIRIGTFDLQGINNQ--I 228
Query: 319 DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASL 378
+ +R LADY + + F GD T N+Y +VA R A
Sbjct: 229 ESLRKLADYCLNFVLKD----------GFHGGD--------TGNRYEKLLRDVAYRNAKT 270
Query: 379 VAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPD 438
VA+WQ GF +GVLNTDN SILGL+IDYGPFGFLD ++PSFTPN D+ RY + NQPD
Sbjct: 271 VAKWQAYGFMNGVLNTDNTSILGLSIDYGPFGFLDVYNPSFTPNHDDV-FLRYSYRNQPD 329
Query: 439 IGLWNIAQFSTTL----AAAKLIDD--------------KEA--------NYVMERYGTK 472
I +WN+++ ++ L A +DD K+A ++E Y
Sbjct: 330 IIIWNLSKLASALVELIGACDKVDDLQYMEQLHNSTDLLKKAFAYTSEVFEKIVEEYKNI 389
Query: 473 FMDEYQAIMTKKLGLP--KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL 530
+++ +M K++GLP NK +I+ LL + ++D N F LS + PS E+E
Sbjct: 390 VQNDFYDLMFKRVGLPSDSSNKILITDLLQILEDYELDMPNCFSFLS--RNSPSSMENEE 447
Query: 531 LVP---LKAVLLDIGKER-----KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+ L+ ER +A+ +WV Y + + D R A M VNP +
Sbjct: 448 YAAKLMQACICLNPNNERVRNESVKAFTNWVGRYSEATKTQ--EDSSRLASMKKVNPHFT 505
Query: 583 LRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCS 642
LRN++ + I A +G F +++ K+ P+++ G K + P + CS
Sbjct: 506 LRNWVLEEVIKEAYIGKFELFKKVCKMAACPFEDTWGFSKEEEDYLCYNTTPSKSQIQCS 565
>gi|418355414|ref|ZP_12958133.1| hypothetical protein VCHC61A1_2816 [Vibrio cholerae HC-61A1]
gi|356451912|gb|EHI04591.1| hypothetical protein VCHC61A1_2816 [Vibrio cholerae HC-61A1]
Length = 487
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 259/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 68 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 127
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 128 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 180
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 181 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 217
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 218 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 277
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 278 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 334
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 335 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 386
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 387 ----REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFA 442
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 443 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 487
>gi|260776397|ref|ZP_05885292.1| UPF0061 domain-containing protein [Vibrio coralliilyticus ATCC
BAA-450]
gi|260607620|gb|EEX33885.1| UPF0061 domain-containing protein [Vibrio coralliilyticus ATCC
BAA-450]
Length = 490
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 209/523 (39%), Positives = 285/523 (54%), Gaps = 61/523 (11%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A YT V P A ++N VAW+ A L P + + F +G A Y
Sbjct: 19 AFYTHVQPQA-LDNSHWVAWNSEFARQFGL-PLQAPQGSLKSFLAGELKPMPTPCLAMKY 76
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG++ LGDGR + LGEI N +++ LKGAG TPYSR DG AVLRS+IRE+
Sbjct: 77 AGHQFGIYNPDLGDGRGLLLGEISNQSGTLFDIHLKGAGLTPYSRMGDGRAVLRSTIREY 136
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
LCSEAM LGIPTTRAL ++T+ V R+ K E GA++ R++QS +RFG ++
Sbjct: 137 LCSEAMAGLGIPTTRALAMLTSDTLVYRE-------KAEQGALLLRMSQSHIRFGHFEHF 189
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
Q + ++ LAD I ++ +K Y A
Sbjct: 190 FYTNQ--IAELKLLADKVIEWYWPDCIETDKP---------------------YLAMFEH 226
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V + TA+L+A WQ GF HGV+NTDNMSILG T DYGPFGFLD +DPS+ N +D G R
Sbjct: 227 VVKGTANLIAHWQAYGFAHGVMNTDNMSILGETFDYGPFGFLDDYDPSYISNHSDYEG-R 285
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP-- 488
Y F QP +GLWN++ + L LI+ + V+E+Y + +M KLGL
Sbjct: 286 YAFDQQPRVGLWNLSALAHALTP--LIEKNDLESVLEKYEGILGKSFSRLMRSKLGLQSK 343
Query: 489 -KYNKQIISKLLNNMAVDKVDYTNFFRALSNV-KADPSIPEDELLVPLKAVLLDIGKERK 546
+ + ++ + + ++VDYT F R +SN+ + DP V++D+ +R
Sbjct: 344 REKDSELFQSMFELLEQNQVDYTRFMREISNLDRTDPQ------------VVIDLFADR- 390
Query: 547 EAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
EA W+ Y+ QE +G I ER M VNPKY+LRNYL Q AID AE GDF
Sbjct: 391 EAVKVWLTDYLARCEQEADEAGSPIEASERCEAMRRVNPKYILRNYLAQLAIDKAEEGDF 450
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV R+ +L++ PYDEQP M++YA+LPP W + +SCSS
Sbjct: 451 SEVNRVAELLKYPYDEQPEMDEYAKLPPEWGKK---MEISCSS 490
>gi|254504578|ref|ZP_05116729.1| Uncharacterized ACR, YdiU/UPF0061 family [Labrenzia alexandrii
DFL-11]
gi|222440649|gb|EEE47328.1| Uncharacterized ACR, YdiU/UPF0061 family [Labrenzia alexandrii
DFL-11]
Length = 493
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 198/534 (37%), Positives = 283/534 (52%), Gaps = 70/534 (13%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+D+++ RELPG Y + A V +P+LV + +A L L+P
Sbjct: 8 FQFDNTYARELPG--------------FYVEWQ-GASVPDPKLVLLNTPLAGELGLEPTA 52
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
+ F+G+ GA P AQ Y GHQFG ++ QLGDGRA+ +GE+++ + R ++Q
Sbjct: 53 LSAAEMAAVFAGSASPEGASPLAQVYAGHQFGGFSPQLGDGRALLIGEVIDQEGHRRDIQ 112
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKG+G+TP+SR DG AV+ +RE++ EAMH LG+PTTRAL VTTG+ + R+
Sbjct: 113 LKGSGRTPFSRGGDGKAVIGPVLREYILGEAMHALGVPTTRALAAVTTGEMIQREGL--- 169
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
+PGA++ RVA S LR G++Q A+R D D VR LADYAI H
Sbjct: 170 ----KPGAVLTRVASSHLRVGTFQFFAAR--SDTDKVRQLADYAIARH------------ 211
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
D D + D +++ + V +R A LV++W +GF HGV+NTDN +I G TI
Sbjct: 212 ------DPDLADAD---DRHLRFLARVVDRQAQLVSKWMLIGFVHGVMNTDNTTISGETI 262
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANY 464
DYGP FLD +DP+ ++ D G RY F QP I WN+A+ + L L D + +
Sbjct: 263 DYGPCAFLDGYDPAAVFSSID-HGGRYAFGRQPTIMQWNLARLAEAL--LPLFDPADLDR 319
Query: 465 VME---RYGTKFMDEYQAI----MTKKLGLPKYNKQ---IISKLLNNMAVDKVDYTNFFR 514
+E + KF D Y++ M+KKLGL + + LL MA DYT FR
Sbjct: 320 AVELATQELNKFPDLYRSAWLNGMSKKLGLTDVQDEDVTLFEDLLGAMAASGADYTLVFR 379
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
LSN + + P +L E K +WV+ + Q S G EE M
Sbjct: 380 RLSNAVSGNTAPLFDLF------------EDKAGISAWVIRWEQRRSSEGRPAEEISRGM 427
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPP 628
N VNP Y+ RN+ + A+DA+E GD+ V LL +++ PY+E+ G+E Y P
Sbjct: 428 NRVNPIYIPRNHKVEEALDASEAGDYHLVEELLDVLKDPYEERAGLEAYGTPAP 481
>gi|399000637|ref|ZP_10703361.1| hypothetical protein PMI21_01935 [Pseudomonas sp. GM18]
gi|398129477|gb|EJM18842.1| hypothetical protein PMI21_01935 [Pseudomonas sp. GM18]
Length = 487
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 218/551 (39%), Positives = 297/551 (53%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P + P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IAAPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP E E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAEAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI--HASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R++ S +RFG ++ + R ++ + L ++ + HF H
Sbjct: 166 E-------KQERAAMVLRLSPSHVRFGHFEFFYYTKRPEQQ----KELGEHVLAMHFPHC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A E+ ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPE-------------------PYLAMFREIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LG +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGFTTAEDDDQKLLEQLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKA 572
R L + + +I L+ +DI + + +W YI + G +D+ R+
Sbjct: 371 RRLGDESPEQAISR------LRDDFVDI-----KGFDAWGERYIARVTREGEADQALRRE 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|353231624|emb|CCD78042.1| Selenoprotein O-like [Schistosoma mansoni]
Length = 706
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 197/475 (41%), Positives = 271/475 (57%), Gaps = 68/475 (14%)
Query: 107 WDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS-ESVA---------- 155
+D+ ++ LP D ++SI R V +AC+T+VSP+ +++NP+LV +S +++A
Sbjct: 70 FDNIQLKSLPIDNGSNSI-RSVPNACFTRVSPT-KIDNPRLVLFSPDALALLNICHKINH 127
Query: 156 -DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
D K E + SG G+ P A CY G+QFG +AGQLGDG AI+LGE++
Sbjct: 128 LDKQNCKGKTEETNCLVEYLSGNKLWPGSNPTAHCYCGYQFGSFAGQLGDGAAISLGEVV 187
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N + ERWELQLKGAG TP+SR DG VLRSS+REFLCSEAM++LGIPTTRA ++T+
Sbjct: 188 NEQGERWELQLKGAGLTPFSRQGDGRKVLRSSLREFLCSEAMYYLGIPTTRAASIITSDT 247
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ---------EDLDIVRTLA 325
V RDMFY G+ E +I RVA++F+RFGS++I S +L IV L
Sbjct: 248 LVERDMFYTGDSITEKASITSRVAKTFIRFGSFEISKSPDSITGRFGPSVGNLTIVSQLT 307
Query: 326 DYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGV 385
+Y I+ + HI D + ++ N Y + EV +RTA+LVA WQ V
Sbjct: 308 NYVIQQFYPHI------------WSDYSNDIM----NCYLEFFKEVVKRTANLVALWQTV 351
Query: 386 GFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIA 445
GF HGVLNTDNMSI+GLTIDYGPFGF+D F NT+D P RY +A QP+I WN A
Sbjct: 352 GFCHGVLNTDNMSIIGLTIDYGPFGFMDQFTWDHISNTSD-PDGRYSYAQQPNICAWNCA 410
Query: 446 QFSTTLAAA----------KLIDDKEANYVMERY----GTKFMDEYQAI----MTKKLGL 487
+ + L A K ID + N + ++ T +M ++++ M KKLGL
Sbjct: 411 RLAECLIQALIDQQKYSSDKTIDKEFVNNLTRKFTNVLDTTYMSYFKSVYLERMRKKLGL 470
Query: 488 --PK--YNKQIISKLLNNMAVDKVDYTNFFRALSNV------KADPSIPEDELLV 532
PK + +I L N M D+TN F AL + + D + E +L+V
Sbjct: 471 FYPKDEIDADLIENLFNTMEKTGADFTNTFLALEDTLFQLFNENDSDLLEPDLIV 525
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 67/151 (44%), Gaps = 21/151 (13%)
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER-KEAWISWVLSYIQELL-------- 561
N L K S +++L L+ + + +ER K W W+ +Y L
Sbjct: 559 NIIDQLEETKILKSKEKEKLYKELEHMTEEEYQERNKRLWSIWLRAYKTRLKIDFERNND 618
Query: 562 SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD--EQPG 619
++ E LM SVNP+ VLRNYL + AI +A+ GD+ ++L + P+ +
Sbjct: 619 NAKTQISECLNLMQSVNPRVVLRNYLAEEAIKSADKGDYTVAQQLFDSLTTPFKNPDTSS 678
Query: 620 MEKYARL-------PPAWAYRPGVCMLSCSS 643
+ RL PP W+ + V SCSS
Sbjct: 679 NNESCRLVSRIKYRPPNWSRKLRV---SCSS 706
>gi|147674783|ref|YP_001217463.1| hypothetical protein VC0395_A1520 [Vibrio cholerae O395]
gi|227118379|ref|YP_002820275.1| hypothetical protein VC395_2046 [Vibrio cholerae O395]
gi|146316666|gb|ABQ21205.1| conserved hypothetical protein [Vibrio cholerae O395]
gi|227013829|gb|ACP10039.1| conserved hypothetical protein [Vibrio cholerae O395]
Length = 508
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 261/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ H + ++ + LAD I HF TS Y
Sbjct: 202 GHFE-HFFYTDQHANL-KLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 407
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 408 ----REAAKTWLTRYLERAARELGQEGRPISIRERCQAMRQVNPKYILRNYLAQQAIEFA 463
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 464 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|296272402|ref|YP_003655033.1| hypothetical protein [Arcobacter nitrofigilis DSM 7299]
gi|296096576|gb|ADG92526.1| protein of unknown function UPF0061 [Arcobacter nitrofigilis DSM
7299]
Length = 485
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 201/517 (38%), Positives = 278/517 (53%), Gaps = 57/517 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y K++P+ + NP L+++++ + D + LD E DF F +G L G+ PYA Y G
Sbjct: 20 YQKINPTP-LNNPHLISYNKLMFDEIALDYDEANSKDFLKFINGEKLLIGSEPYASAYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LG++ W LQ KG+G T YSR DG AVLRSSIRE++
Sbjct: 79 HQFGYFVPQLGDGRAINLGKV-----GTWHLQTKGSGLTRYSRQGDGRAVLRSSIREYII 133
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH L IPTTR L L+ + V R Y G E G+IV R++ S++R G+++ A
Sbjct: 134 SEAMHALNIPTTRVLALIGSTHPVHR---YYGVV--ETGSIVLRMSPSWIRIGTFEYFA- 187
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R + + V+ LADY I++ + H+ N DE NKY E+
Sbjct: 188 RSKGAKENVKQLADYVIKNSYAHLIN------------DE---------NKYEKMYYEMV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
++TA L+A+WQ GF HGV+NTDN S+ GL+IDYGPF F+D F+ + N TD G RY
Sbjct: 227 DKTAILMAKWQAYGFMHGVMNTDNFSMAGLSIDYGPFAFMDYFNINQICNHTDSEG-RYS 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL----- 487
+ NQP + WN+ + +L +D + N ++ Y EY +MT++LGL
Sbjct: 286 YLNQPYVAKWNLEVLANSLKIICELD--KLNEYLKTYFHIQEKEYLTLMTQRLGLDIHKS 343
Query: 488 -PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERK 546
Y +IS LL + K DY FF LS K I + ++DI R
Sbjct: 344 SDSYATLVIS-LLKVLQTSKTDYNQFFYELSKCKNSEEIRK----------VIDISIYR- 391
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+A W+ YI+ E+ + M VNPKYV++NY+ Q AID AE GDF V L
Sbjct: 392 QALDKWLEDYIELREFENEDFEKVQERMKKVNPKYVIKNYMLQEAIDKAEEGDFTLVNEL 451
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + PYDE E+Y++ P LSCSS
Sbjct: 452 LNIAQNPYDEHKEYERYSKATPL---EFSNIKLSCSS 485
>gi|153217047|ref|ZP_01950811.1| conserved hypothetical protein [Vibrio cholerae 1587]
gi|124113937|gb|EAY32757.1| conserved hypothetical protein [Vibrio cholerae 1587]
Length = 508
Score = 318 bits (816), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 192/466 (41%), Positives = 260/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ H + ++ + LAD I HF TS Y
Sbjct: 202 GHFE-HFFYTDQHANL-KLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSEHLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 405
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 406 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 465
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 466 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|449300226|gb|EMC96238.1| hypothetical protein BAUCODRAFT_33584 [Baudoinia compniacensis UAMH
10762]
Length = 624
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 221/607 (36%), Positives = 309/607 (50%), Gaps = 88/607 (14%)
Query: 88 DGGDESKMTKKLKALEDLNWDHSFVRELPGDP------------RTDSIPREVLHACYTK 135
DGG + + + DL ++F ++LP DP R+ PR V A YT
Sbjct: 11 DGGHQQSFS-----IRDLPKSNNFTQKLPPDPQYPTPASSHKAERSKLGPRLVREAAYTY 65
Query: 136 VSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLA---------GAVPY 186
V P + +LV S++ L +DP E DF +G + P+
Sbjct: 66 VRPDS-FPKTELVGVSKAALRDLAIDPASVETDDFKDTVAGKKIITLQGDEPNDTDIYPW 124
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRS 245
AQCYGG+QFG WAGQLGDGRAI+L E N S R+ELQLKGAGKTPYSRFADG AV+RS
Sbjct: 125 AQCYGGYQFGQWAGQLGDGRAISLFETTNPTSHTRYELQLKGAGKTPYSRFADGRAVVRS 184
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIREF+ SEA++ LGIP+TRAL L + R EPGAIV R AQS+LRFG
Sbjct: 185 SIREFVVSEALNALGIPSTRALSLTLAPEARVR------RETTEPGAIVARFAQSWLRFG 238
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM-------NKSESLSFSTGDEDHSVVD 358
++ + SRG D ++R LADYA F + + + E + + DE +
Sbjct: 239 TFDLPRSRG--DRAMIRKLADYAAEEVFGGWDKLPGKTGSDDLVEPGTSVSRDELQGENE 296
Query: 359 LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
N+Y E+A R A +VA WQ FT+GVLNTDN SI GL+ID+GPF FLD FDP+
Sbjct: 297 HQQNRYTRLYREIARRNARMVAYWQAYAFTNGVLNTDNTSIFGLSIDFGPFAFLDNFDPN 356
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AAAKLIDDKE----------ANY 464
+TPN D RY + NQP I WN+ + L A +D+KE A+
Sbjct: 357 YTPNHDDHM-LRYAYKNQPSIIWWNLVRLGEALGELIGAGDRVDEKEFVEEGVSKDWADE 415
Query: 465 VMER-----------YGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDY 509
+++R Y FMDEY+ +MT +LGL + + + S LL+ M ++D+
Sbjct: 416 LVKRAETLIEATGEEYKAVFMDEYKRLMTARLGLKQCKSEDFESLYSDLLDTMEALELDF 475
Query: 510 TNFFRALSNVKADPSIPEDELLVPLKAVLLD-------IGKE-----RKEAWI-SWVLSY 556
+ FR LS + + I DE + +G E R W+ W
Sbjct: 476 NHTFRRLSYISFE-EIDTDEKRKEVAGRFFHHEGLSGLVGSEEDARARVAKWLEKWRARV 534
Query: 557 IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE-LGDFGEVRRLLKLMERPYD 615
+++ +S + EER M +VNPK++ R+++ I+ E G+ + ++ + P+
Sbjct: 535 VEDWPNSTEAKEERFRAMRAVNPKFIPRSWILDELIERVEKKGEREILGHVMDMALNPFQ 594
Query: 616 EQPGMEK 622
E G K
Sbjct: 595 ESWGWSK 601
>gi|262167890|ref|ZP_06035590.1| UPF0061 domain-containing protein [Vibrio cholerae RC27]
gi|262023617|gb|EEY42318.1| UPF0061 domain-containing protein [Vibrio cholerae RC27]
Length = 489
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 259/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 388
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 389 ----REAAKTWLTRYLERAARELGQEGRPISIRERCQAMRQVNPKYILRNYLAQQAIEFA 444
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 445 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|398865175|ref|ZP_10620698.1| hypothetical protein PMI35_02581 [Pseudomonas sp. GM78]
gi|398243914|gb|EJN29491.1| hypothetical protein PMI35_02581 [Pseudomonas sp. GM78]
Length = 487
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 216/551 (39%), Positives = 302/551 (54%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFDR--LGD------------AFSAHVLPEP-IDNPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+ E +F FSG A A+P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPEVAETQEFAELFSGHKLWADAIPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+ L IPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALQALNIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A+V R+A S +RFG ++ + R ++ + L D+ + HF
Sbjct: 166 E-------KQERAAMVLRLAPSHVRFGHFEYFYYTKRPEQQ----KVLGDHVLAMHFPQC 214
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
+ + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 215 --LEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD +F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFF 513
I + + Y F Y +M ++LGL + +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLYLPLFQAHYLDLMRRRLGLTTAEEDDQKLLEQLLQLMQNSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERKA 572
R L + ++ L+ +D+ + + +W Y+ + G + E+R+A
Sbjct: 371 RRLGEESPEAAVGR------LRDDFVDL-----KGFDAWGELYVARVAREGEVDQEQRRA 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ +P++EQPGME YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIDAAESGDYSEVRRLHAVLSKPFEEQPGMESYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|424591597|ref|ZP_18031024.1| hypothetical protein VCCP103710_2369 [Vibrio cholerae CP1037(10)]
gi|408031404|gb|EKG68030.1| hypothetical protein VCCP103710_2369 [Vibrio cholerae CP1037(10)]
Length = 489
Score = 318 bits (815), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 191/466 (40%), Positives = 258/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PIAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW ++V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFLQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSEHLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 386
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +E G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 387 LDREAAKAWIERYLTRAAREFGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 446
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 447 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|229523950|ref|ZP_04413355.1| hypothetical protein VCA_001529 [Vibrio cholerae bv. albensis
VL426]
gi|229337531|gb|EEO02548.1| hypothetical protein VCA_001529 [Vibrio cholerae bv. albensis
VL426]
Length = 489
Score = 318 bits (815), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 192/466 (41%), Positives = 259/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I +F TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + KAV+ L
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------KAVIDLV 386
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 387 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 446
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 447 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|194227089|ref|XP_001496125.2| PREDICTED: UPF0061 protein Fjoh_2793-like [Equus caballus]
Length = 571
Score = 318 bits (815), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 211/556 (37%), Positives = 297/556 (53%), Gaps = 73/556 (13%)
Query: 110 SFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESV-ADSLELDPKEFERP 168
+F+ LP DP ++ R+V + ++ P+ +LVA S+ V D L+LD E
Sbjct: 67 NFIAMLPVDPVKENYVRKVKNCVFSIAFPTPFKSRVRLVAVSKEVLEDILDLDLSVSETD 126
Query: 169 DFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGA 228
DF SG L G+VP A YGGHQFG+WA QLGDGRA +G +N
Sbjct: 127 DFIQLVSGEKILFGSVPLAHRYGGHQFGIWADQLGDGRAHLIGIYMN------------- 173
Query: 229 GKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKE 288
DG AVLRSS+REFL SEA+H LGIPT+RA LV + V RD FYDGN +
Sbjct: 174 ------SHGDGRAVLRSSVREFLGSEAVHHLGIPTSRAASLVVSDDEVWRDQFYDGNVVK 227
Query: 289 EPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS 348
E A+V RVA+S+ R GS +I A G+ LD++RTL D+ I+ HF ++
Sbjct: 228 ERAAVVLRVAKSWFRIGSLEILAHYGE--LDLLRTLLDFIIQEHFPSVD----------- 274
Query: 349 TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH------------GVLNTDN 396
G+ N+Y + V TA L+A W VGF H GV NTDN
Sbjct: 275 VGE---------PNRYVDFFSVVVSETAQLIALWTSVGFAHVTTMYPYLCILEGVCNTDN 325
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
S+L +TIDYGPFGF++A++P F PNT+D RRY NQ +IG++N+ + L L
Sbjct: 326 FSLLSITIDYGPFGFMEAYNPDFVPNTSD-DERRYKIGNQANIGMFNLNKLLQALNP--L 382
Query: 457 IDDKE---ANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYT 510
+D ++ A ++E Y + ++ + KLGL K ++ +I+ LL+ M D+T
Sbjct: 383 LDPRQKQLAALILEGYPDLYYTRFRELFKAKLGLLGERKGDEDLIAFLLHLMEKTAADFT 442
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSS-GISDEE 569
FR LS + EL +P + L + E + +WV Y+ L S+ SD E
Sbjct: 443 MTFRQLSEITQSQL---QELNIPQQFWALQT-ISKHELFPAWVSQYLLRLKSNMNDSDSE 498
Query: 570 RKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEK--YARLP 627
R+ M +VNP+YVL+N++ +SA+ AE DF EVR L ++++ P+ + EK YA
Sbjct: 499 RRKRMMTVNPRYVLKNWMAESAVRKAERNDFSEVRLLQQVLQHPFQKHSTAEKAGYASPT 558
Query: 628 PAWAYRPGVCMLSCSS 643
P+WA V SCSS
Sbjct: 559 PSWAKNLRV---SCSS 571
>gi|183179526|ref|ZP_02957737.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
gi|183012937|gb|EDT88237.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
Length = 508
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 192/468 (41%), Positives = 259/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 202 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 407
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 408 ----REAAKTWLTRYLERAARELGQEGRPISSSERCQAMRQVNPKYILRNYLAQQAIEFA 463
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 464 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 508
>gi|254226480|ref|ZP_04920065.1| conserved hypothetical protein [Vibrio cholerae V51]
gi|125620986|gb|EAZ49335.1| conserved hypothetical protein [Vibrio cholerae V51]
Length = 508
Score = 318 bits (814), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 192/466 (41%), Positives = 260/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + E +++ LKGAG TPYSR DG AVLR
Sbjct: 89 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGEVFDIHLKGAGLTPYSRMGDGRAVLR 148
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 149 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 201
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ H + ++ + LAD I HF TS Y
Sbjct: 202 GHFE-HFFYTDQHANL-KLLADKVIEWHFPDCVQ---------------------TSKPY 238
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 239 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 298
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 299 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 355
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 356 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 405
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 406 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 465
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+L P W + +SCSS
Sbjct: 466 GDFEEMQRLATVLTSPYAEHPEFERYAKLSPEWGKK---LEISCSS 508
>gi|296135964|ref|YP_003643206.1| hypothetical protein Tint_1494 [Thiomonas intermedia K12]
gi|295796086|gb|ADG30876.1| protein of unknown function UPF0061 [Thiomonas intermedia K12]
Length = 513
Score = 318 bits (814), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 203/505 (40%), Positives = 270/505 (53%), Gaps = 60/505 (11%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELD----PKEFERPDFPLFFSGATPLAGAVPYAQC 189
VSP + +P LVA S A + L P++ + D+ F G A
Sbjct: 37 VAVSP---LPDPVLVASSADAAALVGLTAPTTPQDEQ--DWARAFGGHVAAISGGSRATV 91
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKS-------ERWELQLKGAGKTPYSRFADGLAV 242
Y GHQFG WAGQLGDGRA+ LG+ + RWE+Q KG+G+TP+SR DG AV
Sbjct: 92 YAGHQFGNWAGQLGDGRALLLGDWPDASGGRHSGGYARWEVQFKGSGRTPFSRMGDGWAV 151
Query: 243 LRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFL 302
LRSSIREFLCSEAM LGIPTTRALCLV + + V R+ + E A+V R++ SF+
Sbjct: 152 LRSSIREFLCSEAMAALGIPTTRALCLVGSSRPVRRE-------RIETAAMVTRLSPSFV 204
Query: 303 RFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSN 362
RFG ++ + GQ + +R L D+ I + N +
Sbjct: 205 RFGHFEHFSYSGQTEQ--LRALTDWVIAQYCPDCANAPQPALALLQ-------------- 248
Query: 363 KYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPN 422
W V RTA L+A+WQ VGF HGV+NTDNMSILG TIDYGPF FLDA+DP TPN
Sbjct: 249 ----W---VVARTARLIARWQAVGFIHGVMNTDNMSILGWTIDYGPFAFLDAYDPLHTPN 301
Query: 423 TTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIM 481
TTD G RY + QP + WN+ L LID E A ++++ +++ Q +
Sbjct: 302 TTDR-GGRYAYGRQPAVAHWNLLALGQAL--LPLIDKPESALAAVDQFRPQYVQAMQQQL 358
Query: 482 TKKLGL--PKYNK-QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL P+ N + LL+ MA ++ D+T FR L+ + AD P +P A+
Sbjct: 359 AAKLGLTAPQPNDGDLFQDLLDTMAANRSDWTLSFRHLAQLAADAHAP-----IP-PALA 412
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+E + + WV Y + L + G D R MN+VNP VLR++L Q+AI AE+G
Sbjct: 413 AQFAREPQR-FGDWVARYRERLRAEGRDDAARAVAMNAVNPLVVLRHHLAQAAIAQAEVG 471
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKY 623
DF EV RLL + RP+D Y
Sbjct: 472 DFSEVHRLLHALTRPFDAHAAPPHY 496
>gi|384171544|ref|YP_005552921.1| hypothetical protein [Arcobacter sp. L]
gi|345471154|dbj|BAK72604.1| conserved hypothetical protein [Arcobacter sp. L]
Length = 485
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 193/516 (37%), Positives = 283/516 (54%), Gaps = 55/516 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y K++ + ++NP+LV++++ D + LD +E E +F F +G L G+VPY+ Y G
Sbjct: 20 YQKLNATP-LKNPKLVSFNKEACDLIGLDYEECETQEFLEFMNGEKTLNGSVPYSMVYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LG I W LQ KG+G T YSR DG AVLRSSIRE+L
Sbjct: 79 HQFGYFVPQLGDGRAINLGSI-----NGWHLQTKGSGLTRYSRQGDGRAVLRSSIREYLI 133
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM+ LGIPTTRAL ++ + F R+ +E AIV R++ S++R G+++ A
Sbjct: 134 SEAMYALGIPTTRALAIIDSETFAHREW------NQESCAIVLRMSPSWIRIGTFEFFAR 187
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+ ++ LADY I+ + +EN ED KY ++
Sbjct: 188 TKENSQKNLKQLADYVIKQSYPELEN-------------EDE--------KYEKMFYKLV 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
+RTA L+A WQ GF HGV+NTDN S+ GLTIDYGP+ F+D F+ + N TD+ G RY
Sbjct: 227 DRTAQLLALWQVYGFQHGVMNTDNFSMAGLTIDYGPYAFMDYFEKNAICNHTDVEG-RYS 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP---- 488
+ NQP + WN+ F K+ D+++ M+ Y + Y +M K++GL
Sbjct: 286 YNNQPFVARWNL--FVLINVLKKICDEEKLENYMKFYLSIHKKIYLDMMNKRVGLDASKS 343
Query: 489 -KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
N+ +I +LL + K+DY FF L+N+K+ + + +LDI ++
Sbjct: 344 GDANQFLIIELLGALESSKMDYNVFFYRLTNLKSFDDL----------SSILDIAV-FQD 392
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
W SY + + S E R +M VNPKY+L+NY+ Q AI+ A+ GD+ V LL
Sbjct: 393 PLRKWFDSYKRACVEQNSSFESRFEIMKKVNPKYILKNYMLQEAIEKADEGDYTLVNELL 452
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
K+ + P+DE E++A+ P + LSCSS
Sbjct: 453 KIAQNPFDEHEEFERFAQPTPM---KFANIKLSCSS 485
>gi|419837660|ref|ZP_14361098.1| hypothetical protein VCHC46B1_2841 [Vibrio cholerae HC-46B1]
gi|421344225|ref|ZP_15794628.1| hypothetical protein VCHC43B1_2806 [Vibrio cholerae HC-43B1]
gi|423735612|ref|ZP_17708809.1| hypothetical protein VCHC41B1_2388 [Vibrio cholerae HC-41B1]
gi|424009952|ref|ZP_17752889.1| hypothetical protein VCHC44C1_2439 [Vibrio cholerae HC-44C1]
gi|395940305|gb|EJH50986.1| hypothetical protein VCHC43B1_2806 [Vibrio cholerae HC-43B1]
gi|408629795|gb|EKL02464.1| hypothetical protein VCHC41B1_2388 [Vibrio cholerae HC-41B1]
gi|408856208|gb|EKL95903.1| hypothetical protein VCHC46B1_2841 [Vibrio cholerae HC-46B1]
gi|408863747|gb|EKM03221.1| hypothetical protein VCHC44C1_2439 [Vibrio cholerae HC-44C1]
Length = 489
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 191/466 (40%), Positives = 257/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ L D I HF TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLTDKVIEWHFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 386
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 387 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 446
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 447 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|357405193|ref|YP_004917117.1| hypothetical protein MEALZ_1837 [Methylomicrobium alcaliphilum 20Z]
gi|351717858|emb|CCE23523.1| conserved hypothetical protein [Methylomicrobium alcaliphilum 20Z]
Length = 492
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 194/504 (38%), Positives = 279/504 (55%), Gaps = 54/504 (10%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGH 193
T+++P+ V++P+L+ + ++AD L LD E + FSG GA P A Y GH
Sbjct: 20 TRLNPTP-VQSPRLIKLNRNLADQLGLDLDELDNKTAAALFSGNLVPEGAEPLAMAYAGH 78
Query: 194 QFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCS 253
QFG + QLGDGRAI LGE+++ RW++QLKG+G+TP+SR DG A L +RE+L S
Sbjct: 79 QFGNFVPQLGDGRAILLGEVIDRAGRRWDIQLKGSGQTPFSRRGDGRAALGPVLREYLIS 138
Query: 254 EAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR 313
+AMH LGIPTTRAL VT+G+ V R+ PGA++ RVA S +R G++Q A R
Sbjct: 139 DAMHALGIPTTRALAAVTSGEPVFRE-------TPLPGAVLTRVASSHIRIGTFQYFAMR 191
Query: 314 GQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAE 373
ED + V+ LADYAI H+ +++ N Y+A V E
Sbjct: 192 --EDREAVKLLADYAIGRHYPDLKS---------------------APNPYSALLTTVQE 228
Query: 374 RTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCF 433
R ASL+A+W VGF HGV+NTDNM+I G TIDYGP F+D ++P ++ D G RY F
Sbjct: 229 RQASLIARWMHVGFIHGVMNTDNMTISGETIDYGPCAFMDQYNPDTVFSSIDDFG-RYAF 287
Query: 434 ANQPDIGLWNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLGL 487
NQP I WN+A+F+ TL L+ D++ A ++ R+ F + + M +KLGL
Sbjct: 288 GNQPRIAQWNLARFAETL--LPLLHDEQDSAIAIAVEIINRFSDIFDNFWLTGMRRKLGL 345
Query: 488 P---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ +KQ+I LL + K DYTN FRALS++ P+ + L D +
Sbjct: 346 AIEQQDDKQLIDSLLQLLQQHKADYTNVFRALSHIAEGPAT---------EPALNDYLPQ 396
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEV 603
+ + +W+ + L S +R M VNP Y+ RN+ + A+ AA + DF +
Sbjct: 397 TPD-FDNWLERWQTRLDQEPGSPAQRAEAMRQVNPAYIPRNHKVEQALSAAVQDEDFSKF 455
Query: 604 RRLLKLMERPYDEQPGMEKYARLP 627
LL ++ +P+ EQPG Y P
Sbjct: 456 EALLDVLNKPFTEQPGCRHYQEPP 479
>gi|422909699|ref|ZP_16944342.1| hypothetical protein VCHE09_1188 [Vibrio cholerae HE-09]
gi|341634459|gb|EGS59217.1| hypothetical protein VCHE09_1188 [Vibrio cholerae HE-09]
Length = 489
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 204/525 (38%), Positives = 277/525 (52%), Gaps = 64/525 (12%)
Query: 130 HACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYA 187
A YT V P ++N + W+ +A L E P+ L SG A P A
Sbjct: 18 QAFYTPVQPQP-LQNVRWGMWNTRLAQQFGLP----EAPNDELLASLSGQQLPADFSPVA 72
Query: 188 QCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSI 247
Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLRSSI
Sbjct: 73 MKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLRSSI 132
Query: 248 REFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSY 307
RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RFG +
Sbjct: 133 REYLCSEAMAGLGIATTRALALISSETPVYRE-------REERGALLVRLAHTHVRFGHF 185
Query: 308 QIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAW 367
+ Q ++ LAD I HF ++K YAA
Sbjct: 186 EHFFYTDQH--ANLKLLADKVIEWHFPDCVQISKP---------------------YAAL 222
Query: 368 AVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLP 427
+V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +D
Sbjct: 223 FSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHSDYQ 282
Query: 428 GRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL 487
G RY F QP IGLWN++ + L + LID + + Y + +M KLGL
Sbjct: 283 G-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSDHLNLHFSRLMRAKLGL 339
Query: 488 PKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ ++ + +A + DYT F R LS + + ++D+ +
Sbjct: 340 ATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN-----------EAVIDLVLD 388
Query: 545 RKEAWISWVLSYI----QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
R+ A I W+ Y+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE G
Sbjct: 389 REAAKI-WLTRYLDRAARELGQEGGPISSSERCQAMRQVNPKYILRNYLAQQAIEFAERG 447
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
DF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 448 DFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|154245115|ref|YP_001416073.1| hypothetical protein Xaut_1167 [Xanthobacter autotrophicus Py2]
gi|154159200|gb|ABS66416.1| protein of unknown function UPF0061 [Xanthobacter autotrophicus
Py2]
Length = 494
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 200/535 (37%), Positives = 283/535 (52%), Gaps = 69/535 (12%)
Query: 107 WDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFE 166
+D+S+ R+LPG Y +P+ V P LV + +A+ L LDP+
Sbjct: 7 FDNSYARDLPG--------------FYAPATPT-PVTAPGLVKVNAPLAEELGLDPEALA 51
Query: 167 RPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLK 226
P F+G GA P A Y GHQFG + QLGDGRAI LGE+++ R ++QLK
Sbjct: 52 TPHAVEMFAGQHVPEGADPIALAYAGHQFGQFTPQLGDGRAILLGEVVDRAGRRRDIQLK 111
Query: 227 GAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNP 286
G+G TP+SR DG A L +RE++ SEAM LGIPTTRAL VTTG+ V RD
Sbjct: 112 GSGPTPFSRRGDGRAALGPVLREYIVSEAMAALGIPTTRALAAVTTGEPVLRD------- 164
Query: 287 KEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLS 346
+ PGA++ RVA S +R G++Q A+R + D VR LADY I H+ +
Sbjct: 165 RPLPGAVLARVAASHIRIGTFQFFAAR--KATDAVRQLADYTIARHYPELAG-------- 214
Query: 347 FSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDY 406
T Y A V R A+LVA+W VGF HGV+NTDNMS+ G TIDY
Sbjct: 215 -------------TPEPYLALLNGVIGRQAALVARWLLVGFIHGVMNTDNMSVSGETIDY 261
Query: 407 GPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE----- 461
GP F+DA+DP ++ D G RY + NQPDI WN+A+ + L L +DKE
Sbjct: 262 GPCAFMDAYDPETVFSSIDQMG-RYAYGNQPDIAHWNLARLAECL-IPLLGEDKEAAVAA 319
Query: 462 ANYVMERYGTKFMDEYQAIMTKKLGLP-------KYNKQIISKLLNNMAVDKVDYTNFFR 514
AN ++ + +F Y + + K+GL + + + +LL+ MA K D+T FR
Sbjct: 320 ANGALKEFPARFRAAYHSGLVAKIGLAGPDGEASEEDTTLALELLSVMAESKADFTLTFR 379
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L + ADP E P++ + LD ++A+ +W + + L ++G +A M
Sbjct: 380 RLGALAADP-----EAGGPVRDLFLD-----RDAFDAWTTRWRERLAATGRDGAATRAAM 429
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
+ VNP ++ RN+L + I +A GDF L ++ P++EQP YA LPP+
Sbjct: 430 DRVNPLFIPRNHLVEQVIASATEGDFAPFETLNTVLAHPFEEQPAFAAYAGLPPS 484
>gi|118591066|ref|ZP_01548465.1| hypothetical protein SIAM614_15607 [Stappia aggregata IAM 12614]
gi|118436142|gb|EAV42784.1| hypothetical protein SIAM614_15607 [Stappia aggregata IAM 12614]
Length = 493
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 203/539 (37%), Positives = 292/539 (54%), Gaps = 69/539 (12%)
Query: 105 LNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKE 164
+D+S+ R+LPG + A+V P+LV ++ +A L LD
Sbjct: 8 FQFDNSYARDLPG---------------FYVAWEGAKVPAPELVLFNRDLATELNLDADL 52
Query: 165 FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQ 224
E P+ F+G GA P AQ Y GHQFG ++ QLGDGRA+ LGEI++ R ++Q
Sbjct: 53 LETPEGAEIFAGVRQPDGASPLAQVYAGHQFGGFSPQLGDGRALLLGEIIDSAGNRKDIQ 112
Query: 225 LKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDG 284
LKG+G TP+SR DG AV+ +RE++ EAMH LGIPTTRAL VTTG+ + RD
Sbjct: 113 LKGSGPTPFSRGGDGKAVVGPVLREYILGEAMHALGIPTTRALAAVTTGETIYRD----- 167
Query: 285 NPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSES 344
PK PGA++ RVA S LR G++Q A+RG+ D +R LADYAI RH N+
Sbjct: 168 GPK--PGAVLTRVAASHLRVGTFQYFAARGET--DKLRQLADYAIA---RHAPNLAGQ-- 218
Query: 345 LSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTI 404
S+ Y V ER A+L+A+W VGF HGV+NTDN +I G TI
Sbjct: 219 ----------------SDNYLRLFRGVVERQAALMAKWVLVGFVHGVMNTDNTTISGETI 262
Query: 405 DYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE--- 461
DYGP F+DA+DP+ ++ D G RY F QP I WN+A+ + TL DD++
Sbjct: 263 DYGPCAFIDAYDPAAVFSSID-HGGRYAFGRQPVIMQWNLARLAETLLPLIQPDDQDKAV 321
Query: 462 --ANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRAL 516
A+ + R+ + + + M K GL + ++ + +L+ + VDYT FFR L
Sbjct: 322 DLASTELARFPNLYRSAWLSGMRSKTGLQSEAEDDQDLFEAMLSALQEQSVDYTLFFRHL 381
Query: 517 SNVK-ADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
++ P D + P++ +D W+ + Q L G + E KA M+
Sbjct: 382 ADAAVGTPQKLRDLFMSPVQ---ID----------GWLERWRQRLEREGKAVAEIKAGMD 428
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
SVNP Y+ RN+L + A+ +AE+G++ V +LL +++ PY+E+ G E YA LP A+ P
Sbjct: 429 SVNPVYIPRNHLVEEALQSAEVGEYHLVNKLLDVLQSPYEEKSGFEAYA-LPAPAAFGP 486
>gi|453087159|gb|EMF15200.1| UPF0061-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 633
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 214/570 (37%), Positives = 293/570 (51%), Gaps = 84/570 (14%)
Query: 101 ALEDLNWDHSFVRELPGDP------------RTDSIPREVLHACYTKVSPSAEVENPQLV 148
++ DL ++F +LP D R PR V +A YT V P +LV
Sbjct: 21 SIRDLPKSNNFTSKLPADAEFPTPAASHRAERKALGPRLVRNAAYTYVRPEP-FSQSELV 79
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGA--TPLAG-------AVPYAQCYGGHQFGMWA 199
A S++ L +DP DF +G L G P+AQCYGG+QFG WA
Sbjct: 80 AVSKAALRDLAIDPASVTTDDFKKTVAGEHIVTLDGDEPSDKDIYPWAQCYGGYQFGSWA 139
Query: 200 GQLGDGRAITLGEILN-LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
GQLGDGRAI+L E N + R+E+QLKGAGKTPYSRFADG AV+RSSIREF+ SEA++
Sbjct: 140 GQLGDGRAISLFETTNPVTGRRYEIQLKGAGKTPYSRFADGKAVVRSSIREFVVSEALNA 199
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDL 318
LGIP+TRAL L + R EP AIV R A+S++RFG++ + SRG D
Sbjct: 200 LGIPSTRALSLTLGPEERIR------RETTEPAAIVARFAESWIRFGTFDLPRSRG--DR 251
Query: 319 DIVRTLADYAIRHHFRHIENM-------NKSESLSFSTG---DEDHSVVDLTSNKYAAWA 368
D++R LADY F +N+ + + S G +E ++ N+YA
Sbjct: 252 DMLRKLADYVAEDVFAGWQNLPGRVPTTEAKDVVEVSRGVAKEEVQGEAEVAENRYARLF 311
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
EVA R A VA WQ GF +GVLNTDN SI GL+ID+GPF FLD FDP++TPN D
Sbjct: 312 REVARRNAKTVAAWQAYGFMNGVLNTDNTSIYGLSIDFGPFAFLDNFDPNYTPNHDD-HM 370
Query: 429 RRYCFANQPDIGLWNIAQFSTTL----AAAKLIDDKE---------------------AN 463
RY + NQP I WN+ + + L A +DD+E +
Sbjct: 371 LRYSYKNQPSIIWWNLIRLAEALGELIGAGSWVDDEEFVTQGVRKERADELIKRAETIID 430
Query: 464 YVMERYGTKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFRALSN- 518
V E Y F+ EY+ +MT +LGL + K + S+LL+ + ++D+ + FR LSN
Sbjct: 431 RVKEEYEAVFLAEYKRLMTARLGLKQTKESDFKDLYSQLLDTLEALELDFNHTFRRLSNI 490
Query: 519 --VKADPSIPEDEL---------LVPLKAVLLDIGKERKEAWIS-WVLSYIQELLSSGIS 566
V D E+ L L ++ D ++R W++ W +Q+ SS +
Sbjct: 491 SMVDIDSHAKAKEVAGRFFHHEGLGSLVSLTEDQARDRLATWLTRWRTRILQDWPSSEEA 550
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
ER A M +VNP +V R+++ I E
Sbjct: 551 RTERIAAMKAVNPNFVPRSWILDEVITEVE 580
>gi|262192159|ref|ZP_06050319.1| UPF0061 domain-containing protein [Vibrio cholerae CT 5369-93]
gi|262031948|gb|EEY50526.1| UPF0061 domain-containing protein [Vibrio cholerae CT 5369-93]
Length = 489
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 191/466 (40%), Positives = 259/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I +F TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSERLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 386
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 387 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 446
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 447 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|359782969|ref|ZP_09286187.1| hypothetical protein PPL19_17965 [Pseudomonas psychrotolerans L19]
gi|359369115|gb|EHK69688.1| hypothetical protein PPL19_17965 [Pseudomonas psychrotolerans L19]
Length = 486
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 217/552 (39%), Positives = 291/552 (52%), Gaps = 73/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L DL +D+ F R L D T PR + ++P+LV S + L
Sbjct: 1 MKQLLDLTFDNRFAR-LGDDFSTRIDPRPL--------------DDPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP P+F F+GA A P A Y GHQFG + +LGDGR + LGE++N +
Sbjct: 46 DLDPAVAATPEFTQVFAGAQLWDSAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVVNDRG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TP+SR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGLTPFSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSSTQVVR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ + E GA + R+A S +RFG ++ Q L + L ++ + HF +
Sbjct: 166 E-------RLETGATLLRMAPSHVRFGHFEYFYYTRQHSL--LEQLGEHVLAAHFPDLLG 216
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
T +AA EV ER A L+A WQ GF HGV+NTDN S
Sbjct: 217 ---------------------TPEPWAALFREVVERNARLIAYWQAYGFCHGVMNTDNFS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILGLT D+GPF FLD FD F N +D G RY ++NQ I WN+A A A+ +
Sbjct: 256 ILGLTFDFGPFAFLDDFDAQFICNHSDHTG-RYSYSNQVPIAHWNLA------ALAQALT 308
Query: 459 DKEANYVMERYGTKFMDEYQA----IMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTN 511
K A ++ F+ YQA +M ++LGL + +++++ +LL M VDY
Sbjct: 309 PKVAVETLQESIALFLPLYQAHYLDLMRRRLGLQEARDEDQELVERLLALMQQGGVDYHL 368
Query: 512 FFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERK 571
FFR L EDE L V D R + W +Y L + G + ER
Sbjct: 369 FFRQLG---------EDEPAAALARVREDFIDLR--GFDDWSQAYRTRLDAEGGAAAERT 417
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
MN+VNP +VLRNYL Q AI+ AE GD+ EVR L +++ RP++ QPG E++A PP W
Sbjct: 418 TRMNAVNPLFVLRNYLAQQAIEQAEAGDYSEVRLLHEVLSRPFEAQPGRERFALRPPDWG 477
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 478 KH---LEISCSS 486
>gi|393757698|ref|ZP_10346522.1| hypothetical protein QWA_01235 [Alcaligenes faecalis subsp.
faecalis NCIB 8687]
gi|393165390|gb|EJC65439.1| hypothetical protein QWA_01235 [Alcaligenes faecalis subsp.
faecalis NCIB 8687]
Length = 488
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 209/519 (40%), Positives = 278/519 (53%), Gaps = 56/519 (10%)
Query: 131 ACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCY 190
A +T V P + N +L+ ++++A L LD P+F SG +PL G + + Y
Sbjct: 20 AFHTAVPPQP-LANARLLHVNQALAAQLGLDVSRLGEPEFLDVVSGQSPLPGGLTVSAVY 78
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG+WAGQLGDGRA LG+I + + ELQLKGAGKTPYSR DG AVLRSS+RE+
Sbjct: 79 SGHQFGVWAGQLGDGRAHLLGQIDTPEGPQ-ELQLKGAGKTPYSRMGDGRAVLRSSVREY 137
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH 310
L SEAM LGI T+RAL LVT+ V R+ E GAIV RVA SF+RFGS++
Sbjct: 138 LASEAMAGLGIATSRALALVTSDTPVYRESV-------ETGAIVTRVAPSFVRFGSFEHW 190
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
A+ D + +R L DY +R + + SE + + E
Sbjct: 191 AN----DAERLRELLDYVLRDFYPELRQDGDSE-----------------QERVCRFLQE 229
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
V R+A +VA WQ VGF HGV+NTDNMSILGLTIDYGP+GF+D F + N +D G R
Sbjct: 230 VTRRSAEMVADWQTVGFCHGVMNTDNMSILGLTIDYGPYGFMDRFRVNHVCNHSDNQG-R 288
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGT---KFMDEYQAIMTKKLGL 487
Y + QP I WN+ + LA+A ++ + V ER T F++ Y A + K GL
Sbjct: 289 YAWNAQPAIVHWNLYR----LASALMVLGLDVEVVKERLQTFEASFLNRYHANLQAKFGL 344
Query: 488 PKY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKE 544
+ + Q++ + D+T FRAL+ P E + L D ++
Sbjct: 345 RTWRADDAQLVDDWWRLLHNSGADFTLSFRALAQASKAP-----EAFLS----LFDGSED 395
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR 604
+ AW +Y Q L G E++ MN VNP YVLRN+L + AI AA D E+
Sbjct: 396 QARAWWQ---AYSQRLTLDGSDTPEQREAMNRVNPLYVLRNHLAEQAIQAAAKDDASEID 452
Query: 605 RLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
LL L+ PY E+ G E YA PP + V SCSS
Sbjct: 453 TLLMLLRDPYTERAGFEAYAMPPPEGSAELAV---SCSS 488
>gi|77456672|ref|YP_346177.1| hypothetical protein Pfl01_0444 [Pseudomonas fluorescens Pf0-1]
gi|121957903|sp|Q3KJ68.1|Y444_PSEPF RecName: Full=UPF0061 protein Pfl01_0444
gi|77380675|gb|ABA72188.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
Length = 487
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 296/550 (53%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSAHVLPEP-IDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAMADTQEFAELFGGHKLWADAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+A S +RFG ++ + ++ E ++ H+
Sbjct: 166 E-------KQERAAMVLRLAPSHIRFGHFEYFYYTKRPEQQKLLG-----------EHVL 207
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
M+ E L Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 208 AMHYPECLE-------------QPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD +F N +D G RY F+NQ +G WN++ + L LI
Sbjct: 255 SILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPVGQWNLSTLAQAL--TPLI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFR 514
+ + Y F Y +M ++LG ++ ++ +LL M VDYT FFR
Sbjct: 312 SVEALRETLGLYLPLFQAHYLDLMRRRLGFTTAEDDDQMLLEQLLQLMQNSGVDYTLFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKAL 573
L A+ ++ L+ +DI + + +W Y+ + G +D E+R+A
Sbjct: 372 RLGEESAEQAVAR------LRDDFVDI-----KGFDAWGERYVARVARDGATDQEQRRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 421 MHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|407071650|ref|ZP_11102488.1| hypothetical protein VcycZ_18994 [Vibrio cyclitrophicus ZF14]
Length = 485
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 201/528 (38%), Positives = 282/528 (53%), Gaps = 58/528 (10%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q ++W+ ++A+ L E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LSNVQWLSWNHNLANELGFPSFEDASEELLETLSGNVE 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PDQFSPVAMKYAGHQFGSYNPDLGDGRGLLLAQVVAKCGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R A+
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRAAE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF + K
Sbjct: 180 SHIRFGHFEHLFYTNQ--LAEHKLLADKVIEWHFPECLDAEKP----------------- 220
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA + +RTA +VA WQ GF HGV+NTDNMSI+G T DYGPF FLD +DP
Sbjct: 221 ----YAAMFNLIVDRTAEMVALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYDPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IGLWN++ + +L+ L+D + +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGLWNLSALAHSLSP--LVDKADLEAALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL + + ++ + M+ +KVDY FFR LSN+ D P+D
Sbjct: 334 MMRRKLGLLSKQEGDSRLFESMFELMSQNKVDYPRFFRTLSNL--DTLKPQD-------- 383
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
++D+ +R+ A + WV +Y+Q S +R M VNPKY+LRNYL Q AID AE
Sbjct: 384 -VIDLVIDREAAKL-WVDNYLQRCELEESSVVKRCENMRQVNPKYILRNYLAQLAIDKAE 441
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
GD ++ L+ ++ PY E P E A LPP W G M +SCSS
Sbjct: 442 RGDSSDIDALMVVLANPYAEHPDYEHLAALPPEW----GKAMEISCSS 485
>gi|27363526|ref|NP_759054.1| hypothetical protein VV1_0039 [Vibrio vulnificus CMCP6]
gi|33517021|sp|Q8DG12.1|Y039_VIBVU RecName: Full=UPF0061 protein VV1_0039
gi|27359642|gb|AAO08581.1| Selenoprotein O and cysteine-containing protein [Vibrio vulnificus
CMCP6]
Length = 490
Score = 315 bits (808), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 204/517 (39%), Positives = 285/517 (55%), Gaps = 53/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y V+P ++N + V W+ +A L PK + P FSGA P + P A Y G
Sbjct: 21 YRLVTPQP-LDNNRWVIWNGELAQGFAL-PKHADDPQLLSVFSGAEPFSAFKPLAMKYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ LGDGR + LGE+ N + + +++ LKGAG TP+SR DG AVLRS++RE+LC
Sbjct: 79 HQFGVYNPDLGDGRGLLLGEMQNQQGQWFDIHLKGAGLTPFSRMGDGRAVLRSTLREYLC 138
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGI TTRAL ++ + V R+ + E GA + R+AQ+ +RFG ++
Sbjct: 139 SEAMAALGIETTRALGMMVSDTPVYRE-------QVEQGACLIRLAQTHIRFGHFEHFFY 191
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E D +R LAD I + +K Y A +V
Sbjct: 192 T--EQYDELRLLADNVIEWYMPECTAHDKP---------------------YLAMFEQVV 228
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+++AQWQ VGF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G RY
Sbjct: 229 ARTATMIAQWQAVGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-RYA 287
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F QP + LWN++ + L + LI+ + + +Y + +M +KLGL +
Sbjct: 288 FDQQPRVALWNLSALAHAL--SPLIERDDLELALAQYEPTLGKVFSQLMRQKLGLLSQQE 345
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ + + +A + DYT FFR LS + EDE V + L I ++ W
Sbjct: 346 GDSELFNAMFALLAENHTDYTRFFRTLSQLDR-----EDEQTV----IDLFIDRDAAHGW 396
Query: 550 ISWVLSYI-QELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+S L + E +SG S ++R M +VNPKY+LRNYL Q AID A+ GDF EV L
Sbjct: 397 LSRYLERVAMEQTASGEAKSAQQRCEQMRAVNPKYILRNYLAQQAIDKAQQGDFSEVHTL 456
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
KL++ PYDEQ ME YA LPP W + ++SCSS
Sbjct: 457 AKLLKNPYDEQAEMEAYAHLPPEWGKK---MVISCSS 490
>gi|298370130|ref|ZP_06981446.1| YdiU family protein [Neisseria sp. oral taxon 014 str. F0314]
gi|298281590|gb|EFI23079.1| YdiU family protein [Neisseria sp. oral taxon 014 str. F0314]
Length = 504
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 202/517 (39%), Positives = 277/517 (53%), Gaps = 53/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y+ V+P + P VA++ +A++L LD ++F+ + SG+ P A Y G
Sbjct: 35 YSSVNPEP-LNRPYWVAFNPCLAEALGLD-EDFQTASNLAYLSGSAERYRPQPLATVYSG 92
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + +LGDGRA+ LG+ + RWE QLKGAGKTPYSRFADG AVLRSSIRE+LC
Sbjct: 93 HQFGAYTPRLGDGRALLLGDSEDRHGRRWEWQLKGAGKTPYSRFADGRAVLRSSIREYLC 152
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL L + V R+ ++E A++ R+A SF+RFG ++
Sbjct: 153 SEAMHGLGIPTTRALALCGSQDPVYRE-------RQETAAVLTRIAPSFIRFGHFEYLFY 205
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
+G+E ++ LAD+ IRHH+ + +N YA ++
Sbjct: 206 QGRE--AELKLLADFLIRHHYPDCR---------------------VAANPYAELLHQIG 242
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTASL A WQ VGF HGVLNTDNMS LGLTIDYGPFGF+DA+D N +D G RY
Sbjct: 243 LRTASLAAAWQSVGFCHGVLNTDNMSALGLTIDYGPFGFMDAYDRHHVSNHSDGKG-RYA 301
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
+ QP I WN + + + L+ ++ N +E++ F Y M KLGL
Sbjct: 302 YNAQPYIAHWNFSALANCFES--LVPEEFINQTLEQWPDVFQTAYLHKMRGKLGLQHAES 359
Query: 493 QIISKLLNNMAVDK---VDYTNFFRAL---SNVKADPSIPEDELLVPLKAVLLDIGKERK 546
+ + + +A + VD+T FFR L S+V DP E E L G
Sbjct: 360 GDDALIADLLAALQDGNVDFTLFFRHLAKISHVHGDPLPIELENL---------FGGNVT 410
Query: 547 EAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+ W+ Y + L ER MN NP YVLRN+L + I A+ GD+ E+ RL
Sbjct: 411 PVFNLWLGLYRRRLRGENSRSAERAERMNRTNPLYVLRNHLAEQVIVLAQSGDYREIERL 470
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ +E P++E+ +A PPA + +SCSS
Sbjct: 471 RRCLENPFEERAEFADFAEPPPAGS---TPVRVSCSS 504
>gi|255931617|ref|XP_002557365.1| Pc12g05180 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211581984|emb|CAP80145.1| Pc12g05180 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 615
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 221/622 (35%), Positives = 320/622 (51%), Gaps = 95/622 (15%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDS------IPREVLH------ACYTKVSPSAEVENPQLV 148
+L +L + F +LP DP D+ PRE L A +T V P + + P+L+
Sbjct: 10 SLAELPKSNVFTSKLPPDPAFDTPESSHKAPRETLGPRMVKGALFTYVRPE-QTDEPELL 68
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSGAT-----PLAGAVPYAQCYGGHQFGMWAGQLG 203
S L L P E + F +G G P+AQCYGG QFG WAGQLG
Sbjct: 69 GVSSKAMKDLGLKPGEEQTSRFKALVAGNEIWWNEEQGGVYPWAQCYGGWQFGSWAGQLG 128
Query: 204 DGRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIP 262
DGRAI+L E N +++ R+ELQLKGAG+TPYSRFADG AVLRSSIRE++ SEA+ LGIP
Sbjct: 129 DGRAISLFECTNPQTDTRYELQLKGAGRTPYSRFADGKAVLRSSIREYVVSEALSALGIP 188
Query: 263 TTRALCL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
TTRAL L + V R+ EPGAIV R A+S+LR G++ + RG D +++
Sbjct: 189 TTRALSLTLIPNAKVLRERL-------EPGAIVARFAESWLRIGTFDLLRVRG--DRELI 239
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFST-------------GDEDHSVVDLTSNKYAAWA 368
R LA Y F E++ SL GD+ D+ N++A
Sbjct: 240 RKLATYVAEDVFNGWESLPAVVSLRDQQSSTQIDNPQRGIPGDQVQEHEDVQENRFARLY 299
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
E+A R A VA WQ GF +GVLNTDN SI GL++DYGPF F+D FDP +TPN D
Sbjct: 300 REIARRNAKTVAAWQAYGFMNGVLNTDNTSIYGLSLDYGPFAFMDNFDPQYTPNHDD-HM 358
Query: 429 RRYCFANQPDIGLWNIAQFSTTL----AAAKLIDD-----------------KEANYVME 467
RY + NQP I WN+ + +L A +DD K A ++E
Sbjct: 359 LRYAYRNQPSIIWWNLVRLGESLGELIGAGNRVDDESFVNDGVTEEFEPELIKRAEKIIE 418
Query: 468 RYGTK----FMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFR----- 514
R G + F++EY+ +M ++LGL + + S++L+ + ++D+ +FFR
Sbjct: 419 RVGEEFKAVFLNEYKRLMGQRLGLKTQAESDFQNLFSEMLDTLETLELDFNHFFRRLSGL 478
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIG------KERKEAWI-SWVLSYIQELLSSGISD 567
LSN++++ E + IG ++R W+ SW L +++ + +D
Sbjct: 479 TLSNLESEEGRREAASVFFHAEGFGGIGYTEATARDRIAKWLDSWRLRVLEDWGPA--ND 536
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLKLMERPYDEQPGM-----E 621
EER+ M SVNP +V R ++ I+ E GD + R++++ P++++ G+ E
Sbjct: 537 EERQKAMKSVNPNFVPRGWILDEVIERVERKGDRDILDRIMQMSLNPFNDEWGLHRQEEE 596
Query: 622 KYARLPPAWAYRPGVCMLSCSS 643
++ P + M SCSS
Sbjct: 597 RFCGDVPKYKR---AMMCSCSS 615
>gi|195539627|gb|AAI68007.1| Unknown (protein for MGC:184811) [Xenopus (Silurana) tropicalis]
Length = 422
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 183/425 (43%), Positives = 246/425 (57%), Gaps = 52/425 (12%)
Query: 105 LNWDHSFVRELPGDPRTDS-----IPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
L +D+ +R LP +P + PR+V AC+++V P+ + NP +VA S S L
Sbjct: 16 LTFDNLALRSLPVEPGDGTEEEARTPRQVPGACFSRVRPTPLL-NPTVVALSRSALSLLG 74
Query: 160 LDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
L E E + +FSG L G+ P A CY GHQFG +AGQLGDG A+ LGE++N +
Sbjct: 75 LQVGE-EDEEATEYFSGNRLLPGSEPAAHCYCGHQFGNFAGQLGDGAAMYLGEVVNATGK 133
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
RWE+QLKGAG TPYSR ADG VLRSSIREFLCSEAM LGIP+TRA VT V RD
Sbjct: 134 RWEIQLKGAGLTPYSRQADGRKVLRSSIREFLCSEAMSHLGIPSTRAGSCVTADSTVIRD 193
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ---------EDLDIVRTLADYAIR 330
++YDGNPK+E +V R+A +FLRFGS++I + + DI + DY IR
Sbjct: 194 IYYDGNPKKEKCTVVSRIAPTFLRFGSFEIFKPTDEFTGRKGPSVDRNDIRIQMLDYVIR 253
Query: 331 HHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHG 390
+ I+ + + + K AA+ E+ +RTA LVA+WQ VGF HG
Sbjct: 254 TFYPDIQEKHAGNN----------------TEKNAAFFREITKRTARLVAEWQCVGFCHG 297
Query: 391 VLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTT 450
VLNTDNMSI+GLTIDYGPFGF+D +DP + N +D G RY + QP+I WN+ + +
Sbjct: 298 VLNTDNMSIVGLTIDYGPFGFIDRYDPEYICNGSDNMG-RYAYNKQPEICKWNLGKLAEA 356
Query: 451 L-------AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLL 499
L + ++DD+ Y +F + Y M KKLGL + + ++S LL
Sbjct: 357 LIPELPLSISQSILDDE--------YDAEFQNHYMEKMRKKLGLVRLKLDDDSHLVSDLL 408
Query: 500 NNMAV 504
M +
Sbjct: 409 ETMNI 413
>gi|429887958|ref|ZP_19369462.1| Selenoprotein O and cysteine-containing-like protein [Vibrio
cholerae PS15]
gi|429224957|gb|EKY31255.1| Selenoprotein O and cysteine-containing-like protein [Vibrio
cholerae PS15]
Length = 489
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 192/468 (41%), Positives = 258/468 (55%), Gaps = 57/468 (12%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQFG++ LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFGVYNPDLGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I HF TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWHFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSECLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
LGL + ++ + +A + DYT F R LS + +E ++ L +LD
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQG----NEAVIDL---VLD- 388
Query: 542 GKERKEAWISWVLSYIQ----ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAA 595
+EA +W+ Y++ EL G IS ER M VNPKY+LRNYL Q AI+ A
Sbjct: 389 ----REAAKTWLTRYLERAARELGQEGRPISTRERCQAMRQVNPKYILRNYLAQQAIEFA 444
Query: 596 ELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
E GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 445 ERGDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|443718092|gb|ELU08841.1| hypothetical protein CAPTEDRAFT_193573 [Capitella teleta]
Length = 418
Score = 315 bits (806), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 188/458 (41%), Positives = 261/458 (56%), Gaps = 46/458 (10%)
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIRE 249
Y GHQFG + QLGDGR + LGE++N + ++W+L LKGAG+TPYSRF DG AVLRS IRE
Sbjct: 3 YSGHQFGAYNPQLGDGRGLLLGELVNTEGDKWDLHLKGAGQTPYSRFGDGRAVLRSCIRE 62
Query: 250 FLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+L SEA+H LGIPTTRALC+VT+ V R+ E G+ + R+A+S +RFG ++
Sbjct: 63 YLASEALHHLGIPTTRALCVVTSDTPVYRET-------TEAGSTLLRLARSHIRFGHFEY 115
Query: 310 HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
+ + ++ LADY I +F + M + V T Y +
Sbjct: 116 FFYNKR--YEALKELADYTIEQNFYDLPGMKE---------------VAGTDQGYQCFYS 158
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA+L+AQWQ GF HGV+NTDNMSILG T DYGP+GF+D F+ + N +D G
Sbjct: 159 EVIRRTATLIAQWQAAGFAHGVMNTDNMSILGDTFDYGPYGFIDDFNWHYICNHSDHSG- 217
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F+ QP+IG WN + L L DD E +++Y + Y +M KLGL
Sbjct: 218 RYAFSQQPEIGYWNCGRLGQALTP--LFDDGELIQKALDQYPQIYTQAYTRLMLDKLGLE 275
Query: 489 KYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
+ ++ ++S LL + DYT FFR LSN ++ + + + LV ++
Sbjct: 276 EEVEEDATLVSDLLQLLHDSHCDYTLFFRTLSNFPSNQTSEQLQQLVNHPSL-------- 327
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
W+ +Y + L + + D+ R+ M VNPKY+LRNYL Q AI+ AE GD+ E+
Sbjct: 328 ----APWLNTYQERLKKNPLDDQTRQKRMKQVNPKYILRNYLAQQAIEKAEKGDYQEIEH 383
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L+ ++ PYDE P E YA PP W + V SCSS
Sbjct: 384 LMNVLVSPYDEHPDFEHYAEKPPEWGKKLEV---SCSS 418
>gi|398350598|ref|YP_006396062.1| hypothetical protein USDA257_c07120 [Sinorhizobium fredii USDA 257]
gi|390125924|gb|AFL49305.1| UPF0061 protein R00982 [Sinorhizobium fredii USDA 257]
Length = 501
Score = 315 bits (806), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 202/505 (40%), Positives = 274/505 (54%), Gaps = 55/505 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V P++ V P L+ + +A+ L LD ER D FSG T AGA P A Y G
Sbjct: 29 YARVEPTS-VAEPWLIKLNRPLAEELGLDIAALER-DGAAIFSGNTVPAGAEPLAMAYAG 86
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE+++ +R ++QLKGAG+TPYSR DG A L +RE++
Sbjct: 87 HQFGTFVPQLGDGRAILLGEVVDRNGKRRDIQLKGAGQTPYSRRGDGRAALGPVLREYIV 146
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LG+PTTRAL TG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 147 SEAMHALGVPTTRALAATVTGQPVYREQIL-------PGAVFTRVASSHIRVGTFQFFAA 199
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D+D V+TLADY I H+ ++ DE N Y VA
Sbjct: 200 RG--DMDSVKTLADYVIDRHYPELK------------ADE---------NPYLGLLKAVA 236
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W +GF HGV+NTDNM+I G TID+GP F+DA+DP ++ D G RY
Sbjct: 237 ERQAALIARWLHIGFIHGVMNTDNMTISGETIDFGPCAFMDAYDPKKVFSSIDQFG-RYA 295
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ + TL L D AN + YGT F + + M +K+G
Sbjct: 296 YANQPAIGQWNLARLAETL--VTLFDPTADVAVNLANDALGEYGTIFQNHWLDGMRRKIG 353
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + + LL M D+T FRAL++ A+ + + E A L
Sbjct: 354 LSTAEDGDLERVQGLLALMHKGGADFTLAFRALAS-SAENAGGDVEF-----AKLF---- 403
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
+ EA W+ + + L ER A M +VNP ++ RN+ + AI+AA E DF
Sbjct: 404 QEPEALSPWLEDWRRRLEREARQPAERAAAMRAVNPAFIPRNHRVEQAIEAAIENADFSL 463
Query: 603 VRRLLKLMERPYDEQPGMEKYARLP 627
LL + RPY++QPG YA P
Sbjct: 464 FEALLDVTSRPYEDQPGHAAYAAPP 488
>gi|188533967|ref|YP_001907764.1| hypothetical protein ETA_18310 [Erwinia tasmaniensis Et1/99]
gi|259646501|sp|B2VEL5.1|Y1831_ERWT9 RecName: Full=UPF0061 protein ETA_18310
gi|188029009|emb|CAO96877.1| Conserved hypothetical protein YdiU [Erwinia tasmaniensis Et1/99]
Length = 479
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 206/518 (39%), Positives = 288/518 (55%), Gaps = 52/518 (10%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L+ +T + P ++N +L+ +S +A L LD + F+ + L+ SG G P AQ
Sbjct: 11 LNGFHTTLRPMP-LKNARLLYYSAELAQDLGLDERLFDAQNVGLW-SGERLAEGMQPLAQ 68
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG+WAGQLGDGR + LGE +++ LKGAG TPYSR DG AVLRS++R
Sbjct: 69 VYSGHQFGVWAGQLGDGRGLLLGEQQLPDGRKFDWHLKGAGLTPYSRMGDGRAVLRSTLR 128
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
EFL EAM+ LGIPT+RAL +VT+ + V R+ E GA++ RVA+S +RFG ++
Sbjct: 129 EFLAGEAMYHLGIPTSRALTVVTSDEPVYRE-------TTEAGAMLLRVAESHVRFGHFE 181
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
+ +GQ + V LADY IRHH+ + ++Y W
Sbjct: 182 HYYYQGQT--EKVTQLADYVIRHHWPELVQ---------------------EKDRYLLWF 218
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
+V +RTA ++A WQ VGF HGV+NTDNMSILGLT DYGP+GFLD + P N +D G
Sbjct: 219 SDVVQRTARMIAGWQSVGFAHGVMNTDNMSILGLTFDYGPYGFLDDYRPDLICNHSDHQG 278
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL- 487
RY F NQP IGLWN+ + + L+ L+ ++ + Y + M + M KLGL
Sbjct: 279 -RYSFENQPMIGLWNLNRLAHALSG--LMSPQQLKQALAGYEPELMRCWGEKMRAKLGLL 335
Query: 488 --PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER 545
K + I++ LL+ M + DYT FR LS + + +L P++ +D
Sbjct: 336 TPAKDDNNILTGLLSLMTKEGSDYTRTFRQLSQSE------QLQLRSPMRDEFID----- 384
Query: 546 KEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
++A+ SW + Q +L SDEER+ M NP VLRNYL Q AI+ AE D + R
Sbjct: 385 RDAFDSWYNVWRQRVLQEERSDEERQQTMKLANPALVLRNYLAQQAIERAEQDDISVLAR 444
Query: 606 LLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + + RP+D+ P A+ PP W + V SCSS
Sbjct: 445 LHQALSRPFDDAPEYADLAQRPPDWGKKLEV---SCSS 479
>gi|256073786|ref|XP_002573209.1| Crumbs complex protein; MAGUK homolog; cell polarity protein;
serine/threonine kinase [Schistosoma mansoni]
Length = 1461
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 196/475 (41%), Positives = 271/475 (57%), Gaps = 68/475 (14%)
Query: 107 WDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWS-ESVA---------- 155
+D+ ++ LP D ++SI R V +AC+T+VSP+ +++NP+LV +S +++A
Sbjct: 825 FDNIQLKSLPIDNGSNSI-RSVPNACFTRVSPT-KIDNPRLVLFSPDALALLNICHKINH 882
Query: 156 -DSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEIL 214
D K E + SG G+ P A CY G+QFG +AGQLGDG AI+LGE++
Sbjct: 883 LDKQNCKGKTEETNCLVEYLSGNKLWPGSNPTAHCYCGYQFGSFAGQLGDGAAISLGEVV 942
Query: 215 NLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGK 274
N + ERWELQLKGAG TP+SR DG VLRSS+REFLCSEAM++LGIPTTRA ++T+
Sbjct: 943 NEQGERWELQLKGAGLTPFSRQGDGRKVLRSSLREFLCSEAMYYLGIPTTRAASIITSDT 1002
Query: 275 FVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ---------EDLDIVRTLA 325
V RDMFY G+ E +I RVA++F+RFGS++I S +L I+ L
Sbjct: 1003 LVERDMFYTGDSITEKASITSRVAKTFIRFGSFEISKSPDSITGRFGPSVGNLTILSQLT 1062
Query: 326 DYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGV 385
+Y I+ + HI D + ++ N Y + EV +RTA+LVA WQ V
Sbjct: 1063 NYVIQQFYPHI------------WSDYSNDIM----NCYLEFFKEVVKRTANLVALWQTV 1106
Query: 386 GFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIA 445
GF HGVLNTDNMSI+GLTIDYGPFGF+D F NT+D P RY +A QP+I WN A
Sbjct: 1107 GFCHGVLNTDNMSIIGLTIDYGPFGFMDQFTWDHISNTSD-PDGRYSYAQQPNICAWNCA 1165
Query: 446 QFSTTLAAA----------KLIDDKEANYVMERY----GTKFMDEYQAI----MTKKLGL 487
+ + L A K ID + N + ++ T +M ++++ M KKLGL
Sbjct: 1166 RLAECLIQALIDQQKYSSDKTIDKEFVNNLTRKFTNVLDTTYMSYFKSVYLERMRKKLGL 1225
Query: 488 --PK--YNKQIISKLLNNMAVDKVDYTNFFRALSNV------KADPSIPEDELLV 532
PK + +I L N M D+TN F AL + + D + E +L+V
Sbjct: 1226 FYPKDEIDADLIENLFNTMEKTGADFTNTFLALEDTLFQLFNENDSDLLEPDLIV 1280
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 67/151 (44%), Gaps = 21/151 (13%)
Query: 511 NFFRALSNVKADPSIPEDELLVPLKAVLLDIGKER-KEAWISWVLSYIQELL-------- 561
N L K S +++L L+ + + +ER K W W+ +Y L
Sbjct: 1314 NIIDQLEETKILKSKEKEKLYKELEHMTEEEYQERNKRLWSIWLRAYKTRLKIDFERNND 1373
Query: 562 SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYD--EQPG 619
++ E LM SVNP+ VLRNYL + AI +A+ GD+ ++L + P+ +
Sbjct: 1374 NAKTQISECLNLMQSVNPRVVLRNYLAEEAIKSADKGDYTVAQQLFDSLTTPFKNPDTSS 1433
Query: 620 MEKYARL-------PPAWAYRPGVCMLSCSS 643
+ RL PP W+ + V SCSS
Sbjct: 1434 NNESCRLVSRIKYRPPNWSRKLRV---SCSS 1461
>gi|37679273|ref|NP_933882.1| hypothetical protein VV1089 [Vibrio vulnificus YJ016]
gi|39932480|sp|Q7MMI2.1|Y1089_VIBVY RecName: Full=UPF0061 protein VV1089
gi|37198016|dbj|BAC93853.1| conserved hypothetical protein [Vibrio vulnificus YJ016]
Length = 490
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 200/517 (38%), Positives = 283/517 (54%), Gaps = 53/517 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y V+P ++N + V W+ +A L PK + P FSGA P + P A Y G
Sbjct: 21 YRLVTPQP-LDNSRWVIWNGELAQGFAL-PKHADDPQLLSVFSGAEPFSAFKPLAMKYAG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ LGDGR + LGE+ N + + +++ LKGAG TP+SR DG AVLRS+IRE+LC
Sbjct: 79 HQFGVYNPDLGDGRGLLLGEMQNQQGQWFDIHLKGAGLTPFSRMGDGRAVLRSTIREYLC 138
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGI TTRAL + + V R+ + E GA + R+AQ+ +RFG ++
Sbjct: 139 SEAMAALGIETTRALGMTVSDTPVYRE-------QVEQGACLIRLAQTHIRFGHFEHFFY 191
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
E D +R LAD I + +K Y A +V
Sbjct: 192 T--EQYDELRLLADNVIEWYMPECTAHDKP---------------------YLAMFEQVV 228
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
RTA+++AQWQ VGF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D G RY
Sbjct: 229 ARTATMIAQWQAVGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDYQG-RYA 287
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL---PK 489
F QP + LWN++ + L + L++ + + +Y + +M +KLGL +
Sbjct: 288 FDQQPRVALWNLSALAHAL--SPLVERDDLELALAQYEPTLGKVFSQLMRQKLGLLSQQE 345
Query: 490 YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ ++ + + +A + DYT FFR LS + ++ + + L I ++ W
Sbjct: 346 GDSELFNAMFTLLAENHTDYTRFFRTLSQLDSEGA---------QTVIDLFIDRDAARGW 396
Query: 550 ISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRL 606
+S L + E +SG S ++R M +VNPKY+LRNYL Q AID A+ GDF EV L
Sbjct: 397 LSRYLERVALEQTASGEAKSAQQRCEQMRAVNPKYILRNYLAQQAIDKAQQGDFSEVHTL 456
Query: 607 LKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
KL++ PYDEQ ME YA LPP W + ++SCSS
Sbjct: 457 AKLLKNPYDEQAEMEAYAHLPPEWGKK---MVISCSS 490
>gi|254507761|ref|ZP_05119892.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
gi|219549286|gb|EED26280.1| conserved hypothetical protein [Vibrio parahaemolyticus 16]
Length = 489
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 204/523 (39%), Positives = 283/523 (54%), Gaps = 53/523 (10%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ +A Y+ V P +EN +AW+ +A+ L L P+ D F SG
Sbjct: 14 ELPNAFYSLVDPQP-LENSHWIAWNSVLAEQLGL-PENQPSGDLKYFLSGEGDYQTTPVL 71
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
A Y GHQFG + LGDGR + LGE+ + + +++ LKGAG TPYSR DG AVLRS+
Sbjct: 72 AMKYAGHQFGSYNPDLGDGRGLLLGEVTSPTGQMFDIHLKGAGLTPYSRMGDGRAVLRST 131
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
IRE+LCSEAM LGIPTTRAL ++T+ V RD K E GA++ RVA+S +RFG
Sbjct: 132 IREYLCSEAMAGLGIPTTRALGMLTSDTPVYRD-------KVESGALLLRVAESHIRFGH 184
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ Q L ++ LAD I ++ + + + YAA
Sbjct: 185 FEHFFYTNQ--LSELKLLADKVIEWYWPKCLD---------------------SESPYAA 221
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
+ E+TA ++A WQ GF HGV+NTDNMSILG T DYGPFGFLD ++P + N +D
Sbjct: 222 MFATIVEKTAHMIAYWQAYGFAHGVMNTDNMSILGQTFDYGPFGFLDDYEPGYICNHSDY 281
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY F QP IGLWN++ + L+ +ID E + ++ ++ +M KLG
Sbjct: 282 QG-RYAFDQQPRIGLWNLSALAHALSP--IIDKSELEQALSQFEVTLGKKFSRLMRAKLG 338
Query: 487 LP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + + Q+ + + + ++VDYT F R LSN+ P +L I +
Sbjct: 339 LNCKLEQDSQLFNAMFELLHQNRVDYTRFMRELSNLDTQPVHNVSDLF---------IDR 389
Query: 544 ERKEAWISWVLSYIQ-ELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
E AW+ L+ + E+ +G I R M VNPKY+LRNYL Q AID AE GDF
Sbjct: 390 EAANAWLELYLARCECEVDEAGQAIPSTTRCEKMRQVNPKYILRNYLAQIAIDKAEQGDF 449
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
EV L +L++ P+DEQP + YA LPP W + +SCSS
Sbjct: 450 SEVEALAELLKHPFDEQPDKDAYANLPPEWGKK---MEISCSS 489
>gi|402486528|ref|ZP_10833359.1| hypothetical protein RCCGE510_02466 [Rhizobium sp. CCGE 510]
gi|401814651|gb|EJT06982.1| hypothetical protein RCCGE510_02466 [Rhizobium sp. CCGE 510]
Length = 500
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 196/497 (39%), Positives = 269/497 (54%), Gaps = 55/497 (11%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V P L+ +E +A L LD + R D FSG GA P A Y GHQFG ++ Q
Sbjct: 36 VAEPWLIKLNEPLAAELGLDVEALRR-DGAAIFSGNLVPEGAEPLAMAYAGHQFGGFSPQ 94
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRAI LGE+++ R+++QLKGAG TP+SR DG A + +RE++ SEAM LG+
Sbjct: 95 LGDGRAILLGEVIDRSGRRFDIQLKGAGPTPFSRRGDGRAAIGPVMREYIISEAMFALGV 154
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
P TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+RG D D V
Sbjct: 155 PATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHIRVGTFQYFAARG--DTDGV 205
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
R LADY I H+ ++ + N Y A V+ER A+L+A+
Sbjct: 206 RALADYVIDRHYPALKAAD---------------------NPYLALFSAVSERQAALIAR 244
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY +ANQP IG
Sbjct: 245 WLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDQQG-RYAYANQPGIGQ 303
Query: 442 WNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLGLPKYNK--- 492
WN+A+ TL LID++ +AN V+ YG +F + A M K+GL
Sbjct: 304 WNLARLGETL--LPLIDEEPDGAVDKANAVIRAYGERFQAHWLAGMRDKIGLAGEEDGDL 361
Query: 493 QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISW 552
++ LL+ M D+T FR LS++ DE P A EA +W
Sbjct: 362 DLVQALLSLMQAQDADFTLTFRRLSDLAG------DETAKPTFAASF----REPEACGTW 411
Query: 553 VLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLME 611
+ + + L + ER M SVNP ++ RN+ + AI+AA E GDF LL ++
Sbjct: 412 LTQWRERLSRDPQTGAERATAMRSVNPAFIPRNHRVEQAIEAAVENGDFSLFEALLSVLS 471
Query: 612 RPYDEQPGMEKYARLPP 628
+PY++QPG Y R PP
Sbjct: 472 KPYEDQPGFAAY-REPP 487
>gi|301120059|ref|XP_002907757.1| selenoprotein O, putative [Phytophthora infestans T30-4]
gi|301120061|ref|XP_002907758.1| selenoprotein O, putative [Phytophthora infestans T30-4]
gi|262106269|gb|EEY64321.1| selenoprotein O, putative [Phytophthora infestans T30-4]
gi|262106270|gb|EEY64322.1| selenoprotein O, putative [Phytophthora infestans T30-4]
Length = 637
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 217/642 (33%), Positives = 327/642 (50%), Gaps = 137/642 (21%)
Query: 107 WDHSFVRELPGDPRTDSIPREVLH-ACYTKVSPSAEVENPQLVAWSES--VADSLELDPK 163
+D++ +RELP D + R + AC+++V P+ + +P+LV S + + +EL+
Sbjct: 28 FDNAVLRELPIDTEPKNFVRSAVSGACFSRVDPTP-IASPELVVTSPNSLLLVGIELNES 86
Query: 164 EFERPDFPL---------------FFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAI 208
+ + D + +G T L GA AQCY GHQFG ++GQLGDG A+
Sbjct: 87 DSKSQDEGVNGEGDDLQPIETLVPILAGNTLLPGAETAAQCYCGHQFGFFSGQLGDGAAL 146
Query: 209 TLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALC 268
LGE++ + ERWELQLKG+G TPYSR ADG VLRS++REFLCSE MH LG+PTTRA
Sbjct: 147 YLGEVVAV-DERWELQLKGSGLTPYSRTADGRKVLRSTLREFLCSENMHALGVPTTRAGS 205
Query: 269 LVTTGKF-VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH------------ASRGQ 315
+VT+ + V RD+FY+G+ K EP A+V R+A+SFLRFGS++I ++ +
Sbjct: 206 VVTSKETQVLRDIFYNGDAKMEPTAVVTRIAKSFLRFGSFEIFKDEDKLTGLAGPSAHLE 265
Query: 316 EDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERT 375
+++R + D+ IR ++ I + KY + EV RT
Sbjct: 266 NKEEMMREMLDFTIRQYYSEISG----------------------ARKYEKFFQEVVRRT 303
Query: 376 ASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFAN 435
A LVA+WQ +GF HGVLNTDNMSI+G T+DYGPFGF++ FDP NT+D G RY +
Sbjct: 304 AMLVAKWQSIGFCHGVLNTDNMSIVGDTLDYGPFGFMEHFDPKHICNTSDDRG-RYRYEA 362
Query: 436 QPDIGLWNIAQFSTTLAAAKLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLP---KYN 491
QP++ WN + L L+ ++ ++E + + EY +M +KLGL K +
Sbjct: 363 QPEVCKWNCGVLADQLG---LVTERAGLEPILESFDAVYEAEYMRLMREKLGLSDEEKED 419
Query: 492 KQIISKLLNNMAVDKVDYTNFFRALSNVKA-DPSIPEDELLVPLKAVLLDIGKERK---- 546
K ++ L + +A D+T FR LS + + +++L L AV + ++++
Sbjct: 420 KMLVDTLFDVLAFTGADFTCTFRYLSELDVFETGDCREQVLNKLVAVSETLAQQKRKLEL 479
Query: 547 ------EAWISWVLSYIQE-------------------------LLSSGISDEERKALMN 575
+A V+ +QE L +DEER +
Sbjct: 480 DSGGVSDAQFDMVVMLLQENPVRARQYGITPALVAQIKANREAKKLLDATTDEERMDSIR 539
Query: 576 SV-------------------------------NPKYVLRNYLCQSAIDAAELGDFGEVR 604
+V NP +VLRN++ Q AID A GD+ V+
Sbjct: 540 TVWVDWIDVYISRVKEQGDAASDADRRRRMLDVNPLFVLRNHVAQKAIDFAHEGDYDAVQ 599
Query: 605 RLLKLMERPYDEQPGMEK---YARLPPAWAYRPGVCMLSCSS 643
+ +L+ P+DE P ++ YAR P + +C +SCSS
Sbjct: 600 HIFELVTNPFDE-PTDDRDLEYAR--PQDSSTAPLC-VSCSS 637
>gi|399007765|ref|ZP_10710265.1| hypothetical protein PMI20_03169 [Pseudomonas sp. GM17]
gi|398119312|gb|EJM09012.1| hypothetical protein PMI20_03169 [Pseudomonas sp. GM17]
Length = 487
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 210/551 (38%), Positives = 296/551 (53%), Gaps = 70/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-IDRPRLVVASPAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+ + P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPEAAQSPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIPTTRALC++ + V R
Sbjct: 106 EHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPTTRALCVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E A++ R++ S +RFG ++ + R ++ + L ++ + HF
Sbjct: 166 E-------KQERAAMLLRMSPSHVRFGHFEYFYYTKRPEQQ----KQLGEHVLAMHF--P 212
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
E + + E Y A EV ER A L+A+WQ GF HGV+NTDN
Sbjct: 213 ECLEQPEP-------------------YLAMFREVVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGVTFDFGPFAFLDDFDAHLICNHSDDQG-RYSFSNQVPIGQWNLSALAQAL--TPF 310
Query: 457 IDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFF 513
I + + + F Y +M ++LGL +++++ +LL M VDY+ FF
Sbjct: 311 ISVEALRETLGLFLPLFQAHYLDLMRRRLGLTSAEDEDQKLVERLLQLMQGSGVDYSLFF 370
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEE-RKA 572
R L + A+ ++ L+ +D ++ + +W Y I +E R+
Sbjct: 371 RRLGDEPAELAVAR------LRDDFVD-----RQGFDAWADLYKARGARDPIQGQELRRE 419
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AI AAE GD+ EVRRL ++ +P+++Q GM+ YA PP W
Sbjct: 420 RMHAVNPLYILRNYLAQKAIGAAEQGDYSEVRRLHAVLSKPFEQQAGMDSYAERPPEWGK 479
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 480 H---LEISCSS 487
>gi|114045811|ref|YP_736361.1| hypothetical protein Shewmr7_0299 [Shewanella sp. MR-7]
gi|121957887|sp|Q0I001.1|Y299_SHESR RecName: Full=UPF0061 protein Shewmr7_0299
gi|113887253|gb|ABI41304.1| protein of unknown function UPF0061 [Shewanella sp. MR-7]
Length = 484
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 197/525 (37%), Positives = 279/525 (53%), Gaps = 69/525 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
Y +V P + NP +AWSE VA ++L ++P L SG + GA YAQ Y
Sbjct: 15 YAQVYPQG-ISNPHWLAWSEDVAKLIDL-----QQPTDALLQGLSGNAAVEGASYYAQVY 68
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + +LGDGR+I LGE L + W++ LKG G TPYSR DG AV+RS++REF
Sbjct: 69 SGHQFGGYTPRLGDGRSIILGEALGPQGA-WDVALKGGGPTPYSRHGDGRAVMRSAVREF 127
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI- 309
L SEA+H LG+PTTRAL ++ + V R+ +E AI R+A+S +RFG ++
Sbjct: 128 LVSEALHHLGVPTTRALAVIGSDMPVWRE-------SQETAAITVRLARSHIRFGHFEFF 180
Query: 310 -HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
H+ RGQ D + L ++ ++ H+ H+ DL Y AW
Sbjct: 181 CHSERGQAD--KLTQLLNFTLKQHYPHLS-------------------CDLAG--YKAWF 217
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++V + TA L+A WQ +GF HGV+NTDNMSILG + D+GPF FLD F F N +D P
Sbjct: 218 LQVVQDTAKLIAHWQAIGFAHGVMNTDNMSILGDSFDFGPFAFLDTFQEDFICNHSD-PE 276
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F QP IGLWN+ + + L DD A + +Y + Y +M KLGL
Sbjct: 277 GRYAFGQQPGIGLWNLQRLAQALTPVIPSDDLIA--ALNQYQHALVQHYLMLMRAKLGLA 334
Query: 489 ----------KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
+ + ++I + M +++DY+N +R + DPS L+
Sbjct: 335 ERADSTAEQDQQDLELIGRFTVLMEKNQLDYSNTWRRFGQL--DPSSAHSS----LRDDF 388
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+D+ + + +W +Y Q L E + NSVNPKY+LRNYL Q AI A E G
Sbjct: 389 IDLNE-----FDAWYQAY-QTRLGKVTDIEAWQQARNSVNPKYILRNYLAQEAIIAVEEG 442
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RL +++ +P+ EQ E A+ PP W G+ M SCSS
Sbjct: 443 NLAPLERLHQVLRQPFAEQVEHEDLAKRPPDWG--QGLIM-SCSS 484
>gi|451846621|gb|EMD59930.1| hypothetical protein COCSADRAFT_100444 [Cochliobolus sativus
ND90Pr]
Length = 622
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 223/628 (35%), Positives = 317/628 (50%), Gaps = 91/628 (14%)
Query: 92 ESKMTKKLKALEDLNWDHSFVRELPGD------------PRTDSIPREVLHACYTKVSPS 139
E+ + +L L + + F LP D PR PR V A YT V P
Sbjct: 10 ENGSSSELHTLHSIPKSNVFTSNLPADAEFPTPKASHDAPREKLGPRMVKGALYTYVRPD 69
Query: 140 AEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT--------PLAGAVPYAQCYG 191
+ E +L+A S+ + L +E + DF +G P AG P+AQCYG
Sbjct: 70 PQGE-AELLAVSQRALHDIGLKEEEAKTDDFKDVVAGKKILTWDEKDPEAGIYPWAQCYG 128
Query: 192 GHQFGMWAGQLGDGRAITLGEILN-LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
G+QFG WAGQLGDGRAI+L E N R+E+QLKGAG+TPYSRFADG AVLRSSIREF
Sbjct: 129 GYQFGQWAGQLGDGRAISLFETTNPTIGTRYEIQLKGAGRTPYSRFADGRAVLRSSIREF 188
Query: 251 LCSEAMHFLGIPTTRALCL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI 309
+ SE ++ +GIP+TRAL L + G + R+ EPGAIV R AQS++RFG++ +
Sbjct: 189 VVSEYLNAIGIPSTRALSLTLNKGSKIMRERI-------EPGAIVARFAQSWIRFGTFDL 241
Query: 310 HASRGQEDLDIVRTLADYAIRHHF----RHIENMNKSESLSFSTGDEDHSVVDLTS---- 361
RG D +RTLADY H + R + ++ D D+
Sbjct: 242 QRIRG--DRKTLRTLADYTAEHVYGGWDRLPSKLPAGDAKDVHAQTHDGVAKDIVEGEGE 299
Query: 362 ---NKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPS 418
N+Y + R A VA+WQ GF +GVLNTDN SILGL+ID+GPF FLD FDP+
Sbjct: 300 TAENRYVRLYRAILRRNAETVAKWQAYGFMNGVLNTDNTSILGLSIDFGPFAFLDTFDPT 359
Query: 419 FTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AAAKLIDD--------------- 459
+TPN D RY + NQP I WN+ + L A +DD
Sbjct: 360 YTPNHDDHM-LRYSYRNQPTIIWWNLVRLGEALGELFGAGNYVDDETFVEKGVTEEQAPG 418
Query: 460 --KEANYVMERYGTK----FMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDY 509
K A ++R G + F+ EY+ +MT +LGL + ++S+LL+ + ++D+
Sbjct: 419 VVKCAESAIDRAGEEYKAVFLAEYRRLMTLRLGLKTQKESDFDVLMSELLDCLEAYELDF 478
Query: 510 TNFFRALSNVK-ADPSIPEDELLVPLKAVLLDI-------GKERKEAWISWVLSYIQELL 561
+ FR L +++ AD + + + D G+ER W+ ++E
Sbjct: 479 HHAFRRLGDIRLADVDTEDKRIDTAGRFFRSDAAPRRESEGRERIAKWLGKWTERVREDW 538
Query: 562 SSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR-RLLKLMERPYDEQ--- 617
G DEERK M++VNPK+V R+++ I+ E ++ +++KL+ P+ E+
Sbjct: 539 GEG-KDEERKVAMDAVNPKFVPRSWILDELIERVEKKHERDILPQVMKLVLNPFQEEWKW 597
Query: 618 --PGMEKYARLPPAWAYRPGVCMLSCSS 643
E+Y P YR G+ SCSS
Sbjct: 598 NSDEEERYCGEVP--KYR-GMMQCSCSS 622
>gi|296424502|ref|XP_002841787.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638035|emb|CAZ85978.1| unnamed protein product [Tuber melanosporum]
Length = 568
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 208/549 (37%), Positives = 298/549 (54%), Gaps = 58/549 (10%)
Query: 102 LEDLNWDHSFVRELP------------GDPRTDSIPREVLHACYTKVSPSAEVENPQLVA 149
L+DL + F +LP G R+ PR V A YT V P +NP+L+A
Sbjct: 18 LQDLPKSNVFTTKLPPDAQFPTPESSAGATRSQLGPRMVKAALYTYVRPDPVEDNPELLA 77
Query: 150 WSESVADSLELDPKEFERPDFPLFFSGATPLAG-AVPYAQCYGGHQFGMWAGQLGDGRAI 208
S S+ L E +P+F SG + P+AQCYGG QFG WAGQLGDGRAI
Sbjct: 78 VSPLALRSIGLASTEPTKPEFLRLVSGNGGFEDISYPWAQCYGGWQFGQWAGQLGDGRAI 137
Query: 209 TLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRAL 267
+L E N +++ R+ELQLKGAG+TPYSRFADG AVLRSSIREF+ SE ++ +GIP+TRAL
Sbjct: 138 SLFEATNPETKIRYELQLKGAGQTPYSRFADGKAVLRSSIREFIVSEYLYSIGIPSTRAL 197
Query: 268 CL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLAD 326
L + G R+ E AIVCR A+S++R G++ + +RG D +R L+D
Sbjct: 198 SLTLLPGNQAIRENI-------ETCAIVCRFAESWIRIGTFDLLRARG--DRKNLRLLSD 248
Query: 327 YAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVG 386
Y + E ++ + S GD N+Y E+ R A VA+WQ G
Sbjct: 249 YVREEVLKTKERVDGEDGSSGVRGDG-------VRNRYEDMYREIVRRNALTVAKWQAYG 301
Query: 387 FTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ 446
F +GVLNTDN SI+GL++D+GPF F+D+F+P FTPN D RYC+ NQP I WN+ +
Sbjct: 302 FMNGVLNTDNTSIMGLSLDFGPFSFMDSFNPKFTPNHDD-HTLRYCYKNQPTIIWWNLVR 360
Query: 447 FSTTL-----AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK----QIISK 497
+ L A +++D + V E Y + F+ EY+ +M +LG + + S
Sbjct: 361 LAEDLAELFAATPEMLDSE----VGEEYKSIFLAEYKQLMATRLGFTGLRETDMDDVYSP 416
Query: 498 LLNNMAVDKVDYTNFFRALSNVKADPSIPEDEL--------LVPLKAVLLDIGKERKEAW 549
LL+ + VD+ +FFR LS + + DE L+P KAVL + K E
Sbjct: 417 LLDILEDAHVDFGHFFRRLSELPIFELMESDEEAQLTAAEGLMP-KAVLTTVQKG-PEKI 474
Query: 550 ISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLK 608
+ W+ Y + L D +R M VNPK++ +N++ + I E G+ G + ++K
Sbjct: 475 LKWLKLYAERLEEK--EDAKRMERMKKVNPKFIPKNWVLEEIIQRVEQKGERGVLGDVIK 532
Query: 609 LMERPYDEQ 617
L+E P+ ++
Sbjct: 533 LVENPFADR 541
>gi|399908970|ref|ZP_10777522.1| hypothetical protein HKM-1_05858 [Halomonas sp. KM-1]
Length = 492
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 194/494 (39%), Positives = 272/494 (55%), Gaps = 51/494 (10%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V P LVA++ +A++L D F+ + ++FSG GA P AQ Y GHQFG + Q
Sbjct: 25 VREPHLVAFNRPLAEALGFDLAAFDAEEAAVWFSGNVVPHGAEPLAQAYAGHQFGGFVPQ 84
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRA+ LGE+ + ++QLKGAG+TP+SR DG A L +RE+L SEAMH +GI
Sbjct: 85 LGDGRAVLLGEVTDRDGGLRDIQLKGAGRTPFSRGGDGRAPLGPVLREYLVSEAMHAMGI 144
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PTTRAL VTTG+ V R G P EPGAI+ RVA S +R G++Q A+RG D+D V
Sbjct: 145 PTTRALAAVTTGERVMR-----GIP--EPGAILTRVASSHIRVGTFQYFAARG--DIDGV 195
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
R LA + I H+ +E+ E +Y V R A+L+A+
Sbjct: 196 RELAGHVIERHYPALESRQDGE-------------------RYLGLLEAVQARQAALIAK 236
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
W GVGF HGV+NTDN SI G TID+GP F++ +DP ++ D G RY ++NQP I
Sbjct: 237 WMGVGFIHGVMNTDNTSISGETIDFGPCAFMEQYDPKMVFSSID-EGGRYAYSNQPWIAQ 295
Query: 442 WNIAQFSTTLAAAKLIDD------KEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NK 492
WN+A+ + TL LIDD + A +++R+ ++ E+ A+M KLGL +K
Sbjct: 296 WNLARLAETL--LPLIDDDSERAVERATELLQRFPEQYEREWLAVMRAKLGLTSEKPGDK 353
Query: 493 QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISW 552
+I LL M + D+T FR L++V S + LV L ER E W
Sbjct: 354 ALIESLLAAMHRGRADFTLTFRRLADVAE--SAAAEASLVEL--------FERPEEIAGW 403
Query: 553 VLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLME 611
+ + + L + ER M NP ++ RN+ Q A+ AA + D+G LL ++
Sbjct: 404 LEEWRERLAQEEQGESERAQRMRLANPAFIPRNHRVQQALTAAMDENDYGPFETLLDIIT 463
Query: 612 RPYDEQPGMEKYAR 625
P+D+QPG E+Y R
Sbjct: 464 HPFDDQPGREEYMR 477
>gi|348689837|gb|EGZ29651.1| hypothetical protein PHYSODRAFT_252691 [Phytophthora sojae]
Length = 642
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 230/673 (34%), Positives = 345/673 (51%), Gaps = 150/673 (22%)
Query: 85 TETDGGDESKMTKKL---KALEDLNWDHSFVRELPGDPRTDSIPREVLH-ACYTKVSPSA 140
T T+G +++++ L + L ++D++ +RELP D + R + AC+++V P+
Sbjct: 6 TATNG--RTRLSRSLSGWRRLPTAHFDNAVLRELPIDAEPKNFVRSAVSGACFSRVEPTP 63
Query: 141 EVENPQLVAWSESVADSLELDPKEFERPD---------------------FPLFFSGATP 179
+ +P+LV S +SL L E + D P+ +G
Sbjct: 64 -IASPELVVTS---PNSLLLAGIELIQGDDQDNSSDERGISDNLQPIDTLVPVL-AGNKL 118
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
L G+ AQCY GHQFG ++GQLGDG A+ LGEI+ + ERWELQLKG+G TPYSR ADG
Sbjct: 119 LPGSETAAQCYCGHQFGFFSGQLGDGAALYLGEIVT-EGERWELQLKGSGLTPYSRTADG 177
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF-VTRDMFYDGNPKEEPGAIVCRVA 298
VLRS++REFLCSE M LG+PTTRA +V + + V RD+FY+GN K EP A+V R+A
Sbjct: 178 RKVLRSTLREFLCSENMFALGVPTTRAGSVVMSRETQVLRDIFYNGNAKMEPTAVVTRIA 237
Query: 299 QSFLRFGSYQIH------------ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLS 346
+SFLRFGS++I ++ ++ +++ + D+ IR +F
Sbjct: 238 KSFLRFGSFEIFKDEDEFTGMMGPSAHLEDKQEMMTKMLDFTIRQYFPEF---------- 287
Query: 347 FSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDY 406
G+E N Y + EV RTA LVA+WQ +GF HGVLNTDNMSI+G T+DY
Sbjct: 288 --FGEE---------NMYEKFFEEVVHRTAKLVAKWQTIGFCHGVLNTDNMSIVGDTLDY 336
Query: 407 GPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEA-NYV 465
GPFGF++ FDP NT+D G RY + +QPDI WN + L L+ D+ A
Sbjct: 337 GPFGFMEHFDPKHICNTSDDRG-RYRYESQPDICKWNCGVLADQLG---LVTDRAALEPA 392
Query: 466 MERYGTKFMDEYQAIMTKKLGL------PKYNKQIISKLLNNMAVDKVDYTNFFRALSNV 519
+E + + + +EY +M +KLGL K +K ++ L++ +A D+T+ FR LS +
Sbjct: 393 LEAFHSVYQEEYMRLMREKLGLTSQRGEEKEDKMLVDTLVDVLAHTGADFTSTFRYLSGL 452
Query: 520 KA-DPSIPEDELLVPLKAVLLDIGKERK----------EAWISWVLSYIQ---------- 558
A D + +L L V + ++++ +A ++ +Q
Sbjct: 453 DAVDSGDSRERVLNQLVGVSETLAQQKRKLEQEFGGVSDAQFDMIVMLLQENPVRARQYG 512
Query: 559 ---ELLSS------------GISDEER-------------------------------KA 572
EL++ ++DEER +
Sbjct: 513 ITHELVAQMKANRAAKEVLDAMTDEERMESIRTAWEDWIDVYISRIKEEGDAASYSERRQ 572
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDE--QPGMEKYARLPPAW 630
M VNP +VLRN++ Q AID A GD+ V+ + +L+ RP+D+ G +YAR P
Sbjct: 573 HMLKVNPLFVLRNHVAQKAIDLAYEGDYDGVQHIFELLTRPFDDPSDEGDLEYAR-PQDP 631
Query: 631 AYRPGVCMLSCSS 643
+ P +C +SCSS
Sbjct: 632 STAP-LC-VSCSS 642
>gi|229521850|ref|ZP_04411267.1| hypothetical protein VIF_002390 [Vibrio cholerae TM 11079-80]
gi|229340775|gb|EEO05780.1| hypothetical protein VIF_002390 [Vibrio cholerae TM 11079-80]
Length = 489
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 257/466 (55%), Gaps = 53/466 (11%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A Y GHQF ++ +LGDGR + L E+ + + +++ LKGAG TPYSR DG AVLR
Sbjct: 70 PVAMKYAGHQFDVYNPELGDGRGLLLAEMATKQGDVFDIHLKGAGLTPYSRMGDGRAVLR 129
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SS+RE+LCSEAM LGI TTRAL L+++ V R+ +EE GA++ R+A + +RF
Sbjct: 130 SSLREYLCSEAMAGLGIATTRALALMSSETPVYRE-------REERGALLVRLAHTHVRF 182
Query: 305 GSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKY 364
G ++ Q ++ LAD I +F TS Y
Sbjct: 183 GHFEHFFYTDQH--ANLKLLADKVIEWYFPDCVQ---------------------TSKPY 219
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
AAW +V ERTA ++AQWQ GF HGV+NTDNMSILG T DYGPF FLD +DP+F N +
Sbjct: 220 AAWFSQVVERTALMIAQWQAYGFNHGVMNTDNMSILGETFDYGPFAFLDDYDPNFICNHS 279
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKK 484
D G RY F QP IGLWN++ + L + LID + + Y + +M K
Sbjct: 280 DYQG-RYAFDQQPRIGLWNLSALAHAL--SPLIDKDDLEAALGSYSEHLNLHFSRLMRAK 336
Query: 485 LGLPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LD 540
LGL + ++ + +A + DYT F R LS + + +AV+ L
Sbjct: 337 LGLATQQEGDGELFADFFALLANNHTDYTRFLRELSCLDRQGN----------EAVIDLV 386
Query: 541 IGKERKEAWISWVLSY-IQELLSSG--ISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL 597
+ +E +AWI L+ +EL G IS ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 387 LDREAAKAWIERYLTRAARELGQDGLPISTRERCQAMRQVNPKYILRNYLAQQAIEFAER 446
Query: 598 GDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
GDF E++RL ++ PY E P E+YA+LPP W + +SCSS
Sbjct: 447 GDFEEMQRLATVLASPYAEHPEFERYAKLPPEWGKK---LEISCSS 489
>gi|145589154|ref|YP_001155751.1| hypothetical protein Pnuc_0971 [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
gi|145047560|gb|ABP34187.1| protein of unknown function UPF0061 [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
Length = 488
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 200/521 (38%), Positives = 281/521 (53%), Gaps = 70/521 (13%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAG----------AVPYAQCYG 191
+ +P VA+S S + + L E + P+ S LAG + P A Y
Sbjct: 19 IPDPYWVAFSPSASQLIHL---ELDASGLPVDSSWLEVLAGNQLKTSSHEFSNPIATAYS 75
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG+WAGQLGDGRAI LGEI ELQLKGAGKT YSR DG AVLRSSIREFL
Sbjct: 76 GHQFGVWAGQLGDGRAILLGEIAG-----QELQLKGAGKTQYSRMGDGRAVLRSSIREFL 130
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
CSEAMH LGIPT+RAL +V + V R+ E A+ R+A SFLR G ++ H
Sbjct: 131 CSEAMHALGIPTSRALSVVGSDMPVRRETI-------ETAAVCARLAPSFLRVGHFE-HY 182
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
+ Q + V+ LAD I+ H+ + LS + + Y ++
Sbjct: 183 AASQNQVR-VKELADLLIQEHY--------PDCLS-------------SKDPYLELFKQI 220
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
R A LVAQWQ VGF HGVLN+DN+S +G+T+DYGPFGFLD F N +D G RY
Sbjct: 221 CIRNAELVAQWQAVGFCHGVLNSDNISAIGITLDYGPFGFLDEFQIDHICNHSDQAG-RY 279
Query: 432 CFANQPDIGLWNIAQFSTT---LAAAKLIDDKEANYV---MERYGTKFMDEYQAIMTKKL 485
+ QP I WN+A ++T L + ++K + + +E + + +Q++ +KL
Sbjct: 280 AYHRQPQIMHWNMACLASTFIPLLENQYSEEKAQDILRDALEIFPKSYASTWQSLFRRKL 339
Query: 486 GLP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G + + +++ +LL M +VD+T FR LS++K + + L+ +D
Sbjct: 340 GFAIDHENDIKLVERLLQAMHDSRVDFTTLFRKLSDIKKTDCVDA----IALRDEFID-- 393
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ + W+ Y+ L D RK M+ VNPK++LRN+L Q AI+ A+ D+ E
Sbjct: 394 ---RVSIDQWLSDYLLRLQMELDDDATRKIKMDGVNPKFILRNHLAQEAINKAQQHDYAE 450
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
++ LL ++ RP+D+QP EKYA PP + V SCSS
Sbjct: 451 IKTLLNILSRPFDDQPQHEKYAIAPPKDLQKVDV---SCSS 488
>gi|409421941|ref|ZP_11259062.1| hypothetical protein PsHYS_07827 [Pseudomonas sp. HYS]
Length = 486
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 210/551 (38%), Positives = 300/551 (54%), Gaps = 71/551 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD + S+ E P AE P+LV SE+ L
Sbjct: 1 MKALDELTFDNRFAR--LGDAFSTSVLPE----------PIAE---PRLVIASEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P E P F F G A A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLEPTEAYSPVFAELFGGHKLWAEAEPRAMVYSGHQFGSYNPRLGDGRGLLLGEVRNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+ LGIP++RALC++ + V R
Sbjct: 106 QSWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALPALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ +E A++ R+A S +RFG ++ + +R E R LA++ + HF
Sbjct: 166 E-------TQERAAMLLRLAPSHVRFGHFEYFYYTRQPEQ---QRMLAEHVLNTHFAECR 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
+ + F T + ER A L+A+WQ GF HGV+NTDNM
Sbjct: 216 DAPEPYLAMFRT---------------------IVERNAELIARWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD +F N +D G RY ++NQ I WN++ + L +
Sbjct: 255 SILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSYSNQVPIAHWNLSALAQALTPFISV 313
Query: 458 D--DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNF 512
+ + + Y T ++D +M ++LGL + +K +I +LL M VDYT F
Sbjct: 314 EALKETLGLFLPLYETHYLD----LMRRRLGLTRAEDGDKLLIERLLQLMQPGAVDYTLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKA 572
FR L + P ++ L ++ +D+ + W Y+ L + E R+A
Sbjct: 370 FRQLGDQ------PAEQALQVVRDDFIDLA-----GFDLWSADYLARLQREPGNAEGRRA 418
Query: 573 LMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAY 632
M++VNP Y+LRNYL Q AI+AAE GD+ EVRRL +++ +P++EQ GM+ YA+ PP W
Sbjct: 419 RMHAVNPLYILRNYLAQRAIEAAEGGDYEEVRRLHQVLSKPFEEQAGMQAYAQRPPEWGK 478
Query: 633 RPGVCMLSCSS 643
+SCSS
Sbjct: 479 H---LEISCSS 486
>gi|437995034|ref|ZP_20853929.1| hypothetical protein SEEE5646_08432, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 50-5646]
gi|435336399|gb|ELP06344.1| hypothetical protein SEEE5646_08432, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 50-5646]
Length = 422
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 185/457 (40%), Positives = 257/457 (56%), Gaps = 48/457 (10%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
R+ L A YT + P+ ++N +L+ +++ +A L + F+ + + G T L G P
Sbjct: 10 RDELPATYTALLPTP-LKNARLIWYNDKLAQQLAIPASLFDATNGAGVWGGETLLPGMSP 68
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
AQ Y GHQFG+WAGQLGDGR I LGE L + LKGAG TPYSR DG AVLRS
Sbjct: 69 VAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGSTLDWHLKGAGLTPYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
+IRE L SEAMH+LGIPTTRAL +V + V R+ +E GA++ R+AQS +RFG
Sbjct: 129 TIRESLASEAMHYLGIPTTRALSIVASDTPVQRE-------TQETGAMLMRLAQSHMRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
++ R + + V+ LAD+AIRH++ +++ + KYA
Sbjct: 182 HFEHFYYR--REPEKVQQLADFAIRHYWPQWQDVPE---------------------KYA 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
W EVA RT L+A+WQ VGF+HGV+NTDNMSILGLTIDYGPFGFLD +DP F N +D
Sbjct: 219 LWFEEVAARTGRLIAEWQTVGFSHGVMNTDNMSILGLTIDYGPFGFLDDYDPGFIGNHSD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKL 485
G RY F NQP + LWN+ + + TL I+ N ++RY + Y M +KL
Sbjct: 279 HQG-RYRFDNQPLVALWNLQRLAQTL--TPFIEIDALNRALDRYQDALLTHYGQRMRQKL 335
Query: 486 GL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG 542
G K + ++++L + MA + DYT FR LS+ + + PL+ +D
Sbjct: 336 GFFTEQKDDNALLNELFSLMAREGSDYTRTFRMLSHTEQQSASS------PLRDTFID-- 387
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNP 579
+ A+ +W Y L + + D R+ M VNP
Sbjct: 388 ---RAAFDAWFDRYRARLRTEAVDDALRQQQMQRVNP 421
>gi|398975211|ref|ZP_10685359.1| hypothetical protein PMI24_01473 [Pseudomonas sp. GM25]
gi|398140435|gb|EJM29397.1| hypothetical protein PMI24_01473 [Pseudomonas sp. GM25]
Length = 487
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 209/550 (38%), Positives = 289/550 (52%), Gaps = 68/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A V P ++NP+LV S + L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSAHVLPEP-IDNPRLVVASPAALALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + +F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVADTQEFAELFGGHKLWADAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNAAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG+TP+SR DG AVLRSSIREFL SEA+H L IP++RA C++ + V R
Sbjct: 106 EHWDLHLKGAGQTPFSRMGDGRAVLRSSIREFLASEALHALNIPSSRAACVIGSDTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ K+E A+V R+A S +RFG ++ + ++ E ++ H+
Sbjct: 166 E-------KQERAAMVLRLAPSHIRFGHFEYFYYTKRPEQQKLLG-----------EHVL 207
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
M+ E L Y A E+ ER A L+A+WQ GF HGV+NTDNM
Sbjct: 208 AMHYPECLE-------------QPEPYLAMFREIVERNAELIAKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD +F N +D G RY F+NQ +G WN++ + L I
Sbjct: 255 SILGITFDFGPFAFLDDFDANFICNHSDDQG-RYSFSNQVPVGQWNLSALAQAL--TPFI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNNMAV---DKVDYTNFFR 514
+ + Y F Y +M ++ G L + + VDYT FFR
Sbjct: 312 SVEALRETLGLYLPLFQAHYLDLMLRRFGFTTAEDDDQQLLEQLLQLMQNSGVDYTLFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISD-EERKAL 573
L A+ ++ L+ +DI + + +W Y+ + G +D E+R+A
Sbjct: 372 RLGEQSAEQAVAR------LRDDFVDI-----KGFDAWGERYVARVARDGAADQEQRRAR 420
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP Y+LRNYL Q AIDAAE GD+ EVRRL ++ P++EQPGME YA PP W
Sbjct: 421 MHAVNPLYILRNYLAQKAIDAAEQGDYSEVRRLHAVLSNPFEEQPGMESYAERPPEWGKH 480
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 481 ---LEISCSS 487
>gi|116251123|ref|YP_766961.1| hypothetical protein RL1355 [Rhizobium leguminosarum bv. viciae
3841]
gi|121957728|sp|Q1MJK8.1|Y1355_RHIL3 RecName: Full=UPF0061 protein RL1355
gi|115255771|emb|CAK06852.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 500
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 196/506 (38%), Positives = 276/506 (54%), Gaps = 56/506 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +P+A V P L+ +E++A L LD + R D FSG GA P A Y G
Sbjct: 28 FAAQAPTA-VAEPWLIKLNEALAAELGLDVEALRR-DGAAIFSGNLVPEGAEPLAMAYAG 85
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG ++ QLGDGRAI LGE+++ +R+++QLKGAG TP+SR DG A + +RE++
Sbjct: 86 HQFGGFSPQLGDGRAILLGEVVDRSGKRYDIQLKGAGPTPFSRRGDGRAAIGPVLREYII 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 146 SEAMFALGIPATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHIRVGTFQYFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D D VR LADY I H+ ++ N Y A V
Sbjct: 199 RG--DTDGVRALADYVIDRHYPALKE---------------------AENPYLALFDAVC 235
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY
Sbjct: 236 ERQAALIARWLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDQHG-RYA 294
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ TL LID + +AN V++ YG +F + A M +K+G
Sbjct: 295 YANQPGIGQWNLARLGETL--LPLIDAEPDSAVDKANAVIKSYGERFQAHWLAGMLEKIG 352
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L ++ LL+ M D+T FR LS++ D + E E +
Sbjct: 353 LAGEEDGDLDLVQALLSLMQAQGADFTLTFRRLSDLAGDDAA-EPEFAASFR-------- 403
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
+A +W+ + + L + ER M SVNP ++ RN+ + AI+AA + GDF
Sbjct: 404 -EPDACGAWLTQWRERLSRDPQTASERAIAMRSVNPAFIPRNHRVEQAIEAAVDNGDFSL 462
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPP 628
LL ++ +PY++QPG Y R PP
Sbjct: 463 FEALLSVLSKPYEDQPGFAAY-REPP 487
>gi|159483357|ref|XP_001699727.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281669|gb|EDP07423.1| predicted protein [Chlamydomonas reinhardtii]
Length = 622
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 189/441 (42%), Positives = 252/441 (57%), Gaps = 32/441 (7%)
Query: 95 MTKKLKA--LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSE 152
MT + +A LE LN+D+ +R LP DP R+V AC+++V P+ V+ PQLV S
Sbjct: 1 MTAQAEARTLETLNFDNLSLRALPVDPVEGGPVRQVEGACFSRVKPT-PVKGPQLVVASP 59
Query: 153 SVADSLELDPKEFER--PDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITL 210
L++ E L+FSG L GA P A CY GHQFG ++GQLGDG + L
Sbjct: 60 EALALLDIPASEVGEGGKKAALYFSGNKLLPGADPAAHCYCGHQFGYFSGQLGDGATMYL 119
Query: 211 GEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLV 270
GE++N + ERWELQ KGAGKTPYSR ADG VLRSS+REFLCSEAM+ LGIPTTRA V
Sbjct: 120 GEVVNGRGERWELQFKGAGKTPYSRQADGRKVLRSSLREFLCSEAMYNLGIPTTRAGTCV 179
Query: 271 TTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIH-------ASRG---QEDLDI 320
T+ V RD+ YDGN E + R+A +FLRFGS++I RG + I
Sbjct: 180 TSDSKVVRDIKYDGNAILERATTITRIAPTFLRFGSFEIFKPTDNFTGRRGPSAGHEAAI 239
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
+ + +AIR ++ I + + ++ G Y W EV RTASLVA
Sbjct: 240 LPVMLHHAIRTYYPAIWAAHDGDRIAAGVG-----------AMYLDWIKEVTRRTASLVA 288
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
WQ VG+ HGVLNTDNMSI+G+TIDYGPFGFLD +DP F N +D G RY + +QPDI
Sbjct: 289 AWQCVGWCHGVLNTDNMSIVGVTIDYGPFGFLDRYDPDFICNGSDDSG-RYDYKSQPDIC 347
Query: 441 LWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMD-EYQAIMTKKLGLPKY---NKQIIS 496
WN + + + A L + + V E + + ++ + LG + ++ + +
Sbjct: 348 RWNCERLAEAVRAV-LPEGRGKRAVAEVFDAVYRKCVWRGALVCTLGAGRAAVEDEGLAA 406
Query: 497 KLLNNMAVDKVDYTNFFRALS 517
LL+ M D+TN FR LS
Sbjct: 407 ALLSVMEATGADFTNTFRCLS 427
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 13/96 (13%)
Query: 549 WISWVLSY---IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRR 605
W SW+ Y +Q ++G D R A+MN+ NP+++LRN++ Q AI AE GDF EV R
Sbjct: 521 WRSWLAQYGAVLQRRAAAGADDSRRVAVMNATNPRFILRNWIAQQAIQRAEQGDFSEVAR 580
Query: 606 LLKLMERPYDEQPG----------MEKYARLPPAWA 631
+ L+ P+ E PG + Y LPP WA
Sbjct: 581 VFALLRNPFSEAPGPAAASGVSCALPVYDGLPPTWA 616
>gi|190890927|ref|YP_001977469.1| hypothetical protein RHECIAT_CH0001310 [Rhizobium etli CIAT 652]
gi|226695919|sp|B3PTN1.1|Y1310_RHIE6 RecName: Full=UPF0061 protein RHECIAT_CH0001310
gi|190696206|gb|ACE90291.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 500
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 194/498 (38%), Positives = 269/498 (54%), Gaps = 55/498 (11%)
Query: 141 EVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAG 200
+V P L+ +E +A L LD + R D FSG GA P A Y GHQFG ++
Sbjct: 35 QVAEPWLIKLNEPLAAELGLDVEALRR-DGAAIFSGNLVPEGAQPLAMAYAGHQFGGFSP 93
Query: 201 QLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLG 260
QLGDGRAI LGE+++ R+++QLKGAG TP+SR DG A + +RE++ SEAM LG
Sbjct: 94 QLGDGRAILLGEVIDRSGRRFDIQLKGAGPTPFSRRGDGRAAIGPVLREYIISEAMFALG 153
Query: 261 IPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDI 320
IP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+RG D D
Sbjct: 154 IPATRALAAVTTGEPVYREEVL-------PGAVFTRVATSHIRVGTFQYFAARG--DTDG 204
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
VR L +Y I H+ ++ + N Y A V+ER A+L+A
Sbjct: 205 VRALTNYVIDRHYPALKEAD---------------------NPYLALFEAVSERQAALIA 243
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
+W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY +ANQP IG
Sbjct: 244 RWLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDQHG-RYAYANQPGIG 302
Query: 441 LWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLGLPKYNK-- 492
WN+A+ TL LIDD+ +AN V+ YG +F + A M +K+GL +
Sbjct: 303 QWNLARLGETL--LPLIDDEPDAAVDKANAVIRAYGERFQTHWLAGMREKIGLAREEDGD 360
Query: 493 -QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWIS 551
+++ LL+ M D+T FR LS++ D + D EA +
Sbjct: 361 LELVQTLLSLMQAQGADFTLTFRRLSDLAGDEAAEPD----------FAASFREAEASRN 410
Query: 552 WVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLM 610
W+ + + L + R A M VNP ++ RN+ + AI+AA E GDF LL ++
Sbjct: 411 WLSRWRERLSRDPQTAGARAAAMRKVNPAFIPRNHRVEQAIEAAVENGDFSLFEALLTVL 470
Query: 611 ERPYDEQPGMEKYARLPP 628
RPYD+QP Y R PP
Sbjct: 471 ARPYDDQPDFAPY-REPP 487
>gi|297538638|ref|YP_003674407.1| hypothetical protein M301_1447 [Methylotenera versatilis 301]
gi|297257985|gb|ADI29830.1| protein of unknown function UPF0061 [Methylotenera versatilis 301]
Length = 505
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 208/549 (37%), Positives = 296/549 (53%), Gaps = 74/549 (13%)
Query: 91 DESKMTKKLKALE-DLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVA 149
D ++ KK+ A N+D+S+ R +P+ A + K P+ V+ P +V
Sbjct: 7 DLNEALKKISATSLGWNFDNSYTR----------LPK----AFFVKQKPT-PVKAPHIVL 51
Query: 150 WSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAIT 209
+++ +A +L L+ + + L FSG T GA P AQ Y GHQFG LGDGRAI
Sbjct: 52 FNQPLAATLGLNAEAILEDEASLAFSGNTIPVGAEPIAQAYAGHQFGHL-NMLGDGRAIL 110
Query: 210 LGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCL 269
LGE L ++ R+++QLKGAG T YSR DG A L +RE++ SEAMH LGIPTTR+L +
Sbjct: 111 LGEHLTPEANRYDIQLKGAGVTAYSRRGDGRAALGPMLREYIISEAMHALGIPTTRSLAV 170
Query: 270 VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAI 329
VTTG+ V RD PGAI+ RVA S +R G++Q AS +D +I+RTLADY +
Sbjct: 171 VTTGESVYRDSIL-------PGAILTRVASSHIRVGTFQFAAS--HDDPEIIRTLADYTL 221
Query: 330 RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTH 389
HF E + T NKY + V + A L+AQW VGF H
Sbjct: 222 NRHF--------PECIG-------------TENKYLSLLNAVIDHQAKLIAQWMQVGFIH 260
Query: 390 GVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFST 449
GV+NTDNMSI G +ID+GP F+D++DP+ ++ D G RY F NQP I WN+ +F+
Sbjct: 261 GVMNTDNMSICGESIDFGPCAFMDSYDPATVFSSIDQQG-RYAFGNQPPIAQWNLTRFAE 319
Query: 450 TLAAAKLIDDKEANYVMERYGTKFMDEYQ----AIMTKKLGL---PKYNKQIISKLLNNM 502
TL D +EA + E+ F D+YQ A M KLGL + ++ +LL+ M
Sbjct: 320 TLLPLIHQDVEEAIRLAEKALRAFADKYQQYWLAGMRAKLGLFTEAPDDLALVEELLSCM 379
Query: 503 AVDKVDYTNFFRALS---NVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQE 559
+++DYTN FR LS N A P+ + + P +I+W + +
Sbjct: 380 KKNRMDYTNTFRGLSSSLNANA-PTAAQGNIDTP--------------DFITWQQQWNKR 424
Query: 560 LLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPG 619
L S S ++ ALM NP + RN+ ++A+ AAE GDF +LL+++ +P+ E
Sbjct: 425 LSSQTKSLDDAIALMLKTNPAVIPRNHQVEAALSAAESGDFTVQEKLLEVLSQPFKEDAS 484
Query: 620 MEKYARLPP 628
Y R+PP
Sbjct: 485 RASY-RMPP 492
>gi|398407583|ref|XP_003855257.1| hypothetical protein MYCGRDRAFT_99340 [Zymoseptoria tritici IPO323]
gi|339475141|gb|EGP90233.1| hypothetical protein MYCGRDRAFT_99340 [Zymoseptoria tritici IPO323]
Length = 627
Score = 312 bits (799), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 217/599 (36%), Positives = 311/599 (51%), Gaps = 91/599 (15%)
Query: 102 LEDLNWDHSFVRELPGDP------------RTDSIPREVLHACYTKVSPSAEVENPQLVA 149
+ DL ++F ++LP D R + PR V +A YT V P + +LV
Sbjct: 19 IRDLPKSNNFTQKLPPDAEYPTPASSHKADRKNLGPRLVKNAAYTFVRPEP-FKKSELVG 77
Query: 150 WSESVADSLELDPKEFERPDFPLFFSGATPLA----------GAVPYAQCYGGHQFGMWA 199
S++ L +DP + DF F+G + P+AQCYGG+QFG WA
Sbjct: 78 VSKTALRDLAIDPAAVKTEDFKGTFAGNRIITLEADKEPGEKDVYPWAQCYGGYQFGQWA 137
Query: 200 GQLGDGRAITLGEILNLKS-ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
GQLGDGRAI+L E N + +R+E+QLKGAGKTPYSRFADG AV+RSSIREF+ SEA++
Sbjct: 138 GQLGDGRAISLFETTNPNTNKRYEIQLKGAGKTPYSRFADGKAVVRSSIREFVVSEALNA 197
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDL 318
L IPTTRAL L + R EP AIV R A+++LRFG++ + SRG D
Sbjct: 198 LKIPTTRALSLTLGPEETVR------RETTEPAAIVARFAETWLRFGTFDLARSRG--DR 249
Query: 319 DIVRTLADYAIRHHFRHIENM-------NKSESLSFSTG---DEDHSVVDLTSNKYAAWA 368
++VR LA+YA F E++ + + + S G +E ++ N+YA
Sbjct: 250 NLVRKLANYAAEEVFPGWESLPGKVASNEEKDVVDPSRGVAKEEIQGEGEVAENRYARLF 309
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
E+A R A +VA WQ FT+GVLNTDN SI GL+ID+GPF FLD FDPS+TPN D
Sbjct: 310 REIARRNAKMVAHWQAYAFTNGVLNTDNTSIFGLSIDFGPFAFLDNFDPSYTPNHDD-HM 368
Query: 429 RRYCFANQPDIGLWNIAQ----FSTTLAAAKLIDDKE----------ANYVMER------ 468
RY + NQP I WN + F + +DD+E A+ +++R
Sbjct: 369 LRYAYKNQPSIIWWNCVRLAEAFGEVIGGGPWVDDEEFVEKGVRQERADELIKRAETIID 428
Query: 469 -----YGTKFMDEYQAIMTKKLGLPKYNK----QIISKLLNNMAVDKVDYTNFFRALSNV 519
Y FM EY+ +MT +LGL + + ++ S+LL+ + ++D+ + FR LS+V
Sbjct: 429 QVSAEYKAVFMAEYKRLMTARLGLKQCKQTDFDELYSELLDTLFALELDFNHTFRRLSSV 488
Query: 520 KADPSIPED------------ELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGIS 566
D E+ E L + D + R W+ W + ++ S +
Sbjct: 489 VMDDLATEEKRKEVAGRFFHHEGLSGMAGSEAD-ARARIAKWLEKWRVRVFEDWEDSHEA 547
Query: 567 DEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLME---RPYDEQPGMEK 622
+ER A M +VNPK++ R+++ I+ E GE L +ME P+ E+ G K
Sbjct: 548 RDERLAAMKAVNPKFIPRSWVLDELIERVEKK--GEREILDHVMEMALNPFQEEWGWNK 604
>gi|343513306|ref|ZP_08750414.1| hypothetical protein VIS19158_03821 [Vibrio scophthalmi LMG 19158]
gi|342793402|gb|EGU29198.1| hypothetical protein VIS19158_03821 [Vibrio scophthalmi LMG 19158]
Length = 489
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 199/522 (38%), Positives = 283/522 (54%), Gaps = 63/522 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T VSP +EN + V+W+ S+A L P + + SG +P A Y G
Sbjct: 20 FTAVSPQP-LENTRWVSWNASLAAQFGL-PDQAPIGELKQQLSGELSHPQFMPLAMKYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGR + L E+ N + + +++ LKGAG TPYSR DG AVLRS+IRE+LC
Sbjct: 78 HQFGVYNPELGDGRGLLLCELENKQGKIFDVHLKGAGLTPYSRMGDGRAVLRSTIREYLC 137
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGI TTRAL ++ + V R+ K+E GA++ R+A+S +RFG ++
Sbjct: 138 SEAMAGLGIATTRALGMLASDSPVYRE-------KQEQGALLLRMAESHIRFGHFEHFFY 190
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
Q L ++ LAD I ++ + +S YAA +V
Sbjct: 191 TNQ--LSELKLLADKVIEWYWPELAEAEQS---------------------YAAMFEQVV 227
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
+ TA ++AQWQ +GF HGV+NTDNMSILG T DYGPF FLD +D S+ N +D G RY
Sbjct: 228 DNTALMIAQWQAIGFCHGVMNTDNMSILGQTFDYGPFAFLDDYDASYICNHSDYQG-RYA 286
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
F QP I LWN++ L + LID + + ++ + Y M KLGL K +
Sbjct: 287 FNQQPRIALWNLSALGHAL--SPLIDKAQIEAALAQFEPRLQQYYSQQMRAKLGLHKKLE 344
Query: 493 Q---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
Q + L + + K DYT F R LSN+ S P +L I ++ +AW
Sbjct: 345 QDGELFVMLFDLLEQHKPDYTRFMRDLSNIDRHGSQPVIDLF---------IDRDAAKAW 395
Query: 550 ISWVLSYI-------QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
+ L+ ++++++ I R M + NPKYVLRNYL Q AID AE GD+ +
Sbjct: 396 LDLYLARCELEVDEDEQIVTAAI----RCEAMRANNPKYVLRNYLLQLAIDKAEQGDYSD 451
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
V +L +++ P+DEQ ME+ A+LPP W G M +SCSS
Sbjct: 452 VEQLARVLVTPFDEQRHMEELAKLPPEW----GKGMEISCSS 489
>gi|423689547|ref|ZP_17664067.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens SS101]
gi|388000795|gb|EIK62124.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
fluorescens SS101]
Length = 487
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 213/552 (38%), Positives = 300/552 (54%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S++ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDEPRLVVASKAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP + P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDPAVAQTPVFAELFGGHKLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIPT+RALC++ + V R
Sbjct: 106 EHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGIPTSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E GA+V R+A S +RFG ++ + + ++ ++ H+
Sbjct: 166 E-------KQERGAMVLRLAHSHIRFGHFEYFYYTKKPEQQAELA------------EHV 206
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
N++ E Y A E+ ER A ++A+WQ GF HGV+NTDN
Sbjct: 207 LNLHYPECRE-------------QPEPYLAMFREIVERNAEMIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDHEG-RYSFSNQVPIGQWNLSALAQALTPFIS 312
Query: 457 IDD-KEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNF 512
+D KEA + Y + Y +M ++LGL + ++ ++ LL M VDYT F
Sbjct: 313 VDALKEA---LGLYLPLYQAHYLDLMRRRLGLTTAEEDDQTLVEGLLKLMQNSGVDYTLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERK 571
FR L + A ++ L+ +D+ + +W Y + G + E+R+
Sbjct: 370 FRRLGDESATLAVAR------LRDDFVDMA-----GFDAWAERYKARVARDGDYTQEQRR 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
M++VNP Y+LRNYL Q+AI AAE GD+ E+RRL +++ +P++EQ GME+YA+ PP W
Sbjct: 419 ERMHAVNPLYILRNYLAQNAIAAAEAGDYSEIRRLHEVLSKPFEEQAGMEQYAQRPPDWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 KH---LEISCSS 487
>gi|169605071|ref|XP_001795956.1| hypothetical protein SNOG_05551 [Phaeosphaeria nodorum SN15]
gi|160706702|gb|EAT86615.2| hypothetical protein SNOG_05551 [Phaeosphaeria nodorum SN15]
Length = 621
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 221/612 (36%), Positives = 312/612 (50%), Gaps = 97/612 (15%)
Query: 111 FVRELPGD------------PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
F + LP D PR PR V A YT V P + E +L+A S+ L
Sbjct: 28 FTQNLPADDAFPTPKESHDSPRQKLGPRMVKDALYTYVRPDPQGE-AELLAVSQRALQDL 86
Query: 159 ELDPKEFERPDFPLFFSG--------ATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITL 210
L +E + +F SG + P G P+AQCYGG+QFG WAGQLGDGRAI+L
Sbjct: 87 GLSEEEAKSDEFKEVVSGKKILTWDESKPDEGIYPWAQCYGGYQFGQWAGQLGDGRAISL 146
Query: 211 GEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCL 269
E N ++ R+E+QLKGAG+TPYSRFADG AVLRSSIREF+ SE ++ + IPTTRAL L
Sbjct: 147 FETTNPSTKTRYEIQLKGAGRTPYSRFADGRAVLRSSIREFVVSEYLNAINIPTTRALSL 206
Query: 270 -VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYA 328
+ G + R+ EPGAIV R AQS++RFG++ + RG D + +RT+ADY
Sbjct: 207 TLNNGSKIMRERI-------EPGAIVARFAQSWIRFGTFDLQRMRG--DRNTLRTIADYT 257
Query: 329 IRHHFRHIENMNKSESLSFSTGDEDHSVVDL-------------TSNKYAAWAVEVAERT 375
H + + + L E HS + N+YA +
Sbjct: 258 AEHVYGGWDKL--PSKLLPGDAKEVHSKTTTGIAKETLEGEGTDSENRYARLYRAILRAN 315
Query: 376 ASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFAN 435
A VA+WQ GF +GVLNTDN SILGL+ID+GPF FLD FDP++TPN D RY + N
Sbjct: 316 ALTVAKWQAYGFMNGVLNTDNTSILGLSIDFGPFAFLDTFDPTYTPNHDD-HMLRYSYRN 374
Query: 436 QPDIGLWNIAQFSTTL-----AAAKLIDD----------------KEANYVM----ERYG 470
QP I WN+ + L A AK+ D+ K A V+ E Y
Sbjct: 375 QPTIIWWNLVRLGEALGELMGAGAKVDDEVFVEKGVHADDADELVKRAETVIDAAGEEYK 434
Query: 471 TKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIP 526
F+ EY+ +MT +LGL ++++S+LL+ + ++D+ + FR LS+VK I
Sbjct: 435 AVFLAEYRRLMTLRLGLKTEKDGDFEELMSELLDCLEAFELDFHHAFRRLSSVKL-SEID 493
Query: 527 EDELLVPLKAVLLDIG---------KERKEAWISWVLSYIQELLSSGISDEERKALMNSV 577
+E + G +ER W+ + ++E G DEER+ M+ +
Sbjct: 494 TEEQRKDVAGRFFRAGEAPRQEADSRERIGKWLGKWAARVKEDWGEG-KDEERRTAMDKI 552
Query: 578 NPKYVLRNYLCQSAIDAAELGDFGEVR-RLLKLMERPYDE-----QPGMEKYARLPPAWA 631
NPK+V R+++ ID E E+ +++KL P++E + E++ P +
Sbjct: 553 NPKFVPRSWILDELIDRVEKKGEREILPQIMKLALNPFEEHWAWDEAEEERFCGDVPKYK 612
Query: 632 YRPGVCMLSCSS 643
G+ SCSS
Sbjct: 613 ---GMMQCSCSS 621
>gi|310794557|gb|EFQ30018.1| hypothetical protein GLRG_05162 [Glomerella graminicola M1.001]
Length = 633
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 211/559 (37%), Positives = 289/559 (51%), Gaps = 71/559 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG-- 176
PR PR V +A +T V P E+P+L+A S + + + + E +F +G
Sbjct: 46 PRDQIAPRGVRNAAFTWVRPET-AEDPELLAVSPAAMRDIGIKEGDEETEEFRQTVAGNR 104
Query: 177 -----ATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGK 230
L G P+AQCYGG QFG WAGQLGDGRAI+L E N +S+ R+ELQLKGAG
Sbjct: 105 LHGWDEEKLEGGYPWAQCYGGFQFGQWAGQLGDGRAISLFETTNPESKVRYELQLKGAGI 164
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSRFADG AVLRSSIREF+ SEA+H LGIP+TRAL L K R EP
Sbjct: 165 TPYSRFADGKAVLRSSIREFVVSEALHALGIPSTRALALTLLPKSKVR------RETVEP 218
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM--------NKS 342
GAIV R AQS++R G++ + +RG D ++RTLA Y F E + +
Sbjct: 219 GAIVLRFAQSWIRLGNFDLPRARG--DRAMIRTLATYVAEDVFGGWETLPARLASPDKPA 276
Query: 343 ESLSFSTG---DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
E L + G E D + N++ EVA R A VA+WQ GF +GVLNTDN S+
Sbjct: 277 ECLEPARGVPATEVQGPEDSSENRFTRLFREVARRNALTVAKWQAYGFMNGVLNTDNTSV 336
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-----AAA 454
GL+ID+GPF F+D FDP++TPN D RY + NQP I WN+ +F L A A
Sbjct: 337 AGLSIDFGPFAFMDNFDPAYTPNHDDHL-LRYSYRNQPTIIWWNLVRFGEALGELIGAGA 395
Query: 455 KLIDDKEAN--------------------YVMERYGTKFMDEYQAIMTKKLGLPKYNK-- 492
+ +D N V E Y FM EY+ +M ++LGL + +
Sbjct: 396 GVDEDAFVNNGVEESQAEALVARAEKLIMQVGEEYKALFMAEYKRLMAQRLGLKTFKESD 455
Query: 493 --QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPED-------ELLVPLKAVLLDIGK 543
++ S LL+ M ++D+ +FFR LS+VK ED V K
Sbjct: 456 FDELFSNLLDTMESHELDFNHFFRRLSSVKLSDIADEDACRKTAARFFHAEDGVAGGEAK 515
Query: 544 ERKE--AWIS-WVLSYIQELLSSGIS--DEERKALMNSVNPKYVLRNYLCQSAIDAAEL- 597
R + AW+ W +++ S D ER+ M +VNP +V R ++ I E
Sbjct: 516 GRADVGAWLGKWRARVVEDWGEDDASRGDAEREKAMKAVNPNFVPRGWVLDEIIKRVEKD 575
Query: 598 GDFGEVRRLLKLMERPYDE 616
G+ +RR++ + P+++
Sbjct: 576 GERDVLRRVMHMALHPFED 594
>gi|424874405|ref|ZP_18298067.1| hypothetical protein Rleg5DRAFT_5958 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393170106|gb|EJC70153.1| hypothetical protein Rleg5DRAFT_5958 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 500
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 197/507 (38%), Positives = 276/507 (54%), Gaps = 58/507 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +P+A V P L+ +E++A L LD + R D FSG GA P A Y G
Sbjct: 28 FAAQAPTA-VAEPWLIKLNEALAAELGLDVEALRR-DGAAIFSGNLVPEGAEPLAMAYAG 85
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG ++ QLGDGRAI LGE+++ +R+++QLKGAG TP+SR DG A + +RE++
Sbjct: 86 HQFGGFSPQLGDGRAILLGEVVDRSGKRYDIQLKGAGPTPFSRRGDGRAAVGPVLREYII 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 146 SEAMFALGIPATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHVRVGTFQYFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D D VR LADY I H+ ++ N Y A+ V
Sbjct: 199 RG--DTDGVRALADYVIDRHYPALKE---------------------AENPYLAFFDAVC 235
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY
Sbjct: 236 ERQAALIARWLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDQHG-RYA 294
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ TL LID + +AN V++ YG +F + A M +K+G
Sbjct: 295 YANQPGIGQWNLARLGETL--LPLIDAEPDSAVDKANVVIKSYGERFQAHWLAGMREKIG 352
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSI-PEDELLVPLKAVLLDIG 542
L ++ LL+ M D+T FR LS++ D + PE
Sbjct: 353 LAGEEDGDLDLVQALLSLMQAQGADFTLAFRRLSDLAGDDAAGPE-----------FAAS 401
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFG 601
EA +W+ + + L + ER M +VNP ++ RN+ + AI+AA E GDF
Sbjct: 402 FREPEACGAWLTQWRERLSRDPQTASERAIAMRNVNPAFIPRNHRVEQAIEAAVENGDFS 461
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPP 628
LL ++ +PY++QPG Y R PP
Sbjct: 462 LFEALLSVLSKPYEDQPGFVAY-REPP 487
>gi|284991852|ref|YP_003410406.1| hypothetical protein Gobs_3434 [Geodermatophilus obscurus DSM
43160]
gi|284065097|gb|ADB76035.1| protein of unknown function UPF0061 [Geodermatophilus obscurus DSM
43160]
Length = 512
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 205/565 (36%), Positives = 301/565 (53%), Gaps = 81/565 (14%)
Query: 76 LKNQRLDTETETDGGDESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTK 135
L + R T G + + +++D F RELP ++P +
Sbjct: 5 LAHHRPAVHGNTSGTGRAVHRVSVAPAPTVSFDDRFARELP----EMAVPWQ-------- 52
Query: 136 VSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQF 195
+ E +P+L+ ++++A L LDP RPD G GA P AQ Y GHQF
Sbjct: 53 ---ADEAPDPRLLVLNDALATELGLDPGALRRPDGVRLLVGTAVPDGAKPVAQAYAGHQF 109
Query: 196 GMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEA 255
G + +LGDGRA+ LGE+ +++ +L LKG+G+TP+SR DGLA + +RE++ SEA
Sbjct: 110 GGFVPRLGDGRALLLGELTDVEGRLRDLHLKGSGRTPFSRGGDGLAAVGPMLREYVVSEA 169
Query: 256 MHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ 315
MH LGIPTTR+L +V TG+ V R+ PGA++ RVA S LR GS+Q +R
Sbjct: 170 MHALGIPTTRSLAVVATGRPVRRETLL-------PGAVLARVASSHLRVGSFQY--ARAT 220
Query: 316 EDLDIVRTLADYAI-RHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAER 374
D+D++R LAD+AI RHH +T D + + L AA
Sbjct: 221 GDVDLLRRLADHAIARHH--------------PATADAEQPYLALFEAVVAA-------- 258
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
ASLVA+W VGF HGV+NTDN +I G TIDYGP FLDA+DP+ ++ D+ G RY +
Sbjct: 259 QASLVARWMLVGFVHGVMNTDNTTISGETIDYGPCAFLDAYDPATVYSSIDI-GGRYAYG 317
Query: 435 NQPDIGLWNIAQFSTTLAAAKLIDDKE-----ANYVMERYGTKFMDEYQAIMTKKLGLPK 489
NQP + WN+A+F+ TL DD+E A +ER+ ++ + A M KLGLP
Sbjct: 318 NQPIVAEWNLARFAETL-LPLFSDDQEQAVALAVEALERFRPQYNAAWSAGMRAKLGLPD 376
Query: 490 -YNKQIISKLLNN----MAVDKVDYTNFFRAL-SNVKADPSIPEDELLVPLKAVLLDIGK 543
+ ++ + L+ + M VD T+F RAL + + D P + V++D+
Sbjct: 377 GLDDEVATALVEDLHALMQESHVDLTSFSRALGAAARGDAE--------PARLVVMDLA- 427
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
R +AW+ E + D E LM+ NP Y+ RN+L + A+ AA GD +
Sbjct: 428 -RFDAWL--------ERWRALGPDAE---LMDRTNPVYIPRNHLVEEALTAATDGDLAPL 475
Query: 604 RRLLKLMERPYDEQPGMEKYARLPP 628
+RLL+++ PY+E+PG+E+YA P
Sbjct: 476 QRLLEVLAGPYEERPGLERYAAPAP 500
>gi|429331614|ref|ZP_19212367.1| hypothetical protein CSV86_07511 [Pseudomonas putida CSV86]
gi|428763775|gb|EKX85937.1| hypothetical protein CSV86_07511 [Pseudomonas putida CSV86]
Length = 486
Score = 311 bits (796), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 206/549 (37%), Positives = 298/549 (54%), Gaps = 67/549 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L++L +D+ F R GD A T+V P +++P+LV SE+ L
Sbjct: 1 MKGLDELTFDNRFAR--LGD------------AFSTQVLPEP-IDDPRLVVVSEAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+L+P E P F F G + A P A Y GHQFG + +LGDGR + LGE+ N
Sbjct: 46 DLEPTEAHSPVFAELFGGHKLWSEADPRAMVYSGHQFGSYNPRLGDGRGLLLGEVRNDAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG+TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 QHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSNTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRHIE 337
+ +E A++ R+A S +RFG ++ + +R E R LA++ + H+ +
Sbjct: 166 E-------TKESAAMLLRLAPSHIRFGHFEYFYYTRQPEQ---QRQLAEHVLDLHYPECK 215
Query: 338 NMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
DE Y A + ER A L+ +WQ GF HGV+NTDNM
Sbjct: 216 -----------AADE----------PYLAMFRSIVERNAELIGKWQAYGFCHGVMNTDNM 254
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLI 457
SILG+T D+GPF FLD FD F N +D G RY ++NQ I WN++ + L I
Sbjct: 255 SILGITFDFGPFAFLDDFDAGFICNHSDDQG-RYSYSNQVPIAHWNLSALAQAL--TPFI 311
Query: 458 DDKEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFR 514
+ + + + Y +M ++LGL + +K +I +LL+ M VDY+ FFR
Sbjct: 312 SVEALQEALGLFLPLYEAHYLDLMRRRLGLTTAEEGDKVLIQRLLSLMQPGAVDYSLFFR 371
Query: 515 ALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALM 574
L + P ++ L +++ +D+ + +W Y+ + + + R+ M
Sbjct: 372 KLGDQ------PVEQALGVVRSDFVDLA-----GFDNWSQDYLARVQREPGNADGRRERM 420
Query: 575 NSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
++VNP YVLRNYL Q AI+AA+ GD+ EVRRL ++ RP++EQPGME YA PP W
Sbjct: 421 HAVNPLYVLRNYLAQRAIEAAQSGDYSEVRRLHAVLARPFEEQPGMEAYAERPPEWGKH- 479
Query: 635 GVCMLSCSS 643
+SCSS
Sbjct: 480 --LEISCSS 486
>gi|410907992|ref|XP_003967475.1| PREDICTED: LOW QUALITY PROTEIN: selenoprotein O-like [Takifugu
rubripes]
Length = 666
Score = 311 bits (796), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 197/454 (43%), Positives = 258/454 (56%), Gaps = 46/454 (10%)
Query: 91 DESKMTKKLKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAW 150
D+ ++ +LE LN+D+ +++LP DP D R+V AC+++V P + P+ VA
Sbjct: 2 DDMGISVSRSSLERLNFDNVALKKLPLDPSEDPGVRQVKGACFSRVKPQP-LTKPRFVAV 60
Query: 151 SESVADSLELDPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAIT 209
S + L L E P P + SG+ + G+ P A CY GHQFG +AGQLGDG A
Sbjct: 61 SYKALELLGLVGDEVINDPLGPEYLSGSKIMPGSEPAAHCYCGHQFGQFAGQLGDGAACY 120
Query: 210 LGEI-----------LNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
LGE+ S RWE+Q+KGAG TPYSR ADG VLRSSIREFLCSEAM F
Sbjct: 121 LGEVKVPPDQDPELLRENPSSRWEIQVKGAGLTPYSRQADGRKVLRSSIREFLCSEAMFF 180
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS------ 312
LGIPTTRA +VT+ V RD++Y G+P+ E ++V R+A +FLRFGS++I S
Sbjct: 181 LGIPTTRAGSVVTSDSSVVRDVYYSGHPRHEKCSVVLRIAPTFLRFGSFEIFKSPDEYTG 240
Query: 313 -RGQE-DLDIVR-TLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAV 369
RG LD +R + DY I + I+ +F E + A+
Sbjct: 241 RRGPSCGLDEIRGQMIDYVIEMFYPEIQQ-------NFPDRME----------RNVAFFR 283
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
EV RTA LVAQWQ VGF HGVLNTDNMSILGLT+DYGP+GF+D FDP F + +D G
Sbjct: 284 EVMVRTARLVAQWQCVGFCHGVLNTDNMSILGLTLDYGPYGFMDRFDPDFICSASDNSG- 342
Query: 430 RYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPK 489
RY + QPDI WN+ + + LA D EA V++ Y + Y M KLGL K
Sbjct: 343 RYSYQAQPDICRWNLVKLAEALAPELPPDRAEA--VLDEYLALYNGFYLQNMRNKLGLLK 400
Query: 490 Y----NKQIISKLLNNMAVDKVDYTNFFRALSNV 519
++ ++S LL M D+TN FR LS +
Sbjct: 401 KEEPEDEILMSDLLQTMHSTGADFTNTFRCLSQI 434
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/106 (42%), Positives = 64/106 (60%), Gaps = 17/106 (16%)
Query: 538 LLDIGKE-----RKEAWISWVLSYIQELL--SSGISD-----EERKALMNSVNPKYVLRN 585
L++I +E + E W W++ Y + L G SD EER +M NP+ +LRN
Sbjct: 515 LMEISQEALKSKQAEDWRGWIVRYRKRLALEMEGQSDAQAVQEERLRVMEGTNPRVILRN 574
Query: 586 YLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
Y+ Q+AI+AAE GDF EV+R+LK++E+PY QPG+E PAW
Sbjct: 575 YIAQNAIEAAENGDFSEVQRVLKVLEKPYCSQPGLEF-----PAWV 615
>gi|424888115|ref|ZP_18311718.1| hypothetical protein Rleg10DRAFT_2169 [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393173664|gb|EJC73708.1| hypothetical protein Rleg10DRAFT_2169 [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 500
Score = 311 bits (796), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 196/496 (39%), Positives = 265/496 (53%), Gaps = 54/496 (10%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V P L+ +E +A L LD R D FSG GA P A Y GHQFG ++ Q
Sbjct: 36 VAEPWLIKLNEPLAAELGLDVAALRR-DGAAIFSGNLVPEGAEPLAMAYAGHQFGGFSPQ 94
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRAI LGE+++ R+++QLKGAG TP+SR DG A + +RE++ SEAM LGI
Sbjct: 95 LGDGRAILLGEVVDRSGRRFDIQLKGAGPTPFSRRGDGRAAIGPVLREYIVSEAMFALGI 154
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
P TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+RG D D V
Sbjct: 155 PATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHIRVGTFQFFAARG--DTDGV 205
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
R LADY I H+ ++ + N Y A V+ER ASL+A+
Sbjct: 206 RALADYVIDRHYPALKEAD---------------------NPYLALFSAVSERQASLIAR 244
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY +ANQP IG
Sbjct: 245 WLHVGFIHGVMNTDNMTVSGETIDFGPCAFVDAYDPATVFSSIDQHG-RYAYANQPGIGQ 303
Query: 442 WNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLGLPKYNK--- 492
WN+A+ TL LID++ +AN V+ YG +F + A M K+GL
Sbjct: 304 WNLARLGETL--LPLIDEEPDGAVDKANGVIRSYGERFQTHWLAGMLGKIGLAGEEDGDL 361
Query: 493 QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISW 552
+++ LL+ M D+T FR LS++ DE P A EA W
Sbjct: 362 ELVQALLSLMQAQGADFTLTFRRLSDLAG------DETAEPSFAASF----REPEACAPW 411
Query: 553 VLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLME 611
+ + L + ER M SVNP ++ RN+ + AI+AA E GDF LL ++
Sbjct: 412 LAQWHGRLSRDPQTAAERSMAMRSVNPAFIPRNHRIEQAIEAAVENGDFSLFEALLTVLA 471
Query: 612 RPYDEQPGMEKYARLP 627
+PY++QPG Y P
Sbjct: 472 KPYEDQPGFAAYMEPP 487
>gi|148976461|ref|ZP_01813167.1| hypothetical protein VSWAT3_01588 [Vibrionales bacterium SWAT-3]
gi|145964284|gb|EDK29540.1| hypothetical protein VSWAT3_01588 [Vibrionales bacterium SWAT-3]
Length = 485
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 200/528 (37%), Positives = 279/528 (52%), Gaps = 58/528 (10%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q +AW++S+A L E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LSNVQWLAWNQSLATELGFPSFESASEELLDTLSGNVE 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PEQFSPLAMKYAGHQFGAYNPDLGDGRGLLLAQVVAKSGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R A+
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRAAE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF E L DE+
Sbjct: 180 SHIRFGHFEHLFYTNQ--LVEHKLLADKVIEWHF--------PECL-----DEE------ 218
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA ++ +RTA ++A WQ GF HGV+NTDNMSI+G T DYGPF FLD +DP
Sbjct: 219 --KPYAAMFNQIVDRTAEMIALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYDPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IG+WN++ + +L + L++ + +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGMWNLSALAHSL--SPLVERADLEAALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL + + ++ + M+ +KVDY FFR LSN+ P+ +L++ A
Sbjct: 334 LMRRKLGLLSKQEGDSRLFESMFELMSQNKVDYPRFFRTLSNLDTLPAQEVIDLVIDRDA 393
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
L W+ +Y Q S ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 394 AKL------------WLDNYFQRCELEESSATERCEKMRQVNPKYILRNYLAQLAIEKAE 441
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
GD +V L+ ++ PY E E A LPP W G M +SCSS
Sbjct: 442 RGDSSDVDALMVVLADPYAEHSDYEYLAALPPEW----GKGMEISCSS 485
>gi|241203720|ref|YP_002974816.1| hypothetical protein Rleg_0982 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240857610|gb|ACS55277.1| protein of unknown function UPF0061 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 500
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 196/506 (38%), Positives = 275/506 (54%), Gaps = 56/506 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +P+A V P L+ +E++A L LD + R D FSG GA P A Y G
Sbjct: 28 FAAQAPTA-VAEPWLIKLNEALAAELGLDVEALRR-DGAAIFSGNLVPEGAEPLAMAYAG 85
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG ++ QLGDGRAI LGE++ +R+++QLKGAG TP+SR DG A + +RE++
Sbjct: 86 HQFGGFSPQLGDGRAILLGEVVGRSGKRYDIQLKGAGPTPFSRRGDGRAAIGPVLREYII 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 146 SEAMFALGIPATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHVRVGTFQYFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D D VR LADY I H+ ++ N Y A V+
Sbjct: 199 RG--DTDGVRALADYVIDRHYPALKE---------------------AENPYLALFEAVS 235
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY
Sbjct: 236 ERQAALIARWLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDQHG-RYA 294
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ TL LID + +AN V++ YG +F + A M +K+G
Sbjct: 295 YANQPGIGQWNLARLGETL--LPLIDAEPDGAVDKANIVIKSYGERFQAHWLAGMREKIG 352
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L ++ LL+ M D+T FR LS++ D + E E +
Sbjct: 353 LAGEEDGDLDLVQALLSLMQAQGADFTLTFRRLSDLAGDDAA-EPEFAASFR-------- 403
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
+A +W+ + + L + ER M VNP ++ RN+ + AI+AA E GDF
Sbjct: 404 -EPDARGAWLTQWRERLSRDPQTATERAIAMRRVNPAFIPRNHRVEQAIEAAVENGDFSL 462
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPP 628
LL ++ +PY++QPG Y R PP
Sbjct: 463 FEALLSVLSKPYEDQPGFVAY-REPP 487
>gi|349575194|ref|ZP_08887115.1| SelO family protein [Neisseria shayeganii 871]
gi|348013202|gb|EGY52125.1| SelO family protein [Neisseria shayeganii 871]
Length = 486
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 200/516 (38%), Positives = 269/516 (52%), Gaps = 50/516 (9%)
Query: 132 CYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYG 191
C T V P A + P+L +S +A L + F + D SG+ P A Y
Sbjct: 17 CET-VRPEA-LSRPELPVFSSELAAELGIPDSVFVQADTVAQLSGSAAHYDPAPTATVYS 74
Query: 192 GHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFL 251
GHQFG++ QLGDGRA+ LG+++ RWE+QLKG+GKTP+SRFADG AVLRS+IRE+L
Sbjct: 75 GHQFGVYVPQLGDGRAMLLGDLVAPDGSRWEIQLKGSGKTPFSRFADGRAVLRSTIREYL 134
Query: 252 CSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHA 311
SEAMH LGIPTTRAL + + V R+ + E A++ R A SFLRFG ++
Sbjct: 135 ASEAMHALGIPTTRALAITVSPDPVYRE-------QPETAAVLTRAAPSFLRFGHFEYFY 187
Query: 312 SRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEV 371
R Q + LADY I H+ N + A V
Sbjct: 188 HRRQH--QHLAPLADYLIAEHYPECRA---------------------AENPHLALFEAV 224
Query: 372 AERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRY 431
RTA+L+AQWQ VGF HGV+NTDNMS+LGLTIDYGP+GFLD F+ N +D G RY
Sbjct: 225 TRRTAALIAQWQAVGFCHGVMNTDNMSLLGLTIDYGPYGFLDGFNRHHVCNHSD-AGGRY 283
Query: 432 CFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY- 490
+ QP + WN+ + + A L + E V+E + + Y M +KLGL
Sbjct: 284 AYKEQPYVAQWNLLKLGS--AFLPLAAEAELIAVIESFVGHYQTGYLNAMRQKLGLSHSQ 341
Query: 491 --NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEA 548
+ +++ LL+ + + DYT FFR L+ + + P + L+ L E
Sbjct: 342 PDDAELVHDLLDVLQQAEADYTLFFRRLAEMPTEHQAPLPDSLLRLFP--------HAER 393
Query: 549 WISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAI-DAAELGDFGEVRRLL 607
I W Y + L + ERK M++VNP YV RNYL + AI A + GDF VRRL
Sbjct: 394 LIHWSGRYKRRLRQENLPPAERKRQMDAVNPLYVPRNYLLEQAIAQARDHGDFDGVRRLQ 453
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ P+ E+ A PP WA +C +SCSS
Sbjct: 454 ACWQDPFTERAEYADLADTPPDWA--ADIC-ISCSS 486
>gi|384047815|ref|YP_005495832.1| Luciferase family protein [Bacillus megaterium WSH-002]
gi|345445506|gb|AEN90523.1| Luciferase family protein [Bacillus megaterium WSH-002]
Length = 486
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 197/515 (38%), Positives = 286/515 (55%), Gaps = 59/515 (11%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPY 186
E+ + +T + P+ V +P++V +++S+A SL L ++ + + +G + GA P
Sbjct: 17 ELPNIFFTPLDPNP-VSSPKIVKFNDSLAASLGLQKEQLQSQEGVSILAGNSVPKGAFPL 75
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
AQ YGGHQFG + LGDGRA+ +GE + E+ +LQLKG+G+TPYSR DG A L
Sbjct: 76 AQAYGGHQFGHF-NMLGDGRAMLIGEQVTPSGEKVDLQLKGSGRTPYSRGGDGRAALGPM 134
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
+RE++ SEAMH LGIPTTR+L +V TG+ + R+ KE PGAI+ RVA S LRFG+
Sbjct: 135 LREYIISEAMHALGIPTTRSLAVVITGESIVRE-------KELPGAILTRVASSHLRFGT 187
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
+Q A G ++ ++ LADYA+ HF HIE K KY +
Sbjct: 188 FQFAAKWG--TVENLQALADYALERHFSHIEKNEK---------------------KYLS 224
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
EV +R A+LVA+WQ +GF HGV+NTDNM+I G TIDYGP F+D +DP ++ D+
Sbjct: 225 LLQEVIKRHATLVAKWQLIGFIHGVMNTDNMTISGETIDYGPCAFMDTYDPETVFSSIDV 284
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQ----AIMT 482
G RY + NQP I WN+A+F+ L D ++A + + T+F Y+ A M
Sbjct: 285 QG-RYAYQNQPGITGWNLARFAEALLPLLDQDIEKAVEIAQSAVTEFPKFYRENWLAGMQ 343
Query: 483 KKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL 539
KLGL K ++ + +LL M K DYTN FRAL+ K S +L
Sbjct: 344 AKLGLFNEEKEDEALFQELLTIMKTYKADYTNTFRALTFDKLGNS----DLF-------- 391
Query: 540 DIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGD 599
E + W + + L S E + LM + NP + RN+ + A+DAA+ GD
Sbjct: 392 -----ESEEFAQWQELWQKRLGRQQQSKAESQELMKNNNPAVIPRNHRVEEALDAAQKGD 446
Query: 600 FGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRP 634
+ + LL+++ PY E PG +Y +PPA + +P
Sbjct: 447 YSVMETLLQVLSSPY-ESPGQSEYC-VPPAPSNQP 479
>gi|333898683|ref|YP_004472556.1| hypothetical protein Psefu_0480 [Pseudomonas fulva 12-X]
gi|333113948|gb|AEF20462.1| UPF0061 protein ydiU [Pseudomonas fulva 12-X]
Length = 487
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 202/507 (39%), Positives = 281/507 (55%), Gaps = 53/507 (10%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
+ P+LV S+S L+LDP+E +R F FSG + A P A Y GHQFG ++ +
Sbjct: 29 IAEPRLVVVSDSAMALLDLDPREAQREVFAELFSGNQLWSDAEPRAMVYSGHQFGGYSPR 88
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGR + LGE+LN E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGI
Sbjct: 89 LGDGRGLLLGEVLNDAGEHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGI 148
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDI 320
P++RALC+ + V R+ ++E A++ R+A S +RFG ++ + +R E L
Sbjct: 149 PSSRALCVTGSSTPVWRE-------RQETAAMLVRLAPSHIRFGHFEYFYYTRQHEQL-- 199
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
+ LADY I HH+ +AA V ERTA ++A
Sbjct: 200 -KQLADYVIEHHY---------------------PACLEQPQPHAALLKAVLERTAEMIA 237
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
WQ GF HGV+NTDNMSILG+T D+GP+ FLD FD N +D G RY F+NQ I
Sbjct: 238 WWQAYGFCHGVMNTDNMSILGITFDFGPYAFLDDFDAKHICNHSDDTG-RYSFSNQVPIA 296
Query: 441 LWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISK 497
WN++ + L L++ ++ + + Y +M K+LG ++++I +
Sbjct: 297 HWNLSALAQAL--TPLVEIDTLRETLDLFLPIYQAHYHDLMRKRLGFTTAEDGDEELIQR 354
Query: 498 LLNNMAVDK-VDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSY 556
LL M K DY+ FFR L + P + L V ++ +D+ + +W Y
Sbjct: 355 LLTLMQAGKATDYSLFFRHLGD-----QAPSEALKV-VRNDFVDL-----TGFDAWAADY 403
Query: 557 IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDE 616
+ G+ ER+A M++VNP YVLRNYL Q AI AAE GD+G VR L +++ RP++E
Sbjct: 404 QARVEREGLEQSERQARMHAVNPLYVLRNYLAQEAIAAAEQGDYGPVRELHQVLTRPFEE 463
Query: 617 QPGMEKYARLPPAWAYRPGVCMLSCSS 643
QPG + YA+ PP W +SCSS
Sbjct: 464 QPGKQHYAQRPPDWGKH---LEISCSS 487
>gi|94263788|ref|ZP_01287594.1| Protein of unknown function UPF0061 [delta proteobacterium MLMS-1]
gi|93455799|gb|EAT05966.1| Protein of unknown function UPF0061 [delta proteobacterium MLMS-1]
Length = 517
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 196/513 (38%), Positives = 279/513 (54%), Gaps = 41/513 (7%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L A + + V P+L+ + ++A L L + + + F+G AGA P A
Sbjct: 22 LPAAFYRFCNPTPVAAPRLLKLNAALAGELGLQLEGLDEQELAEIFAGNRLPAGAQPLAM 81
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG QLGDGRAI LGE+L+ +S RW++QLKGAGKTP+SR DG A L IR
Sbjct: 82 AYAGHQFGSLVPQLGDGRAILLGEVLDGQSRRWDIQLKGAGKTPFSRGGDGRAPLGPVIR 141
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+L SEAMH LGIPTTRAL V++G+ V R+ PGA++ RVA S +R G+++
Sbjct: 142 EYLVSEAMHALGIPTTRALAAVSSGEQVRRERLL-------PGAVITRVAASHIRVGTFE 194
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIEN--MNKSESLSFS-TGDEDHSVVDLTSNKYA 365
A RG D +RTLADY I H+ I +N E + +G E H +Y
Sbjct: 195 FFARRG--DFASLRTLADYVIPRHYSEINGPEINGPEIIGPEISGAEGH-------RRYL 245
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
A V R A LVAQW +GF HGV+NTDN +I G TIDYGP FLD + P + D
Sbjct: 246 ALLAAVIARQAELVAQWMSIGFIHGVMNTDNTTISGETIDYGPCAFLDHYHPETVFSAID 305
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-----ANYVMERYGTKFMDEYQAI 480
G RY + QP I WN+A+F+ +L L DD+E A +++ + ++ +
Sbjct: 306 T-GGRYAYHMQPRIAQWNLARFAESLLPL-LHDDQEQAIALATALLQDFMPRYEKAWLTR 363
Query: 481 MTKKLGL--PKY-NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAV 537
M K+GL P+ ++++I LL MA ++VD+T FFR L+N +P+ E + + PL
Sbjct: 364 MGNKIGLTAPQPDDRKLIEGLLAAMADNEVDFTLFFRRLANAVENPT--EADGIRPL--- 418
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAID-AAE 596
R E W W + + L + + ER M SVNP + RN+ + AI A E
Sbjct: 419 -----FNRPETWEHWAEGWHKRLAADPLPPAERAKRMRSVNPAIIPRNHRIEQAISKATE 473
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
DF + +L + + P+++ P +++ PPA
Sbjct: 474 AADFSDFTKLNQALNHPWEDNPERDRWL-APPA 505
>gi|380495958|emb|CCF31998.1| hypothetical protein CH063_00739 [Colletotrichum higginsianum]
Length = 636
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 209/557 (37%), Positives = 290/557 (52%), Gaps = 70/557 (12%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG-- 176
PR PR V +A +T V P E+P+L+A S + + + + + +F +G
Sbjct: 52 PRDQIAPRGVRNAAFTWVRPET-AEDPELLAVSPAAMRDIGIQEGDEKTEEFRQTVAGNR 110
Query: 177 -----ATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGK 230
L G P+AQCYGG QFG WAGQLGDGRAI+L E N + R+ELQLKGAG
Sbjct: 111 LHGWDEEKLEGGYPWAQCYGGFQFGQWAGQLGDGRAISLFETRNPDTNVRYELQLKGAGM 170
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEP 290
TPYSRFADG AVLRSSIREF+ SEA+H L IP+TRAL L K R EP
Sbjct: 171 TPYSRFADGKAVLRSSIREFVVSEALHALKIPSTRALSLTLLPKSKVR------RETVEP 224
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHF-------RHIENMNK-S 342
GAIV R AQS++R G++ + +RG D ++RTLA Y +EN +K
Sbjct: 225 GAIVLRFAQSWIRLGNFDLPRARG--DRAMIRTLATYVAEDVLGGWETLPARLENPDKPG 282
Query: 343 ESLSFSTGDEDHSVV---DLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSI 399
E L + G V D N++ EVA R A VA+WQ GF +GVLNTDN SI
Sbjct: 283 ECLEPARGVPATDVQGPEDSAENRFTRLFREVARRNALTVAKWQAYGFMNGVLNTDNTSI 342
Query: 400 LGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL-----AAA 454
+GL+ID+GPF F+D FDP++TPN D RY + NQP I WN+ +F L A A
Sbjct: 343 MGLSIDFGPFAFMDNFDPAYTPNHDDHL-LRYSYRNQPTIIWWNLVRFGEALGELLGAGA 401
Query: 455 KLIDD--------------------KEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK-- 492
+ +D K V E Y FM EY+ +MT++LGL + +
Sbjct: 402 GVDEDAFVKNRVEESESETLIGRAEKLIMQVGEEYKAVFMAEYKRLMTQRLGLKNFKESD 461
Query: 493 --QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLL---------DI 541
++ S LL+ M ++D+ +FFR LS+VK I ++E A +
Sbjct: 462 FDELFSNLLDTMETHELDFNHFFRRLSSVKLS-DIADEEARRETAARFFHAEGAAGGENK 520
Query: 542 GKERKEAWIS-WVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GD 599
G+ AW+ W +++ D ER+ M +VNP +V R ++ I E G+
Sbjct: 521 GRADIGAWLGKWRARAVEDWGEGASQDVEREKAMKAVNPNFVPRGWVLDEIIKRVEKDGE 580
Query: 600 FGEVRRLLKLMERPYDE 616
+RR++++ P+++
Sbjct: 581 RDVLRRVMQMALYPFED 597
>gi|378728850|gb|EHY55309.1| hypothetical protein HMPREF1120_03451 [Exophiala dermatitidis
NIH/UT8656]
Length = 651
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 208/563 (36%), Positives = 284/563 (50%), Gaps = 79/563 (14%)
Query: 102 LEDLNWDHSFVRELPGDP------------RTDSIPREVLHACYTKVSPSAEVENPQLVA 149
L D+ ++F LP DP R PR V A YT V P E+P+L+A
Sbjct: 50 LADIPKSNNFTSHLPPDPQFPTPIDSHRAPRQKLGPRMVRGALYTYVRPEP-TEDPELLA 108
Query: 150 WSESVADSLELDPKEFERPDFPLFFSGAT-----PLAGAVPYAQCYGGHQFGMWAGQLGD 204
S + + L E + SG G P+AQCYGG QFG WAGQLGD
Sbjct: 109 VSNAALRDIGLAESEASSEELKQVVSGNKFYWDEEKGGIYPWAQCYGGFQFGQWAGQLGD 168
Query: 205 GRAITLGEILNLKSE-RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPT 263
GRAI+L E N +++ R+E+QLKGAGKTPYSRFADG AVLRSSIREF+ SE ++ +GIPT
Sbjct: 169 GRAISLFETTNPQTKVRYEIQLKGAGKTPYSRFADGKAVLRSSIREFVVSEYLNAIGIPT 228
Query: 264 TRALCLVTTGKF-VTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVR 322
TRAL L K V R+ EPGAIVCR+AQS+LR G++ + SRG D D++R
Sbjct: 229 TRALSLTLCPKSQVVRERL-------EPGAIVCRIAQSWLRLGTFDLMRSRG--DRDLIR 279
Query: 323 TLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVD--------LTSNKYAAWAVEVAER 374
A Y F E + + D + V N++ E+ R
Sbjct: 280 QTATYVAEEVFGGWETLPAALPADTPNADPERGVSKDEIQGKEGAEENRFTRLYREIVRR 339
Query: 375 TASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFA 434
A +V WQ GF +GVLNTDN SI GL++DYGPF F+D FDPS+TPN D RY +
Sbjct: 340 NAKVVGMWQAYGFMNGVLNTDNTSIYGLSMDYGPFAFMDNFDPSYTPNHDDYM-LRYSYR 398
Query: 435 NQPDIGLWNIAQFSTTL----AAAKLIDD-----------------KEANYVM----ERY 469
QP I WN+ + L A +D+ K A ++ E Y
Sbjct: 399 AQPSIIWWNLVRLGEALGELIGAGDRVDNDVFVEKGVEEDFAPVLIKRAETIIDQIGEEY 458
Query: 470 GTKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSI 525
F+ EY+ +M+ +LGL ++ S+LL+ M ++D+ +FFR LS VK D
Sbjct: 459 KAVFLSEYRRLMSLRLGLKTQKDSDFDKLFSELLDTMEALELDFNHFFRRLSTVKLDDVS 518
Query: 526 PED------ELLVPLKAV-----LLDIGKERKEAWI-SWVLSYIQELLSSGISDEERKAL 573
++ E + V + G+ER W+ SW +++ S +D+ER+
Sbjct: 519 TKEGREQTAECFFHREGVTGLNETNESGRERVGKWLDSWRERIVEDWGSEPSADQEREKA 578
Query: 574 MNSVNPKYVLRNYLCQSAIDAAE 596
M +VNP +V R +L ID +
Sbjct: 579 MKAVNPNFVPRGWLLDDIIDRVQ 601
>gi|424894202|ref|ZP_18317776.1| hypothetical protein Rleg4DRAFT_0035 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393178429|gb|EJC78468.1| hypothetical protein Rleg4DRAFT_0035 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 500
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 197/506 (38%), Positives = 273/506 (53%), Gaps = 56/506 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +P+ V P L+ +E +A L LD + R D FSG GA P A Y G
Sbjct: 28 YAGQAPT-PVAEPWLIKLNEPLAAELGLDVEALRR-DGAAIFSGNLVPEGAEPLAMAYAG 85
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG ++ QLGDGRAI LGE+++ +R+++QLKGAG TP+SR DG A + +RE++
Sbjct: 86 HQFGGFSPQLGDGRAILLGEVVDSSGKRFDIQLKGAGPTPFSRRGDGRAAIGPVLREYIV 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 146 SEAMFALGIPATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHVRVGTFQFFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D D VR LADY I H+ ++ + N Y A ++
Sbjct: 199 RG--DTDGVRALADYVIDRHYPELKAAD---------------------NPYLALFEAIS 235
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY
Sbjct: 236 ERQAALIARWLHVGFIHGVMNTDNMTVSGETIDFGPCAFVDAYDPATVFSSIDQHG-RYA 294
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ TL LID++ +AN V+ YG +F + A M K+G
Sbjct: 295 YANQPGIGQWNLAKLGETL--LPLIDEEPDGAVDKANAVIRAYGERFQAHWLAGMLGKIG 352
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L +++ LL+ M D+T FR LS++ D + A D
Sbjct: 353 LAGEEDGDLELVQALLSLMQAQGADFTLTFRRLSDLAGDETAEPS-----FAASFRD--- 404
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
EA W+ + + L + ER M SVNP ++ RN+ + AI AA E GDF
Sbjct: 405 --PEACGPWLTQWRERLSRDPQTAAERAIAMRSVNPAFIPRNHRIEQAIGAAVEDGDFSL 462
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPP 628
LL ++ +PY++QPG Y R PP
Sbjct: 463 FEALLTVLAKPYEDQPGFAAY-REPP 487
>gi|374704764|ref|ZP_09711634.1| hypothetical protein PseS9_15611 [Pseudomonas sp. S9]
Length = 486
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 202/548 (36%), Positives = 292/548 (53%), Gaps = 65/548 (11%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L +L +D+ F R GD A T V P + P+LV S++ + L
Sbjct: 1 MKQLSELTFDNRFAR--LGD------------AFSTHVLPEP-IAEPRLVVASQAAMELL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP+E F+G + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 46 DLDPEEANTEVLAQIFAGHKLWSDAEPRAMVYSGHQFGGYTPRLGDGRGLLLGEVVNQAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SE +H LGI ++RALC+ + V R
Sbjct: 106 EHWDLHLKGAGATPYSRMGDGRAVLRSSIREFLASEHLHALGIASSRALCVTGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ K+E A+V R+AQS +RFG ++ Q L + LA++ + +HF E
Sbjct: 166 E-------KQETAAMVLRLAQSHIRFGHFEYFYYTQQHKL--LEQLAEHVLHNHF---EA 213
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
+ ++ Y+A ++ ERTA ++A WQ GF HGV+ TDNMS
Sbjct: 214 CLQEQA------------------PYSAMFRQIVERTAEMIAYWQAYGFCHGVMKTDNMS 255
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD N +D G RY F+NQ I WN+A L +D
Sbjct: 256 ILGITFDYGPYAFLDDFDAKHICNHSDDTG-RYSFSNQVPIAQWNLAALGQALTPLAGVD 314
Query: 459 DKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVDYTNFFRA 515
+ A+ +E + + Y +M ++LG ++ ++ +LL M +DY+ FFR
Sbjct: 315 ELSAS--LELFLPLYQSHYLDLMRRRLGFTSAKDDDQALVQELLQLMQNSAIDYSLFFRE 372
Query: 516 LSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMN 575
L +++P L L+ D+ + +W Y+ G S ++R+ M+
Sbjct: 373 LG--ESEPQAA----LARLRDDFTDLA-----GFDAWSQRYMDRDPLQGQSQQQRRERMH 421
Query: 576 SVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPG 635
VNPK++LRNYL Q AI+AAE GD+ VR L +++ P+ EQPG E++A+ PP W
Sbjct: 422 GVNPKFILRNYLAQQAIEAAEKGDYSVVRELHQVLSHPFAEQPGKERFAQRPPDWGKH-- 479
Query: 636 VCMLSCSS 643
+SCSS
Sbjct: 480 -LEISCSS 486
>gi|163854259|ref|YP_001642302.1| hypothetical protein Mext_4863 [Methylobacterium extorquens PA1]
gi|226707622|sp|A9W9J2.1|Y4863_METEP RecName: Full=UPF0061 protein Mext_4863
gi|163665864|gb|ABY33231.1| protein of unknown function UPF0061 [Methylobacterium extorquens
PA1]
Length = 497
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 191/504 (37%), Positives = 269/504 (53%), Gaps = 47/504 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +V+P+A VE P+L+ + ++A L LDP E P+ +G GA P A Y G
Sbjct: 19 FGRVAPTA-VEAPRLIRLNRALAVDLGLDPDRLESPEGVEVLAGQRVPEGAEPLAAAYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE++ + R ++QLKG+G TP+SR DG A L +RE+L
Sbjct: 78 HQFGQFVPQLGDGRAILLGEVVG-RDGRRDIQLKGSGPTPFSRRGDGRAALGPVLREYLV 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL VTTG+ V R+ PGA++ RVA S +R GS+Q A+
Sbjct: 137 SEAMHALGIPTTRALAAVTTGEQVIRETAL-------PGAVLTRVASSHIRVGSFQFFAA 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D++ +R LAD+AI H D + + D N Y A V
Sbjct: 190 RG--DVEGLRALADHAIARH------------------DPEAARAD---NPYRALLDGVI 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A+LVA+W VGF HGV+NTDNMSI G TIDYGP FLD +DP+ ++ D G RY
Sbjct: 227 RRQAALVARWLTVGFIHGVMNTDNMSIAGETIDYGPCAFLDTYDPATAFSSIDRHG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK----EANYVMERYGTKFMDEYQAIMTKKLGLP 488
+ NQP I LWN+ + + L D+ EA + + +F Y + +KLGL
Sbjct: 286 YGNQPRIALWNLTRLAEALLPLLSEDETQAVGEAEAALTGFAGQFEAAYHGGLNRKLGLA 345
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADP-SIPEDELLVPLKAVLLDIGKE 544
+ + LL MA ++ D+T FR L P P+ + ++++ +D
Sbjct: 346 TTRDGDPALAGDLLKTMAENEADFTLTFRRLGEAVPGPDGEPDPAAVEAVRSLFID---- 401
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEV 603
A+ W + + L R+ +M + NP ++LRN+ + I AA E DF
Sbjct: 402 -PTAYDRWAEGWRRRLKDEAGDAAARRQMMRAANPAFILRNHRVEEMITAAVERQDFAPF 460
Query: 604 RRLLKLMERPYDEQPGMEKYARLP 627
LL ++ RPY++QP +YA P
Sbjct: 461 ETLLTVLARPYEDQPDFARYAEPP 484
>gi|117922273|ref|YP_871465.1| hypothetical protein Shewana3_3841 [Shewanella sp. ANA-3]
gi|166232650|sp|A0L1Z0.1|Y3841_SHESA RecName: Full=UPF0061 protein Shewana3_3841
gi|117614605|gb|ABK50059.1| protein of unknown function UPF0061 [Shewanella sp. ANA-3]
Length = 484
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 196/525 (37%), Positives = 277/525 (52%), Gaps = 69/525 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
Y +V P + NP +AWSE A ++L ++P L SG + GA YAQ Y
Sbjct: 15 YAQVYPQG-ISNPHWLAWSEDAAKLIDL-----QQPTDVLLKGLSGNAAVEGASYYAQVY 68
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + +LGDGR+I LGE L + W++ LKG G TPYSR DG AV+RS++REF
Sbjct: 69 SGHQFGGYTPRLGDGRSIILGEALGPQGA-WDVALKGGGPTPYSRHGDGRAVMRSAVREF 127
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI- 309
L SEA+H LG+PTTRAL ++ + V R+ +E AI R+A+S +RFG ++
Sbjct: 128 LVSEALHHLGVPTTRALAVIGSDMPVWRE-------SQETAAITVRLARSHIRFGHFEFF 180
Query: 310 -HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
H+ RGQ D + L ++ ++ H+ H+ DL Y AW
Sbjct: 181 CHSERGQAD--KLTQLLNFTLKQHYPHLS-------------------CDLAG--YKAWF 217
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++V + TA L+A WQ +GF HGV+NTDNMSILG + D+GPF FLD F F N +D P
Sbjct: 218 LQVVQDTAKLIAHWQAIGFAHGVMNTDNMSILGDSFDFGPFAFLDTFQEYFICNHSD-PE 276
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F QP IGLWN+ + + L DD A + +Y + Y +M KLGL
Sbjct: 277 GRYAFGQQPGIGLWNLQRLAQALTPVIPSDDLIA--ALNQYQHALVQHYLMLMRAKLGLA 334
Query: 489 ----------KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
+ + ++I + M +++DY+N +R + DPS L+
Sbjct: 335 ERADSTAEQDQQDLELIGRFTVLMEKNQLDYSNTWRRFGQL--DPSSAHSS----LRDDF 388
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+D+ + + W +Y Q L E + NSVNPKY+LRNYL Q AI A E G
Sbjct: 389 IDLNE-----FDVWYQAY-QVRLGKVTDVEAWQQARNSVNPKYILRNYLAQEAIIAVEEG 442
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RL +++ +P+ EQ E A+ PP W G+ M SCSS
Sbjct: 443 NLAPLERLHQVLRQPFAEQVEHEDLAKRPPDWG--QGLIM-SCSS 484
>gi|417949937|ref|ZP_12593066.1| hypothetical protein VISP3789_05089 [Vibrio splendidus ATCC 33789]
gi|342807367|gb|EGU42556.1| hypothetical protein VISP3789_05089 [Vibrio splendidus ATCC 33789]
Length = 485
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 198/528 (37%), Positives = 283/528 (53%), Gaps = 58/528 (10%)
Query: 120 RTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATP 179
R ++PR YT + P+ + N Q ++W+ ++A E + SG
Sbjct: 12 RFTALPR----LFYTPIQPTP-LSNVQWLSWNHNLATEFGFPSFESASEELLDTLSGNVE 66
Query: 180 LAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADG 239
P A Y GHQFG + LGDGR + L +++ E ++L LKGAGKTPYSR DG
Sbjct: 67 PEQFSPLAMKYAGHQFGAYNPDLGDGRGLLLAQVVAKSGETFDLHLKGAGKTPYSRMGDG 126
Query: 240 LAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQ 299
AV+RS++RE+LCSEAM L IPTTRAL ++T+ V R+ K+E GA++ R ++
Sbjct: 127 RAVIRSTVREYLCSEAMAGLNIPTTRALAMMTSDTPVYRE-------KQEWGALLVRASE 179
Query: 300 SFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDL 359
S +RFG ++ Q L + LAD I HF E L DE+
Sbjct: 180 SHIRFGHFEHLFYTNQ--LVEHKLLADKVIEWHF--------PECL-----DEE------ 218
Query: 360 TSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSF 419
YAA ++ +RTA ++A WQ GF HGV+NTDNMSI+G T DYGPF FLD ++P
Sbjct: 219 --KPYAAMFNQIVDRTAEMIALWQANGFAHGVMNTDNMSIIGQTFDYGPFAFLDEYNPRL 276
Query: 420 TPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQA 479
N +D G RY F QP IG+WN++ + +L + L++ + +E+Y + +
Sbjct: 277 ICNHSDYQG-RYAFNQQPRIGMWNLSALAHSL--SPLVERADLEAALEQYEPQMNGYFSQ 333
Query: 480 IMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+M +KLGL + + ++ + M+ +KVDY FFR LSN+ ++P E
Sbjct: 334 LMRRKLGLLSKQEGDSRLFESMFELMSQNKVDYPRFFRTLSNLD---TVPAQE------- 383
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
++D+ +R A + WV +Y+Q S ER M VNPKY+LRNYL Q AI+ AE
Sbjct: 384 -VIDLVIDRDAAKL-WVDNYLQRCELEESSATERCEKMRQVNPKYILRNYLAQLAIEKAE 441
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
GD +V L+ ++ PY E P E A LPP W G M +SCSS
Sbjct: 442 RGDSSDVDALMVVLADPYAEHPDYEYLAALPPEW----GKGMEISCSS 485
>gi|113971973|ref|YP_735766.1| hypothetical protein Shewmr4_3645 [Shewanella sp. MR-4]
gi|121957893|sp|Q0HE08.1|Y3645_SHESM RecName: Full=UPF0061 protein Shewmr4_3645
gi|113886657|gb|ABI40709.1| protein of unknown function UPF0061 [Shewanella sp. MR-4]
Length = 484
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 195/525 (37%), Positives = 279/525 (53%), Gaps = 69/525 (13%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPYAQCY 190
Y++V P + NP +AWSE A ++L ++P L SG + GA YAQ Y
Sbjct: 15 YSQVYPQG-ISNPHWLAWSEDAAKLIDL-----QQPTDALLQGLSGNAAVEGASYYAQVY 68
Query: 191 GGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREF 250
GHQFG + +LGDGR+I LGE L + W++ LKG G TPYSR DG AV+RS++REF
Sbjct: 69 SGHQFGGYTPRLGDGRSIILGEALGPQGA-WDVALKGGGPTPYSRHGDGRAVMRSAVREF 127
Query: 251 LCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQI- 309
L SEA+H LG+PTTRAL ++ + V R+ +E AI R+A+S +RFG ++
Sbjct: 128 LVSEALHHLGVPTTRALAVIGSDMPVWRE-------SQETAAITVRLARSHIRFGHFEFF 180
Query: 310 -HASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWA 368
H+ RGQ D + L ++ ++ H+ ++ DL Y AW
Sbjct: 181 CHSERGQAD--KLTQLLNFTLKQHYPNLS-------------------CDLAG--YKAWF 217
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
++V + TA L+A WQ +GF HGV+NTDNMSILG + D+GPF FLD F F N +D P
Sbjct: 218 LQVVQDTAKLIAHWQAIGFAHGVMNTDNMSILGDSFDFGPFAFLDTFQEDFICNHSD-PE 276
Query: 429 RRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLP 488
RY F QP IGLWN+ + + L DD A + +Y + Y +M KLGL
Sbjct: 277 GRYAFGQQPGIGLWNLQRLAQALTPVIPSDDLIA--ALNQYQHALVQHYLMLMRVKLGLT 334
Query: 489 ----------KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
+ + ++I + M +++DY+N +R + DPS L+
Sbjct: 335 ERADSTAEQDQQDLELIGRFTVLMEKNQLDYSNTWRRFGQL--DPSSAHSS----LRDDF 388
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+D+ + + +W +Y Q L E + NSVNPKY+LRNYL Q AI A E G
Sbjct: 389 IDLNE-----FDAWYQAY-QARLGKVTDIEAWQQARNSVNPKYILRNYLAQEAIIAVEEG 442
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ + RL +++ +P+ EQ E A+ PP W G+ M SCSS
Sbjct: 443 NLAPLERLHQVLRQPFAEQVEHEDLAKRPPDWG--QGLIM-SCSS 484
>gi|425774260|gb|EKV12573.1| hypothetical protein PDIG_43270 [Penicillium digitatum PHI26]
gi|425778539|gb|EKV16663.1| hypothetical protein PDIP_34500 [Penicillium digitatum Pd1]
Length = 578
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 207/567 (36%), Positives = 289/567 (50%), Gaps = 77/567 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGA- 177
PR PR V A +T + P + P+L+ S L L P E + F +G
Sbjct: 3 PRETLGPRMVKGALFTYIRPE-RTDEPELLGVSSQAMKDLGLKPGEEKTSRFKALVAGNE 61
Query: 178 ----TPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGKTP 232
G P+AQCYGG QFG WAGQLGDGRAI+L E N ++ R+ELQLKGAGKTP
Sbjct: 62 IWWNKEHGGIYPWAQCYGGWQFGSWAGQLGDGRAISLFECTNPQTNMRYELQLKGAGKTP 121
Query: 233 YSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRAL--CLVTTGKFVTRDMFYDGNPKEEP 290
YSRFADG AVLRSSIRE++ SEA+ LGIPTTRAL LV K + + EP
Sbjct: 122 YSRFADGKAVLRSSIREYVVSEALFALGIPTTRALSLTLVPNAKVLRERI--------EP 173
Query: 291 GAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFS-- 348
GAIV R A+S+LR G++ + RG D +++R LA Y F E++ SL
Sbjct: 174 GAIVARFAESWLRIGTFDLLRVRG--DRELIRKLATYVAEDVFSGWESLPAIVSLRDQQS 231
Query: 349 -----------TGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNM 397
TGD+ D+ N++A E+A R A VA WQ GF +GVLNTDN
Sbjct: 232 STQIDNSQRGITGDQVQEHQDVQENRFARLYREIARRNAKTVAAWQAYGFMNGVLNTDNT 291
Query: 398 SILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTL----AA 453
SI GL++DYGPF F+D FDP +TPN D RY + NQP I WN+ + +L A
Sbjct: 292 SIYGLSLDYGPFAFMDNFDPHYTPNHDD-HMLRYAYRNQPSIIWWNLVRLGESLGELIGA 350
Query: 454 AKLIDD-----------------KEANYVMERYGTK----FMDEYQAIMTKKLGLPKYN- 491
+DD K A ++E G F++EY+ +M ++LGL
Sbjct: 351 GNRVDDESFVNDGVTNEFEPELIKRAEKIIEHVGEDFKAVFLNEYKRLMGQRLGLKTQTE 410
Query: 492 ---KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIG------ 542
+ + S+LL+ + ++D+ +FFR LS + ED G
Sbjct: 411 SDFQNLFSELLDTLEALELDFNHFFRRLSGLPLSSLETEDSRREAASVFFHAEGFGGIGY 470
Query: 543 -----KERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
++R W+ SW L +++ + +D+ER+ M SVNP +V R ++ ID E
Sbjct: 471 TEATARDRIAQWLDSWRLRILEDWGPA--NDDERRKAMKSVNPNFVPRGWILDEVIDRVE 528
Query: 597 L-GDFGEVRRLLKLMERPYDEQPGMEK 622
GD + R++++ P+ ++ + K
Sbjct: 529 RKGDRDILGRIMQMSLNPFKDEWDLHK 555
>gi|424914935|ref|ZP_18338299.1| hypothetical protein Rleg9DRAFT_2469 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392851111|gb|EJB03632.1| hypothetical protein Rleg9DRAFT_2469 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 500
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 193/505 (38%), Positives = 271/505 (53%), Gaps = 55/505 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +P+A V P L+ +E +A L LD + R D FSG GA P A Y G
Sbjct: 28 FAAQTPTA-VAEPWLIKLNEPLAVELGLDVETLRR-DGAAIFSGNLVPEGAEPLAMAYAG 85
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG ++ QLGDGRAI LGE+++ R+++QLKGAG TP+SR DG A + +RE++
Sbjct: 86 HQFGGFSPQLGDGRAILLGEVVDRSGRRYDIQLKGAGPTPFSRRGDGRAAIGPVLREYII 145
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGIP TRAL VTTG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 146 SEAMFALGIPATRALAAVTTGEPVYREEVL-------PGAVFTRVAASHIRVGTFQFFAA 198
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D D VR LADY I H+ +++ + N Y + V+
Sbjct: 199 RG--DTDGVRALADYVIDRHYSALKDAD---------------------NPYLSLFSAVS 235
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
ER A+L+A+W VGF HGV+NTDNM++ G TID+GP F+D +DP+ ++ D G RY
Sbjct: 236 ERQAALIARWLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDNYDPATVFSSIDQHG-RYA 294
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDK------EANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ TL LID++ +AN V+ YG +F + A M K+G
Sbjct: 295 YANQPGIGQWNLARLGETL--LPLIDEEPDGAVDKANAVIRAYGERFQAHWLAGMRGKIG 352
Query: 487 LP---KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + +++ LL+ M D+T FR LS++ D +
Sbjct: 353 LAGEEDSDLELVQALLSLMQAQGADFTLTFRRLSDLAGDAA----------AEPAFAASF 402
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
EA W+ + + L + ER M SVNP ++ RN+ + AI+AA E GDF
Sbjct: 403 REPEACGPWLAQWRERLSRDPQTAAERATAMCSVNPAFIPRNHRVEQAIEAAVENGDFSL 462
Query: 603 VRRLLKLMERPYDEQPGMEKYARLP 627
LL ++ +PYD+QPG Y P
Sbjct: 463 FEALLTVLAKPYDDQPGFAAYLEPP 487
>gi|322694898|gb|EFY86716.1| hypothetical protein MAC_07217 [Metarhizium acridum CQMa 102]
Length = 632
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 218/593 (36%), Positives = 306/593 (51%), Gaps = 88/593 (14%)
Query: 102 LEDLNWDHSFVRELPGD------------PRTDSIPREVLHACYTKVSPSAEVENPQLVA 149
L+DL F LP D PR +PR+V HA +T V P + ++P+L+A
Sbjct: 13 LQDLPKSWHFTESLPPDSVFPTPADSHKTPRDQILPRQVRHALFTWVRPERQ-KDPELLA 71
Query: 150 WSESVADSLELDPKEFERPDFPLFFSG-------ATPLAGAVPYAQCYGGHQFGMWAGQL 202
S + + + E + DF F +G L G P+AQCYGG QFG WAGQL
Sbjct: 72 VSPAALRDIGIKAGEDKTDDFRQFVAGNKLYGWDEEKLEGGYPWAQCYGGFQFGQWAGQL 131
Query: 203 GDGRAITLGEILNLKS-ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
GDGRAI+L E N + +++ELQLKGAG TPYSRFADG AVLRSSIREF+ SEA++ L I
Sbjct: 132 GDGRAISLFESRNPDTGKKYELQLKGAGLTPYSRFADGKAVLRSSIREFVVSEALNALRI 191
Query: 262 PTTRALCL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDI 320
P+TRAL L + V R+ EPGA+V R A+S+LR G++ I +RG D D+
Sbjct: 192 PSTRALSLTLLPHSKVLRESI-------EPGAVVLRFAESWLRLGNFDILRARG--DRDL 242
Query: 321 VRTLADYAIRHHFRHIENMN------KSESLSFSTG-----DEDHSVVDLTSNKYAAWAV 369
+R LA Y H F EN+ + S G E + N++A
Sbjct: 243 IRKLATYTAEHVFGGWENLPARLEDPERPQQSPVPGRRVPEKELQGPAETAENRFARLYR 302
Query: 370 EVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGR 429
E+A R A VA WQ GF +GVLNTDN S+ GL+ID+GPF F+D FDPS+TPN D
Sbjct: 303 EIARRNAKTVAAWQAYGFMNGVLNTDNTSVYGLSIDFGPFAFMDNFDPSYTPNHDDYT-L 361
Query: 430 RYCFANQPDIGLWNIAQFSTTL----AAAKLIDD------------------KEANYVME 467
RY + NQP I WN+ +F L AA L DD + +M+
Sbjct: 362 RYSYRNQPTIIWWNLVRFGEALGELMGAAGLADDATFISEGVKEDQQEEVISRAEKLIMQ 421
Query: 468 ---RYGTKFMDEYQAIMTKKLGL----PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVK 520
+ F+ EY+ +MT +LGL P ++ S L+ + ++D+ +FFR LSNVK
Sbjct: 422 TGDEFKEVFLGEYKRLMTLRLGLRELKPTDFNELFSPALDTLEALELDFNHFFRRLSNVK 481
Query: 521 -ADPSIPE---DELLV------PLKAVLLDIGKERKEAWI-SWVLSYIQELLS----SGI 565
A+ S PE ++ V P V D ++R W+ W +++ S
Sbjct: 482 LAEVSSPEGRREKAAVFFHAEGPPGTVGEDEARDRVAKWLEKWHARVVEDWKDGERVSEE 541
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLKLMERPYDEQ 617
D+ER M VNP +V R+++ I E G+ + R++ + P++++
Sbjct: 542 RDQERIEAMKRVNPNFVPRSWVLDEVIRRVEKEGERDVLNRIMHMALNPFEDE 594
>gi|429210509|ref|ZP_19201676.1| Selenoprotein O-like protein [Pseudomonas sp. M1]
gi|428159283|gb|EKX05829.1| Selenoprotein O-like protein [Pseudomonas sp. M1]
Length = 488
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 211/550 (38%), Positives = 285/550 (51%), Gaps = 69/550 (12%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+K L++L +D+ F R D+ EVL P AE P+LV S + L
Sbjct: 3 VKQLDELTFDNRFAR------LGDAFSTEVL------PDPIAE---PRLVVASPAAMALL 47
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LDP P F FSG + A P A Y GHQFG + +LGDGR + LGE++N
Sbjct: 48 DLDPAVAGEPVFAEIFSGHKLWSEAEPRAMVYSGHQFGAYNPRLGDGRGLLLGEVVNDAG 107
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
+ W+L LKGAG+TPYSR DG AVLRSSIREFL SE +H LGIP++RALC+ + V R
Sbjct: 108 QHWDLHLKGAGQTPYSRMGDGRAVLRSSIREFLASEYLHALGIPSSRALCVTGSDTPVWR 167
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIEN 338
+ E A++ R+A S +RFG ++ Q D ++ L D+ + +HF
Sbjct: 168 E-------TRESAAMLLRLAPSHVRFGHFEYFYYTQQH--DKLKELGDFVLANHFPECLE 218
Query: 339 MNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
K YAA+ V E A L+A WQ GF HGV+NTDNMS
Sbjct: 219 QPK---------------------PYAAFFRAVVESNAELIAHWQAYGFCHGVMNTDNMS 257
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLID 458
ILG+T DYGP+ FLD FD N +D G RY F NQ I WN+A + L +D
Sbjct: 258 ILGITFDYGPYAFLDDFDAKHICNHSDDAG-RYSFNNQVPIAHWNLAALAQALTPFVEVD 316
Query: 459 D-KEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNK---QIISKLLNNMAVDKVDYTNFF 513
+ +EA + Y ++D +M ++LG ++ +LL M VDY+ FF
Sbjct: 317 ELREALGLFLPLYQAHYLD----LMRRRLGFTTAEDGDLDLVQRLLQAMQSGAVDYSLFF 372
Query: 514 RALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSGISDEERKAL 573
R L PE + L ++ +D+ + +W Y G S +ER+A
Sbjct: 373 RRLGE-----QAPE-QALAQVREDFVDLA-----GFDAWATDYRARAEREGGSQDERRAR 421
Query: 574 MNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYR 633
M++VNP YVLRNYL Q AI AAE GD+ VR L + + RP++EQPG E + R PP W R
Sbjct: 422 MHAVNPLYVLRNYLAQEAISAAEQGDYSVVRELHETLSRPFEEQPGREAFTRRPPDWGRR 481
Query: 634 PGVCMLSCSS 643
+SCSS
Sbjct: 482 ---LEISCSS 488
>gi|227821315|ref|YP_002825285.1| hypothetical protein NGR_c07390 [Sinorhizobium fredii NGR234]
gi|227340314|gb|ACP24532.1| gluconate permease [Sinorhizobium fredii NGR234]
Length = 501
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 195/505 (38%), Positives = 268/505 (53%), Gaps = 55/505 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V P+ V P L+ + + + L LD ER D FSG T +GA P A Y G
Sbjct: 29 YARVEPT-PVAEPWLIKLNRPLGEELRLDVAAIER-DGAAIFSGNTVPSGADPLAMAYAG 86
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE+++ +R ++QLKG+G+TPYSR DG A L +RE++
Sbjct: 87 HQFGTFVPQLGDGRAILLGEVIDRNGKRRDIQLKGSGQTPYSRRGDGRAALGPVLREYII 146
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LG+PTTRAL TG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 147 SEAMHALGVPTTRALAATVTGQPVYREQIL-------PGAVFTRVAASHIRVGTFQFFAA 199
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D+D V+ LADY I H+ ++ DE N Y V+
Sbjct: 200 RG--DMDSVKALADYVIDRHYPELK------------ADE---------NPYLGLLKAVS 236
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A+L+A+W VGF HGV+NTDNM+I G TID+GP F+DA+DP ++ D G RY
Sbjct: 237 ARQAALIARWLDVGFIHGVMNTDNMTISGETIDFGPCAFMDAYDPKKVFSSIDQFG-RYA 295
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ + TL L D AN V+ YGT F + + M +K+G
Sbjct: 296 YANQPAIGQWNLARLAETL--VTLFDPTADVAVNLANDVLGEYGTIFQNHWLDGMRRKIG 353
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L +++ LL M D+T FR L++ D + EL +A
Sbjct: 354 LTTAEDGDLELVQALLALMHRGGADFTLTFRRLASSAEDAGA-DVELAKLFQA------- 405
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
EA W+ + + L ER A M VNP ++ RN+ + AI+AA E DF
Sbjct: 406 --PEALAPWLADWRRRLERESRQPAERAATMRGVNPAFIPRNHRVEQAIEAAIEEADFSL 463
Query: 603 VRRLLKLMERPYDEQPGMEKYARLP 627
L+ + +PY++QPG YA P
Sbjct: 464 FEALVDVTSKPYEDQPGHAAYAEPP 488
>gi|440225918|ref|YP_007333009.1| hypothetical protein RTCIAT899_CH05275 [Rhizobium tropici CIAT 899]
gi|440037429|gb|AGB70463.1| hypothetical protein RTCIAT899_CH05275 [Rhizobium tropici CIAT 899]
Length = 501
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 195/496 (39%), Positives = 266/496 (53%), Gaps = 55/496 (11%)
Query: 142 VENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQ 201
V PQL+ ++E +A L LD + ++ + FSG L G+ P A Y GHQFG + Q
Sbjct: 38 VTAPQLIKFNEVLARELGLDVETLKQ-NAAAIFSGNELLPGSQPIAMAYAGHQFGNFVPQ 96
Query: 202 LGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGI 261
LGDGRAI LGE+ + +R ++QLKG G TP+SR DG A L +RE++ SEAMH LGI
Sbjct: 97 LGDGRAILLGEVKDRSGKRRDIQLKGPGPTPFSRRGDGRAALGPVLREYIVSEAMHALGI 156
Query: 262 PTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIV 321
PTTRAL VT+G+ V R+ PGA+ RVA S +R G++Q A+RG D + V
Sbjct: 157 PTTRALAAVTSGEPVYREEVL-------PGAVFTRVAASHIRVGTFQFFAARG--DTESV 207
Query: 322 RTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQ 381
RTLAD+ I H+ I + N Y A VA+R ASL+A+
Sbjct: 208 RTLADHVIARHYPEIRDRK---------------------NPYLALLEAVADRQASLIAR 246
Query: 382 WQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGL 441
W VGF HGV+NTDNM++ G TID+GP F+DA+DP+ ++ D G RY +ANQP IG
Sbjct: 247 WLHVGFIHGVMNTDNMTVSGETIDFGPCAFMDAYDPATVFSSIDRTG-RYAYANQPAIGQ 305
Query: 442 WNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLGLPKY---NK 492
WN+A+ TL LID AN V++ YG +F + A M K+GL +
Sbjct: 306 WNLARLGETL--IPLIDPSVDVAIDLANTVIKAYGERFQACWLAGMRAKIGLVSEEDGDL 363
Query: 493 QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISW 552
+I LL M D+T FR L+ + AD + + A D +A W
Sbjct: 364 DLIQSLLATMHQQGADFTITFRRLAALAADEDVTD------FAAAFND-----PQAATLW 412
Query: 553 VLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEVRRLLKLME 611
+ + + L + R A M VNP ++ RN+ + AI+AA E GDF LLK++
Sbjct: 413 LGRWQERLARDPQTPAARSAAMRKVNPAFIPRNHRIEQAIEAAVEDGDFSLFEALLKVLA 472
Query: 612 RPYDEQPGMEKYARLP 627
PY +QP YA P
Sbjct: 473 TPYQDQPAFAPYAEPP 488
>gi|121604495|ref|YP_981824.1| hypothetical protein Pnap_1589 [Polaromonas naphthalenivorans CJ2]
gi|120593464|gb|ABM36903.1| protein of unknown function UPF0061 [Polaromonas naphthalenivorans
CJ2]
Length = 521
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 187/485 (38%), Positives = 258/485 (53%), Gaps = 51/485 (10%)
Query: 153 SVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGE 212
++AD + LDP+ + +G G P A Y GHQFG+W QLGDGR +TL E
Sbjct: 67 ALADQIGLDPRWCQSAAALPLLTGNAAWPGQTPSASVYAGHQFGVWVSQLGDGRVLTLAE 126
Query: 213 ILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTT 272
ELQLKGAG TPY+R +DG A L SS+RE L EA+H LG+PTTRAL L +
Sbjct: 127 WRAPDGSPVELQLKGAGPTPYARGSDGRATLASSVRELLACEALHALGVPTTRALSLAGS 186
Query: 273 GKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHH 332
V RD + A++ R A F+RFG ++ HA G + LAD+ I HH
Sbjct: 187 SLSVQRDEL-------DTAAVLGRTAPCFVRFGHFEFHARHGTPQQ--LALLADHVIEHH 237
Query: 333 FRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVL 392
F ++ N + ++AAW EV E TA+L A WQ +GF HGVL
Sbjct: 238 FPYLANQPQ---------------------RHAAWLAEVVELTAALFAHWQTLGFCHGVL 276
Query: 393 NTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQF----S 448
NTDN+S+LGLT+DYGP+GF++ F P N +D G RY + QP IG WN + +
Sbjct: 277 NTDNLSVLGLTLDYGPYGFMERFRPHHVCNASDHEG-RYAYTAQPAIGRWNCERLLGACA 335
Query: 449 TTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVD 505
LA ++A ++ RY + E KLGL + + ++++ L +
Sbjct: 336 GLLAPQPEAAREQAQALLARYDEVYRQEVMRRWRAKLGLREARAGDAGLLNRWLTLLQRG 395
Query: 506 KVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG 564
K D+T FR L++ ++ DP+ E L+ L A + A W+ + L S G
Sbjct: 396 KADFTLAFRRLADAIQIDPA----EALICLPA-------GQDAALRDWLDDWRARLASEG 444
Query: 565 ISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYA 624
S R + M VNP+YVLRN+L Q+AI+ A+ G E+ RLL ++ RP+DEQPG E YA
Sbjct: 445 GSPAGRASAMRRVNPRYVLRNHLAQAAIEGAQRGSSVELHRLLAVLARPFDEQPGAEHYA 504
Query: 625 RLPPA 629
PPA
Sbjct: 505 -APPA 508
>gi|313682029|ref|YP_004059767.1| hypothetical protein Sulku_0903 [Sulfuricurvum kujiense DSM 16994]
gi|313154889|gb|ADR33567.1| protein of unknown function UPF0061 [Sulfuricurvum kujiense DSM
16994]
Length = 478
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 200/516 (38%), Positives = 280/516 (54%), Gaps = 62/516 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y KV+PS ++NP+L +++ A+ L LDP E +G L G+ PYA CY G
Sbjct: 20 YDKVAPSP-LKNPRLASFNPKAAELLGLDPALLETDKLEKLLNGTLLLNGSSPYAMCYSG 78
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + +LGDGRAI LG + W LQLKG+G+T YSR DG AVLRSSIRE+L
Sbjct: 79 HQFGYYVPRLGDGRAINLG-----SANGWNLQLKGSGQTLYSRQGDGRAVLRSSIREYLM 133
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IH 310
SEAM+ LGIPT+RAL ++++ + V R+ K E GA+V R+++S++ FGS++ H
Sbjct: 134 SEAMNALGIPTSRALAIISSDENVARE-------KWERGAVVLRLSRSWILFGSFEYFFH 186
Query: 311 ASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVE 370
+R +E + TLAD+ ++ F + G E+ Y
Sbjct: 187 TNRYKE----LETLADFLLQESFPEL------------IGAEE---------PYLKMYGL 221
Query: 371 VAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRR 430
+ +RTA L+AQWQ VGF HGV+NTDNMS +G+TIDYGPF F+D F+ + N TD G R
Sbjct: 222 IVKRTAELMAQWQSVGFNHGVMNTDNMSAVGITIDYGPFAFMDTFESGYICNHTDTQG-R 280
Query: 431 YCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGL--P 488
Y + NQP IG WN+ + + L+ L+ + +E+YG F ++ KLGL P
Sbjct: 281 YSYDNQPRIGYWNLERLAHALSP--LVTSDKLKNELEKYGEYFTARLMELLRAKLGLDTP 338
Query: 489 KYNK-QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
N + L + M ++D T FFR LS D PL A L + +
Sbjct: 339 DENDGNLFRALFSLMENGRIDMTPFFRTLSRY--------DGTREPLLAQTLAPNQLNE- 389
Query: 548 AWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLL 607
W+ Y L + S+ +R M NPKYVL+NY+ Q AID A+ DF + LL
Sbjct: 390 ----WLDRYDDRLSLNASSEAQRHVKMLRTNPKYVLKNYILQEAIDKAQNDDFTLINDLL 445
Query: 608 KLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L + PYDE E+Y++ P ++ LSCSS
Sbjct: 446 HLAQNPYDEHEAFERYSQSTP---HQFKNLKLSCSS 478
>gi|343517357|ref|ZP_08754363.1| hypothetical protein VIBRN418_14656 [Vibrio sp. N418]
gi|342793681|gb|EGU29471.1| hypothetical protein VIBRN418_14656 [Vibrio sp. N418]
Length = 489
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 198/524 (37%), Positives = 283/524 (54%), Gaps = 67/524 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+T VSP +EN + V+W+ S+A L P + + +G +P A Y G
Sbjct: 20 FTAVSPQP-LENTRWVSWNASLAAQFGL-PDQAPIGELKQQLAGELSHPQFMPLAMKYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG++ +LGDGR + L E+ N + + +++ LKGAG TPYSR DG AVLRS+IRE+LC
Sbjct: 78 HQFGVYNPELGDGRGLLLCELENKQGKIFDVHLKGAGLTPYSRMGDGRAVLRSTIREYLC 137
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LGI TTRAL ++ + V R+ K+E GA++ R+A+S +RFG ++
Sbjct: 138 SEAMAGLGIATTRALGMLASDSPVYRE-------KQEQGALLLRMAESHIRFGHFEHFFY 190
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
Q L ++ LAD I ++ + +S YAA V
Sbjct: 191 TNQ--LSEIKLLADKVIEWYWPELAEAEQS---------------------YAAMFELVV 227
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
+ TA ++AQWQ +GF HGV+NTDNMSILG T DYGPF FLD +D S+ N +D G RY
Sbjct: 228 DNTALMIAQWQAIGFCHGVMNTDNMSILGQTFDYGPFAFLDDYDASYICNHSDYQG-RYA 286
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKYNK 492
F QP I LWN++ L + LID + + ++ + Y M KLGL +NK
Sbjct: 287 FNQQPRIALWNLSALGHAL--SPLIDKAQIEAALAQFEPRLQQYYSQQMRAKLGL--HNK 342
Query: 493 -----QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKE 547
++ L + + K DYT F R LSN+ S P +L I ++ +
Sbjct: 343 LEQDGELFVMLFDLLEQHKPDYTRFMRELSNIDRHGSQPIIDLF---------IDRDAAK 393
Query: 548 AWISWVLSYI-------QELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDF 600
AW+ L+ ++++++ I R M + NPKYVLRNYL Q AID AE GD+
Sbjct: 394 AWLDLYLARCELEVDEDEQIVTAAI----RCEAMRANNPKYVLRNYLLQLAIDKAEQGDY 449
Query: 601 GEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
+V +L +++ P+DEQ ME+ A+LPP W G M +SCSS
Sbjct: 450 SDVEQLARVLVTPFDEQSHMEELAKLPPEW----GKGMEISCSS 489
>gi|363421017|ref|ZP_09309106.1| hypothetical protein AK37_10071 [Rhodococcus pyridinivorans AK37]
gi|359734752|gb|EHK83720.1| hypothetical protein AK37_10071 [Rhodococcus pyridinivorans AK37]
Length = 502
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 188/500 (37%), Positives = 283/500 (56%), Gaps = 55/500 (11%)
Query: 139 SAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMW 198
AE +P+L+A +E +A SL LD D +GA AGA P A Y GHQFG +
Sbjct: 35 GAEAPDPELLALNEDLAVSLGLDVAALRSADGVAVLAGAEVPAGAKPVAMAYAGHQFGGY 94
Query: 199 AGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHF 258
A LGDGRA+ LGE+++ +R +L LKG+G TP+SR DG AV+ +RE+L SEAMH
Sbjct: 95 APLLGDGRALLLGELVDADGDRVDLHLKGSGPTPFSRGGDGFAVVGPMLREYLVSEAMHA 154
Query: 259 LGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDL 318
LGIPTTR+L +V TG+ V R+ EPGA++ RVA S LR G+++ A +G+
Sbjct: 155 LGIPTTRSLSVVATGRPVYRE-------GAEPGAVLARVAASHLRVGTFEFAARQGE--- 204
Query: 319 DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASL 378
+VR LAD+AI H+ + ++ + TG+ +N+Y V E ASL
Sbjct: 205 -VVRALADHAIARHYPDLLDLPE-------TGE---------NNRYLGLFTAVVEAQASL 247
Query: 379 VAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPD 438
VAQW VGF HGV+NTDN +I G TIDYGP F+DAFDP+ ++ D G RY F NQP
Sbjct: 248 VAQWMLVGFVHGVMNTDNTTISGQTIDYGPCAFVDAFDPAAVFSSIDHSG-RYAFGNQPA 306
Query: 439 IGLWNIAQFSTTLAAAKLIDD------KEANYVMERYGTKFMDEYQAIMTKKLGLPK--Y 490
+ WN+A+F+ TL +L+D V++ + T++ Y++ + KLGLP+
Sbjct: 307 VLKWNLARFAETL--LRLVDSTPDAAIAAVTAVLDSFDTRYERHYRSGLAAKLGLPEDSL 364
Query: 491 NKQIISKLLNNMAVDKVDYTNFFRALSN-VKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+++++ LL + + D+T FRAL++ ++ +P+ P D L + +ER W
Sbjct: 365 DQELVDDLLTLLEEHRADWTVTFRALADELRGNPA-PLDGL----------VPRERSAPW 413
Query: 550 IS-WVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLK 608
+ W + ++ ++G ER M+ VNP Y+ RN+ +A+ AA GD +LL+
Sbjct: 414 LERWHAAAERDDRAAG----ERAEAMDRVNPLYIPRNHHVDAALKAATGGDLEPFAKLLE 469
Query: 609 LMERPYDEQPGMEKYARLPP 628
++ P++ + +Y P
Sbjct: 470 VVTHPFEARAEWNEYVSPAP 489
>gi|254564227|ref|YP_003071322.1| hypothetical protein METDI5920 [Methylobacterium extorquens DM4]
gi|254271505|emb|CAX27520.1| conserved hypothetical protein, UPF0061 protein [Methylobacterium
extorquens DM4]
Length = 497
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 192/504 (38%), Positives = 266/504 (52%), Gaps = 47/504 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +V+P+A VE P+L+ + ++A L LDP E P+ +G GA P A Y G
Sbjct: 19 FGRVAPTA-VEAPRLIRLNRALAVDLGLDPDRLESPEGVEVLAGRRVPEGAEPLAAAYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE++ + R ++QLKG+G TP+SR DG A L +RE+L
Sbjct: 78 HQFGQFVPQLGDGRAILLGEVVG-RDGRRDIQLKGSGPTPFSRRGDGRAALGPVLREYLV 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL VTTG+ V R+ PGA++ RVA S +R GS+Q A+
Sbjct: 137 SEAMHALGIPTTRALAAVTTGERVIRETVL-------PGAVLTRVASSHIRVGSFQFFAA 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D++ +R+LAD+AI H D + + D N Y A V
Sbjct: 190 RG--DVEGLRSLADHAIARH------------------DPEAARAD---NPYRALLDGVI 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A LVA+W VGF HGV+NTDNMSI G TIDYGP FLD +DP+ ++ D G RY
Sbjct: 227 RRQAELVARWLTVGFIHGVMNTDNMSIAGETIDYGPCAFLDTYDPATAFSSIDRNG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDD----KEANYVMERYGTKFMDEYQAIMTKKLGLP 488
+ NQP I LWN+ + + L D+ EA + + +F Y + +KLGL
Sbjct: 286 YGNQPRIALWNLTRLAEALLPLLSEDETQAVAEAEAALTGFAGQFEAAYHGGLNRKLGLA 345
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLV-PLKAVLLDIGKE 544
+ + LL MA ++ D+T FR L P D V ++++ +D
Sbjct: 346 TTRDGDPALAGDLLKTMAENEADFTLTFRRLGEAVPGPDGESDPAAVEAVRSLFID---- 401
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEV 603
A W + + L R+ +M + NP ++ RN+ + I AA E DF
Sbjct: 402 -PTALDRWAEGWRRRLKDEAGDAAARRQMMRAANPAFIPRNHRVEEMITAAVERQDFAPF 460
Query: 604 RRLLKLMERPYDEQPGMEKYARLP 627
LL ++ RPYD+QP +YA P
Sbjct: 461 ETLLTVLARPYDDQPDFAQYAERP 484
>gi|358399652|gb|EHK48989.1| hypothetical protein TRIATDRAFT_129317 [Trichoderma atroviride IMI
206040]
Length = 634
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 212/565 (37%), Positives = 295/565 (52%), Gaps = 78/565 (13%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSG-- 176
PR PR+V A +T V PS E ++P+L+A S + L + E + F F +G
Sbjct: 42 PRDQITPRQVRDALFTWVRPS-EQKDPELLAVSPAALKDLGIKAGEEKTEAFRQFVAGNK 100
Query: 177 -----ATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGK 230
T L G P+AQCYGG QFG WAGQLGDGRAI+L E N +S R+ELQLKGAG
Sbjct: 101 LYGWDETKLEGGYPWAQCYGGFQFGQWAGQLGDGRAISLFETTNPESNVRYELQLKGAGL 160
Query: 231 TPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCL-VTTGKFVTRDMFYDGNPKEE 289
TPYSRFADG AVLRSS+REF+ SEA++ L IPTTRAL L + V R+ E
Sbjct: 161 TPYSRFADGKAVLRSSLREFVVSEALNALKIPTTRALSLTLLPHSKVLREA-------TE 213
Query: 290 PGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENM--------NK 341
PGAIV R+AQS+LR G++ + +RG D D++R LA Y F E +
Sbjct: 214 PGAIVLRLAQSWLRLGTFDLLRARG--DRDLIRKLATYIAEDVFGGWEKLPGRLESPDEP 271
Query: 342 SESLSFSTG---DEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMS 398
++S S G E D N++ E+ R A VA WQ GF +GVLNTDN S
Sbjct: 272 TKSPSPKRGVPASEVEGPSDAAENRFQRLYREIIRRNAVTVAHWQAYGFMNGVLNTDNTS 331
Query: 399 ILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAA----- 453
+ GL++DYGPF F+D FDP++TPN D RY + NQP I WN+ + TL
Sbjct: 332 VYGLSMDYGPFAFMDTFDPAYTPNHDDYT-LRYNYKNQPTIIWWNLVRLGETLGELLGIG 390
Query: 454 ---------AKLIDDKEANYVMER-----------YGTKFMDEYQAIMTKKLGLPKYNK- 492
AK I ++ ++ER Y F++EY+ +MT +LGL + +
Sbjct: 391 PQVDDETFIAKGIRQEQEKELVERAENLITQAGEEYKAVFLNEYKRLMTARLGLRHFKET 450
Query: 493 ---QIISKLLNNMAVDKVDYTNFFRALS-----NVKADPSIPEDELLV-----PLKAVLL 539
++ S+ L+ M ++D+ +FFR LS ++K E + P + V
Sbjct: 451 DFDELFSEGLDTMEALELDFNHFFRRLSTIILADIKTQEGRKEKAAIFFHKEGPSEVVGE 510
Query: 540 DIGKERKEAWI-SW---VLSYIQELLSSGISDE---ERKALMNSVNPKYVLRNYLCQSAI 592
+ KE+ W+ W VL +E S +S+E ER+ M VNP +V R ++ I
Sbjct: 511 ETAKEKIAQWLEKWRVRVLEDWKEESSHDLSEEKDAERRQAMKQVNPNFVPRGWILDEVI 570
Query: 593 DAAEL-GDFGEVRRLLKLMERPYDE 616
E GD + R+ + P+++
Sbjct: 571 RRVEKEGDRQVLDRITHMALHPFED 595
>gi|157960137|ref|YP_001500171.1| hypothetical protein Spea_0308 [Shewanella pealeana ATCC 700345]
gi|189039814|sp|A8GZ99.1|Y308_SHEPA RecName: Full=UPF0061 protein Spea_0308
gi|157845137|gb|ABV85636.1| protein of unknown function UPF0061 [Shewanella pealeana ATCC
700345]
Length = 483
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 201/535 (37%), Positives = 281/535 (52%), Gaps = 78/535 (14%)
Query: 127 EVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAV 184
E L Y++V P + NP +AWS+ AD +E+ ++P L SG + GA
Sbjct: 9 EQLPEFYSQVFPLG-ISNPHWLAWSQDAADLIEI-----KQPSDELLQGLSGNAHVDGAS 62
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
YAQ Y GHQFG ++ QLGDGR+I LGE L + W++ LKGAG TPYSR DG AV+R
Sbjct: 63 YYAQVYSGHQFGGYSPQLGDGRSIILGEALGPQGA-WDVALKGAGPTPYSRHGDGRAVMR 121
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
S++REFL SEA+H L IPTTRAL ++ + V R+ +E AI R+A+S +RF
Sbjct: 122 SAVREFLISEALHHLHIPTTRALAVIGSDLPVWRE-------SQETAAITVRLAKSHIRF 174
Query: 305 GSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSN 362
G ++ H+ RG ++ L D+ I+ H+ DL+ +
Sbjct: 175 GHFEYFCHSERGAPA--KLKQLLDFTIKQHY-----------------------PDLSCD 209
Query: 363 K--YAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFT 420
Y AW V TA ++A WQ +GF HGV+NTDNMSILG T D+GPF FLD F F
Sbjct: 210 AVGYKAWFTRVVADTAKMIANWQAIGFAHGVMNTDNMSILGDTFDFGPFAFLDTFKEGFI 269
Query: 421 PNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAI 480
N +D P RY F QP IGLWN+ + + L+ DD + + +Y + + Y +
Sbjct: 270 CNHSD-PEGRYAFGQQPGIGLWNLQRLAQALSPIIASDDLIES--LNQYQVELVKHYLLL 326
Query: 481 MTKKLGLP---------KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELL 531
M KLGL ++ +I M +++D+TN +R + DP+
Sbjct: 327 MRGKLGLKTSAAEAEQDDHDLALIGAFTGLMERNQLDHTNTWRRFGQL--DPNASHSS-- 382
Query: 532 VPLKAVLLDIGKERKEAWISWVLSYIQELLS---SGISDEERKALMNSVNPKYVLRNYLC 588
L+ +D+ + +W +Y L S G+ +ER N VNPKY+LRNYL
Sbjct: 383 --LRDDFVDL-----HGFDTWYQAYQVRLGSVDEVGLWQKER----NQVNPKYILRNYLA 431
Query: 589 QSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
Q AI A ELGD + L +L++ P+DEQ E A+ PP W G+ M SCSS
Sbjct: 432 QEAIIAVELGDLKPLHNLQRLLQNPFDEQLEFEDMAKRPPDWG--QGLIM-SCSS 483
>gi|407719848|ref|YP_006839510.1| hypothetical protein BN406_00639 [Sinorhizobium meliloti Rm41]
gi|407318080|emb|CCM66684.1| hypothetical protein BN406_00639 [Sinorhizobium meliloti Rm41]
Length = 490
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 200/505 (39%), Positives = 266/505 (52%), Gaps = 55/505 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V P+ V P L+ + +A L LD + ER D FSG GA P A Y G
Sbjct: 18 YARVQPT-PVAEPWLIKLNRPLAGELGLDAEALER-DGAAIFSGNLIPEGAEPLAMAYAG 75
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE+ + R ++QLKGAG+TPYSR DG A L +RE++
Sbjct: 76 HQFGTFVPQLGDGRAILLGEVTDAGGRRRDIQLKGAGQTPYSRRGDGRAALGPVLREYIV 135
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LG+PTTRAL TG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 136 SEAMHALGVPTTRALAATVTGQPVYREQIL-------PGAVFTRVAASHIRVGTFQFFAA 188
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D++ +RTLADY I H+ ++ K Y A VA
Sbjct: 189 RG--DMESIRTLADYVIGRHYPELKTDEK---------------------PYLALLKAVA 225
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A+L+A+W VGF HGV+NTDNM+I G TID+GP F+D +DP ++ D G RY
Sbjct: 226 ARQAALIARWLHVGFIHGVMNTDNMTISGETIDFGPCAFMDDYDPKTVFSSIDQFG-RYA 284
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ + TL L D AN + YGT F + M +K+G
Sbjct: 285 YANQPAIGQWNLARLAETL--VTLFDPVADTAVNLANDALGEYGTIFQKHWLDGMRRKIG 342
Query: 487 L---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + ++ LL M K D+T FR L+ A+ + + EL A L
Sbjct: 343 LLTDEDEDLDLVQSLLTLMQNGKADFTLTFRRLA-ASAENATADTEL-----ASLF---- 392
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
E +A W+ + + L ER A M SVNP ++ RN+ + AI AA E DF
Sbjct: 393 EEPQALSPWLEHWRRRLEREPQPATERAAAMRSVNPAFIPRNHRVELAIAAATEDADFSL 452
Query: 603 VRRLLKLMERPYDEQPGMEKYARLP 627
LL + RPY++QPG YAR P
Sbjct: 453 FEALLDVTSRPYEDQPGHAAYARPP 477
>gi|189195618|ref|XP_001934147.1| hypothetical protein PTRG_03814 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980026|gb|EDU46652.1| hypothetical protein PTRG_03814 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 622
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 222/622 (35%), Positives = 317/622 (50%), Gaps = 91/622 (14%)
Query: 98 KLKALEDLNWDHSFVRELPGDPR----TDSI--------PREVLHACYTKVSPSAEVENP 145
+L+ L+ L + F LP DP DS PR V A YT V P + E P
Sbjct: 16 ELQTLQSLPKSNVFTSNLPVDPAFPTPKDSHNAPLEALGPRMVKGALYTYVRPDPQGE-P 74
Query: 146 QLVAWSESVADSLELDPKEFERPDFPLFFSG--------ATPLAGAVPYAQCYGGHQFGM 197
+L+A S+ L L +E + +F +G + P G P+AQCYGG+QFG
Sbjct: 75 ELLAVSQRALQDLGLKEEEAKTEEFKELVAGKKILTWDESKPEQGIYPWAQCYGGYQFGQ 134
Query: 198 WAGQLGDGRAITLGEILN-LKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAM 256
WAGQLGDGRAI+L E N R+E+QLKGAG+TPYSRFADG AVLRSSIREF+ SE +
Sbjct: 135 WAGQLGDGRAISLFESTNPATGTRYEVQLKGAGRTPYSRFADGRAVLRSSIREFVVSEYL 194
Query: 257 HFLGIPTTRALCL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ 315
+ +GIP+TRAL L + G + R+ + EPGAIV R AQS++RFG++ + RG
Sbjct: 195 NAIGIPSTRALALTLNKGSKIMRE-------RMEPGAIVTRFAQSWIRFGTFDLQRIRG- 246
Query: 316 EDLDIVRTLADYAIRHHFRHIENMNKS--ESLSFSTGDEDHSVV---------DLTSNKY 364
D +RT+ DY H + + + + + D+ H V + N+Y
Sbjct: 247 -DRKTLRTVVDYTAEHVYGGWDKLPSKLPDGDAKEVHDQTHEGVAKETVEGEAENEENRY 305
Query: 365 AAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTT 424
+ R AS VA+WQ GF +GVLNTDN SILGL+ID+GPF FLD FDP++TPN
Sbjct: 306 VRLYRAILRRNASTVAKWQAYGFMNGVLNTDNTSILGLSIDFGPFAFLDTFDPTYTPNHD 365
Query: 425 DLPGRRYCFANQPDIGLWNIAQFSTTL----AAAKLID----------DKEANYVM---- 466
D RY + NQP I WN+ + L A ++D + +A V+
Sbjct: 366 D-HMLRYSYRNQPTIIWWNLVRLGEALGELMGAGSIVDSDTFVEQGVTEAQAGEVVARGE 424
Query: 467 -------ERYGTKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDYTNFFRA 515
E Y F+ EY+ +MT +LGL Y + + S+LL+ + ++D+ + FR
Sbjct: 425 SAIDRAGEEYKAVFLAEYKRLMTLRLGLKTYKESDFEDLFSELLDCLEKYELDFHHAFRR 484
Query: 516 LSNVK-ADPSIPEDELLVPLKAVLLD-IGKERKEA------WISWVLSYIQELLSSGISD 567
L +V AD E K D + ++ E W+ ++E G
Sbjct: 485 LGSVTLADVDTEEKRKDAAGKFFRADNVPRQESEERARIARWLGTWAERVREDWGEG-KH 543
Query: 568 EERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVR-RLLKLMERPYDEQ-----PGME 621
EER+A M++VNPK+V R+++ ID E + ++ R++KL P+ E E
Sbjct: 544 EERRAAMDAVNPKFVPRSWVLDELIDRVEKKNERDILPRIMKLSLNPFQEHWDWDGDEEE 603
Query: 622 KYARLPPAWAYRPGVCMLSCSS 643
++ P + G+ SCSS
Sbjct: 604 RFCGDVPKYK---GMMQCSCSS 622
>gi|127511196|ref|YP_001092393.1| hypothetical protein Shew_0262 [Shewanella loihica PV-4]
gi|166228414|sp|A3Q9I6.1|Y262_SHELP RecName: Full=UPF0061 protein Shew_0262
gi|126636491|gb|ABO22134.1| protein of unknown function UPF0061 [Shewanella loihica PV-4]
Length = 484
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 196/527 (37%), Positives = 272/527 (51%), Gaps = 65/527 (12%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLF--FSGATPLAGAVPY 186
L Y++V+P + PQ +AWSE A + L ++PD L +G + GA Y
Sbjct: 11 LSGFYSQVTPQG-LPRPQWLAWSEDAAALIGL-----KQPDDELLQGLAGNQAIPGASYY 64
Query: 187 AQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSS 246
AQ Y GHQFG ++ QLGDGR+I LGE + W++ LKGAG TPYSR DG AV+RS+
Sbjct: 65 AQVYSGHQFGGYSPQLGDGRSIILGEAEGPQG-YWDVALKGAGMTPYSRHGDGRAVMRSA 123
Query: 247 IREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGS 306
+REFL SEA+H L IPTTRAL ++ + V R+ +E AI R+A+S +RFG
Sbjct: 124 VREFLVSEALHHLNIPTTRALAVIGSDLPVWRE-------TQETAAITVRLAKSHIRFGH 176
Query: 307 YQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
++ Q D ++ L D+ + H+ + Y A
Sbjct: 177 FEFFCHSEQGSKDKLKQLLDFTLSQHYPELSRDQAG---------------------YIA 215
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
W V TA L+A WQ VGF HGV+NTDNMSILG + D+GPF FLD F+ F N +D
Sbjct: 216 WFNRVVADTAKLIAHWQAVGFAHGVMNTDNMSILGDSFDFGPFAFLDTFEEDFICNHSD- 274
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
P RY F QP +GLWN+ + + L DD A + Y + Y +M KLG
Sbjct: 275 PNGRYAFGQQPGVGLWNLQRLAQALVPIIASDDLIA--ALNTYQHHLVQAYLVLMRDKLG 332
Query: 487 LP----------KYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKA 536
+ + + Q+I M +++D+TN +R + + DP+ L+
Sbjct: 333 IKLVEPAGSERDEADLQLIGGFTLLMEANRLDHTNTWRRFAQL--DPNSQHSS----LRD 386
Query: 537 VLLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAE 596
+D+ + +W +Y QE L +A+ VNPKYVLRNYL Q AI A E
Sbjct: 387 DFIDLA-----GFDTWYQAY-QERLGQVSDVAGWQAVRAQVNPKYVLRNYLAQEAIIACE 440
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
G+ + L +L+ RP+DEQP E YA+ PP W G+ M SCSS
Sbjct: 441 EGNTQPLAELHQLLTRPFDEQPEKEAYAKRPPEWG--QGLIM-SCSS 484
>gi|240141718|ref|YP_002966198.1| hypothetical protein MexAM1_META1p5320 [Methylobacterium extorquens
AM1]
gi|240011695|gb|ACS42921.1| conserved hypothetical protein, UPF0061 protein [Methylobacterium
extorquens AM1]
Length = 497
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 191/504 (37%), Positives = 267/504 (52%), Gaps = 47/504 (9%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
+ +V+P+A VE P+L+ + ++A L LDP E P+ +G GA P A Y G
Sbjct: 19 FGRVAPTA-VEAPRLIRLNRALAVDLGLDPDRLESPEGVEVLAGRRVPEGAEPLAAAYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE++ + R ++QLKG+G TP+SR DG A L +RE+L
Sbjct: 78 HQFGQFVPQLGDGRAILLGEVVG-RDGRRDIQLKGSGPTPFSRRGDGRAALGPVLREYLV 136
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LGIPTTRAL VTTG+ V R+ PGA++ RVA S +R GS+Q A+
Sbjct: 137 SEAMHALGIPTTRALAAVTTGERVIRETVL-------PGAVLTRVASSHIRVGSFQFFAA 189
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D++ +R LAD+AI H D + + D N Y A V
Sbjct: 190 RG--DVEGLRALADHAIARH------------------DPEAARAD---NPYRALLDGVI 226
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A+LVA+W VGF HGV+NTDNMSI G TIDYGP FLD +DP+ ++ D G RY
Sbjct: 227 RRQAALVARWLTVGFIHGVMNTDNMSIAGETIDYGPCAFLDTYDPATAFSSIDRNG-RYA 285
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDD----KEANYVMERYGTKFMDEYQAIMTKKLGLP 488
+ NQP I LWN+ + + L D+ EA + + +F Y + +KLGL
Sbjct: 286 YGNQPRIALWNLTRLAEALLPLLSEDETQAVAEAEAALTGFAGQFEAAYHGGLNRKLGLA 345
Query: 489 KY---NKQIISKLLNNMAVDKVDYTNFFRALSNVKADP-SIPEDELLVPLKAVLLDIGKE 544
+ + LL MA ++ D+T FR L P P+ + ++++ +D
Sbjct: 346 TTRDGDPALAGDLLKTMAENEADFTLTFRRLGEAVPGPDGEPDPAAVEAVRSLFID---- 401
Query: 545 RKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGEV 603
A+ W + + L R+ +M + NP ++ RN+ + I AA E DF
Sbjct: 402 -PTAYDRWAEGWRRRLKDEAGDAAARRQMMRAANPAFIPRNHRVEEMITAAVERQDFAPF 460
Query: 604 RRLLKLMERPYDEQPGMEKYARLP 627
LL ++ RPYD+QP YA P
Sbjct: 461 ETLLTVLARPYDDQPDFAHYAEPP 484
>gi|242807746|ref|XP_002485019.1| YdiU domain protein [Talaromyces stipitatus ATCC 10500]
gi|218715644|gb|EED15066.1| YdiU domain protein [Talaromyces stipitatus ATCC 10500]
Length = 596
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 205/528 (38%), Positives = 275/528 (52%), Gaps = 75/528 (14%)
Query: 119 PRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGAT 178
PR PR V A YT V P E+P+L+ S L L P E + +F +G
Sbjct: 67 PRETLGPRIVKGAMYTYVRPET-AEDPELLGVSPRAMTDLGLQPGEEKTDEFRDLVAGNK 125
Query: 179 PL-----AGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE-RWELQLKGAGKTP 232
G P+AQCYGG QFG WAGQLGDGRAI+L E+ N + R+ELQLKGAG+TP
Sbjct: 126 IFWNEQEGGVYPWAQCYGGWQFGAWAGQLGDGRAISLCELTNPSTNVRYELQLKGAGRTP 185
Query: 233 YSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKF-VTRDMFYDGNPKEEPG 291
YSRFADG AVLRSSIRE++ SEA++ LGIPTTRAL L K V R+ + EPG
Sbjct: 186 YSRFADGKAVLRSSIREYVVSEALNALGIPTTRALSLTLLPKSKVLRE-------RMEPG 238
Query: 292 AIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGD 351
AIV R AQS+LR GS+ I SR + DL +R LA Y F E++ +L G+
Sbjct: 239 AIVARFAQSWLRIGSFDILHSRNERDL--IRNLATYIAEDVFPGWESLPGVVTLPNGDGN 296
Query: 352 EDHSVVD----------------LTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTD 395
+ VD N++ E+ R A VA WQ GF +GVLNTD
Sbjct: 297 TANVNVDEPPRGIPAAELQGKEGQEENRFTRLYREIVRRNAKTVAAWQAYGFMNGVLNTD 356
Query: 396 NMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ----FSTTL 451
N SI GL++D+GPF F+D FDPS+TPN D RY + NQP + WN+ + F +
Sbjct: 357 NTSIFGLSLDFGPFAFMDNFDPSYTPNHDD-HYLRYSYKNQPSVIWWNLVRLGEAFGELI 415
Query: 452 AAAKLIDDKE---------------------ANYVMERYGTKFMDEYQAIMTKKLGLPKY 490
AA+ +DD+E N E Y T F +EY +M+++LGL
Sbjct: 416 GAAERVDDEEFITKGVTEEFGQILIKRAETIINRTGEEYKTVFKNEYVRLMSRRLGLLTS 475
Query: 491 NKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK--- 543
+ + S+LL+ M ++D+ +FFR LS+V + +++ L K + G
Sbjct: 476 KESDFETLFSELLDTMEHLELDFNHFFRRLSDVGIEEIETDEQRLAIAKRFFHNEGISGV 535
Query: 544 --------ERKEAWI-SWVLSYIQELLSSGISDEERKALMNSVNPKYV 582
+R AW+ SW ++ G +D+ERK M VNPK +
Sbjct: 536 GNTEESACKRIAAWLSSWKDRINEDWKRDGRTDQERKERMKFVNPKVL 583
>gi|388469461|ref|ZP_10143670.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
synxantha BG33R]
gi|388006158|gb|EIK67424.1| protein of unknown function, YdiU/UPF0061 family [Pseudomonas
synxantha BG33R]
Length = 487
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 213/552 (38%), Positives = 300/552 (54%), Gaps = 72/552 (13%)
Query: 99 LKALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSL 158
+KAL++L +D+ F R GD A T V P ++ P+LV S++ L
Sbjct: 1 MKALDELTFDNRFAR--LGD------------AFSTHVLPEP-LDEPRLVVASKAAMALL 45
Query: 159 ELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKS 218
+LD E P F F G A A P A Y GHQFG + QLGDGR + LGE+ N
Sbjct: 46 DLDAAVAETPVFAELFGGHKLWAEAEPRAMVYSGHQFGGYTPQLGDGRGLLLGEVYNEAG 105
Query: 219 ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTR 278
E W+L LKGAG TPYSR DG AVLRSSIREFL SEA+H LGIP++RALC++ + V R
Sbjct: 106 EHWDLHLKGAGMTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRALCVIGSSTPVWR 165
Query: 279 DMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ--IHASRGQEDLDIVRTLADYAIRHHFRHI 336
+ K+E GA+V R+A S +RFG ++ + + ++ +++ H+
Sbjct: 166 E-------KQERGAMVLRLAHSHIRFGHFEYFYYTKKPEQQVELA------------EHV 206
Query: 337 ENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDN 396
N++ E Y A ++ ER A L+A+WQ GF HGV+NTDN
Sbjct: 207 LNLHYPECRE-------------QPEPYLAMFRKIVERNAELIAKWQAYGFCHGVMNTDN 253
Query: 397 MSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKL 456
MSILG+T D+GPF FLD FD F N +D G RY F+NQ IG WN++ + L
Sbjct: 254 MSILGITFDFGPFAFLDDFDAHFICNHSDHEG-RYSFSNQVPIGQWNLSALAQALTPFIS 312
Query: 457 IDD-KEANYVMERYGTKFMDEYQAIMTKKLGL---PKYNKQIISKLLNNMAVDKVDYTNF 512
+D KEA + Y + Y +M ++LGL + ++ ++ LL M VDYT F
Sbjct: 313 VDALKEA---LGLYLPLYQAHYLDLMRRRLGLTTAEEDDQTLVESLLKLMQNSGVDYTLF 369
Query: 513 FRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISWVLSYIQELLSSG-ISDEERK 571
FR L + A ++ L+ +D+ + +W Y + G + E+R+
Sbjct: 370 FRRLGDESAALAVAR------LRDDFVDMA-----GFDAWAELYKARVARDGDYTQEQRR 418
Query: 572 ALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYARLPPAWA 631
M++VNP Y+LRNYL Q+AI AAE GD+ EVRRL +++ +P++EQ GME+YA+ PP W
Sbjct: 419 ERMHAVNPLYILRNYLAQNAIAAAEAGDYSEVRRLHEVLCKPFEEQTGMEQYAQRPPDWG 478
Query: 632 YRPGVCMLSCSS 643
+SCSS
Sbjct: 479 RH---LEISCSS 487
>gi|336317640|ref|ZP_08572491.1| hypothetical protein Rhein_3927 [Rheinheimera sp. A13L]
gi|335877987|gb|EGM75935.1| hypothetical protein Rhein_3927 [Rheinheimera sp. A13L]
Length = 482
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 202/512 (39%), Positives = 276/512 (53%), Gaps = 55/512 (10%)
Query: 136 VSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQF 195
V+P AE P L+A+S A L+L F + D + SG AG+ P AQ Y GHQF
Sbjct: 22 VTPFAE---PTLLAFSADTAALLQLPTAFFSQTDAADYLSGKKLFAGSTPVAQKYAGHQF 78
Query: 196 GMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEA 255
G + +LGDGR + LG+IL ++L LKGAG+TPYSRF DG AVLRSSIREFL SEA
Sbjct: 79 GQYNPELGDGRGLLLGDILGSDGLHYDLHLKGAGRTPYSRFGDGRAVLRSSIREFLASEA 138
Query: 256 MHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQ 315
MH LGIPT+RAL LV + + V R+ E GA+V RV S +RFG ++ G
Sbjct: 139 MHHLGIPTSRALSLVGSAEPVQRETI-------EQGAMVIRVCPSHIRFGHFEHCFYTG- 190
Query: 316 EDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERT 375
D + ++ L D+ ++ HF N N A +V T
Sbjct: 191 -DKNQLQRLVDFTVQQHFPDCLN---------------------EKNPALAMLQQVVVHT 228
Query: 376 ASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFAN 435
A L++QWQ VGF HGV+NTDNMSILGL+ DYGP+ FLD + P + N +D G RY F
Sbjct: 229 AELISQWQAVGFNHGVMNTDNMSILGLSFDYGPYAFLDDYQPGYICNHSDHSG-RYAFDE 287
Query: 436 QPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NK 492
QP IGLWN+ + L + LI+ ++ + Y ++ Y +M KKLGL ++
Sbjct: 288 QPGIGLWNLNALAHAL--SPLIEVEDLRAALGLYEPTLVNHYMTLMGKKLGLTTQQPTDR 345
Query: 493 QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWISW 552
+I + L + + DY+ FR L++ D + ++ +LD+ A+ W
Sbjct: 346 ALIGQWLALLQQQQQDYSLSFRRLADFTDDATGSS------VRDHMLDVA-----AFDQW 394
Query: 553 VLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMER 612
Y L S ERK+ MN++NP Y+LRNYL Q I AAE GD + L+++++
Sbjct: 395 AELYRDRLALESASAVERKSQMNNINPLYILRNYLAQQVISAAEQGDTAPLHELMQVLQS 454
Query: 613 PYDEQPGMEKYARLPPAWAYRPGVCM-LSCSS 643
PY Q G E +A PP W G M +SCSS
Sbjct: 455 PYQLQAGKEAFAAPPPDW----GKGMDISCSS 482
>gi|302915521|ref|XP_003051571.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732510|gb|EEU45858.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 641
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 217/594 (36%), Positives = 299/594 (50%), Gaps = 89/594 (14%)
Query: 101 ALEDLNWDHSFVRELPGD------------PRTDSIPREVLHACYTKVSPSAEVENPQLV 148
+LEDL F LP D PR PR+V A +T V P AE ++P+L+
Sbjct: 20 SLEDLPKSWHFTESLPADAVFPTPADSHKTPRDQITPRQVQKAIFTWVRP-AEQKDPELL 78
Query: 149 AWSESVADSLELDPKEFERPDFPLFFSG-------ATPLAGAVPYAQCYGGHQFGMWAGQ 201
A S + L + E + DF +G L G P+AQCYGG QFG WAGQ
Sbjct: 79 AVSPAALRDLGIKAGEEKTEDFRQLVAGNKLYGWDEEKLEGGYPWAQCYGGFQFGQWAGQ 138
Query: 202 LGDGRAITLGEILNLKS-ERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLG 260
LGDGRAI+L E N S ER+ELQLKGAG TPYSRFADG AVLRSSIREF+ SEA++ L
Sbjct: 139 LGDGRAISLFETTNPASGERYELQLKGAGLTPYSRFADGKAVLRSSIREFVVSEALNALK 198
Query: 261 IPTTRALCL-VTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLD 319
IPTTRAL L + V R+ + EPGAIV R AQS+LR G++ I +RG D D
Sbjct: 199 IPTTRALSLTLLPDSKVLRE-------RVEPGAIVLRFAQSWLRLGNFDILRARG--DRD 249
Query: 320 IVRTLADYAIRHHF-------RHIENMNKSES----LSFSTGDEDHSVVDLTSNKYAAWA 368
++R L+ Y F +EN ++ ++ D D N++
Sbjct: 250 LIRKLSTYIAEDVFGGWDELPARLENPDEPKTSPPPKRGVAKDTIEGPEDGEENRFTRLY 309
Query: 369 VEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPG 428
EV R A+ VA WQ GF +GVLNTDN SI GL+ID+GPF F+D FDP++TPN D
Sbjct: 310 REVVRRNATTVANWQAYGFMNGVLNTDNTSIYGLSIDFGPFAFMDNFDPTYTPNHDDY-A 368
Query: 429 RRYCFANQPDIGLWNIAQFSTTLA-----AAKLID----------DKEANYVM------- 466
RY + NQP I WN+ +F + AK+ D +EA V
Sbjct: 369 LRYSYRNQPTIIWWNLVRFGEAIGEMMGMGAKVDDPTFVEKGVTEGEEAAVVARAEKLIT 428
Query: 467 ---ERYGTKFMDEYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNV 519
E + F++EY+ +MT +LGL + + S+ L+ + ++D+ +FFR LSN+
Sbjct: 429 QAGEEFKIVFLNEYKRLMTARLGLKTHKDSDFDVLFSEALDTLEALELDFHHFFRRLSNL 488
Query: 520 KADPSIPED----------ELLVPLKAVLLDIGKERKEAWI-SWVLSYIQELLSSGIS-- 566
K E+ P D +ER W+ SW +++ G +
Sbjct: 489 KLQDLATEEGRKEKASTFFHKEGPPTTGTEDGARERIAKWLASWRERIVEDWKDEGDNVP 548
Query: 567 ---DEERKALMNSVNPKYVLRNYLCQSAIDAAEL-GDFGEVRRLLKLMERPYDE 616
D ER M VNP +V R ++ I E G+ + R++++ P+++
Sbjct: 549 EEKDNERIKAMKKVNPNFVPRGWILDEVIKRVEKDGERDVLDRIMQMALHPFED 602
>gi|410693763|ref|YP_003624384.1| conserved hypothetical protein,ydiU [Thiomonas sp. 3As]
gi|294340187|emb|CAZ88559.1| conserved hypothetical protein,ydiU [Thiomonas sp. 3As]
Length = 513
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 203/505 (40%), Positives = 269/505 (53%), Gaps = 60/505 (11%)
Query: 134 TKVSPSAEVENPQLVAWSESVADSLELD----PKEFERPDFPLFFSGATPLAGAVPYAQC 189
VSP + +P LVA S A + L P++ + D+ F G A
Sbjct: 37 VAVSP---LPDPVLVASSADAAALVGLTAPATPQDEQ--DWARAFGGHVAAISGGSRATV 91
Query: 190 YGGHQFGMWAGQLGDGRAITLGEILNLKS-------ERWELQLKGAGKTPYSRFADGLAV 242
Y GHQFG WAGQLGDGRA+ LG+ + RWE+Q KG+G+TP+SR DG AV
Sbjct: 92 YAGHQFGNWAGQLGDGRALLLGDWPDASGGRHSCGYARWEVQFKGSGRTPFSRMGDGWAV 151
Query: 243 LRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFL 302
LRSSIREFLCSEAM LGIPTTRALCLV + + V R+ + E A+V R++ SF+
Sbjct: 152 LRSSIREFLCSEAMAALGIPTTRALCLVGSSRPVRRE-------RIETAAMVTRLSPSFV 204
Query: 303 RFGSYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSN 362
RFG ++ + GQ + +R L D+ I + D + L
Sbjct: 205 RFGHFEHFSYSGQTEQ--LRALTDWVIAQY-------------CPDCADAPQPALALLQ- 248
Query: 363 KYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPN 422
W V RTA L+AQWQ VGF HGV+NTDNMSILG TIDYGPF FLDA+DP TPN
Sbjct: 249 ----WVVA---RTARLIAQWQAVGFIHGVMNTDNMSILGWTIDYGPFAFLDAYDPLHTPN 301
Query: 423 TTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-ANYVMERYGTKFMDEYQAIM 481
TTD G RY + QP + WN+ L LID E A ++++ +++ Q +
Sbjct: 302 TTDR-GGRYAYGRQPAVAHWNLLALGQAL--LPLIDKPESALAAVDQFRPQYVQAMQQQL 358
Query: 482 TKKLGLPK---YNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL 538
KLGL + + LL+ MA ++ D+T FR L+ + AD P +P A+
Sbjct: 359 AAKLGLTAPQPGDGDLFQDLLDTMAANRSDWTLSFRHLAQLAADAHAP-----IP-PALA 412
Query: 539 LDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELG 598
+E + + WV Y + L + +D R MN+VNP VLR++L Q+AI AE G
Sbjct: 413 AQFAREPQR-FADWVARYRERLRAESRNDAARAVAMNAVNPLVVLRHHLAQAAIAQAEAG 471
Query: 599 DFGEVRRLLKLMERPYDEQPGMEKY 623
DF EVRRLL + RP+D Y
Sbjct: 472 DFSEVRRLLHALTRPFDAHAAPAHY 496
>gi|149926470|ref|ZP_01914731.1| hypothetical protein LMED105_13763 [Limnobacter sp. MED105]
gi|149824833|gb|EDM84047.1| hypothetical protein LMED105_13763 [Limnobacter sp. MED105]
Length = 522
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 197/505 (39%), Positives = 268/505 (53%), Gaps = 39/505 (7%)
Query: 147 LVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGR 206
LV + ++A+ + L+ +E + SG TP G A Y GHQFG + QLGDGR
Sbjct: 49 LVHLNTALANEVGLNAEELSKAQGIDVLSGNTPFPGYQSRASVYCGHQFGQFVPQLGDGR 108
Query: 207 AITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRA 266
A+ + EI K R +LQLKGAG TPYSR ADG AVLRSSIRE+L SEAMH LGIPTTRA
Sbjct: 109 ALLIAEIRKGKQYR-QLQLKGAGPTPYSRHADGRAVLRSSIREYLASEAMHALGIPTTRA 167
Query: 267 LCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDIVRTLAD 326
L L + V R+ E A+VCRV++SF+RFG + Q LD +R L
Sbjct: 168 LSLTASVDPVFRE-------TTETAAVVCRVSESFMRFGHVEFFCYTNQ--LDALRNLLS 218
Query: 327 YAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVG 386
+ I H I+ + +E+ SF G W V RTA + AQWQ VG
Sbjct: 219 WHIEQHHPDID-LGDTET-SFHAG-------------LLQWLGVVVARTARMAAQWQAVG 263
Query: 387 FTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQ 446
F HGV+NTDNMS+LGLTIDYGP+GF+D FD N +D G RY + NQP I WN+
Sbjct: 264 FCHGVMNTDNMSLLGLTIDYGPYGFMDGFDIDHICNHSDHQG-RYSYRNQPRIAHWNL-- 320
Query: 447 FSTTLAAAKLI-DDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYNKQIISKLLNN--- 501
++ A + LI D KE +++ + F E+ + +KLGL + L+ N
Sbjct: 321 YALAQALSPLIPDSKETLQNLLDGFADVFHAEHSTLFARKLGLAHEQGDAVDTLIENTLK 380
Query: 502 -MAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVL-LDIGKERK-EAWISWVLSYIQ 558
M +D+T FFR++S + ++ E+ L +G E + W+ +++
Sbjct: 381 FMHEHTLDFTRFFRSISALNPTATLEENFASWQQSPFFPLALGDEAQLNNSKLWLNEWLK 440
Query: 559 ELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQP 618
S E + ++ NP +VLRN+L Q AI+ A+ GDF EV RL + PY+
Sbjct: 441 ATSQPTSSVEAWRINLDQTNPAFVLRNHLLQHAIEQAQKGDFAEVNRLFAALSDPYNAAS 500
Query: 619 GMEKYARLPPAWAYRPGVCMLSCSS 643
+ Y PP WA +LSCSS
Sbjct: 501 LLGDYTAQPPDWAKS---LVLSCSS 522
>gi|209695647|ref|YP_002263576.1| hypothetical protein VSAL_I2210 [Aliivibrio salmonicida LFI1238]
gi|226701218|sp|B6EIM5.1|Y2210_ALISL RecName: Full=UPF0061 protein VSAL_I2210
gi|208009599|emb|CAQ79895.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
Length = 485
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 197/520 (37%), Positives = 274/520 (52%), Gaps = 64/520 (12%)
Query: 133 YTKVSPSAEVENPQLVAWSESVAD--SLELDPKEFERPDFPLF--FSGATPLAGAVPYAQ 188
+T V P + N + W+E +A +L LDP D L FSG P A
Sbjct: 21 FTHVPPQP-LNNVHWIMWNEKLAKRFNLPLDPA----ADAELLSGFSGEVVPPQFSPLAM 75
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG + LGDGR + L EI + +++ LKGAG+TPYSR DG AVLRS+IR
Sbjct: 76 KYAGHQFGSYNPDLGDGRGLLLAEIKDKAGASFDIHLKGAGRTPYSRSGDGRAVLRSTIR 135
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+LCSEAM LGIPTTRAL ++ + V R+ + E GA++ RVA++ +RFG ++
Sbjct: 136 EYLCSEAMFGLGIPTTRALGMMGSDTPVYREGY-------ETGALLLRVAETHVRFGHFE 188
Query: 309 --IHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAA 366
+++ E + LAD I HF + N YA
Sbjct: 189 HLFYSNLLAEH----KLLADKVIEWHFPDCLD---------------------NENPYAV 223
Query: 367 WAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDL 426
E+ +RTA ++A WQ VGF HGV+NTDNMSI+G T DYGPFGFLD ++P + N +D
Sbjct: 224 MFNEIVDRTAKMIAHWQAVGFAHGVMNTDNMSIIGQTFDYGPFGFLDDYEPGYICNHSDY 283
Query: 427 PGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLG 486
G RY F QP IGLWN++ + L+ LID + + +E+Y + + +M +KLG
Sbjct: 284 QG-RYAFNQQPRIGLWNLSALAHALSP--LIDKADLDQALEQYEVQLHGYFSQLMRQKLG 340
Query: 487 L---PKYNKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L + ++ + ++ + VDYT F R LSNV + ++D+
Sbjct: 341 LITKQDGDSRLFESMFELLSQNSVDYTRFLRELSNVDTHN-----------EQAIIDLFI 389
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEV 603
+R A + WV YI + R M VNPKY+LRNYL Q AID A+ GD+ E+
Sbjct: 390 DRDAAKL-WVSLYITRCEKEHETVASRCKKMREVNPKYILRNYLAQQAIDKAQEGDYSEL 448
Query: 604 RRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
L L+ P+DE E YA LPP+W + +SCSS
Sbjct: 449 EALSLLLRSPFDEHIEFEHYANLPPSWGKK---MEISCSS 485
>gi|379736257|ref|YP_005329763.1| hypothetical protein BLASA_2861 [Blastococcus saxobsidens DD2]
gi|378784064|emb|CCG03732.1| conserved protein of unknown function [Blastococcus saxobsidens
DD2]
Length = 492
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 191/506 (37%), Positives = 268/506 (52%), Gaps = 73/506 (14%)
Query: 141 EVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAG 200
E P+L+A +E +A L LDP P+ G GA P AQ Y GHQFG +A
Sbjct: 30 EAPEPRLLALNEPLATGLGLDPAALRTPEGLRLLVGTGVPDGATPVAQAYAGHQFGGFAP 89
Query: 201 QLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLG 260
+LGDGRA+ LGE+++ + +L LKG+G+TP++R DGLA + +RE++ SEAMH LG
Sbjct: 90 RLGDGRALLLGELVDAEGRLRDLHLKGSGRTPFARGGDGLAAIGPMLREYVISEAMHALG 149
Query: 261 IPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLDI 320
IPTTR+L +V TG+ V R+ PGA++ RVA S LR GS+Q +R +DLD+
Sbjct: 150 IPTTRSLAVVATGRQVRRETLL-------PGAVLARVASSHLRVGSFQY--ARVTDDLDL 200
Query: 321 VRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVA 380
+R LAD+AI H G+E + + N Y A V ASLVA
Sbjct: 201 LRRLADHAIARH-------------RVGAGEEGAARAE---NPYLALFEAVVSAQASLVA 244
Query: 381 QWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIG 440
W VGF HGV+NTDNM+I G TIDYGP FLDAFDP+ ++ D G RY + NQP +
Sbjct: 245 SWMLVGFVHGVMNTDNMTISGETIDYGPCAFLDAFDPATVYSSIDT-GGRYAYGNQPLVA 303
Query: 441 LWNIAQFSTTLAAAKLIDDKEANYV------MERYGTKFMDEYQAIMTKKLGLPKYN--- 491
WN+A+ + L L+ D EA + + + ++ + A M KLGL +
Sbjct: 304 EWNLARLAEAL--LPLLHDDEAQAIPVAVEALRGFRPRYEAAWTAGMRAKLGLTAASDDD 361
Query: 492 --KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAW 549
+ LL + D VD T+FFR L++ + P L + L + + W
Sbjct: 362 TVASLAVDLLELLHRDHVDLTSFFRGLASAARGDAEPTRLLFLDLAGI---------DGW 412
Query: 550 IS-WVLSYIQELLSSGISDEERKAL------MNSVNPKYVLRNYLCQSAIDAAELGDFGE 602
++ W +AL M+ VNP Y+ RN+L + A+DAA GD G
Sbjct: 413 LARW------------------RALQPDPDGMDRVNPVYIPRNHLVEEALDAATGGDLGP 454
Query: 603 VRRLLKLMERPYDEQPGMEKYARLPP 628
+ RLL + PYD++PG+E+YA P
Sbjct: 455 LDRLLDAVTAPYDQRPGLERYAAPAP 480
>gi|409393023|ref|ZP_11244533.1| hypothetical protein GORBP_109_00290 [Gordonia rubripertincta NBRC
101908]
gi|403197204|dbj|GAB87767.1| hypothetical protein GORBP_109_00290 [Gordonia rubripertincta NBRC
101908]
Length = 501
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 188/492 (38%), Positives = 265/492 (53%), Gaps = 51/492 (10%)
Query: 140 AEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWA 199
AEV +PQL+ +E +A SL LD + D +GA A P A Y GHQFG +A
Sbjct: 35 AEVPDPQLLVVNEPLASSLGLDVEALRSVDGVAILAGAAVPADGRPVATAYSGHQFGGYA 94
Query: 200 GQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFL 259
LGDGRA+ LGE+L++ R +LQLKG+G TP+SR DG AV+ +RE+L SEAMH L
Sbjct: 95 PLLGDGRALLLGELLDVDGHRVDLQLKGSGPTPFSRGGDGFAVVGPMLREYLISEAMHAL 154
Query: 260 GIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRGQEDLD 319
G+PTTR+L +V TG+ V R+ EPGA++ R+A S LR G+++ A G D
Sbjct: 155 GVPTTRSLSVVATGRGVHRNGV-------EPGAVLARIAASHLRVGTFEFAARNG----D 203
Query: 320 IVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLV 379
I++ LADYAI H+ + ++ +TG N+YA V ER A LV
Sbjct: 204 ILQPLADYAITRHYPDLTDLP-------TTG---------AGNRYAKLLERVVERQARLV 247
Query: 380 AQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDI 439
AQW VGF HGV+NTDN +I G TIDYGP F+DAFDP+ ++ D G RY F NQP +
Sbjct: 248 AQWMLVGFVHGVMNTDNTTISGETIDYGPCAFIDAFDPAAVFSSID-QGGRYAFGNQPAV 306
Query: 440 GLWNIAQFSTTLAAAKLI----DD--KEANYVMERYGTKFMDEYQAIMTKKLGLPK--YN 491
WN+A+F+ TL +LI DD A + + + + ++ KLGL +
Sbjct: 307 LKWNLARFAETL--LRLISPTPDDAIATATATLSTFDSLYEQHLNEGLSAKLGLADTFVD 364
Query: 492 KQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGKERKEAWIS 551
+I LL MA + D+T FRAL++ + P D+LL +E
Sbjct: 365 HALIDDLLALMAEHRADWTGTFRALADELRGRTAPLDQLLA-------------REVSAP 411
Query: 552 WVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLME 611
W+ + + L G D M+ VNP Y+ RN++ +A+ AA GD +L ++
Sbjct: 412 WLARWRETLTQHGRDDATTADAMDRVNPLYIPRNHMVDAALRAAHEGDLAPFEEMLDVVT 471
Query: 612 RPYDEQPGMEKY 623
P++ + KY
Sbjct: 472 HPFERRVDWVKY 483
>gi|348524626|ref|XP_003449824.1| PREDICTED: selenoprotein O-like, partial [Oreochromis niloticus]
Length = 588
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 181/435 (41%), Positives = 247/435 (56%), Gaps = 38/435 (8%)
Query: 101 ALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLEL 160
L L + ++ +++LP D R V AC++++ + P VA S++ L L
Sbjct: 10 VLGRLPFKNTVLKKLPIDDSEQPGSRMVPEACFSRIRALQPLVRPVFVALSQTALSLLGL 69
Query: 161 DPKE-FERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSE 219
+E P P + SG+ L G+ P A CY GHQFG++A QLGDG + LGE+ +
Sbjct: 70 SAQEVLSDPLGPEYLSGSRLLPGSEPAAHCYSGHQFGLFAAQLGDGAVMYLGEVESCAHG 129
Query: 220 RWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRD 279
RWE+Q+KGAG TPYSR DG VLRSSIREFLCSEAM LGIP+TRA LVT+ +V+RD
Sbjct: 130 RWEIQVKGAGVTPYSRDGDGRKVLRSSIREFLCSEAMAALGIPSTRAASLVTSDLYVSRD 189
Query: 280 MFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASR-----------GQEDLDIVRTLADYA 328
+G E ++V RVA +F+RFGS++I R G++ DI L DY
Sbjct: 190 PLNNGQRILERCSVVLRVAPTFIRFGSFEIFLGRDEFSGLQGPSAGRD--DIRAQLLDYI 247
Query: 329 IRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFT 388
+ I+ + HS+ ++ A+ EV RTA LVAQWQ VGF
Sbjct: 248 GDTFYPQIQ--------------QAHSI---RKDRNLAFFREVMTRTARLVAQWQCVGFC 290
Query: 389 HGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFS 448
HGVLNTDNMSILGLT+DYGPFGF++ FDP F N +D RRY + QP + WN+A +
Sbjct: 291 HGVLNTDNMSILGLTLDYGPFGFMERFDPDFVSNASD-KKRRYSYQAQPSVCRWNLACLA 349
Query: 449 TTLAAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY----NKQIISKLLNNMAV 504
L + +D EA V++ + + Y +IM KKLGL + +++++S LL M
Sbjct: 350 EALGSE--LDPAEAGAVLDEFMPMYEAFYLSIMRKKLGLVRIEEAEDRELVSDLLRVMHN 407
Query: 505 DKVDYTNFFRALSNV 519
D+TN FR LS V
Sbjct: 408 TGADFTNTFRLLSRV 422
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 52/81 (64%), Gaps = 7/81 (8%)
Query: 540 DIGKERKEAWISWVLSYIQEL--LSSGISD-----EERKALMNSVNPKYVLRNYLCQSAI 592
D+ +++++ WI W+ Y + L SD +ER +MN +NP+ VLRNY+ Q+ I
Sbjct: 508 DLKRKQRDDWIYWIGQYRRRLGRECDSTSDLPVIIKERLKVMNGINPRVVLRNYIAQNVI 567
Query: 593 DAAELGDFGEVRRLLKLMERP 613
AAE GDF E+ R+LK++E+P
Sbjct: 568 QAAEKGDFSEIVRVLKVLEKP 588
>gi|254458812|ref|ZP_05072236.1| hypothetical protein CBGD1_1949 [Sulfurimonas gotlandica GD1]
gi|373867139|ref|ZP_09603537.1| protein containing UPF0061 domain [Sulfurimonas gotlandica GD1]
gi|207084578|gb|EDZ61866.1| hypothetical protein CBGD1_1949 [Sulfurimonas gotlandica GD1]
gi|372469240|gb|EHP29444.1| protein containing UPF0061 domain [Sulfurimonas gotlandica GD1]
Length = 481
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 187/522 (35%), Positives = 284/522 (54%), Gaps = 59/522 (11%)
Query: 126 REVLHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVP 185
RE+ + +V P+ +++P L++ S+ A L +D + + +G L G+
Sbjct: 15 RELDPIFFDEVEPTP-LKDPFLISVSKDAAKLLGVDEDITKDENLVGILNGTYSLEGSDT 73
Query: 186 YAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRS 245
+A CY GHQFG + +LGDGRAI LG++ LQLKG+G T YSR DG AVLRS
Sbjct: 74 FAMCYAGHQFGHFVYRLGDGRAINLGKV-----NGQNLQLKGSGLTLYSRMGDGRAVLRS 128
Query: 246 SIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFG 305
SIRE+L SEAMH LGI T+RAL L+ + VTR + E GAIV R++ +++RFG
Sbjct: 129 SIREYLMSEAMHGLGIETSRALALIGSDSDVTR-------QEREKGAIVLRLSPTWVRFG 181
Query: 306 SYQIHASRGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYA 365
+++ RG+ V+ LADY I F H++++ Y
Sbjct: 182 TFEYFNFRGEHAR--VQKLADYVIDESFEHLKDV---------------------EGMYV 218
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
E+ TA +A+WQ VGF HGV+NTDNMSI G TIDYGPF FLD ++ + N TD
Sbjct: 219 KMYEEIVRNTAITIARWQSVGFNHGVMNTDNMSIDGRTIDYGPFAFLDDYESGYICNHTD 278
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMER-YGTKFMDEYQAIMTKK 484
+ G RY F NQP I WN+ + + L++ +++ A ++++ +GT + +EY +IM KK
Sbjct: 279 VDG-RYSFKNQPGIAHWNLHKLAVALSS--IVNHDRALEILDKTFGTSYEEEYLSIMYKK 335
Query: 485 LGLPKYNKQIISK---LLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDI 541
+GL + +++ I +L ++ +DYT FFR LS D K +LD+
Sbjct: 336 MGLYERDEKDIELFKWMLGSLESATIDYTKFFRTLSAYDGD------------KKNILDM 383
Query: 542 GKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
+ W+ +Y + L +S+E+R M NPKY+L+N++ Q AI+ A+ G++
Sbjct: 384 AV-FQTPLSEWLDAYDERLKKETLSNEKRHTQMLKTNPKYILKNHILQEAIEKAQQGEYS 442
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLPPAWAYRPGVCMLSCSS 643
+ LL + P++E +E A+ P + LSCSS
Sbjct: 443 MIDELLIVAHSPFEEHLELEHLAKATPL---KSKNIKLSCSS 481
>gi|344244934|gb|EGW01038.1| Selenoprotein O [Cricetulus griseus]
Length = 533
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 170/353 (48%), Positives = 214/353 (60%), Gaps = 29/353 (8%)
Query: 185 PYAQCYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLR 244
P A CY GHQFG +AGQLGDG AI LGE+ ERWELQLKGAG TP+SR ADG VLR
Sbjct: 2 PAAHCYCGHQFGQFAGQLGDGAAIYLGEVCTAAGERWELQLKGAGPTPFSRQADGRKVLR 61
Query: 245 SSIREFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRF 304
SSIREFLCSEAM LGIPTTRA VT+ V RD+FYDGNPK E +V R+A +F+RF
Sbjct: 62 SSIREFLCSEAMFHLGIPTTRAGACVTSESKVIRDVFYDGNPKYEKCTVVLRIAPTFIRF 121
Query: 305 GSYQI------HASRGQEDL---DIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHS 355
GS++I H R + DI + DY I + I+ + T D D+
Sbjct: 122 GSFEIFKSPDEHTGRAGPSMGRNDIRVQMLDYVISSFYPEIQAAH--------TCDSDN- 172
Query: 356 VVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAF 415
+ AA+ EV RTA +VA+WQ VGF HGVLNTDNMSI+GLTIDYGPFGFLD +
Sbjct: 173 -----IQRNAAFFREVTRRTARMVAEWQCVGFCHGVLNTDNMSIVGLTIDYGPFGFLDRY 227
Query: 416 DPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKEANYVMERYGTKFMD 475
DP N +D G RY ++ QP + WN+ + + L + EA + E + T+F
Sbjct: 228 DPDHVCNASDSAG-RYTYSKQPQVCKWNLQKLAEALEPELPLALGEA-ILAEEFDTEFQR 285
Query: 476 EYQAIMTKKLGLPKYNKQ----IISKLLNNMAVDKVDYTNFFRALSNVKADPS 524
Y M KKLGL + ++ +++KLL M + D+TN F LS+ A+PS
Sbjct: 286 HYLQKMRKKLGLIRVEQEGDGALVAKLLETMHLTGADFTNTFYMLSSFPAEPS 338
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 57/106 (53%), Gaps = 23/106 (21%)
Query: 549 WISWVLSY-------IQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFG 601
W +W+ Y +++ + ER +M++ NPKYVLRNY+ Q+AI+AAE GDF
Sbjct: 422 WETWLQEYRARLDKEKEDVGDTAAWQAERVRIMHTNNPKYVLRNYIAQNAIEAAENGDFA 481
Query: 602 EVRRLLKLMERPY------DEQPGMEKYARL----------PPAWA 631
EVRR+LKL+E PY E G E AR PP WA
Sbjct: 482 EVRRVLKLLESPYYSEGAATEATGPEAAARTTDEQCSYSSRPPLWA 527
>gi|389689564|ref|ZP_10178782.1| hypothetical protein MicloDRAFT_00008900 [Microvirga sp. WSM3557]
gi|388590054|gb|EIM30340.1| hypothetical protein MicloDRAFT_00008900 [Microvirga sp. WSM3557]
Length = 492
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 188/506 (37%), Positives = 267/506 (52%), Gaps = 56/506 (11%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V P A V P+LV + +A L LDP PD SG A P A Y G
Sbjct: 19 YARVEPEA-VAAPRLVRLNRDLALHLGLDPDRLSSPDGVELLSGNRVPDAAEPIAMAYAG 77
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE+++ S R ++QLKG+G TP+SR DG A L +RE+L
Sbjct: 78 HQFGQFVPQLGDGRAILLGEVVDQNSIRRDIQLKGSGPTPFSRRGDGRAALGPVLREYLL 137
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAM LG+PTTRAL V TG+ V R+ PGA++ RVA S +R G++Q A+
Sbjct: 138 SEAMAALGLPTTRALAAVLTGETVARETLL-------PGAVLTRVASSHIRVGTFQFFAA 190
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
R +D++ +R LADY I H+ ++ Y A+ +V
Sbjct: 191 R--QDVEGLRLLADYVIARHYPQAAESDR---------------------PYRAFLDQVI 227
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
A L+A+W +GF HGV+NTDNMSI G TIDYGP F+DA+DP+ ++ D G RY
Sbjct: 228 AAQADLIARWLHIGFIHGVMNTDNMSIAGETIDYGPCAFMDAYDPATVFSSIDRQG-RYA 286
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDD----KEANYVMERYGTKFMDEYQAIMTKKLGLP 488
+ NQP IGLWN+ + + TL +D+ +A+ +E + KF Y A + +KLGL
Sbjct: 287 YGNQPRIGLWNLTRLAETLLPLLFLDEDKAVADASEALEAFSGKFEAAYHAGLRRKLGLL 346
Query: 489 KYNKQ---IISKLLNNMAVDKVDYTNFFRALSNVKADPSIPE---DELLVPLKAVLLDIG 542
++ + LLN MA ++ D+T FR LS+ A P+ E + + PL
Sbjct: 347 TEREEDLTLAGDLLNAMAENQADFTLTFRRLSDAAAGPAGDEAVRNLFINPL-------- 398
Query: 543 KERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFG 601
A+ +W + + + L R+ M +VNP ++ RN+ ++ I AA E DF
Sbjct: 399 -----AYDAWAVRWRERLSLEPQDGASRQVAMRAVNPAFIPRNHRVEAMIQAAVERDDFA 453
Query: 602 EVRRLLKLMERPYDEQPGMEKYARLP 627
LL ++ PY +QP Y+ P
Sbjct: 454 PFEELLAVLSNPYQDQPAFAHYSEPP 479
>gi|378825270|ref|YP_005188002.1| hypothetical protein SFHH103_00678 [Sinorhizobium fredii HH103]
gi|365178322|emb|CCE95177.1| UPF0061 protein RL1355 [Sinorhizobium fredii HH103]
Length = 502
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 191/505 (37%), Positives = 268/505 (53%), Gaps = 54/505 (10%)
Query: 133 YTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQCYGG 192
Y +V P+ V P L+ + +A+ L LD ER D FSG T AGA P A Y G
Sbjct: 29 YARVEPT-PVAEPWLIKLNRPLAEELRLDIAALER-DGAAIFSGNTVPAGAEPLAMAYAG 86
Query: 193 HQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLC 252
HQFG + QLGDGRAI LGE++ +R ++QLKG+G+TPYSR DG A L +RE++
Sbjct: 87 HQFGTFVPQLGDGRAILLGEVIGRDGKRRDIQLKGSGQTPYSRRGDGRAALGPVLREYIV 146
Query: 253 SEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHAS 312
SEAMH LG+PTTRAL + TG+ V R+ PGA+ RVA S +R G++Q A+
Sbjct: 147 SEAMHALGVPTTRALAVTVTGQPVYREQIL-------PGAVFTRVAASHIRVGTFQFFAA 199
Query: 313 RGQEDLDIVRTLADYAIRHHFRHIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVA 372
RG D+D V+ LAD+ I H+ ++ ++ N Y V+
Sbjct: 200 RG--DMDSVKALADHVIDRHYPELKAADE--------------------NPYLGLLKAVS 237
Query: 373 ERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYC 432
R A+L+A+W +GF HGV+NTDNM+I G TID+GP F+DA+DP ++ D G RY
Sbjct: 238 ARQAALIARWLHIGFIHGVMNTDNMTISGETIDFGPCAFMDAYDPKKVFSSIDQFG-RYA 296
Query: 433 FANQPDIGLWNIAQFSTTLAAAKLIDDKE------ANYVMERYGTKFMDEYQAIMTKKLG 486
+ANQP IG WN+A+ + TL L D AN V+ YGT F + + M +K+G
Sbjct: 297 YANQPAIGQWNLARLAETL--VTLFDPTADVAVNLANDVLGEYGTIFQNHWLDGMRRKIG 354
Query: 487 LPKYNK---QIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAVLLDIGK 543
L +++ LL M D+T FR L++ D + EL +A
Sbjct: 355 LSTAEDGDLELVQALLALMHKGGADFTLTFRRLASSAEDAGA-DVELAKLFQA------- 406
Query: 544 ERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAIDAA-ELGDFGE 602
E W+ + + L ER + M +VNP ++ RN+ + AI+AA E DF
Sbjct: 407 --PETLSPWLADWRRRLARESRQPVERASAMRAVNPAFIPRNHRVEQAIEAAIEDADFSL 464
Query: 603 VRRLLKLMERPYDEQPGMEKYARLP 627
L+ + +PY+ QPG YA P
Sbjct: 465 FEALVDVTSKPYEGQPGHAAYAEPP 489
>gi|340500605|gb|EGR27471.1| selenoprotein o, putative [Ichthyophthirius multifiliis]
Length = 508
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 194/489 (39%), Positives = 274/489 (56%), Gaps = 43/489 (8%)
Query: 100 KALEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLE 159
++ +LN+ +S + +LP T + P+ V Y+KV P NP+++ S+ + L+
Sbjct: 5 QSFYNLNFINSAINKLPIQTPTTTNPQTVRGYFYSKVEPKIR-PNPKIIILSDPALNLLD 63
Query: 160 LDPKEF--ERPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNLK 217
L +E ++ F FF G VP A CY GHQFG WAGQLGDGRAI++G+I N K
Sbjct: 64 LTKEEILKDQNSFTQFFCGNLLNESQVPIAHCYCGHQFGSWAGQLGDGRAISIGDIRNKK 123
Query: 218 SERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFVT 277
+ ELQLKG+G TPYSRFADG AVLRSSIREFLCSE ++FL IPTTRA +V T
Sbjct: 124 GQIIELQLKGSGVTPYSRFADGNAVLRSSIREFLCSEFLYFLDIPTTRAASIVQTDDLAQ 183
Query: 278 RDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQIHASRG-QEDL--DIVRTLADYAIRHHFR 334
RD++Y+GN +E IV R+A +F+RFGS+QI G E L ++ L DY I +
Sbjct: 184 RDIYYNGNVIQEKCCIVLRLAPTFIRFGSFQICDKGGPSEGLGDQMIPELTDYVIDLFYE 243
Query: 335 HIENMNKSESLSFSTGDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
+++ +KY + ++ ++TA LVA+WQ V F HGVLNT
Sbjct: 244 GLKD---------------------KEDKYRLFFEDIVKKTAILVAKWQTVAFCHGVLNT 282
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIAQFSTTLAAA 454
DNMSILGLTID+GPFGF++ F+ N +D G Y + NQP WN+ + + +L
Sbjct: 283 DNMSILGLTIDFGPFGFMEHFNKEHICNHSDQDG-YYSYENQPKACKWNLLRLAESLKY- 340
Query: 455 KLIDDKEA-NYVMERYGTKFMDEYQAIMTKKLGLPKYN----KQIISKLLNNMAVDKVDY 509
++D E+ Y+ E + + Y IM +KLG+ N K+I+ +L+ M ++Y
Sbjct: 341 -VLDFGESKKYIEENFDVILQENYYNIMREKLGIYSQNQEDCKRIVDQLIEVMHELGLEY 399
Query: 510 TNFFRALSNVKADPSIPEDEL--LVPLKA---VLLDIGKERKEAWISWVLSYIQELLSSG 564
TNFFR LS V + ED L + KA +L+D K R + +L I EL S
Sbjct: 400 TNFFRKLSTVNILNTDIEDILNQFLLFKAPDNILMDRIKPR---FTQEMLEKISELYESN 456
Query: 565 ISDEERKAL 573
D + +
Sbjct: 457 PLDMQMRGF 465
>gi|94266486|ref|ZP_01290177.1| Protein of unknown function UPF0061 [delta proteobacterium MLMS-1]
gi|93452901|gb|EAT03412.1| Protein of unknown function UPF0061 [delta proteobacterium MLMS-1]
Length = 517
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 195/513 (38%), Positives = 278/513 (54%), Gaps = 41/513 (7%)
Query: 129 LHACYTKVSPSAEVENPQLVAWSESVADSLELDPKEFERPDFPLFFSGATPLAGAVPYAQ 188
L A + + V P+L+ + ++A L L + + F+G AGA P A
Sbjct: 22 LPAAFYRFCNPTPVAAPRLLKLNAALAGELGLQLEGLDEQALAEIFAGNRLSAGAQPLAM 81
Query: 189 CYGGHQFGMWAGQLGDGRAITLGEILNLKSERWELQLKGAGKTPYSRFADGLAVLRSSIR 248
Y GHQFG QLGDGRAI LGE+L+ + RW++QLKGAGKTP+SR DG A L IR
Sbjct: 82 AYAGHQFGSLVPQLGDGRAILLGEVLDGRGRRWDIQLKGAGKTPFSRGGDGRAPLGPVIR 141
Query: 249 EFLCSEAMHFLGIPTTRALCLVTTGKFVTRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ 308
E+L SEAMH LGIPTTRAL V++G+ V R+ PGA++ RVA S +R G+++
Sbjct: 142 EYLVSEAMHALGIPTTRALAAVSSGEQVMRERLL-------PGAVITRVAASHIRVGTFE 194
Query: 309 IHASRGQEDLDIVRTLADYAIRHHFRHIEN--MNKSESLSFSTGDE-DHSVVDLTSNKYA 365
A RG D +RTLADY I H+ I +N E+ G HS +Y
Sbjct: 195 FFARRG--DFASLRTLADYVIPRHYPEINGPEINGPETNGPEIGGAGGHS-------RYL 245
Query: 366 AWAVEVAERTASLVAQWQGVGFTHGVLNTDNMSILGLTIDYGPFGFLDAFDPSFTPNTTD 425
A V R A LVA+W +GF HGV+NTDN +I G TIDYGP FLD + P + D
Sbjct: 246 ALLAAVIARQAELVARWMSIGFIHGVMNTDNTTISGETIDYGPCAFLDHYHPETVFSAID 305
Query: 426 LPGRRYCFANQPDIGLWNIAQFSTTLAAAKLIDDKE-----ANYVMERYGTKFMDEYQAI 480
G RY + QP I WN+A+F+ +L L DD+E A +++ + ++ +
Sbjct: 306 T-GGRYAYHMQPRIAQWNLARFAESLLPL-LHDDQEQAIALATALLQDFMPRYEKAWLTR 363
Query: 481 MTKKLGL--PKY-NKQIISKLLNNMAVDKVDYTNFFRALSNVKADPSIPEDELLVPLKAV 537
M K+GL P+ ++++I +LL MA ++VD+T FFR L+N +P+ E + + PL
Sbjct: 364 MGNKIGLTDPQPDDRKLIEELLAAMADNEVDFTLFFRRLANAVENPT--EADGIRPL--- 418
Query: 538 LLDIGKERKEAWISWVLSYIQELLSSGISDEERKALMNSVNPKYVLRNYLCQSAID-AAE 596
R EAW W + + L + + ER M SVNP + RN+ + AI A E
Sbjct: 419 -----FNRPEAWEHWAEGWHKRLAADPLPPAERAKRMRSVNPAIIPRNHRIEQAISKATE 473
Query: 597 LGDFGEVRRLLKLMERPYDEQPGMEKYARLPPA 629
DF + +L + + P+++ P +++ PPA
Sbjct: 474 AADFSDFTKLNQALNHPWEDNPERDRWL-APPA 505
>gi|92114613|ref|YP_574541.1| hypothetical protein Csal_2495 [Chromohalobacter salexigens DSM
3043]
gi|121957868|sp|Q1QUL6.1|Y2495_CHRSD RecName: Full=UPF0061 protein Csal_2495
gi|91797703|gb|ABE59842.1| protein of unknown function UPF0061 [Chromohalobacter salexigens
DSM 3043]
Length = 494
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 214/558 (38%), Positives = 286/558 (51%), Gaps = 89/558 (15%)
Query: 102 LEDLNWDHSFVRELPGDPRTDSIPREVLHACYTKVSPSAEVENPQLVAWSESVADSLELD 161
L+ L +D+++ R LP D +T+VSP A +N +L+ S +L LD
Sbjct: 10 LDSLRFDNAWAR-LPED-------------FFTRVSP-ATWKNTRLLDISPRGCRALGLD 54
Query: 162 PKEFE-----RPDFPLFFSGATPLAGAVPYAQCYGGHQFGMWAGQLGDGRAITLGEILNL 216
P F+ R G T L G P AQ Y GHQFG++ LGDGR + +GE
Sbjct: 55 PACFDDDAPARETLRQLMGGETVLPGMAPLAQKYTGHQFGVYNPALGDGRGLLMGEA-QT 113
Query: 217 KSERWELQLKGAGKTPYSRFADGLAVLRSSIREFLCSEAMHFLGIPTTRALCLVTTGKFV 276
W+L LKGAG+TPYSRF DG AVLRSS+RE+L EAM LG+PTT AL L T + V
Sbjct: 114 ADGYWDLHLKGAGQTPYSRFGDGRAVLRSSVREYLAGEAMAGLGVPTTLALALATNDEKV 173
Query: 277 TRDMFYDGNPKEEPGAIVCRVAQSFLRFGSYQ-IHASRGQEDLDIVRTLADYAIRHHFRH 335
R+ + EPGA + R+A S +RFG ++ ++ SR +D+ R L D+ I RH
Sbjct: 174 QRE-------RVEPGATLLRLAPSHVRFGHFEWLYQSRRHDDM---RRLVDHVIE---RH 220
Query: 336 IENMNKSESLSFST-GDEDHSVVDLTSNKYAAWAVEVAERTASLVAQWQGVGFTHGVLNT 394
+ SES + + GD V RTA L+A WQ GF H V+NT
Sbjct: 221 RPALAASESPAEALFGD-------------------VVARTARLIAAWQAYGFVHAVMNT 261
Query: 395 DNMSILGLTIDYGPFGFLDAFDPSFTPNTTDLPGRRYCFANQPDIGLWNIA---QFSTTL 451
DNMSILGLT+DYGP+ F+DA+DP PN TD G RY F QP +GLWN++ Q T L
Sbjct: 262 DNMSILGLTLDYGPYAFMDAYDPRLVPNHTDANG-RYAFDQQPGVGLWNLSVLGQSLTPL 320
Query: 452 AAAKLIDDKEANYVMERYGTKFMDEYQAIMTKKLGLPKY---NKQIISKLLNNMAVDKVD 508
A + D+ + Y EY +M +LGL + Q++ L +A D
Sbjct: 321 AEPDALRDR-----LTEYEPALQQEYARLMRARLGLESVVEGDAQLVQDWLTLLAEAGAD 375
Query: 509 YTNFFRALSNVKADPSIPEDELL---VPLKAVLLDIGKERKEAWISWVLSYIQELLSSGI 565
Y FRAL D + E L VP++ + AW+S +QE
Sbjct: 376 YHRAFRALGEWAVD----DGEWLRQEVPVEGL---------SAWLSRYHERLQEEERDAA 422
Query: 566 SDEERKALMNSVNPKYVLRNYLCQSAIDAAELGDFGEVRRLLKLMERPYDEQPGMEKYAR 625
S R+ M +VNP YVLR +L Q I+AAE GD + +L+ P+ +PGME++A
Sbjct: 423 S---RRDAMQAVNPLYVLRTHLAQQVIEAAEAGDEAPLVEFRRLLADPFTARPGMERWAA 479
Query: 626 LPPAWAYRPGVCMLSCSS 643
PP A V LSCSS
Sbjct: 480 APPPQA---SVICLSCSS 494
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,656,219,544
Number of Sequences: 23463169
Number of extensions: 464302732
Number of successful extensions: 1173675
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2383
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 1159183
Number of HSP's gapped (non-prelim): 2787
length of query: 643
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 494
effective length of database: 8,863,183,186
effective search space: 4378412493884
effective search space used: 4378412493884
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)