BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781134|ref|YP_003065547.1| hypothetical protein CLIBASIA_05180 [Candidatus Liberibacter asiaticus str. psy62] (320 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781134|ref|YP_003065547.1| hypothetical protein CLIBASIA_05180 [Candidatus Liberibacter asiaticus str. psy62] gi|254040811|gb|ACT57607.1| hypothetical protein CLIBASIA_05180 [Candidatus Liberibacter asiaticus str. psy62] Length = 320 Score = 265 bits (677), Expect = 7e-69, Method: Composition-based stats. Identities = 320/320 (100%), Positives = 320/320 (100%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL Sbjct: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 Query: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL 120 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL Sbjct: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL 120 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW Sbjct: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 Query: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS Sbjct: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 Query: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI Sbjct: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 Query: 301 VSSGKSINDIQINNAKLLVE 320 VSSGKSINDIQINNAKLLVE Sbjct: 301 VSSGKSINDIQINNAKLLVE 320 >gi|227822437|ref|YP_002826409.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] gi|227341438|gb|ACP25656.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] Length = 536 Score = 258 bits (658), Expect = 9e-67, Method: Composition-based stats. Identities = 135/321 (42%), Positives = 194/321 (60%), Gaps = 9/321 (2%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GGSL GFSWS ESGDGLS LCVF+ST GE+AVY G +P +S F+LK +Y IG+PL Sbjct: 216 MKKGGSLITGFSWSLESGDGLSDLCVFVSTEGEIAVYAGSDPSSASDFALKGVYQIGKPL 275 Query: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNL--PLSWPIQEQWEQAIMVAPTGWSM 118 GK A + V DV IAT++GL M + ++ + +S PI++ W QA +GW++ Sbjct: 276 GKNAWIRVGGDVLIATSDGLTPMSQVFQRDRQALSQVSISRPIEDDWRQAANATGSGWTI 335 Query: 119 ILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGT 178 W ++N++ ++ P NS++SDKTFV+NV + RW++ NW + LFFG +G Sbjct: 336 KPWAEQNLVFVAFPDNSVVSDKTFVLNVL-SGRWATISNWRASCFETLQGGLFFGSRDGY 394 Query: 179 FWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYD 238 WQGD G+D+ F YLS F + G + A +AH+ L+A RP ++LFARA++D Sbjct: 395 CWQGDTGGTDDGLTFAASYLSQFTPAG-QFGQRTAATMAHMYLRAKTRPKVQLFARADFD 453 Query: 239 HSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGC 298 + P F ++ + + WD LWD +W V++ + F QNV A G+ LAVGC Sbjct: 454 RTTPPFD--RVTEGDIASPEWDVGLWDVARWDG---VSQALRYDFRQNVRASGDMLAVGC 508 Query: 299 VIVSSGKSINDIQINNAKLLV 319 VI S G DI+++ A L V Sbjct: 509 VITSGGAVKLDIEVDLATLQV 529 >gi|117925017|ref|YP_865634.1| hypothetical protein Mmc1_1720 [Magnetococcus sp. MC-1] gi|117608773|gb|ABK44228.1| hypothetical protein Mmc1_1720 [Magnetococcus sp. MC-1] Length = 508 Score = 251 bits (641), Expect = 9e-65, Method: Composition-based stats. Identities = 71/314 (22%), Positives = 136/314 (43%), Gaps = 13/314 (4%) Query: 3 AGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGK 62 GG L A +W+ + GDGL VFL + G+V +Y G NP+D++S+SL IY +G P+G+ Sbjct: 198 NGGYLQAMGTWTRDGGDGLDDFAVFLMSEGDVLIYQGSNPNDATSWSLVGIYRLGAPIGR 257 Query: 63 RAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPT--GWSMIL 120 R ++ +D+ + T +G + + +L ++ LS IQ AI + + GW + Sbjct: 258 RCVMRYGSDLIVITQDGYLPLSKVL-SGSRSGEALSDRIQGALSVAIRNSASRFGWQALH 316 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 + N LL + P+ S + + N W F + V + L+FG G + Sbjct: 317 YPAGNYLLFNVPQASGRYE-QHIFN-MTTGAWCRFTGQNGICWAVFEDRLYFGAQNGVLY 374 Query: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 + D D+ + + + F + R +R + L + + L ++ + Sbjct: 375 RADYGTHDDGQDIVTDGRTSFQYLGGRSST-KRFTMMRPLLSSDANLPVSLTLDTDFGEN 433 Query: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 + S N + WD+ WD W + + Q+ + G C++V Sbjct: 434 --PPAPAISSPNQAEGAAWDSEQWDTATWGGAMQIRG-----QWQSCLGIGYCVSVRLRT 486 Query: 301 VSSGKSINDIQINN 314 ++ +++ ++ Sbjct: 487 ATANQNVRWHALDV 500 >gi|150397022|ref|YP_001327489.1| hypothetical protein Smed_1819 [Sinorhizobium medicae WSM419] gi|150028537|gb|ABR60654.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 538 Score = 248 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 130/321 (40%), Positives = 186/321 (57%), Gaps = 9/321 (2%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GGSL GFSWS ESGDGLS LCVFLST GE+AVY G +P +S F+LK +Y IGRPL Sbjct: 218 MKKGGSLLTGFSWSLESGDGLSDLCVFLSTEGEIAVYAGSDPSSASDFALKGVYQIGRPL 277 Query: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNL--PLSWPIQEQWEQAIMVAPTGWSM 118 GK + D+ IAT +GL M + ++ + +S PI++ W +A TGW++ Sbjct: 278 GKNGWIRAGADILIATTDGLTPMSQVFQRDRQALSLVSVSRPIEDDWRRAANATGTGWTL 337 Query: 119 ILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGT 178 W ++N++ ++ P+N++++D TFV+NV RWS+ NW Y LFFG +G Sbjct: 338 KQWPEQNLVFVAFPENTVVADTTFVLNVL-TGRWSTISNWQALCYETLQGGLFFGSLDGY 396 Query: 179 FWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYD 238 WQGD G+D+ F YLS F + G + A +AH+ +A P ++LFARA+YD Sbjct: 397 MWQGDAGGTDDGLSFSATYLSQFSP-ATQFGQRATATMAHMYFRAKSAPKVRLFARADYD 455 Query: 239 HSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGC 298 S P F+ + + WD LWD W V++ + + F QNV A G+ + VGC Sbjct: 456 RSTPTFTT--STEGEASSSEWDVGLWDRAIWDG---VSEVQRYDFRQNVRATGDMIGVGC 510 Query: 299 VIVSSGKSINDIQINNAKLLV 319 VI S G DI+++ A + V Sbjct: 511 VITSGGDFKLDIEVDLATVQV 531 >gi|316933870|ref|YP_004108852.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] gi|315601584|gb|ADU44119.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] Length = 558 Score = 243 bits (620), Expect = 3e-62, Method: Composition-based stats. Identities = 85/323 (26%), Positives = 153/323 (47%), Gaps = 13/323 (4%) Query: 2 QAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLG 61 + GGSL G SWS E+ D LS CVF++T GEVAVY G +P +S+++ Y IGRPLG Sbjct: 239 KRGGSLLFGASWSLET-DNLSEQCVFVTTEGEVAVYQGTDPSSASTWAKVGTYRIGRPLG 297 Query: 62 KRAIVFVKNDVWIATNNGLISMKNILLQEEKTNL--PLSWPIQEQWEQAI--MVAPTGWS 117 +AI+ D+ I T+ G + + + ++ +S+PI+ W + + A W Sbjct: 298 PKAIIPAGGDLVIGTDMGFVPLSQAVQRDISALSPAAISYPIETAWNETVEATAASGAWH 357 Query: 118 MILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEG 177 +W ++M +++ P + + FV N A W + W + + + FFG +G Sbjct: 358 CEVWPTKHMAIVAPPTPVVGFPQVFVAN-ARTGAWGRYTGWKARCLGLFRDRAFFGSDQG 416 Query: 178 TFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANY 237 + ++SG+D RP+ +++ F + A + ++QA +L +A+Y Sbjct: 417 LVIEAEVSGADRGRPYTAVWVPLFETFKAPASL-KTAGLMRATVQAASEVVPRLSLQADY 475 Query: 238 DHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVG 297 + + P + + + W ++W W + +K+FQ Q+V G +A G Sbjct: 476 NVNLPTPPDDA---GAVASNTWGAAIWGVSLW---SEPSGRKVFQRWQSVGGRGYAIAPG 529 Query: 298 CVIVSSGKSINDIQINNAKLLVE 320 I S S D+ + ++ + Sbjct: 530 LQITSGKASPPDVDVIKVEITYD 552 >gi|315122528|ref|YP_004063017.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495930|gb|ADR52529.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 532 Score = 237 bits (604), Expect = 2e-60, Method: Composition-based stats. Identities = 249/320 (77%), Positives = 280/320 (87%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 MQAGGSL AGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPD SF+LKAIYHIGRPL Sbjct: 213 MQAGGSLFAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDSIDSFALKAIYHIGRPL 272 Query: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL 120 GK+AI+FVKNDVWIAT NGLISMKNILLQ E NLPLS IQE+W QAIM PTGWS+ L Sbjct: 273 GKKAIIFVKNDVWIATTNGLISMKNILLQGEGANLPLSSAIQEEWNQAIMEVPTGWSLTL 332 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 WEKRNMLL+SCP++SLLS KT VMNV +NN W+S HNWFTQSYV+AN+NLFFGDYEG+FW Sbjct: 333 WEKRNMLLVSCPQHSLLSSKTLVMNVDNNNYWASLHNWFTQSYVIANDNLFFGDYEGSFW 392 Query: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 QGDISGSD+ RPF IYLSPF LG +R+AC AH+SLQAYQRPYLKLF+R++YD S Sbjct: 393 QGDISGSDDNRPFQAIYLSPFRESYSYLGVKRKACQAHISLQAYQRPYLKLFSRSDYDKS 452 Query: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 YP+F KET++ NP+++G+WDNS+WDE +WTDN F+ KKKLF F QNVVAYGN LAVGCVI Sbjct: 453 YPDFFKETVNANPINSGLWDNSIWDESQWTDNFFIRKKKLFNFSQNVVAYGNFLAVGCVI 512 Query: 301 VSSGKSINDIQINNAKLLVE 320 VSSGK INDIQINN+KLLVE Sbjct: 513 VSSGKFINDIQINNSKLLVE 532 >gi|13470677|ref|NP_102246.1| hypothetical protein mll0452 [Mesorhizobium loti MAFF303099] gi|14021419|dbj|BAB48032.1| mll0452 [Mesorhizobium loti MAFF303099] Length = 528 Score = 225 bits (574), Expect = 6e-57, Method: Composition-based stats. Identities = 76/322 (23%), Positives = 138/322 (42%), Gaps = 10/322 (3%) Query: 2 QAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLG 61 Q GGS+ G +WS E+G G+ CV +S GEVAV+ G NP D++++SL Y + LG Sbjct: 208 QDGGSVLCGGTWSLETGGGIDAKCVIISDRGEVAVFQGTNPADATAWSLVGRYAMSPLLG 267 Query: 62 KRAIVFVKNDVWIATNNGLISMKNILLQEEKTNL--PLSWPIQEQWEQAIMVAPTG-WSM 118 RA++ D+ + +G++ + + ++ ++ I+ W + + W + Sbjct: 268 PRALMQAGGDLLMGMADGIVPISQAIQKDAAALALISVTRAIEPDWRKEVAARTVLPWEL 327 Query: 119 ILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGT 178 + W + NM +++ P + K FV+N W+ + W V+ +FG +G Sbjct: 328 LKWPQFNMGIVALPASPGQDRKVFVVN-LQTGAWADYTGWDACCLVLHAGWAYFGTSDGK 386 Query: 179 FWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYD 238 Q ++ G D+ P++ Y+ F + + +A A ++ KL NY Sbjct: 387 IMQCEVGGIDDGMPYVCTYVGLFDHMKAVART-KIIHMARTVFIASRKFVPKLSVSTNYQ 445 Query: 239 HSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGC 298 S P + +D WD+ LWD+ W K + +V G +A Sbjct: 446 VSLPSPPPSAPDDVTLD--TWDSGLWDKAVWDAGWV---KAVTTKWTSVGQSGFIIAPQV 500 Query: 299 VIVSSGKSINDIQINNAKLLVE 320 + S D ++ + E Sbjct: 501 QVTSGVTPTPDAELVEIDMTYE 522 >gi|152982947|ref|YP_001353888.1| hypothetical protein mma_2198 [Janthinobacterium sp. Marseille] gi|151283024|gb|ABR91434.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 519 Score = 219 bits (557), Expect = 6e-55, Method: Composition-based stats. Identities = 71/318 (22%), Positives = 127/318 (39%), Gaps = 16/318 (5%) Query: 3 AGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGK 62 GG L + +WS ++G G+ VF+++ G+VAVY G +P +++++L IY IG P+G+ Sbjct: 208 KGGYLMSMGNWSLDAGYGMDDYAVFVTSEGQVAVYKGTDPASATTWALIGIYDIGSPIGR 267 Query: 63 RAIVFVKNDVWIATNNGLISMKNILLQEEKTNLP-LSWPIQEQWEQAI--MVAPTGWSMI 119 R ++ D+ + +GL + L+ ++ L+ IQ A GW Sbjct: 268 RCLMKYAGDLTLICKDGLAPLSKSLMSSRVNSMEMLTDKIQHVVSDYTTLYAANFGWETA 327 Query: 120 LWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTF 179 L+ + NMLL++ P + VMN + WS F W + + + LFFG G Sbjct: 328 LFPQENMLLVNVPTSPTT-SYQLVMNTI-SGAWSRFIGWNASCFELHGDELFFGSSGGVC 385 Query: 180 WQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDH 239 D +DN F+ R ++ + + + L ++D Sbjct: 386 KAWDTQ-ADNGTNIDFEAQQSFNYFGARSQL-KQVKMLRPLISTDGSSSVLLGVNTDFD- 442 Query: 240 SYPEFSKETISNNPMDNGIWDNSLWDE-VKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGC 298 + +P WD S+WD W+ +L + K Q G C++ Sbjct: 443 --TTAPTGIPTFSPTAGAAWDISVWDGMDTWSGDLAIKK-----DWQTAFGLGYCISAHI 495 Query: 299 VIVSSGKSINDIQINNAK 316 + + + Sbjct: 496 KGAAKNSKLRWAATDYLV 513 >gi|110632595|ref|YP_672803.1| hypothetical protein Meso_0234 [Mesorhizobium sp. BNC1] gi|110283579|gb|ABG61638.1| hypothetical protein Meso_0234 [Chelativorans sp. BNC1] Length = 632 Score = 219 bits (557), Expect = 6e-55, Method: Composition-based stats. Identities = 91/322 (28%), Positives = 137/322 (42%), Gaps = 14/322 (4%) Query: 3 AGGSLPAGFSWSTESGD--GLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 GG+L G SWS S GLS C F+ST GEV VY G +PD S +S IY IG+PL Sbjct: 315 QGGALLVGQSWSMSSSGQGGLSDQCTFVSTEGEVVVYQGSHPDVSGDWSKVGIYRIGKPL 374 Query: 61 GKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNL--PLSWPIQEQWEQAIMVAPTGWSM 118 G+RA + D+ AT G +++ + + +S+ I++ W + + W Sbjct: 375 GRRAFIRAGGDLVFATTIGFVALSTAIQVDVAALSPKAVSFAIEDVWNETVNTRSGDWVC 434 Query: 119 ILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGT 178 LW M+ I+ P D F ++ A W+ F NW V L+FG E Sbjct: 435 CLWPAGQMVAIAPP--VGDDDMLFWVSNARTGAWAPFTNWAASCMEVFEGRLYFGMGE-F 491 Query: 179 FWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYD 238 Q + G+D + PF G Y+ F + A +A + + KL R ++D Sbjct: 492 VMQAMVGGTDASEPFAGTYMPLFSDNGKPS-AHKYAKMARAEIISATVLEEKLSCRFDFD 550 Query: 239 HSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGC 298 + P WDN++WDE W+ + + + +V G ++ Sbjct: 551 TNLP---PAPNVAPVPVGNEWDNAIWDESLWSS---PRDQIVSKRRHSVSGDGYRISPVL 604 Query: 299 VIVSSGKSINDIQINNAKLLVE 320 I S D QI + LL E Sbjct: 605 QISSGAAVPLDAQIVSLDLLYE 626 >gi|126444876|ref|YP_001063338.1| hypothetical protein BURPS668_A2344 [Burkholderia pseudomallei 668] gi|126224367|gb|ABN87872.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 590 Score = 214 bits (545), Expect = 1e-53, Method: Composition-based stats. Identities = 72/313 (23%), Positives = 115/313 (36%), Gaps = 14/313 (4%) Query: 4 GGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKR 63 GG L +W+ ++ GL+ VF+++ GEV VY G +P SSSF + A + IG P+G+R Sbjct: 278 GGFLMGMATWNIDNSAGLNPYAVFVTSRGEVIVYQGSDPSQSSSFGISARFRIGPPVGRR 337 Query: 64 AIVFVKNDVWIATNNGLISMKN---ILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL 120 +D+ +GL + Q+ K L Q + GW L Sbjct: 338 FYEKYGSDIVFIGADGLTPLSQALLTDRQQRKITLTEKISPTVQADYFAYQGNFGWQCTL 397 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 + + N L+++ P +VMN W F W + N NL+FG Sbjct: 398 FPEGNKLVVNVPDQEDQLSHQYVMNTI-TGAWCKFTGWNAFCFTYFNGNLYFGGA-AFVA 455 Query: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 Q D++ SD F+ R G ++ + +L ++ S Sbjct: 456 QADVTNSDAGMAISSDIEPAFNYFGMR-GQEKLFKMMRPVFITNAPLAPQLDIALDF-SS 513 Query: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 P S T S + WD + WDEV W D V +++ G Sbjct: 514 APPTSTPTFSQAHFN--RWDITPWDEVPWNDGHIVQT-----DWESIDGLGYAATFRMRC 566 Query: 301 VSSGKSINDIQIN 313 + + Sbjct: 567 QTIDVEFALESFD 579 >gi|167907341|ref|ZP_02494546.1| hypothetical protein BpseN_34245 [Burkholderia pseudomallei NCTC 13177] Length = 590 Score = 214 bits (545), Expect = 1e-53, Method: Composition-based stats. Identities = 72/313 (23%), Positives = 115/313 (36%), Gaps = 14/313 (4%) Query: 4 GGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKR 63 GG L +W+ ++ GL+ VF+++ GEV VY G +P SSSF + A + IG P+G+R Sbjct: 278 GGFLMGMATWNIDNSAGLNPYAVFVTSRGEVIVYQGSDPSQSSSFGISARFRIGPPVGRR 337 Query: 64 AIVFVKNDVWIATNNGLISMKN---ILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMIL 120 +D+ +GL + Q+ K L Q + GW L Sbjct: 338 FYEKYGSDIVFIGADGLTPLSQALLTDRQQRKITLTEKISPTVQADYFAYQGNFGWQCTL 397 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFW 180 + + N L+++ P +VMN W F W + N NL+FG Sbjct: 398 FPEGNKLVVNVPDQEDQLSHQYVMNTI-TGAWCKFTGWNAFCFTYFNGNLYFGGA-AFVA 455 Query: 181 QGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHS 240 Q D++ SD F+ R G ++ + +L ++ S Sbjct: 456 QADVTNSDAGMAISSDIEPAFNYFGMR-GQEKLFKMMRPVFITNAPLAPQLDIALDF-SS 513 Query: 241 YPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVI 300 P S T S + WD + WDEV W D V +++ G Sbjct: 514 APPTSTPTFSQAHFN--RWDITPWDEVPWNDGHIVQT-----DWESIDGLGYAATFRMRC 566 Query: 301 VSSGKSINDIQIN 313 + + Sbjct: 567 QTIDVEFALESFD 579 >gi|254720058|ref|ZP_05181869.1| hypothetical protein Bru83_11056 [Brucella sp. 83/13] gi|265985065|ref|ZP_06097800.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663657|gb|EEZ33918.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 481 Score = 203 bits (515), Expect = 4e-50, Method: Composition-based stats. Identities = 73/275 (26%), Positives = 129/275 (46%), Gaps = 11/275 (4%) Query: 3 AGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGK 62 GG L G +WST+SG G+ +CVF++ +GEVAV+ G NPD ++S Y IG+PLGK Sbjct: 213 KGGVLLFGGTWSTDSGSGMDDMCVFVTDMGEVAVFQGTNPDTIDTWSKVGTYVIGKPLGK 272 Query: 63 RAIVFVKNDVWIATNNGLISMKNILLQEEKTN--LPLSWPIQEQWEQAIMVAPT---GWS 117 A D+ I T++G++S+ I+ ++ +++PI+ W + +S Sbjct: 273 NAFFKTGGDIAIITDDGIVSISQIMQKDRAGLASAAITFPIETLWRDVVFSRAFAGYDFS 332 Query: 118 MILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEG 177 I+W ++LL+ P + + + V N W + W + VV ++ L+FG +G Sbjct: 333 AIVWPTESLLLVVVPSFAGGTKFSLVANT-KTGSWCRYTGWDARCAVVFDDKLYFGTSDG 391 Query: 178 TFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANY 237 QG+ GS+ P+ L F + + + A A +++ + +FA ++Y Sbjct: 392 RVIQGETGGSECGEPYAATILPRFDNMGSPVS--KSALHARCVMRSNNKFEPIMFANSDY 449 Query: 238 DHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDN 272 P + +W ++W E W Sbjct: 450 KVDLPNAGISY---GDASSNVWGGAIWGESTWGAG 481 >gi|306839390|ref|ZP_07472204.1| Hypothetical protein BROD_2246 [Brucella sp. NF 2653] gi|306405513|gb|EFM61778.1| Hypothetical protein BROD_2246 [Brucella sp. NF 2653] Length = 481 Score = 202 bits (514), Expect = 5e-50, Method: Composition-based stats. Identities = 73/275 (26%), Positives = 129/275 (46%), Gaps = 11/275 (4%) Query: 3 AGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGK 62 GG L G +WST+SG G+ +CVF++ +GEVAV+ G NPD ++S Y IG+PLGK Sbjct: 213 KGGVLLFGGTWSTDSGSGMDDMCVFVTDMGEVAVFQGTNPDTIDTWSKVGTYVIGKPLGK 272 Query: 63 RAIVFVKNDVWIATNNGLISMKNILLQEEKTN--LPLSWPIQEQWEQAIMVAPT---GWS 117 A D+ I T++G++S+ I+ ++ +++PI+ W + +S Sbjct: 273 NAFFKTGGDIAIITDDGIVSISQIMQKDRAGLASAAITFPIETLWRDVVFSRAFAGYDFS 332 Query: 118 MILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEG 177 I+W ++LL+ P + + + V N W + W + VV ++ L+FG +G Sbjct: 333 AIVWPTESLLLVVVPSFAGGTKFSLVANT-KTGSWCRYTGWDARCAVVFDDKLYFGTSDG 391 Query: 178 TFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANY 237 QG+ GS+ P+ L F + + + A A +++ + +FA ++Y Sbjct: 392 RVIQGETGGSECGEPYAATILPRFDNMGSPVS--KSALHARCVMRSNNKFEPIMFANSDY 449 Query: 238 DHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDN 272 P + +W ++W E W Sbjct: 450 KVDLPNAGISY---GDASSNVWGGAIWGESTWGAG 481 >gi|327191475|gb|EGE58495.1| hypothetical protein RHECNPAF_300005 [Rhizobium etli CNPAF512] Length = 607 Score = 196 bits (498), Expect = 3e-48, Method: Composition-based stats. Identities = 67/328 (20%), Positives = 132/328 (40%), Gaps = 20/328 (6%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GG+L A W+ GL V +S+ GEV V+ G +P +S++ L + +GRPL Sbjct: 287 MKYGGTLVAINVWTIPVSTGLQQCLVLISSEGEVIVFQGSDPSSASNWGLIGTFKLGRPL 346 Query: 61 G-KRAIVFVKNDVWIATNNGLISMKNILLQEE--KTNLPLSWPIQEQWEQAIMVAPT--- 114 G R ++ V D+ I T +G++ + + + + ++ I W + + T Sbjct: 347 GSDRCLLSVGADLAIMTTDGIVPITKAVQLDRGATSLGAITSRIGPTWRETVAATGTTSQ 406 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGD 174 W + + R M +++ P + +VMN W F + + LFFG Sbjct: 407 EWQLASFPARQMAIVNLPSSFGPY--QYVMNT-ETGAWCRFVGMPASCWATWQDRLFFGA 463 Query: 175 YEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFAR 234 +GT ++ ++ +DN + + + L + + + ++ Q + Sbjct: 464 ADGTLYEAEVGANDNGAAIDALMVGAWSRFGDGLST-KLSKLIGVTAQIGVSTLMYGGIS 522 Query: 235 ANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCL 294 +Y P ++ N+ W ++W K+ V K + G+ L Sbjct: 523 VDYQTKVPTALLSSVENS--AAAKWGTAVWGVSKFPGISLVRK------FASAGGAGSAL 574 Query: 295 AVGCVIVSSGK--SINDIQINNAKLLVE 320 A + SG S+++ + +L E Sbjct: 575 APTIRALISGSSGSVSEAAVVGGSVLYE 602 >gi|209548341|ref|YP_002280258.1| hypothetical protein Rleg2_0736 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534097|gb|ACI54032.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 607 Score = 196 bits (498), Expect = 3e-48, Method: Composition-based stats. Identities = 67/328 (20%), Positives = 131/328 (39%), Gaps = 20/328 (6%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GG+L A W+ GL V +S+ GEV V+ G +P +++ L + +GRPL Sbjct: 287 MKYGGTLVAINVWTIPVSTGLQQCLVLISSEGEVIVFQGSDPSSIANWGLIGTFKLGRPL 346 Query: 61 G-KRAIVFVKNDVWIATNNGLISMKNILLQEE--KTNLPLSWPIQEQWEQAIMVAPT--- 114 G R ++ V D+ I T +G++ + + + + ++ I W + + T Sbjct: 347 GSDRCLLSVGADLAIMTTDGIVPITKAVQLDRGATSLGAITARIGPTWRETVAATGTTSQ 406 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGD 174 W + + R M +++ P + +VMN W F + + LFFG Sbjct: 407 EWQLSSFPARQMAIVNLPSSFGPY--QYVMNT-ETGAWCRFVGMPASCWASWQDRLFFGA 463 Query: 175 YEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFAR 234 +GT ++ ++ +DN + + + L + + + ++ Q + Sbjct: 464 GDGTVYEAEVGANDNGVAIDALMVGAWSRYGDGLST-KLSKLIGVTAQIGVSSLMYGGIS 522 Query: 235 ANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCL 294 +Y P ++ NN W ++W K+ V K + G+ L Sbjct: 523 VDYQTKIPTALLSSVENN--AAAKWGTAVWGVSKFPGVSLVRK------FASAGGAGSAL 574 Query: 295 AVGCVIVSSGK--SINDIQINNAKLLVE 320 A + SG S+++ + +L E Sbjct: 575 APTIRALISGSSGSVSEAAVVGGSVLYE 602 >gi|116253670|ref|YP_769508.1| hypothetical protein RL3930 [Rhizobium leguminosarum bv. viciae 3841] gi|115258318|emb|CAK09420.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 607 Score = 196 bits (497), Expect = 5e-48, Method: Composition-based stats. Identities = 68/328 (20%), Positives = 133/328 (40%), Gaps = 20/328 (6%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GG+L A W+ GL V +S+ GEV VY G +P S+++ L + +GRPL Sbjct: 287 MKYGGTLIAIGVWTIPVSTGLQQCLVLMSSEGEVIVYQGSDPSSSTNWGLIGTFKLGRPL 346 Query: 61 GK-RAIVFVKNDVWIATNNGLISMKNILLQEE--KTNLPLSWPIQEQWEQAIM---VAPT 114 G R + V D+ I T +G++ + + + + ++ I W + + Sbjct: 347 GTERCFLSVGADLAIMTTDGIVPITKAVQLDRGATSLGAITAKIGPTWRETVATAGTTSD 406 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGD 174 W + + R M +++ P + +VMN W F + + LFFG Sbjct: 407 EWQLASFPARQMAIVNLPSSLGPY--QYVMNT-ETGAWCRFVGLAASCWGNWQDRLFFGA 463 Query: 175 YEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFAR 234 +GT ++ ++ +DN + + ++ + + + + ++ Q + Sbjct: 464 ADGTVYEAEVGANDNGAAIDALMVGAWNRYGEDM-ASKFSKLIGVTAQIGVSTLMYAGMS 522 Query: 235 ANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCL 294 +Y P T+ NN + W +++W K+ V K + G+ L Sbjct: 523 FDYQTKIPTALLSTVENN--ASAKWGSAIWGVSKFPGLSIVRK------FASAGGAGSAL 574 Query: 295 AVGCVIVSSGK--SINDIQINNAKLLVE 320 A + SG S+++ + +L E Sbjct: 575 APTIRALISGASGSVSEAAVIGGSVLYE 602 >gi|15320622|ref|NP_203466.1| hypothetical protein Mx8p52 [Myxococcus phage Mx8] gi|15281732|gb|AAK94387.1|AF396866_52 p52 [Myxococcus phage Mx8] Length = 536 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 72/331 (21%), Positives = 130/331 (39%), Gaps = 30/331 (9%) Query: 2 QAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGR--P 59 +AGG L ++W+ + G G+ V +S G+VA++ G +P S++F L+ ++ +G P Sbjct: 217 RAGGHLVGLWNWTYDGGAGMDDSLVAISGGGDVAIWQGTDPASSATFGLRGVWSLGGSPP 276 Query: 60 LGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPT--GWS 117 G+R DV + + G+ + ++ E + ++ + + ++ + GWS Sbjct: 277 AGRRIATDYGGDVLVLSRLGVRPLSRLVAGEVDKDTYVTAKVSNLFSALMLTRASLPGWS 336 Query: 118 MILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEG 177 M L + N LL++ P + VM W + + S V L+FG +G Sbjct: 337 MQLHPEDNALLVTVPTYPGQPTEQLVMA-LAGRAWFRYRDLPIYSSAVWGGKLYFGTVDG 395 Query: 178 TFWQGDISGSD--------NARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQ-RPY 228 D D P LS F + Q++ + +L + P Sbjct: 396 RVCVND-GYVDGVLLSEPSAFTPVQWSLLSAFTNLG--SARQKQVQLLRPTLLSESATPS 452 Query: 229 LKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVV 288 ++ AR YD + +S +G WD S WD W+ +Q Q V Sbjct: 453 YEVQARYRYDFAE----LAPVSAMGGGSGTWDGSTWDVDVWSG--------EYQASQQVR 500 Query: 289 AYGNCLAVGCVIVSSGKSINDIQINNAKLLV 319 G + V I G ++ + +L Sbjct: 501 G-GTGVGVDLAIAIRGTAVARTVLVGIDILF 530 >gi|319783505|ref|YP_004142981.1| hypothetical protein Mesci_3814 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169393|gb|ADV12931.1| hypothetical protein Mesci_3814 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 617 Score = 180 bits (457), Expect = 2e-43, Method: Composition-based stats. Identities = 72/324 (22%), Positives = 130/324 (40%), Gaps = 10/324 (3%) Query: 2 QAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDS--SSFSLKAIYHIGRP 59 Q GGSL G +WS ++GDG+ CVF++T GE A++ G NP + + ++L Y + P Sbjct: 293 QRGGSLLFGATWSLDAGDGIDDKCVFVTTEGEAAIFEGSNPAGTTQAEWNLVGRYDLTDP 352 Query: 60 LGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLS--WPIQEQWEQAIMVAPTG-W 116 +GKR + D+ +AT GL+ + + ++ + I+ W++ + W Sbjct: 353 MGKRGTMRAGGDLIVATKEGLVPISAAINKDAAALSLAAISRNIEPDWKREAARRLSLPW 412 Query: 117 SMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYE 176 +I W N +++ P + + + W F W T+ + + L+FG + Sbjct: 413 EIIKWPDMNYAIVALPVTADGQEAWSFVVNLETGAWCKFVGWATRCIELHDSRLYFGTND 472 Query: 177 GTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARAN 236 GT ++ +I+G+DN P Y+ + LG + A + + K+ N Sbjct: 473 GTVFEAEIAGNDNGLPIYYTYVGNPDHMKS-LGGLKTVHQARPTFLSATPYNPKISFSVN 531 Query: 237 YDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAV 296 Y S P + +WD+ WD KW ++ G + Sbjct: 532 YSVSLPPAPPAA---DGGTADLWDSGQWDVAKWD-QAQPVASVGGGQWISIGKTGYVMQP 587 Query: 297 GCVIVSSGKSINDIQINNAKLLVE 320 I S + + E Sbjct: 588 QLQITSFLNQRPVTEFVQLDVTFE 611 >gi|218680122|ref|ZP_03528019.1| hypothetical protein RetlC8_15010 [Rhizobium etli CIAT 894] Length = 305 Score = 179 bits (455), Expect = 4e-43, Method: Composition-based stats. Identities = 62/312 (19%), Positives = 125/312 (40%), Gaps = 20/312 (6%) Query: 17 SGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLG-KRAIVFVKNDVWIA 75 GL V +S+ GEV V+ G +P +++++SL + +GRPLG R ++ V D+ I Sbjct: 1 VSTGLQQCLVLISSEGEVIVFQGSDPSNAANWSLIGTFKLGRPLGLDRCLLSVGADLAIM 60 Query: 76 TNNGLISMKNILLQEE--KTNLPLSWPIQEQWEQAIMVAPT---GWSMILWEKRNMLLIS 130 T +G++ + + + + ++ I W + + T W + + R M +++ Sbjct: 61 TTDGIVPITKAVQLDRGATSLGAITARIGPTWRETVAATGTTSGEWQLSSFPARQMAIVN 120 Query: 131 CPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFWQGDISGSDNA 190 P + +VMN W F + + LFFG +GT ++ ++ +DN Sbjct: 121 LPSSFGPY--QYVMNT-ETGAWCRFVGMPASCWASWQDRLFFGAADGTVYEAEVGANDNG 177 Query: 191 RPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETIS 250 + + + L + + + ++ Q + +Y P ++ Sbjct: 178 VAIDALMVGAWSRYGDGLST-KLSKLIGVTAQIGVSTVMYAGISVDYQTKIPTALLSSVE 236 Query: 251 NNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGK--SIN 308 NN W ++W K+ V K + G+ LA + SG S++ Sbjct: 237 NN--AAAKWGTAIWGVSKFPGISLVRK------FASAGGAGSALAPTIRALISGSSGSVS 288 Query: 309 DIQINNAKLLVE 320 + + +L E Sbjct: 289 EAAVVGGSVLYE 300 >gi|145642362|ref|ZP_01797923.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae R3021] gi|145272931|gb|EDK12816.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae 22.4-21] Length = 603 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 44/330 (13%), Positives = 93/330 (28%), Gaps = 29/330 (8%) Query: 2 QAGGSLPAG-FSWSTESGD---GLSTLCVFLSTL-GEVAVY------GGDNPDDSSSFSL 50 Q G SL WS G GL LS+ G V + G + L Sbjct: 283 QLGHSLVGRPNQWSVLLGSEQLGLGDEITALSSTVGGVLIIGCRNKIAGLYGSGRDDWVL 342 Query: 51 KAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIM 110 K + IG + + + I + NG+ + + + + +++ Sbjct: 343 KEVSSIG--IVPETLQTTFVPLAI-SKNGITRIDQTEQFGDFRLSEVDANRKLGFDKQHY 399 Query: 111 VAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENL 170 + K N + + + + + + ++ + E + Sbjct: 400 NIAYTST---KPKSNQIRFYSREGR-HICMMLQADGSTRSTFFTYPEILCGVWQS-PEQV 454 Query: 171 FFGDYEGTFWQG-DISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYL 229 + +G ++ D S + +P I F+ + A L + L Sbjct: 455 YLAFDDGKVYRQSDKCYSFSGKPIDWIIKMAFNHCGSPT-LIKSWHSAELQATTEGKSKL 513 Query: 230 KLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVA 289 +Y+ +Y + G W++S W++ W + + Sbjct: 514 SYRFDLDYNSNYHASTLSKDLQITGGGGRWNDSFWNDFLW-SAEDYSTPTFYL------- 565 Query: 290 YGNCLAVGCVIVSSGKSINDIQINNAKLLV 319 G + S +I+ L Sbjct: 566 SGYSRNIALSFSGSSIYSPQFEISGLILNY 595 >gi|319776218|ref|YP_004138706.1| hypothetical protein HICON_18210 [Haemophilus influenzae F3047] gi|317450809|emb|CBY87031.1| Conserved hypothetical protein [Haemophilus influenzae F3047] Length = 661 Score = 137 bits (344), Expect = 3e-30, Method: Composition-based stats. Identities = 39/318 (12%), Positives = 89/318 (27%), Gaps = 28/318 (8%) Query: 13 WSTESGD---GLSTLCVFLSTL-GEVAVY------GGDNPDDSSSFSLKAIYHIGRPLGK 62 WS G GL LS+ G V + G + LK + IG + Sbjct: 353 WSVLLGSEQLGLGDEITALSSTVGGVLIIGCRNKIAGLYGSGRDDWVLKEVSSIG--IVP 410 Query: 63 RAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWE 122 + + I + NG+ + + + + +++ + Sbjct: 411 ETLQTTFVPLAI-SKNGITRIDQTEQFGDFRLSEVDANRKLGFDKQHYNITYTST---KP 466 Query: 123 KRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFWQG 182 K N + + + + + + ++ + E ++ +G ++ Sbjct: 467 KSNQIRFYSKEGR-HICMMLQADGSTRSTFFTYPEILRGVWQS-PEQVYLAFDDGKVYRQ 524 Query: 183 -DISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSY 241 D S + +P I F+ + + L + L +Y+ +Y Sbjct: 525 SDKCYSFSGKPIDWIIKMAFNHCGSPT-LIKSWHSSELQATTEGKSKLSYRFDLDYNSNY 583 Query: 242 PEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIV 301 + G W++S W++ W + + G + Sbjct: 584 HASTLSKDLQIAGGGGRWNDSFWNDFLW-SAEDYSTPTFYL-------SGYSRNIALSFS 635 Query: 302 SSGKSINDIQINNAKLLV 319 S +I+ L Sbjct: 636 GSSIYSPQFEISGLILNY 653 >gi|218674218|ref|ZP_03523887.1| hypothetical protein RetlG_23247 [Rhizobium etli GR56] Length = 457 Score = 131 bits (329), Expect = 1e-28, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 77/174 (44%), Gaps = 9/174 (5%) Query: 1 MQAGGSLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPL 60 M+ GG+L A W+ GL V +S+ GEV V+ G +P ++++ L + +GRPL Sbjct: 287 MKYGGTLVAINVWTIPVATGLQQCLVLISSEGEVIVFQGSDPSSAANWGLIGTFKLGRPL 346 Query: 61 G-KRAIVFVKNDVWIATNNGLISMKNILLQEE--KTNLPLSWPIQEQWEQAIMVAPT--- 114 G R ++ V D+ I T +G++ + + + + ++ I W + + + T Sbjct: 347 GSDRCLLSVGADLAIMTTDGIVPITKAVQLDRGATSLGAITARIGPTWRETVAASGTSSQ 406 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANE 168 W + + R M +++ P + +VMN W F + + Sbjct: 407 EWQLSSFPARQMAIVNLPSSYGPY--QYVMNT-ETGAWCRFVGMPASCWATWQD 457 >gi|170719091|ref|YP_001784242.1| hypothetical protein HSM_0910 [Haemophilus somnus 2336] gi|168827220|gb|ACA32591.1| Haemophilus-specific protein, uncharacterized [Haemophilus somnus 2336] Length = 659 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 40/329 (12%), Positives = 91/329 (27%), Gaps = 27/329 (8%) Query: 2 QAGGSLPAGFS-WSTESGD---GLSTLCVFLSTL-GEVAVY------GGDNPDDSSSFSL 50 Q G SL + W+ G G+ LS+ G V + + L Sbjct: 339 QFGHSLVGKPTHWAVLLGSEHFGVGDEITALSSTTGGVLLIGCRHKTTALYGSTRDDWVL 398 Query: 51 KAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIM 110 K I +G + + + I + +G+I + + + ++ Sbjct: 399 KDIAKVG--IKSGTLQATFIPIAI-SRHGIIRVDATEQFGDFKLSETDSGRKLGFKPIEN 455 Query: 111 VAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENL 170 + K N + +N+ + + + + ++ + + Sbjct: 456 NIVFSST---KAKANQVRFYS-ENAFHICMMLQPDGSTKSTYFTYPEKLKGVWQSHHNTF 511 Query: 171 FFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLK 230 D + Q D S +P F+ + + A L A ++ Sbjct: 512 LAFDDGKVYRQSDDCFSFAGKPIEWTVKMAFNHCGSPVHI-KSWKSAELQATAQGVLSVQ 570 Query: 231 LFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAY 290 +Y+ + + + G W+ S W++ W + L+ Sbjct: 571 YRFDLDYNADFHAVNLAQALSVIGYGGRWNESYWNDFLW-STQEYSTPVLYL-------S 622 Query: 291 GNCLAVGCVIVSSGKSINDIQINNAKLLV 319 G + S +++ L Sbjct: 623 GYSRNLSLSFAGSDLYAPQFELSGLVLNY 651 >gi|113461516|ref|YP_719585.1| hypothetical protein HS_1373 [Haemophilus somnus 129PT] gi|112823559|gb|ABI25648.1| hemophilus-specific protein, uncharacterized [Haemophilus somnus 129PT] Length = 659 Score = 112 bits (280), Expect = 7e-23, Method: Composition-based stats. Identities = 40/329 (12%), Positives = 92/329 (27%), Gaps = 27/329 (8%) Query: 2 QAGGSLPAGFS-WSTESGD---GLSTLCVFLSTL-GEVAVY------GGDNPDDSSSFSL 50 Q G SL + W+ G G+ LS+ G V + + L Sbjct: 339 QFGHSLVGKPTHWAVLLGSEHFGVGDEITALSSTTGGVLLIGCRHKTTALYGSTRDDWVL 398 Query: 51 KAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIM 110 K I +G + + + I + +G+I + + + ++ Sbjct: 399 KDIAKVG--IKSGTLQATFIPIAI-SRHGIIRVDATEQFGDFKLSETDSGRKLGFKPIEN 455 Query: 111 VAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENL 170 + K N + +N+ + + + + ++ + + Sbjct: 456 NIVFSST---KAKANQVRFYS-ENAFHICMMLQPDGSTKSTYFTYPEKLKGVWQSHHNTF 511 Query: 171 FFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLK 230 D + Q D S +P F+ + + A L A ++ Sbjct: 512 LAFDDGKVYRQSDDCFSFAGKPIEWTVKMAFNHCGSPVHI-KSWKSAELQATAQGVLSVQ 570 Query: 231 LFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAY 290 + +Y+ + + + G W+ S W++ W + L+ Sbjct: 571 YRSDLDYNADFHAVNLARELSVIGHGGRWNESYWNDFLW-STQEYSTPVLYL-------S 622 Query: 291 GNCLAVGCVIVSSGKSINDIQINNAKLLV 319 G + S +++ L Sbjct: 623 GYSRNLSLSFAGSDLYAPQFELSGLVLNY 651 >gi|228937959|ref|ZP_04100586.1| hypothetical protein bthur0008_6350 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970839|ref|ZP_04131479.1| hypothetical protein bthur0003_6260 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977413|ref|ZP_04137808.1| hypothetical protein bthur0002_6280 [Bacillus thuringiensis Bt407] gi|228782390|gb|EEM30573.1| hypothetical protein bthur0002_6280 [Bacillus thuringiensis Bt407] gi|228788964|gb|EEM36903.1| hypothetical protein bthur0003_6260 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821750|gb|EEM67751.1| hypothetical protein bthur0008_6350 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326938437|gb|AEA14333.1| hypothetical protein CT43_CH0641 [Bacillus thuringiensis serovar chinensis CT-43] Length = 502 Score = 109 bits (272), Expect = 7e-22, Method: Composition-based stats. Identities = 30/312 (9%), Positives = 92/312 (29%), Gaps = 34/312 (10%) Query: 16 ESGDGLSTLCV-FLSTLGEVAVYG-----GDNPDDSSSFSLKAIYHIGRPLGKRAIVFVK 69 + + V ++ + ++ + + L I + +I V Sbjct: 202 DVASNENDEIVGLVTFRNSLIIFCRHSIWALYGKTTYDYELVKINTPTGCMAPESIQVVG 261 Query: 70 NDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLI 129 N ++ ++ + + + ++ I+ + + + L Sbjct: 262 NQIFYLSDTHVYGLFAND-FNMVSAQIITKQIESTMRAIPLTEKNKAVAGYF-EGKYYLS 319 Query: 130 SCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFG-DYEGTFWQGDISGSD 188 +L+ D+ V W+ + N +V + N +FG + + + D Sbjct: 320 FPNGKTLVYDELLVC-------WTVYSNIKADVFVNYDGNFYFGSNKNAYVFHPE--YHD 370 Query: 189 NARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKET 248 + +P + + ++ + L + Q +L + +++ + ++ Sbjct: 371 DGKPIPFRMETKYFDFGLMTQDKKI---HRIWLHSNQPNGYRLGVKLDFETKQVDGTRP- 426 Query: 249 ISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSIN 308 N + WD ++WD+ + + + +I+ I Sbjct: 427 ---NAANVSNWDEAIWDQNTFDRIEMYINRLRVNNRTKKIG---------MIIEDVTHIR 474 Query: 309 DIQINNAKLLVE 320 + + E Sbjct: 475 PFVVYGIGVQYE 486 >gi|75762343|ref|ZP_00742221.1| hypothetical protein RBTH_08417 [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74490168|gb|EAO53506.1| hypothetical protein RBTH_08417 [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 491 Score = 107 bits (267), Expect = 2e-21, Method: Composition-based stats. Identities = 30/312 (9%), Positives = 92/312 (29%), Gaps = 34/312 (10%) Query: 16 ESGDGLSTLCV-FLSTLGEVAVYG-----GDNPDDSSSFSLKAIYHIGRPLGKRAIVFVK 69 + + V ++ + ++ + + L I + +I V Sbjct: 191 DVASNENDEIVGLVTFRNSLIIFCRYSIWALYGKTTYDYELVKINTPTGCMAPESIKVVG 250 Query: 70 NDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLI 129 N ++ ++ + + + ++ I+ + + + + L Sbjct: 251 NQIFYLSDTHVYGLFAND-FNMVSAQIITKQIESTMRAIPLTEKSKAVAGYF-EGKYYLS 308 Query: 130 SCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFG-DYEGTFWQGDISGSD 188 +L+ D+ V W+ + N +V + N +FG + + + D Sbjct: 309 FPNGKTLVYDELLVC-------WTVYSNIKADVFVNYDGNFYFGSNKNAYVFHPE--YHD 359 Query: 189 NARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKET 248 + +P + + ++ + L + Q +L + +++ ++ Sbjct: 360 DGKPIPFRMETKYFDFGLMTQDKKI---HRIWLHSNQPNGYRLGVKLDFETKQVNGTRP- 415 Query: 249 ISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSIN 308 N + WD ++WD+ + + + +I+ I Sbjct: 416 ---NAANVSNWDEAIWDQNTFDRIEMYINRLRVNNRTKKIG---------MIIEDVTHIR 463 Query: 309 DIQINNAKLLVE 320 + + E Sbjct: 464 PFVVYGIGVQYE 475 >gi|237748184|ref|ZP_04578664.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379546|gb|EEO29637.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 478 Score = 105 bits (263), Expect = 6e-21, Method: Composition-based stats. Identities = 30/297 (10%), Positives = 78/297 (26%), Gaps = 42/297 (14%) Query: 25 CVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMK 84 V L G Y +P ++ L+ +++ V D++ ++ G S+ Sbjct: 218 LVVLMRDGAQVWYADPDP---TAMCLEETVENVGTSFPQSLATVAGDLYFLSDFGFRSIT 274 Query: 85 NILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVM 144 L +L + P+ + + + L + + + V Sbjct: 275 TQQLVSRLDDLDIGSPVDTLVRPVLQDVKGAPKAVYFYGTGQYLCAIDRQMFVYS---VS 331 Query: 145 NVAHNNRWSSF-HNWFTQSYVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHA 203 + WS + + N L+ + + + + +D+ + + + P+ Sbjct: 332 RTSRIAAWSRYDLPVTVDAMDELNGVLYIRSGDDVYKLDEEAHTDDGQEYEVVLELPYMN 391 Query: 204 IDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSL 263 G +R L +Q + + + + N Sbjct: 392 FKTP-GILKRVYGVDLVMQGE----CYFSMGFDVRNHEAVTDEVRVVGN----------- 435 Query: 264 WDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSINDIQINNAKLLVE 320 + L + G + + Q++ + E Sbjct: 436 ----TYGGGLIPLEVA-----------GTEFSPRFR----NVTNQPFQLDALTIYYE 473 >gi|237746345|ref|ZP_04576825.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] gi|229377696|gb|EEO27787.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] Length = 464 Score = 103 bits (256), Expect = 4e-20, Method: Composition-based stats. Identities = 28/278 (10%), Positives = 75/278 (26%), Gaps = 39/278 (14%) Query: 44 DSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQE 103 D ++ SL +++ V D++ ++ G S+ L +L + P+ Sbjct: 220 DPTAMSLVETVENVGSSYPQSLATVAGDLYFLSDFGFRSITTQQLVARLDDLDIGSPVDM 279 Query: 104 QWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSF-HNWFTQS 162 + + + L + + + V + WS + + + Sbjct: 280 LVRPCLKALKGTPKAVYFYGTGQYLCAIDRQMFVYS---VSRTSKIAAWSRYELPFTVDA 336 Query: 163 YVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQ 222 + L+ + + + + +D+ + + I P+ G +R L +Q Sbjct: 337 MDELDGVLYLRSGDDVYKLDEEAHTDDGKEYEVILELPYMNFKTP-GVLKRVYGVDLVMQ 395 Query: 223 AYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQ 282 + + + ++ N + L + Sbjct: 396 GE----CYFSMGFDVRNFEAVTDEVRVAGN---------------TYGGGLIPLEVA--- 433 Query: 283 FCQNVVAYGNCLAVGCVIVSSGKSINDIQINNAKLLVE 320 G A + Q++ + E Sbjct: 434 --------GTEFAPRFR----NVTDQPFQLDALTIYYE 459 >gi|228905773|ref|ZP_04069685.1| hypothetical protein bthur0014_68150 [Bacillus thuringiensis IBL 4222] gi|228853868|gb|EEM98614.1| hypothetical protein bthur0014_68150 [Bacillus thuringiensis IBL 4222] Length = 221 Score = 94.0 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 67/225 (29%), Gaps = 27/225 (12%) Query: 97 LSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFH 156 ++ I+ + + + L +L+ D+ V W+ + Sbjct: 7 ITKQIESTMRAIPLTEKNKAVAGYF-EGKYYLSFPNGKTLVYDELLVC-------WTVYS 58 Query: 157 NWFTQSYVVANENLFFG-DYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRAC 215 N +V + N +FG + + + D+ +P + + ++ Sbjct: 59 NIKADVFVNYDGNFYFGSNKNAYVFHPE--YHDDGKPIPFRMETKYFDFGLMTQDKKI-- 114 Query: 216 VAHLSLQAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFV 275 + L + Q +L + +++ ++ N + WD ++WD+ + Sbjct: 115 -HRIWLHSNQPNGYRLGVKLDFETKQVNGTRP----NAANVSNWDEAIWDQNTFDRIEMY 169 Query: 276 TKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSINDIQINNAKLLVE 320 + + +I+ I + + E Sbjct: 170 INRLRVSNRTKKIG---------MIIEDVTHIRPFVVYGIGVQYE 205 >gi|134095452|ref|YP_001100527.1| hypothetical protein HEAR2272 [Herminiimonas arsenicoxydans] gi|133739355|emb|CAL62405.1| Hypothetical protein HEAR2272 [Herminiimonas arsenicoxydans] Length = 463 Score = 90.2 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 21/278 (7%), Positives = 72/278 (25%), Gaps = 39/278 (14%) Query: 44 DSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQE 103 D ++ + ++ V D++ ++ G S+ L +++ ++ + PI Sbjct: 219 DPTNMAFDDNVENVGTSYPNSLSTVGGDLFFLSDFGFRSITTSRLIDKREDVDVGSPIDP 278 Query: 104 QWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFH-NWFTQS 162 + A+ + + + + + + + WS + + + Sbjct: 279 LVKGALAGMAGAPRAKYYYGTGQYMCAIDRTLFVYS---LSRTSKIAAWSRYRMPFDVDA 335 Query: 163 YVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQ 222 L+ + + + +D+ + + P+ G +R + + Sbjct: 336 IAERLGVLYLRSGDDVYRFDEDIKTDDGVEYEVLLEMPYMDFKTP-GVLKRITGIDIVM- 393 Query: 223 AYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQ 282 + Y L + + + + Sbjct: 394 -EGQCYFSLGWDVRNALALTDEVRIIG-----------------NTRGGGMIP------- 428 Query: 283 FCQNVVAYGNCLAVGCVIVSSGKSINDIQINNAKLLVE 320 + G + ++++ L E Sbjct: 429 ----IECCGTEFSPRFR----NLDNKPFRLDSLTLYYE 458 >gi|209548752|ref|YP_002280669.1| hypothetical protein Rleg2_1149 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534508|gb|ACI54443.1| hypothetical protein Rleg2_1149 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 606 Score = 84.8 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 71/258 (27%), Gaps = 27/258 (10%) Query: 30 TLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILL- 88 ++ G +P + F + ++ + ND T GL ++ + Sbjct: 319 GRLRTYIWQGYDPGEIGGFIPVKTLA-VGLIHQKLRQSMPNDEAFVTPFGLRTLSVQVQT 377 Query: 89 --------QEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDK 140 + S I+ +++ M + + Sbjct: 378 DGVEVTSDIGSTQDGEFSDKIRALLADDDSYRK--ARSFFYQRDGM--FGYKLDDTGL-Q 432 Query: 141 TFVMNVAHNNRWSSFHNW--FTQSYVVANEN--LFFGDYEGTFWQGDI------SGSDNA 190 +V+N W+ F + S+V +N + + + S +D+ Sbjct: 433 VYVLNDKAKG-WTEFQGYFADAASFVALGDNRLMILRGEQAYLYANGTDALVGESYADDE 491 Query: 191 RPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETIS 250 P G++ P+ + R G + + L + T Sbjct: 492 EPIFGMWWLPWLSRSARWGNRA-VEILLEDTAPNVTLTLDRMIDFREQNVVTTQVTVTGG 550 Query: 251 NNPMDNGIWDNSLWDEVK 268 + D +WD +WD Sbjct: 551 GSKWDEALWDVGMWDSGT 568 >gi|153212099|ref|ZP_01947916.1| hypothetical protein A55_1878 [Vibrio cholerae 1587] gi|124116895|gb|EAY35715.1| hypothetical protein A55_1878 [Vibrio cholerae 1587] Length = 665 Score = 84.8 bits (208), Expect = 1e-14, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 93/326 (28%), Gaps = 43/326 (13%) Query: 12 SWSTESGDGL----STLCVFLSTLGEVAVYGGDN------PDDSSSFSLKAIYHIGRPLG 61 SWS G + F S G + G + ++ +K + + Sbjct: 357 SWSGLLGAEQFSTGDEITGFKSIQGGAMIIAGSRSINALFGEVRDNWVIKRLTSKVG-VK 415 Query: 62 KRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWP-------IQEQWEQAIMVAPT 114 + V T NGLI + + + + + I+ + AI P Sbjct: 416 PGMLTSTFVPVA-QTTNGLIRVDSTDVFGDLIASEIQANKQAPKAAIRPYF--AIANIPE 472 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGD 174 + + ++ L++ ++ F W + +NLFF Sbjct: 473 SNQVRFYGEKAHLIVEIQPDAKTRTTEFTYPQPLRGVW------------QSPDNLFFAF 520 Query: 175 YEGTFWQGDIS-GSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFA 233 +G ++ + + S P + + A L + LK +A Sbjct: 521 DDGRIYRQNAAVNSFAGEPIFWDMRLAYSHCGTPT-VVKNWKSAELQMNVANALSLKFYA 579 Query: 234 RANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNC 293 +Y + G W+ S W++ WT + T V GN Sbjct: 580 DLDYGTPDKSQTLVNTGIAYGGGGRWNESDWNDFYWTAPDYATPI--------VSLSGNS 631 Query: 294 LAVGCVIVSSGKSINDIQINNAKLLV 319 + + S + I+ L Sbjct: 632 RNISISLTGSSDHEENFDISGFILNY 657 >gi|225155407|ref|ZP_03723899.1| hypothetical protein ObacDRAFT_9455 [Opitutaceae bacterium TAV2] gi|224803863|gb|EEG22094.1| hypothetical protein ObacDRAFT_9455 [Opitutaceae bacterium TAV2] Length = 592 Score = 54.4 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 35/97 (36%), Gaps = 3/97 (3%) Query: 26 VFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKN 85 + + G V +P ++S+ + I +V V D++ + G+ S+ Sbjct: 235 LGVFCAGSVWSIDSSDP-VVGNWSIMKVTDIAGCAEGNTVVAVGQDIYFLSQWGVCSLGA 293 Query: 86 ILLQEEK-TNLPLSWPIQEQWEQAI-MVAPTGWSMIL 120 + + +S PIQ ++ W+ + Sbjct: 294 LQTTDSISPASTISAPIQPFLDRINWSQIRNAWATVW 330 >gi|218673258|ref|ZP_03522927.1| hypothetical protein RetlG_17531 [Rhizobium etli GR56] Length = 94 Score = 42.4 bits (98), Expect = 0.080, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 31/91 (34%), Gaps = 10/91 (10%) Query: 232 FARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYG 291 +Y P ++ NN W ++W K+ V K + G Sbjct: 7 GISVDYQTKVPTALLSSVENN--AAAKWGTAVWGVAKFPGISLVRK------FASAGGAG 58 Query: 292 NCLAVGCVIVSSGK--SINDIQINNAKLLVE 320 + LA + SG S+++ + +L E Sbjct: 59 SALAPTIRALISGSSGSVSEAAVVGGSVLYE 89 >gi|115376424|ref|ZP_01463660.1| dipeptidyl peptidase IV [Stigmatella aurantiaca DW4/3-1] gi|310823227|ref|YP_003955585.1| peptidase, s9b (dipeptidyl peptidase iv) subfamily [Stigmatella aurantiaca DW4/3-1] gi|115366556|gb|EAU65555.1| dipeptidyl peptidase IV [Stigmatella aurantiaca DW4/3-1] gi|309396299|gb|ADO73758.1| Peptidase, S9B (Dipeptidyl peptidase IV) subfamily [Stigmatella aurantiaca DW4/3-1] Length = 720 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 40/120 (33%), Gaps = 14/120 (11%) Query: 18 GDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKA------IYHIGRPLGKRAIVFVKND 71 G G L +L + GG NP + + +K + G L A+ Sbjct: 374 GSGFQMLVNYLPKEDVLYFLGGPNPTERYLWRVKGGGEPERVTSGGPALEGAAVSGRGG- 432 Query: 72 VWIATNNGLISMKNI-------LLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKR 124 + + T G ++ E ++ PIQ ++E + W+ I+ + Sbjct: 433 LMVITTEGPTRLRRAYVHRSDGTRLGELPSVAREPPIQPRFELRQVGPERFWTAIIRPQN 492 >gi|297836494|ref|XP_002886129.1| hypothetical protein ARALYDRAFT_480680 [Arabidopsis lyrata subsp. lyrata] gi|297331969|gb|EFH62388.1| hypothetical protein ARALYDRAFT_480680 [Arabidopsis lyrata subsp. lyrata] Length = 1194 Score = 40.5 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 29/88 (32%), Gaps = 1/88 (1%) Query: 21 LSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGL 80 + + + +V+ D S+ +G + + + + ++ + NGL Sbjct: 681 VDDTGCGIESSKWDSVFESFEQADPSTTRTHGGTGLGLCIVRNLVNKMGGEIKVVQKNGL 740 Query: 81 ISMKNILLQEEKTNLPLSWPIQEQWEQA 108 ++ + L IQ + + Sbjct: 741 GTLMRLYLILSTP-ETADQNIQPDFSKY 767 >gi|312116010|ref|YP_004013606.1| hypothetical protein Rvan_3322 [Rhodomicrobium vannielii ATCC 17100] gi|311221139|gb|ADP72507.1| hypothetical protein Rvan_3322 [Rhodomicrobium vannielii ATCC 17100] Length = 474 Score = 39.3 bits (90), Expect = 0.66, Method: Composition-based stats. Identities = 14/181 (7%), Positives = 45/181 (24%), Gaps = 7/181 (3%) Query: 25 CVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMK 84 + + ++ +F+ + + + V + + +NG++ Sbjct: 197 ALIALGEASLEIWENA-GTTPFAFAPIRADIDIGCIAEHTVASVADSLLFVDHNGVVR-- 253 Query: 85 NILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVM 144 L + + I+ + + + Sbjct: 254 --QLTGSEPQRISTHAIERAIAALSWDERRSLYAVYTHFNGHDFYAITSAYWTFEFDVAT 311 Query: 145 NVAHNNRWSSFHNWFTQSYVVANENLFFGD--YEGTFWQGDISGSDNARPFIGIYLSPFH 202 + H W + + N + G D S +++ +P++ + S Sbjct: 312 GLWHERASGDLEYWTARGHASFNGKVVIGSSLDASLHVLDDASYTESGQPYLFVAQSAPL 371 Query: 203 A 203 Sbjct: 372 H 372 >gi|18398532|ref|NP_565424.1| ATHK1 (histidine kinase 1); histidine phosphotransfer kinase/ osmosensor/ protein histidine kinase [Arabidopsis thaliana] gi|332278154|sp|Q9SXL4|AHK1_ARATH RecName: Full=Histidine kinase 1; AltName: Full=Arabidopsis histidine kinase 1; Short=AtHK1; AltName: Full=Protein AUTHENTIC HIS-KINASE 1 gi|330251597|gb|AEC06691.1| histidine kinase 1 [Arabidopsis thaliana] Length = 1207 Score = 39.3 bits (90), Expect = 0.68, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 30/88 (34%), Gaps = 1/88 (1%) Query: 21 LSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGL 80 + + +V+ D S+ +G + + + + ++ + NGL Sbjct: 692 VDDTGCGIDPSKWDSVFESFEQADPSTTRTHGGTGLGLCIVRNLVNKMGGEIKVVQKNGL 751 Query: 81 ISMKNILLQEEKTNLPLSWPIQEQWEQA 108 ++ + L + + IQ + + Sbjct: 752 GTLMRLYLILSTPD-TVDQNIQPDFSKY 778 >gi|4586626|dbj|BAA32085.1| histidine kinase 1 [Arabidopsis thaliana] Length = 1207 Score = 39.3 bits (90), Expect = 0.68, Method: Composition-based stats. Identities = 10/88 (11%), Positives = 30/88 (34%), Gaps = 1/88 (1%) Query: 21 LSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGL 80 + + +V+ D S+ +G + + + + ++ + NGL Sbjct: 692 VDDTGCGIDPSKWDSVFESFEQADPSTTRTHGGTGLGLCIVRNLVNKMGGEIKVVQKNGL 751 Query: 81 ISMKNILLQEEKTNLPLSWPIQEQWEQA 108 ++ + L + + IQ + + Sbjct: 752 GTLMRLYLILSTPD-TVDQNIQPDFSKY 778 >gi|258573967|ref|XP_002541165.1| predicted protein [Uncinocarpus reesii 1704] gi|237901431|gb|EEP75832.1| predicted protein [Uncinocarpus reesii 1704] Length = 365 Score = 39.3 bits (90), Expect = 0.81, Method: Composition-based stats. Identities = 5/106 (4%), Positives = 22/106 (20%), Gaps = 1/106 (0%) Query: 129 ISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFWQGDISGSD 188 + + F+ + + + + D + + + +D Sbjct: 106 FEKDDDDSQTQYNFICGNDECGEYDDYDGIDARFDDTYYAQEYEEDSDDS-FADLYGNND 164 Query: 189 NARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFAR 234 + + S + + + + R Sbjct: 165 DKATNEASFESDMFTFTTDVRMPKVGDLIDPDGGPGGYVEAVFWLR 210 >gi|293375585|ref|ZP_06621859.1| nucleic acid-binding domain protein [Turicibacter sanguinis PC909] gi|292645802|gb|EFF63838.1| nucleic acid-binding domain protein [Turicibacter sanguinis PC909] Length = 1736 Score = 39.0 bits (89), Expect = 0.89, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 54/199 (27%), Gaps = 18/199 (9%) Query: 2 QAGGSLPAGFSWSTESGD---GLSTLCVFLSTLGEVA-VYGGDNPDDSSSFSLKAIYHIG 57 + G + ++ + G V ++T + + G + + +Y Sbjct: 1082 KMNGYVTGNYAGNMNYVTELVGQEGGVVKINTETITSEILTGIDVLIITDPENDFVYSNE 1141 Query: 58 RPLGKRAIVFVKNDVWIATNNGLISMKNIL---LQEEKTNLPLSWPIQEQWEQAIMVAPT 114 + +A V D+ I + Q K + I+ +Q + Sbjct: 1142 EVVAIKAFVEAGGDLIITSKADYKDQTGQFGNAAQGNKVLEAIGATIRFNDDQVVDPVEN 1201 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLF--- 171 G + L N + S ++ S+ N L Sbjct: 1202 G-------GQAYRLYFDDYNEESPYTKGIDFGKIEQGNSKNTDYK-FSFYSGNSVLVPAD 1253 Query: 172 FGDYEGTFWQGDISGSDNA 190 + + + + +D+A Sbjct: 1254 ATNVDVVVKGHETTTNDDA 1272 >gi|315570638|gb|ADU33323.1| pectin methylesterase [Dendroctonus ponderosae] gi|332375344|gb|AEE62813.1| unknown [Dendroctonus ponderosae] Length = 388 Score = 39.0 bits (89), Expect = 0.94, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 36/124 (29%), Gaps = 15/124 (12%) Query: 45 SSSFSLKAIY-HIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWP--- 100 S+ F L+ + H G + + D NN +S+++ + Sbjct: 177 SNDFQLQGVTIHNGATEAQALAIKTSGDKIHLMNNNFMSIQDTVGLGIDEGENEKAERVL 236 Query: 101 -----IQEQWEQA------IMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHN 149 I+ + + + T ++ EK ++ + V+N Sbjct: 237 IHMCYIEGELDYVIGGASAVFNMVTFNTVASVEKEQYVIFMPDTAPGRTYGFLVINSNIT 296 Query: 150 NRWS 153 S Sbjct: 297 GDAS 300 >gi|156743150|ref|YP_001433279.1| WD-40 repeat-containing protein [Roseiflexus castenholzii DSM 13941] gi|156234478|gb|ABU59261.1| WD-40 repeat protein [Roseiflexus castenholzii DSM 13941] Length = 1157 Score = 39.0 bits (89), Expect = 0.98, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 35/154 (22%), Gaps = 7/154 (4%) Query: 25 CVFLSTLGEVAVYGGDNPDDSSSFSLKAIY----HIGRPLGKRAIVFVKND---VWIATN 77 + LS G G +P + L + P G + + D + I N Sbjct: 820 LLALSRDGARLATAGADPTSLRLWDLAGTTLTARELAGPGGTNTAMALSRDGRRLAIGDN 879 Query: 78 NGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLL 137 G + ++ + + L + S Sbjct: 880 QGATFVYDLTNLSASPQRLSGLGGRSGIRSLTFSPDGRLLATGNDDSQARLYNLASGSGT 939 Query: 138 SDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLF 171 + + +SS W L+ Sbjct: 940 YTRERTRGALTSITFSSDGRWLAYGSAGGQVRLW 973 >gi|307545224|ref|YP_003897703.1| hypothetical protein HELO_2634 [Halomonas elongata DSM 2581] gi|307217248|emb|CBV42518.1| hypothetical protein HELO_2634 [Halomonas elongata DSM 2581] Length = 547 Score = 39.0 bits (89), Expect = 1.0, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 32/69 (46%), Gaps = 3/69 (4%) Query: 17 SGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIAT 76 G ++ + + T G+ ++ G +P S L +Y + KR+IV + A+ Sbjct: 309 VGLASTSAGLVVLTTGQPSLVTGSSPAAISQMKL-DVYQ--SCVSKRSIVEMGGYALYAS 365 Query: 77 NNGLISMKN 85 +GL+++ Sbjct: 366 PDGLVAVGG 374 >gi|254488980|ref|ZP_05102185.1| C4-dicarboxylate transport sensor protein DctB [Roseobacter sp. GAI101] gi|214045849|gb|EEB86487.1| C4-dicarboxylate transport sensor protein DctB [Roseobacter sp. GAI101] Length = 413 Score = 39.0 bits (89), Expect = 1.0, Method: Composition-based stats. Identities = 7/112 (6%), Positives = 25/112 (22%), Gaps = 10/112 (8%) Query: 65 IVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKR 124 + + + + ++ S ++ + A+ T +S+I Sbjct: 94 VEEIGAASLMLYDIDGRTVAATDRN---RLGS-SHRLEPYFVDAVRANATVFSVIPRSTG 149 Query: 125 NMLLISCPKNSLLSDKTFVMN-----VAHNNRWSSFHNWFTQSYVVANENLF 171 + V+ W+ + + + Sbjct: 150 GYRFFYSRRIQEGGATLGVIAAEVDLQRFERAWAGISD-AVIVTDSTGDIIM 200 >gi|325836749|ref|ZP_08166216.1| nucleic acid-binding domain protein [Turicibacter sp. HGF1] gi|325491127|gb|EGC93416.1| nucleic acid-binding domain protein [Turicibacter sp. HGF1] Length = 1736 Score = 38.2 bits (87), Expect = 1.5, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 54/199 (27%), Gaps = 18/199 (9%) Query: 2 QAGGSLPAGFSWSTESGD---GLSTLCVFLSTLGEVA-VYGGDNPDDSSSFSLKAIYHIG 57 + G + ++ + G V ++T + + G N + +Y Sbjct: 1082 KMNGYVTGNYAGNMNYVTELVGQEGGVVKINTETITSEILTGINVLIITDPENDFVYSNE 1141 Query: 58 RPLGKRAIVFVKNDVWIATNNGLISMKNIL---LQEEKTNLPLSWPIQEQWEQAIMVAPT 114 + +A V D+ + + Q K + I+ +Q + Sbjct: 1142 EVVAIKAFVEAGGDLIVTSKADYKDQTGQFGNAAQGNKVLEAIGATIRFNDDQVVDPVEN 1201 Query: 115 GWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLF--- 171 G + L N + S ++ S+ N L Sbjct: 1202 G-------GQAYRLYFDDYNEESPYTKGIDFGKIEQGNSKNTDYK-FSFYSGNSVLVPAD 1253 Query: 172 FGDYEGTFWQGDISGSDNA 190 + + + + +D+A Sbjct: 1254 ATNVDVVVKGHETTTNDDA 1272 >gi|81355156|ref|YP_392481.1| putative coat protein [Rosellinia necatrix partitivirus 1-W8] gi|63093873|dbj|BAD98238.1| putative coat protein [Rosellinia necatrix partitivirus 1-W8] Length = 686 Score = 38.2 bits (87), Expect = 1.6, Method: Composition-based stats. Identities = 23/246 (9%), Positives = 68/246 (27%), Gaps = 35/246 (14%) Query: 42 PDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPI 101 P+ ++ L + P+ + D + + ++ P Sbjct: 402 PNTGTTHHLASAVCDTDPVHPVTVTQPSADFVLL-----RDVTRTFPIADQHL----IPR 452 Query: 102 QEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQ 161 ++ ++++ + NS +S ++ Sbjct: 453 DTEFLLYDEDRDFAPTVLVLD--------PTTNSTVSAYM-----------ATLTGMVIY 493 Query: 162 SYVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSL 221 S+ + + F + E + +D+A P+ +Y + + A +A + Sbjct: 494 SHEIDASAISFPNTEDINGLHNCQFADSATPYTMVYRPSHYTLGT------TAHIARKRI 547 Query: 222 QAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKW-TDNLFVTKKKL 280 + + ++ Y T + + +G + D V + +++ Sbjct: 548 RVKDNRFKSALYLVDFAKIYIPRPILTTIDTLVSHGFPGFTFRDNVTFIGATTSALAQRV 607 Query: 281 FQFCQN 286 N Sbjct: 608 VSTRSN 613 >gi|83942489|ref|ZP_00954950.1| C4-dicarboxylate transport sensor protein DctB, putative [Sulfitobacter sp. EE-36] gi|83846582|gb|EAP84458.1| C4-dicarboxylate transport sensor protein DctB, putative [Sulfitobacter sp. EE-36] Length = 586 Score = 37.8 bits (86), Expect = 2.3, Method: Composition-based stats. Identities = 6/112 (5%), Positives = 24/112 (21%), Gaps = 10/112 (8%) Query: 65 IVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKR 124 + + + + ++ + + A+ T +S+I + Sbjct: 103 VEEIGAASLMLYDIDGRTVAATDRN---RLGASHRQ-EAYFVDALRSNSTVFSVIPRDTG 158 Query: 125 NMLLISCPKNSLLSDKTFVMN-----VAHNNRWSSFHNWFTQSYVVANENLF 171 + V+ W+ + + + Sbjct: 159 GYRFFYSRRIQEGGTNLGVIAAEVDLQRFERAWAGISD-AVIVTDSTGDIIM 209 >gi|186896503|ref|YP_001873615.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis PB1/+] gi|186699529|gb|ACC90158.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis PB1/+] Length = 388 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 31/98 (31%), Gaps = 3/98 (3%) Query: 20 GLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNG 79 G+ T + GE+ + P D+ L I + DV A+ Sbjct: 15 GIETGVAVIVEQGEIVAVTRELPADAEIVHLTGKTLIPGLIDIHIHGRQGADVMDASAEA 74 Query: 80 LISMKNILLQEEKTN---LPLSWPIQEQWEQAIMVAPT 114 L ++ L Q +S PIQ+ + V Sbjct: 75 LRTIARALPQTGVVAWVGTTVSAPIQDIFAALAQVRDF 112 >gi|153947379|ref|YP_001399920.1| N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis IP 31758] gi|152958874|gb|ABS46335.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis IP 31758] Length = 388 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 31/98 (31%), Gaps = 3/98 (3%) Query: 20 GLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNG 79 G+ T + GE+ + P D+ L I + DV A+ Sbjct: 15 GIETGVAVIVEQGEIVAVTRELPADAEIVHLTGKTLIPGLIDIHIHGRQGADVMDASAEA 74 Query: 80 LISMKNILLQEEKTN---LPLSWPIQEQWEQAIMVAPT 114 L ++ L Q +S PIQ+ + V Sbjct: 75 LRTIARALPQTGVVAWVGTTVSAPIQDIFAALAQVRDF 112 >gi|22127099|ref|NP_670522.1| deacetylase [Yersinia pestis KIM 10] gi|45443271|ref|NP_994810.1| 2-deoxy-D-gluconate 3-dehydrogenase [Yersinia pestis biovar Microtus str. 91001] gi|51597396|ref|YP_071587.1| acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis IP 32953] gi|108806429|ref|YP_650345.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Antiqua] gi|108813200|ref|YP_648967.1| acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Nepal516] gi|145597979|ref|YP_001162055.1| acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Pestoides F] gi|150260148|ref|ZP_01916876.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis CA88-4125] gi|162420948|ref|YP_001607713.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis Angola] gi|165924986|ref|ZP_02220818.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. F1991016] gi|165937229|ref|ZP_02225793.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. IP275] gi|166010394|ref|ZP_02231292.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. E1979001] gi|166212754|ref|ZP_02238789.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. B42003004] gi|167400119|ref|ZP_02305637.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167419118|ref|ZP_02310871.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425364|ref|ZP_02317117.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167468775|ref|ZP_02333479.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis FV-1] gi|170023237|ref|YP_001719742.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis YPIII] gi|218928014|ref|YP_002345889.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis CO92] gi|229837513|ref|ZP_04457675.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Pestoides A] gi|229840741|ref|ZP_04460900.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842700|ref|ZP_04462855.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. India 195] gi|229903647|ref|ZP_04518760.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Nepal516] gi|270487428|ref|ZP_06204502.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pestis KIM D27] gi|294502985|ref|YP_003567047.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Z176003] gi|21960155|gb|AAM86773.1|AE013923_8 putative deacetylase [Yersinia pestis KIM 10] gi|45438139|gb|AAS63687.1| 2-deoxy-D-gluconate 3-dehydrogenase [Yersinia pestis biovar Microtus str. 91001] gi|51590678|emb|CAH22320.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis IP 32953] gi|108776848|gb|ABG19367.1| N-acetylglucosamine 6-phosphate deacetylase [Yersinia pestis Nepal516] gi|108778342|gb|ABG12400.1| N-acetylglucosamine 6-phosphate deacetylase [Yersinia pestis Antiqua] gi|115346625|emb|CAL19508.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis CO92] gi|145209675|gb|ABP39082.1| N-acetylglucosamine 6-phosphate deacetylase [Yersinia pestis Pestoides F] gi|149289556|gb|EDM39633.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis CA88-4125] gi|162353763|gb|ABX87711.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis Angola] gi|165914703|gb|EDR33316.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. IP275] gi|165923186|gb|EDR40337.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. F1991016] gi|165990880|gb|EDR43181.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. E1979001] gi|166206046|gb|EDR50526.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. B42003004] gi|166963112|gb|EDR59133.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167050827|gb|EDR62235.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167055764|gb|EDR65548.1| putative N-acetylgalactosamine-6-phosphate deacetylase [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169749771|gb|ACA67289.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pseudotuberculosis YPIII] gi|229679417|gb|EEO75520.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Nepal516] gi|229691010|gb|EEO83064.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. India 195] gi|229697107|gb|EEO87154.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis biovar Orientalis str. PEXU2] gi|229704201|gb|EEO91212.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Pestoides A] gi|262361020|gb|ACY57741.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis D106004] gi|262364961|gb|ACY61518.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis D182038] gi|270335932|gb|EFA46709.1| N-acetylglucosamine-6-phosphate deacetylase [Yersinia pestis KIM D27] gi|294353444|gb|ADE63785.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis Z176003] gi|320014090|gb|ADV97661.1| putative acetylglucosamine-6-phosphate deacetylase [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 388 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 31/98 (31%), Gaps = 3/98 (3%) Query: 20 GLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNG 79 G+ T + GE+ + P D+ L I + DV A+ Sbjct: 15 GIETGVAVIVEQGEIVAVTRELPADAEIVHLTGKTLIPGLIDIHIHGRQGADVMDASAEA 74 Query: 80 LISMKNILLQEEKTN---LPLSWPIQEQWEQAIMVAPT 114 L ++ L Q +S PIQ+ + V Sbjct: 75 LRTIARALPQTGVVAWVGTTVSAPIQDIFAALAQVRDF 112 >gi|109899509|ref|YP_662764.1| protein of unknown function DUF940, membrane lipoprotein putative [Pseudoalteromonas atlantica T6c] gi|109701790|gb|ABG41710.1| protein of unknown function DUF940, membrane lipoprotein putative [Pseudoalteromonas atlantica T6c] Length = 709 Score = 37.4 bits (85), Expect = 2.5, Method: Composition-based stats. Identities = 28/285 (9%), Positives = 72/285 (25%), Gaps = 19/285 (6%) Query: 44 DSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKN--ILLQEEKTNLPLSWPI 101 S+F+ K + I N G I + + + + Sbjct: 424 SVSNFNPKNTSGFFTGIETFWIQTFGNPEEFYLYQGGIMVNGGYSFDGKTSVVGTIKANL 483 Query: 102 QEQWEQAIMVAPTGWSMILWEKRNML-LISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFT 160 + +++ S + + N+ +S SL + N +N ++ F Sbjct: 484 LQNFDKFNFTRDRAPSSLPRVRTNVREYVSGRDISLENLYLHWNNRIADNLYAQF----- 538 Query: 161 QSYVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLS 220 Y E ++ G ++ S + + ++ A++ Sbjct: 539 --YGGYLETMYGGVGGELLYRPVDSNLAFGFDINYVQQRSWENDIDFYDYKVLTGHANVF 596 Query: 221 LQAYQRPYLKLFAR---ANYDHSYPEFSKETISNNPMDNGIW------DNSLWDEVKWTD 271 Q P L + ++ + G + + + E +T Sbjct: 597 WQPEFLPDTLLTIKVGQYLAKDKGVTIDFAKRFDSGIVVGAYAAITDVSSEEYGEGSFTK 656 Query: 272 NLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSINDIQINNAK 316 +++ + G + I++ Sbjct: 657 GFYLSIPFDLFSLKPAKGSGKIPWIPISRDGGQALNRPIELFGVT 701 >gi|257413074|ref|ZP_04741901.2| putative bacterial extracellular solute-binding protein [Roseburia intestinalis L1-82] gi|257204740|gb|EEV03025.1| putative bacterial extracellular solute-binding protein [Roseburia intestinalis L1-82] Length = 976 Score = 37.4 bits (85), Expect = 2.7, Method: Composition-based stats. Identities = 35/262 (13%), Positives = 81/262 (30%), Gaps = 28/262 (10%) Query: 34 VAVYGGDNPDDSSSFSLKAIY-HIGRPLGKRA--IVFVKNDVWIATNN--GLISMKNILL 88 + VY G +PDD ++L +Y + + + + + +DV T I++ L Sbjct: 420 ILVYTGADPDDYRDYNLDQVYPEVIEAMDLESKRLYKIIDDVVAYTGQKAEKIAVAQTLA 479 Query: 89 QEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRN-----MLLISCPKNSLLSDKTFV 143 Q+ + + I ++ + + IL +++S + + D Sbjct: 480 QQLERFVERPDKITVEFTTFKENITSLGTAILNMSETKLDIDYIVVSSKGSEVPED---- 535 Query: 144 MNVAHNNRWSSFHNWFTQSYVVAN--ENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPF 201 W ++ V + +++ D ++G D + Sbjct: 536 RASGWAKAWHEIKSFAASFVVDYDAVGDVYDEDDNEVVRVWIVTGRDQGTILKTMV---D 592 Query: 202 HAIDPRLGFQRRACVAHLS-----LQAYQRPYLKLFARAN----YDHSYPEFSKETISNN 252 P G + + S + A Q P + L A+ Y + Sbjct: 593 DTFTPETGIKVNVEIVDASALLNAVVAGQGPDVVLSVGADQPVNYALRNAAEDLTQFEDY 652 Query: 253 PMDNGIWDNSLWDEVKWTDNLF 274 ++ S + ++ L+ Sbjct: 653 EDVLKVFYESAYRAYEYDGGLY 674 >gi|325924710|ref|ZP_08186148.1| Tfp pilus assembly protein, tip-associated adhesin PilY1 [Xanthomonas perforans 91-118] gi|325544916|gb|EGD16261.1| Tfp pilus assembly protein, tip-associated adhesin PilY1 [Xanthomonas perforans 91-118] Length = 1055 Score = 37.0 bits (84), Expect = 3.4, Method: Composition-based stats. Identities = 25/211 (11%), Positives = 59/211 (27%), Gaps = 12/211 (5%) Query: 95 LPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSS 154 L + + + +LW+K + + + ++ + W + Sbjct: 691 GTLGRGGRGAFALDVTDPGNIS--LLWDKTS----GEISSMGNNLGKPIIGQLASGAWFT 744 Query: 155 FHNWFTQSYVVANENLFFGDYEGTFWQGDISGSD-NARPFIGIYLSPFHAIDPR--LGFQ 211 S + G GT + + N + + S ++I R G Sbjct: 745 MIGNGPNSSNDGAGLVLIGVLTGTVTEIATGATGANGLSGVLGWSSTNNSIVDRIYAGDL 804 Query: 212 RRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTD 271 + + A + P + T + +P W + K+ Sbjct: 805 KGNMWRFAMVGTVATVTKLFTATYG-GQTQPITAAPTAAKDPATGLTW--LFFGTGKYIS 861 Query: 272 NLFVTKKKLFQFCQNVVAYGNCLAVGCVIVS 302 + ++ K + + V G +A + S Sbjct: 862 DGDLSNKDVQSWYGLVDRSGAAIANRSTLNS 892 >gi|146301154|ref|YP_001195745.1| hypothetical protein Fjoh_3412 [Flavobacterium johnsoniae UW101] gi|146155572|gb|ABQ06426.1| hypothetical protein Fjoh_3412 [Flavobacterium johnsoniae UW101] Length = 394 Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats. Identities = 8/72 (11%), Positives = 15/72 (20%), Gaps = 1/72 (1%) Query: 201 FHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWD 260 F I + +L++ + Y NN + Sbjct: 87 FDYIKDPELGVKAFIKNYLNVSQGGAVAIDY-PYYYYSTVNTGQPSPYDLNNDGKLNNFG 145 Query: 261 NSLWDEVKWTDN 272 N W + Sbjct: 146 NDAWGFGMYPGQ 157 >gi|281207113|gb|EFA81296.1| hypothetical protein PPL_05275 [Polysphondylium pallidum PN500] Length = 1590 Score = 37.0 bits (84), Expect = 3.6, Method: Composition-based stats. Identities = 11/147 (7%), Positives = 32/147 (21%), Gaps = 17/147 (11%) Query: 124 RNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWF--------TQSYVVANENLFFGDY 175 + I +++ +F V +W + + + + Sbjct: 349 TYFIFIYSDNSNVTITTSFGYGVF--FQWYETEGIKCPAGSNRNPFTNDAPFDTIHYWGT 406 Query: 176 EGTFWQGDISGSDNARPFIGIYLSPF----HAIDPRLGFQRRACVAHLSLQAYQRPYLKL 231 G + S F + L+ ++ KL Sbjct: 407 SGIPYFSYQITSFANTNVTRSLNLGFQNQTFDFTDNNNYSSPNMTIKLTTKSNSEFTNKL 466 Query: 232 FARANYDHSYPEFSKETISNNPMDNGI 258 + + + + ++ I Sbjct: 467 ---IDLNGITILQTSDPSTDITSALFI 490 >gi|156093264|ref|XP_001612672.1| variable surface protein Vir [Plasmodium vivax SaI-1] gi|148801498|gb|EDL42898.1| variable surface protein Vir, putative [Plasmodium vivax] Length = 396 Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 5/54 (9%), Positives = 13/54 (24%) Query: 236 NYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVA 289 +Y + +F + T + + S + + F Sbjct: 313 DYANRRRDFDESTYGASTYGASTYGASTYGTSTFDGTSTFDGTSTFDGTSTFDG 366 >gi|134295287|ref|YP_001119022.1| hypothetical protein Bcep1808_1176 [Burkholderia vietnamiensis G4] gi|134138444|gb|ABO54187.1| hypothetical protein Bcep1808_1176 [Burkholderia vietnamiensis G4] Length = 471 Score = 37.0 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 29/256 (11%), Positives = 66/256 (25%), Gaps = 13/256 (5%) Query: 17 SGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIA- 75 G G++ V+L V+ L ++ +I + +V+ Sbjct: 183 VGVGVTKRYVYLLGEVTTEVWFNAGDSTFPYERLPGVFMQHGCAAAGSIAHMDGEVYWLA 242 Query: 76 -TNNGLISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKN 134 + G+ + + E + G++ + E +++ P Sbjct: 243 QSQQGVCYVNRSQKFNATQISTFALD-NEIASYPRVDDAIGFTYQI-EGHFFYVLTFPSA 300 Query: 135 SLLSDKTFVMNVAHNNRWSS----FHNWFTQSYVVANENLFFGD-YEGTFWQGDI-SGSD 188 + W Y GD G + D+ + +D Sbjct: 301 DKTWQYDLSTEQWNELSWVDENGGLRRHRANCYASLYGQPIVGDWENGNLYLWDVNAYTD 360 Query: 189 NARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQ-AYQRPYLKLFARANYDHSYPEFSKE 247 N P S F + R +++ ++ + + + Sbjct: 361 NGNPIP-RIRS-FMHSVDDNSDRMRYREFIANMEVGNGSNNSPVWVFLRWSDTRGKSWGN 418 Query: 248 TISNNPMDNGIWDNSL 263 I + G + SL Sbjct: 419 AIKTSLGKEGEYATSL 434 >gi|330799381|ref|XP_003287724.1| hypothetical protein DICPUDRAFT_151857 [Dictyostelium purpureum] gi|325082285|gb|EGC35772.1| hypothetical protein DICPUDRAFT_151857 [Dictyostelium purpureum] Length = 1225 Score = 36.6 bits (83), Expect = 4.4, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 35/109 (32%), Gaps = 11/109 (10%) Query: 13 WSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDV 72 W ++G G + G ++ + D+S+ +G + KR + D+ Sbjct: 595 WVIDNGIG-------IPEEGREKLFQTFSQYDASTTRKYGGSGLGLAISKRLTQLLGGDI 647 Query: 73 WIATNNG----LISMKNILLQEEKTNLPLSWPIQEQWEQAIMVAPTGWS 117 W + G + + + +T S Q + + S Sbjct: 648 WFESQKGKGSSFHFLVEVFFPDYQTIQQSSLQPQFNITNQLTSNDSSTS 696 >gi|315427984|dbj|BAJ49573.1| N-methylhydantoinase B [Candidatus Caldiarchaeum subterraneum] Length = 566 Score = 36.6 bits (83), Expect = 4.4, Method: Composition-based stats. Identities = 6/76 (7%), Positives = 18/76 (23%), Gaps = 3/76 (3%) Query: 134 NSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEG---TFWQGDISGSDNA 190 + + D + + WS S+ + ++ + + D Sbjct: 109 SPVFYDSQLIAFSVNKGHWSEVGGMRFGSWTTDSTEIYQEGLQFPLLKLYNAGEPNRDLF 168 Query: 191 RPFIGIYLSPFHAIDP 206 +P + Sbjct: 169 DLIRTNVRTPDMTLGD 184 >gi|294633768|ref|ZP_06712326.1| polysaccharide lyase family 8 [Streptomyces sp. e14] gi|292830410|gb|EFF88761.1| polysaccharide lyase family 8 [Streptomyces sp. e14] Length = 715 Score = 36.6 bits (83), Expect = 4.9, Method: Composition-based stats. Identities = 16/185 (8%), Positives = 39/185 (21%), Gaps = 12/185 (6%) Query: 131 CPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTFWQGDISGSDNA 190 S W++ + + + W + S + Sbjct: 503 TVDGVAQSSALGWNATFPRASWATIAGFGGYVFPGGASVKALREERTGSWHDINTLSGST 562 Query: 191 RPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKL--FARANYDHSYPEFSKET 248 P YL+ + A V ++ + Sbjct: 563 TPLTRRYLTLWFDHGTDPAAGSYAYVLLPGADPATTAARAAAAGWLTVLANTVNQQGVSV 622 Query: 249 ISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNCLAVGCVIVSSGKSIN 308 S + +V T + V ++ G+ + + ++ Sbjct: 623 PSLGLTAVNFYGGGTVGKVSATAGVSVMIRE----------SGSTATICVADPTRQRTTV 672 Query: 309 DIQIN 313 D+ N Sbjct: 673 DVTWN 677 >gi|148557351|ref|YP_001264933.1| hypothetical protein Swit_4457 [Sphingomonas wittichii RW1] gi|148502541|gb|ABQ70795.1| hypothetical protein Swit_4457 [Sphingomonas wittichii RW1] Length = 475 Score = 36.6 bits (83), Expect = 5.5, Method: Composition-based stats. Identities = 23/249 (9%), Positives = 62/249 (24%), Gaps = 32/249 (12%) Query: 33 EVAVYGGDNPDDSSSFSLKAIYHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILLQEEK 92 E + D + + I + ++ + +N GL++ + Sbjct: 200 ENRIVRMSYTADDAVWQFDEIATDVGCIAPWSLATYGKLTFFLSNKGLMACDGV-----S 254 Query: 93 TNLPLSWPIQEQWEQAIM-VAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNVAHNNR 151 + S + + + + S ++ + ++ ++S P S + Sbjct: 255 VSAIGSEKVDRTFLHMVDRSYLSAISAVVDPRNSLYMLSVPSASPANQVLLYNFALQRFT 314 Query: 152 WSSFHNWF-----TQSYVVANENLFFGDYEGTFWQGDISGS---------DNARPFIGIY 197 + + + + + +GD + D + + +G Sbjct: 315 TAPITSQRMFPSLALGMTLEDMDAVYGDLDAVALSLDSALFRGGYPLVMLFDGNGMLGTL 374 Query: 198 LSPFH--AIDPR-----LGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETIS 250 + G + R A L +S + ET Sbjct: 375 SGDNMAATLVDGMREWFPGRRARITGVRPFTDATTATVSILG-----QNSLGDMPTETAF 429 Query: 251 NNPMDNGIW 259 G + Sbjct: 430 TARSGGGFY 438 >gi|330996831|ref|ZP_08320700.1| Fibrobacter succinogene major domain protein [Paraprevotella xylaniphila YIT 11841] gi|329572274|gb|EGG53933.1| Fibrobacter succinogene major domain protein [Paraprevotella xylaniphila YIT 11841] Length = 445 Score = 36.3 bits (82), Expect = 5.8, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 52/187 (27%), Gaps = 26/187 (13%) Query: 101 IQEQWEQAIMVAPTGWSMILWEKRNMLLISCPKNSLLSDKTFVMNV---AHNNRWSSFH- 156 + E + + S +L K + I K+ ++ V N +N +++ Sbjct: 143 VDEAATYIVDDSELFVSAVLAPKTARMHIKGEKDKTVAVSGVVYNTSFSLTDNVFTTSTE 202 Query: 157 ---------NWFTQSYVVANENLFFGDYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPR 207 + Y +E + + + + + F+ S + + + Sbjct: 203 EISTQFESDGYTPYIYCSLDETHVITLDDLYYKY---TKTFDPTKFMAG-ESGYMVMPSQ 258 Query: 208 LGFQRRACVAHLSLQAYQRPYLKLFARANYDHSYPEFSKETISNNPMDNGIWDNSLWDEV 267 + + + + L L R + + W ++ + Sbjct: 259 VENK---GWTSVYYYPEEAVDLGLSVRW-----ASMNVGASNQYDDGIYTSWGDA-YGTN 309 Query: 268 KWTDNLF 274 T + + Sbjct: 310 TSTSSSY 316 >gi|296414372|ref|XP_002836875.1| hypothetical protein [Tuber melanosporum Mel28] gi|295632717|emb|CAZ81066.1| unnamed protein product [Tuber melanosporum] Length = 1821 Score = 36.3 bits (82), Expect = 6.2, Method: Composition-based stats. Identities = 12/128 (9%), Positives = 31/128 (24%), Gaps = 19/128 (14%) Query: 190 ARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPY--LKLFARANYDHSYPEFSKE 247 + G P++ + QR A A + +P NY+ + Sbjct: 138 GSGYNGSQHQPYNPAAYQAQPQRHATSAGFGVSPEYQPQRHATTAGVGNYNSGQFSPTTA 197 Query: 248 TISNNPMDNGIWDNSLWDEV--------------KWTDN---LFVTKKKLFQFCQNVVAY 290 T + + + + + + + + ++ Sbjct: 198 TYNPSTFNTASYPTGTYRTGAVPQTAPLQSHPSQSFGGGPGYSQYATRPQYATTSSLYGS 257 Query: 291 GNCLAVGC 298 G + G Sbjct: 258 GQPYSPGL 265 >gi|220914641|ref|YP_002489950.1| hypothetical protein Achl_3909 [Arthrobacter chlorophenolicus A6] gi|219861519|gb|ACL41861.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 1838 Score = 35.9 bits (81), Expect = 8.0, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 27/124 (21%), Gaps = 1/124 (0%) Query: 174 DYEGTFWQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFA 233 ++ S S + + Y + P Q + + Sbjct: 787 GSPNGVYRYGASSSFPNQVYDAEYYWVDLSFTPSGAVQPAVSSVSP-INNAADVSTGIKP 845 Query: 234 RANYDHSYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTKKKLFQFCQNVVAYGNC 293 A ++ + S + + + +D T + G Sbjct: 846 SATFNQTVTATSVVFSVKSSAGANVAGTTTYDPATNTSTFTPAASLGYSTTYTATVSGAT 905 Query: 294 LAVG 297 + G Sbjct: 906 NSAG 909 >gi|195450617|ref|XP_002072561.1| GK13659 [Drosophila willistoni] gi|194168646|gb|EDW83547.1| GK13659 [Drosophila willistoni] Length = 3016 Score = 35.9 bits (81), Expect = 8.3, Method: Composition-based stats. Identities = 42/321 (13%), Positives = 88/321 (27%), Gaps = 38/321 (11%) Query: 6 SLPAGFSWSTESGDGLSTLCVFLSTLGEVAVYGGDNPDDSSSFSL--KAIYHIGRPLGKR 63 S+ G S + G GL V+ G + + S + G + + Sbjct: 1080 SVLGGLSKGIKGGLGLDG------------VFSGSSSTEMSQPQVNNSGKKVFGFGIASK 1127 Query: 64 AIVFVKNDVWIATNNGLISMKNILLQEEKTNLPLSWPI---QEQWEQAIMVAPTGWSMIL 120 + V + ++N + + + I + + ++ T S Sbjct: 1128 FVPSVGG--LLNSSNKFVP-TEPEKETSAIVATTTSAISHEETAFSNVQLLVATTAST-- 1182 Query: 121 WEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTF- 179 + + S N L + + + S +Y ++L FG+ + Sbjct: 1183 NQTSTAHINSNNSNDLPINFVTEVCPITKTQIS-----KEVAYHDNYDDLQFGNDLTSVD 1237 Query: 180 --WQGD-ISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARAN 236 ++ D DN I Y P+ ++ + + A LS + Sbjct: 1238 NSYEYDYTGYCDNGE-IIYSYEDPYKEVENNINVEMNAQYISLSSKPMGEATSLQKPLGL 1296 Query: 237 YDHSYPEFSKETISNNPMDNGIWDNSLWDEVK---WTDNLFVTKKKLFQFCQNVVAYGNC 293 P E + N S + L + Q V + Sbjct: 1297 RSSDLPPPPVEAPITSTSTNSASSGSGVSTATSGMFGSILGKAAAAVQSATQAVNQGASS 1356 Query: 294 LAVGC---VIVSSGKSINDIQ 311 +A+ + SS + + Sbjct: 1357 VALAVAQKAVSSSDQKVTAQS 1377 >gi|294933954|ref|XP_002780918.1| hypothetical protein Pmar_PMAR025346 [Perkinsus marinus ATCC 50983] gi|239891065|gb|EER12713.1| hypothetical protein Pmar_PMAR025346 [Perkinsus marinus ATCC 50983] Length = 713 Score = 35.9 bits (81), Expect = 8.6, Method: Composition-based stats. Identities = 15/197 (7%), Positives = 40/197 (20%), Gaps = 25/197 (12%) Query: 120 LWEKRNMLLISCPKNSLLSDKTFVMNVAHNNRWSSFHNWFTQSYVVANENLFFGDYEGTF 179 + + +++ + F+ S +G + T Sbjct: 363 IRPGGRIRIVAPSDGTEEGSFKFIRYTLEC-----VITLQQVSGDRGQLTTLWGSDDVTC 417 Query: 180 WQGDISGSDNARPFIGIYLSPFHAIDPRLGFQRRACVAHLSLQAYQRPYLKLFARANYDH 239 + + + I + A +++ RP + + Y Sbjct: 418 IVEEDAKNVANVNIIST---------------TKVAQAPVNVAPSSRPVQSIRSGLLYRV 462 Query: 240 SYPEFSKETISNNPMDNGIWDNSLWDEVKWTDNLFVTK-KKLFQFCQNVVAYGNCLAVGC 298 IW +D A G + V Sbjct: 463 VAV---VYNPQQANQQAAIWGIETYDNTFTNVGGNPDVFNSEAALLDAGTATGFPITVDL 519 Query: 299 -VIVSSGKSINDIQINN 314 + + +++ Sbjct: 520 YRFLITKGDGPQEALDS 536 >gi|320583430|gb|EFW97643.1| calcium-transporting ATPase, putative [Pichia angusta DL-1] Length = 1166 Score = 35.5 bits (80), Expect = 9.5, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 30 TLGEVAVYGGDNPDDSSSFSLKAI-YHIGRPLGKRAIVFVKNDVWIATNNGLISMKNILL 88 + GEV GD +D+ + L + + +G + + +D+ I T + S+ N + Sbjct: 784 SQGEVVAVTGDGTNDAPALKLADVGFSMG--IAGTEVAREASDI-ILTTDDFSSIVNAIK 840 Query: 89 QEEKTNLPLSWPIQEQWEQAIMV 111 + +Q Q + Sbjct: 841 WGRTVATSIRKFVQFQLTVNVTA 863 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.305 0.128 0.307 Lambda K H 0.267 0.0396 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,811,272,983 Number of Sequences: 14124377 Number of extensions: 45918789 Number of successful extensions: 198276 Number of sequences better than 10.0: 206 Number of HSP's better than 10.0 without gapping: 35 Number of HSP's successfully gapped in prelim test: 171 Number of HSP's that attempted gapping in prelim test: 197761 Number of HSP's gapped (non-prelim): 563 length of query: 320 length of database: 4,842,793,630 effective HSP length: 139 effective length of query: 181 effective length of database: 2,879,505,227 effective search space: 521190446087 effective search space used: 521190446087 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 81 (35.9 bits)