BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1064_1 (238 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|315122526|ref|YP_004063015.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495928|gb|ADR52527.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 389 Score = 259 bits (661), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 130/241 (53%), Positives = 174/241 (72%), Gaps = 3/241 (1%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGKQ++ L+PDPKA+AS+QLS N+ N+ N+ R N++ +TPDGI +Y GVDK+ID F Sbjct: 1 MGKQQSFLSPDPKAVASMQLSENINNSLFNSSRANMNEITPDGILRYTQEGVDKMIDPFS 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPH-HHTHVNTTKDFP 119 G+E+SIP Y +SY L P+ Q ++NR+N N++L S+LLTQR+Q+ +P + +N + Sbjct: 61 GQELSIPRYSRSYELSPVAQDLYNRRNANHILFSNLLTQRLQNFMPSPQNNSMNLQQPLA 120 Query: 120 PQQLRDNDVPEKPN--ASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKL 177 N +PE N + E+ + ILY+Y QQYE LDRLQ L+QDREDLET+L Sbjct: 121 IPDPAHNPIPEGTNHFSQPEQEEGILYDYGKNNGQQYENTLLDRLQPRLKQDREDLETRL 180 Query: 178 HNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 NQGL+ GSV+WNR IDE NRKL+D RLAA+LK+S+EQERLDN++EK AYFHN AQAQ Sbjct: 181 SNQGLMPGSVSWNRTIDENNRKLNDARLAALLKSSEEQERLDNMREKQAYFHNFAQAQSH 240 Query: 238 Q 238 Q Sbjct: 241 Q 241 >gi|227822435|ref|YP_002826407.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] gi|227341436|gb|ACP25654.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] Length = 453 Score = 119 bits (298), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 88/240 (36%), Positives = 124/240 (51%), Gaps = 12/240 (5%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A PDPKA A+ Q + N+ A AN Y N++++TPDG Y + K D Sbjct: 1 MGKSKAPTPPDPKATAAAQTATNIGTAVANGYMGNVNQVTPDGSLTYSYT-KQKWTDPLS 59 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQ---RIQDLL--PHHHTHVNTT 115 G +P + +L +Q +I + + +L L+ L T R+ DLL P + Sbjct: 60 GNVYDLPVATATQTLSEMQDKIKKQNDQASLNLATLATSQSSRLNDLLGKPMDISKAPAA 119 Query: 116 KDFP----PQQLRDNDVPEKPNASLEERKEILYNYPT-MGSQQYEKAFLDRLQSSLQQDR 170 D PQ + + PE S+ I +Y T + +YE A + RL L++DR Sbjct: 120 GDHSKLTLPQYQQFSAGPEL-QTSVGNAGNIARSYETDFDTSKYENALMARLNPQLERDR 178 Query: 171 EDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230 LET+L NQGL GS A+NRAIDE NR +D R+AA+L A EQ RL N+ + A F N Sbjct: 179 AALETRLANQGLQPGSEAYNRAIDEANRTSNDARIAAVLNAGQEQTRLANLANQKASFEN 238 >gi|265985067|ref|ZP_06097802.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663659|gb|EEZ33920.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 299 Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 75/247 (30%), Positives = 119/247 (48%), Gaps = 23/247 (9%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A +PDPK ++ Q N+ A AN+Y N++++TPDG Y +G+ K D + Sbjct: 1 MGKSKAPKSPDPKETSAAQTGTNIGTAVANSYLNNVNQVTPDGSLTYSQTGMQKYYDPYT 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLL---TQRIQDLL---------PHH 108 G+ IP + + L QQ I ++++ NL L L + R+ DLL P Sbjct: 61 GKSYDIPQFTATQQLSQQQQAIKDQEDATNLNLGKLANSQSSRLNDLLGKPFDLSGAPAA 120 Query: 109 HTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQ 168 N T PQ + P+ + ++ + Q+ E A + R+ L+Q Sbjct: 121 GNAGNMTA---PQYQQYTGGPQLQTSYTDDF--------SADRQKVEDALMSRINPQLEQ 169 Query: 169 DREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYF 228 DR LE +L NQG++ GS A+ A+++ + +D R+ A+L EQ RL + A F Sbjct: 170 DRSALEQRLANQGIMPGSKAFETAMNQNAQASNDARMQAILAGGQEQSRLAGLSRDQATF 229 Query: 229 HNLAQAQ 235 N A Q Sbjct: 230 GNNANQQ 236 >gi|150397020|ref|YP_001327487.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] gi|150028535|gb|ABR60652.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] Length = 532 Score = 105 bits (262), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 82/269 (30%), Positives = 119/269 (44%), Gaps = 46/269 (17%) Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPH 68 APDPK AS Q + N+ A AN N +++TPDG Y T K D G+E + Sbjct: 10 APDPKQTASAQTATNIGTAVANNVMGNANQVTPDGNLTY-TYNTQKWTDPLSGKEYDLKV 68 Query: 69 YLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDL-----------------------L 105 + +L P QQ I ++++ L L+ L + L L Sbjct: 69 PTATQTLSPAQQAIKDQEDAAQLNLATLANTQSGKLNGLLASKFDISGAPAAGKSDAIGL 128 Query: 106 PHHHTHVNTTK------------------DFPPQQLRDNDVPEKPNASLEERKEILYNYP 147 P + + + K Q+ D+ K SL I +Y Sbjct: 129 PQYQSFTSGPKLQTSLANAGNVQSSIAGAGSIQSQVADSG---KIQTSLGNAGNITESYD 185 Query: 148 -TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLA 206 + + +YE+A +DRL +++DR LETKL NQGL GS A++RA+DE NR +D R+ Sbjct: 186 FDIDTSKYEQALMDRLSPQIERDRAALETKLTNQGLQPGSEAYDRAMDEANRAANDARIG 245 Query: 207 AMLKASDEQERLDNIQEKHAYFHNLAQAQ 235 A L A EQ R+ + + A F N AQ Q Sbjct: 246 ATLSAGQEQSRIAGLAQNQAQFQNSAQQQ 274 >gi|316933872|ref|YP_004108854.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] gi|315601586|gb|ADU44121.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] Length = 341 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 68/185 (36%), Gaps = 38/185 (20%) Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPH 68 APDP A Q NL +D++TP G Y +G + + G+ ++P Sbjct: 9 APDPVKTAEAQGQMNLTTGVQQQLLNMVDQVTPTGSLTYSQNGTTSFVGA-DGKTYTVPR 67 Query: 69 YLQSYSLHPIQQQIHNRQNINNLLLSDLL---TQRIQDLLPHHHTHVNTTKDFPPQQLRD 125 + + +L P QQ + + N L + + +I LL + N Sbjct: 68 FTSTQTLTPAQQALLDLSNKTQANLGQIGVDQSAKIGSLLGTNLKLGN------------ 115 Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG 185 E A L E +GS RL Q E L T+L NQG+ G Sbjct: 116 ----EATEARLME----------LGSA--------RLDPKFAQSEEALRTRLANQGIQPG 153 Query: 186 SVAWN 190 S AWN Sbjct: 154 SAAWN 158 >gi|218510551|ref|ZP_03508429.1| hypothetical protein RetlB5_25766 [Rhizobium etli Brasil 5] Length = 271 Score = 45.4 bits (106), Expect = 0.006, Method: Compositional matrix adjust. Identities = 50/187 (26%), Positives = 86/187 (45%), Gaps = 18/187 (9%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGKSTMTDQN-GKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 ++P Y +L P Q I+++ L L+ L + Q + T+V+ + + + Sbjct: 65 NLPVYSAYQTLSPQNQAIYDQSQQTQLGLAKLANDQTQKVSGILGTNVDLSSGNVDKYVN 124 Query: 125 D------NDVPEKPNASLEER---KEILYNYPTMGSQQYEKA---FLDRLQSSLQQDRED 172 D ++ ++ ASL++ K I MGS Y+ A F R Q++ Q D Sbjct: 125 DHWRAGFDNQWDREQASLDQSLADKGI-----AMGSAAYDNAMRDFTTRKQAAADQYLGD 179 Query: 173 LETKLHN 179 + + N Sbjct: 180 MYSNAQN 186 >gi|319783503|ref|YP_004142979.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169391|gb|ADV12929.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 330 Score = 45.1 bits (105), Expect = 0.008, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 42/84 (50%) Query: 13 KAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQS 72 K ++ + N+ A ANA N++++TPDG Y +G K D + G+ IP Y + Sbjct: 13 KETSAASTATNVGTAIANANLGNVNQVTPDGSLNYSQTGTYKWNDPYTGKSYDIPTYTAT 72 Query: 73 YSLHPIQQQIHNRQNINNLLLSDL 96 +L Q I ++ + L L +L Sbjct: 73 QTLSGTGQAIKDQTDQAKLNLGEL 96 >gi|116253668|ref|YP_769506.1| hypothetical protein RL3928 [Rhizobium leguminosarum bv. viciae 3841] gi|115258316|emb|CAK09418.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 335 Score = 43.9 bits (102), Expect = 0.017, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 18/187 (9%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK SG + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVSGYQTMKDQN-GKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L LS L ++ + T+V+ + + Sbjct: 65 QLPTYSAYQTYSPQNQAIYDQTQQTQLGLSKLANEQTGKISGILGTNVDLSAGNVDKYAN 124 Query: 125 D------NDVPEKPNASLEER---KEILYNYPTMGSQQYEKA---FLDRLQSSLQQDRED 172 D N+ ++ ASL++ K I +MGS+ Y A F R Q++ Q D Sbjct: 125 DHWQGGFNNQWDRDQASLDQSLADKGI-----SMGSEAYNNALRDFSTRKQAASDQFLGD 179 Query: 173 LETKLHN 179 + + N Sbjct: 180 MYSNAQN 186 >gi|218673260|ref|ZP_03522929.1| hypothetical protein RetlG_17541 [Rhizobium etli GR56] Length = 334 Score = 40.8 bits (94), Expect = 0.14, Method: Compositional matrix adjust. Identities = 49/187 (26%), Positives = 84/187 (44%), Gaps = 18/187 (9%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMKDQN-GKSY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L L+ L + + T+V+ + + + Sbjct: 65 QLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKYVN 124 Query: 125 D------NDVPEKPNASLEER---KEILYNYPTMGSQQYEKA---FLDRLQSSLQQDRED 172 + N+ ++ ASLE+ K I +MGS Y+ A F R Q++ Q D Sbjct: 125 NHWQSGFNNQWDRDQASLEQSLADKGI-----SMGSAAYDNAMRDFSTRKQAASDQYLGD 179 Query: 173 LETKLHN 179 + + N Sbjct: 180 MYSNAQN 186 >gi|86356745|ref|YP_468637.1| hypothetical protein RHE_CH01103 [Rhizobium etli CFN 42] gi|86280847|gb|ABC89910.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 334 Score = 40.8 bits (94), Expect = 0.15, Method: Compositional matrix adjust. Identities = 50/196 (25%), Positives = 79/196 (40%), Gaps = 41/196 (20%) Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPH 68 APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ +P Sbjct: 10 APDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMTDQN-GKTYKLPT 68 Query: 69 YLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDV 128 Y + P Q I+++ L L+ L ND Sbjct: 69 YSAYQTYSPENQAIYDQTQQTQLGLARLA----------------------------NDQ 100 Query: 129 PEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQ----QDREDLETKLHNQGLVS 184 K + IL + + + +K D QS +D+ LE L ++G+ Sbjct: 101 TAKVSG-------ILGSNVDLSAGNVDKYVNDHWQSGFNNQWDRDQASLEQSLADKGIAI 153 Query: 185 GSVAWNRAI-DETNRK 199 GS A++ A+ D T RK Sbjct: 154 GSAAYDNAMRDFTTRK 169 >gi|152982946|ref|YP_001353886.1| hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] gi|151283023|gb|ABR91433.1| Hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] Length = 305 Score = 39.7 bits (91), Expect = 0.36, Method: Compositional matrix adjust. Identities = 20/61 (32%), Positives = 33/61 (54%) Query: 151 SQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLK 210 Q + A + R Q ++Q R+ L+ +L NQG+ GS A+N A+ + +D+R A L Sbjct: 138 GQTGQDALMARFQPQMEQSRKALDAQLANQGITQGSEAYNNAMRTQQQGENDLRSQAALN 197 Query: 211 A 211 Sbjct: 198 G 198 >gi|209548343|ref|YP_002280260.1| hypothetical protein Rleg2_0738 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534099|gb|ACI54034.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 334 Score = 39.3 bits (90), Expect = 0.45, Method: Compositional matrix adjust. Identities = 52/189 (27%), Positives = 86/189 (45%), Gaps = 22/189 (11%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA +++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSYVNQYTPDGSLEYKVTGQQTMTDQN-GKTY 64 Query: 65 SIP--HYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQ 122 IP Q+YS P Q I+++ L L+ L + + T+V+ + + Sbjct: 65 QIPIRSAYQTYS--PQNQAIYDQTQQTQLGLAKLANDQTGKISGILGTNVDLSAGNVDKY 122 Query: 123 LRD------NDVPEKPNASLEER---KEILYNYPTMGSQQYEKA---FLDRLQSSLQQDR 170 + D N+ ++ ASL++ K I +MGS Y+ A F R Q++ Q Sbjct: 123 VNDHWQSGFNNQWDRDQASLDQSLADKGI-----SMGSAAYDNAMRDFSTRKQAASDQYL 177 Query: 171 EDLETKLHN 179 D+ + N Sbjct: 178 GDMYSNAQN 186 >gi|149247691|ref|XP_001528254.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL YB-4239] gi|146448208|gb|EDK42596.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL YB-4239] Length = 1961 Score = 37.4 bits (85), Expect = 1.6, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 14/83 (16%) Query: 151 SQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG------------SVAWNRAIDETNR 198 ++Q + F+DRL+S+++ DRED+ N G V G SV DE +R Sbjct: 1330 TKQDQDGFIDRLKSTIRIDREDIAFGAFN-GFVQGMAEFCSTLNYYASVPVGPEFDE-DR 1387 Query: 199 KLHDVRLAAMLKASDEQERLDNI 221 H ++L A +KA+ + E + Sbjct: 1388 MFHQIKLKAQIKAAGKPEYFNTF 1410 >gi|327191473|gb|EGE58493.1| hypothetical protein RHECNPAF_300003 [Rhizobium etli CNPAF512] Length = 335 Score = 36.6 bits (83), Expect = 3.1, Method: Compositional matrix adjust. Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 4/104 (3%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK + I+ G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTS-KSIMKDQNGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLL---TQRIQDLL 105 +P Y + P Q I+++ L L+ L T +I +L Sbjct: 65 ELPVYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGIL 108 >gi|84688082|ref|ZP_01015940.1| Oligoendopeptidase F [Maritimibacter alkaliphilus HTCC2654] gi|84663910|gb|EAQ10416.1| Oligoendopeptidase F [Rhodobacterales bacterium HTCC2654] Length = 614 Score = 35.4 bits (80), Expect = 7.4, Method: Compositional matrix adjust. Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 10/71 (14%) Query: 156 KAFLDRLQSSL-QQDREDLETKLHNQGLVSGSVAWNRAIDETNRKL--------HDVRLA 206 K DRL++ Q ++LE LH+Q V GS AWN+ DET +L D+ L Sbjct: 155 KPIFDRLRAMRPYQLSDELEKFLHDQSAV-GSAAWNKLFDETMARLTFSPEGEDEDLSLE 213 Query: 207 AMLKASDEQER 217 A L EQ+R Sbjct: 214 ATLNLLSEQKR 224 Searching..................................................done Results from round 2 >gi|315122526|ref|YP_004063015.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495928|gb|ADR52527.1| hypothetical protein CKC_03890 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 389 Score = 309 bits (792), Expect = 2e-82, Method: Composition-based stats. Identities = 130/241 (53%), Positives = 174/241 (72%), Gaps = 3/241 (1%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGKQ++ L+PDPKA+AS+QLS N+ N+ N+ R N++ +TPDGI +Y GVDK+ID F Sbjct: 1 MGKQQSFLSPDPKAVASMQLSENINNSLFNSSRANMNEITPDGILRYTQEGVDKMIDPFS 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPH-HHTHVNTTKDFP 119 G+E+SIP Y +SY L P+ Q ++NR+N N++L S+LLTQR+Q+ +P + +N + Sbjct: 61 GQELSIPRYSRSYELSPVAQDLYNRRNANHILFSNLLTQRLQNFMPSPQNNSMNLQQPLA 120 Query: 120 PQQLRDNDVPEKPN--ASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKL 177 N +PE N + E+ + ILY+Y QQYE LDRLQ L+QDREDLET+L Sbjct: 121 IPDPAHNPIPEGTNHFSQPEQEEGILYDYGKNNGQQYENTLLDRLQPRLKQDREDLETRL 180 Query: 178 HNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 NQGL+ GSV+WNR IDE NRKL+D RLAA+LK+S+EQERLDN++EK AYFHN AQAQ Sbjct: 181 SNQGLMPGSVSWNRTIDENNRKLNDARLAALLKSSEEQERLDNMREKQAYFHNFAQAQSH 240 Query: 238 Q 238 Q Sbjct: 241 Q 241 >gi|227822435|ref|YP_002826407.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] gi|227341436|gb|ACP25654.1| hypothetical protein NGR_c18900 [Sinorhizobium fredii NGR234] Length = 453 Score = 249 bits (634), Expect = 3e-64, Method: Composition-based stats. Identities = 88/240 (36%), Positives = 124/240 (51%), Gaps = 12/240 (5%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A PDPKA A+ Q + N+ A AN Y N++++TPDG Y + K D Sbjct: 1 MGKSKAPTPPDPKATAAAQTATNIGTAVANGYMGNVNQVTPDGSLTYSYT-KQKWTDPLS 59 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQ---RIQDLL--PHHHTHVNTT 115 G +P + +L +Q +I + + +L L+ L T R+ DLL P + Sbjct: 60 GNVYDLPVATATQTLSEMQDKIKKQNDQASLNLATLATSQSSRLNDLLGKPMDISKAPAA 119 Query: 116 KDFP----PQQLRDNDVPEKPNASLEERKEILYNYPT-MGSQQYEKAFLDRLQSSLQQDR 170 D PQ + + PE S+ I +Y T + +YE A + RL L++DR Sbjct: 120 GDHSKLTLPQYQQFSAGPEL-QTSVGNAGNIARSYETDFDTSKYENALMARLNPQLERDR 178 Query: 171 EDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHN 230 LET+L NQGL GS A+NRAIDE NR +D R+AA+L A EQ RL N+ + A F N Sbjct: 179 AALETRLANQGLQPGSEAYNRAIDEANRTSNDARIAAVLNAGQEQTRLANLANQKASFEN 238 >gi|150397020|ref|YP_001327487.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] gi|150028535|gb|ABR60652.1| hypothetical protein Smed_1817 [Sinorhizobium medicae WSM419] Length = 532 Score = 225 bits (574), Expect = 3e-57, Method: Composition-based stats. Identities = 82/274 (29%), Positives = 120/274 (43%), Gaps = 46/274 (16%) Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65 A APDPK AS Q + N+ A AN N +++TPDG Y + K D G+E Sbjct: 7 APEAPDPKQTASAQTATNIGTAVANNVMGNANQVTPDGNLTYTYN-TQKWTDPLSGKEYD 65 Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDL--------------------- 104 + + +L P QQ I ++++ L L+ L + L Sbjct: 66 LKVPTATQTLSPAQQAIKDQEDAAQLNLATLANTQSGKLNGLLASKFDISGAPAAGKSDA 125 Query: 105 --LPHHHTHVNTTK------------------DFPPQQLRDNDVPEKPNASLEERKEILY 144 LP + + + K Q+ D+ K SL I Sbjct: 126 IGLPQYQSFTSGPKLQTSLANAGNVQSSIAGAGSIQSQVADSG---KIQTSLGNAGNITE 182 Query: 145 NYP-TMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDV 203 +Y + + +YE+A +DRL +++DR LETKL NQGL GS A++RA+DE NR +D Sbjct: 183 SYDFDIDTSKYEQALMDRLSPQIERDRAALETKLTNQGLQPGSEAYDRAMDEANRAANDA 242 Query: 204 RLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 R+ A L A EQ R+ + + A F N AQ Q Sbjct: 243 RIGATLSAGQEQSRIAGLAQNQAQFQNSAQQQAY 276 >gi|265985067|ref|ZP_06097802.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663659|gb|EEZ33920.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 299 Score = 218 bits (554), Expect = 5e-55, Method: Composition-based stats. Identities = 73/244 (29%), Positives = 117/244 (47%), Gaps = 17/244 (6%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 MGK +A +PDPK ++ Q N+ A AN+Y N++++TPDG Y +G+ K D + Sbjct: 1 MGKSKAPKSPDPKETSAAQTGTNIGTAVANSYLNNVNQVTPDGSLTYSQTGMQKYYDPYT 60 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLL---TQRIQDLLPH------HHTH 111 G+ IP + + L QQ I ++++ NL L L + R+ DLL Sbjct: 61 GKSYDIPQFTATQQLSQQQQAIKDQEDATNLNLGKLANSQSSRLNDLLGKPFDLSGAPAA 120 Query: 112 VNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDRE 171 N PQ + P+ + ++ + Q+ E A + R+ L+QDR Sbjct: 121 GNAGNMTAPQYQQYTGGPQLQTSYTDDF--------SADRQKVEDALMSRINPQLEQDRS 172 Query: 172 DLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNL 231 LE +L NQG++ GS A+ A+++ + +D R+ A+L EQ RL + A F N Sbjct: 173 ALEQRLANQGIMPGSKAFETAMNQNAQASNDARMQAILAGGQEQSRLAGLSRDQATFGNN 232 Query: 232 AQAQ 235 A Q Sbjct: 233 ANQQ 236 >gi|316933872|ref|YP_004108854.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] gi|315601586|gb|ADU44121.1| hypothetical protein Rpdx1_2530 [Rhodopseudomonas palustris DX-1] Length = 341 Score = 168 bits (424), Expect = 7e-40, Method: Composition-based stats. Identities = 48/216 (22%), Positives = 72/216 (33%), Gaps = 38/216 (17%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFI 60 M APDP A Q NL +D++TP G Y +G + + Sbjct: 1 MDTPEPPAAPDPVKTAEAQGQMNLTTGVQQQLLNMVDQVTPTGSLTYSQNGTTSFVGA-D 59 Query: 61 GREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLL---TQRIQDLLPHHHTHVNTTKD 117 G+ ++P + + +L P QQ + + N L + + +I LL + N Sbjct: 60 GKTYTVPRFTSTQTLTPAQQALLDLSNKTQANLGQIGVDQSAKIGSLLGTNLKLGN---- 115 Query: 118 FPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKL 177 E A L E RL Q E L T+L Sbjct: 116 ------------EATEARLMELGS------------------ARLDPKFAQSEEALRTRL 145 Query: 178 HNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASD 213 NQG+ GS AWN + + +D +L Sbjct: 146 ANQGIQPGSAAWNAEMKSFSEGKNDAYNQLLLSGRQ 181 >gi|319783503|ref|YP_004142979.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169391|gb|ADV12929.1| hypothetical protein Mesci_3812 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 330 Score = 131 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 86/219 (39%), Gaps = 32/219 (14%) Query: 13 KAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQS 72 K ++ + N+ A ANA N++++TPDG Y +G K D + G+ IP Y + Sbjct: 13 KETSAASTATNVGTAIANANLGNVNQVTPDGSLNYSQTGTYKWNDPYTGKSYDIPTYTAT 72 Query: 73 YSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKP 132 +L Q I ++ + L L +L + L V+ + D Sbjct: 73 QTLSGTGQAIKDQTDQAKLNLGELAAGQSSFLKDWLAKPVDLSND-------------AT 119 Query: 133 NASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRA 192 L + + RLQ +L R+ E L N+G+ GS + +A Sbjct: 120 EGRLMDLG------------------MKRLQPALDARRQANEADLINRGIRPGSDNYAQA 161 Query: 193 IDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNL 231 + ++ +D ++L + Q + + + A +NL Sbjct: 162 QNIQDQGENDA-YNSLLLSGRGQAVQEALAQNSAPINNL 199 >gi|110632598|ref|YP_672806.1| hypothetical protein Meso_0237 [Mesorhizobium sp. BNC1] gi|110283582|gb|ABG61641.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 322 Score = 98.4 bits (243), Expect = 6e-19, Method: Composition-based stats. Identities = 52/238 (21%), Positives = 80/238 (33%), Gaps = 53/238 (22%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N A I + TP G Y+ +G I D G+ I Sbjct: 2 KAPKAPDPWQTAAAQGAWNSFTAQQQQSMNMIGQNTPWGSLDYQQTGSTWITDP-TGKRI 60 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + +L P QQ I R L+ + Sbjct: 61 EMPTYTANVNLSPEQQAIFERTQAAEGNLAQIA--------------------------- 93 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSL----QQDREDLETKLHNQ 180 E E L ++ E+ D + +Q+++ L T+L N Sbjct: 94 --------QDQSEWLGEYLQEPFEFNNRDAEEWVWDLASPRILQQQEQNQQALRTQLINS 145 Query: 181 GLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGLQ 238 GL G+ AW D + + A+ +Q + + F N A AQ Q Sbjct: 146 GLRPGTTAW------------DAEMTRLTNANTDQMNQLALTGRQMAF-NEALAQRNQ 190 >gi|218673260|ref|ZP_03522929.1| hypothetical protein RetlG_17541 [Rhizobium etli GR56] Length = 334 Score = 97.7 bits (241), Expect = 1e-18, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 79/212 (37%), Gaps = 35/212 (16%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMKDQ-NGKSY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L L+ L + + T+V+ + + Sbjct: 65 QLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY-- 122 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 + + +D+ LE L ++G+ Sbjct: 123 -----------------------------VNNHWQSGFNNQWDRDQASLEQSLADKGISM 153 Query: 185 GSVAWNRAIDE---TNRKLHDVRLAAMLKASD 213 GS A++ A+ + + D L M + Sbjct: 154 GSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQ 185 >gi|209548343|ref|YP_002280260.1| hypothetical protein Rleg2_0738 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534099|gb|ACI54034.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 334 Score = 96.5 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 81/212 (38%), Gaps = 35/212 (16%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA +++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSYVNQYTPDGSLEYKVTGQQTMTDQ-NGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 IP + P Q I+++ L L+ L + + T+V+ + + + Sbjct: 65 QIPIRSAYQTYSPQNQAIYDQTQQTQLGLAKLANDQTGKISGILGTNVDLSAGNVDKYVN 124 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 D+ + + +D+ L+ L ++G+ Sbjct: 125 DH-------------------------------WQSGFNNQWDRDQASLDQSLADKGISM 153 Query: 185 GSVAWNRAIDE---TNRKLHDVRLAAMLKASD 213 GS A++ A+ + + D L M + Sbjct: 154 GSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQ 185 >gi|327191473|gb|EGE58493.1| hypothetical protein RHECNPAF_300003 [Rhizobium etli CNPAF512] Length = 335 Score = 96.5 bits (238), Expect = 3e-18, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 78/212 (36%), Gaps = 35/212 (16%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK + + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTSKSIMKDQ-NGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L L+ L + + T+V+ + + Sbjct: 65 ELPVYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTGKISGILGTNVDLSAGNVDKY-- 122 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 + + +D+ LE L ++G+ Sbjct: 123 -----------------------------VNNHWQSGFDNQWNRDQASLEQSLADKGIAM 153 Query: 185 GSVAWNRAIDE---TNRKLHDVRLAAMLKASD 213 GS A++ A+ + + D L M + Sbjct: 154 GSAAYDNAMRDFSTRKQAASDQYLGDMYSNAQ 185 >gi|116253668|ref|YP_769506.1| hypothetical protein RL3928 [Rhizobium leguminosarum bv. viciae 3841] gi|115258316|emb|CAK09418.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 335 Score = 96.5 bits (238), Expect = 3e-18, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 82/212 (38%), Gaps = 35/212 (16%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK SG + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVSGYQTMKDQ-NGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L LS L ++ + T+V+ + + Sbjct: 65 QLPTYSAYQTYSPQNQAIYDQTQQTQLGLSKLANEQTGKISGILGTNVDLSAGNVDKYAN 124 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 D+ + + +D+ L+ L ++G+ Sbjct: 125 DH-------------------------------WQGGFNNQWDRDQASLDQSLADKGISM 153 Query: 185 GSVAWNRAIDE---TNRKLHDVRLAAMLKASD 213 GS A+N A+ + + D L M + Sbjct: 154 GSEAYNNALRDFSTRKQAASDQFLGDMYSNAQ 185 >gi|86356745|ref|YP_468637.1| hypothetical protein RHE_CH01103 [Rhizobium etli CFN 42] gi|86280847|gb|ABC89910.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 334 Score = 92.7 bits (228), Expect = 4e-17, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 81/212 (38%), Gaps = 35/212 (16%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 + APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 6 KPPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGYQTMTDQ-NGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 +P Y + P Q I+++ L L+ L + + ++V+ + + + Sbjct: 65 KLPTYSAYQTYSPENQAIYDQTQQTQLGLARLANDQTAKVSGILGSNVDLSAGNVDKYVN 124 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 D+ + + +D+ LE L ++G+ Sbjct: 125 DH-------------------------------WQSGFNNQWDRDQASLEQSLADKGIAI 153 Query: 185 GSVAWNRAIDE---TNRKLHDVRLAAMLKASD 213 GS A++ A+ + + D L M + Sbjct: 154 GSAAYDNAMRDFTTRKQAASDQYLGDMHSNAQ 185 >gi|218510551|ref|ZP_03508429.1| hypothetical protein RetlB5_25766 [Rhizobium etli Brasil 5] Length = 271 Score = 91.1 bits (224), Expect = 1e-16, Method: Composition-based stats. Identities = 47/209 (22%), Positives = 92/209 (44%), Gaps = 7/209 (3%) Query: 5 RASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREI 64 +A APDP A+ Q + N+ A ANA + ++ TPDG +YK +G + D G+ Sbjct: 6 KAPKAPDPTQTAAAQTATNVDTAIANAGLSHTNQYTPDGSLEYKVTGKSTMTDQ-NGKTY 64 Query: 65 SIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLR 124 ++P Y +L P Q I+++ L L+ L + Q + T+V+ + + + Sbjct: 65 NLPVYSAYQTLSPQNQAIYDQSQQTQLGLAKLANDQTQKVSGILGTNVDLSSGNVDKYVN 124 Query: 125 DNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVS 184 D+ N E+ + + G A+ + ++ + + + L + Sbjct: 125 DHWRAGFDNQWDREQASLDQSLADKGIAMGSAAYDNAMRDFTTRKQAAADQYLGDM---- 180 Query: 185 GSVAWNRAIDETNRKLHDVRLAAMLKASD 213 S A N + E N+ L++ ++A++ S Sbjct: 181 YSNAQNSILTERNQPLNE--ISALMSGSQ 207 >gi|152982946|ref|YP_001353886.1| hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] gi|151283023|gb|ABR91433.1| Hypothetical protein mma_2196 [Janthinobacterium sp. Marseille] Length = 305 Score = 83.4 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 47/213 (22%), Positives = 77/213 (36%), Gaps = 41/213 (19%) Query: 2 GKQRASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61 G APD A NL A A A ++++TP G Y D Sbjct: 24 GSPSPPPAPDYAGAAQQTAQGNLEAARAAAEANRVNQVTPYGNLTYSRDPNASTPDGG-- 81 Query: 62 REISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQ---DLLPHHHTHVNTTKDF 118 + + +L P QQ + ++QN +L L+ L + + L ++ T + D Sbjct: 82 -------WTATQTLLPAQQALLDQQNKTSLGLAGLADRGLGYVDQALSNNITAADLPADM 134 Query: 119 PPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH 178 Q + A + R Q ++Q R+ L+ +L Sbjct: 135 V-----------------------------NAGQTGQDALMARFQPQMEQSRKALDAQLA 165 Query: 179 NQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKA 211 NQG+ GS A+N A+ + +D+R A L Sbjct: 166 NQGITQGSEAYNNAMRTQQQGENDLRSQAALNG 198 >gi|126443127|ref|YP_001063336.1| hypothetical protein BURPS668_A2342 [Burkholderia pseudomallei 668] gi|126222618|gb|ABN86123.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 408 Score = 69.5 bits (168), Expect = 4e-10, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 63/212 (29%), Gaps = 25/212 (11%) Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65 A APDP A+A+ N A N + P G Q G D + Sbjct: 36 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIGTDPATGA------- 88 Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125 P Y + + Q + N S + + + + + + Sbjct: 89 -PIYNTNITASGPLQSLIN---------STMGSA--GNANSTVNNALFGLGGLTARYDAL 136 Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG 185 N ++ G A L Q + LE++L NQGL G Sbjct: 137 NGKLGALAGQID--PNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQGLTPG 194 Query: 186 SVAWNRAID----ETNRKLHDVRLAAMLKASD 213 S A++ A+ N+ D ++L Sbjct: 195 SQAYDNAMKNFNLSKNQAYSDAANQSILTGQQ 226 >gi|167907339|ref|ZP_02494544.1| hypothetical protein BpseN_34235 [Burkholderia pseudomallei NCTC 13177] Length = 399 Score = 68.0 bits (164), Expect = 1e-09, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 63/212 (29%), Gaps = 25/212 (11%) Query: 6 ASLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREIS 65 A APDP A+A+ N A N + P G Q G D + Sbjct: 27 APAAPDPYAVANATTQTNNQTAQFNKALNLNNYSNPFGSQQSTQIGTDPATGA------- 79 Query: 66 IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRD 125 P Y + + Q + N S + + + + + + Sbjct: 80 -PIYNTNITASGPLQSLIN---------STMGSA--GNANSTVNNALFGLGGLTARYDAL 127 Query: 126 NDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSG 185 N ++ G A L Q + LE++L NQGL G Sbjct: 128 NGKLGALAGQID--PNAAQLAGQRGQNAAYAAQTQYLDPRFSQGQTSLESQLANQGLTPG 185 Query: 186 SVAWNRAID----ETNRKLHDVRLAAMLKASD 213 S A++ A+ N+ D ++L Sbjct: 186 SQAYDNAMKNFNLSKNQAYSDAANQSILTGQQ 217 >gi|13470675|ref|NP_102244.1| hypothetical protein mll0449 [Mesorhizobium loti MAFF303099] gi|14021417|dbj|BAB48030.1| mll0449 [Mesorhizobium loti MAFF303099] Length = 230 Score = 48.4 bits (113), Expect = 9e-04, Method: Composition-based stats. Identities = 17/52 (32%), Positives = 29/52 (55%) Query: 160 DRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKA 211 +RL L Q + L+T+L NQG+ GS A++RA+ + +D +L+ Sbjct: 22 ERLDPILAQQSDALQTQLSNQGIKLGSAAYDRAMTQQALHANDATDQLILQG 73 >gi|259488782|tpe|CBF88504.1| TPA: auxin efflux transporter family protein (Eurofung) [Aspergillus nidulans FGSC A4] Length = 584 Score = 46.4 bits (108), Expect = 0.003, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 54/177 (30%), Gaps = 16/177 (9%) Query: 5 RASLAPDPKAIASLQLSANLANASANA---YRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61 R LAP + I + N A Y +N +++ PD D + G Sbjct: 214 RVLLAPKERYIEEGE-RDNGETVVAQGRERYTDNPEQVDPDEPLIRTRDSSDGSTEQATG 272 Query: 62 REIS---------IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHV 112 P +SYS + H ++ + L LP + Sbjct: 273 SNDDSDVFHSGEATPVNTRSYSYTKLPHSGH--EDQAHDQSPILGPPPSGPFLPRQDSRG 330 Query: 113 N-TTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQ 168 + +DN + + P + + L G ++ A RL SLQ+ Sbjct: 331 DILYFPNVEVNPQDNTLEQGPLSRFKTSMGRLRVRIASGWARWTSALYSRLPPSLQK 387 >gi|67517312|ref|XP_658534.1| hypothetical protein AN0930.2 [Aspergillus nidulans FGSC A4] gi|40746803|gb|EAA65959.1| hypothetical protein AN0930.2 [Aspergillus nidulans FGSC A4] Length = 644 Score = 46.4 bits (108), Expect = 0.003, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 54/177 (30%), Gaps = 16/177 (9%) Query: 5 RASLAPDPKAIASLQLSANLANASANA---YRENIDRMTPDGIWQYKTSGVDKIIDSFIG 61 R LAP + I + N A Y +N +++ PD D + G Sbjct: 274 RVLLAPKERYIEEGE-RDNGETVVAQGRERYTDNPEQVDPDEPLIRTRDSSDGSTEQATG 332 Query: 62 REIS---------IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHV 112 P +SYS + H ++ + L LP + Sbjct: 333 SNDDSDVFHSGEATPVNTRSYSYTKLPHSGH--EDQAHDQSPILGPPPSGPFLPRQDSRG 390 Query: 113 N-TTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQ 168 + +DN + + P + + L G ++ A RL SLQ+ Sbjct: 391 DILYFPNVEVNPQDNTLEQGPLSRFKTSMGRLRVRIASGWARWTSALYSRLPPSLQK 447 >gi|15320624|ref|NP_203468.1| hypothetical protein Mx8p54 [Myxococcus phage Mx8] gi|15281734|gb|AAK94389.1|AF396866_54 p54 [Myxococcus phage Mx8] Length = 333 Score = 46.4 bits (108), Expect = 0.003, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 26/52 (50%) Query: 160 DRLQSSLQQDREDLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKA 211 RL Q+ + T+L NQGL GS A+ A E ++ +D +AM A Sbjct: 111 SRLDPQWQRREDAERTRLLNQGLSEGSEAYRNAQSEFGQQRNDAYTSAMASA 162 >gi|75909095|ref|YP_323391.1| hypothetical protein Ava_2885 [Anabaena variabilis ATCC 29413] gi|75702820|gb|ABA22496.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413] Length = 885 Score = 44.5 bits (103), Expect = 0.011, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 51/150 (34%), Gaps = 12/150 (8%) Query: 76 HPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNAS 135 P+Q + N+ + L+ T R + P D P+ + Sbjct: 465 TPVQLDLFNQ----HKNLALFATTRAGKSVLVSGILTQALAHGIPVVALDFPKPDGTSTF 520 Query: 136 LEERKEILYNYPTMGSQQ-----YEKAFLDRLQSSLQQDREDLETKLHNQGLVS---GSV 187 + + N + +E+ L L Q+DR T L++ GS Sbjct: 521 TDYAHFMEGNGAYFDISKQSNNLFEQPDLRSLDPEQQRDRLLDYTGFLESALMTMVLGSS 580 Query: 188 AWNRAIDETNRKLHDVRLAAMLKASDEQER 217 N+ + +T R L ++ L+A QER Sbjct: 581 NDNQLLTQTVRSLLNLALSAFFADKGIQER 610 >gi|17227965|ref|NP_484513.1| hypothetical protein all0469 [Nostoc sp. PCC 7120] gi|17129814|dbj|BAB72427.1| all0469 [Nostoc sp. PCC 7120] Length = 885 Score = 43.7 bits (101), Expect = 0.020, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 51/150 (34%), Gaps = 12/150 (8%) Query: 76 HPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNAS 135 P+Q + N+ + L+ T R + P D P+ + Sbjct: 465 TPVQLDLFNQ----HKNLALFATTRAGKSVLVSGILTQALAHGIPVVALDFPKPDGTSTF 520 Query: 136 LEERKEILYNYPTMGSQQ-----YEKAFLDRLQSSLQQDREDLETKLHNQGLVS---GSV 187 + + N + +E+ L L Q+DR T L++ GS Sbjct: 521 TDYAHFMEGNGAYFDISRQSNNLFEQPDLRSLDPEQQRDRLLDYTGFLESALMTMVLGSS 580 Query: 188 AWNRAIDETNRKLHDVRLAAMLKASDEQER 217 N+ + +T R L ++ L+A QER Sbjct: 581 NDNQLLTQTVRSLLNLALSAFFADQGIQER 610 >gi|190347468|gb|EDK39740.2| hypothetical protein PGUG_03838 [Meyerozyma guilliermondii ATCC 6260] Length = 661 Score = 42.2 bits (97), Expect = 0.064, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 43/151 (28%), Gaps = 13/151 (8%) Query: 52 VDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTH 111 D S G+ + Y + N N + + + D +H+H Sbjct: 22 TDDRRSSLGGKSTDLSAYETADEGSEFA-------NDNEIEIKNYNEDETNDYKDQNHSH 74 Query: 112 VNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAF-----LDRLQSSL 166 + D + E + N + + E L+R+ S Sbjct: 75 AS-HSDSTIHNPIHAKSSSSSFSHPEPFDGTIKNGNEIDDEIAEDQLNRQQTLERISSRK 133 Query: 167 QQDREDLETKLHNQGLVSGSVAWNRAIDETN 197 + E ++ +G+ S+ W D N Sbjct: 134 SAEEVAREEQMAEEGIEPSSLDWEGPEDPLN 164 >gi|119511821|ref|ZP_01630922.1| hypothetical protein N9414_03748 [Nodularia spumigena CCY9414] gi|119463533|gb|EAW44469.1| hypothetical protein N9414_03748 [Nodularia spumigena CCY9414] Length = 885 Score = 41.4 bits (95), Expect = 0.095, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 53/167 (31%), Gaps = 17/167 (10%) Query: 76 HPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNAS 135 PI + N+ + L+ T R + P D P+ + Sbjct: 465 TPIHLDLFNQ----HKNLALFATTRAGKSVLVSGILTQALAHGIPVVALDFPKPDGSSTF 520 Query: 136 LEERKEILYNYPTMGSQQ-----YEKAFLDRLQSSLQQDREDLETKLHNQGLVS---GSV 187 + + + N + +E+ L L Q+DR T L++ GS Sbjct: 521 TDYTEFMEGNGAYFDISKQSNNLFEQPDLRSLSPEEQRDRLLDYTGFLESALMTMVLGSS 580 Query: 188 AWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQA 234 N+ + +T R L ++ L+A QER K A Sbjct: 581 TENQLLSQTVRSLINLALSAFFADVGIQERY-----KQAIAGGFGSQ 622 >gi|294921717|ref|XP_002778707.1| partner of ralbp-1, putative [Perkinsus marinus ATCC 50983] gi|239887427|gb|EER10502.1| partner of ralbp-1, putative [Perkinsus marinus ATCC 50983] Length = 554 Score = 41.4 bits (95), Expect = 0.11, Method: Composition-based stats. Identities = 36/172 (20%), Positives = 67/172 (38%), Gaps = 19/172 (11%) Query: 57 DSFIGREIS---IPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVN 113 D+ +GR +S +P ++ S P+QQ++ + + L D H + Sbjct: 230 DATLGRPMSTGVLPQAHETESASPVQQEVKQGKQRPS-----LAGSYKFDGSSHDIGFGS 284 Query: 114 TTKDFPPQQLRDNDVPEKPNASLEERKEILYNYP-TMGSQQYEKAFLDRLQSSLQQDRED 172 K D PE ++R E+ + + RL+ SL+ R D Sbjct: 285 KAKLDTGYDDNDAASPELEQDFRDQRAELARQLARKQDADRMLSEARARLE-SLRSKRRD 343 Query: 173 LETKLHN-----QGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLD 219 L+ + + +G ++ R +DE ++++DVR A+ E E L Sbjct: 344 LDKRYTDVSAALEGQQMAVLSTKRQLDEAVKEINDVRQLAI----QEGEPLA 391 >gi|156402881|ref|XP_001639818.1| predicted protein [Nematostella vectensis] gi|156226949|gb|EDO47755.1| predicted protein [Nematostella vectensis] Length = 1091 Score = 41.0 bits (94), Expect = 0.12, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 62/192 (32%), Gaps = 3/192 (1%) Query: 14 AIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSY 73 Q N + N ++ + Q S V +D+ +G ++ S Sbjct: 576 QTGDFQAQIETVEPIRNNTQNNSEQAENNNNKQ-PQSNVQTNVDASLGLQVQQVVVPLSS 634 Query: 74 SLHPIQQQIHNRQNINNLLLSDLLTQRIQDL-LPHHHTHVNTTKDFPPQQLRDNDVPEKP 132 ++ Q ++N+ + + S + Q D P +T N+ N E Sbjct: 635 LVNTGSQMMNNQTSSEQVN-SQVNPQAAGDTKQPQENTRGNSNTQINSPYPSVNMTTEGS 693 Query: 133 NASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGLVSGSVAWNRA 192 + + + Y + + Q T+++ S +WN + Sbjct: 694 QYAPGQSNSTYLTESAGNTTSYSNGYQSSAYPLINQTDNMNNTEMNKMINPDNSSSWNSS 753 Query: 193 IDETNRKLHDVR 204 ID ++++ + R Sbjct: 754 IDGSSQQKMNAR 765 >gi|118396978|ref|XP_001030825.1| FHA domain protein [Tetrahymena thermophila] gi|89285140|gb|EAR83162.1| FHA domain protein [Tetrahymena thermophila SB210] Length = 548 Score = 40.3 bits (92), Expect = 0.24, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 52/150 (34%), Gaps = 6/150 (4%) Query: 73 YSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKP 132 + + N ++LS+L Q Q ++ N K + Sbjct: 139 QTYSQYLKTFQEESKPNQIVLSEL-NQNSQQIMMIDKIKANNYKKM----QSFEETNVSN 193 Query: 133 NASLEERKEILYNYPTMGSQQYEKAFLDRLQSS-LQQDREDLETKLHNQGLVSGSVAWNR 191 S++ M + +++F D+ Q + +QD + L + N L S S Sbjct: 194 RQSIQNFANEDKYLFEMNTNNVKQSFNDKFQGNAYKQDNQRLRLPVCNSKLDSSSEEDKE 253 Query: 192 AIDETNRKLHDVRLAAMLKASDEQERLDNI 221 IDE + + L S++Q L N+ Sbjct: 254 NIDEYTQAVDQQNLEVNNNTSEDQTALKNV 283 >gi|88798529|ref|ZP_01114113.1| GGDEF domain protein [Reinekea sp. MED297] gi|88778629|gb|EAR09820.1| GGDEF domain protein [Reinekea sp. MED297] Length = 476 Score = 39.1 bits (89), Expect = 0.51, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 15/196 (7%) Query: 50 SGVDKIIDSFIGRE---ISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLP 106 + V+ + D G+ I L L Q I + N LL+DLL+ I + L Sbjct: 52 TQVETVYDELEGQTQHSIDTFRALFQQLLDEQGQTILSPLNKETPLLTDLLS--IAEPLA 109 Query: 107 HHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRL--QS 164 H ++ P DV ++ + E L + RL Sbjct: 110 RHFRTLSRFDPAEPGSPDLEDVRKRLTVRFKALIETLMLMG--DTSGTLTQLTSRLDENP 167 Query: 165 SLQQ-DREDLET-KLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQ 222 + D +T L N L + + E N KL R+ +++A + L ++ Sbjct: 168 NWSMLDELAQKTIHLLNDRLQEEKQQFEGYLSELNAKL--TRIQQIVEA--DSTTLAELK 223 Query: 223 EKHAYFHNLAQAQGLQ 238 E HA F++ AQ + Sbjct: 224 ELHASFNDSINAQMHE 239 >gi|170591765|ref|XP_001900640.1| jmjC domain containing protein [Brugia malayi] gi|158591792|gb|EDP30395.1| jmjC domain containing protein [Brugia malayi] Length = 1017 Score = 39.1 bits (89), Expect = 0.53, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 55/175 (31%), Gaps = 18/175 (10%) Query: 73 YSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKP 132 + P + ++ +++N+ L R ++ H P + D P+ P Sbjct: 501 QQISPKSSKHNSLRDLNHPS-QKLKATRSRNY---HKPDGPLIVGGIPPAIMPADAPQAP 556 Query: 133 NAS----------LEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLHNQGL 182 N L ++ T + + RLQ L +D + K+ Sbjct: 557 NPYGYDPLASVTPLGHKQLPSAYRRTPDMAKVPPSQKFRLQPQLYKDVKAASEKMETVSE 616 Query: 183 VSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFHNLAQAQGL 237 + +W+ + +D + A + +DN F++ A+ Sbjct: 617 KTSDKSWSTNVISAPGPSNDSKSAEYESRAATTTHIDN----ERSFYHSESARSY 667 >gi|189191692|ref|XP_001932185.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973791|gb|EDU41290.1| conserved hypothetical protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 663 Score = 37.2 bits (84), Expect = 1.7, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 52/165 (31%), Gaps = 23/165 (13%) Query: 7 SLAPDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISI 66 AP PKA + + N +++ + G + + +G IS+ Sbjct: 442 PQAPSPKAQVNGDGAQQNTNGTSSGMNNDASMEDNSAANLLDQYGNGSLSGTPVGN-ISL 500 Query: 67 PHYLQSYSLH------------PIQQQIHNRQNIN----------NLLLSDLLTQRIQDL 104 P Q S P Q ++Q+ N ++ +S + +D Sbjct: 501 PRLSQPPSQSHSTTGTPNVPNNPNQTATFDQQDTNLDVGADLLDLDVEMSGMAAAGEKDW 560 Query: 105 LPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYPTM 149 + + N + QQ N+ NA + I + +M Sbjct: 561 VMVNEQSGNAQQPGSGQQPTMNNNQPTTNAGVMPSSNIEADAGSM 605 >gi|118378742|ref|XP_001022545.1| Tubulin-tyrosine ligase family protein [Tetrahymena thermophila] gi|89304312|gb|EAS02300.1| Tubulin-tyrosine ligase family protein [Tetrahymena thermophila SB210] Length = 1245 Score = 36.8 bits (83), Expect = 2.4, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 66/172 (38%), Gaps = 12/172 (6%) Query: 62 REISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQR-IQDLLPHHHTHVNTTKDFPP 120 +E+ P S+H ++++ + L+ TQ +L H N Sbjct: 300 KEMRFPQ-SPINSIHNQHSRLYDYNQGYSQNLNGYTTQEDRNKILKKILAHFNVKVQQME 358 Query: 121 QQLRDN---------DVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDRE 171 ++ N + + + YN QQ + + S+L ++R+ Sbjct: 359 RKQFMNIFGLEIKQIPPQDISKSHRLNANQAYYNSLNTKRQQKINNLSEMINSNLAENRQ 418 Query: 172 DLETKLHNQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQE 223 ++ + +N+ + +GS N+ ++ + L D R A +SD+Q L I E Sbjct: 419 NVNLR-NNKQMAAGSNKLNQTMNNGQQGLVDKRQAYRSNSSDQQLILSGIAE 469 >gi|145516947|ref|XP_001444362.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411773|emb|CAK76965.1| unnamed protein product [Paramecium tetraurelia] Length = 1059 Score = 36.8 bits (83), Expect = 2.4, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 63/187 (33%), Gaps = 35/187 (18%) Query: 14 AIASLQLSANLANASANAYRENIDRMTPDGIW-----QYKTSGVDKIIDSFI------GR 62 I S QL +N+ + + +N + + + ++I Sbjct: 871 QINSSQLHSNINSYADYQLPQNKQSIPTESSLYINNATQTQDHIKQMIGPISKDSINLNS 930 Query: 63 EISIPHYLQSYSLHPIQQQIHNRQNINN----------LLLSDLLTQ------------R 100 + + S+ P QQQI + N+ L L++LL Sbjct: 931 SYAYDQHYHQQSVDPTQQQIFQQTNLAKHHIRQIQVELLDLTNLLNSVKDQQIKDNVLRE 990 Query: 101 IQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEER--KEILYNYPTMGSQQYEKAF 158 I D L + + + + + N + E+ A L E K+I+ N + Y K Sbjct: 991 IDDDLNNLQKSIQLIRKQEQPEQQTNRMMEQSQAQLSEEDIKKIVINCSKEMFENYSKVL 1050 Query: 159 LDRLQSS 165 L++++ Sbjct: 1051 LEQIKGK 1057 >gi|117924321|ref|YP_864938.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1] gi|117608077|gb|ABK43532.1| hypothetical protein Mmc1_1014 [Magnetococcus sp. MC-1] Length = 381 Score = 36.4 bits (82), Expect = 3.1, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 69/231 (29%), Gaps = 40/231 (17%) Query: 1 MGKQRASLAPDPKAIASLQLSANLANASANAY--RENIDRMTPDGIWQYKTSGVDKIIDS 58 MGK ++ P +A+ A + N + TP G+ + Sbjct: 1 MGKSAPAMPAAPDPVATANAQAAANKEAVNESAKVNQFRQETPYGVLDWS---------- 50 Query: 59 FIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTKDF 118 EI P +L Q+ + Q L+ + R+ + D Sbjct: 51 ---GEIGTPDRTMKVTLSEDAQRAYGDQQAIAANLAQIAMGRMG-----QIDAGPFSLDG 102 Query: 119 PPQQLRDNDVPEKPNASLEERKEILYNYPTMGSQQYEKAFLDRLQSSLQQDREDLETKLH 178 Q ASLE+ + Q+ L ++ L+ +L Sbjct: 103 VAQVPN--------GASLEQARNQAMQEYYAHGSQF-------LDKRTANEQSKLQDRLI 147 Query: 179 NQGLVSGSVAWNRAIDETNRKLHDVRLAAMLKASDEQERLDNIQEKHAYFH 229 QG+ S A+ +A+ + + H+ + + RL E + Sbjct: 148 QQGVGLDSRAYRQAMQDFQEQSHEA-----YAELESRARLAGSSEASQQYQ 193 >gi|71022299|ref|XP_761379.1| adenylate cyclase [Ustilago maydis 521] gi|1345878|sp|P49606|CYAA_USTMA RecName: Full=Adenylate cyclase; AltName: Full=ATP pyrophosphate-lyase; AltName: Full=Adenylyl cyclase gi|603940|gb|AAA57469.1| Uac1 [Ustilago maydis] gi|46097612|gb|EAK82845.1| CYAA_USTMA Adenylate cyclase (ATP pyrophosphate-lyase) (Adenylyl cyclase) [Ustilago maydis 521] Length = 2493 Score = 36.4 bits (82), Expect = 3.4, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 61/200 (30%), Gaps = 26/200 (13%) Query: 9 APDPKAIASLQLSANLANASANAYRENIDRMTPDGIWQYKTSGVDK---------IIDSF 59 +P P + ++ + + + + P G + + + D Sbjct: 499 SPSPSQTPIAERQTSVTSTVESPSHASEASLAPSGSLRTPSRTTASTSTSSASTVLSDRL 558 Query: 60 IGREISIPHYLQSY-SLHPIQ--QQIHNRQNINNLLLSDLLTQRIQDLLPHHHTHVNTTK 116 + +P SL +Q Q + N L D+ + + + + Sbjct: 559 PSQVNMLPRNSVPRLSLAEMQNYQSLRKLSN----NLIDMSQTQNPSAMGASYRRGSIAA 614 Query: 117 DFPPQQLRDNDVPEKPNASLEERKE---ILYNYPTMGSQQYEKAFLDRLQSSLQQDREDL 173 P V + ++ L +RK+ ++ + S K L S+ Q+R+ Sbjct: 615 AQAPVDAPQLGVAPRTDSELSDRKDSVVSTHSMRSNHSGISPKTSYANL-PSVIQERQKP 673 Query: 174 ETKLHNQGLVSGSVAWNRAI 193 T L + W +I Sbjct: 674 ATAL------PSAANWTSSI 687 >gi|293392510|ref|ZP_06636830.1| primosomal replication protein N prime [Serratia odorifera DSM 4582] gi|291424912|gb|EFE98121.1| primosomal replication protein N prime [Serratia odorifera DSM 4582] Length = 198 Score = 36.0 bits (81), Expect = 4.4, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 52/146 (35%), Gaps = 9/146 (6%) Query: 37 DRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQQIHNRQNINNLLLSDL 96 +TP G + D + + G + + +Q +H+ +N L++ Sbjct: 39 SEVTPRGDTPVSQARFDSALFANHGTRLRDYLAQVQKNFSQLQSAVHDNRNAQVAFLAEK 98 Query: 97 LTQRI---------QDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLEERKEILYNYP 147 L +I Q L + D + D + A +++R+ +L Sbjct: 99 LVAQIGALQRELATQTLRRQNQPKAPAQVDLYHKLAEHQDYERRLLAMIQDRESLLGRQT 158 Query: 148 TMGSQQYEKAFLDRLQSSLQQDREDL 173 T+ QQ + L L+ L + R+ L Sbjct: 159 TLAGQQKIQQELAALEGRLMRCRQAL 184 >gi|253732074|ref|ZP_04866239.1| possible cell wall associated fibronectin-binding protein [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253724190|gb|EES92919.1| possible cell wall associated fibronectin-binding protein [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 1136 Score = 34.9 bits (78), Expect = 8.5, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 76/235 (32%), Gaps = 31/235 (13%) Query: 21 SANLANASANAYRENIDRMTPDGIWQYKTSGVDKIIDSFIGREISIPHYLQSYSLHPIQQ 80 + N A AN+ NI++ T + + + I+ EI + + Q Sbjct: 197 AKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQ 256 Query: 81 QIHNRQNINNLLL---SDLLTQRIQDLLPHHHTHVNTTKDFPPQQLRDNDVPEKPNASLE 137 + + + N L L RI +L H +N + + + + Sbjct: 257 ALIDEIDRNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQDIKD 316 Query: 138 ERKEILYNYPTMGSQ----------------QYEKAFLDRLQSSLQQDREDLETKLHNQG 181 K + Q + ++A DR+ LQQ D+ + + Sbjct: 317 LVKAKEDAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDRINQILQQGHNDINNAMTKEA 376 Query: 182 LVSGSVAWNRAID---------ETNRKLHDVRLAAMLKASDEQERLDNIQEKHAY 227 + +A+ E ++ D R+ A++ DE +R N+ +K Sbjct: 377 IEQAKERLAQALQEIKNLVKAKENAKQDVDKRVQALI---DEIDRNPNLTDKEKQ 428 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.293 0.115 0.275 Lambda K H 0.267 0.0347 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,198,234,823 Number of Sequences: 13984884 Number of extensions: 109262083 Number of successful extensions: 302494 Number of sequences better than 10.0: 478 Number of HSP's better than 10.0 without gapping: 66 Number of HSP's successfully gapped in prelim test: 509 Number of HSP's that attempted gapping in prelim test: 300310 Number of HSP's gapped (non-prelim): 1858 length of query: 238 length of database: 4,792,584,752 effective HSP length: 135 effective length of query: 103 effective length of database: 2,904,625,412 effective search space: 299176417436 effective search space used: 299176417436 T: 11 A: 40 X1: 16 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (20.9 bits) S2: 78 (34.9 bits)