BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781205|ref|YP_003065618.1| hypothetical protein CLIBASIA_05560 [Candidatus Liberibacter asiaticus str. psy62] (171 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781205|ref|YP_003065618.1| hypothetical protein CLIBASIA_05560 [Candidatus Liberibacter asiaticus str. psy62] gi|254040882|gb|ACT57678.1| hypothetical protein CLIBASIA_05560 [Candidatus Liberibacter asiaticus str. psy62] Length = 171 Score = 346 bits (888), Expect = 6e-94, Method: Compositional matrix adjust. Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR 60 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR Sbjct: 1 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR 60 Query: 61 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK 120 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK Sbjct: 61 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK 120 Query: 121 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS Sbjct: 121 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 >gi|317120671|gb|ADV02494.1| hypothetical protein SC1_gp050 [Liberibacter phage SC1] gi|317120815|gb|ADV02636.1| hypothetical protein SC1_gp050 [Candidatus Liberibacter asiaticus] Length = 167 Score = 338 bits (868), Expect = 1e-91, Method: Compositional matrix adjust. Identities = 167/167 (100%), Positives = 167/167 (100%) Query: 5 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA 64 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA Sbjct: 1 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA 60 Query: 65 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW 124 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW Sbjct: 61 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW 120 Query: 125 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS Sbjct: 121 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 167 >gi|315121930|ref|YP_004062419.1| hypothetical protein CKC_00900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122892|ref|YP_004063381.1| hypothetical protein CKC_05740 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495332|gb|ADR51931.1| hypothetical protein CKC_00900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496294|gb|ADR52893.1| hypothetical protein CKC_05740 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 148 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 42/104 (40%), Positives = 70/104 (67%) Query: 25 ISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGV 84 + S +++ S+ + + YR+ LAEEN+ RAD+L+ +R ++ R+EG++D G+F+MKA +SG+ Sbjct: 18 VVSDMSTATSTSKSNRYRASLAEENSRRADILHSERAERLRKEGLLDAGLFQMKAEMSGL 77 Query: 85 SGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGWHRAN 128 SG S DL +GQ + + + A++ R TV RF KE W + N Sbjct: 78 SGISADLWIGQRYADTEREVEKAQSTRFSTVQRFLKEQQWLKQN 121 >gi|315122425|ref|YP_004062914.1| hypothetical protein CKC_03385 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495827|gb|ADR52426.1| hypothetical protein CKC_03385 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 83 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 29/51 (56%), Positives = 41/51 (80%) Query: 41 YRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGASLDL 91 YRS LAEEN RA L+L+R ++A+REGI++ G+FRMK+ +SG+SG S D+ Sbjct: 33 YRSSLAEENVKRAGFLHLERMERAKREGILEAGLFRMKSEMSGLSGVSADI 83 >gi|296418189|ref|XP_002838724.1| hypothetical protein [Tuber melanosporum Mel28] gi|295634685|emb|CAZ82915.1| unnamed protein product [Tuber melanosporum] Length = 768 Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust. Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 13/107 (12%) Query: 18 LVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRM 77 + G S HR +RD +R+++ R D+ Y DRE+ + G + + + Sbjct: 605 MFGLAFAYCSERLVHR--VRDRAFRTIM------RQDIAYFDREENS--TGALTSFLSTE 654 Query: 78 KAVLSGVSGASLDLLVGQNTRNAYK---GINTARTAREQTVARFAKE 121 LSG+SG +L L+ + ++ AY+ G TA +TVA +E Sbjct: 655 TTHLSGMSGVTLGTLLIRRSKKAYEKSAGFACEATAAIRTVASLTRE 701 >gi|317120713|gb|ADV02535.1| hypothetical protein SC2_gp065 [Liberibacter phage SC2] gi|317120774|gb|ADV02595.1| hypothetical protein SC2_gp065 [Candidatus Liberibacter asiaticus] Length = 118 Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust. Identities = 27/99 (27%), Positives = 46/99 (46%) Query: 8 LLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARRE 67 L G S+ S L + ++ ++ I E R LA++NA AD+ LD+ Q R+E Sbjct: 8 FLTGASVLSRFTKGILDYQADVSQAQAQIETDEQRKKLAQDNAHLADLETLDQISQKRKE 67 Query: 68 GIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINT 106 + + R + G+S + +L +GQ + I T Sbjct: 68 HVYLSSKMRSQMSARGLSPVTQELWLGQTLAEMEREIQT 106 >gi|164426746|ref|XP_960586.2| hypothetical protein NCU03584 [Neurospora crassa OR74A] gi|157071460|gb|EAA31350.2| hypothetical protein NCU03584 [Neurospora crassa OR74A] Length = 2083 Score = 34.3 bits (77), Expect = 6.4, Method: Compositional matrix adjust. Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 1/65 (1%) Query: 101 YKGINTARTAREQTVARFAKEAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMF 160 Y N + A Q V F K++GW+R N+ + + SV AI+GPP A ++G Sbjct: 28 YIFTNISEGAASQVVHDFNKKSGWNRVNRLYMSASSSQSV-AISGPPSSLKAFASGHNFG 86 Query: 161 RRYNV 165 + NV Sbjct: 87 GKTNV 91 Searching..................................................done Results from round 2 >gi|254781205|ref|YP_003065618.1| hypothetical protein CLIBASIA_05560 [Candidatus Liberibacter asiaticus str. psy62] gi|254040882|gb|ACT57678.1| hypothetical protein CLIBASIA_05560 [Candidatus Liberibacter asiaticus str. psy62] Length = 171 Score = 281 bits (719), Expect = 2e-74, Method: Composition-based stats. Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR 60 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR Sbjct: 1 MSIGMDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDR 60 Query: 61 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK 120 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK Sbjct: 61 EDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAK 120 Query: 121 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS Sbjct: 121 EAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 >gi|317120671|gb|ADV02494.1| hypothetical protein SC1_gp050 [Liberibacter phage SC1] gi|317120815|gb|ADV02636.1| hypothetical protein SC1_gp050 [Candidatus Liberibacter asiaticus] Length = 167 Score = 274 bits (700), Expect = 4e-72, Method: Composition-based stats. Identities = 167/167 (100%), Positives = 167/167 (100%) Query: 5 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA 64 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA Sbjct: 1 MDDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQA 60 Query: 65 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW 124 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW Sbjct: 61 RREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGW 120 Query: 125 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 171 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS Sbjct: 121 HRANKEAVKNNRWASVAAIAGPPMVESASSAGMKMFRRYNVKGKDAS 167 >gi|315121930|ref|YP_004062419.1| hypothetical protein CKC_00900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122892|ref|YP_004063381.1| hypothetical protein CKC_05740 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495332|gb|ADR51931.1| hypothetical protein CKC_00900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496294|gb|ADR52893.1| hypothetical protein CKC_05740 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 148 Score = 162 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 42/104 (40%), Positives = 70/104 (67%) Query: 25 ISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGV 84 + S +++ S+ + + YR+ LAEEN+ RAD+L+ +R ++ R+EG++D G+F+MKA +SG+ Sbjct: 18 VVSDMSTATSTSKSNRYRASLAEENSRRADILHSERAERLRKEGLLDAGLFQMKAEMSGL 77 Query: 85 SGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGWHRAN 128 SG S DL +GQ + + + A++ R TV RF KE W + N Sbjct: 78 SGISADLWIGQRYADTEREVEKAQSTRFSTVQRFLKEQQWLKQN 121 >gi|315122425|ref|YP_004062914.1| hypothetical protein CKC_03385 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495827|gb|ADR52426.1| hypothetical protein CKC_03385 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 83 Score = 84.8 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 30/66 (45%), Positives = 42/66 (63%) Query: 26 SSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVS 85 S YRS LAEEN RA L+L+R ++A+REGI++ G+FRMK+ +SG+S Sbjct: 18 ISQYKEGTQRAEADRYRSSLAEENVKRAGFLHLERMERAKREGILEAGLFRMKSEMSGLS 77 Query: 86 GASLDL 91 G S D+ Sbjct: 78 GVSADI 83 >gi|317120713|gb|ADV02535.1| hypothetical protein SC2_gp065 [Liberibacter phage SC2] gi|317120774|gb|ADV02595.1| hypothetical protein SC2_gp065 [Candidatus Liberibacter asiaticus] Length = 118 Score = 53.2 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 46/99 (46%) Query: 8 LLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARRE 67 L G S+ S L + ++ ++ I E R LA++NA AD+ LD+ Q R+E Sbjct: 8 FLTGASVLSRFTKGILDYQADVSQAQAQIETDEQRKKLAQDNAHLADLETLDQISQKRKE 67 Query: 68 GIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINT 106 + + R + G+S + +L +GQ + I T Sbjct: 68 HVYLSSKMRSQMSARGLSPVTQELWLGQTLAEMEREIQT 106 >gi|111221654|ref|YP_712448.1| hypothetical protein FRAAL2220 [Frankia alni ACN14a] gi|111149186|emb|CAJ60869.1| conserved hypothetical protein; putative coiled-coil domain [Frankia alni ACN14a] Length = 247 Score = 41.3 bits (95), Expect = 0.052, Method: Composition-based stats. Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 2/100 (2%) Query: 33 RSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGASLDLL 92 S RD + A N+ + L + ARR+G+++ + G+ G L Sbjct: 77 TRSARDSQRLESGAVTNSRELENLQAELASLARRQGVLEDDALEKMEAVEGLEGRLAAL- 135 Query: 93 VGQNTRNAYKGINTARTAREQTVARFAKEAGWHRANKEAV 132 Q + I+ A TAR++ A E+ R +++A+ Sbjct: 136 -DQRRADLQAEIDAAITARDKAYAEIDTESARMRQDRQAL 174 >gi|323447410|gb|EGB03332.1| hypothetical protein AURANDRAFT_55580 [Aureococcus anophagefferens] Length = 3609 Score = 38.2 bits (87), Expect = 0.43, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 65/175 (37%), Gaps = 22/175 (12%) Query: 9 LYGLSLASPLVGAGLRISSTLASHRSSIRDHEYR-----------SLLAEENALRADVLY 57 L + SP++ A + L+ +S Y+ E L +L+ Sbjct: 1965 LSEIEFKSPVLRAAIEKFMPLSFEMASQAAARYQLEEGRFVYLTPKSYLEMLGLYQTLLH 2024 Query: 58 LDREDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVAR 117 RE+ ++ G+ R++ S V +L + + + + A AR + + Sbjct: 2025 RRREENTTATVRLENGISRLRDAASAVGTLEQELTIMVQAADEKREHSEAIAARVAS-EK 2083 Query: 118 FAKEAGWHRANKEAVK----------NNRWASVAAIAGPPMVESASSAGMKMFRR 162 E +AN+EA K N A +A P +++A++A + RR Sbjct: 2084 MVVERETDKANEEAGKVATIQAEVQRQNEDAERDLVAAEPAIQAATAALDTLDRR 2138 >gi|258648893|ref|ZP_05736362.1| putative fibronectin type III domain protein [Prevotella tannerae ATCC 51259] gi|260850921|gb|EEX70790.1| putative fibronectin type III domain protein [Prevotella tannerae ATCC 51259] Length = 396 Score = 37.0 bits (84), Expect = 0.89, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 37/102 (36%), Gaps = 2/102 (1%) Query: 6 DDLLYGLSLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLD--REDQ 63 DD+ GLS AS A + +A + + + +EN L + RE Sbjct: 137 DDVAVGLSEASVKTLAPVSDEFNIAVSEITSNNAKIEITPKDENMRYYRFLVTEDVREQM 196 Query: 64 ARREGIMDTGVFRMKAVLSGVSGASLDLLVGQNTRNAYKGIN 105 R G + ++ SG LD + QNT+ + Sbjct: 197 IARNGSIQAADLAFWKEMAQQSGVELDQYITQNTKTGFSSFE 238 >gi|169620036|ref|XP_001803430.1| hypothetical protein SNOG_13219 [Phaeosphaeria nodorum SN15] gi|160703952|gb|EAT79546.2| hypothetical protein SNOG_13219 [Phaeosphaeria nodorum SN15] Length = 604 Score = 36.7 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 50/116 (43%), Gaps = 5/116 (4%) Query: 21 AGLRISSTLASHRSSIRDHEYRSLLAEE--NALRADVLYLDREDQARREGIMDTGVFRMK 78 A LR S + + S+ +DH +++ EE +A A +++R ++ + G RM Sbjct: 106 AELRKSKAMMNTPSTYQDHRFKTDYMEEMRSAYPASENHMERSNRIQAGLASILGPERMG 165 Query: 79 AVL---SGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGWHRANKEA 131 + SG SG D ++ ++ + + Q + R E HR +EA Sbjct: 166 LGVGLPSGHSGIDNDPVLAKHNTDKRIQDTKDVYEQYQAMLRQQCEQQQHREAEEA 221 >gi|168008663|ref|XP_001757026.1| predicted protein [Physcomitrella patens subsp. patens] gi|162691897|gb|EDQ78257.1| predicted protein [Physcomitrella patens subsp. patens] Length = 535 Score = 36.7 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 35/95 (36%), Gaps = 5/95 (5%) Query: 32 HRSSIRDHEYRSLL---AEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGAS 88 S+ R+H + A+ V L+ ++A+ +M F+M SG+ Sbjct: 324 AYSTARNHREEAKYFRPADRYHEEKGVNSLEVHERAKPSSVMKQSEFKMDVNRSGLMTTQ 383 Query: 89 LDLLVGQNTRNAYKGINTARTAREQTVARFAKEAG 123 DL +G+ K T R AR EA Sbjct: 384 NDLAMGRRKAEGEKA--TVEVRRADKTARRVMEAQ 416 >gi|322515202|ref|ZP_08068201.1| DNA-directed DNA polymerase IV [Actinobacillus ureae ATCC 25976] gi|322118812|gb|EFX91013.1| DNA-directed DNA polymerase IV [Actinobacillus ureae ATCC 25976] Length = 356 Score = 36.3 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 27/105 (25%), Positives = 43/105 (40%), Gaps = 13/105 (12%) Query: 36 IRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRM-----KAVLSGVSGASLD 90 I + R LA EN L D+ +L +Q E + VFR+ K L ++ Sbjct: 237 IEVNRPRKSLAVENTLPTDIWHLSEAEQIVDE-LFKKLVFRLQRNWGKRSLQEFKKLAIK 295 Query: 91 LLVGQNTRNAYKGINTARTAREQTVARFAK--EAGWHRANKEAVK 133 L G T+ + RT ++ RF + + W R N +V+ Sbjct: 296 LKFGDFTQTTLE-----RTTDGLSLERFIELLQQVWQRTNHRSVR 335 >gi|208608058|emb|CAP09852.1| RNA dependent RNA polymerase [Infectious pancreatic necrosis virus] gi|208608072|emb|CAP09859.1| RNA dependent RNA polymerase [Infectious pancreatic necrosis virus] Length = 75 Score = 36.3 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 27/61 (44%) Query: 99 NAYKGINTARTAREQTVARFAKEAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGMK 158 + KG A + Q +A + GW AN N WA++A I P +V +S M Sbjct: 10 DLEKGEANATKSHAQALAYYLLTRGWVGANGAPEFNATWATIAMIIAPALVVDSSCLFMN 69 Query: 159 M 159 + Sbjct: 70 L 70 >gi|85859159|ref|YP_461361.1| hypothetical protein SYN_02855 [Syntrophus aciditrophicus SB] gi|85722250|gb|ABC77193.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 510 Score = 35.9 bits (81), Expect = 2.0, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 9/129 (6%) Query: 24 RISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSG 83 R + S + + ++RSL AE N L A++ L+ R + + KA Sbjct: 91 RALTDKGSAVNDAKRKQHRSLTAERNRLAAEIKGLE-----ARIAFWQS---QTKAKTRS 142 Query: 84 VS-GASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGWHRANKEAVKNNRWASVAA 142 ++ +L ++ +NT+ A++ T RE+ R A + +EA K + S++ Sbjct: 143 LNDAVNLSAVISRNTKKAWQEKMTLEEEREELDQRIATLQAEMKNPREAAKADWEVSLSL 202 Query: 143 IAGPPMVES 151 + P S Sbjct: 203 LGLKPESRS 211 >gi|146292632|ref|YP_001183056.1| SMC domain-containing protein [Shewanella putrefaciens CN-32] gi|145564322|gb|ABP75257.1| SMC domain protein [Shewanella putrefaciens CN-32] Length = 1018 Score = 35.9 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 42/98 (42%), Gaps = 16/98 (16%) Query: 37 RDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQN 96 + H YR + + AD+ L ++ ++RR+GI+ + +G + D + Sbjct: 183 QTHIYRRIEDSLKSKAADIRALVKDQRSRRDGILQS------------AGLASDDELSCE 230 Query: 97 TRNAYKGINTARTAREQTVARFAKEAGWHRANKEAVKN 134 + TA++A+EQ + + W +A ++ Sbjct: 231 LAKLTPELETAQSAKEQALQ----QQQWVIKTSDAAQH 264 >gi|319425936|gb|ADV54010.1| dsDNA exonuclease, SbcC [Shewanella putrefaciens 200] Length = 1018 Score = 35.9 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 42/98 (42%), Gaps = 16/98 (16%) Query: 37 RDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQN 96 + H YR + + AD+ L ++ ++RR+GI+ + +G + D + Sbjct: 183 QTHIYRRIEDSLKSKAADIRALVKDQRSRRDGILQS------------AGLASDDELSCE 230 Query: 97 TRNAYKGINTARTAREQTVARFAKEAGWHRANKEAVKN 134 + TA++A+EQ + + W +A ++ Sbjct: 231 LAKLTPELETAQSAKEQALQ----QQQWVIKTSDAAQH 264 >gi|120599371|ref|YP_963945.1| SMC domain-containing protein [Shewanella sp. W3-18-1] gi|120559464|gb|ABM25391.1| SMC domain protein [Shewanella sp. W3-18-1] Length = 1018 Score = 35.9 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 42/98 (42%), Gaps = 16/98 (16%) Query: 37 RDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGVSGASLDLLVGQN 96 + H YR + + AD+ L ++ ++RR+GI+ + +G + D + Sbjct: 183 QTHIYRRIEDSLKSKAADIRALVKDQRSRRDGILQS------------AGLASDDELSCE 230 Query: 97 TRNAYKGINTARTAREQTVARFAKEAGWHRANKEAVKN 134 + TA++A+EQ + + W +A ++ Sbjct: 231 LAKLTPELETAQSAKEQALQ----QQQWVIKTSDAAQH 264 >gi|326780466|ref|ZP_08239731.1| lipid A biosynthesis acyltransferase [Streptomyces cf. griseus XylebKG-1] gi|326660799|gb|EGE45645.1| lipid A biosynthesis acyltransferase [Streptomyces cf. griseus XylebKG-1] Length = 307 Score = 34.7 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 35/88 (39%), Gaps = 5/88 (5%) Query: 77 MKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVARFAKEAGWHRANKEAVK-NN 135 M A S + G D L G K A +T+A + W R K ++ + Sbjct: 1 MSAGASDLKGRLTDGLYGLGWGAVKKLPEPAAARLFRTIA----DQVWKRRGKSVLRLES 56 Query: 136 RWASVAAIAGPPMVESASSAGMKMFRRY 163 A V AGP + S AGM+ + RY Sbjct: 57 NLARVVPDAGPARLAELSRAGMRSYMRY 84 >gi|218304263|emb|CAN87025.1| pol protein [Simian immunodeficiency virus] Length = 1022 Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 7/134 (5%) Query: 42 RSLLAEENALRADVLYLDREDQARREGIMDTGVFRMK-AVLSGVSGASLDLLVGQNTRNA 100 + ++ +N D+++ RED + E + +F G+ G+ + LV T + Sbjct: 887 QGVIESKNRRLKDIIHSIREDAEKLETALAMALFIHNFKEKGGLGGSPAERLVNMITSDL 946 Query: 101 YKGINTARTAREQTVA---RFAKEAGWHRANKEAVKNNRWASVAAIAGPPMVESASSAGM 157 + + + R W K K I P + + Sbjct: 947 ETQQTQQQKLKFKNFQVYYRTGANQQWQGPGKLPWKGE---GALVIETPEGIITVPRRKA 1003 Query: 158 KMFRRYNVKGKDAS 171 K+ + +N +G D S Sbjct: 1004 KLIKVWNGEGMDRS 1017 >gi|239907598|ref|YP_002954339.1| methyl-accepting chemotaxis protein [Desulfovibrio magneticus RS-1] gi|239797464|dbj|BAH76453.1| methyl-accepting chemotaxis protein [Desulfovibrio magneticus RS-1] Length = 677 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 3/104 (2%) Query: 13 SLASPLVGAGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDT 72 +L+ L +++ + + + + R+ E ++ +A + RAD R +QA+REG++ Sbjct: 339 ALSDSLDAMAVKLRAMIETATAKTREAEEQTEIARQATARADEARS-RAEQAKREGML-A 396 Query: 73 GVFRMKAVLSGVSGASLDLLVGQNTRNAYKGINTARTAREQTVA 116 R++ V++ VS AS +L Q + + AR E A Sbjct: 397 AAGRLEGVVAVVSSASEEL-SSQVEESNHGAQEQARRTAETATA 439 >gi|313112697|ref|ZP_07798349.1| hypothetical protein HMPREF9436_00189 [Faecalibacterium cf. prausnitzii KLE1255] gi|310624987|gb|EFQ08290.1| hypothetical protein HMPREF9436_00189 [Faecalibacterium cf. prausnitzii KLE1255] Length = 1097 Score = 34.3 bits (77), Expect = 6.6, Method: Composition-based stats. Identities = 29/124 (23%), Positives = 52/124 (41%), Gaps = 19/124 (15%) Query: 29 LASHRSSIRDHEYRSL--LAEENALRADVLYLDREDQARREGIMDTGVFRMKAVLSGV-- 84 L++H ++ E R+L L EE + R L + AR+ ++ + R +++L+ + Sbjct: 320 LSAHGAASASGEGRALDALTEELSRRKTAL----DTAARKAEKAESALARTQSLLAVLRR 375 Query: 85 SGASLDLLVGQNTRNA-----------YKGINTARTAREQTVARFAKEAGWHRANKEAVK 133 SG + + + T + K + A A Q A KE RA +AV Sbjct: 376 SGFAAGIKAEELTADTLPQLTAWLSAQEKPLEEAYFAARQNTAALQKEQAGKRAELDAVS 435 Query: 134 NNRW 137 +W Sbjct: 436 GGKW 439 >gi|239905513|ref|YP_002952252.1| methyl-accepting chemotaxis protein [Desulfovibrio magneticus RS-1] gi|239795377|dbj|BAH74366.1| methyl-accepting chemotaxis protein [Desulfovibrio magneticus RS-1] Length = 601 Score = 34.0 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 15/116 (12%) Query: 33 RSSIRDHEYRSLLAEENALRADVLYLD------REDQARREGIMDTGVFRMKAVLSGVSG 86 ++ I + E +++LAEE +A V + + D+A+REG++D +++AV+ ++ Sbjct: 258 KAKISEAESKTILAEEETKKAQVATKEAEEARRQADRAKREGMLDAA-NKLEAVVESITS 316 Query: 87 AS-----LDLLVGQNTRNAYKGINTARTARE---QTVARFAKEAGWHRANKEAVKN 134 AS + + +N+ + + + A TA E T+ AK A E KN Sbjct: 317 ASDQLNAQSIELSKNSDSQARRVTEAATAMEEMNSTIIEVAKNASQATLTSENAKN 372 >gi|227548602|ref|ZP_03978651.1| possible cadmium-exporting ATPase [Corynebacterium lipophiloflavum DSM 44291] gi|227079325|gb|EEI17288.1| possible cadmium-exporting ATPase [Corynebacterium lipophiloflavum DSM 44291] Length = 644 Score = 34.0 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 2/86 (2%) Query: 21 AGLRISSTLASHRSSIRDHEYRSLLAEENALRADVLYLDREDQARREGIMDTGVFRMKAV 80 +GLR L ++I H +A E + DVL L ++ +GI+ +G Sbjct: 117 SGLRALLALIPSTATIISHGVTRSVAVEELVPGDVLRLAAGERLATDGIIRSG--HSSLD 174 Query: 81 LSGVSGASLDLLVGQNTRNAYKGINT 106 +S ++G S+ + VG INT Sbjct: 175 VSAITGESIPVEVGPGDAVLAGSINT 200 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.311 0.128 0.339 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,718,579,248 Number of Sequences: 14124377 Number of extensions: 91392224 Number of successful extensions: 277066 Number of sequences better than 10.0: 52 Number of HSP's better than 10.0 without gapping: 15 Number of HSP's successfully gapped in prelim test: 54 Number of HSP's that attempted gapping in prelim test: 277046 Number of HSP's gapped (non-prelim): 74 length of query: 171 length of database: 4,842,793,630 effective HSP length: 129 effective length of query: 42 effective length of database: 3,020,748,997 effective search space: 126871457874 effective search space used: 126871457874 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 76 (33.9 bits)