BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781138|ref|YP_003065551.1| hypothetical protein CLIBASIA_05200 [Candidatus Liberibacter asiaticus str. psy62] (178 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781138|ref|YP_003065551.1| hypothetical protein CLIBASIA_05200 [Candidatus Liberibacter asiaticus str. psy62] gi|254040815|gb|ACT57611.1| hypothetical protein CLIBASIA_05200 [Candidatus Liberibacter asiaticus str. psy62] Length = 178 Score = 365 bits (938), Expect = 1e-99, Method: Compositional matrix adjust. Identities = 178/178 (100%), Positives = 178/178 (100%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL Sbjct: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 Query: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP Sbjct: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 Query: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW 178 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW Sbjct: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW 178 >gi|315122531|ref|YP_004063020.1| hypothetical protein CKC_03915 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495933|gb|ADR52532.1| hypothetical protein CKC_03915 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 191 Score = 313 bits (801), Expect = 8e-84, Method: Compositional matrix adjust. Identities = 149/177 (84%), Positives = 164/177 (92%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 MAKD+ASLLIDVSHY EK GFT LFK+FLHRIEVKINR LRLREME+K LSLVEGS L Sbjct: 1 MAKDYASLLIDVSHYTEKSGFTCLFKDFLHRIEVKINRELRLREMEKKAFLSLVEGSVIL 60 Query: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 PDD+IE+RFL D++GNKLDHLPLSISLQKNKGYIITADSICV P SSEDVVLYYYACIPP Sbjct: 61 PDDFIEMRFLTDHQGNKLDHLPLSISLQKNKGYIITADSICVSPPSSEDVVLYYYACIPP 120 Query: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRS 177 LT+NNP+NWLL R+PDLYLYGL+EE+ALWEQKIDKA+TA ALFQESIK+LQ+NDIRS Sbjct: 121 LTDNNPINWLLQRSPDLYLYGLIEEVALWEQKIDKASTAGALFQESIKRLQKNDIRS 177 >gi|227822439|ref|YP_002826411.1| hypothetical protein NGR_c18940 [Sinorhizobium fredii NGR234] gi|227341440|gb|ACP25658.1| hypothetical protein NGR_c18940 [Sinorhizobium fredii NGR234] Length = 198 Score = 143 bits (361), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 76/180 (42%), Positives = 103/180 (57%), Gaps = 6/180 (3%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 D+ASLL+D Y + T+LF FL E+K+NR +R+ +MER + + L +G L Sbjct: 4 SDYASLLVDAGDYSGRDDVTHLFPRFLALAELKLNRVMRVADMERMVTVPLTDGEGSLSA 63 Query: 63 DYIEIRFLEDNKGNKLDHLPL------SISLQKNKGYIITADSICVLPISSEDVVLYYYA 116 D++E R + G L LPL + S GY I +I V P +ED+ L YYA Sbjct: 64 DFLEARQVLAADGRVLRALPLQELGRHATSGGGPIGYAIVGSAIQVRPKGAEDIRLTYYA 123 Query: 117 CIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIR 176 IPPLT P NWL+ RAPD+YLY LVEEIA+WE+ KA A AL ++++ L D R Sbjct: 124 KIPPLTAVAPSNWLIERAPDVYLYALVEEIAIWERDAGKAGAAEALKRQAMAGLGLADER 183 >gi|150397025|ref|YP_001327492.1| hypothetical protein Smed_1822 [Sinorhizobium medicae WSM419] gi|150028540|gb|ABR60657.1| hypothetical protein Smed_1822 [Sinorhizobium medicae WSM419] Length = 200 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 8/183 (4%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 D+ SLL+D Y + + F FL E+K+NR LR+ +ME + L +G LP Sbjct: 4 SDYGSLLVDAGEYSGREDIAHNFPRFLGLAELKLNRGLRVADMEVTAAIPLADGDGTLPA 63 Query: 63 DYIEIRFLEDNKGNKLDHLPLSISLQKNK--------GYIITADSICVLPISSEDVVLYY 114 D++E R + G + +PL + GY I I V +S + + + Y Sbjct: 64 DFLEAREVRSGAGVSIRAVPLQQLTNGYRDRNGVQPAGYSIVGRRIKVRSVSDQGLTITY 123 Query: 115 YACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQND 174 YA IP LT +NP NWLL +APD+YL+ LV EIA+W + +D AT A L +I L+ D Sbjct: 124 YARIPALTPSNPTNWLLEKAPDVYLFALVNEIAIWGKDVDGATAAQQLMMLAISGLRIED 183 Query: 175 IRS 177 RS Sbjct: 184 ERS 186 >gi|170752055|ref|YP_001758315.1| hypothetical protein Mrad2831_5687 [Methylobacterium radiotolerans JCM 2831] gi|170658577|gb|ACB27632.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 217 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 53/166 (31%), Positives = 73/166 (43%), Gaps = 22/166 (13%) Query: 4 DFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSA---PL 60 D ASL V Y+ + F+ E N LR REME + L PL Sbjct: 7 DLASLRAAVLDYIARPDLADAVPGFIALAESHFNAILRAREMEAEASLGTAAAPVAVMPL 66 Query: 61 PDDYIE--------------IRFLE-DNKGNKLDHLPLSISLQKNKGYIITADSICVLPI 105 P D+IE F E D+ + H P + + I A ++ ++P Sbjct: 67 PADFIEWLAVSWAGSGRTARPTFAEADSPEARFRHRPGG----DPQYFAIRAGNVRMVPE 122 Query: 106 SSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQ 151 V L YYA IPPLT+ P NWLL +APD YLY ++ E L+++ Sbjct: 123 KPGAVTLAYYAAIPPLTDAAPSNWLLAKAPDAYLYAVLAEAYLFQK 168 >gi|148257049|ref|YP_001241634.1| hypothetical protein BBta_5781 [Bradyrhizobium sp. BTAi1] gi|146409222|gb|ABQ37728.1| hypothetical protein BBta_5781 [Bradyrhizobium sp. BTAi1] Length = 217 Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 56/191 (29%), Positives = 89/191 (46%), Gaps = 24/191 (12%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFK---NFLHRIEVKINRALRLREMERKIVLSLVEGSA- 58 +++SL VS Y+ + T L +F+ E K+NR L +R+MER+ + + GSA Sbjct: 5 SNYSSLQSAVSEYLARDQDTTLVSRIPSFIQLAEAKLNRQLFVRQMERRAIAVVDLGSAE 64 Query: 59 ----PLPDDYIEIRF-----------LEDNKGNKLDHLPLSIS--LQKNKGYIITADSIC 101 LP D+ +R LE G +LD I + + + + D I Sbjct: 65 PNFIALPSDFQSMRRIRLTSVRGQPPLEFKSGVQLDEYRSRICNVAGQPRYFSLFGDQIE 124 Query: 102 VLPI--SSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTA 159 + P ++ + + Y IPPL N+ NWLL APDLYLYG + E A + ++ + T Sbjct: 125 LAPRPDAAYTIEMVYRRTIPPLASND-SNWLLMLAPDLYLYGALLESAPYLKEDGRIQTW 183 Query: 160 SALFQESIKKL 170 F ++ L Sbjct: 184 GLGFSTALADL 194 >gi|307308930|ref|ZP_07588613.1| hypothetical protein SinmeBDRAFT_4497 [Sinorhizobium meliloti BL225C] gi|306900564|gb|EFN31177.1| hypothetical protein SinmeBDRAFT_4497 [Sinorhizobium meliloti BL225C] Length = 191 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 36/138 (26%), Positives = 71/138 (51%), Gaps = 10/138 (7%) Query: 46 ERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDHL-PLSISLQKNK-GYIITADSICVL 103 E+ + LS+V+ +A LP D+IE+R + + G + P +L ++ GY ++ +++ + Sbjct: 44 EKTVTLSVVDNTAVLPSDFIELRSITGSNGYVYKPVNPAQATLYADEVGYYLSGNTLVFV 103 Query: 104 P--ISSEDVVLYYYA--CIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTA 159 + SE +LY+ A + P N WL R P +Y+ +++E WE+ + Sbjct: 104 GNYVESEVTILYWSAFEGLSPTQSN----WLFDRFPTVYVAAVMKEFYRWEKDAEGVAIE 159 Query: 160 SALFQESIKKLQQNDIRS 177 A QE + ++ ++D R Sbjct: 160 QATLQELLGRVAEDDRRG 177 >gi|160897393|ref|YP_001562975.1| constituent protein [Delftia acidovorans SPH-1] gi|160362977|gb|ABX34590.1| constituent protein [Delftia acidovorans SPH-1] Length = 205 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 83/184 (45%), Gaps = 14/184 (7%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEG--SAPLPD 62 +A L ++ ++++ T +F+ E +I R LRLR+ K+ L G LPD Sbjct: 6 YAGLQASIARWLKRSDLTANIPDFISLAEGRIARDLRLRKQITKVTLVTTGGVQGVDLPD 65 Query: 63 DYIEIR--FLEDNKGNKLDHLPLSISLQKNKG--------YIITADSICVLPISSE--DV 110 +Y+E L+ + +L +P+ K Y + + + + P+ ++ Sbjct: 66 NYLEAENITLDGGEPRQLTFVPIEHLDTKYPANYTAMPGVYTLLGNQLLLGPVPDGVYNI 125 Query: 111 VLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKL 170 + YYA PL++ N NWLL P +YL+ + E L+ ++A ++ + I L Sbjct: 126 DVTYYARFVPLSDVNTTNWLLQNHPSIYLFSALAEAELFLVNDERAVLWASKATQEIGTL 185 Query: 171 QQND 174 Q D Sbjct: 186 QSVD 189 >gi|27476047|ref|NP_775249.1| constituent protein [Pseudomonas phage PaP3] gi|27414477|gb|AAL85563.1| ORF.09 [Pseudomonas phage PaP3] Length = 208 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 24/153 (15%) Query: 28 FLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNK---GNKLDHLPLS 84 F+ ++NR LR+RE E L G +P+ ++ +R + + G L ++ Sbjct: 31 FIGNTTDRLNRLLRVRENEHFETLMAFGGGIEIPEHFVALRSITGDALIGGRTLQYITQD 90 Query: 85 I---------SLQKNKGYIITADSICVLPISSEDV--VLYYYACIPPLTENNPVNWLLHR 133 I Q Y + V P+ + ++ Y+ +P L+ +NP W L + Sbjct: 91 IFSHYVNYNYQPQGVTYYTRLGNFWRVYPVVPDGAPFIVNYWTVLPELSLDNPTTWALTK 150 Query: 134 APDLYLYGLVEEIAL----------WEQKIDKA 156 P +YLYG++E+I L W QK+++A Sbjct: 151 YPQIYLYGVLEQIYLYTMDEERSMFWGQKLERA 183 >gi|167600433|ref|YP_001671933.1| phage particle protein [Pseudomonas phage LUZ24] gi|161168296|emb|CAP45461.1| phage particle protein [Pseudomonas phage LUZ24] Length = 208 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 37/153 (24%), Positives = 67/153 (43%), Gaps = 24/153 (15%) Query: 28 FLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNK---GNKLDHLP-- 82 F+ ++NR LR+RE E L G +P+ ++ +R + + G L ++ Sbjct: 31 FIGNTTDRLNRLLRVRENEHFETLMAFGGGIEIPEHFVALRSITGDSLIGGRTLQYITQD 90 Query: 83 -------LSISLQKNKGYIITADSICVLPISSEDV--VLYYYACIPPLTENNPVNWLLHR 133 + Q Y + V P+ + ++ Y+ +P L+ NP W L + Sbjct: 91 TFTHYVNYNYQPQGVTYYTRLGNFWRVFPVVPDGAPFIVNYWTVLPELSLANPTTWALTK 150 Query: 134 APDLYLYGLVEEIAL----------WEQKIDKA 156 P +YLYG++E+I L W QK+++A Sbjct: 151 YPQIYLYGVLEQIYLYTMDEARSQFWGQKLERA 183 >gi|241760941|ref|ZP_04759030.1| constituent protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374560|gb|EER64021.1| constituent protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 219 Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 44/183 (24%), Positives = 79/183 (43%), Gaps = 14/183 (7%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAP--LPD 62 + LL ++ ++E+ F+ E + R L E+E +I V P LPD Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEIT---VAAGCPVALPD 72 Query: 63 DYIEIRFLEDNKGNK---LDHLPLSISLQ---KNKG-YIITADSICVLPISSEDVV--LY 113 D+ +R L G + LP + Q +NK + + A + + P+ + L Sbjct: 73 DFEALRLLYPAGGRRDRAFLQLPPDVFQQGKHQNKSVFTLIAGQLWISPLPETETAFSLV 132 Query: 114 YYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQN 173 Y A +P L+ N NWLL PD+YL+G + + + + + E++ +L + Sbjct: 133 YRAALPSLSLNRQSNWLLTSHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNKA 192 Query: 174 DIR 176 +R Sbjct: 193 GLR 195 >gi|260753093|ref|YP_003225986.1| constituent protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552456|gb|ACV75402.1| constituent protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 219 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 44/184 (23%), Positives = 79/184 (42%), Gaps = 16/184 (8%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAP--LPD 62 + LL ++ ++E+ F+ E + R L E+E +I V P LPD Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEIT---VAAGCPVALPD 72 Query: 63 DYIEIRFLEDNKGNKLDHLPLSIS---LQKNKG-----YIITADSICVLPISSEDVV--L 112 D+ +R L G + D L +S Q+ K + + A + + P+ + L Sbjct: 73 DFETLRLLYPAGGRR-DRAFLQLSPDVFQQGKHQNKSVFTLMAGQLWISPLPETETAFSL 131 Query: 113 YYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQ 172 Y A +P L+ N NWLL PD+YL+G + + + + + E++ +L + Sbjct: 132 VYRAALPSLSLNRQSNWLLASHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNK 191 Query: 173 NDIR 176 +R Sbjct: 192 AGLR 195 >gi|56551285|ref|YP_162124.1| hypothetical protein ZMO0389 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542859|gb|AAV89013.1| constituent protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 219 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 44/184 (23%), Positives = 79/184 (42%), Gaps = 16/184 (8%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAP--LPD 62 + LL ++ ++E+ F+ E + R L E+E +I V P LPD Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEIT---VAAGCPVTLPD 72 Query: 63 DYIEIRFLEDNKGNKLDHLPLSIS---LQKNKG-----YIITADSICVLPISSEDVV--L 112 D+ +R L G + D L +S Q+ K + + A + + P+ + L Sbjct: 73 DFEALRLLYPAGGRR-DRAFLQLSPDVFQQGKHQNKSVFTLMAGQLWISPLPETETAFSL 131 Query: 113 YYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQ 172 Y A +P L+ N NWLL PD+YL+G + + + + + E++ +L + Sbjct: 132 VYRAALPSLSLNRQSNWLLASHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNK 191 Query: 173 NDIR 176 +R Sbjct: 192 AGLR 195 >gi|221199515|ref|ZP_03572559.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221205584|ref|ZP_03578599.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221174422|gb|EEE06854.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221180800|gb|EEE13203.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 198 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 1/67 (1%) Query: 92 GYIITADSICVLPISSEDVV-LYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWE 150 G+ I D++ V+ + YY I PL++ N NWLL APD+YL+G + E A++ Sbjct: 100 GFQINGDTLAFGADQLGQVLRIDYYTVIEPLSDTNESNWLLEDAPDIYLFGALHEAAMYV 159 Query: 151 QKIDKAT 157 + +AT Sbjct: 160 RDDARAT 166 >gi|291334401|gb|ADD94057.1| hypothetical protein BBta_5781 [uncultured phage MedDCM-OCT-S01-C1] Length = 217 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 2/93 (2%) Query: 88 QKNKGYIITADSICVLPISSEDVV-LYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEI 146 ++ + Y + A+ I V P + D++ + Y+ +P LT + NWL + PD Y++GL EI Sbjct: 111 ERQRYYTVIANQIQVAPPTDNDMLEIVYFQRVPALTTDTDTNWLSEKHPDCYIFGLCAEI 170 Query: 147 ALWEQKIDKATTASALFQESIKKLQQND-IRSW 178 + + + F ES+ + +D I W Sbjct: 171 SAFAKDAMGFELYKMRFNESLGDIAMDDQITRW 203 >gi|167583568|ref|YP_001671758.1| conserved hypothetical protein [Enterobacteria phage phiEco32] gi|164375406|gb|ABY52814.1| conserved hypothetical protein [Enterobacteria phage phiEco32] Length = 250 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 73/174 (41%), Gaps = 32/174 (18%) Query: 21 FTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDH 80 F NF++ E +I R LR+ +E+++ L + +G A +P DY+E +++ K + Sbjct: 31 FVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPPDYLEAQWMMRAKDGTIFQ 90 Query: 81 L--PLSISLQKNKGYI------------------------ITADSI------CVLPISSE 108 + P IS ++ G I I AD+ L + Sbjct: 91 VTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIEADTPYYPDDGSPLIPAEN 150 Query: 109 DVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASAL 162 V+L YYA P E+ + +L AP+L LY + L+ Q + SAL Sbjct: 151 SVILSYYADPPEFHEDTDTSAILTIAPELLLYFTLRHACLFVQDDNGVQKWSAL 204 >gi|291334590|gb|ADD94240.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C136] Length = 135 Score = 42.4 bits (98), Expect = 0.026, Method: Compositional matrix adjust. Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 4/101 (3%) Query: 78 LDHLPLSISLQKNKGYIITADSICVLPI--SSEDVVLYYYACIPPLTENNPVNWLLHRAP 135 +D + S + Y I DS PI S+ L YYA L++ N N++L P Sbjct: 1 MDQIKGSSTTGMPSAYTILGDSFRFAPIPDSAYTGTLNYYAKFAALSDTNTSNYILASHP 60 Query: 136 DLYLYGLVEEIALWEQKID--KATTASALFQESIKKLQQND 174 +YLYG + A + +D + ++ ++++L++ND Sbjct: 61 AIYLYGSLYHAANFLGGVDPQRLQQWQGMYTTAMERLERND 101 >gi|291337048|gb|ADD96568.1| constituent protein [uncultured organism MedDCM-OCT-S11-C359] Length = 179 Score = 38.1 bits (87), Expect = 0.43, Method: Compositional matrix adjust. Identities = 25/105 (23%), Positives = 49/105 (46%), Gaps = 12/105 (11%) Query: 74 KGNKLDHLPLSISLQKNKGYIITADSICVLPI--SSEDVVLYYYACIPPLTENNPVNWLL 131 KG+ +P++ Y I D+ P S+ L YY L+++N N++L Sbjct: 50 KGSSTSGMPVA--------YTIIGDNFRFAPTPDSTYTGTLNYYKSFDALSDSNTTNYIL 101 Query: 132 HRAPDLYLYGLVEEIALWEQKIDKATTA--SALFQESIKKLQQND 174 P +YLYG + A + ++ A ++ ++++L++ND Sbjct: 102 TNHPAIYLYGSLYHAANFLGGVEPARLQQWQGMYTTALERLERND 146 >gi|291334634|gb|ADD94282.1| constituent protein [uncultured phage MedDCM-OCT-S04-C64] Length = 216 Score = 37.4 bits (85), Expect = 0.81, Method: Compositional matrix adjust. Identities = 45/160 (28%), Positives = 74/160 (46%), Gaps = 26/160 (16%) Query: 4 DFASLLIDVSHYVEKGGFTY--------LFKNFLHRIEVKINR----ALRLREMERKIVL 51 +++ L +S++ + FT L + +L R+ V + +R +++RK Sbjct: 5 NYSDLKNAISNWTARSDFTTSNLNDFIDLAERYLKRVPVPVEAPEIGGIR-GDIQRK-TG 62 Query: 52 SLVEGSA--PLPDDYIEI-RF-LEDNKGNKLDHL-PLSISLQKNKG-----YIITADSIC 101 +LV G+A LP D+ EI RF L + L L P +SL G + +D I Sbjct: 63 TLVAGTATLALPTDFQEIYRFTLTADTFTTLRFLAPNQLSLNHRSGSGKPAFFTISDVIE 122 Query: 102 --VLPISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYL 139 V P S+ L YY + L+++N NW++ PD YL Sbjct: 123 FEVAPDSNYAYELSYYPMVTGLSDSNTTNWIIENYPDAYL 162 >gi|291334843|gb|ADD94483.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 179 Score = 36.6 bits (83), Expect = 1.6, Method: Compositional matrix adjust. Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 13/108 (12%) Query: 44 EMERKIVLSLVEGSA--PLPDDYIEI-RF-LEDNKGNKLDHL-PLSISLQKNKG-----Y 93 +++RK +LV G+A LP D+ EI RF L + L L P +SL G + Sbjct: 19 DIQRK-TGTLVAGTATLALPTDFQEIYRFTLTADTFTTLRFLAPNQLSLNHRSGSGKPAF 77 Query: 94 IITADSIC--VLPISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYL 139 +D I V P S+ L YY + L+++N NW++ PD YL Sbjct: 78 FTISDVIEFEVAPDSNYAYELSYYPMVTGLSDSNTTNWIIENYPDAYL 125 >gi|83764465|dbj|BAE54609.1| unnamed protein product [Aspergillus oryzae] Length = 695 Score = 36.6 bits (83), Expect = 1.6, Method: Compositional matrix adjust. Identities = 16/43 (37%), Positives = 23/43 (53%) Query: 92 GYIITADSICVLPISSEDVVLYYYACIPPLTENNPVNWLLHRA 134 GYI T ++ L I+ + V+YYY C+P N + LHR Sbjct: 257 GYICTGEAYVFLHIADDPTVVYYYLCVPNRDVNPEDKYHLHRT 299 >gi|317138817|ref|XP_001816611.2| hypothetical protein AOR_1_1268184 [Aspergillus oryzae RIB40] Length = 450 Score = 36.2 bits (82), Expect = 2.1, Method: Compositional matrix adjust. Identities = 16/43 (37%), Positives = 23/43 (53%) Query: 92 GYIITADSICVLPISSEDVVLYYYACIPPLTENNPVNWLLHRA 134 GYI T ++ L I+ + V+YYY C+P N + LHR Sbjct: 257 GYICTGEAYVFLHIADDPTVVYYYLCVPNRDVNPEDKYHLHRT 299 >gi|238504500|ref|XP_002383481.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|220690952|gb|EED47301.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 551 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 16/43 (37%), Positives = 23/43 (53%) Query: 92 GYIITADSICVLPISSEDVVLYYYACIPPLTENNPVNWLLHRA 134 GYI T ++ L I+ + V+YYY C+P N + LHR Sbjct: 113 GYICTGEAYVFLHIADDPTVVYYYLCVPNRDVNPEDKYHLHRT 155 >gi|224081162|ref|XP_002306315.1| predicted protein [Populus trichocarpa] gi|222855764|gb|EEE93311.1| predicted protein [Populus trichocarpa] Length = 278 Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust. Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 7/74 (9%) Query: 15 YVEKGGFTYLFKNFLHRIEV-KINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDN 73 YV KG T L+KN H+I K + + + I L L S PLPDD +E ++D Sbjct: 165 YVNKGDATLLYKN--HKIGTGKFPEVDQDADSSKSIELGLSGSSGPLPDD-VETS-IQDK 220 Query: 74 KGNKLDHLPLSISL 87 +G K H+ LSI + Sbjct: 221 RGKK--HVSLSIRM 232 >gi|291532915|emb|CBL06028.1| Predicted RNA-binding protein homologous to eukaryotic snRNP [Megamonas hypermegale ART12/1] Length = 242 Score = 34.7 bits (78), Expect = 5.7, Method: Compositional matrix adjust. Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 51 LSLVEGSAPLPDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDV 110 S GS+ +P DY+E R+L+ G K P + NK +T D ++PI ++D+ Sbjct: 186 FSKARGSSKIPVDYVECRYLKKPSGAK----PGFVIFTNNKTLYVTPDEDEIMPIINKDL 241 >gi|327480328|gb|AEA83638.1| oligopeptidase [Pseudomonas stutzeri DSM 4166] Length = 676 Score = 34.3 bits (77), Expect = 7.0, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 1/55 (1%) Query: 122 TENNPVNWLLHR-APDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDI 175 T +P WL +R A D+ Y E L EQ D+A ALFQE ++++ D+ Sbjct: 11 TGTDPYRWLENRDAEDVLAYLKAENAYLEEQLADQAELREALFQEIKSRIRETDL 65 >gi|159109251|ref|XP_001704891.1| Kinase, NEK-frag [Giardia lamblia ATCC 50803] gi|157432966|gb|EDO77217.1| Kinase, NEK-frag [Giardia lamblia ATCC 50803] Length = 778 Score = 34.3 bits (77), Expect = 7.7, Method: Compositional matrix adjust. Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 9/81 (11%) Query: 68 RFLEDNKGNKLDHLPLSISLQKNKGYIITADS----IC---VLPISSEDVVLYYYACIPP 120 R D+ +D P SIS+ ++ Y+++ S IC V PIS+ D +L++Y Sbjct: 140 RLKNDSPPVHMDIRPSSISVTRDGNYLLSVFSPNKFICQPSVEPISAHDYILFHYGPAED 199 Query: 121 LTENNPVNWLLHRAPDLYLYG 141 + EN ++ AP+ +L G Sbjct: 200 IKEN--AQQAIYTAPEWFLTG 218 Searching..................................................done Results from round 2 >gi|315122531|ref|YP_004063020.1| hypothetical protein CKC_03915 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495933|gb|ADR52532.1| hypothetical protein CKC_03915 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 191 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 149/177 (84%), Positives = 164/177 (92%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 MAKD+ASLLIDVSHY EK GFT LFK+FLHRIEVKINR LRLREME+K LSLVEGS L Sbjct: 1 MAKDYASLLIDVSHYTEKSGFTCLFKDFLHRIEVKINRELRLREMEKKAFLSLVEGSVIL 60 Query: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 PDD+IE+RFL D++GNKLDHLPLSISLQKNKGYIITADSICV P SSEDVVLYYYACIPP Sbjct: 61 PDDFIEMRFLTDHQGNKLDHLPLSISLQKNKGYIITADSICVSPPSSEDVVLYYYACIPP 120 Query: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRS 177 LT+NNP+NWLL R+PDLYLYGL+EE+ALWEQKIDKA+TA ALFQESIK+LQ+NDIRS Sbjct: 121 LTDNNPINWLLQRSPDLYLYGLIEEVALWEQKIDKASTAGALFQESIKRLQKNDIRS 177 >gi|254781138|ref|YP_003065551.1| hypothetical protein CLIBASIA_05200 [Candidatus Liberibacter asiaticus str. psy62] gi|254040815|gb|ACT57611.1| hypothetical protein CLIBASIA_05200 [Candidatus Liberibacter asiaticus str. psy62] Length = 178 Score = 246 bits (628), Expect = 8e-64, Method: Composition-based stats. Identities = 178/178 (100%), Positives = 178/178 (100%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL Sbjct: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 Query: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP Sbjct: 61 PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPP 120 Query: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW 178 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW Sbjct: 121 LTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRSW 178 >gi|150397025|ref|YP_001327492.1| hypothetical protein Smed_1822 [Sinorhizobium medicae WSM419] gi|150028540|gb|ABR60657.1| hypothetical protein Smed_1822 [Sinorhizobium medicae WSM419] Length = 200 Score = 207 bits (527), Expect = 4e-52, Method: Composition-based stats. Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 8/183 (4%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 D+ SLL+D Y + + F FL E+K+NR LR+ +ME + L +G LP Sbjct: 4 SDYGSLLVDAGEYSGREDIAHNFPRFLGLAELKLNRGLRVADMEVTAAIPLADGDGTLPA 63 Query: 63 DYIEIRFLEDNKGNKLDHLPLSISLQKNK--------GYIITADSICVLPISSEDVVLYY 114 D++E R + G + +PL + GY I I V +S + + + Y Sbjct: 64 DFLEAREVRSGAGVSIRAVPLQQLTNGYRDRNGVQPAGYSIVGRRIKVRSVSDQGLTITY 123 Query: 115 YACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQND 174 YA IP LT +NP NWLL +APD+YL+ LV EIA+W + +D AT A L +I L+ D Sbjct: 124 YARIPALTPSNPTNWLLEKAPDVYLFALVNEIAIWGKDVDGATAAQQLMMLAISGLRIED 183 Query: 175 IRS 177 RS Sbjct: 184 ERS 186 >gi|227822439|ref|YP_002826411.1| hypothetical protein NGR_c18940 [Sinorhizobium fredii NGR234] gi|227341440|gb|ACP25658.1| hypothetical protein NGR_c18940 [Sinorhizobium fredii NGR234] Length = 198 Score = 204 bits (518), Expect = 5e-51, Method: Composition-based stats. Identities = 77/183 (42%), Positives = 103/183 (56%), Gaps = 7/183 (3%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 D+ASLL+D Y + T+LF FL E+K+NR +R+ +MER + + L +G L Sbjct: 4 SDYASLLVDAGDYSGRDDVTHLFPRFLALAELKLNRVMRVADMERMVTVPLTDGEGSLSA 63 Query: 63 DYIEIRFLEDNKGNKLDHLPLS------ISLQKNKGYIITADSICVLPISSEDVVLYYYA 116 D++E R + G L LPL S GY I +I V P +ED+ L YYA Sbjct: 64 DFLEARQVLAADGRVLRALPLQELGRHATSGGGPIGYAIVGSAIQVRPKGAEDIRLTYYA 123 Query: 117 CIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIR 176 IPPLT P NWL+ RAPD+YLY LVEEIA+WE+ KA A AL ++++ L D R Sbjct: 124 KIPPLTAVAPSNWLIERAPDVYLYALVEEIAIWERDAGKAGAAEALKRQAMAGLGLADER 183 Query: 177 -SW 178 W Sbjct: 184 LRW 186 >gi|241760941|ref|ZP_04759030.1| constituent protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374560|gb|EER64021.1| constituent protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 219 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 40/182 (21%), Positives = 75/182 (41%), Gaps = 10/182 (5%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDY 64 + LL ++ ++E+ F+ E + R L E+E +I + LPDD+ Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEITV-AAGCPVALPDDF 74 Query: 65 IEIRFLEDNKGNK---LDHLPLSISLQKNKG----YIITADSICVLPISSEDVV--LYYY 115 +R L G + LP + Q + + A + + P+ + L Y Sbjct: 75 EALRLLYPAGGRRDRAFLQLPPDVFQQGKHQNKSVFTLIAGQLWISPLPETETAFSLVYR 134 Query: 116 ACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDI 175 A +P L+ N NWLL PD+YL+G + + + + + E++ +L + + Sbjct: 135 AALPSLSLNRQSNWLLTSHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNKAGL 194 Query: 176 RS 177 R Sbjct: 195 RK 196 >gi|160897393|ref|YP_001562975.1| constituent protein [Delftia acidovorans SPH-1] gi|160362977|gb|ABX34590.1| constituent protein [Delftia acidovorans SPH-1] Length = 205 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 45/184 (24%), Positives = 83/184 (45%), Gaps = 14/184 (7%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEG--SAPLPD 62 +A L ++ ++++ T +F+ E +I R LRLR+ K+ L G LPD Sbjct: 6 YAGLQASIARWLKRSDLTANIPDFISLAEGRIARDLRLRKQITKVTLVTTGGVQGVDLPD 65 Query: 63 DYIEIR--FLEDNKGNKLDHLPLSISLQKNKG--------YIITADSICVLPISSE--DV 110 +Y+E L+ + +L +P+ K Y + + + + P+ ++ Sbjct: 66 NYLEAENITLDGGEPRQLTFVPIEHLDTKYPANYTAMPGVYTLLGNQLLLGPVPDGVYNI 125 Query: 111 VLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKL 170 + YYA PL++ N NWLL P +YL+ + E L+ ++A ++ + I L Sbjct: 126 DVTYYARFVPLSDVNTTNWLLQNHPSIYLFSALAEAELFLVNDERAVLWASKATQEIGTL 185 Query: 171 QQND 174 Q D Sbjct: 186 QSVD 189 >gi|56551285|ref|YP_162124.1| hypothetical protein ZMO0389 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542859|gb|AAV89013.1| constituent protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 219 Score = 177 bits (448), Expect = 7e-43, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 73/182 (40%), Gaps = 10/182 (5%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDY 64 + LL ++ ++E+ F+ E + R L E+E +I + LPDD+ Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEITV-AAGCPVTLPDDF 74 Query: 65 IEIRFLEDNKGNKLDH-------LPLSISLQKNKGYIITADSICVLPISSEDVV--LYYY 115 +R L G + + Q + + A + + P+ + L Y Sbjct: 75 EALRLLYPAGGRRDRAFLQLSPDVFQQGKHQNKSVFTLMAGQLWISPLPETETAFSLVYR 134 Query: 116 ACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDI 175 A +P L+ N NWLL PD+YL+G + + + + + E++ +L + + Sbjct: 135 AALPSLSLNRQSNWLLASHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNKAGL 194 Query: 176 RS 177 R Sbjct: 195 RK 196 >gi|260753093|ref|YP_003225986.1| constituent protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552456|gb|ACV75402.1| constituent protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 219 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 38/182 (20%), Positives = 73/182 (40%), Gaps = 10/182 (5%) Query: 5 FASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDY 64 + LL ++ ++E+ F+ E + R L E+E +I + LPDD+ Sbjct: 16 YDGLLKAIALWLERDDLADEIPYFVQLAEARFRRLLTNPELETEITV-AAGCPVALPDDF 74 Query: 65 IEIRFLEDNKGNKLDH-------LPLSISLQKNKGYIITADSICVLPISSEDVV--LYYY 115 +R L G + + Q + + A + + P+ + L Y Sbjct: 75 ETLRLLYPAGGRRDRAFLQLSPDVFQQGKHQNKSVFTLMAGQLWISPLPETETAFSLVYR 134 Query: 116 ACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDI 175 A +P L+ N NWLL PD+YL+G + + + + + E++ +L + + Sbjct: 135 AALPSLSLNRQSNWLLASHPDIYLFGSLLQAEFFGWNDSRLPVIKSALDEALGELNKAGL 194 Query: 176 RS 177 R Sbjct: 195 RK 196 >gi|148257049|ref|YP_001241634.1| hypothetical protein BBta_5781 [Bradyrhizobium sp. BTAi1] gi|146409222|gb|ABQ37728.1| hypothetical protein BBta_5781 [Bradyrhizobium sp. BTAi1] Length = 217 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 57/200 (28%), Positives = 90/200 (45%), Gaps = 24/200 (12%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYL---FKNFLHRIEVKINRALRLREMERKIVLSLVEGS 57 + +++SL VS Y+ + T L +F+ E K+NR L +R+MER+ + + GS Sbjct: 3 LISNYSSLQSAVSEYLARDQDTTLVSRIPSFIQLAEAKLNRQLFVRQMERRAIAVVDLGS 62 Query: 58 AP-----LPDDYIEIRF-----------LEDNKGNKLDHLPLSIS--LQKNKGYIITADS 99 A LP D+ +R LE G +LD I + + + + D Sbjct: 63 AEPNFIALPSDFQSMRRIRLTSVRGQPPLEFKSGVQLDEYRSRICNVAGQPRYFSLFGDQ 122 Query: 100 ICVLPISS--EDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKAT 157 I + P + + Y IPPL N+ NWLL APDLYLYG + E A + ++ + Sbjct: 123 IELAPRPDAAYTIEMVYRRTIPPLASND-SNWLLMLAPDLYLYGALLESAPYLKEDGRIQ 181 Query: 158 TASALFQESIKKLQQNDIRS 177 T F ++ L + S Sbjct: 182 TWGLGFSTALADLNTLGLSS 201 >gi|27476047|ref|NP_775249.1| constituent protein [Pseudomonas phage PaP3] gi|27414477|gb|AAL85563.1| ORF.09 [Pseudomonas phage PaP3] Length = 208 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 80/187 (42%), Gaps = 15/187 (8%) Query: 3 KDFASLLIDVSHYVEKGGFT-YLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLP 61 + L I + ++ T L F+ ++NR LR+RE E L G +P Sbjct: 5 NNVTDLAIAAIQWSDRQDLTQELLMLFIGNTTDRLNRLLRVRENEHFETLMAFGGGIEIP 64 Query: 62 DDYIEIRFLEDN---KGNKLDHLPLSI---------SLQKNKGYIITADSICVLPISSED 109 + ++ +R + + G L ++ I Q Y + V P+ + Sbjct: 65 EHFVALRSITGDALIGGRTLQYITQDIFSHYVNYNYQPQGVTYYTRLGNFWRVYPVVPDG 124 Query: 110 V--VLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESI 167 ++ Y+ +P L+ +NP W L + P +YLYG++E+I L+ +++ + ++ Sbjct: 125 APFIVNYWTVLPELSLDNPTTWALTKYPQIYLYGVLEQIYLYTMDEERSMFWGQKLERAV 184 Query: 168 KKLQQND 174 +LQ + Sbjct: 185 MELQNEE 191 >gi|167600433|ref|YP_001671933.1| phage particle protein [Pseudomonas phage LUZ24] gi|161168296|emb|CAP45461.1| phage particle protein [Pseudomonas phage LUZ24] Length = 208 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 77/187 (41%), Gaps = 15/187 (8%) Query: 3 KDFASLLIDVSHYVEKGGFT-YLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLP 61 + L I + ++ T L F+ ++NR LR+RE E L G +P Sbjct: 5 NNVTDLAIAAIQWSDRQDLTQELLMLFIGNTTDRLNRLLRVRENEHFETLMAFGGGIEIP 64 Query: 62 DDYIEIRFLEDN---KGNKLDHLPLSI---------SLQKNKGYIITADSICVLPISSED 109 + ++ +R + + G L ++ Q Y + V P+ + Sbjct: 65 EHFVALRSITGDSLIGGRTLQYITQDTFTHYVNYNYQPQGVTYYTRLGNFWRVFPVVPDG 124 Query: 110 V--VLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESI 167 ++ Y+ +P L+ NP W L + P +YLYG++E+I L+ ++ + ++ Sbjct: 125 APFIVNYWTVLPELSLANPTTWALTKYPQIYLYGVLEQIYLYTMDEARSQFWGQKLERAV 184 Query: 168 KKLQQND 174 +LQ + Sbjct: 185 MELQNEE 191 >gi|170752055|ref|YP_001758315.1| hypothetical protein Mrad2831_5687 [Methylobacterium radiotolerans JCM 2831] gi|170658577|gb|ACB27632.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 217 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 53/188 (28%), Positives = 76/188 (40%), Gaps = 22/188 (11%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSA---P 59 D ASL V Y+ + F+ E N LR REME + L P Sbjct: 6 TDLASLRAAVLDYIARPDLADAVPGFIALAESHFNAILRAREMEAEASLGTAAAPVAVMP 65 Query: 60 LPDDYIEIRFLE---------------DNKGNKLDHLPLSISLQKNKGYIITADSICVLP 104 LP D+IE + D+ + H P + + I A ++ ++P Sbjct: 66 LPADFIEWLAVSWAGSGRTARPTFAEADSPEARFRHRPG----GDPQYFAIRAGNVRMVP 121 Query: 105 ISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQ 164 V L YYA IPPLT+ P NWLL +APD YLY ++ E L+++ + L Sbjct: 122 EKPGAVTLAYYAAIPPLTDAAPSNWLLAKAPDAYLYAVLAEAYLFQKDPAAVQAHTGLML 181 Query: 165 ESIKKLQQ 172 + L Sbjct: 182 SVLNALGI 189 >gi|307308930|ref|ZP_07588613.1| hypothetical protein SinmeBDRAFT_4497 [Sinorhizobium meliloti BL225C] gi|306900564|gb|EFN31177.1| hypothetical protein SinmeBDRAFT_4497 [Sinorhizobium meliloti BL225C] Length = 191 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 80/180 (44%), Gaps = 6/180 (3%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 M + + ++ + F+ E + + E+ + LS+V+ +A L Sbjct: 1 MTLTYDDFIATINSLTVRTD--APIPTFISLAEDSLRTLTKHYLAEKTVTLSVVDNTAVL 58 Query: 61 PDDYIEIRFLEDNKGNKLDHL-PLSISLQKNK-GYIITADSICVLPI-SSEDVVLYYYAC 117 P D+IE+R + + G + P +L ++ GY ++ +++ + +V + Y++ Sbjct: 59 PSDFIELRSITGSNGYVYKPVNPAQATLYADEVGYYLSGNTLVFVGNYVESEVTILYWSA 118 Query: 118 IPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRS 177 L+ NWL R P +Y+ +++E WE+ + A QE + ++ ++D R Sbjct: 119 FEGLSP-TQSNWLFDRFPTVYVAAVMKEFYRWEKDAEGVAIEQATLQELLGRVAEDDRRG 177 >gi|291334401|gb|ADD94057.1| hypothetical protein BBta_5781 [uncultured phage MedDCM-OCT-S01-C1] Length = 217 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 45/203 (22%), Positives = 80/203 (39%), Gaps = 29/203 (14%) Query: 4 DFASLLIDVSHYVEK--GGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAP-- 59 ++ + YV++ T F +E K+N AL+ E + + L E Sbjct: 2 NYKQIKEAAQGYVDRYDDELTSAIPGFTKVVESKVNSALKTGEQSVRAQIWL-ERDIEYY 60 Query: 60 -LPDDYIEIRFLE----------------DNKGNKLDHLPLS-----ISLQKNKGYIITA 97 LP D+ R +E G L ++ ++ + Y + A Sbjct: 61 GLPSDWGGFRDVELVQKGSNSITPGQGIHPVGGMTLIYVNPEQMNGTHRRERQRYYTVIA 120 Query: 98 DSICVLPISSEDVV-LYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKA 156 + I V P + D++ + Y+ +P LT + NWL + PD Y++GL EI+ + + Sbjct: 121 NQIQVAPPTDNDMLEIVYFQRVPALTTDTDTNWLSEKHPDCYIFGLCAEISAFAKDAMGF 180 Query: 157 TTASALFQESIKKLQQND-IRSW 178 F ES+ + +D I W Sbjct: 181 ELYKMRFNESLGDIAMDDQITRW 203 >gi|221199515|ref|ZP_03572559.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221205584|ref|ZP_03578599.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221174422|gb|EEE06854.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221180800|gb|EEE13203.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 198 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 72/180 (40%), Gaps = 8/180 (4%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 ++ L + ++++ + L +F+ E NR + R + + + LP Sbjct: 4 ASYSELQSVMKRWLKRNDLSDLMPDFIMLAEQHFNRKIFTRARRATFQFTPSKFTVDLPP 63 Query: 63 DYIEIRFLEDNKGNKLDHLPLSISLQKNK-------GYIITADSICVLPISSEDVV-LYY 114 D+ + + H S G+ I D++ V+ + Y Sbjct: 64 DWDRLIAVYYGGHPLKQHAIGEESAYAGGQAQLMIGGFQINGDTLAFGADQLGQVLRIDY 123 Query: 115 YACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQND 174 Y I PL++ N NWLL APD+YL+G + E A++ + +AT +I +L D Sbjct: 124 YTVIEPLSDTNESNWLLEDAPDIYLFGALHEAAMYVRDDARATLWMQKRDLAIDELTTED 183 >gi|148557350|ref|YP_001264932.1| hypothetical protein Swit_4456 [Sphingomonas wittichii RW1] gi|148502540|gb|ABQ70794.1| hypothetical protein Swit_4456 [Sphingomonas wittichii RW1] Length = 310 Score = 91.7 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 68/201 (33%), Gaps = 33/201 (16%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALR--------LREMERKIVLSLV 54 ++ L+ ++ +++ ++F+ E +IN L +R M L++ Sbjct: 74 STYSELVASLAAWLDGSDLGGRERDFITLCEDEINARLAAAVGQGAVIRPMMVVDPLTID 133 Query: 55 EGSAPLPD--------------------DYIEIRFLED---NKGNKLDHLPLSISLQKNK 91 LPD DY+ L + + + I + Sbjct: 134 AELVDLPDGDTVKPISIELSGLDRPWSLDYVSPERLVGLRFGEAEQRSAMRAVIDGDPPR 193 Query: 92 GYIITADSICVLPISSE--DVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALW 149 Y + D + P + + IP L++ P NW+L + YLYG + + ++ Sbjct: 194 FYTVIGDQLRFHPAPEGSFGATIARFVKIPALSDAMPTNWVLASHRNAYLYGSLAQAEMF 253 Query: 150 EQKIDKATTASALFQESIKKL 170 + LF ++ L Sbjct: 254 GWNDSGTANWAELFGNAMDGL 274 >gi|291334634|gb|ADD94282.1| constituent protein [uncultured phage MedDCM-OCT-S04-C64] Length = 216 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 66/179 (36%), Gaps = 22/179 (12%) Query: 3 KDFASLLIDVSHYVEKGGFTY-LFKNFLHRIEVKINRA-----------LRLREMERKIV 50 +++ L +S++ + FT +F+ E + R +R + Sbjct: 4 ANYSDLKNAISNWTARSDFTTSNLNDFIDLAERYLKRVPVPVEAPEIGGIRGDIQRKTGT 63 Query: 51 LSLVEGSAPLPDDYIEIRF--LEDNKGNKLDHL-PLSISLQ-----KNKGYIITADSIC- 101 L + LP D+ EI L + L L P +SL + +D I Sbjct: 64 LVAGTATLALPTDFQEIYRFTLTADTFTTLRFLAPNQLSLNHRSGSGKPAFFTISDVIEF 123 Query: 102 -VLPISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTA 159 V P S+ L YY + L+++N NW++ PD YL + + Q A + Sbjct: 124 EVAPDSNYAYELSYYPMVTGLSDSNTTNWIIENYPDAYLTASLYHAYRFLQDFQSAQSY 182 >gi|167583568|ref|YP_001671758.1| conserved hypothetical protein [Enterobacteria phage phiEco32] gi|164375406|gb|ABY52814.1| conserved hypothetical protein [Enterobacteria phage phiEco32] Length = 250 Score = 82.1 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 77/206 (37%), Gaps = 34/206 (16%) Query: 5 FASLLIDVSHYVEKGG--FTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPD 62 + L V+ + ++ F NF++ E +I R LR+ +E+++ L + +G A +P Sbjct: 13 YGWLKKAVALWADRDDDEFVNQIPNFINFAEKEIYRNLRIPPLEKEVYLDIKDGVAYIPP 72 Query: 63 DYIEIR-----------FLEDNKGNKLDH----LPLSISLQKNKGYIITADSICVLP--- 104 DY+E + + + + S + + P Sbjct: 73 DYLEAQWMMRAKDGTIFQVTSPEEISYRRQHGTINPSHWNNQPVNFARFGSRFIFYPSIE 132 Query: 105 --------------ISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWE 150 + V+L YYA P E+ + +L AP+L LY + L+ Sbjct: 133 ADTPYYPDDGSPLIPAENSVILSYYADPPEFHEDTDTSAILTIAPELLLYFTLRHACLFV 192 Query: 151 QKIDKATTASALFQESIKKLQQNDIR 176 Q + SAL + + ++ + + + Sbjct: 193 QDDNGVQKWSALGKAILDEIVEQNKK 218 >gi|294083941|ref|YP_003550698.1| hypothetical protein SAR116_0371 [Candidatus Puniceispirillum marinum IMCC1322] gi|292663513|gb|ADE38614.1| hypothetical protein SAR116_0371 [Candidatus Puniceispirillum marinum IMCC1322] Length = 203 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 71/180 (39%), Gaps = 12/180 (6%) Query: 4 DFASLLIDVSHYVEKGGFT-YLFKNFLHRIEVKINRALRLREMER--KIVLSLVEGSAPL 60 ++ L + + T L F+ + +I R LR ER +I +S + L Sbjct: 2 NYGDLKTHFQDVLNRSDITSDLTTRFIDQGIARIQRQLRTPMSERVLEIAISSQTATMTL 61 Query: 61 PDDYIEIRFLEDNKG-------NKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLY 113 P D++EI L + + L + K + A+ I + P ++ ++ Sbjct: 62 PSDFLEIISLYHSSNELERVSMRRFRELNDNNHSGKPIYFARQAEKIHLYPQPTDGTLVL 121 Query: 114 YYAC-IPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQ 172 YY P L+ ++ N L A DL +Y + + + +A F + + ++Q+ Sbjct: 122 YYHGEFPSLSADSDFNALTKAASDLVIYAGLTYASDYYLD-QRAELFEQKFNQFLTEIQE 180 >gi|291337048|gb|ADD96568.1| constituent protein [uncultured organism MedDCM-OCT-S11-C359] Length = 179 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 57/145 (39%), Gaps = 15/145 (10%) Query: 46 ERKIVLSLVEGSAPLPDDYIEIR-----------FLEDNKGNKLDHLPLSISLQKNKGYI 94 E S+ + LP ++++R L ++D + S + Y Sbjct: 3 ESDSSFSINAETVALPSGFLQVRDLFILSGGTKYALTYMTPPQMDQIKGSSTSGMPVAYT 62 Query: 95 ITADSICVLPISS--EDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALW--E 150 I D+ P L YY L+++N N++L P +YLYG + A + Sbjct: 63 IIGDNFRFAPTPDSTYTGTLNYYKSFDALSDSNTTNYILTNHPAIYLYGSLYHAANFLGG 122 Query: 151 QKIDKATTASALFQESIKKLQQNDI 175 + + ++ ++++L++ND Sbjct: 123 VEPARLQQWQGMYTTALERLERNDR 147 >gi|291334590|gb|ADD94240.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C136] Length = 135 Score = 74.8 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 4/102 (3%) Query: 78 LDHLPLSISLQKNKGYIITADSICVLPISS--EDVVLYYYACIPPLTENNPVNWLLHRAP 135 +D + S + Y I DS PI L YYA L++ N N++L P Sbjct: 1 MDQIKGSSTTGMPSAYTILGDSFRFAPIPDSAYTGTLNYYAKFAALSDTNTSNYILASHP 60 Query: 136 DLYLYGLVEEIALW--EQKIDKATTASALFQESIKKLQQNDI 175 +YLYG + A + + ++ ++++L++ND Sbjct: 61 AIYLYGSLYHAANFLGGVDPQRLQQWQGMYTTAMERLERNDR 102 >gi|291334843|gb|ADD94483.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 179 Score = 63.2 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 30/125 (24%), Positives = 47/125 (37%), Gaps = 10/125 (8%) Query: 45 MERKIVLSLVEGSAPLPDDYIEIRF--LEDNKGNKLDHL-PLSISLQ-----KNKGYIIT 96 + L + LP D+ EI L + L L P +SL + Sbjct: 21 QRKTGTLVAGTATLALPTDFQEIYRFTLTADTFTTLRFLAPNQLSLNHRSGSGKPAFFTI 80 Query: 97 ADSIC--VLPISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKID 154 +D I V P S+ L YY + L+++N NW++ PD YL + + Q Sbjct: 81 SDVIEFEVAPDSNYAYELSYYPMVTGLSDSNTTNWIIENYPDAYLTASLYHAYRFLQDFQ 140 Query: 155 KATTA 159 A + Sbjct: 141 SAQSY 145 >gi|316934289|ref|YP_004109271.1| hypothetical protein Rpdx1_2957 [Rhodopseudomonas palustris DX-1] gi|315602003|gb|ADU44538.1| hypothetical protein Rpdx1_2957 [Rhodopseudomonas palustris DX-1] Length = 207 Score = 61.7 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 58/158 (36%), Gaps = 19/158 (12%) Query: 4 DFASLLIDVSHYVEKGGFTYLFKN-FLHRIEVKINRALRLREMERKIVLS-----LVEGS 57 ++ L D +++ T + F+ + ++ R LR+ ME+ I + +G Sbjct: 2 NYGELKADFLGLLKRRDLTDAQADNFIRKAVNRVQRVLRIPPMEKSIAVIYDGEQFADGQ 61 Query: 58 APLPDDYIEIRFLE----DNKGNKLDH------LPLSISLQKNKGYIITADSICVLPISS 107 P+P DY+ + L +L L L I + ++ + PI Sbjct: 62 LPIPADYLRLISLSATPPGGSERELAQSDLQTVLDLRIRTGRPTNFVRRGGYWSLAPIPE 121 Query: 108 EDV--VLYYYACIPPLTENNPVNWLLHRAPDLYLYGLV 143 + YY P LT N+L + A D+ + Sbjct: 122 AGTKFRIDYYDEFPALTAGAD-NYLTNGAADMVTSAAL 158 >gi|221271435|dbj|BAH15188.1| hypothetical protein [Serratia phage KSP100] Length = 160 Score = 56.7 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 52/142 (36%), Gaps = 17/142 (11%) Query: 3 KDFASLLIDVSHYVEKGG--FTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 ++A L+ +V ++ + F+ E +I R L++ E + L++ +G + Sbjct: 4 TNYAELVAEVKRWLNRDDPDTLDAIPTFITFAEKEIYRVLKMPVYECLVSLNIKKGRCEI 63 Query: 61 PDDYIEIRFLEDNKGNK---------LDHLPLSISLQKNKGYIITADSICVLPISSED-- 109 P + +++ + + + H+ I ++ Y + + Sbjct: 64 PFNLVQLIEIYTSDNKVGRMTSHREMIRHIADKIKHEQF-YYSRVGNEYHFYGVDDTGEG 122 Query: 110 ---VVLYYYACIPPLTENNPVN 128 V +Y+ L+ NN V Sbjct: 123 KPEVFCHYFQDPEMLSYNNQVT 144 >gi|291334591|gb|ADD94241.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C136] Length = 68 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 29/61 (47%), Gaps = 1/61 (1%) Query: 5 FASLLIDVSHYVEKGGFTYLFK-NFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDD 63 ++ L +++++ + T +F+ E N LR+R+M +++ + +P D Sbjct: 6 YSDLKTSIANWLNRSDLTTEIAEDFIVLAEADFNSKLRIRKMNTSTSITIDSETESVPTD 65 Query: 64 Y 64 + Sbjct: 66 F 66 >gi|291337047|gb|ADD96567.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C359] Length = 75 Score = 42.4 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 21/43 (48%), Gaps = 1/43 (2%) Query: 3 KDFASLLIDVSHYVEKGGFTYLFK-NFLHRIEVKINRALRLRE 44 +++ L V++++ + T +F+ E N LR+R+ Sbjct: 4 SNYSELQSSVANWLNRSDLTTEITGDFIVLTEKDFNSKLRIRK 46 >gi|242046494|ref|XP_002399601.1| sentrin/sumo-specific protease, putative [Ixodes scapularis] gi|215497553|gb|EEC07047.1| sentrin/sumo-specific protease, putative [Ixodes scapularis] Length = 341 Score = 38.6 bits (88), Expect = 0.31, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 49/150 (32%), Gaps = 36/150 (24%) Query: 64 YIEIRFLEDNKGNKLD-HLPLSISLQKNK------------GYIITADSICVLPISSEDV 110 ++ +R LED G + H+P N G + D + V+ + ED Sbjct: 61 FVALRRLEDLPGFRFQSHVPPETKSDANSSTSSRGLHRPEHGAQLNGDDVQVVGGTDEDS 120 Query: 111 V----------------------LYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIAL 148 + L Y + L NWL D Y+ GL+ E A Sbjct: 121 LRDRVRSTIRRSRHDPQRVLTRKLGYSVTVSDLRTLLGTNWLNDVVIDFYM-GLIVERAS 179 Query: 149 WEQKIDKATTASALFQESIKKLQQNDIRSW 178 EQ + + F ++ + +R W Sbjct: 180 LEQGGMRVHAVTTHFFNVLRSRGYDAVRRW 209 >gi|163783852|ref|ZP_02178831.1| hypothetical protein HG1285_12877 [Hydrogenivirga sp. 128-5-R1-1] gi|159880875|gb|EDP74400.1| hypothetical protein HG1285_12877 [Hydrogenivirga sp. 128-5-R1-1] Length = 188 Score = 37.4 bits (85), Expect = 0.81, Method: Composition-based stats. Identities = 23/105 (21%), Positives = 48/105 (45%), Gaps = 10/105 (9%) Query: 45 MERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLP 104 + +++ L+L A LPDD++ + K L+ +P K Y I ++ Sbjct: 50 LIKEVDLTLTNNQAVLPDDFVREYSVVSGK-YILNSIPPDAETDKY-SYKIVGKTLY--- 104 Query: 105 ISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALW 149 +++ V LYY+ P + + + P++++ L+ EIA + Sbjct: 105 STNDSVKLYYFYMYPSYSAVSDTVVI----PNVFV-NLLREIAAY 144 >gi|327308534|ref|XP_003238958.1| cleavage and polyadenylylation specificity factor [Trichophyton rubrum CBS 118892] gi|326459214|gb|EGD84667.1| cleavage and polyadenylylation specificity factor [Trichophyton rubrum CBS 118892] Length = 1024 Score = 37.0 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 18/72 (25%), Positives = 31/72 (43%), Gaps = 1/72 (1%) Query: 56 GSAPLPDDYIEIRFLEDNKGNK-LDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYY 114 SAPL ++ + + + L + S Q GY IT +LP + + + Y Sbjct: 105 ASAPLAATFLPSTSVTASDPSSGLTIQSATSSSQGPSGYEITGSGRILLPPPTNEDIARY 164 Query: 115 YACIPPLTENNP 126 ++ I PL + P Sbjct: 165 FSLIHPLKYSQP 176 >gi|312383817|gb|EFR28745.1| hypothetical protein AND_02903 [Anopheles darlingi] Length = 719 Score = 36.6 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 40/124 (32%), Gaps = 14/124 (11%) Query: 37 NRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIIT 96 R + + E K ++ ++ PL D+ + + D P S + ++ Sbjct: 368 RRDPSISDTETKELIEALQQEEPLGSDW---SVVFPDVPMAGDEAPAGTSSSSGDAFSVS 424 Query: 97 ADSICVLPISSEDVVLYYYACIPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKA 156 + P+ + + P+ WL P++ + GL + +L + Sbjct: 425 GEQWLQEPLDAGQRSFCW-----------PMEWLPKDFPNIRVIGLNYDSSLSQWSATGC 473 Query: 157 TTAS 160 Sbjct: 474 PCEK 477 >gi|58584562|ref|YP_198135.1| outer membrane protein [Wolbachia endosymbiont strain TRS of Brugia malayi] gi|58418878|gb|AAW70893.1| Probable outer membrane protein [Wolbachia endosymbiont strain TRS of Brugia malayi] Length = 983 Score = 36.3 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 22/133 (16%) Query: 2 AKDFASLLIDVSHYVEKGGFTYLFKNFLHRIE---------------------VKINRAL 40 + + A +++S + K +++ E KIN + Sbjct: 313 SANHALFRVNISKFAAK-DLCTYVPDYIINSEFKSWYCDNIDGNILNVIVSFNGKINNLI 371 Query: 41 RLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSI 100 + +V + GS +D+ ++R L + K D L ++++ K + + I +I Sbjct: 372 NNDLSDVIVVADIENGSVKFDEDFEQVRELSGDLVLKNDSLKITVNSAKFQNFTINGCNI 431 Query: 101 CVLPISSEDVVLY 113 + + ED VL Sbjct: 432 EMNSLDREDSVLT 444 >gi|327313420|ref|YP_004328857.1| outer membrane efflux protein [Prevotella denticola F0289] gi|326946163|gb|AEA22048.1| outer membrane efflux protein [Prevotella denticola F0289] Length = 501 Score = 35.9 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 69/180 (38%), Gaps = 9/180 (5%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 +A L +DV+ + ++ + ++ + L E++ I LS EG A L Sbjct: 242 VATRADGLKVDVAVNTADLQIAH-IQSGVSLTKMALCELCGL-ELDGDIRLS-DEGDADL 298 Query: 61 ---PDDYIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYAC 117 P E+R + + + + L +N + ++ V + V+L Sbjct: 299 TPTPSAQFEVRDTTGSDSRETADVRPELRLLQNAIDMSRQNTKLVRSLYLPHVLLT---G 355 Query: 118 IPPLTENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKATTASALFQESIKKLQQNDIRS 177 L+ N N R DL+ G+ ++ +W K ++ SI +L+ +D R Sbjct: 356 GYSLSNPNLFNGFQKRFTDLWNIGITVQVPIWNWGEGKYKVRASKTATSIAQLEMDDARK 415 >gi|159185178|ref|NP_355326.2| UDP-hexose transferase [Agrobacterium tumefaciens str. C58] gi|159140446|gb|AAK88111.2| UDP-hexose transferase [Agrobacterium tumefaciens str. C58] Length = 314 Score = 35.5 bits (80), Expect = 3.3, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 4/113 (3%) Query: 9 LIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIR 68 L +V G ++ +HR V +NR L R+ E I + + E+R Sbjct: 200 LAAAGRFVYLPG-CHVLDYRVHRTSVMMNRLLTFRDCEPAIEAVFSDPAIVAAISPQELR 258 Query: 69 FLEDNKGNKLDHLPLSISLQKNKG---YIITADSICVLPISSEDVVLYYYACI 118 L N ++ ++ + + + + AD+I P S V++ A + Sbjct: 259 RLRHRSENHMNAYTVAQAFRARRYREAFSALADTILNRPRQSLRAVVFSAAAL 311 >gi|326477880|gb|EGE01890.1| cleavage and polyadenylylation specificity factor [Trichophyton equinum CBS 127.97] Length = 1024 Score = 35.5 bits (80), Expect = 3.4, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 30/72 (41%), Gaps = 1/72 (1%) Query: 56 GSAPLPDDYIEIRFLEDNKGNKLDHLPLSISL-QKNKGYIITADSICVLPISSEDVVLYY 114 SAPL ++ + + + + + S Q GY T +LP + + + Y Sbjct: 105 ASAPLAATFLPSTSVTASDPSSGLTIQSATSPSQGPSGYETTGSGRILLPPPTNEDIARY 164 Query: 115 YACIPPLTENNP 126 ++ I PL + P Sbjct: 165 FSLIHPLKYSQP 176 >gi|85713742|ref|ZP_01044732.1| hypothetical protein NB311A_04359 [Nitrobacter sp. Nb-311A] gi|85699646|gb|EAQ37513.1| hypothetical protein NB311A_04359 [Nitrobacter sp. Nb-311A] Length = 334 Score = 35.1 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 23/65 (35%), Gaps = 1/65 (1%) Query: 18 KGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNK 77 + ++ + R + LRL M R + S PLP D + N G Sbjct: 220 RSTLSHHTPALIARYQAACRAILRLGAMRRHVR-SDSGAGVPLPCDPMVQEVATGNAGLT 278 Query: 78 LDHLP 82 +D P Sbjct: 279 IDPKP 283 >gi|326473038|gb|EGD97047.1| cleavage and polyadenylylation specificity factor [Trichophyton tonsurans CBS 112818] Length = 1024 Score = 35.1 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 30/72 (41%), Gaps = 1/72 (1%) Query: 56 GSAPLPDDYIEIRFLEDNKGNKLDHLPLSISL-QKNKGYIITADSICVLPISSEDVVLYY 114 SAPL ++ + + + + + S Q GY T +LP + + + Y Sbjct: 105 ASAPLAATFLPSTSVTASDPSSGLTIQSATSPSQGPSGYETTGSGRILLPPPTNEDIARY 164 Query: 115 YACIPPLTENNP 126 ++ I PL + P Sbjct: 165 FSLIHPLKYSQP 176 >gi|254463059|ref|ZP_05076475.1| N-terminal double-transmembrane domain protein [Rhodobacterales bacterium HTCC2083] gi|206679648|gb|EDZ44135.1| N-terminal double-transmembrane domain protein [Rhodobacteraceae bacterium HTCC2083] Length = 911 Score = 35.1 bits (79), Expect = 4.2, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 7/128 (5%) Query: 10 IDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRF 69 + + + T+ + ++ + LR R I G+ L DD + R Sbjct: 251 SGAARILTRADITFEEGAKIAVQDMSLPAELRARITHFAISGVRSAGAITLSDDSLRRRE 310 Query: 70 LEDNKGNKLDH-----LPLSISLQ--KNKGYIITADSICVLPISSEDVVLYYYACIPPLT 122 + G + PL Q + ++ + + +LP S + ++L A + P Sbjct: 311 VALIAGREDREGLELLSPLHYLRQALQPNAELLEGNLVDILPASPDVMILADVATLAPAE 370 Query: 123 ENNPVNWL 130 E W+ Sbjct: 371 EAELTEWV 378 >gi|134096796|ref|YP_001102457.1| integral membrane indolylacetylinositol arabinosyltransferase EmbC (arabinosylindolylacetylinositol synthase) [Saccharopolyspora erythraea NRRL 2338] gi|133909419|emb|CAL99531.1| integral membrane indolylacetylinositol arabinosyltransferase EmbC (arabinosylindolylacetylinositol synthase) [Saccharopolyspora erythraea NRRL 2338] Length = 1043 Score = 35.1 bits (79), Expect = 4.2, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 47/134 (35%), Gaps = 16/134 (11%) Query: 7 SLLIDVSHYVEKG------GFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 LL + + G T + F+ ++ R LR+ +M ++ A Sbjct: 431 GLLGAAALWAGLSMAVNPMGITAIAP-FIVLA-GRLRRMLRVPDMVLLAGITSTGLVAMF 488 Query: 61 PDDYIE-IRFLEDNKGNKLDHLPLSISLQKNK---GYIITADSICVLPISSEDVVLYYYA 116 D + R G ++P + +Q+ + G+ + D P+ VVL + A Sbjct: 489 ADQSLAGAREATRMHGFYGPNVPWYLEIQRYQFLLGFDLQGDVARRAPVLLTAVVLLFTA 548 Query: 117 CIPPLTENNPVNWL 130 L + WL Sbjct: 549 ----LLLSRGARWL 558 >gi|291005160|ref|ZP_06563133.1| integral membrane indolylacetylinositol arabinosyltransferase EmbC (arabinosylindolylacetylinositol synthase) [Saccharopolyspora erythraea NRRL 2338] Length = 1080 Score = 34.7 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 47/134 (35%), Gaps = 16/134 (11%) Query: 7 SLLIDVSHYVEKG------GFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPL 60 LL + + G T + F+ ++ R LR+ +M ++ A Sbjct: 468 GLLGAAALWAGLSMAVNPMGITAIAP-FIVLA-GRLRRMLRVPDMVLLAGITSTGLVAMF 525 Query: 61 PDDYIE-IRFLEDNKGNKLDHLPLSISLQKNK---GYIITADSICVLPISSEDVVLYYYA 116 D + R G ++P + +Q+ + G+ + D P+ VVL + A Sbjct: 526 ADQSLAGAREATRMHGFYGPNVPWYLEIQRYQFLLGFDLQGDVARRAPVLLTAVVLLFTA 585 Query: 117 CIPPLTENNPVNWL 130 L + WL Sbjct: 586 ----LLLSRGARWL 595 >gi|113970248|ref|YP_734041.1| 2,4-dienoyl-CoA reductase [Shewanella sp. MR-4] gi|113884932|gb|ABI38984.1| 2,4-dienoyl-CoA reductase [Shewanella sp. MR-4] Length = 672 Score = 34.7 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 44/119 (36%), Gaps = 14/119 (11%) Query: 17 EKGGFTYLFKNFL-----HRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLE 71 K F + FL H+I++++N L + +E + +V S +P E+ Sbjct: 421 GKEEFNETIRYFLNQIKLHKIDLRLNTRLDAKVLETETFDEIVIASGVVPR---ELSLPG 477 Query: 72 DNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPPLTENNPVNWL 130 + +D+ Q G + + ++ + +Y C + NP WL Sbjct: 478 FDSPKVVDY------QQVLTGQATIGNKVALIGAGGIGFDMAHYLCESESSTLNPEKWL 530 >gi|117920410|ref|YP_869602.1| 2,4-dienoyl-CoA reductase [Shewanella sp. ANA-3] gi|117612742|gb|ABK48196.1| 2,4-dienoyl-CoA reductase [Shewanella sp. ANA-3] Length = 672 Score = 34.7 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 44/119 (36%), Gaps = 14/119 (11%) Query: 17 EKGGFTYLFKNFL-----HRIEVKINRALRLREMERKIVLSLVEGSAPLPDDYIEIRFLE 71 K F + FL H+I++++N L + +E + +V S +P E+ Sbjct: 421 GKEEFNETIRYFLNQIKLHKIDLRLNTRLDAKVLETETFDEIVIASGVVPR---ELSLPG 477 Query: 72 DNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPPLTENNPVNWL 130 + +D+ Q G + + ++ + +Y C + NP WL Sbjct: 478 FDSPKVVDY------QQVLTGQATIGNKVALIGAGGIGFDMAHYLCESESSTLNPEKWL 530 >gi|302661813|ref|XP_003022569.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517] gi|291186522|gb|EFE41951.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517] Length = 1024 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 1/72 (1%) Query: 56 GSAPLPDDYIEIRFLEDNKGNK-LDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYY 114 SAPL ++ + + + L + S Q GY T +LP + + + Y Sbjct: 105 ASAPLAATFLPSTSVTASDPSSGLTIQSATSSAQGPSGYENTGSGRILLPPPTNEDIARY 164 Query: 115 YACIPPLTENNP 126 ++ I PL + P Sbjct: 165 FSLIHPLKYSQP 176 >gi|75762342|ref|ZP_00742220.1| Hypothetical protein RBTH_00565 [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74490167|gb|EAO53505.1| Hypothetical protein RBTH_00565 [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 191 Score = 34.3 bits (77), Expect = 6.7, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 36 INRALR----LREMERKIVLSLVEGSAPLPDDYIEIRFLEDNKGNKLDHLPLSISLQKNK 91 +NRAL + +E+K L+ LP+D ++ + +P+ Q+ Sbjct: 27 LNRALDDITPIARVEKKATLTY---PYTLPNDVQDMERVMQMN-KVFTRIPVGEQNQQ-- 80 Query: 92 GYIITADSICVLPISSEDVVLYYYACIPPL 121 GY + + + + S + + +YYY + L Sbjct: 81 GYWVWGNELMLQGGSQQPIEVYYYKKLSHL 110 >gi|148725674|emb|CAM56722.2| novel NACHT domain containing protein [Danio rerio] Length = 687 Score = 34.3 bits (77), Expect = 7.1, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 42/136 (30%), Gaps = 2/136 (1%) Query: 4 DFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGSAPLPDD 63 ++ S Y FT ++ +E INR L+ E+E + E S D Sbjct: 6 EYESADSGDKTYRRHKSFTDNLQSIFQNLETVINRFLKN-ELENFRKILQEENSQEFVKD 64 Query: 64 YIEIRFLEDNKGNKLDHLPLSISLQKNKGYIITADSICVLPISSEDVVLYYYACIPPLTE 123 + E R + L L Q + + + Y + + + Sbjct: 65 FNENRSIITEAALDLTLFFLREMKQDQAADTLQG-ELFFTNQLKCSLKKKYQSVFEGIAQ 123 Query: 124 NNPVNWLLHRAPDLYL 139 L + DLY+ Sbjct: 124 QGDSTLLNNIYTDLYI 139 >gi|192291441|ref|YP_001992046.1| hypothetical protein Rpal_3065 [Rhodopseudomonas palustris TIE-1] gi|192285190|gb|ACF01571.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 212 Score = 33.9 bits (76), Expect = 9.3, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 54/200 (27%), Gaps = 23/200 (11%) Query: 1 MAKDFASLLIDVSHYVEKGGFTYLFKNFLHRIEVKINRALRLREMERKIVLSLVEGS--- 57 MAK ++L ++ + + ++ + + R V G Sbjct: 1 MAKTLQTMLDRIADDLTRSDLASQAQSAILDAVDHYAHDRFWFNVTRSKTFPTVAGQQAY 60 Query: 58 -----APLPD--DYIEIRFLEDNKGNKLDHLPLS----------ISLQKNKGYIITADSI 100 A +PD + + + G LD + + + Y + Sbjct: 61 GSAALAEIPDAIQFDGLFLKDGASGFFLDWVDADEAEWLISGVNTAPGRPTNYTFADAQL 120 Query: 101 CVLPISSEDVVL--YYYACIPPLT-ENNPVNWLLHRAPDLYLYGLVEEIALWEQKIDKAT 157 + P+ + + +P L + WL + + + A + + A Sbjct: 121 LLWPVPIAAFTIRPLMHYRLPALALPTDSSPWLNEAEQLIRAHAKMLLYANALEDDEGAA 180 Query: 158 TASALFQESIKKLQQNDIRS 177 A KL R Sbjct: 181 RMQAQIPAHRAKLDAETSRR 200 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.310 0.128 0.334 Lambda K H 0.267 0.0391 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,914,617,625 Number of Sequences: 14124377 Number of extensions: 104599541 Number of successful extensions: 272699 Number of sequences better than 10.0: 51 Number of HSP's better than 10.0 without gapping: 42 Number of HSP's successfully gapped in prelim test: 38 Number of HSP's that attempted gapping in prelim test: 272601 Number of HSP's gapped (non-prelim): 82 length of query: 178 length of database: 4,842,793,630 effective HSP length: 130 effective length of query: 48 effective length of database: 3,006,624,620 effective search space: 144317981760 effective search space used: 144317981760 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 76 (34.0 bits)