BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781130|ref|YP_003065543.1| hypothetical protein CLIBASIA_05160 [Candidatus Liberibacter asiaticus str. psy62] (378 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781130|ref|YP_003065543.1| hypothetical protein CLIBASIA_05160 [Candidatus Liberibacter asiaticus str. psy62] gi|254040807|gb|ACT57603.1| hypothetical protein CLIBASIA_05160 [Candidatus Liberibacter asiaticus str. psy62] Length = 378 Score = 788 bits (2035), Expect = 0.0, Method: Compositional matrix adjust. Identities = 378/378 (100%), Positives = 378/378 (100%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN Sbjct: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT Sbjct: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA Sbjct: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 Query: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG 240 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG Sbjct: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG 240 Query: 241 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR 300 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR Sbjct: 241 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR 300 Query: 301 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE Sbjct: 301 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 Query: 361 HQDVQDMSNTPLNSQKFD 378 HQDVQDMSNTPLNSQKFD Sbjct: 361 HQDVQDMSNTPLNSQKFD 378 >gi|315122525|ref|YP_004063014.1| hypothetical protein CKC_03885 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495927|gb|ADR52526.1| hypothetical protein CKC_03885 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 360 Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust. Identities = 258/360 (71%), Positives = 304/360 (84%), Gaps = 1/360 (0%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPRI+EVYI+P+GS+AYPNSNISSSAYNNLL+DLALDNN PRPI AGGT A NA QAR+N Sbjct: 1 MPRIKEVYIIPSGSKAYPNSNISSSAYNNLLEDLALDNNQPRPICAGGTGAANASQARIN 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LG DDA+NLTKGKIIA LLPFYPIQQGGG GQLDNKIYLGWNG LLLQVD VWT Sbjct: 61 LGMDDANNLTKGKIIATLLPFYPIQQGGGKGQLDNKIYLGWNGKNLLLQVDQQLQGVVWT 120 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 SQLAP ALQ+LV HA K NHIVYT DSN+YA+TPLSSF+R+LF+C N E LH+AIF+NGA Sbjct: 121 SQLAPLALQSLVSHAEKANHIVYTADSNKYASTPLSSFVRELFACKNNEELHKAIFKNGA 180 Query: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALEST-KNCLYKTTVLNMGR 239 +FYNN+ +IA F++DGDI C+ R KT+WQGI+ AQ+TAN+A++ K + KTTVLNM R Sbjct: 181 KFYNNTNKIAEFMNDGDIFCHPRNKTMWQGIDSAQYTANLAIDLIPKQYIKKTTVLNMER 240 Query: 240 GPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSEL 299 GYIHFDTD GAVGC+YF+SDERLK++HG+SSAS+ ++IE+ KFIDF+YLP SGMDS L Sbjct: 241 SQGYIHFDTDMGAVGCNYFISDERLKEIHGKSSASSLDLIEKFKFIDFSYLPSSGMDSSL 300 Query: 300 RYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 RY IGFS NNL+EIN++ VD IG YL+PNPSVI+P+L KAIQEL EVKELR I +Q + Sbjct: 301 RYPIGFSANNLQEINDILVDKIGDYLSPNPSVILPYLCKAIQELSAEVKELRRRIHQQKD 360 >gi|150397017|ref|YP_001327484.1| hypothetical protein Smed_1814 [Sinorhizobium medicae WSM419] gi|150028532|gb|ABR60649.1| hypothetical protein Smed_1814 [Sinorhizobium medicae WSM419] Length = 499 Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 31/62 (50%), Positives = 39/62 (62%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 +PR VY PAG++ N+ I S YN L+ DL D N+ RPI+AGGT A +A AR N Sbjct: 5 LPRTGGVYSPPAGTKGVSNTTIQSVPYNALVDDLTADANAARPITAGGTGATSASVARTN 64 Query: 61 LG 62 LG Sbjct: 65 LG 66 Score = 36.2 bits (82), Expect = 9.7, Method: Compositional matrix adjust. Identities = 17/31 (54%), Positives = 22/31 (70%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKI 74 +S GGT A +A AR NLG ++ASNLT G + Sbjct: 237 VSEGGTGATDAATARSNLGANNASNLTTGTL 267 >gi|227821710|ref|YP_002825680.1| hypothetical protein NGR_c11420 [Sinorhizobium fredii NGR234] gi|227340709|gb|ACP24927.1| hypothetical protein NGR_c11420 [Sinorhizobium fredii NGR234] Length = 506 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 46/163 (28%), Positives = 69/163 (42%), Gaps = 52/163 (31%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPR +Y PAG++ P + I S+ YN+ + DL D N+ RPI+AGGT + +A AR Sbjct: 1 MPRTGGIYSAPAGTKGTPGNTIQSAPYNSFIDDLVDDLNAARPITAGGTGSTSASAARTA 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LG +N+ + + Sbjct: 61 LGLAIGTNV-----------------------------------------------QAYD 73 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLF 163 + LA ++ L A+K ++YTT S+ YA T L+SF R L Sbjct: 74 AGLA--SIAGLTTAANK---MIYTTASDTYAVTDLTSFARTLL 111 >gi|302873697|ref|YP_003842330.1| phage minor structural protein [Clostridium cellulovorans 743B] gi|307686578|ref|ZP_07629024.1| phage minor structural protein [Clostridium cellulovorans 743B] gi|302576554|gb|ADL50566.1| phage minor structural protein [Clostridium cellulovorans 743B] Length = 1163 Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 44/127 (34%), Positives = 66/127 (51%), Gaps = 21/127 (16%) Query: 243 YIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYL 302 YI T GA G +Y+ SD+R K E+ +A + I ++K DF+Y ++G+ + Sbjct: 1040 YIEVSTIFGAKGLTYWDSDKRYKMNIEETDKTALDKIMKIKHYDFDY--KAGLK---HFD 1094 Query: 303 IGFSENNLKEINEVFVDT---------IGGYLAPNPSVIIPHLAKAIQELLQEVKELRDM 353 +G+ L EIN +V T + + PN + IIP+L+KAIQEL Q + Sbjct: 1095 VGYISQQLMEINPQWVITVPQEAANGEVSDFYIPNQTTIIPYLSKAIQELKQ-------L 1147 Query: 354 IDKQNEE 360 ID QN E Sbjct: 1148 IDNQNRE 1154 >gi|319783524|ref|YP_004143000.1| hypothetical protein Mesci_3833 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169412|gb|ADV12950.1| hypothetical protein Mesci_3833 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 403 Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 32/73 (43%), Positives = 40/73 (54%), Gaps = 3/73 (4%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPR VY P G+ PN+ I S+ YN L+ DL D N+ RP++AGG+ A AV N Sbjct: 1 MPRAGGVYSAPPGTAGTPNTTIESAKYNALVADLVADANAARPLTAGGSGATTAVGGSDN 60 Query: 61 L---GTDDASNLT 70 GT AS T Sbjct: 61 FNAAGTSLASAAT 73 >gi|227822432|ref|YP_002826404.1| hypothetical protein NGR_c18870 [Sinorhizobium fredii NGR234] gi|227341433|gb|ACP25651.1| hypothetical protein NGR_c18870 [Sinorhizobium fredii NGR234] Length = 403 Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 32/67 (47%), Positives = 41/67 (61%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 +PR +Y PAG++ PN+ I S YN L+ DL D N+ RPI+AGGT A +A AR Sbjct: 5 LPRTGGIYSPPAGTKGVPNTTIQSVPYNALIDDLTADANAARPITAGGTGATSASAARTA 64 Query: 61 LGTDDAS 67 LG AS Sbjct: 65 LGAQGAS 71 >gi|330970689|gb|EGH70755.1| tail fiber protein H [Pseudomonas syringae pv. aceris str. M302273PT] Length = 780 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 31/98 (31%), Positives = 52/98 (53%), Gaps = 8/98 (8%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 + + P+QQGGG GQ NK+Y+GW+ L L VD + M +WT Q N D A+K Sbjct: 549 IGYTPLQQGGGAGQKTNKVYIGWSDVGLKLTVDATDMGRIWTEQSF-----NPNDKANKS 603 Query: 139 NHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIF 176 + + ++ Y T ++S + + + G+++ A F Sbjct: 604 SSVAGYGITDCYTVTQVNSLLNQRIA---GDSVQTAGF 638 Score = 44.7 bits (104), Expect = 0.023, Method: Compositional matrix adjust. Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 2/48 (4%) Query: 77 NLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQ 122 N L F P+QQGGG Q N++ LGW NG + QVD S + +W Q Sbjct: 661 NRLGFTPVQQGGGANQASNQLRLGWATNGAGVRAQVDASDLGLLWGEQ 708 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 17/37 (45%), Positives = 23/37 (62%) Query: 84 IQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 ++QGGG Q N+I +GW G L + VD S M +WT Sbjct: 440 VKQGGGPDQKANRINIGWTGAMLKVSVDGSDMGRIWT 476 >gi|301386028|ref|ZP_07234446.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato Max13] gi|302060587|ref|ZP_07252128.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato K40] gi|302129790|ref|ZP_07255780.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato NCPPB 1108] Length = 780 Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 5/81 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 + + P+QQGGG GQ NK+Y+GW+ L L VD + M +WT Q N + A+K Sbjct: 549 IGYTPLQQGGGAGQKTNKVYIGWSDVGLKLTVDATDMGRIWTEQSF-----NPNEKANKS 603 Query: 139 NHIVYTTDSNQYAATPLSSFM 159 N I ++ Y T +++ + Sbjct: 604 NSIAGYGITDCYTVTQVNALV 624 Score = 43.5 bits (101), Expect = 0.049, Method: Compositional matrix adjust. Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 2/48 (4%) Query: 77 NLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQ 122 N L F P+QQGGG Q N++ LGW NG + QVD + + +W Q Sbjct: 661 NRLGFVPVQQGGGANQATNQLRLGWATNGAGVRAQVDATDLGLLWGEQ 708 Score = 40.4 bits (93), Expect = 0.41, Method: Compositional matrix adjust. Identities = 16/37 (43%), Positives = 24/37 (64%) Query: 84 IQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 ++QGGG G L N+I +GW G +L + VD + +WT Sbjct: 440 VKQGGGPGMLSNRINIGWTGDRLKVSVDGTDGGRIWT 476 >gi|308187194|ref|YP_003931325.1| hypothetical protein Pvag_1688 [Pantoea vagans C9-1] gi|308057704|gb|ADO09876.1| hypothetical protein Pvag_1688 [Pantoea vagans C9-1] Length = 468 Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 20/45 (44%), Positives = 31/45 (68%) Query: 81 FYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAP 125 F +QQGGG G L NK+Y+GWNGT+++ QVD + + +++ P Sbjct: 126 FASVQQGGGDGMLANKVYMGWNGTKMIAQVDATRLGDLYYGNNPP 170 >gi|299068824|emb|CBJ40065.1| putative side tail fiber protein (Fragment) [Ralstonia solanacearum CMR15] Length = 359 Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 21/37 (56%), Positives = 28/37 (75%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSM 115 L F P+QQGGG GQ +NKI++GW+G +L QVD S + Sbjct: 57 LGFAPVQQGGGAGQFNNKIFIGWSGGKLKGQVDQSDL 93 >gi|330938404|gb|EGH42030.1| tail fiber protein H [Pseudomonas syringae pv. pisi str. 1704B] Length = 624 Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 20/41 (48%), Positives = 27/41 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIW 305 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 25/41 (60%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G GQ +N + +GW+ L L VD + + VW Sbjct: 385 LGYTPVQQGTGTGQFNNVVKIGWSDNGLKLTVDATDLGRVW 425 >gi|320332017|gb|EFW87953.1| tail fiber protein H, putative [Pseudomonas syringae pv. glycinea str. race 4] Length = 625 Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 20/41 (48%), Positives = 27/41 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIW 305 Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G+GQL+N + +GW+ L VD + M +W Sbjct: 385 LLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLW 425 Score = 40.0 bits (92), Expect = 0.68, Method: Compositional matrix adjust. Identities = 18/33 (54%), Positives = 23/33 (69%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVD 111 L F P+QQG G+GQL N I +G++G Q L VD Sbjct: 499 LGFTPVQQGTGVGQLSNLIRIGYDGAQPRLTVD 531 >gi|330882145|gb|EGH16294.1| tail fiber protein H [Pseudomonas syringae pv. glycinea str. race 4] Length = 483 Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 20/41 (48%), Positives = 27/41 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W Sbjct: 123 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIW 163 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G+GQL+N + +GW+ L VD + M +W Sbjct: 243 LLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLW 283 Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust. Identities = 18/33 (54%), Positives = 23/33 (69%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVD 111 L F P+QQG G+GQL N I +G++G Q L VD Sbjct: 357 LGFTPVQQGTGVGQLSNLIRIGYDGAQPRLTVD 389 >gi|28871149|ref|NP_793768.1| tail fiber protein H [Pseudomonas syringae pv. tomato str. DC3000] gi|28854399|gb|AAO57463.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato str. DC3000] gi|331017133|gb|EGH97189.1| tail fiber protein H, putative [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 808 Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust. Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 5/81 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 + + P+QQGGG GQ NK+++GW+ L L VD + +WT Q N D A++ Sbjct: 549 IGYTPLQQGGGAGQKTNKVFIGWSDVGLKLTVDTIDIGRIWTEQSF-----NPNDKANRS 603 Query: 139 NHIVYTTDSNQYAATPLSSFM 159 N I ++ Y ++S + Sbjct: 604 NSIAGYGITDCYTVNQVNSLL 624 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 27/48 (56%), Gaps = 2/48 (4%) Query: 77 NLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQ 122 N L F P+QQGGG Q N++ LGW N + QVD + + +W Q Sbjct: 689 NRLGFTPVQQGGGANQAGNQLRLGWATNAAGIRAQVDATDLGLLWGEQ 736 Score = 36.6 bits (83), Expect = 6.3, Method: Compositional matrix adjust. Identities = 14/37 (37%), Positives = 23/37 (62%) Query: 84 IQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 ++QGG G L N+I +GW G ++ + VD + +WT Sbjct: 440 VKQGGAPGMLGNRINIGWTGDRVKVSVDGTDGGRIWT 476 >gi|289623575|ref|ZP_06456529.1| tail fiber protein H, putative [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650819|ref|ZP_06482162.1| tail fiber protein H, putative [Pseudomonas syringae pv. aesculi str. 2250] gi|330868531|gb|EGH03240.1| tail fiber protein H [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 619 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 21/43 (48%), Positives = 28/43 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W S Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDNTDLGNLWYS 307 >gi|330898935|gb|EGH30354.1| tail fiber protein H [Pseudomonas syringae pv. japonica str. M301072PT] Length = 619 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 21/43 (48%), Positives = 28/43 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W S Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDNTDLGNLWYS 307 Score = 43.9 bits (102), Expect = 0.040, Method: Compositional matrix adjust. Identities = 19/43 (44%), Positives = 27/43 (62%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L + PIQQG GIGQL N + +GW+ + L VD + + +W S Sbjct: 379 LSYTPIQQGTGIGQLGNPVKIGWSSSGLKATVDETDLGTLWAS 421 >gi|330970637|gb|EGH70703.1| tail fiber protein H [Pseudomonas syringae pv. aceris str. M302273PT] Length = 689 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 29/43 (67%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S Sbjct: 328 LGFVPVQQGTGAGQLNNQVRLGWSGSALKAAVDNTDLGNLWYS 370 Score = 37.0 bits (84), Expect = 5.0, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 6/75 (8%) Query: 48 GTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGT-QL 106 G AA++ Q + +D+ + + + L F P+QQG G GQL+N I +G+ Q+ Sbjct: 441 GLAANDPTQPYMRRASDNTTYFLQTR-----LGFVPVQQGTGAGQLNNLIKIGYTSAGQV 495 Query: 107 LLQVDLSSMREVWTS 121 L VD + +W S Sbjct: 496 KLAVDNTDFGNLWYS 510 >gi|301386568|ref|ZP_07234986.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato Max13] Length = 517 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 29/43 (67%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S Sbjct: 156 LGFVPVQQGTGAGQLNNQVRLGWSGSGLKAAVDNTDLGNLWYS 198 Score = 37.0 bits (84), Expect = 5.7, Method: Compositional matrix adjust. Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N I +G+ Q+ L VD + +W S Sbjct: 295 LGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYS 338 >gi|302063119|ref|ZP_07254660.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato K40] Length = 456 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 29/43 (67%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S Sbjct: 95 LGFVPVQQGTGAGQLNNQVRLGWSGSGLKAAVDNTDLGNLWYS 137 Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust. Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N I +G+ Q+ L VD + +W S Sbjct: 234 LGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYS 277 >gi|213972284|ref|ZP_03400352.1| tail fiber protein H [Pseudomonas syringae pv. tomato T1] gi|213922961|gb|EEB56588.1| tail fiber protein H [Pseudomonas syringae pv. tomato T1] Length = 689 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 29/43 (67%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S Sbjct: 328 LGFVPVQQGTGAGQLNNQVRLGWSGSGLKAAVDNTDLGNLWYS 370 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N I +G+ Q+ L VD + +W S Sbjct: 467 LGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYS 510 >gi|28870562|ref|NP_793181.1| tail fiber protein H [Pseudomonas syringae pv. tomato str. DC3000] gi|28853810|gb|AAO56876.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato str. DC3000] Length = 689 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 29/43 (67%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S Sbjct: 328 LGFVPVQQGTGAGQLNNQVRLGWSGSGLKAAVDNTDLGNLWYS 370 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTS 121 L F P+QQG G GQL+N I +G+ Q+ L VD + +W S Sbjct: 467 LGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYS 510 >gi|320325918|gb|EFW81977.1| tail fiber protein H, putative [Pseudomonas syringae pv. glycinea str. B076] Length = 444 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 19/41 (46%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L F P+QQGGG QLDNK+ +GW+ L VD + + +W Sbjct: 265 LGFTPVQQGGGTAQLDNKVKIGWSNNGLKAMVDATDLGNIW 305 Score = 40.8 bits (94), Expect = 0.35, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G+GQL+N + +GW+ L VD + M +W Sbjct: 385 LLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLW 425 >gi|237803435|ref|ZP_04591020.1| tail fiber protein H [Pseudomonas syringae pv. oryzae str. 1_6] gi|331025417|gb|EGI05473.1| tail fiber protein H [Pseudomonas syringae pv. oryzae str. 1_6] Length = 720 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/43 (46%), Positives = 28/43 (65%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F +QQG G GQLDN + +GW+GT L VD + + ++W S Sbjct: 328 LGFTAVQQGTGAGQLDNAVKIGWSGTDLKATVDKTDLGKLWYS 370 >gi|254720065|ref|ZP_05181876.1| hypothetical protein Bru83_11101 [Brucella sp. 83/13] gi|265985070|ref|ZP_06097805.1| predicted protein [Brucella sp. 83/13] gi|306839383|ref|ZP_07472197.1| Hypothetical protein BROD_2239 [Brucella sp. NF 2653] gi|264663662|gb|EEZ33923.1| predicted protein [Brucella sp. 83/13] gi|306405506|gb|EFM61771.1| Hypothetical protein BROD_2239 [Brucella sp. NF 2653] Length = 381 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 28/62 (45%), Positives = 36/62 (58%), Gaps = 1/62 (1%) Query: 1 MPRIRE-VYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARV 59 MPR + VY LP +A I + +N L+D+A D N+ RPIS GGT +A QAR Sbjct: 1 MPRNGQGVYSLPPVYEAVTGETIEAQQHNVPLEDIASDLNNARPISTGGTGGQSATQARE 60 Query: 60 NL 61 NL Sbjct: 61 NL 62 >gi|330878350|gb|EGH12499.1| tail fiber protein H [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 625 Score = 45.1 bits (105), Expect = 0.021, Method: Compositional matrix adjust. Identities = 19/43 (44%), Positives = 27/43 (62%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQGGG GQL+N + +GW+ L VD + + +W S Sbjct: 265 LGFTPVQQGGGTGQLNNLVKIGWSNNGLKAMVDATDLGNLWYS 307 Score = 36.2 bits (82), Expect = 8.6, Method: Compositional matrix adjust. Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 1/54 (1%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTSQLAPRALQNL 131 L + P++QGGG NK+ LG+NG L LQVD S ++ Q P L L Sbjct: 500 LGYTPVEQGGGPNMSANKVRLGYNGAGSLRLQVDSSDFGDLTNDQNLPAKLAGL 553 >gi|308480796|ref|XP_003102604.1| hypothetical protein CRE_03188 [Caenorhabditis remanei] gi|308261038|gb|EFP04991.1| hypothetical protein CRE_03188 [Caenorhabditis remanei] Length = 567 Score = 45.1 bits (105), Expect = 0.021, Method: Compositional matrix adjust. Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 8/115 (6%) Query: 251 GAVGCSYFLSDERLK-KVHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGF 305 G V C SD RLK + G+ + A E +++L+ +D+ Y PE G+ + R G Sbjct: 86 GRVMCP---SDIRLKDNITGKEAKEALENLQKLRIVDYFYKPEVAEKWGLSEDQRKRTGV 142 Query: 306 SENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 L EI V IG YL N S + A QEL + +L ID + +E Sbjct: 143 IAQELAEIIPDAVRDIGDYLTVNESRVFYETVLATQELCRLAGDLDQKIDDKVDE 197 >gi|38640370|ref|NP_944293.1| Bcep22gp64 [Burkholderia phage Bcep22] gi|33860437|gb|AAQ54997.1| Bcep22gp64 [Burkholderia phage Bcep22] Length = 425 Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust. Identities = 36/130 (27%), Positives = 63/130 (48%), Gaps = 18/130 (13%) Query: 240 GPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFN-----YLPESG 294 G +++F D VG + +SDER KK SS A ++ L+F++F+ YLP+ Sbjct: 304 GANHLNFFVDGNLVG--FVVSDERAKKNITPSSTDALARVKALEFVEFDFIDSPYLPKKH 361 Query: 295 MDSELRYLIGFSENNLKEINEVFVDT-----IGGYLAPNPSVIIPHLAKAIQELLQEVKE 349 +D+ G + IN ++D YL N ++ +AIQ+L EV E Sbjct: 362 VDN------GVIAQQAQRINPNWIDKPPADHPDAYLGLNLQYLLMDAMRAIQQLSSEVDE 415 Query: 350 LRDMIDKQNE 359 L+ +++ ++ Sbjct: 416 LKAELERNSK 425 >gi|330963871|gb|EGH64131.1| tail fiber protein H [Pseudomonas syringae pv. actinidiae str. M302091] Length = 625 Score = 44.3 bits (103), Expect = 0.034, Method: Compositional matrix adjust. Identities = 19/43 (44%), Positives = 27/43 (62%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 L F P+QQGGG GQL+N + +GW+ L VD + + +W S Sbjct: 265 LGFTPVQQGGGSGQLNNLVKIGWSNNGLKAMVDATDLGNLWYS 307 Score = 36.2 bits (82), Expect = 8.1, Method: Compositional matrix adjust. Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 1/54 (1%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGT-QLLLQVDLSSMREVWTSQLAPRALQNL 131 L + P++QGGG NK+ LG+NG L LQVD S ++ Q P L L Sbjct: 500 LGYTPVEQGGGPNMSANKVRLGYNGAGSLRLQVDSSDFGDLTNDQNLPAKLAGL 553 >gi|169833508|ref|YP_001693486.1| hypothetical protein SPH_0062 [Streptococcus pneumoniae Hungary19A-6] gi|168996010|gb|ACA36622.1| PblB [Streptococcus pneumoniae Hungary19A-6] Length = 3038 Score = 43.1 bits (100), Expect = 0.070, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 91/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 2855 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 2892 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 2893 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 2935 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 2936 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 2994 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2995 KIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3037 >gi|148995279|ref|ZP_01824069.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP9-BS68] gi|147926790|gb|EDK77848.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP9-BS68] Length = 1602 Score = 42.7 bits (99), Expect = 0.092, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ ES E+ + +E Sbjct: 1496 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ESKKHEEIGLIAQEAE 1554 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1555 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1601 >gi|301799204|emb|CBW31717.1| pblB [Streptococcus pneumoniae OXC141] Length = 2101 Score = 42.7 bits (99), Expect = 0.096, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 1918 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 1955 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 1956 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 1998 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 1999 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 2057 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2058 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2100 >gi|303259359|ref|ZP_07345336.1| PblB [Streptococcus pneumoniae SP-BS293] gi|303265734|ref|ZP_07351632.1| PblB [Streptococcus pneumoniae BS457] gi|302639293|gb|EFL69751.1| PblB [Streptococcus pneumoniae SP-BS293] gi|302644642|gb|EFL74891.1| PblB [Streptococcus pneumoniae BS457] Length = 1530 Score = 42.7 bits (99), Expect = 0.097, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 1347 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 1384 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 1385 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 1427 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 1428 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 1486 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1487 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1529 >gi|303261114|ref|ZP_07347063.1| PblB [Streptococcus pneumoniae SP14-BS292] gi|303263442|ref|ZP_07349365.1| PblB [Streptococcus pneumoniae BS397] gi|303267835|ref|ZP_07353637.1| PblB [Streptococcus pneumoniae BS458] gi|302637951|gb|EFL68437.1| PblB [Streptococcus pneumoniae SP14-BS292] gi|302642531|gb|EFL72876.1| PblB [Streptococcus pneumoniae BS458] gi|302647215|gb|EFL77439.1| PblB [Streptococcus pneumoniae BS397] Length = 2105 Score = 42.7 bits (99), Expect = 0.097, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 1922 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 1959 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 1960 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 2002 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 2003 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 2061 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2062 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2104 >gi|148996415|ref|ZP_01824133.1| phage infection protein [Streptococcus pneumoniae SP11-BS70] gi|147756990|gb|EDK64029.1| phage infection protein [Streptococcus pneumoniae SP11-BS70] Length = 1034 Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 851 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 888 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 889 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 931 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 932 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 990 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 991 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1033 >gi|303255532|ref|ZP_07341590.1| hypothetical protein CGSSpBS455_08580 [Streptococcus pneumoniae BS455] gi|302597510|gb|EFL64598.1| hypothetical protein CGSSpBS455_08580 [Streptococcus pneumoniae BS455] Length = 965 Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 782 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 819 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 820 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 862 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 863 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 921 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 922 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 964 >gi|332072362|gb|EGI82845.1| pblB [Streptococcus pneumoniae GA17570] Length = 2147 Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ ES E+ + +E Sbjct: 2041 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ESKKHEEIGLIAQEAE 2099 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2100 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2146 >gi|183603147|ref|ZP_02712966.2| PblB [Streptococcus pneumoniae SP195] gi|183572670|gb|EDT93198.1| PblB [Streptococcus pneumoniae SP195] Length = 2135 Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ ES E+ + +E Sbjct: 2029 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ESKKHEEIGLIAQEAE 2087 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2088 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2134 >gi|148986194|ref|ZP_01819146.1| phage-related protein; possible prophage LambdaBa01, minor structural protein [Streptococcus pneumoniae SP3-BS71] gi|147921808|gb|EDK72936.1| phage-related protein; possible prophage LambdaBa01, minor structural protein [Streptococcus pneumoniae SP3-BS71] Length = 1203 Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust. Identities = 52/223 (23%), Positives = 90/223 (40%), Gaps = 46/223 (20%) Query: 138 PNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-FQNG--AQFYNNSQRIAAFLD 194 PN TT N Y C NG H + F N A FY N+ Sbjct: 1020 PNAWYVTTSGNMY--------------CRNGADFHGKVDFSNSSRANFYGNT-------- 1057 Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG 254 + R GIE+ +K+ L N G + + G+ Sbjct: 1058 -----TFSRSPVFSNGIELG----------SKDVLGDG--WNPKGGRNAVVWWNQVGSGS 1100 Query: 255 CSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E + Sbjct: 1101 VKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAETIVP 1159 Query: 312 EINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1160 RIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1202 >gi|330985191|gb|EGH83294.1| tail fiber protein H [Pseudomonas syringae pv. lachrymans str. M301315] Length = 665 Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust. Identities = 17/37 (45%), Positives = 26/37 (70%) Query: 84 IQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 I+QGGGIGQL +++ +GW+ T L L V+ + +WT Sbjct: 440 IKQGGGIGQLSSRLNIGWSSTGLRLTVEGEDLGRIWT 476 >gi|330980925|gb|EGH79028.1| tail fiber protein H [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 510 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G+GQL+N + +GW+ L VD + M +W Sbjct: 271 LLYAPVQQGTGVGQLNNVVKIGWSDNGLKATVDATDMGTLW 311 >gi|330891239|gb|EGH23900.1| tail fiber protein H [Pseudomonas syringae pv. mori str. 301020] Length = 510 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 17/41 (41%), Positives = 26/41 (63%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 L + P+QQG G+GQL+N + +GW+ L VD + M +W Sbjct: 271 LLYAPVQQGTGVGQLNNVVKIGWSDNGLKATVDATDMGTLW 311 >gi|149021849|ref|ZP_01835856.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP23-BS72] gi|147930085|gb|EDK81072.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP23-BS72] Length = 999 Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 893 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 951 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ +I Sbjct: 952 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKII 998 >gi|149011808|ref|ZP_01833004.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP19-BS75] gi|147764239|gb|EDK71171.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP19-BS75] Length = 1352 Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1246 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1304 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1305 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1351 >gi|321156928|emb|CBW38917.1| PblB-type protein [Streptococcus phage V22] Length = 1468 Score = 41.6 bits (96), Expect = 0.23, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1362 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1420 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1421 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1467 >gi|168494864|ref|ZP_02719007.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|168495066|ref|ZP_02719209.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575070|gb|EDT95598.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575230|gb|EDT95758.1| PblB [Streptococcus pneumoniae CDC3059-06] Length = 1349 Score = 41.6 bits (96), Expect = 0.23, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1243 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1301 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1302 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1348 >gi|307066694|ref|YP_003875660.1| PblB, putative [Streptococcus pneumoniae AP200] gi|306408231|gb|ADM83658.1| PblB, putative [Streptococcus phage PhiSpn_200] Length = 3035 Score = 41.2 bits (95), Expect = 0.24, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2929 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2987 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2988 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3034 >gi|307126242|ref|YP_003878273.1| PblB [Streptococcus pneumoniae 670-6B] gi|306483304|gb|ADM90173.1| PblB [Streptococcus pneumoniae 670-6B] Length = 2699 Score = 41.2 bits (95), Expect = 0.25, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2593 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2651 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2652 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2698 >gi|307128260|ref|YP_003880291.1| PblB [Streptococcus pneumoniae 670-6B] gi|306485322|gb|ADM92191.1| PblB [Streptococcus pneumoniae 670-6B] Length = 2426 Score = 41.2 bits (95), Expect = 0.25, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2320 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2378 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2379 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2425 >gi|168494815|ref|ZP_02718958.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575310|gb|EDT95838.1| PblB [Streptococcus pneumoniae CDC3059-06] Length = 2676 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2570 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2628 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2629 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2675 >gi|149004162|ref|ZP_01828959.1| hypothetical protein CGSSp14BS69_05677 [Streptococcus pneumoniae SP14-BS69] gi|147757824|gb|EDK64835.1| hypothetical protein CGSSp14BS69_05677 [Streptococcus pneumoniae SP14-BS69] Length = 602 Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 496 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 554 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 555 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 601 >gi|325188925|emb|CCA23454.1| Phosphatidylinositol 3kinase tor2 putative [Albugo laibachii Nc14] Length = 3153 Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust. Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 3/88 (3%) Query: 262 ERLKKVH-GESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVDT 320 ERL V ++ AS R ++ L+ +D + P G+D LR L + + EI + + T Sbjct: 632 ERLLTVGIADTDASIR--LQVLESLDARFDPLLGLDDNLRCLFISLHDEVFEIRQRAMST 689 Query: 321 IGGYLAPNPSVIIPHLAKAIQELLQEVK 348 +G NPSVI+P L + + +LL E++ Sbjct: 690 LGRLAHHNPSVILPSLRQTLVQLLAELE 717 >gi|321157149|emb|CBW39134.1| PblB-type protein [Streptococcus phage 11865] Length = 1421 Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1315 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1373 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1374 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1420 >gi|321157252|emb|CBW39235.1| PblB-type protein [Streptococcus phage 2167] Length = 1469 Score = 41.2 bits (95), Expect = 0.31, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 59/107 (55%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1363 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1421 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + +I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1422 TIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1468 >gi|332201943|gb|EGJ16012.1| phage minor structural , N-terminal region domain protein [Streptococcus pneumoniae GA41317] Length = 1924 Score = 40.8 bits (94), Expect = 0.32, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1818 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1876 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1877 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1923 >gi|321157103|emb|CBW39089.1| PblB-type protein [Streptococcus phage 23782] Length = 1707 Score = 40.8 bits (94), Expect = 0.32, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1601 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1659 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1660 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1706 >gi|225855356|ref|YP_002736868.1| PblB [Streptococcus pneumoniae JJA] gi|225722631|gb|ACO18484.1| PblB [Streptococcus pneumoniae JJA] Length = 2108 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2002 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2060 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2061 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2107 >gi|332205033|gb|EGJ19096.1| pblB [Streptococcus pneumoniae GA47368] Length = 1137 Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1031 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1089 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1090 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1136 >gi|168490166|ref|ZP_02714365.1| PblB [Streptococcus pneumoniae SP195] gi|183571459|gb|EDT91987.1| PblB [Streptococcus pneumoniae SP195] Length = 3023 Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2917 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2975 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2976 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3022 >gi|321157197|emb|CBW39181.1| PblB-type protein [Streptococcus phage 8140] Length = 1179 Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1073 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1131 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1132 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1178 >gi|225857876|ref|YP_002739386.1| PblB [Streptococcus pneumoniae 70585] gi|225722145|gb|ACO17999.1| PblB [Streptococcus pneumoniae 70585] Length = 2970 Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2864 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2922 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2923 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2969 >gi|148990394|ref|ZP_01821566.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP6-BS73] gi|147924349|gb|EDK75441.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP6-BS73] Length = 1053 Score = 40.8 bits (94), Expect = 0.35, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 947 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1005 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1006 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1052 >gi|225855800|ref|YP_002737311.1| hypothetical protein SPP_0076 [Streptococcus pneumoniae P1031] gi|225724622|gb|ACO20474.1| hypothetical protein SPP_0076 [Streptococcus pneumoniae P1031] Length = 140 Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 34 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 92 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 93 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 139 >gi|172079550|ref|ZP_02708651.2| PblB [Streptococcus pneumoniae CDC1873-00] gi|172043007|gb|EDT51053.1| PblB [Streptococcus pneumoniae CDC1873-00] Length = 2654 Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 2548 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2606 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2607 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2653 >gi|13470671|ref|NP_102240.1| hypothetical protein mll0443 [Mesorhizobium loti MAFF303099] gi|14021413|dbj|BAB48026.1| mll0443 [Mesorhizobium loti MAFF303099] Length = 349 Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust. Identities = 28/69 (40%), Positives = 34/69 (49%), Gaps = 3/69 (4%) Query: 1 MPRIRE---VYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQA 57 MP R+ V P A + I SS YN DL D N+ RPI+AGGT A+ A+ Sbjct: 1 MPMPRDGSGVMSWPPNGNAVTATPIDSSKYNARFADLLSDLNAARPITAGGTGANTAIGG 60 Query: 58 RVNLGTDDA 66 NL T A Sbjct: 61 NDNLNTAGA 69 >gi|71991056|ref|NP_496262.2| Prion-like-(Q/N-rich)-domain-bearing protein family member (pqn-47) [Caenorhabditis elegans] gi|50507473|emb|CAA88990.3| C. elegans protein F59B10.1, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|50507495|emb|CAA88602.3| C. elegans protein F59B10.1, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 931 Score = 40.4 bits (93), Expect = 0.41, Method: Compositional matrix adjust. Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Query: 260 SDERLKKVHGE-SSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEIN 314 SD RLK+ E +A A E + +L+ +D+ Y PE G+D + R+ G L+ + Sbjct: 483 SDIRLKEAITERETAEAIENLLKLRVVDYRYKPEVADIWGLDEQQRHRTGLIAQELQAVL 542 Query: 315 EVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 V IG YL + + A Q+L + +L ID++ E Sbjct: 543 PDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDSKIDEKVAE 588 >gi|321156986|emb|CBW38974.1| PblB-type protein [Streptococcus phage 040922] Length = 2102 Score = 40.4 bits (93), Expect = 0.41, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1996 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 2054 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2055 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2101 >gi|321157044|emb|CBW39031.1| PblB-type protein [Streptococcus phage 34117] Length = 1633 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 28/107 (26%), Positives = 58/107 (54%), Gaps = 4/107 (3%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F+++ E+ E+ + +E Sbjct: 1527 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDFI-ENKKHEEIGLIAQEAE 1585 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1586 TIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1632 >gi|268578415|ref|XP_002644190.1| Hypothetical protein CBG17173 [Caenorhabditis briggsae] gi|187025955|emb|CAP34948.1| hypothetical protein CBG_17173 [Caenorhabditis briggsae AF16] Length = 863 Score = 39.7 bits (91), Expect = 0.71, Method: Compositional matrix adjust. Identities = 36/115 (31%), Positives = 50/115 (43%), Gaps = 8/115 (6%) Query: 251 GAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSELRYLIGF 305 G V C SD RLK E A A E +++L+ +D+ Y PE G+ + R G Sbjct: 410 GRVMCP---SDIRLKDNITEKEAKEALENLQKLRIVDYFYKPEVADKWGLSEDQRKRTGV 466 Query: 306 SENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 L E+ V +G YL N S + A QEL + +L ID + E Sbjct: 467 IAQELAEVIPDAVKDLGDYLTVNESRVFYETVLATQELCRLAGDLDQKIDDKVAE 521 >gi|291336087|gb|ADD95673.1| hypothetical protein [uncultured phage MedDCM-OCT-S11-C561] Length = 604 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 72/184 (39%), Gaps = 37/184 (20%) Query: 43 PISAGGTAADNAVQARVNLGT-------------DDASNLTKGKIIANLLPFYPIQQGGG 89 +S GGT A +A AR +LG DD + LT+ AN LPF+ Sbjct: 383 AVSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQS---ANKLPFFSS----- 434 Query: 90 IGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAP------RALQNLVDHAHKPNHIVY 143 G + L LL + S+ R + + L ++ N ++Y Sbjct: 435 -GNAADTTDLTAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLASIAGLTTAANKLIY 493 Query: 144 TTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-------FQNGAQFYNNSQRIAAFLDDG 196 TT S+ YA L++F R + + T+ + Q AQ N + +AA + DG Sbjct: 494 TTGSDTYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQ--AQDSNLAGVVAAGVTDG 551 Query: 197 DIMC 200 + + Sbjct: 552 NFLV 555 >gi|291335641|gb|ADD95248.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C846] Length = 499 Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 72/184 (39%), Gaps = 37/184 (20%) Query: 43 PISAGGTAADNAVQARVNLGT-------------DDASNLTKGKIIANLLPFYPIQQGGG 89 +S GGT A +A AR +LG DD + LT+ AN LPF+ Sbjct: 278 AVSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQS---ANKLPFFSS----- 329 Query: 90 IGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAP------RALQNLVDHAHKPNHIVY 143 G + L LL + S+ R + + L ++ N ++Y Sbjct: 330 -GNAADTTDLTAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLASIAGLTTAANKLIY 388 Query: 144 TTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-------FQNGAQFYNNSQRIAAFLDDG 196 TT S+ YA L++F R + + T+ + Q AQ N + +AA + DG Sbjct: 389 TTGSDTYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQ--AQDSNLAGVVAAGVTDG 446 Query: 197 DIMC 200 + + Sbjct: 447 NFLV 450 >gi|291335566|gb|ADD95176.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C507] Length = 361 Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust. Identities = 45/184 (24%), Positives = 72/184 (39%), Gaps = 37/184 (20%) Query: 43 PISAGGTAADNAVQARVNLGT-------------DDASNLTKGKIIANLLPFYPIQQGGG 89 +S GGT A +A AR +LG DD + LT+ AN LPF+ Sbjct: 140 AVSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQS---ANKLPFFSS----- 191 Query: 90 IGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAP------RALQNLVDHAHKPNHIVY 143 G + L LL + S+ R + + L ++ N ++Y Sbjct: 192 -GNAADTTDLTAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLASIAGLTTAANKLIY 250 Query: 144 TTDSNQYAATPLSSFMRKLFSCHNGETLHQAI-------FQNGAQFYNNSQRIAAFLDDG 196 TT S+ YA L++F R + + T+ + Q AQ N + +AA + DG Sbjct: 251 TTGSDTYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQ--AQDSNLAGVVAAGVTDG 308 Query: 197 DIMC 200 + + Sbjct: 309 NFLV 312 >gi|307710000|ref|ZP_07646446.1| hypothetical protein SMSK564_1275 [Streptococcus mitis SK564] gi|307619258|gb|EFN98388.1| hypothetical protein SMSK564_1275 [Streptococcus mitis SK564] Length = 694 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 2/95 (2%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SD RLK ESS S ++I +LK ++Y E + E +++ K + F + Sbjct: 601 SDRRLKSNIQESSVSGVDVINRLK--TYSYRKEFNNEVEDISCGIMAQDVQKYAPDAFRE 658 Query: 320 TIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 G N ++P+L KAIQEL Q+++++ I Sbjct: 659 GPDGVYTYNTFALVPYLIKAIQELNQKIEKMEKTI 693 >gi|218531765|ref|YP_002422581.1| hypothetical protein Mchl_3837 [Methylobacterium chloromethanicum CM4] gi|218524068|gb|ACK84653.1| hypothetical protein Mchl_3837 [Methylobacterium chloromethanicum CM4] Length = 441 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%) Query: 84 IQQGGGIGQ-LDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 ++QGGG+ Q N + LGW+ T LL QVD + +W S+L Sbjct: 303 VRQGGGVRQNASNMVRLGWSNTNLLAQVDALELGALWASRL 343 >gi|15895154|ref|NP_348503.1| hypothetical protein CA_C1879 [Clostridium acetobutylicum ATCC 824] gi|15024858|gb|AAK79843.1|AE007696_5 Uncharacterized, phage-related protein [Clostridium acetobutylicum ATCC 824] gi|325509293|gb|ADZ20929.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018] Length = 1403 Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust. Identities = 31/106 (29%), Positives = 52/106 (49%), Gaps = 10/106 (9%) Query: 254 GCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEI 313 G + + SD K +S+ +A +II + K F++ E G EL G+ L I Sbjct: 1286 GINVWNSDISFKYAIADSTENALDIINRFKHRQFHW-KEDGRHQEL----GYVSQELYAI 1340 Query: 314 NEVFVDTI-----GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 N V ++ + P +IIP+L+K+IQE +V++L+ I Sbjct: 1341 NPALVLSVPQKNGTVIMQPRDDMIIPYLSKSIQEEDAKVEQLKSKI 1386 >gi|225026040|ref|ZP_03715232.1| hypothetical protein EUBHAL_00279 [Eubacterium hallii DSM 3353] gi|224956646|gb|EEG37855.1| hypothetical protein EUBHAL_00279 [Eubacterium hallii DSM 3353] Length = 415 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 29/112 (25%), Positives = 54/112 (48%), Gaps = 3/112 (2%) Query: 233 TVLNMGRGPGYIHFDTDK--GAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYL 290 T+ ++G G GY+ + K G G LS E+ KK + E +++ +D+ L Sbjct: 194 TLCDIGCGWGYLLIEAAKKYGIKGVGITLSKEQKKKFEERIKENHLEDQLEVRLMDYRDL 253 Query: 291 PESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQE 342 PESG + +G E+ ++ E+F+ ++ + P + + H A+QE Sbjct: 254 PESGFKFDRVVSVGMLEHVGRDNYELFMKSVKSIMNPG-GLFLLHYISALQE 304 >gi|25152210|ref|NP_741884.1| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|5824465|emb|CAA16508.2| C. elegans protein F21A10.2a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 898 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 247 DTDKGAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSELRY 301 D D + G + SD RLK E A A E +++L+ +D+ Y PE G+ + R Sbjct: 436 DGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPEVASKWGLTEDQRK 495 Query: 302 LIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 G L + V +G YL N S + A QEL + +L ID + E Sbjct: 496 RTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTGDLDQKIDDKVAE 554 >gi|302144755|emb|CBW44372.1| C. elegans protein F21A10.2e, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1009 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 52/122 (42%), Gaps = 5/122 (4%) Query: 244 IHFDTDKGAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSE 298 + D D + G + SD RLK E A A E +++L+ +D+ Y PE G+ + Sbjct: 544 LTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPEVASKWGLTED 603 Query: 299 LRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQN 358 R G L + V +G YL N S + A QEL + +L ID + Sbjct: 604 QRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTGDLDQKIDDKV 663 Query: 359 EE 360 E Sbjct: 664 AE 665 >gi|322704319|gb|EFY95915.1| putative PEP5 protein [Metarhizium anisopliae ARSEF 23] Length = 944 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 33/106 (31%), Positives = 43/106 (40%), Gaps = 10/106 (9%) Query: 239 RGPGYIHFDTDKGAVGC---SYFLSDERLKKVHGESSASAREIIEQLKFID--FNYL--- 290 R PG + FD D V C Y+ L K HGE+ A +IE K Y+ Sbjct: 432 RSPGDLKFDLDTAIVMCRQGGYYEQAAYLAKRHGETELVADILIEDSKSYGEALGYIWRQ 491 Query: 291 -PESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVI-IP 334 PE +Y EN KE ++F+D G P S + IP Sbjct: 492 DPEVAYPCMQKYARVLIENCPKEATKLFIDYYTGKYRPRKSTVAIP 537 >gi|25152212|ref|NP_741883.1| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|22265847|emb|CAD44121.1| C. elegans protein F21A10.2b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 947 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 247 DTDKGAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSELRY 301 D D + G + SD RLK E A A E +++L+ +D+ Y PE G+ + R Sbjct: 485 DGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPEVASKWGLTEDQRK 544 Query: 302 LIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 G L + V +G YL N S + A QEL + +L ID + E Sbjct: 545 RTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTGDLDQKIDDKVAE 603 >gi|302144754|emb|CBW44371.1| C. elegans protein F21A10.2d, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 944 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 247 DTDKGAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSELRY 301 D D + G + SD RLK E A A E +++L+ +D+ Y PE G+ + R Sbjct: 482 DGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPEVASKWGLTEDQRK 541 Query: 302 LIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 G L + V +G YL N S + A QEL + +L ID + E Sbjct: 542 RTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTGDLDQKIDDKVAE 600 >gi|32566568|ref|NP_509709.2| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|22265848|emb|CAD44122.1| C. elegans protein F21A10.2c, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|22265862|emb|CAD44135.1| C. elegans protein F21A10.2c, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 949 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 247 DTDKGAVGCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPES----GMDSELRY 301 D D + G + SD RLK E A A E +++L+ +D+ Y PE G+ + R Sbjct: 487 DGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPEVASKWGLTEDQRK 546 Query: 302 LIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 G L + V +G YL N S + A QEL + +L ID + E Sbjct: 547 RTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTGDLDQKIDDKVAE 605 >gi|32469426|ref|NP_862890.1| PblB [Streptococcus phage SM1] gi|10732859|gb|AAG18640.1| PblB [Streptococcus phage SM1] Length = 1062 Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust. Identities = 29/92 (31%), Positives = 47/92 (51%), Gaps = 2/92 (2%) Query: 259 LSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV 318 +SD R K +S S +IIEQLK ++Y E E +++ K + E F Sbjct: 969 ISDRRYKSNIKDSQVSGLDIIEQLK--TYSYRKEYDGKIEDISCGIMAQDVQKYVPEAFF 1026 Query: 319 DTIGGYLAPNPSVIIPHLAKAIQELLQEVKEL 350 + G + ++P+L KAIQEL Q++++L Sbjct: 1027 ENPDGAYSYRTFELVPYLIKAIQELNQKIQKL 1058 >gi|309364934|emb|CAP23547.2| CBR-PQN-47 protein [Caenorhabditis briggsae AF16] Length = 925 Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust. Identities = 36/138 (26%), Positives = 61/138 (44%), Gaps = 18/138 (13%) Query: 241 PGYIHFDTDK----------GAVGCSYFL---SDERLKK-VHGESSASAREIIEQLKFID 286 PG + TD+ G + CS + SD RLK+ + + +A A E + +L+ +D Sbjct: 432 PGNVAIGTDRQAETARLTVAGDIYCSGRVINPSDIRLKEGISEKETAEAIENLLKLRVVD 491 Query: 287 FNYLPES----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQE 342 + Y E G+D + R G L+ + V IG YL + + A Q+ Sbjct: 492 YRYKDEVANVWGLDEQQRQRTGLIAQELQAVLPDAVRDIGDYLTIDEGRVFYETVMATQQ 551 Query: 343 LLQEVKELRDMIDKQNEE 360 L + +L + ID++ E Sbjct: 552 LCRMTGDLDNKIDEKVAE 569 >gi|167647462|ref|YP_001685125.1| hypothetical protein Caul_3500 [Caulobacter sp. K31] gi|167349892|gb|ABZ72627.1| hypothetical protein Caul_3500 [Caulobacter sp. K31] Length = 899 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 15/26 (57%), Positives = 18/26 (69%) Query: 77 NLLPFYPIQQGGGIGQLDNKIYLGWN 102 N L F P+QQG G+GQ N I LGW+ Sbjct: 350 NTLGFTPVQQGTGVGQTTNVIKLGWS 375 >gi|268532300|ref|XP_002631278.1| C. briggsae CBR-PQN-47 protein [Caenorhabditis briggsae] Length = 918 Score = 37.7 bits (86), Expect = 3.0, Method: Compositional matrix adjust. Identities = 36/138 (26%), Positives = 61/138 (44%), Gaps = 18/138 (13%) Query: 241 PGYIHFDTDK----------GAVGCSYFL---SDERLKK-VHGESSASAREIIEQLKFID 286 PG + TD+ G + CS + SD RLK+ + + +A A E + +L+ +D Sbjct: 442 PGNVAIGTDRQAETARLTVAGDIYCSGRVINPSDIRLKEGISEKETAEAIENLLKLRVVD 501 Query: 287 FNYLPES----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQE 342 + Y E G+D + R G L+ + V IG YL + + A Q+ Sbjct: 502 YRYKDEVANVWGLDEQQRQRTGLIAQELQAVLPDAVRDIGDYLTIDEGRVFYETVMATQQ 561 Query: 343 LLQEVKELRDMIDKQNEE 360 L + +L + ID++ E Sbjct: 562 LCRMTGDLDNKIDEKVAE 579 >gi|14590006|ref|NP_142070.1| hypothetical protein PH0053 [Pyrococcus horikoshii OT3] gi|3256438|dbj|BAA29121.1| 404aa long hypothetical protein [Pyrococcus horikoshii OT3] Length = 404 Score = 37.4 bits (85), Expect = 3.5, Method: Compositional matrix adjust. Identities = 21/62 (33%), Positives = 32/62 (51%) Query: 295 MDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 +D E +G S+ +K+I E T+G ++ I+ H A+ I L + KEL D I Sbjct: 189 VDEEKLRELGLSDEKIKKIIEAKEKTMGAWMDETDIAIVQHFAEEIDRLYKLRKELEDYI 248 Query: 355 DK 356 DK Sbjct: 249 DK 250 >gi|330994925|ref|ZP_08318847.1| hypothetical protein SXCC_04812 [Gluconacetobacter sp. SXCC-1] gi|329758186|gb|EGG74708.1| hypothetical protein SXCC_04812 [Gluconacetobacter sp. SXCC-1] Length = 359 Score = 37.0 bits (84), Expect = 4.5, Method: Compositional matrix adjust. Identities = 15/21 (71%), Positives = 16/21 (76%) Query: 81 FYPIQQGGGIGQLDNKIYLGW 101 F PIQQGGG Q NK+YLGW Sbjct: 89 FTPIQQGGGTDQGSNKVYLGW 109 >gi|332158218|ref|YP_004423497.1| hypothetical protein PNA2_0577 [Pyrococcus sp. NA2] gi|331033681|gb|AEC51493.1| hypothetical protein PNA2_0577 [Pyrococcus sp. NA2] Length = 398 Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 32/62 (51%) Query: 295 MDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 +D E +G SE +K+I E T+G ++ ++ H A+ I L + +EL D I Sbjct: 186 IDEEKLRELGLSEEKIKKIVEAKEKTMGAWMDETDISVVQHFAEEIDRLYKLRRELEDYI 245 Query: 355 DK 356 DK Sbjct: 246 DK 247 >gi|308509976|ref|XP_003117171.1| CRE-PQN-47 protein [Caenorhabditis remanei] gi|308242085|gb|EFO86037.1| CRE-PQN-47 protein [Caenorhabditis remanei] Length = 932 Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust. Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 8/121 (6%) Query: 248 TDKGAVGCSYFL---SDERLKK-VHGESSASAREIIEQLKFIDFNYLPES----GMDSEL 299 T G + CS + SD RLK+ + + +A A E + +L+ +D+ Y E G+D + Sbjct: 459 TVTGDIYCSGRVINPSDIRLKEGISEKETAEAIENLLKLRVVDYRYKSEVADVWGLDEQQ 518 Query: 300 RYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 R G L+ + V IG YL + + A Q+L + +L + ID++ Sbjct: 519 RQRTGLIAQELQAVLPDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDNKIDEKVA 578 Query: 360 E 360 E Sbjct: 579 E 579 >gi|293393498|ref|ZP_06637808.1| vitamin B12 transporter BtuB [Serratia odorifera DSM 4582] gi|291423833|gb|EFE97052.1| vitamin B12 transporter BtuB [Serratia odorifera DSM 4582] Length = 643 Score = 37.0 bits (84), Expect = 4.8, Method: Compositional matrix adjust. Identities = 46/195 (23%), Positives = 81/195 (41%), Gaps = 35/195 (17%) Query: 16 AYPNSN---------ISSSAYNNLLKDLALDNNS----------PRPISAGGTAADNAVQ 56 A+PN N S +A++ +D N+S P+P+S+ A +V Sbjct: 20 AFPNDNDKNTLLSVVTSVTAFSGWAQDNTTTNHSDKLVVTANRFPQPVSS--VLAPTSVV 77 Query: 57 ARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMR 116 R ++ A +LT + LP I Q GG+GQ + G N + +L+ +D + Sbjct: 78 TRTDIDRWQAKSLTD---VMRRLPGVDIAQNGGLGQNSSLFVRGTNSSHVLVLIDGIRLN 134 Query: 117 EVWTSQLA-----PRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETL 171 + S + P +L +++ P VY +D+ +++ +NG TL Sbjct: 135 QAGVSGSSDLSQIPISLVQKIEYIRGPRSAVYGSDAIGGVVNIITT------RENNGTTL 188 Query: 172 HQAIFQNGAQFYNNS 186 + NG Q Y+ S Sbjct: 189 SAGLGSNGYQAYDAS 203 >gi|325508527|gb|ADZ20163.1| hypothetical protein CEA_G1125 [Clostridium acetobutylicum EA 2018] Length = 1085 Score = 37.0 bits (84), Expect = 5.0, Method: Compositional matrix adjust. Identities = 29/112 (25%), Positives = 53/112 (47%), Gaps = 11/112 (9%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SDERLK + + II Q+ +F+++ ++G E +G L +IN+ F Sbjct: 976 SDERLKTNIKPITTNCSSIINQINLKEFDFI-KTGEHIE----VGTIAQQLSKINKKFSK 1030 Query: 320 TI------GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQDVQ 365 + + AP+ + I+P++ AIQEL + K L + Q+++ Sbjct: 1031 ILYTTEDGTNFFAPDYNSILPYIIGAIQELSAKSKSLETRVTTLENSIQELK 1082 >gi|270265152|ref|ZP_06193414.1| transporter [Serratia odorifera 4Rx13] gi|270040786|gb|EFA13888.1| transporter [Serratia odorifera 4Rx13] Length = 637 Score = 37.0 bits (84), Expect = 5.4, Method: Compositional matrix adjust. Identities = 42/179 (23%), Positives = 76/179 (42%), Gaps = 26/179 (14%) Query: 23 SSSAYNNLLKDLALDNNS----------PRPISAGGTAADNAVQARVNLGTDDASNLTKG 72 S +A++ +D NNS P+P+S+ A +V R ++ A +LT Sbjct: 28 SVTAFSGWAQDNTTANNSDNLVVTANRFPQPVSS--VLAPTSVVTRDDIDRWQAKSLTD- 84 Query: 73 KIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLA-----PRA 127 + LP + Q GG+GQ + G N + +L+ +D + + S + P + Sbjct: 85 --VMRRLPGVDVGQSGGLGQKSSLFIRGTNSSHVLVLIDGIRLNQAGVSGSSDLSQIPIS 142 Query: 128 LQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNS 186 L V++ P Y +D+ +++ ++ NG TL + NG Q Y+ S Sbjct: 143 LVQKVEYIRGPRSAAYGSDAIGGVVNIITTRVK------NGTTLSAGVGSNGYQAYDAS 195 >gi|255036291|ref|YP_003086912.1| hypothetical protein Dfer_2529 [Dyadobacter fermentans DSM 18053] gi|254949047|gb|ACT93747.1| hypothetical protein Dfer_2529 [Dyadobacter fermentans DSM 18053] Length = 456 Score = 37.0 bits (84), Expect = 5.7, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 55/107 (51%), Gaps = 11/107 (10%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEV 316 Y SD+RLK SAS +I QL+ + +L + + LI +E+ ++ Sbjct: 336 YLTSDKRLKTDFKNLSASLAKI-SQLQGYSYRWLDTTRTQTLQTGLIA------QEVEKL 388 Query: 317 FVDTIG----GYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 F + + GY + N + +IPHL +A++EL + ++ ++ ++ NE Sbjct: 389 FPELVNTDDNGYKSMNYNGLIPHLIEAVKELKGQTAQIAELRNELNE 435 >gi|307308922|ref|ZP_07588605.1| hypothetical protein SinmeBDRAFT_4489 [Sinorhizobium meliloti BL225C] gi|306900556|gb|EFN31169.1| hypothetical protein SinmeBDRAFT_4489 [Sinorhizobium meliloti BL225C] Length = 663 Score = 36.6 bits (83), Expect = 5.8, Method: Compositional matrix adjust. Identities = 17/37 (45%), Positives = 24/37 (64%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLP 80 +S GGT + A AR +LGTD+A+N+T G + LP Sbjct: 161 VSEGGTGSTTAAAARTSLGTDNANNITTGTLADARLP 197 >gi|323486251|ref|ZP_08091579.1| hypothetical protein HMPREF9474_03330 [Clostridium symbiosum WAL-14163] gi|323693958|ref|ZP_08108144.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridium symbiosum WAL-14673] gi|323400469|gb|EGA92839.1| hypothetical protein HMPREF9474_03330 [Clostridium symbiosum WAL-14163] gi|323502004|gb|EGB17880.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridium symbiosum WAL-14673] Length = 390 Score = 36.6 bits (83), Expect = 6.0, Method: Compositional matrix adjust. Identities = 36/167 (21%), Positives = 73/167 (43%), Gaps = 16/167 (9%) Query: 225 TKNCLYK-TTVLNMGRGPGYIHFDTDKG--AVGCSYFLSDERLKKVHGESSASAREIIEQ 281 +K CL + +++++G G GY+ K G LS+E+ ++ E + Sbjct: 160 SKLCLKEGMSLVDIGCGWGYLLIRAAKKYRVKGTGITLSEEQYREFSRRIREEGLEDLLT 219 Query: 282 LKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQ 341 +K +D+ LPE+G+ + +G E+ + ++F D + L P + + H ++ Sbjct: 220 VKLLDYRELPETGLTFDRAVSVGMVEHVGRGNYQLFTDCVNSVLKPG-GLFLLHFISGLK 278 Query: 342 ELLQE------------VKELRDMIDKQNEEHQDVQDMSNTPLNSQK 376 E + V LR+M+ +E++ D+ N L+ K Sbjct: 279 EYPGDPWIKKYIFPGGVVPSLREMLYCLSEDNFHTLDVENLRLHYNK 325 >gi|300727802|ref|ZP_07061184.1| Ser/Thr protein phosphatase family [Prevotella bryantii B14] gi|299774955|gb|EFI71565.1| Ser/Thr protein phosphatase family [Prevotella bryantii B14] Length = 284 Score = 36.6 bits (83), Expect = 6.1, Method: Compositional matrix adjust. Identities = 27/118 (22%), Positives = 52/118 (44%), Gaps = 2/118 (1%) Query: 126 RALQNLVD--HAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFY 183 + + LVD +A KP+ IV+T D +A + FM+ L H + + + + Y Sbjct: 58 KFIHRLVDCVNAQKPDIIVFTGDLVNVSANEVIPFMKTLNKMHATDGIFSVMGNHDYCEY 117 Query: 184 NNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGP 241 + + + + +I+ Y +K W+ + T + K+ +Y V N+ R P Sbjct: 118 GEDKSVGNIIRNQNILQYMEEKIGWKLLNNQNVTISRGEGQRKSAIYLIGVENISRPP 175 >gi|150018933|ref|YP_001311187.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridium beijerinckii NCIMB 8052] gi|149905398|gb|ABR36231.1| Cyclopropane-fatty-acyl-phospholipid synthase [Clostridium beijerinckii NCIMB 8052] Length = 393 Score = 36.6 bits (83), Expect = 6.4, Method: Compositional matrix adjust. Identities = 26/106 (24%), Positives = 54/106 (50%), Gaps = 2/106 (1%) Query: 233 TVLNMGRGPGYIHFDT-DKGAV-GCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYL 290 +VL++G G G++ + +K ++ G LS E+ KK + E + +++ +D+ L Sbjct: 168 SVLDVGCGWGFLLIEAAEKYSIHGIGITLSKEQYKKFNERIKERKLEKLLEVRLMDYRNL 227 Query: 291 PESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHL 336 ESG+ + +G E+ +E E+F+ I L P ++ ++ Sbjct: 228 AESGLSFDRIVSVGMVEHIGRENYELFIKNIDSVLKPKGLCLLDYI 273 >gi|313157451|gb|EFR56872.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 221 Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust. Identities = 51/227 (22%), Positives = 92/227 (40%), Gaps = 26/227 (11%) Query: 140 HIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFLD--DGD 197 H TT+ N A + + + F N Q +QNG YN +R ++ +G Sbjct: 4 HTKITTNLNFKAIKKVLALIVLCFIAFNSHA--QLKYQNGYLVYNAMERYNGYMTNWEGW 61 Query: 198 IMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVGCSY 257 C+K + + ++ + ++ S K LY T G+I + Sbjct: 62 AHCWKFGNRMIK-FDLGPADSRVSSNSDKLVLYDT------ENGGFIDL------YARNV 108 Query: 258 FLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENN-----LKE 312 + + + K + +S SA + QL+ + F + ++ R G S +E Sbjct: 109 YTNSDAASKTNIQSLGSATATLTQLRPVSFEWADKAHYFKTSRRSTGVSNPKEMGFIAQE 168 Query: 313 INEVFVDTIG----GYLAPNPSVIIPHLAKAIQELLQEVKELRDMID 355 I +V D + G+ N S +IP L K+IQEL +++ L+ I+ Sbjct: 169 IEQVLPDIVAVDCEGHRVVNYSALIPLLTKSIQELNGQIETLKAEIE 215 >gi|15894398|ref|NP_347747.1| hypothetical protein CA_C1113 [Clostridium acetobutylicum ATCC 824] gi|15024032|gb|AAK79087.1|AE007628_1 Hypothetical protein CA_C1113 [Clostridium acetobutylicum ATCC 824] Length = 491 Score = 36.6 bits (83), Expect = 7.1, Method: Compositional matrix adjust. Identities = 29/112 (25%), Positives = 53/112 (47%), Gaps = 11/112 (9%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SDERLK + + II Q+ +F+++ ++G E +G L +IN+ F Sbjct: 382 SDERLKTNIKPITTNCSSIINQINLKEFDFI-KTGEHIE----VGTIAQQLSKINKKFSK 436 Query: 320 TI------GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQDVQ 365 + + AP+ + I+P++ AIQEL + K L + Q+++ Sbjct: 437 ILYTTEDGTNFFAPDYNSILPYIIGAIQELSAKSKSLETRVTTLENSIQELK 488 Searching..................................................done Results from round 2 >gi|254781130|ref|YP_003065543.1| hypothetical protein CLIBASIA_05160 [Candidatus Liberibacter asiaticus str. psy62] gi|254040807|gb|ACT57603.1| hypothetical protein CLIBASIA_05160 [Candidatus Liberibacter asiaticus str. psy62] Length = 378 Score = 627 bits (1616), Expect = e-177, Method: Composition-based stats. Identities = 378/378 (100%), Positives = 378/378 (100%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN Sbjct: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT Sbjct: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA Sbjct: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 Query: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG 240 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG Sbjct: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG 240 Query: 241 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR 300 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR Sbjct: 241 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR 300 Query: 301 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE Sbjct: 301 YLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 Query: 361 HQDVQDMSNTPLNSQKFD 378 HQDVQDMSNTPLNSQKFD Sbjct: 361 HQDVQDMSNTPLNSQKFD 378 >gi|315122525|ref|YP_004063014.1| hypothetical protein CKC_03885 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495927|gb|ADR52526.1| hypothetical protein CKC_03885 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 360 Score = 566 bits (1458), Expect = e-159, Method: Composition-based stats. Identities = 258/360 (71%), Positives = 304/360 (84%), Gaps = 1/360 (0%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPRI+EVYI+P+GS+AYPNSNISSSAYNNLL+DLALDNN PRPI AGGT A NA QAR+N Sbjct: 1 MPRIKEVYIIPSGSKAYPNSNISSSAYNNLLEDLALDNNQPRPICAGGTGAANASQARIN 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LG DDA+NLTKGKIIA LLPFYPIQQGGG GQLDNKIYLGWNG LLLQVD VWT Sbjct: 61 LGMDDANNLTKGKIIATLLPFYPIQQGGGKGQLDNKIYLGWNGKNLLLQVDQQLQGVVWT 120 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGA 180 SQLAP ALQ+LV HA K NHIVYT DSN+YA+TPLSSF+R+LF+C N E LH+AIF+NGA Sbjct: 121 SQLAPLALQSLVSHAEKANHIVYTADSNKYASTPLSSFVRELFACKNNEELHKAIFKNGA 180 Query: 181 QFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALEST-KNCLYKTTVLNMGR 239 +FYNN+ +IA F++DGDI C+ R KT+WQGI+ AQ+TAN+A++ K + KTTVLNM R Sbjct: 181 KFYNNTNKIAEFMNDGDIFCHPRNKTMWQGIDSAQYTANLAIDLIPKQYIKKTTVLNMER 240 Query: 240 GPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSEL 299 GYIHFDTD GAVGC+YF+SDERLK++HG+SSAS+ ++IE+ KFIDF+YLP SGMDS L Sbjct: 241 SQGYIHFDTDMGAVGCNYFISDERLKEIHGKSSASSLDLIEKFKFIDFSYLPSSGMDSSL 300 Query: 300 RYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 RY IGFS NNL+EIN++ VD IG YL+PNPSVI+P+L KAIQEL EVKELR I +Q + Sbjct: 301 RYPIGFSANNLQEINDILVDKIGDYLSPNPSVILPYLCKAIQELSAEVKELRRRIHQQKD 360 >gi|302873697|ref|YP_003842330.1| phage minor structural protein [Clostridium cellulovorans 743B] gi|307686578|ref|ZP_07629024.1| phage minor structural protein [Clostridium cellulovorans 743B] gi|302576554|gb|ADL50566.1| phage minor structural protein [Clostridium cellulovorans 743B] Length = 1163 Score = 153 bits (385), Expect = 6e-35, Method: Composition-based stats. Identities = 44/129 (34%), Positives = 66/129 (51%), Gaps = 21/129 (16%) Query: 241 PGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR 300 YI T GA G +Y+ SD+R K E+ +A + I ++K DF+Y ++G+ Sbjct: 1038 QRYIEVSTIFGAKGLTYWDSDKRYKMNIEETDKTALDKIMKIKHYDFDY--KAGLK---H 1092 Query: 301 YLIGFSENNLKEINEVFVDTI---------GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 + +G+ L EIN +V T+ + PN + IIP+L+KAIQEL Q Sbjct: 1093 FDVGYISQQLMEINPQWVITVPQEAANGEVSDFYIPNQTTIIPYLSKAIQELKQ------ 1146 Query: 352 DMIDKQNEE 360 +ID QN E Sbjct: 1147 -LIDNQNRE 1154 >gi|227821710|ref|YP_002825680.1| hypothetical protein NGR_c11420 [Sinorhizobium fredii NGR234] gi|227340709|gb|ACP24927.1| hypothetical protein NGR_c11420 [Sinorhizobium fredii NGR234] Length = 506 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 46/163 (28%), Positives = 69/163 (42%), Gaps = 52/163 (31%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPR +Y PAG++ P + I S+ YN+ + DL D N+ RPI+AGGT + +A AR Sbjct: 1 MPRTGGIYSAPAGTKGTPGNTIQSAPYNSFIDDLVDDLNAARPITAGGTGSTSASAARTA 60 Query: 61 LGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWT 120 LG +N+ + + Sbjct: 61 LGLAIGTNV-----------------------------------------------QAYD 73 Query: 121 SQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLF 163 + LA ++ L A+K ++YTT S+ YA T L+SF R L Sbjct: 74 AGLA--SIAGLTTAANK---MIYTTASDTYAVTDLTSFARTLL 111 >gi|330970689|gb|EGH70755.1| tail fiber protein H [Pseudomonas syringae pv. aceris str. M302273PT] Length = 780 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 31/98 (31%), Positives = 52/98 (53%), Gaps = 8/98 (8%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 + + P+QQGGG GQ NK+Y+GW+ L L VD + M +WT Q N D A+K Sbjct: 549 IGYTPLQQGGGAGQKTNKVYIGWSDVGLKLTVDATDMGRIWTEQSF-----NPNDKANKS 603 Query: 139 NHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIF 176 + + ++ Y T ++S + + + G+++ A F Sbjct: 604 SSVAGYGITDCYTVTQVNSLLNQRIA---GDSVQTAGF 638 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/168 (24%), Positives = 64/168 (38%), Gaps = 17/168 (10%) Query: 33 DLALDNNSPRPISAGGTAADNAVQARVNLG--TDDASNLTKGKIIANLLPFYP------- 83 D A D N ++ G AA + VNL SN T + Sbjct: 380 DGASDTNITLTLADSGVAAGTYTKVAVNLKGLVTSGSNPTTLSGYGITDAYSKDDANSSF 439 Query: 84 IQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVY 143 ++QGGG Q N+I +GW G L + VD S M +WT N D A+K + Sbjct: 440 VKQGGGPDQKANRINIGWTGAMLKVSVDGSDMGRIWTETSF-----NPNDKANKATSLSG 494 Query: 144 TTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAA 191 +N Y ++ + + +++ A F + Y +RI+ Sbjct: 495 YGITNAYTVEQVNDLVGRRV---LADSIVHAGFASNNTDYPYFRRISD 539 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/58 (41%), Positives = 31/58 (53%), Gaps = 2/58 (3%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQLAPRALQN 130 + N L F P+QQGGG Q N++ LGW NG + QVD S + +W Q R N Sbjct: 659 LQNRLGFTPVQQGGGANQASNQLRLGWATNGAGVRAQVDASDLGLLWGEQNFYRPDNN 716 >gi|150397017|ref|YP_001327484.1| hypothetical protein Smed_1814 [Sinorhizobium medicae WSM419] gi|150028532|gb|ABR60649.1| hypothetical protein Smed_1814 [Sinorhizobium medicae WSM419] Length = 499 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 32/69 (46%), Positives = 42/69 (60%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 +PR VY PAG++ N+ I S YN L+ DL D N+ RPI+AGGT A +A AR N Sbjct: 5 LPRTGGVYSPPAGTKGVSNTTIQSVPYNALVDDLTADANAARPITAGGTGATSASVARTN 64 Query: 61 LGTDDASNL 69 LG +N+ Sbjct: 65 LGLAIGTNV 73 Score = 42.9 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 38/90 (42%), Gaps = 5/90 (5%) Query: 35 ALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKG-----KIIANLLPFYPIQQGGG 89 AL + +S GGT A +A AR NLG ++ASNLT G +I + G Sbjct: 228 ALTLETALAVSEGGTGATDAATARSNLGANNASNLTTGTLPNARISGAYDGITTLSTSGK 287 Query: 90 IGQLDNKIYLGWNGTQLLLQVDLSSMREVW 119 I N+I + ++ + + W Sbjct: 288 ITTTGNEIEISGGSPRVRFSDTNTDAYDFW 317 >gi|301386028|ref|ZP_07234446.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato Max13] gi|302060587|ref|ZP_07252128.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato K40] gi|302129790|ref|ZP_07255780.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato NCPPB 1108] Length = 780 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 8/98 (8%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 + + P+QQGGG GQ NK+Y+GW+ L L VD + M +WT Q N + A+K Sbjct: 549 IGYTPLQQGGGAGQKTNKVYIGWSDVGLKLTVDATDMGRIWTEQSF-----NPNEKANKS 603 Query: 139 NHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIF 176 N I ++ Y T +++ + + +++ A F Sbjct: 604 NSIAGYGITDCYTVTQVNALVSARVAA---DSITAAGF 638 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 65/168 (38%), Gaps = 30/168 (17%) Query: 33 DLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPF----------- 81 D A D N ++ G AA + VNL +T G L + Sbjct: 380 DGASDTNITLTLADSGVAAGTYTRVAVNL----KGLVTSGGNPTTLAGYGITDAYSKDDA 435 Query: 82 --YPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPN 139 ++QGGG G L N+I +GW G +L + VD + +WT N D A+K Sbjct: 436 NSSFVKQGGGPGMLSNRINIGWTGDRLKVSVDGTDGGRIWTDTSF-----NPNDKANKAT 490 Query: 140 HIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQ 187 + ++ + T ++ + + +++ A F + NN+ Sbjct: 491 SLAGYGITDAHTITQVNDLVGRRV---LADSIIHAGFAS-----NNTD 530 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 7/102 (6%) Query: 31 LKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGI 90 + L + I+A G A+DN ++ A + + N L F P+QQGGG Sbjct: 620 VNALVSARVAADSITAAGFASDNPEFPYFRRSSNGAVH-----YLQNRLGFVPVQQGGGA 674 Query: 91 GQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQLAPRALQN 130 Q N++ LGW NG + QVD + + +W Q R N Sbjct: 675 NQATNQLRLGWATNGAGVRAQVDATDLGLLWGEQNFYRPDNN 716 >gi|319783524|ref|YP_004143000.1| hypothetical protein Mesci_3833 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169412|gb|ADV12950.1| hypothetical protein Mesci_3833 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 403 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 3/85 (3%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MPR VY P G+ PN+ I S+ YN L+ DL D N+ RP++AGG+ A AV N Sbjct: 1 MPRAGGVYSAPPGTAGTPNTTIESAKYNALVADLVADANAARPLTAGGSGATTAVGGSDN 60 Query: 61 L---GTDDASNLTKGKIIANLLPFY 82 GT AS T A + Sbjct: 61 FNAAGTSLASAATVNLANATGVAVT 85 >gi|28871149|ref|NP_793768.1| tail fiber protein H [Pseudomonas syringae pv. tomato str. DC3000] gi|28854399|gb|AAO57463.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato str. DC3000] gi|331017133|gb|EGH97189.1| tail fiber protein H, putative [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 808 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 26/121 (21%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLA-PRALQN------- 130 + + P+QQGGG GQ NK+++GW+ L L VD + +WT Q P N Sbjct: 549 IGYTPLQQGGGAGQKTNKVFIGWSDVGLKLTVDTIDIGRIWTEQSFNPNDKANRSNSIAG 608 Query: 131 ---------------LVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI 175 L D A++ N I ++ Y T ++S + + + G+ + A Sbjct: 609 YGITDCYTVNQVNSLLGDKANRSNSIAGYGITDCYTVTQVNSIVNQRVA---GDAIQTAG 665 Query: 176 F 176 F Sbjct: 666 F 666 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 20/140 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNG 103 +++GG A + + D +N + ++QGG G L N+I +GW G Sbjct: 412 VTSGGNPTTLAGYGITDAYSKDDANSSF------------VKQGGAPGMLGNRINIGWTG 459 Query: 104 TQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLF 163 ++ + VD + +WT N D A+K + +N Y ++ + + Sbjct: 460 DRVKVSVDGTDGGRIWTDTTF-----NPNDKANKATSLSGYGITNAYTVEQVNDLVNRRV 514 Query: 164 SCHNGETLHQAIFQNGAQFY 183 +++ A F + Y Sbjct: 515 ---QADSIIHAGFASNNTDY 531 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 2/58 (3%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMREVWTSQLAPRALQN 130 + N L F P+QQGGG Q N++ LGW N + QVD + + +W Q R N Sbjct: 687 LQNRLGFTPVQQGGGANQAGNQLRLGWATNAAGIRAQVDATDLGLLWGEQNFYRPDNN 744 >gi|227822432|ref|YP_002826404.1| hypothetical protein NGR_c18870 [Sinorhizobium fredii NGR234] gi|227341433|gb|ACP25651.1| hypothetical protein NGR_c18870 [Sinorhizobium fredii NGR234] Length = 403 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 32/69 (46%), Positives = 41/69 (59%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 +PR +Y PAG++ PN+ I S YN L+ DL D N+ RPI+AGGT A +A AR Sbjct: 5 LPRTGGIYSPPAGTKGVPNTTIQSVPYNALIDDLTADANAARPITAGGTGATSASAARTA 64 Query: 61 LGTDDASNL 69 LG AS Sbjct: 65 LGAQGASAA 73 >gi|330938404|gb|EGH42030.1| tail fiber protein H [Pseudomonas syringae pv. pisi str. 1704B] Length = 624 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/104 (25%), Positives = 44/104 (42%), Gaps = 8/104 (7%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W + N A Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIWYANNF-----NPATKADWG 319 Query: 139 NHIVYTTDSNQYAATPL--SSFMRKLFSCH-NGETLHQAIFQNG 179 + ++ Y + S + S + +++ NG Sbjct: 320 TTLASYKITDAYTKAEVYAKSEVDTRLSSKASADSVTNVGLANG 363 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G GQ +N + +GW+ L L VD + + VW + + Sbjct: 381 LQTRLGYTPVQQGTGTGQFNNVVKIGWSDNGLKLTVDATDLGRVWYAANF-----DPNGK 435 Query: 135 AHKPNHIVYTTDSNQYA 151 A+ + ++ Y Sbjct: 436 ANWGTTLAAYGITDAYT 452 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 1/54 (1%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQNL 131 L + P++QGGG NK+ LG+NG L LQVD + ++ Q P + L Sbjct: 499 LGYTPVEQGGGANMAANKVRLGYNGAGSLRLQVDSTDFGDMINDQNLPGKVAAL 552 >gi|330985191|gb|EGH83294.1| tail fiber protein H [Pseudomonas syringae pv. lachrymans str. M301315] Length = 665 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 32/149 (21%), Positives = 59/149 (39%), Gaps = 12/149 (8%) Query: 31 LKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGI 90 L D + S ++ A LG +T G I+QGGGI Sbjct: 391 LADSGVGAGSYTKVTVNAKGLVTAGSTPTTLG---GFGITDG-YTKTEAYTTFIKQGGGI 446 Query: 91 GQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQY 150 GQL +++ +GW+ T L L V+ + +WT N + A + + ++ Y Sbjct: 447 GQLSSRLNIGWSSTGLRLTVEGEDLGRIWTETAF-----NPNNKADRAATLAGYGIADAY 501 Query: 151 AATPLSSFMRKLFSCHNGETLHQAIFQNG 179 + +++ + + +T+ A F +G Sbjct: 502 TTSQVNNLLNQRV---LADTITTAGFASG 527 >gi|320332017|gb|EFW87953.1| tail fiber protein H, putative [Pseudomonas syringae pv. glycinea str. race 4] Length = 625 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 5/77 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W + N A Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIWYANNF-----NPATKADWG 319 Query: 139 NHIVYTTDSNQYAATPL 155 + ++ Y + Sbjct: 320 TTLASYKITDAYTKAEV 336 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G+GQL+N + +GW+ L VD + M +W ++ + Sbjct: 381 LQTKLLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLWYAKNF-----DPTTK 435 Query: 135 AHKPNHIVYTTDSNQYA 151 A+ + ++ Y Sbjct: 436 ANWGTTLASYRIADAYT 452 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 19/45 (42%), Positives = 26/45 (57%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 L F P+QQG G+GQL N I +G++G Q L VD + + T Sbjct: 499 LGFTPVQQGTGVGQLSNLIRIGYDGAQPRLTVDNTDWGRIMTENN 543 >gi|330882145|gb|EGH16294.1| tail fiber protein H [Pseudomonas syringae pv. glycinea str. race 4] Length = 483 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 5/77 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W + N A Sbjct: 123 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDATDLGNIWYANNF-----NPATKADWG 177 Query: 139 NHIVYTTDSNQYAATPL 155 + ++ Y + Sbjct: 178 TTLASYKITDAYTKAEV 194 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G+GQL+N + +GW+ L VD + M +W ++ + Sbjct: 239 LQTKLLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLWYAKNF-----DPTTK 293 Query: 135 AHKPNHIVYTTDSNQYA 151 A+ + ++ Y Sbjct: 294 ANWGTTLASYRIADAYT 310 Score = 46.0 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 19/45 (42%), Positives = 26/45 (57%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 L F P+QQG G+GQL N I +G++G Q L VD + + T Sbjct: 357 LGFTPVQQGTGVGQLSNLIRIGYDGAQPRLTVDNTDWGRIMTENN 401 >gi|289623575|ref|ZP_06456529.1| tail fiber protein H, putative [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650819|ref|ZP_06482162.1| tail fiber protein H, putative [Pseudomonas syringae pv. aesculi str. 2250] gi|330868531|gb|EGH03240.1| tail fiber protein H [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 619 Score = 77.2 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 5/78 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W S + A+ Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDNTDLGNLWYSNNF-----DPTTKANWG 319 Query: 139 NHIVYTTDSNQYAATPLS 156 + ++ Y + Sbjct: 320 TTLAAYRIADAYTKAEVD 337 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 36/82 (43%), Gaps = 5/82 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG GIGQ N + +GW+ + L VD + + +W S D Sbjct: 375 LQTKLGYTPVQQGTGIGQQGNSVKIGWSSSGLKATVDETDLGTLWMSSNFK-----PSDK 429 Query: 135 AHKPNHIVYTTDSNQYAATPLS 156 A K + + ++ Y + Sbjct: 430 ADKASTLSGYGITDGYTKAEID 451 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 16/45 (35%), Positives = 25/45 (55%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 L F P+QQG G GQ +N I +G++GT + VD + + + Sbjct: 493 LGFAPVQQGTGAGQGNNLIKIGYDGTNPRITVDATDFGWIMHEKN 537 >gi|237803435|ref|ZP_04591020.1| tail fiber protein H [Pseudomonas syringae pv. oryzae str. 1_6] gi|331025417|gb|EGI05473.1| tail fiber protein H [Pseudomonas syringae pv. oryzae str. 1_6] Length = 720 Score = 76.4 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 13/120 (10%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P + GG +A ++ + T A+ + L F +QQG G GQLDN + +GW+ Sbjct: 300 PTTLGGYGIADAY-SKTEIDTRVAA-------LQPKLGFTAVQQGTGAGQLDNAVKIGWS 351 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 GT L VD + + ++W S N A + + ++ Y + + + + Sbjct: 352 GTDLKATVDKTDLGKLWYSSNF-----NPDKKADRAETLSGYGINDAYTSAEVDRRVSER 406 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 40/105 (38%), Gaps = 8/105 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+Q G G+GQ N I +G + + + VD + +W S + Sbjct: 476 LQTRLGYVPVQAGTGVGQQSNLIKIGHSAVGVKITVDETDFGNLWYSGNF-----DPSSK 530 Query: 135 AHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNG 179 A+ + + +N Y + + +++ F +G Sbjct: 531 ANVGSTLAAYGINNAYTKAEVDARDSSR---AIADSITHVGFASG 572 Score = 36.4 bits (82), Expect = 7.0, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 41/121 (33%), Gaps = 1/121 (0%) Query: 12 AGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 S+A S +++ NN +D A V S+ Sbjct: 527 PSSKANVGSTLAAYGINNAYTKAEVDARDSSRAIADSITHVGFASGDVAFPYMRRSSDGG 586 Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQN 130 + L F P++QGGG + K+ LG+N + VD + ++ + + Sbjct: 587 IFYLQTRLGFSPVEQGGGANMGNFKLRLGYNNAGSVRFSVDGADYGDLISDINLATKVAG 646 Query: 131 L 131 L Sbjct: 647 L 647 >gi|330898935|gb|EGH30354.1| tail fiber protein H [Pseudomonas syringae pv. japonica str. M301072PT] Length = 619 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 5/78 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQLDNK+ +GW+ L VD + + +W S + + A+ Sbjct: 265 LGFTPVQQGGGTGQLDNKVKIGWSNNGLKAMVDNTDLGNLWYSNNF-----DPTNKANWG 319 Query: 139 NHIVYTTDSNQYAATPLS 156 + ++ Y + Sbjct: 320 TTLAAYRIADAYTKAEVD 337 Score = 69.5 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 5/82 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + PIQQG GIGQL N + +GW+ + L VD + + +W S D Sbjct: 375 LQTKLSYTPIQQGTGIGQLGNPVKIGWSSSGLKATVDETDLGTLWASSNFK-----PSDK 429 Query: 135 AHKPNHIVYTTDSNQYAATPLS 156 A K + + ++ Y + Sbjct: 430 ADKASTLSGYGITDGYTKAEID 451 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 16/45 (35%), Positives = 25/45 (55%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 L F P+QQG G GQ +N I +G++GT + VD + + + Sbjct: 493 LGFAPVQQGTGAGQGNNLIKIGYDGTNPRITVDATDFGWIMHEKN 537 >gi|301386568|ref|ZP_07234986.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato Max13] Length = 517 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 14/144 (9%) Query: 20 SNISSSAYNNL-LKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANL 78 + + S YN + + L P + GG +A ++ + A+ + Sbjct: 104 TGVLSGTYNRVTVDRRGLVMAGFNPTTLGGYGITDAY-SKTEIDARVAA-------LQPK 155 Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQG G GQL+N++ LGW+G+ L VD + + +W S + A+ Sbjct: 156 LGFVPVQQGTGAGQLNNQVRLGWSGSGLKAAVDNTDLGNLWYSGNF-----DPATKANVG 210 Query: 139 NHIVYTTDSNQYAATPLSSFMRKL 162 + + ++ Y + + + Sbjct: 211 STLFAYGITDAYTKAEVDQRVSER 234 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW-NGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L F P+QQG G GQL+N I +G+ + Q+ L VD + +W S + Sbjct: 291 LQTKLGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYSGNF-----DPSK 345 Query: 134 HAHKPNHIVYTTDSNQYAATPLSS 157 A+ + + ++ Y + Sbjct: 346 KANVGSTLFAYGITDAYTKNETDA 369 >gi|213972284|ref|ZP_03400352.1| tail fiber protein H [Pseudomonas syringae pv. tomato T1] gi|213922961|gb|EEB56588.1| tail fiber protein H [Pseudomonas syringae pv. tomato T1] Length = 689 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 13/120 (10%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P + GG +A ++ + A+ + L F P+QQG G GQL+N++ LGW+ Sbjct: 300 PTTLGGYGIADAY-SKTEIDARVAA-------LQPRLGFVPVQQGTGAGQLNNQVRLGWS 351 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G+ L VD + + +W S + A+ + + ++ Y + + + Sbjct: 352 GSGLKAAVDNTDLGNLWYSGNF-----DPATKANVGSTLFAYGITDAYTKAEVDQRVSER 406 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW-NGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L F P+QQG G GQL+N I +G+ + Q+ L VD + +W S + Sbjct: 463 LQTKLGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYSGNF-----DPSK 517 Query: 134 HAHKPNHIVYTTDSNQYAATPLSS 157 A+ + + ++ Y + Sbjct: 518 KANVGSTLFAYGITDAYTKNETDA 541 >gi|28870562|ref|NP_793181.1| tail fiber protein H [Pseudomonas syringae pv. tomato str. DC3000] gi|28853810|gb|AAO56876.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato str. DC3000] Length = 689 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 13/120 (10%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P + GG +A ++ + A+ + L F P+QQG G GQL+N++ LGW+ Sbjct: 300 PTTLGGYGIADAY-SKTEIDARVAA-------LQPKLGFVPVQQGTGAGQLNNQVRLGWS 351 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G+ L VD + + +W S + A+ + + ++ Y + + + Sbjct: 352 GSGLKAAVDNTDLGNLWYSGNF-----DPATKANVGSTLFAYGITDAYTKAEVDQRVSER 406 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW-NGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L F P+QQG G GQL+N I +G+ + Q+ L VD + +W S + Sbjct: 463 LQTKLGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYSGNF-----DPSK 517 Query: 134 HAHKPNHIVYTTDSNQYAATPLSS 157 A+ + + ++ Y + Sbjct: 518 KANVGSTLFAYGITDAYTKNETDA 541 >gi|320325918|gb|EFW81977.1| tail fiber protein H, putative [Pseudomonas syringae pv. glycinea str. B076] Length = 444 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG QLDNK+ +GW+ L VD + + +W + N A Sbjct: 265 LGFTPVQQGGGTAQLDNKVKIGWSNNGLKAMVDATDLGNIWYANNF-----NPATKADWG 319 Query: 139 NHIVYTTDSNQYAATPL 155 + ++ Y + Sbjct: 320 TTLASYKITDAYTKAEV 336 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 18/68 (26%), Positives = 33/68 (48%), Gaps = 5/68 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G+GQL+N + +GW+ L VD + M +W ++ + Sbjct: 381 LQTKLLYAPVQQGTGLGQLNNVVKIGWSDNGLKATVDETDMGNLWYAKNF-----DPTTK 435 Query: 135 AHKPNHIV 142 A+ + Sbjct: 436 ANWGTTLA 443 >gi|302063119|ref|ZP_07254660.1| tail fiber protein H, putative [Pseudomonas syringae pv. tomato K40] Length = 456 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 13/120 (10%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P + GG +A ++ + A+ + L F P+QQG G GQL+N++ LGW+ Sbjct: 67 PTTLGGYGITDAY-SKTEIDARVAA-------LQPRLGFVPVQQGTGAGQLNNQVRLGWS 118 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G+ L VD + + +W S + A+ + + ++ Y + + + Sbjct: 119 GSGLKAAVDNTDLGNLWYSGNF-----DPATKANVGSTLFAYGITDAYTKAEVDQRVSER 173 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGW-NGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L F P+QQG G GQL+N I +G+ + Q+ L VD + +W S + Sbjct: 230 LQTKLGFVPVQQGTGAGQLNNLIKIGYTSAGQVKLAVDNTDFGNLWYSGNF-----DPSK 284 Query: 134 HAHKPNHIVYTTDSNQYAATPLSS 157 A+ + + ++ Y + Sbjct: 285 KANVGSTLFAYGITDAYTKNETDA 308 >gi|330970637|gb|EGH70703.1| tail fiber protein H [Pseudomonas syringae pv. aceris str. M302273PT] Length = 689 Score = 74.5 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 53/120 (44%), Gaps = 13/120 (10%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P + GG +A ++ ++ A+ + L F P+QQG G GQL+N++ LGW+ Sbjct: 300 PTTLGGYGIADAY-SKTDIDARVAA-------LQPKLGFVPVQQGTGAGQLNNQVRLGWS 351 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G+ L VD + + +W S + A+ + + ++ Y + + + Sbjct: 352 GSALKAAVDNTDLGNLWYSGNF-----DPATKANVGSTLFAYGITDAYTKAEVDQRVSER 406 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 50/124 (40%), Gaps = 7/124 (5%) Query: 35 ALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLD 94 ++ RP++ T A AS+ T + L F P+QQG G GQL+ Sbjct: 424 TDARDAQRPLADSITNIGLAANDPTQPYMRRASDNTT-YFLQTRLGFVPVQQGTGAGQLN 482 Query: 95 NKIYLGW-NGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAAT 153 N I +G+ + Q+ L VD + +W S + A+ + + ++ Y Sbjct: 483 NLIKIGYTSAGQVKLAVDNTDFGNLWYSGNF-----DPSKKANVGSTLFAYGITDAYTKA 537 Query: 154 PLSS 157 + + Sbjct: 538 EVDA 541 >gi|330878350|gb|EGH12499.1| tail fiber protein H [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 625 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQL+N + +GW+ L VD + + +W S N A Sbjct: 265 LGFTPVQQGGGTGQLNNLVKIGWSNNGLKAMVDATDLGNLWYSNNF-----NPATKADWG 319 Query: 139 NHIVYTTDSNQYAATPL 155 + ++ Y + Sbjct: 320 TTLAAYRIADAYTKAEV 336 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNK-IYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L + P+QQG G+GQ + +GW+ + L VD++ M +W + + Sbjct: 381 LQTKLGYTPVQQGTGVGQNSGAAVKIGWSSSGLKATVDVTDMGNLWYANNF-----DPGS 435 Query: 134 HAHKPNHIVYTTDSNQYA 151 A+ + + +N Y Sbjct: 436 KANWGSTLAAYGITNAYT 453 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 50/125 (40%), Gaps = 1/125 (0%) Query: 12 AGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 GS+A S +++ N D + +A V+L ++ + Sbjct: 433 PGSKANWGSTLAAYGITNAYTKAESDARDSQQPNADSITILGFAGNDVSLPYMRRASDGQ 492 Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQN 130 + L + P++QGGG NK+ LG+NG L LQVD S ++ Q P L Sbjct: 493 VYYLQPRLGYTPVEQGGGPNMSANKVRLGYNGAGSLRLQVDSSDFGDLTNDQNLPAKLAG 552 Query: 131 LVDHA 135 L A Sbjct: 553 LGLSA 557 >gi|330963871|gb|EGH64131.1| tail fiber protein H [Pseudomonas syringae pv. actinidiae str. M302091] Length = 625 Score = 71.8 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 5/77 (6%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKP 138 L F P+QQGGG GQL+N + +GW+ L VD + + +W S N + A Sbjct: 265 LGFTPVQQGGGSGQLNNLVKIGWSNNGLKAMVDATDLGNLWYSNNF-----NPANKADWG 319 Query: 139 NHIVYTTDSNQYAATPL 155 + ++ Y + Sbjct: 320 TTLAAYRIADAYTKAEV 336 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Query: 75 IANLLPFYPIQQGGGIGQLDNK-IYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVD 133 + L + P+QQG G+GQ + +GW+ + L VD++ M +W + + Sbjct: 381 LQTKLGYTPVQQGTGVGQNSGAAVKIGWSSSGLKATVDVTDMGNLWYANNF-----DPGS 435 Query: 134 HAHKPNHIVYTTDSNQYA 151 A+ + + +N Y Sbjct: 436 KANWGSTLAAYGITNAYT 453 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 50/125 (40%), Gaps = 1/125 (0%) Query: 12 AGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 GS+A S +++ N D + +A V+L ++ + Sbjct: 433 PGSKANWGSTLAAYGITNAYTKAESDARDSQQPNADSITILGFAGNDVSLPYMRRASDGQ 492 Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQN 130 + L + P++QGGG NK+ LG+NG L LQVD S ++ Q P L Sbjct: 493 VYYLQPRLGYTPVEQGGGPNMSANKVRLGYNGAGSLRLQVDSSDFGDLTNDQNLPAKLAG 552 Query: 131 LVDHA 135 L A Sbjct: 553 LGLSA 557 >gi|13470671|ref|NP_102240.1| hypothetical protein mll0443 [Mesorhizobium loti MAFF303099] gi|14021413|dbj|BAB48026.1| mll0443 [Mesorhizobium loti MAFF303099] Length = 349 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/75 (37%), Positives = 36/75 (48%), Gaps = 1/75 (1%) Query: 1 MPRIR-EVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARV 59 MPR V P A + I SS YN DL D N+ RPI+AGGT A+ A+ Sbjct: 3 MPRDGSGVMSWPPNGNAVTATPIDSSKYNARFADLLSDLNAARPITAGGTGANTAIGGND 62 Query: 60 NLGTDDASNLTKGKI 74 NL T A+ + + Sbjct: 63 NLNTAGANIASASTV 77 >gi|330891239|gb|EGH23900.1| tail fiber protein H [Pseudomonas syringae pv. mori str. 301020] Length = 510 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G+GQL+N + +GW+ L VD + M +W + + Sbjct: 267 LQTKLLYAPVQQGTGVGQLNNVVKIGWSDNGLKATVDATDMGTLWYANNF-----DPGSK 321 Query: 135 AHKPNHIVYTTDSNQYA 151 A+ + + +N Y Sbjct: 322 ANWGSTLAAYGITNAYT 338 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 49/125 (39%), Gaps = 1/125 (0%) Query: 12 AGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 GS+A S +++ N D + A + VNL ++ + Sbjct: 318 PGSKANWGSTLAAYGITNAYTKAESDARDLQRAMADSISYVGFAGNDVNLPYMRRASDGQ 377 Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQN 130 + L F PI+QGGG NK+ LG+N L LQVD++ ++ P L Sbjct: 378 VYYLQPRLGFPPIEQGGGPNMSTNKVRLGYNSAGSLRLQVDVTDFGDLTNDYNLPTKLAG 437 Query: 131 LVDHA 135 L A Sbjct: 438 LGMSA 442 >gi|330980925|gb|EGH79028.1| tail fiber protein H [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 510 Score = 68.7 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 75 IANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDH 134 + L + P+QQG G+GQL+N + +GW+ L VD + M +W + + Sbjct: 267 LQTKLLYAPVQQGTGVGQLNNVVKIGWSDNGLKATVDATDMGTLWYANNF-----DPGSK 321 Query: 135 AHKPNHIVYTTDSNQYA 151 A+ + + +N Y Sbjct: 322 ANWGSTLAAYGITNAYT 338 Score = 45.2 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 48/125 (38%), Gaps = 1/125 (0%) Query: 12 AGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 GS+A S +++ N D + + A VN ++ + Sbjct: 318 PGSKANWGSTLAAYGITNAYTKAESDARDLQRVMADSITYVGFASNDVNFPYMRRASDGQ 377 Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREVWTSQLAPRALQN 130 + L F PI+QGGG NKI LG+N L LQVD++ ++ P L Sbjct: 378 VYFLQPRLGFPPIEQGGGPNMSTNKIRLGYNSAGSLRLQVDVTDFGDLTNDYNLPTKLAG 437 Query: 131 LVDHA 135 L A Sbjct: 438 LGMSA 442 >gi|38640370|ref|NP_944293.1| Bcep22gp64 [Burkholderia phage Bcep22] gi|33860437|gb|AAQ54997.1| Bcep22gp64 [Burkholderia phage Bcep22] Length = 425 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 54/253 (21%), Positives = 105/253 (41%), Gaps = 28/253 (11%) Query: 118 VWTSQLA-PRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIF 176 VW + P + Q ++A N++V + + + + + + G H F Sbjct: 190 VWDAGNFDPASKQAAGNYAKSTNNVV----TFDWGTHQVGQLGLTIDANYQGYLWHSGNF 245 Query: 177 QNGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIA-LESTKNCLYKTT-- 233 N A + S DDG++ +VW G+ +A +++ N + T Sbjct: 246 -NPANYAIKS----VHQDDGNV-----NGSVWGGLLSDYLATQLAGKQASGNYMRGVTGN 295 Query: 234 --VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLP 291 + G +++F D VG + +SDER KK SS A ++ L+F++F+++ Sbjct: 296 GFTVGWDSGANHLNFFVDGNLVG--FVVSDERAKKNITPSSTDALARVKALEFVEFDFID 353 Query: 292 ESGMDSELRYLIGFSENNLKEINEVFVDT-----IGGYLAPNPSVIIPHLAKAIQELLQE 346 + + G + IN ++D YL N ++ +AIQ+L E Sbjct: 354 SPYL-PKKHVDNGVIAQQAQRINPNWIDKPPADHPDAYLGLNLQYLLMDAMRAIQQLSSE 412 Query: 347 VKELRDMIDKQNE 359 V EL+ +++ ++ Sbjct: 413 VDELKAELERNSK 425 >gi|254720065|ref|ZP_05181876.1| hypothetical protein Bru83_11101 [Brucella sp. 83/13] gi|265985070|ref|ZP_06097805.1| predicted protein [Brucella sp. 83/13] gi|306839383|ref|ZP_07472197.1| Hypothetical protein BROD_2239 [Brucella sp. NF 2653] gi|264663662|gb|EEZ33923.1| predicted protein [Brucella sp. 83/13] gi|306405506|gb|EFM61771.1| Hypothetical protein BROD_2239 [Brucella sp. NF 2653] Length = 381 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/62 (45%), Positives = 35/62 (56%), Gaps = 1/62 (1%) Query: 1 MPRIR-EVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARV 59 MPR VY LP +A I + +N L+D+A D N+ RPIS GGT +A QAR Sbjct: 1 MPRNGQGVYSLPPVYEAVTGETIEAQQHNVPLEDIASDLNNARPISTGGTGGQSATQARE 60 Query: 60 NL 61 NL Sbjct: 61 NL 62 >gi|225855800|ref|YP_002737311.1| hypothetical protein SPP_0076 [Streptococcus pneumoniae P1031] gi|225724622|gb|ACO20474.1| hypothetical protein SPP_0076 [Streptococcus pneumoniae P1031] Length = 140 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 34 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 88 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 89 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 139 >gi|15895154|ref|NP_348503.1| hypothetical protein CA_C1879 [Clostridium acetobutylicum ATCC 824] gi|15024858|gb|AAK79843.1|AE007696_5 Uncharacterized, phage-related protein [Clostridium acetobutylicum ATCC 824] gi|325509293|gb|ADZ20929.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018] Length = 1403 Score = 64.1 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 29/125 (23%), Positives = 55/125 (44%), Gaps = 10/125 (8%) Query: 235 LNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESG 294 ++ G Y+ G + + SD K +S+ +A +II + K F++ Sbjct: 1267 MHNGSNGAYMEVANVDHLYGINVWNSDISFKYAIADSTENALDIINRFKHRQFHWK---- 1322 Query: 295 MDSELRYLIGFSENNLKEINEVFVDTIGG-----YLAPNPSVIIPHLAKAIQELLQEVKE 349 + +G+ L IN V ++ + P +IIP+L+K+IQE +V++ Sbjct: 1323 -EDGRHQELGYVSQELYAINPALVLSVPQKNGTVIMQPRDDMIIPYLSKSIQEEDAKVEQ 1381 Query: 350 LRDMI 354 L+ I Sbjct: 1382 LKSKI 1386 >gi|308480796|ref|XP_003102604.1| hypothetical protein CRE_03188 [Caenorhabditis remanei] gi|308261038|gb|EFP04991.1| hypothetical protein CRE_03188 [Caenorhabditis remanei] Length = 567 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 5/122 (4%) Query: 244 IHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPES----GMDSE 298 + D D + G SD RLK G+ + A E +++L+ +D+ Y PE G+ + Sbjct: 76 LTVDGDIYSSGRVMCPSDIRLKDNITGKEAKEALENLQKLRIVDYFYKPEVAEKWGLSED 135 Query: 299 LRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQN 358 R G L EI V IG YL N S + A QEL + +L ID + Sbjct: 136 QRKRTGVIAQELAEIIPDAVRDIGDYLTVNESRVFYETVLATQELCRLAGDLDQKIDDKV 195 Query: 359 EE 360 +E Sbjct: 196 DE 197 >gi|332072362|gb|EGI82845.1| pblB [Streptococcus pneumoniae GA17570] Length = 2147 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++S+ IG Sbjct: 2041 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IESKKHEEIGLIA 2095 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2096 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2146 >gi|183603147|ref|ZP_02712966.2| PblB [Streptococcus pneumoniae SP195] gi|183572670|gb|EDT93198.1| PblB [Streptococcus pneumoniae SP195] Length = 2135 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++S+ IG Sbjct: 2029 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IESKKHEEIGLIA 2083 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2084 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2134 >gi|321156986|emb|CBW38974.1| PblB-type protein [Streptococcus phage 040922] Length = 2102 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1996 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2050 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2051 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2101 >gi|321157044|emb|CBW39031.1| PblB-type protein [Streptococcus phage 34117] Length = 1633 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1527 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1581 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1582 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1632 >gi|332205033|gb|EGJ19096.1| pblB [Streptococcus pneumoniae GA47368] Length = 1137 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1031 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1085 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1086 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1136 >gi|172079550|ref|ZP_02708651.2| PblB [Streptococcus pneumoniae CDC1873-00] gi|172043007|gb|EDT51053.1| PblB [Streptococcus pneumoniae CDC1873-00] Length = 2654 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2548 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2602 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2603 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2653 >gi|149004162|ref|ZP_01828959.1| hypothetical protein CGSSp14BS69_05677 [Streptococcus pneumoniae SP14-BS69] gi|147757824|gb|EDK64835.1| hypothetical protein CGSSp14BS69_05677 [Streptococcus pneumoniae SP14-BS69] Length = 602 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 496 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 550 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 551 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 601 >gi|321157252|emb|CBW39235.1| PblB-type protein [Streptococcus phage 2167] Length = 1469 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1363 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1417 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1418 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1468 >gi|148995279|ref|ZP_01824069.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP9-BS68] gi|147926790|gb|EDK77848.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP9-BS68] Length = 1602 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++S+ IG Sbjct: 1496 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IESKKHEEIGLIA 1550 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1551 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1601 >gi|148990394|ref|ZP_01821566.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP6-BS73] gi|147924349|gb|EDK75441.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP6-BS73] Length = 1053 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 947 GSGSLKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1001 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1002 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1052 >gi|321156928|emb|CBW38917.1| PblB-type protein [Streptococcus phage V22] Length = 1468 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1362 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1416 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1417 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1467 >gi|149021849|ref|ZP_01835856.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP23-BS72] gi|147930085|gb|EDK81072.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP23-BS72] Length = 999 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 58/111 (52%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 893 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 947 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ +I Sbjct: 948 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKII 998 >gi|321157149|emb|CBW39134.1| PblB-type protein [Streptococcus phage 11865] Length = 1421 Score = 61.4 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1315 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1369 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1370 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1420 >gi|321157103|emb|CBW39089.1| PblB-type protein [Streptococcus phage 23782] Length = 1707 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1601 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1655 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1656 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1706 >gi|225855356|ref|YP_002736868.1| PblB [Streptococcus pneumoniae JJA] gi|225722631|gb|ACO18484.1| PblB [Streptococcus pneumoniae JJA] Length = 2108 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2002 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2056 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2057 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2107 >gi|321157197|emb|CBW39181.1| PblB-type protein [Streptococcus phage 8140] Length = 1179 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1073 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1127 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1128 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1178 >gi|148986194|ref|ZP_01819146.1| phage-related protein; possible prophage LambdaBa01, minor structural protein [Streptococcus pneumoniae SP3-BS71] gi|147921808|gb|EDK72936.1| phage-related protein; possible prophage LambdaBa01, minor structural protein [Streptococcus pneumoniae SP3-BS71] Length = 1203 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1097 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1151 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1152 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1202 >gi|303255532|ref|ZP_07341590.1| hypothetical protein CGSSpBS455_08580 [Streptococcus pneumoniae BS455] gi|302597510|gb|EFL64598.1| hypothetical protein CGSSpBS455_08580 [Streptococcus pneumoniae BS455] Length = 965 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 859 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 913 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 914 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 964 >gi|307128260|ref|YP_003880291.1| PblB [Streptococcus pneumoniae 670-6B] gi|306485322|gb|ADM92191.1| PblB [Streptococcus pneumoniae 670-6B] Length = 2426 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2320 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2374 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2375 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2425 >gi|169833508|ref|YP_001693486.1| hypothetical protein SPH_0062 [Streptococcus pneumoniae Hungary19A-6] gi|168996010|gb|ACA36622.1| PblB [Streptococcus pneumoniae Hungary19A-6] Length = 3038 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2932 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2986 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2987 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3037 >gi|168494815|ref|ZP_02718958.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575310|gb|EDT95838.1| PblB [Streptococcus pneumoniae CDC3059-06] Length = 2676 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2570 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2624 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2625 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2675 >gi|148996415|ref|ZP_01824133.1| phage infection protein [Streptococcus pneumoniae SP11-BS70] gi|147756990|gb|EDK64029.1| phage infection protein [Streptococcus pneumoniae SP11-BS70] Length = 1034 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 928 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 982 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 983 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1033 >gi|303261114|ref|ZP_07347063.1| PblB [Streptococcus pneumoniae SP14-BS292] gi|303263442|ref|ZP_07349365.1| PblB [Streptococcus pneumoniae BS397] gi|303267835|ref|ZP_07353637.1| PblB [Streptococcus pneumoniae BS458] gi|302637951|gb|EFL68437.1| PblB [Streptococcus pneumoniae SP14-BS292] gi|302642531|gb|EFL72876.1| PblB [Streptococcus pneumoniae BS458] gi|302647215|gb|EFL77439.1| PblB [Streptococcus pneumoniae BS397] Length = 2105 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1999 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2053 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2054 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2104 >gi|307066694|ref|YP_003875660.1| PblB, putative [Streptococcus pneumoniae AP200] gi|306408231|gb|ADM83658.1| PblB, putative [Streptococcus phage PhiSpn_200] Length = 3035 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2929 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2983 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2984 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3034 >gi|225857876|ref|YP_002739386.1| PblB [Streptococcus pneumoniae 70585] gi|225722145|gb|ACO17999.1| PblB [Streptococcus pneumoniae 70585] Length = 2970 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2864 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2918 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2919 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2969 >gi|168494864|ref|ZP_02719007.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|168495066|ref|ZP_02719209.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575070|gb|EDT95598.1| PblB [Streptococcus pneumoniae CDC3059-06] gi|183575230|gb|EDT95758.1| PblB [Streptococcus pneumoniae CDC3059-06] Length = 1349 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1243 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1297 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1298 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1348 >gi|149011808|ref|ZP_01833004.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP19-BS75] gi|147764239|gb|EDK71171.1| prophage LambdaSa2, PblB, putative [Streptococcus pneumoniae SP19-BS75] Length = 1352 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1246 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1300 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1301 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1351 >gi|303259359|ref|ZP_07345336.1| PblB [Streptococcus pneumoniae SP-BS293] gi|303265734|ref|ZP_07351632.1| PblB [Streptococcus pneumoniae BS457] gi|302639293|gb|EFL69751.1| PblB [Streptococcus pneumoniae SP-BS293] gi|302644642|gb|EFL74891.1| PblB [Streptococcus pneumoniae BS457] Length = 1530 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1424 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1478 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1479 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1529 >gi|301799204|emb|CBW31717.1| pblB [Streptococcus pneumoniae OXC141] Length = 2101 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1995 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2049 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2050 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2100 >gi|307126242|ref|YP_003878273.1| PblB [Streptococcus pneumoniae 670-6B] gi|306483304|gb|ADM90173.1| PblB [Streptococcus pneumoniae 670-6B] Length = 2699 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2593 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2647 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2648 QEAETIVPKIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 2698 >gi|168490166|ref|ZP_02714365.1| PblB [Streptococcus pneumoniae SP195] gi|183571459|gb|EDT91987.1| PblB [Streptococcus pneumoniae SP195] Length = 3023 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 2917 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 2971 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 2972 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 3022 >gi|332201943|gb|EGJ16012.1| phage minor structural , N-terminal region domain protein [Streptococcus pneumoniae GA41317] Length = 1924 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 12/111 (10%) Query: 251 GAVGCSYFL---SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 G+ Y++ SD RLK+ +++ A + I +L+ + F++ ++++ IG Sbjct: 1818 GSGSVKYWMEQKSDRRLKENITDTAVKALDKINRLRMVAFDF-----IENKKHEEIGLIA 1872 Query: 308 NNLKEINEVFV----DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + I V + GYL + + ++P+L KAIQEL Q+++++ I Sbjct: 1873 QEAETIVPRIVSRDPENPDGYLHIDYTALVPYLIKAIQELNQKIEKMEKTI 1923 >gi|308187194|ref|YP_003931325.1| hypothetical protein Pvag_1688 [Pantoea vagans C9-1] gi|308057704|gb|ADO09876.1| hypothetical protein Pvag_1688 [Pantoea vagans C9-1] Length = 468 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 65/163 (39%), Gaps = 11/163 (6%) Query: 81 FYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNH 140 F +QQGGG G L NK+Y+GWNGT+++ QVD + + +++ P V + Sbjct: 126 FASVQQGGGDGMLANKVYMGWNGTKMIAQVDATRLGDLYYGNNPPPY---PVTSVNTKTG 182 Query: 141 IVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFLDD---GD 197 V S+ A + + LH + + ++ +D G+ Sbjct: 183 AVTLDKSDVGLAN-----VGNWPAVQANGGLHSSGNHRYYMDWGTDGKLHITVDSSDVGE 237 Query: 198 IMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRG 240 + + Q +Q+ N+ ES+ + T N G+ Sbjct: 238 LFTTQNPPNAAQTGAYSQYGGNLNEESSVTVISNTKNGNSGQS 280 >gi|32469426|ref|NP_862890.1| PblB [Streptococcus phage SM1] gi|10732859|gb|AAG18640.1| PblB [Streptococcus phage SM1] Length = 1062 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/179 (22%), Positives = 73/179 (40%), Gaps = 21/179 (11%) Query: 178 NGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNM 237 N A F+NN + GDI + ++ N A+ +Y M Sbjct: 899 NTAHFWNNPKIHGDLEVTGDI--------------VYRYENNGAIYLKGYWIYSPQYKRM 944 Query: 238 GRGPGYIHFDTDKGA---VGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESG 294 Y++ D + + + +SD R K +S S +IIEQLK + + Sbjct: 945 EESNNYLYLYRDGSSYSWIPMNKEISDRRYKSNIKDSQVSGLDIIEQLKTYSYRKEYDGK 1004 Query: 295 MDSELRYLIGFSENNLKE-INEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRD 352 ++ G ++++ + E F + G + ++P+L KAIQEL Q++++L Sbjct: 1005 IED---ISCGIMAQDVQKYVPEAFFENPDGAYSYRTFELVPYLIKAIQELNQKIQKLEK 1060 >gi|238821363|ref|YP_002925179.1| hypothetical protein PH10_gp46 [Streptococcus phage PH10] gi|238804945|emb|CAY56539.1| hypothetical protein [Streptococcus phage PH10] Length = 1137 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 44/103 (42%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SD RLK + A + I Q +F ++++ + + IG ++E++ V Sbjct: 1033 SDRRLKDNIVDCKHKALDYIHQFQFKEYDWKKQEDRPQQAHTKIGLIAQEVQEVDPTLVY 1092 Query: 320 TIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 G L + + KAIQEL E K+L ++ E + Sbjct: 1093 KNGDTLNLDNLRLTNIALKAIQELALENKKLTQRLENLENERR 1135 >gi|299068824|emb|CBJ40065.1| putative side tail fiber protein (Fragment) [Ralstonia solanacearum CMR15] Length = 359 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/49 (44%), Positives = 29/49 (59%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRA 127 L F P+QQGGG GQ +NKI++GW+G +L QVD S + S Sbjct: 57 LGFAPVQQGGGAGQFNNKIFIGWSGGKLKGQVDQSDLGSFVFSDSPTLT 105 >gi|291335566|gb|ADD95176.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C507] Length = 361 Score = 58.7 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 70/176 (39%), Gaps = 25/176 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKII----------ANLLPFYPIQQGGGIGQL 93 +S GGT A +A AR +LG S++ I AN LPF+ L Sbjct: 141 VSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQSANKLPFFSSGNAADTTDL 200 Query: 94 D--NKIYLGWNGTQLLLQVDLSSMR---EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + L +++ + + + LA ++ L A+K ++YTT S+ Sbjct: 201 TAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLA--SIAGLTTAANK---LIYTTGSD 255 Query: 149 QYAATPLSSFMRKLFSCHNGETLHQA-----IFQNGAQFYNNSQRIAAFLDDGDIM 199 YA L++F R + + T+ AQ N + +AA + DG+ + Sbjct: 256 TYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQAQDSNLAGVVAAGVTDGNFL 311 >gi|291335641|gb|ADD95248.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C846] Length = 499 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 70/176 (39%), Gaps = 25/176 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKII----------ANLLPFYPIQQGGGIGQL 93 +S GGT A +A AR +LG S++ I AN LPF+ L Sbjct: 279 VSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQSANKLPFFSSGNAADTTDL 338 Query: 94 D--NKIYLGWNGTQLLLQVDLSSMR---EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + L +++ + + + LA ++ L A+K ++YTT S+ Sbjct: 339 TAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLA--SIAGLTTAANK---LIYTTGSD 393 Query: 149 QYAATPLSSFMRKLFSCHNGETLHQA-----IFQNGAQFYNNSQRIAAFLDDGDIM 199 YA L++F R + + T+ AQ N + +AA + DG+ + Sbjct: 394 TYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQAQDSNLAGVVAAGVTDGNFL 449 >gi|291336087|gb|ADD95673.1| hypothetical protein [uncultured phage MedDCM-OCT-S11-C561] Length = 604 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 70/176 (39%), Gaps = 25/176 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTKGKII----------ANLLPFYPIQQGGGIGQL 93 +S GGT A +A AR +LG S++ I AN LPF+ L Sbjct: 384 VSDGGTGATSASAARTSLGVAIGSDVQAHNAILDDLAGLTQSANKLPFFSSGNAADTTDL 443 Query: 94 D--NKIYLGWNGTQLLLQVDLSSMR---EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + L +++ + + + LA ++ L A+K ++YTT S+ Sbjct: 444 TAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLA--SIAGLTTAANK---LIYTTGSD 498 Query: 149 QYAATPLSSFMRKLFSCHNGETLHQA-----IFQNGAQFYNNSQRIAAFLDDGDIM 199 YA L++F R + + T+ AQ N + +AA + DG+ + Sbjct: 499 TYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQAQDSNLAGVVAAGVTDGNFL 554 >gi|116326465|ref|YP_803185.1| large distal tail fiber subunit [Enterobacteria phage RB32] gi|115344058|gb|ABI95067.1| large distal tail fiber subunit [Enterobacteria phage RB32] Length = 1027 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/131 (27%), Positives = 58/131 (44%), Gaps = 13/131 (9%) Query: 234 VLNMGRGPGYIHFDTDK---GAVGCS--YFLSDERLKKVHGESSASAREIIEQLKFIDFN 288 VL++G F+ D AV C+ Y SD RLK + +A + + +LK ++ Sbjct: 880 VLHVGGSDYAFAFNGDFTAGAAVYCNDVYIRSDRRLKINVEDYEENAVDKVNKLKVKTYD 939 Query: 289 YLPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVII--------PHLAKAI 340 + + + IG +L+E+ V T NP I+ L KAI Sbjct: 940 KVKSLSDREVIGHEIGIIAQDLQEVLPEAVSTSSVGSQDNPEEILTISNSAVNALLIKAI 999 Query: 341 QELLQEVKELR 351 QE+ +E+KEL+ Sbjct: 1000 QEMSEEIKELK 1010 >gi|268578415|ref|XP_002644190.1| Hypothetical protein CBG17173 [Caenorhabditis briggsae] gi|187025955|emb|CAP34948.1| hypothetical protein CBG_17173 [Caenorhabditis briggsae AF16] Length = 863 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 52/122 (42%), Gaps = 5/122 (4%) Query: 244 IHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPES----GMDSE 298 + D D + G SD RLK + + A E +++L+ +D+ Y PE G+ + Sbjct: 400 LTVDGDIYSSGRVMCPSDIRLKDNITEKEAKEALENLQKLRIVDYFYKPEVADKWGLSED 459 Query: 299 LRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQN 358 R G L E+ V +G YL N S + A QEL + +L ID + Sbjct: 460 QRKRTGVIAQELAEVIPDAVKDLGDYLTVNESRVFYETVLATQELCRLAGDLDQKIDDKV 519 Query: 359 EE 360 E Sbjct: 520 AE 521 >gi|325508527|gb|ADZ20163.1| hypothetical protein CEA_G1125 [Clostridium acetobutylicum EA 2018] Length = 1085 Score = 56.4 bits (134), Expect = 6e-06, Method: Composition-based stats. Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 16/123 (13%) Query: 254 GCSYFL-----SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSEN 308 G +Y+ SDERLK + + II Q+ +F++ + + +G Sbjct: 965 GTTYWAYNFNASDERLKTNIKPITTNCSSIINQINLKEFDF-----IKTGEHIEVGTIAQ 1019 Query: 309 NLKEINEVFVDTI------GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 L +IN+ F + + AP+ + I+P++ AIQEL + K L + Q Sbjct: 1020 QLSKINKKFSKILYTTEDGTNFFAPDYNSILPYIIGAIQELSAKSKSLETRVTTLENSIQ 1079 Query: 363 DVQ 365 +++ Sbjct: 1080 ELK 1082 >gi|25152212|ref|NP_741883.1| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|22265847|emb|CAD44121.1| C. elegans protein F21A10.2b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 947 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 5/132 (3%) Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPE 292 V+ + D D + G + SD RLK + + A E +++L+ +D+ Y PE Sbjct: 472 VIGKSEPRAQLTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPE 531 Query: 293 S----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVK 348 G+ + R G L + V +G YL N S + A QEL + Sbjct: 532 VASKWGLTEDQRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTG 591 Query: 349 ELRDMIDKQNEE 360 +L ID + E Sbjct: 592 DLDQKIDDKVAE 603 >gi|25152210|ref|NP_741884.1| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|5824465|emb|CAA16508.2| C. elegans protein F21A10.2a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 898 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 5/132 (3%) Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPE 292 V+ + D D + G + SD RLK + + A E +++L+ +D+ Y PE Sbjct: 423 VIGKSEPRAQLTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPE 482 Query: 293 S----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVK 348 G+ + R G L + V +G YL N S + A QEL + Sbjct: 483 VASKWGLTEDQRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTG 542 Query: 349 ELRDMIDKQNEE 360 +L ID + E Sbjct: 543 DLDQKIDDKVAE 554 >gi|15894398|ref|NP_347747.1| hypothetical protein CA_C1113 [Clostridium acetobutylicum ATCC 824] gi|15024032|gb|AAK79087.1|AE007628_1 Hypothetical protein CA_C1113 [Clostridium acetobutylicum ATCC 824] Length = 491 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/123 (23%), Positives = 54/123 (43%), Gaps = 16/123 (13%) Query: 254 GCSYFL-----SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSEN 308 G +Y+ SDERLK + + II Q+ +F++ + + +G Sbjct: 371 GTTYWAYNFNASDERLKTNIKPITTNCSSIINQINLKEFDF-----IKTGEHIEVGTIAQ 425 Query: 309 NLKEINEVFVDTI------GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 L +IN+ F + + AP+ + I+P++ AIQEL + K L + Q Sbjct: 426 QLSKINKKFSKILYTTEDGTNFFAPDYNSILPYIIGAIQELSAKSKSLETRVTTLENSIQ 485 Query: 363 DVQ 365 +++ Sbjct: 486 ELK 488 >gi|32566568|ref|NP_509709.2| hypothetical protein F21A10.2 [Caenorhabditis elegans] gi|22265848|emb|CAD44122.1| C. elegans protein F21A10.2c, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|22265862|emb|CAD44135.1| C. elegans protein F21A10.2c, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 949 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 5/132 (3%) Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPE 292 V+ + D D + G + SD RLK + + A E +++L+ +D+ Y PE Sbjct: 474 VIGKSEPRAQLTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPE 533 Query: 293 S----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVK 348 G+ + R G L + V +G YL N S + A QEL + Sbjct: 534 VASKWGLTEDQRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTG 593 Query: 349 ELRDMIDKQNEE 360 +L ID + E Sbjct: 594 DLDQKIDDKVAE 605 >gi|302144754|emb|CBW44371.1| C. elegans protein F21A10.2d, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 944 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 5/132 (3%) Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPE 292 V+ + D D + G + SD RLK + + A E +++L+ +D+ Y PE Sbjct: 469 VIGKSEPRAQLTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPE 528 Query: 293 S----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVK 348 G+ + R G L + V +G YL N S + A QEL + Sbjct: 529 VASKWGLTEDQRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTG 588 Query: 349 ELRDMIDKQNEE 360 +L ID + E Sbjct: 589 DLDQKIDDKVAE 600 >gi|138015|sp|P08231|VG37_BPM1 RecName: Full=Long tail fiber protein p37; Short=Protein Gp37; AltName: Full=Receptor-recognizing protein gi|15115|emb|CAA29160.1| unnamed protein product [Enterobacteria phage M1] Length = 251 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 11/116 (9%) Query: 240 GPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSEL 299 PG HF+ Y SD RLK E A E + +LK ++ + S + Sbjct: 126 SPGNGHFNDV-------YIRSDGRLKINKKELENGALEKVCRLKVYTYDKVKSIKDRSVI 178 Query: 300 RYLIGFSENNLKEINEVFVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 + +G +L++ V + L + S + L KAIQE+ +E+KEL+ Sbjct: 179 KREVGIIAQDLEKELPEAVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 234 >gi|302144755|emb|CBW44372.1| C. elegans protein F21A10.2e, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 1009 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 5/132 (3%) Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVH-GESSASAREIIEQLKFIDFNYLPE 292 V+ + D D + G + SD RLK + + A E +++L+ +D+ Y PE Sbjct: 534 VIGKSEPRAQLTVDGDIYSSGRVMYPSDIRLKDNITEKGAKDALENLQKLRIVDYFYKPE 593 Query: 293 S----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVK 348 G+ + R G L + V +G YL N S + A QEL + Sbjct: 594 VASKWGLTEDQRKRTGVIAQELAAVLPDAVKDLGDYLTVNESRVFYETVLATQELCRLTG 653 Query: 349 ELRDMIDKQNEE 360 +L ID + E Sbjct: 654 DLDQKIDDKVAE 665 >gi|138016|sp|P08232|VG37_BPOX2 RecName: Full=Long tail fiber protein p37; Short=Protein Gp37; AltName: Full=Receptor-recognizing protein gi|15125|emb|CAA29157.1| unnamed protein product [Enterobacteria phage Ox2] Length = 251 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 4/99 (4%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEV 316 Y SD RLK E A E + +LK ++ + S ++ +G +L++ Sbjct: 136 YIRSDGRLKINKKELENGALEKVCRLKVYTYDKVKSIKDRSVIKREVGIIAQDLEKELPE 195 Query: 317 FVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 V + L + S + L KAIQE+ +E+KEL+ Sbjct: 196 AVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 234 >gi|301381668|ref|ZP_07230086.1| tail fiber domain protein [Pseudomonas syringae pv. tomato Max13] Length = 425 Score = 53.7 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 68/329 (20%), Positives = 120/329 (36%), Gaps = 39/329 (11%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P++ GGT A + V A +LG + + + F +G +G +Y+GWN Sbjct: 110 PVAKGGTGATDGVLALTSLGMKGGA----YDALIKSVGF----RGAPVGYNVQGLYMGWN 161 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G + + L A ++ ++ + S +S+ L Sbjct: 162 GNGDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYSGVLTVKEVST---TL 214 Query: 163 FSCH--NGETLHQAIFQNGAQFYNN-SQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTAN 219 S + NG T A+ Q G + S R+A L G Q V+ G++IA ++ Sbjct: 215 VSTNQINGLTTPIALAQGGTGGKDQASARVALGLGAG-------QAPVFAGLDIAGRISS 267 Query: 220 IA--LESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG-CSYFLSDERLKKVHGESSASA- 275 + + TV N + D VG + F SD R+KK E + Sbjct: 268 YGNWCRTGFSGSKGGTVYNFNWTGNNVDVYIDNTYVGTMTLFTSDYRIKKFIKELKVPSF 327 Query: 276 REIIEQLKFIDFNY-LPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPN------ 328 + I+ + + + + + R G + +E+N + V + N Sbjct: 328 LDRIDAYRLVTYERKIFGDVFRGDGRVYQGLIAHEAQEVNPLAVTGEKDGVDENGNPRIQ 387 Query: 329 ---PSVIIPHLAKAIQELLQEVKELRDMI 354 P +I L A++EL E+ L+ I Sbjct: 388 QLDPMALITDLMGAVKELRAELAALKASI 416 >gi|213970547|ref|ZP_03398674.1| tail fiber domain protein [Pseudomonas syringae pv. tomato T1] gi|213924718|gb|EEB58286.1| tail fiber domain protein [Pseudomonas syringae pv. tomato T1] Length = 425 Score = 53.3 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 68/329 (20%), Positives = 120/329 (36%), Gaps = 39/329 (11%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P++ GGT A + V A +LG + + + F +G +G +Y+GWN Sbjct: 110 PVAKGGTGATDGVLALTSLGMKGGA----YDALIKSVGF----RGAPVGYNVQGLYMGWN 161 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G + + L A ++ ++ + S +S+ L Sbjct: 162 GNGDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYSGVLTVKEVST---TL 214 Query: 163 FSCH--NGETLHQAIFQNGAQFYNN-SQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTAN 219 S + NG T A+ Q G + S R+A L G Q V+ G++IA ++ Sbjct: 215 VSTNQINGLTTPIALAQGGTGGKDQASARVALGLGAG-------QAPVFAGLDIAGRISS 267 Query: 220 IA--LESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG-CSYFLSDERLKKVHGESSASA- 275 + + TV N + D VG + F SD R+KK E + Sbjct: 268 YGNWCRTGFSGSKGGTVYNFNWTGNNVDVYIDNTYVGTMTLFTSDYRIKKFIKELKVPSF 327 Query: 276 REIIEQLKFIDFNY-LPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPN------ 328 + I+ + + + + + R G + +E+N + V + N Sbjct: 328 LDRIDAYRLVTYERKIFGDVFRGDGRVYQGLIAHEAQEVNPLAVTGEKDGVDENGNARIQ 387 Query: 329 ---PSVIIPHLAKAIQELLQEVKELRDMI 354 P +I L A++EL E+ L+ I Sbjct: 388 QLDPMALITDLMGAVKELRAELAALKASI 416 >gi|291545443|emb|CBL18551.1| hypothetical protein CK1_01850 [Ruminococcus sp. SR1/5] Length = 412 Score = 53.3 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 44/111 (39%), Gaps = 10/111 (9%) Query: 259 LSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV 318 +SD RLK+ S A E + ++K F++ G IGF + L+EI+ Sbjct: 305 VSDIRLKENIENSETDALETVNRMKVRQFDWKERMG---GWHQNIGFVADELEEIDPNLA 361 Query: 319 DTIG-------GYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 G N ++ + KAIQEL +V E I + Q Sbjct: 362 LGGGYDENGEMDIKQINSPYLLNYAIKAIQELSAKVDEQEKHIKELERRLQ 412 >gi|71991056|ref|NP_496262.2| Prion-like-(Q/N-rich)-domain-bearing protein family member (pqn-47) [Caenorhabditis elegans] gi|50507473|emb|CAA88990.3| C. elegans protein F59B10.1, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|50507495|emb|CAA88602.3| C. elegans protein F59B10.1, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 931 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Query: 260 SDERLKK-VHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEIN 314 SD RLK+ + +A A E + +L+ +D+ Y PE G+D + R+ G L+ + Sbjct: 483 SDIRLKEAITERETAEAIENLLKLRVVDYRYKPEVADIWGLDEQQRHRTGLIAQELQAVL 542 Query: 315 EVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 V IG YL + + A Q+L + +L ID++ E Sbjct: 543 PDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDSKIDEKVAE 588 >gi|312088535|ref|XP_003145899.1| hypothetical protein LOAG_10325 [Loa loa] gi|307758937|gb|EFO18171.1| hypothetical protein LOAG_10325 [Loa loa] Length = 873 Score = 52.9 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 59/151 (39%), Gaps = 12/151 (7%) Query: 224 STKNCLYKTTVLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKK-VHGESSASAREIIEQL 282 ST + + R + D G + SD R+K+ +H + SA + Q+ Sbjct: 338 STLYYNSGSVAIGTDRAVAPLTVGGDIYCSGVVHRPSDRRMKEQIHEVDTKSALSHLAQI 397 Query: 283 KFIDFNYLPES----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVII----- 333 + + ++Y PE G+ E R+ +G L EI V G +L + S I Sbjct: 398 RVVGYSYKPEIALKWGLSEENRHRVGVIAQELAEILPDAVTDNGDFLQVDDSRIFYETVA 457 Query: 334 --PHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 L + L +++ + + K + HQ Sbjct: 458 AATELCRLTGNLEHKIEAVEKLSHKLAKLHQ 488 >gi|324500951|gb|ADY40431.1| Myelin gene regulatory factor [Ascaris suum] Length = 992 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 57/143 (39%), Gaps = 12/143 (8%) Query: 232 TTVLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKK-VHGESSASAREIIEQLKFIDFNYL 290 + + M R + D G + SD R+K+ +H + A + Q++ + ++Y Sbjct: 490 SVAIGMDRAMAPLTVGGDIYCSGVVHRPSDRRVKEEIHEVDTKDAMSRLAQIRVVGYSYK 549 Query: 291 PES----GMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQE 346 PE G+ E R+ +G L EI V G +L + S I E + Sbjct: 550 PEIALQWGLSEENRHRVGVIAQELAEILPDAVTDNGDFLQVDDSRIF-------YETVAA 602 Query: 347 VKELRDMIDKQNEEHQDVQDMSN 369 EL + + + V+ +SN Sbjct: 603 ATELCRLTGNLEHKIEAVEKLSN 625 >gi|228861560|ref|YP_002854581.1| gp37 large distal tail fiber subunit [Enterobacteria phage RB14] gi|227438576|gb|ACP30889.1| gp37 large distal tail fiber subunit [Enterobacteria phage RB14] Length = 981 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 4/105 (3%) Query: 251 GAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNL 310 G+ Y SD RLK E A E + +LK ++ + S ++ +G +L Sbjct: 860 GSFSDVYIRSDSRLKINKEELEYGAVEKVCRLKVYIYDKVKSIKDRSVIKREVGIIAQDL 919 Query: 311 KEINEVFVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 ++ V + L + S + L KAIQE+ +E+KEL+ Sbjct: 920 EKELPEAVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 964 >gi|163644213|dbj|BAF95750.1| tail fiber protein [Enterobacteria phage KEP10] Length = 982 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/105 (26%), Positives = 47/105 (44%), Gaps = 4/105 (3%) Query: 251 GAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNL 310 G+ Y SD RLK E A E + +LK ++ + S ++ +G +L Sbjct: 861 GSFSDVYIRSDSRLKINKEELEYGAVEKVCRLKVYIYDKVKSIKDRSVIKREVGIIAQDL 920 Query: 311 KEINEVFVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 ++ V + L + S + L KAIQE+ +E+KEL+ Sbjct: 921 EKELPEAVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 965 >gi|289552035|gb|ADD10651.1| gp37 [Enterobacteria phage IP008] Length = 854 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 4/99 (4%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEV 316 Y SD RLK E A E + +LK ++ + S ++ +G +L++ Sbjct: 739 YIRSDGRLKINKKELENGALEKVCRLKVYIYDKVKSIKDRSVIKREVGIIAQDLEKELPE 798 Query: 317 FVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 V + L + S + L KAIQE+ +E+KEL+ Sbjct: 799 AVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 837 >gi|167041094|gb|ABZ05855.1| hypothetical protein ALOHA_HF400048F7ctg1g22 [uncultured marine microorganism HF4000_48F7] Length = 665 Score = 51.0 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 47/133 (35%), Gaps = 52/133 (39%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 PIS GGT +A +AR LG S++ Sbjct: 312 PISLGGTGGTSASEARTLLGVTLGSDV--------------------------------- 338 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 + + + L +++ L A+K ++YTT S+ YA T L++F R + Sbjct: 339 --------------QAYDAGL--QSISGLTTAANK---MIYTTGSDTYAVTDLTAFARTI 379 Query: 163 FSCHNGETLHQAI 175 + T+ + Sbjct: 380 LDDSDAATVRTTL 392 >gi|330974326|gb|EGH74392.1| tail fiber domain-containing protein [Pseudomonas syringae pv. aceris str. M302273PT] Length = 425 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 63/328 (19%), Positives = 108/328 (32%), Gaps = 43/328 (13%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P+ GGT A + A +LG + + + F +G +G +Y+GWN Sbjct: 110 PVLKGGTGATDGASALTSLGMKGGA----YDALIKSIGF----RGAPVGYNVQGLYMGWN 161 Query: 103 GTQ-----LLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSS 157 G + W S + V + S +T + Sbjct: 162 GNGDGGANYICNRGAGLGGHAWWSVNSDNTAAGPVMTYSYGGLLTVQQVSTTLVSTNQIN 221 Query: 158 FMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHT 217 + + G T ++ A + R A L G Q V+ G++IA Sbjct: 222 GLTTPITLAQGGT----GGKDQA-----TARNALGLGAG-------QAPVFAGLDIAGRA 265 Query: 218 ANIALESTKNC--LYKTTVLNMGRGPGYIHFDTDKGAVG-CSYFLSDERLKKVHGESSA- 273 ++ + TV N + D VG + F SD R+KK E Sbjct: 266 SSYGIWCRTGFSGSKGGTVYNFNWTGNNVDVYIDSTYVGTMTLFTSDYRIKKFIKELKVP 325 Query: 274 SAREIIEQLKFIDFNYLPESGM-DSELRYLIGFSENNLKEINEVFVDTIGGYLAPN---- 328 S + I+ + + + + + R G + +E+N + V + N Sbjct: 326 SYLDRIDAYRLVTYERKVFGDVFRGDGRVYQGLIAHEAQEVNPLAVTGEKDGVDENGNAR 385 Query: 329 -----PSVIIPHLAKAIQELLQEVKELR 351 P +I L A++EL EV L+ Sbjct: 386 IQQLDPMALITDLMGAVKELRAEVIALK 413 >gi|308509976|ref|XP_003117171.1| CRE-PQN-47 protein [Caenorhabditis remanei] gi|308242085|gb|EFO86037.1| CRE-PQN-47 protein [Caenorhabditis remanei] Length = 932 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Query: 260 SDERLKK-VHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEIN 314 SD RLK+ + + +A A E + +L+ +D+ Y E G+D + R G L+ + Sbjct: 474 SDIRLKEGISEKETAEAIENLLKLRVVDYRYKSEVADVWGLDEQQRQRTGLIAQELQAVL 533 Query: 315 EVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 V IG YL + + A Q+L + +L + ID++ E Sbjct: 534 PDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDNKIDEKVAE 579 >gi|28869296|ref|NP_791915.1| tail fiber domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28871181|ref|NP_793800.1| tail fiber domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28852537|gb|AAO55610.1| tail fiber domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28854431|gb|AAO57495.1| tail fiber domain protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 475 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 73/370 (19%), Positives = 117/370 (31%), Gaps = 65/370 (17%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 PI+ GGT A + A VNLG +N ++ + F +G +G +Y+GWN Sbjct: 110 PIAKGGTGATDGATALVNLGLRGGAN----DLLVKSIGF----RGTPVGYNIQGLYMGWN 161 Query: 103 GTQ-----LLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSS 157 G + + W S + V + S + +SS Sbjct: 162 GNGDGGANYICNRGGAGGGHSWWSVNSDNTAAGPVMTYSYAGVLSVAQLSVTASPIAISS 221 Query: 158 ---FMRKLFSCHNGETLHQAIFQNGAQFYNN--------SQRIAAFLDDGDIMCYK---- 202 L A +N S R L D+ ++ Sbjct: 222 GGTSANTAAGARTNLGLGSAAIENTVPITKGGTGGTDGPSARSGIGLGTTDVATFRALEL 281 Query: 203 RQKTVWQGIEIAQHTANIALESTKNCLYKTTV--------------LNMGRG-------- 240 T W A+ + + T+ LN RG Sbjct: 282 THTTPWIDFHFNNTAADYDVRLINDSAGTLTLDGRLASKGTWCRTGLNGSRGANIYNLNW 341 Query: 241 ----PGYIHFDTDKGAVG-CSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGM 295 GY+ D VG + SD R+K+ E SA E + + + F + Sbjct: 342 NTSNSGYVDVYIDASYVGALTLLQSDYRVKRQVEEFSAPFLERVNAYRIVTFKRAAYGEV 401 Query: 296 DSELRYLI-GFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLAK---------AIQELLQ 345 + LI G + ++EIN + + N ++ I L AI+EL Sbjct: 402 FKDGENLIQGLIAHEVQEINPLAATGKKDDVDANGNIWIQQLDSIALITDAFGAIKELSV 461 Query: 346 EVKELRDMID 355 +VK+LR +D Sbjct: 462 QVKKLRAELD 471 >gi|309364934|emb|CAP23547.2| CBR-PQN-47 protein [Caenorhabditis briggsae AF16] Length = 925 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Query: 260 SDERLKK-VHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEIN 314 SD RLK+ + + +A A E + +L+ +D+ Y E G+D + R G L+ + Sbjct: 464 SDIRLKEGISEKETAEAIENLLKLRVVDYRYKDEVANVWGLDEQQRQRTGLIAQELQAVL 523 Query: 315 EVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 V IG YL + + A Q+L + +L + ID++ E Sbjct: 524 PDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDNKIDEKVAE 569 >gi|268532300|ref|XP_002631278.1| C. briggsae CBR-PQN-47 protein [Caenorhabditis briggsae] Length = 918 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 5/106 (4%) Query: 260 SDERLKK-VHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEIN 314 SD RLK+ + + +A A E + +L+ +D+ Y E G+D + R G L+ + Sbjct: 474 SDIRLKEGISEKETAEAIENLLKLRVVDYRYKDEVANVWGLDEQQRQRTGLIAQELQAVL 533 Query: 315 EVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 V IG YL + + A Q+L + +L + ID++ E Sbjct: 534 PDAVRDIGDYLTIDEGRVFYETVMATQQLCRMTGDLDNKIDEKVAE 579 >gi|298486648|ref|ZP_07004706.1| Tail fiber domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298158863|gb|EFH99925.1| Tail fiber domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 425 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 64/329 (19%), Positives = 115/329 (34%), Gaps = 39/329 (11%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P++ GGT A + A +LG + + + F +G +G +Y+GWN Sbjct: 110 PVAKGGTGATDGASALTSLGMKGGA----YDALIKSVGF----RGAPVGYNVQGLYMGWN 161 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G + + L A ++ ++ + + + +S+ L Sbjct: 162 GNGDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYTGVLTVSQVST---TL 214 Query: 163 FSCH--NGETLHQAIFQNGAQFYNN-SQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTAN 219 S + NG T + Q G + + R A L G Q V+ G++I ++ Sbjct: 215 VSTNQINGLTTPITLAQGGTGGKDQATARNALGLGTG-------QAPVFAGLDIVGRVSS 267 Query: 220 IALESTKNCL--YKTTVLNMGRGPGYIHFDTDKGAVG-CSYFLSDERLKKVHGESSASA- 275 TV N + D VG + F SD R+KK E + Sbjct: 268 NGTWCRTGFTGSRGGTVYNFNWTGNNVDVYIDNTYVGTMTLFTSDYRIKKFIKELKVPSF 327 Query: 276 REIIEQLKFIDFNY-LPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPN------ 328 + I+ + + + + + R G + +E+N + V + N Sbjct: 328 LDRIDAYRLVTYERKIFGDVFRGDGRVYQGLIAHEAQEVNPLAVTGEKDGVDENGNARIQ 387 Query: 329 ---PSVIIPHLAKAIQELLQEVKELRDMI 354 P +I L A++EL EV L+ I Sbjct: 388 QLDPMALITDLMGAVKELRAEVAALKASI 416 >gi|3676460|gb|AAC61976.1| large subunit distal tail fiber [Enterobacteria phage T6] Length = 993 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 4/105 (3%) Query: 251 GAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNL 310 G+ Y SD RL + A E + +LK ++ L S ++ +G +L Sbjct: 872 GSFSDVYIRSDSRLNINKQQLEYGAVEKVCRLKVYIYDKLKSIKDRSVIKREVGIIAQDL 931 Query: 311 KEINEVFVDTI----GGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 ++ V + L + S + L KAIQE+ +E+KEL+ Sbjct: 932 EKELPEAVSKVEVDGSDVLTISNSAVNALLIKAIQEMSEEIKELK 976 >gi|307710000|ref|ZP_07646446.1| hypothetical protein SMSK564_1275 [Streptococcus mitis SK564] gi|307619258|gb|EFN98388.1| hypothetical protein SMSK564_1275 [Streptococcus mitis SK564] Length = 694 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/96 (27%), Positives = 46/96 (47%), Gaps = 4/96 (4%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINE-VFV 318 SD RLK ESS S ++I +LK + + ++ G ++++ F Sbjct: 601 SDRRLKSNIQESSVSGVDVINRLKTYSYRKEFNNEVED---ISCGIMAQDVQKYAPDAFR 657 Query: 319 DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 + G N ++P+L KAIQEL Q+++++ I Sbjct: 658 EGPDGVYTYNTFALVPYLIKAIQELNQKIEKMEKTI 693 >gi|315612415|ref|ZP_07887328.1| phage minor structural protein [Streptococcus sanguinis ATCC 49296] gi|315315396|gb|EFU63435.1| phage minor structural protein [Streptococcus sanguinis ATCC 49296] Length = 1036 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 92/224 (41%), Gaps = 16/224 (7%) Query: 142 VYTTDSN--QYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFL-DDGDI 198 + T D + Y T ++F S G +++ N +N A F+ + G++ Sbjct: 816 IGTHDKDPSTYWLTGQNNF-DVGLSSGKGRYNQTSLWVNWGDNWNKPGNKAWFVKNTGEM 874 Query: 199 MCYKR----QKTVWQG-IEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGA- 252 CY + + G ++++ + I S + +Y + GY++ + Sbjct: 875 HCYNTAHFWNEPIVHGDLQVSGNIKFINNGSLGHWIYSYNYKKIASNNGYLYLYDNYERY 934 Query: 253 --VGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNL 310 + + +SD R K S+ S +I LK + + ++ G ++ Sbjct: 935 DWIPMNKEISDRRYKHNIEASTVSGLAVINNLKTYSYRKEYDGKIED---IACGIMAQDV 991 Query: 311 KE-INEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDM 353 ++ + E F + G + ++P+L KAIQEL Q+V+ L+ + Sbjct: 992 QKYVPEAFYENPDGAYSYRTFELVPYLIKAIQELNQKVERLKKI 1035 >gi|291228384|ref|XP_002734150.1| PREDICTED: myelin gene regulatory factor-like [Saccoglossus kowalevskii] Length = 900 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 42/107 (39%), Gaps = 5/107 (4%) Query: 235 LNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESS-ASAREIIEQLKFIDFNYLPE- 292 +N R + D G SDER+K+ + + +L+ + ++Y+PE Sbjct: 498 INTDRPEESLVVHGDIKVTGQVLQPSDERVKENIEDVDTKEQLRKVAKLRLVSYDYIPEY 557 Query: 293 ---SGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHL 336 SGM + G +L+++ V G + PN I L Sbjct: 558 IAHSGMSEQNSKTTGVLAQDLRDVMPDAVKESGDVILPNGHRIDKFL 604 >gi|167647462|ref|YP_001685125.1| hypothetical protein Caul_3500 [Caulobacter sp. K31] gi|167349892|gb|ABZ72627.1| hypothetical protein Caul_3500 [Caulobacter sp. K31] Length = 899 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 39/99 (39%), Gaps = 15/99 (15%) Query: 33 DLALDN-NSPRP-ISAGGTAADNAVQARVNLGTDDASNLTKGKIIA----------NLLP 80 D D + +P ++ G + QA GT + N T N L Sbjct: 296 DFVNDAYTAAQPFLTVGRSGTTP--QAINLFGTSLSFNSTAITTATTLASGLATKQNTLG 353 Query: 81 FYPIQQGGGIGQLDNKIYLGWNGTQ-LLLQVDLSSMREV 118 F P+QQG G+GQ N I LGW+ L +D + + Sbjct: 354 FTPVQQGTGVGQTTNVIKLGWSNEGKLKATIDATDQGAI 392 >gi|92119284|ref|YP_579013.1| complement C1q protein [Nitrobacter hamburgensis X14] gi|91802178|gb|ABE64553.1| Complement C1q protein [Nitrobacter hamburgensis X14] Length = 781 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 55/133 (41%), Gaps = 7/133 (5%) Query: 240 GPGY-IHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSE 298 P Y +H + VG LSD R KK + IE+L+ + F+++ + Sbjct: 639 APSYMLHVNGSVAGVGAYNALSDRRFKKNIHPAD-YGLAAIEKLRPVTFDWISPTSPQLH 697 Query: 299 LRYLIGFSENNLKEINEVFVDTIGG---YLAPNPSVIIPHLAKAIQELLQEVKELRDMID 355 R +G ++ + V ++ S ++P L KA+QEL + LR + Sbjct: 698 NR-QLGLIAQEVQPLVPEAVSVANDPSHTMSIAYSTLVPVLIKAVQELKADNDNLRAELR 756 Query: 356 KQNE-EHQDVQDM 367 + +H ++ + Sbjct: 757 TVRDTDHAAIESL 769 >gi|167383335|ref|XP_001736494.1| hypothetical protein [Entamoeba dispar SAW760] gi|165901104|gb|EDR27264.1| hypothetical protein EDI_344700 [Entamoeba dispar SAW760] Length = 637 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 27/114 (23%), Positives = 51/114 (44%), Gaps = 11/114 (9%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SD+R K + ++ ++I QL+ + F Y GF LK++ V Sbjct: 184 SDQRTKTDIANLN-NSLDLIMQLRGVTFKY------KGTEERKYGFIAQELKQVIPDLVR 236 Query: 320 TIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQDVQDMSNTPLN 373 L + I+P L ++++EL Q V+E + QN E ++Q+ + ++ Sbjct: 237 EDEKGLYIDTQGILPILVESLKELNQNVEEYK----NQNNEILELQERVDKAMD 286 >gi|9634043|ref|NP_052117.1| tail fiber protein [Yersinia phage phiYeO3-12] gi|6599034|emb|CAB63638.1| tail fiber protein [Yersinia phage phiYeO3-12] Length = 645 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 11/125 (8%) Query: 239 RGPGYIHFDTDKGAVGCSYFLSDERLKKVHG--ESSASAREIIE-----QLKFIDFNYLP 291 RG Y+ D AVG +F SD RLK+ S+ II K+ D +Y Sbjct: 518 RGGYYLEGRVDGTAVGFRWFQSDRRLKEDIKVVRSADDMLNIIRSYIPVSYKYKDASYTD 577 Query: 292 ESGMDSEL---RYLIGFSENNLKEINEVFVDTIGGYL-APNPSVIIPHLAKAIQELLQEV 347 G + + R GF +L + VD + + +P+P+ II L ++ L + Sbjct: 578 NRGRTNTIEGKRSRAGFITQDLIRLWPEAVDVMSDGMQSPDPNQIIGGLMLLVKNLDARI 637 Query: 348 KELRD 352 +EL Sbjct: 638 QELEK 642 >gi|326431896|gb|EGD77466.1| hypothetical protein PTSG_08561 [Salpingoeca sp. ATCC 50818] Length = 1363 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 50/119 (42%), Gaps = 16/119 (13%) Query: 260 SDERLKKV-HGESSASAREIIEQLKFIDFN----YLPESGMDSELRYLIGFSENNLKEIN 314 SD+R+K+ H S+ + + ++ +++ + ++G S+ R+ +G L+++ Sbjct: 825 SDKRVKENIHAASTKRHLDNVNRMTLYEYDLKEEWARKAGRSSDDRHEMGVVAQELQQVL 884 Query: 315 EVFVDTIGG-----------YLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 V + G L N + A+QEL + + L I + E +Q Sbjct: 885 PDAVQSCGDVALDDGTEIDDLLVVNKDRVFMESVGAVQELSKMTQHLELRIKELEELNQ 943 >gi|67475526|ref|XP_653457.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56470408|gb|EAL48071.1| hypothetical protein EHI_072090 [Entamoeba histolytica HM-1:IMSS] Length = 640 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 15/109 (13%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV- 318 SD+R K + ++ ++I QL+ + F Y + GF LK++ V Sbjct: 187 SDQRTKTDIANLN-NSLDLIMQLRGVTFKY------KGTEQRKYGFIAQELKQVLPDLVR 239 Query: 319 -DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQDVQD 366 DT G Y+ + I+P L +++++L Q V+E + QN E ++Q+ Sbjct: 240 EDTQGLYI--DTQGILPILVESLKQLNQNVEEYK----NQNNEILELQE 282 >gi|255036291|ref|YP_003086912.1| hypothetical protein Dfer_2529 [Dyadobacter fermentans DSM 18053] gi|254949047|gb|ACT93747.1| hypothetical protein Dfer_2529 [Dyadobacter fermentans DSM 18053] Length = 456 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 52/104 (50%), Gaps = 5/104 (4%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEV 316 Y SD+RLK SAS I QL+ + +L + + G ++++ Sbjct: 336 YLTSDKRLKTDFKNLSAS-LAKISQLQGYSYRWLDTTRTQT---LQTGLIAQEVEKLFPE 391 Query: 317 FVDTI-GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 V+T GY + N + +IPHL +A++EL + ++ ++ ++ NE Sbjct: 392 LVNTDDNGYKSMNYNGLIPHLIEAVKELKGQTAQIAELRNELNE 435 >gi|326680883|ref|XP_002667695.2| PREDICTED: myelin gene regulatory factor-like, partial [Danio rerio] Length = 833 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 25/113 (22%), Positives = 49/113 (43%), Gaps = 7/113 (6%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E ++ + I Q++ ++++Y PE S M + + G ++E+ Sbjct: 493 SDRRAKQNIQEVDSTEQLKRIAQMRIVEYDYRPEFASRMGIDQCHETGIIAQEVRELLPS 552 Query: 317 FVDTIGGYLAPNPSVIIPHLAKAIQELLQE----VKELRDMIDKQNEEHQDVQ 365 V +G N I L +++ E VK+L + D Q+++ Sbjct: 553 AVREMGDITCINGETIDQFLMVDKEQIFMENVGAVKQLCKLTDNLESRIQELE 605 >gi|167377044|ref|XP_001734263.1| hypothetical protein [Entamoeba dispar SAW760] gi|165904344|gb|EDR29586.1| hypothetical protein EDI_210610 [Entamoeba dispar SAW760] Length = 585 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 82/212 (38%), Gaps = 17/212 (8%) Query: 154 PLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEI 213 ++SF + L S + I Q S ++ ++ D++ + Sbjct: 107 KITSFEQTLSSAEQTNSQEVQIEMQQLQSPQQSNTLSLASNEWDVLINT--------VNT 158 Query: 214 AQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSA 273 +Q T + L S + N+ + +H + A + SDER K H + Sbjct: 159 SQSTTHQLLCSELQKGDFLRIYNISKTLHRLHVYGEVLAENGYFVRSDERTK-CHIRPLS 217 Query: 274 SAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV--DTIGGYLAPNPSV 331 E I QL + Y + + +GF +KEI V D I G L+ + Sbjct: 218 DCLESISQLVGKQYRY------KNSPQLRLGFVAQEVKEILPDLVHTDEITGTLSVDVLG 271 Query: 332 IIPHLAKAIQELLQEVKELRDMIDKQNEEHQD 363 IIP L +++++L E+ L + D Q + QD Sbjct: 272 IIPFLVESLKQLNSEISMLDGVNDTQFKCLQD 303 >gi|291335155|gb|ADD94780.1| hypothetical protein [uncultured phage MedDCM-OCT-S11-C1836] Length = 249 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 69/176 (39%), Gaps = 25/176 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTK-GKIIANLLPFY------PIQQGGGIGQLDN- 95 +S GGT A +A AR LG S++ +A + P G G + + Sbjct: 29 VSDGGTGATSASAARTALGVAIGSDVQAFDAELAAIAGLTSAANKVPYFTGSGTADVADL 88 Query: 96 ----KIYLGWNGTQLLLQVDLSSMR---EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + L +++ + + + LA ++ L A+K ++YTT S+ Sbjct: 89 TAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLA--SIAGLTTAANK---LIYTTGSD 143 Query: 149 QYAATPLSSFMRKLFSCHNGETLHQA-----IFQNGAQFYNNSQRIAAFLDDGDIM 199 YA L++F R + + T+ AQ N + +AA + DG+ + Sbjct: 144 TYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQAQNSNLAGVVAAGVTDGNFL 199 >gi|328785444|ref|XP_393650.4| PREDICTED: myelin gene regulatory factor-like [Apis mellifera] Length = 1139 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 43/116 (37%), Gaps = 16/116 (13%) Query: 260 SDERLKKVHGESS-ASAREIIEQLKFIDFNYLPE----SGMDSELRYLIGFSENNLKEIN 314 SD R K+ E ++QL+ + + Y PE SG+ + + G +++I Sbjct: 532 SDARAKQNVQEVDTREQLRNVQQLRVVRYRYAPEFAQHSGLGIKQQEDTGVIAQEVQQIL 591 Query: 315 EVFVDTIGGYLAPNPSVIIPHL-----------AKAIQELLQEVKELRDMIDKQNE 359 V G + PN I L A++EL + L ID+ Sbjct: 592 PEAVLPAGDIVLPNGQRIENFLMVNKERIFMENVGAVKELCKVTDSLETRIDQLER 647 >gi|183232159|ref|XP_653936.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|169802174|gb|EAL48549.2| hypothetical protein EHI_105260 [Entamoeba histolytica HM-1:IMSS] Length = 675 Score = 44.1 bits (102), Expect = 0.033, Method: Composition-based stats. Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 17/215 (7%) Query: 151 AATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQG 210 ++SF + L S + I Q S ++ ++ D++ + Sbjct: 194 TTPQITSFDQTLSSAEQTNSQEVKIEMQQLQSPQQSNTLSLASNEWDVLINTVNVS---- 249 Query: 211 IEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGE 270 + A H ++ E K + + N+ + +H + A + SDER K H Sbjct: 250 -QSATHQL-LSSELQKGDFLR--IYNISKTLHRLHVYGEVLAENGYFVRSDERTK-CHIR 304 Query: 271 SSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV--DTIGGYLAPN 328 + E I QL + Y + + +GF +KE+ V D I G L+ + Sbjct: 305 PLSDCLESISQLVGKQYRY------KNSPQLRLGFVAQEVKEVLPDLVHTDEITGTLSVD 358 Query: 329 PSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQD 363 +IP L +++++L E+ L D Q + QD Sbjct: 359 VLGVIPFLVESLKQLNSEISMLDGANDTQFKCLQD 393 >gi|291335317|gb|ADD94934.1| hypothetical protein [uncultured phage MedDCM-OCT-S01-C29] Length = 604 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 69/176 (39%), Gaps = 25/176 (14%) Query: 44 ISAGGTAADNAVQARVNLGTDDASNLTK-GKIIANLLPFY------PIQQGGGIGQLDN- 95 +S GGT A +A AR LG S++ +A + P G G + + Sbjct: 384 VSDGGTGATSASAARTALGVAIGSDVQAFDAELAAIAGLTSAANKVPYFTGSGTADVADL 443 Query: 96 ----KIYLGWNGTQLLLQVDLSSMR---EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + L +++ + + + LA ++ L A+K ++YTT S+ Sbjct: 444 TAFARTLLDDANASAARTTLGTAIGSDVQAYDAGLA--SIAGLTTAANK---LIYTTGSD 498 Query: 149 QYAATPLSSFMRKLFSCHNGETLHQA-----IFQNGAQFYNNSQRIAAFLDDGDIM 199 YA L++F R + + T+ AQ N + +AA + DG+ + Sbjct: 499 TYAVADLTAFGRSILDDADAATVRATLGLVIGTDVQAQDSNLAGVVAAGVTDGNFL 554 >gi|255038797|ref|YP_003089418.1| hypothetical protein Dfer_5053 [Dyadobacter fermentans DSM 18053] gi|254951553|gb|ACT96253.1| hypothetical protein Dfer_5053 [Dyadobacter fermentans DSM 18053] Length = 533 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 62/121 (51%), Gaps = 11/121 (9%) Query: 251 GAVGCSYFL--SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSEN 308 G G S + SD RLK+ E + +++Q+K + F Y ++G+++ + +G Sbjct: 293 GKPGSSLWAVASDSRLKQNITEFT-DGLALLKQIKPVWFQYNGKAGIETGEQKFVGIIAQ 351 Query: 309 NLKEINEVFVDTIGGYL----APNPSVIIPHLAKAIQE-LLQEVKELRDMIDKQNEEHQD 363 +++I T+G + N S + + A A+ L+ VKE +++I++++ E +D Sbjct: 352 EMQKIAPY---TVGSFTYQDSLGNKSEYLDYDANAVTYILINSVKEQQEVIEQKDAELKD 408 Query: 364 V 364 V Sbjct: 409 V 409 >gi|170581054|ref|XP_001895519.1| hypothetical protein [Brugia malayi] gi|158597502|gb|EDP35634.1| conserved hypothetical protein [Brugia malayi] Length = 487 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 11/109 (10%) Query: 265 KKVHGESSASAREIIEQLKFIDFNYLPES----GMDSELRYLIGFSENNLKEINEVFVDT 320 +++H + A + Q++ + ++Y PE G+ E R+ +G L EI V Sbjct: 3 EQIHEVDTKIALSHLAQIRVVGYSYKPEIALKWGLSEENRHRVGVIAQELAEILPDAVTD 62 Query: 321 IGGYLAPNPSVII-------PHLAKAIQELLQEVKELRDMIDKQNEEHQ 362 G YL + S I L + L +++ + + K H+ Sbjct: 63 NGDYLQVDDSRIFYETVAAATELCRLTGNLEHKIEAVEKLSHKLARLHR 111 >gi|124010598|ref|ZP_01695214.1| cell wall surface anchor family protein [Microscilla marina ATCC 23134] gi|123982213|gb|EAY23814.1| cell wall surface anchor family protein [Microscilla marina ATCC 23134] Length = 369 Score = 43.3 bits (100), Expect = 0.056, Method: Composition-based stats. Identities = 61/331 (18%), Positives = 115/331 (34%), Gaps = 27/331 (8%) Query: 50 AADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQ 109 N QA+ + T + G + N I G K+ L +G + Sbjct: 20 GMGNISQAQTSRLTIKKPGSSTGVTLDNETGGNTIFSIGS----SEKMRLTSDGKLGIGT 75 Query: 110 VDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGE 169 D + ++ L + V N + T N + + + + +G Sbjct: 76 TDPQDKLHIGSAGLVNIRVGKYVHLGETGNGLA-TVLGNNVRVSTTENVKMQFVNPIDGG 134 Query: 170 TLHQAIFQNGAQFYNNSQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCL 229 + +F G F+ A + GD++ + ++ + I N+ + +T Sbjct: 135 QAIRLMFNEGISFHT-----AGGVLTGDVITAGQNYAGYERMRI-DLNGNVGIGTTTPA- 187 Query: 230 YKTTVLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNY 289 YK V G IH A G SD R KK S + Q++ + + Sbjct: 188 YKLEV------DGNIHATERVYANGIE-LTSDIRYKKNIQPLSTDVVAKLSQVRGTSYKF 240 Query: 290 LPESGMDSELRYL----IGFSENNLKEINEVFVDTIGG-YLAPNPSVIIPHLAKAIQELL 344 E E R+L +G L ++ V Y + N +IP L +A+++L Sbjct: 241 RTEEF--KEKRFLKTKQVGIIAQELAQVYPELVMKGADGYYSVNYIGLIPILVEAVKDLR 298 Query: 345 QEVKELRDMIDKQNEEHQDVQDMSNTPLNSQ 375 + +L +++ +++ + T L SQ Sbjct: 299 KNNDKLDKQAQVLKAKNKKIKE-ALTGLKSQ 328 >gi|186472261|ref|YP_001859603.1| triple helix repeat-containing collagen [Burkholderia phymatum STM815] gi|184194593|gb|ACC72557.1| Collagen triple helix repeat [Burkholderia phymatum STM815] Length = 582 Score = 43.3 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 52/316 (16%), Positives = 92/316 (29%), Gaps = 42/316 (13%) Query: 47 GGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGI--------GQLDNKIY 98 G A + +G ++ K + I G G +IY Sbjct: 238 GPKGDTGAQGPQGPVGPAADTSTFVQKSGDTMTGQLRIAPASGDASLLLNAQGSNLPRIY 297 Query: 99 LGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSF 158 G + D +S+ + A N N + ++ + + S+ Sbjct: 298 FQKGGAGPSMLYDGTSIGFANATLNAWNMAINDSGSVSFRNTVNINNGTDLWLSAQSGSY 357 Query: 159 MRKLFSCHNGET--LHQAIFQNGAQFYNNSQRIAAFL--DDGDIMCYKR----------- 203 +L NG L + N++ F DDG + Sbjct: 358 SGRLMLNANGYAPFLRSNGANGNIEVVNSANSAVNFTIWDDGHVTARGNVYASSAFLQTN 417 Query: 204 ---QKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVG-CSYFL 259 + W G ++ NI + T YK + G G ++F D + Sbjct: 418 GDVNGSQWVGGWLSAGINNICMTRTTGNTYK---IGWDGGAGNLNFYVDGTFIAYLHSNA 474 Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SD LK E + + +IEQ+ F ++ + +G L+ I + Sbjct: 475 SDSALKANVEEVTPDSLTLIEQIDFCSYDI-------AGRHVDMGIIAQQLQTITPRWA- 526 Query: 320 TIGGYLAPNPSVIIPH 335 Y P P P+ Sbjct: 527 ----YKPPTPDTPEPY 538 >gi|281206371|gb|EFA80558.1| NDT80/PhoG-like protein [Polysphondylium pallidum PN500] Length = 714 Score = 43.3 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 31/134 (23%), Positives = 52/134 (38%), Gaps = 18/134 (13%) Query: 248 TDKGAVGCSYFLSDERLKKVHGESS-ASAREIIEQLKFIDFNYLPE----SGMDSELRYL 302 T + G Y SD R+K + + + ++K D+ PE +GMD Sbjct: 565 TIYASEGV-YHPSDLRIKYDVKHVDTKNNLDNVNRMKIYDYKLHPEWVYMNGMDPYENAD 623 Query: 303 IGFSENNLKEINEVFVDTIGG-----------YLAPNPSVIIPHLAKAIQELLQEVKELR 351 G L EI V TIG + N S+I ++ A QEL ++V+ + Sbjct: 624 RGVIAQELYEILPRSVKTIGNRVIGGQEIENLMVIKNQSLIFENIG-ATQELSKQVENIN 682 Query: 352 DMIDKQNEEHQDVQ 365 ++ Q ++ Sbjct: 683 KKMESYESRLQSLE 696 >gi|332826387|gb|EGJ99230.1| hypothetical protein HMPREF9455_00554 [Dysgonomonas gadei ATCC BAA-286] Length = 840 Score = 43.3 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 14/108 (12%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEI------ 313 SD RLK + + I++LK +++ Y +G S +GF +L+++ Sbjct: 723 SDRRLKHDI-KPIGQGLDFIKKLKPVEYIYNTGNGKKS-----LGFIAQDLQQVMAEENM 776 Query: 314 --NEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 + V T G L + +IP L KAIQE ++ + IDKQ + Sbjct: 777 SGYSLVVPTQGDTLGITSTELIPVLTKAIQEQQVTIETQKQTIDKQQQ 824 >gi|332839997|ref|XP_522467.3| PREDICTED: myelin gene regulatory factor-like [Pan troglodytes] Length = 796 Score = 42.9 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 362 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEMQEILPR 421 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 422 AVREVGDVTCGNGETLENFL 441 >gi|297262945|ref|XP_001108573.2| PREDICTED: myelin gene regulatory factor-like [Macaca mulatta] Length = 898 Score = 42.9 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 451 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 510 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 511 AVREVGDVTCENGETLENFL 530 >gi|254521513|ref|ZP_05133568.1| gp32, bacteriophage protein [Stenotrophomonas sp. SKA14] gi|219719104|gb|EED37629.1| gp32, bacteriophage protein [Stenotrophomonas sp. SKA14] Length = 693 Score = 42.9 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 68/379 (17%), Positives = 117/379 (30%), Gaps = 57/379 (15%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLAL---DNNSPRPISAGGTAADNAVQA 57 M + + G +++ S +N + D L D NS + NA Sbjct: 341 MDKTGGTFT---GHVGVNGNSLRSYGWNGVANDGVLVLGDANSYIFKNGSNFTFANAAGG 397 Query: 58 RVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMRE 117 + + T G F P + G L +Q S+ Sbjct: 398 YTAMLNAGGNVWTSGN-------FDPTSKVNKAGDKMT--------GPLKVQAGNGSLIR 442 Query: 118 VWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQ 177 + + +Q++ + T + +S + G + AI Sbjct: 443 IVNDGTSSTMIQSVNAAEDAFAKLFLTGEG--------ASLQGRGQQVDIGSNVDVAISA 494 Query: 178 NGAQFYNNSQRI--AAFLDDGDIMCYKRQKTVWQGIEI------AQHTANIA-LESTKN- 227 NG + YN+SQ F +G+I+ + G+ + + +++ N Sbjct: 495 NGLR-YNDSQGTNRVHFTANGEIVAKPINASPGNGVGSFGLISQGNYGGGLGFIDTGGNT 553 Query: 228 ----CLYKTTVLNMGRGPGYIHFDTDKGAVGC-----SYFLSDERLKKVHGESSASAREI 278 T+ PG + + G Y R K + Sbjct: 554 HGAIWTEYQTLQFGVNTPGALTPKMSLTSGGVLSTIGGYDFGSSRKLKNIEGTLPYGLAA 613 Query: 279 IEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSV------I 332 +EQ++ +Y PE D R + F L E+ VD G + Sbjct: 614 VEQMELAAGHYKPEYNDDGRRR--LFFVAEQLAELVPEAVDLEGVEFQGERVASVKLDQL 671 Query: 333 IPHLAKAIQELLQEVKELR 351 +P LAKAIQEL EV+ L+ Sbjct: 672 LPVLAKAIQELSAEVRALK 690 >gi|85701830|ref|NP_001028505.1| hypothetical protein LOC237558 [Mus musculus] gi|74190381|dbj|BAE25877.1| unnamed protein product [Mus musculus] gi|187956311|gb|AAI50916.1| Gene model 239, (NCBI) [Mus musculus] gi|187957126|gb|AAI50924.1| Gene model 239, (NCBI) [Mus musculus] Length = 904 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R+K+ E + I Q++ + ++Y PE S M + G ++EI Sbjct: 446 SDSRVKENIQEVDTNEQLRRIAQMRIVQYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 505 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 506 AVREVGDVTGGNGETLENFL 525 >gi|74188453|dbj|BAE25858.1| unnamed protein product [Mus musculus] Length = 904 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R+K+ E + I Q++ + ++Y PE S M + G ++EI Sbjct: 446 SDSRVKENIQEVDTNEQLRRIAQMRIVQYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 505 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 506 AVREVGDVTGGNGETLENFL 525 >gi|73968742|ref|XP_538281.2| PREDICTED: similar to CG3328-PA [Canis familiaris] Length = 1158 Score = 42.5 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I Q++ ++++Y PE S M + G ++EI Sbjct: 469 SDSRAKQNIQEVDTNEQLRRIAQMRIVEYDYRPEFASSMGINTAHQTGMIAQEVQEILPR 528 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 529 AVREVGDVTCENGETLQNFL 548 >gi|163786013|ref|ZP_02180461.1| outer membrane protein, putative [Flavobacteriales bacterium ALC-1] gi|159877873|gb|EDP71929.1| outer membrane protein, putative [Flavobacteriales bacterium ALC-1] Length = 594 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 49/116 (42%), Gaps = 7/116 (6%) Query: 246 FDTDKGAVGCSYF-LSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIG 304 F A +Y +SD +LKK + QL + +YL + D++ + G Sbjct: 457 FRGAFNATTGTYLQISDRKLKKDITTLEGGTLNKVLQLNPV--SYLMKDQTDTKRNH--G 512 Query: 305 FSENNLKEINEVFVDTI--GGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQN 358 +KEI + L + + +IP L KAIQE Q +K + I+ Q+ Sbjct: 513 LISQEVKEIFPSITHYVKESDLLTLSYTELIPILIKAIQEQQQIIKNQNEKIEAQS 568 >gi|239750374|ref|XP_001718110.2| PREDICTED: myelin gene regulatory factor [Homo sapiens] gi|310110393|ref|XP_001716702.2| PREDICTED: myelin gene regulatory factor [Homo sapiens] Length = 972 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 513 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 572 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 573 AVREVGDVTCGNGETLENFL 592 >gi|169204180|ref|XP_001718960.1| PREDICTED: myelin gene regulatory factor [Homo sapiens] Length = 972 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 513 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 572 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 573 AVREVGDVTCGNGETLENFL 592 >gi|119617649|gb|EAW97243.1| chromosome 12 open reading frame 28 [Homo sapiens] Length = 786 Score = 42.5 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 327 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 386 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 387 AVREVGDVTCGNGETLENFL 406 >gi|148689861|gb|EDL21808.1| mCG125060 [Mus musculus] Length = 353 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R+K+ E + I Q++ + ++Y PE S M + G ++EI Sbjct: 215 SDSRVKENIQEVDTNEQLRRIAQMRIVQYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 274 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 275 AVREVGDVTGGNGETLENFL 294 >gi|291334968|gb|ADD94601.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 558 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 45/105 (42%), Gaps = 15/105 (14%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVD 319 SDERLK+ E+S SA + I+ LK F++ + +D+ GF L I V Sbjct: 452 SDERLKENI-ENSGSALQDIKDLKVRQFDW--KDNIDT--HRDFGFVAQELHSIIPEAVS 506 Query: 320 TIGGYLAPNP----------SVIIPHLAKAIQELLQEVKELRDMI 354 L N S I+P L KA+QE ++ L + Sbjct: 507 VGSDELDDNGKPKQSWGVDYSHIVPRLVKAVQEQQTRIESLEAEV 551 >gi|150018933|ref|YP_001311187.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridium beijerinckii NCIMB 8052] gi|149905398|gb|ABR36231.1| Cyclopropane-fatty-acyl-phospholipid synthase [Clostridium beijerinckii NCIMB 8052] Length = 393 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 49/106 (46%), Gaps = 2/106 (1%) Query: 233 TVLNMGRGPGY--IHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYL 290 +VL++G G G+ I G LS E+ KK + E + +++ +D+ L Sbjct: 168 SVLDVGCGWGFLLIEAAEKYSIHGIGITLSKEQYKKFNERIKERKLEKLLEVRLMDYRNL 227 Query: 291 PESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHL 336 ESG+ + +G E+ +E E+F+ I L P ++ ++ Sbjct: 228 AESGLSFDRIVSVGMVEHIGRENYELFIKNIDSVLKPKGLCLLDYI 273 >gi|332221363|ref|XP_003259831.1| PREDICTED: myelin gene regulatory factor-like [Nomascus leucogenys] Length = 564 Score = 42.1 bits (97), Expect = 0.12, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + + I Q++ ++++Y PE S M + G ++EI Sbjct: 370 SDSRAKQNIQEVDTNEQLKRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVQEILPR 429 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 430 AVREVGDVTCGNGETLENFL 449 >gi|300920084|ref|ZP_07136541.1| hypothetical protein HMPREF9540_03761 [Escherichia coli MS 115-1] gi|300412903|gb|EFJ96213.1| hypothetical protein HMPREF9540_03761 [Escherichia coli MS 115-1] Length = 1027 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 27/96 (28%), Positives = 43/96 (44%), Gaps = 7/96 (7%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLI---GFSENNLKEINEV 316 SD+RLK E +A + +E+L ++ + +E + + G LKE+ Sbjct: 931 SDKRLKSNFEEIE-NAVDKVEKLTGYVYD-KKSDLVKTEYSFEVREAGIIAQELKEVLPE 988 Query: 317 FVDTIG--GYLAPNPSVIIPHLAKAIQELLQEVKEL 350 V + G L N + + L AI+EL VKEL Sbjct: 989 AVSSFGPDEILGVNSAAVNALLVNAIKELSARVKEL 1024 >gi|313157451|gb|EFR56872.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 221 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 45/191 (23%), Positives = 79/191 (41%), Gaps = 24/191 (12%) Query: 176 FQNGAQFYNNSQRIAAFLD--DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTT 233 +QNG YN +R ++ +G C+K + + ++ + ++ S K LY T Sbjct: 38 YQNGYLVYNAMERYNGYMTNWEGWAHCWKFGNRMIK-FDLGPADSRVSSNSDKLVLYDT- 95 Query: 234 VLNMGRGPGYIHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPES 293 G+I D A Y SD K +S SA + QL+ + F + ++ Sbjct: 96 -----ENGGFI----DLYARNV-YTNSDAASKTNI-QSLGSATATLTQLRPVSFEWADKA 144 Query: 294 GMDSELRY--------LIGFSENNLKEINEVFV-DTIGGYLAPNPSVIIPHLAKAIQELL 344 R +GF ++++ V G+ N S +IP L K+IQEL Sbjct: 145 HYFKTSRRSTGVSNPKEMGFIAQEIEQVLPDIVAVDCEGHRVVNYSALIPLLTKSIQELN 204 Query: 345 QEVKELRDMID 355 +++ L+ I+ Sbjct: 205 GQIETLKAEIE 215 >gi|149371909|ref|ZP_01891228.1| putative phage tail protein [unidentified eubacterium SCB49] gi|149355049|gb|EDM43610.1| putative phage tail protein [unidentified eubacterium SCB49] Length = 412 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 57/143 (39%), Gaps = 32/143 (22%) Query: 246 FDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLK----FIDFNYLPESGMDSELRY 301 F D G G + SD+RLKK S +A I+ +K D P+ GM++ L Y Sbjct: 264 FINDLGYTGGFWSTSDKRLKKDINNIS-NALATIKSIKPVSYHFDIQKYPDMGMNTNLEY 322 Query: 302 LIGFSENNLK--------------------EINEVFVDTIGGYLAPNPSVIIPHLAKAIQ 341 GF LK EIN +L + + IIP L + I+ Sbjct: 323 --GFIAQELKNVLPNVVKEKMIPIKGARKSEINNNEPLKKELFLTVDYTRIIPILTQGIK 380 Query: 342 ELLQEVKELRDMIDKQNEEHQDV 364 E QE+ E + + EH +V Sbjct: 381 E-QQEIIETQ----NKKIEHLEV 398 >gi|320168176|gb|EFW45075.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864] Length = 1398 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 50/140 (35%), Gaps = 24/140 (17%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE----SGMDSELRYLIGFSENNLKEIN 314 SD R+K S A + ++ D++ PE S ++ G L E+ Sbjct: 824 SDRRVKDAIVPVDTSEALRNVNSMRLYDYSLRPEWMETSHRPADQSTDRGVLAQELFELM 883 Query: 315 EVFVDTIGG-----------YLAPNPSVIIPHLAKAIQELLQEVKELRDMID-------- 355 V+ IG +L N I+ A QEL ++ L + Sbjct: 884 PRAVNNIGDVPLASGETIPDFLVVNKDAILMETVAATQELSKQAAHLDSRLHAVSQRLHT 943 Query: 356 KQNEEHQDVQDMSNTPLNSQ 375 ++ +E ++ S ++S+ Sbjct: 944 RERDEEDNISTSSEDAVDSR 963 >gi|124002006|ref|ZP_01686860.1| cell wall surface anchor family protein [Microscilla marina ATCC 23134] gi|123992472|gb|EAY31817.1| cell wall surface anchor family protein [Microscilla marina ATCC 23134] Length = 502 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 42/98 (42%), Gaps = 3/98 (3%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR--YLIGFSENNLKEINEVF 317 SD R KK ++ A + QL+ F + E + IGF + + Sbjct: 391 SDLRFKKDITPITSEAITKLGQLQGKTFQWRTEEFKEKNFSEGTKIGFIAQEMLNVYPEL 450 Query: 318 VDTIGG-YLAPNPSVIIPHLAKAIQELLQEVKELRDMI 354 V+ G Y + N S +IP L +A+++L + + L + Sbjct: 451 VNEGGDGYYSINYSGLIPILTEAVKDLNNKNEALEARV 488 >gi|326783687|ref|YP_004324081.1| fiber [Synechococcus phage S-SSM7] gi|310003699|gb|ADO98094.1| fiber [Synechococcus phage S-SSM7] Length = 2328 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 6/101 (5%) Query: 255 CSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRY--LIGFSENNLKE 312 + F SD RLK + A + +L + + E+G D +L +G S ++E Sbjct: 2223 ITAFASDMRLKTNIEKI-QGAVAKVCKLSGFTYEF-NETGRDLKLPAGKQLGVSAQQVQE 2280 Query: 313 INE--VFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELR 351 I V V I YL ++P L +AI+EL +E++ L+ Sbjct: 2281 IFPEAVAVRPIDEYLTVKYEKLVPVLIEAIKELKEEIESLK 2321 >gi|296487696|gb|DAA29809.1| hypothetical protein LOC781109 [Bos taurus] Length = 896 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I Q++ ++++Y PE S M + G ++EI Sbjct: 449 SDSRAKQNVQEVDTNEQLRRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVREILPR 508 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 509 AVREVGDVTCENGETLENFL 528 >gi|156121291|ref|NP_001095793.1| hypothetical protein LOC781109 [Bos taurus] gi|151553530|gb|AAI48935.1| MGC139000 protein [Bos taurus] Length = 896 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I Q++ ++++Y PE S M + G ++EI Sbjct: 449 SDSRAKQNVQEVDTNEQLRRIAQMRIVEYDYKPEFASAMGINTAHQTGMIAQEVREILPR 508 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 509 AVREVGDVTCENGETLENFL 528 >gi|194228569|ref|XP_001494410.2| PREDICTED: similar to Uncharacterized protein C12orf28, partial [Equus caballus] Length = 504 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I Q++ ++++Y PE S M + G ++EI Sbjct: 57 SDSRAKQNVQEVDTNEQLRRIAQMRIVEYDYKPEFASAMGINAAHQTGMIAQEVREILPA 116 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 117 AVREVGDVTCENGETLEHFL 136 >gi|301773628|ref|XP_002922234.1| PREDICTED: myelin gene regulatory factor-like [Ailuropoda melanoleuca] Length = 897 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I Q++ ++++Y PE S M + G ++EI Sbjct: 451 SDSRAKQNIQEVDTNEQLRRIAQMRIVEYDYKPEFASSMGINTAHQTGMIAQEVREILPG 510 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +G N + L Sbjct: 511 AVREVGDVTCENGETLQNFL 530 >gi|321464079|gb|EFX75090.1| hypothetical protein DAPPUDRAFT_306915 [Daphnia pulex] Length = 1243 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 49/128 (38%), Gaps = 9/128 (7%) Query: 260 SDERLKKVHGESS-ASAREIIEQLKFIDFNYLPE----SGMDSELRYLIGFSENNLKEIN 314 SD R K+ E+ ++ L+ + + Y E +G+ + R G ++ I Sbjct: 651 SDRRAKEGIEEADTKEQLRNVQALRVVHYKYTEEFAETAGLKEDERGDTGVIAQEVESII 710 Query: 315 EVFVDTIGGYLAPNPSVIIPHLA----KAIQELLQEVKELRDMIDKQNEEHQDVQDMSNT 370 V G + PN I L + E + VKEL + D +++ M++ Sbjct: 711 PDAVRPAGNIVLPNGRQIENFLVVNKERIFMENVGAVKELCKVTDNLETRIDELERMNHK 770 Query: 371 PLNSQKFD 378 ++ D Sbjct: 771 LAKLKRLD 778 >gi|328866338|gb|EGG14723.1| NDT80/PhoG-like protein [Dictyostelium fasciculatum] Length = 857 Score = 40.6 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 31/121 (25%), Positives = 48/121 (39%), Gaps = 9/121 (7%) Query: 248 TDKGAVGCSYFLSDERLK-KVHGESSASAREIIEQLKFIDFNYLPE----SGMDSELRYL 302 T + G Y SD R+K +H + + + +K D+ PE +GMD Sbjct: 652 TIYASEGV-YHPSDLRIKYDLHQVDTRTNLRNVNSMKIYDYKLHPEWVYMNGMDPYENQD 710 Query: 303 IGFSENNLKEINEVFVDTIGGYLAPNPSV---IIPHLAKAIQELLQEVKELRDMIDKQNE 359 G LKEI V TIG + + ++ + E + +EL ID+ E Sbjct: 711 RGIIAQELKEILPESVKTIGNRVINGREIDNLLVIKNNALMYENIGATQELSKQIDQLKE 770 Query: 360 E 360 E Sbjct: 771 E 771 >gi|192358796|ref|YP_001981106.1| putative purine nucleoside permease [Cellvibrio japonicus Ueda107] gi|190684961|gb|ACE82639.1| putative purine nucleoside permease [Cellvibrio japonicus Ueda107] Length = 347 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 44/188 (23%), Positives = 70/188 (37%), Gaps = 17/188 (9%) Query: 26 AYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGK-IIANLLPFYPI 84 + L DL +++ I GG NA + + LG D +L+K IIA + P+ Sbjct: 62 PFPLGLYDLHMNDTGLLAICTGG-GVTNATASIMALGMDPRFDLSKAYWIIAGIGGGDPL 120 Query: 85 QQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYT 144 G G + G LL ++D + + W L P + +T Sbjct: 121 DVSLGTGVWAKHVVDG----DLLYEIDGREIPKDWQYGLIPLGAKKPNQE-----STGWT 171 Query: 145 TDSNQYAATP-LSSFMRKLFSCHN-GETLHQAIFQNGAQFYNNSQRIAAFLDDGDIMCYK 202 D+ YA P L ++ +L H +T F+ Y + R F+ GD + Sbjct: 172 VDTIHYALNPGLVNWAYQLTKDHPVADTPAMKAFREQYTGYPQALR-PPFVTIGDTI--- 227 Query: 203 RQKTVWQG 210 T W G Sbjct: 228 GSSTYWHG 235 >gi|120407025|ref|YP_956851.1| ORF1 [Mal de Rio Cuarto virus] gi|63033348|gb|AAY27969.1| ORF1 [Mal de Rio Cuarto virus] Length = 361 Score = 40.6 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 28/120 (23%), Positives = 49/120 (40%), Gaps = 4/120 (3%) Query: 191 AFLDDGDI-MCYKRQKTVW-QGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDT 248 AF+ DG C Q+T Q + + + +A KN L K V+N R +F Sbjct: 125 AFIRDGSYNFCQNCQETTSEQDLAALRFMSRLAKIIIKNSLTKDDVINAQRSLISYYFGR 184 Query: 249 DKGAVGCSYFLSDERLKKVHG-ESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSE 307 +V + + S L VHG +S ++ + ++K F L + + Y + + Sbjct: 185 VFKSVTLT-WDSKCNLPSVHGYSTSETSLDHYIRMKIDLFKALSHNNLVYGGNYQLVYQA 243 >gi|307308922|ref|ZP_07588605.1| hypothetical protein SinmeBDRAFT_4489 [Sinorhizobium meliloti BL225C] gi|306900556|gb|EFN31169.1| hypothetical protein SinmeBDRAFT_4489 [Sinorhizobium meliloti BL225C] Length = 663 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 3/56 (5%) Query: 28 NNLLKDLALD---NNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLP 80 N + D++ + + +S GGT + A AR +LGTD+A+N+T G + LP Sbjct: 142 NLKIGDMSANTLVLTTALAVSEGGTGSTTAAAARTSLGTDNANNITTGTLADARLP 197 >gi|291335615|gb|ADD95223.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C714] Length = 155 Score = 40.6 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 117 EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQA-- 174 + + + LA ++ L A+K ++YTT S+ YA L++F R + + T+ Sbjct: 23 QAYDAGLA--SIAGLTTAANK---LIYTTGSDTYAVADLTAFGRSILDDADAATVRATLG 77 Query: 175 ---IFQNGAQFYNNSQRIAAFLDDGDIM 199 AQ N + +AA + DG+ + Sbjct: 78 LVIGTDVQAQDSNLAGVVAAGVTDGNFL 105 >gi|291334873|gb|ADD94512.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C1441] Length = 155 Score = 40.2 bits (92), Expect = 0.47, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 117 EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQA-- 174 + + + LA ++ L A+K ++YTT S+ YA L++F R + + T+ Sbjct: 23 QAYDAGLA--SIAGLTTAANK---LIYTTGSDTYAVADLTAFGRSILDDADAATVRATLG 77 Query: 175 ---IFQNGAQFYNNSQRIAAFLDDGDIM 199 AQ N + +AA + DG+ + Sbjct: 78 LVIGTDVQAQNSNLAGVVAAGVTDGNFL 105 >gi|260641890|ref|ZP_05413892.2| putative outer membrane protein [Bacteroides finegoldii DSM 17565] gi|260624236|gb|EEX47107.1| putative outer membrane protein [Bacteroides finegoldii DSM 17565] Length = 1096 Score = 40.2 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 67/171 (39%), Gaps = 9/171 (5%) Query: 195 DGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLY--KTTVLNMGRGPGYIHFDTDKGA 252 D I ++ TV + ++ + +++ + + L K T LN+ DT Sbjct: 36 DSKITIQQKNMTVIEALKSVERQTKMSINYSDSQLKSKKLTNLNLKNALITTALDTILKG 95 Query: 253 VGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKE 312 G +Y + D + +S A + + K +D N P G++ + G S + + Sbjct: 96 TGFTYQIRDNYVIVTEQKSVAPSVVKNIRGKVVDENGEPLIGVNISVE---GSSTGTITD 152 Query: 313 INEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQD 363 +N F+ + I ++ A Q + K+ ++ KQ+ E D Sbjct: 153 MNGSFMIKASD----KSKLKISYIGYATQMIAVSNKDFYQVVLKQDAEVLD 199 >gi|330994925|ref|ZP_08318847.1| hypothetical protein SXCC_04812 [Gluconacetobacter sp. SXCC-1] gi|329758186|gb|EGG74708.1| hypothetical protein SXCC_04812 [Gluconacetobacter sp. SXCC-1] Length = 359 Score = 40.2 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 33/127 (25%), Positives = 50/127 (39%), Gaps = 21/127 (16%) Query: 1 MPRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVN 60 MP + P S+I ++ + ++ + + AG T R N Sbjct: 18 MPATG---TPGWATDGDPASSIPATDFPASHYNMTVAEMVQVILDAGLT------LDRTN 68 Query: 61 LGTDDASNLTKGKIIANLLP---FYPIQQGGGIGQLDNKIYLGW------NGTQLLLQVD 111 G A+ K+IA F PIQQGGG Q NK+YLGW + +L Q+D Sbjct: 69 WGQLSAA---IQKLIAKPFGGTIFTPIQQGGGTDQGSNKVYLGWEIINDLSTGRLRYQID 125 Query: 112 LSSMREV 118 + + Sbjct: 126 TDDIGAL 132 >gi|293348602|ref|XP_001080934.2| PREDICTED: myelin gene regulatory factor-like [Rattus norvegicus] Length = 966 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 34/80 (42%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R+K+ E + I Q++ + ++Y PE S M + + G ++EI Sbjct: 506 SDIRVKENVQEVDTNEQLRRIAQMRIVQYDYKPEFASAMGIDTAHQTGMIAQEVQEILPR 565 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +GG N + L Sbjct: 566 AVREVGGVTCGNGETLENFL 585 >gi|149066917|gb|EDM16650.1| rCG49167, isoform CRA_b [Rattus norvegicus] Length = 300 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 34/80 (42%), Gaps = 3/80 (3%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R+K+ E + I Q++ + ++Y PE S M + + G ++EI Sbjct: 160 SDIRVKENVQEVDTNEQLRRIAQMRIVQYDYKPEFASAMGIDTAHQTGMIAQEVQEILPR 219 Query: 317 FVDTIGGYLAPNPSVIIPHL 336 V +GG N + L Sbjct: 220 AVREVGGVTCGNGETLENFL 239 >gi|296102080|ref|YP_003612226.1| hypothetical protein ECL_01719 [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295056539|gb|ADF61277.1| hypothetical protein ECL_01719 [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 153 Score = 39.8 bits (91), Expect = 0.63, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 47/104 (45%), Gaps = 5/104 (4%) Query: 260 SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMD--SELRYLIGFSENNLKEINEVF 317 SDERLK+V E + LK +++++ SG ++ R GF + +NE + Sbjct: 50 SDERLKEVISNQVDGYFERLSALKVVEYHWNDISGASPMAKARVRRGFIAQQVNAVNESY 109 Query: 318 V---DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQN 358 +T + + I+ L A+ EL +++ E+ + + Sbjct: 110 ALPPETEDDFWGIDDRAIVADLLLAVLELKEKLAEVTKKLTTEE 153 >gi|86356748|ref|YP_468640.1| hypothetical protein RHE_CH01106 [Rhizobium etli CFN 42] gi|86280850|gb|ABC89913.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 151 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 45/122 (36%), Gaps = 5/122 (4%) Query: 1 MPR--IREVYILPAGSQAYPNSNISSSAYNNLLKDLALD--NNSPRPISAGGTAA-DNAV 55 MPR VY PAG+ I + +N L DL + N+ PR SA TA A Sbjct: 1 MPRNPSTGVYSKPAGTTPSVGQVIDPAPWNALTTDLGNEITNSLPRDGSAPMTAPLKTAS 60 Query: 56 QARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSM 115 G A+N G + Q G +G +Y +G + D +++ Sbjct: 61 GTASAPGIGFATNPQTGLYLKGGGLLGFTQNGVDVGFDKASVYAAKSGDYTAVASDDNAV 120 Query: 116 RE 117 Sbjct: 121 HR 122 >gi|307565665|ref|ZP_07628138.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] gi|307345628|gb|EFN90992.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] Length = 312 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 22/117 (18%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELR---------------- 300 Y SDER K ++ I L+ + +Y +SG+ S R Sbjct: 102 YNYSDERAKTNV-KTIDGGLNTILNLRPV--SYTWKSGIGSATRSSHLMTVANGPSCDKN 158 Query: 301 YLIGFSENNLKEINEVFV--DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMID 355 GF L+E+ V D G L N + IIP L K+IQ+L +V+E + M++ Sbjct: 159 LQFGFLAQELEEVIPDAVKTDEEGRKLI-NYTAIIPILVKSIQDLQGKVEEQQRMLE 214 >gi|152983189|ref|YP_001353882.1| hypothetical protein mma_2192 [Janthinobacterium sp. Marseille] gi|151283266|gb|ABR91676.1| Hypothetical protein mma_2192 [Janthinobacterium sp. Marseille] Length = 588 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 40/102 (39%), Gaps = 6/102 (5%) Query: 1 MPRIR-EVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARV 59 MPR VY LPAG+ ++ISS NN + D+A + + I+A G A Sbjct: 1 MPRNGSGVYTLPAGNPVISGTSISSVTQNNTMSDVAAELTNS--IAADGQKTPTADLPMG 58 Query: 60 NL---GTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIY 98 N G D + + + + G G ++ + Sbjct: 59 NFRHTGVSDGTARNQYASVGQTQDSALLWCGLATGTANDLTF 100 >gi|327193446|gb|EGE60343.1| hypothetical protein RHECNPAF_158002 [Rhizobium etli CNPAF512] Length = 452 Score = 39.4 bits (90), Expect = 0.82, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 39/122 (31%), Gaps = 5/122 (4%) Query: 1 MPR--IREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAAD---NAV 55 MPR VY PAG+ I + +N L DL + + P A A Sbjct: 1 MPRNPSTGVYSKPAGTTPSVGQVIDPAPWNALTTDLGNEITNSLPRDGSAPMAAPLKAAS 60 Query: 56 QARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSM 115 G ASN G + Q G I +Y +G L D +++ Sbjct: 61 GTVSAPGVGFASNPQTGLYLKGGGVLGFTQNGVDITFDKASVYAAKSGDYTALATDDNAV 120 Query: 116 RE 117 Sbjct: 121 HR 122 >gi|46451896|gb|AAS98053.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 864 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 10/86 (11%) Query: 161 KLFSCHNGETLHQA-IFQNGAQFYNNSQRIAAFLDDGDIMCYKRQK---TVWQGIEIAQH 216 + F C+ + A +F NG YN++ I+ D DI R K +WQG+ A + Sbjct: 378 EFFYCNTSSLFNSAYVFNNGTYMYNSTNSIS----DSDITLQCRIKQIINMWQGVGQAMY 433 Query: 217 TANIALESTKNCLYKTTVLNMGRGPG 242 IA T C+ T L + R G Sbjct: 434 APPIAGNIT--CISNITGLLLTRDGG 457 >gi|134295382|ref|YP_001119117.1| hypothetical protein Bcep1808_1272 [Burkholderia vietnamiensis G4] gi|134138539|gb|ABO54282.1| hypothetical protein Bcep1808_1272 [Burkholderia vietnamiensis G4] Length = 407 Score = 39.4 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 3/65 (4%) Query: 69 LTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSSMR-EVWTSQLAP 125 G L + P+QQG G+GQ N + +GW +G+ L + VD + + + SQL+ Sbjct: 65 ADAGANYQPKLDYTPVQQGTGVGQGANTVKMGWAKDGSGLRVTVDTNDLGLLAFASQLSA 124 Query: 126 RALQN 130 A Q+ Sbjct: 125 YASQD 129 >gi|289450110|ref|YP_003474880.1| hypothetical protein HMPREF0868_0557 [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|289184657|gb|ADC91082.1| conserved hypothetical protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 469 Score = 39.4 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 33/62 (53%) Query: 259 LSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFV 318 E ++ +G ++ S E++ Q+ I+ +LP +G+ +E + + ++++N + Sbjct: 157 DPHEPVEYFYGSTNKSIMELLSQIGNINLQHLPLNGLKAEAHFYVVPQSGQIQQVNVQII 216 Query: 319 DT 320 DT Sbjct: 217 DT 218 >gi|325188925|emb|CCA23454.1| Phosphatidylinositol 3kinase tor2 putative [Albugo laibachii Nc14] Length = 3153 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 6/104 (5%) Query: 262 ERLKKV-HGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVDT 320 ERL V ++ AS R ++ L+ +D + P G+D LR L + + EI + + T Sbjct: 632 ERLLTVGIADTDASIR--LQVLESLDARFDPLLGLDDNLRCLFISLHDEVFEIRQRAMST 689 Query: 321 IGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEEHQDV 364 +G NPSVI+P L Q L+Q + EL D Q +E V Sbjct: 690 LGRLAHHNPSVILPSLR---QTLVQLLAELEYSGDTQTKEEGSV 730 >gi|313768280|ref|YP_004061960.1| hypothetical protein MpV1_077 [Micromonas sp. RCC1109 virus MpV1] gi|312598976|gb|ADQ91000.1| hypothetical protein MpV1_077 [Micromonas sp. RCC1109 virus MpV1] Length = 424 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 42/93 (45%), Gaps = 9/93 (9%) Query: 240 GPGY-IHFDTDKGAVGCSYFLSDERLKKVHGE-SSASAREII--EQLKFIDFNYLPESGM 295 P Y + + + A G + SDER+KK + + A II Q K D+ +G Sbjct: 188 TPSYRLDVNGNFRAFGITN-SSDERIKKNITDINDGDALNIIRLLQPKRYDYIDTKTAGT 246 Query: 296 DSELRYLIGFSENNLKEINEVFVDTIGGYLAPN 328 + ++ GF L+++ VDTI + PN Sbjct: 247 EGQVW---GFIAQELEDVMPYAVDTISDF-IPN 275 >gi|149632301|ref|XP_001512136.1| PREDICTED: similar to chromosome 12 open reading frame 28 [Ornithorhynchus anatinus] Length = 826 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 28/130 (21%), Positives = 47/130 (36%), Gaps = 21/130 (16%) Query: 254 GCSYFLSDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNL 310 G + SD R K+ E + I Q++ ++++Y PE S M E + G + Sbjct: 390 GAVLYPSDSRAKQNIQEVDPNEQLRRIAQMRVVEYDYKPEFASVMGIEHVHQTGMIAQEV 449 Query: 311 KEINEVFVDTIGGYLAPNPSVI------------------IPHLAKAIQELLQEVKELRD 352 K+I V +G N + + L K +L + + EL Sbjct: 450 KDILPSAVREVGDVTCANGEKVENLLVVDKDQIFMENVGAVKQLCKLTNDLEERIGELEV 509 Query: 353 MIDKQNEEHQ 362 K N+ Q Sbjct: 510 WNQKLNKLQQ 519 >gi|330992661|ref|ZP_08316605.1| hypothetical protein SXCC_02564 [Gluconacetobacter sp. SXCC-1] gi|329760139|gb|EGG76639.1| hypothetical protein SXCC_02564 [Gluconacetobacter sp. SXCC-1] Length = 542 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 6/46 (13%) Query: 79 LPFYPIQQGGGIGQLDNKIYLGW------NGTQLLLQVDLSSMREV 118 L F P+QQGGG Q N++ LGW + +L Q+D + + + Sbjct: 206 LGFTPVQQGGGADQGSNRVCLGWEIINDASTGRLRYQIDATDIGAL 251 >gi|126339419|ref|XP_001369709.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 898 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 3/72 (4%) Query: 260 SDERLKKVHGESSAS-AREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD R K+ E + I +++ ++++Y PE S M + G ++EI Sbjct: 438 SDSRAKQNIQEVDTNEQLRRINRMRIVEYDYKPEFASVMGINSTHQTGMIAQEVQEILPR 497 Query: 317 FVDTIGGYLAPN 328 V +G N Sbjct: 498 AVREVGDVTCEN 509 >gi|300917271|ref|ZP_07133946.1| phage Tail Collar Domain protein [Escherichia coli MS 115-1] gi|300415469|gb|EFJ98779.1| phage Tail Collar Domain protein [Escherichia coli MS 115-1] Length = 323 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 22/71 (30%), Positives = 31/71 (43%), Gaps = 4/71 (5%) Query: 6 EVYILPAGSQAYPNSNIS----SSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNL 61 + A + A N ++ S + L K L +N P++ GGT A NA A NL Sbjct: 100 GAWETFATTSANQNITVTINGTSLSIPALGKLLQKGSNGALPVNQGGTGATNAADASTNL 159 Query: 62 GTDDASNLTKG 72 G + S L G Sbjct: 160 GLGEGSALPVG 170 >gi|239626661|ref|ZP_04669692.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridiales bacterium 1_7_47_FAA] gi|239516807|gb|EEQ56673.1| cyclopropane-fatty-acyl-phospholipid synthase [Clostridiales bacterium 1_7_47FAA] Length = 392 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 2/106 (1%) Query: 234 VLNMGRGPGY--IHFDTDKGAVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLP 291 +L++G G G+ I + G G LS E+ + + + +K +D+ LP Sbjct: 172 LLDVGCGWGFLLIRAAKEYGVRGTGITLSREQYEGFTKRIEEEGLKELVSVKLMDYRDLP 231 Query: 292 ESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSVIIPHLA 337 ESGM + +G E+ +E F+D + L P+ ++ +++ Sbjct: 232 ESGMKFDRIVSVGMLEHVGRENYGRFMDCMDKVLKPHGVFLLHYIS 277 >gi|134295396|ref|YP_001119131.1| hypothetical protein Bcep1808_1286 [Burkholderia vietnamiensis G4] gi|134138553|gb|ABO54296.1| hypothetical protein Bcep1808_1286 [Burkholderia vietnamiensis G4] Length = 406 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 25/48 (52%), Gaps = 2/48 (4%) Query: 69 LTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGW--NGTQLLLQVDLSS 114 G L + P+QQG G+GQ N + +GW +G+ L + VD + Sbjct: 65 ADAGTNYQPKLGYTPVQQGTGVGQGTNAVKIGWAKDGSGLRVTVDAND 112 >gi|47210713|emb|CAF90005.1| unnamed protein product [Tetraodon nigroviridis] Length = 974 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 3/67 (4%) Query: 260 SDERLKKVHGESSA-SAREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEINEV 316 SD+R K E + + I Q++ ++F+Y PE S + + + G +KE+ Sbjct: 576 SDQRAKCNIQEVDSEQQLKRINQMRIVEFDYKPEFASSLGIDHTHQTGVLAQEVKELLPS 635 Query: 317 FVDTIGG 323 V +G Sbjct: 636 AVTQVGD 642 >gi|209523965|ref|ZP_03272517.1| hypothetical protein AmaxDRAFT_1335 [Arthrospira maxima CS-328] gi|209495637|gb|EDZ95940.1| hypothetical protein AmaxDRAFT_1335 [Arthrospira maxima CS-328] Length = 616 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 64/172 (37%), Gaps = 32/172 (18%) Query: 204 QKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMGRGPGYIHFDTDKGAVGCSYFL---- 259 + VW+G ++ ++ N +Y + + G IH G +Y Sbjct: 412 ESNVWRGDPSDRY----PIDVYNNQMY--LIKDNGLSSSNIHTRFRLGGTNIAYINDSGD 465 Query: 260 -----SDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDS--ELRYLIGFSENNLKE 312 SD R K V E + ++ ++I++L I F Y + + + IG ++E Sbjct: 466 YVKGSSDIRFKTVLNE-NVNSLDLIKKLNVIKFKYNKLAELHGFCDKEAKIGLIAQEVQE 524 Query: 313 INEVFVDTIGGYLAPNPSV--------------IIPHLAKAIQELLQEVKEL 350 V+ + +P V ++P L IQEL +V+ L Sbjct: 525 YYPEAVEVVKNDHTDSPEVDDKKIDYLTILYEKLVPLLVAGIQELSAKVESL 576 >gi|194364667|ref|YP_002027277.1| hypothetical protein Smal_0889 [Stenotrophomonas maltophilia R551-3] gi|194347471|gb|ACF50594.1| hypothetical protein Smal_0889 [Stenotrophomonas maltophilia R551-3] Length = 656 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 54/262 (20%), Positives = 86/262 (32%), Gaps = 40/262 (15%) Query: 116 REVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAI 175 +WTS + +K ++ + N Y + + + F N T Sbjct: 406 GTIWTSGNF-----DPAAKVNKAGDVM-SGTLNMYVGQGSTQVVHQRFGWANNVTRWATA 459 Query: 176 FQNGAQF------YNN---SQRIAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTK 226 + A +N +QR A F + R+ T GI +A ++ Sbjct: 460 MEADATLSLYSYDFNGGSPTQR-ANFSN-----IQYRENTFVGGINSTGASAAFVIKDRG 513 Query: 227 NCLYKTTVLNMGR--------GPGYIHFD-TDKGAVGCS--YFLSDERLKKVHGESSASA 275 N + + N G T GA+ S Y R K + Sbjct: 514 NANREFHIYNTDNVLRIWSSTAQGVDRLTLTSAGALATSGGYDFGSSRKLKNIEGALPYG 573 Query: 276 REIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLKEINEVFVDTIGGYLAPNPSV---- 331 +EQ++ +Y PE D R + F L E+ VD G Sbjct: 574 LAAVEQMELAAGHYKPEYNDDGRRR--LFFVAEQLAEVVPEAVDLEGVEFQGERVASVKL 631 Query: 332 --IIPHLAKAIQELLQEVKELR 351 ++P +AKAIQEL EV+ L+ Sbjct: 632 DQLLPVMAKAIQELAAEVRALK 653 >gi|18466716|ref|NP_569523.1| putative phage tail protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|16506032|emb|CAD09918.1| putative phage tail protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] Length = 850 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 47/116 (40%), Gaps = 7/116 (6%) Query: 251 GAVGCSYFLS--DERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLI--GFS 306 G C+ ++S D RLK H +A++ + L+ I + D Y I G Sbjct: 708 GNASCNQWISTSDIRLKA-HLNDIENAKDKVRTLRGITYYKRNNIVEDKYSYYEIEAGLV 766 Query: 307 ENNLKEINEVFVDTIGG--YLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 ++E+ V IG +L N ++ L AI E++ + I+ +E Sbjct: 767 AQEVQEVLPEAVRKIGDTEFLGVNYGGVVALLVNAINEMIDDEDVQSKRIEALEDE 822 >gi|313887448|ref|ZP_07821137.1| hypothetical protein HMPREF9294_0656 [Porphyromonas asaccharolytica PR426713P-I] gi|312923090|gb|EFR33910.1| hypothetical protein HMPREF9294_0656 [Porphyromonas asaccharolytica PR426713P-I] Length = 417 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 43/115 (37%), Gaps = 15/115 (13%) Query: 260 SDERLKKVHGESSA-SAREIIEQLKFI------------DFNYLPESGMDSELRYLIGFS 306 SD+ K+ E A + +L+ + D L +S + + G Sbjct: 195 SDQSAKENVEEIDEDEASRALLKLRPVTYTLKEDNANASDLVSLDKSNFKEQNHHNYGLI 254 Query: 307 ENNLKEINEVFV--DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 + EI V DTI +IP L A+Q QE++ L++ I + E Sbjct: 255 AQEVLEIFPDIVEYDTISQQYGIRYMELIPILIVALQRQQQEIETLKEHIGRSTE 309 >gi|1513238|gb|AAB06761.1| ORFveg132; similar to Caenorhabditis elegans ORF F59B10.1 encoded by EMBL Accession Number Z49132 [Dictyostelium discoideum] Length = 608 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 51/121 (42%), Gaps = 19/121 (15%) Query: 248 TDKGAVGCSYFLSDERLKKVHGE-SSASAREIIEQLKFIDFNYLPE----SGMDSEL-RY 301 T + G Y SD R+K S S + + ++K D+ Y P+ +G D L Sbjct: 475 TIYASEGV-YHPSDLRIKYDLKSIDSKSNLDNVNRMKLYDYKYNPQWTHMNGRDPYLDNC 533 Query: 302 LIGFSENNLKEINEVFVDTIGG-----------YLAPNPSVIIPHLAKAIQELLQEVKEL 350 G +L+ I V TIG + N ++++ + A QEL +++ E+ Sbjct: 534 DRGVIAQDLQRILPKTVRTIGNKNVNGQEIENLLVIKNEALVMETIG-ATQELSKQMDEM 592 Query: 351 R 351 + Sbjct: 593 K 593 >gi|302061780|ref|ZP_07253321.1| tail fiber domain protein [Pseudomonas syringae pv. tomato K40] Length = 457 Score = 37.9 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 48/147 (32%), Gaps = 12/147 (8%) Query: 28 NNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQG 87 N L N P++ GGT A ++ A +LG + + + F +G Sbjct: 75 NALAALGTTGNYDILPVAKGGTGATDSASALTSLGMKGGA----YDALIKSIGF----RG 126 Query: 88 GGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDS 147 +G +Y+GWNG + + L A ++ ++ + S Sbjct: 127 APVGYNVQGLYMGWNGNNDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYS 182 Query: 148 NQYAATPLSSFMRKLFSCHNGETLHQA 174 + LS + G + A Sbjct: 183 GVLSVAQLSVVGSPIALSSGGTSATTA 209 >gi|42522290|ref|NP_967670.1| cell wall surface anchor family protein [Bdellovibrio bacteriovorus HD100] gi|39574821|emb|CAE78663.1| cell wall surface anchor family protein [Bdellovibrio bacteriovorus HD100] Length = 1365 Score = 37.9 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 55/126 (43%), Gaps = 13/126 (10%) Query: 252 AVGCSYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMDSELRYLIGFSENNLK 311 AV + SD+RLK + S + + L+ + + + D + GF L+ Sbjct: 1222 AVNGTIQTSDQRLKAEVQDLSQ-GLDFVMALQPKSYKWKSDQN-DPAAKTHWGFMAQELE 1279 Query: 312 E-----------INEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNEE 360 + + +T Y N S +I + KA+QEL +V + D +D+ +E Sbjct: 1280 AQVKRSTASAAPVGLIHHETNSDYYGVNYSELIAPVVKALQELYLKVLGVSDRMDQLEKE 1339 Query: 361 HQDVQD 366 +Q++++ Sbjct: 1340 NQNLKN 1345 >gi|301382058|ref|ZP_07230476.1| tail fiber domain protein [Pseudomonas syringae pv. tomato Max13] Length = 459 Score = 37.9 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 48/147 (32%), Gaps = 12/147 (8%) Query: 28 NNLLKDLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQG 87 N L N P++ GGT A ++ A +LG + + + F +G Sbjct: 95 NALAALGTTGNYDILPVAKGGTGATDSASALTSLGMKGGA----YDALIKSIGF----RG 146 Query: 88 GGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDS 147 +G +Y+GWNG + + L A ++ ++ + S Sbjct: 147 APVGYNVQGLYMGWNGNNDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYS 202 Query: 148 NQYAATPLSSFMRKLFSCHNGETLHQA 174 + LS + G + A Sbjct: 203 GVLSVAQLSVVGSPIALSSGGTSATTA 229 >gi|213967367|ref|ZP_03395515.1| tail fiber domain protein [Pseudomonas syringae pv. tomato T1] gi|302134255|ref|ZP_07260245.1| tail fiber domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|213927668|gb|EEB61215.1| tail fiber domain protein [Pseudomonas syringae pv. tomato T1] Length = 477 Score = 37.9 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 45/132 (34%), Gaps = 12/132 (9%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P++ GGT A ++ A +LG + + + F +G +G +Y+GWN Sbjct: 110 PVAKGGTGATDSASALTSLGMKGGA----YDALIKSIGF----RGAPVGYNVQGLYMGWN 161 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKL 162 G + + L A ++ ++ + S + LS + Sbjct: 162 GNNDGGANYICNRG----GGLGGHAWWSVNSDNTAAGPVMTYSYSGVLSVAQLSVVGSPI 217 Query: 163 FSCHNGETLHQA 174 G + A Sbjct: 218 ALSSGGTSATTA 229 >gi|148734528|ref|YP_001285545.1| Stf [Enterobacteria phage TLS] gi|38046787|gb|AAR09286.1| Stf [Enterobacteria phage TLS] Length = 674 Score = 37.5 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 27/117 (23%), Positives = 46/117 (39%), Gaps = 3/117 (2%) Query: 33 DLALDNNSPRPISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQ 92 D + PI +GGT ADN QAR NLG ++T + A +Y Sbjct: 417 DRSAGVRKALPIVSGGTGADNVAQARRNLGVGIDQDVTFNAVWAK--AYYGQGGAWTAAV 474 Query: 93 LDNKIYLGWNGTQLLLQVDLSSMR-EVWTSQLAPRALQNLVDHAHKPNHIVYTTDSN 148 + K +G+ LL ++ S + + +A Q + ++V+ + N Sbjct: 475 IKEKSQQASDGSNLLTYIENSWFGIQNYLVGTPGQAYQYVWSFNTFGGNVVWYSMRN 531 >gi|254253772|ref|ZP_04947089.1| Large exoprotein involved in heme utilization or adhesion [Burkholderia dolosa AUO158] gi|124898417|gb|EAY70260.1| Large exoprotein involved in heme utilization or adhesion [Burkholderia dolosa AUO158] Length = 1531 Score = 37.5 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 59/189 (31%), Gaps = 10/189 (5%) Query: 2 PRIREVYILPAGSQAYPNSNISSSAYNNLLKDLALDNNSPRPISAGGTAADNAVQARVNL 61 P +V+ S SN S+ A + + ++ NSP P G + A N+ + + Sbjct: 154 PATAQVWHGTTSSDWTVGSNWSTGAAPSGTATVTINTNSPNPTVLGVSGAVNSTIGNLQM 213 Query: 62 GTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTS 121 GT ++ + + L +G + + ++ W+ Sbjct: 214 GTATGTSALTIQNGSTL----------SSSVTTGTNQIGQSSGATNATITVTGTGSSWSM 263 Query: 122 QLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQ 181 + L T A +S R + + NG TL A Sbjct: 264 ASSTIFGAALGSTGTLNVANGATASLASNLAVGQTSGGRGVLNVTNGSTLTTGGATIAAS 323 Query: 182 FYNNSQRIA 190 N++ I+ Sbjct: 324 NSNSTGSIS 332 >gi|218531765|ref|YP_002422581.1| hypothetical protein Mchl_3837 [Methylobacterium chloromethanicum CM4] gi|218524068|gb|ACK84653.1| hypothetical protein Mchl_3837 [Methylobacterium chloromethanicum CM4] Length = 441 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 1/41 (2%) Query: 84 IQQGGGIGQ-LDNKIYLGWNGTQLLLQVDLSSMREVWTSQL 123 ++QGGG+ Q N + LGW+ T LL QVD + +W S+L Sbjct: 303 VRQGGGVRQNASNMVRLGWSNTNLLAQVDALELGALWASRL 343 >gi|157157303|ref|YP_001462549.1| hypothetical protein EcE24377A_1451 [Escherichia coli E24377A] gi|157079333|gb|ABV19041.1| hypothetical protein EcE24377A_1451 [Escherichia coli E24377A] Length = 263 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 15/37 (40%), Positives = 19/37 (51%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLL 79 P+S+GGT A A AR NLG D + + A L Sbjct: 153 PVSSGGTGATTASGARTNLGLGDVATQNLSTLDARYL 189 >gi|295096326|emb|CBK85416.1| hypothetical protein ENC_16990 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 420 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 2/34 (5%) Query: 38 NNSPRPISAGGTAADNAVQARVNLGTDDASNLTK 71 +N PIS GGT A A +R NLG S+ T+ Sbjct: 136 SNGAVPISQGGTGATTAEGSRTNLGL--GSSATR 167 >gi|124004568|ref|ZP_01689413.1| conserved hypothetical protein [Microscilla marina ATCC 23134] gi|123990140|gb|EAY29654.1| conserved hypothetical protein [Microscilla marina ATCC 23134] Length = 341 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 31/142 (21%), Positives = 55/142 (38%), Gaps = 25/142 (17%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPE-------SGMD------------- 296 Y SD+RLK S + QLK + + Y E SG Sbjct: 113 YTTSDKRLKSNINPIEGS-LTKLLQLKGVSYQYTFELNKYGDLSGEKITEIKQKTIDADK 171 Query: 297 ---SELRYLIGFSENNLKEINEVFVDT-IGGYLAPNPSVIIPHLAKAIQELLQEVKELRD 352 S+ + +GF +L+++ V G+L+ N S ++P L +A++E ++ L + Sbjct: 172 PYTSKAKQRLGFIAQDLQQVLPEAVAKDEKGFLSVNYSEVVPLLVEAMKEQQAKIDALEN 231 Query: 353 MIDKQNEEHQDVQDMSNTPLNS 374 ++ NT +S Sbjct: 232 ALNDLKSGASKRSTERNTTTSS 253 >gi|66809965|ref|XP_638706.1| NDT80/PhoG-like protein [Dictyostelium discoideum AX4] gi|74854295|sp|Q54PT9|RCDK_DICDI RecName: Full=Protein rcdK gi|60467247|gb|EAL65280.1| NDT80/PhoG-like protein [Dictyostelium discoideum AX4] Length = 932 Score = 36.7 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 51/121 (42%), Gaps = 19/121 (15%) Query: 248 TDKGAVGCSYFLSDERLKKVHGE-SSASAREIIEQLKFIDFNYLPE----SGMDSEL-RY 301 T + G Y SD R+K S S + + ++K D+ Y P+ +G D L Sbjct: 756 TIYASEGV-YHPSDLRIKYDLKSIDSKSNLDNVNRMKLYDYKYNPQWTHMNGRDPYLDNC 814 Query: 302 LIGFSENNLKEINEVFVDTIGG-----------YLAPNPSVIIPHLAKAIQELLQEVKEL 350 G +L+ I V TIG + N ++++ + A QEL +++ E+ Sbjct: 815 DRGVIAQDLQRILPNAVRTIGNKNVNGQEIENLLVIKNEALVMETIG-ATQELSKQMDEM 873 Query: 351 R 351 + Sbjct: 874 K 874 >gi|328874731|gb|EGG23096.1| NDT80/PhoG-like protein [Dictyostelium fasciculatum] Length = 1272 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 27/121 (22%), Positives = 45/121 (37%), Gaps = 10/121 (8%) Query: 257 YFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPESGMD---------SELRYLIGFSE 307 Y SD R+K S++ + I++LK +++ G D + + GF Sbjct: 814 YKPSDRRIKSNIVRDSSNHWDKIDRLKIYNYDRKKMPGYDNLAASATSSATMVKEKGFLA 873 Query: 308 NNLKEINEVFVDTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQ-NEEHQDVQD 366 L+E+ V G + S I L + LL E I + EH + + Sbjct: 874 QELREVIPNAVSVAGEVRLQDGSTIPNLLVVNDRVLLLENIGATQQIGRTLRREHDHIIE 933 Query: 367 M 367 M Sbjct: 934 M 934 >gi|311106478|ref|YP_003979331.1| arginase [Achromobacter xylosoxidans A8] gi|310761167|gb|ADP16616.1| arginase [Achromobacter xylosoxidans A8] Length = 320 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 60/170 (35%), Gaps = 10/170 (5%) Query: 72 GKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQNL 131 G +A L + P Q G GQ I GW + VD R V + L ++ L Sbjct: 157 GMPVACLCGYGPAQITGLSGQ-TPDIEPGWVRQIGIRSVDQGEKRLVHEAGLEVFDMRYL 215 Query: 132 VDHAHKPNHIVYTTDSNQYAATPLS---SFMRKLFSCHNGETLHQAIFQNGAQFYNNSQR 188 + + ++ +S F+ + G T+ G Y +Q Sbjct: 216 DEMGMRATMEAALAGLDEDTHLHVSFDVDFLDPEIAPGVGTTVA------GGPTYREAQL 269 Query: 189 IAAFLDDGDIMCYKRQKTVWQGIEIAQHTANIALESTKNCLYKTTVLNMG 238 + D +M + ++ TA +A++ ++ K+T++ G Sbjct: 270 CMEMIADTGLMRSLDVVELNPAFDVRNKTAELAVDLIESLFGKSTLMRRG 319 >gi|9634179|ref|NP_037718.1| tail fiber [Enterobacteria phage HK97] gi|6901608|gb|AAF31111.1| tail fiber [Enterobacteria phage HK97] Length = 321 Score = 36.4 bits (82), Expect = 6.9, Method: Composition-based stats. Identities = 15/35 (42%), Positives = 19/35 (54%) Query: 38 NNSPRPISAGGTAADNAVQARVNLGTDDASNLTKG 72 +N ++ GGT A NA AR NLG + S L G Sbjct: 136 SNGAVTVADGGTGATNAADARTNLGLGEGSALPVG 170 >gi|42523973|ref|NP_969353.1| hypothetical protein Bd2548 [Bdellovibrio bacteriovorus HD100] gi|39576181|emb|CAE80346.1| hypothetical protein Bd2548 [Bdellovibrio bacteriovorus HD100] Length = 1660 Score = 36.4 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 19/114 (16%), Positives = 35/114 (30%) Query: 43 PISAGGTAADNAVQARVNLGTDDASNLTKGKIIANLLPFYPIQQGGGIGQLDNKIYLGWN 102 P++ GGT A A LG + A + + K I + G N G Sbjct: 530 PVTKGGTGLSAAGTANQLLGMNGAGSALEYKTIPTCGANQYVTFNGTSFSCVNGAPSGSA 589 Query: 103 GTQLLLQVDLSSMREVWTSQLAPRALQNLVDHAHKPNHIVYTTDSNQYAATPLS 156 G L +++ T+ + + + + TP++ Sbjct: 590 GGDLTGTYPNPTLKATGTAGTYTKVTTDAQGRVTSGAAVAAADITTALGYTPVN 643 >gi|124002007|ref|ZP_01686861.1| cell wall surface anchor family protein, putative [Microscilla marina ATCC 23134] gi|123992473|gb|EAY31818.1| cell wall surface anchor family protein, putative [Microscilla marina ATCC 23134] Length = 609 Score = 36.4 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 47/108 (43%), Gaps = 5/108 (4%) Query: 256 SYFLSDERLKKVHGESSASAREIIEQLKFIDFNYLPE--SGMDSELRYLIGFSENNLKEI 313 + SD R KK + + QL+ + + E D +GF ++E+ Sbjct: 483 TQLTSDIRFKKDIMPITHEVIAKLGQLQGNTYQWRTEEFKNRDFAEGTQLGFIAQEMQEV 542 Query: 314 NEVFV--DTIGGYLAPNPSVIIPHLAKAIQELLQEVKELRDMIDKQNE 359 V D G L+ N + +IP L +A+++L + +E ++ +NE Sbjct: 543 YPELVNEDHQGD-LSINYTGLIPVLTEALKDLNNKSEEAAKDLNNKNE 589 >gi|126464512|ref|YP_001045625.1| trimethylamine-N-oxide reductase c-type cytochrome TorC [Rhodobacter sphaeroides ATCC 17029] gi|126106323|gb|ABN78853.1| trimethylamine-N-oxide reductase c-type cytochrome TorC [Rhodobacter sphaeroides ATCC 17029] Length = 392 Score = 36.4 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 8/116 (6%) Query: 71 KGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQN 130 K K LL P+Q G GW + + ++ + LAP A + Sbjct: 228 KAKADGRLLAATPMQVVDVKGDWVQVAVKGWQQEGAERVIYEKQGKRIFNAALAPTATGS 287 Query: 131 LVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNS 186 +V A +V + L++++R + +A++Q G Q YN + Sbjct: 288 IVAGA----SMVDPDTEQTWTDVSLTAWVRNRDLTDD----QEALWQYGKQMYNGA 335 >gi|332560657|ref|ZP_08414975.1| trimethylamine-N-oxide reductase c-type cytochrome TorC [Rhodobacter sphaeroides WS8N] gi|332274455|gb|EGJ19771.1| trimethylamine-N-oxide reductase c-type cytochrome TorC [Rhodobacter sphaeroides WS8N] Length = 392 Score = 36.4 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 8/116 (6%) Query: 71 KGKIIANLLPFYPIQQGGGIGQLDNKIYLGWNGTQLLLQVDLSSMREVWTSQLAPRALQN 130 K K LL P+Q G GW + + ++ + LAP A + Sbjct: 228 KAKANGRLLAATPMQVVDVKGDWVQVAVKGWQQEGAERVIYEKQGKRIFNAALAPTATGS 287 Query: 131 LVDHAHKPNHIVYTTDSNQYAATPLSSFMRKLFSCHNGETLHQAIFQNGAQFYNNS 186 +V A +V + L++++R + +A++Q G Q YN + Sbjct: 288 IVAGA----SMVDPDTEQTWTDVSLTAWVRNRDLTDD----QEALWQYGKQMYNGA 335 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.120 0.313 Lambda K H 0.267 0.0362 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,847,656,280 Number of Sequences: 14124377 Number of extensions: 218144902 Number of successful extensions: 679295 Number of sequences better than 10.0: 233 Number of HSP's better than 10.0 without gapping: 88 Number of HSP's successfully gapped in prelim test: 265 Number of HSP's that attempted gapping in prelim test: 678942 Number of HSP's gapped (non-prelim): 493 length of query: 378 length of database: 4,842,793,630 effective HSP length: 141 effective length of query: 237 effective length of database: 2,851,256,473 effective search space: 675747784101 effective search space used: 675747784101 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 81 (36.0 bits)