BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780448|ref|YP_003064861.1| hypothetical protein CLIBASIA_01665 [Candidatus Liberibacter asiaticus str. psy62] (311 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780448|ref|YP_003064861.1| hypothetical protein CLIBASIA_01665 [Candidatus Liberibacter asiaticus str. psy62] gi|254040125|gb|ACT56921.1| hypothetical protein CLIBASIA_01665 [Candidatus Liberibacter asiaticus str. psy62] Length = 311 Score = 443 bits (1139), Expect = e-122, Method: Composition-based stats. Identities = 311/311 (100%), Positives = 311/311 (100%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA Sbjct: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQT 120 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQT Sbjct: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQT 120 Query: 121 GNISDATKASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNL 180 GNISDATKASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNL Sbjct: 121 GNISDATKASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNL 180 Query: 181 LDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLL 240 LDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLL Sbjct: 181 LDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLL 240 Query: 241 NLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGIKAEGILSRNA 300 NLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGIKAEGILSRNA Sbjct: 241 NLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGIKAEGILSRNA 300 Query: 301 PPELHFSSYVN 311 PPELHFSSYVN Sbjct: 301 PPELHFSSYVN 311 >gi|254780449|ref|YP_003064862.1| hypothetical protein CLIBASIA_01670 [Candidatus Liberibacter asiaticus str. psy62] gi|254040126|gb|ACT56922.1| hypothetical protein CLIBASIA_01670 [Candidatus Liberibacter asiaticus str. psy62] Length = 459 Score = 346 bits (888), Expect = 2e-93, Method: Composition-based stats. Identities = 118/384 (30%), Positives = 179/384 (46%), Gaps = 75/384 (19%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 MIP+YGT +EFKKGNYGWG +G +SD ALL Y + LVRG SIA K T G Sbjct: 76 MIPIYGTYREFKKGNYGWGIVGAISDAALLIPVVGYGARAAINLVRGGSIALKAGTAGTM 135 Query: 61 TVVQEATVMTKTTQEGA---LLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQ-- 115 +EA + + T++ A L KEGI + +EG S IKSES+G K IS++ ++ Sbjct: 136 IAAKEACTIAQATEKTAKLTALTKEGITSIRTIEGSSVTIKSESIGTKASISSTNTAEKS 195 Query: 116 ---------TVTQTGNISDA-------------------------TKA------------ 129 + T+ G ++ TKA Sbjct: 196 AISQKITTNSTTEIGKTTEVVEESISKINSQLSKSTPQGIWTKALTKADPALESIYQRGK 255 Query: 130 --SSTIKDAQSIDR----SQAIFQKMPL------------EEYPRLQKIGINYFRDFK-L 170 S+TIK+ I++ ++AI +K+P E+ + FRDF+ L Sbjct: 256 IFSNTIKNNAFIEKLAHTTKAIDKKIPFIGNQWRDINTAHSEFKMVPLSDQTLFRDFQGL 315 Query: 171 LGTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHI 230 G N + +LD +RA+ FI +GKK+ D++ + +L F K+ +++QLISSYA++ I Sbjct: 316 CGKNIDNQFILDLNRAS-FIFNGKKLARDNSAEAIQKLMNQFAKNPKQLQLISSYANQSI 374 Query: 231 FCKPFTKDLLNLANKNIY-QLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLN---GSLHR 286 F + ++ Y S + TL + ++F AK V ++ G + Sbjct: 375 FADSVVHLMQSIPEFAKYASKSGSASKFTAKTLTNGEVAFTAKYTTKVQAVDKIAGKPLK 434 Query: 287 NYGIKAEGILSRNAPPELHFSSYV 310 YG+K GILS + EL S Y+ Sbjct: 435 EYGLKISGILSPDKATELQRSFYL 458 >gi|58384653|gb|AAW72673.1| hypothetical protein [Buchnera aphidicola (Cinara cedri)] Length = 303 Score = 179 bits (453), Expect = 6e-43, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 63/145 (43%), Gaps = 4/145 (2%) Query: 169 KLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHE 228 +LL + K+ + LD ++GKK+ D A +++ + P +F+ QLIS+Y H Sbjct: 159 ELLDSGKLEQKFLDHLHGKTVYLNGKKVLSDQAPDIMNAFRESVP-EFQTQQLISTYVHP 217 Query: 229 HIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYL---NGSLH 285 + + + N Y+Y+ + + + + + + + Sbjct: 218 EVLDVAWENLFKRHPGVINRTVDNEHYTYEIDEISPEMYKVAITKITDLQPSYSGDINEI 277 Query: 286 RNYGIKAEGILSRNAPPELHFSSYV 310 +G++A I++ N PE+ +S +V Sbjct: 278 HTHGMRAAMIITANFNPEMRYSFFV 302 >gi|116515293|ref|YP_802922.1| Yba3 [Buchnera aphidicola str. Cc (Cinara cedri)] gi|116257147|gb|ABJ90829.1| hypothetical protein BCc_379 [Buchnera aphidicola str. Cc (Cinara cedri)] Length = 303 Score = 177 bits (450), Expect = 1e-42, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 63/145 (43%), Gaps = 4/145 (2%) Query: 169 KLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHE 228 +LL + K+ + LD ++GKK+ D A +++ + P +F+ QLIS+Y H Sbjct: 159 ELLDSGKLEQKFLDHLHGKTVYLNGKKVLSDQAPDIMNAFRESVP-EFQTQQLISTYVHP 217 Query: 229 HIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYL---NGSLH 285 + + + N Y+Y+ + + + + + + + Sbjct: 218 EVLDVAWENLSKRHPGVINRTVDNEHYTYEIDEISPEMYKVAITKITDLQPSYSGDINEI 277 Query: 286 RNYGIKAEGILSRNAPPELHFSSYV 310 +G++A I++ N PE+ +S +V Sbjct: 278 HTHGMRAAMIITANFNPEMRYSFFV 302 >gi|315122104|ref|YP_004062593.1| hypothetical protein CKC_01770 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495506|gb|ADR52105.1| hypothetical protein CKC_01770 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 335 Score = 176 bits (447), Expect = 3e-42, Method: Composition-based stats. Identities = 125/308 (40%), Positives = 166/308 (53%), Gaps = 41/308 (13%) Query: 2 IPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIAT 61 IP+YGTIQ FK+ GWG LGI +DV L I Y K L+RGSS A A + Sbjct: 56 IPIYGTIQAFKEKESGWGILGITTDV-LTLIGIGYGIKGAAALIRGSSAAATAAMAAGTS 114 Query: 62 VVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTG 121 +EGG+ IKS +V L+ + + Sbjct: 115 T--------------------------AIEGGNALIKSSAV----LVESKNTLKHTDMGA 144 Query: 122 NISDATKASSTIKDAQSIDRSQAIFQKMPL-EEYPRLQKIGINYFRDFKLLGTNKVYKNL 180 N D+ + + S++I +P+ E PRL+KI NYF DFKLL NK YK L Sbjct: 145 NTIHI--------DSLAQNSSKSIGTIIPVVSENPRLKKIATNYFYDFKLLSPNKAYKGL 196 Query: 181 LDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLL 240 DAS+A+EFI++GKKINID+ + ML L +IFP DFEKVQLIS YAHE IF PFT L Sbjct: 197 RDASKASEFIVNGKKINIDTPEKMLENLKEIFPNDFEKVQLISCYAHEGIFDAPFTHKLF 256 Query: 241 NLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSL-HRNYGIKAEGILSRN 299 ++ N Y SY+F+ L+D TI+F A +G + ++GS YG+K +GILS+ Sbjct: 257 SINNPKNYTGVKTYNSYKFDALEDGTINFSATYKGNFSPVDGSPSTHGYGVKVDGILSKQ 316 Query: 300 APPELHFS 307 + PELHF+ Sbjct: 317 SIPELHFT 324 >gi|315122103|ref|YP_004062592.1| hypothetical protein CKC_01765 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495505|gb|ADR52104.1| hypothetical protein CKC_01765 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 464 Score = 164 bits (415), Expect = 1e-38, Method: Composition-based stats. Identities = 97/393 (24%), Positives = 160/393 (40%), Gaps = 85/393 (21%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 +IP+YGTIQ FKKG GWG G ++DV L Y K++ L RG + A KI+ G Sbjct: 67 LIPIYGTIQSFKKGEIGWGIFGAITDVLTLVPVVGYGAKMVGALARGGNAAIKISKAGAI 126 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEG------GSTAIKSES------------- 101 A+ T+ GA LA + + MEG GS +K+ S Sbjct: 127 AASATASTYAAATRGGAALADGALITKYAMEGESAFNAGSATVKATSSTLYESNVITKAA 186 Query: 102 --------------VGAKELISASQNSQTVTQTGNISDATKASSTIKDA----------- 136 + + ++ ++ + + + +T+ + Sbjct: 187 ESTHSTLDKAAILPISKNTIKTSVNTAEMDKIAAKVVEQSTKKTTLSNKKTIKQASKKFF 246 Query: 137 -------------------QSIDRSQAIFQKMPL-----------------EEYPRLQKI 160 +I +S++I +K+ EY + Sbjct: 247 IASLRAVDPGLELLYQGGKAAIRKSRSIPEKIMKTSHSLPKNTWKNIDSIPSEYKMVSLS 306 Query: 161 GINYFRDFKLLGTNK-VYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKV 219 + FR+FK L N+ + LLD +RA E+II+GK + + ++ LA L K F D +K+ Sbjct: 307 DESLFRNFKQLNKNELDQQFLLDLNRA-EYIINGKNMRDTNQKSQLAYLQKTFANDPQKL 365 Query: 220 QLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTY 279 Q+IS+YAH+ IF + + + N Y N + ++Q NTL ++ + AK + Sbjct: 366 QIISAYAHQGIFADGISYLMETIPNMLSYGSKNGKTTFQINTLGEEGVRLSAKYTASLVT 425 Query: 280 LNG---SLHRNYGIKAEGILSRNAPPELHFSSY 309 N + R YG+K + IL N P+ Y Sbjct: 426 ENQAIKNPLREYGLKIDTILFPNKAPQFTQYFY 458 >gi|257471634|ref|ZP_05635633.1| hypothetical protein BaphL_02905 [Buchnera aphidicola str. LSR1 (Acyrthosiphon pisum)] gi|311087470|gb|ADP67550.1| hypothetical protein CWS_03010 [Buchnera aphidicola str. JF99 (Acyrthosiphon pisum)] gi|311087954|gb|ADP68033.1| hypothetical protein CWU_03765 [Buchnera aphidicola str. JF98 (Acyrthosiphon pisum)] Length = 205 Score = 153 bits (387), Expect = 3e-35, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 80/164 (48%), Gaps = 5/164 (3%) Query: 151 LEEYPRLQKIGINYFRDFKLL-GTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELN 209 L + +Q ++ + ++ L +N + + ++ + F+++ I++++ ML + Sbjct: 42 LNSFNNIQSTIKHFQKKYEFLKSSNDLDNDFMNDTNPFLFVVNNGLISVNNRNKMLKDFK 101 Query: 210 KIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISF 269 I P + E QLIS+YA++ + + + + + YQ+ + R Y+ N L D ++ Sbjct: 102 TIVP-NVEFRQLISTYANQKFLRQSYLQLISEHPEIDQYQIKHSRNIYKINFLDDGSVKL 160 Query: 270 VAKEEGLVTYLNGSLHRNY---GIKAEGILSRNAPPELHFSSYV 310 VA + N + + Y GI+A IL NA P + +S ++ Sbjct: 161 VATNLSDLDVKNDNYIQKYKSFGIRATIILPPNASPIMKYSYFM 204 >gi|311086306|gb|ADP66388.1| hypothetical protein CWO_03065 [Buchnera aphidicola str. LL01 (Acyrthosiphon pisum)] gi|311086880|gb|ADP66961.1| hypothetical protein CWQ_03105 [Buchnera aphidicola str. TLW03 (Acyrthosiphon pisum)] Length = 205 Score = 152 bits (384), Expect = 5e-35, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 80/164 (48%), Gaps = 5/164 (3%) Query: 151 LEEYPRLQKIGINYFRDFKLL-GTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELN 209 L + +Q ++ + ++ L +N + + ++ + F+++ I++++ ML + Sbjct: 42 LNSFNNIQSTIKHFQKKYEFLKSSNDLDNDFMNDTNPFLFVVNDGLISVNNRNKMLKDFK 101 Query: 210 KIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISF 269 I P + E QLIS+YA++ + + + + + YQ+ + R Y+ N L D ++ Sbjct: 102 TIVP-NVEFRQLISTYANQKFLRQSYLQLISEHPEIDQYQIKHSRNIYKINFLDDGSVKL 160 Query: 270 VAKEEGLVTYLNGSLHRNY---GIKAEGILSRNAPPELHFSSYV 310 VA + N + + Y GI+A IL NA P + +S ++ Sbjct: 161 VATNLSDLDVKNDNYIQKYKSFGIRATIILPPNASPIMKYSYFM 204 >gi|219681925|ref|YP_002468311.1| hypothetical protein BUAP5A_577 [Buchnera aphidicola str. 5A (Acyrthosiphon pisum)] gi|219624768|gb|ACL30923.1| hypothetical protein BUAP5A_577 [Buchnera aphidicola str. 5A (Acyrthosiphon pisum)] Length = 367 Score = 151 bits (382), Expect = 9e-35, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 80/164 (48%), Gaps = 5/164 (3%) Query: 151 LEEYPRLQKIGINYFRDFKLL-GTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELN 209 L + +Q ++ + ++ L +N + + ++ + F+++ I++++ ML + Sbjct: 204 LNSFNNIQSTIKHFQKKYEFLKSSNDLDNDFMNDTNPFLFVVNNGLISVNNRNKMLKDFK 263 Query: 210 KIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISF 269 I P + E QLIS+YA++ + + + + + YQ+ + R Y+ N L D ++ Sbjct: 264 TIVP-NVEFRQLISTYANQKFLRQSYLQLISEHPEIDQYQIKHSRNIYKINFLDDGSVKL 322 Query: 270 VAKEEGLVTYLNGSLHRNY---GIKAEGILSRNAPPELHFSSYV 310 VA + N + + Y GI+A IL NA P + +S ++ Sbjct: 323 VATNLSDLDVKNDNYIQKYKSFGIRATIILPPNASPIMKYSYFM 366 >gi|219682480|ref|YP_002468864.1| hypothetical protein BUAPTUC7_578 [Buchnera aphidicola str. Tuc7 (Acyrthosiphon pisum)] gi|219622213|gb|ACL30369.1| hypothetical protein BUAPTUC7_578 [Buchnera aphidicola str. Tuc7 (Acyrthosiphon pisum)] Length = 367 Score = 151 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 80/164 (48%), Gaps = 5/164 (3%) Query: 151 LEEYPRLQKIGINYFRDFKLL-GTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELN 209 L + +Q ++ + ++ L +N + + ++ + F+++ I++++ ML + Sbjct: 204 LNSFNNIQSTIKHFQKKYEFLKSSNDLDNDFMNDTNPFLFVVNDGLISVNNRNKMLKDFK 263 Query: 210 KIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISF 269 I P + E QLIS+YA++ + + + + + YQ+ + R Y+ N L D ++ Sbjct: 264 TIVP-NVEFRQLISTYANQKFLRQSYLQLISEHPEIDQYQIKHSRNIYKINFLDDGSVKL 322 Query: 270 VAKEEGLVTYLNGSLHRNY---GIKAEGILSRNAPPELHFSSYV 310 VA + N + + Y GI+A IL NA P + +S ++ Sbjct: 323 VATNLSDLDVKNDNYIQKYKSFGIRATIILPPNASPIMKYSYFM 366 >gi|15617174|ref|NP_240387.1| hypothetical protein BU584 [Buchnera aphidicola str. APS (Acyrthosiphon pisum)] gi|11387313|sp|P57644|Y584_BUCAI RecName: Full=Uncharacterized protein BU584; AltName: Full=yba3 gi|25373122|pir||A84998 hypothetical protein [imported] - Buchnera sp. (strain APS) gi|10039239|dbj|BAB13273.1| hypothetical protein [Buchnera aphidicola str. APS (Acyrthosiphon pisum)] Length = 367 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 80/164 (48%), Gaps = 5/164 (3%) Query: 151 LEEYPRLQKIGINYFRDFKLL-GTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELN 209 L + +Q ++ + ++ L +N + + ++ + F+++ I++++ ML + Sbjct: 204 LNSFNNIQSTIKHFQKKYEFLKSSNDLDNDFMNDTNPFLFVVNNALISVNNRNKMLKDFK 263 Query: 210 KIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISF 269 I P + E QLIS+YA++ + + + + + YQ+ + R Y+ N L D ++ Sbjct: 264 TIVP-NVEFRQLISTYANQKFLRQSYLQLISEHPEIDQYQIKHSRNIYKINFLDDGSVKL 322 Query: 270 VAKEEGLVTYLNGSLHRNY---GIKAEGILSRNAPPELHFSSYV 310 VA + N + + Y GI+A IL NA P + +S ++ Sbjct: 323 VATNLSDLDVKNDNYIQKYKSFGIRATIILPPNASPIMKYSYFM 366 >gi|27905000|ref|NP_778126.1| hypothetical protein bbp529 [Buchnera aphidicola str. Bp (Baizongia pistaciae)] gi|34098501|sp|Q89A26|Y529_BUCBP RecName: Full=Uncharacterized protein bbp_529; AltName: Full=yba3 gi|27904398|gb|AAO27231.1| hypothetical protein bbp_529 [Buchnera aphidicola str. Bp (Baizongia pistaciae)] Length = 398 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 73/157 (46%), Gaps = 7/157 (4%) Query: 160 IGINYFRDF-KLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEK 218 G +F F L T+++ + + + I+G++++ S + ML + P D Sbjct: 242 TGKKFFNRFMSLRNTSRLDDSFIFEGEHSILQINGQQVSKYSPKTMLDDFKTAIP-DLAS 300 Query: 219 VQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKE----E 274 QLISS++H+ IF +P+ + + ++ + ++SY + ++D F A E Sbjct: 301 RQLISSFSHQGIFSQPYIELFSEHPDLVKFKPKDSQFSYVVHEVEDGVFQFTATSQADLE 360 Query: 275 GLVTYLNGSLHRNYGIKAEGILSRNAPPE-LHFSSYV 310 + + +G++ LS++ PE + +S Y+ Sbjct: 361 SSYETSDHKKYNAFGVQVSMTLSKDKSPEDVEYSYYL 397 >gi|320538773|ref|ZP_08038451.1| hypothetical protein SSYM_0353 [Serratia symbiotica str. Tucson] gi|320031162|gb|EFW13163.1| hypothetical protein SSYM_0353 [Serratia symbiotica str. Tucson] Length = 210 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 57/192 (29%), Positives = 86/192 (44%), Gaps = 15/192 (7%) Query: 126 ATKASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLG--TNKVYKNLLDA 183 A+ SSTI + ++K P P L I I DF + T + LLD Sbjct: 26 ASNISSTIVKSADALEDFGKWKKFP-SSVPVLDTISI-LQDDFHKIKVTTQLSEQFLLDF 83 Query: 184 SRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLA 243 RAT + +DGK I + K P D +K QLISSYA++ P + Sbjct: 84 DRAT-YKVDGKVIPRGD----YIQFEKAIP-DIKKRQLISSYANQMSLADPSIGAMSFFP 137 Query: 244 N-KNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLN----GSLHRNYGIKAEGILSR 298 + + N SY+ + D I +AK E +T ++ ++ +YG+KAE LS Sbjct: 138 DSFAKHGAHNSNVSYEIWNIPDSKIKLIAKVESKLTPVDLADGEKIYSSYGLKAEMTLSE 197 Query: 299 NAPPELHFSSYV 310 N PP+ ++S Y+ Sbjct: 198 NTPPKYNYSYYL 209 >gi|163800536|ref|ZP_02194437.1| inner membrane protein, putative [Vibrio sp. AND4] gi|159175979|gb|EDP60773.1| inner membrane protein, putative [Vibrio sp. AND4] Length = 1581 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 60/296 (20%), Positives = 112/296 (37%), Gaps = 41/296 (13%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 ++PV+ T+++F+KGNY G+LG++SDV GK F TG Sbjct: 924 IVPVWATVEDFQKGNYYMGSLGLLSDVGFFLPI----GKAAFN-------------TGKM 966 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIK---SESVGAKELISASQNSQTV 117 T + + K + + + I + + G + I + + +S + Sbjct: 967 TGKVVSHSLPKKYKVSSAINVVQIGSNKRVVGETFDIDLQVARKQAKEAWLSLPKTVFDQ 1026 Query: 118 TQTGNISDATKA--SSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNK 175 I+D K S+++K Q R+Q I +K P K+ + R+F++ ++ Sbjct: 1027 MNVLPITDIIKGAKSASLKIKQ---RAQKIKRK---NTDPYAPKVYQS--RNFRISSKDR 1078 Query: 176 VYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISS--YAHEHIFCK 233 D + I + +I I + + + I D L+ S Y+ + + K Sbjct: 1079 PD---FDPDKVVG-IPNADRIAIANIADEENVIIGIRAVDPNNRSLLESGLYSSKSLLIK 1134 Query: 234 PFTKDLLNLANKN----IYQLSNPRYSY-QFNTLKDKTISFVAKEEGLVTYLNGSL 284 + D A Y ++ R +FN D I+ A + +T L Sbjct: 1135 SKSSDWGPHAGFIPVDQKYAKASARAQVDRFNQYSDNAINSGAAKPVQLTINEDRL 1190 >gi|297579269|ref|ZP_06941197.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536863|gb|EFH75696.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 1590 Score = 45.5 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 59/320 (18%), Positives = 115/320 (35%), Gaps = 43/320 (13%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 ++PV+ T+++F+KG+YG +LG++SD+ GK F TTG Sbjct: 941 IVPVWATVEDFQKGDYGMASLGLISDIGFFLPI----GKAAF-------------TTGKM 983 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIK---SESVGAKELISASQNSQTV 117 T + + + + + L I + + G + I + + +S + Sbjct: 984 TGKVISKSLPRKYKVSSALNVLEIGSNKRVVGEAFDIDLKVARKQAKEAWLSLPKTIFDQ 1043 Query: 118 TQTGNISDATKA--SSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNK 175 ++D K S+++K Q + + + P K+ + RDF++ ++ Sbjct: 1044 MNVLPVTDIVKGAKSASLKIKQRVQKIKRKNTD------PYAPKVYQS--RDFRISSKDR 1095 Query: 176 VYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISS--YAHEHIFCK 233 D + +K+ + A + I D L+ S Y+ + + K Sbjct: 1096 PD---FDPDKVVGIPSTDRKVIANIADEE-NVVIGIRAVDPNNRSLLESGLYSSKSLLIK 1151 Query: 234 PFTKDLLNLANKN----IYQLSNPRYSY-QFNTLKDKTISFVAKEEGLVTYLNGSLHRNY 288 + D A Y ++ R +FN D I+ A +T + L Sbjct: 1152 SKSSDWGPHAGFIPVDQKYAKASARAQVDRFNHYSDNAINSGAAVPVPLTIDDKRLTELQ 1211 Query: 289 GIKAEGILSRNAPPE--LHF 306 G I PE +H+ Sbjct: 1212 GGDNPLIPELKYNPETGMHY 1231 >gi|307206898|gb|EFN84744.1| Papilin [Harpegnathos saltator] Length = 2983 Score = 43.2 bits (100), Expect = 0.046, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 11/110 (10%) Query: 48 SSIATKIATTGIATVVQEATVMTKTTQ---EGALLAKE---GIEATHIMEGGST----AI 97 + +K TT T V E++ +T T+ EG+ E G ++ MEG + A Sbjct: 746 GATDSKAGTTEFGTEVTESSDITDETETTVEGSGTTIEEESGATSSTDMEGSTMEGTQAA 805 Query: 98 KSESVGAK-ELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQAIF 146 +ES G K E+ S +S T++G + + + ST+ ++ +++ Sbjct: 806 TTESGGTKSEMTEMSTDSSATTESGGTTSSAEVESTVNESSEPTTTESSD 855 >gi|290994126|ref|XP_002679683.1| midasin [Naegleria gruberi] gi|284093301|gb|EFC46939.1| midasin [Naegleria gruberi] Length = 5497 Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 67/206 (32%), Gaps = 37/206 (17%) Query: 111 SQNSQTVTQTGNISDATKASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKL 170 + + + + ++ D + S+ I D Q A+ + + + + I R L Sbjct: 2514 TNDVEKTIISADVMDTSSTSNLINDMQY-----ALEYFIDVSSFKDIDIRVIWLQRLLTL 2568 Query: 171 LGTNKVYKNLL-------DASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLIS 223 ++K+ + LL + + E +KI + Q + ++I ++ Sbjct: 2569 CKSDKILEELLTKGIELLNTRKGCEVSKKIQKIFKNMQQTVTQRFSEIIASQPMNLK--- 2625 Query: 224 SYAHEHIFCKPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGS 283 Y+ + + + N + +Y F+ A Sbjct: 2626 -YSEQLFARLQYLGLI------------NEQDNYVFD---------AATILTSQQKRKYE 2663 Query: 284 LHRNYGIKAEGILSRNAPPELHFSSY 309 ++ ++ +G + N LH S Y Sbjct: 2664 EQQSIAVEFQGTIVENKSSLLHQSYY 2689 >gi|78047500|ref|YP_363675.1| methyl-accepting chemotaxis protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78035930|emb|CAJ23621.1| methyl-accepting chemotaxis protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 801 Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E ++VG E + Sbjct: 693 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARALE-------EQAVGLTEAV 745 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + Q T T + KA+ ++ Sbjct: 746 AVFKTEQGTTSTVRTASVAKAAPAVR 771 >gi|261211805|ref|ZP_05926092.1| hypothetical protein VCJ_002068 [Vibrio sp. RC341] gi|260839155|gb|EEX65787.1| hypothetical protein VCJ_002068 [Vibrio sp. RC341] Length = 1480 Score = 41.2 bits (95), Expect = 0.17, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 28/42 (66%), Gaps = 4/42 (9%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLF 42 ++PV+GT+++F++G++G +LG++SDV GK F Sbjct: 830 IVPVWGTVEDFQRGDHGMASLGLISDVGFFLPI----GKAAF 867 >gi|58615294|gb|AAW80256.1| adenylate cyclase-like protein [Vibrio cholerae] Length = 1046 Score = 41.2 bits (95), Expect = 0.18, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 28/42 (66%), Gaps = 4/42 (9%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLF 42 ++PV+GT+++F++G++G +LG++SDV GK F Sbjct: 397 IVPVWGTVEDFQRGDHGMASLGLISDVGFFLPI----GKAAF 434 >gi|116331650|ref|YP_801368.1| TPR repeat-containing protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116125339|gb|ABJ76610.1| TPR-repeat protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 662 Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 44/117 (37%), Gaps = 5/117 (4%) Query: 174 NKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKV-QLISSYAHEHIFC 232 N ++ L R F + D A + LA ++ P E Q +S Y F Sbjct: 328 NHSFRRKLGEYRMQRFHSSKNSLLYDMASHHLAAARELIPGQPEVQFQTLSEYKRTGFFP 387 Query: 233 KPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYG 289 + + LL N +Y Y+ L + +A +EGL+ +L NYG Sbjct: 388 R-YLNLLLF---LRKKYPENQKYQYKIENLLNSMKQSIAYKEGLLEITGDNLVENYG 440 >gi|116327689|ref|YP_797409.1| TPR repeat-containing protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116120433|gb|ABJ78476.1| TPR-repeat protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] Length = 662 Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 44/117 (37%), Gaps = 5/117 (4%) Query: 174 NKVYKNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKV-QLISSYAHEHIFC 232 N ++ L R F + D A + LA ++ P E Q +S Y F Sbjct: 328 NHSFRRKLGEYRMQRFHSSKNSLLYDMASHHLAAARELIPGQPEVQFQTLSEYKRTGFFP 387 Query: 233 KPFTKDLLNLANKNIYQLSNPRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYG 289 + + LL N +Y Y+ L + +A +EGL+ +L NYG Sbjct: 388 R-YLNLLLF---LRKKYPENQKYQYKIENLLNSMKQSIAYKEGLLEITGDNLVENYG 440 >gi|312901179|ref|ZP_07760464.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0470] gi|311291698|gb|EFQ70254.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0470] Length = 835 Score = 41.2 bits (95), Expect = 0.21, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|315573637|gb|EFU85828.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0309B] Length = 835 Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|315151244|gb|EFT95260.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0012] Length = 835 Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|229547023|ref|ZP_04435748.1| glycosy hydrolase family protein [Enterococcus faecalis TX1322] gi|257421445|ref|ZP_05598435.1| glycosyl hydrolase [Enterococcus faecalis X98] gi|307276351|ref|ZP_07557477.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX2134] gi|312952546|ref|ZP_07771413.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0102] gi|229307951|gb|EEN73938.1| glycosy hydrolase family protein [Enterococcus faecalis TX1322] gi|257163269|gb|EEU93229.1| glycosyl hydrolase [Enterococcus faecalis X98] gi|306507016|gb|EFM76160.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX2134] gi|310629544|gb|EFQ12827.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0102] gi|315153441|gb|EFT97457.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0031] gi|315156348|gb|EFU00365.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0043] gi|315158457|gb|EFU02474.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0312] gi|323479344|gb|ADX78783.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis 62] Length = 835 Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|307278233|ref|ZP_07559314.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0860] gi|306505108|gb|EFM74297.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0860] Length = 849 Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 447 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 503 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 504 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 558 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 559 NGKKYNHNRVSKAIQRGNNAYYNDP 583 >gi|315171379|gb|EFU15396.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX1342] Length = 845 Score = 40.9 bits (94), Expect = 0.22, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 443 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 499 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 500 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 554 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 555 NGKKYNHNRVSKAIQRGNNAYYNDP 579 >gi|307274117|ref|ZP_07555326.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0855] gi|312903861|ref|ZP_07763033.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0635] gi|306509242|gb|EFM78303.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0855] gi|310632805|gb|EFQ16088.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0635] gi|315160839|gb|EFU04856.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0645] gi|315576998|gb|EFU89189.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0630] gi|327533960|gb|AEA92794.1| family 20 glycosyl hydrolase [Enterococcus faecalis OG1RF] Length = 849 Score = 40.9 bits (94), Expect = 0.23, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 447 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 503 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 504 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 558 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 559 NGKKYNHNRVSKAIQRGNNAYYNDP 583 >gi|29374765|ref|NP_813917.1| glycosy hydrolase family protein [Enterococcus faecalis V583] gi|227555794|ref|ZP_03985841.1| glycosy hydrolase family protein [Enterococcus faecalis HH22] gi|300862276|ref|ZP_07108356.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TUSoD Ef11] gi|29342223|gb|AAO79989.1| glycosyl hydrolase, family 20 [Enterococcus faecalis V583] gi|227175090|gb|EEI56062.1| glycosy hydrolase family protein [Enterococcus faecalis HH22] gi|295112425|emb|CBL31062.1| N-acetyl-beta-hexosaminidase [Enterococcus sp. 7L76] gi|300848801|gb|EFK76558.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TUSoD Ef11] gi|315143199|gb|EFT87215.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX2141] gi|315582029|gb|EFU94220.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0309A] Length = 835 Score = 40.9 bits (94), Expect = 0.23, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|293384888|ref|ZP_06630726.1| beta-N-acetylhexosaminidase [Enterococcus faecalis R712] gi|293388578|ref|ZP_06633079.1| beta-N-acetylhexosaminidase [Enterococcus faecalis S613] gi|312908700|ref|ZP_07767640.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis DAPTO 512] gi|312979161|ref|ZP_07790867.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis DAPTO 516] gi|291077806|gb|EFE15170.1| beta-N-acetylhexosaminidase [Enterococcus faecalis R712] gi|291082036|gb|EFE18999.1| beta-N-acetylhexosaminidase [Enterococcus faecalis S613] gi|310625299|gb|EFQ08582.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis DAPTO 512] gi|311288033|gb|EFQ66589.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis DAPTO 516] Length = 835 Score = 40.9 bits (94), Expect = 0.24, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|255971653|ref|ZP_05422239.1| predicted protein [Enterococcus faecalis T1] gi|255962671|gb|EET95147.1| predicted protein [Enterococcus faecalis T1] Length = 745 Score = 40.9 bits (94), Expect = 0.26, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 343 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 399 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 400 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 454 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 455 NGKKYNHNRVSKAIQRGNNAYYNDP 479 >gi|256761959|ref|ZP_05502539.1| predicted protein [Enterococcus faecalis T3] gi|256683210|gb|EEU22905.1| predicted protein [Enterococcus faecalis T3] Length = 745 Score = 40.9 bits (94), Expect = 0.26, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 343 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 399 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 400 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 454 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 455 NGKKYNHNRVSKAIQRGNNAYYNDP 479 >gi|255974618|ref|ZP_05425204.1| predicted protein [Enterococcus faecalis T2] gi|255967490|gb|EET98112.1| predicted protein [Enterococcus faecalis T2] Length = 745 Score = 40.9 bits (94), Expect = 0.26, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 343 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 399 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 400 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 454 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 455 NGKKYNHNRVSKAIQRGNNAYYNDP 479 >gi|307287067|ref|ZP_07567140.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0109] gi|306501846|gb|EFM71136.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0109] gi|315165753|gb|EFU09770.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX1302] Length = 835 Score = 40.5 bits (93), Expect = 0.29, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTKES---VATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|315026702|gb|EFT38634.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX2137] Length = 845 Score = 40.5 bits (93), Expect = 0.30, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 443 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTSAQVTTWQTEVQTALDNLKEKQTQ 499 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 500 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 554 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 555 NGKKYNHNRVSKAIQRGNNAYYNDP 579 >gi|315172603|gb|EFU16620.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX1346] Length = 849 Score = 40.5 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 447 AIEKAESYTPETIQALQTTKES---VATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 503 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 504 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 558 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 559 NGKKYNHNRVSKAIQRGNNAYYNDP 583 >gi|284008185|emb|CBA74448.1| conserved pentapeptide repeat protein [Arsenophonus nasoniae] Length = 1253 Score = 40.5 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 3/84 (3%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 +IP Y TI E +KGN G + DV+ +L K R S + A G+ Sbjct: 617 LIPFYTTITEVRKGNTGQAVGAALFDVSGFFP---FLAKAAHISNRFSIAVGEAAVNGLQ 673 Query: 61 TVVQEATVMTKTTQEGALLAKEGI 84 T +++AT+ Q L K GI Sbjct: 674 TALKQATLRQALHQGAKQLLKSGI 697 >gi|38426864|gb|AAR20477.1| EndoE [Enterococcus faecalis] Length = 837 Score = 40.5 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTKES---VATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|229550597|ref|ZP_04439322.1| EndoE family protein [Enterococcus faecalis ATCC 29200] gi|229304316|gb|EEN70312.1| EndoE family protein [Enterococcus faecalis ATCC 29200] Length = 835 Score = 40.5 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAESYTPETIQALQTTKES---VATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|227519952|ref|ZP_03950001.1| glycosy hydrolase family protein [Enterococcus faecalis TX0104] gi|227072500|gb|EEI10463.1| glycosy hydrolase family protein [Enterococcus faecalis TX0104] Length = 835 Score = 40.1 bits (92), Expect = 0.41, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAENYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|256855086|ref|ZP_05560447.1| glycosyl hydrolase, family 20 [Enterococcus faecalis T8] gi|307291577|ref|ZP_07571453.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0411] gi|256709599|gb|EEU24646.1| glycosyl hydrolase, family 20 [Enterococcus faecalis T8] gi|306497338|gb|EFM66879.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0411] gi|315028710|gb|EFT40642.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX4000] Length = 835 Score = 40.1 bits (92), Expect = 0.41, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 433 AIEKAENYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 489 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 490 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 544 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 545 NGKKYNHNRVSKAIQRGNNAYYNDP 569 >gi|315168170|gb|EFU12187.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX1341] Length = 849 Score = 40.1 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 457 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 516 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 517 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 571 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 572 IQRGNNAYYNDP 583 >gi|315147044|gb|EFT91060.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX4244] Length = 835 Score = 40.1 bits (92), Expect = 0.48, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 443 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 502 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 503 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 557 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 558 IQRGNNAYYNDP 569 >gi|257081452|ref|ZP_05575813.1| glycosyl hydrolase [Enterococcus faecalis E1Sol] gi|256989482|gb|EEU76784.1| glycosyl hydrolase [Enterococcus faecalis E1Sol] Length = 545 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQTTK---ETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|257417531|ref|ZP_05594525.1| lacto-N-biosidase [Enterococcus faecalis AR01/DG] gi|257159359|gb|EEU89319.1| lacto-N-biosidase [Enterococcus faecalis ARO1/DG] Length = 545 Score = 39.7 bits (91), Expect = 0.51, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQT---TKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|256956911|ref|ZP_05561082.1| predicted protein [Enterococcus faecalis DS5] gi|307269030|ref|ZP_07550393.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX4248] gi|256947407|gb|EEU64039.1| predicted protein [Enterococcus faecalis DS5] gi|306514699|gb|EFM83251.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX4248] gi|315031235|gb|EFT43167.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0017] gi|315036438|gb|EFT48370.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis TX0027] Length = 849 Score = 39.7 bits (91), Expect = 0.51, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 457 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 516 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 517 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 571 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 572 IQRGNNAYYNDP 583 >gi|294781540|ref|ZP_06746878.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis PC1.1] gi|294451397|gb|EFG19861.1| glycosyl hydrolase family 20, catalytic domain protein [Enterococcus faecalis PC1.1] Length = 835 Score = 39.7 bits (91), Expect = 0.52, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 443 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 502 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 503 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 557 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 558 IQRGNNAYYNDP 569 >gi|256964082|ref|ZP_05568253.1| lacto-N-biosidase [Enterococcus faecalis HIP11704] gi|257088588|ref|ZP_05582949.1| lacto-N-biosidase [Enterococcus faecalis CH188] gi|257418744|ref|ZP_05595738.1| lacto-N-biosidase [Enterococcus faecalis T11] gi|256954578|gb|EEU71210.1| lacto-N-biosidase [Enterococcus faecalis HIP11704] gi|256997400|gb|EEU83920.1| lacto-N-biosidase [Enterococcus faecalis CH188] gi|257160572|gb|EEU90532.1| lacto-N-biosidase [Enterococcus faecalis T11] Length = 545 Score = 39.7 bits (91), Expect = 0.52, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQT---TKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|256618424|ref|ZP_05475270.1| lacto-N-biosidase [Enterococcus faecalis ATCC 4200] gi|256597951|gb|EEU17127.1| lacto-N-biosidase [Enterococcus faecalis ATCC 4200] Length = 545 Score = 39.7 bits (91), Expect = 0.52, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQT---TKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|89885977|ref|YP_516175.1| methylation [Rhodoferax ferrireducens T118] gi|89347975|gb|ABD72177.1| methylation [Rhodoferax ferrireducens T118] Length = 523 Score = 39.7 bits (91), Expect = 0.53, Method: Composition-based stats. Identities = 29/131 (22%), Positives = 45/131 (34%), Gaps = 12/131 (9%) Query: 4 VYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFG---LVRGSSIATKIATTGIA 60 ++GT + + + L + LA P G+ F V + T A+ G Sbjct: 192 IFGTYRTYSSSSVANYVLRGDTIDPALAGPLTVGGQTTFNGGTQVNNTLGVTGAASVGTT 251 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQT 120 V AT + TQ L G+ M G+ S+G + + TQ Sbjct: 252 LNVGGATTLNGATQVNGAL---GVSGATTMNAGA------SIGTTLNVGGATTLAGATQI 302 Query: 121 GNISDATKASS 131 N T A+S Sbjct: 303 NNTLGVTGAAS 313 >gi|256960714|ref|ZP_05564885.1| lacto-N-biosidase [Enterococcus faecalis Merz96] gi|256951210|gb|EEU67842.1| lacto-N-biosidase [Enterococcus faecalis Merz96] Length = 545 Score = 39.7 bits (91), Expect = 0.54, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQT---TKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|294664506|ref|ZP_06729853.1| methyl-accepting chemotaxis protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292605717|gb|EFF49021.1| methyl-accepting chemotaxis protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 801 Score = 39.7 bits (91), Expect = 0.63, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 39/86 (45%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E ++VG E + Sbjct: 693 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARSLE-------EQAVGLTEAV 745 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + Q T + KA+ ++ Sbjct: 746 AVFKTEQGTASTVRTASVAKAAPAVR 771 >gi|294624077|ref|ZP_06702828.1| methyl-accepting chemotaxis protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292601625|gb|EFF45611.1| methyl-accepting chemotaxis protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 783 Score = 39.7 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 23/86 (26%), Positives = 39/86 (45%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E ++VG E + Sbjct: 675 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARSLE-------EQAVGLTEAV 727 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + Q T + KA+ ++ Sbjct: 728 AVFKTEQGTASTVRTASVAKAAPAVR 753 >gi|257087932|ref|ZP_05582293.1| predicted protein [Enterococcus faecalis D6] gi|256995962|gb|EEU83264.1| predicted protein [Enterococcus faecalis D6] Length = 545 Score = 39.3 bits (90), Expect = 0.69, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 52/145 (35%), Gaps = 8/145 (5%) Query: 72 TTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASS 131 ++ E I+A + +E G + QT QT + K + Sbjct: 143 AIEKAESYTPETIQALQT---TKETVATELAGKTYTSAQVTTWQTEVQTALDNLKEKQTQ 199 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFII 191 +K SID + F LEE + K N + D +L+ N + +LD + Sbjct: 200 PLKSVFSIDAGRKYFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNV 254 Query: 192 DGKKINIDSAQNMLAELNKIFPKDF 216 +GKK N + + N + D Sbjct: 255 NGKKYNHNRVSKAIQRGNNAYYNDP 279 >gi|21242638|ref|NP_642220.1| chemotaxis protein [Xanthomonas axonopodis pv. citri str. 306] gi|21108104|gb|AAM36756.1| chemotaxis protein [Xanthomonas axonopodis pv. citri str. 306] Length = 689 Score = 39.3 bits (90), Expect = 0.80, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E +++G E + Sbjct: 581 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARSLE-------EQAIGLTEAV 633 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + Q T + KA+ ++ Sbjct: 634 AVFKTEQGTASTVRTASVAKAAPAVR 659 >gi|257078584|ref|ZP_05572945.1| lacto-N-biosidase [Enterococcus faecalis JH1] gi|256986614|gb|EEU73916.1| lacto-N-biosidase [Enterococcus faecalis JH1] Length = 545 Score = 38.9 bits (89), Expect = 0.97, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 153 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 212 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 213 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 267 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 268 IQRGNNAYYNDP 279 >gi|257084106|ref|ZP_05578467.1| glycosyl hydrolase [Enterococcus faecalis Fly1] gi|256992136|gb|EEU79438.1| glycosyl hydrolase [Enterococcus faecalis Fly1] Length = 545 Score = 38.9 bits (89), Expect = 0.98, Method: Composition-based stats. Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 5/132 (3%) Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 E ++ + +E G + QT QT + K + +K SID + Sbjct: 153 ETIQALQTTKETVATELAGKTYTAAQVTTWQTEVQTALDNLKEKQTQPLKSVFSIDAGRK 212 Query: 145 IFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSAQNM 204 F LEE + K N + D +L+ N + +LD ++GKK N + Sbjct: 213 YFSVEQLEE--LVAKASQNGYTDVQLILGNDGLRFILDDM---SVNVNGKKYNHNRVSKA 267 Query: 205 LAELNKIFPKDF 216 + N + D Sbjct: 268 IQRGNNAYYNDP 279 >gi|160885510|ref|ZP_02066513.1| hypothetical protein BACOVA_03510 [Bacteroides ovatus ATCC 8483] gi|156109132|gb|EDO10877.1| hypothetical protein BACOVA_03510 [Bacteroides ovatus ATCC 8483] Length = 679 Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 57/155 (36%), Gaps = 7/155 (4%) Query: 14 GNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTT 73 G GW G ++ L A+ GLV G+ + V + + + Sbjct: 521 GLVGWNVQGTITGCYSLMTITAFTAVNAGGLVGGNEGPVTASFAASEIVAKASGNIGGLV 580 Query: 74 QEGALLAKEGIEATHIMEG-GSTAIKSESVGAKELISASQNSQTVTQT-GNISDATKASS 131 + G L G +T ++ G S I S G+ S V+ GN+ +TK S Sbjct: 581 KNGGTLT--GCYSTSVLSGTASVTICGISTGSVTANECYFMSDGVSNPGGNLPTSTKVSD 638 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFR 166 A ID+ ++ Q + Y ++ G + R Sbjct: 639 A---AALIDKIASMNQAIAGSGYKYVENTGTDSAR 670 >gi|299147364|ref|ZP_07040429.1| putative filamentous hemagglutinin family protein [Bacteroides sp. 3_1_23] gi|298514642|gb|EFI38526.1| putative filamentous hemagglutinin family protein [Bacteroides sp. 3_1_23] Length = 679 Score = 38.9 bits (89), Expect = 1.0, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 57/155 (36%), Gaps = 7/155 (4%) Query: 14 GNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTT 73 G GW G ++ L A+ GLV G+ + V + + + Sbjct: 521 GLVGWNVQGTITGCYSLMTITAFTAVNAGGLVGGNEGPVTASFAASEIVAKASGNIGGLV 580 Query: 74 QEGALLAKEGIEATHIMEG-GSTAIKSESVGAKELISASQNSQTVTQT-GNISDATKASS 131 + G L G +T ++ G S I S G+ S V+ GN+ +TK S Sbjct: 581 KNGGTLT--GCYSTSVLSGTASVTICGISTGSVTANECYFMSDGVSNPGGNLPTSTKVSD 638 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFR 166 A ID+ ++ Q + Y ++ G + R Sbjct: 639 A---AALIDKIASMNQAIAGSGYKYVENTGTDSAR 670 >gi|301091602|ref|XP_002895982.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262095706|gb|EEY53758.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 704 Score = 38.5 bits (88), Expect = 1.1, Method: Composition-based stats. Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 1/109 (0%) Query: 26 DVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTTQEGALLAK-EGI 84 DV LL P G+ +VR + A +V V + T+Q A+L + E I Sbjct: 425 DVVLLIPPLRLGGRACLDVVRAVVHSAVRANAWGVVLVSSVFVESPTSQHSAILEQLEEI 484 Query: 85 EATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKASSTI 133 E+ G S I + + ++ S +T G++ + + +++ Sbjct: 485 ESCVKTSGVSYTIMRLPLFMEYFLALSNCDETSDNLGSLEEEKQMQASM 533 >gi|325928660|ref|ZP_08189838.1| methyl-accepting chemotaxis protein [Xanthomonas perforans 91-118] gi|325540972|gb|EGD12536.1| methyl-accepting chemotaxis protein [Xanthomonas perforans 91-118] Length = 237 Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E ++VG E + Sbjct: 129 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARSLE-------EQAVGLTEAV 181 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + Q T T + KA+ ++ Sbjct: 182 AVFKTEQGTTSTVRTASVAKAAPAVR 207 >gi|293371730|ref|ZP_06618141.1| conserved domain protein [Bacteroides ovatus SD CMC 3f] gi|292633427|gb|EFF51997.1| conserved domain protein [Bacteroides ovatus SD CMC 3f] Length = 679 Score = 37.8 bits (86), Expect = 2.0, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 57/155 (36%), Gaps = 7/155 (4%) Query: 14 GNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTT 73 G GW G ++ L A+ GLV G+ + V + + + Sbjct: 521 GLVGWNVQGTITGCYSLMTITAFTAVNAGGLVGGNEGPVTASFAAGEIVAKASGNIGGLV 580 Query: 74 QEGALLAKEGIEATHIMEG-GSTAIKSESVGAKELISASQNSQTVTQT-GNISDATKASS 131 + G L G +T ++ G S I S G+ S V+ GN+ +TK S Sbjct: 581 KNGGTLT--GCYSTSVLSGTASVTICGISTGSVTANECYFMSDGVSNPGGNLPTSTKVSD 638 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFR 166 A ID+ ++ Q + Y ++ G + R Sbjct: 639 A---AALIDKIASMNQAIAGSGYKYVENTGTDSAR 670 >gi|171685051|ref|XP_001907467.1| hypothetical protein [Podospora anserina S mat+] gi|170942486|emb|CAP68138.1| unnamed protein product [Podospora anserina S mat+] Length = 902 Score = 37.8 bits (86), Expect = 2.3, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 47/138 (34%), Gaps = 3/138 (2%) Query: 134 KDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNL-LDASRATEFIID 192 KDA +DR + +F+K + E PR+ + FR F+ G +L A R TE D Sbjct: 178 KDAAELDRMETLFKKHEMGEIPRVDWLDQLVFRGFERRGLQTARSSLKAKAIRQTEASRD 237 Query: 193 GKKINIDS--AQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQL 250 K + + F+ + + Y + F + +N + Sbjct: 238 DKSSAQNEKAKPEPSGYFLNVELPRFDFPVVFADYEYPAPPISSFQHLSSSQSNILLKPP 297 Query: 251 SNPRYSYQFNTLKDKTIS 268 + + D+ Sbjct: 298 PEVHFGPGISVDDDEGYE 315 >gi|307185838|gb|EFN71679.1| Papilin [Camponotus floridanus] Length = 2944 Score = 37.8 bits (86), Expect = 2.3, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 41/106 (38%), Gaps = 11/106 (10%) Query: 47 GSSIATKIAT--------TGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIK 98 G+S +T T + V E+ + ++T + A + EG +E Sbjct: 837 GASASTSAVTEETVEETQASTSAVTSESDISSETAESTASESSEGST---TIESSDMTTG 893 Query: 99 SESVGAKELISASQNSQTVTQTGNISDATKASSTIKDAQSIDRSQA 144 S + + + S+ +++ + +D T S+ + + +++ Sbjct: 894 SSDITTESIGSSDITTESTGSSDITTDLTTGSAETTISSITESTES 939 >gi|166711970|ref|ZP_02243177.1| chemotaxis protein [Xanthomonas oryzae pv. oryzicola BLS256] Length = 717 Score = 37.4 bits (85), Expect = 2.5, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 49 SIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELI 108 S A++ + GI V Q T M +TTQ+ A L +E A +E ++VG E + Sbjct: 609 SAASQEQSAGIEQVNQTVTQMDETTQQNAALVEEATAAARSLE-------EQAVGLTEAV 661 Query: 109 SASQNSQTVTQTGNISDATKASSTIK 134 + + T + K + ++ Sbjct: 662 AVFKTEHGTTSDARTAPVAKVAQAVR 687 >gi|260172169|ref|ZP_05758581.1| adhesin [Bacteroides sp. D2] gi|315920478|ref|ZP_07916718.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313694353|gb|EFS31188.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 679 Score = 37.4 bits (85), Expect = 2.5, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 57/155 (36%), Gaps = 7/155 (4%) Query: 14 GNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTT 73 G GW G ++ L A+ GLV G+ + V + + + Sbjct: 521 GLVGWNVQGTITGCYSLMTITAFTAVNAGGLVGGNEGPVTASFAAGEIVAKASGNIGGLV 580 Query: 74 QEGALLAKEGIEATHIMEG-GSTAIKSESVGAKELISASQNSQTVTQT-GNISDATKASS 131 + G L G +T ++ G S I S G+ S V+ GN+ +TK S Sbjct: 581 KNGGTLT--GCYSTSVLSGTASVTICGISTGSVTANECYFMSDGVSNPGGNLPTSTKVSD 638 Query: 132 TIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFR 166 A ID+ ++ Q + Y ++ G + R Sbjct: 639 A---AALIDKITSMNQAVAGSGYKYVENTGTDSAR 670 >gi|322705536|gb|EFY97121.1| phosphoinositide 3-kinase, putative [Metarhizium anisopliae ARSEF 23] Length = 1017 Score = 37.4 bits (85), Expect = 2.6, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 134 KDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKL--LGTNKVYKNLLDASRATEFII 191 KDA+ +DR + +F+K + E PR+ + FR F+ L K +L RA Sbjct: 286 KDAEELDRMEKLFKKHEMGEIPRVDWLDQMVFRSFEKKGLQAAKSSMKMLQRQRALNGDN 345 Query: 192 DGKKINIDSAQNMLAE 207 +G D+ + L + Sbjct: 346 NGDH-ASDTDNDSLND 360 >gi|229523778|ref|ZP_04413183.1| hypothetical protein VCA_001355 [Vibrio cholerae bv. albensis VL426] gi|229337359|gb|EEO02376.1| hypothetical protein VCA_001355 [Vibrio cholerae bv. albensis VL426] Length = 1486 Score = 37.4 bits (85), Expect = 2.8, Method: Composition-based stats. Identities = 57/313 (18%), Positives = 107/313 (34%), Gaps = 37/313 (11%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIA 60 ++PV+ T+++F+ GN+G LG+ SDV A A Sbjct: 838 IVPVWATVEDFQNGNHGMATLGLFSDVGFFLPI--------------GKAALSSAKLTGK 883 Query: 61 TVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKEL-ISASQNSQTVTQ 119 + + K + + +L + G + E +K+ AKE IS + Sbjct: 884 VIAKSLPRKYKVSSKLNIL-EIGSNKRVVGEAFDIDLKAARKQAKEAWISLPKTVFDQMN 942 Query: 120 TGNISDATKA--SSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVY 177 ++D K S+++K Q + + + P K+ + RDF++ ++ Sbjct: 943 VLPVTDIVKGAKSASLKIKQRVQKVKRRNTD------PYAPKVYQS--RDFRISSKDRPD 994 Query: 178 KNLLDASRATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISS--YAHEHIFCKPF 235 D + +K+ S + + + D L+ S Y+ + + K Sbjct: 995 ---FDPDKVVGIPSADRKVMA-SIADEENVVIGVRAVDPNNRSLLESGLYSSKSLLIKSK 1050 Query: 236 TKDLLNLANKN----IYQLSNPRYSY-QFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGI 290 + D A Y ++ R +FN D I+ A +T + L G Sbjct: 1051 SSDWGPHAGFIPVDQKYAKASARAQVDRFNHYSDNAINSGAATPVPLTIDDKRLTELQGG 1110 Query: 291 KAEGILSRNAPPE 303 I PE Sbjct: 1111 DNPLIPELKYNPE 1123 >gi|213626993|gb|AAI70552.1| N-methyl-D-aspartate receptor subunit NR2A [Xenopus laevis] Length = 1451 Score = 37.4 bits (85), Expect = 2.9, Method: Composition-based stats. Identities = 34/148 (22%), Positives = 66/148 (44%), Gaps = 19/148 (12%) Query: 62 VVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTG 121 V EA V T+ Q L K+ +E +G G +E S+++N + + + G Sbjct: 991 VSAEAKVNTRPRQ----LWKKSVETLRQTQGSV-----NENGTEESKSSTKNQRFLPEDG 1041 Query: 122 NISDATKASSTI------KDAQSIDRSQAIFQKMPLE-EYPR-LQKIGINYFRDFKLLGT 173 + SD ++ASS ++ +S+ +K P+ +Y R ++ ++Y + G Sbjct: 1042 HFSDVSEASSRATCHIDSENNNKHRKSKDNLKKRPVSAKYARECSEVELSYLKIKH--GP 1099 Query: 174 NKVYKNLLDASRATEFIIDGKKINIDSA 201 N+ +D + FI+D K + +S Sbjct: 1100 NRDKVYTIDGDKEPSFIMDQPKYSENSP 1127 >gi|196047835|ref|ZP_03115013.1| NlpC/P60 family protein [Bacillus cereus 03BB108] gi|196021091|gb|EDX59820.1| NlpC/P60 family protein [Bacillus cereus 03BB108] Length = 446 Score = 37.4 bits (85), Expect = 3.1, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 83/236 (35%), Gaps = 20/236 (8%) Query: 82 EGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQTGNISDATKAS-STIKDAQSID 140 EG HI + KS +GA++ I+ +V GN + + + TI + I+ Sbjct: 137 EGGTKVHISNLSNPYYKSRYLGARKFINNDNIVPSVVLDGNNTTIEQETKQTITIPKKIE 196 Query: 141 RSQAIFQKMPLEEYPRLQK-IGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINID 199 F P + + IG R+ K K+ LD + +T + + +ID Sbjct: 197 DMPKSFNVTPNGVHSKYPITIGDTLERNGK-----KIKNYSLDTNESTS---NNQNEDID 248 Query: 200 SAQNMLAELNKIFPKDFEKVQL-------ISSYAHEHIFCKPFTKDLLNLANKNIYQLSN 252 S M I P ++ QL + +Y H P ++ N ++L+ Sbjct: 249 STNVMEKLEETIAPASEKQKQLQIENEIQLFNYGHNTFLPTP---KFESITMINNFKLNR 305 Query: 253 PRYSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGIKAEGILSRNAPPELHFSS 308 + L +KT + + + + Y + +LS N S Sbjct: 306 EEIAVILEALINKTKNKSLLNDKPLIIKDVMSEDPYYKSIQFVLSHNFLTTNSKSF 361 >gi|290474339|ref|YP_003467218.1| hypothetical protein XBJ1_1298 [Xenorhabdus bovienii SS-2004] gi|289173651|emb|CBJ80431.1| hypothetical protein XBJ1_1298 [Xenorhabdus bovienii SS-2004] Length = 1532 Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 6/97 (6%) Query: 57 TGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQT 116 G A V +AT + K + GAL K EA ++E + A K+ + A +I + Sbjct: 215 AGTAGSVTKATEIAKVAEAGALATKSR-EAAQVVEASTQAAKNANPAAGAIIKEKPATSP 273 Query: 117 VTQTGNISDATKASSTIKDAQSIDRSQAIFQKMPLEE 153 T+ ++ + +A S + + ++ E Sbjct: 274 KTEVKPTTE-----KPVDNATSSGKKKGNDNELSSTE 305 >gi|92117724|ref|YP_577453.1| TonB-dependent receptor [Nitrobacter hamburgensis X14] gi|91800618|gb|ABE62993.1| TonB-dependent receptor [Nitrobacter hamburgensis X14] Length = 736 Score = 37.0 bits (84), Expect = 3.7, Method: Composition-based stats. Identities = 55/318 (17%), Positives = 102/318 (32%), Gaps = 53/318 (16%) Query: 14 GNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTT 73 G+ WG L + VA+L +A+ SS T + T T + + K Sbjct: 10 GSALWGVLA--TTVAILLPSSAFAQ---------SSSGTTLPTV-TVTAPKSSKPQPKRA 57 Query: 74 QEGALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQTVTQT-GNISDAT----- 127 A + + + + A+ + G+ + +A Q + + QT G ++ T Sbjct: 58 TRNAARSNQAPRHARVPSQAAVAVPAAVPGSLTVATAQQALREIQQTPGGVALVTADAYR 117 Query: 128 --KASSTIKDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASR 185 SSTIKD +D +F + + RL G + R+F R Sbjct: 118 NSTVSSTIKD--ILDYVPGVFAQPKWGDDTRLSIRGSSLSRNF--------------HLR 161 Query: 186 ATEFIIDGKKINIDSAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKP------FTKDL 239 + +DG I + + + +I P ++ V++ F Sbjct: 162 GVQLYMDGIPI---NTADGYGDFQEIDPTAYKYVEVFKGGNALRFGANSLGGAINFVTPS 218 Query: 240 LNLANKNIYQLSNPRYSY---QFNTL-KDKTISFVAKEEGLVTYLNGSLHRNYG--IKAE 293 N + + + Q NT + +G +YG + Sbjct: 219 GRDPFVNGVSVDAGAFGFRRLQANTGGSNGPWDGFVTAST--QSSDGFRDHSYGHSTRVS 276 Query: 294 GILSRNAPPELHFSSYVN 311 G + P+ Y+N Sbjct: 277 GNVGYQFSPDFETRFYLN 294 >gi|313889219|ref|ZP_07822873.1| Gram-positive signal peptide protein, YSIRK family [Peptoniphilus harei ACS-146-V-Sch2b] gi|312844773|gb|EFR32180.1| Gram-positive signal peptide protein, YSIRK family [Peptoniphilus harei ACS-146-V-Sch2b] Length = 805 Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 70/180 (38%), Gaps = 25/180 (13%) Query: 47 GSSIATKIATTGIATVVQEAT-VMTKTTQEGALLAKEGIEATHIMEGGSTAIKSESVGAK 105 G + T+ +A+ ++ + K E A ++ + +M+ + IK + K Sbjct: 584 GGLEQLQAGTSELASGARQIDGELAKAMSEAAS----KLDTSQLMKLSDSLIKLDDATTK 639 Query: 106 ELISASQNSQTVTQ--------TGNISDATKASSTIKDAQSIDRSQAI----FQKMPLEE 153 +S+ + Q T I++ SS +K+ S D + + + +E Sbjct: 640 LKDGSSKLREGTEQSEDAVNKFTDAITELDSNSSKLKNGAS-DLTSGLLEFRDKSQDFKE 698 Query: 154 YPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINIDSA---QNMLAELNK 210 +PRL+ GI RD G K+ ++D S + DG + D+ L E + Sbjct: 699 FPRLKTEGIVPMRD----GIKKLDDGIIDLSAGAFKLKDGSNLIDDNMRIFAEKLQEFKQ 754 >gi|317476494|ref|ZP_07935743.1| TonB-dependent Receptor Plug domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|316907520|gb|EFV29225.1| TonB-dependent Receptor Plug domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 1095 Score = 36.6 bits (83), Expect = 4.5, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 68/179 (37%), Gaps = 13/179 (7%) Query: 19 GALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATTG--IATVVQEATVMTKTTQEG 76 GA+G+ +L P + L GLV G I T G ++ + + + Sbjct: 141 GAVGLADAESLKERPVLNATQALQGLVPGLQITQTNGTLGDTPTINIRGTGTIGEGSSGS 200 Query: 77 ALLAKEGIEATHIMEGGSTAIKSESVGAKELISASQNSQ---TVTQTGNISDATKASSTI 133 L+ +G MEG I + + + ++ + +S + G I TK S Sbjct: 201 PLILIDG------MEGDLNMINPQDIASVSVLKDAASSSIYGSRAPFGVILITTKTGS-- 252 Query: 134 KDAQSIDRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIID 192 KD SI+ + + P++ + + F + + + + + +LD E ++ Sbjct: 253 KDKISINYNNSFRWASPIKMMKMMNSVNFASFLNDAMTNSGRSPQFVLDHVNRMEDYMN 311 >gi|163838809|ref|YP_001623214.1| phage infection protein [Renibacterium salmoninarum ATCC 33209] gi|162952285|gb|ABY21800.1| phage infection protein [Renibacterium salmoninarum ATCC 33209] Length = 673 Score = 36.2 bits (82), Expect = 6.5, Method: Composition-based stats. Identities = 24/101 (23%), Positives = 40/101 (39%), Gaps = 2/101 (1%) Query: 34 AAYLGKVLFGLVRGSSIATKIATTGIATVVQEATVMTKTTQEGALLAKEGIEATHIMEGG 93 + K L G S A A G A + A + + Q+ + + ++ G Sbjct: 354 VSNGAKTLAGATPALSAAISQANDGTAKLATGAQTLASSEQQAVAGTNKLADGAKTLDDG 413 Query: 94 STAIKSESV--GAKELISASQNSQTVTQTGNISDATKASST 132 + + +V A+Q Q V Q N +DATKAS++ Sbjct: 414 AAKLDDGAVKLADGANTLATQLGQGVDQIPNPNDATKASAS 454 >gi|118395905|ref|XP_001030297.1| hypothetical protein TTHERM_01100420 [Tetrahymena thermophila] gi|89284595|gb|EAR82634.1| hypothetical protein TTHERM_01100420 [Tetrahymena thermophila SB210] Length = 1749 Score = 36.2 bits (82), Expect = 6.6, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 51/157 (32%), Gaps = 7/157 (4%) Query: 140 DRSQAIFQKMPLEEYPRLQKIGINYFRDFKLLGTNKVYKNLLDASRATEFIIDGKKINID 199 ++ I +K EE L+ + + + Y+ L+ ++ + + K I+ Sbjct: 904 KKNSQISKKQRFEELNFLRMMSQDVTTYDYSQDSQFDYQ--LEQNKQSTMNSNNKNIHDQ 961 Query: 200 SAQNMLAELNKIFPKDFEKVQLISSYAHEHIFCKPFTKDLLNLANKNIYQLSNPR----- 254 S + + K F KD + S +++++ YQ+ N + Sbjct: 962 SVKQKNDDFQKQFHKDLKIQYNQSKFSNQYHIENSIISSEQAQITDYQYQIDNNKCKILT 1021 Query: 255 YSYQFNTLKDKTISFVAKEEGLVTYLNGSLHRNYGIK 291 +N + + ++ GI+ Sbjct: 1022 SQRMYNNENQNSFYLMKDQQSEQQLNEDDKQNEEGIQ 1058 >gi|284006360|emb|CBA71595.1| conserved hypothetical protein [Arsenophonus nasoniae] Length = 319 Score = 35.9 bits (81), Expect = 9.1, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 63/163 (38%), Gaps = 16/163 (9%) Query: 1 MIPVYGTIQEFKKGNYGWGALGIVSDVALLAIPAAYLGKVLFGLVRGSSIATKIATT--- 57 +IP Y I E ++GN G + D+A +L + + G + Sbjct: 3 LIPFYTVITEAQQGNTGKAVQAGLWDMA------GFLSFISLTIQIGGRFSIAAGEAALN 56 Query: 58 GIATVVQEATVMTKTTQEGALLAKEGIEATH-------IMEGGSTAIKSESVGAKELISA 110 G+ T +++AT +Q G L K GI + + G+ ++S G + L S Sbjct: 57 GLQTALKQATFRQALSQGGKQLLKSGIPHIANSLPPNVVAKLGTAFLRSADPGFELLASG 116 Query: 111 SQNSQTVTQTGNISDATKASSTIKDAQSIDRSQAIFQKMPLEE 153 S + + S K +++++ A F +P E+ Sbjct: 117 GIKSINALKKAASQSKIEISGLNKLIKALEKKAADFPVVPTEK 159 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.311 0.130 0.316 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,645,524,751 Number of Sequences: 14124377 Number of extensions: 42384527 Number of successful extensions: 211207 Number of sequences better than 10.0: 483 Number of HSP's better than 10.0 without gapping: 108 Number of HSP's successfully gapped in prelim test: 375 Number of HSP's that attempted gapping in prelim test: 209114 Number of HSP's gapped (non-prelim): 2223 length of query: 311 length of database: 4,842,793,630 effective HSP length: 138 effective length of query: 173 effective length of database: 2,893,629,604 effective search space: 500597921492 effective search space used: 500597921492 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 81 (35.9 bits)