BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022859
(291 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255548834|ref|XP_002515473.1| Protein C2orf4, putative [Ricinus communis]
gi|223545417|gb|EEF46922.1| Protein C2orf4, putative [Ricinus communis]
Length = 291
Score = 532 bits (1370), Expect = e-149, Method: Compositional matrix adjust.
Identities = 259/291 (89%), Positives = 276/291 (94%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
MEKIRRASHAGSWYTDN K+L EELDGWL+ A L KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MEKIRRASHAGSWYTDNPKKLDEELDGWLKAAALEKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDP +ISR+FLLGPSHHYYTPKCALSTATVYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61 FGNIDPANISRIFLLGPSHHYYTPKCALSTATVYKTPIGDLPIDLEVIEELKATGKFELM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVFEGH +K+VPILVGA+NA+NEAMYG+L KYVDDP+NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFEGHQVKVVPILVGALNADNEAMYGKLLGKYVDDPTNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRFNYMHYDKK+G IHKSIEALDKMGMDIIETGDPDAFK+YL EYDNTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDKKYGAIHKSIEALDKMGMDIIETGDPDAFKQYLQEYDNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVF+HML NC+T IKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFVHMLRNCATVIKIKFLRYEQSSQCKTTRDSSVSYASAAAKVDA 291
>gi|359475871|ref|XP_002285449.2| PREDICTED: protein MEMO1-like [Vitis vinifera]
gi|296082111|emb|CBI21116.3| unnamed protein product [Vitis vinifera]
Length = 291
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 256/291 (87%), Positives = 277/291 (95%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME IRRASHAGSWYTDN ++LAEELDGWLR +GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MEMIRRASHAGSWYTDNPRKLAEELDGWLRASGLAKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDP+SISRVFLLGPSHHYYTPKCALS ATVYKTP+GDL +DLEV+EELKATGKFELM
Sbjct: 61 FGNIDPSSISRVFLLGPSHHYYTPKCALSRATVYKTPVGDLQIDLEVVEELKATGKFELM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+G+ +KI+PILVGA+NAENEAMYGR+ AKYVDDP+NFF
Sbjct: 121 DLDVDEAEHSMEMHLPYLAKVFDGYPVKIIPILVGALNAENEAMYGRMLAKYVDDPNNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRFNY+HYDKK G IHKSIEALD+MGMDIIETGDPDAFK+YLLE+DNTIC
Sbjct: 181 SVSSDFCHWGSRFNYVHYDKKFGAIHKSIEALDRMGMDIIETGDPDAFKRYLLEFDNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHML NCSTKIKI FLRYEQSSQCKT RDSSVSYASAAA+VDA
Sbjct: 241 GRHPISVFLHMLRNCSTKIKINFLRYEQSSQCKTMRDSSVSYASAAAQVDA 291
>gi|449441896|ref|XP_004138718.1| PREDICTED: protein MEMO1-like [Cucumis sativus]
gi|449493342|ref|XP_004159261.1| PREDICTED: protein MEMO1-like [Cucumis sativus]
Length = 291
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 256/291 (87%), Positives = 277/291 (95%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+KIRRASHAGSWYTDNSK+LAEELDGWL +GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MDKIRRASHAGSWYTDNSKKLAEELDGWLSASGLSKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISR+FLLGPSHHYYTPKCALSTAT+Y+TP+GDLP+DLEVIEELKATGKFELM
Sbjct: 61 FGNIDPTNISRIFLLGPSHHYYTPKCALSTATIYQTPVGDLPIDLEVIEELKATGKFELM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVFEGH +K+VPILVGAV+AE+EA YGRL AKYVDDP NFF
Sbjct: 121 DMHVDEAEHSMEMHLPYLAKVFEGHPVKVVPILVGAVSAESEATYGRLLAKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
S+SSDFCHWGSRF+YMHYDKKHG I+KSIEALD+MGMDIIETGD +AFKKYL EYDNTIC
Sbjct: 181 SISSDFCHWGSRFSYMHYDKKHGPIYKSIEALDRMGMDIIETGDAEAFKKYLQEYDNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHML + STKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLKHSSTKIKIKFLRYEQSSQCKTTRDSSVSYASAAAKVDA 291
>gi|356552504|ref|XP_003544607.1| PREDICTED: protein MEMO1-like [Glycine max]
Length = 296
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 252/288 (87%), Positives = 273/288 (94%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K+RR SHAGSWYTDN K+L+EEL+GWL+ GL KS +VRGVIAPHAGYSYSGRAAAYAFG
Sbjct: 4 KVRRPSHAGSWYTDNPKQLSEELEGWLQSCGLTKSSDVRGVIAPHAGYSYSGRAAAYAFG 63
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
NIDP++I+RVFLLGPSHHYYTPKCAL+TATVYKTPIGDLP+DLEV EELKATGKFELMDI
Sbjct: 64 NIDPSNITRVFLLGPSHHYYTPKCALTTATVYKTPIGDLPIDLEVTEELKATGKFELMDI 123
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
VDEAEHSMEMHLPYLAKVFEGH +KIVPILVGAV+AENEAMYG++ AKYVDD +NFFS+
Sbjct: 124 RVDEAEHSMEMHLPYLAKVFEGHQVKIVPILVGAVSAENEAMYGQILAKYVDDSNNFFSI 183
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWGSRFNYMHYDKKHG I+KSIEALDKMGMDIIETGDPD+FK+YLLEYDNTICGR
Sbjct: 184 SSDFCHWGSRFNYMHYDKKHGPIYKSIEALDKMGMDIIETGDPDSFKQYLLEYDNTICGR 243
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
HPISVFLHML NCSTKIKIKFLRYEQSSQCK+ RDSSVSYASAAAK D
Sbjct: 244 HPISVFLHMLRNCSTKIKIKFLRYEQSSQCKSNRDSSVSYASAAAKTD 291
>gi|224143207|ref|XP_002324881.1| predicted protein [Populus trichocarpa]
gi|118483368|gb|ABK93585.1| unknown [Populus trichocarpa]
gi|222866315|gb|EEF03446.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 254/291 (87%), Positives = 271/291 (93%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+KIRR SHAGSWYTDN K+L EEL+GWL GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MDKIRRPSHAGSWYTDNPKKLEEELEGWLGATGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+I+RVFLLGPSHHYYTPKCALSTATVYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61 FGNIDPTNITRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPIDLEVIEELKATGKFELM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYL K+FEGH +K+VPILVGA+N +NEAMYGRL AKYVDDP+NFF
Sbjct: 121 DLQVDEAEHSMEMHLPYLVKIFEGHPVKVVPILVGALNTDNEAMYGRLLAKYVDDPTNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y HYDKK G IHKSIEALDKMGMDIIETG+ DAFK+YL EYDNTIC
Sbjct: 181 SVSSDFCHWGSRFHYTHYDKKCGPIHKSIEALDKMGMDIIETGNADAFKQYLSEYDNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHM NCSTKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMSSNCSTKIKIKFLRYEQSSQCKTMRDSSVSYASAAAKVDA 291
>gi|255647372|gb|ACU24152.1| unknown [Glycine max]
Length = 296
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 250/288 (86%), Positives = 272/288 (94%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K+RR SHAGSWYTDN K+L+EEL+GWL+ GL KS +VRGVIAPHAGYSYSGRAAAYAFG
Sbjct: 4 KVRRPSHAGSWYTDNPKQLSEELEGWLQSCGLTKSSDVRGVIAPHAGYSYSGRAAAYAFG 63
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
NIDP++I+RVFLLGPSHHYYTPKCAL+TATVYKTPIGDLP+DLEV EELKATGKFELMDI
Sbjct: 64 NIDPSNITRVFLLGPSHHYYTPKCALTTATVYKTPIGDLPIDLEVTEELKATGKFELMDI 123
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
VDEAEHSME+HLPYLAKVFEGH +KIVPILVGAV+AENEAMYG++ AKYVDD +NFFS+
Sbjct: 124 RVDEAEHSMEVHLPYLAKVFEGHQVKIVPILVGAVSAENEAMYGQILAKYVDDSNNFFSI 183
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWGSRFNYMHYDKKHG I+KSIEALDKMGMDIIETGDPD+FK+YLLEYDNTICGR
Sbjct: 184 SSDFCHWGSRFNYMHYDKKHGPIYKSIEALDKMGMDIIETGDPDSFKQYLLEYDNTICGR 243
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
HPISVFLHML NCSTKIKIKF RYEQSSQCK+ RDSSVSYASAAAK D
Sbjct: 244 HPISVFLHMLRNCSTKIKIKFFRYEQSSQCKSNRDSSVSYASAAAKTD 291
>gi|224092526|ref|XP_002309646.1| predicted protein [Populus trichocarpa]
gi|222855622|gb|EEE93169.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 252/291 (86%), Positives = 270/291 (92%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
MEKIRRASHAGSWYTD+ K+L EEL+GWL + GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MEKIRRASHAGSWYTDDPKKLEEELEGWLSDTGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+I+RVFLLGPSHHYYTPKCALSTA VYKTPIGDLP+DLEVIEELKATGKFELM
Sbjct: 61 FGNIDPTNITRVFLLGPSHHYYTPKCALSTAAVYKTPIGDLPIDLEVIEELKATGKFELM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDE EHSMEMHLPYLAK+FEGH +K+VPILVGAV+A+NEAMYG L AKYVDDP+NFF
Sbjct: 121 DLQVDEHEHSMEMHLPYLAKIFEGHPVKVVPILVGAVSADNEAMYGCLLAKYVDDPTNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y YDKK G IHKSIEALDKMGMDIIETG+ DA K+YL EY NTIC
Sbjct: 181 SVSSDFCHWGSRFHYTRYDKKCGPIHKSIEALDKMGMDIIETGNADALKQYLSEYGNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHML NCSTKIKIKFLRYEQSSQCKT RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLRNCSTKIKIKFLRYEQSSQCKTMRDSSVSYASAAAKVDA 291
>gi|326489895|dbj|BAJ94021.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326513924|dbj|BAJ92112.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 298
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/291 (81%), Positives = 267/291 (91%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME++RRASHAGSWYT+N+ +L EELDGWL A L KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERVRRASHAGSWYTNNASKLEEELDGWLSAAALTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ ATVY TPIGDLP+DLEVIEELKATGKFE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRATVYSTPIGDLPVDLEVIEELKATGKFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGAVN+++EAMYG+L AKYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGAVNSQSEAMYGQLLAKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y +YDK HG IHKSIEALD+MGM+IIETGDPDAFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYDKNHGAIHKSIEALDRMGMEIIETGDPDAFKQYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISV LHML +C TKIK+ F+RYEQSSQCK+ RDSSVSYASAAAKVD+
Sbjct: 241 GRHPISVLLHMLKHCLTKIKVGFVRYEQSSQCKSMRDSSVSYASAAAKVDS 291
>gi|258644468|dbj|BAI39726.1| unknown protein [Oryza sativa Indica Group]
Length = 298
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 233/291 (80%), Positives = 268/291 (92%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME++RRASHAGSWYT+N+++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERVRRASHAGSWYTNNARKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
S+SSDFCHWG+RF+Y +YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGTRFSYTYYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFL ML +CSTKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 241 GRHPISVFLSMLKHCSTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 291
>gi|115475776|ref|NP_001061484.1| Os08g0299000 [Oryza sativa Japonica Group]
gi|50508226|dbj|BAD31730.1| unknown protein [Oryza sativa Japonica Group]
gi|113623453|dbj|BAF23398.1| Os08g0299000 [Oryza sativa Japonica Group]
gi|215678729|dbj|BAG95166.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215692469|dbj|BAG87889.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215708833|dbj|BAG94102.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222640303|gb|EEE68435.1| hypothetical protein OsJ_26808 [Oryza sativa Japonica Group]
Length = 298
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 232/291 (79%), Positives = 267/291 (91%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME++RRASHAGSWYT+N+++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERVRRASHAGSWYTNNARKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
S+SSDFCHWG+RF+Y +YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGTRFSYTYYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFL ML +C TKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 241 GRHPISVFLSMLKHCLTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 291
>gi|242055595|ref|XP_002456943.1| hypothetical protein SORBIDRAFT_03g046030 [Sorghum bicolor]
gi|241928918|gb|EES02063.1| hypothetical protein SORBIDRAFT_03g046030 [Sorghum bicolor]
Length = 299
Score = 489 bits (1258), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/291 (80%), Positives = 268/291 (92%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME++RRASHAGSWYT+N+++L EEL+GWL AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERVRRASHAGSWYTNNARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATG+FE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGRFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA+++++EAMYG+L +KYVDDP NFF
Sbjct: 121 DLSVDEAEHSMEMHLPYLAKVFQGHTVKVVPILVGALSSQSEAMYGQLLSKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y +Y++KHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEEKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHML +CSTKIKI F+RYEQSSQCK RDSSVSYASAAAKVDA
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNFRDSSVSYASAAAKVDA 291
>gi|297825573|ref|XP_002880669.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326508|gb|EFH56928.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 234/291 (80%), Positives = 264/291 (90%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
MEK+R+ +HAGSWYTDN +L+ +L+ WL GL KSP VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MEKVRQPTHAGSWYTDNPTKLSSDLEEWLSATGLTKSPHVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F NIDPT+ISR+FLLGPSHH+YTPKCALSTATVYKTPIGDLP+D+E+I+E++A GKF +M
Sbjct: 61 FANIDPTNISRIFLLGPSHHFYTPKCALSTATVYKTPIGDLPVDVEMIKEIRAMGKFGMM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGAV+AENEAMYG L AKYVDDP NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFKGHDVKVVPILVGAVSAENEAMYGELLAKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRFNYMHYD HG IHKSIEALDK GMDIIETGDPDAFKKYLLE++NTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPDAFKKYLLEFENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAAAK++
Sbjct: 241 GRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAAAKLET 291
>gi|357145523|ref|XP_003573672.1| PREDICTED: protein MEMO1-like [Brachypodium distachyon]
Length = 298
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 231/291 (79%), Positives = 266/291 (91%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME++R+ASHAGSWYT+N+++L EELDGWLR AGL KSP+VR +IAPHAGYSYSGR AAYA
Sbjct: 1 MERVRKASHAGSWYTNNARKLEEELDGWLRAAGLTKSPDVRAIIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCALS TVY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALSRTTVYSTPIGDLPVDQEVIEELSATGKFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D VDEAEHSMEMHLPYL+KVF+G+ +K+VPILVGA+++ENEAMYG+L +KYVDDP NFF
Sbjct: 121 DHNVDEAEHSMEMHLPYLSKVFQGYNVKVVPILVGALSSENEAMYGQLLSKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
S+SSDFCHWGSRF+Y +YDK HG IHKSIEALD++GM+IIE+GDPDAFK+YL EY+NTIC
Sbjct: 181 SISSDFCHWGSRFSYTYYDKSHGAIHKSIEALDRLGMEIIESGDPDAFKQYLREYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISV LHML +CSTK K+ F+RYEQSSQCK+ RDSSVSYASAAAK+DA
Sbjct: 241 GRHPISVLLHMLKHCSTKSKVGFVRYEQSSQCKSMRDSSVSYASAAAKIDA 291
>gi|148910301|gb|ABR18230.1| unknown [Picea sitchensis]
Length = 291
Score = 486 bits (1250), Expect = e-135, Method: Compositional matrix adjust.
Identities = 225/291 (77%), Positives = 255/291 (87%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME+ RRA HAGSWYTDN+ +L EELD WLR G KSP+VRGVIAPHAGYSYSGR AA+A
Sbjct: 1 MERARRAVHAGSWYTDNAGQLGEELDRWLRACGQSKSPDVRGVIAPHAGYSYSGRCAAFA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F NIDP SRVFLLGPSHHYYTPKCALS ATVYKTP+GDLP+DL+V +EL+A+GKFE M
Sbjct: 61 FANIDPQRTSRVFLLGPSHHYYTPKCALSKATVYKTPLGDLPIDLDVNDELRASGKFEDM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+G +KIVPILVGA++ E+EAMYGRL AKYVDDP NFF
Sbjct: 121 DLQVDEAEHSMEMHLPYLAKVFQGWPVKIVPILVGALSPESEAMYGRLLAKYVDDPRNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRFNY++YD+K G I+KSIE LD+ GMDIIE+GD D+FK YLLEYDNTIC
Sbjct: 181 SVSSDFCHWGSRFNYVYYDRKQGAIYKSIETLDRKGMDIIESGDADSFKDYLLEYDNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPISVFLHML NC ++KI FL+YEQSSQC++ RDSSVSYASA AK +A
Sbjct: 241 GRHPISVFLHMLKNCGVRVKISFLQYEQSSQCRSMRDSSVSYASAVAKAEA 291
>gi|18400785|ref|NP_565590.1| uncharacterized protein [Arabidopsis thaliana]
gi|4567249|gb|AAD23663.1| expressed protein [Arabidopsis thaliana]
gi|16604563|gb|AAL24083.1| unknown protein [Arabidopsis thaliana]
gi|20259161|gb|AAM14296.1| unknown protein [Arabidopsis thaliana]
gi|21554336|gb|AAM63443.1| unknown [Arabidopsis thaliana]
gi|330252585|gb|AEC07679.1| uncharacterized protein [Arabidopsis thaliana]
Length = 291
Score = 486 bits (1250), Expect = e-135, Method: Compositional matrix adjust.
Identities = 233/291 (80%), Positives = 264/291 (90%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
MEKIR+ +HAGSWYTDN +L+ +L+ WL GL KSP+VRGVIAPHAGYSYSGRAAAYA
Sbjct: 1 MEKIRQPTHAGSWYTDNPTKLSSDLEEWLNATGLTKSPDVRGVIAPHAGYSYSGRAAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F NIDPT+ISR+FLLGPSHH+YTPKCALSTATVYKTPIG+LP+D+E+I+E++A GKF +M
Sbjct: 61 FANIDPTNISRIFLLGPSHHFYTPKCALSTATVYKTPIGNLPVDVEMIKEIRAMGKFGMM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVFEG+ +K+VPILVGAV+ ENEAMYG L AKYVDDP NFF
Sbjct: 121 DLRVDEAEHSMEMHLPYLAKVFEGNNVKVVPILVGAVSPENEAMYGELLAKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRFNYMHYD HG IHKSIEALDK GMDIIETGDPDAFKKYLLE++NTIC
Sbjct: 181 SVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPDAFKKYLLEFENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
GRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAAAK++
Sbjct: 241 GRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAAAKLET 291
>gi|226492415|ref|NP_001141056.1| uncharacterized protein LOC100273137 [Zea mays]
gi|194702440|gb|ACF85304.1| unknown [Zea mays]
gi|195654227|gb|ACG46581.1| hypothetical protein [Zea mays]
gi|413951291|gb|AFW83940.1| hypothetical protein ZEAMMB73_880994 [Zea mays]
Length = 300
Score = 482 bits (1241), Expect = e-134, Method: Compositional matrix adjust.
Identities = 221/277 (79%), Positives = 253/277 (91%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME+ RRASHAGSWYT+++++L EEL+GWL AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA++++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQGHNVKVVPILVGALSSQNEALYGQLLSKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
GRHPISVFLHML +CSTKIKI F+RYEQSSQCK RD
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 277
>gi|219887901|gb|ACL54325.1| unknown [Zea mays]
Length = 300
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 220/277 (79%), Positives = 252/277 (90%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME+ RRASHAGSWYT+++++L EEL+GWL AGL KSP+ R VIAPHAGYSYSGR AAYA
Sbjct: 1 MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDARAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+GH +K+VPILVGA++++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQGHNVKVVPILVGALSSQNEALYGQLLSKYVDDPKNFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 181 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
GRHPISVFLHML +CSTKIKI F+RYEQSSQCK RD
Sbjct: 241 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 277
>gi|294462702|gb|ADE76896.1| unknown [Picea sitchensis]
Length = 291
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 219/290 (75%), Positives = 254/290 (87%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME +RRA+HAGSWYT N+ L EEL+ WLR G KS +VRGVIAPHAGY YSGR AA+A
Sbjct: 1 MESVRRAAHAGSWYTSNASELGEELERWLRACGQSKSSDVRGVIAPHAGYRYSGRCAAFA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F +IDP SRVFLLGPSHHYYTPKCALS+ATVYKTP+GDLP+DLEV EELKATGKFE+M
Sbjct: 61 FAHIDPQQTSRVFLLGPSHHYYTPKCALSSATVYKTPLGDLPIDLEVYEELKATGKFEIM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DI VDE EHSMEMHLPYLAKVF+G +KIVPILVG+++ E+EAMYGR +KYV+DP+NFF
Sbjct: 121 DIQVDEEEHSMEMHLPYLAKVFQGWPVKIVPILVGSLSTESEAMYGRFLSKYVNDPANFF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
S+SSDFCHWGSRFNY++YDKK G I+KSI+ALDK GMDI+ +GDP+AFK YLL+Y NTIC
Sbjct: 181 SISSDFCHWGSRFNYVYYDKKEGAIYKSIKALDKKGMDIVASGDPNAFKDYLLQYRNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
GRHPISVFLHML NCSTKIK+ F++YEQSS+ KT RDSSVSYASA A+V+
Sbjct: 241 GRHPISVFLHMLQNCSTKIKVAFVQYEQSSKSKTMRDSSVSYASAVARVE 290
>gi|125561007|gb|EAZ06455.1| hypothetical protein OsI_28693 [Oryza sativa Indica Group]
Length = 409
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 219/275 (79%), Positives = 252/275 (91%)
Query: 17 NSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLG 76
++++L EELDGWLR AGL KSP+VR VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLG
Sbjct: 128 DTRKLDEELDGWLRAAGLTKSPDVRAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLG 187
Query: 77 PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLP 136
PSHHYYTPKCAL+ AT+Y TPIGDLP+D EVIEEL ATGKF+ MD+ VDEAEHSMEMHLP
Sbjct: 188 PSHHYYTPKCALTRATIYSTPIGDLPVDHEVIEELNATGKFDFMDLSVDEAEHSMEMHLP 247
Query: 137 YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM 196
YL+KVF+GH +K+VPILVGA+N++NEAMYG+L +KY+DDP NFFS+SSDFCHWG+RF+Y
Sbjct: 248 YLSKVFQGHNVKVVPILVGALNSQNEAMYGQLLSKYLDDPKNFFSISSDFCHWGTRFSYT 307
Query: 197 HYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
+YDK HG IHKSIEALD+MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CS
Sbjct: 308 YYDKSHGAIHKSIEALDRMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCS 367
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
TKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 368 TKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 402
>gi|413951290|gb|AFW83939.1| hypothetical protein ZEAMMB73_880994 [Zea mays]
Length = 286
Score = 452 bits (1163), Expect = e-125, Method: Compositional matrix adjust.
Identities = 212/277 (76%), Positives = 240/277 (86%), Gaps = 14/277 (5%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME+ RRASHAGSWYT+++++L EEL+GWL AGL KSP+VR VIAPHAGYSYSGR AAYA
Sbjct: 1 MERARRASHAGSWYTNDARKLEEELNGWLGAAGLTKSPDVRAVIAPHAGYSYSGRCAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
FGNIDPT+ISRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D EVIEEL ATGKFE M
Sbjct: 61 FGNIDPTNISRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQEVIEELSATGKFEFM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ VDEAEHSMEMHLPYLAKVF+G ++NEA+YG+L +KYVDDP NFF
Sbjct: 121 DLNVDEAEHSMEMHLPYLAKVFQG--------------SQNEALYGQLLSKYVDDPKNFF 166
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SVSSDFCHWGSRF+Y +Y+KKHG IHKSIEALD+MGM+IIETGDP AFK+YL EY+NTIC
Sbjct: 167 SVSSDFCHWGSRFSYTYYEKKHGAIHKSIEALDRMGMEIIETGDPVAFKEYLQEYENTIC 226
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
GRHPISVFLHML +CSTKIKI F+RYEQSSQCK RD
Sbjct: 227 GRHPISVFLHMLKHCSTKIKIGFVRYEQSSQCKNMRD 263
>gi|302785594|ref|XP_002974568.1| hypothetical protein SELMODRAFT_232363 [Selaginella moellendorffii]
gi|300157463|gb|EFJ24088.1| hypothetical protein SELMODRAFT_232363 [Selaginella moellendorffii]
Length = 289
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 206/286 (72%), Positives = 245/286 (85%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ASHAGSWY+D++ L EELDGWL+ +GLPK P+ R VIAPHAGY YSGR AA+AF NI
Sbjct: 3 RQASHAGSWYSDDALELREELDGWLKGSGLPKKPDARAVIAPHAGYRYSGRCAAFAFANI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP +SRVF+LGPSHH+YT KCALS +VYKTP+GDLP+DL+V ELK TG+FE M + V
Sbjct: 63 DPQVVSRVFILGPSHHHYTRKCALSRTSVYKTPLGDLPIDLDVYRELKETGQFEEMRVDV 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DEAEHSMEMHLPYLAKVF G +KIVPILVGA+++E+EA+YGRL AKY+DDPSNFFSVSS
Sbjct: 123 DEAEHSMEMHLPYLAKVFSGVPVKIVPILVGALSSESEALYGRLLAKYLDDPSNFFSVSS 182
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF Y +YDKK G I+KSIEALDK GMDIIE+ +P+AFK YL EY+NTICGRHP
Sbjct: 183 DFCHWGSRFGYTYYDKKRGPIYKSIEALDKCGMDIIESKNPEAFKSYLDEYENTICGRHP 242
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
I V LHM+ +C+TK++++FL+YEQSS CKT RDSSVSYASA V+
Sbjct: 243 IGVLLHMIQHCNTKLELRFLQYEQSSHCKTSRDSSVSYASAVITVE 288
>gi|302759687|ref|XP_002963266.1| hypothetical protein SELMODRAFT_438442 [Selaginella moellendorffii]
gi|300168534|gb|EFJ35137.1| hypothetical protein SELMODRAFT_438442 [Selaginella moellendorffii]
Length = 289
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 205/286 (71%), Positives = 244/286 (85%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ASHAGSWY+D++ L EELDGWL+ +GLPK P+ R VIAPHAGY YSGR AA+AF NI
Sbjct: 3 RQASHAGSWYSDDALELREELDGWLKGSGLPKKPDARAVIAPHAGYRYSGRCAAFAFANI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP +SRVF+LGPSHH+YT KCALS +VYKTP+GDLP+DL+V ELK TG+FE M + V
Sbjct: 63 DPQVVSRVFILGPSHHHYTRKCALSRTSVYKTPLGDLPIDLDVYRELKETGQFEEMRVDV 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DEAEHSMEMHLPYLAKVF G +KIVPILVG +++E+EA+YGRL AKY+DDPSNFFSVSS
Sbjct: 123 DEAEHSMEMHLPYLAKVFSGVPVKIVPILVGTLSSESEALYGRLLAKYLDDPSNFFSVSS 182
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF Y +YDKK G I+KSIEALDK GMDIIE+ +P+AFK YL EY+NTICGRHP
Sbjct: 183 DFCHWGSRFGYTYYDKKRGPIYKSIEALDKCGMDIIESKNPEAFKSYLDEYENTICGRHP 242
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
I V LHM+ +C+TK++++FL+YEQSS CKT RDSSVSYASA V+
Sbjct: 243 IGVLLHMIQDCNTKLELRFLQYEQSSHCKTSRDSSVSYASAVITVE 288
>gi|168010446|ref|XP_001757915.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690792|gb|EDQ77157.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 290
Score = 442 bits (1137), Expect = e-122, Method: Compositional matrix adjust.
Identities = 201/281 (71%), Positives = 236/281 (83%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A HAG+WYTD+++ L +ELD WLR +GLPK R VIAPHAGY YSG AAYAF NI
Sbjct: 4 REAFHAGTWYTDDAEELTQELDQWLRSSGLPKLANARAVIAPHAGYRYSGHCAAYAFANI 63
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP + RVFLLGPSHH++T KC LS +VYKTP+GDLP+D EV EEL+A+G+FE MD+ V
Sbjct: 64 DPQRVQRVFLLGPSHHHFTRKCCLSKVSVYKTPLGDLPIDNEVYEELRASGRFEDMDVSV 123
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DEAEHSMEMHLPY+AKVF+G +KIVPI+VGA EA YGRL AKY+DDP+NFFS+SS
Sbjct: 124 DEAEHSMEMHLPYIAKVFQGTSVKIVPIMVGATTPATEATYGRLLAKYIDDPANFFSISS 183
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF Y +Y+KK G I+KSIEALDK GMDIIETGDPDAF++YL E++NTICGRHP
Sbjct: 184 DFCHWGSRFGYQYYEKKRGPIYKSIEALDKCGMDIIETGDPDAFQEYLSEFENTICGRHP 243
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
I VFLHML +CST+ +IKFL+YEQSSQCK+ RDSSVSYASA
Sbjct: 244 IGVFLHMLKHCSTRTRIKFLQYEQSSQCKSMRDSSVSYASA 284
>gi|168017790|ref|XP_001761430.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687436|gb|EDQ73819.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 290
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 196/281 (69%), Positives = 239/281 (85%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+HAGSWY+D+++ L +ELD WLR +GLPK + R +IAPHAGY YSGR AAYAF NI
Sbjct: 4 REATHAGSWYSDDAEELVQELDYWLRSSGLPKLTDARAIIAPHAGYWYSGRCAAYAFANI 63
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D + RVFLLGPSHH++T KCALSTA+VYKTP+GDLP+D EV +EL+A+G+FE M + V
Sbjct: 64 DSQRVQRVFLLGPSHHHFTRKCALSTASVYKTPLGDLPIDHEVYDELRASGRFEDMSMSV 123
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DEAEHSMEMHLP+LAKVF+G +KIVPI++GA E EA YGRL AKY+DDP+NFFS+SS
Sbjct: 124 DEAEHSMEMHLPFLAKVFQGTNVKIVPIMIGACPPETEATYGRLLAKYIDDPNNFFSISS 183
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF Y +Y+KK G I+KSIEALD+ GMDIIE+GDPDAF++YL E+ NTICGRHP
Sbjct: 184 DFCHWGSRFGYQYYEKKRGPIYKSIEALDQRGMDIIESGDPDAFQEYLSEFKNTICGRHP 243
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
I VF H+L +C+T+++IKFL+YEQSSQCK RDSSVSYASA
Sbjct: 244 IGVFFHILRHCATRMRIKFLQYEQSSQCKNMRDSSVSYASA 284
>gi|222631062|gb|EEE63194.1| hypothetical protein OsJ_18003 [Oryza sativa Japonica Group]
Length = 778
Score = 348 bits (892), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 176/274 (64%), Positives = 201/274 (73%), Gaps = 52/274 (18%)
Query: 18 SKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGP 77
+++L EELDGWLR AGL KSP+V VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLGP
Sbjct: 3 ARKLDEELDGWLRAAGLTKSPDVSAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLGP 62
Query: 78 SHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
SHHYYTPKCAL+ AT+Y TPIGDLP+D E
Sbjct: 63 SHHYYTPKCALTRATIYSTPIGDLPVDHE------------------------------- 91
Query: 138 LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
+NEAMYG+L ++Y+DDP NFFS+SSDFCHWG+RF+Y +
Sbjct: 92 ---------------------DQNEAMYGQLLSRYLDDPKNFFSISSDFCHWGTRFSYTY 130
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
YDK HG IHKSIEALD MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CST
Sbjct: 131 YDKSHGAIHKSIEALDHMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCST 190
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
KIKI+F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 191 KIKIRFVRYEQSSQCKSMRDSSVSYASAAAKVDT 224
>gi|384250635|gb|EIE24114.1| UPF0103-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 294
Score = 343 bits (879), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 154/278 (55%), Positives = 203/278 (73%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RR +HAGSWY D+ RLA +++ W+ P+ R +I+PHAGYSY G A+A+ I
Sbjct: 6 RRPAHAGSWYEDDGTRLASQIENWIGAVPTPRGAHARAIISPHAGYSYCGHVMAHAYKQI 65
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+P +SRVF+LGPSHHYYT +C LS A+ Y TP+G + + L++ +EL+ATG+F+++DI V
Sbjct: 66 NPERVSRVFILGPSHHYYTRRCCLSPASSYDTPLGSVSIALDIYQELQATGQFDVIDIDV 125
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DEAEHS+E+ + YLA + G ++VPI+VG++ E EA YGRL A Y+DDPSN F +SS
Sbjct: 126 DEAEHSLELQMGYLAHIMRGRQFQLVPIMVGSLTPEREAAYGRLLAGYLDDPSNVFIISS 185
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF+Y D G I+K+IE LD GMDIIE GDP AF +YL Y NTICGRHP
Sbjct: 186 DFCHWGSRFSYTFNDSSKGPIYKAIEWLDTQGMDIIEQGDPKAFTEYLRRYGNTICGRHP 245
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
ISVFL+M+ +C TK +I F Y+QS++ T RDSSVSY
Sbjct: 246 ISVFLNMVRSCRTKFEINFENYDQSARVATFRDSSVSY 283
>gi|332027147|gb|EGI67240.1| Protein MEMO1 [Acromyrmex echinatior]
Length = 296
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 163/293 (55%), Positives = 207/293 (70%), Gaps = 8/293 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IRRASHAGSWY+DN L+++L+GWL A L P R +IAPHAGYSY G A +A
Sbjct: 1 MALIRRASHAGSWYSDNGSELSKQLEGWLSAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + R+F+LGPSHH P CALS+A++Y+TP+ DL +D +V EL+ TG FE M
Sbjct: 60 YRQISPVVVRRIFILGPSHHVRLPGCALSSASIYRTPLYDLLIDQQVRRELEETGHFECM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LP++AKV EG I+PILVG+++ E EA+YGRL A Y+ DP
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREALYGRLLAPYMADPQT 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +YD+ G IH+SI+ LDKMGMDIIET +P F +YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPLMFTEYLKKYGNT 239
Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGRHPI V L + GN + ++ +KFL+Y QSSQC DSSVSYASA+
Sbjct: 240 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 292
>gi|322779500|gb|EFZ09692.1| hypothetical protein SINV_14380 [Solenopsis invicta]
Length = 310
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 163/293 (55%), Positives = 207/293 (70%), Gaps = 8/293 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IRRASHAGSWY+DN L+++L+GWL A L P R +IAPHAGYSY G A +A
Sbjct: 15 MALIRRASHAGSWYSDNGSELSKQLEGWLSAADLSHGP-ARAIIAPHAGYSYCGACAGFA 73
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + R+F+LGPSHH P CALS+A++Y+TP+ DL +D +V EL+ TG FE M
Sbjct: 74 YRQISPVVVRRIFILGPSHHVRLPGCALSSASIYRTPLYDLHIDQQVRRELEETGHFECM 133
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LP++AKV EG I+PILVG+++ E EA+YGRL A Y+ DP
Sbjct: 134 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREALYGRLLAPYMADPQT 193
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +YD+ G IH+SI+ LDKMGMDIIET +P F +YL +Y NT
Sbjct: 194 LFIISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPPMFTEYLKKYGNT 253
Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGRHPI V L + GN + ++ +KFL+Y QSSQC DSSVSYASA+
Sbjct: 254 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 306
>gi|125551770|gb|EAY97479.1| hypothetical protein OsI_19410 [Oryza sativa Indica Group]
Length = 234
Score = 340 bits (871), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 176/274 (64%), Positives = 200/274 (72%), Gaps = 52/274 (18%)
Query: 18 SKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGP 77
+++L EELDGWLR AGL KSP+V VIAPHAGYSYSGR AAYAFGNIDPT+ISRVFLLGP
Sbjct: 3 ARKLDEELDGWLRAAGLTKSPDVSAVIAPHAGYSYSGRCAAYAFGNIDPTNISRVFLLGP 62
Query: 78 SHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
SHHYYTPKCAL+ AT+Y TPIGDLP+D E
Sbjct: 63 SHHYYTPKCALTRATIYSTPIGDLPVDHE------------------------------- 91
Query: 138 LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
+NEAMYG+L ++Y+DDP NFFS+SSDFCHWG+RF+Y +
Sbjct: 92 ---------------------DQNEAMYGQLLSRYLDDPKNFFSISSDFCHWGTRFSYTY 130
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
YDK HG IHKSIEALD MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CST
Sbjct: 131 YDKSHGAIHKSIEALDHMGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCST 190
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
KIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 191 KIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVDT 224
>gi|326427143|gb|EGD72713.1| MEMO1 protein [Salpingoeca sp. ATCC 50818]
Length = 291
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 162/290 (55%), Positives = 206/290 (71%), Gaps = 1/290 (0%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M RRA+HAGSWYTD+ K L+ +LD WL + G P R +IAPHAGYSYSG AAYA
Sbjct: 1 MSAHRRATHAGSWYTDDGKELSGQLDAWLGKVGEAIMPGARAIIAPHAGYSYSGPTAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P ++ RVF+LGPSHH Y CAL++ T Y TPIG+L +D + ++L+ATG+FE M
Sbjct: 61 YKQIVPDNVRRVFVLGPSHHVYIQGCALTSTTTYDTPIGNLKIDRDTNKDLEATGQFEHM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DI DE EHS+EMHLPY+AKV EG +VP+LVGA++++ E YG++FA Y++DPSN F
Sbjct: 121 DIQTDEDEHSIEMHLPYVAKVMEGKDFTVVPVLVGALDSKLEQEYGKIFAPYLEDPSNLF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF Y + H I+K IE LDK GMDIIE D F+ YL E +NTIC
Sbjct: 181 VISSDFCHWGRRFGYTYVPPGHDAIYKGIEHLDKQGMDIIEKQDAKGFRAYLQECENTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
GR+PIS+ L+ L + K +KFL Y QS+ CK+ RDSSVSYASAA +
Sbjct: 241 GRNPISILLNALQVLKDRQKEVKFLHYAQSNPCKSTRDSSVSYASAAVLI 290
>gi|307166154|gb|EFN60403.1| Protein MEMO1 [Camponotus floridanus]
Length = 296
Score = 337 bits (864), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 161/293 (54%), Positives = 206/293 (70%), Gaps = 8/293 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IRRASHAGSWY+DN L+++L+GWL A L P R +IAPHAGYSY G A +A
Sbjct: 1 MALIRRASHAGSWYSDNGPELSKQLEGWLGAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + R+F+LGPSHH P CALS+ ++Y+TP+ DL +D +V EL+ TG FE M
Sbjct: 60 YRQISPVVVRRIFILGPSHHVRLPGCALSSTSIYRTPLYDLHIDQQVRRELEETGHFECM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LP++AKV EG I+PILVG+++ + EA+YGRL A Y+ DP
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPDREALYGRLLAPYMADPQT 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +YD+ G IH+SI+ LDKMGMDIIET +P F +YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPAMFTEYLKKYGNT 239
Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGRHPI V L + GN + ++ +KFL+Y QSSQC DSSVSYASA+
Sbjct: 240 ICGRHPIGVLLQAIHSLKGNTNGQRMNLKFLKYAQSSQCNNMNDSSVSYASAS 292
>gi|346469415|gb|AEO34552.1| hypothetical protein [Amblyomma maculatum]
Length = 298
Score = 336 bits (861), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 162/292 (55%), Positives = 202/292 (69%), Gaps = 5/292 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ASHAGSWYTD+++ L +LD WL G P R VIAPHAGY Y G AA+A+
Sbjct: 7 VRKASHAGSWYTDSARELRYQLDNWLSAVGPPTFGPARAVIAPHAGYQYCGACAAHAYKQ 66
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+DP+ + RVF+LGPSHH C LS A Y+TP DL +D EV EEL TG FE + I
Sbjct: 67 VDPSVVRRVFILGPSHHARLGGCGLSPAKQYRTPFYDLTIDQEVYEELYETGAFEEVSIH 126
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
VDE EHS+EMHLPY+AKV E IVPI+VG+++ ENEA YGRL +KY+ D N F +S
Sbjct: 127 VDENEHSLEMHLPYIAKVMENQEFTIVPIIVGSLSPENEAFYGRLLSKYLADADNLFVIS 186
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SDFCHWG+RF+Y +YDK G IH+SIE LDK GM IIE P AF YL +Y NTICGRH
Sbjct: 187 SDFCHWGARFHYQYYDKSWGNIHQSIEKLDKQGMSIIEELSPTAFTAYLKKYGNTICGRH 246
Query: 244 PISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
PI V L+ + + ++ +KFL+Y QSSQC + DSSVSYASAA +++
Sbjct: 247 PIGVLLNAVDTLQSSGNGHRMVLKFLKYAQSSQCMSMGDSSVSYASAALRLE 298
>gi|427788107|gb|JAA59505.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 298
Score = 336 bits (861), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 160/292 (54%), Positives = 202/292 (69%), Gaps = 5/292 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ASHAGSWYTD+++ L +L+ WL G P R +IAPHAGY Y G AA+A+
Sbjct: 7 VRKASHAGSWYTDSARELRYQLENWLSAVGPPTFGPARAIIAPHAGYQYCGACAAHAYKQ 66
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+DPT++ RVF+LGPSHH C LS A Y+TP+ DL ++ EV EEL TG FE + I
Sbjct: 67 VDPTAVRRVFILGPSHHARIGGCGLSPAKTYRTPLYDLTINQEVYEELYETGAFEEVSIH 126
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
VDE EHS+EMHLPY+AKV E IVPI+VG+++ ENEA YGRL +KY+ D N F +S
Sbjct: 127 VDENEHSLEMHLPYIAKVMENQDFTIVPIIVGSLSPENEAFYGRLLSKYLADADNLFVIS 186
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SDFCHWG+RF+Y YDK G IH+SIE LDK GM IIE P AF YL +Y NTICGRH
Sbjct: 187 SDFCHWGARFHYQFYDKSWGNIHQSIEKLDKQGMSIIEELSPTAFTAYLKKYGNTICGRH 246
Query: 244 PISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
PI V L+ + ++ +KFL+Y QSSQC + DSSVSYASAA +++
Sbjct: 247 PIGVLLNAVDTLQNSGNGHRMALKFLKYAQSSQCMSMSDSSVSYASAALRLE 298
>gi|302853948|ref|XP_002958486.1| hypothetical protein VOLCADRAFT_99750 [Volvox carteri f.
nagariensis]
gi|300256214|gb|EFJ40486.1| hypothetical protein VOLCADRAFT_99750 [Volvox carteri f.
nagariensis]
Length = 293
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 156/291 (53%), Positives = 205/291 (70%), Gaps = 4/291 (1%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAA 58
M KIRR SHA SWY ++ L E+D W +A + P V R +I PHAGYSY G A
Sbjct: 1 MPKIRRPSHASSWYQGDADALNAEIDNW--KAAVQTEPRVSPRAIIGPHAGYSYCGHVMA 58
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
YA+ +ID +RVFLLGPSHH +T KC LS+ ++Y TP+G + +D EV +LKATG+FE
Sbjct: 59 YAYKHIDANKFTRVFLLGPSHHVFTRKCMLSSQSLYDTPLGSMEIDQEVYGQLKATGQFE 118
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+M VDEAEHS+E+H PY+ G K+VPI+VGA+ AE EA+YG+L Y+DDPSN
Sbjct: 119 VMSRDVDEAEHSLELHTPYIVHTMRGQSYKLVPIMVGALTAEGEALYGKLLGPYLDDPSN 178
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F VSSDFCHWGSRF+Y +Y+++ G I +SI+ LD++G+ IE G+P F YL +Y NT
Sbjct: 179 LFVVSSDFCHWGSRFSYTYYNREQGQIWQSIKWLDELGIQAIEGGNPADFTAYLEQYRNT 238
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICGRHPI VFL+ML + + KIKF +Y+QSS+C ++ SSVSYA+A V
Sbjct: 239 ICGRHPIGVFLNMLQHSKLRHKIKFTKYDQSSKCTSQSHSSVSYAAAIVTV 289
>gi|260837153|ref|XP_002613570.1| hypothetical protein BRAFLDRAFT_114006 [Branchiostoma floridae]
gi|229298955|gb|EEN69579.1| hypothetical protein BRAFLDRAFT_114006 [Branchiostoma floridae]
Length = 294
Score = 330 bits (846), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 155/292 (53%), Positives = 206/292 (70%), Gaps = 7/292 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRA+HAGSWYT + + L ++L+GWL A SP R +IAPHAGY+Y G A+A+
Sbjct: 3 VRRATHAGSWYTSSVQDLNDQLEGWLAAAQPTLSP-ARAIIAPHAGYAYCGACGAHAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+DP +I RVF+LGPSHH CA++ VY TP+ DL +D + +EL TG+FE+MDI
Sbjct: 62 VDPRNIKRVFILGPSHHVRLSGCAVTATQVYHTPLYDLTIDTAINQELLQTGQFEVMDIN 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E +VP+LVG ++ + EA YGRLF+K++ DP N F
Sbjct: 122 TDEDEHSIEMHLPYVAKVMESKKGQFTVVPVLVGGLSVDQEAKYGRLFSKFLADPQNLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF Y++YD+ G I++SIEALD+MGMDIIE +P F YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFKYVYYDRSQGDIYQSIEALDRMGMDIIEHMNPAEFTSYLRKYENTICG 241
Query: 242 RHPISVFLHMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
RHPI+V L+ + T + ++FL Y QSS+C+ K DSSVSYA+AA V
Sbjct: 242 RHPIAVLLNAVKYLKTANSHTLSLRFLNYAQSSRCRRKEDSSVSYAAAALVV 293
>gi|289741463|gb|ADD19479.1| putative dioxygenase [Glossina morsitans morsitans]
Length = 294
Score = 329 bits (843), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 159/287 (55%), Positives = 199/287 (69%), Gaps = 6/287 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ASHAGSWYT ++ L +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RKASHAGSWYTSSANELRRQLDNWLASACLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D+EV EL+ TG F MD+
Sbjct: 63 SPAVVKRIFILGPSHHVRLRGCALSVAKKYETPLYDLKIDIEVNAELEKTGHFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVP+LVG++N + EA+YGRLF KY +P N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEEFKNQFSIVPVLVGSLNPDQEALYGRLFTKYFLEPENLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RFNY +YDK G I+KSIE+LDK GM+IIET DP +F YL EY+NTICGR
Sbjct: 183 SSDFCHWGQRFNYTYYDKSSGPIYKSIESLDKRGMEIIETLDPASFTSYLREYNNTICGR 242
Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L+ + + K KFL+Y QSSQCK DSSVSYA+ +
Sbjct: 243 HPIGVLLNAVKSLQEKGCNTNFKFLKYAQSSQCKHMDDSSVSYAAGS 289
>gi|383852946|ref|XP_003701986.1| PREDICTED: protein MEMO1-like [Megachile rotundata]
Length = 296
Score = 328 bits (842), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 157/297 (52%), Positives = 203/297 (68%), Gaps = 8/297 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IRRA+HAGSWY + L ++L+GWL A L P R +I+PHAGYSY G A +A
Sbjct: 1 MALIRRATHAGSWYAGSGLELNKQLEGWLGAADLSHGP-ARAIISPHAGYSYCGACAGFA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + R+F+LGPSHH CALS+A++Y+TP+ DL +D +V EL+ +G FE M
Sbjct: 60 YRQISPVVVRRIFILGPSHHVRLAGCALSSASIYQTPLYDLHVDQQVCRELEESGHFEWM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LP++AKV EG I+PILVG+++ E EA YGRL A Y+ DP
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEREAFYGRLLAPYMADPQT 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +YD+ G IH+SI+ LDKMGMDIIET +P F YL +Y NT
Sbjct: 180 LFVISSDFCHWGQRFRYTYYDRSCGPIHRSIQNLDKMGMDIIETLNPAMFTDYLKKYGNT 239
Query: 239 ICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
ICGRHPISV L ++ N ++ +KFL+Y QS+QC DSSVSYASA+ V+
Sbjct: 240 ICGRHPISVLLQIIHNLKGNTNGQRMNLKFLKYAQSNQCNNMSDSSVSYASASLVVE 296
>gi|307106589|gb|EFN54834.1| hypothetical protein CHLNCDRAFT_48888 [Chlorella variabilis]
Length = 318
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 154/311 (49%), Positives = 200/311 (64%), Gaps = 26/311 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +IR +HAG+WY++ + +L ++L WL EA + R +IAPHAGY YSG AAYA
Sbjct: 1 MPRIRATTHAGTWYSNQAGQLQQQLKTWLSEAEAVEGQHARAIIAPHAGYRYSGHVAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ IDPT + RVFLLGPSHH+Y+ C LS A Y TP+G +D EV EL+ATGKF+ +
Sbjct: 61 YKQIDPTQVRRVFLLGPSHHFYSKHCLLSPADAYATPLGSATIDAEVYAELRATGKFQEL 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DEAEHS+E+HLPY+ + G +VPI+VGA++AE+EA YGRL A Y+DD SN F
Sbjct: 121 KAAADEAEHSLELHLPYIVHMMAGRPFTLVPIVVGAISAESEAAYGRLLAPYLDDSSNLF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKH--------------------------GVIHKSIEALDK 214
VSSDFCHWG RF+Y YD G IH SIE LD+
Sbjct: 181 IVSSDFCHWGKRFSYTFYDPAQASAAGGGGQGRPGRGQQQQLSGGALLGPIHASIERLDR 240
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
GM +IE D F YL ++ NTICGRHP ++ LH L +C+TK ++F +Y+QS +C T
Sbjct: 241 QGMALIEAQDAAGFTAYLKQHGNTICGRHPTALLLHALRHCATKHSVRFTKYDQSHRCTT 300
Query: 275 KRDSSVSYASA 285
++DSSVSYASA
Sbjct: 301 EQDSSVSYASA 311
>gi|195108689|ref|XP_001998925.1| GI24231 [Drosophila mojavensis]
gi|193915519|gb|EDW14386.1| GI24231 [Drosophila mojavensis]
Length = 295
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/289 (55%), Positives = 195/289 (67%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRA+HAGSWYTD++ L+ +LD WL A L P R +IAPHAGY+Y G AYA+
Sbjct: 3 VRRATHAGSWYTDSASELSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAYAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + R+F+LGPSHH CALS YKTP+ DL +D EV EL+ TG+F MDI
Sbjct: 62 VSPVVVKRIFILGPSHHVRLRSCALSVTKKYKTPLYDLKIDTEVNAELENTGQFTWMDIK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E IVPILVG++N E EA YG L AKY DP+N F
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDFKDQFTIVPILVGSLNPEQEAQYGSLLAKYFMDPTNLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF+Y +YD+ G IHK IE LDK GMDIIET P AF +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFSYTYYDRSCGPIHKCIEQLDKQGMDIIETLSPTAFTEYLRKYNNTICG 241
Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI V L + + + KFL+Y QSSQC DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGS 290
>gi|195446372|ref|XP_002070750.1| GK10855 [Drosophila willistoni]
gi|194166835|gb|EDW81736.1| GK10855 [Drosophila willistoni]
Length = 295
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 159/288 (55%), Positives = 197/288 (68%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+S L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSSSELSRQLDRWLSAAELSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A YKTP+ DL +D ++ EL+ TG F MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDTQINAELEKTGHFTWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHKSIE LDK GMDIIET P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGPIHKSIEQLDKQGMDIIETLTPSAFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQEQGYNKMSFKFLKYAQSSQCQDMDDSSVSYASGS 290
>gi|390332252|ref|XP_791245.2| PREDICTED: protein MEMO1-like [Strongylocentrotus purpuratus]
Length = 299
Score = 327 bits (837), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 156/296 (52%), Positives = 207/296 (69%), Gaps = 11/296 (3%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +RRA+H+G+WY+ + K L+ EL+GWL A SP R +IAPHAGYSY G A+A
Sbjct: 1 MSTMRRATHSGTWYSSSGKTLSGELEGWLSNAERSGSP-ARAIIAPHAGYSYCGACGAHA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ ++P +I RVF+LGPSHH Y P CAL+T T Y+TP+ DL +D +V ++L ++ FE M
Sbjct: 60 YKQVNPANIERVFILGPSHHAYLPGCALTTCTKYQTPLYDLDVDQKVNQDLMSSKMFETM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ VDE EHS+EMHLPY+AKV E K IVP+LVGA++ E +GRLF+ Y+ DP N
Sbjct: 120 SVQVDEDEHSIEMHLPYIAKVMESKRGKFTIVPVLVGALDINKEQNFGRLFSTYLADPRN 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F VSSDFCHWG RF + +YDKK+ I++SIEA+D+MGMD+IE DP F YL +Y NT
Sbjct: 180 LFVVSSDFCHWGKRFRFTYYDKKYPKIYQSIEAVDRMGMDLIEQLDPQGFAAYLKKYSNT 239
Query: 239 ICGRHPISVFLHML--------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGRHPI+V L+ + ++KFL+Y QS+QCK + DSSVSYA+AA
Sbjct: 240 ICGRHPIAVLLNAITIAISERQNGSGLHYEMKFLQYAQSNQCKLQSDSSVSYAAAA 295
>gi|195500657|ref|XP_002097466.1| GE24472 [Drosophila yakuba]
gi|194183567|gb|EDW97178.1| GE24472 [Drosophila yakuba]
Length = 295
Score = 326 bits (835), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 156/288 (54%), Positives = 198/288 (68%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D ++ EL+ TG+F MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHKSIE LDK GMDIIET P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGAIHKSIEKLDKQGMDIIETLSPHSFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y+QSSQC+ DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYDQSSQCQDMEDSSVSYASGS 290
>gi|125774497|ref|XP_001358507.1| GA20778 [Drosophila pseudoobscura pseudoobscura]
gi|54638244|gb|EAL27646.1| GA20778 [Drosophila pseudoobscura pseudoobscura]
Length = 295
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/288 (54%), Positives = 196/288 (68%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD L+ +LD WL A L P R +IAPHAGY+Y G +A+A+ I
Sbjct: 4 RRATHAGSWYTDAGSELSRQLDRWLGAAELSHGP-ARAIIAPHAGYTYCGACSAFAYRQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A YKTP+ DL +D+++ EL+ TG+F MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDIQINAELEKTGQFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L + Y DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSTYFMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHKSIE LDK GMD+IET P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGQIHKSIEKLDKQGMDLIETLSPTAFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y QSSQC DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLQYAQSSQCLDMEDSSVSYASGS 290
>gi|21357419|ref|NP_650252.1| CG8031 [Drosophila melanogaster]
gi|7299707|gb|AAF54889.1| CG8031 [Drosophila melanogaster]
gi|16769746|gb|AAL29092.1| LP04475p [Drosophila melanogaster]
gi|220944484|gb|ACL84785.1| CG8031-PA [synthetic construct]
gi|220954280|gb|ACL89683.1| CG8031-PA [synthetic construct]
Length = 295
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 157/288 (54%), Positives = 198/288 (68%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G AA+A+ +
Sbjct: 4 RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D ++ EL+ TGKF MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDAQINSELEKTGKFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD G IHKSIE LDK GMDIIE+ +P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGHRFSYTYYDSSCGAIHKSIEKLDKQGMDIIESLNPHSFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIEDSSVSYASGS 290
>gi|194901576|ref|XP_001980328.1| GG17083 [Drosophila erecta]
gi|190652031|gb|EDV49286.1| GG17083 [Drosophila erecta]
Length = 295
Score = 324 bits (830), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/288 (54%), Positives = 197/288 (68%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSGTELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D ++ EL+ TG+F MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDSQINAELEKTGQFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L + Y+ DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSPYLMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHKSIE LDK GMDIIET P +F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGAIHKSIEKLDKQGMDIIETLSPHSFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDMEDSSVSYASGS 290
>gi|259155194|ref|NP_001158838.1| protein MEMO1 [Salmo salar]
gi|223647658|gb|ACN10587.1| MEMO1 [Salmo salar]
Length = 297
Score = 324 bits (830), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/290 (55%), Positives = 199/290 (68%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL +A SP R +IAPHAGYSY G AA+A+ I
Sbjct: 8 REASHAGSWYTASGSQLNTQLEGWLSQAQSTVSP-ARAIIAPHAGYSYCGACAAHAYKQI 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALS+A +YKTP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSVTRRVFILGPSHHVPLSRCALSSAEIYKTPLYDLRIDQKVYADLWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H + IVP+LVGA++ E YG+L ++Y+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDELSIVPVLVGALSESKEQDYGKLLSRYLADPSNLFII 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G IH+SIE LDKMGM IIET DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFCYTYYDEAQGEIHRSIEHLDKMGMGIIETLDPMSFTNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L T++ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAVSELRKSGTEMNFSFLNYAQSSQCRNWQDSSVSYAAGALIV 296
>gi|195037481|ref|XP_001990189.1| GH19198 [Drosophila grimshawi]
gi|193894385|gb|EDV93251.1| GH19198 [Drosophila grimshawi]
Length = 295
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 159/292 (54%), Positives = 194/292 (66%), Gaps = 7/292 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSGSDLSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS YKTP+ DL +D EV EL+ TG+F MDI
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVTKKYKTPLYDLKIDTEVNAELEKTGQFTWMDIKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E IVPILVG++N E EA YG L A Y DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDFKDQFTIVPILVGSLNPEQEAQYGSLLATYFMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHK IE LDK GMDIIET P AF +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSSGPIHKCIEQLDKQGMDIIETLSPTAFTEYLRKYNNTICGR 242
Query: 243 HPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
HPI V L + + + KFL+Y QSSQC DSSVSYAS + ++
Sbjct: 243 HPIGVMLSAIKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGSLIIE 294
>gi|194743160|ref|XP_001954068.1| GF18089 [Drosophila ananassae]
gi|190627105|gb|EDV42629.1| GF18089 [Drosophila ananassae]
Length = 295
Score = 322 bits (824), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 155/288 (53%), Positives = 195/288 (67%), Gaps = 7/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSGSELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D ++ EL+ TG+F MD+
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVPILVG++N + EA YG L + Y DP+N F +
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPDQEAQYGSLLSTYFMDPTNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF+Y +YD+ G IHKSIE LDK GMDIIE+ P AF +YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFSYTYYDRSCGPIHKSIEKLDKQGMDIIESLSPQAFTEYLRKYSNTICGR 242
Query: 243 HPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 243 HPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDMEDSSVSYASGS 290
>gi|340721535|ref|XP_003399175.1| PREDICTED: protein MEMO1-like [Bombus terrestris]
gi|350407005|ref|XP_003487952.1| PREDICTED: protein MEMO1-like [Bombus impatiens]
Length = 296
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 156/293 (53%), Positives = 201/293 (68%), Gaps = 8/293 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IRRA+HAGSWY + L ++L+GWL A L P R +IAPHAGYSY G A +A
Sbjct: 1 MALIRRATHAGSWYAGSGLDLNKQLEGWLGAADLSHGP-ARAIIAPHAGYSYCGACAGFA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + R+F+LGPSHH CALS+A++Y+TP+ DL +D +V EL+ T FE M
Sbjct: 60 YRQISPVVVRRIFILGPSHHVRLAGCALSSASIYQTPLYDLHIDQQVYRELEETRHFEWM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LP++AKV EG I+PILVG+++ E EA+YGRL A Y+ DP
Sbjct: 120 DLNTDEEEHSIEMQLPFIAKVMEGFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQT 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSDFCHWG RF Y +YD+ G I++SI+ LDKMGMDIIET +P F YL +Y NT
Sbjct: 180 LLVISSDFCHWGQRFRYTYYDRSCGPIYRSIQNLDKMGMDIIETLNPTMFTDYLKKYGNT 239
Query: 239 ICGRHPISVFLHML----GNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGRHPISV L + GN + ++ +KFL+Y QS+QC DSSVSYASA+
Sbjct: 240 ICGRHPISVLLQAIHTLKGNTNGQRMNLKFLKYAQSNQCNNMNDSSVSYASAS 292
>gi|52345758|ref|NP_001004925.1| protein MEMO1 [Xenopus (Silurana) tropicalis]
gi|82183493|sp|Q6DJ03.1|MEMO1_XENTR RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|49522376|gb|AAH75382.1| MGC89105 protein [Xenopus (Silurana) tropicalis]
gi|89269896|emb|CAJ83509.1| novel protein orthologous to C2orf4 [Xenopus (Silurana) tropicalis]
Length = 297
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 155/290 (53%), Positives = 197/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L+ +LDGWL +A K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGSQLSAQLDGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALST +Y+TP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSVTRRVFILGPSHHVALSRCALSTVDIYRTPLYDLHIDQKVYGDLWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H + IVP+LVGA++ E +G+LF+KY+ DP+N F +
Sbjct: 127 DEDEHSIEMHLPYTAKTMESHKDDLTIVPVLVGALSESKEQEFGKLFSKYLADPTNLFVI 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGMGIIEQLDPVQFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAATELQKNGVNMSFSFLNYAQSSQCRSWQDSSVSYAAGALVV 296
>gi|242019097|ref|XP_002430002.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515060|gb|EEB17264.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 296
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/295 (54%), Positives = 197/295 (66%), Gaps = 11/295 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HA SWY+D++K L +L WL +A L P R +IAPHAGY Y G AA+A+ I
Sbjct: 4 RRATHANSWYSDSAKELERQLQTWLGQAELTHGP-ARAIIAPHAGYQYCGSCAAFAYRQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH C LS A +TP DL +D +V +EL ATG+F +M I +
Sbjct: 63 SPAVVKRIFILGPSHHVRFSGCYLSPALKCETPFYDLTVDQQVHKELMATGEFNVMGISI 122
Query: 125 DEAEHSMEMHLPYLAKV---FEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DEAEHS+EMHLPY+AKV F+ + IVPILVG+ NA+ EA+YG + A Y+ DP N F
Sbjct: 123 DEAEHSIEMHLPYIAKVMAEFKDYF-TIVPILVGSTNAKEEALYGSILAPYLADPQNLFI 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSDFCHWG RF Y YDK G IH+SI+ LDKMGMDIIE +P F YL +Y NTICG
Sbjct: 182 VSSDFCHWGQRFRYTFYDKSWGEIHQSIQTLDKMGMDIIEKLNPTEFTDYLKKYGNTICG 241
Query: 242 RHPISVFLHMLG------NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
RHPI V L + N KI +KFL+Y QSSQC DSSVSYASAA +
Sbjct: 242 RHPIGVLLQAIAYLKNSSNFGNKINLKFLKYVQSSQCCRMSDSSVSYASAAVTFE 296
>gi|156551830|ref|XP_001604210.1| PREDICTED: protein MEMO1-like [Nasonia vitripennis]
Length = 296
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 155/297 (52%), Positives = 199/297 (67%), Gaps = 8/297 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M RRASH+GSWY++N L ++L WL A L P R +I+PHAGYSY G AA+A
Sbjct: 1 MALARRASHSGSWYSENGFELNKQLGDWLNAADLSHGP-ARAIISPHAGYSYCGACAAFA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P + ++F+LGPSHH P CAL+ A VYKTP+ DL +D + +EL TG FE M
Sbjct: 60 YRQISPVVVRKIFILGPSHHVRLPGCALTQAQVYKTPLYDLHVDQRINKELADTGHFEWM 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D+ DE EHS+EM LPY+AKV E ++PILVG+++ E EAMYGRL A Y+ DP
Sbjct: 120 DMDTDEDEHSIEMQLPYIAKVMEDFKDSFTVIPILVGSLSPEREAMYGRLLAPYMADPQT 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +YD+ G IHKSI+ LD+MGM+IIE P +F +YL +Y NT
Sbjct: 180 LFIISSDFCHWGQRFRYTYYDRSCGAIHKSIKNLDQMGMNIIENMTPSSFTEYLRKYGNT 239
Query: 239 ICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
ICGRHPI V L + + ++ +KFL+Y QSSQC DSSVSYASA+ V+
Sbjct: 240 ICGRHPIGVLLQAIQSLKETTNGQRMTLKFLKYAQSSQCNNMNDSSVSYASASMVVE 296
>gi|148223639|ref|NP_001085839.1| protein MEMO1 [Xenopus laevis]
gi|82184294|sp|Q6GNT9.1|MEMO1_XENLA RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|49116700|gb|AAH73413.1| MGC80879 protein [Xenopus laevis]
Length = 297
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 154/290 (53%), Positives = 197/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWY+ + +L+ +LDGWL +A K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYSASGSQLSAQLDGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALST +Y+TP+ DL +D +V EL TG FE M +
Sbjct: 67 DPSVTRRVFILGPSHHVALSRCALSTVDIYRTPLYDLHVDQKVYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H + IVP+LVGA++ E +G++F+KY+ DP+N F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDDLTIVPVLVGALSESKEQEFGKVFSKYLADPTNLFVI 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGMSIIEQLDPVQFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAATELQKNGVNMSFSFLNYAQSSQCRSWQDSSVSYAAGALVV 296
>gi|195571271|ref|XP_002103627.1| GD20526 [Drosophila simulans]
gi|194199554|gb|EDX13130.1| GD20526 [Drosophila simulans]
Length = 295
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 157/289 (54%), Positives = 197/289 (68%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G AA+A+
Sbjct: 3 VRRATHAGSWYTDSGAELSGQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + R+F+LGPSHH CALS A YKTP+ DL +D ++ EL+ TG+F M +
Sbjct: 62 VSPVVVKRIFILGPSHHVRLRVCALSAAKKYKTPLYDLKIDSQINAELEKTGEFTWMSMK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L A Y+ DP+N F
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLAPYLMDPTNLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF+Y +YD G IHKSIE LDK GMDIIE+ P +F +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGHRFSYTYYDSTCGAIHKSIEKLDKQGMDIIESLSPHSFTEYLRKYNNTICG 241
Query: 242 RHPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIDDSSVSYASGS 290
>gi|195329376|ref|XP_002031387.1| GM25966 [Drosophila sechellia]
gi|194120330|gb|EDW42373.1| GM25966 [Drosophila sechellia]
Length = 295
Score = 319 bits (818), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 157/289 (54%), Positives = 197/289 (68%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G AA+A+
Sbjct: 3 VRRATHAGSWYTDSGAELSGQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACAAFAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + R+F+LGPSHH CALS A YKTP+ DL +D ++ EL+ TG+F M +
Sbjct: 62 VSPVVVKRIFILGPSHHVRLRGCALSGAKKYKTPLYDLKIDSQINAELEKTGEFTWMSLK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG L A Y+ DP+N F
Sbjct: 122 TDEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLAPYLMDPTNLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF+Y +YD G IHKSIE LDK GMDIIE+ P +F +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGHRFSYTYYDSTCGAIHKSIEKLDKQGMDIIESLSPHSFTEYLRKYNNTICG 241
Query: 242 RHPISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI V L + K+ KFL+Y QSSQC+ DSSVSYAS +
Sbjct: 242 RHPIGVMLGAVKALQDQGYDKMSFKFLKYAQSSQCQDIDDSSVSYASGS 290
>gi|443695939|gb|ELT96731.1| hypothetical protein CAPTEDRAFT_154277 [Capitella teleta]
Length = 297
Score = 319 bits (818), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 158/300 (52%), Positives = 208/300 (69%), Gaps = 15/300 (5%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
M ++R+ASHAGSWYT + + L +EL WL +A P E R +IAPHAGYSY G +A+
Sbjct: 1 MSRVRKASHAGSWYTGDGEGLNKELGKWLSKA--PAGGEHARAIIAPHAGYSYCGACSAH 58
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ IDP+ + RVF+LGPSHH CA++ A Y+TP+ DL +D ++ EEL +T FE
Sbjct: 59 AYKQIDPSRVHRVFILGPSHHVRLNGCAITAAEKYQTPLYDLTIDAQISEELYSTSMFEK 118
Query: 120 MDICVDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
M DE EHS+EMHLPY+AKV + GH ++P+LVG++N + EA YGR+F++Y+ DP
Sbjct: 119 MSNQTDEDEHSIEMHLPYVAKVMQSRQGHFT-VIPVLVGSLNPDKEAAYGRIFSRYLADP 177
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+N F +SSDFCHWG RF Y +YDK G I+KSIE LD+MGM++IE DP AF YL +Y
Sbjct: 178 NNLFVISSDFCHWGQRFRYTYYDKSKGDIYKSIEHLDRMGMNLIEEMDPPAFTAYLEQYG 237
Query: 237 NTICGRHPISVFLHML-------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
NTICGRHPI V L+ + GN + + FL+Y QSSQC T +DSSVSYASAA +
Sbjct: 238 NTICGRHPIGVLLNSIKCLRESRGN-GHRYSLHFLKYAQSSQCLTNKDSSVSYASAALTI 296
>gi|281204139|gb|EFA78335.1| hypothetical protein PPL_08986 [Polysphondylium pallidum PN500]
Length = 289
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 153/290 (52%), Positives = 192/290 (66%), Gaps = 1/290 (0%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR +HAGSWY+ N+ L+++LD WL E R +I PHAGYSYSGRAA+
Sbjct: 1 MATIRSCTHAGSWYSRNASELSKQLDNWLGETK-SIGENTRAIIVPHAGYSYSGRAASRG 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ N+D + R+F+LGPSHH Y C L+ T ++TPIG+L +D EVI++L T F
Sbjct: 60 YINLDASKYKRIFILGPSHHVYMKTCGLTKLTHFETPIGNLEVDTEVIKKLDDTSIFTWT 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
VDE EHS+E+HLPY+AK+ + IKIVPI+VG+++ + E YG++ A Y DDP NFF
Sbjct: 120 TKDVDEDEHSIELHLPYVAKMIQDKKIKIVPIMVGSLSKDTEEQYGKVLAPYFDDPENFF 179
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF Y HY++ G IHKSIE LD+ GM +IETGDP F YL E NTIC
Sbjct: 180 VISSDFCHWGKRFGYTHYNQSAGPIHKSIEELDRTGMSMIETGDPLQFSTYLKETKNTIC 239
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
GR+PI+V L N K IK L YEQSSQC DSSVSY A K D
Sbjct: 240 GRNPIAVMLWTAKNSRNKHSIKSLHYEQSSQCMNMNDSSVSYGVLAIKQD 289
>gi|91094281|ref|XP_970827.1| PREDICTED: similar to CG8031 CG8031-PA [Tribolium castaneum]
gi|270014397|gb|EFA10845.1| hypothetical protein TcasGA2_TC001622 [Tribolium castaneum]
Length = 295
Score = 318 bits (816), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 154/290 (53%), Positives = 200/290 (68%), Gaps = 8/290 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+A HAGSWYTD+ L+ +L+ WL +A L P R +IAPHAGY Y G +A+
Sbjct: 3 VRKAIHAGSWYTDSGPELSRQLEYWLHQADLIHGP-ARAIIAPHAGYQYCGACGGHAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + R+F+LGPSHH CALSTA YKTP+ DL +D +V EL+ TG+FE MD+
Sbjct: 62 VSPVVVRRIFILGPSHHVRLSGCALSTARKYKTPLYDLHVDSQVNGELEMTGQFEWMDLD 121
Query: 124 VDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E H IVPILVG+++ E EA YGR+ + Y+ DP N F
Sbjct: 122 TDENEHSIEMHLPYIAKVMENYKHQFTIVPILVGSLSPEREACYGRILSSYLGDPQNLFI 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF Y +YD+ +G I++SIE LD++GM+IIE +P F YL ++ NTICG
Sbjct: 182 ISSDFCHWGHRFRYTYYDRSYGSIYQSIEELDRLGMNIIENLNPTDFTNYLKKFGNTICG 241
Query: 242 RHPISVFLH----MLGNCSTK-IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI + L +L N S + +KFL+Y QSSQC+ DSSVSYASAA
Sbjct: 242 RHPIGILLQAVQELLRNDSRRNANLKFLKYAQSSQCRNMNDSSVSYASAA 291
>gi|320167720|gb|EFW44619.1| cell motility mediator [Capsaspora owczarzaki ATCC 30864]
Length = 300
Score = 317 bits (813), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 159/296 (53%), Positives = 192/296 (64%), Gaps = 9/296 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAAAYA 60
+R ASHAGSWY + RL +LDGWL A R +IAPHAGYSYSG AA+A
Sbjct: 5 VREASHAGSWYDEGGARLESQLDGWLAAANSDADSNHTPARAIIAPHAGYSYSGPTAAFA 64
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I P ++ RVF+LGPSHH Y CALS T Y TP+G+L LD +I EL AT FE M
Sbjct: 65 YKQIIPDNVQRVFILGPSHHVYLKGCALSRCTEYATPLGNLKLDAAIIAELNATQSFEFM 124
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
VDE EHS+EMHLPY+ KVF + IVPILVGA+ E EA+YG++ A Y+ +P+N
Sbjct: 125 TKSVDEEEHSIEMHLPYIRKVFNKNANATIVPILVGALATEKEAVYGKILAPYLANPANI 184
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F VSSDFCHWGSRFNY + IH+SI LD GM +IE D D F YL NTI
Sbjct: 185 FVVSSDFCHWGSRFNYTYTTSSQVPIHESINILDHQGMALIERLDCDGFAAYLKSTKNTI 244
Query: 240 CGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
CGRHPI V L + T + +KF++Y QSS CKT+RDSSVSYASA+ +VD
Sbjct: 245 CGRHPIGVLLQAIRAVRTTDTAFQPNLKFVKYAQSSACKTQRDSSVSYASASLRVD 300
>gi|198433196|ref|XP_002131141.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 296
Score = 316 bits (810), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 148/288 (51%), Positives = 197/288 (68%), Gaps = 6/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRASHAGSWY + L +L WL +A R +I+PHAGYSY G AA+AF I
Sbjct: 4 RRASHAGSWYEGKGEDLDMQLSKWLSDASGSNHRPARAIISPHAGYSYCGSCAAHAFSQI 63
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP ++ +VF+LGPSHH+Y +CALS + Y TP+ +L +D EV +EL TGKF M V
Sbjct: 64 DPNTVEKVFILGPSHHFYLKECALSPTSTYITPLYNLKIDQEVYQELYGTGKFRTMSQSV 123
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EM LPY+AKV E I+PILVG+++ +E +YG + AKY+++P+N F +
Sbjct: 124 DEDEHSIEMQLPYIAKVMESKKGQFTIIPILVGSLDQASEKLYGEILAKYMNNPNNVFVI 183
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y YD++HG I+KSIEALD+ GM +IE DP+ F +YL Y NTICGR
Sbjct: 184 SSDFCHWGRRFQYTRYDEQHGEIYKSIEALDRKGMQLIENLDPEGFYEYLAAYKNTICGR 243
Query: 243 HPISVFLHMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI+V L+ + +++ KFL Y+QSS+C+ DSSVSYA+ +
Sbjct: 244 HPIAVMLNAVKTAQNGNGLRLEFKFLDYKQSSKCRVHSDSSVSYAAGS 291
>gi|71361623|ref|NP_001025088.1| protein MEMO1 [Rattus norvegicus]
gi|81918134|sp|Q4QQR9.1|MEMO1_RAT RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|67678458|gb|AAH98061.1| Mediator of cell motility 1 [Rattus norvegicus]
Length = 297
Score = 316 bits (809), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKSGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296
>gi|19526994|ref|NP_598532.1| protein MEMO1 [Mus musculus]
gi|62899882|sp|Q91VH6.1|MEMO1_MOUSE RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|17046305|gb|AAL34463.1|AF363447_1 C21orf19-like protein [Mus musculus]
gi|15489467|gb|AAH13819.1| Memo1 protein [Mus musculus]
Length = 297
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSVTRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296
>gi|344288747|ref|XP_003416108.1| PREDICTED: protein MEMO1-like isoform 1 [Loxodonta africana]
Length = 297
Score = 315 bits (807), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 296
>gi|327262347|ref|XP_003215986.1| PREDICTED: protein MEMO1-like isoform 1 [Anolis carolinensis]
Length = 297
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L+ +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D +V EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKVYAELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E +G+LF+KY+ DP N F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296
>gi|193697516|ref|XP_001942616.1| PREDICTED: protein MEMO1-like [Acyrthosiphon pisum]
Length = 291
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 200/290 (68%), Gaps = 4/290 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRASH+G+WY +++K L+ +L+ WL A L P R +IAPHAGY Y G AA+A+
Sbjct: 3 VRRASHSGTWYNESAKELSTQLENWLGAAELTHGP-ARAIIAPHAGYQYCGACAAFAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P + R+F+LGPSH+ CALS+A Y TP DL +D EV EEL++TG FE M +
Sbjct: 62 ISPFVVQRIFILGPSHNVRLSGCALSSAVKYSTPFYDLQIDNEVYEELESTGHFERMSMK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
VDE EHS+EM LPY+AKV + + K IVP++VG+++ E E++YG +FA+Y+ DP N F
Sbjct: 122 VDEDEHSIEMQLPYIAKVMQDYKDKFTIVPVMVGSLSTEKESLYGHIFAQYLADPKNLFI 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF Y Y+K G IH+SI LD GMD IET P AF +YL + NTICG
Sbjct: 182 ISSDFCHWGQRFRYTFYEKSWGEIHQSITTLDHKGMDTIETLKPAAFVEYLKTFSNTICG 241
Query: 242 RHPISVFLHMLGNCST-KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
RHPISV L + T K+ +KFL+Y QSS+C DSSVSYASA+ ++
Sbjct: 242 RHPISVLLQAANHLRTQKLSLKFLKYAQSSKCYNLGDSSVSYASASLIIE 291
>gi|7705720|ref|NP_057039.1| protein MEMO1 isoform 1 [Homo sapiens]
gi|349585097|ref|NP_001231806.1| mediator of cell motility 1 [Sus scrofa]
gi|114576869|ref|XP_001164110.1| PREDICTED: protein MEMO1 isoform 3 [Pan troglodytes]
gi|126303134|ref|XP_001371460.1| PREDICTED: protein MEMO1-like isoform 1 [Monodelphis domestica]
gi|402890505|ref|XP_003908527.1| PREDICTED: protein MEMO1 isoform 1 [Papio anubis]
gi|402890507|ref|XP_003908528.1| PREDICTED: protein MEMO1 isoform 2 [Papio anubis]
gi|7388490|sp|Q9Y316.1|MEMO1_HUMAN RecName: Full=Protein MEMO1; AltName: Full=C21orf19-like protein;
AltName: Full=Hepatitis C virus NS5A-transactivated
protein 7; Short=HCV NS5A-transactivated protein 7;
AltName: Full=Mediator of ErbB2-driven cell motility 1;
Short=Mediator of cell motility 1; Short=Memo-1
gi|75076342|sp|Q4R6D9.1|MEMO1_MACFA RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|4680693|gb|AAD27736.1|AF132961_1 CGI-27 protein [Homo sapiens]
gi|17046303|gb|AAL34462.1|AF363446_1 C21orf19-like protein [Homo sapiens]
gi|17511762|gb|AAH18733.1| Mediator of cell motility 1 [Homo sapiens]
gi|33328304|gb|AAQ09602.1| NS5ATP7 [Homo sapiens]
gi|56377669|dbj|BAD74066.1| C2orf4 [Homo sapiens]
gi|66267174|gb|AAH94681.1| Mediator of cell motility 1 [Homo sapiens]
gi|67969978|dbj|BAE01336.1| unnamed protein product [Macaca fascicularis]
gi|119620872|gb|EAX00467.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
gi|119620874|gb|EAX00469.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
gi|119620875|gb|EAX00470.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
gi|119620877|gb|EAX00472.1| chromosome 2 open reading frame 4, isoform CRA_a [Homo sapiens]
gi|123982438|gb|ABM82960.1| chromosome 2 open reading frame 4 [synthetic construct]
gi|123997099|gb|ABM86151.1| chromosome 2 open reading frame 4 [synthetic construct]
gi|380785185|gb|AFE64468.1| protein MEMO1 isoform 1 [Macaca mulatta]
gi|383411277|gb|AFH28852.1| protein MEMO1 isoform 1 [Macaca mulatta]
gi|410209454|gb|JAA01946.1| mediator of cell motility 1 [Pan troglodytes]
gi|410249514|gb|JAA12724.1| mediator of cell motility 1 [Pan troglodytes]
gi|410298038|gb|JAA27619.1| mediator of cell motility 1 [Pan troglodytes]
gi|410352083|gb|JAA42645.1| mediator of cell motility 1 [Pan troglodytes]
gi|417398534|gb|JAA46300.1| Putative dioxygenase [Desmodus rotundus]
Length = 297
Score = 314 bits (805), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|195395150|ref|XP_002056199.1| GJ10804 [Drosophila virilis]
gi|194142908|gb|EDW59311.1| GJ10804 [Drosophila virilis]
Length = 295
Score = 314 bits (805), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/289 (53%), Positives = 192/289 (66%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+
Sbjct: 3 VRRATHAGSWYTDSGSDLSRQLDRWLGAADLSHGP-ARAIIAPHAGYAYCGACGAFAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + R+F+LGPSHH CALS A +TP+ DL +D +V EL+ TG+F MD+
Sbjct: 62 VSPVVVKRIFILGPSHHVRLRGCALSVAKKCRTPLYDLKIDTQVNAELEKTGQFSWMDMK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+AKV E IVPI+VG++N E EA YG L A Y DP+N F
Sbjct: 122 SDEDEHSIEMHLPYIAKVMEDFKDQFTIVPIIVGSLNPEQEAQYGSLLANYFMDPTNLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF+Y +YD+ G IHK IE LDK GMDIIE P AF +YL +Y+NTICG
Sbjct: 182 ISSDFCHWGQRFSYTYYDRSCGPIHKCIEQLDKQGMDIIEKLSPTAFTEYLRKYNNTICG 241
Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI V L + + + KFL+Y QSSQC DSSVSYAS +
Sbjct: 242 RHPIGVMLGAIKALQDQGFKNMSFKFLKYAQSSQCLDMEDSSVSYASGS 290
>gi|387016908|gb|AFJ50572.1| Protein MEMO1-like [Crotalus adamanteus]
Length = 297
Score = 314 bits (805), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 195/290 (67%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L+ +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLQIDQKVYTDLWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E +G+LF+KY+ DP N F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM+IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDETQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296
>gi|161172368|pdb|3BD0|A Chain A, Crystal Structure Of Memo, Form Ii
gi|161172369|pdb|3BD0|B Chain B, Crystal Structure Of Memo, Form Ii
gi|161172370|pdb|3BD0|C Chain C, Crystal Structure Of Memo, Form Ii
gi|161172371|pdb|3BD0|D Chain D, Crystal Structure Of Memo, Form Ii
Length = 293
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 4 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 63 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 123 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 242
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 243 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 292
>gi|47123293|gb|AAH70046.1| MEMO1 protein [Homo sapiens]
Length = 297
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 152/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPHLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|306482629|ref|NP_001182352.1| protein MEMO1 [Gallus gallus]
Length = 297
Score = 313 bits (801), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 150/290 (51%), Positives = 194/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP ++F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPNITRKIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSEAKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM+IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHMLGNCS---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ + + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAINELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 296
>gi|428176181|gb|EKX45067.1| hypothetical protein GUITHDRAFT_71667 [Guillardia theta CCMP2712]
Length = 295
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/293 (52%), Positives = 209/293 (71%), Gaps = 9/293 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---GVIAPHAGYSYSGRAAAYA 60
+R ASHAGSWY L ++LD WL++ + KS E++ G+IAPHAGYSYSG AAYA
Sbjct: 3 VRVASHAGSWYAGRGDELKKQLDRWLQD--VKKSDELQPSCGLIAPHAGYSYSGPTAAYA 60
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ +DP+ I R+F+LGPSHHYY CALS + Y TP+G+L +D +V++EL ++G FE M
Sbjct: 61 YRYMDPSRIKRIFVLGPSHHYYLTGCALSKHSAYATPLGNLEIDTDVVKELHSSGLFEYM 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
VDE EHS+EM +PY+ KV E KIVP+LVG ++ + E +YG LF++Y++DPS
Sbjct: 121 SEEVDEDEHSLEMQMPYIYKVMEQASKSYKIVPVLVGNLSPKKEDVYGSLFSRYLNDPST 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
FF +SSDFCHWGSRF Y +YD G I++SIEALD+ GM IE+ P+ F+ Y +++NT
Sbjct: 181 FFVISSDFCHWGSRFRYTYYDPSFGEIYQSIEALDRQGMQEIESLSPENFQNYQAKFNNT 240
Query: 239 ICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICGRHPI+V L+ L + +++ I+F+RY QSS C+ + DSSVSYASA K
Sbjct: 241 ICGRHPIAVLLNALKASSAASQASIQFVRYAQSSPCRCQADSSVSYASAVVKT 293
>gi|158254942|dbj|BAF83442.1| unnamed protein product [Homo sapiens]
Length = 297
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 151/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHRVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|328774127|gb|EGF84164.1| hypothetical protein BATDEDRAFT_18471 [Batrachochytrium
dendrobatidis JAM81]
Length = 299
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 154/286 (53%), Positives = 192/286 (67%), Gaps = 5/286 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ASHAGSWY+ + ++L+ +LD W+ A + +P VR +IAPHAGYSYSG AAA+A+G I
Sbjct: 9 RKASHAGSWYSASQEQLSSQLDSWISTADITPNPNVRAIIAPHAGYSYSGPAAAWAYGAI 68
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D I RVF+LGPSHH Y CALS T+YKTP+GDL +D +V+ LKATG+FE + I
Sbjct: 69 DSHQIKRVFILGPSHHVYIGGCALSPCTIYKTPLGDLTIDTDVVASLKATGQFETLSIQA 128
Query: 125 DEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPYL K+ IVPILVGA + ++ Y +FA Y+ D +N F V
Sbjct: 129 DEDEHSVEMHLPYLYKIMSKKTTPYTIVPILVGASRSSDQHNYATIFAPYLADSTNLFIV 188
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG F Y + IH SIE LD+ M +IETG+ F+ YL NTICGR
Sbjct: 189 SSDFCHWGPNFGYTYQFDPSLQIHASIEKLDREAMRVIETGNSKDFEAYLKRTKNTICGR 248
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+PI V LH L N K+ IKF Y QSS+ TKRD+SVSYASA
Sbjct: 249 NPIGVLLHALECLENMGKKVDIKFTHYAQSSRALTKRDNSVSYASA 294
>gi|160774203|gb|AAI55285.1| Mediator of cell motility 1 [Danio rerio]
Length = 297
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 154/290 (53%), Positives = 192/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL +A P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGSQLNAQLEGWLSQAQSIAGP-ARAIIAPHAGYTYCGACAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALS A VY+TP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSITRRVFILGPSHHVPLSRCALSPAEVYRTPLYDLRIDQKVYADLWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMENHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPISFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ + + FL Y QSSQC+ DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAVAELKKNGIDMNFSFLNYAQSSQCRNWSDSSVSYAAGALIV 296
>gi|157167680|ref|XP_001661638.1| hypothetical protein AaeL_AAEL011394 [Aedes aegypti]
gi|108872303|gb|EAT36528.1| AAEL011394-PA [Aedes aegypti]
Length = 293
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/287 (53%), Positives = 195/287 (67%), Gaps = 6/287 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYTD+ L +L WL +A L P R +IAPHAGY Y G A+A+ I
Sbjct: 4 REASHAGSWYTDSGSDLNRQLSKWLDDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + RVF+LGPSHH +CALS A +TP+ DL +D EV EL+ATG F+ MD
Sbjct: 63 SPAVVKRVFILGPSHHVRLSRCALSAAQYCRTPLYDLKVDQEVNAELEATGHFKWMDQKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E IVPI+VG+++ + E YG++ A Y+ DP N F +
Sbjct: 123 DEDEHSIEMHLPYVAKVMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG+RF Y +Y+ HG I+K IE LDKMGMD+IET P++F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGARFRYTYYEDSHGPIYKWIEVLDKMGMDLIETLKPESFGEYLRKYNNTICGR 242
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L L ++ KFL+Y+QS+QC K+DSSVSYAS +
Sbjct: 243 HPIGVLLQSVEELKRRGYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 289
>gi|159483599|ref|XP_001699848.1| hypothetical protein CHLREDRAFT_182241 [Chlamydomonas reinhardtii]
gi|158281790|gb|EDP07544.1| predicted protein [Chlamydomonas reinhardtii]
Length = 285
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 150/286 (52%), Positives = 193/286 (67%), Gaps = 2/286 (0%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAY 59
M KIRR SHA SWYT L E++ W LP P R +I PHAGYSY G A+
Sbjct: 1 MPKIRRPSHAHSWYTGEGDALDAEINKWKAAVTPLPGLP-ARAIIGPHAGYSYCGHVMAH 59
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ +I+P + RVFLLGPSHH ++ KC LS+ T Y+TP+G + +D EV +L+ +GKF+
Sbjct: 60 AYAHINPANFKRVFLLGPSHHVFSRKCLLSSQTHYETPLGSMEIDQEVYAQLRDSGKFDS 119
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
M VDE EHS+E+H PY+ G K+VPI+VGA+ + EA YG L Y+DDP+N
Sbjct: 120 MSQDVDEDEHSLELHTPYIVHTMRGRDFKLVPIMVGALTTDAEAAYGSLLGPYLDDPANL 179
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F VSSDFCHWGSRF+Y YDK G I +SI+ LD+ GM IIE+ +P F YL +Y NTI
Sbjct: 180 FVVSSDFCHWGSRFSYTFYDKSQGQIWESIKWLDETGMKIIESAEPSKFASYLEQYKNTI 239
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
CGRHPI VFL+ML + S + +I F +Y+QSS+C T SSVSYA+A
Sbjct: 240 CGRHPIGVFLNMLQHSSLRHRISFTKYDQSSKCTTPSHSSVSYAAA 285
>gi|170037682|ref|XP_001846685.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880969|gb|EDS44352.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 293
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/287 (52%), Positives = 196/287 (68%), Gaps = 6/287 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYTD+ L +L WL ++ L P R +IAPHAGY Y G A+A+ I
Sbjct: 4 REASHAGSWYTDSGSELNRQLSKWLDDSDLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + RVF+LGPSHH +CALS A +TP+ DL +D +V EL+ATG F+ MD
Sbjct: 63 SPAVVKRVFILGPSHHVRLSRCALSAAQFCRTPLYDLKVDQQVNAELEATGHFKWMDQKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E IVPI+VG+++ + E YG++ A Y+ DP N F +
Sbjct: 123 DEDEHSIEMHLPYVAKVMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG+RF Y +Y++ HG I+K IE LDKMGMD+IET P++F +YL +Y+NTICGR
Sbjct: 183 SSDFCHWGARFRYTYYEESHGPIYKWIEVLDKMGMDLIETLKPESFAEYLRKYNNTICGR 242
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V L L ++ KFL+Y+QS+QC K+DSSVSYAS +
Sbjct: 243 HPIGVLLQSVEELKRRGYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 289
>gi|348507064|ref|XP_003441077.1| PREDICTED: protein MEMO1-like isoform 1 [Oreochromis niloticus]
Length = 297
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 153/290 (52%), Positives = 194/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWY+ + +L +L+GWL +A P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYSASGSQLNAQLEGWLSQAQSTIRP-ARAIIAPHAGYTYCGACAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALS A +Y+TP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSITRRVFILGPSHHVPLSRCALSPADIYRTPLYDLRIDQKVYADLWKTGLFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPMSFTNYLKKYRNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L ++ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAVAELRKSGLEMNFTFLNYAQSSQCRNWQDSSVSYAAGALIV 296
>gi|343962447|dbj|BAK62811.1| protein C2orf4 [Pan troglodytes]
Length = 297
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 151/290 (52%), Positives = 193/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQ + +DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQRRNWQDSSVSYAAGALTV 296
>gi|41054231|ref|NP_956088.1| protein MEMO1 [Danio rerio]
gi|82177098|sp|Q803S3.1|MEMO1_DANRE RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|27882485|gb|AAH44360.1| Mediator of cell motility 1 [Danio rerio]
Length = 297
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 153/290 (52%), Positives = 191/290 (65%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL +A P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGSQLNAQLEGWLSQAQSIAGP-ARAIIAPHAGYTYCGACAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ RVF+LGPSHH +CALS A VY+TP+ DL +D +V +L TG FE M +
Sbjct: 67 DPSITRRVFILGPSHHVPLSRCALSPAEVYRTPLYDLRIDQKVYADLWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMENHKDEFSIVPVLVGALSGSKEQEYGKLLSKYLADPSNLFII 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
S DFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SPDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPISFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ + + FL Y QSSQC+ DSSVSYA+ A V
Sbjct: 247 HPIGVLLNAVAELKKNGIDMNFSFLNYAQSSQCRNWSDSSVSYAAGALIV 296
>gi|74198004|dbj|BAE35184.1| unnamed protein product [Mus musculus]
Length = 302
Score = 309 bits (792), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/295 (51%), Positives = 195/295 (66%), Gaps = 11/295 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSVTRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHW-----GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
SSDFCHW G RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y N
Sbjct: 187 SSDFCHWGYLFQGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHN 246
Query: 238 TICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
TICGRHPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 247 TICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 301
>gi|328872880|gb|EGG21247.1| hypothetical protein DFA_01125 [Dictyostelium fasciculatum]
Length = 288
Score = 307 bits (787), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 145/279 (51%), Positives = 188/279 (67%), Gaps = 1/279 (0%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A H G+WY +S +L ++L GWL E K + +I PHAGY YSGRAA+ + N+
Sbjct: 4 RAAYHGGNWYASDSNKLGQQLSGWLGETKSIKK-NTKALIVPHAGYDYSGRAASRGYINL 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
T+ R+F+LGPSHH YT C LS T +TP+GDL +D+E+ +L TG+F M +
Sbjct: 63 SDTTFKRIFVLGPSHHVYTKSCGLSKHTHLETPVGDLKVDVEITNKLNLTGQFTWMTKEI 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DE EHS+EMH PY+A+V +G I IVP+LVG V EN Y +FA Y+DDP NFF +SS
Sbjct: 123 DEEEHSLEMHFPYIAQVTKGRDITIVPVLVGNVPKENLKTYADIFAPYLDDPDNFFIISS 182
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWGSRF+Y +D G IH+ IE +DK GM IE+GDP F YL E NTICGR+P
Sbjct: 183 DFCHWGSRFDYSPHDPSCGAIHQFIEKMDKNGMKAIESGDPSVFADYLKETKNTICGRNP 242
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
IS+FL ++ + K +I+ L YEQSS+C + RDSSVSY
Sbjct: 243 ISLFLWIIHSSKLKFEIQSLYYEQSSKCNSMRDSSVSYG 281
>gi|58392176|ref|XP_319155.2| AGAP010011-PA [Anopheles gambiae str. PEST]
gi|55236230|gb|EAA13930.2| AGAP010011-PA [Anopheles gambiae str. PEST]
Length = 295
Score = 307 bits (786), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 146/287 (50%), Positives = 195/287 (67%), Gaps = 6/287 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+HAGSWYTD+ L +L+GWL +A L P R +IAPHAGY Y G A+A+ I
Sbjct: 6 RLATHAGSWYTDSGAELNRQLNGWLNDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQI 64
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + RVF+LGPSHH +CALS TP+ +L +D ++ ++L++TG F+ MD
Sbjct: 65 SPAVVKRVFILGPSHHVRLSRCALSAVQYCNTPLYNLKVDQQINQQLESTGHFKWMDQKT 124
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY+AKV E + IVP++VG++ E E YG++FA Y+ DP N F +
Sbjct: 125 DEDEHSIEMHLPYVAKVMEDYKDQFTIVPVMVGSITNEWEETYGKIFAPYLADPQNLFVI 184
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +Y++ G I+K IE LDKMGMD+IET D+F +YL +Y+NTICGR
Sbjct: 185 SSDFCHWGQRFRYTYYEEGSGPIYKWIETLDKMGMDLIETLKADSFSEYLRKYNNTICGR 244
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI V + L ++ KFL+Y+QS+QC K+DSSVSYAS +
Sbjct: 245 HPIGVLMQAVEELKRRGHRMSFKFLKYDQSNQCCDKKDSSVSYASGS 291
>gi|393248178|gb|EJD55685.1| UPF0103-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 305
Score = 304 bits (779), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 162/298 (54%), Positives = 187/298 (62%), Gaps = 17/298 (5%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+S L +L WL AG + +IAPHAGYSYSG AAA+A+ I
Sbjct: 4 RRATHAGSWYTDDSDELQGQLTSWLASAGSAPIHGCKAIIAPHAGYSYSGPAAAWAYKCI 63
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D T I RVF+LGPSHHYY P CALS Y TP+G+LP+D + EL+A+GKFE M
Sbjct: 64 DTTGIRRVFVLGPSHHYYLPGCALSVLDAYNTPVGNLPIDKTTVAELRASGKFEDMSRTT 123
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DE EHS+EMHLPY+ K FEG I IVP+LVGA++ E E YG L A Y+ P F VSS
Sbjct: 124 DEDEHSLEMHLPYVRKAFEGLDIVIVPVLVGALDTEGERSYGALLAPYLSQPDTFTVVSS 183
Query: 185 DFCHWGSRFNY-MHYDK---------KHGV------IHKSIEALDKMGMDIIETGDPDAF 228
DFCHWG RF Y +HY G IHKSIEALD+ M +IE F
Sbjct: 184 DFCHWGIRFQYTLHYPNPPPSALAKLSRGARPPAHPIHKSIEALDREAMGVIEGRQHARF 243
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKI-KIKFLRYEQSSQCKTKRDSSVSYASA 285
YL NTICGRHPI V L L +IKF+RYEQSSQC DSSVSYASA
Sbjct: 244 MDYLRRTRNTICGRHPIGVLLAALETLGGDDHEIKFVRYEQSSQCVDVNDSSVSYASA 301
>gi|410901597|ref|XP_003964282.1| PREDICTED: protein MEMO1-like isoform 1 [Takifugu rubripes]
Length = 297
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 150/290 (51%), Positives = 193/290 (66%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASH+GSWY+ + +L +L+ WL +A + P R +IAPHAGYSY G AA+A+ +
Sbjct: 8 REASHSGSWYSASGSQLNAQLEDWLSKANSKRRP-ARAIIAPHAGYSYCGACAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DPT RVF+LGPSHH + CALS+A +Y+TP+ D+ +D +V EL TG F+ M+I
Sbjct: 67 DPTVTRRVFILGPSHHVHLTCCALSSAEIYRTPLYDMRIDQKVYAELWKTGLFDRMNIKT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E IVPILVGA++ E YG+L +KY+ DPSN F +
Sbjct: 127 DEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKLLSKYLADPSNLFVI 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG+RF+Y +YD+ G I++SIE LDKMGM IIE DP +F Y+ +Y NTICGR
Sbjct: 187 SSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSFSNYIKKYRNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QS QC+ SSVSYA+ A V
Sbjct: 247 HPIGVLLNAVAELRKSGLDMNFSFLNYAQSGQCRNWEQSSVSYAAGALYV 296
>gi|391341233|ref|XP_003744935.1| PREDICTED: protein MEMO1-like [Metaseiulus occidentalis]
Length = 300
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 151/292 (51%), Positives = 188/292 (64%), Gaps = 6/292 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+HAGSWYT+ L +LDGWL+ G R +IAPHAGY YSG AA+A+ I
Sbjct: 6 RAATHAGSWYTEKENELRNQLDGWLQLVGNATHSPARAIIAPHAGYRYSGSTAAFAYKQI 65
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP + ++F+LGPSHH + CALS AT YKTP+ DL +D E+ +EL TG FE M +
Sbjct: 66 DPDRVRKIFILGPSHHVFLEGCALSPATTYKTPLMDLTIDKEIYDELYKTGLFEEMSLGT 125
Query: 125 DEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EM LPY+AKV E I+PI+VGA++ E EA+YGR AKY+ F
Sbjct: 126 DEDEHSIEMQLPYIAKVMERKTNGPFTIIPIMVGAISTEKEAIYGRALAKYLGREDVCFV 185
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSDFCHWGSRF Y HYD+ G I +SI LD GM +IE P +F YL NTICG
Sbjct: 186 VSSDFCHWGSRFRYQHYDEAQGDIWESISKLDHDGMKLIEDLSPTSFASYLKSTGNTICG 245
Query: 242 RHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
RHPI V L+ L + +FLRY QS++ +++DSSVSYASAA VD
Sbjct: 246 RHPIGVLLNSIATLQQSPIRFNCEFLRYAQSNRVTSRQDSSVSYASAAFLVD 297
>gi|161172364|pdb|3BCZ|A Chain A, Crystal Structure Of Memo
gi|161172365|pdb|3BCZ|B Chain B, Crystal Structure Of Memo
gi|161172366|pdb|3BCZ|C Chain C, Crystal Structure Of Memo
gi|161172367|pdb|3BCZ|D Chain D, Crystal Structure Of Memo
Length = 293
Score = 302 bits (774), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 148/290 (51%), Positives = 189/290 (65%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 4 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE +
Sbjct: 63 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGXFERXSLQT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+E HLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 123 DEDEHSIEXHLPYTAKAXESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDK G IIE DP +F YL +Y NTICGR
Sbjct: 183 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKXGXSIIEQLDPVSFSNYLKKYHNTICGR 242
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 243 HPIGVLLNAITELQKNGXNXSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 292
>gi|227204473|dbj|BAH57088.1| AT2G25280 [Arabidopsis thaliana]
Length = 185
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/185 (81%), Positives = 168/185 (90%)
Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
+I+E++A GKF +MD+ VDEAEHSMEMHLPYLAKVFEG+ +K+VPILVGAV+ ENEAMYG
Sbjct: 1 MIKEIRAMGKFGMMDLRVDEAEHSMEMHLPYLAKVFEGNNVKVVPILVGAVSPENEAMYG 60
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
L AKYVDDP NFFSVSSDFCHWGSRFNYMHYD HG IHKSIEALDK GMDIIETGDPD
Sbjct: 61 ELLAKYVDDPKNFFSVSSDFCHWGSRFNYMHYDNTHGAIHKSIEALDKKGMDIIETGDPD 120
Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
AFKKYLLE++NTICGRHPIS+FLHML + S+KIKI FLRYEQSSQC+T RDSSVSYASAA
Sbjct: 121 AFKKYLLEFENTICGRHPISIFLHMLKHSSSKIKINFLRYEQSSQCQTMRDSSVSYASAA 180
Query: 287 AKVDA 291
AK++
Sbjct: 181 AKLET 185
>gi|380028966|ref|XP_003698154.1| PREDICTED: protein MEMO1-like [Apis florea]
Length = 303
Score = 302 bits (773), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 145/273 (53%), Positives = 190/273 (69%), Gaps = 8/273 (2%)
Query: 21 LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
L ++L+GWL A L P R +IAPHAGYSY G AA+A+ I P + R+F+LGPSHH
Sbjct: 28 LNKQLEGWLNAADLSHGP-ARAIIAPHAGYSYCGACAAFAYRQISPVVVRRIFILGPSHH 86
Query: 81 YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
CALS+A++Y+T + DL +D +V EL+ T FE MD+ DE EHS+EM LP++AK
Sbjct: 87 VRLAGCALSSASIYQTLLYDLHIDQQVYRELEETRHFEWMDLNTDEEEHSIEMQLPFIAK 146
Query: 141 VFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
V EG I+PILVG+++ E EA+YGRL A Y+ DP F +SSDFCHWG RF Y +Y
Sbjct: 147 VMEGFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQTLFVISSDFCHWGQRFRYTYY 206
Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GN 254
D+ G IH+SI+ LDKMGMDIIET +P F YL +Y NT+CGRHPIS+ L ++ GN
Sbjct: 207 DRSCGPIHRSIQNLDKMGMDIIETLNPTMFNDYLKKYGNTVCGRHPISILLQVIHSLKGN 266
Query: 255 CS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ ++ +KFL+Y QS+QC DSSVSYASA+
Sbjct: 267 TNGQRMNLKFLKYVQSNQCNNMNDSSVSYASAS 299
>gi|299755585|ref|XP_002912116.1| hypothetical protein CC1G_13648 [Coprinopsis cinerea okayama7#130]
gi|298411291|gb|EFI28622.1| hypothetical protein CC1G_13648 [Coprinopsis cinerea okayama7#130]
Length = 321
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 164/319 (51%), Positives = 202/319 (63%), Gaps = 35/319 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG-----LPKSPEVRGVIAPHAGYSYSGRAAA 58
IR ASHAGSWYT + + L +EL GWL++ P + + + VIAPHAGYSYSG AAA
Sbjct: 3 IREASHAGSWYTADGRELDQELSGWLQKVNPTDESFPIT-KSKAVIAPHAGYSYSGPAAA 61
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+A+ ID T I RVF+LGPSHH+Y CALS T Y+TPIG+LPLDLE I+ELKA+G+F
Sbjct: 62 WAYKAIDTTGIKRVFILGPSHHFYLNGCALSPCTEYETPIGNLPLDLETIQELKASGRFA 121
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
M DE EHS+EMHLPY+ K+F+G IKIVPI+VG++N EA +G L A Y++
Sbjct: 122 NMSRKADEDEHSLEMHLPYIRKIFQGKDIKIVPIVVGSINQRQEAEFGALLAPYLERQDT 181
Query: 179 FFSVSSDFCHWGSRFNYMHY----DKKHGV---------------IHKSIEALDKMGMDI 219
F VSSDFCHWG+RF+Y +Y D G IHKSIE LD M +
Sbjct: 182 FTVVSSDFCHWGTRFSYTYYYPEADTPVGSGVSLNRSTAPTSSHPIHKSIERLDHEAMQL 241
Query: 220 IE------TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC----STKIKIKFLRYEQS 269
+ T +AF KYL + NTICGRHPI V L + S +K++RYEQS
Sbjct: 242 LSVPPGTATKAHEAFAKYLKQTGNTICGRHPIGVLFGALADLEKTKSLTPIVKWVRYEQS 301
Query: 270 SQCKTKRDSSVSYASAAAK 288
S C T RDSSVSYASA K
Sbjct: 302 SHCHTVRDSSVSYASAYIK 320
>gi|82802793|gb|ABB92446.1| rcC2orf4 [Gorilla gorilla]
Length = 297
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 146/290 (50%), Positives = 189/290 (65%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYAYCRSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CAL++ +Y+TP+ DL +D ++ EL TG F M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALASVDIYRTPLNDLLIDQKIYGELWKTGMFGRMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EH +EMHLPY AK E H I+P+ VGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHGIEMHLPYTAKAMESHKDEFTIIPLPVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HP V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPTGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|239789321|dbj|BAH71291.1| ACYPI001056 [Acyrthosiphon pisum]
Length = 291
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/290 (51%), Positives = 194/290 (66%), Gaps = 4/290 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRASH+G+WY +++K L+ +L+ WL A L P R +IAPHAGY Y G AA+A+
Sbjct: 3 VRRASHSGTWYNESAKELSTQLENWLGAAELTHGP-ARAIIAPHAGYQYCGACAAFAYRQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P + R+F+LGPSH+ CALS+A Y TP DL +D EV EEL++TG FE M +
Sbjct: 62 ISPFVVQRIFILGPSHNVRLSGCALSSAVKYSTPFYDLQIDNEVYEELESTGHFERMSMK 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
VDE EHS+EM LPY+AKV + + K IVP++VG+++ E E++YG Y+ DP N F
Sbjct: 122 VDEDEHSIEMQLPYIAKVMQDYKDKFTIVPVMVGSLSTEKESLYGPYIRSYLADPKNLFI 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG F Y Y+K G IH+SI LD GMD IET P AF +YL + NTICG
Sbjct: 182 ISSDFCHWGQGFRYTFYEKSWGEIHQSITTLDHKGMDTIETLKPAAFVEYLKTFSNTICG 241
Query: 242 RHPISVFLHMLGNCST-KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
RHPISV L + T K+ +KFL+Y QSS+C DSSVSYA A+ ++
Sbjct: 242 RHPISVLLQAANHLRTQKLSLKFLKYAQSSKCYNLGDSSVSYAFASLIIE 291
>gi|156368571|ref|XP_001627766.1| predicted protein [Nematostella vectensis]
gi|156214686|gb|EDO35666.1| predicted protein [Nematostella vectensis]
Length = 296
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 145/290 (50%), Positives = 192/290 (66%), Gaps = 9/290 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWY+ +S+ L +LDGWL + + P R +IAPHAGYSY G AYA+ +
Sbjct: 4 RRATHAGSWYSSHSRDLNSQLDGWLSKVDVNYRP-ARAIIAPHAGYSYCGSCGAYAYKQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DPT + RVF+LGPSHH + CAL+ ++Y+TP+ +L +D V EL TG F+ M
Sbjct: 63 DPTVVKRVFILGPSHHVHLRGCALTKNSIYQTPLYNLEVDKAVNAELMGTGFFDEMSSKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+E+HLPY+AK E +VPILVG+++ + E YG+L + Y+ +P N F +
Sbjct: 123 DEDEHSIELHLPYIAKAMESKRGQYTVVPILVGSLSPDKEKKYGKLLSTYLMNPENLFVI 182
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF + +YD ++ IH+SIEALDK GMD IE DP F +YL +Y NTICGR
Sbjct: 183 SSDFCHWGKRFKFYYYDSRYDRIHQSIEALDKKGMDFIERLDPQGFSEYLEKYGNTICGR 242
Query: 243 HPISVFLH------MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
HPI VFL+ L S + +FL Y QS+QC+ DSSVSYA+
Sbjct: 243 HPIGVFLNALSALSQLNGMSPSMSFRFLHYSQSNQCEDFDDSSVSYAAGV 292
>gi|66811492|ref|XP_639926.1| hypothetical protein DDB_G0284869 [Dictyostelium discoideum AX4]
gi|74854058|sp|Q54NZ1.1|MEMO1_DICDI RecName: Full=Protein MEMO1 homolog
gi|60466854|gb|EAL64898.1| hypothetical protein DDB_G0284869 [Dictyostelium discoideum AX4]
Length = 290
Score = 300 bits (769), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 147/282 (52%), Positives = 190/282 (67%), Gaps = 3/282 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR +HAGSWY DN+++L ++L WL EA ++ V+ +IAPHAGYSYSGRAAAYA+ N
Sbjct: 2 IRNCTHAGSWYLDNARKLEKQLSDWLSEAS-RQNQNVKSIIAPHAGYSYSGRAAAYAYIN 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + RVF+LGPSHH Y C L+ ++TPIG+L +D + +L TG F
Sbjct: 61 LIPENYKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDKDTTNKLFDTGSFIWNTKS 120
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
VDE EHS+E+ LPY+AKV E L++ IVPI+VG+++ + E +YG++ A Y DDP NFF
Sbjct: 121 VDEDEHSLELQLPYIAKVAEKRLLELSIVPIMVGSLSIDLEELYGKILAPYFDDPENFFV 180
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF Y Y+ K I+K IE LD+ GM+IIE+GDP F KYL E NTICG
Sbjct: 181 ISSDFCHWGERFGYTRYENKEVPIYKYIEELDRKGMEIIESGDPVQFSKYLKETKNTICG 240
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
R PI+V L + N K I L YEQSS+ DSSVSY
Sbjct: 241 RCPIAVMLWITKNSQFKYNISSLYYEQSSKVIKSSDSSVSYG 282
>gi|340374122|ref|XP_003385587.1| PREDICTED: protein MEMO1-like [Amphimedon queenslandica]
Length = 299
Score = 300 bits (768), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 148/279 (53%), Positives = 186/279 (66%), Gaps = 8/279 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
M R+ASHAGSWY+ + +L +L+ WL +A G +P R +IAPHAGYSYSG A
Sbjct: 1 MTSTRKASHAGSWYSGSGDKLTSQLEEWLGKATSSGAKHAP-ARAIIAPHAGYSYSGPCA 59
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
AYA+ I P I RVF+LGPSHH Y P CALS + Y+TP+ +L +D E+ L +TGKF
Sbjct: 60 AYAYNQIIPAGIKRVFILGPSHHKYMPDCALSACSHYETPLYNLEIDREIYHSLYSTGKF 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDD 175
MD+ DE EHS+EMHLPY+AKV E IVPILVGA+ + EA YG+L KY+ D
Sbjct: 120 SQMDVRTDEDEHSIEMHLPYIAKVMESKRGSFTIVPILVGALKPDKEAEYGQLLHKYLLD 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
N F +SSDFCHWG RF + +YDK+HG IH SIEALD+MGM +IE D F+KYL Y
Sbjct: 180 EQNLFVISSDFCHWGKRFQFTYYDKEHGPIHSSIEALDRMGMSLIEALDTKGFQKYLQTY 239
Query: 236 DNTICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQC 272
NTICGRHPI V L+ + +KF +Y QS++C
Sbjct: 240 RNTICGRHPIGVLLNAIDAAQGEKSFVLKFTQYAQSNRC 278
>gi|255075349|ref|XP_002501349.1| predicted protein [Micromonas sp. RCC299]
gi|226516613|gb|ACO62607.1| predicted protein [Micromonas sp. RCC299]
Length = 298
Score = 300 bits (768), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 158/296 (53%), Positives = 195/296 (65%), Gaps = 10/296 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWY+ + RL++ELDG+L AG S R +I PHAGYSYSG AAA+ + N+
Sbjct: 4 REASHAGSWYSSDRARLSKELDGFLDAAGSHGSTP-RALIVPHAGYSYSGPAAAWGYKNV 62
Query: 65 DP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
D + RVFLLGPSHH + +CALS Y TP+GDL +D V EEL+ATG F MD+
Sbjct: 63 DAGAGVRRVFLLGPSHHVFLRRCALSKCAAYATPLGDLAVDRGVYEELRATGHFVDMDVD 122
Query: 124 VDEAEHSMEMHLPYLAKVF----EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
VDEAEHS+E+HLPY+ K F E +VPI+VGA++ EA YG + A Y+DD +N
Sbjct: 123 VDEAEHSLELHLPYIFKCFEVLPERDRPTLVPIMVGALSTNAEATYGEILAPYLDDDANL 182
Query: 180 FSVSSDFCHWGSRFNYMHYD--KKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
F VSSDFCHWG RF Y +D +K +H+ IE +D+ GM+IIE+ D DAF YL E N
Sbjct: 183 FIVSSDFCHWGKRFGYQPWDEGRKDVGLHEQIEEMDRRGMEIIESKDADAFAAYLKETGN 242
Query: 238 TICGRHPISVFLHMLGNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
TICGRHPI V L L K +++F RYEQSS DSSVSYASA DA
Sbjct: 243 TICGRHPIGVLLQTLRRSGGFDKTRVEFTRYEQSSAATGWNDSSVSYASAVVFRDA 298
>gi|390604307|gb|EIN13698.1| UPF0103-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 333
Score = 300 bits (767), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 162/327 (49%), Positives = 201/327 (61%), Gaps = 45/327 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----PEVRG---VIAPHAGYSYSGRA 56
+R ASHAGSWYTD+ +RL +L GWL++ + P + G +I PHAGYSYSG A
Sbjct: 3 VREASHAGSWYTDDRQRLNTQLSGWLQDVKPQEDDNFHPPIEGCKAIIGPHAGYSYSGPA 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +ID I RVF+LGPSHH Y CALS Y+TPIGDLPLDLE I++L+ TGK
Sbjct: 63 AAWAYKSIDTVGIRRVFVLGPSHHVYLDGCALSRCNGYETPIGDLPLDLETIQDLRNTGK 122
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
F MD+ DE EHS+EMHLPY+ KVFEG IK+VPILVGA++ E E ++G L A Y+
Sbjct: 123 FSDMDLETDENEHSIEMHLPYIRKVFEGQNIKLVPILVGAIDTEKETLFGELIAPYLLRD 182
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDK-----------------------KHGVIHKSIEALD 213
F VSSDFCHWG+RF Y +Y +H IH SI ALD
Sbjct: 183 DTFCVVSSDFCHWGARFRYTNYYPDPALVPSSVRATQLSTSTKATMYQHHPIHASISALD 242
Query: 214 KMGMDII----ETGDPDA---FKKYLLEYDNTICGRHPISVFLHML--------GNCSTK 258
M+I+ T DA F YL + NTICGRHPI V L L G+
Sbjct: 243 HEAMNILTIPANTATKDAHNVFAAYLAKTKNTICGRHPIGVLLGALAFIDSTKSGDQHNS 302
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASA 285
I ++++RYEQSS+C + RDSSVSYASA
Sbjct: 303 ITMRWVRYEQSSKCASPRDSSVSYASA 329
>gi|119630156|gb|EAX09751.1| hCG401083 [Homo sapiens]
Length = 295
Score = 300 bits (767), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/290 (51%), Positives = 190/290 (65%), Gaps = 8/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE E S+EMHLPY AK E H I+P+LVGA++ +N LF+KY+ DPSN F V
Sbjct: 127 DEDERSIEMHLPYTAKAMESHKDEFTIIPLLVGALSQKNRN--SELFSKYLADPSNLFVV 184
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF + +YD+ G I +SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 185 SSDFCHWGQRFRHSYYDESQGEICRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 244
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 245 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 294
>gi|388496392|gb|AFK36262.1| unknown [Lotus japonicus]
Length = 177
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 148/172 (86%), Positives = 162/172 (94%)
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+MDI VDEAEHSMEMHLPYLAKVFEGH +KIVPILVGAV+AENEAMYG + AKYVDDP+N
Sbjct: 1 MMDIRVDEAEHSMEMHLPYLAKVFEGHPVKIVPILVGAVSAENEAMYGEILAKYVDDPTN 60
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
FFS+SSDFCHWG+RFNY HYDKKHG I+KSIEALDKMGMDIIE GDPD+FK+YL EYDNT
Sbjct: 61 FFSISSDFCHWGARFNYQHYDKKHGPIYKSIEALDKMGMDIIEIGDPDSFKRYLSEYDNT 120
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
ICGRHPISVFL+ML NCSTKIKIKFLRYEQSSQC++ RDSSVSYASAAAK+D
Sbjct: 121 ICGRHPISVFLYMLRNCSTKIKIKFLRYEQSSQCQSTRDSSVSYASAAAKID 172
>gi|307197875|gb|EFN78974.1| Protein MEMO1 [Harpegnathos saltator]
Length = 285
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 144/278 (51%), Positives = 190/278 (68%), Gaps = 10/278 (3%)
Query: 21 LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
L ++L+ WL A L P R +IAPHAGYSY G A +A+ I P + R+F+LGPSHH
Sbjct: 10 LRKQLEDWLSAADLSHGP-ARAIIAPHAGYSYCGECAGFAYRQISPVVVRRIFILGPSHH 68
Query: 81 YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
CALS+ +VY+TP+ DL +D +V +EL+ TG F+ MD+ DE EHS+EM LP++AK
Sbjct: 69 VRLTSCALSSVSVYQTPLYDLRIDQKVRKELEETGHFDWMDVNTDEEEHSIEMQLPFIAK 128
Query: 141 VFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
V E I+PILVG+++ E EA+YGRL A Y+ DP F +SSDFCHWG RF Y++Y
Sbjct: 129 VMEEFKDSFTIIPILVGSLSPEKEALYGRLLAPYMADPQALFVISSDFCHWGQRFRYIYY 188
Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC--- 255
D+ G IH+SI+ LDKMGM+IIET +P F YL +Y NTICGRHPI + L ++ NC
Sbjct: 189 DRSCGPIHRSIQNLDKMGMNIIETLNPAMFTDYLKKYGNTICGRHPIGILLQLI-NCLKG 247
Query: 256 ---STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
++ +KFL+Y QSSQC DSSVSYASA+ V+
Sbjct: 248 RTNGHRMNLKFLKYAQSSQCNNMNDSSVSYASASLVVE 285
>gi|225718386|gb|ACO15039.1| MEMO1 [Caligus clemensi]
Length = 300
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/291 (52%), Positives = 192/291 (65%), Gaps = 10/291 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAFG 62
R ASHAGSWY+ + K L +E+ GWL + P S R +I PHAG+ YSG AAYA+
Sbjct: 7 RMASHAGSWYSGDGKVLQKEMSGWLDKVS-PDSATFPARAIIGPHAGFRYSGPTAAYAYK 65
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I P I RVF+LGPSHH C LS +VY+TP+ DL +D EV EL F + +
Sbjct: 66 QIQPQGIKRVFILGPSHHITVNGCLLSGCSVYETPLYDLLVDKEVNRELMDAKGFGMATL 125
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DE EHS+E+ LPY+AK E IVPILVG+++ + EA YG++ AKY+ DPSN F
Sbjct: 126 SADEDEHSLELRLPYIAKAMESRKGEFTIVPILVGSLSPDKEAKYGKILAKYLMDPSNLF 185
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF+Y +YDK+ G IH+SI +LD GMDIIE D DAF YL +Y NTIC
Sbjct: 186 VISSDFCHWGERFDYTYYDKEAGEIHESISSLDHKGMDIIENLDHDAFAAYLKKYRNTIC 245
Query: 241 GRHPISVFLHMLGNCSTK-----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
GRHPI VFL M+ + + +KFL+Y QS +C+T DSSVSYASAA
Sbjct: 246 GRHPIGVFLGMVKAIRQRGDAPEMCLKFLKYAQSDKCRTLSDSSVSYASAA 296
>gi|426392964|ref|XP_004062805.1| PREDICTED: protein MEMO1-like isoform 1 [Gorilla gorilla gorilla]
gi|426392966|ref|XP_004062806.1| PREDICTED: protein MEMO1-like isoform 2 [Gorilla gorilla gorilla]
Length = 297
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 145/290 (50%), Positives = 188/290 (64%), Gaps = 6/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYAYCRSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+ +LGPSHH +CAL++ +Y+TP+ DL +D ++ EL TG F M +
Sbjct: 67 DPSITRRILILGPSHHVPLSRCALASVDIYRTPLNDLLIDQKIYGELWKTGMFGRMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EH +EMHLPY AK E H I+P+ VGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHGIEMHLPYTAKAMESHKDEFTIIPLPVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HP V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 HPTGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|29841028|gb|AAP06041.1| similar to GenBank Accession Number BC018733 CGI-27 protein in Homo
sapiens [Schistosoma japonicum]
Length = 298
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 145/294 (49%), Positives = 193/294 (65%), Gaps = 9/294 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRASHAGSWY+ + RL +L WL E + + P R +I+PHAGY YSG AA+A+
Sbjct: 5 VRRASHAGSWYSGDRARLNNQLQNWLSEVTVIRQP-ARAIISPHAGYDYSGPCAAFAYKQ 63
Query: 64 IDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
IDP I RVF+LGP+H+ KCALSTA +++TPI L +D +V +L TG+F + +
Sbjct: 64 IDPRLIKRVFVLGPAHYMSLRGKCALSTADLFETPIYSLSIDRDVYRDLDKTGEFVSLSL 123
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DE EHS+EM +PY+AKV E H +VPILVG ++ E EA+YG++FA+Y+ DP NFF
Sbjct: 124 DRDEEEHSIEMQMPYVAKVMERHQGGFSVVPILVGYLSPEREAVYGQIFARYLSDPENFF 183
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF Y +YD+ G I +SI+ALD+ GM IIE F YL +Y NTIC
Sbjct: 184 VISSDFCHWGKRFQYTYYDQSKGPIWQSIQALDEAGMQIIERLAASEFTSYLEQYGNTIC 243
Query: 241 GRHPISVFLHMLGNCSTKI-----KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
GRHPI V L + + T +KF+RY QS +C SSVSYA+ ++
Sbjct: 244 GRHPIGVLLQTVASLRTNAPNNNYSMKFVRYTQSERCNNLNQSSVSYAAGVLQI 297
>gi|298714279|emb|CBJ27395.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 351
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 137/283 (48%), Positives = 195/283 (68%), Gaps = 2/283 (0%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAFG 62
RRASHAG+WY+D+ L ++L WL + P VR +IAPHAG+SYS + AA A+
Sbjct: 61 RRASHAGTWYSDSGPLLGQQLTDWLGASAHTTKPGSAVRAIIAPHAGFSYSAQTAASAYA 120
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++DP+ ++RVF+LGPSHH + +CA+STA+ +TPIGDL +D +V +L ATG F+ M
Sbjct: 121 HMDPSKLARVFVLGPSHHVHMRRCAVSTASRLETPIGDLAIDQDVTRQLLATGDFDAMTA 180
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
+DE EHS+EMHLP++ KV EG +V I++GA++AE E YGR+ A Y++D +NFF +
Sbjct: 181 RMDEDEHSIEMHLPFIKKVMEGRPCTVVAIMIGALSAELEEHYGRILAPYLEDLTNFFII 240
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
S+DFCHWGSRF Y H IH+ I LD GM +I++ D F +YL E+ NTICGR
Sbjct: 241 STDFCHWGSRFRYQPLGAAHSAIHEHISWLDHEGMKLIQSQDAVGFHRYLAEHGNTICGR 300
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
HPI+ F++ + + I + ++ Y QS++ + +DSSVSYASA
Sbjct: 301 HPIATFINAIEASTVCICVTWVEYAQSTKVVSPQDSSVSYASA 343
>gi|328857497|gb|EGG06613.1| hypothetical protein MELLADRAFT_48375 [Melampsora larici-populina
98AG31]
Length = 325
Score = 296 bits (757), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 36/316 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------------PEVRGVIAP 46
R A+HAGSWYT + +L E+D WL EA PK+ + VIAP
Sbjct: 4 RSATHAGSWYTSSPNKLRNEIDTWL-EAAHPKAETTEETQDWRKVAMPVPISNCKAVIAP 62
Query: 47 HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
HAGYSYSG AA+A+ IDP+ I RVF+LGPSHH Y CALS Y+TP+GDL +D++
Sbjct: 63 HAGYSYSGSTAAWAYKTIDPSVIKRVFILGPSHHVYLDGCALSGCEAYETPLGDLEIDVQ 122
Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
V EL +G FE MDI DEAEHS+E+HLPY+ +F+ I IVPIL+G++++E E +YG
Sbjct: 123 VTSELVQSGLFEEMDIETDEAEHSIELHLPYVRHIFKDQTITIVPILIGSISSEKELIYG 182
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSI 209
+ Y+ P N F +SSDFCHWGSRF+Y +Y I++SI
Sbjct: 183 EVLRPYLLSPENLFVISSDFCHWGSRFSYTYYKDPTQENEPARKLSSSASPPATPIYQSI 242
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
EALD+ ++I+E+ F +YL + NTICGRHPI V L L + K+KF Y QS
Sbjct: 243 EALDREALNILESLSHSEFSRYLSQTKNTICGRHPIGVLLGALQTITEPAKLKFTCYAQS 302
Query: 270 SQCKTKRDSSVSYASA 285
SQC + RDSSVSYASA
Sbjct: 303 SQCVSARDSSVSYASA 318
>gi|82802790|gb|ABB92445.1| rcC2orf4 [Homo sapiens]
Length = 295
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/290 (50%), Positives = 189/290 (65%), Gaps = 8/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + ++ +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQVNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYFGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LG SHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGLSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE E S+EMHLPY AK E H I+P+LVGA++ +N LF+KY+ DPSN F V
Sbjct: 127 DEDERSIEMHLPYTAKAMESHKDEFTIIPLLVGALSQKNRN--SELFSKYLADPSNLFVV 184
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF + +YD+ G I +SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 185 SSDFCHWGQRFRHSYYDESQGEICRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 244
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 245 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 294
>gi|395828933|ref|XP_003787616.1| PREDICTED: protein MEMO1 [Otolemur garnettii]
Length = 324
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/285 (50%), Positives = 187/285 (65%), Gaps = 8/285 (2%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
GS+Y +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+
Sbjct: 42 TGSYY--QGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSIT 98
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EH
Sbjct: 99 RRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEH 158
Query: 130 SMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFC 187
S+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFC
Sbjct: 159 SIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFC 218
Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
HWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V
Sbjct: 219 HWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGV 278
Query: 248 FLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 279 LLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 323
>gi|345781967|ref|XP_540144.3| PREDICTED: protein MEMO1 [Canis lupus familiaris]
Length = 332
Score = 294 bits (752), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 58 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 116
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 117 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 176
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 177 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 236
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 237 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 296
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 297 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 331
>gi|148706476|gb|EDL38423.1| RIKEN cDNA 0610016J10, isoform CRA_b [Mus musculus]
Length = 277
Score = 294 bits (752), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 184/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 3 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSVTRRIFILGPSH 61
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 62 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 181
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 276
>gi|324517153|gb|ADY46739.1| Unknown [Ascaris suum]
Length = 302
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 146/295 (49%), Positives = 193/295 (65%), Gaps = 8/295 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR A+HAGSWY+DN K L +L WL AG + R +I+PHAGYSY G AAYAF
Sbjct: 5 IRNATHAGSWYSDNPKELNRQLSEWLAAAGPRVAGPARAIISPHAGYSYCGSTAAYAFKQ 64
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P I +F+LGPSH CALS+ + Y+TP+GDL +D V EL+ TG FE MD
Sbjct: 65 IVPDHIETIFVLGPSHVVCLSGCALSSCSRYRTPLGDLYIDQRVNAELRETGAFETMDFR 124
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+EAEHS+EM LP++AKV E IVPILVG++++ + YG++FAKYV DP N F
Sbjct: 125 SEEAEHSLEMQLPFIAKVMEKQPAGSYNIVPILVGSLSSSRQISYGKIFAKYVADPRNLF 184
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+SSDFCHWG+RF++ Y+ G+ IH+ I ALDK GM+ I +P AF +YL + NTI
Sbjct: 185 VISSDFCHWGNRFHFTPYEHNTGLPIHEQIAALDKQGMEAISRLEPSAFNEYLKKTQNTI 244
Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
CGR+PI V L H + +++FL+Y QS+QC+ DSSVSYA+ A ++
Sbjct: 245 CGRNPICVMLQAAEHFRQMNNHSAELRFLKYSQSNQCRNLSDSSVSYAAGALFIN 299
>gi|355702140|gb|AES01834.1| mediator of cell motility 1 [Mustela putorius furo]
Length = 275
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 2 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 60
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 61 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 120
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 121 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 180
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 181 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 240
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 241 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 275
>gi|229595124|ref|XP_001019799.3| hypothetical protein TTHERM_00138170 [Tetrahymena thermophila]
gi|225566377|gb|EAR99554.3| hypothetical protein TTHERM_00138170 [Tetrahymena thermophila
SB210]
Length = 295
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 139/290 (47%), Positives = 193/290 (66%), Gaps = 5/290 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFG 62
R ASHA SWYT N K L +EL G+L +A + +V+ +I PHAGY YSG AA+ +
Sbjct: 6 RLASHANSWYTGNQKELDQELSGYLSKANKQQLNIQKVKALIGPHAGYYYSGPTAAWGYQ 65
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ P RVFLLGP HH Y C +S ++TP+GD+ LD + I+ELK TGKF+++
Sbjct: 66 YLQPIENLRVFLLGPCHHIYMNGCGISGLEKFQTPLGDITLDKQTIQELKNTGKFDVVSK 125
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
+E EHS+EMHLPY+ K+ K+VPI+VG ++ + E YG+L +KY DD + F +
Sbjct: 126 KNEEDEHSLEMHLPYIQKILGQQQFKLVPIMVGNLSYQREQEYGQLLSKYFDDDNTVFII 185
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWGSRF+Y +Y+K+ G I +SI+ LD GM++IE D + F KYL + +NTICGR
Sbjct: 186 SSDFCHWGSRFDYQYYNKQDGEIWQSIQKLDTAGMELIENHDCEKFNKYLEQTENTICGR 245
Query: 243 HPISVFLHMLGNCST---KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L ++ + ++K KF+ YEQS Q KT++ SSVSYA++ V
Sbjct: 246 HPIGVLLQVIKSTQKYAGQLKTKFMHYEQSEQVKTQKSSSVSYATSITYV 295
>gi|194220833|ref|XP_001917913.1| PREDICTED: protein MEMO1-like [Equus caballus]
Length = 282
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 8 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 66
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 67 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 126
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 127 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 186
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 187 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 246
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 247 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 281
>gi|395507123|ref|XP_003757877.1| PREDICTED: protein MEMO1 [Sarcophilus harrisii]
Length = 299
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 143/275 (52%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DPT R+F+LGPSH
Sbjct: 25 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPTITRRIFILGPSH 83
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 84 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 143
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 144 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 203
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 204 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 263
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 264 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 298
>gi|114576871|ref|XP_001164069.1| PREDICTED: protein MEMO1 isoform 2 [Pan troglodytes]
gi|397513836|ref|XP_003827213.1| PREDICTED: protein MEMO1-like [Pan paniscus]
gi|56377671|dbj|BAD74067.1| C2orf4 [Homo sapiens]
Length = 300
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 26 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 84
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 85 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 144
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 145 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 204
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 205 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 264
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 265 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 299
>gi|281349207|gb|EFB24791.1| hypothetical protein PANDA_019259 [Ailuropoda melanoleuca]
gi|351701080|gb|EHB03999.1| Protein MEMO1, partial [Heterocephalus glaber]
gi|432096762|gb|ELK27340.1| Protein MEMO1, partial [Myotis davidii]
Length = 277
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 3 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 61
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 62 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 181
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 276
>gi|109102576|ref|XP_001105478.1| PREDICTED: protein MEMO1-like [Macaca mulatta]
Length = 297
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 23 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 81
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 82 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 141
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 142 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 201
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 202 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 261
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 262 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|441662627|ref|XP_003262755.2| PREDICTED: protein MEMO1 [Nomascus leucogenys]
Length = 339
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 65 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 123
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 124 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 183
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 184 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 243
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 244 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 303
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 304 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 338
>gi|426226442|ref|XP_004007352.1| PREDICTED: protein MEMO1 [Ovis aries]
Length = 439
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 165 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 223
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 224 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 283
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 284 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 343
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 344 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 403
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 404 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 438
>gi|431911965|gb|ELK14109.1| Protein MEMO1 [Pteropus alecto]
Length = 297
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 141/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 23 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 81
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE +HS+EMHLPY A
Sbjct: 82 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDQHSIEMHLPYTA 141
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 142 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 201
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 202 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 261
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 262 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 296
>gi|403307071|ref|XP_003944034.1| PREDICTED: protein MEMO1 [Saimiri boliviensis boliviensis]
Length = 310
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 36 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 94
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 95 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 154
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 155 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 214
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 215 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 274
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 275 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 309
>gi|300676818|gb|ADK26694.1| mediator of cell motility 1 [Zonotrichia albicollis]
Length = 276
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 140/275 (50%), Positives = 182/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP ++F+LGPSH
Sbjct: 2 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 60
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 61 HVPLSRCALSSVDIYRTPLYDLRIDQKIYAELWKTGMFERMSLQTDEDEHSIEMHLPYTA 120
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 121 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 180
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ +
Sbjct: 181 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 240
Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 241 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 275
>gi|301787491|ref|XP_002929161.1| PREDICTED: protein MEMO1-like [Ailuropoda melanoleuca]
Length = 463
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 142/279 (50%), Positives = 185/279 (66%), Gaps = 6/279 (2%)
Query: 16 DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL 75
++ +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+L
Sbjct: 185 EDRPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFIL 243
Query: 76 GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHL 135
GPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHL
Sbjct: 244 GPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHL 303
Query: 136 PYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRF 193
PY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF
Sbjct: 304 PYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRF 363
Query: 194 NYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM-- 251
Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+
Sbjct: 364 RYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAIT 423
Query: 252 -LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 424 ELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 462
>gi|440908306|gb|ELR58341.1| Protein MEMO1, partial [Bos grunniens mutus]
Length = 277
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 141/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 3 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 61
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 62 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 121
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ +PSN F VSSDFCHWG RF Y +
Sbjct: 122 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVVSSDFCHWGQRFRYSY 181
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 182 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 241
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 242 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 276
>gi|395334280|gb|EJF66656.1| hypothetical protein DICSQDRAFT_95267 [Dichomitus squalens LYAD-421
SS1]
Length = 326
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 160/319 (50%), Positives = 190/319 (59%), Gaps = 38/319 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLRE-------AGLPKSPEVRGVIAPHAGYSYSGRAA 57
R+A+HA SWYT N +L +EL GWL E A P P + +IAPHAGYSYSGRAA
Sbjct: 4 RQATHADSWYTGNGSKLDQELSGWLGEVKPASEDAFNPPVPGTKAIIAPHAGYSYSGRAA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +ID T I RVF+LGPSHH Y CAL+ Y+TPIG LP+D+E +ELK TG+F
Sbjct: 64 AWAYKSIDTTGIKRVFVLGPSHHVYLDGCALTRCAQYETPIGALPIDVETTKELKQTGQF 123
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M + DE EHS+EMHLPY+ K FEG IKIVPILVGA+N + E YG L A Y+
Sbjct: 124 SDMSLQTDEDEHSIEMHLPYIRKTFEGLDIKIVPILVGAINYDKEVSYGHLLAPYLARED 183
Query: 178 NFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIHKSIEALDKM 215
F VSSDFCHWG RF Y Y K IH SI ALD
Sbjct: 184 TIFVVSSDFCHWGLRFQYTFYYPQPPPSSAAPIRLSRSETPPASLKTHPIHASISALDHE 243
Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVF---LHMLGNCSTKIKIKFLRY 266
GM+++ A F YL NTICGRHPI V L L T K+K++RY
Sbjct: 244 GMELLTLPPFTASQAHRTFADYLARTKNTICGRHPIGVLFGALAALEQQGTSPKLKWVRY 303
Query: 267 EQSSQCKTKRDSSVSYASA 285
EQSS C+T +DSSVSYASA
Sbjct: 304 EQSSACETVKDSSVSYASA 322
>gi|312084389|ref|XP_003144256.1| hypothetical protein LOAG_08678 [Loa loa]
gi|307760580|gb|EFO19814.1| hypothetical protein LOAG_08678 [Loa loa]
Length = 305
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/297 (49%), Positives = 198/297 (66%), Gaps = 12/297 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R ASHAGSWYT++ ++L EL WL A + R +I+PHAGYSYSGR AA+AF
Sbjct: 8 VRNASHAGSWYTNDPRKLHRELTEWLGAADSRHAQSARAIISPHAGYSYSGRIAAFAFKQ 67
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P ++S +F+LGPSH CALST + Y+TPIGDL +D ELK TG F LMD+
Sbjct: 68 IIPETVSTIFVLGPSHVMSLDTCALSTCSRYRTPIGDLQIDQRTNTELKETGAFSLMDLR 127
Query: 124 VDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+EAEHS+EM LPY+AK+ E + IVP+LVG+++ +A YG++F+KY+ DP F
Sbjct: 128 SEEAEHSIEMQLPYIAKIMEKQSANGYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVF 187
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
VSSDFCHWGSRF++M +D GV I++ I A+DK GMD I + +P AF +YL NTI
Sbjct: 188 VVSSDFCHWGSRFHFMPHDNTTGVPIYEQIAAMDKQGMDAISSLNPVAFGEYLKRTQNTI 247
Query: 240 CGRHPISVFLH------MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
CGR+PISV L+ + N + FL+Y QS+Q ++ DSSVSYA+ A ++
Sbjct: 248 CGRNPISVILYAAEYFRQMNN--HLAEFVFLKYAQSNQSRSLNDSSVSYAAGALFIN 302
>gi|443898721|dbj|GAC76055.1| predicted dioxygenase [Pseudozyma antarctica T-34]
Length = 347
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 164/336 (48%), Positives = 197/336 (58%), Gaps = 54/336 (16%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-------------------REAGLPKS-PEVRGV 43
IR+ASHAGSWYT++ L E L WL R LP P R +
Sbjct: 8 IRQASHAGSWYTNDRAELDESLTEWLSDVKADQLPSPLAVCDHKPRSGSLPLPIPTCRAI 67
Query: 44 IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
I PHAGY+YSG AAAYA+ ID +I RVF+LGPSHH Y CA+S + Y TP+G+L +
Sbjct: 68 IGPHAGYAYSGPAAAYAYKCIDIAAIERVFILGPSHHVYLDGCAVSRCSQYATPLGNLEV 127
Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
D +V EELKATGKFE M DE EHSMEMHLPY+ KVFEG I+IVPILVGA+N + E
Sbjct: 128 DRQVTEELKATGKFETMTQSQDEDEHSMEMHLPYIYKVFEGRPIRIVPILVGAINTKTED 187
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH------------------GVI 205
+YG+L A Y++D NFF VSSDFCHWG+RF Y +Y I
Sbjct: 188 VYGKLLAPYLNDSRNFFVVSSDFCHWGTRFRYTYYKPAEDSPATMLSARTPRSAYEGAPI 247
Query: 206 HKSIEALDKMGMDII-----ETGDP-------DAFKKYLLEYDNTICGRHPISVFLHMLG 253
H+SI ALD+ G+ I + G+ AF +YL NT+CGRHPI V L L
Sbjct: 248 HQSIRALDEQGIFAITYPWTQGGESKTAAEARKAFAEYLSSTKNTVCGRHPIGVLLAALA 307
Query: 254 NCSTKIKIK----FLRYEQSSQCKTKRDSSVSYASA 285
IK F RYEQSSQC + +DSSVSYASA
Sbjct: 308 ELEKAAGIKSECRFTRYEQSSQCTSPQDSSVSYASA 343
>gi|326915459|ref|XP_003204035.1| PREDICTED: protein MEMO1-like [Meleagris gallopavo]
Length = 290
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 140/275 (50%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP ++F+LGPSH
Sbjct: 16 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 74
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 75 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 134
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 135 KAMESHKDEFTIIPVLVGALSEAKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 194
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
YD+ G I++SIE LDKMGM+IIE DP +F YL +Y NTICGRHPI V L+ +
Sbjct: 195 YDESQGEIYRSIEHLDKMGMNIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 254
Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 255 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 289
>gi|449277723|gb|EMC85796.1| Protein MEMO1 [Columba livia]
Length = 287
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 141/275 (51%), Positives = 182/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP +VF+LGPSH
Sbjct: 13 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKVFILGPSH 71
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 72 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 131
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 132 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 191
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ +
Sbjct: 192 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 251
Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 252 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 286
>gi|354480709|ref|XP_003502547.1| PREDICTED: protein MEMO1-like [Cricetulus griseus]
Length = 428
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 183/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP R+F+LGPSH
Sbjct: 154 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPAITRRIFILGPSH 212
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 213 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 272
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 273 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 332
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGN 254
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ L
Sbjct: 333 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQK 392
Query: 255 CSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 393 NGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 427
>gi|449496796|ref|XP_002189221.2| PREDICTED: protein MEMO1 [Taeniopygia guttata]
Length = 360
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 140/275 (50%), Positives = 182/275 (66%), Gaps = 6/275 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP ++F+LGPSH
Sbjct: 86 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPNITRKIFILGPSH 144
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 145 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 204
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 205 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 264
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+ +
Sbjct: 265 YDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAINELQK 324
Query: 257 --TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 325 NGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 359
>gi|393218450|gb|EJD03938.1| UPF0103-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 391
Score = 290 bits (741), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 165/339 (48%), Positives = 198/339 (58%), Gaps = 56/339 (16%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPEVRGVIAPHAGYSY 52
+ I RA+HAGSWYTD ++RL +EL WL +A LP + +IAPHAGYSY
Sbjct: 51 LNTITRATHAGSWYTDRAQRLNDELSTWLNDAQPDAEDGFALPVK-GCKAIIAPHAGYSY 109
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AA +A+ +ID T I RVF+LGPSHH Y CALS Y TPIG+LPLD+E IEEL+
Sbjct: 110 SGPAAGWAYKSIDTTGIKRVFILGPSHHVYLDCCALSRFQEYATPIGNLPLDIETIEELR 169
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
TG FE M DE EHS+EMHLPY+ K FEG IKIVPILVGA++ E+E +G++ A Y
Sbjct: 170 KTGSFEDMRSDTDEDEHSIEMHLPYVRKAFEGVDIKIVPILVGAISKESEETFGKILAPY 229
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV----------------IH 206
F VSSDFCHWG+RF Y Y + G+ IH
Sbjct: 230 FTREDTFCVVSSDFCHWGTRFQYTFYYPSLPPTSGTVGESGIRLSRTNISSSLSPDHEIH 289
Query: 207 KSIEALDKMGMDIIETGDP-------DAFKKYLLEYDNTICGRHPISVFLHMLGNCST-- 257
+SI ALD MDI+ T P DAF YL NTICGRHPI V L L +
Sbjct: 290 ESISALDHQAMDIL-TLPPSTAQESHDAFSVYLARTKNTICGRHPIGVLLGALAVLESAV 348
Query: 258 -----------KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
K +K++RYEQSSQC+T DSSVSYASA
Sbjct: 349 TESEVPNGEPRKATVKWVRYEQSSQCRTISDSSVSYASA 387
>gi|302697823|ref|XP_003038590.1| hypothetical protein SCHCODRAFT_47866 [Schizophyllum commune H4-8]
gi|300112287|gb|EFJ03688.1| hypothetical protein SCHCODRAFT_47866, partial [Schizophyllum
commune H4-8]
Length = 317
Score = 289 bits (740), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 194/317 (61%), Gaps = 36/317 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-----REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
R ASHAGSWYT++ L E LD WL E P + VIAPHAGYSYSG AAA+
Sbjct: 1 RAASHAGSWYTNDGDELNERLDAWLAAVEPTEDAYPIKGS-KAVIAPHAGYSYSGPAAAW 59
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ +ID T I RVF+LGPSHH Y CALS Y TPIG+LPLDL+ I+EL TGKFE
Sbjct: 60 AYKSIDTTGIKRVFVLGPSHHIYLDGCALSRCKEYATPIGNLPLDLDAIKELGNTGKFEY 119
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+DI DEAEHS+EMHLPY+ K+F+G I+IVPI+VGA+N E +G L A Y+ P
Sbjct: 120 LDINDDEAEHSLEMHLPYVRKIFQGFRDIRIVPIMVGAINERTEKEFGALLAPYLAQPDT 179
Query: 179 FFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSIEALDKMGMDII- 220
F VSSDFCHWG+RF Y Y + IH+SI LD+ MDI+
Sbjct: 180 FCVVSSDFCHWGTRFQYTFYYPSPTAESGLRLSRSEPPSQDHPIHESITRLDREAMDILT 239
Query: 221 ------ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC----STKIKIKFLRYEQSS 270
T P+ F +YL NTICGRHPI V + T+ +K++RYE SS
Sbjct: 240 IPPGKASTAHPE-FARYLSRTKNTICGRHPIGVLFGAIAKLEETQGTEATVKWVRYEHSS 298
Query: 271 QCKTKRDSSVSYASAAA 287
+C+T RDSSVSYASA A
Sbjct: 299 ECETIRDSSVSYASAWA 315
>gi|405968067|gb|EKC33170.1| Protein MEMO1 [Crassostrea gigas]
Length = 384
Score = 289 bits (740), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 146/280 (52%), Positives = 187/280 (66%), Gaps = 11/280 (3%)
Query: 16 DNSKR-LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFL 74
D +R L+ +LD WL +A SP R +IAPHAGY Y G +A+ IDP++I R+F+
Sbjct: 103 DQKRRELSSQLDEWLSKAQPSYSP-ARAIIAPHAGYVYCGACGGHAYRQIDPSNIKRIFI 161
Query: 75 LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMH 134
LGPSHH CAL+ + Y+TP+ DL +D ++ EEL AT FE M + DE EHS+EMH
Sbjct: 162 LGPSHHVRLSGCALTETSHYQTPLYDLTIDQKINEELFATKAFEKMKMSTDEDEHSIEMH 221
Query: 135 LPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSR 192
LPY+AKV E I+P+LVG+++A+ E +YG +F++Y+ DP NFF +SSDFCHWG R
Sbjct: 222 LPYIAKVMERRRGQFTIIPVLVGSLSADKEKLYGSIFSQYLADPENFFVISSDFCHWGQR 281
Query: 193 FNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
F Y YDK G I +SIEALDKMGMD IE DP F YL EY+NTICGR+PI V L +
Sbjct: 282 FRYTFYDKSCGDIWQSIEALDKMGMDAIEQMDPAKFSHYLKEYENTICGRYPIGVLLSAI 341
Query: 253 ------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
GN ++ KF+ Y QSS+CK DSSVSYAS A
Sbjct: 342 DALRRNGN-GRRMSFKFMNYAQSSRCKKMSDSSVSYASGA 380
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 72/104 (69%), Gaps = 1/104 (0%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M K+RRA+HAGSWYT++ + L+ +LD WL +A SP R +IAPHAGY Y G +A
Sbjct: 1 MSKVRRATHAGSWYTNDGRELSSQLDEWLSKAQPSYSP-ARAIIAPHAGYVYCGACGGHA 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
+ IDP++I R+F+LGPSHH CAL+ + Y+TP+ DL +D
Sbjct: 60 YRQIDPSNIKRIFILGPSHHVRLSGCALTETSHYQTPLYDLTID 103
>gi|343429785|emb|CBQ73357.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 344
Score = 289 bits (740), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 161/338 (47%), Positives = 193/338 (57%), Gaps = 53/338 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLRE---AGLP-------KSPEV---------- 40
M R ASHAGSWYTD++ L L WL + LP +SP
Sbjct: 1 MAPTRTASHAGSWYTDDADELDAALSSWLEAVTPSSLPAPLSICERSPTTEALPLPVAGS 60
Query: 41 RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
R +I PHAGYSYSG AAAYA+ ID T IS VF+LGPSHH Y CA+S Y TP+G
Sbjct: 61 RAIIGPHAGYSYSGPAAAYAYRTIDTTRISTVFILGPSHHVYLDGCAVSACAQYDTPLGS 120
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
L +D ++ E+LKATGKF +M + DE EHS+EMHLPY+ KVFEG ++IVPILVGA+N
Sbjct: 121 LQIDRKITEDLKATGKFSVMSLAEDEDEHSIEMHLPYVFKVFEGRDVRIVPILVGAINMS 180
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------- 204
E YGRL A Y+ DPS FF VSSDFCHWG+RF Y HY
Sbjct: 181 KENEYGRLLAPYLRDPSTFFVVSSDFCHWGTRFRYTHYTPSGSSVPTHLSTRSPASLFEA 240
Query: 205 --IHKSIEALDKMGMDIIE---TGDPD---------AFKKYLLEYDNTICGRHPISVFLH 250
IH+SI LD+ G+ I T D AF YL NT+CGRHPI V L
Sbjct: 241 KPIHQSIRELDQAGILAITYPWTHDTQPKSAEHARLAFAHYLASTKNTVCGRHPIGVLLA 300
Query: 251 MLGNCST---KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
L + + + +F RYEQSS+C +DSSVSYASA
Sbjct: 301 ALAELESNGVQSECRFTRYEQSSECVRVQDSSVSYASA 338
>gi|325184143|emb|CCA18601.1| memolike protein putative [Albugo laibachii Nc14]
gi|325186056|emb|CCA20558.1| memolike protein putative [Albugo laibachii Nc14]
Length = 302
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 139/290 (47%), Positives = 193/290 (66%), Gaps = 7/290 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAA 57
+IR+A+HA +WY D+ +L +LD WL +A +S V G+IAPHAG+ +SG A
Sbjct: 4 RIRKATHAATWYIDDPSKLGPQLDAWLEDAVHEGKDARESKRVNGIIAPHAGFRFSGSTA 63
Query: 58 AYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+A+ ++ + T I RVF+LGPSHH Y CAL++A+ Y+TP G LP+D E+ E L TGK
Sbjct: 64 AHAYCHLLERTDIKRVFVLGPSHHVYLEGCALTSASHYETPFGMLPVDEEINEILMKTGK 123
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
F M + VDEAEHS+EMHLP++A+ +G + +VPILVG N N YGRL A++++D
Sbjct: 124 FRRMSMSVDEAEHSIEMHLPFIARTLKGQSLSLVPILVGNTNQNNNLEYGRLLAQFMNDQ 183
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SNFF +SSDFCHWG+RF Y +D +G IH I+ LD + ++E + F YL
Sbjct: 184 SNFFVISSDFCHWGARFRYQPHDASYGEIHDYIKHLDHEAIKLLEDLNATGFATYLESTK 243
Query: 237 NTICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
NTICG+HPIS+ + +L + I+F++Y QS CK K +SSVSYASA
Sbjct: 244 NTICGQHPISIIMQAVLALDGLQPAIRFVKYAQSGACKKKSESSVSYASA 293
>gi|388853823|emb|CCF52544.1| uncharacterized protein [Ustilago hordei]
Length = 353
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 194/336 (57%), Gaps = 55/336 (16%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------------------EVRGVI 44
R+ASHAGSWY+DN K L L WL + + P R +I
Sbjct: 12 RQASHAGSWYSDNQKDLDASLSEWLEDVDVKSLPTPLSVCEHKPSAESLPLPITNCRALI 71
Query: 45 APHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
PHAGYSYSG AAAYA+ ID + I R+F+LGPSHH Y CA+S Y+TP+G+L +D
Sbjct: 72 GPHAGYSYSGPAAAYAYRCIDNSQIERIFILGPSHHVYLDGCAVSGCEEYETPLGNLKID 131
Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM 164
++ E LKATGKF M DE EHS+EMHLPY+ K+F+G I+IVPILVGA+N E
Sbjct: 132 RQITEALKATGKFGTMTQSQDEDEHSIEMHLPYIYKMFKGKSIQIVPILVGAINTSKEDA 191
Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK------------------KHGVIH 206
YG+L A Y+ DP NFF VSSDFCHWGSRF+Y +Y + IH
Sbjct: 192 YGKLLAPYLKDPRNFFVVSSDFCHWGSRFSYTYYRPAGSSLATTLSNRSPRSAYEQPPIH 251
Query: 207 KSIEALDKMGMDII--------ETGDPD-----AFKKYLLEYDNTICGRHPISVFLHMLG 253
+SI LD+ G++ I +TG AF +YL NT+CGRHPI V L L
Sbjct: 252 QSIRELDESGIEAITHPWTKGGKTGKSAEEARLAFAEYLSSTKNTVCGRHPIGVLLAALA 311
Query: 254 NCSTKIKI----KFLRYEQSSQCKTKRDSSVSYASA 285
KI +F RYEQSSQC T +DSSVSYASA
Sbjct: 312 ELEKTDKIQTECRFTRYEQSSQCLTPQDSSVSYASA 347
>gi|291222620|ref|XP_002731317.1| PREDICTED: C21orf19-like protein-like [Saccoglossus kowalevskii]
Length = 255
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 140/250 (56%), Positives = 176/250 (70%), Gaps = 5/250 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFG 62
RRA+HAGSWYT + K L+ ELD WL+ A L SP R +IAPHAGY+Y G+ A+A+
Sbjct: 6 RRATHAGSWYTRSGKDLSAELDQWLQGAPLDSHYSP-ARAIIAPHAGYAYCGKCGAHAYK 64
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+DPT++ +VF+LGPSHH CALS Y TP+ +LP+D +V EL +TG+FE M I
Sbjct: 65 QVDPTNVKKVFILGPSHHVRLSGCALSRTVTYDTPLYNLPIDTQVYGELMSTGQFEKMSI 124
Query: 123 CVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
DE EHS+EMHLPY+AKV E IVPILVGA++ + E YGR+ AKY++DP N F
Sbjct: 125 ETDEDEHSIEMHLPYVAKVMESKRDQFTIVPILVGALSTDKEQKYGRILAKYLEDPHNLF 184
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF+Y +YD+ G I+KSI+ LD GM+ IE DP F KYL Y NTIC
Sbjct: 185 VISSDFCHWGERFHYTYYDESCGEIYKSIDVLDHKGMNHIENLDPSGFSKYLKMYGNTIC 244
Query: 241 GRHPISVFLH 250
GRHPI VFL+
Sbjct: 245 GRHPIGVFLN 254
>gi|115497104|ref|NP_001069863.1| protein MEMO1 [Bos taurus]
gi|118572779|sp|Q2HJH7.1|MEMO1_BOVIN RecName: Full=Protein MEMO1; AltName: Full=Mediator of ErbB2-driven
cell motility 1; Short=Memo-1
gi|88682865|gb|AAI05374.1| Mediator of cell motility 1 [Bos taurus]
Length = 285
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 136/269 (50%), Positives = 181/269 (67%), Gaps = 4/269 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ +PSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPISVFLHMLGNCSTK-IKIKFLRYEQSS 270
HPI V L+ + + + F ++Q+S
Sbjct: 247 HPIGVLLNAITELQKNGMNMSFSFFQQAS 275
>gi|157104420|ref|XP_001648401.1| hypothetical protein AaeL_AAEL014294 [Aedes aegypti]
gi|108869206|gb|EAT33431.1| AAEL014294-PA, partial [Aedes aegypti]
Length = 276
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 142/271 (52%), Positives = 183/271 (67%), Gaps = 6/271 (2%)
Query: 21 LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
L +L WL +A L P R +IAPHAGY Y G A+A+ I P + RVF+LGPSHH
Sbjct: 3 LNRQLSKWLDDADLTFGP-ARAIIAPHAGYRYCGACGAWAYRQISPAVVKRVFILGPSHH 61
Query: 81 YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAK 140
+CALS A +TP+ DL +D EV EL+ATG F+ MD DE EHS+EMHLPY+AK
Sbjct: 62 VRLSRCALSAAQYCRTPLYDLKVDQEVNAELEATGHFKWMDQKTDEDEHSIEMHLPYVAK 121
Query: 141 VFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
V E IVPI+VG+++ + E YG++ A Y+ DP N F +SSDFCHWG+RF Y +Y
Sbjct: 122 VMEDFRDQFTIVPIMVGSISNDWEETYGKILAPYLADPQNLFVISSDFCHWGARFRYTYY 181
Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNC 255
+ HG I+K IE LDKMGMD+IET P++F +YL +Y+NTICGRHPI V L L
Sbjct: 182 EDSHGPIYKWIEVLDKMGMDLIETLKPESFGEYLRKYNNTICGRHPIGVLLQSVEELKRR 241
Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++ KFL+Y+QS+QC K+DSSVSYAS +
Sbjct: 242 GYRMSFKFLKYDQSNQCCDKKDSSVSYASGS 272
>gi|225708900|gb|ACO10296.1| MEMO1 [Caligus rogercresseyi]
Length = 302
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 145/294 (49%), Positives = 196/294 (66%), Gaps = 9/294 (3%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
E+ R ASHAGSWY+ + + L E+ GWL ++ R +I PHAGY YSG AAYA
Sbjct: 5 EQRRCASHAGSWYSGDERVLQREMSGWLEAVKTDSETFPARAIIGPHAGYRYSGPTAAYA 64
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFEL 119
+ I+P I RVF+LGPSHH C LS +VY+TP+ DL +D E+ +EL ++ G F+
Sbjct: 65 YKQINPEGIKRVFILGPSHHIRLNGCLLSGCSVYETPLYDLVVDQELNKELMESKGGFDK 124
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+ DE EHS+E+HLPY+AK E +VPILVG+++ + E YG++ AKY+ DPS
Sbjct: 125 ATLQADEEEHSIELHLPYIAKAMESRKGQFTVVPILVGSLSPDKEYKYGKILAKYLMDPS 184
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
N F +SSDFCHWG RF+Y +YD++ G IH+SI LD GM++IE+ + DAF YL + N
Sbjct: 185 NLFVISSDFCHWGERFDYTYYDEEAGEIHESISKLDHKGMELIESLNHDAFAAYLKKTSN 244
Query: 238 TICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
TICGRHPI VFL M+ ++++KFL+Y QS +C+ DSSVSYASAA
Sbjct: 245 TICGRHPIGVFLGMVKAILQHGEPPRMQLKFLKYAQSDKCRKIADSSVSYASAA 298
>gi|301122825|ref|XP_002909139.1| memo-like protein [Phytophthora infestans T30-4]
gi|262099901|gb|EEY57953.1| memo-like protein [Phytophthora infestans T30-4]
Length = 305
Score = 286 bits (733), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 143/289 (49%), Positives = 191/289 (66%), Gaps = 6/289 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
++R A+HAGSWY+ + +L EEL GWL A K P +R +IAPHAG+ YSG AA
Sbjct: 11 RVRLATHAGSWYSHDEHKLEEELSGWLGAAEGNDTQEKQPSIRAIIAPHAGFRYSGPTAA 70
Query: 59 YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
YA+ + + I RVF+LGPSHH+Y CA+STA Y+TP+G++ +D EV E+L +GKF
Sbjct: 71 YAYHQVLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNISIDHEVNEKLVDSGKF 130
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M + VDE EHS+EMHLP++ KV G VPILVG ++ + YG++ A Y+++
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIFKVMNGRKFTAVPILVGNTKSKTDEEYGKILAPYLENDE 190
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
N F +SSDFCHWG RF Y +D +G IH+ I+ LD GM IE D F +YL E +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAAGFTRYLDETNN 250
Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
TICGRHPIS+ L+ +L + K +KF++Y QSS C DSSVSYASA
Sbjct: 251 TICGRHPISLLLNTILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299
>gi|241122968|ref|XP_002403743.1| protein MEMO1, putative [Ixodes scapularis]
gi|215493518|gb|EEC03159.1| protein MEMO1, putative [Ixodes scapularis]
Length = 297
Score = 286 bits (732), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 142/291 (48%), Positives = 186/291 (63%), Gaps = 6/291 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ASHAGSWYTD+ + L +L+ WL AG+P R VIAPHAGY Y G AAYA+ +
Sbjct: 8 RKASHAGSWYTDSPRELKYQLENWLSAAGMPSFGPARAVIAPHAGYQYCGACAAYAYKQV 67
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP + F + ++ A + + + + EEL TG FE + + V
Sbjct: 68 DPDHVRFFFFVMTLDLLLRCLLLVNNANRQSAEYDAVSV-VPIYEELFETGAFEEVSLHV 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
DE EHS+EMHLPY+AKV E IVPI+VG++ ENEA YGRL +KY+ +P N F +SS
Sbjct: 127 DENEHSLEMHLPYIAKVMEDREFTIVPIIVGSLTPENEAFYGRLLSKYLAEPDNLFVISS 186
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWG+RF+Y +DK G IH+SIE LDK GM+IIE P AF YL +Y NTICGRHP
Sbjct: 187 DFCHWGARFHYQFHDKSWGNIHQSIERLDKQGMNIIEELCPTAFTAYLKKYGNTICGRHP 246
Query: 245 ISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
I V L+ + + ++ +KFL+Y QSSQC + DSSVSYASAA +++
Sbjct: 247 IGVLLNAVDTLQSSGNGHRMALKFLKYAQSSQCMSTSDSSVSYASAALRLE 297
>gi|71017747|ref|XP_759104.1| hypothetical protein UM02957.1 [Ustilago maydis 521]
gi|46098896|gb|EAK84129.1| hypothetical protein UM02957.1 [Ustilago maydis 521]
Length = 346
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 158/342 (46%), Positives = 198/342 (57%), Gaps = 58/342 (16%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------------------GLP 35
M R+A+HAGSWYTD+ + L E L WL + LP
Sbjct: 1 MALTRKATHAGSWYTDDPRELDECLTEWLSDVQPDNLLSPLSVCSDSSCSSSSSESLALP 60
Query: 36 KSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYK 95
S R +I PHAGYSYSG AAAYA+ ID ++I VF+LGPSHH Y CA+S + Y+
Sbjct: 61 ISSN-RAIIGPHAGYSYSGPAAAYAYRTIDTSAIKTVFILGPSHHVYLDGCAVSACSSYE 119
Query: 96 TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
TP+G+LP++ V EL +TG+F M DE EHS+EMHLPY+ KVF+G I+IVPILVG
Sbjct: 120 TPLGNLPINRSVTHELLSTGRFSTMSKTEDEDEHSIEMHLPYIYKVFKGTGIQIVPILVG 179
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK--------------- 200
A+N E +G+L AKY++DP NFF VSSDFCHWGSRF Y +Y
Sbjct: 180 AINTARENEFGKLLAKYLNDPENFFVVSSDFCHWGSRFRYTYYKPCGSNIAMNLTSRSSR 239
Query: 201 ---KHGVIHKSIEALDKMGM---------DIIETGDPD--AFKKYLLEYDNTICGRHPIS 246
+ IH+SI LD+ G+ D +T + AF KYL E NT+CGRHPI
Sbjct: 240 SMFEGKPIHQSIRELDEAGILAITYPWSRDRQKTAEDARLAFAKYLSETKNTVCGRHPIG 299
Query: 247 VFLHMLGNC---STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
V L L K + +F RYEQSSQC T ++SSVSYASA
Sbjct: 300 VLLAALAELERRGQKTECRFTRYEQSSQCTTPQESSVSYASA 341
>gi|331236660|ref|XP_003330988.1| hypothetical protein PGTG_12951 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309309978|gb|EFP86569.1| hypothetical protein PGTG_12951 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 335
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 195/332 (58%), Gaps = 45/332 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------------EVRGVIAPH 47
R A+HAGSWY + RL ++L GWL +A + P + R +IAPH
Sbjct: 4 RSATHAGSWYPASKTRLEDDLKGWLSDAHANEQPVNHAQAWKSVCMPVPIKDCRAIIAPH 63
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGYSYSGRAAA+A+ ID I RVF+LGPSHH Y CALS T Y TP+GDLP+D +
Sbjct: 64 AGYSYSGRAAAWAYKLIDTDQIKRVFILGPSHHVYLDCCALSKCTQYATPLGDLPIDTSI 123
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
EELKA +F M + D+ EHS+E+HLPY+ +F+ H IKIVPILVG++ E YG
Sbjct: 124 NEELKAKNRFGEMSLQTDQEEHSIELHLPYIRHIFKNHDIKIVPILVGSITFAEELEYGA 183
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV----------------IHKSIEA 211
L A Y+ DP N F VSSDFCHWG+RF+Y +Y IH+SIE
Sbjct: 184 LLAPYLADPENLFVVSSDFCHWGTRFSYTYYQDPRSTEPAKRLSPGSPPPEYPIHQSIEN 243
Query: 212 LDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPISVFL-HMLGNCSTK-------I 259
LDK ++ I DP + FK YL NTICGRHPI V L M+ T+
Sbjct: 244 LDKEAIEAISLQDPSQAHETFKGYLKRTKNTICGRHPIGVLLGSMIKLQQTRSTPPPPQQ 303
Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
+++ +RYEQSSQC T DSSVSY SA ++ A
Sbjct: 304 QLQLVRYEQSSQCFTLSDSSVSYVSAFLRLPA 335
>gi|301089179|ref|XP_002894922.1| memo-like protein [Phytophthora infestans T30-4]
gi|262105088|gb|EEY63140.1| memo-like protein [Phytophthora infestans T30-4]
Length = 305
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 142/289 (49%), Positives = 191/289 (66%), Gaps = 6/289 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
++R A+HAGSWY+ + +L EEL GWL A K P +R +IAPHAG+ YSG AA
Sbjct: 11 RVRLATHAGSWYSHDEHKLEEELSGWLGAAEGNDTQEKQPSIRAIIAPHAGFRYSGPTAA 70
Query: 59 YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
YA+ + + I RVF+LGPSHH+Y CA+STA Y+TP+G++ +D EV E+L +GKF
Sbjct: 71 YAYHQVLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNISIDHEVNEKLVDSGKF 130
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M + VDE EHS+EMHLP++ KV G VPILVG ++ + YG++ A Y+++
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIFKVMNGRKFTAVPILVGNTKSKTDEEYGKILAPYLENDE 190
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
N F +SSDFCHWG RF Y +D +G IH+ I+ LD GM IE D F +YL + +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAAGFTRYLDKTNN 250
Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
TICGRHPIS+ L+ +L + K +KF++Y QSS C DSSVSYASA
Sbjct: 251 TICGRHPISLLLNTILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299
>gi|195997049|ref|XP_002108393.1| hypothetical protein TRIADDRAFT_37128 [Trichoplax adhaerens]
gi|190589169|gb|EDV29191.1| hypothetical protein TRIADDRAFT_37128 [Trichoplax adhaerens]
Length = 297
Score = 285 bits (728), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/289 (50%), Positives = 192/289 (66%), Gaps = 8/289 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+A+HAGSWY+ + L+++L GWL + +P + +IAPHAGY+Y G AAYA+ I
Sbjct: 4 RQATHAGSWYSSSGTELSKQLGGWLSKVANGNTP-AKAIIAPHAGYAYCGACAAYAYKQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP I R+F+LGPSHHYY KCA+S + TP+ L +D E+ +L TGKF+ M V
Sbjct: 63 DPRGIKRIFILGPSHHYYLTKCAISMTSQCNTPLYPLQVDTEINNQLLKTGKFDKMSKSV 122
Query: 125 DEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EMHLPY+A+V IVPILVGA+ E MYG +F++Y+ DP N F
Sbjct: 123 DEDEHSIEMHLPYIAQVMSSRSPDSFTIVPILVGALGYREELMYGEIFSQYLVDPENLFV 182
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF + +Y+K H I SIEALD+ GMD+IE DP F++YL +Y NTICG
Sbjct: 183 ISSDFCHWGKRFRFTYYEKSHKHIFSSIEALDRKGMDLIEQLDPAGFREYLDKYQNTICG 242
Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI V L + K +K++FL Y QSSQC + DSSVSYA+
Sbjct: 243 RHPIGVLLQAIDAVRKKQRSNMKMRFLNYAQSSQCLSANDSSVSYAAGV 291
>gi|353237123|emb|CCA69103.1| hypothetical protein PIIN_03003 [Piriformospora indica DSM 11827]
Length = 323
Score = 283 bits (723), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 194/317 (61%), Gaps = 36/317 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVR---GVIAPHAGYSYSGRAAAY 59
R+ASHA SWY+ ++++L +LD W+ E+ SP +R +I PHAGYSYSG AA+
Sbjct: 3 RQASHADSWYSGDARKLNSQLDQWINAVESSETYSPPIRQSKAIIGPHAGYSYSGPNAAW 62
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ +ID T I RVF+LGPSHH Y CALS Y TP+G LPLD E IEEL TGKFE
Sbjct: 63 AYKSIDTTGIKRVFVLGPSHHVYLRSCALSACETYATPLGSLPLDRETIEELHNTGKFET 122
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
M + DE EHS+E+HLPY+ K+F+G IKIVPILVGA++ + EA YG+L A Y F
Sbjct: 123 MALSTDEDEHSIELHLPYVRKIFQGMDIKIVPILVGALDKQGEATYGKLLAPYFAREDTF 182
Query: 180 FSVSSDFCHWGSRFNYMHY-------------------DKKHGVIHKSIEALDKMGMDII 220
VSSDFCHWG RF+Y Y K IH+SI ALDK MDI+
Sbjct: 183 TVVSSDFCHWGRRFSYTFYYPSPPPCEGIKLSRSTPSSTFKSFPIHESITALDKEAMDIL 242
Query: 221 ETG-----DPD----AFKKYLLEYDNTICGRHPISVFLHMLGNCSTK---IKIKFLRYEQ 268
P+ F YL NTICGRHPI V L L + + + + I++++Y+
Sbjct: 243 TLPRESDVSPNEVHTQFADYLARTRNTICGRHPIGVLLGALSSVAEQGKSVTIQWVKYDH 302
Query: 269 SSQCKTKRDSSVSYASA 285
SS+C+T DSSVSYASA
Sbjct: 303 SSECETVEDSSVSYASA 319
>gi|389751361|gb|EIM92434.1| UPF0103-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 331
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 155/324 (47%), Positives = 194/324 (59%), Gaps = 43/324 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
R A+HA SWYT N +L EL GW ++ P + + +IAPHAGYSYSG AA
Sbjct: 4 RIATHADSWYTGNGTKLDSELKGWFNAVKPSAQDNYHPPIAKCKAIIAPHAGYSYSGPAA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +ID T I RVF+LGPSHH Y CALS Y+TPIGDLPLDLE I EL+++G+F
Sbjct: 64 AWAYKSIDTTGIKRVFILGPSHHVYLDGCALSECETYQTPIGDLPLDLETIAELRSSGEF 123
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
MD+ DE EHS+EMHLPY+ KVFEG IKIVPILVGA++ E E +GR A Y+
Sbjct: 124 SDMDLQTDEDEHSIEMHLPYVRKVFEGMDIKIVPILVGAIDKEKEVAFGRTLAPYLARDD 183
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG---------------------VIHKSIEALDKMG 216
F VSSDFCHWG+RF+Y Y + I++SI +LD+
Sbjct: 184 TFCVVSSDFCHWGTRFSYTFYYPERPPTALDPIRLSRTSSPPNLHKFPIYESISSLDREA 243
Query: 217 MDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHML---------GNCSTKIKI 261
M+++ A F +YL NTICGRHPI V L L G K+
Sbjct: 244 MELLTLPPCTARQAHTYFAEYLARTKNTICGRHPIGVLLGALSAVEKGEVEGVEKRMAKL 303
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
+++RYEQSSQC T +DSSVSYASA
Sbjct: 304 QWVRYEQSSQCLTVKDSSVSYASA 327
>gi|392571540|gb|EIW64712.1| UPF0103-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 325
Score = 282 bits (721), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 159/322 (49%), Positives = 189/322 (58%), Gaps = 39/322 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPK-----SPEVRG---VIAPHAGYSYSGRA 56
R A+HA SWY+ ++ +L EL GWL E P SP V G VIAPHAGY+YSGR
Sbjct: 4 RNATHADSWYSGDASQLDRELSGWL-EGVSPSAEDEFSPPVSGTKAVIAPHAGYAYSGRT 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +ID T I RVF+LGPSHH Y CAL+ T Y+TP+G LP+D+E ELK TG+
Sbjct: 63 AAWAYKSIDTTGIKRVFVLGPSHHVYVDGCALTRCTQYETPLGVLPIDVETTRELKNTGQ 122
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
F MD DE EHS+EMHLPY+ KVFEG I IVP+LVGA+N E YG+L A Y+
Sbjct: 123 FTEMDQQTDEDEHSIEMHLPYVRKVFEGKDISIVPVLVGAINYGKEVTYGKLLAPYLARE 182
Query: 177 SNFFSVSSDFCHWGSRFNYM---------------------HYDKKHGVIHKSIEALDKM 215
FF VSSDFCHWG RF Y H K IH+SI ALD
Sbjct: 183 DTFFVVSSDFCHWGLRFQYTFYYPKPPPSSVSPVRLSRADSHTSVKENPIHESISALDHE 242
Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI---KFLRY 266
M+I+ A F YL NTICGRHPI V L L + K+ K++RY
Sbjct: 243 AMEILTMPPCTAAQAHHDFADYLARTKNTICGRHPIGVLLGALTVLEKEGKVPKLKWVRY 302
Query: 267 EQSSQCKTKRDSSVSYASAAAK 288
EQSS C +DSSVSYASA K
Sbjct: 303 EQSSACVAIKDSSVSYASAFVK 324
>gi|402588404|gb|EJW82337.1| hypothetical protein WUBG_06754, partial [Wuchereria bancrofti]
Length = 283
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/276 (51%), Positives = 183/276 (66%), Gaps = 8/276 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R ASHAGSWYTD+ ++L EL WL AG S R +I+PHAGYSYSGR AA+AF
Sbjct: 8 VRNASHAGSWYTDDPRKLHRELTEWLGAAGSRHSQSARAIISPHAGYSYSGRVAAFAFKQ 67
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P ++S +F+LGPSH CALST Y+TPIGDL +D ELK TG F LMD+
Sbjct: 68 IIPETVSIIFVLGPSHVMSLDTCALSTCWRYRTPIGDLQIDQRTNMELKETGAFSLMDLR 127
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+EAEHS+EM LPY+AK+ E IVP+LVG+++ +A YG++F+KY+ DP F
Sbjct: 128 SEEAEHSIEMQLPYIAKIMEKQSTNSYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVF 187
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
VSSDFCHWGSRF++M +D GV I++ I A+DK GMD I + +P AF +YL NTI
Sbjct: 188 VVSSDFCHWGSRFHFMPHDNSTGVPIYEQIAAMDKQGMDAISSLNPAAFGEYLKRTQNTI 247
Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
CGR+PISV L + S + FL+Y QS+Q
Sbjct: 248 CGRNPISVILYAAEYFRQMNSHLAEFVFLKYAQSNQ 283
>gi|348675910|gb|EGZ15728.1| hypothetical protein PHYSODRAFT_351598 [Phytophthora sojae]
Length = 305
Score = 281 bits (719), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 141/289 (48%), Positives = 190/289 (65%), Gaps = 6/289 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
++R A++AGSWY+ + L EL GWL A K P +R +I PHAG+ YSG AA
Sbjct: 11 RVRLATYAGSWYSHDEHELEGELAGWLDAAEDNDTQEKQPSIRAIIGPHAGFRYSGPTAA 70
Query: 59 YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
YA+ ++ + I RVF+LGPSHH+Y CA+STA Y+TP+G++ +D EV E+L +GKF
Sbjct: 71 YAYHHLLNLDRIKRVFILGPSHHFYLRGCAVSTAHEYETPLGNIVIDHEVNEKLVDSGKF 130
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M + VDE EHS+EMHLP++ KV G VPILVG ++ + YG++ A Y+++
Sbjct: 131 ATMSMDVDEDEHSIEMHLPFIYKVMNGRKFTAVPILVGNTKSKMDEEYGKILAPYLENDE 190
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
N F +SSDFCHWG RF Y +D +G IH+ I+ LD GM IE D + F +YL E +N
Sbjct: 191 NLFVISSDFCHWGPRFRYQPHDSTYGEIHEYIKYLDHQGMGFIERLDAEGFTRYLEETNN 250
Query: 238 TICGRHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
TICGRHPIS+ LH +L + K +KF++Y QSS C DSSVSYASA
Sbjct: 251 TICGRHPISLLLHSILASKKLKCTLKFVKYAQSSACMRHGDSSVSYASA 299
>gi|296482698|tpg|DAA24813.1| TPA: protein MEMO1 [Bos taurus]
Length = 250
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 131/243 (53%), Positives = 169/243 (69%), Gaps = 3/243 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ +PSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLAEPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 246
Query: 243 HPI 245
HPI
Sbjct: 247 HPI 249
>gi|336383353|gb|EGO24502.1| hypothetical protein SERLADRAFT_467923 [Serpula lacrymans var.
lacrymans S7.9]
Length = 348
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 194/322 (60%), Gaps = 37/322 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPEVRGVIAPHAGYSYSGR 55
M+ RRA+HAGSWY+ + + L EL WL E P + +IAPHAGYSYSG
Sbjct: 23 MQPSRRATHAGSWYSSSGRTLDAELTQWLAAVTSDEDFAPPITGCKAIIAPHAGYSYSGP 82
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AAA+A+ +ID T I VF+LGPSHH Y CALS Y+TP+G+LPLDL+VI +L+ TG
Sbjct: 83 AAAWAYKSIDTTDIKCVFILGPSHHVYLDGCALSNCEQYETPLGELPLDLQVIRDLRDTG 142
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+FE MDI DE EHS+EMHL Y+ KVFE I IVP+LVGA++ + EA YG++ A Y+
Sbjct: 143 EFEDMDIQTDEDEHSIEMHLSYVRKVFENSDISIVPVLVGAISKDKEAAYGQILAPYLAR 202
Query: 176 PSNFFSVSSDFCHWGSRFNYMHY------DKKHGV----------------IHKSIEALD 213
F +SSDFCHWG+RF+Y +Y G+ IH+SI LD
Sbjct: 203 EDTLFVISSDFCHWGTRFSYTYYYPEPLPSNTPGIRLSRSGPPPSTLSSRPIHESISDLD 262
Query: 214 KMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLG----NCSTKIKIKF 263
M ++ A F YL NTICGRHPI V L L + + +KIK+
Sbjct: 263 HEAMQLLTIPPGSAVNAHQEFASYLGRTKNTICGRHPIGVLLGALSVLEQDETHSVKIKW 322
Query: 264 LRYEQSSQCKTKRDSSVSYASA 285
+RYEQSSQC RDSSVSYASA
Sbjct: 323 VRYEQSSQCFNIRDSSVSYASA 344
>gi|195145810|ref|XP_002013883.1| GL23150 [Drosophila persimilis]
gi|194102826|gb|EDW24869.1| GL23150 [Drosophila persimilis]
Length = 269
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/286 (49%), Positives = 175/286 (61%), Gaps = 29/286 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD L+ +LD WL A L P R +IAPHAGY+Y G +A+A+ I
Sbjct: 4 RRATHAGSWYTDAGSELSRQLDRWLGAAELSHGP-ARAIIAPHAGYTYCGACSAFAYRQI 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A YKTP+ DL +D+++ EL+ T +F D
Sbjct: 63 SPVVVKRIFILGPSHHVRLRGCALSVAKKYKTPLYDLKIDIQINTELEKTPQFSYKD--- 119
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
IVPILVG++N E EA YG L + Y DP+N F +SS
Sbjct: 120 ---------------------QFTIVPILVGSLNPEQEAQYGSLLSTYFMDPTNLFVISS 158
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWG RF+Y +YD+ G IHKSIE LDK GMD+IET P AF +YL +Y+NTICGRHP
Sbjct: 159 DFCHWGQRFSYTYYDRSCGQIHKSIEKLDKQGMDLIETLSPTAFTEYLRKYNNTICGRHP 218
Query: 245 ISVFLHMLGNCS----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
I V L + K+ KFL+Y QSSQC DSSVSYAS +
Sbjct: 219 IGVMLGAVKALQDQGYDKMSFKFLQYAQSSQCLDMEDSSVSYASGS 264
>gi|344288749|ref|XP_003416109.1| PREDICTED: protein MEMO1-like isoform 2 [Loxodonta africana]
Length = 274
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 141/290 (48%), Positives = 179/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAP
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 50 ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 273
>gi|212286159|ref|NP_001131074.1| protein MEMO1 isoform 2 [Homo sapiens]
gi|114576873|ref|XP_001163989.1| PREDICTED: protein MEMO1 isoform 1 [Pan troglodytes]
gi|334312891|ref|XP_003339794.1| PREDICTED: protein MEMO1-like isoform 2 [Monodelphis domestica]
gi|410209452|gb|JAA01945.1| mediator of cell motility 1 [Pan troglodytes]
Length = 274
Score = 277 bits (708), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 141/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAP
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 50 ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 273
>gi|327262349|ref|XP_003215987.1| PREDICTED: protein MEMO1-like isoform 2 [Anolis carolinensis]
Length = 274
Score = 277 bits (708), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 142/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L+ +L+GWL + K P R +IAP
Sbjct: 8 REASHAGSWYTASGPQLSAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
R+F+LGPSHH +CALS+ +Y+TP+ DL +D +V EL TG FE M +
Sbjct: 50 ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKVYAELWKTGMFERMSLQT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E +G+LF+KY+ DP N F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIVPVLVGALSETKEQEFGKLFSKYLADPCNLFVV 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALMV 273
>gi|449669566|ref|XP_004207060.1| PREDICTED: protein MEMO1-like [Hydra magnipapillata]
Length = 291
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 142/286 (49%), Positives = 193/286 (67%), Gaps = 4/286 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
RRASHAGSWY+D+ +L ++L+ WL E + +P R +I+PHAGY+Y G AAYA+
Sbjct: 3 FRRASHAGSWYSDSGDQLNKQLEQWLSEVNVKSTP-ARALISPHAGYAYCGACAAYAYKQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I+P +I R+F+LGPSHH P CA++ T Y+TP+ +L +D + EL TGKF++M
Sbjct: 62 INPMTIKRIFILGPSHHVALPGCAVTQTTSYETPLYNLKIDNLINNELLGTGKFDIMSKE 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
DE EHS+EM LP++AKV E + +VPILVG+ + E E +YG +F+KY+ +P N F
Sbjct: 122 TDENEHSIEMQLPFIAKVMESNKDNFTVVPILVGSTSHEQERLYGVIFSKYLKNPENLFV 181
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF + YDK G I++SIE LDKMGM IE D ++F KYL ++ NTICG
Sbjct: 182 ISSDFCHWGKRFRFTPYDKSKGEIYESIEDLDKMGMKYIEQLDTNSFYKYLEKFSNTICG 241
Query: 242 RHPISVFLHMLGNC-STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
RHPI + L+ + S KF Y QSS+C+ DSSVSYA+
Sbjct: 242 RHPIGILLNAVNEVRSADASFKFHYYSQSSKCRRFEDSSVSYAAGT 287
>gi|345307803|ref|XP_001509250.2| PREDICTED: protein MEMO1-like [Ornithorhynchus anatinus]
Length = 273
Score = 275 bits (702), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 132/256 (51%), Positives = 173/256 (67%), Gaps = 7/256 (2%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI 98
E++G+ HAGY+Y G AA+A+ +DP+ R+F+LGPSHH +CALS+ +Y+TP+
Sbjct: 19 EIKGL--SHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSHHVPLSRCALSSVDIYRTPL 76
Query: 99 GDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGA 156
DL +D ++ EL TG FE M + DE EHS+EMHLPY AK E H I+P+LVGA
Sbjct: 77 YDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGA 136
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
++ E +G+LF+KY+ DPSN F +SSDFCHWG RF Y +YD+ G I++SIE LDKMG
Sbjct: 137 LSESKEQEFGKLFSKYLADPSNLFVISSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMG 196
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCK 273
M+IIE DP +F YL +Y NTICGRHPI V L+ L + FL Y QSSQC+
Sbjct: 197 MNIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKNGVNMSFSFLNYAQSSQCR 256
Query: 274 TKRDSSVSYASAAAKV 289
+DSSVSYA+ A V
Sbjct: 257 NWQDSSVSYAAGALTV 272
>gi|403415251|emb|CCM01951.1| predicted protein [Fibroporia radiculosa]
Length = 326
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 152/319 (47%), Positives = 188/319 (58%), Gaps = 38/319 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-------EAGLPKSPEVRGVIAPHAGYSYSGRAA 57
RRA+HAGSWY + +L EL L+ EA P + +IAPHAGYSYSG A
Sbjct: 4 RRATHAGSWYDADGGKLDTELSSNLQSVEPSLEEAFAPPVKGSKAIIAPHAGYSYSGSTA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +ID I RVF+LGP+HH Y CALST Y+TP+G LPLDL+ I+EL+ TGKF
Sbjct: 64 AWAYKSIDTAGIKRVFILGPAHHVYLDGCALSTCERYETPLGTLPLDLDTIQELRDTGKF 123
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
MD+ DE EHS+EMHLPY+ K+FEG I IVPILVGA+N EA +G + A Y+
Sbjct: 124 SDMDVDTDEDEHSIEMHLPYVRKIFEGLNISIVPILVGAINYNKEATFGTILAPYLARDD 183
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG----------------------VIHKSIEALDKM 215
F VSSDFCHWG+RF Y Y K I+ SI ALD
Sbjct: 184 TFCVVSSDFCHWGTRFQYTFYYPKAPPTSIPAIRLSKADPSPPSLTTHPIYSSISALDHE 243
Query: 216 GMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI---KFLRY 266
M+++ A F +YL NTICGRHPI V L L + K+ K++RY
Sbjct: 244 AMELLTMPPSTAAQSHHEFAEYLARTKNTICGRHPIGVLLGALAVLQKQGKVPHLKWVRY 303
Query: 267 EQSSQCKTKRDSSVSYASA 285
EQSS+C T +DSSVSYASA
Sbjct: 304 EQSSECVTIKDSSVSYASA 322
>gi|339247511|ref|XP_003375389.1| protein MEMO1 [Trichinella spiralis]
gi|316971269|gb|EFV55071.1| protein MEMO1 [Trichinella spiralis]
Length = 301
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 136/291 (46%), Positives = 180/291 (61%), Gaps = 7/291 (2%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E R SHAGSWY+ N + L +L+ WL P + +I+PHAGY Y G +AAYA+
Sbjct: 6 ESCRPCSHAGSWYSANEEDLRSQLEDWLSAVKTNHGP-AKAIISPHAGYCYCGASAAYAY 64
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
IDP ++ R+F+LGPSHH CA+S Y TP +L +D +V EL TG FE M+
Sbjct: 65 KQIDPNTVDRIFILGPSHHVCLSGCAVSKFNSYGTPFYNLTVDQQVNNELIETGLFEKME 124
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ D AEHS+EM LPY+A V + IVPILVG++ ++ YGR+ ++Y+ D N
Sbjct: 125 LLTDTAEHSIEMQLPYIAHVMQSRKGAFTIVPILVGSLTPSRQSAYGRILSRYLADDRNL 184
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F +S+DFCHWG RF++ YDK G IH SIE LD+ GM IIE D AF YL NTI
Sbjct: 185 FIISTDFCHWGHRFHFTKYDKTGGEIHSSIERLDREGMKIIENLDSAAFNDYLKRTGNTI 244
Query: 240 CGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGRHPISV L H+ + + +FL Y QS+ C++ DSSVSYA+ +
Sbjct: 245 CGRHPISVLLQATEHLHEINNRRGDFQFLHYSQSNHCRSMHDSSVSYAAGS 295
>gi|341887659|gb|EGT43594.1| CBN-TAG-253 protein [Caenorhabditis brenneri]
Length = 302
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 138/289 (47%), Positives = 186/289 (64%), Gaps = 6/289 (2%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+ R ASHAGSWY+ N + L +L WL AG + R +IAPHAGYSY G AAYAF
Sbjct: 8 DHTRAASHAGSWYSGNQRDLDRQLTKWLDNAG-DRYGTARALIAPHAGYSYCGETAAYAF 66
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ +++ RVF+LGPSH CA++T + Y+TP+GD+ +D + EEL+AT F+LMD
Sbjct: 67 KQVVSSAVDRVFILGPSHVVALSGCAITTCSKYRTPLGDMTVDHTINEELRATRHFDLMD 126
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+EAEHS+EM LP+LAKV IVPILVG++ YG +FA Y++DP N F
Sbjct: 127 RRDEEAEHSLEMQLPFLAKVMGSRRYTIVPILVGSLPGSRAQTYGGIFAHYMEDPRNLFV 186
Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF++ YD+ V I++ I +LDK GM+ IET +P F YL + NTIC
Sbjct: 187 ISSDFCHWGDRFSFSPYDRNSNVPIYEQITSLDKQGMNAIETLNPTVFNDYLKKTQNTIC 246
Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
GR+PI + L H + + + KFL Y QS++ ++ DSSVSYAS
Sbjct: 247 GRNPILIMLQAAEHFRISNNHTHEFKFLHYTQSNKVRSPSDSSVSYASG 295
>gi|194386136|dbj|BAG59632.1| unnamed protein product [Homo sapiens]
Length = 274
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 140/290 (48%), Positives = 177/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAP
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 50 ------RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+ F+KY+ DPSN F V
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKPFSKYLADPSNLFVV 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 224 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 273
>gi|348507066|ref|XP_003441078.1| PREDICTED: protein MEMO1-like isoform 2 [Oreochromis niloticus]
Length = 274
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 142/290 (48%), Positives = 178/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWY+ + +L +L+GWL +A P R +IAP
Sbjct: 8 REASHAGSWYSASGSQLNAQLEGWLSQAQSTIRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
RVF+LGPSHH +CALS A +Y+TP+ DL +D +V +L TG FE M +
Sbjct: 50 ------RVFILGPSHHVPLSRCALSPADIYRTPLYDLRIDQKVYADLWKTGLFERMSLQT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H IVP+LVGA++ E YG+L +KY+ DPSN F +
Sbjct: 104 DEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGALSESKEQEYGKLLSKYLADPSNLFII 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGR
Sbjct: 164 SSDFCHWGQRFRYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPMSFTNYLKKYRNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L ++ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 224 HPIGVLLNAVAELRKSGLEMNFTFLNYAQSSQCRNWQDSSVSYAAGALIV 273
>gi|336370572|gb|EGN98912.1| hypothetical protein SERLA73DRAFT_122776 [Serpula lacrymans var.
lacrymans S7.3]
Length = 327
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 151/323 (46%), Positives = 193/323 (59%), Gaps = 38/323 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPEVRGVIAPHAGYSYSGR 55
M+ RRA+HAGSWY+ + + L EL WL E P + +IAPHAGYSYSG
Sbjct: 1 MQPSRRATHAGSWYSSSGRTLDAELTQWLAAVTSDEDFAPPITGCKAIIAPHAGYSYSGP 60
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AAA+A+ +ID T I VF+LGPSHH Y CALS Y+TP+G+LPLDL+VI +L+ TG
Sbjct: 61 AAAWAYKSIDTTDIKCVFILGPSHHVYLDGCALSNCEQYETPLGELPLDLQVIRDLRDTG 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVD 174
+FE MDI DE EHS+EMHL Y+ K++ I IVP+LVGA++ + EA YG++ A Y+
Sbjct: 121 EFEDMDIQTDEDEHSIEMHLSYVRKIYTSSSDISIVPVLVGAISKDKEAAYGQILAPYLA 180
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHY------DKKHGV----------------IHKSIEAL 212
F +SSDFCHWG+RF+Y +Y G+ IH+SI L
Sbjct: 181 REDTLFVISSDFCHWGTRFSYTYYYPEPLPSNTPGIRLSRSGPPPSTLSSRPIHESISDL 240
Query: 213 DKMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLG----NCSTKIKIK 262
D M ++ A F YL NTICGRHPI V L L + + +KIK
Sbjct: 241 DHEAMQLLTIPPGSAVNAHQEFASYLGRTKNTICGRHPIGVLLGALSVLEQDETHSVKIK 300
Query: 263 FLRYEQSSQCKTKRDSSVSYASA 285
++RYEQSSQC RDSSVSYASA
Sbjct: 301 WVRYEQSSQCFNIRDSSVSYASA 323
>gi|146186006|ref|XP_001032869.2| hypothetical protein TTHERM_00486570 [Tetrahymena thermophila]
gi|146143075|gb|EAR85206.2| hypothetical protein TTHERM_00486570 [Tetrahymena thermophila
SB210]
Length = 296
Score = 272 bits (695), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 181/288 (62%), Gaps = 4/288 (1%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAY 59
++ R+A+ AGSWY N +L +EL+ +L+ A L SP +++ +I PHAGY YSG A+
Sbjct: 5 QQTRKAAFAGSWYEGNQAKLNQELNNYLQNAQLQISPIQKIKAMIGPHAGYRYSGPTQAW 64
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
++ I+P R+FLLGPSH C LS+ TP G + +D +VI+EL TG F
Sbjct: 65 SYKYINPQEHQRIFLLGPSHRQRFQGCGLSSCNSLDTPFGQIQVDTDVIQELIKTGSFVQ 124
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
D +EAEHS+EMHLPY+ K+ IVPI+VG + + E YG + +KY DD
Sbjct: 125 TDKQTEEAEHSLEMHLPYIKKIMGDKPFTIVPIMVGQTSYDLEQKYGGILSKYFDDEKTL 184
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F +SSDFCHWG RFNY ++D + G I++SI+ LDK GMD IE+ F +YL E NTI
Sbjct: 185 FIISSDFCHWGERFNYQYHDPQAGKIYQSIQKLDKEGMDSIESHSSKKFAQYLNETSNTI 244
Query: 240 CGRHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
CGR I + LH++ S +K KF++Y QSSQ + DSSVSYASA
Sbjct: 245 CGRRGIGILLHIIEQSALSKTLKTKFVQYAQSSQVVDEDDSSVSYASA 292
>gi|47229018|emb|CAG09533.1| unnamed protein product [Tetraodon nigroviridis]
Length = 308
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 183/304 (60%), Gaps = 35/304 (11%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+ WL +A + P R +IAPHAGYSY G AA+A+ +DPT RVF+LGPSH
Sbjct: 5 QLNAQLEDWLSKAQSTRRP-ARAIIAPHAGYSYCGACAAHAYKQVDPTVTRRVFILGPSH 63
Query: 80 HYYTPKCALSTATVYKTPIGDLPLD-----------------------------LEVIEE 110
H + CALS+A +Y+TP+ D+ +D L V E
Sbjct: 64 HVHLTCCALSSAEIYRTPLYDMRIDQKGLAHFYMPRERLSACAAKAVHAKLDCCLPVYAE 123
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRL 168
L TG F+ M+I DE EHS+EMHLPY AK E IVPILVGA++ E YG+L
Sbjct: 124 LWKTGLFDRMNIKTDEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKL 183
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+KY+ DPSN F VSSDFCHWG+RF+Y +YD+ G I++SIE LDKMGM IIE DP +F
Sbjct: 184 LSKYLADPSNLFVVSSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSF 243
Query: 229 KKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
Y+ +Y NTICGRHPI V L+ L ++ FL Y QS QC+ SSVSYA+
Sbjct: 244 SNYIKKYRNTICGRHPIGVLLNAVAELRKSGLEMNFSFLNYAQSGQCRNWEQSSVSYAAG 303
Query: 286 AAKV 289
A V
Sbjct: 304 ALYV 307
>gi|409051530|gb|EKM61006.1| hypothetical protein PHACADRAFT_180158 [Phanerochaete carnosa
HHB-10118-sp]
Length = 333
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 152/327 (46%), Positives = 187/327 (57%), Gaps = 45/327 (13%)
Query: 4 IRRASHAGSWYTDNSKRLA-----EELDGWL---------REAGLPKSPEVRGVIAPHAG 49
+RRA+HAGSWYTD+ K A + LD L E P + VIAPHAG
Sbjct: 3 VRRATHAGSWYTDDRKLTAHSCRWQTLDNELARNLAAVQSSEEFTPPVAGCKAVIAPHAG 62
Query: 50 YSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
YSYSG AAA+A+ ID T I RVF+LGPSHH Y C LS Y+TPIGDLPLD+E I
Sbjct: 63 YSYSGPAAAWAYKTIDTTGIKRVFILGPSHHVYLDGCDLSQCKTYQTPIGDLPLDIETIA 122
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
ELK +GKF +M + DE EHS+E+HLPY+ KVFE I +VPILVGA++ + E YG L
Sbjct: 123 ELKKSGKFGVMSVETDEDEHSIELHLPYVRKVFERKDISVVPILVGAIDKDKEDAYGELL 182
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRF----------------------NYMHYDKKHGVIHK 207
A Y+ F VSSDFCHWG RF N D + +H+
Sbjct: 183 APYLAREDTIFVVSSDFCHWGQRFQYTYYYPEPPPTHRPAIRLSRSNASPADLANHPVHQ 242
Query: 208 SIEALDKMGMDIIETGDPDA------FKKYLLEYDNTICGRHPISVFLHMLGNCSTK--- 258
SI ALD+ M+++ A F YL NTICGRHPI V L L +
Sbjct: 243 SIRALDQEAMELLTMPPASATSAHYDFTDYLRRTKNTICGRHPIGVLLGALSALEQEGRV 302
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASA 285
K++++RYEQSS C++ DSSVSYASA
Sbjct: 303 PKLQWVRYEQSSACQSVNDSSVSYASA 329
>gi|407922334|gb|EKG15436.1| UPF0103/Mediator of ErbB2-driven cell motility (Memo-related)
[Macrophomina phaseolina MS6]
Length = 327
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 188/324 (58%), Gaps = 41/324 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL----------REAG------LPKSPEVRGVIAPH 47
+R ASHAGSWY+ N +L+ +LD WL R A +P S R +IAPH
Sbjct: 3 VREASHAGSWYSANKSQLSAQLDQWLDAVPASATGIRPASQDDLIDIPSS-GARAIIAPH 61
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGY+YSG AAA+A+ +D + R+FLLGPSHHYY CALS Y+TP+GDL +D E
Sbjct: 62 AGYAYSGPAAAWAYKALDLSKAKRIFLLGPSHHYYLTGCALSKCDAYETPLGDLQIDKET 121
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAEN 161
+ EL+ F+ M DE EHS+EMHLPY+ KV +VPILVGA +A
Sbjct: 122 VAELRKKASFDTMTQSQDEEEHSLEMHLPYIYKVLSRQFSDPSEFPTLVPILVGATSAAT 181
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG---------------VIH 206
E +G L A Y+ DP++ F +SSDFCHWGSRF Y +Y++ G IH
Sbjct: 182 ERQFGELLAPYLADPTSVFIISSDFCHWGSRFRYTYYEQAPGQVRNLRSSDKSPSSPAIH 241
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKF 263
+SI A+D MD +ET + F L E NT+CGRHPI V + L K K +F
Sbjct: 242 ESIAAVDGYCMDAVETKSHEKFLGILEETGNTVCGRHPIGVVMAAVEALSLAEDKGKFRF 301
Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
+RYE+SS C + +DSSVSY SA A
Sbjct: 302 IRYERSSDCVSPKDSSVSYCSAVA 325
>gi|268555180|ref|XP_002635578.1| C. briggsae CBR-TAG-253 protein [Caenorhabditis briggsae]
Length = 391
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 135/290 (46%), Positives = 185/290 (63%), Gaps = 8/290 (2%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E R ASHAGSWY N + L +L WL AG + R +I+PHAGYSY G AAYAF
Sbjct: 97 EHARSASHAGSWYNGNQRDLDRQLTKWLDNAG-DRFGTARALISPHAGYSYCGETAAYAF 155
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ +++ RVF+LGPSH CA++T + Y+TP+GDL +D ++ EEL+AT F+LMD
Sbjct: 156 KQVVSSAVERVFILGPSHIVALNGCAITTCSKYRTPLGDLSVDHKINEELRATRHFDLMD 215
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+E+EHS+EM LP++AKV I+P+LVG++ + YG +FA Y++DP N F
Sbjct: 216 RRDEESEHSIEMQLPFIAKVMGNRRYTIIPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 275
Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF++ YD+ + I++ I LDK GM IET +P F YL + NTIC
Sbjct: 276 ISSDFCHWGDRFSFSPYDRNSNLPIYEQITNLDKQGMSAIETLNPTVFNDYLKKTQNTIC 335
Query: 241 GRHPISVFLH-----MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
GR+PI + L L N T + +FL Y QS++ ++ DSSVSYAS
Sbjct: 336 GRNPILIMLQAAEGFRLSNNHTH-EFRFLHYTQSNKVRSPSDSSVSYASG 384
>gi|308506181|ref|XP_003115273.1| CRE-TAG-253 protein [Caenorhabditis remanei]
gi|308255808|gb|EFO99760.1| CRE-TAG-253 protein [Caenorhabditis remanei]
Length = 411
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 138/295 (46%), Positives = 188/295 (63%), Gaps = 9/295 (3%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E +R ASHAGSWY+ N + L +L WL AG + R +I+PHAGYSY G AAYAF
Sbjct: 116 EHVRSASHAGSWYSGNQRDLDRQLTKWLDSAG-ERFGTARALISPHAGYSYCGETAAYAF 174
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I P+++ RVF+LGPSH CA++T + Y+TP+GDL +D +V E+L+AT F+LMD
Sbjct: 175 KQIVPSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDQKVTEDLRATRHFDLMD 234
Query: 122 ICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+E+EHS+EM LP++AKV IVP+LVG++ + YG +FA Y++DP N F
Sbjct: 235 RRDEESEHSIEMQLPFIAKVMGPTRRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPKNLF 294
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+SSDFCHWG RF++ +D+ + I + I +DK GM IET +P F YL + NTI
Sbjct: 295 VISSDFCHWGDRFSFSPFDRNSNLPIFEQITNMDKQGMAAIETLNPTVFNDYLKKTQNTI 354
Query: 240 CGRHPISVFLHM-----LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
CGR+PI + L L N T + KFL Y QS++ ++ DSSVSYAS V
Sbjct: 355 CGRNPILIMLQAAEHFRLSNNHTH-EFKFLHYTQSNKVRSPSDSSVSYASGVLFV 408
>gi|345571372|gb|EGX54186.1| hypothetical protein AOL_s00004g219 [Arthrobotrys oligospora ATCC
24927]
Length = 330
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 193/328 (58%), Gaps = 41/328 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEV--RGVIAPHAGYSYS 53
M IR ASHAGSWY+ + L EL G+L + G+ + P R +IAPHAGYSYS
Sbjct: 1 MTSIRPASHAGSWYSKDKNTLDSELSGYLSQVPSTIDGIGEIPPAGARVIIAPHAGYSYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AAA+A+ +++ T+I RVF+LGPSHH Y CA+S+ Y TP+G LP+D + +L
Sbjct: 61 GPAAAWAYKSLNLTNIKRVFILGPSHHIYINGCAVSSHGAYATPLGPLPIDTKATLDLIK 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFE-----GHLIKIVPILVGAVNAENEAMYGRL 168
T F M+ VD EHS+EMHLPY K+ + IVPILVGA+N E E YG++
Sbjct: 121 TNSFSFMNQVVDSDEHSIEMHLPYTYKMLSTFFGPTDIPPIVPILVGAINTETEEKYGKI 180
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIE 210
+ ++DDP N F VSSDFCHWGSRF+Y +Y + G I KSIE
Sbjct: 181 LSTHLDDPENAFIVSSDFCHWGSRFSYQYYIPQPGKDGMSLKSRGHRVPEGGQEIWKSIE 240
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----------GNCSTKI 259
LD+ ++ IE+G + F YL NT+CGRHPI V + L + K
Sbjct: 241 GLDQAAIEAIESGKHENFVDYLSHTRNTVCGRHPIGVIMAGLEEVLKIRAERGEDVEGKG 300
Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAA 287
KF+RYEQSSQCK+ DSSVSYASA A
Sbjct: 301 LFKFVRYEQSSQCKSVEDSSVSYASAFA 328
>gi|313221650|emb|CBY36133.1| unnamed protein product [Oikopleura dioica]
Length = 292
Score = 270 bits (689), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 187/288 (64%), Gaps = 8/288 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRASHAGSWY+ + + L +L W+ + + + VI PHAGY+YS AA++F +
Sbjct: 4 RRASHAGSWYSGDPRELENQLKNWI--SKAKFEKKAKAVIVPHAGYAYSAPTAAWSFLQL 61
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D + ++F++GPSHH Y P CAL +TP+G+L +D +++ EL ATG F MD+
Sbjct: 62 DAQTTKKIFVIGPSHHVYLPNCALPVVKECETPLGNLRIDKDIVTELHATGLFCTMDVPT 121
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EM LP+LA +F+ L + +VPI+VG++ E EA Y ++FAKY++DP F +
Sbjct: 122 DEEEHSIEMQLPFLAHIFKNRLEEVSVVPIMVGSIKQEKEADYAQVFAKYLNDPEVVFVI 181
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG RF Y + +++ I +SI+A+D+ GMD I + + F YL NTICGR
Sbjct: 182 SSDFCHWGKRFRYTYRLEEYEQIFESIDAVDREGMDHIASKNLSNFHNYLRRTKNTICGR 241
Query: 243 HPISVFLHMLGNCSTKI----KIKFLRYEQSSQCKTKRDSSVSYASAA 286
+PI + L + K+ +IKFL+Y QSSQ ++ DSSVSYASA
Sbjct: 242 NPICLLLATINLLEQKVNMQSQIKFLKYAQSSQVRSPNDSSVSYASAV 289
>gi|189201744|ref|XP_001937208.1| hypothetical protein PTRG_06875 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984307|gb|EDU49795.1| hypothetical protein PTRG_06875 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 377
Score = 270 bits (689), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 191/329 (58%), Gaps = 47/329 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
R ASHAGSWYTDN + L+++LDGWL E +P +P R +IAPH
Sbjct: 48 RSASHAGSWYTDNGQLLSQQLDGWLEAVPNSTTPIGTASSQQGEVTIP-TPNARAIIAPH 106
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AG+SYSG AAA+A+ + D + RVFLLGPSHHYY A++ Y+TP+GDL +D E+
Sbjct: 107 AGFSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAAITGCDKYRTPLGDLTIDTEL 166
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
++E++ E M DE EHS+EMHLPY+ K+ + + ++PI+VG +
Sbjct: 167 VQEIQTEWDLETMSKRTDEDEHSLEMHLPYIYKMLSLKNASFQNDTTSVPLIPIMVGNTD 226
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
A E YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y +G
Sbjct: 227 AAAEVHYGSLLAPYLSDPANIFVISSDFCHWGSRFRYTYYQPANGSAATQLRTSSRVPSD 286
Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNCSTK 258
IH+SI A+D+ MD +E+G F L + NT+CGRHPI +F+ + G
Sbjct: 287 YPIHESIAAVDQESMDAVESGSHRQFLDQLRKTGNTVCGRHPIGLFMAAVERAEGLGEGS 346
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K KFLRYE+SS + +DSSVSY SA A
Sbjct: 347 GKFKFLRYERSSLVEEVKDSSVSYCSAFA 375
>gi|330920737|ref|XP_003299128.1| hypothetical protein PTT_10063 [Pyrenophora teres f. teres 0-1]
gi|311327315|gb|EFQ92775.1| hypothetical protein PTT_10063 [Pyrenophora teres f. teres 0-1]
Length = 333
Score = 269 bits (688), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 190/329 (57%), Gaps = 47/329 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
R+A+HAGSWYTDN + L+++LDGWL E +P +P R +IAPH
Sbjct: 4 RKATHAGSWYTDNGQLLSQQLDGWLEAVPSSTTPIGTTSSQQGEVTIP-TPNARAIIAPH 62
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AG+SYSG AAA+A+ + D + RVFLLGPSHHYY A + Y+TP+GDL +D E+
Sbjct: 63 AGFSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDTYRTPLGDLAIDTEL 122
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
+ E++A E M DE EHS+EMHLPY+ K+ + + ++PIL+G +
Sbjct: 123 VHEIQAAWDLETMTKRTDEDEHSLEMHLPYIYKMLSLNNTSFQNDTTSVPLIPILIGNTD 182
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
A EA YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y +
Sbjct: 183 AATEAHYGSLLAPYLSDPANVFVISSDFCHWGSRFRYTYYQPANASPATHLKTSSRVPSD 242
Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNCSTK 258
IH+SI A+D MD +E+G F L + NT+CGRHPI +F+ + G
Sbjct: 243 YPIHESIAAVDHESMDAVESGSHRQFLDQLRKTGNTVCGRHPIGLFMAAVEQAEGLGQGS 302
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K KFLRYE+SS + +DSSVSY SA A
Sbjct: 303 GKFKFLRYERSSLVQDVKDSSVSYCSAFA 331
>gi|406864123|gb|EKD17169.1| DUF52 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 330
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/327 (46%), Positives = 192/327 (58%), Gaps = 46/327 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--------VRGVIAPHAGYSYSGRA 56
R ASHAGSWY+ +L+ ELD WL A +P S + R +IAPHAGYSYSG A
Sbjct: 4 RSASHAGSWYSSGKDKLSNELDQWL--AQVPDSIDGTQLPVAGARVIIAPHAGYSYSGPA 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ ++D ++ R+FLLGPSH Y P CA S + Y TP+GDL +D E++++L+ TGK
Sbjct: 62 AAWAYKSLDLSNAQRIFLLGPSHALYLPGCATSKHSRYATPLGDLVIDTEIVKKLQDTGK 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFA 170
FE M DE EHS+EMHLPY+ K+ IVPILVG +A +E YG + A
Sbjct: 122 FEKMSTDADETEHSLEMHLPYIYKLCSQSFNSPAEFPPIVPILVGNTSATSEKAYGDVLA 181
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIHKS 208
Y+ DP++ F VSSDFCHWG RF Y +Y D IH+S
Sbjct: 182 PYLADPTSVFIVSSDFCHWGLRFQYTYYLPASPSAAAGTAGGYSLKQRHDDPTDPPIHES 241
Query: 209 IEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCST--KIK 260
I LDK+ MD IETG D F L E NT+CGRHPI V + + G S K K
Sbjct: 242 IGRLDKLAMDAIETGKHDVFLGNLKETGNTVCGRHPIGVVMAAIEVLEKEGKVSPEGKGK 301
Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAA 287
KF+RYE+SS+ + DSSVSYASA A
Sbjct: 302 FKFVRYERSSEVEEISDSSVSYASAYA 328
>gi|402225258|gb|EJU05319.1| UPF0103-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 327
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 185/321 (57%), Gaps = 36/321 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRG---VIAPHAGYSYSGR 55
M +RRA+HA SWY+ RL +L GWL E SP ++G +I PHAGY YSG
Sbjct: 2 MSGMRRATHANSWYSGARSRLDHQLKGWLSAVEPSSDYSPPIKGCKAIIGPHAGYDYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ ID T I RVFLLGPSHH Y CAL+ Y TP+G+LP+DL ELKATG
Sbjct: 62 TAAWAYKAIDVTGIKRVFLLGPSHHVYLDGCALTPFATYDTPLGELPVDLATTAELKATG 121
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
KF +M DE EHS+E+HLPY+ K FEG IK+VPILVGA+N + EA +G + A+Y++
Sbjct: 122 KFSMMSNKTDENEHSIELHLPYVRKTFEGCDIKVVPILVGAINDKKEADFGGILAQYLEQ 181
Query: 176 PSNFFSVSSDFCHWGSRFNYMHY-----------------DKKHG--VIHKSIEALDKMG 216
F VSSDFCHWG RF++ Y D G IH+SI LD
Sbjct: 182 EDTMFVVSSDFCHWGERFDFTFYYPGAPSPTSSIRLEDDSDIPRGGQPIHESISRLDHEA 241
Query: 217 MDIIETGDPDAFKK--------YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFL 264
++++ + K YL NTICGRHPI V L L K +KF+
Sbjct: 242 INMLTLPTSTSSAKQAHASLSSYLRRTGNTICGRHPIGVLLGALATLEEDNKHKASLKFV 301
Query: 265 RYEQSSQCKTKRDSSVSYASA 285
+Y QSS CK DSSVSYASA
Sbjct: 302 KYAQSSPCKHIEDSSVSYASA 322
>gi|405978340|gb|EKC42740.1| Protein MEMO1 [Crassostrea gigas]
Length = 272
Score = 267 bits (683), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 132/248 (53%), Positives = 168/248 (67%), Gaps = 9/248 (3%)
Query: 47 HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
HAGY Y G +A+ IDP++I R+F+LGPSHH CAL+ + Y+TP+ DL +D +
Sbjct: 22 HAGYVYCGACGGHAYRQIDPSNIKRIFILGPSHHVRLSGCALTETSHYQTPLYDLTIDQK 81
Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAM 164
+ EEL AT FE M + DE EHS+EMHLPY+AKV E I+P+LVG+++A+ E +
Sbjct: 82 INEELFATKAFEKMKMSTDEDEHSIEMHLPYIAKVMERRRGQFTIIPVLVGSLSADKEKL 141
Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
YG +F++Y+ DP NFF +SSDFCHWG RF Y YDK G I +SIEALDKMGMD IE D
Sbjct: 142 YGSIFSQYLADPENFFVISSDFCHWGQRFRYTFYDKSCGDIWQSIEALDKMGMDAIEQMD 201
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHML------GNCSTKIKIKFLRYEQSSQCKTKRDS 278
P F YL EY+NTICGR+PI V L + GN ++ KF+ Y QSS+CK DS
Sbjct: 202 PAKFSHYLKEYENTICGRYPIGVLLSAIDALRRNGN-GRRMSFKFMNYAQSSRCKKMSDS 260
Query: 279 SVSYASAA 286
SVSYAS A
Sbjct: 261 SVSYASGA 268
>gi|347836555|emb|CCD51127.1| similar to DUF52 domain protein [Botryotinia fuckeliana]
Length = 331
Score = 266 bits (681), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 188/331 (56%), Gaps = 46/331 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M R ASHAGSWYT N +L+ ELD WL + +P + P R +IAPHAGYSY
Sbjct: 1 MGSTREASHAGSWYTSNPSKLSSELDNWLSQ--VPSTISDTKLPIPGARVIIAPHAGYSY 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AAA+A+ +D ++ RVFLLGPSH +Y +CALS + Y TP+GDL +D +++EL
Sbjct: 59 SGPAAAWAYATLDLSTAKRVFLLGPSHAWYLTECALSKHSKYATPLGDLTIDTALVQELS 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
ATG+F+ M DE EHS+E+HLPY+ K+ ++PILVG NA E +G
Sbjct: 119 ATGEFKRMSTDQDETEHSLELHLPYIYKLLSLNFPSPSSFPPLIPILVGNTNAATERKFG 178
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
+ A Y+ DPSN F +SSDFCHWGSRF Y +Y DK G
Sbjct: 179 SILAPYLSDPSNIFIISSDFCHWGSRFQYTYYLPASPSAEVRDKSGGYSLSRRDKAPTQP 238
Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
IH+SI LD++ M I G F L E NT+CGRHPI V + +
Sbjct: 239 PIHESIGRLDQLSMSAISGGVHQEFLDNLEETGNTVCGRHPIGVIMAAIEKVKEEKSLDG 298
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K K KF+RYE+S++ D SVSYASA A
Sbjct: 299 EKGKFKFVRYERSNEVTRVDDGSVSYASAYA 329
>gi|169596398|ref|XP_001791623.1| hypothetical protein SNOG_00959 [Phaeosphaeria nodorum SN15]
gi|160701301|gb|EAT92454.2| hypothetical protein SNOG_00959 [Phaeosphaeria nodorum SN15]
Length = 340
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 193/336 (57%), Gaps = 54/336 (16%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
R ASHAGSWYT N K+L+++LDGWL + +P +P R +IAPH
Sbjct: 4 REASHAGSWYTSNGKQLSQQLDGWLEAVPSSTTPIGTASSEQGDVSIP-TPNARAIIAPH 62
Query: 48 AGYSYSGRAAAYAFGNIDPTSIS-------------RVFLLGPSHHYYTPKCALSTATVY 94
AGYSYSG AAA+A+ + D + S RVFLLGPSHH+Y A + Y
Sbjct: 63 AGYSYSGPAAAWAYKSADWANASGVYMNITDHKSSKRVFLLGPSHHHYLSGAATTACDKY 122
Query: 95 KTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH----LIKIV 150
TP+GDL +D +++E+K E M VDEAEHS+EMHLPY+ K+ H + +V
Sbjct: 123 ATPLGDLIIDTALVQEIKQEWGLETMSQDVDEAEHSLEMHLPYIYKMLSLHNNPSSVPLV 182
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------ 204
PI++G + E+ YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y+ G
Sbjct: 183 PIMIGNTSPSTESKYGSLLAPYLSDPTNIFVISSDFCHWGSRFRYTYYESPDGASATQLT 242
Query: 205 ----------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
IH+SI+A+DK MD +E+G F + L E NT+CGRHPI VF+ + +
Sbjct: 243 RKSKIDEDWPIHESIKAVDKESMDAVESGHHKRFLEQLKETGNTVCGRHPIGVFMAAVES 302
Query: 255 CST---KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K + KF+RYE+SS + DSSVSY SA A
Sbjct: 303 ADVGEGKGRFKFVRYERSSLVEDYGDSSVSYCSAFA 338
>gi|440633615|gb|ELR03534.1| hypothetical protein GMDG_01285 [Geomyces destructans 20631-21]
Length = 327
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 191/324 (58%), Gaps = 43/324 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
R ASHAGSWY+ L+ ELDGWL A +P S P R +IAPHAGYSYSG A
Sbjct: 4 RAASHAGSWYSSAPATLSSELDGWL--AQVPDSIDGNKLPVPGARVIIAPHAGYSYSGPA 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ ++D + + RVFLLGPSH Y CA+S+ Y TP+G+L LD I+EL+AT K
Sbjct: 62 AAWAYKSLDLSKVKRVFLLGPSHTLYLSGCAISSQEYYATPLGNLKLDQATIKELQATSK 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLFA 170
F+ +EAEHS+EMHLPY+ KV ++VP+LVGA + E YG++FA
Sbjct: 122 FDPWKAKSEEAEHSLEMHLPYIYKVLANTFKSPDDFPQLVPVLVGATSGPTERSYGQIFA 181
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY-----------------DK--KHGVIHKSIEA 211
Y+ DP++ F VSSDFCHWG RF Y +Y DK + IH+SI
Sbjct: 182 PYLADPTSVFVVSSDFCHWGERFQYTYYLPASPSANSDGYSLRRRDKTPTNPWIHESIGK 241
Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI--------KF 263
+DK+ MD IE G F L + NT+CGRHPI + + L + KI KF
Sbjct: 242 IDKLAMDAIEQGTHQGFLDNLEDTGNTVCGRHPIGLVMAALEVLKQEGKISAENGGRFKF 301
Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
+RYE+SS K +DSSVSYASA A
Sbjct: 302 VRYERSSDVKDIKDSSVSYASAVA 325
>gi|330797324|ref|XP_003286711.1| hypothetical protein DICPUDRAFT_94236 [Dictyostelium purpureum]
gi|325083309|gb|EGC36765.1| hypothetical protein DICPUDRAFT_94236 [Dictyostelium purpureum]
Length = 277
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 139/285 (48%), Positives = 175/285 (61%), Gaps = 16/285 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR SHAG+WY DN+ RL ++L WL+ A S V+ VIAPHAGY+YSGR A++A+ N
Sbjct: 2 IRNCSHAGTWYFDNASRLEKQLSDWLKVAKRLNS-NVKSVIAPHAGYTYSGRTASHAYIN 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P RVF+LGPSHH Y C L+ ++TPIG+L +D E+ + L TG F
Sbjct: 61 LVPEKFKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDREISDTLYNTGDFVWNSKS 120
Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
VDE EHS+EM LPY+AKV E L +KIVPI+VG ++ E YG++ A Y DDP NFF
Sbjct: 121 VDEDEHSLEMQLPYIAKVAENKLSDLKIVPIMVGNLSLNLEEKYGKILAPYFDDPDNFFV 180
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSDFCHWG RF Y Y+ + I++ IE LDK M IIETGDP F KYL I
Sbjct: 181 ISSDFCHWGERFGYTKYENQQVPIYQYIEELDKQAMSIIETGDPVQFDKYL-----KILK 235
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
H L + + I + YEQSS+ RDSSVSYA A
Sbjct: 236 IH--------LWSLPNRYDISSVHYEQSSKVVQPRDSSVSYAVLA 272
>gi|410901599|ref|XP_003964283.1| PREDICTED: protein MEMO1-like isoform 2 [Takifugu rubripes]
Length = 274
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 137/290 (47%), Positives = 177/290 (61%), Gaps = 29/290 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASH+GSWY+ + +L +L+ WL +A + P R +IAP
Sbjct: 8 REASHSGSWYSASGSQLNAQLEDWLSKANSKRRP-ARAIIAPR----------------- 49
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
RVF+LGPSHH + CALS+A +Y+TP+ D+ +D +V EL TG F+ M+I
Sbjct: 50 ------RVFILGPSHHVHLTCCALSSAEIYRTPLYDMRIDQKVYAELWKTGLFDRMNIKT 103
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E IVPILVGA++ E YG+L +KY+ DPSN F +
Sbjct: 104 DEDEHSIEMHLPYTAKAMESRKDDFSIVPILVGALSENKEHEYGKLLSKYLADPSNLFVI 163
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHWG+RF+Y +YD+ G I++SIE LDKMGM IIE DP +F Y+ +Y NTICGR
Sbjct: 164 SSDFCHWGNRFHYTYYDESQGEIYRSIEHLDKMGMGIIEQLDPPSFSNYIKKYRNTICGR 223
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QS QC+ SSVSYA+ A V
Sbjct: 224 HPIGVLLNAVAELRKSGLDMNFSFLNYAQSGQCRNWEQSSVSYAAGALYV 273
>gi|396468359|ref|XP_003838155.1| similar to DUF52 domain protein [Leptosphaeria maculans JN3]
gi|312214722|emb|CBX94676.1| similar to DUF52 domain protein [Leptosphaeria maculans JN3]
Length = 332
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 139/327 (42%), Positives = 187/327 (57%), Gaps = 44/327 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLP----------------KSPEVRGVIAPHA 48
R SHAGSWY+DN + L+ +LDGWL P +P R +IAPHA
Sbjct: 4 RSPSHAGSWYSDNKELLSRQLDGWLEAVPSPARPIGTSSSQQGEVSIPTPNARAIIAPHA 63
Query: 49 GYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
GYSYSG AAA+A+ +D T+ RVFLLGPSHHYY A + Y TP+GDL +D ++
Sbjct: 64 GYSYSGPAAAWAYKTVDWTNAKRVFLLGPSHHYYLTGAATTGCDNYGTPLGDLIVDTALV 123
Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH---------LIKIVPILVGAVNA 159
+E++ + E+M VDE EHS+EMHLPY+ K+ + + +VPI++G +
Sbjct: 124 KEIQTKWQLEVMSKSVDEDEHSLEMHLPYIHKMLSLNNPSFQSSPSSVPLVPIMIGNTDP 183
Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV--------------- 204
E+ YG L A Y+ DPSN F VSSDFCHWGSRF Y +Y G
Sbjct: 184 STESHYGALLAPYLSDPSNIFVVSSDFCHWGSRFRYTYYQPADGSAAMTLRSSSRVPPEY 243
Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---KIK 260
IH+SI A+D+ M +E+G+ F + L + NT+CGRHPI VF+ + K +
Sbjct: 244 PIHESIAAVDRESMAAVESGEHTRFLEQLRKTGNTVCGRHPIGVFMAAVEKAEMDGGKGR 303
Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAA 287
KFLRYE+S ++ +DSSVSY S A
Sbjct: 304 FKFLRYERSGLVESVKDSSVSYCSGFA 330
>gi|145514724|ref|XP_001443267.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410645|emb|CAK75870.1| unnamed protein product [Paramecium tetraurelia]
Length = 294
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 179/284 (63%), Gaps = 4/284 (1%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
KIR ASHAGSWY + K+L +L+ +L +A P ++ +I PHAG+SYSG AA+A+
Sbjct: 7 KIREASHAGSWYIGDGKQLDAQLNDFLSKAKGETIPNIKAIIGPHAGFSYSGPTAAFAYQ 66
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FELM 120
++ RVFLLGP HH Y LS Y+TP+G++ LD I++L A K +
Sbjct: 67 HLVQKERMRVFLLGPCHHTYIKGIGLSELEQYETPLGNIELDQPTIKQLSAELKKNYVFT 126
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ ++E EHS+EMHLP++ K+F K++PI+VGA + E +A + KY DP+ F
Sbjct: 127 NKDIEEQEHSLEMHLPFIYKIFPK--CKLIPIMVGATSEEQDAQVASVLVKYFVDPNTVF 184
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF Y Y+K+HG IH+SI LD + +IE+ + F KYL E +NTIC
Sbjct: 185 VISSDFCHWGKRFQYTPYNKEHGEIHQSIAQLDGQAIKLIESHNIKEFYKYLDETENTIC 244
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
GRHPI V L+++ ++K + RY QSSQ DSSVSYA+
Sbjct: 245 GRHPICVLLNIINLSKLQLKTQLARYAQSSQVTKPNDSSVSYAA 288
>gi|451852596|gb|EMD65891.1| hypothetical protein COCSADRAFT_87269 [Cochliobolus sativus ND90Pr]
Length = 333
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 187/329 (56%), Gaps = 47/329 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
R+ASHAGSWYTDN + L+++LDGWL + +P +P R +IAPH
Sbjct: 4 RKASHAGSWYTDNGQLLSQQLDGWLGAVPASTTPIGSASSQKGQVSIP-TPSARAIIAPH 62
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGYSYSG AAA+A+ + D + RVFLLGPSHHYY A + Y TP+GDL +D +
Sbjct: 63 AGYSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDKYHTPLGDLIVDTTL 122
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
++ ++ E+M DE EHS+EMHLPY+ K+ + + ++PI+VG +
Sbjct: 123 VQTIQQEWDLEIMSKRTDEDEHSLEMHLPYIYKMLSLKNSDFQSDSSSVPLIPIMVGNTD 182
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------- 204
A E YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y +G
Sbjct: 183 AAAETRYGSLLAPYLADPTNIFVISSDFCHWGSRFRYTYYQPPNGAAATQLKSASRVPSD 242
Query: 205 --IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST----K 258
IH+SI A+D MD IETG F L + NT+CGRHPI +F+ + + K
Sbjct: 243 YPIHESIAAVDHESMDAIETGSHRKFLDQLRKTGNTVCGRHPIGLFMAAVESTEALEEGK 302
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
FLRYE+SS + +DSSVSY SA A
Sbjct: 303 GIFSFLRYERSSLVEDLQDSSVSYCSAFA 331
>gi|451997153|gb|EMD89618.1| hypothetical protein COCHEDRAFT_1194970 [Cochliobolus
heterostrophus C5]
Length = 333
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 188/329 (57%), Gaps = 47/329 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----------------EAGLPKSPEVRGVIAPH 47
R+ASHAGSWYTDN + L+++LDGWL + +P +P R +IAPH
Sbjct: 4 RKASHAGSWYTDNGQLLSQQLDGWLGAVPASTTPIGSASSQKGQVSIP-TPNARAIIAPH 62
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGYSYSG AAA+A+ + D + RVFLLGPSHHYY A + Y TP+GDL +D +
Sbjct: 63 AGYSYSGPAAAWAYKSADWANAKRVFLLGPSHHYYLTGAATTGCDKYHTPLGDLIVDTTL 122
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVN 158
++ ++ E+M DE EHS+EMHLPY+ K+ + + +VPI+VG +
Sbjct: 123 VQTIQEEWDLEIMSKRTDEDEHSLEMHLPYIYKMLSLKNKDFQSDPSSVPLVPIMVGNTD 182
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
A EA YG L A Y+ DP+N F +SSDFCHWGSRF Y +Y +G
Sbjct: 183 AAAEARYGSLLAPYLADPANIFVISSDFCHWGSRFRYTYYQPPNGGAATQLKSASRVPSD 242
Query: 204 -VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST----K 258
IH+SI A+D MD IETG F L + NT+CGRHPI +F+ + + K
Sbjct: 243 YPIHESIAAVDHESMDAIETGSHRKFLDQLRKTGNTVCGRHPIGLFMAAVESAEALEEGK 302
Query: 259 IKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
FLRYE+SS + +DSSVSY SA A
Sbjct: 303 GIFSFLRYERSSLVEDLQDSSVSYCSAFA 331
>gi|32566861|ref|NP_741570.2| Protein TAG-253, isoform a [Caenorhabditis elegans]
gi|351059096|emb|CCD66949.1| Protein TAG-253, isoform a [Caenorhabditis elegans]
Length = 302
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/275 (46%), Positives = 177/275 (64%), Gaps = 6/275 (2%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E R ASHAGSWY N + L +L WL AG P+ R +I+PHAGYSY G AAYAF
Sbjct: 8 EHTRSASHAGSWYNANQRDLDRQLTKWLDNAG-PRIGTARALISPHAGYSYCGETAAYAF 66
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ +++ RVF+LGPSH CA++T + Y+TP+GDL +D ++ EEL+AT F+LMD
Sbjct: 67 KQVVSSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDHKINEELRATRHFDLMD 126
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+E+EHS+EM LP++AKV IVP+LVG++ + YG +FA Y++DP N F
Sbjct: 127 RRDEESEHSIEMQLPFIAKVMGSKRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 186
Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF++ YD+ + I++ I +DK GM IET +P AF YL + NTIC
Sbjct: 187 ISSDFCHWGERFSFSPYDRHSSIPIYEQITNMDKQGMSAIETLNPAAFNDYLKKTQNTIC 246
Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
GR+PI + L H + + + +FL Y QS++
Sbjct: 247 GRNPILIMLQAAEHFRISNNHTHEFRFLHYTQSNK 281
>gi|403333881|gb|EJY66071.1| Cell motility mediator [Oxytricha trifallax]
Length = 309
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/299 (45%), Positives = 182/299 (60%), Gaps = 16/299 (5%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLP--------KSPEVRGVIAPHAGYSYSGRA 56
R ASHAGSWY + K+L +L +L +A + ++R +IAPHAG YSG+
Sbjct: 7 RLASHAGSWYPSDKKKLDLQLTQYLEKAKIDLGDRIKDNSRGKIRAIIAPHAGIDYSGQV 66
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AAYA+ +DP RV LLGPSHH Y CAL+ + Y+TP+GDL +D E E+L
Sbjct: 67 AAYAYCQLDPQQYKRVILLGPSHHVYLESCALTLCSKYQTPLGDLVIDKEFNEQLLKENS 126
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV-DD 175
F+ M +DE EHS+EMHLPY+ KVF G I++VPILVG + + EA+YG+ AKY+ D
Sbjct: 127 FKQMPKQIDEEEHSLEMHLPYIKKVF-GDQIQLVPILVGNLTKDKEALYGKTLAKYLRDK 185
Query: 176 PSNFFSVSSDFCHWGSRFNYMHY----DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
N F +SSDFCHWG F+YM+ DK I K IE +DK G++ I D + F +Y
Sbjct: 186 DENLFIISSDFCHWGKDFDYMYLVEKVDKDQNTISKQIERIDKDGINHIIGQDVEKFNEY 245
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTK-IKIKFLRYEQSSQ-CKTKRDSSVSYASAAAK 288
L DNTICGRHPI V L L T+ + + +Y QS+ KR++SVSYAS +
Sbjct: 246 LDITDNTICGRHPIGVLLECLKEMDTRTFETELAQYGQSTNIIDDKRETSVSYASIVTR 304
>gi|25146594|ref|NP_741571.1| Protein TAG-253, isoform b [Caenorhabditis elegans]
gi|30316377|sp|Q22915.3|TG253_CAEEL RecName: Full=MEMO1 family protein tag-253
gi|351059097|emb|CCD66950.1| Protein TAG-253, isoform b [Caenorhabditis elegans]
Length = 350
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/275 (46%), Positives = 177/275 (64%), Gaps = 6/275 (2%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E R ASHAGSWY N + L +L WL AG P+ R +I+PHAGYSY G AAYAF
Sbjct: 56 EHTRSASHAGSWYNANQRDLDRQLTKWLDNAG-PRIGTARALISPHAGYSYCGETAAYAF 114
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ +++ RVF+LGPSH CA++T + Y+TP+GDL +D ++ EEL+AT F+LMD
Sbjct: 115 KQVVSSAVERVFILGPSHVVALNGCAITTCSKYRTPLGDLIVDHKINEELRATRHFDLMD 174
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+E+EHS+EM LP++AKV IVP+LVG++ + YG +FA Y++DP N F
Sbjct: 175 RRDEESEHSIEMQLPFIAKVMGSKRYTIVPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFV 234
Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDFCHWG RF++ YD+ + I++ I +DK GM IET +P AF YL + NTIC
Sbjct: 235 ISSDFCHWGERFSFSPYDRHSSIPIYEQITNMDKQGMSAIETLNPAAFNDYLKKTQNTIC 294
Query: 241 GRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQ 271
GR+PI + L H + + + +FL Y QS++
Sbjct: 295 GRNPILIMLQAAEHFRISNNHTHEFRFLHYTQSNK 329
>gi|417398158|gb|JAA46112.1| Putative dioxygenase [Desmodus rotundus]
Length = 271
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 133/290 (45%), Positives = 171/290 (58%), Gaps = 32/290 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 66
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
DP+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M +
Sbjct: 67 DPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQT 126
Query: 125 DEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F V
Sbjct: 127 DEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVV 186
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSDFCHW E DP +F YL +Y NTICGR
Sbjct: 187 SSDFCHWXXX--------------------------XXEQLDPVSFSNYLKKYHNTICGR 220
Query: 243 HPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
HPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 221 HPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 270
>gi|340503207|gb|EGR29819.1| hypothetical protein IMG5_148030 [Ichthyophthirius multifiliis]
Length = 330
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 125/295 (42%), Positives = 180/295 (61%), Gaps = 7/295 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYA 60
+IR +S+AGSWY++N L +++ WL A + V+ +I PHAG+S+SG AYA
Sbjct: 34 EIRESSYAGSWYSNNEHELNIQINVWLDMAKCEIQNIQAVKAIIVPHAGFSFSGPTQAYA 93
Query: 61 FGNIDP---TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ + RVFL+GP H+ + +C LS VY+TP+G++ +D E I EL+ F
Sbjct: 94 YKYLKQYCHQKKLRVFLMGPFHYIFIRQCGLSGMKVYETPVGNIEIDQETINELRELAYF 153
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
E D +E EHS+EM L +L K IK++PI+VGA++ E +YG++F+K+VD
Sbjct: 154 ETTDKDAEEEEHSLEMQLCFLIKTLGAENIKLIPIMVGALDQSQETLYGQIFSKFVDQDD 213
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
F VS+DFCHWG ++ Y +++K G I++ IE LD+ GM IIE D F YL E +N
Sbjct: 214 TLFIVSTDFCHWGQKYKYTYFNKDDGEIYEQIEKLDQRGMTIIEKHDNQEFISYLKETEN 273
Query: 238 TICGRHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
ICGRH +S+ LH N S I+ F+RY SS K K D SVSYASA + ++
Sbjct: 274 NICGRHALSILLHAFNNSKYSKDIQTHFVRYALSSLVKDKSDQSVSYASAVSIIE 328
>gi|358060843|dbj|GAA93461.1| hypothetical protein E5Q_00102 [Mixia osmundae IAM 14324]
Length = 339
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 196/330 (59%), Gaps = 45/330 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG----------LPKSPEVRGVIAPHAGY 50
M R A+HAGSWY D+SK L E L WL E P P ++ +I+PHAGY
Sbjct: 1 MSGARPATHAGSWYDDDSKTLDELLSDWLGEVDSTPCQEDKDFAPPVPNLKAIISPHAGY 60
Query: 51 SYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
+YSG AAA+A+ I+P + RVF+LGPSHH Y CALS + YKTP+G+LPLDL+ I
Sbjct: 61 AYSGAAAAWAYACIEPFAYKRVFILGPSHHVYLNGCALSQRSSYKTPLGNLPLDLDTIAR 120
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
L+AT +FE M D+ EHS+EM LPY+AKVF+GH + IVPI+VG+++ E+ +G+L A
Sbjct: 121 LRATRQFEDMSPSADDDEHSIEMQLPYIAKVFQGHAVNIVPIMVGSISTSKESAFGKLLA 180
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
Y+ D F +SSDFCHWG+RF Y +Y K IHKSIE +D
Sbjct: 181 PYLADEDTLFVISSDFCHWGTRFGYTYYVPKEDSSAPVQLAKTNSAHADFPIHKSIELID 240
Query: 214 KMGMDIIE-TGDP-----DA---FKKYLLEYDNTICGRHPISVFLHML---------GNC 255
+ GM +E T D DA F YL NTICGRHPI V L + G
Sbjct: 241 REGMRHVELTKDSGKSAQDAHRDFASYLTRTKNTICGRHPIGVLLATIASLQDSKTAGFA 300
Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+++KF+RYE+SS K DSSVSYASA
Sbjct: 301 QDDVRLKFVRYERSSLVKKLSDSSVSYASA 330
>gi|145497196|ref|XP_001434587.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401714|emb|CAK67190.1| unnamed protein product [Paramecium tetraurelia]
Length = 345
Score = 256 bits (654), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 184/289 (63%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R A HAGSWY+ S L +++ WL +A + +++ ++ PHAGY+YSG AA+++
Sbjct: 51 VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 110
Query: 62 GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ P+ +VF+LGP H+ Y +C L+ +Y+TP+G++ +DLE +++L G FE
Sbjct: 111 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 170
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D +E EHS+EM LP+LA + I+PI+VG+++A++E YGRL ++Y D
Sbjct: 171 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGRLLSEYFDMDDT 230
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +S+DFCHWG++F Y +Y+ G I +SIE LD+ M+ IE D D F YL EY+N
Sbjct: 231 LFIISTDFCHWGTKFAYTYYNSADGEIFESIEKLDQKAMEHIELHDLDKFNDYLREYENN 290
Query: 239 ICGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASA 285
+CG+H I++ LH + ++ KF+RY QS + K+DSSVSYA+A
Sbjct: 291 VCGKHCIAILLHCIAMSQNTHMMETKFIRYAQSCLVRDKKDSSVSYAAA 339
>gi|145542007|ref|XP_001456691.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424504|emb|CAK89294.1| unnamed protein product [Paramecium tetraurelia]
Length = 344
Score = 256 bits (654), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 184/289 (63%), Gaps = 7/289 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R A HAGSWY+ S L +++ WL +A + +++ ++ PHAGY+YSG AA+++
Sbjct: 50 VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 109
Query: 62 GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ P+ +VF+LGP H+ Y +C L+ +Y+TP+G++ +DLE +++L G FE
Sbjct: 110 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 169
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D +E EHS+EM LP+LA + I+PI+VG+++A++E YGRL ++Y D
Sbjct: 170 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGRLLSEYFDMDDT 229
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +S+DFCHWG++F Y +Y+ G I +SIE LD+ M+ IE D D F YL EY+N
Sbjct: 230 LFIISTDFCHWGTKFAYTYYNSADGEIFESIEKLDQKAMEHIELHDLDKFNDYLREYENN 289
Query: 239 ICGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASA 285
+CG+H I++ LH + ++ KF+RY QS + K+DSSVSYA+A
Sbjct: 290 VCGKHCIAILLHCIAMSQNTHMMETKFIRYAQSCLVRDKKDSSVSYAAA 338
>gi|452988647|gb|EME88402.1| hypothetical protein MYCFIDRAFT_86111 [Pseudocercospora fijiensis
CIRAD86]
Length = 336
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 182/330 (55%), Gaps = 44/330 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIAPH 47
IR ASHAGSWY+D+ +L +LDGWL P S P R +IAPH
Sbjct: 7 IRDASHAGSWYSDSKSQLNSQLDGWLAAVDTPVSCIGPQSSQEVFPDVPVPSARVIIAPH 66
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGYSYSG AAA+A+ + D + RVFLLGPSHHYY K ALS T Y TPIG+L +D +
Sbjct: 67 AGYSYSGPAAAWAYKSWDVSKAERVFLLGPSHHYYLSKAALSKCTQYATPIGNLTVDRQT 126
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-------LIKIVPILVGAVNAE 160
+L + +FE M VDE EHS+EMHLPY+ K+ H L +VPI+VG +A
Sbjct: 127 TAKLYTSAQFEWMTQSVDEQEHSLEMHLPYIYKMLAKHFGGNPAGLPPLVPIMVGNTSAS 186
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------I 205
E G++ A Y+ DP+N F VSSDF HWG RF Y +Y G I
Sbjct: 187 TERALGQILAPYLADPANVFVVSSDFAHWGLRFRYTYYRPSTGSALHLSSSAKSPQSPPI 246
Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK----- 260
H+SI+ +D M E+G + L E NT+CGRHPI V + +
Sbjct: 247 HESIKTVDFECMGACESGSHSKWLSTLEETGNTVCGRHPIGVIMAAIEQVRKDSGEQHAG 306
Query: 261 -IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
KF+RYE+SS+ K DSSVSYASA A +
Sbjct: 307 LFKFVRYERSSEVKRIGDSSVSYASAFAVI 336
>gi|154289990|ref|XP_001545597.1| hypothetical protein BC1G_15807 [Botryotinia fuckeliana B05.10]
Length = 323
Score = 255 bits (652), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 181/323 (56%), Gaps = 46/323 (14%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M R ASHAGSWYT N +L+ ELD WL + +P + P R +IAPHAGYSY
Sbjct: 1 MGSTREASHAGSWYTSNPSKLSSELDNWLSQ--VPSTISDTKLPIPGARVIIAPHAGYSY 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AAA+A+ +D ++ RVFLLGPSH +Y +CALS + Y TP+GDL +D +++EL
Sbjct: 59 SGPAAAWAYATLDLSTAKRVFLLGPSHAWYLTECALSKHSKYATPLGDLTIDTALVQELS 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
ATG+F+ M DE EHS+E+HLPY+ K+ ++PILVG NA E +G
Sbjct: 119 ATGEFKRMSTDQDETEHSLELHLPYIYKLLSLNFPSPSSFPPLIPILVGNTNAATERKFG 178
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
+ A Y+ DPSN F +SSDFCHWGSRF Y +Y DK G
Sbjct: 179 SILAPYLSDPSNIFIISSDFCHWGSRFQYTYYLPASPSAEVRDKSGGYSLSRRDKAPTQP 238
Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
IH+SI LD++ M I G F L E NT+CGRHPI V + +
Sbjct: 239 PIHESIGRLDQLSMSAISGGVHQEFLDNLEETGNTVCGRHPIGVIMAAIEKVKEEKSLDG 298
Query: 257 TKIKIKFLRYEQSSQCKTKRDSS 279
K K KF+RYE+S++ D S
Sbjct: 299 EKGKFKFVRYERSNEVTRVDDGS 321
>gi|388582149|gb|EIM22455.1| UPF0103-domain-containing protein [Wallemia sebi CBS 633.66]
Length = 318
Score = 254 bits (650), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 182/314 (57%), Gaps = 29/314 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
M+ IR +HAG+WYTDNS +L ++L W +A G+ + + +IAPHAG S+SGR A
Sbjct: 1 MQNIREPTHAGAWYTDNSIKLNKQLQHWFDDAISDGIHNNSRSKVIIAPHAGLSFSGRTA 60
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
AYA+ + + R+ +LGPSHH Y CALS Y TP+G L +D E+ +EL +T F
Sbjct: 61 AYAYSAANISQYKRIIILGPSHHVYLKSCALSPFDAYDTPLGTLSIDKEINDELSSTKAF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+ M DE EHS EMH P++ K+ H IKIVPILVG ++ E E YG+ FAKY++DP
Sbjct: 121 KRMSKVTDEDEHSFEMHTPFIKKLANEHDIKIVPILVGNISHEREVEYGKYFAKYINDPE 180
Query: 178 NFFSVSSDFCHWGSRFNYMHY-----------------DKKHGVIHKSIEALDKMGMDII 220
F++SSDF HWGSRF Y +Y D+ IH+SI LD DI+
Sbjct: 181 TLFAISSDFSHWGSRFRYTYYRPSRNEPGGRLGKFDDVDETGDSIHESISGLDHEAWDIL 240
Query: 221 ETG--DP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCS---TKIKIKFLRYEQSSQ 271
DP ++F KYL E NTICGRH + L L + + T +K+ Y QSS+
Sbjct: 241 SEALKDPKQTHESFTKYLSETGNTICGRHAFGILLGALHHANRDYTSGSVKWTHYTQSSK 300
Query: 272 CKTKRDSSVSYASA 285
DSSVSY++
Sbjct: 301 ALLPTDSSVSYSAG 314
>gi|398398762|ref|XP_003852838.1| hypothetical protein MYCGRDRAFT_70695 [Zymoseptoria tritici IPO323]
gi|339472720|gb|EGP87814.1| hypothetical protein MYCGRDRAFT_70695 [Zymoseptoria tritici IPO323]
Length = 338
Score = 254 bits (650), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 183/334 (54%), Gaps = 46/334 (13%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIA 45
+ IR SHAGSWY+D+S +L ELDGWL P + P R +IA
Sbjct: 5 QTIRDCSHAGSWYSDSSSKLNAELDGWLAAVDAPVTCIGPRSEGEQVQQLPVPGARMIIA 64
Query: 46 PHAGYSYSGRAAAYAFGNIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
PHAGYSYSG AAA+A+ D + +R VFLLGPSHH+Y K ALS T Y TPIG+L +
Sbjct: 65 PHAGYSYSGPAAAWAYKAWDVSKANRKKVFLLGPSHHHYLTKAALSRCTQYATPIGNLTV 124
Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF-------EGHLIKIVPILVGA 156
D E EL ATG FE M VDE EHS+EMHLPY+ K+ H +VP+++G
Sbjct: 125 DRETTAELHATGVFEWMSHSVDEQEHSLEMHLPYIYKMLSRTFGEDSAHFPPLVPLMIGN 184
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------ 204
+ E GR A Y+ DPSN F +SSDF HWG RF Y +Y G
Sbjct: 185 TSPSTEKALGRRLAPYLADPSNAFVISSDFAHWGLRFRYTYYRPSTGTAVDLTSSSRSPK 244
Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----GNCS 256
IH SI+ +D M E+G D + L + NT+CGRHPI V + + G S
Sbjct: 245 EPAIHDSIKTVDFESMGACESGSHDEWLGQLEDTGNTVCGRHPIGVMMAAVEEVRKGAAS 304
Query: 257 TKI-KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
KF+RYE+SS+ K DSSVSYASA A V
Sbjct: 305 QGTGAFKFVRYERSSEVKRVSDSSVSYASAFACV 338
>gi|56755259|gb|AAW25809.1| SJCHGC02434 protein [Schistosoma japonicum]
Length = 304
Score = 253 bits (645), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 121/295 (41%), Positives = 186/295 (63%), Gaps = 9/295 (3%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+ +R +SH GSWY+ + +L+ +L WL E + VR +I PHAGY +SG AA
Sbjct: 5 FKSVRVSSHCGSWYSADRTQLSSQLSTWLESCENSVLSGYSVRAIIVPHAGYRHSGFCAA 64
Query: 59 YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ I+P I R+F+LGPSH CAL+ + Y+TP +L +D ++ +LK F
Sbjct: 65 HAYRQINPDKIERIFILGPSHRLDIGDTCALTCVSEYETPFCNLKIDTDIYSDLKKLSYF 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+++ DEAEHS+EM LP++A + +G IVP++VG ++ E + ++G+L + Y+ D
Sbjct: 125 KVLTKNQDEAEHSVEMQLPFIAYIMKGKKDQYSIVPVVVGCLSTERQELFGKLLSNYLLD 184
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
N F +SSDFCHWG RF Y +YDK G I +SIE LD +G+ I++ P++F +YL ++
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYDKSDGAIWQSIEKLDHLGLGAIQSLKPESFLQYLKKF 244
Query: 236 DNTICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
NTICGR I + L M+ + K +++K L Y QS++C++ DSSVSYA+ A
Sbjct: 245 SNTICGRRSIGLLLFMIDSIRQKQLFNLELKVLYYTQSNRCQSMEDSSVSYAACA 299
>gi|353229019|emb|CCD75190.1| hypothetical protein Smp_097240.2 [Schistosoma mansoni]
Length = 304
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 180/292 (61%), Gaps = 9/292 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA+A+
Sbjct: 8 IRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAAHAY 67
Query: 62 GNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F ++
Sbjct: 68 RLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCFSVL 127
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ DEAEHS+EM LPY+A + + + IVPI+VG+++ E + +G L + Y+ D N
Sbjct: 128 EKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDEKN 187
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y +Y+K G I +SIE LD +G++ I + P +F KYL EY NT
Sbjct: 188 LFVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHLGLNAIRSLKPKSFIKYLEEYSNT 247
Query: 239 ICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
ICGR I V L M+ K +++K L Y QSS C++ DSSVSYA+ A
Sbjct: 248 ICGRRSIGVLLFMIDCIRQKQPSNLELKILNYTQSSHCQSMEDSSVSYAACA 299
>gi|353229018|emb|CCD75189.1| hypothetical protein Smp_097240.1 [Schistosoma mansoni]
Length = 303
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 127/291 (43%), Positives = 179/291 (61%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA+A+
Sbjct: 8 IRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAAHAY 67
Query: 62 GNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F ++
Sbjct: 68 RLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCFSVL 127
Query: 121 DICVDEAEHSMEMHLPYLAKVFE-GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ DEAEHS+EM LPY+A + + IVPI+VG+++ E + +G L + Y+ D N
Sbjct: 128 EKSQDEAEHSVEMQLPYIAHIMKRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDEKNL 187
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F +SSDFCHWG RF Y +Y+K G I +SIE LD +G++ I + P +F KYL EY NTI
Sbjct: 188 FVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHLGLNAIRSLKPKSFIKYLEEYSNTI 247
Query: 240 CGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR I V L M+ K +++K L Y QSS C++ DSSVSYA+ A
Sbjct: 248 CGRRSIGVLLFMIDCIRQKQPSNLELKILNYTQSSHCQSMEDSSVSYAACA 298
>gi|452821363|gb|EME28394.1| hypothetical protein Gasu_40900 [Galdieria sulphuraria]
Length = 312
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 127/291 (43%), Positives = 180/291 (61%), Gaps = 14/291 (4%)
Query: 9 HAGSWYTDNSK---RLAEELDGWLREA-----GLPKSPE--VRGVIAPHAGYSYSGRAAA 58
+ G+WY ++ K L +LD WL++A L + ++ VI PHAGYSYSG AA
Sbjct: 16 YEGTWYGEDEKDSIALKHQLDLWLQKAKENVGNLEQYSNGLLKAVIVPHAGYSYSGATAA 75
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+++ IDP+ RVF+LGPSH+Y C ++ +TP G L +D++ E L AT KF
Sbjct: 76 HSYARIDPSLYDRVFILGPSHYYRMSDCRVTPFERLETPFGPLVIDVQCCESLVATKKFT 135
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+D VD EHS+E+ P+L VF +++V I+VG ++ + YG+L A Y+ DP
Sbjct: 136 FLDKGVDTQEHSLELQFPFLKAVFGSSNVQLVNIMVGVLDKAAKEEYGKLLAPYMKDPRT 195
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCHWG RF Y ++D++ G I KSIE LD++GM IE F YL +Y NT
Sbjct: 196 LFVISSDFCHWGKRFGYQYHDEEDGEIFKSIEKLDRLGMTAIEHQSAKEFSSYLDKYQNT 255
Query: 239 ICGRHPISVFLHMLGNC----STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
ICGR+PI V LHM+ S +++ F+RY+QS +CKT +D SVSYA+A
Sbjct: 256 ICGRNPIGVLLHMIEYLKRTESMQLETLFVRYDQSMKCKTMQDLSVSYAAA 306
>gi|145514095|ref|XP_001442958.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410319|emb|CAK75561.1| unnamed protein product [Paramecium tetraurelia]
Length = 294
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 123/285 (43%), Positives = 178/285 (62%), Gaps = 4/285 (1%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+KIR A+HAGSWY + K+L +L+ +L +A P ++ +I PHAG+SYSG AA+A+
Sbjct: 6 QKIREATHAGSWYIGDRKQLDAQLNDFLSKAKAETIPNIKAIIGPHAGFSYSGPTAAFAY 65
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FEL 119
++ +VFLLGP HH Y LS +Y+TP+G++ LD I++L A K F
Sbjct: 66 QHLVQKEGMKVFLLGPCHHTYIKGIGLSELEIYETPLGNIELDQPTIKQLSAELKKNFIF 125
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ V+E EHS+EMHLP++ KVF K++PI+VGA + +A + KY DP+
Sbjct: 126 TNKEVEEEEHSLEMHLPFIYKVFPK--CKLIPIMVGATTEQQDAQVASVLVKYFVDPNTV 183
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F +SSDFCHWG F Y Y+K+HG IH+SI LD + +IE+ + F KYL + NTI
Sbjct: 184 FVISSDFCHWGKSFRYTPYNKEHGEIHQSITQLDGQAIKLIESHNIPEFYKYLEDTKNTI 243
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CGRHPI V L+++ +++ + +Y QS+Q DSSVSYA+
Sbjct: 244 CGRHPICVLLNIINLSKLQLRTQLTKYNQSNQVTKPNDSSVSYAA 288
>gi|387219389|gb|AFJ69403.1| protein memo1 [Nannochloropsis gaditana CCMP526]
Length = 254
Score = 246 bits (629), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 123/233 (52%), Positives = 159/233 (68%), Gaps = 8/233 (3%)
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++DP+ I RVF+LGPSHH Y CA+S A+V +TP+G L +D V EL A F+ M
Sbjct: 2 HLDPSVIQRVFILGPSHHVYLRACAVSGASVCETPVGPLSVDEAVRAELLADPLFQTMAQ 61
Query: 123 CVDEAEHSMEMHLPYLAK-VFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
VDE EHS+EMHLPY+A V E + IVPILVG+++ EA+YGR+ AKY+ DP+N F
Sbjct: 62 RVDEDEHSIEMHLPYVAHLVGEVQGVSIVPILVGSLDPTQEAVYGRILAKYLSDPANLFI 121
Query: 182 VSSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
VS+DFCHWG RF Y +D+ +G IH+ IE LD+ GM +IE GDP AF YL +Y NTIC
Sbjct: 122 VSTDFCHWGRRFQYQPFDRAYGEDIHEYIEWLDRQGMALIEQGDPQAFTTYLRQYRNTIC 181
Query: 241 GRHPISVFLHML------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
GRHP+ VFL G ++++F+RY QSS+C + DSSVSYASA A
Sbjct: 182 GRHPLGVFLQAAQALREGGKEGLDVRVEFVRYAQSSRCMHETDSSVSYASAVA 234
>gi|171680871|ref|XP_001905380.1| hypothetical protein [Podospora anserina S mat+]
gi|27764330|emb|CAD60610.1| unnamed protein product [Podospora anserina]
gi|170940063|emb|CAP65290.1| unnamed protein product [Podospora anserina S mat+]
Length = 335
Score = 246 bits (629), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 186/333 (55%), Gaps = 49/333 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY D+ K L+ +LDG+L GLP P R +IAPHAGYSYSG A
Sbjct: 4 RDASHAGSWYDDDEKELSSQLDGFLSRVPDQLDDHGLP-VPGARVIIAPHAGYSYSGPCA 62
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +D + RVF+LGPSH YY CAL+T + Y TP GDL +D I EL+ TGKF
Sbjct: 63 AWAYKALDLRAAKRVFILGPSHTYYLRGCALTTFSKYATPFGDLVVDRNTINELRETGKF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
+ D EHS+EMH+P+L K + IVPILVG +AE E +G+L +
Sbjct: 123 TDIPARRDVDEHSLEMHVPFLWKRLQQTFGDDSTKYPPIVPILVGDGSAEEEKAFGKLLS 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY-----------MHYDKKHGV--------------- 204
Y+ DP+ + VSSDFCHWGSRF+Y M + GV
Sbjct: 183 SYLKDPTTAWIVSSDFCHWGSRFSYRPHFSDGAIRDMDAPRSKGVRHEVLNVTPPDWSKL 242
Query: 205 --------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
IH I+ LD++ MD +E+G+ F K + + NT+CGRHPI V + L
Sbjct: 243 GISSGEPEIHDVIKVLDQLAMDAVESGEHSEFYKVVQDSHNTVCGRHPIGVIMAALEAVD 302
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
K K KF++Y++S+ K D SVSYASA A V
Sbjct: 303 GKGKFKFVQYQRSNLVKKSFDFSVSYASAYAVV 335
>gi|296415955|ref|XP_002837648.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633527|emb|CAZ81839.1| unnamed protein product [Tuber melanosporum]
Length = 329
Score = 246 bits (628), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 185/321 (57%), Gaps = 40/321 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HA SWY + ++L +L+ W++ + G+P+ P R ++APHAGYSYSG A
Sbjct: 4 RIATHANSWYVADKRKLDNQLNAWIKIAGSEDCGIPEFPVSNARIIMAPHAGYSYSGATA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ D + +R+F+LGP+HH Y C++S Y TP+G+L LD E++ EL +TGKF
Sbjct: 64 GFAYKAWDLSQATRIFILGPAHHVYINGCSISGCDSYDTPLGELTLDSELLRELASTGKF 123
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE----GHL-IKIVPILVGAVNAENEAMYGRLFAKY 172
+ + VDE EHS+EMHLPY+ K E H +KIVPI+VG+++ E YG L Y
Sbjct: 124 TWLSLKVDEDEHSIEMHLPYIYKAIERAGKSHKDVKIVPIVVGSISTAQEKAYGTLLKPY 183
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD----------------KKHGV------IHKSIE 210
++DP N F +S+DFCHWG+RF+Y Y K+HGV +HKSI
Sbjct: 184 LEDPQNAFVISTDFCHWGARFSYTGYFRELPTHDNPNPEFIHLKEHGVQAIRHPLHKSIR 243
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH----MLG--NCSTKIKIKFL 264
ALD GM I TG F YL + NTICGRHPI V + M G S K ++L
Sbjct: 244 ALDHEGMVAIATGKHSEFSSYLSKTKNTICGRHPIGVIMAAAELMFGENKGSNLGKFRYL 303
Query: 265 RYEQSSQCKTKRDSSVSYASA 285
Y QS + SSVSY S
Sbjct: 304 YYAQSEALEDVNGSSVSYVSG 324
>gi|395731964|ref|XP_002812197.2| PREDICTED: protein MEMO1-like, partial [Pongo abelii]
Length = 240
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 122/234 (52%), Positives = 155/234 (66%), Gaps = 7/234 (2%)
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F + P+ R+F+LGPSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M
Sbjct: 8 FCCVSPSR--RIFILGPSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERM 65
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+ DPSN
Sbjct: 66 SLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSN 125
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F VSSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NT
Sbjct: 126 LFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNT 185
Query: 239 ICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICGRHPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 186 ICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 239
>gi|389644230|ref|XP_003719747.1| hypothetical protein MGG_04091 [Magnaporthe oryzae 70-15]
gi|351639516|gb|EHA47380.1| hypothetical protein MGG_04091 [Magnaporthe oryzae 70-15]
gi|440466506|gb|ELQ35770.1| hypothetical protein OOU_Y34scaffold00689g3 [Magnaporthe oryzae
Y34]
gi|440477091|gb|ELQ58235.1| hypothetical protein OOW_P131scaffold01683g71 [Magnaporthe oryzae
P131]
Length = 338
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 188/340 (55%), Gaps = 53/340 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLP-KSPEVRGVIAPHAGYSY 52
M IRRASHAGSWYT +S +L ELD WL E+ +P K V +IAPHAGYSY
Sbjct: 1 MSGIRRASHAGSWYTHSSSKLNTELDEWLSEVPDKINESDVPIKGASV--IIAPHAGYSY 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AA+A+ D ++ R+F+LGPSH YY CAL+T Y+TP+G+L +DL+ I++L+
Sbjct: 59 SGPCAAWAYKVFDVSAAKRIFVLGPSHTYYLSGCALTTYATYETPLGNLRVDLDTIKQLR 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK------------IVPILVGAVNAE 160
TGKF+ + DE EHS+EMHLPYLAK +VPIL+G +
Sbjct: 119 DTGKFKDIPRDNDEDEHSLEMHLPYLAKRLTQTFGGGSDGDGDASWPPVVPILIGDNKRD 178
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------- 204
E +G L ++ DP N F VSSDFCHWG+RF+Y Y V
Sbjct: 179 AEKAFGELLLPHLRDPDNAFIVSSDFCHWGNRFSYTKYTADGTVEGVRSLGRADRNLPVP 238
Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-----------G 253
IH+ I LD + MD IETG DAF L NT+CGRHPI V + L G
Sbjct: 239 IHEGIRVLDHLAMDAIETGSHDAFYDNLKATGNTVCGRHPIGVVMAALEMLKKERAGGDG 298
Query: 254 NCSTKIK----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
S K F++Y++SS + DSSVSYA+A A +
Sbjct: 299 AASGSKKGDGVFTFVKYDRSSLVENVSDSSVSYAAAYAVI 338
>gi|156035813|ref|XP_001586018.1| hypothetical protein SS1G_13111 [Sclerotinia sclerotiorum 1980]
gi|154698515|gb|EDN98253.1| hypothetical protein SS1G_13111 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 335
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 181/332 (54%), Gaps = 46/332 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M IR ASHAGSWYT N L+ +LD WL + +P S P R +IAPHAGYSY
Sbjct: 1 MGSIREASHAGSWYTSNPSTLSTQLDEWLSQ--VPSSIDDTKIPIPGARVIIAPHAGYSY 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AAA+A+ ++D +S R+FLLGPSH +Y +CALS + Y TP+GDL +D + EL
Sbjct: 59 SGPAAAWAYKSLDLSSAKRIFLLGPSHAWYLSECALSKHSTYATPLGDLRIDRATVSELA 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYG 166
TG+F+ M+ DE EHS+EMHLPY+ K+ ++PILVG NA E +G
Sbjct: 119 QTGEFKTMNSDQDETEHSLEMHLPYIYKMLSLNFDSPSDFPPLIPILVGNTNASTEKKFG 178
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------DKKHGV------------ 204
+ A Y+ D ++ F VSSDFCHWG RF Y +Y D G
Sbjct: 179 AILAPYLADETSVFIVSSDFCHWGPRFQYTYYLPSDSSTDANDTSTGYSLSRWDKSPTQP 238
Query: 205 -IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-------S 256
IH+SI LDK+ M+ I G F L E NT+CGRHPI + + + S
Sbjct: 239 PIHESIGKLDKLSMEAISGGKHQEFLNNLEETGNTVCGRHPIGITMAAIEKLKEDKLLQS 298
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
K K KF+RYE+S++ D S S +
Sbjct: 299 DKGKFKFVRYERSNEVVKVDDGSFELYSKTTR 330
>gi|134076429|emb|CAK39657.1| unnamed protein product [Aspergillus niger]
Length = 315
Score = 243 bits (619), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 176/311 (56%), Gaps = 33/311 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+D+ L+ +LD WL+E G P R +IAPHAGY+YSGR A
Sbjct: 4 REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
A+A+ +D + R+F++GPSHH+Y AL T Y TP+ D LPLD E I +L++T
Sbjct: 64 AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
+F M VDEAEHS+E+HLPY+ ++ + +VPILVGAV
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
E +G L A Y+DDP N F +SSDFCHWG RF Y + IH+SI A+D M
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYT---SREPPIHESISAVDLATMA 240
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKT 274
I TG+ F L NT+CGRHPI V + + N K + F+RY++SS
Sbjct: 241 AITTGEYARFSTILKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIE 300
Query: 275 KRDSSVSYASA 285
DSSVSY SA
Sbjct: 301 VDDSSVSYVSA 311
>gi|426202099|gb|EKV52022.1| hypothetical protein AGABI2DRAFT_239501 [Agaricus bisporus var.
bisporus H97]
Length = 333
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 176/321 (54%), Gaps = 35/321 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGYSYSGRAA 57
R A+HAGSWYT N+ +L +L GWL + E + +IAPHAGYS+ G
Sbjct: 13 FRGAAHAGSWYTSNAPQLTADLSGWLSLVRPRREDEEFPVQGCKAIIAPHAGYSHCGETG 72
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +IDP++ RVF++GPSH + CALS Y TPIG LP+D +V+ L TG F
Sbjct: 73 AWAYKSIDPSTTRRVFVIGPSHFWRIDNCALSRCHTYDTPIGTLPVDTDVVVSLYETGLF 132
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M++ DE EHS+EM LPYL +V EG IKIVPI+VG + E E YGR+ A Y+ +
Sbjct: 133 RWMNLTRDEREHSIEMQLPYLRRVCEGKDIKIVPIMVGDITKEQELEYGRILAPYLAEEG 192
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGV-------------------IHKSIEALDKMGMD 218
F SSDFCHWG + Y Y K I +S LD +GM
Sbjct: 193 TVFIASSDFCHWGPEYYYTFYFPKPNCLPSEGYQATRATPPEKGYKIWQSTTQLDHIGMG 252
Query: 219 IIETGDPDA------FKKYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQ 268
I+ D A F +YL+E DN+ICGRH I + H+ K +++Y+Q
Sbjct: 253 ILADQDRSALTAHRDFHRYLVETDNSICGRHAIGILYGALAHLEETTGRKKFCNWVKYDQ 312
Query: 269 SSQCKTKRDSSVSYASAAAKV 289
S+Q D+SVSYASA K+
Sbjct: 313 SAQVTKITDNSVSYASAWIKI 333
>gi|19115252|ref|NP_594340.1| UPF0103 family [Schizosaccharomyces pombe 972h-]
gi|1723280|sp|Q10212.1|YAY4_SCHPO RecName: Full=MEMO1 family protein C4H3.04c
gi|1184017|emb|CAA93343.1| UPF0103 family [Schizosaccharomyces pombe]
Length = 309
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 188/311 (60%), Gaps = 29/311 (9%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAY 59
+ IR A+HAGSWY D+++ L ++L +++ +PE R VI+PHAGY YSG+ A+
Sbjct: 3 QAIREATHAGSWYLDDTELLTKQLKSFIKNP----TPETGKRFVISPHAGYMYSGKVASQ 58
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFE 118
F +D + I RVF+ GPSHH +T KC +S A++ TP+GDL +D ++ ++L A+ F+
Sbjct: 59 GFQQLDFSKIQRVFVFGPSHHIFTRKCLVSRASICSTPLGDLKVDEDLCQKLVASDNSFD 118
Query: 119 LMDICVDEAEHSMEMHLPYLA--KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
M + VDE+EHS+EM P LA + +G L +KIVPI++GA+ + + ++Y+
Sbjct: 119 SMTLDVDESEHSLEMQFPLLAFHLLKQGCLGKVKIVPIMIGALTSTTMMAAAKFLSQYIK 178
Query: 175 DPSNFFSVSSDFCHWGSRFNY--------------MHYDKKHG----VIHKSIEALDKMG 216
D SN F +SSDFCHWG RF Y + Y ++ G I++SI LD +G
Sbjct: 179 DESNSFVISSDFCHWGRRFGYTLYLNDTNQLEDAVLKYKRRGGPTSPKIYESISNLDHIG 238
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
M IIET D F +YL NTICGR+PI + + + + + KF+ Y QSS +
Sbjct: 239 MKIIETKSSDDFSEYLKTTQNTICGRYPIELIMKSMECANFSERFKFISYAQSSHVELVT 298
Query: 277 DSSVSYASAAA 287
DSSVSYA+A A
Sbjct: 299 DSSVSYATATA 309
>gi|367032250|ref|XP_003665408.1| hypothetical protein MYCTH_2309073 [Myceliophthora thermophila ATCC
42464]
gi|347012679|gb|AEO60163.1| hypothetical protein MYCTH_2309073 [Myceliophthora thermophila ATCC
42464]
Length = 335
Score = 240 bits (612), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 183/333 (54%), Gaps = 49/333 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLRE-------AGLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY D+ + L+ +LD +L LP P R +IAPHAGYSYSG A
Sbjct: 4 REASHAGSWYEDDPEELSSQLDDFLHRVPAELDFTPLP-IPGARVIIAPHAGYSYSGPCA 62
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ ++D S RVF+LGPSH YY CAL+T Y+TP GDL +D EL+ TG+F
Sbjct: 63 AWAYKSLDLRSAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLVVDKPTTSELRKTGRF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
M + EHS+EMH+PYL K E IVPILVG + + E +G L +
Sbjct: 123 SDMPARREVEEHSLEMHIPYLWKRLEQTFGTDVSKYPSIVPILVGNASEQEEKSWGELLS 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY---------------------------MHYDKKHG 203
Y+ DP + VSSDFCHWGSRF+Y D +
Sbjct: 183 PYLKDPETAWIVSSDFCHWGSRFSYRPEFHKGVVRDLDNKDDDLKVSPDWAQAAADPERP 242
Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-------GNCS 256
IH+ I+ LD+M MD +E+G F K + E NT+CGRHPI V + L G +
Sbjct: 243 EIHEVIKVLDQMAMDAVESGAHSEFYKVIRETRNTVCGRHPIGVTMAALETIAKEGGTNA 302
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
K K +F++Y++S+ K +RD SVSYASA A V
Sbjct: 303 GKGKFRFVQYQRSNLVKKERDFSVSYASAYAVV 335
>gi|223999071|ref|XP_002289208.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974416|gb|EED92745.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 311
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 185/307 (60%), Gaps = 24/307 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF-- 61
+RRA HAGSWY++ L E L +L +A + S + I+PHAG+ YSG AAY++
Sbjct: 5 VRRAHHAGSWYSEEPNELDEALANYLAKAMVASS---KTSISPHAGFQYSGPTAAYSYLA 61
Query: 62 ---GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
I S+ V +L PSHH Y CA+S A+ +TP+GDL +D + E++ +TGKF+
Sbjct: 62 LKEALISNPSLRTVVVLHPSHHIYLDGCAVSGASTLETPLGDLVVDNNLREQMLSTGKFD 121
Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----------EGHLIKIVPILVGAVNAENEAMYGRL 168
+M+ VDE EHS EM P++AKV + IK++PI++G++ + E ++G+L
Sbjct: 122 VMEKRVDEQEHSGEMQYPFIAKVINDVKKHTPEHDNLSIKVLPIMIGSIKQKKEELFGKL 181
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH--GVIHKSIEALDKMGMDIIETGDPD 226
A ++ D F +SSDFCHWG RF+Y + +H+ IE LD+ GM +IE P
Sbjct: 182 LAPFLSDDGVFTVISSDFCHWGKRFSYTPQPSREVANEVHEYIEYLDRKGMSLIEMQRPG 241
Query: 227 AFKKYLLEYDNTICGRHPISVFLHML----GNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
AF YL EY NTICGRHPI+V+L+ + ++ ++F +Y QS + + RD SVSY
Sbjct: 242 AFADYLREYSNTICGRHPIAVWLNTVVTNREEGKERLDVRFAKYAQSGKATSLRDFSVSY 301
Query: 283 ASAAAKV 289
ASA A+
Sbjct: 302 ASAVART 308
>gi|426335211|ref|XP_004029126.1| PREDICTED: protein MEMO1 [Gorilla gorilla gorilla]
Length = 330
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 127/284 (44%), Positives = 168/284 (59%), Gaps = 26/284 (9%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LGPSH
Sbjct: 58 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSH 116
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 117 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 176
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 177 KAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 236
Query: 198 YDKKHGVIHKSIEALDKM-GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM----- 251
YD+ G I++SIE LDKM + I D K + + VFL M
Sbjct: 237 YDESQGEIYRSIEHLDKMVTIQKIILRDTSVLKIFKVN-----------KVFLDMVFSNI 285
Query: 252 ------LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 286 INAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 329
>gi|453089768|gb|EMF17808.1| UPF0103-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 336
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 184/332 (55%), Gaps = 44/332 (13%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----------------PEVRGVIA 45
+ +R ASHAGSWY+ + +L +L+GWL P S P R +IA
Sbjct: 5 QTVRHASHAGSWYSSSKSQLDGQLEGWLSAVKPPVSCIGPQSAGQTYSELPVPGARVIIA 64
Query: 46 PHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL 105
PHAGYSYSG AAA+A+ D + R+FLLGPSHHY+ K ALS T Y TP+G+L +D
Sbjct: 65 PHAGYSYSGPAAAWAYQAWDVSKAKRIFLLGPSHHYHLTKAALSRCTHYATPLGNLVVDK 124
Query: 106 EVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVN 158
+ +L A+G+F+ M VDE EHS+EMHLPY+ K+ +VPI+VG +
Sbjct: 125 DTTAKLHASGQFQWMSQSVDEDEHSLEMHLPYIYKMLSKTFGDDPTEFPPLVPIMVGNTS 184
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
E GR+ A+++ DP+N F +SSDF HWGSRF Y +Y G
Sbjct: 185 QSTERSLGRILAEHLADPTNAFVISSDFAHWGSRFRYTYYKPSTGSAVDLKSSGPVPTSP 244
Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCST 257
IH+SI+A+D M+ E+G + + L + NT+CGRHPI V + + G
Sbjct: 245 AIHESIKAVDFECMEACESGSHERWLHALEDTGNTVCGRHPIGVIMAAVEEVRKSGGKEG 304
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
KF++Y++SS DSSVSYASA A +
Sbjct: 305 IGAFKFVQYDRSSLVIRVSDSSVSYASAFAVI 336
>gi|367047497|ref|XP_003654128.1| hypothetical protein THITE_2116866 [Thielavia terrestris NRRL 8126]
gi|347001391|gb|AEO67792.1| hypothetical protein THITE_2116866 [Thielavia terrestris NRRL 8126]
Length = 341
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 142/339 (41%), Positives = 182/339 (53%), Gaps = 55/339 (16%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY D++ L+ +LD +L LP P R +IAPHAGYSYSG A
Sbjct: 4 REASHAGSWYDDDADVLSSQLDDFLNRVPATLNNNNLP-IPGARVIIAPHAGYSYSGPCA 62
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +D S RVF+LGPSH YY CAL+T Y+TP GDL +D ELK TG+F
Sbjct: 63 AWAYKALDLRSAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLVVDKATTSELKRTGRF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGRLFA 170
M D EHS+EMH+PYL K E IVPILVG + E E +G L +
Sbjct: 123 SNMPPGRDVDEHSLEMHIPYLWKRLEQTFGSDSTTYPPIVPILVGDGSEEEEKSFGELLS 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNY---------------------------------MH 197
+Y+ DP + VSSDFCHWGSRF+Y +
Sbjct: 183 RYLKDPETAWIVSSDFCHWGSRFSYRPEFHQGSIRNLDAPRSQGSKRDVVQAAPDWSKVA 242
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----- 252
D IH+ I+ALD+M MD +E+G F K + + NT+CGRHPI V + L
Sbjct: 243 EDPGAPQIHEVIKALDQMAMDAVESGTHSEFYKVIQDTHNTVCGRHPIGVIMAALELVAK 302
Query: 253 --GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
G K K +F++Y++S+ K DSSVSYASA A +
Sbjct: 303 EQGLQEGKGKFRFIQYQRSNLVKKHWDSSVSYASAYAVI 341
>gi|213403404|ref|XP_002172474.1| MEMO1 [Schizosaccharomyces japonicus yFS275]
gi|212000521|gb|EEB06181.1| MEMO1 [Schizosaccharomyces japonicus yFS275]
Length = 309
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 124/305 (40%), Positives = 178/305 (58%), Gaps = 22/305 (7%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R +HAG+WYTD+ L ++LDG++ AG P R +I+PHAGY YSG A +G
Sbjct: 7 NVRPLTHAGTWYTDDFSALKKQLDGFM--AGTQIVPGTRMIISPHAGYLYSGPTAGKCYG 64
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-FELMD 121
++D + RVF+LGPSHH YT C +S+ Y T +G+LP+D EV + L ++ K +L+
Sbjct: 65 SLDFSHCKRVFVLGPSHHMYTRDCLVSSFDAYATHLGNLPIDREVCDFLLSSSKHVQLLS 124
Query: 122 ICVDEAEHSMEMHLPYLAKVFEG----HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
DEAEHS+EM +P LA+ + + IVPI+VGA++ +E +G A +V +
Sbjct: 125 ENADEAEHSLEMQIPILAQALQNQNALQHVSIVPIMVGALSHSHEQAFGETLAPFVANEE 184
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHG---------------VIHKSIEALDKMGMDIIET 222
N F +SSDFCHWG RF Y Y G I +S+ LD++ M IE
Sbjct: 185 NVFIISSDFCHWGLRFGYTGYLNNDGKFEMLAASSHSPRAPKIFQSVHQLDRLVMSAIEE 244
Query: 223 GDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
GD + F + L + NT+CG+HPI+V L L + + +F+ Y+QSSQ DSSVSY
Sbjct: 245 GDYNHFVRILSKTKNTVCGQHPIAVILAALTKANIDKRFRFIAYDQSSQVVDIEDSSVSY 304
Query: 283 ASAAA 287
A+ A
Sbjct: 305 AAGLA 309
>gi|340931817|gb|EGS19350.1| hypothetical protein CTHT_0048080 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 337
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 185/335 (55%), Gaps = 51/335 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY D+ L+ +LD +L ++ LP P R +IAPHAGYSYSG A
Sbjct: 4 REASHAGSWYEDDPSVLSSQLDEFLGRVPASLNDSALP-IPGARVIIAPHAGYSYSGPCA 62
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +D ++ RVF+LGPSH YY CAL+T Y+TP GDL +D ELK TG+F
Sbjct: 63 AWAYKTLDLSNAKRVFILGPSHTYYLRGCALTTFDKYETPFGDLIVDKATTAELKRTGRF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVNAENEAMYGRLFA 170
M D EHS+EMH+PYL K E I+PILVG E E YG+L A
Sbjct: 123 SDMPARRDVDEHSLEMHIPYLWKRLEQTFGTDSAKYPPIIPILVGDGTEEQEQAYGQLLA 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYM------------------HYDKKHGV-------- 204
+Y+ DP+ + +SSDFCHWGSRF+Y H + + G
Sbjct: 183 QYLKDPTTAWIISSDFCHWGSRFSYRPVFDNGKIRNLDADRKDKHLELRPGWEQLVDNPD 242
Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---- 257
IH+ IE LDKM M+ +E+G F + + NT+CGRHPI V + L +
Sbjct: 243 NMKIHEVIEMLDKMAMNAVESGAHHKFYHVIQQTQNTVCGRHPIGVIMAALEVLTKDQRP 302
Query: 258 ---KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ K+KF++Y++S+ + D SVSYASA A V
Sbjct: 303 EDGRGKLKFVQYQRSNLVEKVWDFSVSYASAYAIV 337
>gi|321258823|ref|XP_003194132.1| hypothetical protein CGB_E1360W [Cryptococcus gattii WM276]
gi|317460603|gb|ADV22345.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 346
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 184/344 (53%), Gaps = 55/344 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYS 53
M +R A+HAGSWYT + L +L L P + +IAPHAGYSYS
Sbjct: 1 MSGVREATHAGSWYTSSRSGLHNQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AAA+A+ + I RVFLLGPSHH Y P ALS Y+TP+GD+PLD++ I EL+A
Sbjct: 61 GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFETYETPLGDIPLDIDTINELRA 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYGRLFAKY 172
T F M DE EHS+EMHLPY+ +F+G + +K+VPILVG NA A AKY
Sbjct: 121 TRIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRNDLKLVPILVGHPNASTSAKLSEALAKY 180
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY------------------DKKHGV---------- 204
D FF +SSDFCHWGSRF+ Y G+
Sbjct: 181 WQDDETFFVISSDFCHWGSRFSCTPYYPNIPPLVNPVPPVKSSTSSTSGILSQPPELVKK 240
Query: 205 -----------IHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
I KSIE +D GMD++ E G + + YL NTICGR+PI+V L
Sbjct: 241 FSSATSNPDVPIWKSIEYMDHEGMDLLRKPGEDGAVEKWHGYLERTKNTICGRNPITVLL 300
Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+++ + K + F+RYEQSS+C T +DSSVSY S +V
Sbjct: 301 NLVQFVYKDQQVKPEFIFVRYEQSSRCVTGKDSSVSYVSGVLRV 344
>gi|317140845|ref|XP_001818439.2| hypothetical protein AOR_1_2650174 [Aspergillus oryzae RIB40]
Length = 366
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 139/363 (38%), Positives = 182/363 (50%), Gaps = 82/363 (22%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
R ASHAGSWY+DN + L +LDGWL A +P S P R +IAPHAGYSYSG
Sbjct: 4 REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPHAGYSYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
AAYA+ +D + R+F+LGPSHH+Y AL T Y TP+ D LPLD E+I +L +
Sbjct: 62 CAAYAYKALDLSKAKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 121
Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
T F M +DE EHS+E+HLPY+ ++ + H +VPI+VG+
Sbjct: 122 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 181
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
+A E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y
Sbjct: 182 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 241
Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ IH+SI A D M I TG F
Sbjct: 242 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 301
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ + NT+CGRHPI V + + ++ K K F+RYE+SS DSSVSY S
Sbjct: 302 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSVSYVS 361
Query: 285 AAA 287
A A
Sbjct: 362 AFA 364
>gi|320589421|gb|EFX01882.1| duf52 domain containing protein [Grosmannia clavigera kw1407]
Length = 336
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 178/333 (53%), Gaps = 47/333 (14%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-------LPKSPEVRGVIAPHAGYSYS 53
+ R A HAGSWYT + + L+EEL+GWL EA LP + R VIAPHAGY YS
Sbjct: 3 LRGTRPAEHAGSWYTADPRNLSEELNGWLEEASCAINGSLLPVT-NARAVIAPHAGYRYS 61
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AA+A+ +D RVF+LGPSH YY CAL+T Y TP GD +D +V L+
Sbjct: 62 GPCAAWAYKCLDLNKARRVFVLGPSHTYYLRGCALTTFEKYATPFGDFQVDADVTARLRE 121
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGR 167
+G F+ + D EHS+EM LPY+ K+ E L +VPI+VG + +E +G+
Sbjct: 122 SGFFKDIPRHNDVGEHSLEMQLPYIWKMLERTLGTGARWPSLVPIIVGDGSGADEREFGQ 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------DKK----HGVIHK 207
L Y++DP N F +SSDFCHWG F + HY D K +H+
Sbjct: 182 LLVPYLEDPENAFVISSDFCHWGRNFEFWHYCPSSDFSKIRSLSPRGDSKPSPTEPPVHE 241
Query: 208 SIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI-------- 259
I LD+ IETG F YL +NT+CGRHPISV + L T+
Sbjct: 242 FIRVLDEAAFRAIETGSQAKFVDYLHVTENTVCGRHPISVAMAALELMGTRAPTSTSTNS 301
Query: 260 -----KIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K +F+RYE+S +T DSSVSY SA A
Sbjct: 302 PIGHPKFEFVRYERSDLAETTSDSSVSYGSAFA 334
>gi|405120620|gb|AFR95390.1| hypothetical protein CNAG_02444 [Cryptococcus neoformans var.
grubii H99]
Length = 346
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 182/344 (52%), Gaps = 55/344 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYS 53
M +R A+HAGSWYT + L +L L P + +IAPHAGYSYS
Sbjct: 1 MSGVREATHAGSWYTSSRSGLRNQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AAA+A+ + I RVFLLGPSHH Y P ALS Y+TP+GD+PLD++ I+EL+A
Sbjct: 61 GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFEAYETPLGDIPLDIDTIDELRA 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKY 172
TG F M DE EHS+EMHLPY+ +F+G +K+VPILVG +A AKY
Sbjct: 121 TGIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRDDLKLVPILVGHPSASTSVKLSEALAKY 180
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------------- 198
D FF +SSDFCHWGSRF+ Y
Sbjct: 181 WQDDETFFVISSDFCHWGSRFSCTPYYPNVPPLANPVPPVKSSTSVTSSTLTQPLELVKK 240
Query: 199 -----DKKHGVIHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
I KSIE +D GMD++ E G + + YL NTICGR+PI+V L
Sbjct: 241 FSSASSNPDVPIWKSIEYMDHEGMDLLRKPGEEGAVEKWHGYLDRTKNTICGRNPITVLL 300
Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+++ + K + F+RYEQSS+C T +DSSVSY S ++
Sbjct: 301 NLVQLVYKDQPVKPEFTFVRYEQSSRCVTGKDSSVSYVSGVLRL 344
>gi|118376702|ref|XP_001021532.1| hypothetical protein TTHERM_00148970 [Tetrahymena thermophila]
gi|89303299|gb|EAS01287.1| hypothetical protein TTHERM_00148970 [Tetrahymena thermophila
SB210]
Length = 333
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 112/288 (38%), Positives = 171/288 (59%), Gaps = 16/288 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R ++ G WY+ L +L+ WL A + ++ +IAPHAG+S+SG AA+A+
Sbjct: 54 VREPTYQGLWYSSQDYELNIQLNVWLDMAKCDITSIKSIKAIIAPHAGFSFSGPTAAWAY 113
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
R P CALS Y+TP+G++ +D E I++L G FE+ D
Sbjct: 114 ---------RYLKQNPPTQ---KSCALSGMKTYETPLGNIEIDQETIQQLLKEGSFEVSD 161
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+E EHS+EM L +L K IK++P++VG++++ E YG+LF+KY++ F
Sbjct: 162 KDAEEEEHSIEMQLCFLIKTLGLENIKLIPVMVGSIDSTQEEKYGKLFSKYINQEDTLFV 221
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+S+DFCHWG +F Y + K+ I++ IE LD+ GM+ IE D DAF++Y+ E +N ICG
Sbjct: 222 ISTDFCHWGQKFKYTYSTKEDCEIYEQIEKLDQKGMEHIEMNDIDAFQQYIRETENNICG 281
Query: 242 RHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
RH IS+ ++ + S +I KF+RY QSS + K DSSVSYA+A +
Sbjct: 282 RHAISILMNAIAKSKISNQIDTKFIRYSQSSLVRDKNDSSVSYAAAVS 329
>gi|358368861|dbj|GAA85477.1| DUF52 domain protein [Aspergillus kawachii IFO 4308]
Length = 365
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 180/358 (50%), Gaps = 77/358 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+D+ L+ +LD WL+E G P R +IAPHAGY+YSGR A
Sbjct: 4 REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
A+A+ +D + R+F++GPSHH+Y AL T Y TP+ D LPLD E I +L++T
Sbjct: 64 AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTK 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
+F M VDEAEHS+E+HLPY+ ++ + +VPILVGAV
Sbjct: 124 AVSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSQYPPLVPILVGAVT 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
E +G L A Y+DDP+N F +SSDFCHWG RF Y +Y
Sbjct: 184 ESTEKAFGTLLAPYIDDPANAFVISSDFCHWGQRFRYTYYTSQAPNPGPSIPVSAPGLPQ 243
Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A+D M I TG+ F
Sbjct: 244 PGIDGDSVTDQMEMVSAGRSLKSRERIRSREPPIHESISAVDIATMAAIATGEYARFSTI 303
Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
L NT+CGRHPI V + + N K + +F+RY++SS DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEDIRKNEGEKGRFRFIRYDRSSHVIEVDDSSVSYVSA 361
>gi|58267092|ref|XP_570702.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134111713|ref|XP_775392.1| hypothetical protein CNBE1080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258051|gb|EAL20745.1| hypothetical protein CNBE1080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226936|gb|AAW43395.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 346
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 184/344 (53%), Gaps = 55/344 (15%)
Query: 1 MEKIRRASHAGSWYTDNS----KRLAEELDGWLREAGLPKSPEV---RGVIAPHAGYSYS 53
M +R A+HAGSWYT + K+L++ L + L P V + +IAPHAGYSYS
Sbjct: 1 MSGVREATHAGSWYTSSRPGLHKQLSQNLSAVKPISTLDYDPPVSNAKAIIAPHAGYSYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AAA+A+ + I RVFLLGPSHH Y P ALS Y+TP+GD+PLD + I EL+
Sbjct: 61 GPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFEAYETPLGDIPLDTDTINELRD 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKY 172
TG F M DE EHS+EMHLPY+ +F+G +K+VPILVG +A A AKY
Sbjct: 121 TGIFSDMKSSTDEDEHSLEMHLPYIRLIFQGRDDLKLVPILVGHPSASTSAKLSEALAKY 180
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------------- 198
D FF +SSDFCHWGSRF+ Y
Sbjct: 181 WQDGETFFVISSDFCHWGSRFSCTPYYPNIPPLANPVPPVKSSTSATPGTLTQPPELVKK 240
Query: 199 -----DKKHGVIHKSIEALDKMGMDII----ETGDPDAFKKYLLEYDNTICGRHPISVFL 249
I KSIE +D GMD++ E G + + YL NTICGR+PI+V L
Sbjct: 241 FSSASSNPDVPIWKSIEYMDHEGMDLLRKPGEDGAVEKWHGYLERTKNTICGRNPITVLL 300
Query: 250 HMLG----NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+++ N K + F+RYEQSS+C +DSSVSY S ++
Sbjct: 301 NLVQFVYKNQPVKPEFVFVRYEQSSKCVNGKDSSVSYVSGVLRL 344
>gi|432906940|ref|XP_004077602.1| PREDICTED: protein MEMO1 [Oryzias latipes]
Length = 394
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 114/228 (50%), Positives = 149/228 (65%), Gaps = 8/228 (3%)
Query: 45 APHAGYSYSGRAAA-----YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG 99
A HAG YS +A+ +DP+ RVF+LGPSHH +CALS A +Y+TP+
Sbjct: 10 ASHAGSWYSASXXXXXXXXHAYKQVDPSITRRVFILGPSHHVPLSRCALSPADIYRTPLY 69
Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAV 157
DL +D +V +L TG FE + + +DE EHS+EMHLPY AK E H IVP+LVGA+
Sbjct: 70 DLRIDQKVYADLWKTGLFERISMQIDEDEHSIEMHLPYTAKAMESHKDEFSIVPVLVGAL 129
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
+ E YG+L +KY+ DP+N F +SSDFCHWG RF Y +YD+ G I++SIE LDKMGM
Sbjct: 130 SESKEQEYGKLLSKYLADPTNLFIISSDFCHWGQRFRYTYYDESQGEIYRSIENLDKMGM 189
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-IKIKFL 264
IIE DP +F YL +Y NTICGRHPI V L +L T+ + + F+
Sbjct: 190 SIIEQLDPMSFTNYLKKYRNTICGRHPIGVLLALLSEVRTEDVNVTFV 237
>gi|425769569|gb|EKV08060.1| hypothetical protein PDIP_70230 [Penicillium digitatum Pd1]
gi|425771206|gb|EKV09656.1| hypothetical protein PDIG_60800 [Penicillium digitatum PHI26]
Length = 367
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 138/362 (38%), Positives = 180/362 (49%), Gaps = 79/362 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R+ASH+GSWY+D+++ LA +LDGWL + G +P R +IAPHAGYSYSG A
Sbjct: 4 RQASHSGSWYSDSARTLARQLDGWLAQVPDTMEKVGSLPTPGARVIIAPHAGYSYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
AYA+ +D + R+F+LGPSHH AL T T Y+TP+ D LPLD E+I +L
Sbjct: 64 AYAYKALDLSKAKRIFILGPSHHVSLATLALPTLTSYRTPLSDEPLPLDTELITQLFETQ 123
Query: 112 --KATG---KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
K G +F M VDE EHS+EMHLPY+ ++ + +VPI+VG +
Sbjct: 124 ATKPNGTKIRFTTMSRSVDEDEHSIEMHLPYIHRLLQLQYPDSSASEYPPLVPIMVGNTS 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
A E +G L A Y+ DP N F +SSDFCHWG RF Y +Y
Sbjct: 184 AATEQAFGALLAPYLADPENAFVISSDFCHWGLRFRYTYYVPQAPRPGPQLPLSGDILPQ 243
Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A D M I TG +F
Sbjct: 244 PGMAASSVHQAVDMVSLGHSLRQRDRISSRGPAIHESISAFDMATMAAITTGSAKSFLDI 303
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKI------KIKFLRYEQSSQCKTKRDSSVSYASA 285
+ NT+CGRHPI V + L + + FLRYE+SS + DSSVSY SA
Sbjct: 304 IERTGNTVCGRHPIGVIMAALEFIAASYAADQEGRFYFLRYERSSDVEEIDDSSVSYVSA 363
Query: 286 AA 287
A
Sbjct: 364 FA 365
>gi|449298413|gb|EMC94428.1| hypothetical protein BAUCODRAFT_149584 [Baudoinia compniacensis
UAMH 10762]
Length = 336
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/330 (41%), Positives = 183/330 (55%), Gaps = 44/330 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRG-------------VIAPH 47
+R ASHAGSWY+ ++ +L +L+G L P P+ G +I PH
Sbjct: 7 VRDASHAGSWYSASASQLDSQLEGHLAAVKPPLKCIGPQSEGQSVSELPISGARVIIGPH 66
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV 107
AGYSYSG AAA+A+ D + RVFLLGPSHH+YT + AL+ T Y TP+G L +D +
Sbjct: 67 AGYSYSGPAAAWAYRCWDVSKAKRVFLLGPSHHHYTSRAALTRCTHYATPLGHLTIDRAM 126
Query: 108 IEELKATGKFELMDICVDEAEHSMEMHLPYL----AKVFEGHLIK---IVPILVGAVNAE 160
EL +GKFE M VDE EHS+EMHLPY+ +K F G+ K +VP++VG +A
Sbjct: 127 TTELYKSGKFEWMSQSVDEDEHSLEMHLPYIHKIFSKTFSGNAAKFPLLVPVMVGNTSAS 186
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK---------------KHGVI 205
EA G+L A Y+ D +N F +SSDF HWG RF Y Y + I
Sbjct: 187 TEATLGKLLAPYLADETNAFVISSDFAHWGLRFRYTLYRSPGASQVNLAAGAKAPTNPAI 246
Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKI--- 259
H+SI+A+D + E+GD + + L NT+CGRHPI V + L S K
Sbjct: 247 HESIKAVDFDCIASCESGDHSQWLETLESTGNTVCGRHPIGVMMAAVEELRRTSGKKDAG 306
Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
KF+RYE+S + K SSVSYASA A +
Sbjct: 307 AFKFVRYERSEEVKKVSQSSVSYASAFAVI 336
>gi|350635890|gb|EHA24251.1| hypothetical protein ASPNIDRAFT_209410 [Aspergillus niger ATCC
1015]
Length = 365
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 178/358 (49%), Gaps = 77/358 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+D+ L+ +LD WL+E G P R +IAPHAGY+YSGR A
Sbjct: 4 REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
A+A+ +D + R+F++GPSHH+Y AL T Y TP+ D LPLD E I +L++T
Sbjct: 64 AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
+F M VDEAEHS+E+HLPY+ ++ + +VPILVGAV
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
E +G L A Y+DDP N F +SSDFCHWG RF Y +Y
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYTYYTPQAPNPGPSIPVSAPGLPQ 243
Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A+D M I TG+ F
Sbjct: 244 PGIDSDNVTDQMEMVSTGRSLKSRDRIRSREPPIHESISAVDLATMAAITTGEYARFSTI 303
Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
L NT+CGRHPI V + + N K + F+RY++SS DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIEVDDSSVSYVSA 361
>gi|317029592|ref|XP_001391938.2| hypothetical protein ANI_1_1068064 [Aspergillus niger CBS 513.88]
Length = 365
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 178/358 (49%), Gaps = 77/358 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+D+ L+ +LD WL+E G P R +IAPHAGY+YSGR A
Sbjct: 4 REDSHAGSWYSDDKSTLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
A+A+ +D + R+F++GPSHH+Y AL T Y TP+ D LPLD E I +L++T
Sbjct: 64 AFAYKALDLSQAKRIFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTR 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
+F M VDEAEHS+E+HLPY+ ++ + +VPILVGAV
Sbjct: 124 AGSRNGLELQFTTMSRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVT 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD------------------- 199
E +G L A Y+DDP N F +SSDFCHWG RF Y +Y
Sbjct: 184 ESTEKAFGALLAPYIDDPENAFVISSDFCHWGQRFRYTYYTPQAPNPGPSIPVSAPGLPQ 243
Query: 200 ----------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A+D M I TG+ F
Sbjct: 244 PGIDGDNVTDQMEMVSTGRSLKSRDRIRSREPPIHESISAVDLATMAAITTGEYARFSTI 303
Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
L NT+CGRHPI V + + N K + F+RY++SS DSSVSY SA
Sbjct: 304 LKNTGNTVCGRHPIGVIMAGIEEIRKNEGEKGRFHFIRYDRSSNVIEVDDSSVSYVSA 361
>gi|255943395|ref|XP_002562466.1| Pc18g06390 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587199|emb|CAP94863.1| Pc18g06390 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 367
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/362 (37%), Positives = 178/362 (49%), Gaps = 79/362 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASH+GSWY+D+++ L +LDGWL + G +P R +IAPHAGYSYSG A
Sbjct: 4 RHASHSGSWYSDSARTLTRQLDGWLAQVPDTMEKVGSLPTPGARVIIAPHAGYSYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
AYA+ +D + R+F+LGPSHH AL T T Y+TP+ D LPLD E+I +L T
Sbjct: 64 AYAYKALDLSKAKRIFILGPSHHVSLATLALPTLTSYRTPLSDEPLPLDTELITQLLKTQ 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
F M VDE EHS+EMHLPY+ ++ + +VPI+VG +
Sbjct: 124 ATRPNGAKISFTTMSRSVDEDEHSIEMHLPYIHRLLQLQYPDSSASEYPPLVPIMVGNTS 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
A E +G L A Y+ DP+N F +SSDFCHWG RF Y +Y
Sbjct: 184 AATEQAFGALLAPYLADPANAFVISSDFCHWGLRFRYTYYVPQAARPGPELPLSGDILPQ 243
Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A D M I TG +F
Sbjct: 244 PGMDASSVDQVVEMVSAGHSLRQRDRISSREPAIHESISAFDMATMAAITTGSAKSFLDI 303
Query: 232 LLEYDNTICGRHPISVFLHMLGNCST------KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ NT+CGRHPI V + L + + + FLRYE+SS + DSSVSY SA
Sbjct: 304 IERTGNTVCGRHPIGVIMAALEVVTASRAADQEGRFYFLRYERSSDVEDIDDSSVSYVSA 363
Query: 286 AA 287
A
Sbjct: 364 FA 365
>gi|237834219|ref|XP_002366407.1| hypothetical protein TGME49_027840 [Toxoplasma gondii ME49]
gi|211964071|gb|EEA99266.1| hypothetical protein TGME49_027840 [Toxoplasma gondii ME49]
gi|221486634|gb|EEE24895.1| conserved hypothetical protein [Toxoplasma gondii GT1]
gi|221508391|gb|EEE33978.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 302
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/294 (42%), Positives = 170/294 (57%), Gaps = 10/294 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
+RRA+HAGSWY+ + L + D L A K+ E V +I PHAGY+Y R AA+A+
Sbjct: 3 VRRAAHAGSWYSSDPGELTQLFDRCLATAE--KTEENVIALICPHAGYAYCARTAAWAWR 60
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKFELM 120
+ P ++ RVF+LGPSHH + P CAL ++V Y TP+GD+ LD V+E L++ FE +
Sbjct: 61 QVSPENVRRVFVLGPSHHVFLPGCALPASSVRAYATPLGDISLDTAVLETLRSAKLFETL 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ DE EHS+EM LP+L + G +VPI+VG + A + Y N F
Sbjct: 121 ALRDDEEEHSIEMQLPFLRHILRGKSFTLVPIVVGDLRPSGHAAVAKALRPYFLQEGNLF 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDFCHWG RF Y + I + I LDK G +IE DP F++Y NT
Sbjct: 181 VFSSDFCHWGRRFRYSYLPPATASLPIFERIGILDKEGAALIEQQDPAGFQEYYERTGNT 240
Query: 239 ICGRHPISVFLHML---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICG +PIS+FLH+L G + K K L Y QSSQ + + SSVSYA+ A+ +
Sbjct: 241 ICGHNPISIFLHLLEASGRPRSAFKTKLLDYSQSSQVENESSSSVSYAAFASSL 294
>gi|70989359|ref|XP_749529.1| DUF52 domain protein [Aspergillus fumigatus Af293]
gi|66847160|gb|EAL87491.1| DUF52 domain protein [Aspergillus fumigatus Af293]
gi|159128940|gb|EDP54054.1| DUF52 domain protein [Aspergillus fumigatus A1163]
Length = 402
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 138/361 (38%), Positives = 178/361 (49%), Gaps = 78/361 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+DN L +LD WL G P R +IAPHAGY+YSG A
Sbjct: 40 REPSHAGSWYSDNRSTLTRQLDQWLAHVPNEIEGIGSLPVPGSRVIIAPHAGYAYSGPCA 99
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
AYA+ +D + R+F+LGPSHH+Y AL T Y TP+ D LPLD E+I +L
Sbjct: 100 AYAYRALDLSKAKRIFILGPSHHHYLSTLALPQLTSYYTPLSDEPLPLDTELIAKLLSAK 159
Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
+T F M VDE EHS+E+HLPY+ ++ + H K +VPILVG+ +
Sbjct: 160 AVKPNGSTVSFTTMTRSVDEDEHSIELHLPYIHRLLQLQHPTKRTSQYPPLVPILVGSTS 219
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
A E +G L A Y++DPSN F +SSDFCHWG RF+Y +Y
Sbjct: 220 ASTEQAFGALLASYLEDPSNVFVISSDFCHWGLRFSYTYYVPQAPKPGPKLPLSADALPQ 279
Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ + IH+SI A D M I TG+ + F
Sbjct: 280 PSNDLDEFEERIELVSAGHSLQRRDRINSREPAIHESISAFDIATMAAIATGETENFLDV 339
Query: 232 LLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ NT+CGRHPI V + + T K F+RYE+SS DSSVSY SA
Sbjct: 340 IQRTGNTVCGRHPIGVIMAAIEATRTQEDGKKGAFHFIRYERSSDAVNVTDSSVSYVSAF 399
Query: 287 A 287
A
Sbjct: 400 A 400
>gi|119480005|ref|XP_001260031.1| hypothetical protein NFIA_080780 [Neosartorya fischeri NRRL 181]
gi|119408185|gb|EAW18134.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 366
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 138/361 (38%), Positives = 178/361 (49%), Gaps = 78/361 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R SHAGSWY+DN L +LD WL G P R +IAPHAGY+YSG A
Sbjct: 4 REPSHAGSWYSDNRSTLTRQLDQWLAHVPDEIEGIGSLPVPGSRVIIAPHAGYAYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
AYA+ +D + R+F+LGPSHH+Y AL T Y TP+ D LPLD E+I +L
Sbjct: 64 AYAYRALDLSKAKRIFILGPSHHHYLSTLALPQLTSYYTPLSDEPLPLDTELIAKLLSAK 123
Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
+T F M VDE EHS+E+HLPY+ ++ + H K +VPILVG+ +
Sbjct: 124 AVKPNGSTVSFTTMTRSVDEDEHSIELHLPYIHRLLQLQHPTKRTSQYPPLVPILVGSTS 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
A E +G L A Y++DPSN F +SSDFCHWG RF+Y +Y
Sbjct: 184 ASTEQAFGALLASYMEDPSNVFVISSDFCHWGLRFSYTYYVPQAPKPGPKLPLSADVLPQ 243
Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ + IH+SI A D M I TG+ + F
Sbjct: 244 PSNDLDEFEEKIEQVSAGHSLQRRDRINSREPAIHESISAFDIATMAAIATGETENFLDA 303
Query: 232 LLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ NT+CGRHPI V + + T K F+RYE+SS DSSVSY SA
Sbjct: 304 IQRTGNTVCGRHPIGVIMAAIEATRTQEDGKKGAFHFIRYERSSDAVNVTDSSVSYVSAF 363
Query: 287 A 287
A
Sbjct: 364 A 364
>gi|443924045|gb|ELU43122.1| Memo-like domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 486
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/297 (45%), Positives = 167/297 (56%), Gaps = 44/297 (14%)
Query: 21 LAEELDGWLR-------EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVF 73
L +LDGWL E P + +IAPHAGYSYSG AAA+A+ +ID T I RVF
Sbjct: 66 LDSQLDGWLALVQDGALEGFRPPVTGCKAIIAPHAGYSYSGPAAAWAYKSIDTTGIKRVF 125
Query: 74 LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEM 133
+LGPSHH Y CALS Y+TPI I +L+ TG FE M + DE EHS+EM
Sbjct: 126 ILGPSHHVYLRSCALSKCQSYETPIA--------IAKLRETGAFEDMPLGTDEDEHSIEM 177
Query: 134 HLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW---- 189
HLPY+ K+F G IK+VPILVGAV+ + EA +G L A Y +P VSSDFCH
Sbjct: 178 HLPYVRKIFAGLDIKVVPILVGAVDQDQEADFGALLAPYFAEPDTICIVSSDFCHCYTFY 237
Query: 190 ---------GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE--TGDP---------DAFK 229
++ + + I+KSI ALDK MDI+ T P DAF
Sbjct: 238 YPSPPPSATPTKLSRSIQPNEDYPIYKSITALDKQAMDILTLPTEAPDTKTAAQTHDAFA 297
Query: 230 KYLLEYDNTICGRHPISVFLHML-----GNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
KYL E NTICGRHPI V L L G+ +K++RYEQSS+C+ +DSS +
Sbjct: 298 KYLKETKNTICGRHPIGVLLGALAELERGDSPRASAVKWVRYEQSSECRKLQDSSTA 354
>gi|83766294|dbj|BAE56437.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 392
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/357 (37%), Positives = 177/357 (49%), Gaps = 82/357 (22%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
R ASHAGSWY+DN + L +LDGWL A +P S P R +IAPHAGYSYSG
Sbjct: 4 REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPHAGYSYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
AAYA+ +D + R+F+LGPSHH+Y AL T Y TP+ D LPLD E+I +L +
Sbjct: 62 CAAYAYKALDLSKAKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 121
Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
T F M +DE EHS+E+HLPY+ ++ + H +VPI+VG+
Sbjct: 122 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 181
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
+A E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y
Sbjct: 182 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 241
Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ IH+SI A D M I TG F
Sbjct: 242 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 301
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVS 281
+ + NT+CGRHPI V + + ++ K K F+RYE+SS DSS S
Sbjct: 302 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSFS 358
>gi|67526663|ref|XP_661393.1| hypothetical protein AN3789.2 [Aspergillus nidulans FGSC A4]
gi|40740807|gb|EAA59997.1| hypothetical protein AN3789.2 [Aspergillus nidulans FGSC A4]
gi|259481656|tpe|CBF75380.1| TPA: DUF52 domain protein (AFU_orthologue; AFUA_2G03970)
[Aspergillus nidulans FGSC A4]
Length = 366
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 143/364 (39%), Positives = 182/364 (50%), Gaps = 78/364 (21%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRA 56
R ASHAGSWY+ N + LA +LD WL G +P R +IAPHAGYSYSG
Sbjct: 3 FRAASHAGSWYSGNRQTLALQLDQWLDRVPGTLEGLGSLPAPGARIIIAPHAGYSYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKAT 114
AAYA+ +D + R+F+LGPSHH+Y AL T T Y TP+ D LPLD E+I +L++T
Sbjct: 63 AAYAYKALDLSKAKRIFVLGPSHHHYFTSLALPTLTGYYTPLSDDPLPLDTELIAKLRST 122
Query: 115 G---------KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAV 157
FE M VDE EHS+E+HLPY+ ++ + +VPILVG+
Sbjct: 123 PAAKSNGSTIAFEDMSQSVDEDEHSIELHLPYIHRLLQLQHPDKKAAEYPPLVPILVGST 182
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------- 204
+ EA +G L A Y++DPSN F VSSDFCHWG RF Y +Y + V
Sbjct: 183 SKATEAAFGALLAPYLEDPSNAFVVSSDFCHWGLRFRYTYYVPQAPVPGPRLPLSSYDLP 242
Query: 205 ----------------------------------IHKSIEALDKMGMDIIETGDPDAFKK 230
IH+SI + D M I TG + F
Sbjct: 243 QPEDDETEIESKIEVACAGYHLKRRDPIRRGKPLIHESISSFDIATMAAIATGKTERFTD 302
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFLRYEQSSQC-KTKRDSSVSYASA 285
L + NT+CGRHPI V + + K K F RYE+SS + DSSVSY SA
Sbjct: 303 ILEKTGNTVCGRHPIGVIMAAIEAAGKQEGGKGKFHFARYERSSDIGEIVEDSSVSYVSA 362
Query: 286 AAKV 289
A V
Sbjct: 363 FAVV 366
>gi|449018288|dbj|BAM81690.1| probable CGI-27 protein [Cyanidioschyzon merolae strain 10D]
Length = 365
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 179/346 (51%), Gaps = 63/346 (18%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR---------------------------------- 30
R A HAGSWY +++RL++ L+ WLR
Sbjct: 6 REAQHAGSWYQSDTERLSQALEQWLRRVPSQLTPLRPADDAILFRNLWGKRDHFLDDSNV 65
Query: 31 EAGLP--KSPEV---------RG----VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL 75
EA L ++P+ RG ++ PHAGY+YSG AA+A+ +DP RV ++
Sbjct: 66 EASLSSEQNPQQVQTQTQVQGRGTLEILVVPHAGYAYSGETAAFAYAQVDPARFDRVVII 125
Query: 76 GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT--GKFELMDICVDEAEHSMEM 133
GPSHH Y KC LS A TP+GDL +D +++ FE++ DE EHS+EM
Sbjct: 126 GPSHHVYMQKCGLSGAERLATPLGDLLVDTALVQSWSQLYPELFEILSKSTDEDEHSIEM 185
Query: 134 HLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGS 191
LP+LAKVF+G L ++ VPI+ GA++ E GR A+ V P VS+DFCHWG
Sbjct: 186 QLPFLAKVFQGRLQQVRFVPIMCGALSPSKEKACGRFCAEQVLQPRTLLVVSTDFCHWGQ 245
Query: 192 RFNYM--------HYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
RF Y ++ GV + I ALD MGM +I D + F+ YL + NTICGR
Sbjct: 246 RFGYTPGCKDKLAQERQRLGVGLSDYIRALDSMGMYLISLKDAEGFRAYLQKTGNTICGR 305
Query: 243 HPISVFLHMLGNCS-TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
HP++V L + + ++ FLRY QSS C DSSVSY + A
Sbjct: 306 HPLAVMLWAVHFAEPSAWRLVFLRYAQSSACVDAHDSSVSYVAGAG 351
>gi|403348996|gb|EJY73945.1| Cell motility mediator [Oxytricha trifallax]
Length = 323
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 124/300 (41%), Positives = 177/300 (59%), Gaps = 18/300 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
IR+A+HAGSWYT++ L EL +L ++ + + +++ +I PHAG+ +SG AA+
Sbjct: 16 IRKANHAGSWYTEDPDDLNTELKQYLNQSQQTVDVNEQGQMKALIGPHAGFRFSGPTAAW 75
Query: 60 AFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
A+ NI +P RVFLLGPSH Y A + ++TP+G+L +D I EL GK
Sbjct: 76 AYINIKNPEQYKRVFLLGPSHKVYLDNIATTACDEWETPLGNLKIDHITITELIQNGKEQ 135
Query: 117 --FELMDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYV 173
F+ + +E EHS+EMH+P++ K+FEG + +VP++VG + + Y L Y
Sbjct: 136 ELFQQISKKYEENEHSLEMHIPFIRKMFEGREDVTLVPLMVGQIPDDKFDRYAELLMPYF 195
Query: 174 DDPSNFFSVSSDFCHWGSRFNY-MHYDK--KHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
D F VSSDFCHWG RF + + Y K IHKSIE LD GM +I+ D + FK+
Sbjct: 196 LDQQTLFIVSSDFCHWGKRFQFTLRYQDQDKSEPIHKSIEKLDHEGMSLIQNHDFNGFKQ 255
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCST----KIKIKFLRYEQSSQCKT-KRDSSVSYASA 285
YL NTICGR+PI V L ++ K KF++Y QSS+ + + DSSVSYAS+
Sbjct: 256 YLKTLKNTICGRYPIQVLLAVINKVEQTQQYKCSTKFVKYAQSSEIQMDEDDSSVSYASS 315
>gi|336467325|gb|EGO55489.1| hypothetical protein NEUTE1DRAFT_131235 [Neurospora tetrasperma
FGSC 2508]
gi|350288045|gb|EGZ69281.1| UPF0103-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 355
Score = 224 bits (570), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 181/336 (53%), Gaps = 52/336 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
++ R ASHAGSWY N+ RL+ +LD ++ LP P R +IAPHAGYSYS
Sbjct: 17 FQRTRPASHAGSWYLGNAARLSSQLDEFMSRVPNKLDGRDLP-IPGARVIIAPHAGYSYS 75
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AA+A+ +D ++ RVFLLGPSH +Y CALST Y TP GDL +D + ++EL
Sbjct: 76 GPCAAWAYKILDLANVKRVFLLGPSHTFYLKGCALSTFGKYSTPFGDLVVDGKAVDELME 135
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYG 166
KF + + D EH +EMHLPYL K E L IVP+LVG ++A+ E G
Sbjct: 136 DQKFSPIPVEYDIREHCLEMHLPYLWKRLEQTLGSDSSQFPPIVPVLVGDLSADGEKAVG 195
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------------- 204
L A Y+ DP N F +SSDFCHWG ++Y K V
Sbjct: 196 SLLAPYLADPKNAFIISSDFCHWGKNYHYRPQLHKGLVRDQDAQWASLKIQPGFKTKADG 255
Query: 205 ---------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML--- 252
IH+ I+ALD + MD I+TGD + L +NT+CGRHPI V L L
Sbjct: 256 SEPRDAEVPIHEVIKALDDLVMDSIKTGDHSDYYSILKGTNNTVCGRHPIGVVLAALEKM 315
Query: 253 ---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ +K K +F++Y++S+ + + D SVSY SA
Sbjct: 316 GGAQSGGSKGKFQFVQYQRSNLVEKQSDFSVSYVSA 351
>gi|355565594|gb|EHH22023.1| hypothetical protein EGK_05205, partial [Macaca mulatta]
Length = 240
Score = 224 bits (570), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 115/272 (42%), Positives = 156/272 (57%), Gaps = 37/272 (13%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH 79
+L +L+GWL + K P R +I HAGY+Y G A +A+ +DP+ R+F+LGPSH
Sbjct: 3 QLNAQLEGWLSQVQSTKRP-ARAII--HAGYTYCGSCATHAYKQVDPSVTRRIFILGPSH 59
Query: 80 HYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA 139
H +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLPY A
Sbjct: 60 HVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTA 119
Query: 140 KVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
K E H I+P+LVGA++ E + +LF+KY+ DPSN F VSSDFCHWG RF Y +
Sbjct: 120 KAMESHKDEFTIIPVLVGALSESKEQEFRKLFSKYLADPSNLFVVSSDFCHWGQRFRYSY 179
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
YD+ G I++SIE LDKM + ++
Sbjct: 180 YDESQGEIYRSIEHLDKMAITELQKN--------------------------------GM 207
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 208 NMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 239
>gi|121710340|ref|XP_001272786.1| DUF52 domain protein [Aspergillus clavatus NRRL 1]
gi|119400936|gb|EAW11360.1| DUF52 domain protein [Aspergillus clavatus NRRL 1]
Length = 366
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 134/361 (37%), Positives = 180/361 (49%), Gaps = 78/361 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY+D+ L +LD WL + G P R +IAPHAGY+YSG A
Sbjct: 4 REASHAGSWYSDSKTTLTRQLDQWLGQVPNTIDGIGDLPVPGSRVIIAPHAGYAYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATG 115
AYA+ +D + R+F+LGPSHH+Y A+ T Y TP+ D LPLD E+I +L +T
Sbjct: 64 AYAYKALDLSQAKRIFILGPSHHHYLSTLAVPQLTSYYTPLSDEPLPLDTELISQLLSTK 123
Query: 116 ---------KFELMDICVDEAEHSMEMHLPYLAKVFE-GHLIK-------IVPILVGAVN 158
F M +DE EHS+E+HLPY+ ++ + H K +VPILVG+ +
Sbjct: 124 AVKPNGSTVSFTTMTRSIDEDEHSIELHLPYIHRLLQLQHPTKRTSEYPLLVPILVGSTS 183
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------------- 198
E +G L A Y++DP+N F +SSDFCHWGSRF+Y +Y
Sbjct: 184 TTTEKAFGALLAPYLEDPTNAFIISSDFCHWGSRFSYTYYVPQAPKPGPKLPLSANALPQ 243
Query: 199 ---------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ IH+SI A D M I TG+ + F
Sbjct: 244 PGEKPDEFEESVEQVSAGHALQRRDRISSREPAIHESISAFDIATMAAIATGETENFLDV 303
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIK-----FLRYEQSSQCKTKRDSSVSYASAA 286
+ NT+CGRHPI V + + + + K F+RYE+SS DSSVSY SA
Sbjct: 304 IQRTGNTVCGRHPIGVIMAAIEETAAREDGKKGMFHFIRYERSSDAVDVTDSSVSYVSAF 363
Query: 287 A 287
A
Sbjct: 364 A 364
>gi|242808341|ref|XP_002485142.1| DUF52 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218715767|gb|EED15189.1| DUF52 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 354
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 134/333 (40%), Positives = 170/333 (51%), Gaps = 68/333 (20%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL--------REAGLPKSPEVRGVIAPHAGYSYSGRA 56
RR +HAGSWY D+S L ELDGWL + LP P R +I PHAGY+YSG
Sbjct: 4 RRPTHAGSWYEDDSTTLKAELDGWLDAVPNEIDKLGALPI-PGARIIIGPHAGYAYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK-A 113
AA+A+ +D + R+FLLGPSHH+ AL T Y TP+ D LPLD E+I +++ A
Sbjct: 63 AAWAYKALDLSKAKRIFLLGPSHHHPLATIALPEVTSYATPLSDEPLPLDTEIINKIRTA 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAK----VFEGHLIK----IVPILVGAVNAENEAMY 165
+ FE M VDE EHSME+HLPY+ + F G +VPI+VG+ NAE E
Sbjct: 123 SSAFETMSRRVDEREHSMELHLPYIHRKLQLTFPGRPASEYPPLVPIMVGSTNAETERAV 182
Query: 166 GRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------------------ 201
G L A Y+ DPSN F +SSDFCHWG RF Y +Y +
Sbjct: 183 GALLATYLADPSNAFVISSDFCHWGQRFGYTYYVRDAPQQKPPLPLSFDSLPQPPTTVSA 242
Query: 202 -------------------HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
H I+ SI A+D M I TG+ F K L NT+CGR
Sbjct: 243 AEDTLQKVPGRSLRRNSELHAEIYDSISAVDIATMKAITTGNASQFLKSLRTTGNTVCGR 302
Query: 243 HPISVFLHMLGNCSTKI-----KIKFLRYEQSS 270
HPI V + L ++ K F+RYE+SS
Sbjct: 303 HPIGVIMAALEEYASVAGEEVGKFHFIRYERSS 335
>gi|296818439|ref|XP_002849556.1| DUF52 domain-containing protein [Arthroderma otae CBS 113480]
gi|238840009|gb|EEQ29671.1| DUF52 domain-containing protein [Arthroderma otae CBS 113480]
Length = 371
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 135/367 (36%), Positives = 182/367 (49%), Gaps = 83/367 (22%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYSGRA 56
+RR SHAGSWY+D+ L ++D WL G P R +IAPHAGYSYSG
Sbjct: 3 VRRPSHAGSWYSDDGATLQAQIDKWLDLVPNELEGVGSLPVPGARVIIAPHAGYSYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKAT 114
AA+A+ +D T R+FL+GPSHH++ AL T Y+TP+ LPLD +++ EL+A+
Sbjct: 63 AAFAYKCLDLTKAKRIFLIGPSHHHHLTTIALPELTGYRTPLSSSPLPLDTKLLSELRAS 122
Query: 115 G-----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
+F M VDEAEHSME+HLPY+ ++ + +VP++VG+ +A
Sbjct: 123 TASSAKRFTAMSPSVDEAEHSMELHLPYIHRLLQRLYPSEPTSAYPPLVPMMVGSTSAAT 182
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------- 198
E +G + + Y+ D N F VSSDFCHWG+RF Y +Y
Sbjct: 183 ERAFGSILSPYLADEENAFVVSSDFCHWGTRFGYTYYTTHAPGAPFDMPLTYPSLPVPSD 242
Query: 199 -----------------------DKKH-----GVIHKSIEALDKMGMDIIETGDPDAFKK 230
DK + IH+SI A D M I +GD D F
Sbjct: 243 RLTAAKAAADVDAVSSGQMLREKDKSYRAADACAIHESISACDIACMSAIASGDTDVFLD 302
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCS----------TKIKIKFLRYEQSSQCKTKRDSSV 280
L NT+CGRHPI V + + K + F+RYE+SS C + RDSSV
Sbjct: 303 ALKRTGNTVCGRHPIGVVMAGMEGLQQEQAAEKEGVQKGRFYFMRYERSSDCFSVRDSSV 362
Query: 281 SYASAAA 287
SY SA A
Sbjct: 363 SYVSAFA 369
>gi|170575060|ref|XP_001893082.1| Hypothetical UPF0103 protein C37C3.8 in chromosome V [Brugia
malayi]
gi|158601091|gb|EDP38091.1| Hypothetical UPF0103 protein C37C3.8 in chromosome V, putative
[Brugia malayi]
Length = 261
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 113/233 (48%), Positives = 151/233 (64%), Gaps = 8/233 (3%)
Query: 66 PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD 125
P +S +F+LGPSH CALST + Y+TPIGDL +D ELK TG F LMD+ +
Sbjct: 20 PKPVSIIFVLGPSHVMSLDTCALSTCSRYRTPIGDLQIDQRTNMELKETGAFSLMDLRSE 79
Query: 126 EAEHSMEMHLPYLAKVFEGHLIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
EAEHS+EM LPY+AK+ E + IVP+LVG+++ +A YG++F+KY+ DP F V
Sbjct: 80 EAEHSIEMQLPYIAKIMEKKSMNSYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVFVV 139
Query: 183 SSDFCHWGSRFNYMHYDKKHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSDFCHWG+RF++M YD GV I++ I A+DK GMD I + +P F +YL NTICG
Sbjct: 140 SSDFCHWGNRFHFMPYDNSTGVPIYEQIAAMDKQGMDAISSLNPAVFGEYLKRTQNTICG 199
Query: 242 RHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
R+PISV LH + FL+Y QS+Q + DSSVSYA+ A ++
Sbjct: 200 RNPISVILHAAEXFRQMNNHFAEFIFLKYAQSNQSRNLNDSSVSYAAGALFIN 252
>gi|85092720|ref|XP_959514.1| hypothetical protein NCU11386 [Neurospora crassa OR74A]
gi|28920949|gb|EAA30278.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 355
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 181/336 (53%), Gaps = 52/336 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
++ R ASHAGSWY N+ RL+ +LD ++ LP P R +IAPHAGYSYS
Sbjct: 17 FQRTRPASHAGSWYLGNAARLSSQLDEFMSRVPNKLDGRDLP-IPGARVIIAPHAGYSYS 75
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AA+A+ +D ++ RVFLLGPSH +Y CALST Y TP GDL +D + ++EL
Sbjct: 76 GPCAAWAYKILDLANVKRVFLLGPSHTFYLKGCALSTFGKYSTPFGDLVVDGKAVDELME 135
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYG 166
KF + + D EH +EMHLPYL K E L IVP+LVG ++A+ E G
Sbjct: 136 DQKFSPIPVEYDIREHCLEMHLPYLWKRLEQTLGGDSSQFPPIVPVLVGDLSADGEKAVG 195
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---------------------- 204
+ A Y+ DP N F +SSDFCHWG ++Y K V
Sbjct: 196 SILAPYLADPKNAFIISSDFCHWGKNYHYRPQLHKGLVRDQDAQWASLKIQPGFKTKADG 255
Query: 205 ---------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML--- 252
IH+ I+ALD + MD I+TGD + L +NT+CGRHPI V L L
Sbjct: 256 SEPRDAEVPIHEVIKALDDLVMDSIKTGDHSDYYSILKGTNNTVCGRHPIGVVLAALEKM 315
Query: 253 ---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ +K K +F++Y++S+ + + D SVSY SA
Sbjct: 316 GGAQSGESKGKFQFVQYQRSNLVEKQSDFSVSYVSA 351
>gi|348574319|ref|XP_003472938.1| PREDICTED: protein MEMO1-like [Cavia porcellus]
Length = 341
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 139/204 (68%), Gaps = 3/204 (1%)
Query: 17 NSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLG 76
N +L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+ R+F+LG
Sbjct: 120 NGPQLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILG 178
Query: 77 PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLP 136
PSHH +CALS+ +Y+TP+ DL +D ++ EL TG FE M + DE EHS+EMHLP
Sbjct: 179 PSHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLP 238
Query: 137 YLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
Y AK E H I+P+LVGA++ E +G+LF+KY+ DPSN F VSSDFCHWG RF
Sbjct: 239 YTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFR 298
Query: 195 YMHYDKKHGVIHKSIEALDKMGMD 218
Y +YD+ G I++SIE LDKM ++
Sbjct: 299 YSYYDESQGEIYRSIEHLDKMVLE 322
>gi|336260613|ref|XP_003345100.1| hypothetical protein SMAC_07390 [Sordaria macrospora k-hell]
gi|380096552|emb|CCC06600.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 343
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 183/338 (54%), Gaps = 53/338 (15%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSG 54
++ R ASHAGSWY ++ RL+ +LD ++ LP P R +IAPHAGYSYSG
Sbjct: 5 QETRPASHAGSWYLGSAARLSFQLDEYMSRVPNKLDGKDLP-IPGARVIIAPHAGYSYSG 63
Query: 55 RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ +D ++ RVFLLGPSH +Y CALS Y +P GDL +D E I+EL T
Sbjct: 64 PCAAWAYKVLDLANVKRVFLLGPSHTFYLKGCALSAFGKYSSPFGDLAVDGEAIDELMKT 123
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-------IKIVPILVGAVNAENEAMYGR 167
KF + + EH +EMHLPYL K E L IVPILVG ++A+ E G
Sbjct: 124 KKFSTIPADYEIQEHCLEMHLPYLWKRLEQTLGSDSSQFPPIVPILVGDLSADGEETIGA 183
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNY-----------------------MHYDKKHGV 204
L A Y+ DP N F +SSDFCHWG + Y + +K G
Sbjct: 184 LLAPYLADPKNAFIISSDFCHWGKNYRYRPQLYKGLVRDQDAQWATLKIQPIFENKVDGT 243
Query: 205 --------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML---- 252
IH+ I+ALD M +D I+TGD + L NT+CGRHPI V L L
Sbjct: 244 EPRDADVPIHEVIKALDDMVIDAIKTGDHSEYYSTLKGTHNTVCGRHPIGVVLAALEEFL 303
Query: 253 -GNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
G+ S +K K +F++Y++S+ + + D SVSY SA A
Sbjct: 304 GGDQSGGSKGKFQFIQYQRSNLVERQSDFSVSYVSAYA 341
>gi|409083485|gb|EKM83842.1| hypothetical protein AGABI1DRAFT_103914 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 503
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 174/325 (53%), Gaps = 42/325 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGYSYSGRAAA 58
R A+HAGS + + +L +L GWL + E + +IAPHAGYS+ G A
Sbjct: 181 RGAAHAGSCLS--APQLTADLSGWLSLVRPRREDEEFPVQGCKAIIAPHAGYSHCGETGA 238
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+A+ +IDP++ RVF++GPSH + CALS Y TPIG LP+D +V+ L TG F
Sbjct: 239 WAYKSIDPSTTRRVFVIGPSHFWRIDNCALSRCHTYDTPIGTLPVDTDVVVSLYETGLFR 298
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVNAENEAMYGRLFAKYV 173
M++ DE EHS+EM LPYL +V EG + IKIVPI+VG + E E YGR+ A Y+
Sbjct: 299 WMNLTRDEREHSIEMQLPYLRRVCEGRVMIRKDIKIVPIMVGDITKEQELEYGRILAPYL 358
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-------------------IHKSIEALDK 214
+ F SSDFCHWG + Y Y K I +S LD
Sbjct: 359 AEEGTVFIASSDFCHWGPDYYYTFYFPKPNCAPSEGYQTTRATPPEKGYKIWQSTTQLDH 418
Query: 215 MGMDIIETGDPDAFK------KYLLEYDNTICGRHPISVFL----HMLGNCSTKIKIKFL 264
+GM I+ D AF +YL+E DN+ICGRH I + H+ K ++
Sbjct: 419 IGMGILADQDRSAFTAHRDFHRYLVETDNSICGRHAIGILYGALAHLEETTGRKKFCNWV 478
Query: 265 RYEQSSQCKTKRDSSVSYASAAAKV 289
+Y+QS+Q D+SVSYASA K+
Sbjct: 479 KYDQSAQVTKITDNSVSYASAWIKI 503
>gi|50549243|ref|XP_502092.1| YALI0C21450p [Yarrowia lipolytica]
gi|49647959|emb|CAG82412.1| YALI0C21450p [Yarrowia lipolytica CLIB122]
Length = 319
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 174/312 (55%), Gaps = 29/312 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKS--PEVRGVIAPHAGYSYSGRAAAYA 60
R A+HAGSWY+ N+ +L E+D L L KS P R ++ PHAG +Y+G
Sbjct: 5 RPATHAGSWYSSNATKLGAEVDRHLANGASVLGKSAIPGARVLVGPHAGLAYAGPQLGET 64
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ D +I R+F+LGPSHH Y A S Y+TP G++ +D+E ++L +G + M
Sbjct: 65 YAAFDFKNIKRLFILGPSHHVYLEHAATSAFHSYETPFGNVNVDVETTQKLNDSGVTKYM 124
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ EHS EMH+P+L ++ +KIVPI+VG + E E +L YV+DP+N F
Sbjct: 125 SATTDKDEHSFEMHMPFLKRLVGDQNVKIVPIMVGQTSQEYEKRLAKLLLPYVEDPTNAF 184
Query: 181 SVSSDFCHWGSRFNYMHY--------------------DKKHGVIHKSIEALDKMGMDII 220
+S+DFCHWG+ F Y Y + + I+KSIE LDK GM++
Sbjct: 185 VISTDFCHWGNNFRYWGYADSENCDNVSQSREELRRALKRSNTPIYKSIEYLDKKGMEVA 244
Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-----IKIKFLRYEQSSQCKTK 275
D +K+Y + DNTICGR P+++ + ML N + + I ++++ Y QS+Q +
Sbjct: 245 SLTSYDKWKEYCKKTDNTICGRKPLAILISMLENYAIEKGDKPISLEWIGYSQSNQVRDI 304
Query: 276 RDSSVSYASAAA 287
+ SVSYAS A
Sbjct: 305 SEGSVSYASGFA 316
>gi|119178448|ref|XP_001240897.1| hypothetical protein CIMG_08060 [Coccidioides immitis RS]
gi|392867144|gb|EAS29654.2| hypothetical protein CIMG_08060 [Coccidioides immitis RS]
Length = 383
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/378 (37%), Positives = 185/378 (48%), Gaps = 95/378 (25%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+DN+ L +LD W+ G+ P R +IAPHAGY+YSG A
Sbjct: 4 REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVAGARIIIAPHAGYAYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
A+A+ ++D + R+FLLGPSHH+ K AL + Y TP+ LPLD E+I+EL
Sbjct: 64 AFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTRT 123
Query: 112 -KATGKFELMDICVDEAEHSMEMHLPY----LAKVFEGHLI----KIVPILVGAVNAENE 162
T +F M+ +DEAEHS+E+HLPY L +++ G K+VP++VG+ +A E
Sbjct: 124 ENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQRLYPGEPAASYPKLVPMMVGSTSAPTE 183
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
+GR+ A Y+ +P N F VSSDFCHWG RF Y +Y
Sbjct: 184 QAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYYVPDAPAPGPVLPLSYAALPQPSEA 243
Query: 199 -------------------------DKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKY 231
K GV IH+SI A D M I +G F
Sbjct: 244 LKLGAARPQISGMSKGRYLRAGDPLPKGAGVPSIHESISACDIACMSAIASGQTQTFLDA 303
Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIK------------------IKFLRYEQS 269
L NTICGRHPI V + +LG +K F+RYE+S
Sbjct: 304 LKSTGNTICGRHPIGVIMAAIEFVLGKDKENMKDLDIKAGDEAQTHLMRGAFNFVRYERS 363
Query: 270 SQCKTKRDSSVSYASAAA 287
S C + DSSVSY SA A
Sbjct: 364 SDCISVTDSSVSYVSAFA 381
>gi|406607120|emb|CCH41508.1| hypothetical protein BN7_1049 [Wickerhamomyces ciferrii]
Length = 317
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 174/314 (55%), Gaps = 32/314 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAFG 62
+R ASHAG+WY+ N +L ++LD +L+ A P+S R +I PHAGY+Y+G A +
Sbjct: 3 VRAASHAGAWYSANPLKLNKQLDEYLKAA--PQSVAGARLLIGPHAGYTYAGSTLAETYQ 60
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
D + + RVF+LGPSHH Y AL S Y+TP G+LP+D EV++ LK F+ M
Sbjct: 61 AWDTSKVKRVFILGPSHHVYFKNVALLSGVDEYETPFGNLPVDREVVDSLKNNKIFKEMS 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEG---HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ VDE EHS EMH PY+ K+ + + IVPI++ + + ++Y+ D +N
Sbjct: 121 LEVDEDEHSFEMHAPYIYKLTQNIPQGIPSIVPIMISHSDENFNKKISSILSEYLKDEAN 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIEALDKMGMDII 220
F +SSDFCHWGSRF Y Y K + IHKSIE LD + I
Sbjct: 181 TFVISSDFCHWGSRFGYTAYTGKGTLDDLKDLSYTTKVPSGGLSIHKSIEFLDILATRIA 240
Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK-------IKIKFLRYEQSSQCK 273
TG +K Y+ NTICG+ PI++ L + + K +++ Y QSSQ
Sbjct: 241 STGSSKDWKDYIKVTGNTICGQRPIAILLETIEKVKEEGTYDDDWGKFEWIGYSQSSQAT 300
Query: 274 TKRDSSVSYASAAA 287
+ RDSSVSYASA A
Sbjct: 301 SARDSSVSYASAFA 314
>gi|303310391|ref|XP_003065208.1| hypothetical protein CPC735_020910 [Coccidioides posadasii C735
delta SOWgp]
gi|240104868|gb|EER23063.1| hypothetical protein CPC735_020910 [Coccidioides posadasii C735
delta SOWgp]
Length = 381
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 137/376 (36%), Positives = 184/376 (48%), Gaps = 93/376 (24%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+DN+ L +LD W+ G+ P R +IAPHAGY+YSG A
Sbjct: 4 REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVAGARIIIAPHAGYAYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL---- 111
A+A+ ++D + R+FLLGPSHH+ K AL + Y TP+ LPLD E+I+EL
Sbjct: 64 AFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTRT 123
Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--------HLIKIVPILVGAVNAENE 162
T +F M+ +DEAEHS+E+HLPY+ + +G K+VP++VG+ +A E
Sbjct: 124 ENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQGLYPGEPAASYPKLVPMMVGSTSAPTE 183
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
+GR+ A Y+ +P N F VSSDFCHWG RF Y +Y
Sbjct: 184 QAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYYVPDAPAPGPVLPLSYAALPQPSEA 243
Query: 199 -------------------------DKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKY 231
+ GV IH+SI A D M I +G F
Sbjct: 244 LKLGAARPQISGMSKGRYLRAGDQLPRGAGVPSIHESISACDIACMSAIASGHTQTFLDA 303
Query: 232 LLEYDNTICGRHPISVFL----HMLGNCSTKIK----------------IKFLRYEQSSQ 271
L NTICGRHPI V + +LG ++ F+RYE+SS
Sbjct: 304 LKSTGNTICGRHPIGVIMAAIEFVLGKDKENMRDLDIKAGDETHLMRGAFNFVRYERSSD 363
Query: 272 CKTKRDSSVSYASAAA 287
C + DSSVSY SA A
Sbjct: 364 CISVTDSSVSYVSAFA 379
>gi|212537757|ref|XP_002149034.1| DUF52 domain protein [Talaromyces marneffei ATCC 18224]
gi|210068776|gb|EEA22867.1| DUF52 domain protein [Talaromyces marneffei ATCC 18224]
Length = 365
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 167/332 (50%), Gaps = 66/332 (19%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
RR +HAGSWY D++ L ELDGWL G P R +I PHAGY+YSG A
Sbjct: 15 RRPTHAGSWYEDDTTTLKAELDGWLDAVPGEIEPFGALPIPGARIIIGPHAGYAYSGPCA 74
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL-KAT 114
A+A+ +D + R+FLLGPSHHY AL T Y TP+ D LPLD+EVI+++ A+
Sbjct: 75 AWAYKALDLSKAKRIFLLGPSHHYPLATIALPEVTSYATPVSDEPLPLDIEVIDKICNAS 134
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAK----VFEGHLI----KIVPILVGAVNAENEAMYG 166
FE+M +DE EHSME+HLPY+ + F G +VPI+VG+ AE E G
Sbjct: 135 STFEIMSRRIDEREHSMELHLPYIHRKLQLTFPGRPAAEYPPLVPIMVGSTKAETERAVG 194
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------------- 198
L A Y+ D SN F +SSDFCHWG RF Y +Y
Sbjct: 195 ALLATYLADSSNAFIISSDFCHWGQRFGYTYYVPDAPQQKPPLPLSFDNLPQPPTTVSEA 254
Query: 199 -DKKHGV--------------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
D V I+ SI A+D M I TG F K L NT+CGRH
Sbjct: 255 EDAIQKVSGRSLRSNKEIQTEIYDSISAVDIATMKAITTGSTTQFLKILRSTGNTVCGRH 314
Query: 244 PISVFLHMLGNCSTKI-----KIKFLRYEQSS 270
PI V + L ++ K F+RYE+SS
Sbjct: 315 PIGVIMAALEAYASASGEEVGKFHFIRYERSS 346
>gi|315041727|ref|XP_003170240.1| MEMO1 protein [Arthroderma gypseum CBS 118893]
gi|311345274|gb|EFR04477.1| MEMO1 protein [Arthroderma gypseum CBS 118893]
Length = 379
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 178/375 (47%), Gaps = 91/375 (24%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
+RR +HAGSWY+DN L +LD WL GL K P R +IAPHAGYSYSG
Sbjct: 3 VRRPTHAGSWYSDNRATLQAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYSYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK-- 112
AA+A+ +D + R+FL+GPSHH + AL T Y TP+ LPLD +V+ EL+
Sbjct: 63 AAFAYKCLDLSKAKRIFLIGPSHHRHLTTIALPELTGYLTPLSSTPLPLDTKVLGELESH 122
Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
+ +F M VDEAEHSME+HLPY+ ++ + +VP++VG+ +A
Sbjct: 123 ASSSAKRFTTMSPSVDEAEHSMELHLPYIHRLLQRLYPSQPTSAYPPLVPMMVGSTSAAT 182
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD---------------------- 199
E +G + Y+ D N F +SSDFCHWGSRF Y +Y
Sbjct: 183 ERAFGSILGPYLADEENAFVISSDFCHWGSRFAYTYYTTHAADAPYKMPLTYPSLPVPSD 242
Query: 200 ------------------------------KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
K IH+SI A D M I +G+ D F
Sbjct: 243 ELTAAKAAEEVEVVSSGRMLGEKDKSYYQAKDACAIHESISACDIACMSAIASGNTDVFL 302
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCS-----------------TKIKIKFLRYEQSSQC 272
L NT+CGRHPI V + + K K FLRYE+SS C
Sbjct: 303 DALRRTGNTVCGRHPIGVVMAGMEGLQHDQAETTTTTTTEDGGVRKGKFYFLRYERSSDC 362
Query: 273 KTKRDSSVSYASAAA 287
+ RDSSVSY SA A
Sbjct: 363 FSLRDSSVSYVSAFA 377
>gi|310795369|gb|EFQ30830.1| memo-like protein [Glomerella graminicola M1.001]
Length = 325
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 176/320 (55%), Gaps = 36/320 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R G WY ++ RLA++LD +L + GLP P R +IAPHAGYSYSG A
Sbjct: 7 RPPGKQGEWYLGDADRLAKQLDEFLADVPDQIDGQGLP-IPGARVIIAPHAGYSYSGATA 65
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +D + RVFLLGPSH YY CA++T Y TP G+L +D E + ++
Sbjct: 66 AWAYKALDLSKAKRVFLLGPSHTYYLRGCAVTTYKKYGTPWGELRIDEETTDAIRRRDGV 125
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFAK 171
E M + ++ EHS+EMHLPYL K + + +VPILVG N E G+ A+
Sbjct: 126 EDMPVRNEDREHSLEMHLPYLFKRLQQTFGSPDNFPTLVPILVGDNNKAEEKEVGKWLAE 185
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNY-----------MHYDKKHGV------IHKSIEALDK 214
Y+ DP N F VSSDFCHWGS F+Y M + HG IH++++ +D
Sbjct: 186 YLRDPDNAFIVSSDFCHWGSHFDYTVYCPDGDVTKMQKLRPHGSAPDGPPIHETVKLVDD 245
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS-----TKIKIKFLRYEQS 269
+ ++ I+TG AF L + NT+CGRHPI V + L + + +F++Y++S
Sbjct: 246 LAIEAIKTGRHSAFYDNLKQTKNTVCGRHPIGVTMAALEELGGEGQEQQHRFRFIQYQRS 305
Query: 270 SQCKTKRDSSVSYASAAAKV 289
S DSSVSY SA A V
Sbjct: 306 SLVTGPGDSSVSYVSAYAVV 325
>gi|452847073|gb|EME49005.1| hypothetical protein DOTSEDRAFT_67898 [Dothistroma septosporum
NZE10]
Length = 276
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/271 (44%), Positives = 155/271 (57%), Gaps = 28/271 (10%)
Query: 43 VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
+IAPHAGYSYSG AAA+A+ D + +VFLLGPSHH+Y K ALS T Y TPIG+L
Sbjct: 2 IIAPHAGYSYSGPAAAWAYKAWDVSKAKKVFLLGPSHHHYLTKAALSKCTEYATPIGNLK 61
Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVG 155
++ EV EL +TG+FE M VDE EHS+EMHLPY+ K + +VPI+VG
Sbjct: 62 VNREVTAELHSTGQFEWMSRSVDEDEHSLEMHLPYIYKRLSQTFGEDAANFPTLVPIMVG 121
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------ 203
+A E GR+ A Y+ D SN F VSSDF HWG RF Y +Y G
Sbjct: 122 NTSASTEKSLGRILAPYLADRSNVFVVSSDFAHWGLRFRYTYYQPATGNAIDLTSSAKLP 181
Query: 204 ---VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI- 259
IH+SI+ +D M E+G +A+ L + NT+CGRHPI V + + +
Sbjct: 182 KDPAIHESIKVVDFQCMGACESGSHEAWLNVLDKTGNTVCGRHPIGVMMAAVEKLREDLD 241
Query: 260 -----KIKFLRYEQSSQCKTKRDSSVSYASA 285
KF++Y++SS DSSVSYASA
Sbjct: 242 RLDAGAFKFVKYDRSSLVNRVGDSSVSYASA 272
>gi|322702109|gb|EFY93857.1| DUF52 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 323
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 174/321 (54%), Gaps = 39/321 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
+R A AGSWY + L ELD +L + +P++ P R +IAPHAGY++SG
Sbjct: 3 LRSAGKAGSWYVASPNNLGSELDEYLSD--VPETVDGSTLPIPGARIIIAPHAGYTFSGP 60
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ +D + RVF+LGPSH YY CA++T + Y TP G+L +D ++I+ +K
Sbjct: 61 CAAWAYKCLDLSKAKRVFVLGPSHTYYLDGCAVTTYSKYVTPFGNLTVDRDIIQRVKEAA 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLF 169
+ + + D AEHS+EMHLPYL K E + IVPIL+G N + E G++
Sbjct: 121 QMDDIPPSRDSAEHSLEMHLPYLYKRCEQTFGSPENFPTIVPILIGDNNRDEEKAVGQVL 180
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEAL 212
Y+ D N F +SSDFCHWG+ F YM Y + IH++I L
Sbjct: 181 VPYLKDEENAFIISSDFCHWGAHFQYMVYSPDNDPARLINLHKGDKAPIGPPIHETIRLL 240
Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS------TKIKIKFLRY 266
DK MD +E+G DAF L NT+CGRHPI V + L S K + + +Y
Sbjct: 241 DKAAMDAVESGHHDAFVDNLKLTKNTVCGRHPIGVAMAALELLSGEVADVKKCRFRITQY 300
Query: 267 EQSSQCKTKRDSSVSYASAAA 287
++S+ + D SVSY SA A
Sbjct: 301 QRSNLVEKPSDFSVSYVSAYA 321
>gi|402078969|gb|EJT74234.1| hypothetical protein GGTG_08077 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 332
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 179/333 (53%), Gaps = 45/333 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVRGVIAPHAGYSYS 53
M +R ASHAG WYT L +LD +L ++ LP R +IAPHAGYSYS
Sbjct: 1 MSGVRTASHAGDWYTAEPDGLEHDLDRYLDAVPEQINDSDLPVK-GARVIIAPHAGYSYS 59
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AA+A+ +D + RVF+LGPSH YY AL+T Y TP+GDL +D++ + L+A
Sbjct: 60 GPCAAWAYKALDLGAAKRVFILGPSHTYYLSGAALTTYAAYATPLGDLRVDVDTVAALRA 119
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL---------IKIVPILVGAVNAENEAM 164
TG+F + DE EHS+EMHLPYLAK +VPI+VG E E
Sbjct: 120 TGRFSDVPRQRDEDEHSLEMHLPYLAKRLAQTFGPQDTPAAWPPVVPIVVGDNGREAERA 179
Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---DKKHGV---------------IH 206
G L Y+ D + F VSSDFCHWGSRF+Y Y + GV IH
Sbjct: 180 LGELLVPYLRDRDSAFIVSSDFCHWGSRFSYTAYTPDGTEAGVRSLTRRDAAPDLPVPIH 239
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC------STKIK 260
+SI ALD M M IE+G DAF L NT+CGRHPI V + L S+ +K
Sbjct: 240 ESISALDHMAMGAIESGSHDAFFDNLKATGNTVCGRHPIGVVMAGLEALRKEKAESSGLK 299
Query: 261 ----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
F++Y++SS RDSSVSY SA A +
Sbjct: 300 GDGVFTFVKYDRSSLVNGLRDSSVSYGSAYAII 332
>gi|302409576|ref|XP_003002622.1| MEMO1 [Verticillium albo-atrum VaMs.102]
gi|261358655|gb|EEY21083.1| MEMO1 [Verticillium albo-atrum VaMs.102]
Length = 329
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/328 (39%), Positives = 174/328 (53%), Gaps = 42/328 (12%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYS 53
++R+ AGSWY ++KRLAE+L+G+L + +P P R +IAPHAGYSYS
Sbjct: 4 REVRKPGKAGSWYLADAKRLAEQLEGFLDD--VPSQINSSDVPIPGARVIIAPHAGYSYS 61
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AA+A+ ++D + RVFLLGPSH +Y CA++T Y TP G++ +D EV+ L+A
Sbjct: 62 GPTAAWAYKSLDLSQTRRVFLLGPSHTFYLKGCAVTTFKHYGTPFGNIRVDEEVVSTLRA 121
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGR 167
M + D EHS+EMHLPYL +F +VPILVG E G
Sbjct: 122 ELSIPDMPLSNDNKEHSLEMHLPYLWTMFTKAFDSPDEFPTLVPILVGDGTKTAERAVGA 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSI 209
Y+ DP+N F VSSDFCHWG F+Y Y V IH++I
Sbjct: 182 WLLPYLRDPTNAFVVSSDFCHWGDNFSYTPYSPDAQVDDTLQHISSRSRVPAGRPIHETI 241
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST--------KIKI 261
+ LD + I+TG+ F L NT+CGRHPI V + L + K +
Sbjct: 242 KELDDQAIAAIKTGEYAQFYDNLSLTKNTVCGRHPIGVMMAALELLAEEEGRGGEDKGRF 301
Query: 262 KFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+F+RYE+S+ + D SVSY SA A V
Sbjct: 302 QFVRYERSNLVEDADDMSVSYVSAYAVV 329
>gi|71747482|ref|XP_822796.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832464|gb|EAN77968.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 323
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 183/316 (57%), Gaps = 27/316 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKSPEVRGVIAPHAGYSYSGRAA 57
M RRA+HAGSWY + L +D A L +S ++ GVI+PHAG SYSG A
Sbjct: 1 MTYSRRATHAGSWYEGSPNALKALVDTLFSTASKSPLKESEKMIGVISPHAGISYSGNTA 60
Query: 58 AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL----E 106
++ + ++ SI+R+FLLGPSHH + A Y+TP G L ++ E
Sbjct: 61 SHVYVHLRDYIYGHKGRSITRIFLLGPSHHKGFDGVEVCAAQRYETPFGPLVVNAKVGQE 120
Query: 107 VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVF----EGH-----LIKIVPILVGA 156
V +EL+A G M DE EHS+EM LP+++ + G+ +++VP+L+G
Sbjct: 121 VEKELRAAGVPVGTMHRMTDEDEHSIEMQLPFISHLLHYPPNGYKPAMDRVELVPLLIGG 180
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
N + E + G + +KY+ D NFF +SSDFCHWG+RF YM HY+K ++ I +I ++D
Sbjct: 181 TNRKMENLIGSVLSKYLKDNQNFFVISSDFCHWGARFQYMYHYEKAEYPDIGDAIISMDH 240
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
GM ++E D D + KYL +NTICGR PISV + L + + ++FL Y QS++CK
Sbjct: 241 EGMRLLEARDMDGWYKYLSTTNNTICGRRPISVLMAALDSKKEAV-VRFLHYSQSNRCKN 299
Query: 275 KRDSSVSYASAAAKVD 290
DSSVSYA A +D
Sbjct: 300 MSDSSVSYAGAIVTLD 315
>gi|342878232|gb|EGU79587.1| hypothetical protein FOXB_09870 [Fusarium oxysporum Fo5176]
Length = 324
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 125/324 (38%), Positives = 172/324 (53%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M R AS AGSWY + K L EL +L A +P+S P R ++APHAGY +
Sbjct: 1 MTTTRPASKAGSWYEGDQKLLRRELQNYL--AAVPESFDGVALPIPGARVIVAPHAGYKF 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AA+A+ +D + RV +LGPSH YY CA +T Y TP GDL +D E+ EL+
Sbjct: 59 SGPCAAWAYKTLDLSRAKRVIVLGPSHTYYLEGCAATTFEKYATPFGDLEIDTELATELE 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYL----AKVFEG--HLIKIVPILVGAVNAENEAMYG 166
E M + EHS+EMH+PYL + F+ KIVP+LVG+ N + E + G
Sbjct: 119 DAVAMEPMSRRGEVNEHSLEMHMPYLYLRCQETFDSPDKFPKIVPVLVGSNNRQEEKVIG 178
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD----------------KKHG-VIHKSI 209
R Y+ DP N F +SSDFCHWG F+Y+ Y K +G IH++I
Sbjct: 179 RALLPYLRDPENVFIISSDFCHWGDNFSYLPYSSTKSPSDLTQLQKESPKPNGPPIHETI 238
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV------FLHMLGNCSTKIKIKF 263
+D+ MD +++G DAF L + NT+CGRHPI V F+ K +
Sbjct: 239 RVIDEAAMDAVKSGSHDAFLDSLKQTRNTVCGRHPIGVVMAALEFIRQEDAFQDKGRFSI 298
Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
++Y++S+ + D SVSY S A
Sbjct: 299 IKYDRSNLVEYPNDMSVSYVSGYA 322
>gi|261332591|emb|CBH15586.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 323
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 183/316 (57%), Gaps = 27/316 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKSPEVRGVIAPHAGYSYSGRAA 57
M RRA+HAGSWY + L +D A L +S ++ GVI+PHAG SYSG A
Sbjct: 1 MTYSRRATHAGSWYEGSPNALKALVDTLFSTASKSPLKESEKMIGVISPHAGISYSGNTA 60
Query: 58 AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL----E 106
++ + ++ SI+R+FLLGPSHH + A Y+TP G L ++ E
Sbjct: 61 SHVYVHLRDYIYGHKGRSITRIFLLGPSHHKGFDGVEVCAAQRYETPFGPLVVNAKVGQE 120
Query: 107 VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVF----EGH-----LIKIVPILVGA 156
V +EL+A G M DE EHS+EM LP+++ + G+ +++VP+L+G
Sbjct: 121 VEKELRAAGVPVGTMHRMTDEDEHSIEMQLPFISHLLHYPPNGYKPAMDRVELVPLLIGG 180
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
N + E + G + +KY+ D NFF +SSDFCHWG+RF YM HY+K ++ I +I ++D
Sbjct: 181 TNRKMENLIGSVLSKYLKDNQNFFVISSDFCHWGARFQYMYHYEKAEYPDIGDAIISMDH 240
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
GM ++E D D + KYL +NTICGR PISV + L + + ++FL Y QS++CK
Sbjct: 241 EGMRLLEARDMDGWYKYLSTTNNTICGRRPISVLMAALDSKKEAV-VRFLHYSQSNRCKN 299
Query: 275 KRDSSVSYASAAAKVD 290
DSSVSYA A +D
Sbjct: 300 MSDSSVSYAGAIVTLD 315
>gi|378726099|gb|EHY52558.1| hypothetical protein HMPREF1120_00769 [Exophiala dermatitidis
NIH/UT8656]
Length = 388
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 139/384 (36%), Positives = 182/384 (47%), Gaps = 100/384 (26%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPK--SPEVRGVIAPHAGYSYSG 54
+R ASHAGSWY+DNS++LA +LD WL G+ K P R VIAPHAGY+YSG
Sbjct: 3 VREASHAGSWYSDNSRQLASQLDLWLSRVPEKDILPGIEKLPVPGARAVIAPHAGYAYSG 62
Query: 55 RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELK 112
AA+A+ +D + R+F+L PSHH + AL Y+TP+ D LPLD E I EL
Sbjct: 63 PCAAWAYKCLDLSQAKRIFILHPSHHRHLRTAALPVVDAYETPLSDQPLPLDRETIHELS 122
Query: 113 A---------TGKFELMDICVDEAEHSMEMHLPYLAKVF--------EGHLIKIVPILVG 155
+ T KF M VDEAEHS EM LPY+ ++ E +VPI+VG
Sbjct: 123 SLSATNENGETVKFTTMSKSVDEAEHSAEMQLPYIHRLLQKLYPGQPESSYPPLVPIMVG 182
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------- 198
+ E G++ A Y+ D N F +SSDFCHWGSRF Y +Y
Sbjct: 183 GTSVTTEQTLGKMLAPYIADEQNAFVISSDFCHWGSRFGYTYYVPDAPSPALSPLMLPNG 242
Query: 199 -------------DKKHGV---------------------IHKSIEALDKMGMDIIETGD 224
+ H V I++SI +D+ M I TG
Sbjct: 243 VRGEFTTAKESTNEDVHKVPTLGQGEELRSNTRIPKGGPQIYESIAHVDRACMCAIATGS 302
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCS---------------------TKIKIKF 263
F + L NT+CGRHPI VF+ + T+ K +F
Sbjct: 303 HSEFCRVLRITGNTVCGRHPIGVFMAGVEEVERNKRDKGEHAKADTDPDTTDPTRGKFRF 362
Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
+RYE+SS + +DSSVSY SA A
Sbjct: 363 MRYERSSDVISPKDSSVSYVSAFA 386
>gi|322711962|gb|EFZ03535.1| hypothetical protein MAA_00609 [Metarhizium anisopliae ARSEF 23]
Length = 323
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 123/321 (38%), Positives = 173/321 (53%), Gaps = 39/321 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
+R A AGSWY + L ELD +L + +P++ P R +IAPHAGY++SG
Sbjct: 3 LRTAGKAGSWYVASPDDLGSELDEYLSD--VPETVDDSTLPIPGARIIIAPHAGYTFSGP 60
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ +D + RVF+LGPSH YY CA++T + Y TP G+L +D ++I+ +K
Sbjct: 61 CAAWAYKCLDLSKAKRVFVLGPSHTYYLDGCAVTTYSKYATPFGNLTVDRDIIQRVKEAA 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLF 169
+ + + D AEHS+EMHLPYL K E + IVPIL+G N + E G++
Sbjct: 121 QMDDIPPSRDSAEHSLEMHLPYLYKRCEQTFGSPENFPTIVPILIGDNNRDEEKAIGQVL 180
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEAL 212
Y+ D N F +SSDFCHWG+ F YM Y + IH++I L
Sbjct: 181 VPYLKDEENAFIISSDFCHWGAHFQYMVYSPDNDPAQLINLRRGDKAPAGPPIHETIRLL 240
Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCS------TKIKIKFLRY 266
DK MD +E+G D F L NT+CGRHPI V + L S K + + +Y
Sbjct: 241 DKAAMDAVESGHHDVFVDNLKLTKNTVCGRHPIGVAMAALELLSGEVADARKCRFRITQY 300
Query: 267 EQSSQCKTKRDSSVSYASAAA 287
++S+ + D SVSY SA A
Sbjct: 301 QRSNLVEKPSDFSVSYVSAYA 321
>gi|340518150|gb|EGR48392.1| predicted protein [Trichoderma reesei QM6a]
Length = 323
Score = 213 bits (541), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 175/320 (54%), Gaps = 39/320 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
R AS AGSWY +L + LD +L A +P++ P R +IAPHAGY+YSG
Sbjct: 4 RPASKAGSWYEKTPAKLMKTLDNYL--ADVPETVDGSSLPIPGARVIIAPHAGYAYSGPN 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +D + RVF+LGPSH Y CA++T + + TP GD +D +V+E +K G
Sbjct: 62 AAWAYSCLDLSKAKRVFILGPSHTYGLSGCAVTTFSKWATPFGDFTVDRDVLERVKEAGG 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAK----VFEG--HLIKIVPILVGAVNAENEAMYGRLFA 170
+ D AEHS+EMHLPYL K FE IVPI+VG+++ E GR+
Sbjct: 122 MGDVPPENDVAEHSLEMHLPYLYKRCQQTFESPEDFPTIVPIIVGSLSRSREKDVGRVLL 181
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
Y+ D N F +SSDFCHWG+RF Y Y + I+++I LD
Sbjct: 182 SYLKDEENAFIISSDFCHWGTRFGYAVYSPNGDIQRLTTLHDYSPKPSGPPIYETIRLLD 241
Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLRYE 267
+ MD +++G DAF + L NT+CGRHPI V + L N K + K ++Y+
Sbjct: 242 EAAMDAVKSGSHDAFVENLRHTGNTVCGRHPIGVAMAALELYAKELNEEDKSRFKIVKYD 301
Query: 268 QSSQCKTKRDSSVSYASAAA 287
+SS+ DSSVSY SA A
Sbjct: 302 RSSEVVWPDDSSVSYVSAYA 321
>gi|358381478|gb|EHK19153.1| hypothetical protein TRIVIDRAFT_69842 [Trichoderma virens Gv29-8]
Length = 323
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 174/320 (54%), Gaps = 39/320 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
R A+ AGSWY + + LA ELD +L A +P+ P R +IAPHAGY+YSG
Sbjct: 4 RPATKAGSWYEKDPEILAGELDEYL--AKVPEDVDGSSLPIPGARIIIAPHAGYAYSGPN 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +D + RVF+LGPSH Y CA++T + Y TP GD +D + IE +K G
Sbjct: 62 AAWAYSCLDLSKAKRVFILGPSHTYGFSGCAVTTFSHYSTPFGDFTVDRDTIERVKEAGG 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRLFA 170
+ D AEHS+EMHLPYL K E K+VPI+VG+ + +E GRL
Sbjct: 122 MKNAPPRNDVAEHSLEMHLPYLYKRCEQTFKSPEEFPKVVPIIVGSTSRADEKDIGRLLL 181
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALD 213
Y+ D N F +SSDFCHWGSRF+Y Y + I+++I LD
Sbjct: 182 SYLKDEENAFIISSDFCHWGSRFDYAVYTPDGNIGKLSSLHDRSPKPSGPPIYETIRLLD 241
Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLRYE 267
+ MD +++G DAF L NT+CGRHPI + + L N + K ++Y+
Sbjct: 242 EAAMDAVKSGSHDAFVDNLRLTGNTVCGRHPIGIAMAALELYAKELNDEGMSRFKVVKYD 301
Query: 268 QSSQCKTKRDSSVSYASAAA 287
+SS+ DSSVSY SA A
Sbjct: 302 RSSEVIWPDDSSVSYVSAYA 321
>gi|326482860|gb|EGE06870.1| DUF52 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 377
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/375 (35%), Positives = 182/375 (48%), Gaps = 88/375 (23%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYS 53
M IR +HAGSWY+D+ L +LD WL GL K P R +IAPHAGY+YS
Sbjct: 1 MMSIRSHTHAGSWYSDHGPTLEAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYAYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL 111
G AA+A+ +D + R+FL+GPSHH + A+ T Y TP+ LPLD E++ ++
Sbjct: 61 GPCAAFAYKCLDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTEMLAKI 120
Query: 112 K-----ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVN 158
+ A F M VDE EHSME+HLPY+ ++ + +VP++VG+ +
Sbjct: 121 EHEASLADRPFSKMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYPPLVPMMVGSTS 180
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------- 203
A NE ++G + Y+ D N F +SSDFCHWGSRF Y +Y + G
Sbjct: 181 AANERVFGSILRPYLADEENAFVISSDFCHWGSRFAYTYYVQAPGPTTYKLPLTYPSLPV 240
Query: 204 -------------------------------------VIHKSIEALDKMGMDIIETGDPD 226
IH+S+ A D M I +G+ D
Sbjct: 241 PSDMLTADGAAEAVRAVSSGRMLRDNDKSYNKEKYMWAIHESVSACDMACMSAIASGNTD 300
Query: 227 AFKKYLLEYDNTICGRHPISVFL--------------HMLGNCSTKIKIKFLRYEQSSQC 272
F++ L NT+CGRHPI V + M + K K F+RYE+SS C
Sbjct: 301 VFQEALRRTGNTVCGRHPIGVVMAGIEGLQLQHQAEAAMTQDVVKKGKFYFIRYERSSDC 360
Query: 273 KTKRDSSVSYASAAA 287
+ +DSSVSY SA A
Sbjct: 361 FSVQDSSVSYVSAFA 375
>gi|146422342|ref|XP_001487111.1| hypothetical protein PGUG_00488 [Meyerozyma guilliermondii ATCC
6260]
gi|146388232|gb|EDK36390.1| hypothetical protein PGUG_00488 [Meyerozyma guilliermondii ATCC
6260]
Length = 328
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 170/320 (53%), Gaps = 41/320 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPEVRGVIAPHAGYSYSGR 55
+R A+HAGSWY+ N+ LA +++ ++ +A G P P R +I PHAGY+YSG
Sbjct: 3 VRPATHAGSWYSANNASLASQMERFISKAQNNLKKSHGGPHVPGARVLIGPHAGYTYSGT 62
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
A + D T + RVF+LGPSHH Y++ +S Y+TP G+L +D +V EL
Sbjct: 63 QLAETYEAWDTTGVKRVFILGPSHHVYFSSTAKVSKFDSYQTPFGNLDVDTKVCSELVDK 122
Query: 115 GKFELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL--- 168
G F M DE EHS EMH P++ K KIVPI++ A+N E +Y ++
Sbjct: 123 GAFSYMTEEEDENEHSFEMHAPFIRYKTKDLPHGSPKIVPIMISAMN---ERLYNKIVKA 179
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK----HGV-------------------I 205
Y D SN F+VSSDFCHWG+RF Y Y +K G+ I
Sbjct: 180 LEPYFADKSNTFAVSSDFCHWGARFGYTKYLQKIPDSEGITSQSLVSLKSSSQLVQSIPI 239
Query: 206 HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLR 265
H+SIE LDK M I G + +Y+ E NTICG+ PISV L +L N ++
Sbjct: 240 HRSIEILDKEAMKIASKGTHTDWNRYIDETQNTICGQKPISVVLRLLDNSGAHPGFSWIG 299
Query: 266 YEQSSQCKTKRDSSVSYASA 285
Y QS + + +SSVSYAS
Sbjct: 300 YSQSGEVTSPTESSVSYASG 319
>gi|302923736|ref|XP_003053739.1| hypothetical protein NECHADRAFT_98907 [Nectria haematococca mpVI
77-13-4]
gi|256734680|gb|EEU48026.1| hypothetical protein NECHADRAFT_98907 [Nectria haematococca mpVI
77-13-4]
Length = 324
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 173/324 (53%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M R A+ AGSWY N+K L EL L A +P+S P R +IAPHAGY +
Sbjct: 1 MVATRPAAKAGSWYKSNAKTLQAELLNNL--AAVPESLDGASLPIPGARIIIAPHAGYEF 58
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
SG AA+A+ +D + RVF+LGPSH YY CA +T Y+TP G+L +D + ++ +
Sbjct: 59 SGPCAAWAYKTLDLSRAKRVFVLGPSHTYYLEGCAATTFDKYETPFGELTVDRALAKKFE 118
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYL----AKVF--EGHLIKIVPILVGAVNAENEAMYG 166
+LM + EHS+EMH+PYL + F KIVP+LVG+ N E G
Sbjct: 119 DAASMDLMPQRNEVLEHSLEMHMPYLYLRCQQTFGSPDKFPKIVPVLVGSNNGPEEKEVG 178
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSI 209
R Y+ DP N F +SSDFCHWG F+YM Y IH++I
Sbjct: 179 RALLPYLKDPENAFIISSDFCHWGHNFSYMVYSPTSSPGDLVKLRRQDRAPAGPPIHETI 238
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVF---LHMLG---NCSTKIKIKF 263
A+D+ MD +++G DAF L + NT+CGRHPI V L +LG K + +
Sbjct: 239 RAIDEAAMDGVKSGSHDAFLATLKQTKNTVCGRHPIGVMMAALELLGQEPQYQGKGRFQV 298
Query: 264 LRYEQSSQCKTKRDSSVSYASAAA 287
++Y++S+ + D SVSY SA A
Sbjct: 299 IQYDRSNLVQEPDDMSVSYVSAYA 322
>gi|116200592|ref|XP_001226108.1| hypothetical protein CHGG_10841 [Chaetomium globosum CBS 148.51]
gi|88175555|gb|EAQ83023.1| hypothetical protein CHGG_10841 [Chaetomium globosum CBS 148.51]
Length = 298
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/324 (38%), Positives = 164/324 (50%), Gaps = 68/324 (20%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSW ++ HAGYSYSG AA+A+ +
Sbjct: 4 REASHAGSWMRNS-----------------------------HAGYSYSGPCAAWAYKAL 34
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D S RVF+LGPSH YY CAL++ Y+TP GDL +D EL+ T +F M
Sbjct: 35 DLRSAKRVFILGPSHTYYLRGCALTSFDKYETPFGDLVVDNATTSELRQTNRFSEMPKRS 94
Query: 125 DEAEHSMEMHLPYLAKVFE-----GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
D +EHS+EMHLPYL K E G I+PILVG + + E +G+L +KY+ DP +
Sbjct: 95 DVSEHSLEMHLPYLRKRLEQTFGDGDYPPIIPILVGDASGDEEKSWGQLLSKYLKDPESA 154
Query: 180 FSVSSDFCHWGSRFNY---------MHYDKKHGV------------------IHKSIEAL 212
+ VSSDFCHWGSRF Y D G IH+ IE L
Sbjct: 155 WIVSSDFCHWGSRFTYRPKFRDGVIRDLDNDRGALQVTPDWSQVAEDPESPEIHEVIEVL 214
Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML-------GNCSTKIKIKFLR 265
D+M +D +E+G F K + + NT+CGRHPI V + L G K F++
Sbjct: 215 DQMAIDAVESGSHSEFSKVIRQTQNTVCGRHPIGVMMAALEAVAEERGLEGKNGKFSFVQ 274
Query: 266 YEQSSQCKTKRDSSVSYASAAAKV 289
Y++S+ K +D SVSYASA A V
Sbjct: 275 YQRSNLVKRDKDFSVSYASAYAVV 298
>gi|326474665|gb|EGD98674.1| hypothetical protein TESG_06154 [Trichophyton tonsurans CBS 112818]
Length = 376
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 131/372 (35%), Positives = 180/372 (48%), Gaps = 88/372 (23%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
IR +HAGSWY+D+ L +LD WL GL K P R +IAPHAGY+YSG
Sbjct: 3 IRSHTHAGSWYSDHGPTLEAQLDQWLDLVPGELEGLGKLPVPGARVIIAPHAGYAYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELK-- 112
AA+A+ D + R+FL+GPSHH + A+ T Y TP+ LPLD E++ +++
Sbjct: 63 AAFAYKCFDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTEMLAKIEHE 122
Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
A F M VDE EHSME+HLPY+ ++ + +VP++VG+ +A N
Sbjct: 123 ASLADRPFSKMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYPPLVPMMVGSTSAAN 182
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------------ 203
E ++G + Y+ D N F +SSDFCHWGSRF Y +Y + G
Sbjct: 183 ERVFGSILRPYLADEENAFVISSDFCHWGSRFAYTYYVQAPGPTTYKLPLTYPSLPVPSD 242
Query: 204 ----------------------------------VIHKSIEALDKMGMDIIETGDPDAFK 229
IH+S+ A D M I +G+ D F+
Sbjct: 243 MLTADGAAEAVRAVSSGRMLRDNDKSYNKEKYMWAIHESVSACDMACMSAIASGNTDVFQ 302
Query: 230 KYLLEYDNTICGRHPISVFL--------------HMLGNCSTKIKIKFLRYEQSSQCKTK 275
+ L NT+CGRHPI V + M + K K F+RYE+SS C +
Sbjct: 303 EALRRTGNTVCGRHPIGVVMAGIEGLQQQHQAEAAMTQDVVKKGKFYFIRYERSSDCFSV 362
Query: 276 RDSSVSYASAAA 287
+DSSVSY SA A
Sbjct: 363 QDSSVSYVSAFA 374
>gi|358253785|dbj|GAA53778.1| hypothetical protein CLF_111105, partial [Clonorchis sinensis]
Length = 729
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 101/217 (46%), Positives = 142/217 (65%), Gaps = 6/217 (2%)
Query: 76 GPSHHYYTPK-CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMH 134
GPSHH K CA+ ++Y+TP+ DL +D + +E TG+F DEAEHS+EM
Sbjct: 508 GPSHHVDIGKYCAIPEVSIYQTPLADLEVDHSICDEFAKTGRFVKFSKAQDEAEHSIEMQ 567
Query: 135 LPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
LPY+AKV +G+ IVP++VG + E EA YG + A Y+ DP F +SSDFCHWG RF
Sbjct: 568 LPYIAKVMQGNPFTIVPLVVGTLTPEIEAAYGEILAPYLADPETVFVISSDFCHWGRRFR 627
Query: 195 YMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
+ +YD+ G I +SIE LD GM+ IE+ +P+ F YL +Y NTICGR I+V L+ +
Sbjct: 628 FQYYDQADGEIWQSIEKLDLQGMNAIESLNPEEFTSYLRKYGNTICGRRGINVLLNTIQK 687
Query: 255 CS-----TKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ + +++FL+Y QSSQC++ DSSVSYA+A+
Sbjct: 688 MNRDRSKSGHELRFLKYAQSSQCRSLDDSSVSYAAAS 724
>gi|357622477|gb|EHJ73941.1| hypothetical protein KGM_12231 [Danaus plexippus]
Length = 302
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 160/307 (52%), Gaps = 75/307 (24%)
Query: 21 LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHH 80
L+ +LD WL +A L P R +IAPHAGYSY G AA+A+ + P + R+F+LGPSHH
Sbjct: 26 LSRQLDLWLSKADLTHGP-ARAIIAPHAGYSYCGACAAFAYRQVSPLVVKRIFILGPSHH 84
Query: 81 YYTPKCALSTATVYKTPIGDLPLD------------------------------------ 104
CALS+ Y TP+ DL +D
Sbjct: 85 VRLRGCALSSLDKYATPLYDLTIDKQNYDRPDGRMAIPFHSLCGIRNLLLHTACWFRGVS 144
Query: 105 -LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAEN 161
+ + EL+ATG+FE MD+ DE EHS+EMHLPY+AKV E I+PILVG+++ +
Sbjct: 145 GVTLYAELEATGQFEFMDVQTDEDEHSIEMHLPYIAKVMEEFKTAFTIIPILVGSLSPDR 204
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII- 220
EA YG + A Y+ DP N VSSDFCHWGSRF Y D IH++IE LD + +
Sbjct: 205 EAKYGAILAPYLADPQNLVVVSSDFCHWGSRFRYTWRDGNRN-IHQAIEWLDHQACEKLR 263
Query: 221 -ETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
++G P ++ +KFL+Y QSSQC T RDSS
Sbjct: 264 CQSGAP--------------------------------QMSMKFLKYAQSSQCMTMRDSS 291
Query: 280 VSYASAA 286
VSYASA+
Sbjct: 292 VSYASAS 298
>gi|71029084|ref|XP_764185.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351139|gb|EAN31902.1| hypothetical protein, conserved [Theileria parva]
Length = 290
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 168/288 (58%), Gaps = 2/288 (0%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R A+HAGSWY+++S L E+ + ++ +IAPHAGY+YS + AA+A+
Sbjct: 1 MRVATHAGSWYSNSSGALESEIKSSFQGLTNAAYKTLKYIIAPHAGYAYSLKTAAHAYSQ 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
ID S +F+LGPSHH++ CA+ + +TP+G L +D++++++L F +++
Sbjct: 61 IDSASYKTIFVLGPSHHFFLRGCAVDRFSSLQTPLGPLQVDVDIVDKLSNLKGFSVINNE 120
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
E EHS+EMHLP L VF+ +K+VPI+VG + Y +D F S
Sbjct: 121 ASEDEHSIEMHLPLLRFVFKKEPVKVVPIMVGDFSESLADELTSALVPYFNDERTLFVFS 180
Query: 184 SDFCHWGSRFNY--MHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SDFCH+GSRF + Y+ ++ +++ IE LDK G+D+I D F YL E +NTICG
Sbjct: 181 SDFCHFGSRFQFSITGYESENKPLYEKIEMLDKRGIDLIVNHKHDDFLWYLTETENTICG 240
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
R+PI + L +L + + + L Y QSS+ DSSVSYA+ V
Sbjct: 241 RNPILLLLRLLAASNLNVTSRLLHYSQSSRITRVSDSSVSYAAIVGLV 288
>gi|358390568|gb|EHK39973.1| hypothetical protein TRIATDRAFT_152882 [Trichoderma atroviride IMI
206040]
Length = 323
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/322 (36%), Positives = 171/322 (53%), Gaps = 39/322 (12%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSG 54
+ R A AGSWY L ++D +L A +P++ P R +IAPHAGY YSG
Sbjct: 2 ETRLAKKAGSWYKAAEVDLMRDIDKFL--AKVPETVDDSQLPVPGARIIIAPHAGYEYSG 59
Query: 55 RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ +D + RVF+LGPSH Y CA++T + Y TP GD +D + IE +K
Sbjct: 60 ETAAWAYSCLDLSRAKRVFILGPSHTYGLSGCAVTTFSQYATPFGDFTVDRDTIERVKEA 119
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEG------HLIKIVPILVGAVNAENEAMYGRL 168
+ + DEAEHS+EMHLP L K + +VPI+VG+ + NE GR+
Sbjct: 120 ASMQNVPRRNDEAEHSLEMHLPLLYKRCQQTFKSPEEFPMVVPIIVGSTSRANEKDIGRV 179
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEA 211
Y+ D N F +SSDFCHWG RF+Y Y + I++SI+
Sbjct: 180 LLSYIKDEENAFIISSDFCHWGERFDYTAYTANGDIDSLKHLQPYSSKPSGPPIYESIQL 239
Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLG------NCSTKIKIKFLR 265
+D+ MD +++G DAF + NT+CGRHPI + + L N + + + ++
Sbjct: 240 VDEAAMDAVKSGSHDAFVDNIKRTGNTVCGRHPIGIAMAALELYAEEVNDENRSRFRVVK 299
Query: 266 YEQSSQCKTKRDSSVSYASAAA 287
Y++SS+ DSSVSY SA A
Sbjct: 300 YDRSSEVVWPDDSSVSYVSAYA 321
>gi|380485591|emb|CCF39262.1| memo-like protein, partial [Colletotrichum higginsianum]
Length = 322
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 170/313 (54%), Gaps = 32/313 (10%)
Query: 8 SHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
++ G WY ++K+L E+LDG+L + LP P R +IAPHAGY+YSG AA+A
Sbjct: 11 ANRGEWYLGDAKKLREQLDGFLADVPDQIDGNALP-VPGARVIIAPHAGYTYSGATAAWA 69
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ +D + RVFLLGPSH +Y CA++ + Y TP G+L +D +V++ L+ +
Sbjct: 70 YKALDLSKAKRVFLLGPSHTFYLADCAVTGYSNYGTPWGNLRVDRQVVDRLRDELDIPKI 129
Query: 121 DICVDEAEHSMEMHLPYLAKVFE-------GHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
D+ EHS+EMHLPYL E +VPIL+G + E G+ A+Y+
Sbjct: 130 PTANDDREHSLEMHLPYLWVRLEQTFGASPDAFPPVVPILIGDNDEAGEKAVGKWLAEYL 189
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-----------------IHKSIEALDKMG 216
DP N F VSSDFCHWG F+Y Y + IH++I+ +D +
Sbjct: 190 RDPDNAFIVSSDFCHWGRHFDYTVYSPDGTLDSLRKLRPVGPAPEGPPIHETIKLVDDLA 249
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
++ I TG +F L + NT+CGRHPI V + L + KF++Y++S+
Sbjct: 250 IEAIGTGRHGSFYNNLKQTKNTVCGRHPIGVTMAALEELGGGHRFKFIQYQRSNLVTVPG 309
Query: 277 DSSVSYASAAAKV 289
D+SVSY SA A V
Sbjct: 310 DTSVSYVSAYAVV 322
>gi|353229020|emb|CCD75191.1| hypothetical protein Smp_175570 [Schistosoma mansoni]
Length = 215
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 139/212 (65%), Gaps = 10/212 (4%)
Query: 85 KCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEG 144
KCALSTA ++TP+ L + V +L+ TG+F + + DE EHS+EM +PY+AK+ EG
Sbjct: 6 KCALSTADFFETPLYSLSI---VYRDLEETGEFVSLPLDRDEEEHSIEMQMPYIAKMMEG 62
Query: 145 HLIK--IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
+ K +VPILVG + E EA+YG++F++Y+ +P NFF +SSDFCHWG RF Y +YD+
Sbjct: 63 YQGKFSVVPILVGYLTPEREAVYGQIFSRYLSNPENFFVISSDFCHWGKRFQYTYYDQSK 122
Query: 203 GVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-----ST 257
G I +SI+ALD+ GM++IE P F YL +Y NTICGRHPI V L ++ +
Sbjct: 123 GAIWQSIQALDETGMELIERLAPSEFTSYLEQYGNTICGRHPIGVLLQIVAYLRRNMPNN 182
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+KF+RY QS C SSVSYA+ ++
Sbjct: 183 NFNMKFVRYAQSEHCNNMNQSSVSYAAGVLQI 214
>gi|393906536|gb|EJD74311.1| hypothetical protein, variant [Loa loa]
Length = 221
Score = 206 bits (525), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 106/215 (49%), Positives = 143/215 (66%), Gaps = 12/215 (5%)
Query: 86 CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH 145
CALST + Y+TPIGDL +D ELK TG F LMD+ +EAEHS+EM LPY+AK+ E
Sbjct: 6 CALSTCSRYRTPIGDLQIDQRTNTELKETGAFSLMDLRSEEAEHSIEMQLPYIAKIMEKQ 65
Query: 146 LIK---IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
IVP+LVG+++ +A YG++F+KY+ DP F VSSDFCHWGSRF++M +D
Sbjct: 66 SANGYSIVPVLVGSLSPSKQASYGKIFSKYLSDPKIVFVVSSDFCHWGSRFHFMPHDNTT 125
Query: 203 GV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH------MLGNC 255
GV I++ I A+DK GMD I + +P AF +YL NTICGR+PISV L+ + N
Sbjct: 126 GVPIYEQIAAMDKQGMDAISSLNPVAFGEYLKRTQNTICGRNPISVILYAAEYFRQMNN- 184
Query: 256 STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
+ FL+Y QS+Q ++ DSSVSYA+ A ++
Sbjct: 185 -HLAEFVFLKYAQSNQSRSLNDSSVSYAAGALFIN 218
>gi|346972214|gb|EGY15666.1| MEMO1 protein [Verticillium dahliae VdLs.17]
Length = 343
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 168/322 (52%), Gaps = 42/322 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGR 55
+R+ AGSWY ++K+LAE+L+G+L + +P P R +IAPHAGYSYSG
Sbjct: 6 VRKPGKAGSWYLADAKQLAEQLEGFLDD--VPSQINSSDVPIPGARVIIAPHAGYSYSGP 63
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ ++D + RVFLLGPSH +Y CA++T Y TP G++ +D EV L+
Sbjct: 64 TAAWAYKSLDLSQTKRVFLLGPSHTFYLKGCAVTTFKHYGTPFGNIRVDEEVTSTLRTAL 123
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVF------EGHLIKIVPILVGAVNAENEAMYGRLF 169
M D EHS+EMHLPYL +F +VPILVG E G
Sbjct: 124 SLPDMPPANDNKEHSLEMHLPYLWTMFAKTFGSPDAFPTLVPILVGDGTKTAERAVGAWL 183
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIEA 211
Y+ DP+N F VSSDFCHWG F+Y Y V IH++I+
Sbjct: 184 LPYLRDPANAFVVSSDFCHWGDNFSYTPYSPHAKVDGSLSHISSRSRVPAGRPIHETIKE 243
Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST--------KIKIKF 263
LD + I+TG+ F L NT+CGRHPI V + L + K + +F
Sbjct: 244 LDDQAIAAIKTGEYARFYDNLSLTKNTVCGRHPIGVMMAALELLAEEEGRGGEDKGRFQF 303
Query: 264 LRYEQSSQCKTKRDSSVSYASA 285
+RYE+S+ + D SVSY SA
Sbjct: 304 VRYERSNLVEDADDMSVSYVSA 325
>gi|154419148|ref|XP_001582591.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121916827|gb|EAY21605.1| hypothetical protein TVAG_013700 [Trichomonas vaginalis G3]
Length = 292
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 112/294 (38%), Positives = 162/294 (55%), Gaps = 12/294 (4%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYA 60
K+ A HAGSWY + L E LD A + + + V+ +IAPHAGY YS A+YA
Sbjct: 4 KVVPAYHAGSWYPIG-QELKEMLDESFSNANVSQDKKGIVKAIIAPHAGYVYSVATASYA 62
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL--KATGKFE 118
+ IDP++ RV +LGPSH Y KC ++ A +TP G +P+D + +EL K F+
Sbjct: 63 YKAIDPSNFDRVVILGPSHRIYVKKCTIAAADGCETPYGTVPIDRKAADELLQKYPDSFQ 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
++ I EHS+EM LP L VF ++PI++G + + DP
Sbjct: 123 VLSIDQSAKEHSLEMQLPLLKYVFGDKPFSVIPIMIGDLKEAQHKQVVEALTPIISDPKT 182
Query: 179 FFSVSSDFCHWGSRFNYMHYDK---KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+SSDFCHWG+ F+Y + K K ++K IE LDKM + ++ DP F KY+ E
Sbjct: 183 LLVISSDFCHWGNNFDYFYLPKEIEKSEPVYKRIERLDKMAWEYVKDHDPKGFTKYISET 242
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+NTICG PI++ + +L N +F Y QSS K+K DSSVSY++ ++
Sbjct: 243 ENTICGYVPITMAMEILVNYDA----EFPHYSQSSHAKSKHDSSVSYSAGILRI 292
>gi|219119433|ref|XP_002180477.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407950|gb|EEC47885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 356
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 169/325 (52%), Gaps = 47/325 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IRRA HAGSWY ++ L L +L A S +I PHAGYSYSG AAYA+
Sbjct: 21 IRRAHHAGSWYENDPVTLRTTLQQYLDHAASDSS-----LIVPHAGYSYSGPTAAYAYQP 75
Query: 64 IDPTSISRV-------FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL----- 111
+ +SRV +L PSHH Y CA+S + TP+G+L D + EEL
Sbjct: 76 LF-QELSRVDCPIQILLVLHPSHHVYLDGCAISNSHTINTPVGNLATDDGIREELLLLNH 134
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHL--------IKIVPILVGAVNAEN 161
F +M DE EHS EM PY+A + + G L I+++PI+ GA++ +
Sbjct: 135 NNKSIFTVMSQKEDEEEHSGEMQYPYIAHILQACGKLHNNGSNKPIRVLPIMCGALSNQQ 194
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------IHKSIEALDKM 215
EA YG L + + VSSDFCHWG RF Y K +H+ I++LD+
Sbjct: 195 EASYGHLLQRVIAREDVLTIVSSDFCHWGPRFRYQPIPTKEKSYKDSMPLHEFIKSLDRQ 254
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI-------------KIK 262
GMD IE P AF YL NTICGRH I+V++ + T I +++
Sbjct: 255 GMDAIEAQQPGAFANYLARTRNTICGRHAIAVWMQAIAASETTIGNKDDTDPTGELLRVR 314
Query: 263 FLRYEQSSQCKTKRDSSVSYASAAA 287
F+RY QSS ++ RD+SVSYA+A A
Sbjct: 315 FVRYAQSSPAESLRDNSVSYAAALA 339
>gi|399217242|emb|CCF73929.1| unnamed protein product [Babesia microti strain RI]
Length = 290
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 113/291 (38%), Positives = 155/291 (53%), Gaps = 10/291 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE----VRGVIAPHAGYSYSGRAAAY 59
IR A+HAGSWY+ L + + L PE V+ +I PHAGY YS + A Y
Sbjct: 3 IRIATHAGSWYSSVGNVLKTRI-----SSELSTHPENEVLVKYIITPHAGYDYSLKTALY 57
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ I S++F+LGPSHH Y C L Y+TPIG+L +D E + L A F
Sbjct: 58 AYSKIRTFKYSKIFILGPSHHVYFEGCGLDKCIKYETPIGNLDIDTETVTNLLANDHFMN 117
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ E EHS+EM LP L + EG KI+PI++G ++ ++ KY +D N
Sbjct: 118 ISKGTSEDEHSIEMQLPILKLILEGFPDAKIIPIMIGDIDEQSTIGIANSLLKYFEDKDN 177
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDFCH+G+RF + IH+++E LDK G+D+I + F YL + NT
Sbjct: 178 LFVISSDFCHFGARFGFYKTPYPGIPIHEAVEKLDKDGIDLILKHNYKGFISYLDDTKNT 237
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICGR+PI V L +L I Y QS+ RDSSVSYA+ +
Sbjct: 238 ICGRNPIQVLLKLLTFSKLNIFSNLESYAQSNPATNIRDSSVSYAAVVGTI 288
>gi|225680304|gb|EEH18588.1| DUF52 domain protein [Paracoccidioides brasiliensis Pb03]
Length = 439
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 134/374 (35%), Positives = 178/374 (47%), Gaps = 91/374 (24%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+++LD WL + G P R +IAPHAGY+YSG A
Sbjct: 64 REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPHAGYAYSGPCA 123
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
A+A+ +D ++ R+FL+GPSHH+ AL T Y TP+ LPLD E+I L
Sbjct: 124 AFAYKTLDLSNAERIFLIGPSHHHILSTLALPQLTSYLTPLSREPLPLDTELIAHLLTST 183
Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
+ +F M VD AEHS+E+HLPY+ + +VPI+VG+ E
Sbjct: 184 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPSLVPIMVGSTLPATE 243
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
A +G + A Y+ DPSN F +SSDFCHWG RF + +Y
Sbjct: 244 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARTPAEPLPLSYPQLPLPSTS 303
Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
DK + GV IH+SI A D M I +G+ F
Sbjct: 304 DAGVAAEAIEAVSGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRRFL 363
Query: 230 KYLLEYDNTICGRHPISVFLHML----GNCS------------TKIKIKFLRYEQSSQCK 273
L + NT+CGRHPI V + L GN K + F+RYE+SS
Sbjct: 364 DALRKTGNTVCGRHPIGVVIAGLEIVRGNKEGGGKVDAVDGEMQKGRFHFIRYERSSDVV 423
Query: 274 TKRDSSVSYASAAA 287
DSSVSY SA A
Sbjct: 424 AVGDSSVSYVSAFA 437
>gi|391870565|gb|EIT79745.1| putative dioxygenase [Aspergillus oryzae 3.042]
Length = 390
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 125/357 (35%), Positives = 169/357 (47%), Gaps = 84/357 (23%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
R ASHAGSWY+DN + L +LDGWL A +P S P R +IAP+ + S
Sbjct: 4 REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPYVPENISAP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEELKA 113
++ DP I R+F+LGPSHH+Y AL T Y TP+ D LPLD E+I +L +
Sbjct: 62 CTVNSYA--DPLDIKRIFILGPSHHHYLSTLALPELTSYYTPLSDEPLPLDTELITKLLS 119
Query: 114 TG---------KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--------IVPILVGA 156
T F M +DE EHS+E+HLPY+ ++ + H +VPI+VG+
Sbjct: 120 TKAVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTSQYPPLVPIMVGS 179
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------ 198
+A E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y
Sbjct: 180 TSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQAPKPGPKLPLSANSL 239
Query: 199 -----------------------------DKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ IH+SI A D M I TG F
Sbjct: 240 PQPGDDLSSVEEKMESVSTGQALQRRDRISSREPAIHESISAFDIATMAAISTGATANFL 299
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVS 281
+ + NT+CGRHPI V + + ++ K K F+RYE+SS DSS S
Sbjct: 300 ETIQTTGNTVCGRHPIGVIMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSFS 356
>gi|254585889|ref|XP_002498512.1| ZYRO0G12034p [Zygosaccharomyces rouxii]
gi|238941406|emb|CAR29579.1| ZYRO0G12034p [Zygosaccharomyces rouxii]
Length = 319
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/319 (37%), Positives = 173/319 (54%), Gaps = 36/319 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+HAGSWY+D+ + L +L G+L GL P +++PHAGY Y G+ A+A+ ++
Sbjct: 3 RPATHAGSWYSDDPRLLKSQLLGYLEATGLRGEPGANIIVSPHAGYRYCGKTMAHAYASL 62
Query: 65 DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
D T++SRVF++GPSHH Y+ K +S +TP+G L +D E+ +EL + F MD
Sbjct: 63 DLTNVSRVFIMGPSHHVYFKNKVLVSGFDSLETPLGTLKIDRELGDELITSRHFAEMDPG 122
Query: 124 VDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
VD EHS+EM LA+ + +K+VP++V + E + G + KY+ P
Sbjct: 123 VDMEEHSIEMQFSMLAQTLQWKNVPLDTVKVVPLMVSHNSKEVDWQIGNVLGKYL-GPGT 181
Query: 179 FFSVSSDFCHWGSRFNYMHY----DKKHGV----------------------IHKSIEAL 212
F +SSDFCHWG RF Y Y D+ + I KSIE L
Sbjct: 182 IFIISSDFCHWGRRFGYTGYVGSQDELNDAIAQETEIEMLTSRSKLSHHQLDIWKSIELL 241
Query: 213 DKMGMDIIE-TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSS 270
D+ MD+++ GD D +KKYL NTICG PI++ L +L + + +F Y QSS
Sbjct: 242 DRAAMDVLKLQGDKYDEWKKYLEVTGNTICGARPIALMLAILAQ-TRDARFQFPAYSQSS 300
Query: 271 QCKTKRDSSVSYASAAAKV 289
+ DSSVSY S A +
Sbjct: 301 HVRDLHDSSVSYGSGFATM 319
>gi|294910181|ref|XP_002777909.1| Protein C2orf4, putative [Perkinsus marinus ATCC 50983]
gi|239885888|gb|EER09704.1| Protein C2orf4, putative [Perkinsus marinus ATCC 50983]
Length = 341
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 115/282 (40%), Positives = 154/282 (54%), Gaps = 32/282 (11%)
Query: 37 SPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATV--Y 94
S + +I PHAGY Y G AA A+ I +++RVF+LGPSHH Y CAL ++ Y
Sbjct: 58 STTTKAIITPHAGYRYCGSVAANAYNTIS-DNVNRVFILGPSHHQYLDNCALPHPSIHQY 116
Query: 95 KTPIGDLPLDLEVIEELK-------ATGKF-ELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
TP G L LD +V+ EL+ + G+F + DE EHS+EM LP + + L
Sbjct: 117 DTPFGPLKLDEDVLGELRGLSKSSNSGGEFGTTLTKEEDEDEHSIEMQLPLIYHQLKHRL 176
Query: 147 ----IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH----- 197
+ IVPILVG ++ E + GRL AKY DP F +SSDFCHWG+RF Y
Sbjct: 177 GIQDLTIVPILVGVLSPNVERVVGRLLAKYFKDPGTLFVISSDFCHWGTRFRYTQLQKDK 236
Query: 198 ---------YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVF 248
+D I+ IEA+D+ GM++I D + FK+YL NTICGRHPISV
Sbjct: 237 VKLAVKSIVFDPNTQPINAGIEAMDREGMELIVNQDSEGFKRYLQMTGNTICGRHPISVL 296
Query: 249 LHMLGNCSTKIKIKFLRYEQSS---QCKTKRDSSVSYASAAA 287
L +L N + IKF+ Y QS+ + DS V+YA+
Sbjct: 297 LEILRNVKEEYTIKFVDYNQSTLLPSVPCRNDSCVAYAAGVV 338
>gi|296004434|ref|XP_002808659.1| Memo-like protein [Plasmodium falciparum 3D7]
gi|225631643|emb|CAX63929.1| Memo-like protein [Plasmodium falciparum 3D7]
Length = 296
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/297 (35%), Positives = 164/297 (55%), Gaps = 8/297 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME RRA H+GSWYT+ + L +++ + A K V+ I PHAGY Y+ ++
Sbjct: 1 MENYRRAYHSGSWYTNKREVLKSKIEESFKRANAQKQ-NVKAAICPHAGYDYALETNSHV 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK---F 117
+ +ID ++ +F+LGP+HH Y Y+TP G L ++ ++I ++ + +
Sbjct: 60 YASIDVENVKNIFILGPNHHIYNKGFLFPRVEKYETPFGFLQINKQIISDIIKSDTHNLY 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA--MYGRLFAKYVDD 175
+D DE EHS+EM LP + + + IKIVPI VG++ + + ++ KY D
Sbjct: 120 SFIDPDDDEEEHSIEMQLPLIKYIIKDKDIKIVPIYVGSIGNDLKKIDLFANPLKKYFQD 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHG--VIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
N F SSDFCH+G RF + + +K+ I K IE +DK ++II D F YL
Sbjct: 180 QHNLFLFSSDFCHYGPRFRFTNILQKYSDTFIFKQIENMDKDAVNIISHHDLTGFVDYLN 239
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
E NTICG +PI + L++L + S + K + Y QS+ K++ DSSVSYA + V+
Sbjct: 240 ETHNTICGSNPIKIMLNLLQHYSASVSTKLMHYSQSNHAKSRSDSSVSYAGVISTVN 296
>gi|46108134|ref|XP_381125.1| hypothetical protein FG00949.1 [Gibberella zeae PH-1]
Length = 390
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 114/279 (40%), Positives = 152/279 (54%), Gaps = 33/279 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSYSGRA 56
R A AG WY + L EL ++ A +P+S P R VIAPHAGY YSG
Sbjct: 4 RPAKKAGYWYKSEAAELKSELQQYM--AAVPESINGAPVPVPGARVVIAPHAGYEYSGPC 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +D + RVF+LGPSH YY CA + Y TP GDL +D+++ +EL+
Sbjct: 62 AAWAYKTLDLSCAKRVFVLGPSHTYYLEGCAATIFGKYATPFGDLEIDVDMAKELEDAIM 121
Query: 117 FELMDICVDEAEHSMEMHLPYL----AKVFEG--HLIKIVPILVGAVNAENEAMYGRLFA 170
E M + EHS+EMH+PYL + FE KIVP+LVG+ A+ E + GR
Sbjct: 122 MEKMPRQGEINEHSLEMHMPYLYLRCEETFETPDKFPKIVPVLVGSNTAKEEKVIGRALL 181
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-----------------VIHKSIEALD 213
Y+ DP N F +SSDFCHWGS F+Y+ Y + IH++I +D
Sbjct: 182 PYLRDPENAFIISSDFCHWGSGFSYLPYSPTNSPSDLTQLKKRDPKPDGPPIHETIRVID 241
Query: 214 KMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
+ MD +ETG DAF L + NT+CGRHPI V + L
Sbjct: 242 QAAMDAVETGSHDAFISTLKQTRNTVCGRHPIGVTMAAL 280
>gi|149050674|gb|EDM02847.1| similar to RIKEN cDNA 0610016J10 gene, isoform CRA_a [Rattus
norvegicus]
Length = 180
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 98/178 (55%), Positives = 120/178 (67%), Gaps = 5/178 (2%)
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
FE M + DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+
Sbjct: 2 FERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLA 61
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
DPSN F VSSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +
Sbjct: 62 DPSNLFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKK 121
Query: 235 YDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
Y NTICGRHPI V L+ L + FL Y QSSQC++ +DSSVSYA+ A V
Sbjct: 122 YHNTICGRHPIGVLLNAITELQKSGMNMSFSFLNYAQSSQCRSWQDSSVSYAAGALTV 179
>gi|429853874|gb|ELA28918.1| duf52 domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 296
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 161/311 (51%), Gaps = 47/311 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
RR G WYT + RL+++LDG+L + GLP P R +IAPHAGYSYSG A
Sbjct: 7 RRPGKQGEWYTGDPTRLSKQLDGFLSDVPDQIDGDGLP-VPGARVIIAPHAGYSYSGPTA 65
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ +D + RVFLLGPSH YY L V E++A
Sbjct: 66 AWAYKALDLSKAKRVFLLGPSHTYY--------------------LRGAVTAEIRARDGV 105
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF------EGHLIKIVPILVGAVNAENEAMYGRLFAK 171
M + D+ EHS+EMHLPYL K E + +VPILVG N E G+ A
Sbjct: 106 ADMPVRNDDKEHSLEMHLPYLFKRLQQTFGAEENFPTLVPILVGDNNKSEEKEVGKWLAP 165
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------IHKSIEALDKMGMDI 219
Y+ DP N F VSSDFCHWGS F+Y Y V IH++I+ +D + ++
Sbjct: 166 YLKDPENAFIVSSDFCHWGSHFDYTVYSPDGTVEGMQRLRGDSPPIHETIKMVDDLAIEA 225
Query: 220 IETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI-KFLRYEQSSQCKTKRDS 278
I+TG F L + NT+CGRHPI V + L + +F++Y++SS DS
Sbjct: 226 IKTGKHSNFYDNLKQTKNTVCGRHPIGVTMAALEELGEGHPVFRFVQYQRSSMVTEPSDS 285
Query: 279 SVSYASAAAKV 289
SVSY SA A V
Sbjct: 286 SVSYVSAYAVV 296
>gi|354548291|emb|CCE45027.1| hypothetical protein CPAR2_700310 [Candida parapsilosis]
Length = 328
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 120/326 (36%), Positives = 173/326 (53%), Gaps = 40/326 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
+R+A+HAGSWY+ + K L ++ + A S V G ++ PHAGY+YSG A
Sbjct: 3 VRQATHAGSWYSSDPKTLKAQVQTLIHVAQKSGSNGVSGARVLVGPHAGYTYSGERLGEA 62
Query: 61 FGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL------KA 113
F D +++ RVFLLGPSHH Y+ K LS Y+TP+GD+P+D E I +L K
Sbjct: 63 FNAWDTSTVKRVFLLGPSHHVYFKDKALLSPFDFYETPLGDIPVDRETINDLLKKRFKKK 122
Query: 114 TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
G+ F+LM +DE EHS EMH P++ + + + KIVPIL+ +++E +
Sbjct: 123 HGQPVFKLMSEEIDEDEHSFEMHAPFIYHQGRKAKHGVPKIVPILISGMDSELQTELVDA 182
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV----------------------IH 206
Y+ + N F +SSDFCHWGSRF Y ++
Sbjct: 183 LYPYIQNDENHFIISSDFCHWGSRFGYTKVLTNKNTTLNTLETDLHSLRFSSAPNDTPVY 242
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST---KIKIKF 263
+SIE LD+M + I G +K+Y+ NTICG+ PI+V L +L + + +
Sbjct: 243 ESIEWLDRMALKIASKGSSAEWKEYIRISGNTICGQKPIAVILQLLEQYRSNDGQDVFNW 302
Query: 264 LRYEQSSQCKTKRDSSVSYASAAAKV 289
L Y QS+ K+ RDSSVSYAS K+
Sbjct: 303 LGYSQSNPAKSSRDSSVSYASGYVKL 328
>gi|295657135|ref|XP_002789140.1| DUF52 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226284526|gb|EEH40092.1| DUF52 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 379
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 133/374 (35%), Positives = 177/374 (47%), Gaps = 91/374 (24%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+++LD WL + G P R +IAPHAGY+YSG A
Sbjct: 4 REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPHAGYAYSGPCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
A+A+ +D ++ R+FL+GPSH++ AL T Y TP+ LPLD E+I L
Sbjct: 64 AFAYKTLDLSNAERIFLIGPSHNHILSTLALPQLTCYLTPLSREPLPLDTELIAHLLTST 123
Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
+ +F M VD AEHS+E+HLPY+ + +VPI+VG+ E
Sbjct: 124 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPPLVPIMVGSTLPAAE 183
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
A +G + A Y+ DPSN F +SSDFCHWG RF + +Y
Sbjct: 184 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARKPAESLPLSYPQLPLPSTS 243
Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
DK + GV IH+SI A D M I +G+ F
Sbjct: 244 DAGVAAEAIEAVAGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRGFL 303
Query: 230 KYLLEYDNTICGRHPISVFLHML----------GNCST------KIKIKFLRYEQSSQCK 273
L + NTICGRHPI V + L G K + F+RYE+SS
Sbjct: 304 DALRKTGNTICGRHPIGVVMAGLEIVRGSKEGGGKLDAVDGEMQKCRFHFIRYERSSDVV 363
Query: 274 TKRDSSVSYASAAA 287
DSSVSY SA A
Sbjct: 364 AVGDSSVSYVSAFA 377
>gi|390474594|ref|XP_002757945.2| PREDICTED: protein MEMO1-like [Callithrix jacchus]
Length = 180
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 98/178 (55%), Positives = 119/178 (66%), Gaps = 5/178 (2%)
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD 174
FE M + DE EHS+EMHLPY AK E H I+P+LVGA++ E +G+LF+KY+
Sbjct: 2 FERMSLQTDEDEHSIEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLA 61
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
DPSN F VSSDFCHWG RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +
Sbjct: 62 DPSNLFVVSSDFCHWGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKK 121
Query: 235 YDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
Y NTICGRHPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 122 YHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 179
>gi|407834999|gb|EKF99095.1| hypothetical protein TCSYLVIO_010007 [Trypanosoma cruzi]
Length = 315
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/307 (38%), Positives = 172/307 (56%), Gaps = 27/307 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGW---LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
RRASHA SWY ++K L +D +++ L + G+I+PHAG SYSG AA+ +
Sbjct: 5 RRASHANSWYEGSAKVLKATVDDLFERVKDFQLGGDETLIGIISPHAGISYSGPTAAHVY 64
Query: 62 GNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EE 110
+ ++++ +FLLGP HH +S A Y+TP G L +D V+ +
Sbjct: 65 KVLRNYIYGPKGSNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSSVFQS 124
Query: 111 LKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGAVNAE 160
LKA G + M DE EHS+EM LP+++ + + I++VP+LVG + E
Sbjct: 125 LKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILHHPPAGLTGAKNRIQLVPMLVGWTDRE 184
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDKMGMD 218
E G + + Y D N F +SSDFCHWGSRF Y HY K ++ I +I A+D GM+
Sbjct: 185 TEERIGAVLSSYSQDSHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDHEGME 244
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
+E D + + +YL + NTICGR PIS+ + + N ++FL Y QS++C + DS
Sbjct: 245 WLEKRDMNGWYEYLNKTHNTICGRRPISIGMEAVKN-DRSTTVRFLHYSQSNRCASLSDS 303
Query: 279 SVSYASA 285
SVSYASA
Sbjct: 304 SVSYASA 310
>gi|254570361|ref|XP_002492290.1| Putative protein of unknown function [Komagataella pastoris GS115]
gi|238032088|emb|CAY70010.1| Putative protein of unknown function [Komagataella pastoris GS115]
gi|328353703|emb|CCA40101.1| Protein MEMO1 [Komagataella pastoris CBS 7435]
Length = 314
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 168/317 (52%), Gaps = 36/317 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAA 57
M K+R A+HAGSWY+ N L +L +L A V+G ++ PHAGYS++G+
Sbjct: 1 MVKVRPATHAGSWYSGNPNELNSQLSHYLSLAKSSGKTSVKGARVLVGPHAGYSFAGKTL 60
Query: 58 AYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A + + DPT I RVF+LGPSHH Y+ + + Y TP+G++P+D + I++L + +
Sbjct: 61 AQTYNSFDPTGIKRVFILGPSHHIYFKDEVRTTRYGAYATPLGNVPVDTDTIKDLVSNAR 120
Query: 117 -FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL------- 168
+ M VDE EHS E+H+P ++ L K + + G+L
Sbjct: 121 HIDYMSSSVDENEHSFELHMPLY---YKACLDKGLSTPPPIIPILISGFPGQLADSLTST 177
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------IHKSIE 210
Y +D N F VS+DFCHWG RF Y Y + I++SIE
Sbjct: 178 LQPYFEDKENAFFVSTDFCHWGDRFGYTSYTPNGDLESMDDITVAFNGKSNSLKIYESIE 237
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSS 270
A+DK GM++I GD +++YL +NTICG +P++V L ++G T ++L Y QSS
Sbjct: 238 AVDKKGMELISKGDVSLWRQYLQATENTICGAYPLTVLLKLMGKADT---FEWLGYTQSS 294
Query: 271 QCKTKRDSSVSYASAAA 287
DSSVSYAS A
Sbjct: 295 HVLDPYDSSVSYASGYA 311
>gi|407397346|gb|EKF27707.1| hypothetical protein MOQ_008560 [Trypanosoma cruzi marinkellei]
Length = 315
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/311 (38%), Positives = 170/311 (54%), Gaps = 27/311 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
M RRASHA SWY ++K L +D A + G+IAPHAG SYSG A
Sbjct: 1 MAYSRRASHANSWYEGSAKVLKATVDELFERAKDFQWGGDETLIGIIAPHAGISYSGLTA 60
Query: 58 AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI-- 108
A+ + + +++ +FLLGP HH +S A Y+TP G L +D V+
Sbjct: 61 AHVYKVLRNYIYGPKGGNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSS 120
Query: 109 --EELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGA 156
+ LKA G + M DE EHS+EM LP+++ + + I++VP+LVG
Sbjct: 121 VFQSLKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILHYPPAGLTGAKNRIQLVPMLVGW 180
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDK 214
+ E E G + + Y +P N F +SSDFCHWGSRF Y HY K ++ I +I A+D
Sbjct: 181 TDRETEERIGAVLSSYSQNPHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDH 240
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
GM+ +E D + + YL + NTICGR PIS+ + + N + ++FL Y QS++C +
Sbjct: 241 EGMEWLEKRDMNGWYDYLSKTHNTICGRRPISIGMEAVKNDKSA-SVRFLHYSQSNRCAS 299
Query: 275 KRDSSVSYASA 285
DSSVSYASA
Sbjct: 300 LSDSSVSYASA 310
>gi|448534928|ref|XP_003870861.1| hypothetical protein CORT_0G00430 [Candida orthopsilosis Co 90-125]
gi|380355217|emb|CCG24733.1| hypothetical protein CORT_0G00430 [Candida orthopsilosis]
Length = 328
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 118/326 (36%), Positives = 175/326 (53%), Gaps = 40/326 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
+R+A+HAGSWY+ + + L ++ + A + V G +I PHAGY+YSG A
Sbjct: 3 VRQATHAGSWYSSDPRTLKLQIQTLISAAQKSGNKSVNGARILIGPHAGYTYSGERLGEA 62
Query: 61 FGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL------KA 113
F D +++ R+FLLGPSHH Y+ K LS Y+TP+GD+P+D E I++L K
Sbjct: 63 FNVWDTSTVKRIFLLGPSHHVYFKDKALLSPFDFYETPLGDIPIDRETIDDLLKKRFKKK 122
Query: 114 TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
G+ F+LM VDE EHS EMH P++ + + + KIVPIL+ ++++ ++
Sbjct: 123 HGQTVFKLMSEDVDEDEHSFEMHAPFIYHQGQKSKHGVPKIVPILISGMDSDLQSELVDA 182
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------DKKHGVIH 206
YV + N F +SSDFCHWGSRF Y K ++
Sbjct: 183 LFPYVQNEENHFIISSDFCHWGSRFGYTKVLTTKNATLDSLESDLYSLRFSSASKDIPVY 242
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK---IKF 263
+SIE LD+M + I G +K+Y+ NTICG+ PI+V L +L + +
Sbjct: 243 ESIEVLDRMALQIASKGSAAEWKEYIRISGNTICGQKPIAVILQLLEQYRSNKGEDVFNW 302
Query: 264 LRYEQSSQCKTKRDSSVSYASAAAKV 289
L Y QS+ K+ +DSSVSYA+ K+
Sbjct: 303 LGYSQSNPAKSSKDSSVSYAAGYVKL 328
>gi|340057164|emb|CCC51506.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 314
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 175/310 (56%), Gaps = 26/310 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAA 58
M R ASHAGSWY +S L + L K + R GVI+PHAG YSG A
Sbjct: 1 MSYSRCASHAGSWYEGSSATLKSTVGALLAVVQDHKLGDERLVGVISPHAGIRYSGGTAG 60
Query: 59 YAFGNIDP-------TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI--- 108
+ F + +++R+FL+GPSH+ + A Y+TP G L + +V+
Sbjct: 61 HVFAAVRDYVYGPRGGALTRMFLIGPSHYKSFSGVEVCGAKEYETPFGPLSVSAQVLKQL 120
Query: 109 -EELKATG-KFELMDICVDEAEHSMEMHLPYLAKV-----FEGH----LIKIVPILVGAV 157
EE ++ G M DE EHS+E+ LP+++ + F G +++VP+L+G
Sbjct: 121 AEEFRSAGVPVGTMTRATDEEEHSIELQLPFISHILHFPCFGGSPAKDRVQLVPLLIGGT 180
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDKM 215
+A + G + AKY DP+N F +SSDFCHWGSRF Y H+++ K+ I +I A+D
Sbjct: 181 DAAMDKAIGNVLAKYTRDPANLFVISSDFCHWGSRFQYTYHFERSKYPAIGDAIVAMDHA 240
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK 275
GM ++E + A+ KYL E +NTICGR PI+V + L + S+ ++F+ Y QS++C++
Sbjct: 241 GMQLLEQRNLQAWYKYLEETNNTICGRRPITVAMAALHHHSSA-AVRFVHYSQSNRCQSL 299
Query: 276 RDSSVSYASA 285
DSSVSYA A
Sbjct: 300 SDSSVSYAGA 309
>gi|71410526|ref|XP_807553.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871580|gb|EAN85702.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 315
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 118/307 (38%), Positives = 168/307 (54%), Gaps = 27/307 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
RRASHA SWY ++K L +D A L + G+I+PHAG SYSG AA+ +
Sbjct: 5 RRASHANSWYEGSAKVLKATVDDLFERAKDFQLGGDETLIGIISPHAGISYSGPTAAHVY 64
Query: 62 GNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EE 110
+ +++ +FLLGP HH +S A Y+TP G L +D V+ +
Sbjct: 65 KALRNYIYGPKGNNVTHIFLLGPDHHKGFDGVEMSGAQQYETPFGALSIDTSVVSSVFQS 124
Query: 111 LKATG-KFELMDICVDEAEHSMEMHLPYLAKVF---------EGHLIKIVPILVGAVNAE 160
LKA G + M DE EHS+EM LP+++ + + I++VP+LVG + +
Sbjct: 125 LKAAGVQTTRMSQSTDEEEHSIEMQLPFISHILYHPPAGLTGAKNRIQLVPMLVGWTDRK 184
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNY-MHYDK-KHGVIHKSIEALDKMGMD 218
E G + + Y D N F +SSDFCHWGSRF Y HY K ++ I +I A+D GM+
Sbjct: 185 TEERIGAVLSSYSQDSHNIFVLSSDFCHWGSRFQYKYHYQKSEYPAIADAIIAMDHEGME 244
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
+E D + + YL + NTICGR PIS+ + + N ++FL Y QS++C + DS
Sbjct: 245 WLEKRDMNGWYDYLNKTHNTICGRRPISIGMEAVKN-DRSTTVRFLHYSQSNRCASLSDS 303
Query: 279 SVSYASA 285
SVSYASA
Sbjct: 304 SVSYASA 310
>gi|154345628|ref|XP_001568751.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066093|emb|CAM43881.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 328
Score = 196 bits (499), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 168/324 (51%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPEVRG------VIAPHAGY 50
M +R A+HA WY +RL +D +LRE E RG +I PHAG
Sbjct: 1 MHFVRSATHAAPHGCGWYEAIPERLKATIDDYLREVTHHYEEESRGGARMMGLIVPHAGM 60
Query: 51 SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
SYSGR A AF + + R+F+LGPSH C LS A+ Y+TP G L
Sbjct: 61 SYSGRTACEAFAVFREYLYAKGSKGSKVERIFILGPSHTKGFEGCELSAASAYETPFGPL 120
Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
+D ++ + KA DEAEHS+EM PYL+ +
Sbjct: 121 KVDTATVDRVITALCKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPAATGTSTQPA 180
Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-K 201
+ IVPI+VG N ++E + Y+DD NFF SSDFCHWG RF+Y HY + +
Sbjct: 181 AARVSIVPIIVGWTNRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGDRFSYTYHYQRSQ 240
Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
+ I SI A+D M+++E D + + YL NTICGR PIS+ + + ++K++I
Sbjct: 241 YPNIGDSIIAMDHAAMELLERRDLEGWYGYLRTTKNTICGRAPISIGMQYWMDEASKVQI 300
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
KF+ Y QS++C + DSSVSYA+A
Sbjct: 301 KFVHYSQSNKCLSVEDSSVSYAAA 324
>gi|429329595|gb|AFZ81354.1| hypothetical protein BEWA_007630 [Babesia equi]
Length = 287
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 101/273 (36%), Positives = 156/273 (57%), Gaps = 9/273 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RR +H+GSWY + + L E+ L +++ +IAPHAGY+YS + AAY++
Sbjct: 1 MRRPTHSGSWYPSSPRVLGSEIRLALENVSDLTHHKLKYIIAPHAGYAYSLKTAAYSYAQ 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
ID T + R+F+LGPSH+ Y C + TP+G L +D E+I++L F +
Sbjct: 61 IDATQVKRIFILGPSHYLYLLGCGIDKFAKLDTPLGHLDVDTEIIDQLSKVEGFSTIKND 120
Query: 124 VDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
E EHS+EMHLP + + E IK+VPI+VG N + + Y +D SN F +
Sbjct: 121 CSEEEHSIEMHLPLVKYITNENKDIKVVPIIVGDFNNKLKDHIANTLLPYFNDESNLFII 180
Query: 183 SSDFCHWGSRFNYMH--YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDFCH+GSRF + Y+K++ +H+SIE LD+ ++ I + D F+ Y+ E NTIC
Sbjct: 181 SSDFCHFGSRFQFTKTGYEKENRPLHESIEMLDRKSIEYITSHDLIGFENYIDETGNTIC 240
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
GR+PI + L + I++ F+ ++ K
Sbjct: 241 GRNPIQILLKV------SIQLPFMNFKDDISIK 267
>gi|340501035|gb|EGR27856.1| memo family protein, putative [Ichthyophthirius multifiliis]
Length = 220
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 86/198 (43%), Positives = 129/198 (65%), Gaps = 2/198 (1%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
KIR ASHA SWYT N ++L +EL+ +L+ + + +++G+I PHAG+ YSG AA++
Sbjct: 5 KIRLASHANSWYTGNKQKLDQELNEYLQNSQVEIQDIKQIKGIIGPHAGFYYSGPTAAWS 64
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ + P RVFLLGP HH Y C S Y TP+G + LD ++IE+LK T +F+++
Sbjct: 65 YKYLCPQDNLRVFLLGPCHHIYLNSCGTSDLDFYDTPLGSIRLDKQIIEQLKQTEQFQVL 124
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ +E EHS+EMHLPY+ K ++PI+VG +N + E YG++ +++ DD + F
Sbjct: 125 NKSDEEDEHSLEMHLPYIQKQLGSKPFTLIPIMVGNINFQQEKQYGQILSQFFDDENTVF 184
Query: 181 SVSSDFCHWGSRFNYMHY 198
+SSDFCHWGSR+ Y Y
Sbjct: 185 IISSDFCHWGSRYIYQFY 202
>gi|157876688|ref|XP_001686688.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129763|emb|CAJ09069.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 370
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 168/324 (51%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
M +R A+HA WY +RL +D + EA S E + G+IAPHAG
Sbjct: 43 MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYSDEEQECARMTGLIAPHAGM 102
Query: 51 SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
SYSGR AA AF + + R+F+LGPSH C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFREYLYAKGSKGSELERIFILGPSHTRGFEGCELSAASAYETPFGPL 162
Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
+D V++ + KA DEAEHS+EM PYL+ +
Sbjct: 163 RVDTAVVDRVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPVQPA 222
Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
+ IVPI+VG N ++E + Y+DD NFF SSDFCHWG RF+Y ++ K+
Sbjct: 223 AGRVAIVPIIVGWTNRQDEKAICDVLKPYMDDARNFFICSSDFCHWGERFSYTYHYKRSE 282
Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
+ I SI A+D M+++E D + + YL NTICGR PIS+ + + K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWASKGNKARV 342
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
KF+ Y QS++CK DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366
>gi|68481987|ref|XP_715011.1| hypothetical protein CaO19.7210 [Candida albicans SC5314]
gi|46436613|gb|EAK95972.1| hypothetical protein CaO19.7210 [Candida albicans SC5314]
gi|238878225|gb|EEQ41863.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 335
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/335 (36%), Positives = 176/335 (52%), Gaps = 51/335 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
IR A+HAGSWY++N +L +LD + +A K+P R +I PHAGY+Y
Sbjct: 3 IRPATHAGSWYSNNPTKLKNQLDQFFAKARSLKTPLSGEDASNGIDGARILIGPHAGYTY 62
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG A +F D + + RVFLLGPSHH Y+ LS Y+TP G++P+D+ I+EL
Sbjct: 63 SGERLAESFNAWDTSKVKRVFLLGPSHHVYFKNHALLSPFEYYETPFGNIPVDVSTIKEL 122
Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAEN 161
K F+ M +DE EHS EMH P++ ++G L KI+PIL+ ++ +
Sbjct: 123 LDEQCKHGPVFKYMSEEIDEEEHSFEMHAPFI--YYKGETAIHGLPKIIPILISGMDGKL 180
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------DK 200
+ Y+D+ N F +SSDFCHWGSRF Y Y +
Sbjct: 181 QEGIVNSLLPYLDNEENHFIISSDFCHWGSRFGYTKYLSSGETTLDTLSDNITSLGSLHR 240
Query: 201 KHGV-IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML----GNC 255
G+ I+KSIE LDK+ + I G A+ KY+ NTICG+ PI + L +L N
Sbjct: 241 PKGLQIYKSIELLDKIALQIATNGSSAAWSKYIDISGNTICGQKPILIVLKLLEKYNNNS 300
Query: 256 STKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
S++ ++ Y QS+ + +SSVSYAS ++
Sbjct: 301 SSRGNTFNWIGYSQSNPVQRTSESSVSYASGYVRL 335
>gi|342184221|emb|CCC93702.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 316
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 177/317 (55%), Gaps = 29/317 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
M RRA+HAGSWY +S L ++G+ ++ + ++ + GVI+PHAG SYSG A
Sbjct: 1 MAYSRRATHAGSWYEASSAALKSAIEGYFAVASKSSVNENESLIGVISPHAGISYSGMTA 60
Query: 58 AYAFGNI-------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI-- 108
A+A+ + + ++R+F+LGPSH + + A Y+TP G + ++ V
Sbjct: 61 AHAYVKMRDYIYGPKGSCVTRIFVLGPSHVKWFDGVEVCEARQYETPFGPVCVNSAVAGG 120
Query: 109 --EELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE---------GHLIKIVPILVGA 156
+ L++ G MD DE EHS+EM LP+LA V I++VP+L+G
Sbjct: 121 VGQALRSAGVPVGTMDRSTDEEEHSIEMQLPFLAYVLNYPPNGCPPAKDRIELVPLLIGD 180
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM-HYDK-KHGVIHKSIEALDK 214
E G + A++ D N F +SSDFCHWGSRF Y HY++ K+ I +I A+D
Sbjct: 181 TTRAMEEAIGVVLAEHFRDKQNLFVMSSDFCHWGSRFRYTYHYEREKYPEIGDAIIAMDH 240
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST-KIKIKFLRYEQSSQCK 273
GM ++E D + + +YL NTICGR PISV + L C+ K K+ F+ Y QS++C+
Sbjct: 241 EGMRLLEGRDVEGWYRYLSATKNTICGRRPISVAVAAL--CANKKAKVSFVHYSQSNRCR 298
Query: 274 TKRDSSVSYASAAAKVD 290
DSSVSYA+A D
Sbjct: 299 NLTDSSVSYAAAVITED 315
>gi|367014857|ref|XP_003681928.1| hypothetical protein TDEL_0E04740 [Torulaspora delbrueckii]
gi|359749589|emb|CCE92717.1| hypothetical protein TDEL_0E04740 [Torulaspora delbrueckii]
Length = 332
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 119/329 (36%), Positives = 175/329 (53%), Gaps = 43/329 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR A+HAGSWY+++ L+ +L G+L + G R +I+PHAGY Y G AY++ +
Sbjct: 3 IRPATHAGSWYSNHGNELSTQLRGYLDDTGKSAVRSARLIISPHAGYRYCGPTMAYSYAS 62
Query: 64 ID-PTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK---FE 118
+D +++ R+F+LGPSHH Y+ + ++ +TP+GDL +D + ++L F
Sbjct: 63 LDIDSNVKRIFILGPSHHVYFRNQIFVTKFESLETPLGDLQVDQALCQKLMKLQDGHLFV 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYV 173
MD D EHS+EM P L + + +K+VP+LV +AE + G++ Y+
Sbjct: 123 PMDHDADMGEHSLEMQFPMLVQTLKWRNVPLESVKVVPMLVSHNSAEVDLAVGQVLKTYM 182
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IHK 207
DDPSN F +SSDFCHWG RF Y Y H + I +
Sbjct: 183 DDPSNLFIISSDFCHWGRRFAYTGYVGSEDELKDALQQETEIEMLTARSKLSHHQIDIWQ 242
Query: 208 SIEALDKMGMDII-ETGDP---DAFKKYLLEYDNTICGRHPISVFLHMLGN---CSTKIK 260
SIE LDKM M ++ +T + +A+K+YL NTICG PI+V L L S KI
Sbjct: 243 SIEILDKMAMKVLSDTANEARYNAWKQYLDITGNTICGARPIAVILSTLSQIDEASHKID 302
Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
++ Y QSS + SSVSYA+ A +
Sbjct: 303 FQWPNYSQSSHVNSLDHSSVSYAAGYATL 331
>gi|255711188|ref|XP_002551877.1| KLTH0B01980p [Lachancea thermotolerans]
gi|238933255|emb|CAR21439.1| KLTH0B01980p [Lachancea thermotolerans CBS 6340]
Length = 322
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 117/322 (36%), Positives = 170/322 (52%), Gaps = 38/322 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R ASHAGSWY+ NS +L +L +L + GLP + R +I+PHAGY+Y G +
Sbjct: 3 VRPASHAGSWYSTNSGKLGLQLQQYLGQ-GLPVAG-ARVIISPHAGYAYCGATMGKVYAA 60
Query: 64 IDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+D T++ R+F+LGPSHH Y+ + +S TP+G LP+D+ ++EL FE MD
Sbjct: 61 LDLTNVKRIFVLGPSHHIYFKNRAHVSNFKEVATPLGSLPVDIATVKELVEDEAFEYMDP 120
Query: 123 CVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
VD EHS+EM P L + + + +VPI++ + ++ G++ ++Y+ D S
Sbjct: 121 EVDVEEHSLEMQFPMLRALLKLRGVDTASVGVVPIMISHNSTRSDQHLGKVLSEYLADQS 180
Query: 178 NFFSVSSDFCHWGSRFNYMHY--DKK-----------------------HGV-IHKSIEA 211
F +SSDFCHWG RF++ Y DK H V I KSIE
Sbjct: 181 TVFIISSDFCHWGRRFSFTGYVGDKSEIQEALAEDTEIESLTARSKLSHHQVPIWKSIEI 240
Query: 212 LDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYE 267
LD+ M I+ +P + +K YL NTICG ISV L L + + ++ Y
Sbjct: 241 LDQYAMSILSQTEPRDKYELWKHYLEVTGNTICGEKAISVMLCALSHQAEPAVFQWPAYA 300
Query: 268 QSSQCKTKRDSSVSYASAAAKV 289
QSSQ + DSSVSY K+
Sbjct: 301 QSSQVTSVMDSSVSYGGGYCKI 322
>gi|255728913|ref|XP_002549382.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240133698|gb|EER33254.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 335
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 112/291 (38%), Positives = 161/291 (55%), Gaps = 42/291 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
+R A+HAG+WYT++ +L ++LD + +A L KSP R +I PHAGY+Y
Sbjct: 3 VRSATHAGTWYTNSPTKLRKQLDEFFHKAKLLKSPINGDDASNGVEGARILIGPHAGYAY 62
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG A F D T + RVFLLGPSHH Y+ LS+ Y+TP+GD+P+D E+I++L
Sbjct: 63 SGERLAETFNAWDTTKVKRVFLLGPSHHVYFKSYAMLSSFDYYETPLGDIPVDKEIIKDL 122
Query: 112 --KATGK---FELMDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAENEA 163
K + F+ M +DE EHS EMH P++ E + KI+PIL+ ++ + ++
Sbjct: 123 INKKCSRGPIFKYMSDEIDEDEHSFEMHAPFIRYKGESSPHGVPKIIPILISGMDEKLQS 182
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV------------------- 204
++D+ N F +SSDFCHWGSRF Y Y V
Sbjct: 183 DIVDSLLPFMDNEENHFIISSDFCHWGSRFGYTKYLSSSDVTLDNLEDEITSLGMFQRTK 242
Query: 205 ---IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
I+KSIE LDK+ ++I TG A+K+Y+ NTICG+ PIS+ L +L
Sbjct: 243 GLEIYKSIELLDKLALEIASTGSSAAWKQYIDTTGNTICGQKPISIVLKLL 293
>gi|240276562|gb|EER40073.1| DUF52 domain-containing protein [Ajellomyces capsulatus H143]
Length = 416
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 128/369 (34%), Positives = 174/369 (47%), Gaps = 88/369 (23%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+ +L+ WL + G+ + P R +IAPHAGY+YSG A
Sbjct: 44 REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEEL---- 111
A+A+ +D + R+FLLGPSHH++ AL T YKTP+ LPLD ++I L
Sbjct: 104 AWAYKALDLSKAKRIFLLGPSHHHHLSTLALPQLTSYKTPLSPDPLPLDTDLITHLLSTT 163
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEA 163
F M VD AEHS+E+HLPY+ + ++VP++VG+ + EA
Sbjct: 164 STNTHFTTMSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEA 223
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------- 198
+G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 224 AFGALLAPYLADDTNAFVVSSDFCHWGLRFGYTYYLPVPAASSTTTTTTRGTATGAAEAR 283
Query: 199 ---------------DKKHG------VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+ G +IH+SI A D M I +G+ + F L N
Sbjct: 284 AAIDAVSGGGAFKARSRMLGGGAGVPLIHESISACDVACMRAIASGETELFLDALRRTGN 343
Query: 238 TICGRHPISVFLHMLGNCS---------------------TKIKIKFLRYEQSSQCKTKR 276
T+CGRHPI V + L K K F+RYE+SS
Sbjct: 344 TVCGRHPIGVVMAGLEAVRRSQGEGERDGDGDGDGDGDGLQKSKFYFVRYERSSDVVAAG 403
Query: 277 DSSVSYASA 285
+SSVSY SA
Sbjct: 404 ESSVSYVSA 412
>gi|365984735|ref|XP_003669200.1| hypothetical protein NDAI_0C02970 [Naumovozyma dairenensis CBS 421]
gi|343767968|emb|CCD23957.1| hypothetical protein NDAI_0C02970 [Naumovozyma dairenensis CBS 421]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/333 (35%), Positives = 174/333 (52%), Gaps = 44/333 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M R A+HAGSWY+ N+ L+ +L +L + P R +I+PHAGY Y G AY+
Sbjct: 1 MSLSRPATHAGSWYSRNAAELSSQLQTYLNNSKKPIVKNSRIIISPHAGYRYCGSTMAYS 60
Query: 61 FGNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KATG 115
+ ++D +I R+F+LGPSHH +++ + +S TP+GDL +D ++ +L K +G
Sbjct: 61 YASLDLNENIKRIFILGPSHHIFFSNEIFVSAFDSISTPLGDLKVDKDLCSKLVRKKISG 120
Query: 116 K--FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRL 168
K F M+ VD EHS+EM L + + K++P++V E + G++
Sbjct: 121 KKIFSFMNQDVDTDEHSLEMQFSMLVQTLNWRKVPLDNVKVIPMMVSHNTKEFDMSVGKI 180
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHG 203
++Y++DPSN F +SSDFCHWG RF Y Y K H
Sbjct: 181 LSEYLNDPSNLFIISSDFCHWGRRFEYTGYVGSEDELKEATEEETEIEMLTSRSKLKHHQ 240
Query: 204 V-IHKSIEALDKMGM----DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC--S 256
+ I KSIE LDK M D + DA+K+YL NTICG PI V L L +
Sbjct: 241 IPIWKSIEILDKHAMRTLSDTQNSDRYDAWKQYLEITGNTICGERPIGVVLAALAYLPEN 300
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+K+ ++ Y QSSQ ++ +SSVSY S +
Sbjct: 301 SKVTFEWPHYSQSSQVQSLEESSVSYTSGYVTI 333
>gi|260940813|ref|XP_002615246.1| hypothetical protein CLUG_04128 [Clavispora lusitaniae ATCC 42720]
gi|238850536|gb|EEQ40000.1| hypothetical protein CLUG_04128 [Clavispora lusitaniae ATCC 42720]
Length = 304
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 159/303 (52%), Gaps = 20/303 (6%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+HAGSWYT + +RLA ++ + G P R +I PHAGY+Y G A +
Sbjct: 3 RPATHAGSWYTGDKRRLAAQIADFFAATG-SMVPGARVLIGPHAGYTYCGARLAETYAAW 61
Query: 65 DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGK-FELMD 121
D T + RVF+LGPSHH Y+ +S Y+TP+G L +D EV ++L KA G F M
Sbjct: 62 DTTGVRRVFILGPSHHVYFRETAKVSPFARYETPLGSLRVDTEVCDQLVKAPGSAFTYMT 121
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
DE EHS EMH P++A + + IVPI++ + + Y+ DP+N
Sbjct: 122 EDEDEDEHSFEMHAPFVAFRAQKDNVAPTIVPIMISGMGPRLRDRLVAQLSPYMADPANT 181
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGV-------------IHKSIEALDKMGMDIIETGDPD 226
F VSSDFCHWG RF Y Y K + +H SIE LD+ GM G
Sbjct: 182 FVVSSDFCHWGRRFGYTQYTPKADLSLLAAYKHSSGQPLHASIEFLDRSGMRAASAGS-T 240
Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++ Y+ NTICG+ P+SV LH++ +++ Y QS Q DSSVSYAS
Sbjct: 241 VWEDYIESTGNTICGQKPVSVVLHLVEKHVDGPLFEWIGYSQSGQAVDASDSSVSYASGY 300
Query: 287 AKV 289
++
Sbjct: 301 VRM 303
>gi|146103116|ref|XP_001469487.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073857|emb|CAM72596.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 370
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 167/324 (51%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
M +R A+HA WY +RL +D + EA E + G+IAPHAG
Sbjct: 43 MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDEEGHECARMTGLIAPHAGM 102
Query: 51 SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
SYSGR AA AF + + R+F+LGPSH C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFRDYLYAKGSKGSELERIFILGPSHTKGFEGCELSAASAYETPFGRL 162
Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
+D V++ + KA DEAEHS+EM PYL+ +
Sbjct: 163 RVDTAVVDRVMTDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222
Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
+ IVPI+VG + ++E + Y+DD NFF SSDFCHWG RF+Y ++ K+
Sbjct: 223 AGRVAIVPIIVGWTDRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282
Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
+ I SI A+D M+++E D + + YL NTICGR PIS+ + + K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWADEVNKARV 342
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
KF+ Y QS++CK DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366
>gi|392578607|gb|EIW71735.1| hypothetical protein TREMEDRAFT_56524, partial [Tremella
mesenterica DSM 1558]
Length = 304
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/293 (38%), Positives = 161/293 (54%), Gaps = 46/293 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPK---SP---EVRGVIAPHAGYSYSGRA 56
+RRASHAGSWY N +L EEL+ L + LP SP + + +I+PHAGY+YSG+A
Sbjct: 7 VRRASHAGSWYDANPIQLKEELESNLSQVVPLPTLDYSPPCQDAKAIISPHAGYTYSGQA 66
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA+A+ +I +VF++GPSHH ALS Y TP G++ L + I+ L+ TG+
Sbjct: 67 AAWAYASIPTDRYKKVFVIGPSHHQSFHGIALSPFKSYATPFGEILLCTDTIQALRETGE 126
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGH-LIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
F M DE EHS+EM +PYL +F+G +++VP++V A Y + A Y +D
Sbjct: 127 FTQMRSAGDEEEHSLEMQMPYLRLIFQGRDDLRLVPLIVSHPTAAQYESYASILAPYWND 186
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKH------------------------------GVI 205
P +FF S+DFCHWGSRF+Y +Y + G I
Sbjct: 187 PESFFVFSTDFCHWGSRFSYTNYYPQAPFPEDPVPPISSPGEKPAIPLTMRSTPGGGGQI 246
Query: 206 HKSIEALDKMGMDIIETGDP------DAFKKYLLEYDNTICGRHPISVFLHML 252
+SI+ +D G+ I+ DP D + YL NTICGR+ +S+FLH++
Sbjct: 247 WQSIQYMDHEGIAILR--DPTSPTAVDDWNAYLNRTGNTICGRYGLSLFLHLV 297
>gi|398024154|ref|XP_003865238.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503475|emb|CBZ38560.1| hypothetical protein, conserved [Leishmania donovani]
Length = 370
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 167/324 (51%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGLPKSPE------VRGVIAPHAGY 50
M +R A+HA WY +RL +D + EA E + G+IAPHAG
Sbjct: 43 MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDEEGHECARMTGLIAPHAGM 102
Query: 51 SYSGRAAAYAFG---------NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
SYSGR AA AF + + R+F+LGPSH C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFRDYLYAKGSKGSELERIFILGPSHTKGFEGCELSAASAYETPFGRL 162
Query: 102 PLDLEVIEEL-----KATGKFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
+D V++ + KA DEAEHS+EM PYL+ +
Sbjct: 163 RVDTAVVDRVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222
Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-- 201
+ IVPI+VG + ++E + Y+DD NFF SSDFCHWG RF+Y ++ K+
Sbjct: 223 AGRVAIVPIIVGWTDRQDEKAICDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282
Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
+ I SI A+D M+++E D + + YL NTICGR PIS+ + + K ++
Sbjct: 283 YPNIGDSIIAMDHAAMELLEKRDLERWYAYLQMTKNTICGRAPISIGMQRWADEMNKARV 342
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
KF+ Y QS++CK DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366
>gi|320581608|gb|EFW95828.1| ATP-dependent helicase DHX8 , putative [Ogataea parapolymorpha
DL-1]
Length = 1522
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 105/267 (39%), Positives = 148/267 (55%), Gaps = 27/267 (10%)
Query: 12 SWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISR 71
SWYT ++ RL +LD WL +A P + R ++ PHAGY+Y+G A + +D + I R
Sbjct: 399 SWYTGSALRLESQLDKWLSQASGPVA-GARLLVGPHAGYAYAGETLAQTYSALDASGIKR 457
Query: 72 VFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL--KATGKFELMDICVDEAE 128
VF++GPSHH Y C + S Y+TP+G++P+D + I++L K + F LM +D E
Sbjct: 458 VFIMGPSHHVYFRGCVMTSNFDYYETPLGNVPVDKQTIKDLVAKDSKMFRLMSEEMDREE 517
Query: 129 HSMEMHLPYLAKVF---EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSD 185
HS EMH+P+L +V G + KI+PI++ A + E Y D N F +S+D
Sbjct: 518 HSFEMHMPFLYRVTSKARGSVPKIIPIMISATEEKFERKLAEHLTPYFQDHENAFVISTD 577
Query: 186 FCHWGSRFNYMHY----------DKKHGV----------IHKSIEALDKMGMDIIETGDP 225
FCHWGSRF Y Y D ++G I+KSIEALDK M I
Sbjct: 578 FCHWGSRFGYTSYTPSGKIADIQDLRYGTSVLKKSDALPIYKSIEALDKEAMKIASKTSY 637
Query: 226 DAFKKYLLEYDNTICGRHPISVFLHML 252
AF+ YL +NTICG P+SV L ++
Sbjct: 638 KAFRDYLAATENTICGAKPLSVLLLLM 664
>gi|327298557|ref|XP_003233972.1| hypothetical protein TERG_05841 [Trichophyton rubrum CBS 118892]
gi|326464150|gb|EGD89603.1| hypothetical protein TERG_05841 [Trichophyton rubrum CBS 118892]
Length = 375
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 159/320 (49%), Gaps = 74/320 (23%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-----REAGLPK--SPEVRGVIAPHAGYSYSGRA 56
IRR +HAGSWY+D+ L +LD WL + GL K P R +IAPHAGY+YSG
Sbjct: 3 IRRHTHAGSWYSDHGPTLEAQLDQWLDLVPGKLEGLGKLPVPGARVIIAPHAGYAYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELK-- 112
AA+A+ +D + R+FL+GPSHH + A+ T Y TP+ LPLD +V+ +++
Sbjct: 63 AAFAYKCLDLSKAKRIFLIGPSHHLHLTTIAVPQLTEYLTPLSPMPLPLDTDVLAKIEQE 122
Query: 113 ---ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAEN 161
A F M VDE EHSME+HLPY+ ++ + +VP++VG+ +A N
Sbjct: 123 ASLADKPFSRMSRSVDEQEHSMELHLPYIHRLLQRLYPSMPTSAYPPLVPMMVGSTSAAN 182
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG------------------ 203
E ++G + Y+ D N F +SSDFCHWG+RF Y +Y + G
Sbjct: 183 ERVFGSILRPYLADEENAFVISSDFCHWGTRFAYTYYVQAPGPTTYKLPLTYPSLPVPSD 242
Query: 204 ----------------------------------VIHKSIEALDKMGMDIIETGDPDAFK 229
IH+S+ A D M I +G+ D F+
Sbjct: 243 MLTAEKAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMACMSAIASGNTDVFQ 302
Query: 230 KYLLEYDNTICGRHPISVFL 249
+ L NT+CGRHPI V +
Sbjct: 303 EALRRTGNTVCGRHPIGVVM 322
>gi|444316904|ref|XP_004179109.1| hypothetical protein TBLA_0B07740 [Tetrapisispora blattae CBS 6284]
gi|387512149|emb|CCH59590.1| hypothetical protein TBLA_0B07740 [Tetrapisispora blattae CBS 6284]
Length = 335
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 180/332 (54%), Gaps = 46/332 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IRRA+HAGSWY+++S L+++L+ L +A K+P + R +++PHAGY Y G AY++
Sbjct: 3 IRRATHAGSWYSNHSLELSQQLESCLSKADTMKAPIDKARIIVSPHAGYRYCGPTMAYSY 62
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--- 116
+++ T ++ R+F+LGPSHH Y+ + LS +TP+G+L +D E+ E+L GK
Sbjct: 63 ASLNLTQNVKRIFILGPSHHLYFRNEILLSKFNQLETPLGNLTIDNELNEKLIKDGKKHS 122
Query: 117 --FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRLF 169
F MD D EHS+EM P L + I KI+P+LV + + G++
Sbjct: 123 NIFNYMDKDTDLDEHSLEMQYPMLLQTLNWRKISPDKVKIIPMLVSHNTKDVDMSLGKIL 182
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-------------------------HGV 204
Y+ D N F +SSDFCHWG RF + Y H V
Sbjct: 183 LPYLKDEKNLFIISSDFCHWGRRFQFTGYVSDEKELEEAIEEETEIEMLTSRSKLPHHKV 242
Query: 205 -IHKSIEALDKMGMDII-ETGDPDAF---KKYLLEYDNTICGRHPISVFLHMLGNCSTK- 258
I +SIE LDK M ++ E+ + + + KKYL NT+CG +P SV L ++ +
Sbjct: 243 PIWQSIEILDKHAMKVLTESANIEKYSNWKKYLEISGNTVCGANPCSVILAIIALLKDER 302
Query: 259 -IKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
IK ++ +Y QSS+ DSSVSYAS AK+
Sbjct: 303 LIKFEWPKYSQSSKVTNVDDSSVSYASGFAKI 334
>gi|401884619|gb|EJT48772.1| spliceosome-related protein [Trichosporon asahii var. asahii CBS
2479]
gi|406694217|gb|EKC97549.1| spliceosome-related protein [Trichosporon asahii var. asahii CBS
8904]
Length = 1139
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/205 (50%), Positives = 129/205 (62%), Gaps = 7/205 (3%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSP---EVRGVIAPHAGYSYS 53
M +R ASHAGSWYT+NS RL EL G+L + P SP E + +IAPHAG+SYS
Sbjct: 850 MSGVREASHAGSWYTNNSSRLTAELSGYLDKVHPLNNPPFSPPVKEAKAIIAPHAGFSYS 909
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G AAA+A+ + I RV LLGPSHH Y ALS Y TPIGD+PLDLE I +L+
Sbjct: 910 GPAAAWAYAAVPVDKIKRVVLLGPSHHVYLSGIALSKFASYGTPIGDIPLDLEAIADLRK 969
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+ F M VDE EHS+EMHLPY+ +VF IK+VP+LVG + + +KY
Sbjct: 970 SRLFSDMKSSVDEDEHSLEMHLPYIRQVFGKRDIKLVPLLVGHPSEDKRKAISAALSKYW 1029
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHY 198
+D FF +SSDFCHWGSRF+ Y
Sbjct: 1030 NDDETFFIISSDFCHWGSRFSCTPY 1054
>gi|115390657|ref|XP_001212833.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193757|gb|EAU35457.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 455
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 174/398 (43%), Gaps = 116/398 (29%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R ASHAGSWY+D+ + L+ +LD WL + G P R +IAP+ GY S
Sbjct: 57 RDASHAGSWYSDSQRTLSRQLDQWLAQVPDQIEGLGSMPIPGARIIIAPY-GYDMSSSGY 115
Query: 58 AYAF-------------------------GNIDPTSIS------------RVFLLGPSHH 80
+ F N P+S + R+F+LGPSHH
Sbjct: 116 SADFLAMPATHTQVPAQLMPTKRWICRKRKNPKPSSFTTTTYGILPICSKRIFILGPSHH 175
Query: 81 YYTPKCALSTATVYKTPIGD--LPLDLEVIEELKATGK---------FELMDICVDEAEH 129
+Y AL T Y TP+ D LPLD E+I +L AT F M +DE EH
Sbjct: 176 HYLSTLALPELTSYSTPLSDDPLPLDTELINKLLATKATRPNGSSVGFTTMSRSIDEDEH 235
Query: 130 SMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
S+E+HLPY+ + + +VPI+VG+ +A E +G L A Y++DPSN F
Sbjct: 236 SIELHLPYIHRRLQLQYPDKPTSEYPPLVPIMVGSTSASTENAFGALLAPYLEDPSNAFV 295
Query: 182 VSSDFCHWGSRFNYMHY------------------------------------------- 198
+SSDFCHWGSRF Y +Y
Sbjct: 296 ISSDFCHWGSRFRYTYYVPQAPKPGPRLPLSAEDLPQPGDEAKEVEEKVEMVSAGQSLQR 355
Query: 199 ----DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN 254
+ IH+SI A D M I TG F L + NT+CGRHPI V + + +
Sbjct: 356 GDRISSREPAIHESISAFDIATMAAITTGTTSKFLDILQKTGNTVCGRHPIGVIMAAIES 415
Query: 255 C-----STKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
S K + F+RYE+SS DSSVSY SA A
Sbjct: 416 VTSQEESKKGRFHFIRYERSSDAVDVTDSSVSYVSAVA 453
>gi|150866198|ref|XP_001385707.2| hypothetical protein PICST_36516 [Scheffersomyces stipitis CBS
6054]
gi|149387454|gb|ABN67678.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 345
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 176/346 (50%), Gaps = 58/346 (16%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYS 53
M IR A+HAGSWY++N +L +L+ + +A P R +I PHAG++YS
Sbjct: 1 MSYIRPATHAGSWYSNNPTKLGLQLEAYFHKAESHSGEDSRHIIPGARILIGPHAGFAYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G A F D + + R+F+LGPSHH Y+ +S Y+TP G++P+D E IE+L
Sbjct: 61 GERLAETFTVWDTSKVKRIFMLGPSHHVYFKNSVMVSQFEWYETPFGNIPVDTETIEKLL 120
Query: 113 AT--------------GKFELMDICVDEAEHSMEMHLPYLAKV---FEGHLIKIVPILVG 155
T F+ M +DE EHS EMH P++ + + KI+PIL+
Sbjct: 121 HTKPQSHGHSLTHAKDSVFKYMSEEMDEDEHSFEMHAPFIYQKTHDLPQGIPKIIPILIS 180
Query: 156 AVNAE-NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------- 201
++ + N+ + L Y+++ N F +SSDFCHWGSRF Y Y +
Sbjct: 181 GMDEKLNDEVVSALLP-YLENEENHFIISSDFCHWGSRFGYTKYVPQKVDSLQLLTENLS 239
Query: 202 ---HGV--------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
H + I+KSIE LDK M+I +G +K Y+ + NTICG+ PI+V L
Sbjct: 240 SLGHSLRTKPNELPIYKSIEVLDKAAMEIASSGSYSDWKTYISQTGNTICGQKPIAVVLK 299
Query: 251 ML-------GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
++ G+ K++ Y QS+Q + DSSVSYAS +
Sbjct: 300 LIQKYRLAAGDTDKAAIFKWIGYSQSNQARRASDSSVSYASGYVTI 345
>gi|84996959|ref|XP_953201.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304197|emb|CAI76576.1| hypothetical protein, conserved [Theileria annulata]
Length = 297
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 102/245 (41%), Positives = 148/245 (60%), Gaps = 2/245 (0%)
Query: 47 HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
HAGY+YS + AA+A+ ID TSI +F+LGPSHH++ CA+ + +TP+G L +D++
Sbjct: 51 HAGYAYSLKTAAHAYSQIDATSIKTIFVLGPSHHFFLRGCAVDRFSSLQTPLGVLQVDVD 110
Query: 107 VIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
++E+L F +++ E EHS+EMHLP L VF+ +K+VPI+VG +
Sbjct: 111 IVEKLSDLKGFSVINNEASEDEHSIEMHLPLLKFVFKKEHVKVVPIMVGEFSESLADELT 170
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNY--MHYDKKHGVIHKSIEALDKMGMDIIETGD 224
Y +D + F +SSDFCH+GSRF + Y+ ++ +++ IE LDK G+D+I
Sbjct: 171 GALVPYFNDENTLFVISSDFCHFGSRFQFSITGYESENKPLYEKIEMLDKRGIDLIVNHK 230
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D F YL E +NTICGR+PI + L +L + I K L Y QSS+ DSSVSYAS
Sbjct: 231 YDDFLWYLNETENTICGRNPILLLLRLLAASNLNISSKLLHYSQSSRITRVSDSSVSYAS 290
Query: 285 AAAKV 289
V
Sbjct: 291 IVGLV 295
>gi|241950435|ref|XP_002417940.1| conserved hypothetical protein [Candida dubliniensis CD36]
gi|223641278|emb|CAX45658.1| conserved hypothetical protein [Candida dubliniensis CD36]
Length = 335
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/335 (34%), Positives = 174/335 (51%), Gaps = 51/335 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----------EVRGVIAPHAGYSY 52
IR A+HAGSWY++N +L +LD + +A K+P R +I PHAGY+Y
Sbjct: 3 IRPATHAGSWYSNNPAKLRNQLDQFFAKARSLKTPLSGEDASNGIDGARIIIGPHAGYTY 62
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG A +F D + + RVFLLGPSHH Y+ LS Y+TP G++P+D+ I++L
Sbjct: 63 SGERLAESFNAWDTSKVKRVFLLGPSHHVYFKNNALLSPFEYYETPFGNIPVDVSTIKDL 122
Query: 112 -----KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAEN 161
K F+ M +DE EHS EMH P++ ++G + KIVPIL+ ++++
Sbjct: 123 LDQKCKHGPVFKYMSEEIDEDEHSFEMHAPFI--YYKGETAVHGVPKIVPILISGMDSKL 180
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY----------------------D 199
+ Y+++ N F +SSDFCHWGSRF Y Y
Sbjct: 181 QDGIVNSLLPYLENEENHFIISSDFCHWGSRFGYTKYLLSGETTLDTLSDNITSLGSLHQ 240
Query: 200 KKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
K I+KSIE LDK+ + I G ++ KY+ NTICG+ PI + L +L + +
Sbjct: 241 PKGLEIYKSIELLDKIALQIATNGSSASWTKYIDISGNTICGQKPILIVLKLLEKYNNRS 300
Query: 260 KI-----KFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ ++ Y QS+ + +SSVSYAS ++
Sbjct: 301 LLHGNTFNWIDYSQSNPVQRTSESSVSYASGYVRL 335
>gi|294659081|ref|XP_461417.2| DEHA2F24794p [Debaryomyces hansenii CBS767]
gi|202953600|emb|CAG89832.2| DEHA2F24794p [Debaryomyces hansenii CBS767]
Length = 335
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/333 (35%), Positives = 170/333 (51%), Gaps = 48/333 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA------GLPKSPEVRGVIAPHAGYSYSGRAA 57
IR ASHAG+WY +++ +L +L+G+ ++A G P R +I PHAG++YSG
Sbjct: 3 IRPASHAGTWYPNSTIKLGSQLEGFFKQAHNVSSNGKPVK-GARILIGPHAGFAYSGARL 61
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL----- 111
A F D + + RVF+LGPSHH Y AL S Y+TP G LP+D+EV ++L
Sbjct: 62 AETFTAWDTSKVKRVFILGPSHHVYFKSSALVSRYEYYETPFGKLPVDVEVTKQLTGIEG 121
Query: 112 -KATGKFELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAENEAMYGR 167
K + F+ M VD+ EHS EMH P++ +K + I+PI++ +
Sbjct: 122 SKGSRVFKYMSEDVDDDEHSFEMHTPFIYYKSKHLPQGVPSIIPIMISGMGDRLSTDIVD 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------DKKHGV-------- 204
Y+DD N F +SSDFCHWGSRF Y Y + +H
Sbjct: 182 ALLPYLDDEENTFIISSDFCHWGSRFGYTKYLLQKPTKGIQEDDLINLRHSTKMKSTDLP 241
Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK---- 260
I++SIE LD+ M+I G + +++Y+ NTICG+ PI + L +L K
Sbjct: 242 IYQSIELLDREAMNIASAGSYNLWQQYISVTSNTICGQKPIGIILRLLEEYEKSGKSIVT 301
Query: 261 ----IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
++ Y QSS+ DSSVSYAS ++
Sbjct: 302 GSKVFNWINYSQSSEVHKHHDSSVSYASGYVQL 334
>gi|401842455|gb|EJT44665.1| YJR008W-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 338
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 178/332 (53%), Gaps = 51/332 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
+R A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 VRPATHAGSWYSNRTQELSQQLHTYLTKSSL-KGPIRNARVIICPHAGYKYCGPTMAYSY 61
Query: 62 GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-------K 112
++D +S + R+F+LGPSHH Y+ + +S + +TP+G+L +D ++ + L K
Sbjct: 62 ASLDISSDVKRIFILGPSHHIYFKNQILISAFSELETPLGNLKVDTDLCKALVKKEHSEK 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
F+LMD D AEHS+EM P L + + +KIVP++V + + + G
Sbjct: 122 GKKLFKLMDQDTDMAEHSLEMQFPMLVETLKWRGLSLDEVKIVPMIVSHNSIDIDHSIGD 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ +P+N F VSSDFCHWG RF Y Y H
Sbjct: 182 VLSEYIKNPNNLFIVSSDFCHWGRRFQYTGYVGSKEELDEAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDII-ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNC-- 255
V I +SIE +DK M + +T +P+ A+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDKYAMKTLADTANPERYGAWKQYLEITGNTICGEKPISVILSALSRIKD 301
Query: 256 --STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ IK + Y QSS + DSSVSY S
Sbjct: 302 ASALTIKFHWPHYSQSSHVTSIDDSSVSYTSG 333
>gi|365759923|gb|EHN01681.1| YJR008W-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 338
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/337 (35%), Positives = 180/337 (53%), Gaps = 51/337 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
+R A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 VRPATHAGSWYSNRTQELSQQLHTYLTKSSL-KGPIRNARVIICPHAGYKYCGPTMAYSY 61
Query: 62 GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-------K 112
++D +S + R+F+LGPSHH Y+ + +S + +TP+G+L +D ++ + L K
Sbjct: 62 ASLDISSDVKRIFILGPSHHIYFKNQILISAFSELETPLGNLKVDTDLCKALVKKEHSEK 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
F+LMD D AEHS+EM P L + + +KIVP++V + + + G
Sbjct: 122 GKKLFKLMDQDTDMAEHSLEMQFPMLVETLKWRGLSLDEVKIVPMIVSHNSIDIDHSIGD 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ +P+N F VSSDFCHWG RF Y Y H
Sbjct: 182 VLSEYIKNPNNLFIVSSDFCHWGRRFQYTGYVGSKEELDEAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDII-ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNC-- 255
V I +SIE +DK M + +T +P+ A+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDKYAMKTLADTANPERYGAWKQYLEITGNTICGEKPISVILSALSRIKD 301
Query: 256 --STKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
+ IK + Y QSS + DSSVSY S ++
Sbjct: 302 ASALSIKFHWPHYSQSSHVTSIDDSSVSYTSGYVTLE 338
>gi|401419944|ref|XP_003874461.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490697|emb|CBZ25959.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 370
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 163/324 (50%), Gaps = 39/324 (12%)
Query: 1 MEKIRRASHAG----SWYTDNSKRLAEELDGWLREAGL------PKSPEVRGVIAPHAGY 50
M +R A+HA WY +RL +D + EA + + +IAPHAG
Sbjct: 43 MRFVRPATHAAPHGRGWYEAIPERLKVTIDDYFSEATHCYDNEGKECARMTCLIAPHAGM 102
Query: 51 SYSGRAAAYAFGNIDP---------TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
SYSGR AA AF + R+F+LGPSH C LS A+ Y+TP G L
Sbjct: 103 SYSGRTAAEAFAVFREYLYAKGSKGCELERIFILGPSHTKGFEGCELSAASAYETPFGSL 162
Query: 102 PLDLE----VIEELKATG-KFELMDICVDEAEHSMEMHLPYLAKVFE------------- 143
+D VI +L+ G DEAEHS+EM PYL+ +
Sbjct: 163 KVDTAAVDLVITDLRKAGVGAATASRRTDEAEHSIEMETPYLSHILHYPPTTTGAPAQPA 222
Query: 144 GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG 203
+ IVPI+VG ++E + Y+DD NFF SSDFCHWG RF+Y ++ K+
Sbjct: 223 AGRVSIVPIIVGWTTRQDEKAIYDVLKPYMDDARNFFIFSSDFCHWGERFSYTYHYKRSE 282
Query: 204 VIH--KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
H SI A+D M+++E D + + YL NTICGR PIS+ + N K ++
Sbjct: 283 YPHIGDSIIAMDHAAMELLEKRDLEKWYAYLQITKNTICGRAPISIGMQRWANEVNKAQV 342
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
KF+ Y QS++CK DSSVSYA+A
Sbjct: 343 KFVHYSQSNKCKDADDSSVSYAAA 366
>gi|344232722|gb|EGV64595.1| hypothetical protein CANTEDRAFT_120277 [Candida tenuis ATCC 10573]
Length = 326
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 118/322 (36%), Positives = 158/322 (49%), Gaps = 40/322 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+H+GSWY+ + L ++ G+ P P R ++ PHAG+SYSG A F
Sbjct: 4 RPATHSGSWYSGSRHALHTQITGFFANKVNPI-PRARVLVGPHAGFSYSGHRLAETFSAW 62
Query: 65 DPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKA---TGK--FE 118
D + RVF+LGPSHH Y+ +S Y TPIG LP+D + L +GK F+
Sbjct: 63 DTDHVQRVFILGPSHHVYFRNHVKVSHYAFYDTPIGKLPVDTSINHHLTQGLHSGKRVFQ 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFE----GHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
MD DE EHS EMH P++ + + IVPI++ ++ R A Y
Sbjct: 123 YMDAETDEDEHSFEMHCPFIKHRLDIDQKSAGVTIVPIMISGMDGPCADAVCRALAPYFS 182
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKH------GV--------------------IHKS 208
DP+N F VSSDFCHWG RFNY Y K G+ IH+S
Sbjct: 183 DPANTFVVSSDFCHWGERFNYTQYVPKPLDYTAVGIDDVVAELMTVRSNKQLDTTEIHES 242
Query: 209 IEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGN---CSTKIKIKFLR 265
IE LD++ M I G +K+Y+ + NTICG P+ V + +L + +L
Sbjct: 243 IEILDRVAMHIASHGTWKDWKRYIEKTRNTICGEKPLMVVMRLLEGERAAAAGSAFHWLG 302
Query: 266 YEQSSQCKTKRDSSVSYASAAA 287
Y QSS DSSVSYAS A
Sbjct: 303 YSQSSSVINVHDSSVSYASGYA 324
>gi|353229017|emb|CCD75188.1| hypothetical protein Smp_097240.4 [Schistosoma mansoni]
Length = 240
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 135/220 (61%), Gaps = 5/220 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+ IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA
Sbjct: 5 LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64
Query: 59 YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F
Sbjct: 65 HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
+++ DEAEHS+EM LPY+A + + + IVPI+VG+++ E + +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
N F +SSDFCHWG RF Y +Y+K G I +SIE LD +
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAIWQSIEKLDHL 224
>gi|219564308|dbj|BAH03725.1| hypothetical protein [Saccharomyces pastorianus]
gi|219564312|dbj|BAH03728.1| hypothetical protein [Saccharomyces pastorianus]
Length = 338
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 122/337 (36%), Positives = 177/337 (52%), Gaps = 51/337 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ + L+++L +L ++ K P R +I PHAGY Y G A+++
Sbjct: 3 IRPATHAGSWYSNKPQELSQQLQTYLTKS-TTKGPIHNARIIICPHAGYRYCGHTMAHSY 61
Query: 62 GNIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
++D S R+F+LGPSHH Y+ + +S +TP+G L +D ++ + L A
Sbjct: 62 ASLDLNSKTKRIFILGPSHHVYFRNQILVSAFNGLETPLGSLNVDTDLCKALVKREHPAN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+LMD D AEHS+EM P L + + +K+VP++V + + + G
Sbjct: 122 GKKLFKLMDHDTDMAEHSLEMQFPMLVETLKWRGVSLEEVKVVPMMVSHNSTDIDQSIGE 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ +Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILFEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSREELNEAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDII-ETGDP---DAFKKYLLEYDNTICGRHPISVFLHML----G 253
V I +SIE +D+ M + ET + DA+K+YL NT+CG PISV L L G
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSETPNSERYDAWKQYLEITGNTVCGEKPISVILSALSKIEG 301
Query: 254 NCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
+T IK ++ Y QSS + DSSVSY S +D
Sbjct: 302 ARATNIKFQWPSYSQSSHVTSIDDSSVSYTSGYVTLD 338
>gi|448084446|ref|XP_004195606.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
gi|359377028|emb|CCE85411.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
Length = 344
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 170/340 (50%), Gaps = 54/340 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------PEVRGVIAPHAGYS 51
+R A+H G WY+++S +LA +++ + ++A K R +I PHAGYS
Sbjct: 3 VRPATHGGLWYSNSSSKLAHQMEAYFKKAASLKEQLYGDDDITGPISGARILIGPHAGYS 62
Query: 52 YSGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
YSG A F D + + RVF+LGPSHH Y+ +S Y+TP+G+LP+D + +E
Sbjct: 63 YSGERLAETFTVWDTSKVKRVFILGPSHHVYFKNYAQISKYDYYETPLGNLPVDSSLSKE 122
Query: 111 LKA-----TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAE 160
L + TG F+ M VDE EHS EMH PY+ +K + I+PI++ ++
Sbjct: 123 LVSYRGSRTGAPVFKYMSEEVDEDEHSFEMHAPYIYYRSKGLAQGVPSIIPIMISGMDDS 182
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK------------------- 201
+ Y+ D N F +SSDFCHWG RF + Y KK
Sbjct: 183 TCDDLASALSPYLADEENTFIISSDFCHWGYRFGFTEYLKKVPSGTNISSNDIITLTPST 242
Query: 202 ----HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
IH+SIE LDK+ M I G DA+KKY+ NTICG+ PI V L L +
Sbjct: 243 TTSPSMPIHRSIEHLDKLAMLIASKGSHDAWKKYIALTGNTICGQRPIYVILKTLDKYKS 302
Query: 258 --------KIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ ++ Y QSS + ++SSVSYAS K+
Sbjct: 303 SKPDLFQGQSVFNWINYSQSSPIRVFKESSVSYASGYVKL 342
>gi|290991406|ref|XP_002678326.1| predicted protein [Naegleria gruberi]
gi|284091938|gb|EFC45582.1| predicted protein [Naegleria gruberi]
Length = 309
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 170/302 (56%), Gaps = 20/302 (6%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPEVRGVIAPHAGYSYSGRAAA 58
RR + AG+WY D+ + L+ EL+ +L +A + +V+ +I+PHAGY YS +A
Sbjct: 5 RRDAFAGAWYQDDKEALSHELNDYLEKASSTSSTSIHHGKKVKAIISPHAGYRYSAPTSA 64
Query: 59 YAFGNIDPTS----ISRVFLLGPSHHYYTP-KCALSTATVYKTPIGDLPLDLEVIEEL-- 111
+ + + +S I VF+LGP H Y +C LS TP+G L ++ + EL
Sbjct: 65 HGYSCVLNSSNYDKIRTVFVLGPCHRLYLDDQCVLSKCDQCNTPLGALKVNKSIQSELVS 124
Query: 112 KATGKFE-LMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRL 168
K + F L D +E EHS+E+ P++A +F L I+IVPI+VG +++E+ +
Sbjct: 125 KYSDLFGYLKDKKNEEDEHSLELQFPFIAYLFREKLNQIQIVPIMVGDLSSESISRMAES 184
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII--ETGDPD 226
Y+D + F +SSDFCHWG RF+Y + K + KSI+ LD +G+ I + +P+
Sbjct: 185 LTPYLDSEESLFVISSDFCHWGKRFSYQYVINKDISLSKSIQQLDMLGVSKITEHSKNPE 244
Query: 227 AFKKYLLEYDNTICGRHPISVFLHML--GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
F Y+ +Y NTICG++PI V L+ L N + FL Y QSS K+ DSSVSY S
Sbjct: 245 EFYNYMKKYKNTICGKNPILVMLNALQKSNNWKNHESLFLNYSQSSAAKSMEDSSVSYVS 304
Query: 285 AA 286
A
Sbjct: 305 IA 306
>gi|344302348|gb|EGW32653.1| hypothetical protein SPAPADRAFT_61718 [Spathaspora passalidarum
NRRL Y-27907]
Length = 341
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 167/339 (49%), Gaps = 53/339 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-----------PEVRGVIAPHAGYSY 52
+R A+HAGSWYT++ +L +LD + +A + P R +I PHAGY+Y
Sbjct: 3 VRSATHAGSWYTNSPTKLRAQLDSFFEKAKRNEKTIHGPIANDSIPGARILIGPHAGYAY 62
Query: 53 SGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEEL 111
SG A F D + + RVFLLGPSHH Y AL S+ Y TP G++P+D E I +L
Sbjct: 63 SGERLAETFTAWDTSKVKRVFLLGPSHHVYFRNYALVSSFDSYATPFGEIPVDKETISKL 122
Query: 112 --------KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAE 160
F+LM VDE EHS EMH P++ E + KI+PI++ ++ +
Sbjct: 123 LRKESKFDSRDAVFKLMSEEVDEDEHSFEMHAPFIHYRGEKAPLGMPKIIPIMISGMDEK 182
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY---------------------- 198
Y+D N F +SSDFCHWGSRF+Y Y
Sbjct: 183 LNEDIVECLLPYMDHEENHFIISSDFCHWGSRFSYTKYLNTEDTSLKTVGHDLYSLGTFN 242
Query: 199 DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK 258
K I+KSIE LD+M +DI G +++Y+ NTICG+ PIS+ L ++ K
Sbjct: 243 KPKELEIYKSIELLDRMALDIATKGSSRDWQEYIKITGNTICGQKPISIVLRLVEEYHKK 302
Query: 259 IK--------IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
++ Y QS+ + +SSVSYAS ++
Sbjct: 303 APRSSPVDGVFNWIGYSQSNPVRRSGESSVSYASGYVQL 341
>gi|448079961|ref|XP_004194508.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
gi|359375930|emb|CCE86512.1| Piso0_005007 [Millerozyma farinosa CBS 7064]
Length = 344
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 172/344 (50%), Gaps = 62/344 (18%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS------------PEVRGVIAPHAGYS 51
+R A+H G WY+++S +LA +++ + +A K R +I PHAGYS
Sbjct: 3 VRPATHGGLWYSNSSSKLAHQMEAYFMKAASLKEQLYGDDDVTGPVSGARILIGPHAGYS 62
Query: 52 YSGRAAAYAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
YSG A F D + + RVF+LGPSHH Y+ +S Y+TP+G+LP+D V +
Sbjct: 63 YSGERLAETFTVWDTSKVKRVFILGPSHHVYFKNYAQISKYDYYETPLGNLPVDSTVSKA 122
Query: 111 LKA-----TGK--FELMDICVDEAEHSMEMHLPYL---AKVFEGHLIKIVPILVGAVNAE 160
L + TG F+ M VDE EHS EMH PY+ +K + I+PI++ ++
Sbjct: 123 LASYRGNRTGAPVFKYMSEEVDEDEHSFEMHAPYIYYRSKGLAQGVPSIIPIMISGMDDG 182
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK--HGV-------------- 204
+ Y+ D N F VSSDFCHWG RF + Y K+ G
Sbjct: 183 TCDDLASALSPYLADEENTFIVSSDFCHWGYRFGFTEYLKRVPSGTNISSNDIVTLTPST 242
Query: 205 -------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
IH+SIE LDK+ M I G DA+KKY+ NTICG+ PI V L ML
Sbjct: 243 ATSPSMPIHRSIEHLDKLAMLIASKGSHDAWKKYIALTGNTICGQRPIYVILKML----D 298
Query: 258 KIKIK------------FLRYEQSSQCKTKRDSSVSYASAAAKV 289
K KI+ ++ Y QSS + ++SSVSYAS K+
Sbjct: 299 KYKIQKPDLFQGQSVFNWINYSQSSPVRVFKESSVSYASGYVKL 342
>gi|320033881|gb|EFW15827.1| hypothetical protein CPSG_07454 [Coccidioides posadasii str.
Silveira]
Length = 274
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 133/217 (61%), Gaps = 24/217 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLRE--------AGLPKSPEVRGVIAPHAGYSYSGRA 56
R A+HAGSWY+DN+ L +LD W+ LP + R +IAPHAGY+YSG
Sbjct: 4 REATHAGSWYSDNAATLTRQLDEWMNRVPNEIEGIGSLPVA-GARIIIAPHAGYAYSGPC 62
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD--LPLDLEVIEEL--- 111
AA+A+ ++D + R+FLLGPSHH+ K AL + Y TP+ LPLD E+I+EL
Sbjct: 63 AAFAYKSLDLSKAKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTR 122
Query: 112 --KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--------HLIKIVPILVGAVNAEN 161
T +F M+ +DEAEHS+E+HLPY+ + +G K+VP++VG+ +A
Sbjct: 123 TENGTVRFTTMNQAIDEAEHSLELHLPYIHYLLQGLYPGEPAASYPKLVPMMVGSTSAPT 182
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
E +GR+ A Y+ +P N F VSSDFCHWG RF Y +Y
Sbjct: 183 EQAFGRILAPYLANPENAFIVSSDFCHWGLRFAYAYY 219
>gi|50303155|ref|XP_451515.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640646|emb|CAH03103.1| KLLA0A11836p [Kluyveromyces lactis]
Length = 321
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 163/323 (50%), Gaps = 42/323 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
M +R A+HAGSWY+ +S L L L R+ +P + + +I+PHAGY Y G A
Sbjct: 1 MSSVRPATHAGSWYS-SSPALITTLQQLLSRDEAVPGT---KHLISPHAGYKYCGDTMAL 56
Query: 60 AFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ +D T++ R ++GPSHH Y+ K +++ TP GD +D + EL GK
Sbjct: 57 GYSKLDFTNVKRCIIMGPSHHIYFKNKIQVTSFDEVNTPFGDFKVDGGLRNELLVGGKVV 116
Query: 119 LMDICVDEAEHSMEMHLPYL---AKV--FEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
LM VD EHS+EM P L AK+ + IKI+P+LV + + + G+L KYV
Sbjct: 117 LMSKEVDMDEHSLEMQFPMLWAAAKLRGVDPTNIKILPLLVSHNSTKVDFELGKLLQKYV 176
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHY---------------------------DKKHGV-I 205
DD F +SSDFCHWG RF Y Y H V I
Sbjct: 177 DDEETLFILSSDFCHWGRRFGYTGYVMESSDIDESLENDSEVEYLTQRSFNHKSSHNVPI 236
Query: 206 HKSIEALDKMGMDIIETGDPDAF---KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
+KSIE LD+ MDI+ + + KKY+ NTICG PI VFL + + +
Sbjct: 237 YKSIEILDQYAMDILGKNTENKYYDWKKYINVTGNTICGERPIGVFLCAMSLLNKPYQFN 296
Query: 263 FLRYEQSSQCKTKRDSSVSYASA 285
+ Y QSSQC DSSVSYAS
Sbjct: 297 WAHYSQSSQCVDLEDSSVSYASG 319
>gi|149239700|ref|XP_001525726.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451219|gb|EDK45475.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 356
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 174/355 (49%), Gaps = 68/355 (19%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP----------------KSPEVRGVIAP 46
+R A+HAG WY+ ++ L ++LD + +A K P R +I P
Sbjct: 2 NVRPATHAGLWYSGDAGALKQQLDALILKAQRSANTTTATTATTTTKNGKVPGARVLIGP 61
Query: 47 HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDL 105
HAGY+YSG A + D + + RVFLLGPSHH Y A+ T Y+TP+G+LP+D
Sbjct: 62 HAGYTYSGERLAETYNVWDTSKVKRVFLLGPSHHVYFKNYAMLTPFEFYETPLGNLPVDT 121
Query: 106 EVIEELKATGK--------FELMDICVDEAEHSMEMHLPYL----AKVFEGHLIKIVPIL 153
I++L +T F++M VDE EHS EMH P++ G+ KI+PI+
Sbjct: 122 ATIKKLLSTTSSSSISKPLFKVMAEDVDEDEHSFEMHAPFIYHQTVNTSGGNPPKIIPIM 181
Query: 154 VGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY--DKKHGV------- 204
+ ++ ++ ++ N F +SSDFCHWGSRF Y KH
Sbjct: 182 ISGMDHATQSDLAEALLPFIASEENHFIISSDFCHWGSRFGYTKVLPSNKHTTFNTLEKD 241
Query: 205 ---------------IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
IH+S+ LDKM ++I G D +K+Y+ NTICG+ PIS+ L
Sbjct: 242 LVSLTSMSKLNSGTSIHESVANLDKMALEIATRGSSDMWKQYIKITGNTICGQKPISIVL 301
Query: 250 HML---GNCSTKIK------------IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+L G + IK +++ Y QS+ K+ RDSSVSYAS K+
Sbjct: 302 QLLEKYGRLNYNIKSEAGNTDSINNVFEWIGYSQSNPAKSVRDSSVSYASGYVKL 356
>gi|323308440|gb|EGA61685.1| YJR008W-like protein [Saccharomyces cerevisiae FostersO]
Length = 338
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGSTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ IK ++ Y QSS + DSSVSY S +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337
>gi|365764811|gb|EHN06331.1| YJR008W-like protein [Saccharomyces cerevisiae x Saccharomyces
kudriavzevii VIN7]
Length = 338
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D ++ + L
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKVYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ IK ++ Y QSS + DSSVSY S +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337
>gi|151945084|gb|EDN63335.1| conserved protein [Saccharomyces cerevisiae YJM789]
gi|190409495|gb|EDV12760.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|207343905|gb|EDZ71221.1| YJR008Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256270208|gb|EEU05430.1| YJR008W-like protein [Saccharomyces cerevisiae JAY291]
gi|323332839|gb|EGA74242.1| YJR008W-like protein [Saccharomyces cerevisiae AWRI796]
gi|323354328|gb|EGA86168.1| YJR008W-like protein [Saccharomyces cerevisiae VL3]
Length = 338
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ IK ++ Y QSS + DSSVSY S +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337
>gi|6322467|ref|NP_012541.1| Mho1p [Saccharomyces cerevisiae S288c]
gi|1353049|sp|P47085.1|YJX8_YEAST RecName: Full=MEMO1 family protein YJR008W
gi|854581|emb|CAA60930.1| ORF YJR83.9 [Saccharomyces cerevisiae]
gi|1015634|emb|CAA89530.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285812901|tpg|DAA08799.1| TPA: Mho1p [Saccharomyces cerevisiae S288c]
gi|392298436|gb|EIW09533.1| hypothetical protein CENPK1137D_1303 [Saccharomyces cerevisiae
CEN.PK113-7D]
Length = 338
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 178/336 (52%), Gaps = 51/336 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K+ P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVFPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ IK ++ Y QSS + DSSVSYAS +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYASGYVTI 337
>gi|258577287|ref|XP_002542825.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903091|gb|EEP77492.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 383
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 167/349 (47%), Gaps = 77/349 (22%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+H+GSWY+D++ L +LD W+ G+ P R +IAPHAG++YSGR A
Sbjct: 4 REAAHSGSWYSDDAATLTRQLDEWMGRVPNEIEGIGSLPVAGARVIIAPHAGFAYSGRCA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
A+A+ +D + R+FL+GPSHH+ K AL + Y TP+ LPLD EVI EL
Sbjct: 64 AFAYKCLDLSKAKRIFLIGPSHHHPFSKIALPEVSSYSTPLSPDPLPLDKEVIAELLNRA 123
Query: 116 -----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
+F M +DEAEHS+E+HLPY+ + + K+VP++VG+ +A E
Sbjct: 124 ENGHVRFCTMSQAIDEAEHSLELHLPYIHYLLQRLYPDEPAASYPKLVPMMVGSTSAPTE 183
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
+GR+ A Y+ +P N F +SSDFCHWG RF Y +Y
Sbjct: 184 QAFGRILAPYLANPENAFIISSDFCHWGLRFAYAYYIDDVPSPGPVLPLSYDALPQPSEA 243
Query: 199 ----------------------DK--KHG---VIHKSIEALDKMGMDIIETGDPDAFKKY 231
D+ KH I++SI A D M I +G F
Sbjct: 244 LKLGSARRQITAVSSGRYLRAGDQLPKHADVPAIYESISACDIACMSAIASGHKQTFLDA 303
Query: 232 LLEYDNTICGRHPI----SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
+ NTICGRHPI S F +LG I+ ++ +Q R
Sbjct: 304 IKSTGNTICGRHPIGLIMSAFEFVLGKDKENIRDLEIKAADETQTHLMR 352
>gi|349579198|dbj|GAA24361.1| K7_Yjr008wp [Saccharomyces cerevisiae Kyokai no. 7]
Length = 338
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 118/332 (35%), Positives = 177/332 (53%), Gaps = 51/332 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ IK ++ Y QSS + DSSVSY S
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSG 333
>gi|145524772|ref|XP_001448213.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415747|emb|CAK80816.1| unnamed protein product [Paramecium tetraurelia]
Length = 258
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 132/207 (63%), Gaps = 5/207 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R A HAGSWY+ S L +++ WL +A + +++ ++ PHAGY+YSG AA+++
Sbjct: 51 VREAVHAGSWYSSKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSY 110
Query: 62 GNID---PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ P+ +VF+LGP H+ Y +C L+ +Y+TP+G++ +DLE +++L G FE
Sbjct: 111 KYLKKYPPSEKLKVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFE 170
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D +E EHS+EM LP+LA + I+PI+VG+++A++E YG+L ++Y D
Sbjct: 171 QSDKDAEEEEHSIEMQLPFLAHILGTDNFTIIPIMVGSIDAKSEEYYGKLLSEYFDMDDT 230
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVI 205
F +S+DFCHWG++F Y +Y+ G I
Sbjct: 231 LFIISTDFCHWGTKFAYTYYNSADGEI 257
>gi|219564300|dbj|BAH03719.1| hypothetical protein [Saccharomyces pastorianus]
gi|219564304|dbj|BAH03722.1| hypothetical protein [Saccharomyces pastorianus]
Length = 338
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 177/336 (52%), Gaps = 51/336 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K+ P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVFPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ IK ++ Y QSS + DSSVSY S +
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSYTSGYVTI 337
>gi|290771211|emb|CBK33739.1| EC1118_1J11_2674p [Saccharomyces cerevisiae EC1118]
Length = 336
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 115/329 (34%), Positives = 177/329 (53%), Gaps = 51/329 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIH- 206
+ ++Y+ DP+N F VSSDFCHWG RF Y Y + ++H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLLHH 241
Query: 207 -----KSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS- 256
+SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRD 301
Query: 257 ---TKIKIKFLRYEQSSQCKTKRDSSVSY 282
+ IK ++ Y QSS + DSSVSY
Sbjct: 302 AGPSGIKFQWPNYSQSSHVTSIDDSSVSY 330
>gi|400595291|gb|EJP63096.1| memo-like protein [Beauveria bassiana ARSEF 2860]
Length = 373
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 170/317 (53%), Gaps = 43/317 (13%)
Query: 13 WYTDNSKRLAEELDGWLREA-----GLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
WY +L E D +L + G P P R VIAPHAGY+YSGR +A+AF +D
Sbjct: 56 WYELAPFQLRWEFDEYLEDVPQTVDGSPLPIPGARAVIAPHAGYAYSGRCSAWAFRCLDL 115
Query: 67 TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF----ELMDI 122
+ RVF+LGP+H++Y ALST + Y TP+G +D + + ++ A E+ +I
Sbjct: 116 SQAKRVFVLGPTHYFYFSGLALSTFSQYSTPLGAFSVDQQTLRDIAAAAASAGAPEVRNI 175
Query: 123 CVDEA--EHSMEMHLPYLAKVFEGHLIK------IVPILVGAVNAENEAMYGRLFAKYVD 174
EHS+EM + +L + E + IVP+LV NA++E GRL Y+
Sbjct: 176 PRRRELDEHSLEMEIAFLYQRCEATFSRPEDFPTIVPMLVSG-NADDEKAIGRLLLPYLR 234
Query: 175 DPSNFFSVSSDFCHWGSRF-NYMHY----DKKHGV-------------IHKSIEALDKMG 216
DP+N F VSSDFCHWG +F +Y Y D+K V IH+SI+ LD
Sbjct: 235 DPTNAFVVSSDFCHWGRQFGDYRPYFPGGDRKRRVNLRDGDAPPTSPPIHESIKMLDDEA 294
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVF---LHMLGNCST---KIKIKFLRYEQSS 270
++ +ETG DAF L E DN++CGRHPI L MLG + K K K +RY++S+
Sbjct: 295 IEAMETGVHDAFVANLEETDNSVCGRHPIGAMMAALEMLGENAPLEKKPKFKHVRYDRSA 354
Query: 271 QCKTKRDSSVSYASAAA 287
SSVSY SA A
Sbjct: 355 LLTDPTKSSVSYVSAYA 371
>gi|363752797|ref|XP_003646615.1| hypothetical protein Ecym_4787 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890250|gb|AET39798.1| hypothetical protein Ecym_4787 [Eremothecium cymbalariae
DBVPG#7215]
Length = 329
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/327 (36%), Positives = 169/327 (51%), Gaps = 42/327 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYAF 61
R A+H+GSWY N L+++L + EA V+G +I+PHAGYSY G A A+
Sbjct: 3 RPATHSGSWYLSNPSSLSQQLGTFFTEAIKENGGPVKGTRAIISPHAGYSYCGETMAKAY 62
Query: 62 GNIDPT-SISRVFLLGPSHHYY-TPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-FE 118
+D T I RVF+LGPSHH+Y K +S V +TP+G L +D+E +E+L + + F
Sbjct: 63 AKLDITDQIKRVFILGPSHHFYFKNKGLISRYEVLETPLGSLKVDVETVEKLLSNPELFA 122
Query: 119 LMDICVDEAEHSMEMHLP--YLAKVFEG---HLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+D D+ EHS+EM P Y + G I +VPIL+ + + + G+ +Y+
Sbjct: 123 NLDQEADQDEHSLEMQYPMWYQTLIHNGLSPSKIGVVPILISHNSTKIDYTIGKCLLEYL 182
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--------------------------VIHK 207
+ F +SSDFCHWG RF Y Y H I+K
Sbjct: 183 KSGDSIFIISSDFCHWGRRFGYTGYVSSHEDIADAIAELTEIETLTTRSKLDHHCLKIYK 242
Query: 208 SIEALDKMGMDII--ETGDPD---AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
SIE LD+ MDI+ + G D A+K+YL NTICG P VFL +L +T K
Sbjct: 243 SIELLDRYAMDILAQKNGPKDTYAAWKEYLEVTGNTICGEKPTGVFLCLLSYLNTSHPFK 302
Query: 263 FLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ Y QSS ++ +SSVSYA+ V
Sbjct: 303 WSGYSQSSNVESLDESSVSYAAGYCVV 329
>gi|401409412|ref|XP_003884154.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325118572|emb|CBZ54123.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 254
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 154/294 (52%), Gaps = 52/294 (17%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
+RRA+HA SWY+++ L + D +L +A K+ E V ++ P
Sbjct: 3 VRRAAHASSWYSNDPSELTQLFDRFLAKAE--KTEENVIALVCPR--------------- 45
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKFELM 120
RVF+LGPSHH Y P CAL +V Y TP+G++PLD EV+E L+ FE +
Sbjct: 46 --------RVFVLGPSHHIYLPGCALPPPSVRAYATPLGEIPLDAEVLEALRTQNVFETI 97
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ DE EHS+EM LP+L V G +VPI+VG +
Sbjct: 98 ALRDDEEEHSIEMQLPFLRYVLRGQTFTLVPIVVGDLRRSGH------------------ 139
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGV--IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
V+ DFCHWG RF Y + I++ I LDK G + IE DP AF++Y E NT
Sbjct: 140 -VAVDFCHWGRRFRYTYLPPAPASLPIYERIRILDKEGANTIEQQDPAAFQEYYEETGNT 198
Query: 239 ICGRHPISVFLHML---GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
ICG +PIS+FLH+L G + K K L Y QSSQ + + SSVSYA+ A+ +
Sbjct: 199 ICGHNPISIFLHLLEESGMPRSAFKTKLLHYSQSSQVEHESSSSVSYAAFASSL 252
>gi|256089011|ref|XP_002580612.1| hypothetical protein [Schistosoma mansoni]
Length = 225
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 128/210 (60%), Gaps = 5/210 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+ IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA
Sbjct: 5 LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64
Query: 59 YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F
Sbjct: 65 HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
+++ DEAEHS+EM LPY+A + + + IVPI+VG+++ E + +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
N F +SSDFCHWG RF Y +Y+K G I
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAI 214
>gi|256089009|ref|XP_002580611.1| hypothetical protein [Schistosoma mansoni]
Length = 224
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 127/209 (60%), Gaps = 4/209 (1%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+ IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA
Sbjct: 5 LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64
Query: 59 YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F
Sbjct: 65 HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE-GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+++ DEAEHS+EM LPY+A + + IVPI+VG+++ E + +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLDE 184
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
N F +SSDFCHWG RF Y +Y+K G I
Sbjct: 185 KNLFVISSDFCHWGKRFRYQYYNKSDGAI 213
>gi|256089013|ref|XP_002580613.1| hypothetical protein [Schistosoma mansoni]
Length = 224
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 128/210 (60%), Gaps = 5/210 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+ IR +SHAGSWY+ + L+ +LD WL E + VR +I PHAGY +SG AA
Sbjct: 5 LGSIRVSSHAGSWYSADRTELSSQLDTWLESCENNVSNGSLVRAIIVPHAGYRHSGLCAA 64
Query: 59 YAFGNIDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+A+ I+P I +F++GPSH CAL+ + Y+TP +L ++ V LK+ F
Sbjct: 65 HAYRLINPDKIECIFIIGPSHRLDIGDTCALTCVSEYETPFYNLKINTNVYNSLKSLNCF 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIK--IVPILVGAVNAENEAMYGRLFAKYVDD 175
+++ DEAEHS+EM LPY+A + + + IVPI+VG+++ E + +G L + Y+ D
Sbjct: 125 SVLEKSQDEAEHSVEMQLPYIAHIMKRRKSQYSIVPIVVGSLSFEKQEFFGNLLSSYLLD 184
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVI 205
N F +SSDFCHWG RF Y +Y+K G I
Sbjct: 185 EKNLFVISSDFCHWGKRFRYQYYNKSDGAI 214
>gi|38047939|gb|AAR09872.1| similar to Drosophila melanogaster CG8031, partial [Drosophila
yakuba]
Length = 166
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/164 (52%), Positives = 111/164 (67%), Gaps = 3/164 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWYTD+ L+ +LD WL A L P R +IAPHAGY+Y G A+A+ +
Sbjct: 4 RRATHAGSWYTDSGAELSRQLDRWLGAADLSHGP-ARAIIAPHAGYTYCGACGAFAYRQV 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
P + R+F+LGPSHH CALS A Y+TP+ DL +D ++ EL+ TG+F MD+
Sbjct: 63 IPVVVKRIFILGPSHHVRLRGCALSVAKKYRTPLYDLKIDTQINAELEKTGQFSWMDMKT 122
Query: 125 DEAEHSMEMHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYG 166
DE EHS+EMHLPY+AKV E + IVPILVG++N E EA YG
Sbjct: 123 DEDEHSIEMHLPYIAKVMEDYKDQFTIVPILVGSLNPEQEAQYG 166
>gi|374110238|gb|AEY99143.1| FAGR321Wp [Ashbya gossypii FDAG1]
Length = 330
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/324 (36%), Positives = 162/324 (50%), Gaps = 42/324 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
IR A+HAGSWY N RL+++L +L +A RG +I+PHAGYSY G A
Sbjct: 3 IRPATHAGSWYLSNPTRLSQQLSKFLNDATTEHGGVQRGTRAIISPHAGYSYCGETMAKC 62
Query: 61 FGNIDPT-SISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGK-F 117
+ +D RVF+LGPSHH++ AL S T +TP+G+L +D E + +L + F
Sbjct: 63 YAKLDVNKDTKRVFVLGPSHHFHFRNIALVSRYTALETPLGNLQVDEETVSKLLGHARLF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+D D+ EHS+EM P L + +KIVPIL+ + E + G++ +Y
Sbjct: 123 RPLDYDDDQKEHSLEMQYPMLHETLLQRGVRPSDVKIVPILISYNSTEVDYAIGKILIEY 182
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV---------------------IH 206
+ VSSDFCHWG RF Y Y D + V I+
Sbjct: 183 LQCNDTAVIVSSDFCHWGRRFGYTGYVSSVEDIQDAVADGTEIETLTTRSKMDHHSIQIY 242
Query: 207 KSIEALDKMGMDII-ETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
KSIE LD+ MDI+ + D +K YL NTICG PI V L +T +
Sbjct: 243 KSIELLDRYAMDILGQMADSREKYQIWKDYLEVTGNTICGEKPIGVLLCAFSYVNTPLSF 302
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
+ Y QSS+ K+ DSSVSYA+
Sbjct: 303 HWCGYSQSSEVKSLEDSSVSYAAG 326
>gi|302309543|ref|NP_986987.2| AGR321Wp [Ashbya gossypii ATCC 10895]
gi|299788408|gb|AAS54811.2| AGR321Wp [Ashbya gossypii ATCC 10895]
Length = 330
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/324 (36%), Positives = 162/324 (50%), Gaps = 42/324 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAYA 60
IR A+HAGSWY N RL+++L +L +A RG +I+PHAGYSY G A
Sbjct: 3 IRPATHAGSWYLSNPTRLSQQLSKFLNDATTEHGGVQRGTRAIISPHAGYSYCGETMAKC 62
Query: 61 FGNIDPT-SISRVFLLGPSHHYYTPKCAL-STATVYKTPIGDLPLDLEVIEELKATGK-F 117
+ +D RVF+LGPSHH++ AL S T +TP+G+L +D E + +L + F
Sbjct: 63 YAKLDVNKDTKRVFVLGPSHHFHFRNIALVSRYTALETPLGNLQVDEETVSKLLGHARLF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+D D+ EHS+EM P L + +KIVPIL+ + E + G++ +Y
Sbjct: 123 RPLDYDDDQKEHSLEMQYPMLHETLLQRGVRPSDVKIVPILISYNSTEVDYAIGKILIEY 182
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV---------------------IH 206
+ VSSDFCHWG RF Y Y D + V I+
Sbjct: 183 LQCDDTAVIVSSDFCHWGRRFGYTGYVSSVEDIQDAVADGTEIETLTTRSKMDHHSIQIY 242
Query: 207 KSIEALDKMGMDII-ETGDP----DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
KSIE LD+ MDI+ + D +K YL NTICG PI V L +T +
Sbjct: 243 KSIELLDRYAMDILGQMADSREKYQIWKDYLEVTGNTICGEKPIGVLLCAFSYVNTPLSF 302
Query: 262 KFLRYEQSSQCKTKRDSSVSYASA 285
+ Y QSS+ K+ DSSVSYA+
Sbjct: 303 HWCGYSQSSEVKSLEDSSVSYAAG 326
>gi|238484865|ref|XP_002373671.1| DUF52 domain protein [Aspergillus flavus NRRL3357]
gi|220701721|gb|EED58059.1| DUF52 domain protein [Aspergillus flavus NRRL3357]
Length = 346
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 161/345 (46%), Gaps = 66/345 (19%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS---------PEVRGVIAPHAGYSYSGR 55
R ASHAGSWY+DN + L +LDGWL A +P S P R +IAP+ + S
Sbjct: 4 REASHAGSWYSDNQRTLTHQLDGWL--AQVPNSIEGIGSLPVPGARIIIAPYVPENISAP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYK-----TPIGDLPLDL----- 105
++ DP + + LGP H +T + + T I PL
Sbjct: 62 CTVNSYA--DPLDMRAIHTLGPVPHTHTRLWIFRKPSEFSFLDHLTIITSRPLLFLNLLR 119
Query: 106 ----------EVIEELKATGK---FELMDICVDEAEHSMEMHLPYLAKVFEGHLIK---- 148
+ +K+ G F M +DE EHS+E+HLPY+ ++ + H
Sbjct: 120 TIPRFPTSLFHLTRTVKSNGSTISFTTMSRSIDEDEHSIELHLPYIHRLLQLHFPNKPTS 179
Query: 149 ----IVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------ 198
+VPI+VG+ +A E+ +G L A Y+ DP+N F +SSDFCHWG RF Y +Y
Sbjct: 180 QYPPLVPIMVGSTSASTESAFGALLAPYLQDPTNAFVISSDFCHWGLRFRYTYYVPQSVS 239
Query: 199 -----------DKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
+ IH+SI A D M I TG F + + NT+CGRHPI V
Sbjct: 240 TGQALQRRDRISSREPAIHESISAFDIATMAAISTGVTANFLETIQTTGNTVCGRHPIGV 299
Query: 248 FLHMLGNCST-----KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
+ + ++ K K F+RYE+SS DSSVSY SA A
Sbjct: 300 IMAAIEEITSQEEGKKGKFHFVRYERSSDAVDVDDSSVSYVSAFA 344
>gi|221053714|ref|XP_002258231.1| hypothetical protein in Apicomplexan species [Plasmodium knowlesi
strain H]
gi|193808064|emb|CAQ38768.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 296
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 156/297 (52%), Gaps = 8/297 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M RRA HAGSWY++N L +D K V+ I PHAGY Y+ + ++
Sbjct: 1 MTNYRRAYHAGSWYSNNGTALKNSIDALFESINCQKQ-NVKAAICPHAGYDYALKTNSHV 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I+ ++ +F+LGP+HH Y C Y+TP G L ++ EVI ++ L
Sbjct: 60 YACINVENVKNIFILGPNHHIYNKGCLFPQVDKYETPFGFLQINKEVISDIMKNDVDHLF 119
Query: 121 DIC---VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM--YGRLFAKYVDD 175
D DE EHS+EM LP + + + IKI+PI VG + + + + + + KY D
Sbjct: 120 DFIGDEDDEEEHSIEMQLPLIKYIIKEKDIKIIPIYVGCIGNDIQKISRFCKPIKKYFQD 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH--KSIEALDKMGMDIIETGDPDAFKKYLL 233
N F SSDFCH+G RF++ + +K+ H K +E +DK +II F YL
Sbjct: 180 EENLFLFSSDFCHYGRRFSFTNILQKYNDTHIFKQVENMDKDAANIISRHSVADFIDYLN 239
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
+ NTICG +PI + L +L + + K + Y QS+Q ++ +SSVSYA + ++
Sbjct: 240 KTHNTICGSNPIKMMLRLLEDFPGNVSTKLMHYSQSNQARSTNESSVSYAGMVSFIN 296
>gi|156098107|ref|XP_001615086.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803960|gb|EDL45359.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 296
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 158/297 (53%), Gaps = 8/297 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M R+A HAGSWY+ + + L +D E+ K V+ I PHAGY Y+ + ++
Sbjct: 1 MTSYRKAYHAGSWYSSSGRALKNSIDTHF-ESISCKKQSVKAAICPHAGYDYALQTNSHV 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I ++ +F+LGP+HH Y C Y+TP G L ++ EVI E+ L
Sbjct: 60 YACISVENVKNIFILGPNHHIYNKGCLFPHVEKYETPFGFLQINREVISEILQNDVDHLF 119
Query: 121 DIC---VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM--YGRLFAKYVDD 175
D DE EHS+EM LP + + + IKI+PI VG + + + + + KY D
Sbjct: 120 DFIGDEDDEEEHSIEMQLPLIKYIIKEKDIKIIPIYVGCIGNDIQKIDRFCNPLKKYFQD 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH--KSIEALDKMGMDIIETGDPDAFKKYLL 233
N F SSDFCH+G RF++ + +K+ H K +E +D+ II D F +YL
Sbjct: 180 EGNLFLFSSDFCHYGRRFSFTNILQKYNDTHIFKQVENMDRDAASIISRHDIADFIEYLN 239
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
+ NTICG +PI + L +L + K+ K + Y QS+Q ++ DSSVSYA + ++
Sbjct: 240 KTHNTICGSNPIKMMLQLLQDLPGKVSTKLMHYSQSNQARSANDSSVSYAGMVSFIN 296
>gi|156844360|ref|XP_001645243.1| hypothetical protein Kpol_1060p41 [Vanderwaltozyma polyspora DSM
70294]
gi|156115902|gb|EDO17385.1| hypothetical protein Kpol_1060p41 [Vanderwaltozyma polyspora DSM
70294]
Length = 331
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/330 (34%), Positives = 172/330 (52%), Gaps = 47/330 (14%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRAAAYA 60
R ASH+ SWY+D L+++L G+L G S E+ R +I+PHAGY Y G A++
Sbjct: 4 RPASHSSSWYSDKPDELSDQLQGYLN--GASSSAELKKDARLIISPHAGYKYCGSTMAHS 61
Query: 61 FGNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
+ ++D +I R+F+LGPSH+ Y+ + LS TP+GDL +D E L +
Sbjct: 62 YASLDLNENIERIFILGPSHNVYFRNQIFLSAFETLSTPLGDLNVDTAYCEMLINSNNSL 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVNAENEAMYGRLFAK 171
F MD+ D +EHS+EM P L K + IK++P++V + + K
Sbjct: 122 FAYMDMETDISEHSLEMQFPMLVKTIQWRKLSLDKIKVIPMMVSHNTEDIDTKIAEFILK 181
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY------------DK-------------KHGV-I 205
DP N F +SSDFCHWG RF + Y D+ +H V I
Sbjct: 182 KFRDPKNLFIISSDFCHWGRRFRFTGYVGSPEELQDSINDEVEIEMLTSRSKLPRHRVPI 241
Query: 206 HKSIEALDKMGMDIIE-TGDPDAF---KKYLLEYDNTICGRHPISVFLHM--LGNCSTKI 259
+SI+ LD+ MD+++ + +P+ + K+YL NTICG PI + L + L +I
Sbjct: 242 WESIKFLDEYAMDLLDKSENPEKYKDWKQYLELTGNTICGAKPIQLILKVIELEKYPERI 301
Query: 260 KIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ ++ +Y Q+SQ + DSSVSYAS A +
Sbjct: 302 RFEWPKYSQTSQVENVNDSSVSYASGYAII 331
>gi|366992726|ref|XP_003676128.1| hypothetical protein NCAS_0D01850 [Naumovozyma castellii CBS 4309]
gi|342301994|emb|CCC69766.1| hypothetical protein NCAS_0D01850 [Naumovozyma castellii CBS 4309]
Length = 331
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 159/314 (50%), Gaps = 43/314 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR A+HAGSWY+ N+ L+ +L +L P R +I+PHAGY Y G AY+F +
Sbjct: 3 IRPATHAGSWYSKNATDLSHQLQTYLTSTKKPIVSNARVIISPHAGYRYCGHTMAYSFAS 62
Query: 64 IDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KAT-GK--F 117
+ T + RVF+LGPSHH Y+ + +S +TP+G+ +D +V ++L KA GK F
Sbjct: 63 LALTKKVKRVFILGPSHHIYFKNEVFVSGFDAIETPLGEFKVDKDVCKKLVKAKHGKKLF 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-----EGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
M+ VD EHS+EM LA+ E +K++P++V + E + G AKY
Sbjct: 123 AFMNQDVDMDEHSLEMQFSMLAQTLKWRDIELEQVKVIPMMVSHNSTEVDMEVGEELAKY 182
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IH 206
+ DP N F +SSDFCHWG RF Y Y K H V I
Sbjct: 183 LVDPENLFILSSDFCHWGRRFEYTGYVGSEGELKEAMEEETEIEMLTARSKLKHHQVPIW 242
Query: 207 KSIEALDKMGMDIIETGDPD----AFKKYLLEYDNTICGRHPISVFLHMLGNC--STKIK 260
KSIE LD+ M + + A+K+YL NTICG PI V L L K+
Sbjct: 243 KSIEILDRAAMHTLSESTNENRYKAWKRYLEITGNTICGERPIGVILAALSYLPKDEKVT 302
Query: 261 IKFLRYEQSSQCKT 274
++ Y QSSQ +
Sbjct: 303 FEWPNYSQSSQVQN 316
>gi|50285415|ref|XP_445136.1| hypothetical protein [Candida glabrata CBS 138]
gi|49524439|emb|CAG58036.1| unnamed protein product [Candida glabrata]
Length = 332
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/329 (35%), Positives = 166/329 (50%), Gaps = 45/329 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR A+HAG+WY+ ++K L+++L +L + + R +I+PHAGY Y G A+A
Sbjct: 1 MVSIRSATHAGTWYSAHAKDLSQQLLQYLESSSRECVSKARIIISPHAGYRYCGPTMAHA 60
Query: 61 FGNID-PTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KATG 115
+ ++D R+F+LGPSHH Y+ + +S +TP+G+L +D EV E L G
Sbjct: 61 YASLDLKKKPKRIFILGPSHHVYFKNEVLVSHFQEVETPLGNLNVDTEVCETLVGGSGDG 120
Query: 116 K--FELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRL 168
+ F MD DE EHS+EM P LA+ I K+VP++V + + G
Sbjct: 121 RKLFRYMDKDTDEDEHSLEMQFPMLAQALLWRQIPLSEVKVVPLMVSHNSVNVDTSVGST 180
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHG 203
AKY+ D N F +SSDFCHWG RF+Y Y H
Sbjct: 181 LAKYMKDEENLFIISSDFCHWGRRFSYTGYVGSTEELENVIAEETEVEMLTTRSKLSHHQ 240
Query: 204 V-IHKSIEALDKMGMDIIETG--DPD----AFKKYLLEYDNTICGRHPISVFLHMLGNCS 256
V I KSIE +DK M ++ P+ +K+YL NTICG PISV L +
Sbjct: 241 VPIWKSIEIMDKYAMHVLSRNAKSPENLYSEWKQYLEISGNTICGEKPISVIACALASVG 300
Query: 257 TKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
K ++ Y QSS + +SSVSYA+
Sbjct: 301 YK-DFEWPAYSQSSNVNSINESSVSYAAG 328
>gi|2425141|gb|AAB70854.1| similar to C. elegans CEESS08F encoded by GenBank Accession Number
U64857 [Dictyostelium discoideum]
Length = 168
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/172 (49%), Positives = 116/172 (67%), Gaps = 5/172 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR +HAGSWY DN+++L ++L WL EA + V+ +IAPHAGYSYSGRAAAYA+ N
Sbjct: 2 IRNCTHAGSWYLDNARKLEKQLSDWLSEAS-RLNQNVKSIIAPHAGYSYSGRAAAYAYIN 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + RVF+LGPSHH Y C L+ ++TPIG+L +D + +L TG F
Sbjct: 61 LIPENYKRVFILGPSHHVYMKTCGLTKLDTWETPIGNLKVDKDTTNKLFDTGSFIWNTKS 120
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
VDE EHS+E+ LPY+AKV E IVPI+VG+++ + E +YG++ A Y DD
Sbjct: 121 VDEDEHSLELQLPYIAKVAEN----IVPIMVGSLSIDLEELYGKILAPYFDD 168
>gi|239615278|gb|EEQ92265.1| DUF52 domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327353638|gb|EGE82495.1| DUF52 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 393
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 95/223 (42%), Positives = 129/223 (57%), Gaps = 30/223 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
M R A+HAGSWY+D+ L+ +L+ WL + G P R +IAPHAGY+YS
Sbjct: 1 MSSSREATHAGSWYSDHEPTLSSQLNKWLSQVPNELPGLGRLPVPGARIIIAPHAGYAYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP--LDLEVIEEL 111
G AA+A+ +D + R+FL+GPSHH+Y AL T Y TP+ P LD ++I L
Sbjct: 61 GPCAAWAYKMLDLSKAKRIFLIGPSHHHYLSTIALPQLTRYLTPLSPDPLLLDTDLITHL 120
Query: 112 KATG---KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-------------KIVPILVG 155
+T +F M VD AEHS+E+HLPY+ HLI ++VP++VG
Sbjct: 121 LSTTTNPRFTTMSPAVDSAEHSLELHLPYI-----HHLIRTLYPTKPATAYPRLVPMMVG 175
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
+ +A EA +G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 176 STSAATEAAFGALLAPYLADETNAFIVSSDFCHWGLRFGYTYY 218
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 46/105 (43%), Gaps = 22/105 (20%)
Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST------- 257
IH+SI A D M I +G+ F L + NT+CGRHPI V + L
Sbjct: 287 IHESISACDVACMKAIASGETALFLDALRKTGNTVCGRHPIGVVMAGLEAVRKGSGEGEG 346
Query: 258 ---------------KIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
K K F+RYE+SS DSSVSY SA A
Sbjct: 347 EEDNESGGNGDRELEKGKFYFVRYERSSDVVAVADSSVSYVSAFA 391
>gi|261187552|ref|XP_002620196.1| DUF52 domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239594159|gb|EEQ76740.1| DUF52 domain-containing protein [Ajellomyces dermatitidis SLH14081]
Length = 340
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 95/223 (42%), Positives = 129/223 (57%), Gaps = 30/223 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYS 53
M R A+HAGSWY+D+ L+ +L+ WL + G P R +IAPHAGY+YS
Sbjct: 1 MSSSREATHAGSWYSDHEPTLSSQLNKWLSQVPNELPGLGRLPVPGARIIIAPHAGYAYS 60
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP--LDLEVIEEL 111
G AA+A+ +D + R+FL+GPSHH+Y AL T Y TP+ P LD ++I L
Sbjct: 61 GPCAAWAYKMLDLSKAKRIFLIGPSHHHYLSTIALPQLTRYLTPLSPDPLLLDTDLITHL 120
Query: 112 KATG---KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-------------KIVPILVG 155
+T +F M VD AEHS+E+HLPY+ HLI ++VP++VG
Sbjct: 121 LSTTTNPRFTTMSPAVDSAEHSLELHLPYI-----HHLIRTLYPTKPATAYPRLVPMMVG 175
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
+ +A EA +G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 176 STSAATEAAFGALLAPYLADETNAFIVSSDFCHWGLRFGYTYY 218
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 205 IHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPI 245
IH+SI A D M I +G+ F L + NT+CGRHPI
Sbjct: 287 IHESISACDVACMKAIASGETALFLDALRKTGNTVCGRHPI 327
>gi|225556304|gb|EEH04593.1| DUF52 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 320
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 128/215 (59%), Gaps = 21/215 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+ +L+ WL + G+ + P R +IAPHAGY+YSG A
Sbjct: 44 REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
A+A+ +D + R+FLLGPSHH++ AL T YKTP+ LPLD ++I L +T
Sbjct: 104 AWAYKALDLSKAKRIFLLGPSHHHHLSTLALPQLTSYKTPLSPDPLPLDTDLITHLLSTT 163
Query: 116 K----FELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEA 163
F M VD AEHS+E+HLPY+ + ++VP++VG+ + EA
Sbjct: 164 STNPHFTTMSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEA 223
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY 198
+G L A Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 224 AFGALLAPYLADDTNAFVVSSDFCHWGLRFGYTYY 258
>gi|123498609|ref|XP_001327441.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121910370|gb|EAY15218.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 286
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 139/256 (54%), Gaps = 6/256 (2%)
Query: 35 PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVY 94
P +V+ VI+PHAGY + A++AF IDP+ SRV ++GPSH C +S A +
Sbjct: 33 PIEGKVKAVISPHAGYRHCAETASHAFATIDPSLYSRVIIMGPSHRLPIDYCTISEAKSF 92
Query: 95 KTPIGDLPLDLEVIEEL--KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI 152
+TP L +D + EEL K F+ + I EHS+E+ LP++ +F+G + +VPI
Sbjct: 93 ETPTRSLEID-PIAEELTSKYGSIFKKLSIETSNREHSLELMLPWIDYIFKGKNVTVVPI 151
Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL 212
+VG ++ Y++DPS +SSDF HWGSRF+Y + +K G I + I A+
Sbjct: 152 MVGHLDQTKLEQAVSALKPYINDPSTLLVISSDFTHWGSRFSYTYLPEKDGEIWEKISAI 211
Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC 272
D M+ I T F+ Y+ TICGR+PI++ + ++ + Y QSS
Sbjct: 212 DHGAMEAISTCKAKNFQDYIKSTRATICGRNPITIAMMAFDK---PYRVDWPHYSQSSHA 268
Query: 273 KTKRDSSVSYASAAAK 288
K DSSVSYA +
Sbjct: 269 KNMADSSVSYAGGVIR 284
>gi|346320315|gb|EGX89916.1| DUF52 domain protein [Cordyceps militaris CM01]
Length = 370
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 159/314 (50%), Gaps = 43/314 (13%)
Query: 13 WYTDNSKRLAEELDGWLREA-----GLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
WY +L D +L E G P P R +IAPHAGY+YSGR +A+AF +D
Sbjct: 56 WYELAPFQLRWNFDEYLDEVPETIQGSPLPIPGARAIIAPHAGYAYSGRCSAWAFRCLDL 115
Query: 67 TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-------KFEL 119
RVF+LGP+H++Y ALST Y TP+G +D + + +++A
Sbjct: 116 ARAKRVFVLGPTHYFYFRGLALSTFASYSTPLGAFRVDQQTLRDVRAAADALGGAPDVRN 175
Query: 120 MDICVDEAEHSMEMHLPYLAKVFE------GHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
M + EHS+EM + +L + E IVP+LV +A++E GRL Y+
Sbjct: 176 MPRRRELDEHSLEMEIAFLYQRCEAVFASPADFPAIVPMLVSG-DADDERAVGRLLLPYL 234
Query: 174 DDPSNFFSVSSDFCHWGSRF-NYMHY----DKKHGV------------IHKSIEALDKMG 216
DP+N F VSSDFCHWG +F +Y Y D+ V IH+SI+ LD
Sbjct: 235 RDPANAFVVSSDFCHWGRQFGDYRPYFRGGDRARRVNLRDADAPPSPPIHESIKMLDDEA 294
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML------GNCSTKIKIKFLRYEQSS 270
+ +ETG DAF L E +N++CGRHPI + L G + + K +RY++S+
Sbjct: 295 IAAMETGSHDAFVANLEETENSVCGRHPIGAMMAALELLGADGAPEDRPRFKHVRYDRSA 354
Query: 271 QCKTKRDSSVSYAS 284
SSVSY S
Sbjct: 355 LLTEPTKSSVSYNS 368
>gi|328853086|gb|EGG02227.1| hypothetical protein MELLADRAFT_38671 [Melampsora larici-populina
98AG31]
Length = 268
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/262 (37%), Positives = 142/262 (54%), Gaps = 34/262 (12%)
Query: 5 RRASHAGSWYTDNSK-RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
R A+HAGSWYT + + EE W + A + VIAPH GY YSG AA+A+
Sbjct: 4 RSATHAGSWYTSSRQAETTEETQDWRKVAMPVPISHCKAVIAPHVGYLYSGLTAAWAYKT 63
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL---- 119
IDP+ I P H Y CALS Y+ P+G+L +D++V + + F L
Sbjct: 64 IDPSVIY------PLSHVYLDGCALSGCEAYEIPLGNLEIDIQVSSQSVSIFIFLLGHLS 117
Query: 120 --------MDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYGRLFA 170
MDI D+AEHS+E+HLPY+ + + I IVPIL+G++++E E +YG++
Sbjct: 118 YLGIGAMEMDIKTDKAEHSIELHLPYVRHISKSFQTITIVPILIGSISSEEELIYGKVLR 177
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY-DKKHGV----------IHKSIEALDKMGMDI 219
Y+ P N F +SSDFCH F+Y +Y D + I++SIEALD+ + I
Sbjct: 178 PYLLSPENLFVISSDFCHC---FSYTYYKDPARKLSSGASPPATPIYQSIEALDQEALKI 234
Query: 220 IETGDPDAFKKYLLEYDNTICG 241
+E+ F +YL + N ICG
Sbjct: 235 LESLSHSEFSRYLSQTKNNICG 256
>gi|323347913|gb|EGA82173.1| YJR008W-like protein [Saccharomyces cerevisiae Lalvin QA23]
Length = 312
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 105/295 (35%), Positives = 159/295 (53%), Gaps = 47/295 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGPTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-----KAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D ++ + L
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKXYPEN 121
Query: 115 GK--FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGR 167
GK F+ MD D AEHS+EM LP L + +K++P++V + + + G
Sbjct: 122 GKKLFKPMDHDTDMAEHSLEMQLPMLVETLXWREISLDTVKVIPMMVSHNSVDVDRCIGN 181
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKH 202
+ ++Y+ DP+N F VSSDFCHWG RF Y Y H
Sbjct: 182 ILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLXHH 241
Query: 203 GV-IHKSIEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHML 252
V I +SIE +D+ M + G+ DA+K+YL NTICG PISV L L
Sbjct: 242 QVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSAL 296
>gi|226288183|gb|EEH43696.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 407
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 161/374 (43%), Gaps = 114/374 (30%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-------GLPKSPEVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+++LD WL + G P R +IAP
Sbjct: 55 REATHAGSWYSDHEPTLSKQLDQWLAQVPDELPDIGPLPIPGARIIIAPE---------- 104
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--DLPLDLEVIEELKATG 115
R+FL+GPSHH+ AL T Y TP+ LPLD E+I L +
Sbjct: 105 -------------RIFLIGPSHHHILSTLALPQLTSYLTPLSREPLPLDTELIAHLLTST 151
Query: 116 -----KFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENE 162
+F M VD AEHS+E+HLPY+ + +VPI+VG+ E
Sbjct: 152 NASNHRFTTMSPAVDSAEHSLELHLPYIHHLLRRLYPTKPTAQYPSLVPIMVGSTLPATE 211
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY------------------------ 198
A +G + A Y+ DPSN F +SSDFCHWG RF + +Y
Sbjct: 212 AAFGSVLAPYLADPSNAFIISSDFCHWGLRFGHTYYVPNARTPAEPLPLSYPQLPLPSTS 271
Query: 199 ---------------------DK---KHGV-----IHKSIEALDKMGMDIIETGDPDAFK 229
DK + GV IH+SI A D M I +G+ F
Sbjct: 272 DAGVAAEAIEAVSGGISLKVKDKDRVRKGVAGVPLIHESISACDVACMTAIASGETRRFL 331
Query: 230 KYLLEYDNTICGRHPISVFLHML----GNCS------------TKIKIKFLRYEQSSQCK 273
L + NT+CGRHPI V + L GN K + F+RYE+SS
Sbjct: 332 DALRKTGNTVCGRHPIGVVIAGLEIVRGNKEGGGKVDAVDGEMQKGRFHFIRYERSSDVV 391
Query: 274 TKRDSSVSYASAAA 287
DSSVSY SA A
Sbjct: 392 AVGDSSVSYVSAFA 405
>gi|358253784|dbj|GAA53777.1| hypothetical protein CLF_111104 [Clonorchis sinensis]
Length = 166
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 105/162 (64%), Gaps = 8/162 (4%)
Query: 133 MHLPYLAKVFEGH--LIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWG 190
M LPY+AKV E IVPI+VG+++ EA+YGR+ A Y+ +P N F +SSDFCHWG
Sbjct: 1 MQLPYVAKVMERRRGAFTIVPIVVGSLSPAREAVYGRILAPYLLNPENLFVISSDFCHWG 60
Query: 191 SRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
RF Y HYD+ G I +SI+ALD+ GM IE P F YL +Y NTICGRHPI + L
Sbjct: 61 RRFQYTHYDQHKGAIWQSIQALDEEGMHAIERLSPADFTAYLDQYGNTICGRHPIGILLQ 120
Query: 251 MLGNC------STKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
M+ + ST+ KF++Y QS C++ DSSVSYA+ +
Sbjct: 121 MVESLRKGSSPSTQFNFKFIKYTQSEHCQSMNDSSVSYAAGS 162
>gi|410079789|ref|XP_003957475.1| hypothetical protein KAFR_0E01860 [Kazachstania africana CBS 2517]
gi|372464061|emb|CCF58340.1| hypothetical protein KAFR_0E01860 [Kazachstania africana CBS 2517]
Length = 356
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/349 (33%), Positives = 174/349 (49%), Gaps = 63/349 (18%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
IR+ASHAG+WY NS +L +L L L + + R +I+PHAGY Y G AY++
Sbjct: 7 IRKASHAGTWYLQNSTQLQTQLQSNLNATKLVQPIKGSRIIISPHAGYRYCGNVMAYSYA 66
Query: 63 NIDPTSIS-------RVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
++D T RVF+LGPSHH Y+ K +S+ +TP+G+L +D EV++EL
Sbjct: 67 SMDLTKFKNASDDKIRVFILGPSHHIYFRDKILVSSFEEVETPLGNLTVDREVVDELIKK 126
Query: 115 GK-----FELMDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAM 164
FEL+D VD EHS+EM LA+ +K++P+LV + ++
Sbjct: 127 NHNDKKVFELLDKDVDLDEHSLEMQFSMLAQTLHWRSVDLSKVKVIPMLVSHNSETVDSK 186
Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-----DKKHGV--------------- 204
Y++D +FF +SSDFCHWG RF Y Y D +
Sbjct: 187 VAGFLKPYLNDGKSFFIISSDFCHWGRRFEYTGYVGNEDDLNEAMNEETEIEMLTSRSKL 246
Query: 205 ------IHKSIEALDKMGMDII---------ETGDPD-------AFKKYLLEYDNTICGR 242
I KSIE LDK M ++ ++ +P+ +K+YL NTICG+
Sbjct: 247 SHHQISIWKSIELLDKYAMKVLSGEEIKKETDSNEPNEDLTLYEKWKEYLDITGNTICGQ 306
Query: 243 HP--ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+P + + L N K + ++L Y QSS+ + +SSVSYAS ++
Sbjct: 307 NPIKLILKLLEELNEDEKFEFEWLNYSQSSKVSSIEESSVSYASGCVRI 355
>gi|313227292|emb|CBY22438.1| unnamed protein product [Oikopleura dioica]
Length = 176
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/173 (47%), Positives = 113/173 (65%), Gaps = 6/173 (3%)
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
MD DE EHS+EM LP+LA +F+ L + +VPI+VG++ E EA Y ++FAKY++DP
Sbjct: 1 MDAPTDEEEHSIEMQLPFLAHIFKNRLEEVSVVPIMVGSIKQEKEADYAQVFAKYLNDPE 60
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
F +SSDFCHWG RF Y + +++ I +SI+A+D+ GMD I + + F YL N
Sbjct: 61 VVFVISSDFCHWGKRFRYTYRLEEYEQIFESIDAVDREGMDHIASKNLSNFHNYLRRTKN 120
Query: 238 TICGRHPISVFLHMLGNCSTK----IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
TICGR+PI + L + K +IKFL+Y QSSQ ++ DSSVSYASA
Sbjct: 121 TICGRNPICLLLATINLLEQKGQMQSQIKFLKYAQSSQVRSPNDSSVSYASAV 173
>gi|403218503|emb|CCK72993.1| hypothetical protein KNAG_0M01400 [Kazachstania naganishii CBS
8797]
Length = 334
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 159/325 (48%), Gaps = 43/325 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRA+HAGSWY + +L + + + G R V+APHAGY Y A+ ++
Sbjct: 10 RRATHAGSWYPSDGGQLRDTV---FKHRGQGPRAGARVVVAPHAGYKYCSSVLWPAYESL 66
Query: 65 DPT----SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFE 118
D + S RVF+LGPSHH Y+ +S + TP+G LP+D E E L + G F+
Sbjct: 67 DTSCPGSSHLRVFVLGPSHHAYFKDTFLVSGYSELDTPLGPLPVDTEYCESLVQGGGPFK 126
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLI-------KIVPILVGAVNAENEAMYGRLFAK 171
M VD AEHS+EM P L + E + ++VPI+V + E A + A+
Sbjct: 127 RMSEAVDLAEHSLEMQYPALRCLLEQRGLSSGARGAQVVPIMVSHASREKYASLAAVLAE 186
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIH----- 206
+ VSSDFCHWG RF Y Y + + H
Sbjct: 187 QLASQRAIVVVSSDFCHWGRRFEYTGYVGSQEELTTALAEETEVEELTARSKLAHHRLPI 246
Query: 207 -KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIK-IKFL 264
+S+E LD+ M ++E+ D D + +YL NT+CGR+P+ V L L +++
Sbjct: 247 WQSVELLDRGAMHVLESNDLDKWTEYLAITGNTVCGRNPLCVLLLALQQHRGGAPHWEWV 306
Query: 265 RYEQSSQCKTKRDSSVSYASAAAKV 289
+Y QSS+ + SSVSYASA K+
Sbjct: 307 KYAQSSKVQDITQSSVSYASAYIKL 331
>gi|366998487|ref|XP_003683980.1| hypothetical protein TPHA_0A04710 [Tetrapisispora phaffii CBS 4417]
gi|357522275|emb|CCE61546.1| hypothetical protein TPHA_0A04710 [Tetrapisispora phaffii CBS 4417]
Length = 341
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 164/339 (48%), Gaps = 54/339 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R A+H+GSWY+D+ +RL EL L +A + R +I+PHAGY Y G AY++
Sbjct: 3 RPATHSGSWYSDDLRRLDAELSRHLVDASKNVNMVKGSRLIISPHAGYKYCGSTMAYSYA 62
Query: 63 NIDPTS-ISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKA-----TG 115
++D S + R+ ++GPSHH Y+ + +S + +TP+G+L +D + IE+ G
Sbjct: 63 SLDLNSNVKRIIIMGPSHHIYFKNQIYVSGFSEVQTPLGNLHVDKDFIEDKLMGNSIEKG 122
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLI-----KIVPILVGAVNAENEAMYGRLFA 170
F MD D +EHS+EM P + K + I K+VP++V + + +
Sbjct: 123 LFAYMDEETDLSEHSLEMQFPMVVKTLQFRNIDVDKVKVVPMIVSHNSEAVDTKLANVLK 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHY--------------------DKKHGVIHK--- 207
+P F VSSDFCHWG RF Y Y + +H+
Sbjct: 183 NEFMNPETIFIVSSDFCHWGRRFQYTGYVGNSEELKDSLEEQTEIEMLTSRSKSLHREVK 242
Query: 208 ---SIEALDKMGMDIIETGDPD----AFKKYLLEYDNTICGRHPISVFLHML-------- 252
SI +DK GM ++ D +K Y+ NT+CG +PI V L +
Sbjct: 243 IWESIMIIDKFGMKLLSDSDNSDKYKNWKNYIEISGNTVCGTNPIKVILKTIELIANEVV 302
Query: 253 --GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
G+ S+KI + Y QSS+ + +SSVSYAS A +
Sbjct: 303 SSGSDSSKIDFYWPSYSQSSKVTSVNESSVSYASGYAII 341
>gi|340502344|gb|EGR29045.1| mediator of cell motility 1, putative [Ichthyophthirius
multifiliis]
Length = 230
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 121/234 (51%), Gaps = 38/234 (16%)
Query: 21 LAEELDGWLREAGL--PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPS 78
L EL G+L + L +++ +I+PH G Y G AY++ I+P +I R+FLLGPS
Sbjct: 2 LEIELQGYLDKVNLNVQNIQKLKALISPHTGLKYCGAVQAYSYKYINPQNIQRIFLLGPS 61
Query: 79 HHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK--FELMDICVDEAEHSMEMHLP 136
H + C LS+ ++TP G++ +D+E+I EL K F +D ++ EHS+E+ LP
Sbjct: 62 HRIFIQGCGLSSMDFFETPFGNIEIDVEIINELYKNQKDYFVKLDQETEQNEHSLELQLP 121
Query: 137 YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYM 196
L K ++PI+VG N E + YG++F+KY DD S F +SSDFCHWG
Sbjct: 122 MLKKQMGDKQFILIPIMVGQTNQELDVQYGKIFSKYFDDDSTLFIISSDFCHWG------ 175
Query: 197 HYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
IE F+ YL NTICGR I++ L+
Sbjct: 176 -----------QIE-----------------FQNYLNSTKNTICGRRGINILLN 201
>gi|123427666|ref|XP_001307302.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121888923|gb|EAX94372.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 310
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 148/289 (51%), Gaps = 11/289 (3%)
Query: 4 IRRASHAGSWY-TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+A+HAG WY TD+ ++ E + ++GVI+PH+ Y R AAY+F
Sbjct: 4 IRKATHAGEWYPTDDQLKIMLEASFHYAQPKPNADLGLKGVISPHSCYQVCLRTAAYSFS 63
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT--GKFELM 120
I+P R+ +LG HH +S AT +TP G+L +D EV E+L + M
Sbjct: 64 CINPDKFERIIILGTCHHIALKAGLVSHATEVETPFGNLQVDTEVTEKLATEYGEAIQWM 123
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D VDE EHS+EM P + +++ +KI+P+L+G+++ E + + D FF
Sbjct: 124 DQKVDENEHSLEMQYPLIKYIWQDRPVKIIPMLIGSLSEPREIEIAEALSPIITDEKTFF 183
Query: 181 SVSSDFCHWGSRFNYMHYDK-KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+SSDF HWG F++ K + + ++ D+ + II + + F+ E +I
Sbjct: 184 IISSDFTHWGEIFHHTPIQSTKKKQLSQQLQIADERSIGIIHQFNYEHFRFICEEIHGSI 243
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC--KTKRDSSVSYASAA 286
CG + I + L +L +F +++S C +T +D S+SY +A
Sbjct: 244 CGCYSICLMLRILAE-----GYEFEVFDRSELCELRTMKDFSISYVAAG 287
>gi|384490136|gb|EIE81358.1| hypothetical protein RO3G_06063 [Rhizopus delemar RA 99-880]
Length = 626
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/295 (35%), Positives = 147/295 (49%), Gaps = 65/295 (22%)
Query: 7 ASHAGSWYTDNSKRLAEELDGWLREA---------GLPKSPEVRGVIAPHAGYSYSGRAA 57
ASHAGSWYT++ K L +EL+ +++++ LP V+ +IAP
Sbjct: 381 ASHAGSWYTNDKKALDKELNLYIKKSVEEMTADGLQLPIK-HVKAIIAP----------- 428
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
RVF+LGPSH T +C L Y+TP+G+LP+D VIE+L TG F
Sbjct: 429 ------------KRVFILGPSHVCATRECLLPNYDTYETPLGNLPVDWSVIEDLHKTGLF 476
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE---GHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
L D + EHS+EMHLP++ K FE H + PI +V ++ ++
Sbjct: 477 GLADSEDEINEHSIEMHLPFIYKAFEKTQKHCLSSRPI---SVTDKDHGIH--------- 524
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
N + SS I++SI+ALD G+ IE+ D F+ YL E
Sbjct: 525 --LNRNNASSTIVR---------------PIYESIQALDHQGIKAIESLSFDQFENYLNE 567
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
DNTICGR+PI V L L + L+Y QSS+C+T DSSVSYAS +V
Sbjct: 568 TDNTICGRNPIGVLLACLEEMKGDRSARCLKYAQSSKCRTLEDSSVSYASIYIQV 622
>gi|82541169|ref|XP_724845.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479636|gb|EAA16410.1| Protein of unknown function [Plasmodium yoelii yoelii]
Length = 264
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 128/257 (49%), Gaps = 40/257 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M RRA H+GSWY+DNS L ++ + LPK +V+ I PHAGY+Y +++
Sbjct: 1 MPTYRRAYHSGSWYSDNSNVLKNSIESLFEKINLPKQ-QVKAAICPHAGYAYCLETSSHV 59
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG--------DLPLD-------- 104
+ I+ +I +F+LGP+HH Y C L Y+TP G +LPL
Sbjct: 60 YSCINVENIKNIFILGPNHHIYNKGCLLPQVDKYETPFGFLQINKDGNLPLATCHLPLAT 119
Query: 105 ---------------LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKI 149
+ I ++ +D DE EHS+EM LP + + + IKI
Sbjct: 120 YHLPLTTINVYIYMFISDIMNNDTQNLYDYIDEIDDEEEHSIEMQLPLIKYIIKEKDIKI 179
Query: 150 VPILVGA----VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG-- 203
VPI VG VN NE + KY D +N F SSDFCH+G RF++ + +K+
Sbjct: 180 VPIYVGCIGNDVNKINE--FSNPLKKYFQDKTNAFIFSSDFCHFGRRFSFTNILEKYNDK 237
Query: 204 VIHKSIEALDKMGMDII 220
IHK IE +DK G+++I
Sbjct: 238 YIHKKIENMDKDGINVI 254
>gi|385305757|gb|EIF49708.1| atp-dependent helicase dhx8 [Dekkera bruxellensis AWRI1499]
Length = 204
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 80/197 (40%), Positives = 111/197 (56%), Gaps = 9/197 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
IR A+HAG+WY L +LD +L G K + R ++ PHAGYSY+G A A+G
Sbjct: 10 IREATHAGTWYVGTPSVLDRQLDQFLN--GKTKIIDGARILLGPHAGYSYAGPTLAKAYG 67
Query: 63 NIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEEL--KATGKFEL 119
DP+ + RVF++GPSHH Y++ S Y TP+G +P+D E I L K F+
Sbjct: 68 XFDPSKVKRVFIMGPSHHVYFSGSVMTSKFDAYDTPLGKIPVDKETISXLMKKDPKMFKQ 127
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH---LIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
M DE EH EMH+P+L KV E + KI+PI++ N + E + Y ++
Sbjct: 128 MSFQADEDEHCFEMHMPFLYKVTENSPNGVPKIIPIMISDSNEKFERKLAKDLQPYFENK 187
Query: 177 SNFFSVSSDFCHWGSRF 193
N F ++SDFCHWG RF
Sbjct: 188 ENAFIITSDFCHWGFRF 204
>gi|256089017|ref|XP_002580615.1| hypothetical protein [Schistosoma mansoni]
Length = 145
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 95/142 (66%), Gaps = 2/142 (1%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RRASHAGSWY+ +L +L WL E + + P R +I+PHAGY YSG AA+A+
Sbjct: 5 VRRASHAGSWYSAERAKLNNQLQKWLSEVTVDRQP-TRAIISPHAGYDYSGPCAAFAYKQ 63
Query: 64 IDPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
IDP I R+F+LGP+H+ KCALSTA ++TP+ L + +V +L+ TG+F + +
Sbjct: 64 IDPRHIKRIFVLGPAHYMSLRGKCALSTADFFETPLYSLSIGKDVYRDLEETGEFVSLPL 123
Query: 123 CVDEAEHSMEMHLPYLAKVFEG 144
DE EHS+EM +PY+AK+ EG
Sbjct: 124 DRDEEEHSIEMQMPYIAKMMEG 145
>gi|403223866|dbj|BAM41996.1| predicted protein [Theileria orientalis strain Shintoku]
Length = 197
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/174 (37%), Positives = 105/174 (60%), Gaps = 4/174 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAF 61
+R A+HAGSWY+D+S+ L ++ GLP +P ++ +IAPHAGY+YS A +A+
Sbjct: 1 MRIATHAGSWYSDSSRALESQIKS--SSDGLPNVPNPTLKYIIAPHAGYAYSLSTAVHAY 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ID ++I VF+LGPSHH++ CA+ + +TP+G L +D+ +++ L F +++
Sbjct: 59 SHIDASTIKTVFVLGPSHHFFLRGCAVDQFSSLQTPLGQLQVDVNIVDSLSNLKGFSVVN 118
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
E EHS+EMHLP L +F+ +K+VPI+VG N Y +D
Sbjct: 119 KQASEDEHSIEMHLPLLKFMFKKENVKVVPIIVGDFNETMRDQIASSLLPYFND 172
>gi|156084952|ref|XP_001609959.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797211|gb|EDO06391.1| conserved hypothetical protein [Babesia bovis]
Length = 245
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 139/293 (47%), Gaps = 58/293 (19%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RR+ HAGSWY+++++ L E++D L +A P ++ +I+P+
Sbjct: 1 MRRSIHAGSWYSNDAEILTEQIDQALSQAVEPFGSALKYIISPNT--------------- 45
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
VF+LGPSHH CA+ ++ +TP G+L +D ++ EL F+ +
Sbjct: 46 --------VFILGPSHHLPLKGCAVDVSSTLQTPFGELQVDNDITTELLKGKCFKELSKR 97
Query: 124 VDEAEHSMEMHLP---YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
E EHS+EM LP Y+A IK+VPI+VG + E G+ Y + F
Sbjct: 98 NSEEEHSIEMQLPILHYVANKSNADHIKVVPIVVGYMLNEGLEDVGQALLPYFEKEDTIF 157
Query: 181 SVSSDFCHWGSRFNYMH--YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSDFCH+G RF + ++ + I K+IE+LD G+ +I D + KY
Sbjct: 158 VISSDFCHFGKRFGFTRTGFEDQDMPIWKAIESLDLDGVKLIVEHDLEVSNKY------- 210
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
IKI + QSS+C + DSSVSY S A + A
Sbjct: 211 --------------------IKITLV---QSSKCSERSDSSVSYCSIAGMLSA 240
>gi|300176356|emb|CBK23667.2| unnamed protein product [Blastocystis hominis]
Length = 195
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 103/167 (61%), Gaps = 1/167 (0%)
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
+E EHS+E+ LP++ +F G K+V ++VGAV++ ++ + A ++ DP F +SS
Sbjct: 4 EENEHSLEIELPFVYHLF-GSRAKVVLMMVGAVSSSYKSKIAPVLAPFLTDPKTVFVISS 62
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHP 244
DFCHWG F++++Y + I ++IE LDK G +IE DAF +Y+ + DNTICG +
Sbjct: 63 DFCHWGYNFDFLYYKDRSKPIWQNIEELDKRGAALIEQQSADAFAEYIEDTDNTICGHNC 122
Query: 245 ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
I +FL L + + LRY QSS+ ++ DSSVSY + ++A
Sbjct: 123 IEIFLRALAASGLQTTTRMLRYGQSSRAESMADSSVSYCALKTALNA 169
>gi|302507880|ref|XP_003015901.1| hypothetical protein ARB_06213 [Arthroderma benhamiae CBS 112371]
gi|291179469|gb|EFE35256.1| hypothetical protein ARB_06213 [Arthroderma benhamiae CBS 112371]
Length = 272
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 122/268 (45%), Gaps = 81/268 (30%)
Query: 101 LPLDLEVIEELK-----ATGKFELMDICVDEAEHSMEMHLPYLAKVFE--------GHLI 147
LPLD EV+ +++ A F M VDE EHSME+HLPY+ ++ +
Sbjct: 3 LPLDTEVLAKIEQEASSADRPFSRMSRSVDEQEHSMELHLPYIHRLLQRLYPTMPTSAYP 62
Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG---- 203
+VP++VG+ +A NE ++G + Y+ D N F +SSDFCHWG+RF Y +Y + G
Sbjct: 63 PLVPMMVGSTSAANERVFGSILRPYLADEENAFVISSDFCHWGTRFAYTYYVQAPGPTTY 122
Query: 204 ------------------------------------------------VIHKSIEALDKM 215
IH+S+ A D
Sbjct: 123 KLPLTYPSLPVPSDMLTAERAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMA 182
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL-----------HMLGNCSTKIKIK-- 262
M I +G+ D F++ L NT+CGRHPI V + +G T+ +K
Sbjct: 183 CMSAIASGNTDVFQEALRRTGNTVCGRHPIGVVMAGIEGLQHQAETTMGAAVTQDGVKKG 242
Query: 263 ---FLRYEQSSQCKTKRDSSVSYASAAA 287
F+RYE+SS C + +DSSVSY SA A
Sbjct: 243 KFYFIRYERSSDCFSVQDSSVSYVSAFA 270
>gi|159108244|ref|XP_001704394.1| Hypothetical protein GL50803_112932 [Giardia lamblia ATCC 50803]
gi|157432456|gb|EDO76720.1| hypothetical protein GL50803_112932 [Giardia lamblia ATCC 50803]
Length = 285
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/292 (32%), Positives = 149/292 (51%), Gaps = 25/292 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAF 61
+R +GSWY K ++ +LR+AG P+ VR +IAPHAG YSG AA+++
Sbjct: 1 MRLCLKSGSWY-PLPKEAKAQVAAFLRKAGEAVHPDANVRFIIAPHAGLVYSGLTAAHSY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
+IDPT + V +LG H ++ + S + P+ G L +E I L + K
Sbjct: 60 SSIDPTRYTSVVMLGVCHAFHQRGLSTSPFASWANPLMEKGSPSLSMETIPGLPSCQK-- 117
Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----EGHLIKIVPILV--GAVNAENEAMYGRLFAKY 172
D C E EHS+E+ +P+LA VF E +K + GA E +++ Y
Sbjct: 118 --DDC--EEEHSLELQIPFLAHVFANQIEAGTVKFSAVYCSYGATRTEIDSLM-----DY 168
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
V + ++ VSSDFCH+G RF + + ++++ LD ++ + G ++F++ L
Sbjct: 169 VTEHNSLIVVSSDFCHYGPRFQFTPMIQGK-TANETVTMLDNKCINGVMLG-ANSFEEAL 226
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
E NT+CG + I L +L +K + L Y +S + +T RD SVSY S
Sbjct: 227 KETQNTVCGHYTILTCLRVLEGLKLDVKKELLFYTKSDEPRTARDHSVSYVS 278
>gi|253744686|gb|EET00850.1| Hypothetical protein GL50581_1896 [Giardia intestinalis ATCC 50581]
Length = 285
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 149/292 (51%), Gaps = 25/292 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R +GSWY + K ++ LR+A + P V+ +I+PHAG YSG AA+++
Sbjct: 1 MRLCLKSGSWY-PSPKEAKAQVAALLRKAEETVHPDPNVKFIISPHAGLVYSGLTAAHSY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
+IDPT S V +LG H ++ + S + P+ G L +E + L + K +
Sbjct: 60 SSIDPTRYSSVVMLGVCHAFHQRGLSTSPFASWANPLTEKGSPSLSMEGVPGLSSCQKSD 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGRLFAKY 172
E EHS+E+ +P+LA VF + V GA AE +++ A+Y
Sbjct: 120 C------EEEHSLELQIPFLAHVFANQIEAGTVTFSAVYCSYGATKAEIDSL-----AEY 168
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ + ++ VSSDFCH+G RF + + ++++ LD ++ + G D+F++ L
Sbjct: 169 IVEHNSLIVVSSDFCHYGPRFQFTPLIQGK-TANETVTMLDNKCINGVMLG-ADSFEEAL 226
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
E NT+CG + I L +L + +K + L Y +S + KT RD SVSY S
Sbjct: 227 KETQNTVCGHYTIKTCLCILEGLNLNVKKELLFYTKSDEPKTTRDHSVSYVS 278
>gi|301063138|ref|ZP_07203692.1| memo-like protein [delta proteobacterium NaphS2]
gi|300442738|gb|EFK06949.1| memo-like protein [delta proteobacterium NaphS2]
Length = 362
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 148/291 (50%), Gaps = 15/291 (5%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP---EVRGVIAPHAGYSYSGRAA 57
++ + A+H G+WY + KRL L + +A PK+ + +I+PHAGY YSG A
Sbjct: 56 LKHVFTATHGGTWYPGDKKRLEALLTKFFDQAH-PKTTAKGRLMALISPHAGYPYSGPTA 114
Query: 58 AYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
++ + + ++ V ++G SH+ AL ++TP+G + + +++E+L
Sbjct: 115 SFGYKLLKQRDQNVKLVVIIGTSHYVPFHGMALPDWDGFRTPLGTVGVATKIVEKLDKEP 174
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
FE+ D EHS+EM +P L F +IVPI+ G++N E+ L ++
Sbjct: 175 PFEIFDQAFAR-EHSVEMQVPMLQ--FALKKFQIVPIVAGSLNMEDIQKAAELLRPLLNR 231
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P F +SSDF H+G + Y + + ++++ L + I+ D AF ++
Sbjct: 232 PDVLFVISSDFTHYGPNYGYAPFTDERP---ENLKKLADQALGTIKRLDLRAFMEHWQTT 288
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+TICG PISV L ML +T ++ L ++ S + ++SVSY S A
Sbjct: 289 GDTICGFFPISVLLAMLPQNTT---VEPLHFDTSGRITNNYENSVSYFSIA 336
>gi|308163228|gb|EFO65584.1| LigB protein [Giardia lamblia P15]
Length = 285
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 149/292 (51%), Gaps = 25/292 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAF 61
+R +GSWY K ++ +L++AG P+ VR ++APHAG +YSG AA+++
Sbjct: 1 MRLCLKSGSWY-PLPKEAKAQVAAFLQKAGETVHPDANVRFILAPHAGLAYSGLTAAHSY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI---GDLPLDLEVIEELKATGKFE 118
+IDPT + V +LG H ++ + S + P G L +E I L + K
Sbjct: 60 SSIDPTRYTSVVMLGVCHAFHQRGLSTSPFASWANPFTEKGSPSLSMETIPGLPSCQK-- 117
Query: 119 LMDICVDEAEHSMEMHLPYLAKVF----EGHLIKIVPILV--GAVNAENEAMYGRLFAKY 172
D C E EHS+E+ +P+LA VF E +K + GA E +++ Y
Sbjct: 118 --DDC--EEEHSLELQIPFLAHVFANQIEAGTVKFSAVYCSYGATRTEIDSLM-----DY 168
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ + ++ VSSDFCH+G RF + + ++++ LD ++ + G D+F++ L
Sbjct: 169 ITEHNSLIVVSSDFCHYGPRFQFTPMIQGK-TANETVTMLDNKCINGVMLG-ADSFEEAL 226
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
E +T+CG + I L +L +K + L Y +S + KT RD SVSY S
Sbjct: 227 KETQSTVCGHYTILTCLRVLEGLKLDVKKELLFYAKSDEPKTARDHSVSYVS 278
>gi|70945450|ref|XP_742543.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521586|emb|CAH75500.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 187
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/183 (38%), Positives = 103/183 (56%), Gaps = 11/183 (6%)
Query: 47 HAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
HAGY+Y +++ + I+ +I +F+LGP+HH Y C L Y+TP+G L ++
Sbjct: 2 HAGYAYCLETSSHVYSCINVENIKNIFILGPNHHIYNKGCLLPQVDKYETPLGFLQINKN 61
Query: 107 VIEEL---KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA----VNA 159
VI ++ ++ +D DE EHS+EM LP + + + IKIVPI VG VN
Sbjct: 62 VISDIMNNDTQNLYDYIDELDDEEEHSIEMQLPLIKYIIKEKDIKIVPIYVGCIGNDVNK 121
Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHG--VIHKSIEALDKMGM 217
NE + KY D +N F SSDFCH+G RF++ + +K+ IHK IE +DK G+
Sbjct: 122 INE--FCNPLKKYFQDETNAFIFSSDFCHYGRRFSFTNILEKYNDKYIHKKIENMDKDGI 179
Query: 218 DII 220
II
Sbjct: 180 SII 182
>gi|164661966|ref|XP_001732105.1| hypothetical protein MGL_0698 [Malassezia globosa CBS 7966]
gi|159106007|gb|EDP44891.1| hypothetical protein MGL_0698 [Malassezia globosa CBS 7966]
Length = 200
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/189 (40%), Positives = 102/189 (53%), Gaps = 27/189 (14%)
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
VDEAEHS+EMHLPY+ KV+ +KIVP+LVG + + Y FA+Y DP F +S
Sbjct: 5 VDEAEHSIEMHLPYIYKVWGERDVKIVPVLVGHLPEQMNFAYALCFAQYFADPRTLFVIS 64
Query: 184 SDFCHWGSRFNYMHYD---KKHGV---------------IHKSIEALDKMGMDIIETGD- 224
SDFCHWGSRF Y Y G+ I++SI+ LD GM I
Sbjct: 65 SDFCHWGSRFQYTWYQPTSTSKGIMLSSANKSCIEPKMPIYQSIQNLDAEGMSAISFNKH 124
Query: 225 -----PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKR 276
AF +L + NTICGR+PI + L +L + + +F Y+QS+ C +
Sbjct: 125 GSRRARQAFAMHLTKTGNTICGRNPILLLLTILEILEDRGAMFECRFTHYKQSNACCHVQ 184
Query: 277 DSSVSYASA 285
DSSVSYA+A
Sbjct: 185 DSSVSYATA 193
>gi|116748738|ref|YP_845425.1| hypothetical protein Sfum_1298 [Syntrophobacter fumaroxidans MPOB]
gi|116697802|gb|ABK16990.1| protein of unknown function DUF52 [Syntrophobacter fumaroxidans
MPOB]
Length = 276
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 18/286 (6%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
E+IR + AGSWY DN L E+ G+L+ A P + G++APHAGY YSG AA+A
Sbjct: 4 ERIRESVIAGSWYPDNPGALRNEIAGYLKRARPHPGRENLIGLVAPHAGYMYSGEVAAHA 63
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ ++ + RV ++ PSH P + ++TP+G +PLD+++++ +
Sbjct: 64 YRQLEGSRFERVLIVAPSHRTRFPSSTIYHLGGFRTPLGVVPLDVDLVQAFLEHPQLRGY 123
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
++ EHS+E+ LP+L V K+VPI++G E+ L K + +
Sbjct: 124 YPQAEDQEHSLEIQLPFLQVVLGE--FKLVPIIMGNQTYEHCTELAELIEKLCRNKNVLL 181
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ S Y + G++ + A D G+ F+ L + C
Sbjct: 182 IASSDLSHYHS---YREAKQLDGIVTARVAAFDCQGL----------FRD-LNDGACEAC 227
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
G P+ + G + + L Y S +R V Y +AA
Sbjct: 228 GAGPMVAVMLAAGKLGANVS-RVLHYANSGDVTGERSGVVGYMAAA 272
>gi|218200888|gb|EEC83315.1| hypothetical protein OsI_28694 [Oryza sativa Indica Group]
Length = 84
Score = 117 bits (294), Expect = 5e-24, Method: Composition-based stats.
Identities = 64/76 (84%), Positives = 71/76 (93%)
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
MGM+IIETG+PDAFK+YL EY+NTICGRHPISVFL ML +CSTKIKI F+RYEQSSQCK+
Sbjct: 1 MGMEIIETGNPDAFKQYLQEYENTICGRHPISVFLSMLKHCSTKIKIGFVRYEQSSQCKS 60
Query: 275 KRDSSVSYASAAAKVD 290
RDSSVSYASAAAKVD
Sbjct: 61 MRDSSVSYASAAAKVD 76
>gi|373457187|ref|ZP_09548954.1| AMMECR1-domain protein [Caldithrix abyssi DSM 13497]
gi|371718851|gb|EHO40622.1| AMMECR1-domain protein [Caldithrix abyssi DSM 13497]
Length = 502
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 114/209 (54%), Gaps = 15/209 (7%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR----GVIAPHAGYSYSGRAAA 58
KIR+ + AG++Y ++K L ++ +L++A +SPE++ G+IAPHAGY YSG AA
Sbjct: 33 KIRKPAVAGAFYPSSAKALQSMIESFLKKA---QSPEIKDAITGIIAPHAGYVYSGGVAA 89
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKF 117
YA+ + I RV ++ PSH Y A+ T Y+TP+G +P+D E +L K + +
Sbjct: 90 YAYKVLQGRKIKRVVVISPSHVEYFSGAAVYDGTHYETPLGLIPVDAEFARKLAKKSDRL 149
Query: 118 ELMD-----ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
EL + EH++E+ LP+L V + ++VPI++G + E GR A
Sbjct: 150 ELSEHGHTINFQGRGEHALEVQLPFLQVVLDD--FQLVPIVMGEQSYETARALGRALADL 207
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
++D SSD H+ S + DKK
Sbjct: 208 IEDDRTIIVASSDLSHFHSYQKAIELDKK 236
>gi|349804847|gb|AEQ17896.1| putative protein memo1 [Hymenochirus curtipes]
Length = 100
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/100 (53%), Positives = 71/100 (71%), Gaps = 1/100 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWYT + +L+ +L+GWL +A K P R +IAPHAGY+Y G AA+A+ +
Sbjct: 1 REASHAGSWYTASGPQLSAQLEGWLSQAQTSKRP-ARAIIAPHAGYTYCGSCAAHAYKQV 59
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
DP+ RVF+LGPSHH +CALST +Y+TP+ DL +D
Sbjct: 60 DPSVTRRVFILGPSHHVALSRCALSTVDLYRTPLYDLHVD 99
>gi|361130005|gb|EHL01881.1| putative protein MEMO1 [Glarea lozoyensis 74030]
Length = 201
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 96/200 (48%), Gaps = 59/200 (29%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ASHAGSWY N L+ EL+G+L A +P + + + + P A
Sbjct: 4 RSASHAGSWYKANRTALSSELEGYL--AAVPAAFDEQQLPIPKA---------------- 45
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ ++ ATV EELKATGKF+ + V
Sbjct: 46 --------------------RVIIAPATV---------------EELKATGKFDSIPRDV 70
Query: 125 DEAEHSMEMHLPYLAKVFEGHL------IKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
DEAEHS+EMH PY+ + H K+VP++VG E YG+LF+ Y+ DP++
Sbjct: 71 DEAEHSLEMHCPYIYTMLSRHFDSPADYPKLVPVMVGNTAPAAEKEYGKLFSSYLADPTS 130
Query: 179 FFSVSSDFCHWGSRFNYMHY 198
F +SSDFCHWG RF Y +Y
Sbjct: 131 VFVISSDFCHWGGRFRYTYY 150
>gi|390952709|ref|YP_006416468.1| putative dioxygenase [Thiocystis violascens DSM 198]
gi|390429278|gb|AFL76343.1| putative dioxygenase [Thiocystis violascens DSM 198]
Length = 261
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 129/286 (45%), Gaps = 31/286 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R+ + A +Y + + L LDG+L A P++P + VI PHAGY YSG AA A
Sbjct: 1 MSLVRQPAVADRFYPGDPRELTRMLDGFLANAQ-PRTPPPKAVIVPHAGYIYSGPIAASA 59
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P S++RV L+GPSH A S+A + TP+G +PLD I + A +
Sbjct: 60 YATLAPARESVTRVILMGPSHRVPFRGLATSSADWFDTPLGAVPLDRVAINQALALPQVR 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
MD EHS+E+ LP+L +V + P +VG + E+ A L P
Sbjct: 120 SMDEA-HALEHSLEVQLPFLQRVLVD--FSLAPFVVGDASPESVAEVLELLWG---GPET 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H+ H D +S LD IE+ PD + YD
Sbjct: 174 LIVISSDLSHY-------HDD-------RSARRLDAATSAAIESLRPDD-----IGYDQA 214
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CGR P++ L + + + LR S RD V Y +
Sbjct: 215 -CGRTPVNGLLTVAKRRGMQARTLDLR--NSGDTAGSRDQVVGYGA 257
>gi|333984577|ref|YP_004513787.1| hypothetical protein [Methylomonas methanica MC09]
gi|333808618|gb|AEG01288.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methylomonas methanica MC09]
Length = 259
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 130/269 (48%), Gaps = 41/269 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
R + AGS+Y ++ L+ L+G+L R AG +P + +I PHAGY YSG AA A+
Sbjct: 3 RAPAVAGSFYPADAGTLSHMLNGFLDRTAGDEHAP--KAIIVPHAGYIYSGPIAASAYAR 60
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ P S+SRV L+GPSH A+S A + TP+GD+ +D I++L+ ++
Sbjct: 61 LKPVRDSVSRVVLIGPSHRVAFQGLAVSAADTFSTPLGDIAIDQSAIQQLQQLPFVGYLE 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-----AVNAENEAMYGRLFAKYVDDP 176
EHS+E+HLP+L + K+VPI+ G V+ E ++G P
Sbjct: 121 EA-HALEHSLEVHLPFLQQTLSH--FKLVPIVAGDASPHQVSQVLEILWG--------GP 169
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+SSD H+ +D + + + LDK ++IE D A
Sbjct: 170 ETLIVISSDLSHY--------HD------YATSQTLDKHTSELIEALDYQALSS------ 209
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLR 265
+ CGR P+S L +L S IK LR
Sbjct: 210 DAACGRVPVSGLLKLLREKSLAIKTIDLR 238
>gi|119719191|ref|YP_919686.1| hypothetical protein Tpen_0273 [Thermofilum pendens Hrk 5]
gi|119524311|gb|ABL77683.1| protein of unknown function DUF52 [Thermofilum pendens Hrk 5]
Length = 287
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 135/295 (45%), Gaps = 30/295 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPE--------VRGVIAPHAGYSYSGR 55
R A HAG +Y + L E L + +L E G K PE + GV+APHAGY YSG
Sbjct: 6 RPAVHAGYFYPAEKRELIEALKNAFLHELGPGKLPEKPGSYTGNLIGVVAPHAGYIYSGH 65
Query: 56 AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ + + VF+LGP+HH AL V++TP+GD+ +D V +EL +
Sbjct: 66 VAAHAYLALAEGGKPDVVFILGPNHHALGAPIALDENEVWETPLGDVEVDFRVSKELASR 125
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ D EHS+E+ +P+L VF G IVPI + E G A +
Sbjct: 126 EQIIRFDFQAHAYEHSIEVQVPFLQFVF-GEGFTIVPISMMLQTPEAARRVGEAIAGLIM 184
Query: 175 DPS--NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA-LDKMGMDIIETGDPDAFKKY 231
+ + SSD H+ + I A DK+ ++ + D +
Sbjct: 185 EKGLRAYVVASSDMSHY---------------VESRIAAEKDKVAIERVLFLDAEGLYNA 229
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++E D ++CG P+ +++ ++ L+Y S + S V+YAS A
Sbjct: 230 VIEEDISMCGPGPVMALIYIAKKLGYA-NVRLLKYADSGDVTGDKSSVVAYASLA 283
>gi|91204537|emb|CAJ70765.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 347
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 134/291 (46%), Gaps = 25/291 (8%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFGNIDPTS 68
AG +Y N L +++ +L E KS +I+PHAGY YSG+ AAY + I
Sbjct: 37 AGRFYPGNEAALKGQINEFLNEVSSQKSNGRPLAIISPHAGYVYSGQVAAYGYSAIKGHG 96
Query: 69 ISRVFLLGPSHHYYTPKCA-LSTATVYKTPIGDLPLDLEVIEELKATG---------KFE 118
RV +L PSH + A + AT YKTP+G + +D E + L T
Sbjct: 97 FKRVIVLSPSHSGRRYRGASILKATSYKTPLGKISIDQEACDYLLNTSFTAESKNKRNSS 156
Query: 119 LMDICVD-----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+ + D + EHS+EM LP+L +VPI++G + + +
Sbjct: 157 PLKLFGDYDGAYKGEHSLEMQLPFLQMTLGD--FNLVPIMIGILIDNDFDKVAEAIRPLL 214
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
DD + VSSDF H+G + Y+ + + + ++I+ LD + I D D ++Y
Sbjct: 215 DDKT-LLVVSSDFTHYGDAYRYVPFREN---VEENIKILDYGAFEKILNKDFDGLREYRK 270
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ CG PIS+ L +L + + K + L Y+ S SVSYAS
Sbjct: 271 QTGINACGILPISILLKLLPDSA---KGQILHYDTSGHQTNNFVYSVSYAS 318
>gi|53802717|ref|YP_112612.1| hypothetical protein MCA0072 [Methylococcus capsulatus str. Bath]
gi|53756478|gb|AAU90769.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath]
Length = 267
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 140/290 (48%), Gaps = 39/290 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R + AG +Y + ++L L G+L EA +P + +IAPHAGY YSG AA A
Sbjct: 1 MRSVRAPAVAGLFYPADRRQLHALLQGFLGEAKTAGTPP-KALIAPHAGYVYSGPVAASA 59
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P I RV LLGPSH ALS+ + TP+G++PLDLE L A
Sbjct: 60 YALLKPLRHRIRRVVLLGPSHRIAFRGLALSSTQSFATPLGEIPLDLEAQSVLAALPFVH 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD---- 174
++D EHS+E+ LP+L +V E K+VPI+VG +A+ G++ A+ +D
Sbjct: 120 VLDQA-HVLEHSLEVQLPFLQEVLED--FKLVPIVVG------DAVPGQV-AEAIDLVWG 169
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
VSSD H+ +D + + +D+ D IE+ P+ +
Sbjct: 170 GDETLIVVSSDLSHY--------HD------YATARRMDRATSDAIESLHPEE-----IG 210
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+++ CGR P++ L + + K + S +DS V Y +
Sbjct: 211 FEDA-CGRLPVAGLL--IAARRRGLSAKTVDLRNSGDTAGSKDSVVGYGA 257
>gi|323304343|gb|EGA58116.1| YJR008W-like protein [Saccharomyces cerevisiae FostersB]
Length = 210
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 100/209 (47%), Gaps = 39/209 (18%)
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGH-----LIKIVPILVGAVNAENEAMYGRLFAKYVD 174
MD D AEHS+EM LP L + + +K++P++V + + + G + ++Y+
Sbjct: 1 MDHDTDMAEHSLEMQLPMLVETLKWREISLDTVKVIPMMVSHNSVDVDRCIGNILSEYIK 60
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYD-------------------------KKHGV-IHKS 208
DP+N F VSSDFCHWG RF Y Y H V I +S
Sbjct: 61 DPNNLFIVSSDFCHWGRRFQYTGYVGSKEELNDAIXEETEVEMLTARSKLSHHQVPIWQS 120
Query: 209 IEALDKMGMDIIE---TGDP-DAFKKYLLEYDNTICGRHPISVFLHMLGNCS----TKIK 260
IE +D+ M + G+ DA+K+YL NTICG PISV L L + IK
Sbjct: 121 IEIMDRYAMKTLSDTPNGERYDAWKQYLEITGNTICGEKPISVILSALSKIRDAGPSGIK 180
Query: 261 IKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
++ Y QSS + DSSVSY S +
Sbjct: 181 FQWPNYSQSSHVTSIDDSSVSYTSGYVTI 209
>gi|84489659|ref|YP_447891.1| hypothetical protein Msp_0859 [Methanosphaera stadtmanae DSM 3091]
gi|84372978|gb|ABC57248.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091]
Length = 283
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 142/302 (47%), Gaps = 38/302 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSP----EVRGVIAPHAGYSYSGRAA 57
IR+ S AG +Y N L E + + G +P +P + + I PHAGY YSG+ A
Sbjct: 2 IRKPSVAGLFYESNKSLLEENIKNIFNKTGQNIPTTPSNTYKTKAAIVPHAGYIYSGKTA 61
Query: 58 AYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
+YA+G+I + I V ++GP+H Y +L+T+ ++TPIGD+ +D E EL+
Sbjct: 62 SYAYGDIARSGICDTVVIIGPNHTGYGDDISLTTSNTWQTPIGDVCVDSEFNNELEKINS 121
Query: 117 FELMDICVDEAEHSMEMHLPYLAKV--FEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
EHS+E+ LP+L + + KIVPI++ + + A +
Sbjct: 122 NITFSPEAHIKEHSIEVELPFLQYISNIQKKSFKIVPIVI----TRQQKNFCVELAHSIY 177
Query: 175 DPS-----NFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
D S N V S+D H+ + + + D+K I KSIE MDI D+
Sbjct: 178 DVSKKLNRNIMVVASTDLTHYENATSAKNKDEK---ILKSIE-----NMDI------DSL 223
Query: 229 KKYLLEYDNTICGRHP--ISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ +Y+ T+CG P ++ L N + I L Y S + +S V YASA
Sbjct: 224 LNNINKYNITMCGYGPTITAIIYSKLVNANRSI---ILNYSNSGEVYNDYESVVGYASAV 280
Query: 287 AK 288
K
Sbjct: 281 IK 282
>gi|294673326|ref|YP_003573942.1| hypothetical protein PRU_0573 [Prevotella ruminicola 23]
gi|294474323|gb|ADE83712.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 464
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 13/282 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
+ ++R A+ A +YT +++ L E+D L++ + +I PHAGY +SG AA A
Sbjct: 3 LGQVRPATQANRFYTGDAEELENEVDSLLKQHNSQVYDNLAALIVPHAGYYFSGNVAASA 62
Query: 61 FGNIDP-TSISRVFLLGPSHHYYTPKCALST-ATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ ID R+FLLGPSHH + +++T A Y TP+G++ +D E EL
Sbjct: 63 YMTIDAKKEYKRIFLLGPSHHEWLNGASVNTEADYYATPLGNVKVDHETAIELTKADSIF 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
EH +E+ LP+L + G + IVPI++ + + Y D N
Sbjct: 123 AYHRSAHAQEHCLEVQLPFLQRKL-GEVPPIVPIIISTNDYYKLKRMADVLKPYFTD-EN 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SSDF H+ S + D K G K+IE D + ++ K+ L +
Sbjct: 181 LFVISSDFSHYPSYQDAYEVDAKTG---KAIETGDVEEFIAAIEANANSGKRNLA---TS 234
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSV 280
CG I + M+ ++ +K L Y+ S + S V
Sbjct: 235 ACGEFAIVTLMLMI---DSRYDVKHLMYQNSGDIDNRDLSRV 273
>gi|345878384|ref|ZP_08830100.1| putative dioxygenase [endosymbiont of Riftia pachyptila (vent
Ph05)]
gi|344224602|gb|EGV50989.1| putative dioxygenase [endosymbiont of Riftia pachyptila (vent
Ph05)]
Length = 271
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R A+ AG +Y D L +D +L +AG + P + ++APHAG+ YSG AA
Sbjct: 8 MGGVRTAAVAGLFYPDAPAELRAMVDDYLGQAGEGRPP--KAIVAPHAGFVYSGPVAASV 65
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P + ISRV LLGPSH ALS+A ++TP+G++PL+ +V +EL +
Sbjct: 66 YHRLTPVADRISRVVLLGPSHRVAFRGLALSSAQSFQTPLGEIPLE-QVSDELLRLPQLS 124
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ D+ + EHS+E+ LP+L +V +++PI+VG + A + + D
Sbjct: 125 VNDMAHAQ-EHSLEVQLPFLQRVLGE--FRLLPIVVGDASGAEVA---EVLEQVWDGEET 178
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H+ +Y+ + + +D+ D I P+ L Y++
Sbjct: 179 LIVISSDLSHY---HDYL-----------TAQQMDRTTSDAILALQPER-----LHYEDA 219
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
CGR P+ L + ++ LR S +D V Y + A V
Sbjct: 220 -CGRIPLQGLLEVAQRKGLDGELVDLR--NSGDTAGLKDRVVGYGAYAFAV 267
>gi|320160154|ref|YP_004173378.1| hypothetical protein ANT_07440 [Anaerolinea thermophila UNI-1]
gi|319994007|dbj|BAJ62778.1| hypothetical protein ANT_07440 [Anaerolinea thermophila UNI-1]
Length = 281
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 117/223 (52%), Gaps = 9/223 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R + AG+WY+ +++RL +++ +L EA +P + EV V+APHAG+ YSGR A +AF
Sbjct: 7 VRPSPIAGTWYSGDAQRLRRQVETYLSEAHIPSLNGEVVAVVAPHAGHRYSGRTAGHAFA 66
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD---LEVIEELKATGKFEL 119
+ S V +L P H + + + Y TP+G +P+D L ++E+ A L
Sbjct: 67 AVRGQSPEVVAVLSPFHAMHPAELLTTAHQAYATPLGLIPVDHEALLILEQDLAEQHLSL 126
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ D +EHS+E+ LP+L +G +++P++V + + + G+L A+ + D
Sbjct: 127 TRVAYD-SEHSLEIELPFLQVALQGDF-RLLPLMVRSHSPQVLQTLGKLLAQVLKDRRAL 184
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
S+D H+ D + + + +EA D + M ET
Sbjct: 185 LVASTDLSHFYPEKIARQLDAE---MLRRLEAFDPLAMLEAET 224
>gi|313680677|ref|YP_004058416.1| hypothetical protein [Oceanithermus profundus DSM 14977]
gi|313153392|gb|ADR37243.1| protein of unknown function DUF52 [Oceanithermus profundus DSM
14977]
Length = 260
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 128/283 (45%), Gaps = 30/283 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AGS+Y +LA +D L A P + + V+APHAGY YSG AAY+F
Sbjct: 2 VRKPAVAGSFYPAEPMQLARMVDELLAMATRPPAHAPKAVVAPHAGYVYSGPVAAYSFRA 61
Query: 64 IDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
++P + VFL+GP+H+ + T T ++TP+G +P+D E + L
Sbjct: 62 LEPLAGKTPTVFLMGPAHYLAFEGVSTGTYTYWETPLGQVPVDTERVGALLERSALFTTA 121
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+ EHS+E+ LP+L V G K+VP+L G V+ A+ G L A V P +
Sbjct: 122 DEPHQPEHSLEVELPFLQAVLGG--FKLVPLLFGLVD--PLAVAGHLEA--VLRPGDVLV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VS+D H+ D ++ ALD G+ E CG
Sbjct: 176 VSTDLSHYHPDAEARKLD---AATLETALALDARGLSHREA-----------------CG 215
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
RHP + + S + ++ L Y S R V YA+
Sbjct: 216 RHPWAALTALADRRSWRPEL--LAYATSGDTSGDRSRVVGYAA 256
>gi|119620873|gb|EAX00468.1| chromosome 2 open reading frame 4, isoform CRA_b [Homo sapiens]
Length = 201
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/103 (53%), Positives = 66/103 (64%), Gaps = 3/103 (2%)
Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
G RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L
Sbjct: 98 GQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLL 157
Query: 250 HM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 158 NAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 200
>gi|154151753|ref|YP_001405371.1| hypothetical protein Mboo_2214 [Methanoregula boonei 6A8]
gi|154000305|gb|ABS56728.1| protein of unknown function DUF52 [Methanoregula boonei 6A8]
Length = 262
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 89/290 (30%), Positives = 138/290 (47%), Gaps = 37/290 (12%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K R + G +Y +S L + L+G+ P P + G+++PHAGY YSG+ AAYAF
Sbjct: 2 KTRSCAVCGMFYPRDSAHLTQILEGFFSRTA-PGEPAL-GIVSPHAGYIYSGQIAAYAFS 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
IDP ++GPSHH Y ++A ++TP+G + +D E I+ L DI
Sbjct: 60 RIDPGFSGTFVVIGPSHHGYLTS---ASAIPWETPLGLVEIDAEFIDAL---------DI 107
Query: 123 CVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR--LFAKYVDDP 176
VDE EHS+E+ LP++ F ++VPI++G + + A + A+ +
Sbjct: 108 PVDEPSHEEEHSIEVQLPFIRHRFP--RARVVPIMMGEQDPAHAAAVAEKIVAAQRLTKK 165
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SSDF H+ K G ++ +IE L + +DI E F + + E
Sbjct: 166 EIRVVASSDFSHY-----VPEAQAKTGDLY-AIEPLAR--LDIKE------FYRRIREKR 211
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
T CG PIS + + C + + +RY S R+ V YA+ A
Sbjct: 212 VTACGYGPISAMVAVC-QCMGAGRGELIRYATSGDVTNDRNQVVGYAAIA 260
>gi|302348701|ref|YP_003816339.1| hypothetical protein ASAC_0902 [Acidilobus saccharovorans 345-15]
gi|302329113|gb|ADL19308.1| hypothetical protein ASAC_0902 [Acidilobus saccharovorans 345-15]
Length = 288
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 142/301 (47%), Gaps = 28/301 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAG---LPKSPEVR-----GVIAPHAGYS 51
M++IR+ +HAGS+Y + L + ++ +L G LP++P R G + PHAGY
Sbjct: 1 MQEIRKPAHAGSFYPADRDDLVKSIESSFLHRLGPGKLPQAPPSRNRTSVGYLVPHAGYM 60
Query: 52 YSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
YSG AA+++ N+ +V ++ GP+H +L ++TP+G + +D EV +
Sbjct: 61 YSGPVAAHSYYNMAQEGAPKVVVVAGPNHTGLGEAASLWQGDAWETPLGTVEVDREVEKL 120
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
+ + K+ D EHS+E+ +P+L +F+ + K+VPI++ N E
Sbjct: 121 IISNTKYFTFDNEAHLYEHSVEVQVPFLQYIFDSNF-KLVPIVIKLQNPEVSKDLADALV 179
Query: 171 KYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
K +++ F SSD HY+ + K EAL +E+ DP+
Sbjct: 180 KIINENGVDLLFVASSDMN---------HYEPHDVTVSKDNEAL-----RYVESLDPEGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+ L D T+CG P L+M G + L++ S ++D V YASA
Sbjct: 226 FRVLEADDITMCGPGPAMTLLYM-GRSLGATRAVILKHATSGDVTGEKDWVVGYASAVVP 284
Query: 289 V 289
+
Sbjct: 285 L 285
>gi|288940496|ref|YP_003442736.1| hypothetical protein Alvin_0757 [Allochromatium vinosum DSM 180]
gi|288895868|gb|ADC61704.1| protein of unknown function DUF52 [Allochromatium vinosum DSM 180]
Length = 267
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 127/293 (43%), Gaps = 35/293 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLRE-----AGLPKSPEVRGVIAPHAGYSYSGR 55
M IR+ + A +Y + L LD L + A P + + +I PHAGY YSG
Sbjct: 1 MPTIRQPAVADRFYPGDPVELGRMLDALLADSPASAASRPGTAPPKALIVPHAGYIYSGP 60
Query: 56 AAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA A+ + P I RV LLGPSH A ++A V+ TP+G +P+D +E
Sbjct: 61 IAATAYATLAPVHDQIHRVVLLGPSHRLPFMGLAATSADVFATPLGPVPIDRAAVERTLT 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+ L+D EHS+E+ LP+L +V + +VP++VG E+ A L
Sbjct: 121 LPQVRLLD-AAHAQEHSLEVQLPFLQRVLDD--FSLVPLVVGEAAPESVAEVLDLLWG-- 175
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
P VSSD H Y+ Y ++ + +D + IE P+A
Sbjct: 176 -GPETLILVSSDLTH------YLDY--------RTAQRIDAATSEAIEALRPEAIGS--- 217
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CGR P++ L + ++ + L S RD V Y + A
Sbjct: 218 ---DQACGRAPLNGLLTL--ARRRGLQAETLDRRNSGDTAGSRDQVVGYGAYA 265
>gi|349605881|gb|AEQ00969.1| Protein MEMO1-like protein, partial [Equus caballus]
Length = 103
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 54/101 (53%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 192 RFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM 251
RF Y +YD+ G I++SIE LDKMGM IIE DP +F YL +Y NTICGRHPI V L+
Sbjct: 2 RFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNA 61
Query: 252 ---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 62 ITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 102
>gi|344341515|ref|ZP_08772434.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thiocapsa marina 5811]
gi|343798635|gb|EGV16590.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thiocapsa marina 5811]
Length = 261
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/287 (29%), Positives = 130/287 (45%), Gaps = 33/287 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M RR + AG +Y + + L +DG+L EA +P + +IAPHAGY YSG AA A
Sbjct: 1 MTLSRRPAVAGLFYPGDPRELKHMVDGYLAEAPARDTPP-KALIAPHAGYIYSGSIAASA 59
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P +I+RV LLGP+H A S+A + TP+G + LD I+ A +
Sbjct: 60 YATLLPAHAAITRVVLLGPAHRVPVRGLATSSAECFDTPLGSVDLDRAAIDSALALPQVR 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAMYGRLFAKYVDDPS 177
+MD + EHS+E+ LP+L + + +VP +VG + AE + L+ P
Sbjct: 120 IMDEAHAQ-EHSLEVQLPFLQEALD--RFTLVPFVVGDTSPAEVAEVLDLLWGG----PE 172
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
VSSD H+ ++Y + +IEAL+ + Y+
Sbjct: 173 TLIVVSSDLSHY---YDYATAKRLDTATSTAIEALEPQD----------------IHYEQ 213
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CGR P++ L + + LR S RD V Y +
Sbjct: 214 A-CGRIPVNGLLELARRRGLHARTLDLR--NSGDTAGPRDQVVGYGA 257
>gi|386826852|ref|ZP_10113959.1| putative dioxygenase [Beggiatoa alba B18LD]
gi|386427736|gb|EIJ41564.1| putative dioxygenase [Beggiatoa alba B18LD]
Length = 264
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 36/286 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGR--AAAYAF 61
IR + AG +Y N L + + +L+ A P + + + +I PHAGY YSG A AYA+
Sbjct: 6 IRPPAVAGMFYPANPAELQQMVKQFLQVATPPSAIQPKALIVPHAGYLYSGAIAATAYAY 65
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+I RV LLGPSH Y ALSTA Y +P+G++ LD + E L +D
Sbjct: 66 WLSLARTIKRVVLLGPSHRYPLRGLALSTAQAYASPLGNVLLDHQTDETLLKLPFVHAID 125
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
++EHS+E+HLP+L ++F+ ++P++VG + + G++
Sbjct: 126 -QAHQSEHSLEVHLPFLQQLFQQ--FTLIPLVVGDASPQQ---VGQVLELLWGGEETVIV 179
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSD H Y Y + ++ +A++++ + L Y++ CG
Sbjct: 180 ISSDLSH------YHPYATAQAMDARTSQAIEQLNFNA-------------LHYEDA-CG 219
Query: 242 RHPISVFLHMLGNCSTK---IKIKFLRYEQSSQCKTKRDSSVSYAS 284
R+PI+ L C K + I + S D V Y +
Sbjct: 220 RNPIAGLL-----CVAKQRHLTITTVDLRNSGDTAGSHDKVVGYGA 260
>gi|336398113|ref|ZP_08578913.1| AMMECR1 domain protein [Prevotella multisaccharivorax DSM 17128]
gi|336067849|gb|EGN56483.1| AMMECR1 domain protein [Prevotella multisaccharivorax DSM 17128]
Length = 509
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 123/269 (45%), Gaps = 35/269 (13%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELD-------GWLREAGL--PKSPEVRGVIAPHAGYSY 52
+ +R+ + AG +Y + + L +L+ G L +G+ S V+ VI PHAGY +
Sbjct: 33 QNVRQMAVAGQFYAASRQDLQSDLEKCYAEGRGLLARSGVVVGDSARVQAVIVPHAGYVF 92
Query: 53 SGRAAAYAFGNI-DPTSISRVFLLGPSHHY-YTPKCALSTATVYKTPIGDLPLDLEVIEE 110
SG AA AF +I S +FLLGPSHH + S Y+TP+GD+ +D ++ +E
Sbjct: 93 SGSVAASAFASIPKDASYQHIFLLGPSHHVAFDGVSVCSVFDAYRTPLGDVAVDTKLCKE 152
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
L + EH +E+ LP+L L IVP+++G + +
Sbjct: 153 LMRRSPVFTYVSSAHDREHCLEVQLPFLQYRL-ASLPPIVPMIIGTQDFATLKKMAEVLM 211
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
Y P N F VSSDF H+ S + + D+ G + I TG D F +
Sbjct: 212 PYF-TPDNLFVVSSDFSHYPSYSDALTVDRNSG--------------EAITTGRLDKFVE 256
Query: 231 YLLE-----YDN---TICGRHPISVFLHM 251
L E DN + CG+ I+V L M
Sbjct: 257 ALRENECRGVDNLYTSACGQSAIAVLLMM 285
>gi|194333116|ref|YP_002014976.1| hypothetical protein Paes_0272 [Prosthecochloris aestuarii DSM 271]
gi|194310934|gb|ACF45329.1| protein of unknown function DUF52 [Prosthecochloris aestuarii DSM
271]
Length = 287
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 139/291 (47%), Gaps = 17/291 (5%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAAA 58
E +R + A S+Y + + L L+ L E+ + E +R ++ PHAGY++SGRA+A
Sbjct: 3 ENVRYPAVAESFYPSDREELDTFLESILSESTATNNSEKASIRALLVPHAGYAFSGRASA 62
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
A+ + VF+LG +H Y AL T ++++P+G +P++++ E+ +
Sbjct: 63 EAYSRLAGNQYRTVFILGNAHAYRFNGIALDTHHIWQSPLGRIPINMDAAEQFRTAAPRL 122
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+ +DI ++H +E+ LP+L K + I+PIL G NA++ ++ + P
Sbjct: 123 IDYLDIA-HHSDHVLEVQLPFLQKTLKTGF-SILPILFGE-NAKDISLKTARILSDILQP 179
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+ SSD H+ S YD + + K+++A+ + M +E + K+ + D
Sbjct: 180 DDLLIASSDLSHYPS------YDDANAIDRKTLDAIVNIDMKGLERHVRNTMKQNVAHED 233
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK-RDSSVSYASAA 286
CG + L + + L Y S + + RD+ V Y S A
Sbjct: 234 ALFCGPDGLKTLLRIA--TQRGWTAEELAYCNSGDAEWRDRDAVVGYGSVA 282
>gi|144900060|emb|CAM76924.1| protein containing DUF52 [Magnetospirillum gryphiswaldense MSR-1]
Length = 452
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 129/297 (43%), Gaps = 47/297 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLR-----EAGLPKSPEVRGVIAPHAGYSYSG 54
M IR ++ AG +Y L +LD W + E G+PK+ +IAPHAGY YSG
Sbjct: 1 MTSIRPSAVAGMFYPAEPAELMRQLDACWAQARPHPELGVPKA-----IIAPHAGYVYSG 55
Query: 55 RAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
AA A+ + +ISRV LLGPSH A+STA + +P+G + LD + +E K
Sbjct: 56 AVAASAYDLLKTARGTISRVVLLGPSHRVGFRGMAVSTADAWASPLGAVRLDRDGVERAK 115
Query: 113 A---TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
TG+ E EHS+E+HLP+L ++ +VP++VG E A L
Sbjct: 116 TVPLTGELE----AAHAQEHSLEVHLPFLQRMLGD--FTLVPVVVGDAAPEQVAA---LL 166
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
P +S+D H+ +Y +IEALD+ +D
Sbjct: 167 DALWGGPETLIVISTDLSHY---LDYAACQDMDSRTAAAIEALDEGALD----------- 212
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CGR P+ L + + IK + S +D V Y + A
Sbjct: 213 ------RDQACGRIPVGGLLRT--AKARGMTIKTIDRRNSGDTAGPKDRVVGYGAWA 261
>gi|431932323|ref|YP_007245369.1| dioxygenase [Thioflavicoccus mobilis 8321]
gi|431830626|gb|AGA91739.1| putative dioxygenase [Thioflavicoccus mobilis 8321]
Length = 261
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 125/286 (43%), Gaps = 31/286 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M R + AG +Y + L +D +L +A + P + +IAPHAGY YSG AA A
Sbjct: 1 MSLARPPAVAGRFYPSDPPALKRMVDDYLAKAKV-DGPPPKALIAPHAGYIYSGLIAATA 59
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + +IS+V LLGP+H + A S+A ++TP+G +PLD I+ +
Sbjct: 60 YAPLVAARETISQVVLLGPAHRVFVRGLAASSAERFETPLGTVPLDRAAIDRTLTLPQVS 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ D E EHS+E+ LP+L V + K+VP +VG + E A L P
Sbjct: 120 ISDAAHAE-EHSLEVQLPFLQVVLDE--FKLVPFVVGDASPEEVAEVLELLWS---GPET 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
VSSD H+ +Y + K+IE L + YD
Sbjct: 174 LIVVSSDLSHY---HDYATAHRLDTATSKAIEGLHPEDIG----------------YDQA 214
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CGR P++ L + + K LR S RD V Y +
Sbjct: 215 -CGRQPVNGLLELARRRGLRAKTLDLR--NSGDTSGPRDQVVGYGA 257
>gi|386811290|ref|ZP_10098516.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386406014|dbj|GAB61397.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 529
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 33/304 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE---VRGVIAPHAGYSYSGRAA 57
++++ AG +Y++N L +++ + + +P+ P+ VI+PHAGY YSG+ A
Sbjct: 25 LQEVWEPQVAGRFYSENENVLKGQINTFFKN--IPEQPDKGKPLAVISPHAGYQYSGQVA 82
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPK---CALSTATVYKTPIGDLPLDLEVIEEL--- 111
AY + +I +RV LL PSH + ++ +KTP+G + +D E L
Sbjct: 83 AYGYKSIKGYEFTRVILLSPSHFSSGKRFRGVSILKTKNFKTPLGLITVDQEACNYLLTF 142
Query: 112 ----------KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN 161
K T F + EHS+E LP+L ++ G+ +VPI+VG + ++
Sbjct: 143 AKGSSPTAPQKTTTLFGSYEGAYS-GEHSLETQLPFL-QLSLGNFT-LVPIMVGILVDDD 199
Query: 162 EAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
++D + VSSDF H+G + Y+ + I K+I+ALD + I
Sbjct: 200 FDQIAHAIRPLLNDKT-LVVVSSDFTHYGEGYGYVPFKTN---IEKNIKALDYGAFEKIL 255
Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLG-NCSTKIKIKFLRYEQSSQCKTKRDSSV 280
+ D + Y E CG PI++ L +L N +I L Y+ S + SV
Sbjct: 256 SKDFHGLRTYRKETGINTCGIIPIALLLKLLPENAQGEI----LNYDTSGRQSNNFSFSV 311
Query: 281 SYAS 284
SYAS
Sbjct: 312 SYAS 315
>gi|410696082|gb|AFV75150.1| putative dioxygenase [Thermus oshimai JL-2]
Length = 280
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 128/291 (43%), Gaps = 36/291 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME +RR + AG +Y + L E++G L A P P VRG +APHAGY YSGR AA A
Sbjct: 1 MELVRREAAAGVFYPREGETLRREVEGLLEAARTPPLPGVRGALAPHAGYLYSGRVAAEA 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + RV LLGPSH A ++TP+G+ P+DLE L A G F
Sbjct: 61 FRALSAWKGRARRVLLLGPSHFVPFLGVAFHPYRAWRTPLGETPVDLEGGRRLLARGHPF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
++ E EHS+E+ LP+L I+P+L G V+ EA+ L
Sbjct: 121 RALEEPFRE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPLGVAEALEEEL------G 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + SSD H+ H A + G +T + L
Sbjct: 172 PRDLVVASSDLSHY----------------HPEPVARARDG----KTLEKALALDPLALA 211
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++ CGR P + + + ++ L Y S++ K +R V YA+ A
Sbjct: 212 ESEACGRLPWASLTALARGLGWRPRL--LAYATSAEAKGERARVVGYAAVA 260
>gi|153875832|ref|ZP_02003453.1| protein of unknown function DUF52 [Beggiatoa sp. PS]
gi|152067703|gb|EDN66546.1| protein of unknown function DUF52 [Beggiatoa sp. PS]
Length = 195
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 96/188 (51%), Gaps = 9/188 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y DN+ L+ +D +L++ + P + +IAPHAGY YSG AA +
Sbjct: 5 VRTPAVAGMFYPDNAAELSNMVDQFLQDVKV-AGPVPKAIIAPHAGYIYSGPIAASVYAR 63
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ P I RV LLGPSH A ST + TP+G +P+D I+ + + + +++
Sbjct: 64 LTPVHDQIQRVVLLGPSHQVPLKGLAASTMLQFATPLGKIPVDKPAIDNILSLPQVSVLE 123
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L KV ++P++VG + A + K P
Sbjct: 124 QA-HVNEHSLEVQLPFLQKVLGN--FSLIPLVVGEATPKQVA---EVLEKLWGGPETLIV 177
Query: 182 VSSDFCHW 189
VSSD H+
Sbjct: 178 VSSDLSHY 185
>gi|313227297|emb|CBY22443.1| unnamed protein product [Oikopleura dioica]
Length = 113
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 73/111 (65%), Gaps = 2/111 (1%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RRASHAGSWY+ + + L +L+ W+ +A + + VI PHAGY+YS AA++F +
Sbjct: 4 RRASHAGSWYSGDPRELENQLENWISKAK--FEKKAKAVIVPHAGYAYSAPTAAWSFLQL 61
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
D + ++F++GPSHH Y P CAL +TP+G+L +D +++ EL ATG
Sbjct: 62 DAQTTKKIFVIGPSHHVYLPNCALPVVKECETPLGNLRIDKDIVTELHATG 112
>gi|429217048|ref|YP_007175038.1| dioxygenase [Caldisphaera lagunensis DSM 15908]
gi|429133577|gb|AFZ70589.1| putative dioxygenase [Caldisphaera lagunensis DSM 15908]
Length = 279
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 142/293 (48%), Gaps = 36/293 (12%)
Query: 8 SHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+HAGS+Y L + ++ G L A + E G + PHAGY YSG AA
Sbjct: 3 AHAGSFYPSKKDDLIKSIESSYLHKLGPGMLPNAERERKKESIGYMVPHAGYIYSGPIAA 62
Query: 59 YAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+++ NI +VF++ GP+H ++ +++TP+G++ +D E+ + L + K+
Sbjct: 63 HSYFNISKEGKPKVFIIAGPNHTGLGENASVWKEGIWETPLGEVEVDSEISKLLVSYSKY 122
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGRLFAKY 172
D EHS+E+ +P+L +F+ IKIVPI++ N E A+Y +
Sbjct: 123 FTFDEEAHIYEHSVEVQIPFLQYIFKD--IKIVPIVIKLQNYEVSLDLANALYRIITENG 180
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
VD + SSD H Y YD + +K D++ + IE+ D + K L
Sbjct: 181 VD---LIYIASSDMNH------YEPYD----ITYKK----DEIALKKIESLDTEGLFKVL 223
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
E + T+CG P+ +H+ ++K +I L++ S ++D V YASA
Sbjct: 224 DENNITMCGPGPVGALIHLSKKLNSKAEI--LKHATSGDITGEKDWVVGYASA 274
>gi|254458451|ref|ZP_05071876.1| ammecr1 domain protein [Sulfurimonas gotlandica GD1]
gi|373866308|ref|ZP_09602706.1| protein containing MEMO/AMMECR1 domain [Sulfurimonas gotlandica
GD1]
gi|207084759|gb|EDZ62046.1| ammecr1 domain protein [Sulfurimonas gotlandica GD1]
gi|372468409|gb|EHP28613.1| protein containing MEMO/AMMECR1 domain [Sulfurimonas gotlandica
GD1]
Length = 485
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 130/278 (46%), Gaps = 27/278 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AGS+YT+N L E++ LR+A + +I PHAGY +S A+ A+
Sbjct: 23 LRPTAVAGSFYTNNKPELQREVNTLLRDAKTFPKQNINALIVPHAGYVFSANVASTAYKT 82
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
++ +FL+G SH+ ++ YKTP+G++ ++ + +L K+ +
Sbjct: 83 LNK-KYKNIFLIGSSHYTSFDGASVYNIGDYKTPLGNVQVNRSIASKLIKDSKYFVFRAE 141
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
E EH++E+ LP+L ++ G +KIVPI++ N + + + Y +D N F +S
Sbjct: 142 AHEKEHTLEVQLPFLQTIY-GDDLKIVPIIIATSNLQTIISISKTLSPYFND-DNLFIIS 199
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN------ 237
+D H+ + + +DK +D + P F + L + ++
Sbjct: 200 TDLSHYPT--------------FRDANIVDKNILDSLVKNSPKKFVEALAKNEDAKIEAL 245
Query: 238 --TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
+ CG + L+M + + K + L Y+ S+ K
Sbjct: 246 QTSACGWTSLLTLLYMTQDKAYKYET--LEYKNSAHSK 281
>gi|357406969|ref|YP_004918893.1| hypothetical protein MEALZ_3652 [Methylomicrobium alcaliphilum 20Z]
gi|351719634|emb|CCE25310.1| conserved protein of unknown function [Methylomicrobium
alcaliphilum 20Z]
Length = 259
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 128/286 (44%), Gaps = 39/286 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR----EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
RR + AG +Y D+ +L +D +L EA +PK+ +IAPHAGY YSG AA A
Sbjct: 3 RRPAVAGLFYPDDPTKLRTMVDHYLNDAETEAKVPKA-----IIAPHAGYIYSGPVAASA 57
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + SI+RV L+GPSH A++ Y TP+G++ +D +E L E
Sbjct: 58 YARLKQARDSITRVVLIGPSHRVAFRGLAVTRTDYYTTPLGNVVIDRTAVEALIKLPFVE 117
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
++ EHS+E+HLP+L + K+VPI+ G +A+ R P
Sbjct: 118 YIEQA-HTHEHSLEVHLPFLQETLAD--FKLVPIVAGDASADQ---VSRALDLVWGGPET 171
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H+ + + + LD+ +IE +Y
Sbjct: 172 LIVISSDLSHYHD--------------YSTAKKLDQQTSTLIEQ------MQYEKLSSEA 211
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG+ P+S L +L S +IK LR S +D V Y +
Sbjct: 212 ACGKVPVSGLLKLLRQKSMRIKAVDLR--NSGDTAGDKDRVVGYGA 255
>gi|340781796|ref|YP_004748403.1| hypothetical protein Atc_1054 [Acidithiobacillus caldus SM-1]
gi|340555949|gb|AEK57703.1| protein containing DUF52 [Acidithiobacillus caldus SM-1]
Length = 266
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 128/287 (44%), Gaps = 32/287 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR + AG +Y L +LD ++ + K + +IAPHAGY YSG A A
Sbjct: 1 MSHIRPPAVAGRFYPARVTELQHQLDAFIPSSVASKQLHPKVLIAPHAGYVYSGPVAGKA 60
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + ISRV LLGP+H+ A+ A + TP+G +P++ + + + +
Sbjct: 61 YAQLQGLRGQISRVVLLGPAHYVRLRGLAIPEADAFATPLGTVPVNRDALNAISDLPQVI 120
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPS 177
D+ EHS+E+HLP+L ++ +VP+LVG A E + RL+ P
Sbjct: 121 QSDVP-HNPEHSLEVHLPFLQRMLGD--FSVVPLLVGDATYTEVAEVLERLWG----GPE 173
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+S+D H Y+ Y + + LD +IIE P+A + Y+
Sbjct: 174 TLIVISTDLSH------YLDY--------ATAQRLDARTAEIIEALQPEA-----IHYEQ 214
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CGR P+ L + +I+ LR S RD V Y +
Sbjct: 215 A-CGRIPLGGLLELARKRHMRIERVDLR--NSGDTTGPRDQVVGYGA 258
>gi|119946258|ref|YP_943938.1| hypothetical protein Ping_2620 [Psychromonas ingrahamii 37]
gi|119864862|gb|ABM04339.1| hypothetical protein DUF52 [Psychromonas ingrahamii 37]
Length = 282
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 103/192 (53%), Gaps = 14/192 (7%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K R A+ AGS+Y + ++ +EL +L A + + + +I PHAGY YSG A YA+
Sbjct: 24 KYRTAAVAGSFYPATADQIDQELSVFLN-APSESTTQAKALIVPHAGYCYSGAVAGYAYS 82
Query: 63 NIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ + I+RV LLGPSH CA+S+ + TPIG +P+D +L E +
Sbjct: 83 YLKNIAHNINRVILLGPSHRVALQGCAISSCDFFTTPIGPIPVDKSAYTQLLD----EKL 138
Query: 121 DICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
D+A EHS+E+ LP+L + + + +VPI+VG + ++ + + V++P
Sbjct: 139 VTINDQAHLLEHSLEVQLPFLQRSLQNFV--LVPIVVGQCSVQHVSQILEILK--VNEPG 194
Query: 178 NFFSVSSDFCHW 189
VSSD H+
Sbjct: 195 TLVVVSSDLSHY 206
>gi|345873097|ref|ZP_08825017.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thiorhodococcus drewsii AZ1]
gi|343917581|gb|EGV28376.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thiorhodococcus drewsii AZ1]
Length = 261
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 128/293 (43%), Gaps = 41/293 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +IR+ + A +Y N++ L LD +L P + +IAPHAGY YSG AA A
Sbjct: 1 MSQIRQPAVANRFYPGNAQELGVMLDLYLSSDRTGAQPP-KALIAPHAGYVYSGPIAASA 59
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P +I RV LLGPSH A S A+ + TP+G + LD IE + +
Sbjct: 60 YATLAPVRDAIRRVVLLGPSHRLPFQGLAASMASDFATPLGLVRLDRAAIESALKLPQVQ 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAKYV 173
++ EHS+E+ LP+L + K+VP +VG E+ +A++G
Sbjct: 120 PLE-SAHTFEHSLEVQLPFLQRALGD--FKLVPFVVGEATPESVAEVLDALWG------- 169
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
P +SSD H Y+ Y+ + LD+ IET DP+
Sbjct: 170 -GPETLILISSDLSH------YLDYETAN--------RLDRQTSTAIETLDPNGIGP--- 211
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CG +P+ L + + + LR S RD V Y + A
Sbjct: 212 ---DQACGHYPLRGLLTLAKRRGLQAETLDLR--NSGDTAGSRDQVVGYGAYA 259
>gi|307594859|ref|YP_003901176.1| hypothetical protein Vdis_0732 [Vulcanisaeta distributa DSM 14429]
gi|307550060|gb|ADN50125.1| protein of unknown function DUF52 [Vulcanisaeta distributa DSM
14429]
Length = 286
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 139/299 (46%), Gaps = 29/299 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPE-----VRGVIAPHAGY 50
M ++R+ + AG++Y + +L ++++ W L L K P V VI PHAGY
Sbjct: 1 MVRVRKPAVAGAFYEADRDKLIKQIE-WSISHPLGPGKLVKQPREGFKAVPIVIVPHAGY 59
Query: 51 SYSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
YSG AA +F I R F+L GP+H+ A+ ++TP+G L +D E+ +
Sbjct: 60 IYSGPVAAMSFAEIYRFHSPRTFILVGPNHYGIGAPVAIYPEGAWETPLGTLEIDSEIAK 119
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
EL + K+ D+ EHS+E+ LP++ +F G+ +KIVPI++ E G
Sbjct: 120 ELMSRVKYLEPDVYAFAQEHSLEVQLPFIQYIF-GNNVKIVPIIIWRQTKEVARDLGNAI 178
Query: 170 AKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
A+ + +P + V+S ++ HY+ K ++A+D + D D+
Sbjct: 179 AEVISSHEPGSIVYVASS--------DWNHYEPHEITTEKDMKAIDP-----VLRLDEDS 225
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
F + YD + CG I+ + K + LR+ S +V YAS A
Sbjct: 226 FFDVIERYDVSACGYGAIATAIVTAKRLGVK-NVVLLRHATSGDTSGYTLETVGYASIA 283
>gi|383764689|ref|YP_005443671.1| hypothetical protein CLDAP_37340 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384957|dbj|BAM01774.1| hypothetical protein CLDAP_37340 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 270
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 127/288 (44%), Gaps = 30/288 (10%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K+R+ + AG +Y + L + ++ +A +P VR VIAPHAGY SG A AF
Sbjct: 5 KVRKPAVAGRFYPASPATLHNAVRQYIEQASIPALQGVRAVIAPHAGYVCSGAVAGAAFC 64
Query: 63 NIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ S S ++LLGP+H+ A+S+A + TP+G +P+ ++ + L T +
Sbjct: 65 ALQTASASEPVIYLLGPAHYKIVRGVAVSSADAFATPLGPVPVAIDAVTRLATTNHLAHI 124
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D EH +E+ LP+L +F G + IVP+L+ A+ + L + + P
Sbjct: 125 DDWAHAPEHCLEVELPFLQYIF-GERLHIVPMLLDE-EADVTGVAEFLAQEVRERPDALI 182
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL--DKMGMDIIETGDPDAFKKYLLEYDNT 238
VSSD H+ H EA+ D+ ++ I GD
Sbjct: 183 VVSSDLSHY----------------HPYAEAIRRDRSLLEAILAGDLARVAA------GE 220
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR PI +H+ + + L Y S + V YA+ A
Sbjct: 221 ACGRLPILCLMHVAQQLNW--QAHRLAYANSGDTCGPKHEVVGYAALA 266
>gi|335436406|ref|ZP_08559201.1| hypothetical protein HLRTI_04897 [Halorhabdus tiamatea SARL4B]
gi|334897718|gb|EGM35847.1| hypothetical protein HLRTI_04897 [Halorhabdus tiamatea SARL4B]
Length = 283
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 141/301 (46%), Gaps = 38/301 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M+ R+ + AG +Y N L E+++G A P S PE+ G++APHAG +
Sbjct: 1 MDSARQPAVAGRFYVRNEAPLREQVEGMFTHAVGPGSVPTATAGSPEIAGLVAPHAGLPF 60
Query: 53 SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG AA+++ + + V +LGP+H A+ ++TP+G + +D ++ E++
Sbjct: 61 SGPVAAHSYAALAESGTPETVVILGPNHTGVGAAVAVPGDDEWRTPLGSVQIDADLREQI 120
Query: 112 KATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
+ D VD+ +EH+ E+ LP+L +++ + ++PI + +A+ G
Sbjct: 121 -----VDSTDATVDDRAHASEHAAEVQLPFLQYLYDD--LAVLPISLRRQDADVSRQLGE 173
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
A + D+ + + SSDF H+ + H V +IE D++ +D IE DP
Sbjct: 174 ALAAHTDEETVVIA-SSDFTHY----------EPHDV---AIE-RDELALDRIEANDPAG 218
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
+ + ++CG I+ L G S ++ + S RD V YAS A
Sbjct: 219 LIETVEREGLSVCGYGAIAAMLWANGEDS---QVDVFAHATSGDTAGSRDDVVGYASVAV 275
Query: 288 K 288
+
Sbjct: 276 R 276
>gi|383783463|ref|YP_005468029.1| hypothetical protein LFE_0171 [Leptospirillum ferrooxidans C2-3]
gi|383082372|dbj|BAM05899.1| hypothetical protein LFE_0171 [Leptospirillum ferrooxidans C2-3]
Length = 269
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 140/294 (47%), Gaps = 37/294 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRG----VIAPHAGYSYSG 54
M +R + AG +Y L + ++G L EA +P + + RG +I PHAG YSG
Sbjct: 1 MTIVRSPAVAGVFYPKEPDILLDMVEGMLEEALGKIPPAIQARGTPRALIVPHAGLIYSG 60
Query: 55 RAAAYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA A+ ++P+ + R+F+LGPSH + A STAT + TP+G++P+D ++++++
Sbjct: 61 PIAASAYCLMNPSFLPKRIFVLGPSHRIFFKGIATSTATSFGTPLGEIPVDRTLVDQMET 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+ D + EHS+E+ LP+L KV + + I+P++ G +A + A R+ +
Sbjct: 121 LPGIDTYDPAHTQ-EHSIEIQLPFL-KV-QMDQVPIIPLIAGESSASSVA---RVIKTAL 174
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD-KMGMDIIETGDPDAFKKYL 232
P +S+D H Y+ D+ + LD + DII +Y
Sbjct: 175 SFPDTLVVISTDLSH------YLPPDEG--------QLLDARTASDIIS-------PEYR 213
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CG P++ FL K +I LR S ++ V Y S A
Sbjct: 214 PLTNEMACGSQPLNGFLMAAQEAGLKGEILDLR--NSGDTAGPKNEVVGYGSFA 265
>gi|385811190|ref|YP_005847586.1| dioxygenase [Ignavibacterium album JCM 16511]
gi|383803238|gb|AFH50318.1| Putative dioxygenase [Ignavibacterium album JCM 16511]
Length = 272
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 127/286 (44%), Gaps = 19/286 (6%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAY 59
M+K+R AG +Y + L E+ L+EA KS ++ G+++PHAGY YSG+ AAY
Sbjct: 1 MKKVRPPHVAGYFYPADPVTLHNEIQRMLKEAKSEKSFDKIVGIVSPHAGYMYSGKTAAY 60
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ + + ++ PSH Y P ++ Y+TP+G + +D E +L K
Sbjct: 61 AYNTLKGKNYKTAIVISPSHAEYFPGISVYDGDAYETPLGIVEIDEERANKLVEGSKLIF 120
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
I EH++E+ +P+L V + KIVP+++G ++ R A+ +DD +
Sbjct: 121 RGIQGHRREHALEVQIPFLQSVLDD--FKIVPVVMGDQSSVFVNELARKLAEVMDDKT-I 177
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ +N D+ ++ + I D + +K L +
Sbjct: 178 IVASSDLSHF---YNASQADELDSIVEQRINNFDF-----------EQLQKDLDSHRCEA 223
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
CG PI V + K LR S + V Y SA
Sbjct: 224 CGGGPI-VAMMKAAALKNYDKATVLRRTDSGDVTGDKSEVVGYLSA 268
>gi|381159503|ref|ZP_09868735.1| putative dioxygenase [Thiorhodovibrio sp. 970]
gi|380877567|gb|EIC19659.1| putative dioxygenase [Thiorhodovibrio sp. 970]
Length = 267
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 131/298 (43%), Gaps = 45/298 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----EVRGVIAPHAGYSYSGR 55
M IR A AG++Y + L E+D + +P ++ +I PHAGY YSG
Sbjct: 1 MTTIRHACVAGTFYPADPAHLRAEIDACFAQRAPANAPGLAGERIKALIVPHAGYVYSGP 60
Query: 56 AAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA A+ ++ P + I RV LLGP+H ALS A ++TP+G +P+ E+L
Sbjct: 61 VAASAYASLQPQADEIRRVVLLGPAHRMPFQGLALSAADAWQTPLGSVPVARSACEQLLV 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRL 168
+ E +D D EHS+E+ LP+L ++ +++P+LVGA +A E+++G
Sbjct: 121 FPQVERLDAAFD-GEHSLEVQLPFLQRLLGD--FELIPLLVGAASAAEVAEVLESLWG-- 175
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+SSD H+ + S ALD I +P+
Sbjct: 176 ------GDETLILISSDLSHF--------------LDDTSARALDAQTAAAILRLEPE-- 213
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
L CGR+P+ L + +K + L SSQ D V Y + A
Sbjct: 214 ----LIGPEQACGRYPMGGLLRI--ARMRGLKPEQLDLRNSSQTAGSPDRVVGYGAYA 265
>gi|296272331|ref|YP_003654962.1| hypothetical protein [Arcobacter nitrofigilis DSM 7299]
gi|296096505|gb|ADG92455.1| protein of unknown function DUF52 [Arcobacter nitrofigilis DSM
7299]
Length = 265
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 129/285 (45%), Gaps = 37/285 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+ + +G++Y D+ K L + + + E + R +I PHAGY YSG A A+
Sbjct: 4 NKMNFSGTFYPDDKKELLKYFEVFTANEEKIKLDINPRAIIVPHAGYIYSGSVANVAYNL 63
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
T RV ++GPSH Y+ +++ Y+TP+GDL +D E +L K++ +
Sbjct: 64 SKDTKPKRVIVIGPSHRYFLQGASIAQFDSYETPLGDLVIDKEFGSKL--IEKYDFLHFV 121
Query: 124 VD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
D EHS E +P++ F+ +K++ I+ G ++A+ ++ K D NF +
Sbjct: 122 EDAHMEHSTETQIPFIKNYFD---VKVLEIVYGKIDAKELSLLVLELLK---DKDNFLVI 175
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
S+D H+ + LD + +D I+ D + K D CG+
Sbjct: 176 STDLSHFYTL--------------DEANILDGICIDAIKNQDIEKLTK-----DCEACGK 216
Query: 243 HPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
+ L + K+ K+KFL Y+ S + + V YAS
Sbjct: 217 EGVKALLQ-----TAKVFDYKVKFLDYKTSYEITKDKSKVVGYAS 256
>gi|327311477|ref|YP_004338374.1| hypothetical protein TUZN_1595 [Thermoproteus uzoniensis 768-20]
gi|326947956|gb|AEA13062.1| hypothetical protein TUZN_1595 [Thermoproteus uzoniensis 768-20]
Length = 281
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 130/294 (44%), Gaps = 28/294 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELD-------GWLREAGLPKSPEVRGVIAPHAGYSYS 53
M +IR+ + AG +Y + + L E + G E+ P+ + G +APHAGY YS
Sbjct: 1 MARIRKPAVAGYFYEADREALVERIRWSIYHDLGPKTESLSPEGAKALGAVAPHAGYIYS 60
Query: 54 GRAAAYAFGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+A+ + V ++GP+H+ A+ + +++TP+G L +D E E L
Sbjct: 61 GPVAAWAYAALRGYGRPDTVVVIGPNHYGVGAPVAIMKSGIWETPLGALEIDEEAAEFLS 120
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+ + D EHS+E+H+P++ F +KIVP+ + G AK
Sbjct: 121 SEYRSLEDDFYAFSKEHSIEVHIPFIQYFFGD--VKIVPVALWRQTPSTAKELGAALAKM 178
Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
V + + SSD H Y H H+ D++ + I GD + F
Sbjct: 179 VANFKKRIYIIASSDLNH------YEH--------HEVTAKKDELAISKIVAGDVEGFFD 224
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ YD + CG PI M + K K L++ S R+ +V YAS
Sbjct: 225 VISRYDISACGIGPIGAL--MKAAAALGFKAKLLKHATSGDTSGYREETVGYAS 276
>gi|328848465|gb|EGF97679.1| hypothetical protein MELLADRAFT_41191 [Melampsora larici-populina
98AG31]
Length = 144
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 81/136 (59%), Gaps = 18/136 (13%)
Query: 120 MDICVDEAEHSMEMHLPY---LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
MDI D+AEHS+E+HLPY ++K F+ I VPIL+G++++E E +YG++ Y+ P
Sbjct: 1 MDIKTDKAEHSIELHLPYVRHISKSFQT--ITNVPILIGSISSEKEFIYGKVLQPYLLSP 58
Query: 177 SNFFSVSSDFCHWGSRFNYMHY-----------DKKHGVIHKSIEALDKMGMDIIETGDP 225
N F +SSDF GSRF+Y +Y I++SIEALD+ + I+E+
Sbjct: 59 ENLFVISSDF--LGSRFSYTYYKDLARKLSSGASPPATPIYQSIEALDQEALKILESLSH 116
Query: 226 DAFKKYLLEYDNTICG 241
F +YL + N ICG
Sbjct: 117 SEFSRYLSQTKNNICG 132
>gi|386359785|ref|YP_006058030.1| dioxygenase [Thermus thermophilus JL-18]
gi|383508812|gb|AFH38244.1| putative dioxygenase [Thermus thermophilus JL-18]
Length = 263
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 122/291 (41%), Gaps = 36/291 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R AG +Y L EE++ L A P P VRGV++PHAGY+Y+GR A A
Sbjct: 1 MDLVRPPQVAGYFYPGEKAALEEEVEALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + RVFLLGPSH P A ++TP+G++ +DLE L G F
Sbjct: 61 FRALSAWRGKARRVFLLGPSHFVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
E EHS+E+ LP+L I+P+L G V+ EA+ L
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPEL------G 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + SSD H+ D K K ALD G+
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P S + + K + L Y S++ + R+ V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEARGGRERVVGYGALA 260
>gi|373458175|ref|ZP_09549942.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Caldithrix abyssi DSM 13497]
gi|371719839|gb|EHO41610.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Caldithrix abyssi DSM 13497]
Length = 276
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 128/286 (44%), Gaps = 23/286 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
IR+ + AG +Y+ N L E+ +L + K+ + GV+APHAGY YSG AA A+
Sbjct: 6 IRKPAVAGMFYSGNRSTLEREVAVFLENSNQEKNVRHIYGVVAPHAGYMYSGGVAARAYR 65
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA-TGKFELMD 121
+ V ++ PSHH Y ++ Y+TP+G +P+D + +L + L
Sbjct: 66 QVMDFEYEVVVVIAPSHHVYFETVSIYDGDFYETPMGLIPVDKNLCRQLADFDQRLTLSS 125
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
I + EH++E+ LP+L +FE K+VPI++G + +N A +D+
Sbjct: 126 IGHEGEEHALEVQLPFLQHIFEE--FKLVPIVMGDQSMKNIQALANALAAVLDNKKTLIV 183
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ S ++ LD + + I D + L +CG
Sbjct: 184 ASSDLSHYHS--------------YEEAVRLDSVVISHINNFKEDNLYEDLQSGLCEMCG 229
Query: 242 RHPISVFLHMLGNC--STKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
P+ + M+ C S K K + Y S +R V Y +A
Sbjct: 230 GGPV---IAMMKACRKSGADKAKVVLYRNSGDVTGERAPVVGYLAA 272
>gi|148656531|ref|YP_001276736.1| hypothetical protein RoseRS_2409 [Roseiflexus sp. RS-1]
gi|148568641|gb|ABQ90786.1| protein of unknown function DUF52 [Roseiflexus sp. RS-1]
Length = 284
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 124/288 (43%), Gaps = 23/288 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AGSWY N L E+D +L +A P P +V GV+APHAG YSG AA+AF
Sbjct: 3 IRPSPIAGSWYPSNPAHLQHEIDRYLAQAEPPVLPGKVWGVLAPHAGVRYSGPIAAWAFA 62
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI----EELKATGKFE 118
+ + + + P H + T Y+TP+G +P+D I E L+ F
Sbjct: 63 CVRGRTPEIIVIASPWHRGGPTPLITTGHTAYETPLGIVPVDNNAIAQLDEALRRRAGFG 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
L D+ EH++E+ LP+L +VF ++P+++ +A A G A+ +
Sbjct: 123 LTPRRHDD-EHAVEIELPFLQRVFGSFW--LLPVMLADQSAVTSAALGAALAETLRGRDA 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSD H+ + V + LD + +E DPDA E
Sbjct: 180 LLVASSDLSHY----------EPEPVARR----LDAGLLRRVEAFDPDAVIAAEAEGVGY 225
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG I+ L ++ LRY S + V Y +AA
Sbjct: 226 ACGSGAIAAVLWACRELGAN-RVTILRYGTSGDVERAAPWVVGYGAAA 272
>gi|427428692|ref|ZP_18918732.1| Putative dioxygenase [Caenispirillum salinarum AK4]
gi|425881800|gb|EKV30484.1| Putative dioxygenase [Caenispirillum salinarum AK4]
Length = 470
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 133/292 (45%), Gaps = 33/292 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAA 57
M +R A+ AG++Y ++ L +D + R A ++P + +IAPHAGY +SG A
Sbjct: 1 MTHMRPAAVAGAFYPGDAATLTRTVDDLMAEARAATPAETPVPKAIIAPHAGYMFSGPTA 60
Query: 58 AYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A A+ + P + RV LLGP+H AL T ++TP+G + LD + I ++
Sbjct: 61 ALAYARLAPLKGRVKRVVLLGPAHRVPFRGLALPTVDSFETPLGPVALDTDAINDVAEHV 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA-ENEAMYGRLFAKYVD 174
+ EHS+E+HLP+L + G ++VP+ VG A E A+ L+
Sbjct: 121 PGAGFSDKAHQQEHSLEVHLPFLRAIL-GPEPEVVPVCVGDAKADEVAALLDHLWGGE-- 177
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
+S+D H ++ YD V ++ A++ + DP L
Sbjct: 178 --ETLIVISTDLSH------FLDYDTARRVDAGTVAAVEHL--------DPSR-----LT 216
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++N CG P++ L L ++ I+ L S RD V YA+ A
Sbjct: 217 HENA-CGLTPMAGLL--LSAKKRRMSIETLDARSSGDTAGPRDRVVGYAAWA 265
>gi|218295903|ref|ZP_03496683.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
gi|218243641|gb|EED10169.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
Length = 264
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 132/295 (44%), Gaps = 36/295 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R A AG +Y + L EE+ L+ A P P VRGV+APHAGY YSG+ AA +
Sbjct: 1 MDLVRPAYVAGQFYPGEREALLEEVASLLKAAKTPPLPGVRGVLAPHAGYGYSGKVAAES 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + R LLGPSH A ++TP+G++ +DLE L G F
Sbjct: 61 FQALSGFRGQAQRAILLGPSHFVPFLGVAFYPYRAWRTPLGEVAVDLEGGRRLLERGFPF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
++ E EHS+E+ LP+L G + I+P+L G V+ EA+ L
Sbjct: 121 LRLEEPFWE-EHSLEVLLPFLQVALPG--VPILPLLFGEVDPLRVAEALLPEL------S 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + SSD H+ H D V + D+ ++I + D +A +
Sbjct: 172 PQDLVVASSDLSHY-------HPDP---VARER----DRRTLEIALSLDAEALSR----- 212
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
CGR P + + + ++ L Y S++ R V YA+ A +D
Sbjct: 213 -AEACGRLPWASLTALARELGWRPRL--LAYATSAEAFGDRRRVVGYAAVAYGLD 264
>gi|55981680|ref|YP_144977.1| hypothetical protein TTHA1711 [Thermus thermophilus HB8]
gi|55773093|dbj|BAD71534.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length = 456
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 121/291 (41%), Gaps = 36/291 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R AG +Y L EE+ L A P P VRGV++PHAGY+Y+GR A A
Sbjct: 1 MDLVRPPQVAGYFYPGEKAALKEEVKALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + RVFLLGPSH P A ++TP+G++ +DLE L G F
Sbjct: 61 FRALSAWRGKARRVFLLGPSHFVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
E EHS+E+ LP+L I+P+L G V+ EA+ L
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALPQ--TPILPLLFGEVDPGEVAEALLPEL------G 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + SSD H+ D K K ALD G+
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P S + + K + L Y S++ + R+ V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEARGGRERVVGYGALA 260
>gi|452944125|ref|YP_007500290.1| putative dioxygenase [Hydrogenobaculum sp. HO]
gi|452882543|gb|AGG15247.1| putative dioxygenase [Hydrogenobaculum sp. HO]
Length = 262
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 136/282 (48%), Gaps = 24/282 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR + AG +Y + L + ++ + E + + + +GVI PHAGY YSG A +
Sbjct: 2 IREEAVAGLFYPGDKDILLQAIN-LVCENEVIQDVDAKGVIVPHAGYIYSGHTACSVYKR 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + ++GP+H P ++ ++ +KTP+G++P++ E+ +++ + G E+ +
Sbjct: 61 LKPKK--NIIMMGPNHTGQGPVISIDSSDAWKTPLGEVPINKELRDKIASFG-IEI-EPK 116
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
EHS+E+ LP+L F+ IVP+++G ++ E G+ + +VDD VS
Sbjct: 117 AHIKEHSLEVQLPFLQVYFKD--FTIVPMILGFLDYETIVKLGKFLSSFVDD-DTLILVS 173
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SD H+ S+ KK ++ +I LD + ++Y+ T+CG
Sbjct: 174 SDMSHYISQ---QEAQKKDSFLYDAISKLDS-----------KELYRRAIQYNITMCGFI 219
Query: 244 PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
P +V + L + K+ + Y S + V+Y S
Sbjct: 220 PAAVTIESLKDKVKKVYP--VEYTNSGMVTKDFEKVVAYLSV 259
>gi|384431889|ref|YP_005641249.1| hypothetical protein [Thermus thermophilus SG0.5JP17-16]
gi|333967357|gb|AEG34122.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermus thermophilus SG0.5JP17-16]
Length = 263
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 120/291 (41%), Gaps = 36/291 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R AG +Y L EE+ L A P P VRGV++PHAGY+Y+GR A A
Sbjct: 1 MDLVRPPQVAGYFYPGEKAALKEEVKALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + RVFLLGPSH P A ++TP+G++ +DLE L G F
Sbjct: 61 FRALSAWRGKARRVFLLGPSHSVAFPGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 120
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
E EHS+E+ LP+L I+P+L G V+ EA+ L
Sbjct: 121 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPEL------G 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + SSD H+ D K K ALD G+
Sbjct: 172 PKDLVVASSDLSHYHPDPVARRLDAK---TLKRALALDAEGVA----------------- 211
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P S + + K + L Y S++ R+ V Y + A
Sbjct: 212 QGEACGRLPWSTLTAL--ARALGWKPRLLAYATSAEAGGGRERVVGYGALA 260
>gi|160902549|ref|YP_001568130.1| hypothetical protein Pmob_1088 [Petrotoga mobilis SJ95]
gi|160360193|gb|ABX31807.1| protein of unknown function DUF52 [Petrotoga mobilis SJ95]
Length = 274
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 128/260 (49%), Gaps = 31/260 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV---RGVIAPHAGYSYSGRAA 57
MEK+R AG++Y N ++L L + ++ + ++ G I PHAGY YSG A
Sbjct: 1 MEKVRNPVVAGTFYPSNPEKLKNLLKSFFGDSFKVNTEKLIPPMGAIVPHAGYIYSGETA 60
Query: 58 AYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A A+ + I+ RVFLLGP+H K ++ T+ +KTP+G + +D K GK
Sbjct: 61 AKAYKKVFEKGIAKRVFLLGPNHTGLGSKISVFTSGSWKTPLGTINVD------GKTAGK 114
Query: 117 -FELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
+ +DI DE+ EHS+E+ LP+L G+ +IVPI + + E G + A
Sbjct: 115 ILKELDIYNDESAHSREHSLEVQLPFLQYAI-GNDFEIVPICMMDQSLETSKNLGEILAD 173
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+++ + SSD H+ S + D+ K IE L M + + D ++Y
Sbjct: 174 IIEE-GDLIIASSDMNHYESHEKTLLKDE------KVIETLKNMNLQEMY----DTIRRY 222
Query: 232 LLEYDNTICGRHPISVFLHM 251
+ ++CG P++ L +
Sbjct: 223 NI----SMCGYGPVAALLSI 238
>gi|406911945|gb|EKD51642.1| hypothetical protein ACD_62C00211G0005 [uncultured bacterium]
Length = 262
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 128/289 (44%), Gaps = 41/289 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRGVIAPHAGYSYSGRAAA 58
KIR+ + AG +Y + L +++ +L + A +PK+ +I PHAGY YSG+ AA
Sbjct: 4 KIRKPAVAGMFYPAEPQVLKQDIQTYLSQVQDCALVPKA-----IIVPHAGYIYSGQVAA 58
Query: 59 YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+ + I RV LLGPSH AL+ A Y T +GD+ LD E L K
Sbjct: 59 QAYAQVSTIKDKIKRVVLLGPSHRVGFRGAALTNARCYNTSLGDVFLDTTAHEVLAGLPK 118
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+ D EHS+E+ LP+L V H +VP++VG E G + +
Sbjct: 119 VVVFDQA-HVPEHSLEVQLPFLQVVL--HDFVLVPVVVGDARPEE---VGEMLCRLWGGE 172
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+S+D H+ + S A D ++E D+F E+D
Sbjct: 173 ETLIVISTDLSHFED--------------YASACAHDARTCRLVEDMAYDSF-----EHD 213
Query: 237 NTICGRHPISVFLHMLGNCSTK-IKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ CGR P+ ML S K ++IK L + S +D V Y +
Sbjct: 214 DA-CGRVPLG---GMLRAASQKGLRIKTLELKNSGDTAGTKDRVVGYGA 258
>gi|288869705|ref|ZP_05975843.2| conserved hypothetical protein [Methanobrevibacter smithii DSM
2374]
gi|288861209|gb|EFC93507.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
2374]
Length = 285
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
+R+ + AG +Y D+S+ L + + D +L G K P++ V+ PHAG+ YS
Sbjct: 5 LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 64
Query: 54 GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+++ + VF++ GP+H + ++ + TP+G++ +D E + L
Sbjct: 65 GTIAAHSYCELAKNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQIDEEFADTLI 124
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
+ F D EHS+E+ LP+L + + KIVP+++G+ ++A N+ L A
Sbjct: 125 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
D S SSD H+ ++ +K G + + IE +D+ + +
Sbjct: 183 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+++Y+ T+CG P+ + + C + L Y+ S S V YAS K
Sbjct: 229 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 285
>gi|195953308|ref|YP_002121598.1| hypothetical protein HY04AAS1_0933 [Hydrogenobaculum sp. Y04AAS1]
gi|195932920|gb|ACG57620.1| protein of unknown function DUF52 [Hydrogenobaculum sp. Y04AAS1]
Length = 262
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 139/280 (49%), Gaps = 26/280 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y + L + ++ + E + + + +GVI PHAGY YSG A +
Sbjct: 2 VREEAVAGLFYPGDKDILLQAIN-LVCENEVIQDVDAKGVIVPHAGYIYSGHTACSVYKR 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + ++GP+H P ++ ++ +KTP+G++P++ E+ +++ + G E+ +
Sbjct: 61 LKPKK--NIIMMGPNHTGQGPVISIDSSDAWKTPLGEVPINKELRDKIASFG-IEI-EPK 116
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
EHS+E+ LP+L F+ IVP+++G ++ E +G+ + +VDD VS
Sbjct: 117 AHIKEHSLEVQLPFLQVYFKD--FTIVPMILGFLDYETIVKFGKFLSSFVDD-DTLILVS 173
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SD H+ S+ KK ++ +I LD + + ++Y+ T+CG
Sbjct: 174 SDMSHYISQ---EEAQKKDSFLYDTISRLDSKEL-----------YRRAIQYNITMCGFI 219
Query: 244 PISVFLHMLGNCSTKIKIKF-LRYEQSSQCKTKRDSSVSY 282
P +V + L + K+K + + Y S + V+Y
Sbjct: 220 PAAVAIEALKD---KVKKAYAVEYTNSGMVTKDFEKVVAY 256
>gi|435851774|ref|YP_007313360.1| putative dioxygenase [Methanomethylovorans hollandica DSM 15978]
gi|433662404|gb|AGB49830.1| putative dioxygenase [Methanomethylovorans hollandica DSM 15978]
Length = 266
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 31/287 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+A+ AG +Y N + L +ELD + ++GV+ PHAGY YSGR AA ++
Sbjct: 1 MRQATVAGKFYPANPEALRKELDNCFSGTTIDPRSGIKGVVVPHAGYIYSGRVAARSYAV 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ +F GP+H Y ALST T + TP+G + D E+ E+L + IC
Sbjct: 61 LPEADTYVIF--GPNHTGYGSPVALSTET-WSTPLGKVETDHELAEKLAGS------IIC 111
Query: 124 VDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN- 178
DE EHS+E+ +P+L F+ H +I+P+ +G + E G A+ +
Sbjct: 112 QDEVAHRFEHSIEVQIPFLQYRFK-HDFRILPVCIGMQDEEIALEVGMEIARAITAIGRK 170
Query: 179 -FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
SSDF HY H E+ D+ + I + D L D
Sbjct: 171 AVIIASSDFT---------HYQPAH-----VAESNDRYLIKAILDMNVDDLYNRLYARDI 216
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ CG PI+ + K + L+Y S + V YAS
Sbjct: 217 SACGYGPIAATITAARKLGAS-KAELLKYATSGEVTGDDSGVVGYAS 262
>gi|148643498|ref|YP_001274011.1| dioxygenase [Methanobrevibacter smithii ATCC 35061]
gi|206558195|sp|A5UN65.1|Y1438_METS3 RecName: Full=MEMO1 family protein Msm_1438
gi|148552515|gb|ABQ87643.1| predicted dioxygenase [Methanobrevibacter smithii ATCC 35061]
Length = 282
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
+R+ + AG +Y D+S+ L + + D +L G K P++ V+ PHAG+ YS
Sbjct: 2 LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 61
Query: 54 GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+++ + VF++ GP+H + ++ + TP+G++ +D E + L
Sbjct: 62 GTIAAHSYCELAKNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQVDEEFADTLI 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
+ F D EHS+E+ LP+L + + KIVP+++G+ ++A N+ L A
Sbjct: 122 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 179
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
D S SSD H+ ++ +K G + + IE +D+ + +
Sbjct: 180 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 225
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+++Y+ T+CG P+ + + C + L Y+ S S V YAS K
Sbjct: 226 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 282
>gi|325295386|ref|YP_004281900.1| hypothetical protein Dester_1205 [Desulfurobacterium
thermolithotrophum DSM 11699]
gi|325065834|gb|ADY73841.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfurobacterium thermolithotrophum DSM 11699]
Length = 267
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 124/248 (50%), Gaps = 22/248 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y NS+ L L+ + R +PK + + +I PHAGY YSG+ A +
Sbjct: 2 VRYPAVAGQFYPGNSEELKLYLESFCRN-NIPKV-DAKAIIVPHAGYIYSGKVAGETYSR 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
++ S++ ++GP+H ++ + ++ TP+G++P++ + +L FE D
Sbjct: 60 VEIPSLN--IIMGPNHTGLGKSVSVYPSGIWITPLGEIPINEHITSKLLNNSPFE-ADTA 116
Query: 124 VDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L + G+ + IVPI ++ + G + AK +++ +
Sbjct: 117 AHIYEHSLEVQLPFL-QYCSGYREDLSIVPITYKYISYSDCIKAGEVLAKVLEEDNGLIV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+S+DF H+ S+ E D + +D I + +P+ K + Y+ ++CG
Sbjct: 176 ISTDFSHYISQ--------------SEAEKYDSLAIDAILSLNPEELYKRVFTYNISMCG 221
Query: 242 RHPISVFL 249
P ++ L
Sbjct: 222 VIPATIGL 229
>gi|308050676|ref|YP_003914242.1| hypothetical protein Fbal_2966 [Ferrimonas balearica DSM 9799]
gi|307632866|gb|ADN77168.1| protein of unknown function DUF52 [Ferrimonas balearica DSM 9799]
Length = 257
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 96/188 (51%), Gaps = 9/188 (4%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+IR A+ AG +Y + LA E+D L + RG+I PHAGY YSG AA A+
Sbjct: 2 RIRPAAVAGYFYPGEANELAREVDRLLSQPKPALPAPARGLIVPHAGYVYSGAIAASAYL 61
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE-LMD 121
+ RV L+GP+H AL T Y+TP+G + LD + I L G++E +D
Sbjct: 62 TLPEQRFRRVILMGPAHTKAVRGIALPDWTHYQTPLGRVALDTDAINRLD--GQYESRID 119
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EH++E+ LP++ G L ++P++VGA+ +E A + P +
Sbjct: 120 NLAHAREHALEVQLPFIQVRLSGCL--LLPMVVGAIRSETLAECLTALMQ----PDDLLI 173
Query: 182 VSSDFCHW 189
VS+D H+
Sbjct: 174 VSTDMSHF 181
>gi|222445004|ref|ZP_03607519.1| hypothetical protein METSMIALI_00620 [Methanobrevibacter smithii
DSM 2375]
gi|222434569|gb|EEE41734.1| Memo-like protein [Methanobrevibacter smithii DSM 2375]
Length = 285
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 139/298 (46%), Gaps = 30/298 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSPEVR---------GVIAPHAGYSYS 53
+R+ + AG +Y D+S+ L + + D +L G K P++ V+ PHAG+ YS
Sbjct: 5 LRKPAVAGMFYPDDSEELVKTIEDCFLHSFGPGKIPDIESFEGNDYPVNVMVPHAGFQYS 64
Query: 54 GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+++ + VF++ GP+H + ++ + TP+G++ +D E + L
Sbjct: 65 GTIAAHSYCELARNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNIQVDEEFADTLI 124
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA--VNAENEAMYGRLFA 170
+ F D EHS+E+ LP+L + + KIVP+++G+ ++A N+ L A
Sbjct: 125 SFSDFASADFAAHMREHSIEVQLPFLQ--YFSNDFKIVPVVLGSQTISAANDLAAAILKA 182
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
D S SSD H+ ++ +K G + + IE +D+ + +
Sbjct: 183 GEKLDKSYCVIASSDLSHFNTQ---ERANKVDGFVLEDIENMDEFKL-----------LE 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+++Y+ T+CG P+ + + C + L Y+ S S V YAS K
Sbjct: 229 EIIQYNITMCGYGPVMTTMILSKMCGKNTS-EILAYKTSGDISGDLSSVVGYASGIFK 285
>gi|394987639|ref|ZP_10380478.1| hypothetical protein SCD_00035 [Sulfuricella denitrificans skB26]
gi|393792858|dbj|GAB70117.1| hypothetical protein SCD_00035 [Sulfuricella denitrificans skB26]
Length = 271
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 133/291 (45%), Gaps = 34/291 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEV-RGVIAPHAGYSYSGRAA 57
M++IR A+ AG++YT + + LA E+ L EA + KS + + VI PHAGY YSG A
Sbjct: 1 MKQIRPAAVAGAFYTGDPRSLASEVAAMLEEAEQHMEKSATLPKAVIVPHAGYIYSGPVA 60
Query: 58 AYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A A+ ++ RV LLGP H AL + + TP+G +PLDL+ + ++
Sbjct: 61 ASAYALLEQGRNIYRRVVLLGPVHRVPVHGLALPSTEGFATPLGVVPLDLDAMAKIAGLP 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+ D EHS+E+ LP+L V + K+VP+ VG V + A + +
Sbjct: 121 QVVEYD-AAHALEHSLEVQLPFLQAVLDD--FKLVPLAVGDVTPQEVA---EVLERLWGG 174
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P +SSD H ++ Y + + + +A+ +D+ ++ + +
Sbjct: 175 PETLIVISSDLSH------FLTYREAQQIDSTTAQAI----LDL----------RWPVSH 214
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CG P++ L L +K + L S R V Y + A
Sbjct: 215 EQA-CGGTPVNGLL--LAAQHHHLKPRLLNLRNSGDTAGDRSRVVGYGAFA 262
>gi|352683068|ref|YP_004893592.1| putative dioxygenase [Thermoproteus tenax Kra 1]
gi|350275867|emb|CCC82514.1| Predicted dioxygenase [Thermoproteus tenax Kra 1]
Length = 281
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 135/296 (45%), Gaps = 32/296 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-------GVIAPHAGYSYS 53
M K+RR + AG +Y + L E + +R P S ++ G + PHAGY YS
Sbjct: 1 MHKVRRPAVAGYFYEAEREALLERIKWAVRHEVGPGSETLKLEGTRALGAVVPHAGYIYS 60
Query: 54 GRAAAYAFGNIDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+A+ + + V ++GP+H+ A+ + V++TP+G L +D E+ E L
Sbjct: 61 GPVAAWAYAALRSFGAPDSVVIVGPNHYGVGSPVAIMRSGVWETPLGSLSVDEELAEGLI 120
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRL 168
K D EHS+E+ LP+L F +KIVPI + +V E + R+
Sbjct: 121 REYKALEDDFYAFSREHSIEVQLPFLQYFFGD--VKIVPIALWRQTPSVAKELGSALARV 178
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+K+ + + SSDF H Y H H+ D++ + I GD D F
Sbjct: 179 ISKH--NKKVYVLASSDFNH------YEH--------HEITVKKDEVAIKEIVEGDVDRF 222
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ L +YD + CG P +V M K + L++ S + +V YAS
Sbjct: 223 FEVLSKYDISACG--PGAVGALMRAASELGFKARLLKHATSGDTSGYVEETVGYAS 276
>gi|117925442|ref|YP_866059.1| hypothetical protein Mmc1_2152 [Magnetococcus marinus MC-1]
gi|117609198|gb|ABK44653.1| protein of unknown function DUF52 [Magnetococcus marinus MC-1]
Length = 481
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 71/224 (31%), Positives = 110/224 (49%), Gaps = 20/224 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRGVIAPHAGYSYSGRAAAY 59
+IR A+ AG +Y + L + + L++A PK E R +APHAGY YSG AAY
Sbjct: 31 RIRPAAVAGMFYPAQADALRQLVRSLLQQA--PKRHDQGEPRAFVAPHAGYRYSGLTAAY 88
Query: 60 AFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+ + R VFLLGPSH +L ++TP+G + +DL ++E + A
Sbjct: 89 AYNTLQAAPKERPRRVFLLGPSHRVALHGASLGNYDAFETPLGLVEVDLPLVERMAAQES 148
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
++D EHS+E+HLP+L + H ++VP++ G + A + AKY +
Sbjct: 149 DLVLDNAPHAQEHSLEVHLPFLQESL-AHF-RLVPMVFGRIEPSRVA---EILAKY-REA 202
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDII 220
+ SSD H + YD G+ H EA+ M + +
Sbjct: 203 DDLIVGSSDLSH------FYDYDTAVGLDHTCNEAVLNMDLQAM 240
>gi|170725419|ref|YP_001759445.1| hypothetical protein Swoo_1057 [Shewanella woodyi ATCC 51908]
gi|169810766|gb|ACA85350.1| protein of unknown function DUF52 [Shewanella woodyi ATCC 51908]
Length = 268
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 106/205 (51%), Gaps = 14/205 (6%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRG-----VIAPHAGYSYSGRAAA 58
R+A+ AG +Y L + ++ +L R A L P+++ +I PHAGY YSG+ AA
Sbjct: 6 RQAAVAGRFYPAEPTALKQMINTYLGRCASLTNDPKLKSSPTKVIIVPHAGYIYSGQVAA 65
Query: 59 YAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AF + P I RV LLGP+H + CAL ++ ++TP+G +D + +L K
Sbjct: 66 SAFALLAPRRKQIKRVVLLGPAHRVHLKGCALPSSESFETPLGQQVIDKVALRQLSEHSK 125
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+ D+ +E EHS+E+ LP+L + +++PILVG ++ AM L + D
Sbjct: 126 ILISDLPHNE-EHSLEVELPFLQLCLDK--FELIPILVGDISP--HAMASILEQVWGGD- 179
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKK 201
VSSD H+ S D++
Sbjct: 180 ETLIVVSSDLSHYHSYQEAAELDRE 204
>gi|375262828|ref|YP_005025058.1| hypothetical protein VEJY3_18321 [Vibrio sp. EJY3]
gi|369843255|gb|AEX24083.1| hypothetical protein VEJY3_18321 [Vibrio sp. EJY3]
Length = 267
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 96/190 (50%), Gaps = 11/190 (5%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
K R A+ AG +Y + L +LDG+ E A L ++P + +I PHAGY YSG AA AF
Sbjct: 2 KYREAAVAGRFYPATANELITQLDGYFAEQALLTEAP--KALIVPHAGYFYSGEVAAKAF 59
Query: 62 GNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
+ + + SRV L GPSHH CA+ + V+ TP G++ +D E +E+L +
Sbjct: 60 RLLSNSKNTFSRVVLFGPSHHVALDGCAVPDSDVFLTPTGEVMIDREGVEDLLSQDLVAT 119
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
D EHS+E+ LP+L ++PI+VG + R+ P+
Sbjct: 120 SDQA-HHWEHSLEVQLPFLQYCLND--FTLLPIVVG---RDLHGYVKRILEAVTQTPNTL 173
Query: 180 FSVSSDFCHW 189
VSSD H+
Sbjct: 174 IVVSSDLSHY 183
>gi|251773044|gb|EES53600.1| protein of unknown function DUF52 [Leptospirillum
ferrodiazotrophum]
Length = 469
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 105/209 (50%), Gaps = 40/209 (19%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG---------LPKSPEVRGVIAPHAGYS 51
M+++R + AGS+Y + L +L+ L A LPK+ +I PHAGY
Sbjct: 1 MDRVRYPAVAGSFYEGEASALRAKLESLLARARSEGSGGRKELPKA-----LIVPHAGYL 55
Query: 52 YSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
YSG AA A+ ++ ISRV LLGP+H Y AL A V++TP+GD+P+D E++
Sbjct: 56 YSGPVAARAYSLLEKAQGRISRVVLLGPAHRVYFHGLALPGADVFRTPLGDVPVDPELVH 115
Query: 110 ELKATGKFELMDIC----VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN---- 161
+++ L +C EHS+E+ LP+L +V ++P++VG +A
Sbjct: 116 RVES-----LPQVCERPEAHRQEHSLEVQLPFLQRVL--GRFGLLPLVVGEASAREVLEV 168
Query: 162 -EAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
EA+ G P VSSD H+
Sbjct: 169 LEAVRG--------GPETLLVVSSDLSHY 189
>gi|78358001|ref|YP_389450.1| mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio alaskensis G20]
gi|78220406|gb|ABB39755.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio alaskensis G20]
Length = 298
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 126/302 (41%), Gaps = 37/302 (12%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+ IRR AG +Y D+ L L +L EA P + PHAGY +SG A
Sbjct: 19 QHIRRPVVAGRFYPDSPAELQSMLRAYLDEAAAPPQKPTLLAMVPHAGYVFSGAVAGCTL 78
Query: 62 GN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ P ++ F+LGP+H A+ V++TP+GD+P+D + E A
Sbjct: 79 AQAMLPQTL---FVLGPNHTGRGSGIAVWPEGVWRTPLGDVPVDNALAAEFCALCAPARP 135
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLF--AKYVD 174
D AEHS+E+ LP+L ++IVP+ +G AV A ++ A
Sbjct: 136 DTLAHSAEHSLEVVLPFLQ--LRVPRVRIVPVSIGDPSLAVLTAAGAAMAQIIRRAAQTA 193
Query: 175 DPSN----FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
DP VSSD H ++ D+ LD M ++ + +P
Sbjct: 194 DPGGQNRIAMVVSSDMTH------FLPQDEAR--------RLDAMALEQVTALNPQGLYT 239
Query: 231 YLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
+ E ++CG P++ L +LG ++ RY S K DS V YA A
Sbjct: 240 TVREKRISMCGVLPMTAALEACRLLGATGARLT----RYATSGDITGKTDSVVGYAGALV 295
Query: 288 KV 289
V
Sbjct: 296 DV 297
>gi|345863536|ref|ZP_08815746.1| putative dioxygenase [endosymbiont of Tevnia jerichonana (vent
Tica)]
gi|345125318|gb|EGW55188.1| putative dioxygenase [endosymbiont of Tevnia jerichonana (vent
Tica)]
Length = 241
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 128/267 (47%), Gaps = 33/267 (12%)
Query: 25 LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYY 82
+D +L +AG + P + ++APHAG+ YSG AA + + P + ISRV L+GPSH
Sbjct: 2 VDDYLGQAGEGRPP--KAIVAPHAGFVYSGPVAASVYHRLTPVADRISRVVLVGPSHRVA 59
Query: 83 TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF 142
ALS+A ++TP+G++PL+ +V +EL + + D+ EHS+E+ LP+L +V
Sbjct: 60 FRGLALSSAQSFQTPLGEIPLE-QVSDELLRLPQLSVNDMA-HAQEHSLEVQLPFLQRVL 117
Query: 143 EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKH 202
+++PI+VG + A + + D +SSD H+ +Y+
Sbjct: 118 GE--FRLLPIVVGDASG---AEVAEVLEQVWDGEETLIVISSDLSHY---HDYL------ 163
Query: 203 GVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIK 262
+ + +D+ D I P+ L Y++ CGR P+ L + ++
Sbjct: 164 -----TAQQMDRTTSDAILALQPER-----LHYEDA-CGRIPLQGLLEVAQRKGLDGELV 212
Query: 263 FLRYEQSSQCKTKRDSSVSYASAAAKV 289
LR S +D V Y + A V
Sbjct: 213 DLR--NSGDTAGLKDRVVGYGAYAFAV 237
>gi|302665144|ref|XP_003024185.1| hypothetical protein TRV_01684 [Trichophyton verrucosum HKI 0517]
gi|291188230|gb|EFE43574.1| hypothetical protein TRV_01684 [Trichophyton verrucosum HKI 0517]
Length = 205
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 89/203 (43%), Gaps = 68/203 (33%)
Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHY-------------- 198
+VG+ +A NE ++G + Y+ + N F +SSDFCHWG+RF Y +Y
Sbjct: 1 MVGSTSAANERVFGSILRPYLANEENAFVISSDFCHWGTRFAYTYYVQAPDPTTYKLPLT 60
Query: 199 --------------------------------DKKHG------VIHKSIEALDKMGMDII 220
DK + IH+S+ A D M I
Sbjct: 61 YPSLPVPSDMLTAEKAAEEVRAASSGRMLRENDKSYNKEKYMWAIHESVSACDMACMSAI 120
Query: 221 ETGDPDAFKKYLLEYDNTICGRHPISVFL-----------HMLGNCSTKIKIK-----FL 264
+G+ D F++ L NT+CGRHPI V + +G T+ +K F+
Sbjct: 121 ASGNTDVFQEALRRTGNTVCGRHPIGVVMAGIEGLQHQAETTMGAAVTQDGVKKGKFYFI 180
Query: 265 RYEQSSQCKTKRDSSVSYASAAA 287
RYE+SS C + +DSSVSY SA A
Sbjct: 181 RYERSSDCFSVQDSSVSYVSAFA 203
>gi|116749427|ref|YP_846114.1| AMMECR1 domain-containing protein [Syntrophobacter fumaroxidans
MPOB]
gi|116698491|gb|ABK17679.1| AMMECR1 domain protein [Syntrophobacter fumaroxidans MPOB]
Length = 522
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 109/228 (47%), Gaps = 17/228 (7%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYA 60
E++R+ + AG+WY + L ++++G+L PK ++ +I+PHAG YSG+ AAY
Sbjct: 40 EQVRKPAIAGTWYPASPVELRKQIEGFLNRVPEPKPRGQLVALISPHAGTIYSGQVAAYG 99
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ ++ + V ++ PSH A ++TP+G +PLD ++IE L+ K
Sbjct: 100 YKLLEKQKFASVIVISPSHRARFEGVATYELGGFQTPLGIVPLDRDLIEALRRRDKRIAH 159
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
V EH++E+ LP+L V E K+VP+++G + A V +
Sbjct: 160 RPEVHSEEHALEIQLPFLQTVLEE--FKLVPLIMGEQDFATCKRLAEAIADTVREKRVLV 217
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
SSD H + Y++ +ALDK+ D + DP
Sbjct: 218 IASSDLSH------FHPYERA--------KALDKVAADRVGALDPQGL 251
>gi|344941525|ref|ZP_08780813.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methylobacter tundripaludum SV96]
gi|344262717|gb|EGW22988.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methylobacter tundripaludum SV96]
Length = 259
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 122/264 (46%), Gaps = 31/264 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
R+ + AGS+Y N ++L LD +L +A K P + +IAPHAGY YSG AA A+
Sbjct: 3 RQPAVAGSFYPANPEQLHLMLDQFLNDAATNEKVP--KAIIAPHAGYIYSGPIAASAYAR 60
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ I+RV L+GPSH A+S + + TP+G +P+D ++ L E ++
Sbjct: 61 LKKAHDKITRVVLIGPSHRVAFRGLAVSRSETFTTPLGSVPVDQAAVQTLVQLPFVEYIE 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+HLP+L ++ + +IVPI+ G A E + L A + D
Sbjct: 121 QA-HTYEHSLEVHLPFLQEMLDS--FEIVPIVAG--EASPEQVSQVLDALWGGD-ETLIV 174
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSD H+ DK ++IE L +Y + CG
Sbjct: 175 ISSDLSHYHDYATAQQMDKS---TSQAIEQL-----------------QYERLAPESACG 214
Query: 242 RHPISVFLHMLGNCSTKIKIKFLR 265
+ P+S L + S + LR
Sbjct: 215 KVPVSGLLKLARKKSLSVTTIDLR 238
>gi|114778372|ref|ZP_01453219.1| hypothetical protein SPV1_08141 [Mariprofundus ferrooxydans PV-1]
gi|114551335|gb|EAU53892.1| hypothetical protein SPV1_08141 [Mariprofundus ferrooxydans PV-1]
Length = 256
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 134/291 (46%), Gaps = 45/291 (15%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R A+ AG +Y + L + + L + + SP R ++APHAGY YSG AAYA+
Sbjct: 2 SVRPAAVAGLFYPADRAELCDMIARELAQVDV-VSPCPRALVAPHAGYIYSGPTAAYAYA 60
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ + +SRV + GP+H A+++++ ++TP+GD+PL + EL +D
Sbjct: 61 LLSKSPVSRVVIAGPAHRVAFRGVAVASSSAFRTPLGDVPLAADG-RELAERFDCVAIDD 119
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAKYVDDPS 177
EHS+E+ LP+L V +++P+ VGAV A E+++G D
Sbjct: 120 GAHAMEHSLEVQLPFLQTVLGS--FELLPLCVGAVEASELADVLESLWG--------DAQ 169
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA--LDKMGMDIIETGDPDAFKKYLLEY 235
F +SSD H+ H EA +D +D I G +++
Sbjct: 170 MLFVISSDLSHY----------------HPYAEARRMDLTTIDRILEGQS-------IDH 206
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CG I+ +++L ++ + L Y S RD V YAS A
Sbjct: 207 EQA-CGATGINA-MNILAK-QHGLQAQLLDYRNSGDTAGDRDRVVGYASLA 254
>gi|119872612|ref|YP_930619.1| hypothetical protein Pisl_1105 [Pyrobaculum islandicum DSM 4184]
gi|166228883|sp|A1RTJ4.1|Y1105_PYRIL RecName: Full=MEMO1 family protein Pisl_1105
gi|119674020|gb|ABL88276.1| protein of unknown function DUF52 [Pyrobaculum islandicum DSM 4184]
Length = 281
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 137/295 (46%), Gaps = 33/295 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
++R+ + AG +Y + ++L ++L+ ++ PK+P++ G +APHAGY YSG
Sbjct: 2 RVRKPAVAGYFYESDREKLLQQLEWAIKHELGPKAPQIPKLGAETLGGVAPHAGYIYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + VF++ GP+H+ A+ + V++TP+G + +D E+ E++
Sbjct: 62 VAAWVYSALAGFGKPDVFIIIGPNHYGIGAPVAIMKSGVWETPLGRVEIDGELAEKIMRY 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
K D EHS+E+ +P++ F +KIVPI + G+ A +
Sbjct: 122 FKELEDDFHAFSREHSIEVQIPFIQYFFGD--VKIVPITIWRQTLSTSRELGKALANAIR 179
Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYD---KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ + SSDF H Y H+D KK + I LD+ G+ + +
Sbjct: 180 EYGRRAYIIASSDFNH------YEHHDITTKKDEMAIGKILQLDETGLFEVAS------- 226
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++D +ICG PI + K + L++ S R+ +V YAS
Sbjct: 227 ----KFDISICGIGPIGALIVAAKELGYK-NVTLLKHATSGDTSGYREETVGYAS 276
>gi|397779743|ref|YP_006544216.1| hypothetical protein BN140_0577 [Methanoculleus bourgensis MS2]
gi|396938245|emb|CCJ35500.1| UPF0103 protein [Methanoculleus bourgensis MS2]
Length = 262
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 132/293 (45%), Gaps = 45/293 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
+R S AG +Y + L + L+ + ++ K+P + RG++APHAGY YSG A AF
Sbjct: 3 MRPCSVAGMFYPAEPRHLEQLLETFFQK----KAPGIDARGIVAPHAGYVYSGETGACAF 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I P ++GPSH Y CA +A ++TP+G + +D E ++ L D
Sbjct: 59 STIPPDFDGTFVVIGPSHRGYM-TCA--SAVPWETPLGIVDVDTEFVDAL---------D 106
Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
I +DEA EHS+E+ +P + F +I P+++G + E A + ++
Sbjct: 107 IEIDEASHRNEHSIEVQVPIIKYRFP--RARIAPVMMGDQSYEAAASLAEHLLRAIEHTG 164
Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+ SSDF H+ V ++ D +D ++T D F + L E
Sbjct: 165 RNVRIVASSDFSHY--------------VPDETARRQDLYAIDALKTLDIPEFYRRLRET 210
Query: 236 DNTICGRHPISVFLHMLGNCST--KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
T CG PI+ M C T + + LRY S + V YA+ A
Sbjct: 211 GATACGYGPIAA---MCITCRTLGATRGELLRYTTSGDVTGDFNQVVGYAAIA 260
>gi|384440528|ref|YP_005655252.1| hypothetical protein [Thermus sp. CCB_US3_UF1]
gi|359291661|gb|AEV17178.1| hypothetical protein TCCBUS3UF1_21410 [Thermus sp. CCB_US3_UF1]
Length = 263
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 127/290 (43%), Gaps = 34/290 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R + AG +Y + L +E+ L EA P P +RG++ PHAGY YSGR AA
Sbjct: 1 MDLVRPPAVAGYFYPQEGRALEQEVARLLGEAASPPGPRIRGLLLPHAGYRYSGRVAAKG 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
F + P R LLGPSH A + TP+G++ +DLE L G
Sbjct: 61 FAALLPWRGRARRAILLGPSHFLAFRGVAFYPYRAWATPLGEVAVDLEGGRRLLRLGPPF 120
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
E + ++ EHS+E+ LP+L V H I+P+L G V+ EA G L P
Sbjct: 121 REYQEPFLE--EHSLEVLLPFLQAVLP-H-TPILPLLFGEVD-PGEAARGLL---ETLGP 172
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+ SSD H+ D K + +++ ALD G+ E
Sbjct: 173 EDLVLASSDLSHYHPDPMARTLDAK--TLERAL-ALDAQGVAGAEA-------------- 215
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P + L +L + + + L Y S++ V YA+ A
Sbjct: 216 ---CGRLPWAT-LTLLAR-AQGWRAQLLAYATSAEASGDAGRVVGYAAVA 260
>gi|90413323|ref|ZP_01221317.1| hypothetical protein P3TCK_17149 [Photobacterium profundum 3TCK]
gi|90325724|gb|EAS42187.1| hypothetical protein P3TCK_17149 [Photobacterium profundum 3TCK]
Length = 260
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 92/185 (49%), Gaps = 7/185 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R + AG +Y ++ L ++L+ W + +R +I PHAGY YSG AA A+ +
Sbjct: 4 RPPAMAGRFYEKSASLLQQQLNDWCSPP-VTHRGVIRALIVPHAGYIYSGEVAAKAYCQL 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+I +V L+GPSH Y CA+ + + TP+G++ +D + I+ L ++ D V
Sbjct: 63 QAKNIKKVILIGPSHRYAFHGCAVPNSRYFSTPLGNVSIDTQSIDNLIKIDDIKVSDQ-V 121
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
EH +E+ LP+L H ++P+L +++ N A R+ +SS
Sbjct: 122 HAQEHCLEVQLPFLQTCL--HHFTVLPLLTSDISSINVA---RIINDLWQQEDTLLVISS 176
Query: 185 DFCHW 189
D H+
Sbjct: 177 DLSHF 181
>gi|294139598|ref|YP_003555576.1| hypothetical protein SVI_0827 [Shewanella violacea DSS12]
gi|293326067|dbj|BAJ00798.1| conserved hypothetical protein [Shewanella violacea DSS12]
Length = 271
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 100/196 (51%), Gaps = 13/196 (6%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
+ IR + AG +Y + L+ EL ++ +A + +V + +I PHAGY YSG
Sbjct: 2 LSSIRPPAVAGLFYPADPITLSTELTSYMDQAVQAQHLQVSNKAPKAIIVPHAGYIYSGL 61
Query: 56 AAAYAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AAY F +I I RV LLGP+H Y CAL ++ + TP+GDLP+D +L A
Sbjct: 62 VAAYGFFHIQRYKHKIKRVVLLGPAHTVYLQGCALPESSHFCTPLGDLPIDRAGAAKLAA 121
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+ +I E EHS+E+ LP+L + +++PILVG + E M L +
Sbjct: 122 DDAVTISNIPHKE-EHSLEVELPFLQHCLDE--FELLPILVGEIAP--EPMASLLNLVWG 176
Query: 174 DDPSNFFSVSSDFCHW 189
D +SSD H+
Sbjct: 177 GD-ETLIVISSDLSHF 191
>gi|21227863|ref|NP_633785.1| hypothetical protein MM_1761 [Methanosarcina mazei Go1]
gi|23822345|sp|Q8PW40.1|Y1761_METMA RecName: Full=MEMO1 family protein MM_1761
gi|20906277|gb|AAM31457.1| conserved protein [Methanosarcina mazei Go1]
Length = 267
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 128/288 (44%), Gaps = 32/288 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAF 61
++R+ + AG +Y + L EL GL + EV G + PHAGY YSG+ AA+ +
Sbjct: 2 EMRQPAVAGQFYPLRCENLENELKRCFE--GLEIREQEVLGAVCPHAGYMYSGKVAAHVY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ +F GP+H Y ++S T +KTP+G++ +DLE+ + G D
Sbjct: 60 ATLPEADTYVIF--GPNHTGYGSPVSVSRET-WKTPLGNIDVDLELADGF--LGSIVDAD 114
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
+ EHS+E+ LP+L FE KI+PI +G + E G L A + +
Sbjct: 115 ELGHKYEHSIEVQLPFLQYRFERDF-KILPICMGMQDEETAVEVGNLLADLISESGKRAV 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSDF H+ + + + +D +D I D L + ++
Sbjct: 174 IIASSDFTHYET--------------AERAKEIDSEVIDSILNFDISGMYDRLYRRNASV 219
Query: 240 CGRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
CG PI+ L S K+ + L+Y S +D+ V YA+
Sbjct: 220 CGYGPITAML----TASKKLGGSRATLLKYANSGDVSGDKDAVVGYAA 263
>gi|319955906|ref|YP_004167169.1| hypothetical protein Nitsa_0147 [Nitratifractor salsuginis DSM
16511]
gi|319418310|gb|ADV45420.1| protein of unknown function DUF52 [Nitratifractor salsuginis DSM
16511]
Length = 266
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/254 (31%), Positives = 119/254 (46%), Gaps = 28/254 (11%)
Query: 4 IRRASHAGSWYTDNS---KRLAEELDGWLREAGLPKS---PEVRGVIAPHAGYSYSGRAA 57
IR A + G++Y N +++ E +G L EA KS + R +I PHAGY YSG A
Sbjct: 2 IRLAGNMGAFYPANCSEIEKMIEHWNGILDEALTDKSVLDEKPRAIIVPHAGYIYSGFTA 61
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A + + RV ++GPSHH Y + S Y+TP G+LP+D IEEL
Sbjct: 62 NIAHRILANSRPKRVVVIGPSHHVYFEGVSASMQESYQTPCGNLPIDKAYIEELTKEYPL 121
Query: 118 ELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
E V EA EHS E +P++ +E K+V ++ G ++ L +
Sbjct: 122 EF----VPEAHHKEHSTETQMPFIQH-YEPQA-KVVELIYGKIDYRQLV---PLIDTILS 172
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDK--KHGVIHKSIEALDK-------MGMD-IIETGD 224
DP N +SSD H+ + D GV +S+E L+K +G+ I+E
Sbjct: 173 DPENVVVISSDLSHFYTLEEAKQLDNICLAGVAEESVEILEKGCEACGIIGIKAIVEVAK 232
Query: 225 PDAFKKYLLEYDNT 238
+K LL+Y +
Sbjct: 233 KHGWKTQLLDYRTS 246
>gi|384263323|ref|YP_005418511.1| hypothetical protein RSPPHO_02915 [Rhodospirillum photometricum DSM
122]
gi|378404425|emb|CCG09541.1| Putative uncharacterized protein [Rhodospirillum photometricum DSM
122]
Length = 272
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 126/289 (43%), Gaps = 36/289 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R + AG +Y D LA +D L EA +P + +IAPHAGY YSG AA A
Sbjct: 1 MLPLRPTAVAGIFYPDAPTALARAVDDLLAEAPGSDAPAPKALIAPHAGYVYSGPTAAAA 60
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA---TG 115
+ + P +I RV LLGPSH AL+ A Y +P+G + +D + L+A G
Sbjct: 61 YARLAPGRDTIERVVLLGPSHRVPFRGLALTEAQGYASPLGPVAIDHAWADRLRALPYAG 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
E + EH++E+HLP+L ++ +VP++ G V+ A A+
Sbjct: 121 TLEDAHL----PEHALEVHLPFLHRMLT--RFTLVPVVCGQVSVAQMAAA---LAELWGG 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P VSSD H Y+ YD+ ALD IE D +A
Sbjct: 172 PETLIVVSSDLSH------YLPYDQA--------RALDDATAQTIEALDAEALGP----- 212
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG P++ L + +++ LR S R+ V Y +
Sbjct: 213 -REACGHIPVAGLLALARAHGGRLERLDLR--SSGDTAGSRERVVGYGA 258
>gi|377575424|ref|ZP_09804418.1| hypothetical protein MOPEL_130_01900 [Mobilicoccus pelagius NBRC
104925]
gi|377536001|dbj|GAB49583.1| hypothetical protein MOPEL_130_01900 [Mobilicoccus pelagius NBRC
104925]
Length = 283
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 113/223 (50%), Gaps = 23/223 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPE-----VRGVIAPHAGYSYSGRA 56
+R + AG +Y D+ L +DG L +A +P +PE VR ++APHAGY YSG
Sbjct: 1 MRTPAVAGLFYDDDPDDLRASVDGLLDDARRRVPPAPEEDPATVRALLAPHAGYVYSGPT 60
Query: 57 AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA + ++ ISRV LLGP H A A + TP G++P++ + EL+A+
Sbjct: 61 AAAGYLRVEACRDRISRVVLLGPVHRVPVRGLAHPEAAAFATPWGEVPVE-PLSPELRAS 119
Query: 115 GKFELMDICVDEA-EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+L+D + A EHS+E+H+P+L +V G ++P+ VG V ++ A
Sbjct: 120 FP-QLLDSRLAHAQEHSLEVHVPFLQRVL-GEF-TLLPLAVGGVGSKAVADVLDAVTSGA 176
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV---IHKSIEALD 213
DD + VSSD H Y+ YD V IEALD
Sbjct: 177 DDEATMVVVSSDLSH------YLPYDDARRVDAGTLAQIEALD 213
>gi|262198838|ref|YP_003270047.1| hypothetical protein [Haliangium ochraceum DSM 14365]
gi|262082185|gb|ACY18154.1| protein of unknown function DUF52 [Haliangium ochraceum DSM 14365]
Length = 275
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 124/264 (46%), Gaps = 25/264 (9%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYA 60
E IR + AG +Y ++ L+ +++ +L+ AG P P V V+ PHAG YSG+ A
Sbjct: 6 ETIREPAVAGRFYPHDAGELSTQVESFLQPAGFAPPQPAV-AVMVPHAGLVYSGKVAGQV 64
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
F ++ RV +L P+H P+ ++ + ++ P G++ +D + EL A
Sbjct: 65 FAEVEIP--ERVIILAPNHTGRGPQVSVMASGQWRMPGGEVAVDERLAAELLAAYPRAQA 122
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN-- 178
D EH++E+ LP+L G ++IVP+++G ++ G+ A + +
Sbjct: 123 DTEAHAGEHAIEVELPFLVARRPG--VRIVPVVLGMLSELEAVHLGQALATAIAAVGDEV 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSD H+ S + +D++ ++ + G+P+ + + E+ +
Sbjct: 181 LVVASSDMSHFLS--------------EQQARRIDRVALEPLLEGEPEGLYRTVQEHRIS 226
Query: 239 ICGRHPISVFL---HMLGNCSTKI 259
+CG P +V L LG + ++
Sbjct: 227 MCGFIPATVMLAYARQLGRGTPRL 250
>gi|157960725|ref|YP_001500759.1| hypothetical protein Spea_0897 [Shewanella pealeana ATCC 700345]
gi|157845725|gb|ABV86224.1| protein of unknown function DUF52 [Shewanella pealeana ATCC 700345]
Length = 278
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 134/303 (44%), Gaps = 52/303 (17%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG--LPKS-----------PEVRGVIAPHAGY 50
+R A+ AG +Y ++ L LDG+L +A L + P + +I PHAGY
Sbjct: 5 VRPAAVAGLFYPASADELKAALDGYLTQASSILARKTDCQNSQDESYPSPKVLIVPHAGY 64
Query: 51 SYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
YSG+ AAYA+ I P +I +V L+GP+H Y AL + ++TP+G +P+ + +
Sbjct: 65 LYSGQVAAYAYALIQPLADTIKKVLLIGPAHRVYLQGGALPLSRYFETPLGQIPIAPDSV 124
Query: 109 EELKATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM 164
E L IC+ E EHS+E+ LP+L + +++P+L+G + A+
Sbjct: 125 EILGCQ-----QCICISELAHQQEHSLEVQLPFLQHFLKE--FELLPLLIGESEPKEMAL 177
Query: 165 YGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
L + VS+D H + YD+ LD++ I G
Sbjct: 178 ---LLEQVWGGNETLIVVSTDLSH------FHQYDEAL--------RLDRVTCQKILAGQ 220
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ CG ++ L +L ++++ L Y+ S ++ V YAS
Sbjct: 221 GTILSE-------QACGCSSLNALLPLLKR--HQLRLTQLSYQNSGDTAGDKNRVVGYAS 271
Query: 285 AAA 287
A+
Sbjct: 272 FAS 274
>gi|384171487|ref|YP_005552864.1| hypothetical protein [Arcobacter sp. L]
gi|345471097|dbj|BAK72547.1| conserved hypothetical protein [Arcobacter sp. L]
Length = 263
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 103/190 (54%), Gaps = 11/190 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAA--AYA 60
IR+A +GS+Y + + + ++ + + + +++ +I PHAGY YSG A AY
Sbjct: 3 IRKAVVSGSFYPQKKEEILKYINHFNSVKTNVESFEDIKAIIVPHAGYIYSGFTANLAYK 62
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I RV +LGPSH Y +++ Y+TP+G+L +D E ++L K++ +
Sbjct: 63 LVSFSKKDIKRVVVLGPSHRVYLKGASVALYDEYETPLGNLKVDKEFSQKL--IDKYDFL 120
Query: 121 DICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
D ++ E EHS E P++ FE +++V I+ G ++ ++ ++ + + D SNF
Sbjct: 121 DFNIECEFEHSTETQAPFIKHYFEN--VQLVEIVYGQIDYKD---LSKVIDEVLKDKSNF 175
Query: 180 FSVSSDFCHW 189
+S+D H+
Sbjct: 176 VVISTDLSHF 185
>gi|404496814|ref|YP_006720920.1| MEMO-like protein [Geobacter metallireducens GS-15]
gi|418065236|ref|ZP_12702610.1| protein of unknown function DUF52 [Geobacter metallireducens RCH3]
gi|78194419|gb|ABB32186.1| MEMO-like protein [Geobacter metallireducens GS-15]
gi|373562414|gb|EHP88625.1| protein of unknown function DUF52 [Geobacter metallireducens RCH3]
Length = 267
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 126/284 (44%), Gaps = 23/284 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ + AG +YTD KRL +L + + + V+G++APHAGY YSG A +G I
Sbjct: 3 RQPAVAGQFYTDEPKRLRGDLQRMIPAETMKQ--RVKGIVAPHAGYVYSGHVAGAVYGAI 60
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ S V +LGP+HH + AL + TP+G +P+D+ + + ++ ++
Sbjct: 61 EIPST--VVILGPNHHGFGAAAALYPEGEWLTPLGSVPIDVRLSQLVQKHVPLVEPEMSA 118
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFSV 182
EHS+E+ +P+L + IVPI +G + E + G A+ + +
Sbjct: 119 HRFEHSLEVQVPFLQYCRAD--VSIVPICLGFGDYERCRLLGEGIARAIREFGEDVLIVA 176
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSD H+ S + A D+ + + +P+ + + T+CG
Sbjct: 177 SSDMTHYES--------------ADAARAKDEQAIAEVLALNPEGLVQICRRKNITMCGV 222
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
P +V L + K + + Y S V+YA+ A
Sbjct: 223 IPATVML-VTAKALGASKAELISYATSGDVTGDNRQVVAYAAMA 265
>gi|74317604|ref|YP_315344.1| hypothetical protein Tbd_1586 [Thiobacillus denitrificans ATCC
25259]
gi|74057099|gb|AAZ97539.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
25259]
Length = 464
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 99/194 (51%), Gaps = 13/194 (6%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLRE--AGLPK-SPEVRGVIAPHAGYSYSGRAA 57
M +R A+ AG +Y D+ L + LD L E AG + +P + VI PHAGY YSG A
Sbjct: 1 MPTVRPAAVAGLFYPDDPVVLTQTLDALLAEGEAGAERLTP--KAVIVPHAGYIYSGPIA 58
Query: 58 AYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A A+ + I RV LLGP+H + AL A + TP+G++ +D E IE L A
Sbjct: 59 AAAYARLGDLKGRIRRVVLLGPAHRVFVRGLALPEAERFVTPLGEVTIDREAIELLAALP 118
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+ D + EHS+E+ LP+L +V +VP+ VG AE A L A + D
Sbjct: 119 QVTRSD-AAHQMEHSLEVQLPFLQRVLGD--FTLVPLAVGQARAEEVAEV--LDALWGGD 173
Query: 176 PSNFFSVSSDFCHW 189
+SSD H+
Sbjct: 174 -ETLIVISSDLSHF 186
>gi|13542189|ref|NP_111877.1| hypothetical protein TVN1358 [Thermoplasma volcanium GSS1]
gi|23822367|sp|Q978N2.1|Y1383_THEVO RecName: Full=MEMO1 family protein TV1383
gi|14325622|dbj|BAB60525.1| hypothetical protein [Thermoplasma volcanium GSS1]
Length = 269
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 135/289 (46%), Gaps = 30/289 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+ + AG +Y N L + + ++++ + P GV+ PHAG YSGR A Y++
Sbjct: 2 IRKPAVAGYFYPSNRNELLSLISSFHVQQSEVSCQP--IGVVVPHAGIVYSGRTAMYSYN 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ +SI ++GP+H TP ++ + ++TP+G+ ++ E+ EL ++ + D
Sbjct: 60 ALRNSSIRDFIIIGPNHRPMTPYASIFPSGSWETPLGNAIINEELASELYKNSQYIVKDE 119
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDPSN 178
EHS+E+ +P+L +F G+ VP+++G V + + RL Y+
Sbjct: 120 ESHSVEHSIEVQIPFLQYMF-GNSFTFVPVILGDQEKVVANDIASALMRLSKPYI----- 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDF HY++ V K ++ + ++ +D+ D D F + + T
Sbjct: 174 -LIASSDFT---------HYERSDIVERKDMDLISRI-VDL----DIDGFYDTIERENVT 218
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
CG I++ + + KI + L + S D V Y+S A
Sbjct: 219 ACGYGAIAILMIIAKKIGAKISL--LNHSNSGDVTNDYDEVVGYSSIVA 265
>gi|71907600|ref|YP_285187.1| hypothetical protein Daro_1971 [Dechloromonas aromatica RCB]
gi|71847221|gb|AAZ46717.1| Protein of unknown function DUF52 [Dechloromonas aromatica RCB]
Length = 267
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 95/191 (49%), Gaps = 8/191 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M R + AG++Y + + L+ +D +L +A + + +I PHAGY YSG AA A
Sbjct: 1 MINTRPPAVAGTFYPGDPETLSTTVDRFLADASPQSGQQPKALIVPHAGYIYSGSTAANA 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P +I RV LLGP+H AL + TP+G + LD + I + +
Sbjct: 61 YATLKPWAKTIRRVILLGPTHRVAVEGIALPEVEAFSTPLGTIRLDAQAIASIAGLPQIV 120
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ V EHS+E+HLP+L ++ E +VP+ VG +A +A+ + P
Sbjct: 121 FSN-HVHAFEHSLEVHLPFLQRILEQ--FTLVPLAVG--DAAPDAV-AEILDLLWGGPET 174
Query: 179 FFSVSSDFCHW 189
VSSD H+
Sbjct: 175 LIVVSSDLSHF 185
>gi|305663860|ref|YP_003860148.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
gi|304378429|gb|ADM28268.1| protein of unknown function DUF52 [Ignisphaera aggregans DSM 17230]
Length = 299
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 122/255 (47%), Gaps = 21/255 (8%)
Query: 40 VRGVIAPHAGYSYSGRAAAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYK-- 95
+ G + PHAGY YSG AA+ + + +I V ++G +H + K ++T T YK
Sbjct: 51 IIGGVVPHAGYGYSGFCAAWLYKELGEHIDNIDVVVIMGTNHTGFGGK--ITTTTYYKRW 108
Query: 96 -TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV 154
TP+G++ +D+ I LK D EHS+E+ LP+L ++ G+ +VPI+V
Sbjct: 109 STPLGEIDVDINFINRLKNYYPNLDDDALAHTREHSVEVQLPFLQYLY-GNKFSLVPIVV 167
Query: 155 GAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEAL 212
+ E + + K +++ F V SSDF H G + Y + + I ++ L
Sbjct: 168 RDITEEEATTFSKALNKAINEDDRRFVVLASSDFTHHGDLYGYTIFYEN---IAANVRKL 224
Query: 213 DKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI---KIKFLRYEQS 269
D M ++ I D F + +Y+ T+CG I++ + +KI + L+Y S
Sbjct: 225 DLMFIEAIVNLDTRRFLSLIDKYNATVCGYGAIAILMEY-----SKINGARASLLKYYHS 279
Query: 270 SQCKTKRDSSVSYAS 284
+ D V YAS
Sbjct: 280 ADITGDEDIVVGYAS 294
>gi|88801036|ref|ZP_01116585.1| hypothetical protein MED297_05099 [Reinekea blandensis MED297]
gi|88776239|gb|EAR07465.1| hypothetical protein MED297_05099 [Reinekea sp. MED297]
Length = 261
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 88/162 (54%), Gaps = 10/162 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGL---PKSPEVRGVIAPHAGYSYSGRAAAYA 60
IR + AG+ Y +++RL +++ WL A + +P+V +IAPH+G+ YSG +AA A
Sbjct: 3 IRTPAVAGTLYEADTQRLLVQMENWLESAPVKTTQSTPKV--LIAPHSGFHYSGESAARA 60
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ ++ I RV LLGP+H L V+ TP+G +PLD + L+
Sbjct: 61 YQTLNAVYDRIRRVILLGPAHRTTVDHLVLPEDDVFATPLGQVPLDKTAVNWLRRQPGV- 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
+ D + EHS+EM LP+L E +VPI+VG V+ +
Sbjct: 120 ITDNTLHAPEHSLEMQLPFLQTALEDFF--LVPIIVGQVDPD 159
>gi|116753790|ref|YP_842908.1| hypothetical protein Mthe_0477 [Methanosaeta thermophila PT]
gi|116665241|gb|ABK14268.1| protein of unknown function DUF52 [Methanosaeta thermophila PT]
Length = 269
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 130/284 (45%), Gaps = 22/284 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
+R + AG +Y ++ L EL R GL + P V G + PHAGY YSG AA +
Sbjct: 1 MRPPAAAGMFYPARAEALLAELSSAFR--GLKREPMPVIGAVVPHAGYIYSGYVAAEVYA 58
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ P + V L+GP+H A+S T ++TP+G +P+DLE+ + L+ G D
Sbjct: 59 RL-PERETYV-LIGPNHTGLGLPVAVSRET-WRTPLGSVPVDLELADALE--GTIVASDE 113
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
EHS+E+ LP+L + F G +I+PI +G + E G A + +V
Sbjct: 114 MAHMEEHSIEVQLPFLQRCFSG--FRILPICMGMQDQETAIEVGEAVANAILKLGRSCTV 171
Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H Y+ Y+ + +E++ +M +D + + +Y Y + +C
Sbjct: 172 IASSDLTH------YLRYETARQLDTSILESVIRMDIDDLYSKIEMLDSRY---YGHGVC 222
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
G P++ + + K LRY S R V Y +
Sbjct: 223 GYGPMAATIAASRRLGASMG-KLLRYATSGDVSGDRGQVVGYGA 265
>gi|336476945|ref|YP_004616086.1| hypothetical protein [Methanosalsum zhilinae DSM 4017]
gi|335930326|gb|AEH60867.1| protein of unknown function DUF52 [Methanosalsum zhilinae DSM 4017]
Length = 265
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 128/283 (45%), Gaps = 24/283 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R AG +Y N K L +E+ + + S ++ G +APHAGYS+SG AA+ +
Sbjct: 1 MREPVVAGQFYPSNPKELKKEIRKCFKGIEV-SSHDLIGAVAPHAGYSFSGHVAAHVYAR 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I S ++GP+H Y + S A + TP+G++ D ++I+ L G +D
Sbjct: 60 I--PSADTYIIIGPNHTGYGSPISAS-ADTWSTPLGEIRTDGDLIKAL--AGSIIDVDEL 114
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV- 182
EHS+E+ LP+L + H +I+PI +G + E + G + V D +
Sbjct: 115 AHRFEHSIEVQLPFLQYTLD-HDFEILPICMGLQDEETASEVGHEIVRAVKDAGKKVVII 173
Query: 183 -SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSDF H+ S + YD + IEA++KM D + ++E + + CG
Sbjct: 174 ASSDFTHYQSA--DIAYDTDQYI----IEAIEKM--------DVSEIYRRIMEKNASACG 219
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
PI+ + K K L Y S S V Y +
Sbjct: 220 FGPIAAMI-TAAKLLDATKAKLLTYSTSGDVIGDMSSVVGYGA 261
>gi|381167018|ref|ZP_09876230.1| Predicted dioxygenase [Phaeospirillum molischianum DSM 120]
gi|380683833|emb|CCG41042.1| Predicted dioxygenase [Phaeospirillum molischianum DSM 120]
Length = 457
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 121/288 (42%), Gaps = 29/288 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR + AG +Y + + ++ L A ++ + +I PHAG+ YSG AA A
Sbjct: 1 MTAIRLPAFAGQFYPSDPIAIRHQIATMLATAPHARATTPKAIIVPHAGWVYSGPIAAAA 60
Query: 61 FGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + P + RV LLGPSH AL A + +P+G + LD + L
Sbjct: 61 YAQLVPARGRVRRVVLLGPSHRIALRGLALCDAEAWASPLGQVLLDRDSALRLTTLPDHS 120
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ D + EH++E+ LP+L V G +++PILVG +AE + D
Sbjct: 121 ISDPAHAQ-EHALEVQLPFLQAVL-GDDFRLLPILVGEASAE---IVAATLDTVWGDSET 175
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
VS+D H+ +Y ++ +IE D + + ++
Sbjct: 176 LIVVSTDLSHY---LDYESCRRQDSGTAAAIERFDGTELAL-----------------DS 215
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR+P++ L L +KI L S RD V Y + A
Sbjct: 216 ACGRNPLAGLL--LAAQRRGMKIDRLDLRNSGDTAGSRDRVVGYGAWA 261
>gi|218961087|ref|YP_001740862.1| hypothetical protein CLOAM0773 [Candidatus Cloacamonas
acidaminovorans]
gi|167729744|emb|CAO80656.1| conserved hypothetical protein [Candidatus Cloacamonas
acidaminovorans str. Evry]
Length = 272
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 124/285 (43%), Gaps = 21/285 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
IR HAG++Y +++ +++GW+ A P S E GVI PHAGY YSG A
Sbjct: 2 IRNPMHAGTFYPRFEQQIKRQIEGWISNAVTPLSSERALGVILPHAGYMYSGECATLGLH 61
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+I +I +L PSH ++S Y P+G L LD+E+ ++ + + +
Sbjct: 62 SISHENIDCFIILHPSHQANYFDFSVSPYQEYVNPLGTLDLDVELYNKIAPEAD-QNIPL 120
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAK-YVDDPSNFF 180
+ + EHSME+ LP L F KI+PI+ G + A + + L+ Y
Sbjct: 121 ILHQEEHSMEIQLPILNYFFPK--AKILPIMFGNQIPAVAKRLADILYETIYSSTKRIVI 178
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ D G++ K + AL DP+ + ++ D C
Sbjct: 179 LCSSDLSHYHRARTAEEMD---GILVKDVTAL-----------DPEHLWQDIIHSDCEAC 224
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
G I L+ T K+K ++Y S + V Y SA
Sbjct: 225 GIGGILCLLYY-AQHYTNAKMKVIQYTHSGKISGNDSQVVGYLSA 268
>gi|222056095|ref|YP_002538457.1| hypothetical protein Geob_3013 [Geobacter daltonii FRC-32]
gi|221565384|gb|ACM21356.1| protein of unknown function DUF52 [Geobacter daltonii FRC-32]
Length = 267
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 23/283 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + AG +YTD+ +L +L + A P GVI+PHAGY YSG A FG
Sbjct: 2 IRQPAVAGQFYTDDPDKLRTQLKAMIAPA--PFKERALGVISPHAGYMYSGAVAGKLFGE 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ S V +LGP+HH + A+ ++TP+G P++ + +K D
Sbjct: 60 V--AVPSTVVILGPNHHGIGARAAIFPEGSWQTPLGTAPVNSSLATLIKKHSSIVEADQK 117
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
EHS+E+ LP+L F + IVP+ +G + ++ + G A + D +
Sbjct: 118 AHIFEHSLEVQLPFLQ--FLRPDVSIVPVCLGFRDFKSCLLLGSGIASAIRDYGDEVLIV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ + + D++ + ALD G+ ++ + D T+CG
Sbjct: 176 ASSDMTHYETAASAKQKDEQA---INMVLALDAEGLLLV-----------CAQKDITMCG 221
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
P +V L + K + ++Y S V+YA+
Sbjct: 222 VVPATVML-VAAKGLGATKARLVQYATSGDVTGDNRQVVAYAA 263
>gi|171184544|ref|YP_001793463.1| hypothetical protein Tneu_0059 [Pyrobaculum neutrophilum V24Sta]
gi|170933756|gb|ACB39017.1| protein of unknown function DUF52 [Pyrobaculum neutrophilum V24Sta]
Length = 282
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 130/291 (44%), Gaps = 28/291 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-------EVRGVIAPHAGYSYSGRA 56
+R+ + AG +Y + +RL ++++ ++ PK+P E G +APHAGY YSG
Sbjct: 5 VRKPAVAGYFYEADRERLIQQIEWSIKHELGPKAPQLPKLGAEALGGVAPHAGYMYSGPV 64
Query: 57 AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+ + + F ++GP+H+ A+ + ++TP+G + +D E+ EE+ +G
Sbjct: 65 AAWIYAALSGFGKPDAFVIVGPNHYGIGAPVAIMKSGAWETPLGRVEIDAELAEEIARSG 124
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
E D EHS+E+ +P++ F +KIVPI + G+ A +
Sbjct: 125 AAE-EDPHAFSREHSIEVQIPFIQYFFGD--VKIVPIALWRQTPSASRELGKAIAAALKK 181
Query: 176 PSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
+ SSDF H+ + H V K D+M + I D +
Sbjct: 182 RGGRIYVIASSDFNHY----------EPHDVTVKK----DEMAIGKILKLDEAGLFEVAS 227
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++D +ICG PI V + L++ S RD +V YAS
Sbjct: 228 KFDISICGLGPIGVLITAAKELGYN-NATLLKHATSGDTSGYRDETVGYAS 277
>gi|452077450|gb|AGF93409.1| protein containing UPF0103/Mediator of ErbB2-driven cell motility
(Memo), related domain protein [uncultured organism]
Length = 283
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 102/210 (48%), Gaps = 10/210 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVR-------GVIAPHAGYSY 52
M +IR + AG +Y L +L+ + E G K PEV+ G+I+PHAGY Y
Sbjct: 1 MPEIRNPAVAGRFYRGTKTGLTNQLEECFKHECGPGKIPEVQEGPRRMIGLISPHAGYPY 60
Query: 53 SGRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG AA+ F + + V ++GP+H P ++ ++ +++TP+G + L+ E+ E +
Sbjct: 61 SGPVAAHGFSKLAEDGKPDSVVIIGPNHSGVGPDVSVDSSDLWETPLGQIDLNKEIRERI 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
+ +D EHS+E+ LP+L +F+ IVPI + A NAE G+
Sbjct: 121 LEESEVAELDSSSHLNEHSLEVQLPFLQYLFDEDF-DIVPICMKAQNAETSEDIGKAIQA 179
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
+ S+D H+ + D+K
Sbjct: 180 AANGEDILIIASTDLTHYEPQQMAETKDRK 209
>gi|397689697|ref|YP_006526951.1| dioxygenase [Melioribacter roseus P3M]
gi|395811189|gb|AFN73938.1| Putative dioxygenase [Melioribacter roseus P3M]
Length = 271
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 105/195 (53%), Gaps = 8/195 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
M+K+R+ + AG +Y + LA+E++ L AG ++PE + G++ PHAGY YSG+ AA
Sbjct: 1 MKKVRQPAVAGMFYPSLPEELADEINEMLDNAGTEEAPENLYGIVVPHAGYIYSGQTAAK 60
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ + V ++ PSH Y P ++ Y TP+G++ +D E+ + + K
Sbjct: 61 AYKLLKDKKYKTVVVISPSHREYFPGISVFDGDAYLTPLGEIEIDSEMRDRIAEGSKIIF 120
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD--DPS 177
+ +EH++E+ LP+L +V G K++P+++G +++ L + D D +
Sbjct: 121 KGVNGHRSEHALEVQLPFL-QVALGDF-KLLPVVMG---DQSKLFVDELSERLSDVVDEN 175
Query: 178 NFFSVSSDFCHWGSR 192
SSD H+ S+
Sbjct: 176 TLIVSSSDLSHYYSK 190
>gi|206901742|ref|YP_002251218.1| hypothetical protein DICTH_1399 [Dictyoglomus thermophilum H-6-12]
gi|206740845|gb|ACI19903.1| conserved protein [Dictyoglomus thermophilum H-6-12]
Length = 269
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 129/283 (45%), Gaps = 23/283 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K R + AG +Y S+ L L +++E +P + G I PHAGY YSG+ A +
Sbjct: 2 KKRLPAVAGYFYPATSRELLNMLSQFIKE--VPNKVPLWGCIVPHAGYIYSGKVAGAVYS 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ ++ ++GP+H+ A+ + ++ TP+GD+ +D + +++ + D
Sbjct: 60 LMEVPDVA--IIIGPNHNGLGYPSAIYSEGIWTTPLGDVEIDSVLSQKILQESDYLKEDA 117
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
+ EHS+E+ +P+L + IKIVPI V + E G ++ V D +
Sbjct: 118 LAHQPEHSLEVQVPFLQYINPN--IKIVPITVVDYSYEVLKNLGYAISQAVRDYEYNTVI 175
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H+ H+ + D ++ I D F K ++E D +IC
Sbjct: 176 IASSDFSHYEP--------------HEIAKQKDLYAIESILNLDEKEFIKRVIEKDISIC 221
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G PI + + + +K + K + Y+ S D V YA
Sbjct: 222 GVGPIVITMIAVKLLGSK-RAKLVEYQTSGNVTRDFDQVVGYA 263
>gi|153004135|ref|YP_001378460.1| hypothetical protein Anae109_1268 [Anaeromyxobacter sp. Fw109-5]
gi|152027708|gb|ABS25476.1| protein of unknown function DUF52 [Anaeromyxobacter sp. Fw109-5]
Length = 265
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 123/284 (43%), Gaps = 23/284 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR + AG +Y + LA E+D +L G + RGV+ PHAGY YSG A +
Sbjct: 2 IREPAVAGRFYPGDPTVLAGEVDRFLAAPG--ERLRARGVVVPHAGYVYSGAVAGAVYAR 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTAT-VYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ +RV +LGP+H AL A+ ++TP+G +P+D E+ L A+ D
Sbjct: 60 VEIP--ARVLVLGPNHTGIGAAAALWPASGTWRTPLGQVPVDAELTAALSAS-PLVTADT 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
EHS+E+ +P+L + G + I + + + + GR A+
Sbjct: 117 RAHRLEHSLEVQVPFLERARPG--VSIAALCLTHLPYADCVEIGRTVAQAASAAGALVVA 174
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSD H+ D H++IE + + DP + + D ++CG
Sbjct: 175 SSDMSHYLPAPVARQKD------HRAIERILAL--------DPRGLYEVVHAEDISMCGI 220
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
P +V L + + + +RY S + D V YA A
Sbjct: 221 VPATVML-VAARLMGATRAELVRYAHSGEVSGDEDQVVGYAGIA 263
>gi|307352919|ref|YP_003893970.1| hypothetical protein Mpet_0758 [Methanoplanus petrolearius DSM
11571]
gi|307156152|gb|ADN35532.1| protein of unknown function DUF52 [Methanoplanus petrolearius DSM
11571]
Length = 266
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 130/286 (45%), Gaps = 27/286 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RR + AG +Y L E+L + LP + G+++PHAG YSG+AAAY+F
Sbjct: 3 VRRVTLAGRFYPGEPASLKEQLAYFFSHVALPHPVDACGIVSPHAGTIYSGQAAAYSFSA 62
Query: 64 IDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I P+ + F+ + PSH Y P C ++ ++TP+G + D ++ L D
Sbjct: 63 I-PSDFNGTFIVVAPSHAGY-PDC--TSGLSWETPLGIIECDEALVSALDINRD----DF 114
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
+ + E+S+E+ +P++ F K+VP+L+G + + V + S+ +
Sbjct: 115 AMAQPENSLEVQMPFIKFRFPDA--KVVPVLMGRQTLQGAEAMAEAIVRTVRETSSDVRI 172
Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H Y+ ++ + +IEAL + D + F + + E + C
Sbjct: 173 VASSDFSH------YIPDEEARRLDGYAIEALFDL--------DSEEFFRRIREEGVSAC 218
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
G PI+V + K + L+Y S V YA+ A
Sbjct: 219 GYGPIAVMVEACRELFGAKKSELLKYMTSGDVSGDFSQVVGYAAVA 264
>gi|73668946|ref|YP_304961.1| hypothetical protein Mbar_A1422 [Methanosarcina barkeri str.
Fusaro]
gi|121722214|sp|Q46CL1.1|Y1422_METBF RecName: Full=MEMO1 family protein Mbar_A1422
gi|72396108|gb|AAZ70381.1| conserved protein [Methanosarcina barkeri str. Fusaro]
Length = 265
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 136/287 (47%), Gaps = 32/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R+ + AG +Y L +EL GL + + G + PHAGY YSGR AA+ +
Sbjct: 1 MRQPAVAGQFYPLRPDNLEKELKQCFE--GLEIRERNILGAVCPHAGYVYSGRVAAHVYA 58
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ P + + V L GP+H Y ++ST T +KTP+G + +D E+ E L TG +D
Sbjct: 59 -VLPKADTYV-LFGPNHTGYGSPVSVSTDT-WKTPLGIIEVDRELAEGL--TGSIVDVDE 113
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
EHS+E+ LP+L F+ +I+PI +G + E G L A V F
Sbjct: 114 IGHRYEHSIEVQLPFLQYRFDQDF-RILPICLGMQDEETVIEVGTLIANLVSKSGKKVAF 172
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H Y + ++ IEA+ + +D+ P +++ L + ++C
Sbjct: 173 IASSDFTH------YQPANLARETDNEIIEAI--LNLDV-----PGIYER-LYRRNASVC 218
Query: 241 GRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
G PIS L S K+ + + L Y S + ++ V YA+
Sbjct: 219 GYGPISAML----TASKKLGATRAELLNYSNSGEVSGDMNAVVGYAA 261
>gi|381153737|ref|ZP_09865606.1| putative dioxygenase [Methylomicrobium album BG8]
gi|380885709|gb|EIC31586.1| putative dioxygenase [Methylomicrobium album BG8]
Length = 260
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 129/281 (45%), Gaps = 43/281 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
R+ + AG++Y ++L LD +L +A PK P + +I PHAGY YSG AA A+
Sbjct: 3 RQPAVAGTFYPAEPQQLHLLLDQYLNDADTAPKVP--KAIIVPHAGYIYSGPIAATAYAR 60
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ ISRV L+GPSH A+++A + TP+G + +D + A + +D
Sbjct: 61 LLKAHDQISRVVLIGPSHRVGFHGLAITSAQNFVTPLGSIEVDQRAVH---AIAQLPFVD 117
Query: 122 ICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDDPS 177
+++A EHS+E+HLP+L ++ + KIVPI+ G AE A + G L+
Sbjct: 118 -YLEQAHVMEHSLEVHLPFLQEMLDD--FKIVPIVTGDAPAEQVAQVLGMLWGG----EE 170
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEAL--DKMGMDIIETGDPDAFKKYLLEY 235
VSSD H+ DK +IE L + +G+D
Sbjct: 171 TLIVVSSDLSHYHDYATCQELDK---ATSAAIEKLQYESLGID----------------- 210
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
+ CGR I L + + S +K LR + KR
Sbjct: 211 --SACGRVAIGGLLKLARDKSLAVKNIDLRNSGDTAGDKKR 249
>gi|126179754|ref|YP_001047719.1| hypothetical protein Memar_1811 [Methanoculleus marisnigri JR1]
gi|125862548|gb|ABN57737.1| protein of unknown function DUF52 [Methanoculleus marisnigri JR1]
Length = 262
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 129/293 (44%), Gaps = 45/293 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAYAF 61
+R S AG +Y + L + L+ + R+ G P + RG+++PHAGY YSG A AF
Sbjct: 3 MRPCSVAGMFYPAEPRHLEQLLETFFRKRG----PGITSRGIVSPHAGYVYSGETGACAF 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I P ++GPSH Y ++A ++TP+G + +D E I+ MD
Sbjct: 59 STIPPDFDGTFLVIGPSHRGYMTS---ASAVPWETPLGIVDVDTEFIDA---------MD 106
Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
I +DEA EHS+E+ +P++ F + P+L+G + E + V+
Sbjct: 107 IEIDEASHQSEHSIEVQMPFIKYRFP--RARAAPVLMGEQSYEAAESLAEHLLQAVEHTK 164
Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+ SSDF H+ V + D ++ ++T D F + L E
Sbjct: 165 RDVRIVASSDFSHY--------------VPDEVARKQDLYAIEALKTLDLPEFYRRLQET 210
Query: 236 DNTICGRHPISVFLHMLGNCST--KIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
T+CG PI+ M C + + + LRY S + V YA+ A
Sbjct: 211 RATVCGYGPIAT---MCIACRSLGATRGELLRYTTSGDVTEDYNQVVGYAAIA 260
>gi|56479181|ref|YP_160770.1| hypothetical protein ebA6549 [Aromatoleum aromaticum EbN1]
gi|56315224|emb|CAI09869.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
Length = 275
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 129/294 (43%), Gaps = 46/294 (15%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP----KSPEVRGVIAPHAGYSYSGRAAA 58
IR A+ AG +Y D+ L ++ G + A +P +P + +I PHAGY YSG AA
Sbjct: 5 SIRPAAVAGYFYPDDPYALRAQI-GEMLAAAVPLETVSAP--KALIVPHAGYVYSGPVAA 61
Query: 59 YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+ N+ + RV LLGP+H AL A + TP+G +P+ E L+A
Sbjct: 62 SAYANLAGLRDVVRRVVLLGPAHRMAVRGFALPAAQFFSTPLGQVPMSRGGWETLQAR-- 119
Query: 117 FELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
D+ VD+ AEHS+E+ LP+L + ++VP+LVG NA EA+ +
Sbjct: 120 ---PDVVVDDRPHAAEHSLEVQLPFLQMALD--TFELVPLLVG--NASAEAV-AEILETL 171
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
P +SSD H+ S + ++ D+ + I T +
Sbjct: 172 WGGPETLIVISSDLSHYHS--------------YGQAQSCDRATVGQILT------LRTG 211
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+E+D CG P++ L ++ L S R V YAS A
Sbjct: 212 IEHDQA-CGATPVNGLLR--AAAQHGLEPHLLDLRNSGDTAGDRSRVVGYASIA 262
>gi|390960737|ref|YP_006424571.1| hypothetical protein CL1_0570 [Thermococcus sp. CL1]
gi|390519045|gb|AFL94777.1| hypothetical protein CL1_0570 [Thermococcus sp. CL1]
Length = 293
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 137/294 (46%), Gaps = 8/294 (2%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAY 59
M ++R + AGS+Y + + L E L+ + + G S + +APHAGY +SG A+
Sbjct: 1 MTEVRYPAVAGSFYPAD-ESLIEMLEEFFNDLGEEGSERRITAGVAPHAGYIFSGYTASR 59
Query: 60 AFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ I + F +LGP+H A+ + + TP+G++ +D E+ + +
Sbjct: 60 TYKAIFEDGLPETFVILGPNHTGLGSPIAVYPSGEWLTPLGEIEVDAEMAKAIARLSGIA 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+D EHS+E+ +P++ + E G ++IVPI +G + E GR + +
Sbjct: 120 DLDELAHRYEHSIEVQVPFIQYLAELAGKKVRIVPITLGIQDEEVSRALGRAIFEASQEL 179
Query: 177 SNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
V S+DF H+G+ + Y+ + + + I+ D + I D D + L E
Sbjct: 180 GRDVVVIASTDFMHYGAMYGYVPFRVRADELPHRIKEWDFRLVRRILDFDVDGLFRELRE 239
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++T+CG + + + + ++ + L Y S + D+ V YAS +
Sbjct: 240 MNHTMCGPGAVGTAI-VYSRSAGALEAELLHYTTSYEVSRSTDAVVGYASIVMR 292
>gi|337283757|ref|YP_004623231.1| hypothetical protein PYCH_02600 [Pyrococcus yayanosii CH1]
gi|334899691|gb|AEH23959.1| hypothetical protein PYCH_02600 [Pyrococcus yayanosii CH1]
Length = 292
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 138/294 (46%), Gaps = 14/294 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AG +Y + L + L + R+ G P ++ IAPHAGY +SG A+ +
Sbjct: 2 IRYPAVAGQFYP-LGEELVDMLKAFFRDLGEPGGERKITAGIAPHAGYVFSGYTASRTYK 60
Query: 63 NIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F++ GP+H A+ + TP+G++ +D E+ E+ +D
Sbjct: 61 AIYEDGLPETFVIFGPNHTGLGSPIAVYPEGKWVTPLGEVEVDGELAREIVKNSGIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAE-NEAMYGRLFAKYVDDPSN 178
+ EHS+E+ LP++ + E G ++IVPI + + E +E + G +F + +
Sbjct: 121 EIAHKYEHSIEVQLPFIQYIAERAGKTVRIVPITLSIQDEEVSEDLGGAVFEAARELGRD 180
Query: 179 FFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+ S+DF H+G+ + Y+ + + + ++ D + I D + + + E D+
Sbjct: 181 VVVIASTDFMHYGTFYGYVPFRARANELPNLVKEWDLRVIRRILDFDVEGMFREIREMDH 240
Query: 238 TICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+CG + VF ++G ++ + L Y S D+ V YAS +
Sbjct: 241 TMCGPGGVGASVVFSRLIG----AVEAELLHYTTSFDISRSTDAIVGYASIVMR 290
>gi|217970617|ref|YP_002355851.1| hypothetical protein Tmz1t_2215 [Thauera sp. MZ1T]
gi|217507944|gb|ACK54955.1| protein of unknown function DUF52 [Thauera sp. MZ1T]
Length = 274
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 125/293 (42%), Gaps = 40/293 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
+ IR A+ AG +Y + + L +L L A +P + +I PHAGY YSG AA
Sbjct: 3 LASIRPAAVAGQFYPRDERVLRTQLAEMLSTAVAFESAPAPKAIIVPHAGYMYSGPVAAS 62
Query: 60 AFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+ + P I RV +LGP+H +L A + TP+G++P+ E L+A
Sbjct: 63 AYALLAPLRERIRRVVMLGPTHRVAVRGFSLPAAQRFATPLGEVPVARGDWEALQARD-- 120
Query: 118 ELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
D+ VD+ EH +E+ LP+L E +IVP+LVG + E A L +
Sbjct: 121 ---DVLVDDRPHALEHCLEVQLPFLQVCLER--FEIVPVLVGDASPEATA---SLIERLW 172
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
P +SSD H Y+ Y + + D+ +D + P L
Sbjct: 173 GGPETLILISSDLSH------YLSY--------REAQWSDRAAVDQVLELRPG------L 212
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+++ CG PI+ L L ++ L S R V Y S A
Sbjct: 213 DHEQA-CGATPINGLL--LAARKHDLQAHLLDLRNSGDTAGDRSRVVGYTSIA 262
>gi|54308444|ref|YP_129464.1| hypothetical protein PBPRA1251 [Photobacterium profundum SS9]
gi|46912873|emb|CAG19662.1| conserved hypothetical protein [Photobacterium profundum SS9]
Length = 260
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 94/190 (49%), Gaps = 15/190 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y +++ L ++LD W + +R +I PHAGY YSG AA A+
Sbjct: 3 MRLPAVAGRFYEESAPLLQKQLDDWCSPPTTHRD-LIRALIVPHAGYIYSGEVAAKAYCQ 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ +I +V L+GPSH Y CA+ + + TP+G + +D++ I+ L ++ DI
Sbjct: 62 LQAETIKKVILIGPSHRYAFHGCAVPNSDYFSTPLGSVSIDVQSIDNL-----IKIDDIK 116
Query: 124 VDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
V E EH +E+ LP+L H ++P+L V+ A ++
Sbjct: 117 VSEQVHAQEHCLEVQLPFLQTCL--HQFTLLPLLTSNVSFIKVA---KIIDALWQQDDTL 171
Query: 180 FSVSSDFCHW 189
+SSD H+
Sbjct: 172 LVISSDLSHF 181
>gi|399157091|ref|ZP_10757158.1| hypothetical protein SclubSA_09246 [SAR324 cluster bacterium SCGC
AAA001-C10]
Length = 279
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 125/279 (44%), Gaps = 52/279 (18%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLR----EAGLPKS-----PEVRGVIAPHAGYSYS 53
IR + A +Y + L +E+ +L E +PKS E+R +I PHAGY YS
Sbjct: 9 NIRPPAVASMFYPGKAAELRKEVQNYLSNAETEEDVPKSKKGETAELRTLIVPHAGYIYS 68
Query: 54 GRAAAYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
G+ AA A+ + + RV LLGP+H + A ++TP+G++ LD E+IE
Sbjct: 69 GKIAACAYQLLQENQNQFKRVLLLGPAHRVWLQGAAFPETDAFETPLGEITLDKELIE-- 126
Query: 112 KATGKFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
K +F + + DEA EH +E+ LP+L + +++P++VG E+ +
Sbjct: 127 KMLEEFSWISVS-DEAHSEEHCLEVQLPFLQETLGD--FELLPLVVGETKTEH---ITEI 180
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
K+ +D +S+D H + Y + ++ A++ + + I T D
Sbjct: 181 IQKFSEDTETLIVISTDLSH------FHDYQTAREIDARTANAIELLEQNRISTEDA--- 231
Query: 229 KKYLLEYDNTICGRHPISVFL----------HMLGNCST 257
CG +P+ L H LG C++
Sbjct: 232 -----------CGAYPLRGALLAASQNHWNVHRLGLCNS 259
>gi|288817556|ref|YP_003431903.1| putative dioxygenase [Hydrogenobacter thermophilus TK-6]
gi|384128321|ref|YP_005510934.1| hypothetical protein [Hydrogenobacter thermophilus TK-6]
gi|288786955|dbj|BAI68702.1| putative dioxygenase [Hydrogenobacter thermophilus TK-6]
gi|308751158|gb|ADO44641.1| protein of unknown function DUF52 [Hydrogenobacter thermophilus
TK-6]
Length = 264
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 113/218 (51%), Gaps = 19/218 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAF 61
KI+R + AG +Y + K+L + + L+ + L P P G++APHAGY+YSG A +
Sbjct: 2 KIKRPNVAGIFYPSDPKKLRDTVLALLKNSKLFPLKP--VGLVAPHAGYTYSGAVAGAVY 59
Query: 62 GNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-- 116
++ +S+ V ++ PSH+++ + ++TP+G + +D + IE T K
Sbjct: 60 KQLENLDLSKDWMVVIIAPSHYFFFEGITFGSYQAFETPLGQVEVDRKAIERFIDTRKSL 119
Query: 117 -FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
DI D+ EHS+E+ LP+L + + ++VP+L +A+ E + L + +
Sbjct: 120 RVSFSDIPYDK-EHSLEVQLPFLQVLLKS--FRLVPVLYS--DAKPEEIKEVL--NFFEG 172
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
+ F VSSD H+ S N Y K H IE+LD
Sbjct: 173 ENTLFVVSSDLSHYHSE-NTARY--KDSFCHAGIESLD 207
>gi|156937782|ref|YP_001435578.1| hypothetical protein Igni_0992 [Ignicoccus hospitalis KIN4/I]
gi|166227550|sp|A8AB69.1|Y992_IGNH4 RecName: Full=MEMO1 family protein Igni_0992
gi|156566766|gb|ABU82171.1| protein of unknown function DUF52 [Ignicoccus hospitalis KIN4/I]
Length = 285
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 138/292 (47%), Gaps = 26/292 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVR-----GVIAPHAGYSYS 53
+R + AG++Y +++ L ++ W L +P+ VR G + PHAGY YS
Sbjct: 3 VRPPAVAGTFYPADAEELIRLIE-WSFTHPLGPGEVPEVSPVRRKASVGYMVPHAGYIYS 61
Query: 54 GRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+++ ++ + V ++GP+H P ++ ++++TP+G + D E I EL
Sbjct: 62 GPVAAWSYYHLAQEGAPETVVIIGPNHTGLGPAVSVMPPSIWETPLGGVKTDDEAISELL 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
D EHS+E+ LP+L +F G +IVPI++ RL +
Sbjct: 122 KVSNVVEEDYSAHAYEHSLEVQLPFLQYLF-GDSFRIVPIVMKV----QTPSVARLLMQS 176
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ + ++ D+ S + HY+ + K + AL+K I DP+ ++ L
Sbjct: 177 IKE--AMENLGRDYVVLSSS-DLNHYEPHDITVEKDMLALEK-----IVNLDPEGLQEVL 228
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++YD ++CG P+ V +++ + I L++ S + + V YA+
Sbjct: 229 VKYDISMCGPGPVMVNMYLDKEYGAERAI-LLKHATSGDTSGDKSAVVGYAA 279
>gi|89093257|ref|ZP_01166207.1| hypothetical protein MED92_04232 [Neptuniibacter caesariensis]
gi|89082553|gb|EAR61775.1| hypothetical protein MED92_04232 [Neptuniibacter caesariensis]
Length = 260
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 96/198 (48%), Gaps = 7/198 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R A+ AG +Y D+ + L EL G + + ++ PHAGY YSG AA A+ I
Sbjct: 4 RTAAVAGMFYPDDKECLLTELSGLISSQSEREGTPPSLLVVPHAGYQYSGTVAAQAYKQI 63
Query: 65 -DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
D + RV LLGPSH ALS A + +P+G+L LD E+I EL + +
Sbjct: 64 TDWSYYERVLLLGPSHRVPLRGMALSDADKFSSPLGELNLDTELIAELNSQ-DLAAYNSA 122
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
E EHS+E+ LP+L F + I+P++VG + A R+ + VS
Sbjct: 123 AHELEHSLEVQLPFLQ--FLNCDLPIIPVVVGVAPRDEVASLIRIVEQSY---RILVIVS 177
Query: 184 SDFCHWGSRFNYMHYDKK 201
+D H+ S D+K
Sbjct: 178 TDLSHFHSFEEACDIDRK 195
>gi|406904129|gb|EKD46005.1| hypothetical protein ACD_69C00050G0003 [uncultured bacterium]
Length = 260
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 127/285 (44%), Gaps = 46/285 (16%)
Query: 10 AGSWYTDNSKRLAEE----LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID 65
AG +Y DN L ++ LD +A LP + +IAPHAGY YSG+ AA A+ +
Sbjct: 8 AGKFYPDNPAVLRQDILNMLDNAADKANLPLP---KAIIAPHAGYIYSGQVAASAYVCLA 64
Query: 66 P--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I RV L P+H Y A + A Y TP+G + +D E I L F +++
Sbjct: 65 KAKNQIKRVVLFAPAHQYPVDGIATTNAKSYLTPLGQIAIDQETISSL----SFPYLNV- 119
Query: 124 VDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNF 179
++EA EH++E+HLP+L L +VP+LVG A +E E + L+ P
Sbjct: 120 IEEAFSSEHAVEVHLPFLQLTLGSFL--LVPLLVGSAAVSEVENILEELWG----GPETL 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+SSD H+ +K ++ DK D I +PD L Y++
Sbjct: 174 VVISSDLSHYHP--------------YKLAKSWDKDTADAIIALNPDG-----LSYEHA- 213
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG I L + K+ + L S +D+ V Y +
Sbjct: 214 CGATGIKALLGV--AAKKKMHVLQLDLRNSGDTAGSKDNVVGYGA 256
>gi|341582002|ref|YP_004762494.1| hypothetical protein GQS_04580 [Thermococcus sp. 4557]
gi|340809660|gb|AEK72817.1| hypothetical protein GQS_04580 [Thermococcus sp. 4557]
Length = 293
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 135/292 (46%), Gaps = 12/292 (4%)
Query: 1 MEKIRRASHAGSWYTDNSK--RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
M ++R + AGS+Y + EE G L E G + + +APHAGY +SG A+
Sbjct: 1 MAEMRYPAVAGSFYPSGEALIEMLEEFLGDLGEEGSER--RITAGVAPHAGYVFSGYTAS 58
Query: 59 YAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ I + F +LGP+H A+ + ++TP+GD+ +D + + +
Sbjct: 59 RTYKAIFEDGLPETFVILGPNHTGLGSPIAIYPSGKWRTPLGDVDVDSGMTKAIAKLSGI 118
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+D + EHS+E+ LP++ + E G ++IVPI +G + E GR + +
Sbjct: 119 ADLDELAHKYEHSIEVQLPFIQYLSELAGKEVRIVPIALGIQDEEVSEDLGRAVFEASQE 178
Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
V S+DF H+G + Y+ + + + I+ D + I D + L
Sbjct: 179 LGRDVVVIASTDFMHYGPMYGYVPFRARASELPHRIKEWDFRVIRRILDFDVKGMFRELR 238
Query: 234 EYDNTICGRHPISVFLHML-GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
E ++T+CG P +V M+ + ++ + L Y S + D+ V YAS
Sbjct: 239 EMNHTMCG--PGAVGTAMVYSRSAGALEAELLHYTTSFEVSRSTDAIVGYAS 288
>gi|374628598|ref|ZP_09700983.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanoplanus limicola DSM 2279]
gi|373906711|gb|EHQ34815.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanoplanus limicola DSM 2279]
Length = 268
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/294 (27%), Positives = 127/294 (43%), Gaps = 39/294 (13%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLR---EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
++RR++ AGS+Y +S L ++L + + L PE GV++PHAG +SG+ +AY
Sbjct: 2 ELRRSTLAGSFYPADSSVLRKQLKSFFENIPDKSLSALPEPFGVVSPHAGIIFSGQTSAY 61
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ I ++GPSH Y P C + ++TP+G L D I L
Sbjct: 62 AYSAIPKEFKGTFVVIGPSHAGY-PDCV--STDNWETPLGTLKNDNGFIRNL-------- 110
Query: 120 MDICVD-----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
DI VD + E+S+E+ + ++ F +IVP+LVG+ N K ++
Sbjct: 111 -DIPVDNYAHRQRENSLEVQMQFIKYRFPD--CRIVPVLVGSQTRRNSLSLAEKLFKAIE 167
Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
N + SSDF H+ + K LD +D + D F + +
Sbjct: 168 LSENPVKIVASSDFSHY--------------ISDKDARMLDNYAIDALLHLDTAEFYRRI 213
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ + CG PI V + K +K L Y S R V YAS A
Sbjct: 214 NDEGISACGYGPIMVMAEVSKRLGAKKAVK-LHYSTSGDINGDRSQVVGYASLA 266
>gi|219564289|dbj|BAH03713.1| hypoyhetical protein [Saccharomyces pastorianus]
Length = 139
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 85/138 (61%), Gaps = 12/138 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAAYAF 61
IR A+HAGSWY++ ++ L+++L +L ++ L K P R +I PHAGY Y G AY++
Sbjct: 3 IRPATHAGSWYSNRAQELSQQLHTYLIKSTL-KGPIHNARIIICPHAGYRYCGSTMAYSY 61
Query: 62 GNIDPT-SISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLE-----VIEELKAT 114
++D ++ R+F+LGPSHH Y+ + +S + +TP+G+L +D + + +E
Sbjct: 62 ASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPEN 121
Query: 115 GK--FELMDICVDEAEHS 130
GK F+ MD D AEHS
Sbjct: 122 GKKLFKPMDHDTDMAEHS 139
>gi|219852949|ref|YP_002467381.1| hypothetical protein Mpal_2377 [Methanosphaerula palustris E1-9c]
gi|219547208|gb|ACL17658.1| protein of unknown function DUF52 [Methanosphaerula palustris
E1-9c]
Length = 262
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 133/291 (45%), Gaps = 39/291 (13%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
KIR + AG +Y L + L + G S G++APHAGY YSG AA A+
Sbjct: 2 KIRTCAVAGMFYPGEPSHLEQLLGKFFAVTGHETS--ASGIVAPHAGYIYSGAVAALAYA 59
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ P+S F ++GPSH Y P C ++A ++TP+G + D +++ L D
Sbjct: 60 TL-PSSFDGTFVVIGPSHRGY-PTC--TSAVPWETPLGIVDTDTDLVSAL---------D 106
Query: 122 ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+ V+E+ EHS+E+ +P++ F +I P+L+G + + GR+ A+ +
Sbjct: 107 LPVNESYHADEHSLEVQMPFIKYRFP--RARIAPVLMGDQDLASSVRLGRMIARAARETR 164
Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
V SSDF H Y+ G +IEAL + +++ F L E
Sbjct: 165 REIRVVASSDFSH------YVPETVAKGQDLYAIEALKSLEIEL--------FYNRLRER 210
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++ CG I+ + + K + L+Y S R S V YA+ A
Sbjct: 211 GSSACGYGTIAAMITASIEAGAE-KGELLKYATSGDVTGDRASVVGYAAIA 260
>gi|126465721|ref|YP_001040830.1| hypothetical protein Smar_0821 [Staphylothermus marinus F1]
gi|126014544|gb|ABN69922.1| protein of unknown function DUF52 [Staphylothermus marinus F1]
Length = 289
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 132/299 (44%), Gaps = 40/299 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
+R AG +Y ++L E ++ G L + + + G +APHAGY YSG
Sbjct: 7 VRLPVVAGYFYPSEPEKLRELIEHSFKHPLGPGELPQVSSTRRKDALGYVAPHAGYIYSG 66
Query: 55 RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
AA+ + N+ P +I ++G +H ++ A + TP+G+L +D E+ +
Sbjct: 67 PIAAHVYYNLALDGKPETI---IIIGTNHTGLGSLVSVYPAGKWITPLGELEVDAELARD 123
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMY 165
+ + +D EHS+E+ LP++ +F G+ +KIVPI++G + ++A+Y
Sbjct: 124 IVSNSDLAELDTYAHVEEHSVEVQLPFIQYLF-GNNVKIVPIVLGLHTPDVARDLSKAIY 182
Query: 166 GRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
L D SSDF H+ H A DK +D I D
Sbjct: 183 ESLQTMGRD---TIIIASSDFNHYDP--------------HDITVAKDKKALDRILALDT 225
Query: 226 DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D K +LE TICG I + ++ K + K L+Y S + + V YA+
Sbjct: 226 DGLYKVILEEPVTICGPGGIMTLIELV-KLYGKGEAKLLKYATSGDVSGDKSAVVGYAA 283
>gi|383789560|ref|YP_005474134.1| putative dioxygenase [Spirochaeta africana DSM 8902]
gi|383106094|gb|AFG36427.1| putative dioxygenase [Spirochaeta africana DSM 8902]
Length = 281
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 131/303 (43%), Gaps = 55/303 (18%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA--------GLPKSPE-----VRGVIAPHAG 49
+R + AG +Y ++ L + + G L A GLP + VR VIAPHAG
Sbjct: 5 SVRNPAVAGKFYPRSAAELDQVVAGLLTRASTRLQDVSGLPDAANRTAGTVRAVIAPHAG 64
Query: 50 YSYSGR------AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
Y +SG AAA + + P RV LLGP+H+ AL A ++TP+GD P+
Sbjct: 65 YMFSGSIAALSFAAAAQYRDAPP---RRVILLGPAHYAAGITVALPQADSFRTPLGDHPV 121
Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
D E ++ L TG E + AEHS+E+ LP++ V+ + I+P+L + E
Sbjct: 122 DGEAMQRLLQTGLAE-TNGAAHSAEHSLEVMLPFIMAVWGA--LPILPLLCSRIAPE--- 175
Query: 164 MYGRLFAKYVDD--PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
RL V++ P + VSSD H+ + E LD ++ E
Sbjct: 176 ---RLAEILVEEIGPGDLLVVSSDLSHFHPA--------------ATAEQLDNALLEAWE 218
Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
GDP + CG+ PI + + + ++ L Y S +S V
Sbjct: 219 QGDPHGVSQ------GEACGKLPILASMALARQQGWRHRV--LGYAHSGAAGIGSESVVG 270
Query: 282 YAS 284
Y +
Sbjct: 271 YGA 273
>gi|337288628|ref|YP_004628100.1| hypothetical protein TOPB45_1081 [Thermodesulfobacterium sp. OPB45]
gi|334902366|gb|AEH23172.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermodesulfobacterium geofontis OPF15]
Length = 268
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/284 (26%), Positives = 128/284 (45%), Gaps = 28/284 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R + AG +Y+DN + L L + RE + +G+I PHAGY YSG A +G
Sbjct: 3 REPAVAGYFYSDNPRELKFHLSTLIKFRENKI----RAKGIIVPHAGYMYSGWVAGKVYG 58
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+I P + ++G +H + +L + TP+G++ +D E+ E+L + D+
Sbjct: 59 SIIPPDTA--IIIGTNHTGLGERISLFPGEAFITPLGEVSVDKEISEDLIKSVPLLSRDV 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
EHS+E+ +P+L + +KIVPI + ++ E G+ A+ + S+ + +
Sbjct: 117 MAHLHEHSIEVQVPFLQYI--NPKVKIVPICLSKLSLEEVKELGKGIAEVIKKHSDKYIL 174
Query: 183 ---SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSDF H+ V H+ + D M + I + F K + E +
Sbjct: 175 IVGSSDFSHY--------------VPHEVAKKKDLMAIREILKLSEEDFLKVIYEEKISA 220
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG PI+V ++ N K + + Y+ S S V Y
Sbjct: 221 CGYIPIAVTINTCKNLGAS-KAELVEYQTSGDIIKDYSSVVGYG 263
>gi|218781792|ref|YP_002433110.1| AMMECR1 domain-containing protein [Desulfatibacillum alkenivorans
AK-01]
gi|218763176|gb|ACL05642.1| AMMECR1 domain protein [Desulfatibacillum alkenivorans AK-01]
Length = 499
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 129/296 (43%), Gaps = 49/296 (16%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
+E +R + AGS+YT N L ++ +LREA P VR ++APHAGY YSG AA
Sbjct: 40 IENVREPAVAGSFYTANPSALKRQVKEFLREAKQTPVKGRVRALVAPHAGYVYSGVVAAA 99
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
++ I P + +V ++G SH ++ + Y+TP+G +P+ E ++ G L
Sbjct: 100 SYEKI-PRDVRKVIIIGASHRARYRGASIPNVSAYRTPLGLVPIAPEAGRLAESPGFVRL 158
Query: 120 MDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILV-GAVNAEN-----EAMYGRLFA 170
D+A EHS+E+ LP+L V + +PIL+ G VN E E + R F
Sbjct: 159 -----DQAHAQEHSIEVQLPFLQVVLGD--FEFIPILLGGGVNPEKLAEALEPLVQRGF- 210
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
F S+D H Y Y+ + + LD + I D +A K
Sbjct: 211 --------FLIASTDLSH------YYPYE--------TAQNLDNICCQAIPDLDYEAMK- 247
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CG P + + N + L Y S +D V YA+ A
Sbjct: 248 -----NCKACGWVPTVAVMKVAENLGWTGTL--LDYRNSGDTAGSKDKVVGYAAIA 296
>gi|297527606|ref|YP_003669630.1| hypothetical protein Shell_1646 [Staphylothermus hellenicus DSM
12710]
gi|297256522|gb|ADI32731.1| protein of unknown function DUF52 [Staphylothermus hellenicus DSM
12710]
Length = 289
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 122/267 (45%), Gaps = 31/267 (11%)
Query: 27 GWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI----DPTSISRVFLLGPSHHYY 82
G L + + E G +APHAGY YSG AA+ + N+ P +I ++G +H
Sbjct: 39 GELPKVSNARRKETLGYVAPHAGYIYSGPIAAHVYYNLALDGKPETI---IIIGTNHSGL 95
Query: 83 TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVF 142
++ A + TP+G+L +D E+ ++ +D+ EHS+E+ LP++ +F
Sbjct: 96 GSLVSVYPAGKWITPLGELEVDAELARDIVNNSDLAELDVYAHVEEHSIEVQLPFIQYLF 155
Query: 143 EGHLIKIVPILVGAVNAE-----NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMH 197
G +KIVPI++G + ++A+Y L + D + SSDF H+
Sbjct: 156 -GDNVKIVPIVLGLHTPDVARDLSKAIYESLQSTGRD---SIIIASSDFNHYDP------ 205
Query: 198 YDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCST 257
H A DK ++ I D D F K LLE TICG I + ++
Sbjct: 206 --------HDITVAKDKKALERILALDTDGFYKVLLEEHVTICGPGGIMTLIELV-KLYG 256
Query: 258 KIKIKFLRYEQSSQCKTKRDSSVSYAS 284
K + K L+Y S + + V YA+
Sbjct: 257 KGEAKLLKYATSGDVSGDKSTVVGYAA 283
>gi|374340342|ref|YP_005097078.1| dioxygenase [Marinitoga piezophila KA3]
gi|372101876|gb|AEX85780.1| putative dioxygenase [Marinitoga piezophila KA3]
Length = 268
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 137/291 (47%), Gaps = 35/291 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR---EAGLP-KSPEVRGVIAPHAGYSYSGRAAAY 59
+R AG++Y + + L +++ + E +P KS ++ G+I PHAGY +SG+ AAY
Sbjct: 1 MRYPVVAGTFYPADYEELKKQISSFFERFEEIEIPEKSIDLTGLIVPHAGYVFSGQTAAY 60
Query: 60 AFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE----VIEELKAT 114
+ + + ++GP+H P ++ + TP G + +D E ++ EL+
Sbjct: 61 GYYELMKKGKVKTAIIIGPNHTGVGPSLSVYPEGSWLTPFGTIDVDTEISNFILSELRIK 120
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
G DI E EHS+E+ LP+L ++ G+ KIVPI++G E ++ K+
Sbjct: 121 G-----DIIAHEYEHSIEVQLPFLQYLY-GNDFKIVPIIMGVQTLEMSKKLSKVIKKFAK 174
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
D + SSD H+ N+ +K I IEAL + P+ +Y+ +
Sbjct: 175 DGVVIIA-SSDLNHYE---NHEVTLEKGNFI---IEALKEK--------SPEKVYEYIKK 219
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
Y+ T CG I+ L+ + ++ L + S + + +V Y SA
Sbjct: 220 YNITACGFGAINTLLY-----TGFENVRILHHTTSGEIFGDYERTVGYLSA 265
>gi|346224900|ref|ZP_08846042.1| AMMECR1 domain-containing protein [Anaerophaga thermohalophila DSM
12881]
Length = 496
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 72/237 (30%), Positives = 111/237 (46%), Gaps = 25/237 (10%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP----EVRGVIAPHAGYSYSGRAAA 58
K R + AG +Y +S L E L A KSP EV +I PHAGY YSG AA
Sbjct: 25 KDRVPAVAGMFYEGDSASLEEHLSSLFERA---KSPVCDGEVAALIVPHAGYVYSGGVAA 81
Query: 59 YAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
F I + T VFL+G SH ++ T Y TP+G + ++ E+ E+L +
Sbjct: 82 SGFNQIPEDTGYENVFLIGSSHRIAFNGASVYTKGDYLTPLGRVEVNRELAEKLVESSSV 141
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ EHS+E+ LP+L + HL K++P+++G +A + Y
Sbjct: 142 ISDNTAPHIEEHSLEVQLPFL----QYHLKKPFKLIPVVMGPQDASGAEKTAEVLKPYF- 196
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHG---------VIHKSIEALDKMGMDIIET 222
P N F +S+DF H+ + DKK ++ K++E K+G++ + T
Sbjct: 197 KPGNLFVISTDFSHYPGYEDAKEVDKKTAEAILKNDPRLLLKTLEENRKLGIEGLAT 253
>gi|407776168|ref|ZP_11123457.1| dioxygenase [Thalassospira profundimaris WP0211]
gi|407280884|gb|EKF06451.1| dioxygenase [Thalassospira profundimaris WP0211]
Length = 487
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 126/293 (43%), Gaps = 33/293 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRA 56
M IR + AG++Y ++ L E+DG + A + + +I PHAG +SG
Sbjct: 1 MTIIRPPAVAGTFYPADADLLRSEIDGLMDAALHSDAVSGDASPKAIIVPHAGLMFSGSL 60
Query: 57 AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA F + +I R+ ++GP+H AL+ A + TP+G++ DL +++ A
Sbjct: 61 AALGFATVRALKDTIKRIVIIGPAHRMAFQGIALARADQFATPLGNMRCDLPALQKALAL 120
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+++D EH +E+ LP++ ++F E I IVP+LV + + K
Sbjct: 121 PHVQMLDDA-HTLEHGLEIELPFIQRLFGEDADIGIVPLLVSRCSPRQ---VHEVIEKLW 176
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
P +SSD H + YD + +++ ++ + I+T D
Sbjct: 177 GGPETLIVISSDLSH------FHDYDTAQRMDNRTRTMIENFDAENIDTTD--------- 221
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG P++ L + N +KIK L S + V Y + A
Sbjct: 222 -----ACGALPVAGMLMVARN--RGMKIKTLGMRNSGDVTGDKSRVVGYGAWA 267
>gi|320451436|ref|YP_004203532.1| hypothetical protein TSC_c23830 [Thermus scotoductus SA-01]
gi|320151605|gb|ADW22983.1| conserved hypothetical protein [Thermus scotoductus SA-01]
Length = 270
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 127/290 (43%), Gaps = 34/290 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R + AG +Y ++RL +E++G L++A P VRG+++PHAGY YSG+ AA
Sbjct: 1 MDLVRLPAVAGYFYPLEAERLRQEVEGLLQKAQASPDPRVRGLLSPHAGYFYSGKVAAEG 60
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
F + R+ LLGPSH A + TP+G++ +DLE L A G
Sbjct: 61 FKVLSAWRGRARRILLLGPSHFVPFLGAAFYPYRAWATPLGEVAVDLEGGSRLVAKGSPF 120
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDP 176
L EHS+E+ LP+L I+P+L G + + EA+ L
Sbjct: 121 LTYEEPFREEHSLEVLLPFLQVALPD--TPILPLLFGEADPKEVAEALLSEL------GE 172
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+ SSD H+ + D+K + K++ ALD G+ E
Sbjct: 173 GDLVVASSDLSHYHPDPVARNRDQK--TLEKAL-ALDVEGVAQGEA-------------- 215
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG P + + G+ K + L Y S++ V YA+ A
Sbjct: 216 ---CGHLPWATLTALAGSLGWKPTL--LAYATSAEASGDTSRVVGYAAVA 260
>gi|46199650|ref|YP_005317.1| hypothetical protein TTC1348 [Thermus thermophilus HB27]
gi|46197276|gb|AAS81690.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length = 280
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 119/291 (40%), Gaps = 36/291 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ +R AG +Y L EE+ L A P P VRGV++PHAGY+Y+GR A A
Sbjct: 18 MDLVRLPRVAGYFYPGEKAALEEEVGALLAGARTPPLPGVRGVLSPHAGYAYAGRVMAEA 77
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG-KF 117
F + RVFLLGPSH A ++TP+G++ +DLE L G F
Sbjct: 78 FRALSAWRGKARRVFLLGPSHFVAFSGVAFFPYRAWRTPLGEVAVDLEGGRRLLGQGAPF 137
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDD 175
E EHS+E+ LP+L I+P+L G V+ EA+ L K
Sbjct: 138 RAYREPFLE-EHSLEVLLPFLQVALP--QTPILPLLFGEVDPGEVAEALLPELGQK---- 190
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+ SSD H+ D K K ALD G+ E
Sbjct: 191 --DLVVASSDLSHYHPDPVARKLDAK---TLKRALALDAEGVAQGEA------------- 232
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P + + K + L Y S++ R+ V Y + A
Sbjct: 233 ----CGRLPWGTLTAL--ARALGWKPRLLAYATSAEAGGGRERVVGYGALA 277
>gi|91774303|ref|YP_566995.1| hypothetical protein Mbur_2394 [Methanococcoides burtonii DSM 6242]
gi|121689012|sp|Q12TI1.1|Y2394_METBU RecName: Full=MEMO1 family protein Mbur_2394
gi|91713318|gb|ABE53245.1| memo-like protein of unknown function DUF52 [Methanococcoides
burtonii DSM 6242]
Length = 264
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 124/287 (43%), Gaps = 25/287 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y ++K L +E+ + S +V G + PHAGY YSG AA+AF
Sbjct: 1 MRQPTVAGKFYPLSTKALRKEIVKCFHGLEI-MSEDVIGAVVPHAGYVYSGPVAAHAFAR 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ +F GP+H Y A+S V+ TP GD+ D E+ + L G MD
Sbjct: 60 LPKADTYVIF--GPNHTGYGSPVAMSQ-DVWNTPFGDVETDRELGKLL--AGTIIDMDEV 114
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
EHS+E+ +P+L +F G K++PI +G + + G A+ V + F
Sbjct: 115 AHRYEHSVEVQIPFLQYLF-GSDFKVLPICMGMQDEDTAVEVGLEVARAVKESGKKVVFI 173
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ V + E D +D I D + E D T CG
Sbjct: 174 ASSDLSHY--------------VPQEKAEKSDNYLIDAILDMDVPEIYRRKYEKDITACG 219
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
PI+ L C K + ++Y S T V YA+ K
Sbjct: 220 YGPITAMLTAAKECGAK-NTELVKYGTSGDV-TGDPMVVGYAAIIVK 264
>gi|452965208|gb|EME70234.1| dioxygenase [Magnetospirillum sp. SO-1]
Length = 456
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 130/292 (44%), Gaps = 37/292 (12%)
Query: 1 MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
M IR + AG +Y + +++L LDG + P + +IAPHAG+ YSG
Sbjct: 1 MTAIRPTAVAGQFYPADFAEANRQLTAFLDGAAPAPCAGRRP--KALIAPHAGWVYSGPV 58
Query: 57 AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA A+ + P +RV LLGPSH ALS+A + +P+G PLD + L
Sbjct: 59 AAGAYALLKPFRGGYARVVLLGPSHRLAFRGMALSSADQWASPLGPQPLDKD-WSRLAGV 117
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
++D + EHS+E+H+P+L ++P+++G +A E + G L A +
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATL--GEFTLLPVVIG--DATPEMVAGLLDALWGG 172
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
D +S+D H Y+ Y++ G ++ A+++M DP A +
Sbjct: 173 D-ETLIVISTDLSH------YLPYEQCRGTDGQTAAAIERM--------DPVAISR---- 213
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P+ L ++I L S +D V Y + A
Sbjct: 214 --EGACGRLPVGGLLT--AAKRRGLEIVTLDVRNSGDTAGPKDRVVGYGAWA 261
>gi|350561283|ref|ZP_08930122.1| protein of unknown function DUF52 [Thioalkalivibrio thiocyanoxidans
ARh 4]
gi|349781390|gb|EGZ35698.1| protein of unknown function DUF52 [Thioalkalivibrio thiocyanoxidans
ARh 4]
Length = 267
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 95/192 (49%), Gaps = 10/192 (5%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAY 59
E +R + AG +Y D+ L + + +A P + + +IAPHAGY YSG AA
Sbjct: 5 ETVRDPAVAGLFYPDDPAALRRAVQALMADATGPAAGRSLPKALIAPHAGYRYSGPVAAS 64
Query: 60 AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+ + S I RV LLGPSH A + A Y+TP+G + +D + ++
Sbjct: 65 AYAALGNASDRIQRVVLLGPSHRVPFRGIAATGAGAYRTPLGTIAVDRPALASIRELPGV 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+D+ EHS+E+HLP+L ++ G+ +VP++VG E A + + P
Sbjct: 125 VTLDLA-HGPEHSLEVHLPFL-QLLLGNF-NLVPLVVGDAGPEQVAA---VLERLWGGPE 178
Query: 178 NFFSVSSDFCHW 189
VSSD H+
Sbjct: 179 TLIVVSSDLSHY 190
>gi|167622889|ref|YP_001673183.1| hypothetical protein Shal_0949 [Shewanella halifaxensis HAW-EB4]
gi|167352911|gb|ABZ75524.1| protein of unknown function DUF52 [Shewanella halifaxensis HAW-EB4]
Length = 269
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 130/294 (44%), Gaps = 38/294 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGL-------PKSPEVRGVIAPHAGYSYSGRA 56
IR A+ AG +Y L L+G+L A + + + + +I PHAGY YSG
Sbjct: 5 IRPAAVAGLFYPAAPDELRAMLEGYLARARMSLISQANTQHSQAKVIIVPHAGYIYSGLV 64
Query: 57 AAYAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ I+ ++++V L+GP+H Y AL + ++TP+G + +D +E L++
Sbjct: 65 AAHAYVLIESMAATVNKVLLIGPAHRVYLQGGALPQSQYFETPLGQISIDKRSVEMLESN 124
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ D+ + EH +E+ LP+L +++P+L+G + + A +L +
Sbjct: 125 PHITVSDL-PHQQEHCLEVQLPFLQHCLNQ--FELIPLLIGESDPQETA---KLLEQLWG 178
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
VS+D H+ H DK K ++ L+ +
Sbjct: 179 GEETLVVVSTDLSHFHRYAEATHLDKL--TCQKILQGLETI------------------- 217
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+ CG ++ ML ++ + L Y+ S ++ V YAS A++
Sbjct: 218 FPEQACGSFCLNAL--MLQLKRHQLCLTQLSYQNSGDTAGDKNRVVGYASFASR 269
>gi|389582562|dbj|GAB65300.1| hypothetical protein PCYB_053180 [Plasmodium cynomolgi strain B]
Length = 149
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 68/120 (56%)
Query: 171 KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
KY D N F SSDFCH+G RF++ + +K+ H + +D+ +II D F
Sbjct: 30 KYFQDEGNLFLFSSDFCHYGRRFSFTNILQKYSDTHIFKQNMDRDAANIISRHDIAGFID 89
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
YL + NTICG +PI + L +L + S + K + Y QS+Q ++ DSSVSYA + ++
Sbjct: 90 YLNKTHNTICGSNPIKMMLQLLQDFSGNVSTKLMHYSQSNQARSTNDSSVSYAGMVSFIN 149
>gi|116833213|gb|ABK29519.1| CG8031-PA-like protein [Helicoverpa armigera]
Length = 82
Score = 90.9 bits (224), Expect = 6e-16, Method: Composition-based stats.
Identities = 40/80 (50%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 11 GSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSIS 70
GSWY +N L+ +LD WL +A L P R +IAPHAGYSY G AA+A+ + P +
Sbjct: 1 GSWYIENGNELSRQLDLWLSKADLTHGP-ARAIIAPHAGYSYCGACAAFAYRQVSPVVVK 59
Query: 71 RVFLLGPSHHYYTPKCALST 90
R+F+LGPSHH CALS+
Sbjct: 60 RIFILGPSHHVRLGGCALSS 79
>gi|297568555|ref|YP_003689899.1| protein of unknown function DUF52 [Desulfurivibrio alkaliphilus
AHT2]
gi|296924470|gb|ADH85280.1| protein of unknown function DUF52 [Desulfurivibrio alkaliphilus
AHT2]
Length = 270
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 127/284 (44%), Gaps = 24/284 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
IR + A +Y ++ L E L + R A P P R V+ PHAGY +SGR A
Sbjct: 2 IRTPAVAHQFYPGEAEVLRELLTRLIGRRAADQPGQP-ARAVVMPHAGYVFSGRVAGETV 60
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
++ V +LGP+HH AL ++ + P+G++ ++ E+ ++ G + D
Sbjct: 61 AQVEVPE--DVIVLGPNHHGLGAPAALMSSGSWAMPMGEVQINAELAGLIRRHGPEIVDD 118
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
AEHS+E+ LP+L G ++IVP+ + ++ + + G+ A ++D S
Sbjct: 119 EQAHRAEHSLEVLLPFLQFCQPG--LRIVPLCLARLSPADCRLVGQALAAAIEDFSRPVL 176
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+ S+D H+ SR + A D + ++ + DP+ + + E ++
Sbjct: 177 LAASTDMSHYESR--------------SAATAKDHLAIERVLALDPEGLYRTVAEQQISM 222
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG P + L M + + +RY S + V YA
Sbjct: 223 CGVIPTVITL-MAALQLGATEAELVRYSDSGEASGDTAQVVGYA 265
>gi|161527723|ref|YP_001581549.1| hypothetical protein Nmar_0215 [Nitrosopumilus maritimus SCM1]
gi|229554364|sp|A9A2Y3.1|Y215_NITMS RecName: Full=MEMO1 family protein Nmar_0215
gi|160339024|gb|ABX12111.1| protein of unknown function DUF52 [Nitrosopumilus maritimus SCM1]
Length = 275
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 73/286 (25%), Positives = 121/286 (42%), Gaps = 21/286 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVRGVIAPHAGYSYSGRAAA 58
IR+ AG +Y L E ++ ++ P + + GVI+PHAGY YSG A
Sbjct: 2 IRKPVVAGQFYPGTKNELEEMINSCIQHKFGPGNQIQNNEGIYGVISPHAGYVYSGPTAC 61
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
Y++ I + V +LGP+H A ++TP+G + +D E +E+ K+
Sbjct: 62 YSYKAISSKNPELVIILGPNHFGVGKDVATMVNAQWETPLGLVDVDSEAAKEIANNSKYI 121
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+D +HS+E+ +P L +F KI+PI++ + E G A+ +
Sbjct: 122 EIDEFSHSRDHSLEVQIPMLQSIFSEKF-KILPIILRDQSLEMAKDVGNAVAQIAKSRNT 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDF H+ ++ H + DK ++ I D + F L+E T
Sbjct: 181 MIVASSDFTHYEEN-SFAH-------------SQDKALIEPILEMDVEKFYSVLMEKRVT 226
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG ++ + N +K + L Y S S V Y +
Sbjct: 227 ACGYGAMASVMIACKNLGA-VKGELLSYTTSGDVMGDTSSVVGYGA 271
>gi|406883043|gb|EKD30701.1| hypothetical protein ACD_77C00487G0011 [uncultured bacterium]
Length = 476
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 19/285 (6%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAFGN 63
R+ + AG +Y +L + L+ L A LPK VR +I PHAGY +SG AA AF
Sbjct: 10 RKPAVAGMFYPAEPDKLQQTLES-LFAAALPKQCSNVRAIICPHAGYIFSGGVAASAFNQ 68
Query: 64 IDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
ID T+ RVF++ SH A+ + P G +D E + L +G E+
Sbjct: 69 IDKNTNYKRVFIIASSHQVSFEGAAIYCDGDFLMPYGREKVDTEFGKML-VSGFPEIFTS 127
Query: 123 CVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
EHS+E+ LP+L V + + IVPI++GA + + Y P N F
Sbjct: 128 HTGPHINEHSLEVQLPFLHYVLKTNYC-IVPIIIGASDPRICNNIALVLKPYF-TPDNLF 185
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDF H Y +Y+ V ++ EA+ + ++T K + ++C
Sbjct: 186 IISSDFSH------YPNYEDAKKVDAETKEAILSNNPETLQTCLKKNTAKQISHLSTSLC 239
Query: 241 GRHPISVFLHM-LGNCSTKIKIKFLRYEQS--SQCKTKRDSSVSY 282
G + L+M GN S ++ + Y+ S ++C ++D V Y
Sbjct: 240 GWTSVLTMLYMTTGNDS--LEYSGIEYKNSGDAECYGEKDKVVGY 282
>gi|83312805|ref|YP_423069.1| dioxygenase [Magnetospirillum magneticum AMB-1]
gi|82947646|dbj|BAE52510.1| Predicted dioxygenase [Magnetospirillum magneticum AMB-1]
Length = 456
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 135/293 (46%), Gaps = 39/293 (13%)
Query: 1 MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
M IR + AG +Y + +++L LDG + + P + +IAPHAG+ YSG
Sbjct: 1 MTAIRPTAVAGQFYPADFAEANRQLTAFLDGAVAAPCAGRRP--KALIAPHAGWVYSGPV 58
Query: 57 AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA A+ + P S SRV LLGPSH ALS+A + +P+G +PLD + L
Sbjct: 59 AAGAYALLKPFRGSWSRVVLLGPSHRVAFQGMALSSADQWASPLGAVPLDKD-WSRLAGV 117
Query: 115 GKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
++D + EHS+E+H+P+L A + E ++P+++G ++ E + G L A +
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATIGE---FTLLPVVIG--DSSPEMVAGLLEALWG 171
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
D +S+D H Y+ Y++ +++ A++ M D A +
Sbjct: 172 GD-ETLIVISTDLSH------YLPYEQCRSTDGQTVAAIEHM--------DAAALSR--- 213
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ CGR P+ L ++I L S +D V Y + A
Sbjct: 214 ---DGACGRLPVGGLLTAAKR--RGLEIVTLDVRNSGDTAGPKDRVVGYGAWA 261
>gi|374855667|dbj|BAL58522.1| hypothetical conserved protein [uncultured candidate division OP1
bacterium]
Length = 290
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 127/280 (45%), Gaps = 53/280 (18%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAA 58
ME+IR A+ AGS+Y +S+RL + ++ L +A P++ +RG+I PHAGY YSG AA
Sbjct: 2 MERIRPAAVAGSFYPASSERLRKTVEALLAQAQ-PQTITGRLRGLIVPHAGYIYSGPVAA 60
Query: 59 YAF----------------GNIDPTSIS-----------RVFLLGPSHHYYTPKCALSTA 91
+ G P S R+ LLGP+H+ Y A
Sbjct: 61 TGYKLLKDLTPQPPSLKRKGEPSPPSSGEGPGERSAQKVRLLLLGPAHYVYFVGAATLDV 120
Query: 92 TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
++TP+G + ++I G + + EHS+E+ LP++ V +I P
Sbjct: 121 DAWQTPLGLVECARDLIVRALERGDLQ-SNFDAHAPEHSLEVQLPFVQTVLPN--CEIFP 177
Query: 152 ILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
IL G ++ A R+ + Y+ D +FF VSSD H+ + Y ++ + H++IE
Sbjct: 178 ILTGECDSRELA---RVLSLYL-DAIDFFIVSSDLSHY---YPYDEAVQRDALAHRAIEN 230
Query: 212 LDKMGMD-------------IIETGDPDAFKKYLLEYDNT 238
D M+ ++ +K LL+Y N+
Sbjct: 231 FDFTLMEDRVEACGKTAILTLMRIAHEQGWKAKLLDYRNS 270
>gi|152992570|ref|YP_001358291.1| hypothetical protein SUN_0977 [Sulfurovum sp. NBC37-1]
gi|151424431|dbj|BAF71934.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 267
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 116/249 (46%), Gaps = 27/249 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR--------EAGLPKSPEVRGVIAPHAGYSYSGRA 56
R + AG +Y ++ + E + + R E L P R VI PHAGY YS
Sbjct: 4 RETAVAGQFYPSDANEIKEMFEHYNRIIEEHLQNEELLHMKP--RAVIVPHAGYVYSAFT 61
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A A + T RV ++GPSH Y ++S Y TP+G LP+D E++ ELK+ +
Sbjct: 62 ANVAMRLLGNTEAKRVVVIGPSHRVYLKGTSISDYDSYNTPLGALPIDRELVNELKS--R 119
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
F L + EHS E+ +P++ K ++ +V ++ G E+ A + +DDP
Sbjct: 120 FGLQFVPDAHHEHSTEVQMPFV-KTYDTD-ASVVELVYG---DEDPARLAEVIDYLLDDP 174
Query: 177 SNFFSVSSDFCHW-----GSRFNYMHYDKKHGV----IHKSIEALDKMGMD-IIETGDPD 226
+S+D H+ + + + D + +H+ EA K+G++ ++ T
Sbjct: 175 DTVVVISTDLSHYYDINKAKQLDSICLDAVQKLDTAELHQGCEACGKIGVEAMLITAKRR 234
Query: 227 AFKKYLLEY 235
K LL+Y
Sbjct: 235 GLKPLLLDY 243
>gi|217967891|ref|YP_002353397.1| hypothetical protein Dtur_1509 [Dictyoglomus turgidum DSM 6724]
gi|217336990|gb|ACK42783.1| protein of unknown function DUF52 [Dictyoglomus turgidum DSM 6724]
Length = 269
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 127/283 (44%), Gaps = 23/283 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K R S AG +Y K L L +++E +P+ + G + PHAGY YSG+ A +
Sbjct: 2 KRRLPSVAGYFYPAREKELLNMLSQFIKE--VPQKTPLLGCVVPHAGYIYSGKVAGLIYS 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ ++ ++GP+H+ A+ + V+ TP+GD+ +D + ++ K+ D
Sbjct: 60 IMEVPEVA--IIIGPNHNGLGYPSAIYSEGVWATPLGDVEIDSILSRKIINNSKYLKEDF 117
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
EHS+E+ +P+L + IKIVPI V + E G K + D +
Sbjct: 118 VAHHPEHSLEVQVPFLQ--YLNFNIKIVPITVVDYSYEVLKDLGYAIGKSLLDYEKNTVI 175
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H Y Y+ +IEA+ + D F K ++E D +IC
Sbjct: 176 IASSDFSH------YEPYEIAKQKDFYAIEAILNL--------DEKEFIKRVVEKDISIC 221
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G PI + + + + K + + Y+ S D V YA
Sbjct: 222 GVGPIVITIIAVKLLGGR-KAQLVAYQTSGDITKDIDQVVGYA 263
>gi|220935846|ref|YP_002514745.1| hypothetical protein Tgr7_2683 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|219997156|gb|ACL73758.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
HL-EbGr7]
Length = 274
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 122/290 (42%), Gaps = 38/290 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR---EAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
+R A+ AG +Y ++ L ++G L EA P P + +I PHAGY YSG AA A
Sbjct: 13 VRPAAVAGHFYPADAGLLRTVVEGLLSDFAEAEPPARPP-KALIVPHAGYVYSGAVAARA 71
Query: 61 FGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE---ELKATGK 116
+ + I RV L+GP+H A A ++TP+G +P+D +E EL + +
Sbjct: 72 YNRLRGANGIRRVVLVGPAHRVGFYGIAAPEAAWFETPLGRVPVDQPAVEQACELSSVFR 131
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
++ EH +E+ LP+L + +VP++VG + A R P
Sbjct: 132 YD----AAHAPEHCLEVQLPFLQRTLSEDF-TLVPLVVGGATTNDMA---RTLEAVWGGP 183
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+SSD H ++ YD V ++ A++ + + IE G
Sbjct: 184 ETLVVISSDLSH------HLDYDAARIVDERTSRAIETLDEEGIEAG------------- 224
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR PI L + + + L S RD V Y + A
Sbjct: 225 -MACGRVPIQGLLRLARQ--RGLAAERLDLRCSGDTAGPRDRVVGYGAYA 271
>gi|54294113|ref|YP_126528.1| hypothetical protein lpl1177 [Legionella pneumophila str. Lens]
gi|53753945|emb|CAH15416.1| hypothetical protein lpl1177 [Legionella pneumophila str. Lens]
Length = 447
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 121/251 (48%), Gaps = 31/251 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y D+ L + + +L +A + K P + ++ PHAGY YSG AA A+ +
Sbjct: 3 VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61
Query: 64 I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
+ SI+++ +LGP+H Y A + TP+G++ D E++ + + + L
Sbjct: 62 LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ + EH +E+ LP+ +F KI+P++VG N ++ A RL A+ +
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDNTLL 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSD H+ DKK + SI+ L+ G +I+ + C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214
Query: 241 GRHPISVFLHM 251
G P+ FL+
Sbjct: 215 GYFPLRGFLYF 225
>gi|18312203|ref|NP_558870.1| hypothetical protein PAE0818 [Pyrobaculum aerophilum str. IM2]
gi|23822329|sp|Q8ZYE1.1|Y818_PYRAE RecName: Full=MEMO1 family protein PAE0818
gi|18159641|gb|AAL63052.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Length = 281
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 132/292 (45%), Gaps = 27/292 (9%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYSGR 55
++R+ + AG +Y + L ++++ ++ PK+ E G +APHAGY YSG
Sbjct: 2 RVRKPAVAGYFYESGREELLQQIEWAVKHELGPKALQMPKLGGEALGGVAPHAGYMYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + VF++ GP+H+ A+ + ++TP+G + +D E+ E + +
Sbjct: 62 VAAWLYSALAGYGKPDVFVIVGPNHYGIGAPVAIMKSGAWETPLGRVEVDRELAEVITSH 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
K D EHS+E+ +P++ F +KIVPI++ GR AK +
Sbjct: 122 FKEVEDDFYAFSKEHSVEVQVPFIQYYFGD--VKIVPIVMWRQTLSTSRELGRAIAKALK 179
Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ + SSDF H+ + H + + D+M + I D +
Sbjct: 180 EYGRKAYVIASSDFNHY----------EPHDITTRK----DEMAISKILKLDEAGLFEIS 225
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++D +ICG PI V + I + L++ S +D +V YAS
Sbjct: 226 SKFDISICGIGPIGVLIAAAKELGY-INVTLLKHATSGDTSGYKDETVGYAS 276
>gi|409196691|ref|ZP_11225354.1| AMMECR1 domain-containing protein [Marinilabilia salmonicolor JCM
21150]
Length = 481
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 96/197 (48%), Gaps = 12/197 (6%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-----EVRGVIAPHAGYSYSGR 55
+++ R+ + AG +Y D+ A L G L+E P EV +I PHAGY YSG+
Sbjct: 6 LKRDRKPAVAGMFYEDD----ASLLHGHLKELFSRAKPRDTNDEVAAIIVPHAGYVYSGK 61
Query: 56 AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA F I + I R+FL+G SH A+ + TP G + +D +V EL
Sbjct: 62 IAASGFNQIPEDAKIERIFLIGSSHRVAFDGAAVYMQGDFMTPFGPVEVDADVAGELVEQ 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
D V EHS+E+ +P+L + H K+VP+LVG +A + K+
Sbjct: 122 ESGIFSDPEVHAREHSLEVQVPFL-QFHLKHPFKLVPLLVGPHDASVTPRIAKTLEKWFT 180
Query: 175 DPSNFFSVSSDFCHWGS 191
N F +S+DF H+ S
Sbjct: 181 S-GNLFVISTDFSHYPS 196
>gi|16081384|ref|NP_393717.1| hypothetical protein Ta0237 [Thermoplasma acidophilum DSM 1728]
gi|13878848|sp|Q9HLJ1.1|Y237_THEAC RecName: Full=MEMO1 family protein Ta0237
gi|10639380|emb|CAC11382.1| conserved hypothetical protein [Thermoplasma acidophilum]
Length = 268
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 130/291 (44%), Gaps = 32/291 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SPEVRGVIAPHAGYSYSGRAAAYAF 61
R+ + AG +Y + +EL L +P+ S + G + PHAG YSGR A Y++
Sbjct: 3 RKPAVAGYFYPERK----DELYSLLSSFAIPEQHVSGTIIGAVVPHAGIIYSGRTAMYSY 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I+ +++ ++GP+H TP +L + TP+GD ++ + E L + + D
Sbjct: 59 RAIEKSAVRDFVIIGPNHRPLTPYASLYPEGEWSTPLGDALINDRMAEALYRDSNYIVKD 118
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ +P+L +F G + VP+++G + G K ++DP F +
Sbjct: 119 EESHLMEHSVEVQIPFLQYLF-GDGFRFVPVILGDQEIDVARDIGEAIMK-IEDPFIFIA 176
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE---TGDPDAFKKYLLEYDNT 238
SSDF H+ + + ++K MD+I T D D F L + + T
Sbjct: 177 -SSDFTHY-----------------EDAKRVEKKDMDLISAILTLDLDKFYSVLEKENVT 218
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
CG I+ ++ ++ FL + S V YAS + +
Sbjct: 219 ACGYGAIAALMYYTKKRGG--RMIFLNHSNSGDVTGDYSEVVGYASLVSVI 267
>gi|298674293|ref|YP_003726043.1| hypothetical protein Metev_0320 [Methanohalobium evestigatum
Z-7303]
gi|298287281|gb|ADI73247.1| protein of unknown function DUF52 [Methanohalobium evestigatum
Z-7303]
Length = 265
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 133/287 (46%), Gaps = 32/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y + L +E+ + + + + G + PHAGY +SG AAYAF
Sbjct: 1 MRNPTVAGQFYALDPSGLEKEISRCFGDIDISQQ-NIHGAVIPHAGYMFSGPVAAYAFAQ 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + + + +LGP+H A+S T + TP+G++ D E+ + L + ++D
Sbjct: 60 L-PEADTYI-ILGPNHTGLGSAVAISQDT-WSTPLGNVETDKELGKSLAGS----ILD-- 110
Query: 124 VDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN- 178
VDE EHS+E+ +P+L F H K++PI +G + E G + + + +
Sbjct: 111 VDEMAHMYEHSIEVQIPFLQYRFS-HGFKVLPICMGLQDEETALEVGNQISSAITESNKK 169
Query: 179 -FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
F SSD H+ + V + + DK ++ IE D F + + E
Sbjct: 170 IFVIASSDLTHF----------EPENVANDT----DKYLIESIENMDVHEFYRRISERSA 215
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
T CG PIS L + + + K L+Y S+ S V YA+
Sbjct: 216 TACGYGPISAMLTVAKSLGGRTG-KLLKYGTSADVTGDSSSVVGYAA 261
>gi|372489024|ref|YP_005028589.1| putative dioxygenase [Dechlorosoma suillum PS]
gi|359355577|gb|AEV26748.1| putative dioxygenase [Dechlorosoma suillum PS]
Length = 268
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 40/297 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAY 59
M +R A+ AG +Y + +L ++D +L +A P + ++ +I PHAGY YS AA
Sbjct: 1 MPSLRPAAVAGLFYPADPAQLGADIDAYLAQARPPAAAGPIKALIVPHAGYLYSAPIAAT 60
Query: 60 AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+ + P + I RV LLGP+H A+ T + TP+G LP+D E + +
Sbjct: 61 AYAALAPQAGRIRRVILLGPTHRVAVRGLAVPTVDSFATPLGPLPIDAEARQRIA----- 115
Query: 118 ELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+L + D+ EHS+E+ LP+L ++ ++P+ VGA + E A ++ +
Sbjct: 116 DLPQVVADDGAHALEHSLEVQLPFLQRLLGP--FTLLPLAVGAASGEAVA---QVLERLW 170
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
VSSD H Y Y + E LD+ I D +
Sbjct: 171 GGEETLILVSSDLSH------YQPYAEA--------ERLDQATCARIAAARADLAPQ--- 213
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
CG PI+ L L ++ + L S R+ V YA+ A + D
Sbjct: 214 ----EACGAIPINGLL--LAAARHGLRPRLLDRRNSGDTAGDRERVVGYAAFALEPD 264
>gi|307609923|emb|CBW99449.1| hypothetical protein LPW_12241 [Legionella pneumophila 130b]
Length = 447
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y D+ L + + +L +A + K P + ++ PHAGY YSG AA A+ +
Sbjct: 3 VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61
Query: 64 I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
+ SI+++ +LGP+H Y A + TP+G++ D E++ + + + L
Sbjct: 62 LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ + EH +E+ LP+ +F KI+P++VG N ++ A RL A+
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSD H+ DKK + SI+ L+ G +I+ + C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214
Query: 241 GRHPISVFLHM 251
G P+ FL+
Sbjct: 215 GYFPLRGFLYF 225
>gi|54297126|ref|YP_123495.1| hypothetical protein lpp1171 [Legionella pneumophila str. Paris]
gi|397663639|ref|YP_006505177.1| hypothetical protein LPO_1184 [Legionella pneumophila subsp.
pneumophila]
gi|397666836|ref|YP_006508373.1| hypothetical protein LPV_1323 [Legionella pneumophila subsp.
pneumophila]
gi|53750911|emb|CAH12322.1| hypothetical protein lpp1171 [Legionella pneumophila str. Paris]
gi|395127050|emb|CCD05235.1| conserved protein of unknown function [Legionella pneumophila
subsp. pneumophila]
gi|395130247|emb|CCD08485.1| conserved protein of unknown function [Legionella pneumophila
subsp. pneumophila]
Length = 447
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y D+ L + + +L +A + K P + ++ PHAGY YSG AA A+ +
Sbjct: 3 VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61
Query: 64 I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
+ SI+++ +LGP+H Y A + TP+G++ D E++ + + + L
Sbjct: 62 LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ + EH +E+ LP+ +F KI+P++VG N ++ A RL A+
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSD H+ DKK + SI+ L+ G +I+ + C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214
Query: 241 GRHPISVFLHM 251
G P+ FL+
Sbjct: 215 GYFPLRGFLYF 225
>gi|157374143|ref|YP_001472743.1| hypothetical protein Ssed_1004 [Shewanella sediminis HAW-EB3]
gi|157316517|gb|ABV35615.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
Length = 270
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 95/199 (47%), Gaps = 16/199 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWL-------REAGLPKSPEVR--GVIAPHAGYS 51
+ IR+A+ A +Y + L ++ +L E G+ + R +I PHAGY
Sbjct: 2 LSSIRQAAVADLFYPADPVELTAQIKQFLCATNDKCGEQGIKPGGKQRPKALIVPHAGYI 61
Query: 52 YSGRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
YSG A AF + D I V LLGP+H Y CAL A + TP+G +P++ + E
Sbjct: 62 YSGAVAGRAFARLLDDNKIETVVLLGPAHRVYLQGCALPEANTFATPLGKIPIEQQNYEA 121
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
L A + D+ EH +E+ LP+L + K++PI+VG + A LF+
Sbjct: 122 LSALDAVTISDL-PHLDEHCLEVQLPFLQLCLQR--FKLLPIVVGECSPSVVADILELFS 178
Query: 171 KYVDDPSNFFSVSSDFCHW 189
+ + VS+D H+
Sbjct: 179 Q---RENTLIVVSTDLSHY 194
>gi|20089490|ref|NP_615565.1| hypothetical protein MA0601 [Methanosarcina acetivorans C2A]
gi|23822315|sp|Q8TT38.1|Y601_METAC RecName: Full=MEMO1 family protein MA_0601
gi|19914397|gb|AAM04045.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 267
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 131/288 (45%), Gaps = 32/288 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAF 61
++R+ + AG +Y + + L +EL GL + EV G + PHAGY YSG+ AA+ +
Sbjct: 2 EMRQPAVAGQFYPLHCENLEKELTRCFE--GLEIREREVFGAVCPHAGYIYSGKVAAHVY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ P + + V L GP+H Y +LS T +KTP+G + +DLE+ + G D
Sbjct: 60 ATL-PEADTYV-LFGPNHTGYGSPVSLSRET-WKTPLGTIDVDLELADGF--LGSIVDTD 114
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
EHS+E+ LP+L F G KI+PI +G + + G L A V +
Sbjct: 115 ELGHTYEHSIEVQLPFLQYRF-GRDFKILPICMGMQDKDTAVEVGSLVADLVSESGKRAV 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSDF H+ + + D + + +I LD GM L + ++
Sbjct: 174 IIASSDFTHYETAEHARETDSE---VIDAILKLDVPGM-----------YDSLYRRNASV 219
Query: 240 CGRHPISVFLHMLGNCSTKI---KIKFLRYEQSSQCKTKRDSSVSYAS 284
CG PI+ L + S K+ + L Y S + V YA+
Sbjct: 220 CGYGPIAAML----SASQKLGGSRATLLEYANSGDVSGDMSAVVGYAA 263
>gi|312137117|ref|YP_004004454.1| hypothetical protein Mfer_0894 [Methanothermus fervidus DSM 2088]
gi|311224836|gb|ADP77692.1| protein of unknown function DUF52 [Methanothermus fervidus DSM
2088]
Length = 280
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 128/288 (44%), Gaps = 23/288 (7%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRG------VIAPHAGYSYSGRAA 57
R+ + AG++Y + K L ++ + E G P+V G VIAPHAGY YSG A
Sbjct: 3 RKPAVAGTFYESDPKALKNRIEWCFKHELGPGDFPKVGGDRNIVGVIAPHAGYIYSGPIA 62
Query: 58 AYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+A+ I F +L P+H +L ++TP+G + +D E EL +
Sbjct: 63 AHAYYRIVEDGFPETFIILCPNHTGMGSSVSLMAEGEWETPLGSVKIDEEFSSELFKKSE 122
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+D EHS E+H+P+L + F+ KIVPI + + E + +
Sbjct: 123 IIDIDDTAHSQEHSCEVHIPFL-QYFKKDF-KIVPICMWLQDLETTKEVAQAIVDTIK-- 178
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
S++ D S ++ HY+ + + E+ DK +D I D K + + +
Sbjct: 179 ----SLNKDIVLIAST-DFTHYEPQ-----EIAESTDKKIIDAILKLDEKVMYKRINDLN 228
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
T+CG P+SV L K + L+Y S R S V YAS
Sbjct: 229 ATMCGYGPVSVVLISSKKLGAN-KARLLKYGTSGDVTGDRSSVVGYAS 275
>gi|294496196|ref|YP_003542689.1| hypothetical protein Mmah_1548 [Methanohalophilus mahii DSM 5219]
gi|292667195|gb|ADE37044.1| protein of unknown function DUF52 [Methanohalophilus mahii DSM
5219]
Length = 265
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 125/287 (43%), Gaps = 24/287 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y K L +EL R+ L S + G++ PHAGY +SG AAYA+
Sbjct: 1 MRQPTVAGQFYPLGIKPLKKELGHCFRDLEL-SSEDAYGIVVPHAGYMFSGPVAAYAYSK 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ +F GP+H Y ALST + TP+GD+ D ++ E L G MD
Sbjct: 60 LPKADTYVIF--GPNHTGYGSAVALST-QAWSTPLGDVEADEKMGEML--AGSIIDMDEI 114
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
EHS+E+ LP+L F G KI+PI +G + + G AK + F
Sbjct: 115 AHMYEHSIEVQLPFLQYRF-GDSFKILPICMGLQDIDTAREVGLEVAKAAEKTGKKVVFI 173
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSDF H+ H++ + D +D + D + F + E T CG
Sbjct: 174 ASSDFTHYQP--------------HETAKETDMYLIDALLNMDVEEFDRRREERSATACG 219
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
PI + ++ + L+Y S + V YA+ K
Sbjct: 220 YGPI-IAMYTAAEYMGANTAELLKYATSGDMTGDYSAVVGYAAILVK 265
>gi|14520755|ref|NP_126230.1| hypothetical protein PAB0370 [Pyrococcus abyssi GE5]
gi|23822337|sp|Q9V189.1|Y539_PYRAB RecName: Full=MEMO1 family protein PYRAB05390
gi|5457971|emb|CAB49461.1| Predicted dioxygenase [Pyrococcus abyssi GE5]
gi|380741294|tpe|CCE69928.1| TPA: Dioxygenase [Pyrococcus abyssi GE5]
Length = 291
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/291 (25%), Positives = 133/291 (45%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AG +Y + + L E L+ + R+ G S ++ +APHAGY +SG A+ +
Sbjct: 2 IRYPAVAGQFYPEG-ETLVEMLEEFFRDLGEQGNSRKITAGVAPHAGYVFSGYTASRTYK 60
Query: 63 NIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F++ GP+H A+ + TP+G + +D E+ +E+ K +D
Sbjct: 61 AIYEDGLPETFVIFGPNHTGLGSPIAVYPEGDWVTPLGKVKIDSELAKEIVKLSKIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ LP++ + E G +IVPI +G + + GR + +
Sbjct: 121 DLAHKYEHSIEVQLPFIQYIAEKAGTDFRIVPITLGIQDEDVSEALGRAVFEAAEALGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
V S+DF H+GS + Y+ + + + ++ D + I D + + E D+
Sbjct: 181 VIVIASTDFMHYGSFYGYVPFRGRANELPNMVKEWDMRIIRRILDFDLKGMFEEIREMDH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+CG + + + + + L Y S + D+ V YAS +
Sbjct: 241 TMCGPGGVGAGI-VYSRLMNAREAELLHYTTSFEVSRSTDAIVGYASIVMR 290
>gi|149188588|ref|ZP_01866880.1| hypothetical protein VSAK1_15917 [Vibrio shilonii AK1]
gi|148837498|gb|EDL54443.1| hypothetical protein VSAK1_15917 [Vibrio shilonii AK1]
Length = 267
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 21/197 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP------EVRGVIAPHAGYSYSGRAA 57
IR + AG +Y ++++L +++ L + + ++RG+I PHAGY +SG A
Sbjct: 3 IRPPAVAGRFYNKDTEQLTRQVNQLLTGTNVRDAETCNVQNDLRGLIVPHAGYVFSGETA 62
Query: 58 AYAFGNIDPTSIS--RVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A+ + + RV L+GPSH CAL + ++TP+G + +D + +E L
Sbjct: 63 GLAYHQLQSVAQQFLRVILVGPSHRVAFHGCALPSVGAFETPLGRVSIDRDCVELLADNS 122
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
+ D EHS+E+ LP+L V + +++PI+ G V+A AK ++
Sbjct: 123 MVSINDQ-AHAQEHSLEVQLPFLQTVLDD--FQLLPIVTGQVSALE-------IAKLIEP 172
Query: 175 --DPSNFFSVSSDFCHW 189
D +S+D H+
Sbjct: 173 IWDSKTLLVISTDLSHF 189
>gi|159042151|ref|YP_001541403.1| hypothetical protein Cmaq_1590 [Caldivirga maquilingensis IC-167]
gi|189038957|sp|A8M9U2.1|Y1590_CALMQ RecName: Full=MEMO1 family protein Cmaq_1590
gi|157920986|gb|ABW02413.1| protein of unknown function DUF52 [Caldivirga maquilingensis
IC-167]
Length = 282
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 136/294 (46%), Gaps = 30/294 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--------VRGVIAPHAGYSYSGRA 56
R+ + AG +Y + L ++ + PKS V VI PHAGY YSG
Sbjct: 4 RKPAVAGMFYEASKDGLINQIKWSIEHELGPKSTMNVKENRQYVLSVIVPHAGYVYSGPV 63
Query: 57 AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ + +VF ++GP+H+ A+ T+ ++TP+G + +D EV +++KA
Sbjct: 64 AAHAYVEVGKYIKPKVFVIIGPNHYGVGSPAAIMTSGTWETPLGQVEIDEEVAKQIKAKV 123
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
K D E EHS+E+ +P++ +F G +IVPI++ + G ++ +D
Sbjct: 124 KDLAEDPIAFEREHSIEVQVPFIQYLFPGS--RIVPIVLWNQTIDLSRRLGSAISEVIDG 181
Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
+ V SSD HY+ K ++ +++ I D + F +
Sbjct: 182 RAGEVVVVASSDLN---------HYEPHEVTTDKDMKVIER-----ILNMDEEGFYTVMD 227
Query: 234 EYDNTICGRHPI-SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+YD ++CG I + ++ ST +K+ L++ S +V YAS A
Sbjct: 228 KYDVSVCGFGAIMTAIVYSRKQGSTGVKL--LKHATSGDTSGYLLETVGYASIA 279
>gi|430760607|ref|YP_007216464.1| protein of unknown function DUF52 [Thioalkalivibrio nitratireducens
DSM 14787]
gi|430010231|gb|AGA32983.1| protein of unknown function DUF52 [Thioalkalivibrio nitratireducens
DSM 14787]
Length = 267
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 127/288 (44%), Gaps = 34/288 (11%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
E +R + AG +Y D+ L E+ + A P+ P + +IAPHAGY YSG AA
Sbjct: 5 ETVREPAVAGLFYPDDPAALRREVRALMSGATAPAPERPAPKALIAPHAGYRYSGPVAAS 64
Query: 60 AFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+ + I RV LLGPSH A + A Y+TP+G +P+D ++
Sbjct: 65 AYAALGAAVAHIRRVVLLGPSHRVPFRGIAATGAGAYRTPLGAIPVDQAANASIRDLPGV 124
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDP 176
+D+ EHS+E+HLP+L + + +VP++VG + A+ RL+ P
Sbjct: 125 VTLDLA-HGPEHSLEVHLPFLQLLLDD--FALVPLVVGDADPNQVTAVLERLWGG----P 177
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
VSSD H+ HG +++ ++LD IE+ ++ +L D
Sbjct: 178 ETLIVVSSDLSHY------------HG--YRTAQSLDAETCRAIES-----MREDMLTPD 218
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG P++ L + + + L S R V Y +
Sbjct: 219 RA-CGCRPLAGLLR--AGRAHGLTVHTLDLRNSGDTAGPRTEVVGYGA 263
>gi|441504807|ref|ZP_20986799.1| Putative dioxygenase [Photobacterium sp. AK15]
gi|441427389|gb|ELR64859.1| Putative dioxygenase [Photobacterium sp. AK15]
Length = 222
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 79/161 (49%), Gaps = 8/161 (4%)
Query: 43 VIAPHAGYSYSGRAA--AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
+I PHAGY YSG A AY++ I RV L+GPSH Y+ CAL + TP+G
Sbjct: 1 MIVPHAGYIYSGEIASRAYSYLKQQADRIKRVILIGPSHRYFFQGCALPGVDYFATPLGQ 60
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
+P+D + +E L+ E+ D EH +E+ LP+L + ++PIL V+ +
Sbjct: 61 IPIDTQSVEILRKIDDIEVSD-QAHAFEHCLEVQLPFLQRCLNE--FTLLPILTSDVSPK 117
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK 201
A R+ D + +SSD H+ S D++
Sbjct: 118 TVA---RIINSVWQDENTLLVISSDLSHYHSYTEAQTLDQQ 155
>gi|325298560|ref|YP_004258477.1| AMMECR1 domain-containing protein [Bacteroides salanitronis DSM
18170]
gi|324318113|gb|ADY36004.1| AMMECR1 domain protein [Bacteroides salanitronis DSM 18170]
Length = 496
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/282 (29%), Positives = 136/282 (48%), Gaps = 34/282 (12%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYA 60
+++R + AG++Y + L + + ++ +A + E++ VI PHAGY YSG A+ A
Sbjct: 27 KRVRMPAVAGAFYPAKADTLRKMIVSFMSDASSVEPDKEMQAVIVPHAGYVYSGAIASKA 86
Query: 61 FGNIDP-TSISRVFLLGPSHHYYTPKCALSTATV------YKTPIGDLPLDLEVIEE-LK 112
I + R+FLLGPSH + A A+V Y TP+G++ +D E + L+
Sbjct: 87 IKAIPADAAYKRIFLLGPSH-----RAAFDGASVNGAFGSYATPLGEVQVDKEACDALLR 141
Query: 113 ATGKFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
A F V EA EH +E+ LP L +V + IVPI++G + +
Sbjct: 142 ADTVFTY----VPEAHVREHCLEVQLPLL-QVHLRDVPSIVPIIIGTQDLAKLRRIAQAL 196
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE--ALDKMGMDIIETGDPDA 227
Y N F +SSDF H+ S + DK G ++IE +L K +D +E +
Sbjct: 197 QPYFTS-DNLFVISSDFSHYPSYEDANRVDKATG---EAIESGSLRKF-LDALEMNAREG 251
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
+ L + CG+ PI+V L ++ ++++ L Y S
Sbjct: 252 VPQLL----TSACGQSPIAVLLSLM-EGRKDLQVRHLAYCNS 288
>gi|15606539|ref|NP_213919.1| hypothetical protein aq_1336 [Aquifex aeolicus VF5]
gi|7388474|sp|O67355.1|Y1336_AQUAE RecName: Full=MEMO1 family protein aq_1336
gi|2983762|gb|AAC07322.1| hypothetical protein aq_1336 [Aquifex aeolicus VF5]
Length = 374
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/289 (29%), Positives = 134/289 (46%), Gaps = 32/289 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAAYAFG 62
R SHAG Y +N +EL ++ E S + RG++ PH + +
Sbjct: 103 REPSHAGQAYPEN----PQELKNFIEETVNKNSEKFKARGILVPHMDLRVASGVYGSVYS 158
Query: 63 NIDPTSISRVFLLGPSHHYY-TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM- 120
I V LLG SH+++ TP L +TP+GDL +D+E +EEL+ ++L
Sbjct: 159 AIKENEYDTVVLLGVSHYFHETPFSVLPLDL--RTPLGDLKVDIERVEELQKMFDYDLSH 216
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D+ + EHS+E +L +F +K++P +V + ++ K ++D N
Sbjct: 217 DVLAYKNEHSIEFQTIFLKYLFPE--VKVIPAIVSYGDTKSLKEIAHKITKVLEDSQNPL 274
Query: 181 SVSS-DFCHWGSRFNYMH-YDKKHGVIHKSIEALDKMGMDII-ETGDPDAFKKYLLEYDN 237
+SS DF H G +F H YD D+ ++++ E + +AF LL+ DN
Sbjct: 275 IISSVDFSHVGRKFGDPHSYDPS---------PRDREYINLLAELKNEEAFN--LLQSDN 323
Query: 238 T---ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
I G+ VFL +L N K + K L Y+ + T DS VSYA
Sbjct: 324 NRTRIDGQFTNFVFLEILKNLGVK-EGKLLDYDVYHEAPT--DSKVSYA 369
>gi|350534319|ref|ZP_08913260.1| hypothetical protein VrotD_24493 [Vibrio rotiferianus DAT722]
Length = 264
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 94/203 (46%), Gaps = 14/203 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA--YAF 61
+R + AG +Y + ++L+ + W + + P +R ++ PHAGY YSG AA F
Sbjct: 3 VRPPAVAGRFYPHSPQQLSSMIQSWEKPSACRFQPVIRAMVVPHAGYIYSGEVAAGVCQF 62
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I RV LLGPSHH+ CAL + + TP+G ++EV E +A +
Sbjct: 63 LKSQADRIHRVILLGPSHHFSFLGCALPSVDWFLTPLG----NIEVDREWRAILAQQQDV 118
Query: 122 ICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ D+A EH +E+ LP L +I+PIL ++ A L D +
Sbjct: 119 VVSDDAHAPEHCLEVQLPLLQTYLGA--FRILPILTSRISP---ARMAELLDPVWLDEES 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKK 201
VSSD H+ S D++
Sbjct: 174 LLVVSSDLSHYHSYRQAQRLDQQ 196
>gi|291614303|ref|YP_003524460.1| hypothetical protein Slit_1844 [Sideroxydans lithotrophicus ES-1]
gi|291584415|gb|ADE12073.1| protein of unknown function DUF52 [Sideroxydans lithotrophicus
ES-1]
Length = 263
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 122/288 (42%), Gaps = 32/288 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R + AG +Y L+ E+ +L EA P++ + +I PHAGY YSG AA A
Sbjct: 1 MLAVRPPAVAGMFYPAEPGELSHEILEFLTEAQ-PRTLIPKALIVPHAGYVYSGAIAASA 59
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + S I RV LLGP+H AL + TP+G + LD E++ + + +
Sbjct: 60 YATLKSISARIRRVVLLGPTHRVAVRGLALPDVEAFDTPLGRVMLDTELMRIISSLPQVT 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ EHS+E+ LP+L V ++P+ VG AE A +
Sbjct: 120 -VSAAAHAQEHSLEVQLPFLQSVLND--FTLLPLAVGMATAEEVA---EVLELAWGGEET 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H Y+ Y V ++I +I+E P A ++
Sbjct: 174 LIVISSDLSH------YLPYAAAQSVDGRTIR-------NILELKQPIAHEEA------- 213
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PIS + + ++K L S +++ V Y S A
Sbjct: 214 -CGGTPISGLI--IVARKHQLKPHLLDLRNSGDTAGSKNNVVGYTSIA 258
>gi|344200325|ref|YP_004784651.1| mediator of ErbB2-driven cell motility-containing protein
[Acidithiobacillus ferrivorans SS3]
gi|343775769|gb|AEM48325.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Acidithiobacillus ferrivorans SS3]
Length = 273
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 134/295 (45%), Gaps = 40/295 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV------RGVIAPHAGYSYSG 54
M+ +R A+ AG++Y + L E+ L AG + E + +I PHAGY YSG
Sbjct: 1 MQFVRPAAVAGTFYPAKASVLRAEVARLL--AGAESADEAAAVAWPKAIIVPHAGYVYSG 58
Query: 55 RAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
AA A+ + I RV LLGP+H AL +TP+G + +D + L
Sbjct: 59 AVAASAYALLAKGRGHIRRVVLLGPAHRVPFRGLALPGVQAMQTPLGVVAVDQAGVAAL- 117
Query: 113 ATGKFELMDICVDEA-EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
G +++++ A EH++E+ LP++ +V +G + ++P++VG +A EA+ + K
Sbjct: 118 -AGLPQVLEMPAAHAQEHALEVQLPFIQEVLDG--VSVIPLVVG--DARPEAV-AEVLEK 171
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+SSD H Y Y + + H ++E + + F
Sbjct: 172 LWGGEETVIVISSDLSH------YHPYTEARAIDHHTVEEILR-------------FDPL 212
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+E+D CG PI+ FL + ++ + + S RD+ V YA+ A
Sbjct: 213 PIEHDQA-CGATPINGFLPV--ARKHHLRPRLVDLCNSGDTAGSRDAVVGYAAVA 264
>gi|261403545|ref|YP_003247769.1| hypothetical protein Metvu_1434 [Methanocaldococcus vulcanius M7]
gi|261370538|gb|ACX73287.1| protein of unknown function DUF52 [Methanocaldococcus vulcanius M7]
Length = 287
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 133/301 (44%), Gaps = 33/301 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
M +R + AG +Y + L E ++ +L + G PKS VRG ++ PHAGY Y
Sbjct: 1 MTNVRYPAVAGLFYPSHPDELIEMIEQCYLHKFG-PKSMPVRGTYEKPIGLLCPHAGYVY 59
Query: 53 SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
SG A+++ +DP + +LGP+H ++ +++TP+GD+ D E +
Sbjct: 60 SGPIQAHSYYELSKRVDPLEETTAVILGPNHTGLGSGVSVCNG-IWRTPLGDVKCDEEFV 118
Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
EEL + +D EHS+E+ LP+L ++ KIVPI + + E G
Sbjct: 119 EELWRKCEIIDLDETAHMNEHSIEVQLPFLKHLELLNIAKFKIVPICMMFQDYETSVEVG 178
Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
AK + + SSD H+ + KK ++ K DI+E D
Sbjct: 179 YFVAKVAKELGRRVVIIASSDLTHYEPQ---EVASKKDAIVIK----------DILEM-D 224
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
A + ++ Y+ ++CG P+ L + K K L Y S S V YAS
Sbjct: 225 EKALYEDVINYNISMCGYGPVIAMLRAMKILGAK-NSKLLAYATSGDITGDYSSVVGYAS 283
Query: 285 A 285
A
Sbjct: 284 A 284
>gi|148358756|ref|YP_001249963.1| hypothetical protein LPC_0635 [Legionella pneumophila str. Corby]
gi|296106801|ref|YP_003618501.1| putative dioxygenase [Legionella pneumophila 2300/99 Alcoy]
gi|148280529|gb|ABQ54617.1| hypothetical protein LPC_0635 [Legionella pneumophila str. Corby]
gi|295648702|gb|ADG24549.1| Predicted dioxygenase [Legionella pneumophila 2300/99 Alcoy]
Length = 447
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 120/251 (47%), Gaps = 31/251 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y D+ L + + +L +A + K P + ++ PHAGY YSG AA A+ +
Sbjct: 3 VRQPAVAGYFYPDDPLILKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGSVAASAYAS 61
Query: 64 I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
+ SI+++ +LGP+H Y A + TP+G++ D E++ + + + L
Sbjct: 62 LRDKKNSINKIIILGPAHRLYFKGIAYDPVDKFATPLGEIQQDKELLTQIIDLPYVYSLP 121
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ + EH +E+ LP+ +F KI+P++VG N ++ A RL A+
Sbjct: 122 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVVGETNPQDVA---RLIARVWGGDDTLL 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSD H+ DKK + SI+ L+ G +I+ + C
Sbjct: 175 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 214
Query: 241 GRHPISVFLHM 251
G P+ FL+
Sbjct: 215 GYFPLRGFLYF 225
>gi|390939216|ref|YP_006402953.1| putative dioxygenase [Sulfurospirillum barnesii SES-3]
gi|390192323|gb|AFL67378.1| putative dioxygenase [Sulfurospirillum barnesii SES-3]
Length = 266
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 115/253 (45%), Gaps = 30/253 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAA-- 57
M K R+AS AG +Y DN L + + + + R +I PHAGY YSG A
Sbjct: 1 MHKYRKASVAGMFYPDNCSELKHYISHFDKATPTFTQDLFPRALIVPHAGYIYSGYTANL 60
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
AYA+ I RV ++GPSH Y +++ Y TP GD+ +DL L+ KF
Sbjct: 61 AYAYTASRRCDIERVVVIGPSHRVYIAGASIAFFDAYHTPCGDITIDLSYAHALQK--KF 118
Query: 118 ELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR---LFAKYV 173
+D + EHS E+ +P++ F+ +V I+ G + YG L + +
Sbjct: 119 AFLDFHPNAHEEHSTEVQMPFIHHYFKK--ANVVEIVYGDIT------YGELSSLINEVL 170
Query: 174 DDPSNFFSVSSDFCHWGSR----------FNYMHYDKKHGVIHKSIEALDKMGMD-IIET 222
D +S+D H+ + N ++ H + ++ EA K+G+ ++ +
Sbjct: 171 KDDKTLIVISTDLSHFHTEEEANRRDTFCINAINTLDLHAL--ENCEACGKLGVKALVHS 228
Query: 223 GDPDAFKKYLLEY 235
AFK Y L+Y
Sbjct: 229 ARLHAFKPYFLDY 241
>gi|315655805|ref|ZP_07908703.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
gi|315489869|gb|EFU79496.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
Length = 488
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 103/218 (47%), Gaps = 28/218 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
M +R+ + AGS+Y +S L + +D L++A G P ++ GV PH
Sbjct: 1 MSFVRKPAVAGSFYPQDSGELQKTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
AGY YSG AA F + RV + GP+H ALS AT + TP+G++P+D
Sbjct: 60 AGYIYSGLIAAVGFQLVSALRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119
Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
+ + E + KF VD EH++E+ LP+L VF IVP+ VG E+ +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDIDFAIVPLAVGDCTPEDVCD 176
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
++ L + VSSD H Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLAYDE 208
>gi|403178136|ref|XP_003336578.2| hypothetical protein PGTG_17889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375173354|gb|EFP92159.2| hypothetical protein PGTG_17889 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 293
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 79/167 (47%), Gaps = 29/167 (17%)
Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGV-- 204
IKIV ILVG + E Y L A Y+ DP N F VSS G F + +Y H +
Sbjct: 86 IKIVLILVGLIMFAEELKYSALLAPYLADPENLFVVSSVLSLGGHDF-HTYYQDPHLIEL 144
Query: 205 --------------IHKSIEALDKMGMDIIETGDP----DAFKKYLLEYDNTICGRHPIS 246
IH+SIE L K + I DP FK YL NTI RHPIS
Sbjct: 145 AKQLSPGSPLPEYPIHQSIENLYKEAITAISLQDPKQAHKTFKSYLKRKKNTIGSRHPIS 204
Query: 247 VFL----HMLGNCST----KIKIKFLRYEQSSQCKTKRDSSVSYASA 285
V L + C T + +++ + YEQSS C T RDSSVSY SA
Sbjct: 205 VLLGSMIKLQQTCLTPPPHEQQLQLVCYEQSSPCFTVRDSSVSYVSA 251
>gi|15678074|ref|NP_275188.1| hypothetical protein MTH45 [Methanothermobacter thermautotrophicus
str. Delta H]
gi|7388365|sp|O26151.1|Y045_METTH RecName: Full=MEMO1 family protein MTH_45
gi|2621080|gb|AAB84552.1| conserved protein [Methanothermobacter thermautotrophicus str.
Delta H]
Length = 277
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 124/291 (42%), Gaps = 29/291 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVR---GVIAPHAGYSYSGR 55
IRR + AG++Y + L ++ W L LP +R GVIAPHAGY YSG
Sbjct: 2 IRRPAVAGAFYERDPAALRRRIE-WCFEHELGPGTLPAVGSMRRIKGVIAPHAGYMYSGP 60
Query: 56 AAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ + I + ++ P+H +L ++TP+G + +D E+ E +
Sbjct: 61 VAAHAYHELVSDGIPGTLVIICPNHTGMGSGVSLMQQGAWETPLGTVEIDSELAEAIVRE 120
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+D AEHS E+H+P++ + +IVP+ + E A G A +
Sbjct: 121 SGIIDLDETAHLAEHSCEVHVPFIQYFTDN--FRIVPVTMWMQGHETAADVGHAVASAIR 178
Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ +V S+DF H+ + EA D+ +D I D +
Sbjct: 179 ETGRDAAVIASTDFTHYSPQ--------------DIAEATDRRIIDRITAMDDTGMYGVI 224
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
E + T+CG P++ + + + LRY S R S V YA
Sbjct: 225 SELNATMCGYGPVAATI-IASRILGATECDLLRYATSGDVTGDRSSVVGYA 274
>gi|52841403|ref|YP_095202.1| hypothetical protein lpg1169 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|378777079|ref|YP_005185516.1| hypothetical protein lp12_1149 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
gi|52628514|gb|AAU27255.1| hypothetical protein lpg1169 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|364507893|gb|AEW51417.1| hypothetical protein lp12_1149 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
Length = 453
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 130/284 (45%), Gaps = 33/284 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y D+ L + + +L +A + K P + ++ PHAGY YSG AA A+ +
Sbjct: 9 VRQPAVAGYFYPDDPLTLKQTVLNFLDQAPIHK-PAPKAILVPHAGYVYSGAVAASAYAS 67
Query: 64 I--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELM 120
+ +I+++ LLGP+H Y A + TP+G++ D E++ + + + L
Sbjct: 68 LRDKKDTINKIILLGPAHRLYFKGIAYDPVDKFATPLGEIDQDKELLTQIIDLPYVYSLP 127
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ + EH +E+ LP+ +F KI+P+++G N ++ A RL A+
Sbjct: 128 E--AHQNEHCLEVQLPFCQMIFSK--FKILPLVIGETNPQDVA---RLIARIWGGDDTLL 180
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSD H+ DKK + SI+ L+ G +I+ + C
Sbjct: 181 IISSDLSHYLPYHIAQREDKKTCL---SIDTLN--GEEILH---------------DAAC 220
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
G P+ FL+ + + L S ++ V YA+
Sbjct: 221 GYFPLRGFLYF--GRQNHVYSRLLDLRNSGDTAGDKERVVGYAA 262
>gi|206890214|ref|YP_002248185.1| hypothetical protein THEYE_A0338 [Thermodesulfovibrio yellowstonii
DSM 11347]
gi|206742152|gb|ACI21209.1| conserved protein [Thermodesulfovibrio yellowstonii DSM 11347]
Length = 269
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 129/288 (44%), Gaps = 32/288 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAAYAFG 62
RR S AG +Y NSK L +E++ ++ P + ++ G I PHAGY YSG A +
Sbjct: 4 RRPSVAGYFYPSNSKELLQEIEEYM-----PTTLKIDAFGAICPHAGYVYSGSVAGAVYS 58
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ P I L+GP+H Y AL T ++ P+G + + E+ +++ DI
Sbjct: 59 KLKPKEI--FILIGPNHTGYGENVALMTEGEWEIPLGSIKIHSELAKKIIEKSPLAKDDI 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF--- 179
EHS+E+ LP++ K+ +IVPI + ++ ++ + V++ NF
Sbjct: 117 HAHLHEHSLEVQLPFIYKL--NPEAQIVPITLKMLSLKDCLSLAQGIVLAVNE-LNFRDK 173
Query: 180 --FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
S+D H+ + + +D + ++ I DP+ +LE+
Sbjct: 174 VVVVASTDMSHY--------------LPDELARKVDSLAVEKIRHFDPEGLYNTVLEHGI 219
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
++CG P L+ K +I+ ++Y S++ D V Y A
Sbjct: 220 SMCGFVPTVTMLYTTKLLGAK-EIQVIKYTTSAEVSRDYDKVVGYLGA 266
>gi|148264297|ref|YP_001231003.1| hypothetical protein Gura_2247 [Geobacter uraniireducens Rf4]
gi|146397797|gb|ABQ26430.1| protein of unknown function DUF52 [Geobacter uraniireducens Rf4]
Length = 267
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 27/285 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + AG +YT++ K L +L+ + + +V GVIAPHAGY YSG A +G+
Sbjct: 2 IRQPAVAGQFYTNDPKELRAQLERMVTVETFRE--KVLGVIAPHAGYMYSGAVAGRLYGS 59
Query: 64 ID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
++ PT V +LGP+H + A+ A + TP+G + ++ + +K +
Sbjct: 60 VEVPPT----VIILGPNHQGVGARAAIYPAGKWLTPLGAVSINSRLSALIKKHAPLVEEE 115
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNF 179
EHS+E+ +P+L F + IVP+ +G + + G A+ + +
Sbjct: 116 TSAHHFEHSLEVQVPFLQ--FVRSDVTIVPLCLGFGDYASCKSLGSGIARAIREYGEEVL 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ S M +K D+ ++ I +P+ + E D T+
Sbjct: 174 IVASSDMTHYESAV--MAKEK------------DEHAINEILALNPEGLLRVCREKDITM 219
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG P +V L + + K +RY S V+YA+
Sbjct: 220 CGVIPATVMLVAVNELGAT-QAKLVRYATSGDVTGDNRQVVAYAA 263
>gi|83591788|ref|YP_425540.1| hypothetical protein Rru_A0448 [Rhodospirillum rubrum ATCC 11170]
gi|386348477|ref|YP_006046725.1| hypothetical protein F11_02290 [Rhodospirillum rubrum F11]
gi|83574702|gb|ABC21253.1| Protein of unknown function DUF52 [Rhodospirillum rubrum ATCC
11170]
gi|346716913|gb|AEO46928.1| hypothetical protein F11_02290 [Rhodospirillum rubrum F11]
Length = 282
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 121/285 (42%), Gaps = 41/285 (14%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSP---EVRGVIAPHAGYSYSGRAAAYAFGNIDP 66
AG +Y + +LA+E+D L AG+ SP + V+APHAG +YSG AA+AF +
Sbjct: 22 AGRFYPGDPAQLAQEVDSLL--AGVKASPAPLRPKAVVAPHAGLAYSGPTAAHAFAPLAR 79
Query: 67 --TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD---LEVIEELKATGKFELMD 121
+I RV +LGPSH ALS Y +P+G +P+D + + +L G+ E
Sbjct: 80 HRDAIHRVVVLGPSHRLAFRGLALSGTRAYDSPLGPVPVDHSWQDRLSDLAFVGRLEQ-- 137
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EH++E+ LP+L ++ IVPI+ G AE A + P
Sbjct: 138 --AHGPEHALEVELPFLQRILG--PFSIVPIVCGEAAAEEVAQA---LERVWGGPETLIV 190
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSD H+ Y ++ G +I LD + E CG
Sbjct: 191 VSSDLSHY---LEYEDARRQDGRTATAIARLDGAPIGPREA-----------------CG 230
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
P+ L + + + + L S RD V Y + A
Sbjct: 231 HVPLGGLLCL--ARAAGMTTRCLDLRNSGDTAGPRDRVVGYGAWA 273
>gi|163796279|ref|ZP_02190240.1| Predicted dioxygenase [alpha proteobacterium BAL199]
gi|159178421|gb|EDP62963.1| Predicted dioxygenase [alpha proteobacterium BAL199]
Length = 467
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 93/188 (49%), Gaps = 8/188 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R AG +Y + + L+ +L A + + ++APHAG+ +SG A A+
Sbjct: 12 VRAPQVAGRFYPNGHNACIDLLERYLSAARPSGGVDAKVMVAPHAGWIFSGPIAGTAYAA 71
Query: 64 IDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
P + ISRV LLGP+H A ++A + TP+G LP+D + + A +MD
Sbjct: 72 FLPKAEQISRVVLLGPAHRVPFKGIATTSADGWATPLGTLPVDWDALRTALAIPGVAVMD 131
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
E EHS+E+H P++ +VF IVP+LVG A ++ + + P
Sbjct: 132 EAFAE-EHSLEVHAPFVQRVFP--RAAIVPLLVGDAPA---SLVSGVVERLWGGPETLIL 185
Query: 182 VSSDFCHW 189
VSSD H+
Sbjct: 186 VSSDLSHF 193
>gi|443695207|gb|ELT96159.1| hypothetical protein CAPTEDRAFT_96193, partial [Capitella teleta]
Length = 156
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 84/154 (54%), Gaps = 6/154 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + +G++Y D+ L + + +A P+ + +I PHAG+ YSG AA A+
Sbjct: 3 IRQPAVSGTFYPDSPSALDHFVKELVNDAK-PRDFSPKALIVPHAGFIYSGPVAASAYRL 61
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+D +I RV LLGPSH AL + TP+G +PLD +EE+K + +++D
Sbjct: 62 LDSMKENIRRVVLLGPSHRVPLQGMALPACEAFSTPLGTIPLDTAAMEEIKTFSQMQILD 121
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
EHS+E+ P++ + K++PI+VG
Sbjct: 122 KA-HAYEHSLEVQCPFIQTCLDD--FKLIPIVVG 152
>gi|39996929|ref|NP_952880.1| MEMO-like protein [Geobacter sulfurreducens PCA]
gi|39983817|gb|AAR35207.1| MEMO-like protein [Geobacter sulfurreducens PCA]
Length = 267
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 114/252 (45%), Gaps = 30/252 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +YTD+ +RL EEL +R P+ V GVIAPHAGY YSG A +G+
Sbjct: 2 VRQPAVAGQFYTDDPRRLREELGHLIRPVPAPR--RVTGVIAPHAGYMYSGAIAGAVYGS 59
Query: 64 IDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I I R V +LGP+HH +L + +P+G++P++ + + D+
Sbjct: 60 I---VIPRTVVILGPNHHGLGAAASLYPDGTWLSPLGEVPIEQRLSSLVLEHVPQAEPDV 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-----VNAENEAMYGRLFAKYVDDPS 177
EHS+E+ +P+L + + IVP+ +G E + R A Y ++
Sbjct: 117 IAHRFEHSLEVQVPFLR--YLNSDVAIVPMCLGGGGYGWCRQVGEGL-ARAIAAYGEE-- 171
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
SSD H+ S +S D+ + + D + K +
Sbjct: 172 VLIVASSDMTHYES--------------AESARLKDEAALSCVLALDAEGLLKVCRQRGI 217
Query: 238 TICGRHPISVFL 249
T+CG P +V L
Sbjct: 218 TMCGVIPSTVML 229
>gi|220923217|ref|YP_002498519.1| hypothetical protein Mnod_3293 [Methylobacterium nodulans ORS 2060]
gi|219947824|gb|ACL58216.1| protein of unknown function DUF52 [Methylobacterium nodulans ORS
2060]
Length = 274
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 21/194 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA------GLPKSPEVRGVIAPHAGYSYSGRAAA 58
R A+ AG +Y D+S RL ++ L EA G PK+ +IAPHAGYSYSG AA
Sbjct: 11 RPAAAAGLFYPDDSHRLRSAVERLLAEAPRAAASGRPKA-----IIAPHAGYSYSGPVAA 65
Query: 59 YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AF ++ I RV L+GP+H+ + ++TP+G +PLDL +
Sbjct: 66 AAFAPLEGAEQDIERVILIGPAHYIRFRGICIPMVEGFETPLGRVPLDLNALAVSAEMPG 125
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDD 175
+ D EH++E+ LP+L + VP+LVG E A+ RL+
Sbjct: 126 VQAAD-APHAPEHALEVELPFLQVLLS--RFAAVPLLVGDARPEEVAAVLDRLWGG---- 178
Query: 176 PSNFFSVSSDFCHW 189
P VSSD H+
Sbjct: 179 PETVIVVSSDLSHF 192
>gi|409095141|ref|ZP_11215165.1| hypothetical protein TzilA_00680 [Thermococcus zilligii AN1]
Length = 292
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 72/292 (24%), Positives = 134/292 (45%), Gaps = 8/292 (2%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAF 61
++R + AGS+Y D+ + L L+ + R+ G S ++ +APHAGY +SG A+ +
Sbjct: 2 EVRYPAVAGSFYPDD-ETLIGMLERFFRDLGEEGSGRKITAGVAPHAGYVFSGYTASRTY 60
Query: 62 GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
I + F +LGP+H A+ + TP+G + +D E+ + + +
Sbjct: 61 KAIFEDGLPETFVILGPNHTGLGSPIAVYPGGEWLTPLGGIEVDSEMAKAIARLSGIADL 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D EHS+E+ +P++ + E G +KIVPI +G + E G+ + +
Sbjct: 121 DKRAHTYEHSIEVQVPFIQYIAEQAGKDVKIVPITLGIQDEEVARDLGKAIFEASKELGR 180
Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
V S+DF H+G + Y+ + + + I+ D + I D D + + D
Sbjct: 181 DVVVIASTDFTHYGVVYGYVPFRARADELPHRIKEWDFQVIRHILDFDVDGMFNEVRKLD 240
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+T+CG + + + + ++ + L Y S + D+ V YAS +
Sbjct: 241 HTMCGPGGVGTAI-VYSRLAGAVEAELLHYTTSFEVSRSTDAVVGYASIVMR 291
>gi|48478030|ref|YP_023736.1| hypothetical protein PTO0958 [Picrophilus torridus DSM 9790]
gi|48430678|gb|AAT43543.1| hypothetical conserved protein DUF52 [Picrophilus torridus DSM
9790]
Length = 268
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 124/284 (43%), Gaps = 19/284 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR AG++Y D+ L E K ++ GV+ PHAGY YSG+ A ++
Sbjct: 2 IRDPYVAGAFYPDSESELLNYFKNLEPERFDIKFNKILGVVVPHAGYEYSGKIAWASYSI 61
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ + R ++GP+H+ Y A+ + ++TP+GD +D E+ E+L D
Sbjct: 62 LKEYNARRFLIIGPNHYGYPFYPAIYSNGSWRTPLGDSIIDNELSEQLIMKSGIIKNDPE 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVS 183
EHS+E+ LP+L +F+ VP+++G + E G P S
Sbjct: 122 THSTEHSIEVQLPFLQYIFKNQF-TFVPLILGDQSYEISRDLGETILSLDRIP--LIIAS 178
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SD H+ S YDK + E + + DII D F + +Y T CG
Sbjct: 179 SDLNHYES------YDKNN-------EKDEIIINDIINLRIKDFFND-IYKYRITACGFG 224
Query: 244 PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
I+V +++ KI + LR+ S R V YA+ A
Sbjct: 225 AIAVLMYITKKNHGKIAL--LRHANSGDASGDRKRVVGYAAMLA 266
>gi|410670088|ref|YP_006922459.1| hypothetical protein Mpsy_0882 [Methanolobus psychrophilus R15]
gi|409169216|gb|AFV23091.1| hypothetical protein Mpsy_0882 [Methanolobus psychrophilus R15]
Length = 265
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 125/284 (44%), Gaps = 26/284 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y K L +EL ++ + K + G + PHAGY YSG AA+A+
Sbjct: 1 MRQTTVAGQFYPRGPKDLKKELSRCFKDIEI-KPRNIIGAVVPHAGYVYSGEVAAHAYAL 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ P + + VF GP+H Y ALS T + TP+G + D E+ + L G D
Sbjct: 60 L-PKADTYVF-FGPNHTGYGSAVALSQDT-WVTPLGVVDTDRELGKLL--AGSIVDYDEV 114
Query: 124 VDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
EHS+E+ +P+L F EG KI+P+ +G + E G A+ F
Sbjct: 115 AHRFEHSIEVQIPFLQHRFAEG--FKILPVCMGLQDEETAVEIGVEVARAARASGKKVIF 172
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H+ S D H +I I +DI E F + E + T C
Sbjct: 173 IASSDFTHYQSEEKAS--DNDHYLIDPVIM------IDIPE------FYRRREERNITAC 218
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
G PI+ + G K L+Y S R V YA+
Sbjct: 219 GFGPIAATMAAAGEYGAA-KASLLKYATSGDVTGDRSQVVGYAA 261
>gi|407461723|ref|YP_006773040.1| hypothetical protein NKOR_00910 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407045345|gb|AFS80098.1| hypothetical protein NKOR_00910 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 275
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 106/243 (43%), Gaps = 20/243 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVRGVIAPHAGYSYSGRAAA 58
IR+ AG +Y + L E +D L+ P K +V G+I PHAGY YSG A
Sbjct: 2 IRKPVVAGQFYPETKDELEEMIDSCLQHKYGPGNQIQKDEKVYGIICPHAGYVYSGPTAC 61
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+++ I + V ++GP+H A ++TP+G + +D E +E+ K+
Sbjct: 62 HSYKAISSKNPELVIIIGPNHFGIGKDVATMIDAQWETPLGLVEVDSEAAQEIANNSKYI 121
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+D +HS+E+ +P L KI+PI++ + E G A+ +
Sbjct: 122 EIDEFSHSKDHSLEVQIPMLQSTLSNKF-KILPIILRDQSLEMAKDVGNAVAQIAKSRNA 180
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDF H+ N Y + +I +E M++ + F L+E T
Sbjct: 181 MIIASSDFTHY--EENSFAYSQDKALIEPILE------MNV------EKFYSVLMEKKIT 226
Query: 239 ICG 241
CG
Sbjct: 227 ACG 229
>gi|149922583|ref|ZP_01911012.1| hypothetical protein PPSIR1_06693 [Plesiocystis pacifica SIR-1]
gi|149816539|gb|EDM76035.1| hypothetical protein PPSIR1_06693 [Plesiocystis pacifica SIR-1]
Length = 270
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 91/191 (47%), Gaps = 10/191 (5%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
+ R A+ AG +Y LA LD L A + + +I PHAGY YSG AA
Sbjct: 6 RTRPAALAGRFYPGRPGELARTLDAMLDAASRARAAPARRPKALIVPHAGYVYSGPIAAS 65
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
+ + + +V LLGP+H Y AL A + TP+G++P+D + L+ F
Sbjct: 66 GYAQLRGHGVDKVVLLGPAHRVYVEGLALPGADRFATPLGEVPVDAALAAALRPL-PFVE 124
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDPSN 178
EHS+E+HLP+L V +G ++P++VG E A+ R++ P
Sbjct: 125 TSAEAHALEHSLEVHLPFLQAVLDGPF-SLLPLVVGGATPEQVSAVLERVWGG----PET 179
Query: 179 FFSVSSDFCHW 189
+SSD H+
Sbjct: 180 LVVISSDLSHF 190
>gi|121998650|ref|YP_001003437.1| hypothetical protein Hhal_1871 [Halorhodospira halophila SL1]
gi|121590055|gb|ABM62635.1| protein of unknown function DUF52 [Halorhodospira halophila SL1]
Length = 268
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 95/193 (49%), Gaps = 14/193 (7%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG---VIAPHAGYSYSGRAAAY 59
K+R + AG +Y ++ L + L EAG P+ R ++ PHAGY +SG AAA
Sbjct: 7 KVRPPAVAGRFYPGEAETLRRAVRELLAEAGEPERDPTRAPHAMVLPHAGYPFSGAAAAR 66
Query: 60 AFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGK 116
+ I P + V LLGP+H AL A TP+G +P+ + E L+ G
Sbjct: 67 GYQRIVPIREQLRHVVLLGPAHFVDLSGIALPAADALATPLGTVPVSATLRERALEHPGV 126
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+D E EHS+E+HLP+L + + ++P++VG AE+ GRL + D
Sbjct: 127 H--IDDSAHEREHSLEVHLPFLQTLLDD--FDVLPLVVGRGPAES---CGRLIEQLWQD- 178
Query: 177 SNFFSVSSDFCHW 189
VSSD H+
Sbjct: 179 DTLVVVSSDLSHF 191
>gi|451947838|ref|YP_007468433.1| putative dioxygenase [Desulfocapsa sulfexigens DSM 10523]
gi|451907186|gb|AGF78780.1| putative dioxygenase [Desulfocapsa sulfexigens DSM 10523]
Length = 273
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 129/288 (44%), Gaps = 32/288 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLRE-AG---LPKSPE-VRGVIAPHAGYSYSGRAAA 58
IR + AG +Y + AE L +RE AG LP + + V ++PHAGY YSG AA
Sbjct: 5 IRSPAVAGRFYPGD----AESLRLAIRELAGNHPLPATKQKVLAALSPHAGYVYSGGVAA 60
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
G ++ V LLGP+H ALSTAT + P+G +P++ ++L
Sbjct: 61 ETLGRVEVP--ETVILLGPNHTGKGAPVALSTAT-WNMPMGSVPVNRHFAQDLLEATDII 117
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDP 176
D EHS+E+ +P+L V + IVPI + V+ + G A+ + +D
Sbjct: 118 EEDELAHTYEHSLEVQIPFLQMVQPN--LSIVPITISHVSYQILNELGLALAEVIKQNDS 175
Query: 177 SNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
N V S+D H+ R DKK + K I + DP+ + ++ +
Sbjct: 176 GNILIVASTDMTHYEPREA---ADKKDHYVLKKIADM-----------DPNILYRSVIGH 221
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+ ++CG P++V L K + +RY S + V YA
Sbjct: 222 NISMCGIMPVTVALIAALELGAT-KTELVRYTDSGEISGDTKQVVGYA 268
>gi|308273082|emb|CBX29686.1| hypothetical protein N47_J06670 [uncultured Desulfobacterium sp.]
Length = 493
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 129/292 (44%), Gaps = 36/292 (12%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRA 56
+ +R S+AGS+Y L + ++ +A +P ++ +I PHAGY YSG
Sbjct: 27 DNVREPSYAGSFYPSAKYELEKTINDLTYQAKKTAFTVPSGKTLKALILPHAGYIYSGLT 86
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A++A S+V ++GP H C LS A Y+TP+G + + + +
Sbjct: 87 ASHASLVFSENQFSKVIIMGPDHRVGFNNCVLSDADAYETPLGYVRIHKDAAKLRSEKEL 146
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN--AENEAMYGRLFAKYVD 174
F + + D EHS+E+ LP+L + +I+P+++G+ + A EA+ L
Sbjct: 147 FRSLPVS-DINEHSIEVVLPFLQTYLKN--FEIIPVVIGSADYHAIAEAIDPLL------ 197
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
D SSD H Y+ Y +++E D +++I + + K
Sbjct: 198 DNKTLLVASSDLSH------YLKY-------RQAVEK-DNETINMIVNLENNKLSK---- 239
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
DN+ CG+ PI + + G K + + Y S + +R V YA A
Sbjct: 240 LDNSACGKIPILAIMDLAGKYGWKPVL--VHYSNSGETAGERSRVVGYAVIA 289
>gi|409912350|ref|YP_006890815.1| MEMO-like protein [Geobacter sulfurreducens KN400]
gi|298505941|gb|ADI84664.1| MEMO-like protein [Geobacter sulfurreducens KN400]
Length = 267
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 115/257 (44%), Gaps = 40/257 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +YTD+ +RL EEL +R P+ V GVIAPHAGY YSG A +G+
Sbjct: 2 VRQPAVAGQFYTDDPRRLREELGHLIRPVPAPR--RVTGVIAPHAGYMYSGAIAGAVYGS 59
Query: 64 IDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I I R V +LGP+HH +L + +P+G++P++ + + D+
Sbjct: 60 I---VIPRTVVILGPNHHGLGAAASLYPDGTWLSPLGEVPIEQRLSSLVLEHVAQAEPDV 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG----------RLFAKY 172
EHS+E+ +P+L + + IVP+ +G + YG R A Y
Sbjct: 117 IAHRFEHSLEVQVPFLR--YLNSDVAIVPMCLG------DGGYGWCRQVGEGLARAIAAY 168
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
++ SSD H+ S +S D+ + + D + K
Sbjct: 169 GEE--VLIVASSDMTHYES--------------AESARLKDEAALSCVLALDAEGLLKVC 212
Query: 233 LEYDNTICGRHPISVFL 249
+ T+CG P +V L
Sbjct: 213 RQRGITMCGVIPSTVML 229
>gi|188996255|ref|YP_001930506.1| hypothetical protein SYO3AOP1_0307 [Sulfurihydrogenibium sp.
YO3AOP1]
gi|188931322|gb|ACD65952.1| protein of unknown function DUF52 [Sulfurihydrogenibium sp.
YO3AOP1]
Length = 265
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 102/198 (51%), Gaps = 20/198 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
M +R+ + A +YT N +RL + +L +A L PE G++ PHAGY YSG AA
Sbjct: 1 MLTVRKPAVANMFYTGNKERLLYTIKNYLNKAPLYDYIPE--GIVVPHAGYIYSGPVAAV 58
Query: 60 AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATG 115
++ N+DP+ ++ L+GPSHH Y + ++TP+G + ++ E I + K
Sbjct: 59 SYKQLLNLDPSKHYKILLIGPSHHVYFNGVSYGFYDYWETPLGKVKVNKEAIIKFSKEHP 118
Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA---ENEAMYGRLFAK 171
F L ++ + EHS+E+ +P+L V + +I+P + G +++ EN +
Sbjct: 119 DFPLTLNTLPHQREHSLEVQVPFLQVVLKD--FEILPFVYGDIDSSIVEN-------IIE 169
Query: 172 YVDDPSNFFSVSSDFCHW 189
Y D +S+D H+
Sbjct: 170 YFKDNDTVVIISTDLSHY 187
>gi|386001676|ref|YP_005919975.1| hypothetical protein Mhar_0982 [Methanosaeta harundinacea 6Ac]
gi|357209732|gb|AET64352.1| hypothetical protein Mhar_0982 [Methanosaeta harundinacea 6Ac]
Length = 260
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 125/282 (44%), Gaps = 30/282 (10%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
AG +Y ++L EEL A + +RG + PHAGY YSG AA +G + P
Sbjct: 2 AGQFYPLQPQKLLEELKAAFSGASGGEPLPIRGAVVPHAGYIYSGAVAAEVYGRL-PERE 60
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
+ V L+GP+HH A+S + + TP+G + D+E+ + L G +D EH
Sbjct: 61 TFV-LIGPNHHGLGLPVAISRDS-WMTPLGTVESDVELADAL--AGSILEVDESAHLYEH 116
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFC 187
S+E+ +P+L F G KI+PI +G + E G + ++ SSDF
Sbjct: 117 SLEVQIPFLQARFSG--FKILPIAMGLQDEETAVEVGEAVGEAAKSLGRDCTIIASSDFT 174
Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
H+ + D ++IEA+ +M D + + + + T CG PIS
Sbjct: 175 HYEPQEEARRKDA------QAIEAILRM--------DVPSVYRAVYGQNLTACGYGPISA 220
Query: 248 FL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ +LG + K+ LRY S V Y + A
Sbjct: 221 TITAARILGAETGKL----LRYSTSGDVLGDYSQVVGYGAIA 258
>gi|340344183|ref|ZP_08667315.1| dioxygenase [Candidatus Nitrosoarchaeum koreensis MY1]
gi|339519324|gb|EGP93047.1| dioxygenase [Candidatus Nitrosoarchaeum koreensis MY1]
Length = 277
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 123/289 (42%), Gaps = 24/289 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
+IR + AG +Y + K L + + + +L G K P ++ GVI PHAGY YSG
Sbjct: 2 QIRTPAVAGMFYPNEKKELKKVIKECFLHNFGPGKIPPSNIKKKIFGVICPHAGYVYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A +F I + ++GP+H A T T ++TP+G++ ++ E+ EE+
Sbjct: 62 IACNSFYEISSDLPNLFIIIGPNHWGIGSSVATMTDTKWETPLGEIEVNSEIAEEISKLT 121
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+D EHS+E+ +P L ++ KIVPI + + E G AK
Sbjct: 122 DVIEIDNFSHSREHSLEVQIPILQEIAAN--FKIVPIALINQSKEIAIKVGIAVAKIAQK 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
SSDF H+ S N Y++ +I +E D D F L +
Sbjct: 180 NKVMIIGSSDFTHYES--NKFAYEQDAALIEPILEL------------DVDRFYDILHKK 225
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D + CG I+ + K + L+Y S + S V Y S
Sbjct: 226 DISACGYGAIASTMIACKEIGAT-KGELLKYATSGDISGDKSSVVGYGS 273
>gi|383788631|ref|YP_005473200.1| hypothetical protein CSE_09710 [Caldisericum exile AZM16c01]
gi|381364268|dbj|BAL81097.1| hypothetical protein CSE_09710 [Caldisericum exile AZM16c01]
Length = 264
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 133/287 (46%), Gaps = 33/287 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF- 61
KIR + AG++Y ++ + L + ++ A L + ++ +IAPHAGY YSG A Y++
Sbjct: 2 KIRDTAVAGAFYPEDREELKSMIRRFVESAPLEDTDNIKAIIAPHAGYIYSGPIAGYSYK 61
Query: 62 --GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
NID +V ++ PSH+ Y ++ YKTP+G + + + + L+ +
Sbjct: 62 QLTNIDYLKNIKVIIIAPSHYAYFHGASVGLFDAYKTPLGFVNVSKDAQKLLQLEEFHFI 121
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+D ++ EHS+E+ LP+L +IVPIL + +E + G L + D +
Sbjct: 122 LDAHLE--EHSIEVQLPFLQYTLSH--FEIVPILYSEI-SEESLLRGIL---SIFDENTI 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
VS+D H+ + Y KK K+++ L+ KK++L +
Sbjct: 174 LVVSTDLSHY---YPYDIAIKKDSHCIKAVQELN---------------KKHILNCEA-- 213
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG+ I+ + + +K K L+Y S + V Y SA
Sbjct: 214 CGKTGIATVIDFAK--ANNLKSKVLKYATSGDTAGPKTQVVGYLSAV 258
>gi|304390953|ref|ZP_07372905.1| conserved hypothetical protein [Mobiluncus curtisii subsp. curtisii
ATCC 35241]
gi|304325836|gb|EFL93082.1| conserved hypothetical protein [Mobiluncus curtisii subsp. curtisii
ATCC 35241]
Length = 488
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 102/218 (46%), Gaps = 28/218 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
M +R+ + AGS+Y + L + +D L++A G P ++ GV PH
Sbjct: 1 MSFVRKPAVAGSFYPQDPGELQKTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
AGY YSG AA F + RV + GP+H ALS AT + TP+G++P+D
Sbjct: 60 AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119
Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
+ + E + KF VD EH++E+ LP+L VF IVP+ VG E+ +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDVDFAIVPMAVGDCTPEDVCD 176
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
++ L + VSSD H Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLGYDE 208
>gi|15668579|ref|NP_247377.1| hypothetical protein MJ_0403 [Methanocaldococcus jannaschii DSM
2661]
gi|2495960|sp|Q57846.1|Y403_METJA RecName: Full=MEMO1 family protein MJ0403
gi|1591108|gb|AAB98390.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
2661]
Length = 287
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 137/304 (45%), Gaps = 33/304 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
M KIR + AG +Y + L + ++ +L + G PKS V G ++ PHAGY Y
Sbjct: 1 MNKIRYPAVAGLFYPSHPDELIDMIEQCYLHKFG-PKSMPVHGTYEKPIGLLCPHAGYVY 59
Query: 53 SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
SG A+++ +D + V +LGP+H ++ +++TP+GD+ D E +
Sbjct: 60 SGPIQAHSYYELSKRVDALEETTVVILGPNHTGLGSGVSVMDG-IWRTPLGDVKCDEEFV 118
Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
EEL + +D EHS+E+ LP+L ++ KIVPI + + E G
Sbjct: 119 EELWRKCEIVDLDETAHLNEHSIEVQLPFLKHLELLNIAKFKIVPICMMFQDYETAVEVG 178
Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
AK + + V SSD H+ + KK ++ K DI+E +
Sbjct: 179 YFIAKIAKELNRRIVVIASSDLTHYEPQ---EIASKKDAIVIK----------DILEMNE 225
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ ++ ++ Y+ ++CG P+ L + + K K L Y S + V YAS
Sbjct: 226 KELYED-VVNYNISMCGYGPVIAMLKAMKTLGAE-KAKLLAYATSGDITGDYSAVVGYAS 283
Query: 285 AAAK 288
A +
Sbjct: 284 AIVE 287
>gi|296109608|ref|YP_003616557.1| protein of unknown function DUF52 [methanocaldococcus infernus ME]
gi|295434422|gb|ADG13593.1| protein of unknown function DUF52 [Methanocaldococcus infernus ME]
Length = 287
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 135/297 (45%), Gaps = 30/297 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEV-----RGVIAPHAGYSYSGRAA 57
+R + AG +Y N + L E ++ +L E G K P + G++ PHAGY YSG
Sbjct: 1 MRYPAVAGLFYPSNPEELLEMIESCYLHELGPKKLPSIGSYSILGLVCPHAGYIYSGPIQ 60
Query: 58 AYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
A+++ + DP+ +LGP+H ++S A +++TP+GD+ D E I+ L
Sbjct: 61 AHSYLALSKHADPSEEITAVILGPNHTGLGSGVSVS-ADIWRTPLGDMEPDKEFIDLLWR 119
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
+ +D EHS+E+ LP+L ++ E KIVPI + + E G AK
Sbjct: 120 ECEIIDLDETAHLQEHSIEVQLPFLKHLELLEIAKFKIVPICMMFQDYETAIEVGYFIAK 179
Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ + V SSD HY+ + + K D++ ++ I D
Sbjct: 180 VAKELNRRVVVIASSDLS---------HYEPQEIAVEK-----DRIVINDIIKMDEKRLY 225
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ +++Y+ ++CG P+ + L + + K L Y S + V Y+S A
Sbjct: 226 EDVVKYNISMCGYGPVMAMIRALKELGGE-RAKLLAYATSGDVTGDYSAVVGYSSIA 281
>gi|14591400|ref|NP_143479.1| hypothetical protein PH1626 [Pyrococcus horikoshii OT3]
gi|3258055|dbj|BAA30738.1| 294aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 294
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 135/296 (45%), Gaps = 11/296 (3%)
Query: 1 MEKIRR-ASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAA 58
M K+RR + AG +Y + L E L + ++ G + + +APHAGY +SG A+
Sbjct: 1 MAKMRRYPAVAGQFYPEGDA-LIEMLSSFFKDLGEEGTKRTITAGVAPHAGYVFSGFTAS 59
Query: 59 YAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ I + VF++ GP+H AL + TP+G + +D + +E+
Sbjct: 60 RTYKAIYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIKVDSKFAKEIVKRSGI 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGR-LF-AKYV 173
+D + EHS+E+ LP++ + E G +KIVPI +G + E GR +F A
Sbjct: 120 ADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEEVSRSLGRSIFEASTS 179
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
S+DF H+GS + Y+ + + + + D + I D D +
Sbjct: 180 LGRDTIIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIRRILDFDLDGMFSEIR 239
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAK 288
E ++T+CG P V ++ + K K + L Y S + D+ V YAS K
Sbjct: 240 EMNHTMCG--PGGVGAGIIYSRLMKAKEAELLHYTTSFEVSRSPDAIVGYASIIMK 293
>gi|315656270|ref|ZP_07909161.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
gi|315493272|gb|EFU82872.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
Length = 488
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 101/218 (46%), Gaps = 28/218 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
M +R+ + AGS+Y + L +D L++A G P ++ GV PH
Sbjct: 1 MSFVRKPAVAGSFYPQDPGELQRTVDELLKKATALITNDVSRDSQGRPLR-QLMGVQVPH 59
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
AGY YSG AA F + RV + GP+H ALS AT + TP+G++P+D
Sbjct: 60 AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFATPLGEVPVDQVA 119
Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
+ + E + KF VD EH++E+ LP+L VF IVP+ VG E+ +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDIDFAIVPLAVGDCTPEDVCD 176
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
++ L + VSSD H Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLAYDE 208
>gi|304314295|ref|YP_003849442.1| hypothetical protein MTBMA_c05340 [Methanothermobacter marburgensis
str. Marburg]
gi|302587754|gb|ADL58129.1| conserved hypothetical protein [Methanothermobacter marburgensis
str. Marburg]
Length = 280
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 129/294 (43%), Gaps = 35/294 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGW-----LREAGLP---KSPEVRGVIAPHAGYSYSGRA 56
RR + AG++Y + L ++ W L GLP + +++GVIAPHAGY YSG
Sbjct: 3 RRPAVAGAFYERDPAALRRRIE-WCFNHELGPGGLPARGSARKIKGVIAPHAGYMYSGPV 61
Query: 57 AAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+A+ + I F ++ P+H +L ++TP+G + +D E+ E +
Sbjct: 62 AAHAYHELVSDGIPETFVIICPNHTGMGSGVSLMQRGAWETPLGVVDIDSELAEVIVRES 121
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
+D EHS E+H+P++ + F +IVPI + + E G A ++
Sbjct: 122 GIIDIDGTAHLGEHSCEVHVPFI-QYFTDRF-RIVPITMWMQDQETATDVGHAVAAAIER 179
Query: 175 -DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
+ S+DF H Y D + + IE + +M D +
Sbjct: 180 TERDAVVIASTDFTH------YSPADVAGSIDSRIIERITEM--------DDTGMYGVIT 225
Query: 234 EYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
E + T+CG P++ + MLG + + L+Y S + S V YAS
Sbjct: 226 ELNATMCGYGPVAASITASRMLGAS----ECRLLKYATSGDITGDQSSVVGYAS 275
>gi|432329928|ref|YP_007248071.1| putative dioxygenase [Methanoregula formicicum SMSP]
gi|432136637|gb|AGB01564.1| putative dioxygenase [Methanoregula formicicum SMSP]
Length = 263
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 124/294 (42%), Gaps = 44/294 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRGVIAPHAGYSYSGRAAA 58
K+R + AG +Y L + L+ + AG P G+++PHAGY YSG+ AA
Sbjct: 2 KMRACAVAGMFYPREPSHLEQLLEKFFSAVPDVAGRPL-----GIVSPHAGYIYSGQVAA 56
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
AFG I P ++GPSH Y C + ++TP+G + D E + L
Sbjct: 57 TAFGAIPPDFSGTFVIIGPSHRGYI-TCV--SEVPWETPLGVVDTDTEFVRAL------- 106
Query: 119 LMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG-RLFAKYV 173
D+ DE EHS+E+ +P++ F +I P+++G + G ++FA
Sbjct: 107 --DMETDELSHRDEHSIEVQVPFIKYRFP--RARIAPVMMGQQDYPGAVRLGEKIFAAIR 162
Query: 174 DDPSNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ V SSDF H+ V K + D +D + + D F + +
Sbjct: 163 RTKRDVRIVASSDFSHY--------------VPEKKAKEEDLFAIDALRSLDTREFYRRI 208
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
E T CG PI+ + K + +RY S + V YAS A
Sbjct: 209 AEKRITACGYGPITAMVTACTGLGAK-AAELIRYATSGETTGDYREVVGYASIA 261
>gi|375082610|ref|ZP_09729663.1| hypothetical protein OCC_01034 [Thermococcus litoralis DSM 5473]
gi|374742688|gb|EHR79073.1| hypothetical protein OCC_01034 [Thermococcus litoralis DSM 5473]
Length = 292
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 132/291 (45%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AGS+Y + L LD + + G L + ++ +APHAGY +SG A+ +
Sbjct: 2 IRYPAVAGSFYPSGGE-LKLMLDEFFSDLGELGEKRKITSGVAPHAGYIFSGYTASRTYK 60
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I S+ F ++GP+H A+ +++TP+GD+ +D E+ + + +D
Sbjct: 61 AIYEDSLPETFVIIGPNHTGLGSPVAIYPEGIWRTPMGDVEVDAELAKTIAKHSSLADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ +P++ + E +KIVPI +G + E GR + +
Sbjct: 121 DFAHKYEHSLEVQVPFIQYISEKAEKEVKIVPIALGLQDEEVAEDLGRAIFEASQELGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
V S+D H+G + Y+ + + + I D + I D + + D+
Sbjct: 181 VVVIASTDMMHYGYAYGYVPFRARGDDLLGRIREWDFRVIQKILEFDYKGMFDEIRKMDH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+CG ++ + + S ++ + L Y S + D+ V Y S +
Sbjct: 241 TMCGPGGVATAI-VFSRLSGAVEAEVLHYTTSFEVSRSTDAIVGYVSIVMR 290
>gi|297625218|ref|YP_003686981.1| hypothetical protein PFREUD_00190 [Propionibacterium freudenreichii
subsp. shermanii CIRM-BIA1]
gi|296920983|emb|CBL55520.1| Hypothetical protein PFREUD_00190 [Propionibacterium freudenreichii
subsp. shermanii CIRM-BIA1]
Length = 265
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/214 (31%), Positives = 103/214 (48%), Gaps = 15/214 (7%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPK---SP-EVRGVIAPHAGYSYSGRAAAYA 60
R + AGS+Y RL ELD +A P SP V+ ++ PHAGY YSG AA
Sbjct: 4 RPTAVAGSFYPGQRGRLTRELDTLFEQADRPPAQVSPARVKAIVVPHAGYVYSGTTAATG 63
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ + I+RV +LGP+H AL+ A + TP+G +P+D ++ +A +
Sbjct: 64 YELLRGRPINRVVVLGPTHRVGIRGMALAGADAFDTPLGSVPVDPDLTAIAEAV-PLVVT 122
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNF 179
V EHS+E+ LP++ V ++P+ VG A+ E A+ ++ +
Sbjct: 123 RPDVHAREHSLEVQLPFIRTVLP--QASVLPVAVGDALPDEVAALLDAVWGES----DTA 176
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
+SSD H+ S + D G I K + ALD
Sbjct: 177 IVISSDLSHYHSYDDARKLDA--GTIDK-VLALD 207
>gi|18978010|ref|NP_579367.1| hypothetical protein PF1638 [Pyrococcus furiosus DSM 3638]
gi|397652713|ref|YP_006493294.1| hypothetical protein PFC_10415 [Pyrococcus furiosus COM1]
gi|23822355|sp|Q8U0F2.1|Y1638_PYRFU RecName: Full=MEMO1 family protein PF1638
gi|18893791|gb|AAL81762.1| hypothetical protein PF1638 [Pyrococcus furiosus DSM 3638]
gi|393190304|gb|AFN05002.1| hypothetical protein PFC_10415 [Pyrococcus furiosus COM1]
Length = 292
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 132/291 (45%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IRRA AG +Y D+++ L E L + + G S + +APHAGY +SG A+ +
Sbjct: 2 IRRAVVAGQFYPDDAE-LVEMLKRFFTDLGEEGNSRRITAGVAPHAGYIFSGYTASRTYK 60
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + VF +LGP+H A+ ++TP+G + +D ++ + + +D
Sbjct: 61 AIYEDGLPEVFVILGPNHTGLGSPIAVYPKGEWETPLGRIKVDEKLARRITELSEIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ LP++ + E G +KIVPI +G + E G+ + +
Sbjct: 121 DLAHKYEHSIEVQLPFIQYLAELSGKDVKIVPITLGIQDEEVSYALGKAIYEASQELGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
V S+DF H+G + Y+ + + + ++ D + I D + + + ++
Sbjct: 181 IVVIASTDFMHYGEFYGYVPFRARADELPNLVKEWDMRVIRRILDFDVEGMFEEINAMNH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+C + + I+ + L Y S + D+ V YAS +
Sbjct: 241 TMC-GPGGVGVGIVYSKLAGAIEAELLHYTTSFEVSRSTDAIVGYASIVMR 290
>gi|223478425|ref|YP_002582779.1| phosphomevalonate decarboxylase [Thermococcus sp. AM4]
gi|214033651|gb|EEB74478.1| phosphomevalonate decarboxylase [Thermococcus sp. AM4]
Length = 291
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 133/291 (45%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
IR + AGS+Y + + L L+ + + G S + +APHAGY +SG A+ +
Sbjct: 2 IRYPAVAGSFYPADEE-LVLMLERFFSDLGEEGSERRITAGVAPHAGYIFSGYTASRTYK 60
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F +LGP+H A+ + TP+G + +D E+ +E+ +D
Sbjct: 61 AIFEDGLPETFVILGPNHTGLGSPIAVYPEGEWLTPLGSIEVDAEMAKEIAKLSGIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ +P++ + E G ++IVPI +G + E G+ + ++
Sbjct: 121 DLAHKYEHSIEVQVPFIQYLAELAGKEVRIVPITLGIQDEEVSRALGKAIFEASEELGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
V S+DF H+G + Y+ + + + I+ D + I D D + L E +
Sbjct: 181 VVVIASTDFMHYGPAYGYVPFRARADELPHRIKEWDFRLIRRILDFDVDGLFRELREMRH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+CG + + + + ++ + L Y S + ++ V YAS +
Sbjct: 241 TMCGPGAVGTAI-VYSRLAGALEAELLHYTTSYEVSRSTEAIVGYASIVMR 290
>gi|298345398|ref|YP_003718085.1| dioxygenase [Mobiluncus curtisii ATCC 43063]
gi|298235459|gb|ADI66591.1| dioxygenase [Mobiluncus curtisii ATCC 43063]
Length = 488
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 101/218 (46%), Gaps = 28/218 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA-------------GLPKSPEVRGVIAPH 47
M +R+ + AGS+Y + L +D L++A G P ++ GV PH
Sbjct: 1 MSFVRKPAVAGSFYPQDPGELQRTVDELLKKATALITNDVSRNSQGRPLR-QLMGVQVPH 59
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
AGY YSG AA F + RV + GP+H ALS AT + TP+G++P+D
Sbjct: 60 AGYIYSGLIAAVGFQLVSTLRPRRVIIAGPTHRVGCHGIALSEATAFVTPLGEVPVDQVA 119
Query: 105 LEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--E 162
+ + E + KF VD EH++E+ LP+L VF IVP+ VG E+ +
Sbjct: 120 QQYLVEHSSVAKF-FEPSHVD--EHALEVQLPFLQTVFRDVDFAIVPMAVGDCTPEDVCD 176
Query: 163 AMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK 200
++ L + VSSD H Y+ YD+
Sbjct: 177 SLQTLLSIPGENPDDTLIIVSSDLSH------YLGYDE 208
>gi|325969093|ref|YP_004245285.1| Memo domain containing protein [Vulcanisaeta moutnovskia 768-28]
gi|323708296|gb|ADY01783.1| Memo domain containing protein [Vulcanisaeta moutnovskia 768-28]
Length = 283
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 132/297 (44%), Gaps = 31/297 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPE-----VRGVIAPHAGYSYS 53
+R+ + AG++Y + +L ++++ W L L + P V VI PHAGY YS
Sbjct: 1 MRKPAVAGAFYEADKDKLIKQIE-WSISHSLGPGQLIRQPREGYKAVPIVIVPHAGYIYS 59
Query: 54 GRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA +F I + F+L GP+H+ A+ ++TP+G + +D +V +EL
Sbjct: 60 GPVAAMSFVEIYRFHKPKTFILIGPNHYGVGAPVAIYPEGTWETPLGVVEVDSDVTKELM 119
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+ K+ D+ EHS+E+ LP++ +F G+ +KIVPI++ E G A
Sbjct: 120 SRVKYLEPDVYAFTQEHSIEVQLPFIQYIF-GNDVKIVPIIIWRQTKEVARDLGNAIADV 178
Query: 173 VDD---PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ S + SSD+ H+ H+ D +D + D D+F
Sbjct: 179 ISGHELGSIVYVASSDWNHYEP--------------HEITTEKDMRAIDPVLRLDEDSFF 224
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ YD + CG I+ + + + LR+ S +V YAS A
Sbjct: 225 DAIERYDVSACGYGAIATAIIAAKKLGVR-NVILLRHATSGDTSGYTLETVGYASIA 280
>gi|257094199|ref|YP_003167840.1| hypothetical protein CAP2UW1_2625 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257046723|gb|ACV35911.1| protein of unknown function DUF52 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
Length = 272
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 10/189 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R A+ AG +Y +S LA +L L ++ + + VI PHAGY YSG AA +
Sbjct: 8 VRPAAVAGMFYPGSSSVLARDLRDLLNDSRPAQGRRPKAVIVPHAGYIYSGPIAASVYAP 67
Query: 64 IDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I S+ RV LLGP+H A +T + TP+G +P+D + + ++A + + D
Sbjct: 68 IAALRESVRRVVLLGPTHRVAVDGLAFPASTAFATPLGVVPVDAQALALIEALPQVVVSD 127
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-AVNAENEAMYGRLFAKYVDDPSNFF 180
EHS+E+ LP+L V + +VP+ VG A AE + L+ D+
Sbjct: 128 -AAHALEHSLEVQLPFLQTVLDE--FTLVPLAVGRASPAEVAEVLECLWGG--DE--TLI 180
Query: 181 SVSSDFCHW 189
+SSD H+
Sbjct: 181 VISSDLSHY 189
>gi|212223278|ref|YP_002306514.1| hypothetical protein TON_0132 [Thermococcus onnurineus NA1]
gi|229557467|sp|B6YST0.1|Y132_THEON RecName: Full=MEMO1 family protein TON_0132
gi|212008235|gb|ACJ15617.1| hypothetical protein, conserved, DUF52 family [Thermococcus
onnurineus NA1]
Length = 291
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 132/287 (45%), Gaps = 8/287 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
+R + AGS+Y + + L E L+ + R+ G S ++ + PHAGY +SG A+ F
Sbjct: 2 VRYPAVAGSFYPGD-ETLIEMLEKFFRDLGEHGSERKITAGVVPHAGYVFSGYTASRTFK 60
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F +LGP+H A+ + + TP+G++ +D E+ + + +D
Sbjct: 61 AIYEDGLPETFVILGPNHTGIGSPIAVYPSGSWLTPLGEIEVDSEMAKTIAKLSGIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKYVD--DPS 177
+ EHS+E+ LP++ + E ++IVPI +G + E G+ + + D
Sbjct: 121 ELAHKYEHSIEVQLPFIQYLAEKARTDVRIVPITLGIQDEEVVEDLGKAIYEAANELDRD 180
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
S+DF H+G + Y+ + + + ++ D + I D L + D+
Sbjct: 181 VVIIASTDFMHYGPAYGYVPFRARADELPHRVKEWDFRVIQKILDFDVKGMFGELRKMDH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
T+CG + + + + ++ + L Y S + D+ V YAS
Sbjct: 241 TMCGPGGVGTAI-VYSRLAGALEAELLHYTTSFEVSRSTDAIVGYAS 286
>gi|371778140|ref|ZP_09484462.1| AMMECR1 domain-containing protein [Anaerophaga sp. HS1]
Length = 475
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 13/270 (4%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
K R+ S AG +Y L L EA G+ + +V +I PHAGY YSG AA
Sbjct: 4 KDRQPSVAGMFYAGEPVALQNHLKSLFEEAQPGVGDA-QVAALIVPHAGYLYSGGVAASG 62
Query: 61 FGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
F I+ + +FL+G SH ++ T Y TP+G + +D + ++L + +
Sbjct: 63 FAQINENAHYKTIFLIGSSHRMAFNGASVYTQGDYLTPLGRVDVDKALAQKLVESSPYIT 122
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ LP+L + K+VPI++G +A M Y P N
Sbjct: 123 DIFAPHKDEHSLEVELPFLQYRLKNSF-KLVPIVMGPHDAVGARMVAEALKPYF-KPGNL 180
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
F +S+DF H Y Y+ V +++A+ K D++ K + ++
Sbjct: 181 FVISTDFSH------YPKYEDAKKVDAITVDAILKNDPDLLLNTLEQNRKLGVTNLVTSL 234
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
CG + L++ + T + + + Y S
Sbjct: 235 CGWTSVLTLLYITQD-DTCLDFRHIEYRNS 263
>gi|358637773|dbj|BAL25070.1| hypothetical protein AZKH_2764 [Azoarcus sp. KH32C]
Length = 275
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 124/291 (42%), Gaps = 42/291 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAA--YA 60
IR A+ AG +Y D+ + L ++ L A L + ++ PHAGY YSG AA YA
Sbjct: 6 IRPAAVAGYFYPDDPRVLQMQISEMLSTAVPLEVVHAPKALVVPHAGYVYSGPVAASGYA 65
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ +V +LGP+H AL + + TP+G + L + L+A
Sbjct: 66 CATELRQRVRKVVMLGPAHRAEVRGFALPASQSFATPLGMVKLHHSDWQALQARP----- 120
Query: 121 DICVD----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDD 175
D+ VD E EH +E+ LP+L V + +IVPILVG E A + G+L+
Sbjct: 121 DVVVDDRPHEFEHCLEVQLPFLQTVLDA--FEIVPILVGGATGEQVADLLGQLWGG---- 174
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P +SSD H Y YD+ +++ + +M K +L++
Sbjct: 175 PETLILISSDLSH------YHTYDQARSTDRHAVDQVLRM--------------KPVLDH 214
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PI+ + + ++ L S R V YAS A
Sbjct: 215 QQA-CGATPINGLIR--AAAARGLEPHLLDLRNSGDTAGDRSRVVGYASIA 262
>gi|254489889|ref|ZP_05103084.1| conserved hypothetical protein [Methylophaga thiooxidans DMS010]
gi|224464974|gb|EEF81228.1| conserved hypothetical protein [Methylophaga thiooxydans DMS010]
Length = 262
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 102/204 (50%), Gaps = 20/204 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
ME +R + AG +Y ++ +L +++ + + +I PHAGY YSG AA A
Sbjct: 1 MEIVRPNAVAGLFYPADAVKLNKQVAADMAACHADPLLSPKALIVPHAGYMYSGAVAASA 60
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ ++ P I RV L+GPSH A+STA ++TP+G + +D + ++ A
Sbjct: 61 YQHLKPLKNIIKRVVLIGPSHRVPFDGLAISTADWFETPLGLIAVDRQAESQIIAIDGV- 119
Query: 119 LMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
I +++A EHS+E+ LP+L + + KIVPI+ G + + + ++ K
Sbjct: 120 ---IALEQAHAREHSLEVQLPFLQFLLDD--FKIVPIVAGHASPK---VIAKVLDKLWGG 171
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYD 199
P +SSD H Y+ YD
Sbjct: 172 PETLIVISSDLSH------YLDYD 189
>gi|339483947|ref|YP_004695733.1| hypothetical protein Nit79A3_2567 [Nitrosomonas sp. Is79A3]
gi|338806092|gb|AEJ02334.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Nitrosomonas sp. Is79A3]
Length = 260
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 127/292 (43%), Gaps = 40/292 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR + AG +Y ++ +L +++ L A + + + +I PHAGY YSG AA A
Sbjct: 1 MTTIRSPAVAGLFYPADAWQLEQDVQHLLAMAEFHDT-KPKALIVPHAGYVYSGAIAATA 59
Query: 61 FGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ +I +I RV LLGP+H AL V+ TP+G + LD +++ +
Sbjct: 60 YASIYSVAAAIRRVVLLGPAHRVAVQGLALPGVDVFDTPLGRVKLDTDLVNAIA-----H 114
Query: 119 LMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
L + V + EHS+E+ LP+L KV ++P+ VG +AE A + +
Sbjct: 115 LPQVTVSKEAHVLEHSLEVQLPFLQKVLSD--FTLLPLAVGMTSAEAVA---EVLEQLWG 169
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
+SSD H Y+ Y V K++++ I++ P ++
Sbjct: 170 GEETLIVISSDLSH------YLPYATAQRVDSKTVQS-------IVQLRQP-------ID 209
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+D CG I+ + + + L S RD V YA+ A
Sbjct: 210 HDQA-CGSTAINGLI--IAAQQHHLTPYLLDLRNSGDTAGSRDQVVGYAAIA 258
>gi|85858129|ref|YP_460331.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
gi|85721220|gb|ABC76163.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
Length = 278
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 122/283 (43%), Gaps = 18/283 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEE-LDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
I+R+ AGSWY + + L+ + +D + G + G++APHAGY YSG+ AA+A+
Sbjct: 6 IKRSLIAGSWYPGSPRVLSRDIMDYFDNVPGKTVQGRILGLVAPHAGYMYSGQVAAHAYK 65
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I + VF++GPSH + +L Y+TP+G + + ++ L
Sbjct: 66 EIKGQTYDVVFVIGPSHRAFFRGVSLFKEGGYETPLGIVDVHEDMAARLLEQDPRIAFLP 125
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
V EHS+E+ LP+L +V G VP+++G + E + +
Sbjct: 126 DVHLQEHSVEIQLPFL-QVALGEF-SFVPLIMGDQDYETCRVLADAIVNCCGNKQVLIVG 183
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSD H Y Y++ + + +E L KM D + L CG
Sbjct: 184 SSDLSH------YHGYEQAVRMDSRILEHLRKM--------DECGLIRDLSSGTGEACGG 229
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
P +V + M+ K L+Y S R V YA+A
Sbjct: 230 GPAAVTM-MVARQLGADKAAVLKYANSGDVTGDRSGVVGYAAA 271
>gi|374327733|ref|YP_005085933.1| putative dioxygenase [Pyrobaculum sp. 1860]
gi|356643002|gb|AET33681.1| putative dioxygenase [Pyrobaculum sp. 1860]
Length = 281
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 131/292 (44%), Gaps = 27/292 (9%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-------PEVRGVIAPHAGYSYSGR 55
++R+ + AG +Y + L +++D ++ PK+ E G +APHAGY YSG
Sbjct: 2 RVRKPAVAGYFYESEREELLQQIDWAVKHELGPKALQMPKLGAEALGGVAPHAGYMYSGP 61
Query: 56 AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + F ++GP+H+ A+ + V++TP+G L +D E+ + +
Sbjct: 62 VAAWLYSYLAGFGKPDAFVVVGPNHYGIGAPVAIMKSGVWETPLGRLEIDEELAGLIASH 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
K D EHS+E+ +P++ F +KIVPI + G A+ +
Sbjct: 122 YKELEDDFYAFAKEHSLEVQMPFIQYYFGD--VKIVPITIWRQTLSTSRELGVAIARALR 179
Query: 175 DPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ + SSDF H+ + H V + D+M + I D + +
Sbjct: 180 EYGRRVYVIASSDFNHY----------EPHDVTTRK----DEMAISKILKLDEASLFEVA 225
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++D +ICG PI V + ++ + L++ S R+ +V YAS
Sbjct: 226 SKFDISICGLGPIGVLI-VVAKELGYLNATLLKHATSGDTSGYREETVGYAS 276
>gi|403253107|ref|ZP_10919412.1| hypothetical protein EMP_04985 [Thermotoga sp. EMP]
gi|402811869|gb|EJX26353.1| hypothetical protein EMP_04985 [Thermotoga sp. EMP]
Length = 277
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 129/296 (43%), Gaps = 39/296 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
R+ + AG +Y L E+ LD + LP E + G+++PHAGY YSG
Sbjct: 3 RKPAVAGLFYPSRKDELIEQIRMCFLDKRIGPGKLPGPVETKLQNPIGLVSPHAGYIYSG 62
Query: 55 RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ F + S V ++GP+H + ++TP+G +P++ +E + +
Sbjct: 63 PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEIVLS 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYGRLF 169
++ D EHS+E+ +P+L VFE + IVPI + AV + + +L
Sbjct: 123 NSRYAEKDFMSHIREHSIEVQIPFLQFVFEE--VPIVPICLMDQSPAVAEDLASALAKLV 180
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
A++ P S+D HY+ + + K D ++ IE DP
Sbjct: 181 AEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPSLLY 223
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+YL+ D ++CG ++ L+M ++ L++ S V Y SA
Sbjct: 224 EYLVREDISMCGYGGVATLLNMDFE-----NVRILKHATSGDVSGDTLEVVGYLSA 274
>gi|451979656|ref|ZP_21928071.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
gi|451763184|emb|CCQ89268.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
Length = 267
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 121/282 (42%), Gaps = 23/282 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + +G +Y D + L +E+ L +G + GV+APHAG+ YSG A +
Sbjct: 2 LRQPAVSGRFYPDRPQALIKEVKSHL--SGDRTAVRALGVVAPHAGFMYSGDVAGSVYER 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I+ R+ L+GP+H A+ + + P+GD+ +D E+ ++ D
Sbjct: 60 IEIP--ERILLIGPNHTGRGRPVAIQSHGAWVMPMGDVQIDTELAAAIRKHLPGVEEDDD 117
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
E EHS+E LP+L + K VP+ + ++ + G A+ +++
Sbjct: 118 AHENEHSLETQLPFLQ--YFKQPFKFVPLCLKRLSLNDCQKVGHALARSIEEVGEPVLVV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ S H+S D+ +D + D + T+CG
Sbjct: 176 ASSDMTHYES--------------HESASLKDRRAIDRMLDLDAAGLHTTVHNERITMCG 221
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+P +V L + + K + RY S + DS V YA
Sbjct: 222 VNPATVMLTTVQDLGAKQAV-LSRYMTSGEVSGDMDSVVGYA 262
>gi|240103783|ref|YP_002960092.1| hypothetical protein TGAM_1726 [Thermococcus gammatolerans EJ3]
gi|259646472|sp|C5A7L6.1|Y1726_THEGJ RecName: Full=MEMO1 family protein TGAM_1726
gi|239911337|gb|ACS34228.1| Dioxygenase, putative [Thermococcus gammatolerans EJ3]
Length = 291
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 72/292 (24%), Positives = 131/292 (44%), Gaps = 10/292 (3%)
Query: 4 IRRASHAGSWYT--DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
IR + AGS+Y D + E+ L E G + + +APHAGY +SG A+ +
Sbjct: 2 IRYPAVAGSFYPADDELILMLEKFFSDLGEEGSER--RITAGVAPHAGYIFSGYTASRTY 59
Query: 62 GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
I + F +LGP+H A+ + TP+G + +D E+ +E+ +
Sbjct: 60 KAIFEDGLPETFVILGPNHTGLGSPIAVYPEGEWLTPLGSIEVDAEMAKEIAKLSGIADL 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D + EHS+E+ +P++ + E G +KIVPI +G + + G+ + +
Sbjct: 120 DELAHKYEHSIEVQVPFIQYLTELAGKEVKIVPITLGIQDEDVSRALGKAIFEASEKLGR 179
Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
V S+DF H+G + Y+ + + + I+ D + I D D + L E
Sbjct: 180 DVVVIASTDFMHYGPAYGYVPFRARADELPHRIKEWDFRLIRRILDFDVDGLFRELREMR 239
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+T+CG + + + + ++ + L Y S + ++ V YAS +
Sbjct: 240 HTMCGPGAVGTAI-VYSRLAGAVEAELLHYTTSYEISRSTEAVVGYASIVMR 290
>gi|225848005|ref|YP_002728168.1| hypothetical protein SULAZ_0171 [Sulfurihydrogenibium azorense
Az-Fu1]
gi|225643721|gb|ACN98771.1| conserved hypothetical protein [Sulfurihydrogenibium azorense
Az-Fu1]
Length = 265
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 102/195 (52%), Gaps = 14/195 (7%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
M +R+ + A +YT + +RL + ++ +A L P PE G+I+PHAGY YSG A
Sbjct: 1 MLTVRKPAVANLFYTGDKERLYYTVKSYIEKAPLYPYKPE--GLISPHAGYMYSGIVAGV 58
Query: 60 AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATG 115
++ N+D L+GPSH+ Y + ++TP+G++ + E IE +K+
Sbjct: 59 SYKQLLNLDLDKHYTFLLIGPSHYVYLQGISFGYYDFWQTPLGNVKVAKEKIEAFIKSYP 118
Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
F + ++ + EHS+E+ +P+L + + I+P++ G + +++Y + +
Sbjct: 119 NFPITLNTLPHQKEHSLEVQVPFLQVIMKN--FDIIPVVYGDI----DSIYVKKVIDFFK 172
Query: 175 DPSNFFSVSSDFCHW 189
D + +SSD H+
Sbjct: 173 DENTVVIISSDLSHY 187
>gi|127511864|ref|YP_001093061.1| hypothetical protein Shew_0931 [Shewanella loihica PV-4]
gi|126637159|gb|ABO22802.1| protein of unknown function DUF52 [Shewanella loihica PV-4]
Length = 266
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 18/193 (9%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRAAA 58
K R+A+ AG +Y L ++L + EA P V + +I PHAGY YSG AA
Sbjct: 2 KYRQAAVAGRFYPAEPGLLTQQLIHYFGEA-----PSVHIIPKALILPHAGYLYSGEVAA 56
Query: 59 YAFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A + P RV LLGPSH+ CAL + + TP+G++P+D V EL +
Sbjct: 57 KAVNLLRNHPDGYRRVVLLGPSHYVGLNGCALPRSDSFITPLGEIPID-RVGIELLLNRR 115
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+ + EH++E+ LP L + + ++P++VG + E + V D
Sbjct: 116 LAIASDMAHQREHALEVELPLLQFCLDDFI--LLPVVVGGASPEAVCQ----LIQAVADS 169
Query: 177 SNFFSVSSDFCHW 189
VSSD H+
Sbjct: 170 DTLIVVSSDLSHY 182
>gi|23831516|sp|O59292.2|Y1626_PYRHO RecName: Full=MEMO1 family protein PH1626
Length = 291
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 131/291 (45%), Gaps = 10/291 (3%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFGN 63
R + AG +Y + L E L + ++ G + + +APHAGY +SG A+ +
Sbjct: 3 RYPAVAGQFYPEGDA-LIEMLSSFFKDLGEEGTKRTITAGVAPHAGYVFSGFTASRTYKA 61
Query: 64 IDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I + VF++ GP+H AL + TP+G + +D + +E+ +D
Sbjct: 62 IYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIKVDSKFAKEIVKRSGIADLDD 121
Query: 123 CVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGR-LF-AKYVDDPSN 178
+ EHS+E+ LP++ + E G +KIVPI +G + E GR +F A
Sbjct: 122 LAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEEVSRSLGRSIFEASTSLGRDT 181
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
S+DF H+GS + Y+ + + + + D + I D D + E ++T
Sbjct: 182 IIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIRRILDFDLDGMFSEIREMNHT 241
Query: 239 ICGRHPISVFLHMLGNCSTKIK-IKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+CG P V ++ + K K + L Y S + D+ V YAS K
Sbjct: 242 MCG--PGGVGAGIIYSRLMKAKEAELLHYTTSFEVSRSPDAIVGYASIIMK 290
>gi|387127441|ref|YP_006296046.1| dioxygenase [Methylophaga sp. JAM1]
gi|386274503|gb|AFI84401.1| dioxygenase [Methylophaga sp. JAM1]
Length = 262
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 122/286 (42%), Gaps = 30/286 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR ++ AG +Y S L + L + + +I PH GY YSG+ AA A
Sbjct: 1 MISIRPSAVAGLYYPARSTELEDLLAKQFDKPSDHIHHHAKALIVPHGGYFYSGQVAAKA 60
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ ++ + I RV L+GPSH A+S A + TPIG + +D ++ +L E
Sbjct: 61 YQSLMEVADDIERVILIGPSHRTDFKGIAMSEADYFATPIGSVAVDKKIYPQLSRIKGVE 120
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ D EH +E+ LP+L + + +IVP+L G N+ ++ + D +
Sbjct: 121 TYESPHDN-EHCLEVQLPFLQ--YSLNQFEIVPLLTGKANS---SLIADVLGTATQDVKS 174
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H Y+ Y+ + + +A+ M + D D F
Sbjct: 175 LIVISSDLSH------YLDYETARKIDQYTSQAIISM-----DNHDIDKFHA-------- 215
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG I FL N + +I L S K++S V Y +
Sbjct: 216 -CGCDAIRGFLEYARNANMSGQIMAL--SNSGDIVRKKESVVGYGA 258
>gi|150401805|ref|YP_001325571.1| hypothetical protein Maeo_1383 [Methanococcus aeolicus Nankai-3]
gi|150014508|gb|ABR56959.1| protein of unknown function DUF52 [Methanococcus aeolicus Nankai-3]
Length = 284
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 131/298 (43%), Gaps = 37/298 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLREAG---LPKSPEVR---GVIAPHAGYSYSGRA 56
+R AG++Y + L E ++ + + G +P E + GVIAPHAGY YSG
Sbjct: 3 VRNPVVAGAFYPSDPNELIEIIEYCYFHKLGPNTIPIGGEYKKPVGVIAPHAGYIYSGAP 62
Query: 57 AAYAFGNI------DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
AAY++ I D T+I ++GP+H ++ ++KTP+GD+ D E I++
Sbjct: 63 AAYSYSAISERVSGDITAI----IIGPNHTGLGEGVSVMDG-IWKTPLGDVSTDTEFIDK 117
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLI--KIVPILVGAVNAENEAMYGRL 168
L +D EHS+E+ LP+L + + KIVPI + + E G
Sbjct: 118 LWKECDVVELDELAHSREHSIEIQLPFLQHIALRQSVKFKIVPICMAMQDYETSMDVGYF 177
Query: 169 FAKYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
AK + + S+DF H+ + N KK +I K+I A+D+ + D
Sbjct: 178 IAKIAKELNRNVIIIASTDFSHYEPQEN---ASKKDALIIKNILAMDEKSL------YSD 228
Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
A + Y+ ++CG P + + K L Y S S V Y S
Sbjct: 229 A-----ITYNISMCGYGPTMAMIRAMKELGATTS-KLLSYCTSGDITGDYSSVVGYGS 280
>gi|317153856|ref|YP_004121904.1| hypothetical protein Daes_2152 [Desulfovibrio aespoeensis Aspo-2]
gi|316944107|gb|ADU63158.1| protein of unknown function DUF52 [Desulfovibrio aespoeensis
Aspo-2]
Length = 268
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 123/284 (43%), Gaps = 26/284 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ AG +Y + RL ++DG+L + G P+ + + PHAGY YSG
Sbjct: 3 RQPVVAGRFYDVDPARLFGQVDGFLSQ-GAPRGGQTLLAMVPHAGYVYSGAVCGQTLAQ- 60
Query: 65 DPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
S++R V LLGP+H AL + P G +P+D+E+ L D
Sbjct: 61 --ASLARTVLLLGPNHTGLGQPFALWPDGAWHIPGGSVPVDVELATALLGADPRIAPDTA 118
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS-- 181
EHS+E+ LP+L ++ L+ IVP+ + + E+ A GR + + + S
Sbjct: 119 AHLREHSLEVVLPFLYRLDP--LVSIVPLAIASHVFEDVAGVGRAIGRAIKAFGHPVSIV 176
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSD H+ + H LD + ++ + DP + + ++CG
Sbjct: 177 VSSDMSHY--------------ISHDQARRLDALALEAAVSLDPRGLFDAVRAHSISMCG 222
Query: 242 RHPISVFLH-MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
P++ L+ L +T+ ++ Y S + D V YA
Sbjct: 223 VLPMTAGLYAALEMGATRGELA--AYATSGEVSGDFDQVVGYAG 264
>gi|148269972|ref|YP_001244432.1| hypothetical protein Tpet_0837 [Thermotoga petrophila RKU-1]
gi|166235097|sp|A5IKY3.1|Y837_THEP1 RecName: Full=MEMO1 family protein Tpet_0837
gi|147735516|gb|ABQ46856.1| protein of unknown function DUF52 [Thermotoga petrophila RKU-1]
Length = 277
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 127/296 (42%), Gaps = 39/296 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
R+ + AG +Y L E+ LD + LP E + G ++PHAGY YSG
Sbjct: 3 RKPAVAGLFYPSRRDELVEQIRICFLDKRIGPGELPDPSETKLQSPIGFVSPHAGYIYSG 62
Query: 55 RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
AA+ F + +P S V ++GP+H + ++TP+G +P++ E
Sbjct: 63 PVAAWGFLEVAKFGEP---SVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNQRAAEI 119
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLF 169
+ + ++ D EHS+E+ +P+L VF + IVPI L+ A E + L
Sbjct: 120 ILNSSRYAEEDFMSHIREHSIEVQIPFLQFVFGD--VSIVPICLMDQSPAVAEDLANALT 177
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ PS S+D HY+ + + K D M+ I DP
Sbjct: 178 KLVAEFPSVLIIASTDLN---------HYEDQRTTLRK-----DSYIMEAIRNKDPRLLY 223
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+YL++ D ++CG ++ L+M N + L++ S + V Y SA
Sbjct: 224 EYLVKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274
>gi|70605939|ref|YP_254809.1| hypothetical protein Saci_0089 [Sulfolobus acidocaldarius DSM 639]
gi|449066131|ref|YP_007433213.1| hypothetical protein SacN8_00420 [Sulfolobus acidocaldarius N8]
gi|449068407|ref|YP_007435488.1| hypothetical protein SacRon12I_00420 [Sulfolobus acidocaldarius
Ron12/I]
gi|76363480|sp|Q4JCG3.1|Y089_SULAC RecName: Full=MEMO1 family protein Saci_0089
gi|68566587|gb|AAY79516.1| conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
gi|449034639|gb|AGE70065.1| hypothetical protein SacN8_00420 [Sulfolobus acidocaldarius N8]
gi|449036915|gb|AGE72340.1| hypothetical protein SacRon12I_00420 [Sulfolobus acidocaldarius
Ron12/I]
Length = 284
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 138/301 (45%), Gaps = 41/301 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLREAG---LPKSPEV-------RGVIAPHAGYSY 52
IR + AGS+Y + RL ++++ +L + G LP P+ R + PHAGY Y
Sbjct: 2 IRIPAVAGSFYEADPVRLRKQIEWSFLHDLGPKSLPSVPQNKPPQRSNRFFVVPHAGYMY 61
Query: 53 SGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV-IEE 110
SG AA+A+ ++ S V +LGP+H ++ +KTP+G++ +D E+ +E
Sbjct: 62 SGPVAAHAYYHLSLEGSPDTVIILGPNHTGLGSYVSIWHKGKWKTPLGEVSVDDEISLEL 121
Query: 111 LKATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG 166
+K T E++DI DE EHS+E+ +P+L +F G KIVPI++ +
Sbjct: 122 VKLT---EIIDI--DERAHLYEHSIEVQIPFLQYLF-GQNFKIVPIVIMMQTPDVAESLA 175
Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
K V V SSD H Y +D K+IE D + +D I+ D
Sbjct: 176 EGIYKLVSSGKKDIVVLASSDLNH------YEPHD-------KTIEK-DNLAIDEIQKLD 221
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ + E D T CG PI L + K + LR+ S + S V Y S
Sbjct: 222 YKGLFRVVEEKDVTACGYGPIMTVLILAKKLGKKPYV--LRHATSGDTSGDKSSVVGYLS 279
Query: 285 A 285
Sbjct: 280 V 280
>gi|408403728|ref|YP_006861711.1| hypothetical protein Ngar_c11120 [Candidatus Nitrososphaera
gargensis Ga9.2]
gi|408364324|gb|AFU58054.1| uncharacterized protein family UPF0103 [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 282
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 127/293 (43%), Gaps = 31/293 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSP------EVRGVIAPHAGYSYSGRA 56
R + AG +Y DN + L +D R G ++P + G+++PHAGY YSG
Sbjct: 4 RAPAVAGMFYPDNPRELRGLVDESFRNQRFGPGRAPPSANKRRIYGIVSPHAGYVYSGAV 63
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AA F + + V ++GP+H+ A ++TP+G++ ++ E+ E + A
Sbjct: 64 AANGFYEVSSINFQDVIMVGPNHYGIGSWVAAMKEGEWETPLGEVKVNSELAEGIAARSS 123
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYGRLFAK 171
D +H +E+ LP+L + + +IVPI++ + +E A+ + K
Sbjct: 124 TLDFDNFAHSRDHCLEVQLPFLQYIKQD--FRIVPIVLVSQRSETAYDLGNAISETIVEK 181
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
V D S SSDF H+ N H +K + K+I ALD + F
Sbjct: 182 RVAD-STLLIASSDFTHYEPN-NEAH--RKDAELIKAILALDV-----------NKFYAV 226
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
L D + CG I+ + N + + L+Y S + V Y+S
Sbjct: 227 LERLDVSACGYGAIATMMVAAKNLGAT-RGELLKYATSGDITGDTSAVVGYSS 278
>gi|379005390|ref|YP_005261062.1| putative dioxygenase [Pyrobaculum oguniense TE7]
gi|375160843|gb|AFA40455.1| putative dioxygenase [Pyrobaculum oguniense TE7]
Length = 281
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 131/295 (44%), Gaps = 33/295 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
++R+ + AG +Y ++L +++D ++ PK+ ++ G + PHAGY YSG
Sbjct: 2 RVRKPAVAGYFYAAEKEKLIQQIDWSIKHELGPKALQMPKLGEKALGGVVPHAGYIYSGP 61
Query: 56 AAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+ + G P +I ++GP+H+ A+ + V++TP+G + +D ++ E +
Sbjct: 62 VAAWLYSALAGYGKPDAI---IIIGPNHYGIGAPVAVMKSGVWETPLGRVEVDGDLAELI 118
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
K D EHS+E+ +P++ F ++IVPI+V G+ A
Sbjct: 119 MRHYKGVEDDFYAFSKEHSVEVQIPFIQYYFGD--VRIVPIVVWRQTLSTSRELGKAVAT 176
Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ + + V SSDF HY+ + K D M + I D
Sbjct: 177 AIREYGRYVYVLASSDFN---------HYEPHEVTVQK-----DDMAISKILKVDEAGLF 222
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ ++D +ICG PI + + + L++ S +D +V YAS
Sbjct: 223 EVASKFDISICGLGPIGALIVIAKELGFG-NVTLLKHATSGDTSGYKDETVGYAS 276
>gi|386283998|ref|ZP_10061221.1| hypothetical protein SULAR_02073 [Sulfurovum sp. AR]
gi|385344901|gb|EIF51614.1| hypothetical protein SULAR_02073 [Sulfurovum sp. AR]
Length = 268
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 119/289 (41%), Gaps = 35/289 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEEL-------DGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA 57
R AS AG +Y + + L D L+E + R VI PHAGY YSG A
Sbjct: 4 REASVAGQFYPSSPDEIHSMLEHYNQIIDEHLKEKDSVLHLKPRAVIVPHAGYVYSGFTA 63
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A + + + RV ++GPSH Y ++S Y TP+G L +D ++ +LK +F
Sbjct: 64 NIALRLLSNSDVKRVVIIGPSHRVYLTGTSISEFDTYHTPLGALLIDKPLVLDLKE--RF 121
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
++ + EHS E+ +P++ + +V ++ G E+ + +DDP
Sbjct: 122 DIGFVPEAHHEHSTEVQVPFVKTYTPD--VSVVELVYG---DESPQKLAEVIEYLLDDPE 176
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+S+D H+ YD K ALD + +D +E L
Sbjct: 177 TAVVISTDLSHY--------YDIKKA------NALDSICLDAVEK-----LSTAELHEGC 217
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG+ I V +L ++ L Y S+ V Y SAA
Sbjct: 218 EACGK--IGVEAMLLAAKKNGLRSVLLDYRTSADASGDESQVVGYMSAA 264
>gi|333987636|ref|YP_004520243.1| hypothetical protein MSWAN_1428 [Methanobacterium sp. SWAN-1]
gi|333825780|gb|AEG18442.1| UPF0103 protein [Methanobacterium sp. SWAN-1]
Length = 286
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 125/295 (42%), Gaps = 29/295 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPE-------VRGVIAPHAGYSYSGR 55
IR+ + AG +Y + L E+++ + + G K P+ + GVIAPHAGY YSG
Sbjct: 2 IRKPAVAGLFYEVDPDSLNEQIEWCFKSQIGPGKVPKKIGNKRNITGVIAPHAGYIYSGP 61
Query: 56 AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+++ I F +L P+H + T ++TP+G++ +D E + L
Sbjct: 62 VAAHSYYKIAEDGFPETFVILSPNHTGLGSGVSAMTEGAWETPLGNVEIDEEFAQNLVRN 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL---IKIVPILVGAVNAENEAMYGRLFAK 171
D EHS E+ +P+L + +L KIVP+ + + E G K
Sbjct: 122 TSIMDSDPSAHMQEHSAEVQIPFLQYLKNKYLEKDFKIVPVCMWMQDIETAMEVGISIQK 181
Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+D V S+DF H+ K + H D +D I + D +
Sbjct: 182 TAEDLGRDVVVIASTDFTHY----------KPQDIAHDG----DMQVIDAITSMDENLMM 227
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ D T+CG P++ + + + +F +Y S +S V YAS
Sbjct: 228 ARVSALDVTMCGYGPVAATM-VASKLRGADRCEFEKYATSGDATEDYNSVVGYAS 281
>gi|284991034|ref|YP_003409588.1| hypothetical protein Gobs_2556 [Geodermatophilus obscurus DSM
43160]
gi|284064279|gb|ADB75217.1| protein of unknown function DUF52 [Geodermatophilus obscurus DSM
43160]
Length = 268
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 124/289 (42%), Gaps = 41/289 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
++R + AG +Y + + L + +D L E G P+ V+APHAGY YSG AA A+
Sbjct: 2 RVRPPAVAGRFYPADPEELHDLVDRLLDEVCGQPRPVPPAAVVAPHAGYRYSGAVAATAY 61
Query: 62 GNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
++ P +++RV LLGP+H + A+ TP+G + +D + T L
Sbjct: 62 AHLTAAPHAVTRVVLLGPAHFWPLDGMAVPAVDALATPLGSVDVD-----DDARTVAAAL 116
Query: 120 MDICVD----EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
+ VD + EHS+E LP+L +V G + ++P+LVG E+ A VD
Sbjct: 117 PGVAVDDRPHDGEHSLETQLPFLQRVL-GPEVAVLPVLVGRTQPESVAA----VLTAVD- 170
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
VS+D H H D+ S D+ D + D A +
Sbjct: 171 -GALAVVSTDLSH--------HLDE------PSARERDRRTADAVLARDAAALRP----- 210
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ CG P+ LH + ++ LR S+ R V Y +
Sbjct: 211 -DDACGHQPLRGPLHH--AAERHLSVELLRMATSADSGAGRARVVGYGA 256
>gi|395645086|ref|ZP_10432946.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanofollis liminatans DSM 4140]
gi|395441826|gb|EJG06583.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanofollis liminatans DSM 4140]
Length = 261
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/253 (27%), Positives = 114/253 (45%), Gaps = 36/253 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+ R + AG +Y L + L+ + P +P+ G+++PHAGY YSG A AF
Sbjct: 2 ETRPCTFAGMFYPGEPGHLEQFLE-TVTPVERP-APDALGIVSPHAGYPYSGAVGARAFS 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I P ++GPSH + ++A ++TP+G + +D++ + L DI
Sbjct: 60 AIRPDFDGTFVVIGPSHRGF---LTCTSAIPWETPLGIVDVDVDFVRAL---------DI 107
Query: 123 CVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAMYGRLF-AKYVDDP 176
VDE EHS+E+ P++ F +I PI++G + A E + G++ A
Sbjct: 108 RVDEVSMEDEHSLEVQTPFIKHFFP--RAQIAPIMMGDQSPASAELVAGKIVRAIRATGR 165
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SSDF H+ + D M ++ IE D D + ++EYD
Sbjct: 166 EVRIVASSDFSHY--------------IPESQARKRDAMAIEAIERLDVDGLYRSIVEYD 211
Query: 237 NTICGRHPISVFL 249
+ CG PI+ +
Sbjct: 212 ISACGYGPIAAMI 224
>gi|347523464|ref|YP_004781034.1| hypothetical protein Pyrfu_0913 [Pyrolobus fumarii 1A]
gi|343460346|gb|AEM38782.1| protein of unknown function DUF52 [Pyrolobus fumarii 1A]
Length = 285
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 125/293 (42%), Gaps = 28/293 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
+RR + AG +Y + + L +++ G L G G ++PHAGY YSG
Sbjct: 5 VRRPAVAGMFYEADPEALRSQIEWCFTHTLGPGKLPPRGGGSERLSVGFVSPHAGYMYSG 64
Query: 55 RAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ + + V ++GP+H ++ V++TP+G + +D E+ + +
Sbjct: 65 PVAAHVYYQLALEKKPDTVVIVGPNHTGLGTLVSVMVEGVWETPLGRVEIDSELAKLIVK 124
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+D EHS+E+ +P+L ++ G +IVPI++ G AK
Sbjct: 125 YSDLADIDDKAHLFEHSVEVQVPFLQYIY-GDEFRIVPIVMWDQTPRAARDLGEAVAKAA 183
Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ + SSDF H+ + H V K D++ +D I DP+ +
Sbjct: 184 AELGRDVIYIASSDFTHY----------EPHEVAAKK----DRLAIDKILALDPEGLHEV 229
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ YD ++CG P+ L+ + L+Y S R S V YA+
Sbjct: 230 IQRYDISMCGPGPVMSMLY-YARARGASNAQLLKYATSGDVTGDRSSVVGYAA 281
>gi|407771035|ref|ZP_11118398.1| dioxygenase [Thalassospira xiamenensis M-5 = DSM 17429]
gi|407285924|gb|EKF11417.1| dioxygenase [Thalassospira xiamenensis M-5 = DSM 17429]
Length = 495
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 21/201 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS----PEVRGVIAPHAGYSYSGRA 56
M IR + AG++Y L E+DG + A ++ P + +I PHAG +SG
Sbjct: 1 MTIIRPPAIAGTFYPAAPDMLRSEIDGLIHAAVESQTENDRPIPKAIIVPHAGLMFSGAI 60
Query: 57 AAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
A F + ++ R+ ++GP+H A++ A + TP+GD+ DL ++ A
Sbjct: 61 AGLGFATVRALKGTVKRIVIIGPAHRMAFQGIAIARADGFATPLGDVRCDLAGLQAALAW 120
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAEN-----EAMYGRL 168
+ +++D EH +E+ LP++ ++F E I IVP+LV +A EA++G
Sbjct: 121 PQVQVLDEA-HRLEHGLEIELPFIQRLFGENADIGIVPLLVSRCSAWQVHEVIEALWG-- 177
Query: 169 FAKYVDDPSNFFSVSSDFCHW 189
P +SSD H+
Sbjct: 178 ------GPETLIVISSDLSHF 192
>gi|406892372|gb|EKD37739.1| hypothetical protein ACD_75C01015G0003 [uncultured bacterium]
Length = 267
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 120/282 (42%), Gaps = 22/282 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + A +Y +++ LA + L + + V++PHAGY YSG AA F +
Sbjct: 1 MRQPAVADRFYPGSAEALALAVKELLPAKSTSEKLKGLAVVSPHAGYVYSGALAAETFSS 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ V +LGP+H AL T T + P+GD+P+D EV + + + +D
Sbjct: 61 V--VVPETVIILGPNHRGQGAPVALGT-TSWNMPLGDVPIDREVADLILSHSPQIKVDEM 117
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE--NEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ +P+L K+ E + IVP+++ ++ E A
Sbjct: 118 AHRYEHSLEVQVPFLQKLQER--LCIVPLVISHISYPLCEEVANALALAIRQSGKEILIV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ SR +S E D++ + IE DP + ++ T+CG
Sbjct: 176 ASSDMSHYESR--------------RSAEKKDRLALQCIERLDPYDLYHTVRDHRITMCG 221
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
P+ + L C K + Y S V YA
Sbjct: 222 VIPVVIALLAAKACGAS-KYHLIGYTDSGFVSGDTAQVVGYA 262
>gi|145590329|ref|YP_001152331.1| hypothetical protein Pars_0062 [Pyrobaculum arsenaticum DSM 13514]
gi|166227788|sp|A4WH12.1|Y062_PYRAR RecName: Full=MEMO1 family protein Pars_0062
gi|145282097|gb|ABP49679.1| protein of unknown function DUF52 [Pyrobaculum arsenaticum DSM
13514]
Length = 281
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 131/295 (44%), Gaps = 33/295 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGR 55
++R+ + AG +Y ++L +++D ++ PK+ ++ G + PHAGY YSG
Sbjct: 2 RVRKPAVAGYFYPAEKEKLIQQIDWSIKHELGPKALQMPKLGEKALGGVVPHAGYIYSGP 61
Query: 56 AAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+ + G P +I ++GP+H+ A+ + V++TP+G + +D ++ E +
Sbjct: 62 VAAWLYSALAGYGKPDAI---IIIGPNHYGIGAPVAVMKSGVWETPLGRVEVDGDLAELI 118
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
K D EHS+E+ +P++ F ++IVPI+V G+ A
Sbjct: 119 MRHYKGVEDDFYAFSKEHSVEVQIPFIQYYFGD--VRIVPIVVWRQTLSTSRELGKAVAT 176
Query: 172 YVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+ + + V SSDF HY+ + K D M + I D
Sbjct: 177 AIREYGRYVYVLASSDFN---------HYEPHEVTVQK-----DDMAISKILKVDEAGLF 222
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ ++D +ICG PI + + + L++ S +D +V YAS
Sbjct: 223 EVASKFDISICGLGPIGALIVIAKELGFG-NVTLLKHATSGDTSGYKDETVGYAS 276
>gi|410867018|ref|YP_006981629.1| MEMO_like multi-domain protein [Propionibacterium acidipropionici
ATCC 4875]
gi|410823659|gb|AFV90274.1| MEMO_like multi-domain protein [Propionibacterium acidipropionici
ATCC 4875]
Length = 275
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 123/301 (40%), Gaps = 54/301 (17%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
+ IR + AG +Y + L E LD L A P +V R +I PHAGY YSG
Sbjct: 7 QDAIRPPAVAGFFYPADPAELTETLDRLL-AASEPDEEDVDPDRLRTLIVPHAGYIYSGP 65
Query: 56 AAAYAFGNIDPTSI----SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+ + + + RV +LGP+H AL A Y TP+G P+D
Sbjct: 66 TAAHGYRLLARLAAHHPPRRVAVLGPTHRVAIRGLALPAAGGYATPLGVCPVD------- 118
Query: 112 KATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
A+G +L + V A EHS+E+ +P+L ++ I I P+ VG ++ A
Sbjct: 119 -ASGLDDLPQVAVHAATHAEEHSLEVQVPFLQRILGD--IPITPLAVGLAEPDSVAQVIE 175
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
A DP F +SSD H+ + D I + +E D + D
Sbjct: 176 TLAA---DPDTFIVISSDLSHYHAHAEARRLDDH--TIARILERQDTIPAD--------- 221
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS--VSYASA 285
CG HP++ L S ++ +++ T D S V YAS
Sbjct: 222 ----------RACGAHPLNGML----QASRRLGWAPHILARATSADTAGDPSRVVGYASL 267
Query: 286 A 286
A
Sbjct: 268 A 268
>gi|257054055|ref|YP_003131888.1| hypothetical protein Huta_2994 [Halorhabdus utahensis DSM 12940]
gi|256692818|gb|ACV13155.1| protein of unknown function DUF52 [Halorhabdus utahensis DSM 12940]
Length = 283
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 128/301 (42%), Gaps = 38/301 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--------PEVRGVIAPHAGYSY 52
M R + AG +Y L E+++ P S P + G++APHAG +
Sbjct: 1 MASTRHPAVAGRFYVRTEPSLREQIEAAFTHEIGPGSVPTATAGPPAITGLVAPHAGLPF 60
Query: 53 SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG AA+++ + + V ++GP+H A+ ++TP+G +P+D + +
Sbjct: 61 SGPLAAHSYAALAKSGTPETVVIIGPNHTGVGAAVAVPGDDEWRTPLGSVPIDNGLRDRT 120
Query: 112 KATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
A D VD+ +EH+ E+ LP+L +++ ++PI + +A+ G
Sbjct: 121 VAE-----TDASVDDRAHASEHAAEVQLPFLQHLYDD--FAVLPISLRRQDADVAQALGD 173
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
+ + + + + SSDF H+ H + D++ +D I+ DP
Sbjct: 174 VLETHAGEGAVVIA-SSDFTHYEP--------------HDTAIGRDELALDRIDATDPAG 218
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
+ + ++CG I+ L G S ++ L + S RD V YAS A
Sbjct: 219 LIETVEREGLSVCGYGAIAAMLWASGADS---QVDVLAHATSGDTAGSRDEVVGYASVAV 275
Query: 288 K 288
+
Sbjct: 276 R 276
>gi|327398420|ref|YP_004339289.1| hypothetical protein Hipma_0253 [Hippea maritima DSM 10411]
gi|327181049|gb|AEA33230.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Hippea maritima DSM 10411]
Length = 267
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 97/200 (48%), Gaps = 9/200 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + A +Y + +L LD + + + P + +I PHAGY YSG AA A+
Sbjct: 3 IRKPAVADMFYPGSPSQLKAYLDSVMFQPDVKVKP--KALIVPHAGYVYSGEVAAKAYSL 60
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
ID S ++GP+H + ++ +Y+ P GD+ DLE+IE + A ++ +D
Sbjct: 61 ID--SFDTYIVMGPNHTGLGAEISIFDG-IYQMPFGDVEPDLELIESI-AKNEYAQIDYY 116
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFS 181
EHS+E+ LP++ + + KIVPI+VG N A+ + D
Sbjct: 117 AHLQEHSLEVQLPFIDYISKKPY-KIVPIVVGTHNVAKLYSMAETIAETIKDAGKDVLIV 175
Query: 182 VSSDFCHWGSRFNYMHYDKK 201
VS+D H+ + + DK
Sbjct: 176 VSTDMNHYEDQTTTLIKDKN 195
>gi|124028270|ref|YP_001013590.1| dioxygenase [Hyperthermus butylicus DSM 5456]
gi|123978964|gb|ABM81245.1| predicted dioxygenase [Hyperthermus butylicus DSM 5456]
Length = 301
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 120/261 (45%), Gaps = 24/261 (9%)
Query: 35 PKSPEVRGVIAPHAGYSYSGRAAAYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTAT 92
P+ V GV+ PHAGYSYSG AA+ + + + + V ++G +H Y +T T
Sbjct: 49 PRYTPVGGVV-PHAGYSYSGPCAAHLYLELAENAPEVDTVVVMGTNHTGY--GGVYTTTT 105
Query: 93 VYK---TPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKI 149
YK TP+G + D+E IE LK D EHS+E+ LP+L ++ G+ K+
Sbjct: 106 RYKAWATPLGTVETDIEFIELLKKLYPRLEDDYLAHMREHSIEVELPFLQYIY-GNNFKL 164
Query: 150 VPILVGAVNAENEAMYGRLFAKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHG 203
VPI+V E R A+ V + SSDF H G ++Y+ + +
Sbjct: 165 VPIVV----KEPSERMAREMAEAVKRAAEELGRRILVIASSDFTHHGYMYDYVLFTEN-- 218
Query: 204 VIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKF 263
+ +++ LD ++ I D F + + Y T+CG I+ + + K+
Sbjct: 219 -VRENVAKLDMAIIEHILRLDTKGFLETIYRYGATVCGYGAIATLIEYAKLEGARAKL-- 275
Query: 264 LRYEQSSQCKTKRDSSVSYAS 284
L+Y S+ + V YA+
Sbjct: 276 LKYYNSADVTGDEAAVVGYAA 296
>gi|325095393|gb|EGC48703.1| DUF52 domain-containing protein [Ajellomyces capsulatus H88]
Length = 164
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 52/87 (59%), Gaps = 8/87 (9%)
Query: 120 MDICVDEAEHSMEMHLPYLAKVFE--------GHLIKIVPILVGAVNAENEAMYGRLFAK 171
M VD AEHS+E+HLPY+ + ++VP++VG+ + EA +G L A
Sbjct: 1 MSPPVDSAEHSLELHLPYIHHLLRTLYPTRAPAAYPRLVPMMVGSTSPATEAAFGALLAP 60
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHY 198
Y+ D +N F VSSDFCHWG RF Y +Y
Sbjct: 61 YLADDTNAFVVSSDFCHWGLRFGYTYY 87
>gi|393795899|ref|ZP_10379263.1| hypothetical protein CNitlB_06112 [Candidatus Nitrosoarchaeum
limnia BG20]
Length = 277
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 124/289 (42%), Gaps = 24/289 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
+IR + AG +Y+ K L + + D +L + G K P ++ GVI PHAGY+YSG
Sbjct: 2 QIRTPAVAGMFYSGEKKELKKSINDCFLHKFGPGKIPPSDNKKKIFGVICPHAGYAYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A ++F I ++GP+H A + ++TP+G++ +D E EE+
Sbjct: 62 IACHSFYEISSNLPELFIIVGPNHWGIGSSVATMKDSKWQTPLGEVEVDSETAEEISNIT 121
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
K +D EHS+E+ +P L ++ +I+PI++ + E G A
Sbjct: 122 KIIEVDNFSHSREHSLEVQIPILQEI--SKKFRILPIVLINQSKEVAIQLGSAIANIARK 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
SSDF H+ N +++ + IE + K+ D D F L +
Sbjct: 180 KKVMIIGSSDFTHY--EPNEFAHEQDAAL----IEPILKL--------DVDRFYDVLNKK 225
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D + CG I+ + K + L+Y S S V Y S
Sbjct: 226 DISACGYGAIASTMIACKELGAT-KGELLKYATSGDITGDTSSVVGYGS 273
>gi|170290628|ref|YP_001737444.1| dioxygenase [Candidatus Korarchaeum cryptofilum OPF8]
gi|170174708|gb|ACB07761.1| Predicted dioxygenase [Candidatus Korarchaeum cryptofilum OPF8]
Length = 285
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 134/298 (44%), Gaps = 29/298 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
+R+ + AGS+Y S L + LD R AG P + ++ PHAGY YSG+
Sbjct: 5 LRKPAVAGSFYPSRSDDLLKLLDSLFRGAGSGGIPSPNPAGERRIASMMVPHAGYIYSGK 64
Query: 56 AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KA 113
AA A+ + I F ++GP+H + + + ++TP+G + +DLE+ E + +
Sbjct: 65 TAASAYSILAQDGIPETFVIIGPNHTGLGSAFSATKSKFWETPLGRVEVDLELAEAISRY 124
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
G + D+ EHS+E+ +P+L ++ G +KI+PI++ E G+ A
Sbjct: 125 FGDLDFDDLA-HTWEHSIEVQVPFLQSIY-GASLKILPIVMAIQEPEPSIDLGKSIALAS 182
Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ SSD HY + I + D+ +D I D + +
Sbjct: 183 EKLGRDVLVIASSDMS---------HYLPEGEAIRR-----DRAAIDAILNMDVHSLFRI 228
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+ E D ++CG P+SV + K + +RY S++ + V YAS ++
Sbjct: 229 IQELDVSMCGPGPVSVAIS-FAKLKGVSKGELVRYSTSAETSGDKSFVVGYASVVFRL 285
>gi|329765804|ref|ZP_08257370.1| hypothetical protein Nlim_1150 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329137647|gb|EGG41917.1| hypothetical protein Nlim_1150 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length = 277
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 124/289 (42%), Gaps = 24/289 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEEL-DGWLREAGLPKSP------EVRGVIAPHAGYSYSGR 55
+IR + AG +Y+ K L + + D +L + G K P ++ GVI PHAGY+YSG
Sbjct: 2 QIRTPAVAGMFYSGEKKELKKSINDCFLHKFGPGKIPPSDNKKKIFGVICPHAGYAYSGP 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A ++F I ++GP+H A + ++TP+G++ +D E EE+
Sbjct: 62 IACHSFYEISSNLPELFIIVGPNHWGIGSSVATMKDSKWQTPLGEVEVDSETAEEISNIT 121
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
K +D EHS+E+ +P L ++ +I+PI++ + E G A
Sbjct: 122 KIIEVDNFSHSREHSLEVQIPILQEI--SKKFRILPIVLINQSKEVAIQIGSAIANIARK 179
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
SSDF H+ N +++ + IE + K+ D D F L +
Sbjct: 180 KKVMIIGSSDFTHY--EPNEFAHEQDAAL----IEPILKL--------DVDRFYDVLNKK 225
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D + CG I+ + K + L+Y S S V Y S
Sbjct: 226 DISACGYGAIASTMIACKELGAT-KGELLKYATSGDITGDTSSVVGYGS 273
>gi|126458656|ref|YP_001054934.1| hypothetical protein Pcal_0028 [Pyrobaculum calidifontis JCM 11548]
gi|166227773|sp|A3MS51.1|Y028_PYRCJ RecName: Full=MEMO1 family protein Pcal_0028
gi|126248377|gb|ABO07468.1| protein of unknown function DUF52 [Pyrobaculum calidifontis JCM
11548]
Length = 281
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 35/296 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-------EVRGVIAPHAGYSYSGR 55
+IR+ + AG +Y + L ++ ++ PK+P E G + PHAGY YSG
Sbjct: 2 RIRKPAVAGYFYESRADELQARIEWSIKHEIGPKAPVLPKLGTEALGGVVPHAGYMYSGP 61
Query: 56 AAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + F ++GP+H+ A+ + ++TP G + +D E + L A+
Sbjct: 62 VAAWLYSALAGYGAPTTFVIIGPNHYGIGAPVAIMKSGAWETPFGRVEID-EELASLIAS 120
Query: 115 GKFELMDIC-VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
E+ D EHS+E+ LP++ F+ +K VPI V G+ AK +
Sbjct: 121 NYREIEDDAHAFSKEHSIEVQLPFIQYYFKN--VKFVPIAVWRQTLSTSRELGKAIAKAL 178
Query: 174 DDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ + SSDF HY+ + K A+ K+ +++ E G D +
Sbjct: 179 REYKRRVYLLASSDFN---------HYEPHDVTVKKDDMAIGKI-LELDEAGLFDVASR- 227
Query: 232 LLEYDNTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+D +ICG PI+ V LG + + L++ S RD +V YAS
Sbjct: 228 ---FDISICGIGPIASVIVAAKELGFSNAAL----LKHATSGDTSGYRDETVGYAS 276
>gi|156743011|ref|YP_001433140.1| hypothetical protein Rcas_3068 [Roseiflexus castenholzii DSM 13941]
gi|156234339|gb|ABU59122.1| protein of unknown function DUF52 [Roseiflexus castenholzii DSM
13941]
Length = 277
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 122/288 (42%), Gaps = 23/288 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AGSWY N RL ++ +L +A P S + G++APHAG YSG AA+AF
Sbjct: 3 IRPSPIAGSWYPSNPARLRRAIEQYLAQATPPGLSGRIWGLLAPHAGLRYSGPVAAWAFA 62
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT----GKFE 118
I ++ + ++ P H + Y+TP+G + +D + I +L A +
Sbjct: 63 CIRGSTPDIIVIVSPWHRGGDTPLITTGHAAYETPLGVVEVDADAIVQLDAALCAHAGYG 122
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
L D+ EH++E+ LP+L V +++P+++ A G A+ V
Sbjct: 123 LARRRYDD-EHAIEIELPFLQCVLGS--FRLLPVMMADQRTTTAAALGAALAETVRGRDV 179
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSD H+ + V H+ LD + + DPDA E
Sbjct: 180 LLIASSDLSHY----------EPASVAHR----LDAELLRRVAAFDPDAVIAAEYEGAGY 225
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG I+ L + ++ +RY S + V Y +AA
Sbjct: 226 ACGSGAIAAVLWACRDLGAD-QVTIVRYGTSGDVEPDAPWVVGYGAAA 272
>gi|170288657|ref|YP_001738895.1| hypothetical protein TRQ2_0860 [Thermotoga sp. RQ2]
gi|229890255|sp|B1LA64.1|Y860_THESQ RecName: Full=MEMO1 family protein TRQ2_0860
gi|170176160|gb|ACB09212.1| protein of unknown function DUF52 [Thermotoga sp. RQ2]
Length = 277
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 128/293 (43%), Gaps = 33/293 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
R+ + AG +Y L E+ LD + LP E + G+++PHAGY YSG
Sbjct: 3 RKPAVAGLFYPSRRDELIEQIRMCFLDKRIGPGELPVPVETKLQNPIGLVSPHAGYIYSG 62
Query: 55 RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ F + S V ++GP+H + ++TP+G +P++ +E + +
Sbjct: 63 PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNQRAVEVILS 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLFAKY 172
++ D EHS+E+ +P+L VF + +VPI L+ A E + L
Sbjct: 123 NSRYAEEDFMSHIREHSIEVQIPFLQFVFGE--VSMVPICLMDQSPAVAEDLASALMKLA 180
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ P S+D HY+ + + K D ++ I++ DP +YL
Sbjct: 181 AEFPGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIKSNDPRLLYEYL 226
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
++ D ++CG ++ L+M N + L++ S + V Y SA
Sbjct: 227 VKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274
>gi|152993349|ref|YP_001359070.1| hypothetical protein SUN_1766 [Sulfurovum sp. NBC37-1]
gi|151425210|dbj|BAF72713.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 273
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 14/195 (7%)
Query: 2 EKIRRASHAGSWYTDNSK---RLAEELDGWLREAGLPKSP-EVR--GVIAPHAGYSYSGR 55
+ IR+A+ AGS+Y + + R E + + K ++R +I PHAGY YSG
Sbjct: 3 QGIRKAAVAGSFYPERCREIRRYIREFNAAFDRLSIKKEILDIRPGAIIVPHAGYIYSGF 62
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A +A+ + T R+ ++GPSHHYY + ++TP G++ +D + L
Sbjct: 63 TANFAYRFLKHTKPKRIIVIGPSHHYYFKGISAGHFENFETPCGEIEIDNPYLFAL--AK 120
Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+F + D E EHS E+ +P++ F K++ ++ G V A+ A+ + +
Sbjct: 121 EFNIGFDPKAHEKEHSTEVQMPFIQHYFPK--AKVIELVYGDVPAKELAL---IITALLK 175
Query: 175 DPSNFFSVSSDFCHW 189
+P N +SSD H+
Sbjct: 176 NPDNAVVISSDLSHF 190
>gi|302878778|ref|YP_003847342.1| hypothetical protein Galf_1560 [Gallionella capsiferriformans ES-2]
gi|302581567|gb|ADL55578.1| protein of unknown function DUF52 [Gallionella capsiferriformans
ES-2]
Length = 266
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 32/288 (11%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR + AG +Y ++ L +E+ +L A P + +I PHAGY YSG AA A
Sbjct: 1 MPVIRPPAVAGLFYPADAGELKQEVQEYL-AAAQPFDLSPKALIVPHAGYIYSGAIAATA 59
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + + I RV LLGP+H AL + TP+G + +D+ + ++ +
Sbjct: 60 YATLRKLARKIRRVILLGPTHRVAIRGIALPGVDAFATPLGRIRIDVAAADAIRHLAQVT 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
EH++E+ LP+L V ++P+ VG +A A + D
Sbjct: 120 TSS-HAHAQEHALEVQLPFLQSVLSD--FTVLPLAVGNASA---AEVAEVLECLWDGEET 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+SSD H Y+ Y + +++++A I+ P A +
Sbjct: 174 LIVISSDLSH------YLPYATAQRIDYETVQA-------ILNLTQPIAHEHA------- 213
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG P+S ++ ++ L + S RD V YA+ A
Sbjct: 214 -CGGTPVSGL--IIAAAHHHLRPHLLALKNSGDTAGDRDRVVGYAALA 258
>gi|15642862|ref|NP_227903.1| hypothetical protein TM0087 [Thermotoga maritima MSB8]
gi|418046024|ref|ZP_12684118.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermotoga maritima MSB8]
gi|7388377|sp|Q9WXU2.1|Y087_THEMA RecName: Full=MEMO1 family protein TM_0087
gi|4980576|gb|AAD35181.1|AE001695_7 conserved hypothetical protein [Thermotoga maritima MSB8]
gi|351675577|gb|EHA58737.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermotoga maritima MSB8]
Length = 277
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 128/296 (43%), Gaps = 39/296 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
R+ + AG +Y L E+ LD + LP E + G+++PHAGY YSG
Sbjct: 3 RKPAVAGLFYPSRRDELIEQIRMCFLDKRIGPGKLPGPVETKLQNPIGLVSPHAGYIYSG 62
Query: 55 RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ F + S V ++GP+H + ++TP+G +P++ +E + +
Sbjct: 63 PVAAWGFLEAVKFGEPSVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEIVLS 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYGRLF 169
++ D EHS+E+ +P+L VF + IVPI + AV + + +L
Sbjct: 123 NSRYAEEDFMSHIREHSIEVQIPFLQFVF--GEVSIVPICLMDQSPAVAEDLASALAKLV 180
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
A++ P S+D HY+ + + K D ++ IE DP
Sbjct: 181 AEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPSLLY 223
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+YL+ D ++CG ++ L+M ++ L++ S V Y SA
Sbjct: 224 EYLVREDISMCGYGGVATLLNMDFE-----NVRILKHATSGDVSGDTLEVVGYLSA 274
>gi|158522440|ref|YP_001530310.1| hypothetical protein Dole_2429 [Desulfococcus oleovorans Hxd3]
gi|158511266|gb|ABW68233.1| protein of unknown function DUF52 [Desulfococcus oleovorans Hxd3]
Length = 484
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 121/288 (42%), Gaps = 31/288 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
IR+ AG++Y + + L+ + R+A P + +I PHAGY YSG AA
Sbjct: 26 IRQPVAAGTFYPADPEALSRLITDLTRQAEKTAVKTPAEKPLCALILPHAGYVYSGLTAA 85
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + RV ++ P H +CA++ A +TPIG +P+ + LK + F
Sbjct: 86 HGALALKNRRFKRVVVMAPDHCAGIGQCAVTGADACRTPIGIIPVGTKTAARLKQSKLFV 145
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ EH++E+ LP+L + +VP++ G V + + G + V D +
Sbjct: 146 TTPPASENREHAVEVVLPFLQAWLKD--FSLVPVITGPVAPQR--LAGTI--DSVLDETT 199
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSD H+ S+ D++ +++I G + L+ +
Sbjct: 200 LLVASSDLSHYLSQARAREKDRET--------------INMILQGRTGS----LVSEPDR 241
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PI LH+ + + + L Y S D V YA+ A
Sbjct: 242 ACGAIPIETVLHLAKRHNWQPVL--LHYSNSGDTAGPTDRVVGYAAVA 287
>gi|157363361|ref|YP_001470128.1| hypothetical protein Tlet_0497 [Thermotoga lettingae TMO]
gi|157313965|gb|ABV33064.1| protein of unknown function DUF52 [Thermotoga lettingae TMO]
Length = 279
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 124/293 (42%), Gaps = 35/293 (11%)
Query: 5 RRASHAGSWYTDNSKRLA----EELDGWLREAGLPKSPEV-----RGVIAPHAGYSYSGR 55
R A AGS+Y + +RL E L +P P+ VI PHAGY YSG
Sbjct: 3 RTAVAAGSFYPASPERLKFSIEEAFTSSLGPGHIPDKPDQPLKKHSSVIVPHAGYIYSGP 62
Query: 56 AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ I V L+GP+H Y + + TP G+L + + E
Sbjct: 63 VAAHAYAEISKLGKPELVVLVGPNHTGYGRPIGVYNRGTWVTPFGELHVSTDAAEIFLQY 122
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV---NAENEAMYGRLFAK 171
D +EHS+E+ LP+L +F+ +I+PI V + + E A+ A+
Sbjct: 123 CDEANADFKSHISEHSIEVQLPFLQYLFDD--FQILPIAVFPIFIKSCEKIAIALDHIAE 180
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ PS F ++DF HY+ + K D+M +D I T DP
Sbjct: 181 --NFPSTLFVFTTDF---------NHYESDEITVKK-----DQMVIDKILTKDPTGLYSV 224
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ + T+CG ++ FL M ++ L++ S +R V YAS
Sbjct: 225 VAKEKVTMCGLSVVTSFLFMKHFGLPRL----LKHATSGDVTGERSQVVGYAS 273
>gi|239618223|ref|YP_002941545.1| hypothetical protein Kole_1859 [Kosmotoga olearia TBF 19.5.1]
gi|239507054|gb|ACR80541.1| protein of unknown function DUF52 [Kosmotoga olearia TBF 19.5.1]
Length = 277
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 129/297 (43%), Gaps = 41/297 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEEL----DGWLREAGLPK-----SPEVRGVIAPHAGYSYSGR 55
R AG +Y N++ L +++ D + LP + + G+I PHAGY YSG
Sbjct: 3 RNPVFAGRFYASNAEELKKQITACYDHPVGPGELPGPVFSFALKNAGLITPHAGYMYSGP 62
Query: 56 AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + R + ++GP+H Y + ++ + TP+G L +D ++ EL
Sbjct: 63 VAAHGYLELSKIGKPRKIIIIGPNHTGYGARLSIWPEGSWHTPLGTLRIDESLVGELVRN 122
Query: 115 GKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPIL-----VGAVNAENEAMYGRL 168
+ EL D EHS+E+ LP++ +F+ IVPI+ + AV EA+ L
Sbjct: 123 SQGELKPDTSAHLYEHSIEVQLPFIQHIFDNDP-TIVPIIMTDQSINAVRTLVEAISSIL 181
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+ F SSD HYD + K D++ + + T D D
Sbjct: 182 ----KKEEGIFVIASSDMN---------HYDNHETTLRK-----DELLIKALLTKDIDRI 223
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ E T CG PI+V L + ++K L++ S + +V Y SA
Sbjct: 224 YQVARENRITACGLGPIAVVLKLFD------ELKILKHATSGDVSGDKYHTVGYLSA 274
>gi|389860859|ref|YP_006363099.1| hypothetical protein TCELL_0536 [Thermogladius cellulolyticus 1633]
gi|388525763|gb|AFK50961.1| hypothetical protein TCELL_0536 [Thermogladius cellulolyticus 1633]
Length = 290
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/303 (25%), Positives = 134/303 (44%), Gaps = 38/303 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
R + AG +Y + K L ++ + + G K P V G + PHAGY YSG
Sbjct: 7 RYPAVAGFFYPASRKELVVSIEQSFTHKLGPGKKPVVAASRNKSVVGFVVPHAGYMYSGP 66
Query: 56 AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+A+ + + V +LG +H Y ++ + TP+G++ +D E+ ++L
Sbjct: 67 VAAHAYLELASAGVPETVVILGTNHTGYGALVSVYPGGTWVTPLGEVRVDSELAKKLVEF 126
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
F +D EHS+E+ +P++ ++E +KI+P+++G + K V
Sbjct: 127 SGFAELDEEAHIEEHSVEVQVPFIQYMYEDR-VKILPVVIGLHTVDVARDLALSLKKAVS 185
Query: 175 DPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ V SSDF HY+ +H + K +EA+ + I GD + F + +
Sbjct: 186 ELGRDVVVLASSDFN---------HYEPQHVTVAKDMEAI-----NYILRGDSEGFYRAI 231
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKI------KIKFLRYEQSSQCKTKRDSSVSYASAA 286
E + ++CG I + TK+ +I L++ S + V YAS
Sbjct: 232 TEKNISVCGPGGIMTLIEY-----TKLYEKYTPRITLLKHATSGDVTGDYSAVVGYASVK 286
Query: 287 AKV 289
+V
Sbjct: 287 FEV 289
>gi|313673595|ref|YP_004051706.1| hypothetical protein Calni_1636 [Calditerrivibrio nitroreducens DSM
19672]
gi|312940351|gb|ADR19543.1| protein of unknown function DUF52 [Calditerrivibrio nitroreducens
DSM 19672]
Length = 265
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 110/240 (45%), Gaps = 19/240 (7%)
Query: 44 IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
I PHAGY YSG A I+ RV L+GP+H + + ++ ++TP GD+ +
Sbjct: 40 IVPHAGYIYSGEVAFRVLSAINIKR--RVILMGPNHTGFGQRVSIYPGGSWETPFGDIDI 97
Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
D E++E G D EHS+E+ LP L + E KIVPI + + + E
Sbjct: 98 DEELVERFVNVGL--KTDTLAHLKEHSLEVILPILKYLNEN--CKIVPITISYLRYD-EC 152
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
M + A+ + D N+F F S FN H++ + + K DK +D I
Sbjct: 153 M---MVAEKILDVVNYFKDEVTFV-ISSDFN--HFEDEKTTLLK-----DKYAIDAILRL 201
Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
DP+ + + D ++CG P ++ L + N + K + L + S + D V YA
Sbjct: 202 DPEGLYNVVRDKDISMCGVIPSTIGLIVAKNLGAQ-KGELLIHTTSGRVSGDYDRVVGYA 260
>gi|319790446|ref|YP_004152079.1| hypothetical protein Theam_1477 [Thermovibrio ammonificans HB-1]
gi|317114948|gb|ADU97438.1| protein of unknown function DUF52 [Thermovibrio ammonificans HB-1]
Length = 268
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 129/286 (45%), Gaps = 30/286 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y ++ L L + R +PK + + VI PHAGY YSG A +
Sbjct: 2 VRYPAVAGQFYPGTAQELKLYLSSFCRR-DVPKI-KAKAVIVPHAGYIYSGAVAGATYSR 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ ++ ++GP+H ++ V+ TP+G++P++ E+ EL + +E D
Sbjct: 60 VVIPEVN--VIMGPNHTGLGRPASVYPEGVWVTPLGEVPVNGEIASELTSRYPYE-ADPS 116
Query: 124 VDEAEHSMEMHLPYL--AKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN-FF 180
EHS+E+ +P+L F L IVP++ + + G A+ + D +
Sbjct: 117 AHIYEHSLEVQVPFLQFCSGFNEEL-SIVPVVFQHLPYRDCVAAGEALAQVLADREDALM 175
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+S+DF H+ S+ + + LD + +D I +P+ + + Y+ T+C
Sbjct: 176 VISTDFSHYVSQ--------------EEAKRLDSLAIDAILDLNPEELYRRVHAYNITMC 221
Query: 241 GRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G P +V L +LG ++ + Y S D V YA
Sbjct: 222 GVIPATVGLVAAKLLGAQGAEL----VDYRTSGDVTGDYDQVVGYA 263
>gi|146304886|ref|YP_001192202.1| hypothetical protein Msed_2139 [Metallosphaera sedula DSM 5348]
gi|189039506|sp|A4YIM6.1|Y2139_METS5 RecName: Full=MEMO1 family protein Msed_2139
gi|145703136|gb|ABP96278.1| protein of unknown function DUF52 [Metallosphaera sedula DSM 5348]
Length = 282
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/291 (25%), Positives = 128/291 (43%), Gaps = 31/291 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGR 55
RR + AGS+Y D+S +L + ++ G + G S I PHAGY YSG
Sbjct: 3 RRPAVAGSFYEDDSAQLRKRIEWAFHHPIGPGGIPSVGSTGSRSNPIFIVPHAGYIYSGP 62
Query: 56 AAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+++ + V +LGP+H Y + ++ ++TP+G ++ ++++EL +
Sbjct: 63 VAAHSYYYLAQEGKPDIVIILGPNHTGYGSQVSIWPGGDWETPLGSAQVNAQLVKELVSV 122
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ +D EHS+E+ LP+L F+ + I+P+++ E +++
Sbjct: 123 SEVVDIDEKAHLYEHSIEVQLPFLQYFFDN--LSILPVVILMQTPEIAEFVAEGIWRFIQ 180
Query: 175 ---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
D SSD H+ M D+ ++ + I+ +D G+ K
Sbjct: 181 RHSDKDIVVLASSDLNHYDPHDVTMTKDE---LVIRKIQDMDYKGL-----------YKV 226
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
+ EYD T+CG PI L + K I L++ S + S V Y
Sbjct: 227 VEEYDVTVCGYAPIMASLILAKKMHKKPYI--LKHATSGDTSGDKSSVVGY 275
>gi|20094399|ref|NP_614246.1| dioxygenase [Methanopyrus kandleri AV19]
gi|23822318|sp|Q8TWR9.1|Y963_METKA RecName: Full=MEMO1 family protein MK0963
gi|19887476|gb|AAM02176.1| Predicted dioxygenase [Methanopyrus kandleri AV19]
Length = 283
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 124/292 (42%), Gaps = 29/292 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVR-------GVIAPHAGYSYSGRA 56
R + AG +Y + + L + ++ R E G PE GV+APHAGY +SG
Sbjct: 3 RSPAVAGQFYPADPEELRKMIEWCFRHELGPGDLPETNDGPCTLPGVVAPHAGYQFSGPV 62
Query: 57 AAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+ + + + + V +LGP+H A T ++TP+G + +D E L
Sbjct: 63 AAHTYKVLAESGTPETVVILGPNHTGLGSAVATMTDGAWRTPLGSVEIDSEFATALVRKC 122
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD- 174
D+ EHS+E+ LP+L V+ G + VP+ + + + G +
Sbjct: 123 GVMDDDLTAHANEHSIEVQLPFLQYVY-GESFRFVPVCMAMHDLQTAREVGEAIVDVAEE 181
Query: 175 -DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPDAFKKYL 232
D + S+DF H+ D+K + + I ALD+ GM +I+E
Sbjct: 182 LDRNTVVIASTDFTHYEPHDQAQKKDRK---VIERITALDEAGMIEIVE----------- 227
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
Y+ ++CG P + + + + + L+Y S V YA+
Sbjct: 228 -RYNVSMCGVGPTAATIVAVKAMGAS-EGELLKYATSGDVSGDYSQVVGYAA 277
>gi|344345281|ref|ZP_08776135.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Marichromatium purpuratum 984]
gi|343803110|gb|EGV21022.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Marichromatium purpuratum 984]
Length = 260
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 103/220 (46%), Gaps = 15/220 (6%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M + R + AG +Y + + L E+DG+L A P + ++ PHAGY YSG A +A
Sbjct: 1 MPQTRPPAVAGRFYPASPEALGAEIDGYLAVAA--TGPAPKAIVVPHAGYRYSGAIAGHA 58
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + I +V LLGP+H A S AT + TP+G++ LD I + A
Sbjct: 59 YAALGEARARIRQVVLLGPAHRVAVRGVATSAATAFATPLGEVALDQVAIAQALAFDWVH 118
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV-NAENEAMYGRLFAKYVDDPS 177
++ E EHS+E+ LP+L + + +VP++VG AE A+ L+
Sbjct: 119 TLEAAHAE-EHSLEVQLPFLQRALDD--FSLVPLVVGETPTAELVALLEALW----GGAE 171
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
VSSD H+ R D+ +IE LD +
Sbjct: 172 TLIVVSSDLSHYQLRAEAQALDRA---TSTAIERLDAAAI 208
>gi|149050676|gb|EDM02849.1| similar to RIKEN cDNA 0610016J10 gene, isoform CRA_c [Rattus
norvegicus]
Length = 79
Score = 80.9 bits (198), Expect = 5e-13, Method: Composition-based stats.
Identities = 41/78 (52%), Positives = 49/78 (62%), Gaps = 3/78 (3%)
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQ 271
MGM IIE DP +F YL +Y NTICGRHPI V L+ L + FL Y QSSQ
Sbjct: 1 MGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKSGMNMSFSFLNYAQSSQ 60
Query: 272 CKTKRDSSVSYASAAAKV 289
C++ +DSSVSYA+ A V
Sbjct: 61 CRSWQDSSVSYAAGALTV 78
>gi|257076981|ref|ZP_05571342.1| hypothetical protein Faci_07956 [Ferroplasma acidarmanus fer1]
Length = 271
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 125/289 (43%), Gaps = 22/289 (7%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLP--KSPEVRGVIAPHAGYSYSGRAAA 58
M+ R +G +Y D+ + L + + G + + +P + ++ G I PH Y YS AA
Sbjct: 1 MDNTREPHVSGGFYPDDYEALMDSIKGSM-DITMPDIRYKKLIGAIVPHGQYMYSSHTAA 59
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
YA+ I + ++GP+H + A+ + TP+G ++ E+ E L
Sbjct: 60 YAYKIIQESKKRTFLIIGPNHSKFPAYPAIYPDGYWSTPLGYAKVNSELSERLLLKSDII 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D EHS+E+ LP+L +F+ + P+++G A + P
Sbjct: 120 HDDKEAHITEHSIEVQLPFLQYLFKSNF-TFTPLILGNQGAAIARNIAETIESLEEKP-- 176
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
F +SS+F H Y+ YDK + LD++ ++ I+ F + + +Y T
Sbjct: 177 FVLISSNFNH------YLPYDKNN--------ELDQILINDIQEMRTLKFYQDIEKYGIT 222
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAA 287
CG I++ ML + K KI L + S R V Y++ A
Sbjct: 223 ACGYGAIAIL--MLITKNMKGKIALLNHSNSGDYSKDRKRVVGYSAMIA 269
>gi|406894537|gb|EKD39328.1| hypothetical protein ACD_75C00439G0006 [uncultured bacterium]
Length = 267
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 124/284 (43%), Gaps = 26/284 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG--VIAPHAGYSYSGRAAAYAF 61
+R+ + A +Y ++K L + L +P S +++G V++PHAGY YSG AA F
Sbjct: 1 MRQPAVADRFYPGSAKALGLAVTELL--PAVPDSGKIKGLAVVSPHAGYVYSGALAAETF 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ V +LGP+HH ALST T + P+G +P+D E+ + + A + +D
Sbjct: 59 SAV--VIPETVIILGPNHHGQGAPVALST-TSWDMPMGSVPIDRELTDLILAHSQHIKVD 115
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
+ EHS+E+ +P+L ++ E HL IVP+++ + AK +
Sbjct: 116 ELAHKYEHSLEVQVPFLQRLQE-HL-SIVPLVISHITYPLCEDVAATLAKAIRISGRDIL 173
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ SR E D++ + IE +P + ++
Sbjct: 174 IVASSDMNHYESR--------------PKTEKKDRLALQCIEQLNPYDLYHTVHSQRISM 219
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG P+ + + C K + + S D V YA
Sbjct: 220 CGVIPVVIAMLAALACGAS-KYHLVGHTDSGYVSGDTDQVVGYA 262
>gi|325982175|ref|YP_004294577.1| hypothetical protein NAL212_1534 [Nitrosomonas sp. AL212]
gi|325531694|gb|ADZ26415.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Nitrosomonas sp. AL212]
Length = 271
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 94/207 (45%), Gaps = 17/207 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M IR + G +Y + +L +++ +L E + + +I PHAGY YSG AA A
Sbjct: 9 MATIRSPAVTGLFYPADEHQLRQDVQ-FLLEKANQHDLKPKALIVPHAGYQYSGAIAATA 67
Query: 61 FGN--IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + + +I RV LLGP+H AL V+ TP+G + +D E++ +
Sbjct: 68 YASLRVAAANIQRVVLLGPAHRLAMQGLALPGVDVFTTPLGGVNVDTELVNAIA-----N 122
Query: 119 LMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
L + ++ A EHS+E+ LP+L V +P+ +G +AE A +
Sbjct: 123 LPQVSINRAAHALEHSLEVQLPFLQSVLNE--FTFLPLAIGRASAEEVA---EVLDYLWG 177
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKK 201
+SSD H+ H D +
Sbjct: 178 GEETLIVISSDLSHFLPYATAQHIDNQ 204
>gi|296243113|ref|YP_003650600.1| hypothetical protein Tagg_1388 [Thermosphaera aggregans DSM 11486]
gi|296095697|gb|ADG91648.1| protein of unknown function DUF52 [Thermosphaera aggregans DSM
11486]
Length = 287
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 127/294 (43%), Gaps = 29/294 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGW----------LREAGLPKSPEVRGVIAPHAGYSYSG 54
RR ++ N K E L W L E + E G +APHAGY YSG
Sbjct: 3 RRHPAVAGYFYPNDKHELEALIKWSFTHKIGPGKLPETSPSRRRETIGYVAPHAGYVYSG 62
Query: 55 RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ + + + V ++G +H Y ++ V++TP+G L +D E+ + L
Sbjct: 63 PVAAHTYYQMALDGVPETVVVIGTNHTGYGALVSVYPGGVWETPLGLLEVDSELAKALAD 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+D EHS+E+ LP++ ++ G ++I+P+++G + R AK +
Sbjct: 123 KSSIAELDEYAHLEEHSVEVQLPFIQYIYGGK-VRILPVVMGLHTPD----VAREVAKSL 177
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
D ++ S+ D S ++ HYD + K EA+ +I D F LL
Sbjct: 178 LDAAS--SLKRDIIIVASS-DFNHYDPHDVTVRKDGEAV-----SMILKLDSVGFYNTLL 229
Query: 234 EYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
D ++CG I V + + G + ++ L+Y S + V YAS
Sbjct: 230 REDVSVCGPGGIMVLIEYAKLAGGGGARAEL--LKYATSGDVSGDKSHVVGYAS 281
>gi|315425791|dbj|BAJ47445.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
gi|343484630|dbj|BAJ50284.1| conserved hypothetical protein [Candidatus Caldiarchaeum
subterraneum]
Length = 284
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 127/293 (43%), Gaps = 26/293 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSP-------EVRGVIAPHAGYSY 52
M+ RR + AG +Y + + L +++ +L G K+P +I+PHAG Y
Sbjct: 1 MKPRRRPAVAGYFYEGSREALLRQVEHCFLSPHGPGKTPAREWGKRRAPALISPHAGLMY 60
Query: 53 SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
SG AA+ + + ++ V + GP+H+ ++ + TP+G++ +D ++ E+
Sbjct: 61 SGPVAAHGYYELTKYAVPESVVVFGPNHYGVGTVVSIYPGGSWVTPLGEVKIDEKLAAEI 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
+F +D EHS+E+ LP+L ++ + VPI + + E G +
Sbjct: 121 AGQREFFYLDEVSHSREHSIEVQLPFLQYLYGD--FQFVPICINDQSLETCVEIGEAVGE 178
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
VD + S+DF H+ H+++ D++ ++ IE D +
Sbjct: 179 VVDGRNILMIASTDFTHYEP--------------HETVLKKDRLALERIERLDVEGLYGV 224
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ +D T+CG ++ L ++ L+ S S V YA+
Sbjct: 225 IERHDITMCGYGAVAALLTAAKKLGA-VEATVLKQATSGDTGGDYGSVVGYAA 276
>gi|281412147|ref|YP_003346226.1| hypothetical protein Tnap_0717 [Thermotoga naphthophila RKU-10]
gi|281373250|gb|ADA66812.1| protein of unknown function DUF52 [Thermotoga naphthophila RKU-10]
Length = 277
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 130/299 (43%), Gaps = 45/299 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYSG 54
R+ + AG +Y L E+ LD + LP E + G+++PHAGY YSG
Sbjct: 3 RKPAVAGLFYPSRKDELIEQIRMCFLDKRIGPGKLPVPVETKLQNPIGLVSPHAGYIYSG 62
Query: 55 RAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
AA+ F +P S V ++GP+H + ++TP+G +P++ +E
Sbjct: 63 PVAAWGFLEAAKFGEP---SVVVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEI 119
Query: 111 LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV----GAVNAENEAMYG 166
+ ++ D EHS+E+ +P+L VF + IVPI + AV + +
Sbjct: 120 ILNNSRYAEEDFMSHIREHSIEVQIPFLQFVFGE--VSIVPICLMDQSPAVAEDLASALA 177
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
+L A++ P S+D HY+ + + K D ++ IE DP
Sbjct: 178 KLVAEF---PGVLIIASTDLN---------HYEDQRTTLRK-----DSYIIEAIEGMDPS 220
Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+YL++ D ++CG ++ L+M N + L++ S + V Y SA
Sbjct: 221 LLYEYLVKEDISMCGYGGVATLLNM--NFKNA---RILKHATSGDVSGDKLEVVGYLSA 274
>gi|212702288|ref|ZP_03310416.1| hypothetical protein DESPIG_00299 [Desulfovibrio piger ATCC 29098]
gi|212674281|gb|EEB34764.1| Memo-like protein [Desulfovibrio piger ATCC 29098]
Length = 289
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 127/297 (42%), Gaps = 34/297 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--------EVRGVIAPHAGYSYSGR 55
+R+ + AG +YTD L E++ +L++ + + G++ PHAG+ YSGR
Sbjct: 3 LRQPAVAGRFYTDVPADLRAEVESFLKQGAALPASALAAEDAAHLAGLMLPHAGHVYSGR 62
Query: 56 AAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
++ + R VFLL P+H ++ A ++TP+G +P+D E+ L
Sbjct: 63 VIGATLAHV---RLPRTVFLLCPNHTGLGTPLSVWPAGAWQTPLGPVPVDGEMARLLCEA 119
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV- 173
+ D+ EHS+E+ LP+L + +VP+ VG A G+ A+ +
Sbjct: 120 DEDFSADVMGHVREHSIEVLLPFLQVCAGDAPLHVVPVCVGTGQAPVLRTAGQRLARVLE 179
Query: 174 -----DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
D VSSD HY+ + + K D + ++ GD DA
Sbjct: 180 RCRDRDGQRPLLLVSSDM---------NHYEDQQTTLRK-----DDLALEAALKGDADAL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCS--TKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+ ++CG P+++ + L + ++ + +E S+ + V YA
Sbjct: 226 LATVARAGISMCGAAPLALAFYALHAWTPDATVRAALVMHETSAAVSHDTEHVVGYA 282
>gi|83648587|ref|YP_437022.1| dioxygenase [Hahella chejuensis KCTC 2396]
gi|83636630|gb|ABC32597.1| predicted dioxygenase [Hahella chejuensis KCTC 2396]
Length = 259
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 98/189 (51%), Gaps = 11/189 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R+ + +G +Y N++ L+E + ++ + SP + +IAPHAGY YSG A A+
Sbjct: 3 VRKPAVSGLFYPANAEDLSETVSRYIATSPSFDHSP--KAIIAPHAGYVYSGAIAGVAYS 60
Query: 63 NIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ ++ IS+V LLGPSH A ++ + TP+G + +D + + +L + + +
Sbjct: 61 ALHNSAKRISKVVLLGPSHRVGFRGIAAPSSDAFSTPLGAIAIDADNLVKLASLPQVVTL 120
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
D EHS+E+HLP+L + + ++ P+++G +AE + +
Sbjct: 121 D-SAHAQEHSLEVHLPFLQQCLD--CFELTPLVIGDADAE---LVAEVLELLWGGDETLI 174
Query: 181 SVSSDFCHW 189
+S+D H+
Sbjct: 175 VISTDLSHY 183
>gi|198282863|ref|YP_002219184.1| hypothetical protein Lferr_0726 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|198247384|gb|ACH82977.1| protein of unknown function DUF52 [Acidithiobacillus ferrooxidans
ATCC 53993]
Length = 282
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/289 (25%), Positives = 121/289 (41%), Gaps = 34/289 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL----REAGLPKSPEVRGVIAPHAGYSYSGRAAA- 58
+R A+ AG +Y + L E++ L ++ +P + +I PHAGY YSG AA
Sbjct: 11 VRPAAVAGMFYPGEAAVLRTEVERLLARAEQDGEAASAPWPKAIIVPHAGYIYSGAVAAS 70
Query: 59 -YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
YA I RV LLGP+H AL +TP+G + +D +E L +
Sbjct: 71 GYALLAKGRGHIRRVVLLGPAHRLPFRGLALPGVQAMQTPLGTVAVDQAGVEALAGLPEV 130
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
M EH++E+ LP++ +V + +VP++VG + A R+ K
Sbjct: 131 REMP-AAHAQEHALEVQLPFIQEVLGD--VSVVPLVVGDARPDEVA---RVLEKLWGGEE 184
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+SSD H Y Y + + H ++E + + F ++++
Sbjct: 185 TVIVISSDLSH------YHPYAEARAIDHHTVEEILR-------------FDPTPIDHEQ 225
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PI+ L + + + + S RD+ V YA+ A
Sbjct: 226 A-CGATPINGLLPV--ARKHHLHPRLVGLCNSGDTAGSRDAVVGYAAVA 271
>gi|57641412|ref|YP_183890.1| hypothetical protein TK1477 [Thermococcus kodakarensis KOD1]
gi|119391268|sp|Q5JJC3.1|Y1477_PYRKO RecName: Full=MEMO1 family protein TK1477
gi|57159736|dbj|BAD85666.1| hypothetical protein, conserved, DUF52 family [Thermococcus
kodakarensis KOD1]
Length = 291
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 129/288 (44%), Gaps = 10/288 (3%)
Query: 4 IRRASHAGSWYTDNSKR--LAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+R + AGS+Y + + + E L E G + +APHAGY +SG A+ +
Sbjct: 2 VRYPAVAGSFYPADEELVLMLERFFSDLGEEG--NDRRITAGVAPHAGYIFSGYTASRTY 59
Query: 62 GNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
I + F +LGP+H A+ + TP+G++ +D E+ E+ +
Sbjct: 60 KAIFEDGLPETFVILGPNHTGLGSPIAVHPPGTWITPLGEIEVDGELAREIAKISGIADL 119
Query: 121 DICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D + EHS+E+ +P++ + E G +KIVPI +G + + GR + +
Sbjct: 120 DDLAHKYEHSIEVQVPFIQYLAEKAGKEVKIVPITLGIQDEDVAEDLGRAIFEASRELGK 179
Query: 179 FFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
V S+DF H+G + Y+ + + + I+ D + I D + + E +
Sbjct: 180 DVVVIASTDFMHYGQIYGYVPFRARADELPHRIKEWDFRIIRRILDFDVRGMFEEIREMN 239
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+T+CG + + + + ++ + L Y S + ++ V YAS
Sbjct: 240 HTMCGPGGVGTAI-VYSRLAGAVEAELLHYTTSFEVSRSTEAVVGYAS 286
>gi|332157855|ref|YP_004423134.1| hypothetical protein PNA2_0212 [Pyrococcus sp. NA2]
gi|331033318|gb|AEC51130.1| hypothetical protein PNA2_0212 [Pyrococcus sp. NA2]
Length = 292
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 117/250 (46%), Gaps = 6/250 (2%)
Query: 44 IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLP 102
IAPHAGY +SG A+ + I I F++ GP+H A+ + + TP+G +
Sbjct: 42 IAPHAGYIFSGYTASRTYKAIYEDGIPETFVIFGPNHTGLGSPIAVYPSGEWVTPLGSVK 101
Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAE 160
+D ++ +E+ + +D + EHS+E+ LP++ + E G +KIVPI +G + +
Sbjct: 102 VDSKLAKEIVKVSRIADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDED 161
Query: 161 -NEAMYGRLF-AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
+E++ +F A S+DF H+GS + Y+ + + + + D +
Sbjct: 162 ISESLGKAVFEASRSLGRDVIILASTDFMHYGSFYGYVPFTGRPDELPNLVREGDMRVIR 221
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
I D D + + ++T+CG + + + S + + L Y S + D+
Sbjct: 222 RILEFDLDGMFEEIRRMNHTMCGPGGVGAAI-VFSKLSGAERAELLHYTTSFEVSRSTDA 280
Query: 279 SVSYASAAAK 288
V YAS +
Sbjct: 281 IVGYASIVMR 290
>gi|222099579|ref|YP_002534147.1| hypothetical protein CTN_0605 [Thermotoga neapolitana DSM 4359]
gi|254806533|sp|B9K748.1|Y605_THENN RecName: Full=MEMO1 family protein CTN_0605
gi|221571969|gb|ACM22781.1| Hypothetical Protein CTN_0605 [Thermotoga neapolitana DSM 4359]
Length = 277
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 128/296 (43%), Gaps = 37/296 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEE-----LDGWLREAGLPKSPEVR-----GVIAPHAGYSYS 53
IR AG +Y L E+ LD + LP E G+++PHAGY YS
Sbjct: 2 IREPVVAGLFYPSRKDELIEQIRICFLDRRIGPGELPGPVEKNLQNPVGLVSPHAGYIYS 61
Query: 54 GRAAAYAFGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+ F + S V ++GP+H + ++TP+G +P++ E +E L
Sbjct: 62 GPVAAWGFLEVARIGKPSLVVIIGPNHTGLGKPVGIWPEGFWETPLGRVPVNEEAVEILL 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV---GAVNAENEAMYGRLF 169
+ ++ D EHS+E+ LP+L VF IVP+ + AE+ A R
Sbjct: 122 NSSRYAEEDTLSHLKEHSIEVQLPFLQFVFGD--FSIVPVCLMDQSPTVAEDLAFAVREL 179
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
K N ++S + HY+ + + K D + ++ IE D
Sbjct: 180 MKSF---RNVLIIAST--------DLNHYEDQKTTLKK-----DYLVVEAIEKRDSRLLY 223
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+YL++ D ++CG ++V L+ LG S +I L++ S V Y SA
Sbjct: 224 EYLVKEDISMCGYGGVAVLLN-LGFSSVRI----LKHATSGDVSGDTLEVVGYLSA 274
>gi|46200973|ref|ZP_00056132.2| COG1355: Predicted dioxygenase [Magnetospirillum magnetotacticum
MS-1]
Length = 456
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 128/292 (43%), Gaps = 37/292 (12%)
Query: 1 MEKIRRASHAGSWY----TDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRA 56
M IR + AG +Y + +++L+ LD + + P + +IAPHAG+ YSG
Sbjct: 1 MTAIRPTAVAGQFYPADFAEANRQLSAFLDHAVAAPCAGRRP--KALIAPHAGWVYSGPV 58
Query: 57 AAYAFGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA A+ + P S SRV LLGPSH AL ++ + +P+G + LD + L
Sbjct: 59 AAGAYALLRPFRGSWSRVVLLGPSHRVGFQGMALCSSDQWASPLGAVALDKD-WSRLAGV 117
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
++D + EHS+E+H+P+L ++P+++G +A + + G L A +
Sbjct: 118 AGVGVLDQAHAQ-EHSLEVHVPFLQATIGD--FTLLPVVIG--DASPDMVAGLLDALWGG 172
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
D +S+D H Y+ YD+ +++ A++ F +
Sbjct: 173 D-ETLIVISTDLSH------YLPYDQCRDTDGQTVAAIEH-------------FDALAIS 212
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
D CGR P+ L ++I L S +D V Y S A
Sbjct: 213 RDGA-CGRIPVGGLLTAAKR--RGLEIVTLDVRNSGDTAGPKDRVVGYGSWA 261
>gi|94970386|ref|YP_592434.1| hypothetical protein Acid345_3359 [Candidatus Koribacter versatilis
Ellin345]
gi|94552436|gb|ABF42360.1| protein of unknown function DUF52 [Candidatus Koribacter versatilis
Ellin345]
Length = 271
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 119/286 (41%), Gaps = 24/286 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR + AG +Y N ++L ++ + K + G + PHAGY YSG A +
Sbjct: 5 IREPAVAGRFYPGNPEKLTADIGDYTTPTNAEKLAAI-GCVVPHAGYMYSGHVAGAVYER 63
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+D R +L P+H A+ ++TP+GD +D E+ ++L A D
Sbjct: 64 LDLPK--RFVILCPNHTGAGHPLAVMREGSWRTPLGDAAIDAELADQLLAAFPLTSEDAD 121
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV- 182
EH++E+ LP+L + + VP+ VG + + G AK V + V
Sbjct: 122 AHRTEHALEVQLPFLQILVPN--FRFVPVAVGTGRFDVLSALGESIAKVVQSAAERVMVI 179
Query: 183 -SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ + + D+ + + + ALD G+ D++ E + ++C
Sbjct: 180 ASSDMNHYENDADTRVKDR---LAIERLLALDAKGLYDVVH------------EKNISMC 224
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
G P L + + ++Y S RD V YA A
Sbjct: 225 GYGPAVAMLTAAKRVGAS-RAELIKYATSGDVSGDRDMVVGYAGIA 269
>gi|390939154|ref|YP_006402892.1| putative dioxygenase [Desulfurococcus fermentans DSM 16532]
gi|390192261|gb|AFL67317.1| putative dioxygenase [Desulfurococcus fermentans DSM 16532]
Length = 283
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 125/301 (41%), Gaps = 43/301 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLRE---AGLPKSPEVR------GVIAPHAGYSYS 53
K R AG +Y D + L ++ + G P SP G +APHAGY YS
Sbjct: 2 KKRSPIVAGYFYPDKPEELRSVIEWSFKHRIGPGKPPSPSDTPTTNSIGYVAPHAGYIYS 61
Query: 54 GRAAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
G AA+ + N P +I +LG +H ++ V++TP+GDL +D E+
Sbjct: 62 GPVAAHVYFDMALNKKPDTI---VILGTNHTGLGRPVSVYPEGVWETPLGDLVVDDEIGR 118
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
+ + D EHS+E+ LP++ ++ G +KI PI++G ++ + A R
Sbjct: 119 LIVENSEIAEFDEYAHLEEHSIEVQLPFIVYIYGGD-VKITPIVIG-IHTPDVA---RDL 173
Query: 170 AKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
AK + S SSDF H+ S H+ D M +D I
Sbjct: 174 AKSIHKASMSTGKRIIVIASSDFNHYES--------------HEETSRKDSMAIDRILKL 219
Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
D D +L D +ICG I + + K ++ L+Y S V YA
Sbjct: 220 DTDGLYNVILHNDISICGPGGIMTLMEYIKKLGGKAQL--LKYATSGDTSGDYSHVVGYA 277
Query: 284 S 284
+
Sbjct: 278 A 278
>gi|332796340|ref|YP_004457840.1| hypothetical protein Ahos_0655 [Acidianus hospitalis W1]
gi|332694075|gb|AEE93542.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 284
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 135/299 (45%), Gaps = 39/299 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGW-----LREAGLPKSPEVRG------VIAPHAGYSYS 53
R + AG++Y + + L + + W + LP P+ +G I PHAGY YS
Sbjct: 3 RLPAVAGAFYEADPQELKDRI-VWSFTHPVGPGKLPTVPKEKGKRDNLFFIVPHAGYIYS 61
Query: 54 GRAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+++ + V +LGP+H Y +L ++TP+GD+ +D E+ EL
Sbjct: 62 GPVAAHSYYYLASEGKPDLVIILGPNHTGYGSYVSLWNKGCWETPLGDVKIDEEMAMELV 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGR 167
+ +D EHS+E+ +P+L F+ IKI+PI++ E E +Y R
Sbjct: 122 KYSEVIDIDEQAHLYEHSVEVQIPFLQFFFDSE-IKIIPIVILYQTPEIAEYIAEGIY-R 179
Query: 168 LFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM-DIIETGDPD 226
L K+ + + SSD H+ + Y K I K IE LD G+ D++E
Sbjct: 180 LMQKHPEKDIVVLA-SSDMNHYDPYD--ITYKKDEMAIEK-IEQLDYKGLYDVVENK--- 232
Query: 227 AFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
D T+CG P+ V + + + K I L++ S + S V Y +A
Sbjct: 233 ---------DVTMCGYAPVMVAMILAKKFNKKPYI--LKHATSGDTSGDKSSVVGYLAA 280
>gi|292493574|ref|YP_003529013.1| hypothetical protein Nhal_3603 [Nitrosococcus halophilus Nc4]
gi|291582169|gb|ADE16626.1| protein of unknown function DUF52 [Nitrosococcus halophilus Nc4]
Length = 261
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 100/215 (46%), Gaps = 12/215 (5%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +R+ + AG +Y + L ++ + + +P + +IAPHAGY YSG AA A
Sbjct: 1 MATLRQPAVAGLFYPADLVSLQTQIQNFFKTNDARGNPP-KAIIAPHAGYRYSGPVAASA 59
Query: 61 FGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + I RV LLGPSH A S A + TP+G +P++ + +++ A +
Sbjct: 60 YACLRKVQGRIHRVILLGPSHRVPFYGIATSRANGFATPLGIVPVNQDDLQQALALPQVR 119
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+D EHS+E+ LP+L ++ +VP++VG + + LF
Sbjct: 120 ALDEP-HALEHSLEVQLPFLQEILGD--FSLVPLVVGHSSPQEVKEVLDLFW---GSEET 173
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD 213
VSSD H+ D+ K+IEAL
Sbjct: 174 LIIVSSDLSHYHDYTTAQQLDR---ATTKAIEALQ 205
>gi|289193137|ref|YP_003459078.1| protein of unknown function DUF52 [Methanocaldococcus sp. FS406-22]
gi|288939587|gb|ADC70342.1| protein of unknown function DUF52 [Methanocaldococcus sp. FS406-22]
Length = 287
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 33/301 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
M KIR + AG +Y + L + ++ +L + G PKS V G ++ PHAGY Y
Sbjct: 1 MTKIRYPAVAGLFYPSHPDELIDMIEQCYLHKFG-PKSMPVHGTYEKPIGLLCPHAGYVY 59
Query: 53 SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
SG A+++ +D +LGP+H ++ +++TP+GD+ D E +
Sbjct: 60 SGPIQAHSYYELSKRVDALEEITAVILGPNHTGLGSGVSVMDG-IWRTPLGDVRCDEEFV 118
Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
EEL + +D EHS+E+ LP+L ++ KIVPI + + E G
Sbjct: 119 EELWTKCEIVDLDETAHLNEHSIEVQLPFLKHLELLNVAKFKIVPICMMFQDYETAVEVG 178
Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
AK + + + SSD H+ + KK ++ K I+E +
Sbjct: 179 YFIAKIAKELNRRVVIIASSDLTHYEPQ---EIASKKDAIVIKH----------ILEMNE 225
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ ++ ++ Y+ ++CG P+ L + K K L Y S + V YAS
Sbjct: 226 KELYED-VINYNISMCGYGPVIAMLKAMKTLGAK-KANLLAYATSGDITGDYSAVVGYAS 283
Query: 285 A 285
A
Sbjct: 284 A 284
>gi|376294830|ref|YP_005166060.1| hypothetical protein DND132_0038 [Desulfovibrio desulfuricans
ND132]
gi|323457391|gb|EGB13256.1| protein of unknown function DUF52 [Desulfovibrio desulfuricans
ND132]
Length = 269
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 114/276 (41%), Gaps = 21/276 (7%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
AG +Y L +DG+L A ++ + + PHAGY +SG G D S
Sbjct: 8 AGRFYDAQPDELYAMVDGFLGLAEKRQAEQTLLAMVPHAGYVFSGAVCGKTLGTADLAST 67
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
V LLGP+H + AL + P G L +D + E L A + D EH
Sbjct: 68 --VLLLGPNHTGRGERFALWPDGSWLIPGGSLAVDTGLAEALLAADPAIMADTAAHMGEH 125
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS--VSSDFC 187
S+E+ LP+L ++ IVP+ V A + ++ GR + + D S VSSD
Sbjct: 126 SLEVVLPFLFRLNPA--TTIVPVCVSAPSLDSLERVGRAVGRVLADYPEPVSIVVSSDMS 183
Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
H+ + H+ +D M ++ + T P + ++CG P+++
Sbjct: 184 HY--------------ISHEDAREMDGMALEPMMTLAPARLYDTVRSRRISMCGVLPMTM 229
Query: 248 FLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
L G + + + + Y S + + V YA
Sbjct: 230 GLFAAGELGAE-RAELVAYATSGEVSGDFEQVVGYA 264
>gi|193213539|ref|YP_001999492.1| hypothetical protein Cpar_1900 [Chlorobaculum parvum NCIB 8327]
gi|193087016|gb|ACF12292.1| protein of unknown function DUF52 [Chlorobaculum parvum NCIB 8327]
Length = 287
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 119/287 (41%), Gaps = 17/287 (5%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVRGVIAPHAGYSYSGRAAAYAF 61
IR + AG++Y + L L + P S P + VI PHAGY +SGR AA AF
Sbjct: 7 NIRNPAVAGAFYPADPSELNTLLAELFEKTSQPSSQPSPKAVIVPHAGYRFSGRVAARAF 66
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEV---IEELKATGKFE 118
++ I LLG +H A+ ++TP+G +PLD + + EL T +
Sbjct: 67 STLEGQHIRTAVLLGNAHAALFDSIAIDPHDGWRTPLGTVPLDKMMRIRLIELDPT-LYH 125
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
DI + +H +E+ LP L KI+P+L G A + D
Sbjct: 126 QSDIA-HQRDHVLEVQLPLLQHALASGF-KILPVLFGQNPAGTYWQCAEALLSLLSDDDL 183
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
+ SSD H Y Y + +++E L ++ + +E + + ++ + +
Sbjct: 184 LIA-SSDLSH------YPTYHDAQAIDTRTLEHLVELDIQGLEQHEQEVMQRRIPGLETV 236
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQC-KTKRDSSVSYAS 284
CG + L + K K L Y S + RD+ V Y +
Sbjct: 237 FCGPDAVKTVLEI--GSRLGWKGKLLSYRNSGDADQASRDAVVGYGA 281
>gi|320354378|ref|YP_004195717.1| hypothetical protein Despr_2282 [Desulfobulbus propionicus DSM
2032]
gi|320122880|gb|ADW18426.1| protein of unknown function DUF52 [Desulfobulbus propionicus DSM
2032]
Length = 268
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 116/285 (40%), Gaps = 30/285 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R + A +Y N +L +D + + + V+ PHAGY YSG A +
Sbjct: 3 RMPAVADRFYPGNPDQLRAAMDMLVPVVAEERKRDALAVVMPHAGYVYSGATAGATISRV 62
Query: 65 D-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
P + V ++GP+HH AL T ++ P+G +P+D ++ + + D
Sbjct: 63 RVPET---VLIMGPNHHGRGQALALGTEE-WRMPLGTVPIDRQLATAILQSSSIIREDQE 118
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFFS 181
+ EHS+E+ +P+L +V + IVP++V + + R A + S
Sbjct: 119 AHQYEHSLEVQVPFLQQVQPN--LAIVPLVVSMIPFDLCQTVARELAVAIGSLRRSVLMV 176
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ SR KK D++ +D I DP +L ++CG
Sbjct: 177 ASSDMSHYESR---QQASKK-----------DRLAIDRILAMDPQGLYATVLGQHISMCG 222
Query: 242 RHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
P ++ L LG + +RY S + V YA
Sbjct: 223 VIPATIALLAAQELGATTA----DLIRYTDSGEASGDTSQVVGYA 263
>gi|67470020|ref|XP_650981.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56467653|gb|EAL45595.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449704833|gb|EMD45002.1| Memo family protein [Entamoeba histolytica KU27]
Length = 284
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 104/249 (41%), Gaps = 30/249 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
IR+A AG W++ N LA E+D ++ A L K P ++G I+PHAG+ YSG+ A Y
Sbjct: 6 IRKAIGAGRWFSANGNELANEVDHYINNA-LNKLPSIQGKILGCISPHAGFRYSGQTAGY 64
Query: 60 AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
F + S VF+LG SH A+ TPI +D E I
Sbjct: 65 DFAALKRDSEINGKPDVVFILGFSHSSRFDCAAVMDGKAISTPIATTEIDNEAITMFCEG 124
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ EHS E LP++ + G +K+V +L+G +E +
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVLEQVSQGLQAVCS 182
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA-FKKYLL 233
+ SSD H D+ H ++ K+ DK + + E D K+
Sbjct: 183 KKKMYVIASSDMLH----------DESHNLVEKT----DKETIQLTEKMDIKGLLSKW-- 226
Query: 234 EYDNTI-CG 241
Y+N I CG
Sbjct: 227 SYENQIYCG 235
>gi|383320694|ref|YP_005381535.1| dioxygenase [Methanocella conradii HZ254]
gi|379322064|gb|AFD01017.1| putative dioxygenase [Methanocella conradii HZ254]
Length = 264
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 123/286 (43%), Gaps = 34/286 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y + + L +++ PK + GV+ PHAGY+YSG AA + +
Sbjct: 1 MRNPAVAGMFYPGSKEALRQQIASLAPRQPTPKI-DAMGVVVPHAGYAYSGGVAARVYAS 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I+ +LGP H + A+ST +KTP+G + + + I EL G + +I
Sbjct: 60 IE--GAQTFIILGPRHGWEGSAIAVSTEP-WKTPLGVVDVAHDFI-ELLPPGIIDRDEIA 115
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE------NEAMYGRLFAKYVDDPS 177
EHS+E+ +P+L FEG +IVPI +G + E NE M KY
Sbjct: 116 -HSREHSLEVQVPFLQYFFEG--FRIVPIAIGLQDYETVSEVANELMMA--LEKY--PKK 168
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
SSDF H+ D + + + IE LD +G F +
Sbjct: 169 AVIVASSDFSHYEPVEAAKRKDSR---LIERIEKLDVVG-----------FYDEIARLSA 214
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T CG PI+ + K + + L Y S R+ V YA
Sbjct: 215 TCCGYGPIAAMMLCCKRMGAK-RAELLAYATSGDVTGDRE-VVGYA 258
>gi|284162154|ref|YP_003400777.1| hypothetical protein Arcpr_1045 [Archaeoglobus profundus DSM 5631]
gi|284012151|gb|ADB58104.1| protein of unknown function DUF52 [Archaeoglobus profundus DSM
5631]
Length = 263
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 110/248 (44%), Gaps = 24/248 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+RR + AG +Y N + L L ++ V ++PHAGY YSGR A +
Sbjct: 1 MRRPAVAGMFYPSNRESLLAMLKEFV---DYHPDDSVIACVSPHAGYVYSGRTAGRVYSL 57
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I + ++GP+H Y ALST T + TP+G++ +D+E ++ + K +D
Sbjct: 58 I--PQVETYVIVGPNHTGYGSPVALSTDT-WITPLGEVEVDMEFVKAMPK--KIIDLDET 112
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFFS 181
EHS+E+ +P+L + G KIVPI +G + E E + AK
Sbjct: 113 AHRFEHSLEVQVPFLQYINMGRKFKIVPICMGMQDEETAREVAEEIITAKEETGKEIVVV 172
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ + + D++ ++ I + D F + ++CG
Sbjct: 173 ASSDMHHY--------------LPDEECRRRDRIVIEAILSMDVSKFYDTIYRMQASVCG 218
Query: 242 RHPISVFL 249
PI+V +
Sbjct: 219 YGPIAVAM 226
>gi|89899212|ref|YP_521683.1| hypothetical protein Rfer_0398 [Rhodoferax ferrireducens T118]
gi|89343949|gb|ABD68152.1| protein of unknown function DUF52 [Rhodoferax ferrireducens T118]
Length = 274
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/233 (26%), Positives = 101/233 (43%), Gaps = 35/233 (15%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPEVRGVIAPHAGYSYSGRAA 57
M +R + AG++Y + L ++ L +A + P + +I PHAGY YSG A
Sbjct: 1 MSLVRPPAVAGTFYPGQASVLLNDVTTLLTQARPGANLREPAPKALIVPHAGYIYSGATA 60
Query: 58 AYAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A A+ + SI RV LLGP H AL + TP+G + +D + + +
Sbjct: 61 ARAYAELAGGRASIQRVVLLGPVHRVPVRGLALPGVDAFATPLGRIEVDQDAVAAIA--- 117
Query: 116 KFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-----EAMYG 166
L + V A EHS+E+ LP+L V + K+VP+ VG EA++G
Sbjct: 118 --HLPQVVVSRAAHAQEHSLEVQLPFLQAVLDD--FKLVPLAVGDATPAQVAEVIEALWG 173
Query: 167 RLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
+SSD H ++ Y V ++++ + ++ I
Sbjct: 174 --------GDETLIVISSDLSH------FLPYGTAQAVDSETVQNMMQLNSSI 212
>gi|386875004|ref|ZP_10117208.1| memo-like protein [Candidatus Nitrosopumilus salaria BD31]
gi|386807164|gb|EIJ66579.1| memo-like protein [Candidatus Nitrosopumilus salaria BD31]
Length = 276
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 118/287 (41%), Gaps = 22/287 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------GVIAPHAGYSYSGRAA 57
IR AG +Y L + + P S V GVI PHAGY YSG A
Sbjct: 2 IREPVVAGQFYPRRKDDLENMIKYCMEHKYGPGSKPVLSNEKIFGVICPHAGYVYSGPTA 61
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+++ + + V ++GP+H A +KTP+G + +D E ++ KF
Sbjct: 62 CHSYMSFASQNPELVIIIGPNHFGIGKDVATMVDAQWKTPLGMVQIDSESARKISEISKF 121
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+D +HS+E+ +P L ++ +I+PI++ A + E G ++ +
Sbjct: 122 VEIDEYSHSQDHSLEVQIPMLQEILSSDF-QILPIILRAQDMETAMDVGNAVSEIAKRKN 180
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
SSDF H+ +++ HK +AL + +DI + + F + L
Sbjct: 181 TIIVASSDFTHY----------EENSFAHKQDKALIEPILDI----EVERFYQVLRNKRV 226
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
T CG ++ + N K + L Y S S V Y++
Sbjct: 227 TACGYGAMASTMIACKNLGAT-KGELLSYATSGDVSGDTKSVVGYSA 272
>gi|152990853|ref|YP_001356575.1| hypothetical protein NIS_1109 [Nitratiruptor sp. SB155-2]
gi|151422714|dbj|BAF70218.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2]
Length = 264
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 97/208 (46%), Gaps = 17/208 (8%)
Query: 41 RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
R +I+PHAGY YSG A A + T+ RV ++GPSH Y + S ++TP GD
Sbjct: 45 RAIISPHAGYVYSGFTANVAHRLLQNTAAKRVVVIGPSHRVYLSGISGSFYDAFETPCGD 104
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
L +D +E L KF + + EHS E+ +P++ ++ ++ G V+
Sbjct: 105 LLIDTAYLEMLAQ--KFGIGFVPEAHQEHSTEVQMPFIHHYLPN--TSVIELVYGDVDPR 160
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHW--GSRFNYMHYDKKHGV-------IHKSIEA 211
N AM + + + DP N +S+D H+ S+ + GV +H EA
Sbjct: 161 NVAM---ICEEVLSDPHNVVVISTDLSHYYPLSKAKELDMHCLQGVAELDITELHNGCEA 217
Query: 212 LDKMGMDIIETGDPD-AFKKYLLEYDNT 238
K+G++ I D + +L+Y +
Sbjct: 218 CGKIGVEAIILAAKDLGLRPRILDYRTS 245
>gi|57233975|ref|YP_182025.1| hypothetical protein DET1313 [Dehalococcoides ethenogenes 195]
gi|57224423|gb|AAW39480.1| protein of unknown function /conserved hypothetical protein
TIGR00296 [Dehalococcoides ethenogenes 195]
Length = 438
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 125/288 (43%), Gaps = 35/288 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R AS AG +Y + +++ E+ G +E L G + PHAGY YSG A
Sbjct: 2 LRPASAAGRYYPASKEKIRSVIAPMEVKGVAKERYL-------GAVMPHAGYPYSGGVAM 54
Query: 59 YAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+D P + V +LGPSH + A+ + +++TP+G++ +D + + +
Sbjct: 55 AVASRLDLPET---VIILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSTLAHSIMKYCRH 111
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
D + EHS+E+ LP L + F+ IKIVP+ V AE A G A + +
Sbjct: 112 IKADPSAHQYEHSIEVQLPIL-QYFKPD-IKIVPLTVSFGKAETLAEIGHGIASALRETG 169
Query: 178 N--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
SSD H+ S+ + H+ D + +D I D + + +
Sbjct: 170 REAIIIASSDMTHYESQAD----------AHQK----DSLALDAIIKLDAAQMLERIQQN 215
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T+CG P++ L + K + + + Y+ S D V YA
Sbjct: 216 HITMCGFAPVAAMLTAVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 262
>gi|315229903|ref|YP_004070339.1| dioxygenase [Thermococcus barophilus MP]
gi|315182931|gb|ADT83116.1| hypothetical dioxygenase [Thermococcus barophilus MP]
Length = 292
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 129/291 (44%), Gaps = 8/291 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR + AGS+Y + L L+ + + G L ++ +APHAGY +SG A+ +
Sbjct: 2 IRYPAVAGSFYP-TGEELVIMLERFFSDLGELGSERKITAGVAPHAGYVFSGYTASRTYK 60
Query: 63 NIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F ++GP+H A+ + TP+G + +D E+ + + +D
Sbjct: 61 AIYEDGLPETFVIIGPNHTGLGSPVAVYPEGEWITPLGGVEVDAELAKAIVKNSSIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ LP++ + + G IKIVPI +G + E G+ + +
Sbjct: 121 ELAHKYEHSIEVQLPFIQYIADKAGRKIKIVPITLGLQDEEVAEDLGKAIFEASQELGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
V S+D H+G + Y+ + + + I+ D + I D + + D+
Sbjct: 181 VVVIASTDMMHYGYMYGYIPFRARGEDLLGRIKEWDFRVIQRILEFDYKGMFDEIRKMDH 240
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
T+CG ++ + + + + + L Y S + D+ V YAS +
Sbjct: 241 TMCGPGGVAAGI-VFSRLADANEAELLHYTTSFEVSRSTDAIVGYASVVMR 290
>gi|406911104|gb|EKD50972.1| hypothetical protein ACD_62C00382G0002 [uncultured bacterium]
Length = 499
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 104/228 (45%), Gaps = 20/228 (8%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
E IR+ AG++Y ++ L ++ +L VR V+ PHAGY YSG AA AF
Sbjct: 44 EVIRQPIAAGTFYPNDPTALYHDIKRYLGAEPSLGYTNVRAVLVPHAGYVYSGAVAASAF 103
Query: 62 GNIDPTSISRVFLLGPSHH---YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ P +RVF+L SHH +L+ + Y P +PLD V++ELK F
Sbjct: 104 REVSP-DFTRVFILA-SHHNGDVTLSGVSLTEVSHYAIPGAKIPLD-GVVDELKKNPLF- 159
Query: 119 LMDICVDEAEHSMEM---HLPY---LAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+ H+M+M LP+ L + E I+P++VGA++ ++ KY
Sbjct: 160 ----VHEPLAHTMQMIEVELPFLHALKGLSEKPEFTIIPMIVGALSKTQTEELAQILNKY 215
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKK--HGVIHKSIEALDKMGMD 218
+D F S D H+ + + D + ++ + ALD D
Sbjct: 216 -NDAKTLFVFSVDLSHYFNAKEALTLDARTIDALMGRDTTALDSATTD 262
>gi|88604124|ref|YP_504302.1| hypothetical protein Mhun_2891 [Methanospirillum hungatei JF-1]
gi|88189586|gb|ABD42583.1| protein of unknown function DUF52 [Methanospirillum hungatei JF-1]
Length = 264
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 132/294 (44%), Gaps = 39/294 (13%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K R ++ AG +Y RL + L R + S + GV++PHAGY YSG+ AA AF
Sbjct: 2 KSRASTVAGMFYPGEPGRLRQLLSELFRN--VTSSGDAAGVVSPHAGYVYSGKTAAKAFA 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
D ++GPSH + P C + ++TP+G L D+E+ + +
Sbjct: 60 AFDEKFDGTFLVIGPSHRGF-PTCI--SQVPWETPLGMLEPDIEL---------GSCIHL 107
Query: 123 CVDE------AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
VDE +E+S+E+ +P++ F +IVP+++G + + ++
Sbjct: 108 PVDERAMSYGSENSLEVQMPFIQYRFPRS--RIVPLMMGHQTLQEVHRISSIIITALEKY 165
Query: 177 SNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
+ SSDF H Y+ +K + H +IEAL +M + P+ F + +
Sbjct: 166 QKPVKIVASSDFSH------YVSAEKAYRDDHYAIEALLQMNV-------PEFFSR-IQS 211
Query: 235 YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ T CG PI + +L K + L Y S + D V YA+ A +
Sbjct: 212 HNITACGYGPIGCMMEVL-RTRGKTRCDLLEYTTSGETSGDYDQVVGYAALAVR 264
>gi|339499622|ref|YP_004697657.1| hypothetical protein Spica_0999 [Spirochaeta caldaria DSM 7334]
gi|338833971|gb|AEJ19149.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Spirochaeta caldaria DSM 7334]
Length = 291
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 117/278 (42%), Gaps = 44/278 (15%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------GVIAPHAGYSYSGRAAA 58
R AG +Y D L L+ + +G+ + PE R G+I PHA + SG
Sbjct: 15 RSPVMAGMFYPDTDAALRIALNRYYEISGM-QGPETRTDCNPLGIIVPHAAWELSGLLLG 73
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-F 117
AF + I +V LLGP H +S + ++TPIGDL + ++EL + F
Sbjct: 74 RAFSLVRKRPIEQVVLLGPLHASTQEGIYVSESDSFETPIGDLWVHKNKVDELLSCSTIF 133
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP- 176
EL DI +EHS+E+ LP++A F + I+PIL+G + R D
Sbjct: 134 ELNDI-PHLSEHSLEVVLPWIADAFPQ--VSIIPILIGTSRTNHIHALSRALELCFSDEL 190
Query: 177 -SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALD--KMGMDIIETGDPDAFKKYLL 233
F +++ HYD EA + + +++++TGD +
Sbjct: 191 DKTLFIATTNVS--------AHYDND--------EAFEQGRRFLELVQTGDGAQLLQLAQ 234
Query: 234 EYDNTICGRHPISVFL-------------HMLGNCSTK 258
+ T CG ++ L H+LG S++
Sbjct: 235 QGHITACGAAGVAALLQCRFWEVAPLPKPHLLGQASSR 272
>gi|338730664|ref|YP_004660056.1| hypothetical protein Theth_0876 [Thermotoga thermarum DSM 5069]
gi|335365015|gb|AEH50960.1| protein of unknown function DUF52 [Thermotoga thermarum DSM 5069]
Length = 281
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 134/301 (44%), Gaps = 47/301 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-----KSPEVR------GVIAPHAGYSY 52
IR AG++Y + +L + + + P K P + G+I PHAGY Y
Sbjct: 2 IRYPVVAGTFYPGSPTKLRQTIMDLITSPLGPGNLDFKVPVTQPLNKNVGIIVPHAGYVY 61
Query: 53 SGRAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
SG A +A+ + + + V L+GP+H K + + TP+G + +D + +
Sbjct: 62 SGPIAVHAY--VAAARLGKPNLVVLIGPNHTGRGAKVGVWDKGSWLTPLGKVEVDEQASK 119
Query: 110 ELKATGKFELMDICVDE-----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEA 163
L FE ++C + EHS+E+ LP+L F+ KI+PI + V+ + +
Sbjct: 120 LL-----FENCEVCKADFDSHLLEHSLEVQLPFLQFFFDE--FKILPISIFPVSIDLCKK 172
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
+ L A + + F VS+DF HY+ + I K D+M ++ IE
Sbjct: 173 IAAGLDAIASEYKNTLFVVSTDF---------NHYENQDVTIKK-----DQMAIEKIEAK 218
Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
DP + + +YD ++CG ++ FL+M T K K L + S V YA
Sbjct: 219 DPIGLVEVVDKYDISMCGVSAVASFLYM----KTFGKPKLLCHATSGDVSKDFLQVVGYA 274
Query: 284 S 284
S
Sbjct: 275 S 275
>gi|392374695|ref|YP_003206528.1| hypothetical protein DAMO_1639 [Candidatus Methylomirabilis
oxyfera]
gi|258592388|emb|CBE68697.1| conserved protein of unknown function [Candidatus Methylomirabilis
oxyfera]
Length = 267
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 118/289 (40%), Gaps = 37/289 (12%)
Query: 4 IRRASHAGSWYTDNSKRL---AEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
IRRA+ AGS+Y + L A +L W + + G + PHAGY YSGR A
Sbjct: 2 IRRAAVAGSFYPGTPELLRAQAADLITWDLQ-----TVRALGAVVPHAGYIYSGRVAGAV 56
Query: 61 FGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
+ + + VF +LGP+H + ++TP+G + +D E+ + +
Sbjct: 57 YARL---TFPDVFVILGPNHTGIGAGAGIMADGTWETPMGQVSIDTELARAILQNSRTIE 113
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPS 177
D EHS+E+ LP L G VPI + + G A+ + D S
Sbjct: 114 DDDLGHRREHSIEVQLPLLQA--HGRPFSFVPICLFSSEYAVCQDVGLAVAQAIAGSDRS 171
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
S+D H+ SR + +A D++ ++ I DP + + D
Sbjct: 172 VLMVASTDMSHYVSR--------------EQAKAKDRLAIEAIVACDPQRLYQVVRREDI 217
Query: 238 TICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+CG HP + L LG S ++ + Y S+ S V YA
Sbjct: 218 AMCGFHPTTALLIAARELGATSGEL----ISYATSADVTRDDSSVVGYA 262
>gi|347755627|ref|YP_004863191.1| putative dioxygenase [Candidatus Chloracidobacterium thermophilum
B]
gi|347588145|gb|AEP12675.1| putative dioxygenase [Candidatus Chloracidobacterium thermophilum
B]
Length = 421
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 134/318 (42%), Gaps = 40/318 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R A+HAG Y D + L + LD L G + ++ GV+APH RA A
Sbjct: 106 VRPAAHAGVSYPDEPEALRQRLDAILAHTPACPLGAARPEQLVGVVAPHIDLRVGERAYA 165
Query: 59 YAFGNID------PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
A+ I+ P F++ + HY +++ Y TP+G +P D E ++ L+
Sbjct: 166 PAYRLIEQLAATLPDGEPVTFVVLGTSHYGGDGLFVASRKDYATPLGAMPCDREFLDRLE 225
Query: 113 AT-GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL-----IKIVPILVGAVN---AENEA 163
G D EH++E +L +F+ HL +++VPIL +++ A +EA
Sbjct: 226 TRLGASISADDTPHRQEHAIEFQAVFLRHIFDRHLEAGRPVRMVPILCTSLHELYAADEA 285
Query: 164 MYGRLFAKYV------------DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
R A+Y V D H G +F +D + +E
Sbjct: 286 CRERTQAEYRAFIEALQATLAEQTHRTLLLVGGDLAHVGPKFGD-RFDAQTRA--AELER 342
Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQS 269
D +D + GD A ++ ++ +CG P+ +L L + L YEQ
Sbjct: 343 ADTELLDYVAAGDSAAVLDHIARDEDARRVCGFPPLMAYLDALAAFG-PTDGQVLHYEQW 401
Query: 270 SQCKTKRDSSVSYASAAA 287
+ T+ S+V+Y +AAA
Sbjct: 402 QERPTR--SAVTYGAAAA 417
>gi|256811124|ref|YP_003128493.1| hypothetical protein Mefer_1184 [Methanocaldococcus fervens AG86]
gi|256794324|gb|ACV24993.1| protein of unknown function DUF52 [Methanocaldococcus fervens AG86]
Length = 287
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 133/301 (44%), Gaps = 33/301 (10%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSPEVRG-------VIAPHAGYSY 52
M KIR + +G +Y + L + ++ +L + G PKS V G ++ PHAGY Y
Sbjct: 1 MNKIRYPAVSGLFYPSHPDELIDMIEQCYLHKYG-PKSMPVHGNYKKPVGLLCPHAGYIY 59
Query: 53 SGRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
SG A+++ +D +LGP+H ++ +++TP+GD+ D E I
Sbjct: 60 SGPIKAHSYYELSKRVDALEEITAVILGPNHTGLGSGVSVMDG-IWRTPLGDVKTDEEFI 118
Query: 109 EELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYG 166
+EL + +D EHS+E+ LP+L ++ +IVPI + + E G
Sbjct: 119 DELWKKCEIIDLDETAHLNEHSIEVQLPFLKHLELLNIAKFRIVPITMMLQDYETAVEVG 178
Query: 167 RLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
AK + + + SSD H+ + KK ++ K DI+E D
Sbjct: 179 YFIAKIAMELNRRVVIIASSDLTHYEPQEV---ASKKDAIVIK----------DILEM-D 224
Query: 225 PDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
A + ++ Y+ ++CG P+ L + + K K L Y S + V YAS
Sbjct: 225 ERALYEDVVNYNISMCGYGPVIAMLKAMKTLGAE-KSKLLAYATSGDMTGDYSAVVGYAS 283
Query: 285 A 285
A
Sbjct: 284 A 284
>gi|307721110|ref|YP_003892250.1| hypothetical protein Saut_1190 [Sulfurimonas autotrophica DSM
16294]
gi|306979203|gb|ADN09238.1| protein of unknown function DUF52 [Sulfurimonas autotrophica DSM
16294]
Length = 266
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 123/291 (42%), Gaps = 38/291 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR--------EAGLPKSPEVRGVIAPHAGYSYSGR 55
IR+ + AG +Y + + + + + + EA L P R +I PHAGY YS
Sbjct: 3 IRKDAVAGQFYPASKEEIEKMFSHYNKIIDESIKDEAILNLKP--RAIIVPHAGYVYSAF 60
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A AF + + RV ++GPSH Y +++ Y+TP+G+L +D ++ +EL
Sbjct: 61 TANIAFRLLKNSHAKRVVVIGPSHRVYLNGTSVAEYDSYETPLGNLSIDKKLADEL--IE 118
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
KF+L +EHS E+ +P++ +V ++ G E+ G++ + D
Sbjct: 119 KFDLHFQADAHSEHSTEVQMPFVKNYLPN--ASVVELVYG---NEDPVNLGKVINYLLKD 173
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+S+D H Y DK + + +EA+ K+ + G
Sbjct: 174 EDTTVVISTDLSH------YYDIDKANQLDSICLEAVAKLNPTELHQG------------ 215
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG+ I V ++ +K L Y S+ + V Y SAA
Sbjct: 216 -CEACGK--IGVEAMLIAAKENGLKPTLLDYRTSADASGDKSQVVGYMSAA 263
>gi|357632292|ref|ZP_09130170.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio sp. FW1012B]
gi|357580846|gb|EHJ46179.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio sp. FW1012B]
Length = 288
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 119/284 (41%), Gaps = 25/284 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIA--PHAGYSYSGRAAAYAFG 62
R+ AG +Y + L E +L EA +P + ++A PHAGY YSG A G
Sbjct: 20 RQPVVAGRFYPGAAPALRREAGAFLAEAEVPAEADGPTLLAMVPHAGYVYSGSVAGRTLG 79
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFELMD 121
V LLGP+H + A+ + + P D+P+ ++ +L +A + D
Sbjct: 80 AARLAGT--VLLLGPNHTGRGKRLAVWPSGAWAVPGCDVPVAADLARDLLRAEARLS-PD 136
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L + ++IVPI V + + A + S S
Sbjct: 137 AAAHLEEHSLEVLLPFL--CVKNPAVRIVPIAVAEPDPDVLRQVAGTMAGVLGQRSEPVS 194
Query: 182 --VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
VSSD H+ V H++ + D + +D + DPD + + E T+
Sbjct: 195 LVVSSDMSHY--------------VPHETAKRRDALALDRVLALDPDGLYRVVREAGITM 240
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG P+ + L++ + + Y S + + V YA
Sbjct: 241 CGVLPMVLGLYLAKALGAREAV-LAAYATSGEASGDYNQVVGYA 283
>gi|327400495|ref|YP_004341334.1| hypothetical protein Arcve_0598 [Archaeoglobus veneficus SNP6]
gi|327316003|gb|AEA46619.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Archaeoglobus veneficus SNP6]
Length = 261
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 112/248 (45%), Gaps = 26/248 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R AGS+Y N L L ++R + P + +APHAGY YSGR A +
Sbjct: 1 MRHPVVAGSFYPSNPDSLLAMLAEFVRPS---PDPTIAACVAPHAGYMYSGRTAGKIYSL 57
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I P + + V +LGP+H Y A+ST T + TP+G++ D+E IE + MD
Sbjct: 58 I-PKAETYV-ILGPNHTGYGSMVAVSTDT-WLTPLGEIEPDVEFIEAMPKV--IVDMDEI 112
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L V + KIVPI +G + E E + L A+
Sbjct: 113 AHRYEHSIEVQLPFLQYVHKD--FKIVPICLGMQDEETAREVAHEILTAEERTGRKIVVI 170
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H+ + ++ D +D I + D + + + + ++CG
Sbjct: 171 ASSDMHHY--------------LPDRTCREADAKVIDAILSMDVARYYQTIYDMQASVCG 216
Query: 242 RHPISVFL 249
I V +
Sbjct: 217 YGAIGVVM 224
>gi|407039725|gb|EKE39785.1| Memo family protein [Entamoeba nuttalli P19]
Length = 284
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 101/248 (40%), Gaps = 28/248 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
IR+A AG W++ N LA E+D ++ A K P ++G I+PHAG+ YSG+ A Y
Sbjct: 6 IRKAIGAGRWFSANGNELANEVDHYINNA-FNKLPSIQGKILGCISPHAGFRYSGQTAGY 64
Query: 60 AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
F + S VF+LG SH A+ TPI +D E I
Sbjct: 65 DFAALKRDSEMNGKPDVVFILGFSHSSRFDCAAVMDGKAISTPIATTEIDNEAITMFCEG 124
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ EHS E LP++ + G +K+V +L+G +E +
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVLEQVSQGLQAVCS 182
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
+ SSD H D+ H ++ K+ DK + + E D
Sbjct: 183 KKKMYVIASSDMLH----------DESHSLVEKT----DKETIQLTEKMDIKGLLTR-WS 227
Query: 235 YDNTI-CG 241
Y+N I CG
Sbjct: 228 YENQIYCG 235
>gi|410995995|gb|AFV97460.1| hypothetical protein B649_05730 [uncultured Sulfuricurvum sp.
RIFRC-1]
Length = 266
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 116/251 (46%), Gaps = 30/251 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV---------RGVIAPHAGYSYSGR 55
R+ S AGS+Y ++S + +D + L P+V VI PHAG+ YSG
Sbjct: 4 RKMSVAGSFYPESSIEITTMIDYF--NTILESHPDVAARFDALHGNAVIVPHAGWVYSGF 61
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A AF + + + ++GPSH +++ + Y+TP+G+L +D ++EELK
Sbjct: 62 TANIAFRILSHSLPKTIIVIGPSHKVGFEGVSIADSEFYQTPLGELEIDTALVEELKK-- 119
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY-VD 174
+F L EHS E+ +P++ + +K+V + V A + + Y ++
Sbjct: 120 QFALTTFETAHHEHSTEVQMPFIKHYMDN--VKVVEL----VYAHADPLQISPIIDYLLN 173
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYD---------KKHGVIHKSIEALDKMGMD-IIETGD 224
P +S+D H+ S D + ++H+ EA K+G++ +++ +
Sbjct: 174 QPDTAVVISTDLSHYYSLDEAKELDLICLEAIRHENSTMLHQGCEACGKIGVEAMLDVAN 233
Query: 225 PDAFKKYLLEY 235
+ LL+Y
Sbjct: 234 KRTMEAILLDY 244
>gi|340359794|ref|ZP_08682267.1| hypothetical protein HMPREF9062_1392 [Actinomyces sp. oral taxon
448 str. F0400]
gi|339884084|gb|EGQ73906.1| hypothetical protein HMPREF9062_1392 [Actinomyces sp. oral taxon
448 str. F0400]
Length = 277
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 99/225 (44%), Gaps = 20/225 (8%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWL---REAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
++ R + AG +Y RL +++ L R +I PHAGY YSG AA
Sbjct: 9 QQTRPPAVAGLFYPAEPARLRADVESMLTGARADADAAPTPPAALIVPHAGYVYSGPTAA 68
Query: 59 YAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
A+ + + RV LLGP+H AL TP+G P+ +EV +++A
Sbjct: 69 LAWARAESLRGRVRRVVLLGPAHRMGVRALALPGHRAMDTPLG--PVTVEVPAQIEALAS 126
Query: 117 FELMDICVD--EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
L+ D AEHS+E+ LP+L V +VP+ VG V+A+ A R F D
Sbjct: 127 SSLVVARPDVHAAEHSLEVQLPFLLTVLPEA--SVVPLAVGRVDADRVAEAIRPFLGGSD 184
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
VSSD H Y+ + V +IE + + DI
Sbjct: 185 ---TLVVVSSDLSH------YLPQAEARRVDDATIERILALRADI 220
>gi|322419450|ref|YP_004198673.1| hypothetical protein GM18_1934 [Geobacter sp. M18]
gi|320125837|gb|ADW13397.1| protein of unknown function DUF52 [Geobacter sp. M18]
Length = 266
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 128/290 (44%), Gaps = 34/290 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +YT + RL EEL + + ++ + G+IAPHAGY YSG+ A +
Sbjct: 2 VRQPAVAGRFYTGDPVRLREELSAMVPQG---EAQKAIGIIAPHAGYVYSGKVAGKVYSA 58
Query: 64 ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ P + V +LGP+H AL+ T + TP+G +P++ + + + D
Sbjct: 59 VRIPDT---VLILGPNHTGAGVAAALAPETQWLTPLGSVPVNRRLSKLILEHAPQVREDA 115
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV---GAVNAENEAMYGRLFAKYVDDPSNF 179
EHS+E+ +P+L G I + + + +++ E + R A Y ++
Sbjct: 116 AAHRFEHSLEVQVPFLQYRNPGVSIAAICLALPDFASISKIGEGI-ARAIAAYGEE--VL 172
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ S + + D + + +P+ + E + T+
Sbjct: 173 IVASSDMTHYES--------------AAAAKVKDDQALAQLTELNPEGLLRVCREKEITM 218
Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG P + L +LG S ++ + Y S + S V+YA+ A
Sbjct: 219 CGVIPATALLVAAKVLGATSCRL----VDYATSGEVNGDLASVVAYAALA 264
>gi|432329137|ref|YP_007247281.1| putative dioxygenase [Aciduliprofundum sp. MAR08-339]
gi|432135846|gb|AGB05115.1| putative dioxygenase [Aciduliprofundum sp. MAR08-339]
Length = 271
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 123/292 (42%), Gaps = 37/292 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
+R + AG +Y K L L+ ++ G + P ++ G + PHAGY +SG
Sbjct: 1 MRYPAVAGQFYPGEKKELQIMLNQLFIHPLGPGEVPSLSPDGPRKIIGGVVPHAGYIFSG 60
Query: 55 RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + + F++ +HY T + TP+G + +D E+ +E+
Sbjct: 61 PVAAHFYAALAKDGFPDTFIIIGPNHYGIGSGVAVTLEDFITPLGRVQIDRELAKEIAR- 119
Query: 115 GKFELMDI--CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
E++DI EHS+E+ LP+L F IK VPI + E G++
Sbjct: 120 ---EMVDIDDYAHRYEHSIEVQLPFLQ--FFKREIKFVPISMLIQEYEIAVELGKIIKDA 174
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
++ SSDF H+ R N Y K D + ++ I GD +
Sbjct: 175 IEGKDVVVIASSDFSHYIPRQN--AYRK------------DALAIEKILKGDTRGLYDVI 220
Query: 233 LEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++ T+CG PI+ L G +T L+Y S + +D V YA+
Sbjct: 221 AKHSITMCGYGPITAMLTATGGKAT-----LLKYATSGDVQPMQD-VVGYAA 266
>gi|300087467|ref|YP_003757989.1| hypothetical protein Dehly_0349 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
gi|299527200|gb|ADJ25668.1| protein of unknown function DUF52 [Dehalogenimonas
lykanthroporepellens BL-DC-9]
Length = 440
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 112/248 (45%), Gaps = 28/248 (11%)
Query: 42 GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
GV++PHAGY YSG AA FG I+ + ++GP+H ++ T ++TP+G++
Sbjct: 40 GVVSPHAGYIYSGGVAAAVFGRIE--TADTYVIIGPNHTGMGKPFSIMTVGAWQTPLGEV 97
Query: 102 PLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN 161
+D + + + A K+ D + EHS+E+ LP L + +KIVPI++ +
Sbjct: 98 AIDSSLAQNILANSKYLQEDRTAHQGEHSVEVQLPLLQ--YHQPQLKIVPIVLAVATLDI 155
Query: 162 EAMYGRLFAKYVD---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
G A+ + D SSD H+ ++ D++ +SI ALD+ +
Sbjct: 156 YREIGASIAQAIQESPDKKVVIVASSDMTHYEAQDIATAKDQRA---IESIIALDEAQL- 211
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTK 275
+ +++ ++CG P + LG S ++ +RY+ S
Sbjct: 212 ----------LERVIKERISMCGYAPTVTMMAAARALGARSAEL----VRYQTSGDASGD 257
Query: 276 RDSSVSYA 283
+ V YA
Sbjct: 258 YTAVVGYA 265
>gi|289432991|ref|YP_003462864.1| hypothetical protein DehalGT_1048 [Dehalococcoides sp. GT]
gi|452205478|ref|YP_007485607.1| hypothetical protein btf_1189 [Dehalococcoides mccartyi BTF08]
gi|288946711|gb|ADC74408.1| protein of unknown function DUF52 [Dehalococcoides sp. GT]
gi|452112534|gb|AGG08265.1| hypothetical protein btf_1189 [Dehalococcoides mccartyi BTF08]
Length = 438
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 124/287 (43%), Gaps = 33/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R+AS AG +Y + +++ E+ G +E L G + PHAGY YSG A
Sbjct: 2 LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+D I+ +LGPSH + A+ + +++TP+G++ +D + + +
Sbjct: 55 AVASRLDLPEIA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D + EHS+E+ +P L + F+ IKIVPI V +E A G A + +
Sbjct: 113 KADPSAHQYEHSIEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170
Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SSD H+ S+ + H D + +D I D + + +
Sbjct: 171 EAIIIASSDMTHYESQSD-AHLK-------------DSLALDAIIKLDAAEMLERIQQNH 216
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T+CG P++ L + K + + + Y+ S D V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 262
>gi|355570784|ref|ZP_09042054.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanolinea tarda NOBI-1]
gi|354826066|gb|EHF10282.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Methanolinea tarda NOBI-1]
Length = 263
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 130/288 (45%), Gaps = 36/288 (12%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLR--EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R+ + AG +Y + L + L+ + R ++GL + RG+++PHAGY YSG +A A+G
Sbjct: 4 RKCAVAGMFYPRDPDHLEQLLEKFFRNKDSGL----DARGIVSPHAGYPYSGEVSAVAYG 59
Query: 63 NIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I P++ S F+ +GPSH + C + ++TP+G + D +L + E+ +
Sbjct: 60 AI-PSTFSGTFIVIGPSHRGF---CTCISLVPWETPLGLVDND----SQLGSLLGVEVDE 111
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF-- 179
+ E+S+E+ +P++ F +IVP+L+G + ++ A +
Sbjct: 112 VSHQYQENSLEVQIPFIKYRFP--RARIVPVLMGDQDLQSAADLAEKIVAGIRKSGRSDV 169
Query: 180 -FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDF HY K I + A++ + T D + F + L E +
Sbjct: 170 RIVASSDFS---------HYVPKATAIENDLYAIEPL-----LTLDVEGFYRRLRERRVS 215
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PI+ + + K LRY S T V YA+ A
Sbjct: 216 ACGYGPIATMCLACRSLGAS-EGKLLRYATSGDV-TGDPEVVGYAAIA 261
>gi|229579952|ref|YP_002838351.1| hypothetical protein YG5714_2180 [Sulfolobus islandicus Y.G.57.14]
gi|259646598|sp|C3N8S4.1|Y2180_SULIY RecName: Full=MEMO1 family protein YG5714_2180
gi|228010667|gb|ACP46429.1| protein of unknown function DUF52 [Sulfolobus islandicus Y.G.57.14]
Length = 284
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ +L
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQHFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K +EA++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|154272045|ref|XP_001536875.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150408862|gb|EDN04318.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 137
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 56/85 (65%), Gaps = 7/85 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAGYSYSGRAA 57
R A+HAGSWY+D+ L+ +L+ WL + G+ + P R +IAPHAGY+YSG A
Sbjct: 44 REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCA 103
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYY 82
A+A+ +D + +FLLGPSHH++
Sbjct: 104 AWAYKALDLSKAKSIFLLGPSHHHH 128
>gi|430813553|emb|CCJ29100.1| unnamed protein product [Pneumocystis jirovecii]
Length = 130
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 56/110 (50%), Gaps = 21/110 (19%)
Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDK------- 200
KIVPILVGA+ E E YG+L A Y+ D N F VS G RF+Y +Y +
Sbjct: 18 KIVPILVGAIGTEQEEWYGQLLAPYIGDTKNRFIVS------GQRFSYTYYAEAGEKGRR 71
Query: 201 --------KHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
+ I+KSIE D+ GM IE G F +YL NTICGR
Sbjct: 72 LWREEEPLRGEPIYKSIERCDREGMQKIEEGSHKGFVEYLSRTRNTICGR 121
>gi|197118676|ref|YP_002139103.1| MEMO-like protein [Geobacter bemidjiensis Bem]
gi|197088036|gb|ACH39307.1| MEMO-like protein [Geobacter bemidjiensis Bem]
Length = 266
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+ + AG +Y+ + ++L EEL + RE + + GV+APHAGY YSG AA +
Sbjct: 2 IRQPAVAGKFYSADPEQLREELSRMIPRE----EPAKAIGVVAPHAGYVYSGGAAGKVYA 57
Query: 63 NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
++ P + V +LGP+H AL+ + + TP+G +P++ + + + D
Sbjct: 58 AVEVPDA---VIVLGPNHTGMGAAAALAPSGEWLTPLGPVPVNDRLSQLIMKYAPLVRED 114
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
EHS+E+ +P+L + + I + + + ++ + G A + + +
Sbjct: 115 SAAHRFEHSLEVQVPFLQ--YRNPQVSIAALCLSLPDFDSISRVGTGIAHAIAEYGSEVL 172
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ S +A D++ ++ + DP+ + + D ++
Sbjct: 173 IVASSDMTHYQS--------------AAEAKAKDELALERLANMDPEGLLRVCRDKDISM 218
Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG P + L LG S+++ + Y S + + V+YA+
Sbjct: 219 CGVIPATAMLVAAKTLGATSSRL----ICYTNSGEVNGDLKAVVAYAA 262
>gi|336121517|ref|YP_004576292.1| hypothetical protein Metok_0534 [Methanothermococcus okinawensis
IH1]
gi|334856038|gb|AEH06514.1| UPF0103 protein [Methanothermococcus okinawensis IH1]
Length = 284
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 126/296 (42%), Gaps = 33/296 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSG 54
IR + AG++Y L E ++ G L G+ K P GV+ PHAGY YSG
Sbjct: 3 IRNPAVAGAFYPAEPNNLIEMIEYCYLHKIGPGTLPSKGVFKKP--VGVVCPHAGYIYSG 60
Query: 55 RAAAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
AA+++ I + + ++GP+H + + +++TP+G++ D E I+ L
Sbjct: 61 PVAAHSYNAISKKTDGTITAVIIGPNHTGLGSGVS-TMKGIWRTPLGNMSSDDEFIDRLW 119
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEAMYGRLFA 170
+D EHS+E+ LP+L ++ KIVPI + + E G A
Sbjct: 120 NECDILDLDETAHIREHSIEVQLPFLQHLELLNAVKFKIVPICMMMQDYETAVEIGYFIA 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
K + + V SSDF H+ + KK ++ K DI+ + + +
Sbjct: 180 KISKELNRRVVVIASSDFTHYEPQ---EIASKKDAIVIK----------DILNMNEKELY 226
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++ + T+CG P+ + + + K + L Y S S V Y S
Sbjct: 227 AD-VINNNITMCGYGPVIAMIKAMKELNAK-NSRLLAYSTSGDITGNYSSVVGYGS 280
>gi|336324129|ref|YP_004604096.1| hypothetical protein Flexsi_1893 [Flexistipes sinusarabici DSM
4947]
gi|336107710|gb|AEI15528.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Flexistipes sinusarabici DSM 4947]
Length = 262
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 115/253 (45%), Gaps = 27/253 (10%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
E + +I PHAGY +SG A I P +I LLGP+H + A+ + P
Sbjct: 34 EAKMIIVPHAGYVFSGATAVKTISRIKLPKNI---ILLGPNHTGTGGRIAVYPGGKWSCP 90
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV 157
+GD+P++ ++E+L G FE D EHS+E+ LP L K F L IVPI +
Sbjct: 91 LGDVPVNENMVEKLIDKG-FE-SDQPAHVKEHSLEVQLPIL-KYFRDDL-NIVPIAFKGL 146
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
++ G++ VD+ + VSSDF H++ K +A++++ +
Sbjct: 147 GFDDCRNAGKVLKDLVDETDSMIVVSSDFN---------HFEDIETTNEKDFDAINRI-L 196
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
D+ D +L + ++CG P V L L K +K E ++ +T D
Sbjct: 197 DL----DSKGLYDTVLSKNISMCGIIPTVVALESL---EKKENLKASLVEHTTSAETSGD 249
Query: 278 SS--VSYASAAAK 288
+S V YA K
Sbjct: 250 ASQVVGYAGIIIK 262
>gi|229581387|ref|YP_002839786.1| hypothetical protein YN1551_0739 [Sulfolobus islandicus Y.N.15.51]
gi|259647115|sp|C3NMP4.1|Y739_SULIN RecName: Full=MEMO1 family protein YN1551_0739
gi|228012103|gb|ACP47864.1| protein of unknown function DUF52 [Sulfolobus islandicus Y.N.15.51]
Length = 284
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ +L
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K +EA++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|14601616|ref|NP_148156.1| hypothetical protein APE_1771 [Aeropyrum pernix K1]
gi|7388527|sp|Q9YB24.1|Y1771_AERPE RecName: Full=MEMO1 family protein APE_1771
gi|5105461|dbj|BAA80774.1| conserved hypothetical protein [Aeropyrum pernix K1]
Length = 281
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 128/297 (43%), Gaps = 35/297 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYS 53
KIR +HAG++Y + L + ++ G L + G + I PHAGY YS
Sbjct: 2 KIRNPAHAGTFYPATREELVKSIESSFTHPLGPGRLPQRGGGSGEQAIAYIPPHAGYMYS 61
Query: 54 GRAAAYAFGNIDPTSISR----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
G AA+ + ++ S+ R V LLGP+H +L V++TP+G++ +D E
Sbjct: 62 GPIAAHVYYDM---SLGRKPDVVVLLGPNHTGLGLAASLWDEGVWRTPLGEVEVDSEAGR 118
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
+ D EHS+E+ LP+L ++ G +IVPI+V + R +
Sbjct: 119 LVVEYSGIVAPDDEGHIYEHSLEVQLPFLQYLYGGDF-RIVPIVVLHQTLDISIRIARAY 177
Query: 170 AKYVDDP--SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDA 227
+ ++ + +SD H Y Y++ + D + + IE GDP+A
Sbjct: 178 HRLREENGVNAVLVATSDLNH------YEPYEEN--------KRKDLLLLKAIEEGDPEA 223
Query: 228 FKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
K + + + CG PI+ + +K + L Y S ++ V Y +
Sbjct: 224 VFKTIEAHAISACGPSPIAAAVE--AGRLAGVKPRVLAYANSGDVTGEKAWVVGYPA 278
>gi|218884668|ref|YP_002429050.1| putative dioxygenase [Desulfurococcus kamchatkensis 1221n]
gi|254800053|sp|B8D6F2.1|Y1357_DESK1 RecName: Full=MEMO1 family protein DKAM_1357
gi|218766284|gb|ACL11683.1| Predicted dioxygenase [Desulfurococcus kamchatkensis 1221n]
Length = 283
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 120/301 (39%), Gaps = 43/301 (14%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSP------EVRGVIAPHAGYSYS 53
K R AG +Y D L ++ + G P SP G +APHAGY YS
Sbjct: 2 KKRSPIVAGYFYPDKPGELRSVIEWSFKHGIGPGKPPSPSDIPATNSIGYVAPHAGYIYS 61
Query: 54 GRAAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
G AA+ + N P +I +LG +H ++ V++TP+GDL +D E+
Sbjct: 62 GPVAAHVYFDMALNKKPDTI---VILGTNHTGLGRPVSVYPEGVWETPLGDLVVDSEIGR 118
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
+ + D EHS+E+ LP++ ++ G +KI PI++G + R
Sbjct: 119 LIVENSEIAEFDEYAHLEEHSIEVQLPFIVYIY-GEDVKITPIVIGIHTPD----IARDL 173
Query: 170 AKYVDDPSN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
AK + + S SSDF H+ H+ D M +D I
Sbjct: 174 AKSIYEASMSTGKRIIVIASSDFNHYEP--------------HEETSRKDSMAIDRILKL 219
Query: 224 DPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
D D +L D +ICG I + K ++ L+Y S V YA
Sbjct: 220 DTDGLYNVILHNDISICGPGGIMTLMEYTKKLGGKAQL--LKYATSGDTSGDYSHVVGYA 277
Query: 284 S 284
+
Sbjct: 278 A 278
>gi|218667506|ref|YP_002425064.1| hypothetical protein AFE_0572 [Acidithiobacillus ferrooxidans ATCC
23270]
gi|218519719|gb|ACK80305.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC
23270]
Length = 264
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 114/275 (41%), Gaps = 36/275 (13%)
Query: 15 TDNSKRLAE-ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA--YAFGNIDPTSISR 71
T+ + LA E DG A PK+ +I PHAGY YSG AA YA I R
Sbjct: 12 TEVERLLARAEQDGEAASAPWPKA-----IIVPHAGYIYSGAVAASGYALLAKGRGHIRR 66
Query: 72 VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
V LLGP+H AL +TP+G + +D +E L + M EH++
Sbjct: 67 VVLLGPAHRLPFRGLALPGVQAMQTPLGTVAVDQAGVEALAGLPEVREMP-AAHAQEHAL 125
Query: 132 EMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGS 191
E+ LP++ +V + +VP++VG + A R+ K +SSD H
Sbjct: 126 EVQLPFIQEVLGD--VSVVPLVVGDARPDEVA---RVLEKLWGGEETVIVISSDLSH--- 177
Query: 192 RFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM 251
Y Y + + H ++E + + F ++++ CG PI+ L +
Sbjct: 178 ---YHPYAEARAIDHHTVEEILR-------------FDPTPIDHEQA-CGATPINGLLPV 220
Query: 252 LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ + + S RD+ V YA+ A
Sbjct: 221 --ARKHHLHPRLVGLCNSGDTAGSRDAVVGYAAVA 253
>gi|227831071|ref|YP_002832851.1| hypothetical protein LS215_2219 [Sulfolobus islandicus L.S.2.15]
gi|259646604|sp|C3MJQ3.1|Y2219_SULIL RecName: Full=MEMO1 family protein LS215_2219
gi|227457519|gb|ACP36206.1| protein of unknown function DUF52 [Sulfolobus islandicus L.S.2.15]
Length = 284
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLRE----AGLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ +L
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTLEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K +EA++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEAIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|406896361|gb|EKD40670.1| hypothetical protein ACD_74C00263G0001 [uncultured bacterium]
Length = 267
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 105/245 (42%), Gaps = 23/245 (9%)
Query: 42 GVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
V++PHAGY YSG A F ++ P I LLGP+HH Y +L + P+G+
Sbjct: 38 AVVSPHAGYIYSGSVAGETFAAVEIPKDI---VLLGPNHHGYGAPVSLMHTGSWNMPLGE 94
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
+ ++ + + L A D EHS+E+ +P+L + F + + P+++ ++ +
Sbjct: 95 IQINTVLAQTLLAQTDLIEPDTLAHRFEHSLEVQVPFL-QYFRPDM-TLTPMVISHLSFK 152
Query: 161 NEAMYGRLFAKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
G A+ + + SSD H+ SR S A D + +
Sbjct: 153 ACQEVGETLAEAIRKYNKPVLIVASSDMTHYESRV--------------SATAKDSLAIH 198
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
+E DP+ +L+ T+CG P ++ L + + +RY S + +
Sbjct: 199 RLEALDPEGLYNTVLDKGITMCGIMPTTIALVAALRLGAT-QARLIRYTDSGEASGDTNQ 257
Query: 279 SVSYA 283
V YA
Sbjct: 258 VVGYA 262
>gi|148658050|ref|YP_001278255.1| hypothetical protein RoseRS_3955 [Roseiflexus sp. RS-1]
gi|148570160|gb|ABQ92305.1| protein of unknown function DUF52 [Roseiflexus sp. RS-1]
Length = 405
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 112/273 (41%), Gaps = 23/273 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R+ + AG Y N L LDG++ G P P RGV++PH Y GR A +
Sbjct: 114 RQPALAGQSYPANPADLRRLLDGYVAAVGQVTPAPPTGRGVLSPHIDYERGGRVYAQVWQ 173
Query: 63 NIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK-ATG-KFE 118
+ V L+G H Y+P+ T Y TP G LP D +++ L A G +
Sbjct: 174 RAAEMVRAAEIVILIGTDH--YSPEPITLTRQRYATPFGVLPTDAAIVDALAHALGEEAA 231
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA----VNAENEAMYGRLFA---- 170
EHS+E+ +L V ++PILVG+ + E+ A RL A
Sbjct: 232 FAGELYHRVEHSLELAAVWLQYVRGNPPCPVIPILVGSFARYIGGEDPATDRRLEALMTA 291
Query: 171 --KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+ +D +S D H G F + ++ D+ +D + GD F
Sbjct: 292 LRQVMDTRRTLVVISGDMSHVGPAFGGAPLSESD---KAALRRSDEQVIDRMRAGDAAGF 348
Query: 229 KKYLLEYDN--TICGRHPISVFLHMLGNCSTKI 259
+ + + + ICG P V L ++G ++
Sbjct: 349 FRVIADTGDRQNICGLPPTYVALRLMGAVEGEL 381
>gi|436841594|ref|YP_007325972.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
AM13 = DSM 14728]
gi|432170500|emb|CCO23871.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
AM13 = DSM 14728]
Length = 270
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 127/287 (44%), Gaps = 32/287 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL----REAGLPKSPEVRGVIAPHAGYSYSGR--AAA 58
R+ AG +YTDN ++L +EL ++ ++A P R V+ PHAGY +SG
Sbjct: 3 RKPVVAGRFYTDNPQQLKQELKQFIGPLNKKAAAPYD---RLVMLPHAGYMFSGEPCGKT 59
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
A N+ PT + LLGP+H ++ ++ P G L +D ++ ++L +G
Sbjct: 60 LAGANLAPT----IILLGPNHTGLGSPLSVWDCGSWEFPGGKLDVDEDLAQQLIDSGTGF 115
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
+ + EHS+E+ +P+L + I+IVP+ V + G A +D S
Sbjct: 116 VENQAAHSREHSLEVIVPFLH--YLNPEIRIVPVCVSESAPKVLHKAGEAIADIIDAYSK 173
Query: 179 FFS--VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
S VSSD H ++ DK + +D M ++ I DP +
Sbjct: 174 PVSIVVSSDMSH------FIKADKA--------KKMDSMALEAIIRMDPADLYSIVSSNQ 219
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
++CG P+++ ++ + + Y S + +S V+YA
Sbjct: 220 ISMCGVLPMTMGMYAAKKLGANAG-RLIEYTNSGKVTGDFESVVAYA 265
>gi|15922382|ref|NP_378051.1| hypothetical protein ST2062 [Sulfolobus tokodaii str. 7]
gi|23822365|sp|Q96YW6.1|Y2062_SULTO RecName: Full=MEMO1 family protein STK_20620
gi|15623171|dbj|BAB67160.1| hypothetical protein STK_20620 [Sulfolobus tokodaii str. 7]
Length = 284
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 130/300 (43%), Gaps = 39/300 (13%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG---------VIAPHAGYSYS 53
IR + AG++Y +L ++++ +L G K P+V I PHAGY YS
Sbjct: 2 IRLPAVAGAFYEGEEDKLKKQIEWSFLHPLGPGKIPQVPPQKSKRNNLFFIVPHAGYMYS 61
Query: 54 GRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+A+ + I V +LGP+H +L +KTP+G++ +D ++ +L
Sbjct: 62 GPVAAHAYYYLASEGIPDTVIILGPNHTGLGSYVSLWPKGKWKTPLGEIEIDEQIAMDL- 120
Query: 113 ATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL 168
+ E++DI DE EHS+E+ +P+L F+ KIVPI++ E
Sbjct: 121 -VRESEVIDI--DEKAHLYEHSIEVQVPFLQYFFDSK-TKIVPIVIMMQTPEISEYLAEG 176
Query: 169 FAKYVD---DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
+K + D SSD H+ K+IE D M ++ I + D
Sbjct: 177 ISKIMQKYKDKDIVVIASSDMNHYEPH-------------EKTIEK-DNMAIEKILSLDY 222
Query: 226 DAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ E D T CG P+ L + + K + L++ S + S V Y S
Sbjct: 223 KGLFNVVEEKDVTACGFGPMMTVLMLAKKFNKKPYV--LKHATSGDTSGDKSSVVGYLSV 280
>gi|386393059|ref|ZP_10077840.1| putative dioxygenase [Desulfovibrio sp. U5L]
gi|385733937|gb|EIG54135.1| putative dioxygenase [Desulfovibrio sp. U5L]
Length = 288
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 119/284 (41%), Gaps = 25/284 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIA--PHAGYSYSGRAAAYAFG 62
R+ AG +Y + L E+ +L EA P + ++A PHAGY YSG A G
Sbjct: 20 RQPVVAGRFYPGAAPALRREVGAFLAEAEAPAEADGPTLLAMVPHAGYVYSGSVAGRTLG 79
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE-LKATGKFELMD 121
V LLGP+H + A+ + + P D+P+ ++ + L+A + D
Sbjct: 80 AARLADT--VLLLGPNHTGRGKRLAVWPSGAWAVPGCDVPVAADLARDFLRAEARLS-PD 136
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L + ++IVPI V + E A + S S
Sbjct: 137 AAAHREEHSLEVLLPFL--CVKNPAVRIVPIAVAEPDPEALRQVAGAMAGVLGQRSEPVS 194
Query: 182 --VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
VSSD H+ V H++ + D + +D + DPD + + E T+
Sbjct: 195 IVVSSDMSHY--------------VPHETAKRRDALALDRVLALDPDGLYRVVREAGITM 240
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG P+ + LH+ + + Y S + + V YA
Sbjct: 241 CGVLPMVLGLHLAKALGARQAV-LAAYATSGEASGDYNQVVGYA 283
>gi|95928913|ref|ZP_01311659.1| protein of unknown function DUF52 [Desulfuromonas acetoxidans DSM
684]
gi|95135258|gb|EAT16911.1| protein of unknown function DUF52 [Desulfuromonas acetoxidans DSM
684]
Length = 267
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 90/188 (47%), Gaps = 8/188 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + +G +YTD+ L ++++ +L P P GV+ PHAGY YSG A
Sbjct: 2 IRQPAVSGQFYTDDPYELRQQVELFLT-TDQPGKP-AYGVMMPHAGYMYSGAIAGETLAG 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+D V LLGP+H CAL + +KTP+G++P+ + + L ++
Sbjct: 60 VDVPDT--VLLLGPNHRGIGHPCALYSQGSWKTPLGEVPIAETMAQRLLDAVPHLTGEMQ 117
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNFFS 181
+ EHS+E+ LP+L + + I+P+++G ++ G + D
Sbjct: 118 SHQGEHSLEVLLPFLQ--VKNPNLSIIPLMLGPLSFSILQQLGAGIGAVLKEDGGRVLIV 175
Query: 182 VSSDFCHW 189
SSD H+
Sbjct: 176 ASSDMTHY 183
>gi|148706475|gb|EDL38422.1| RIKEN cDNA 0610016J10, isoform CRA_a [Mus musculus]
Length = 77
Score = 75.9 bits (185), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/76 (51%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCK 273
M IIE DP +F YL +Y NTICGRHPI V L+ L + FL Y QSSQC+
Sbjct: 1 MSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCR 60
Query: 274 TKRDSSVSYASAAAKV 289
+ +DSSVSYA+ A V
Sbjct: 61 SWQDSSVSYAAGALTV 76
>gi|385805502|ref|YP_005841900.1| putative dioxygenase [Fervidicoccus fontis Kam940]
gi|383795365|gb|AFH42448.1| putative dioxygenase [Fervidicoccus fontis Kam940]
Length = 287
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 133/292 (45%), Gaps = 24/292 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-------GWLREAGLPKSPEVR--GVIAPHAGYSYSG 54
+R AG +Y ++L E++ G +EA S G IAPHAGY YSG
Sbjct: 6 VRDPVVAGEFYELRKEKLLEQIRWAFLHPIGPGKEAIFSNSITKNNFGYIAPHAGYIYSG 65
Query: 55 RAAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ + NI F++ GP+H + +L+ ++TP+G + +D+E+ + L
Sbjct: 66 PVAAHTYYNISLEKKPDTFIIIGPNHTGLGAQVSLAPWKQWRTPLGLINVDIELRDYLIT 125
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
K + D EHS+E+ LP+L +F+ +I+PI+ + + + ++ + +
Sbjct: 126 KSKVLVPDYNAHLYEHSIEVQLPFLQYIFQKDF-QILPIV---LMRQTPKVVEKIAEELL 181
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
D + + DF S + HY+ K ++A ++I+ D + F YL
Sbjct: 182 DATEH---LGRDFTIIASS-DMSHYEPYDVAKRKDLKAF-----EMIKKRDINLFYNYLE 232
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ ++CG + + L + + + L+Y S +D+ V Y SA
Sbjct: 233 TENVSMCGPGGVMI-LMTISKMKNGREPEILKYATSGDLTGDKDAVVGYLSA 283
>gi|167379219|ref|XP_001735045.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903085|gb|EDR28738.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 284
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 116/289 (40%), Gaps = 36/289 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----VIAPHAGYSYSGRAAAY 59
+R+A AG W+ N LA E+D ++ A L K P ++G I+PHAG+ YSG A Y
Sbjct: 6 VRKAVGAGRWFNANGNELANEIDHYINNA-LNKLPPIQGKILGCISPHAGFRYSGPTAGY 64
Query: 60 AFGNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
F + S VF+LG SH A+ TPI +D E I
Sbjct: 65 DFAALKRDSEINGKPDVVFILGFSHSSRFDYAAIMDGKAISTPIATTEIDNEAITIFCEG 124
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ EHS E LP++ + G +K+V +L+G +E +
Sbjct: 125 RNYLKCFYKPHNGEHSAENELPFVQRALPG--VKVVMVLIGTHKSEVFEQVAQGLQAVAS 182
Query: 175 DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE 234
+ SSD H D+ H ++ K+ DK + + E D
Sbjct: 183 KKKMYVIASSDMLH----------DESHNLVEKT----DKETILLTEKMDIKGLLTR-WS 227
Query: 235 YDNTI-CGRH---PISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
Y+N I CG P ++ +G C+ I + S+ TKR +S
Sbjct: 228 YENQIYCGITAIIPTMMYAQSIG-CAKAITLDL----TDSETVTKRMNS 271
>gi|404493275|ref|YP_006717381.1| MEMO-like protein [Pelobacter carbinolicus DSM 2380]
gi|77545333|gb|ABA88895.1| MEMO-like protein [Pelobacter carbinolicus DSM 2380]
Length = 267
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 116/281 (41%), Gaps = 23/281 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R + AG +Y + K L ++ +L +A +S G++ PHAGY +SG A FG +
Sbjct: 3 RMPAVAGQFYPADPKSLRSMVETYLEKAT--QSHPAIGLMVPHAGYVFSGAIAGQTFGCV 60
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
D S+V ++GP+H Y AL + TP+G++P+ + + + + D
Sbjct: 61 DIP--SKVLVIGPNHTGYGESLALFAKGSWVTPLGEVPIAEGLADRVLQAHPRLMADDLA 118
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNFFSV 182
EHS+E+ +P+L ++IVP+ + V E G + + +
Sbjct: 119 HRFEHSLEVQIPFLQ--VRAPDVQIVPLCLAPVPYEELLALGNAIGQVLAAEKEPVLLVA 176
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGR 242
SSD HY+ K D M + + DP+ + + + ++CG
Sbjct: 177 SSDM---------THYEPGAAARKK-----DHMALKHVCDLDPEGLYRTVASENISMCGV 222
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
V L C K + + Y S + + V YA
Sbjct: 223 VSTVVMLAA-AQCLGAQKGRLVCYGNSGEVTGDQSEVVGYA 262
>gi|385776635|ref|YP_005649203.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323475383|gb|ADX85989.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 284
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ EL
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMEL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K EA++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDEEAIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|239907803|ref|YP_002954544.1| hypothetical protein DMR_31670 [Desulfovibrio magneticus RS-1]
gi|239797669|dbj|BAH76658.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 292
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 119/288 (41%), Gaps = 31/288 (10%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+ AG +Y + LA E +L EAG P +APHAG YSG A G
Sbjct: 24 SIRQPVVAGRFYPADPAALARETAAYLAEAGAPSDKPTLLAMAPHAGAVYSGPVAGKTLG 83
Query: 63 --NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
N+ T + LLGP+H + A+ + ++ P D+P++ + E L A
Sbjct: 84 AANLADT----LLLLGPNHTGRGGRLAVWSDGAWRIPGRDVPVEAALAEALLAAAPGLSP 139
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDP 176
D EHS+E+ LP++ V G +IVP+ V V AE + A + P
Sbjct: 140 DRAAHLGEHSLEVLLPFVTAVRPG--CRIVPVAVAESRPGVLAETAQAMAEVLAAW-SGP 196
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG-MDIIETGDPDAFKKYLLEY 235
+ VSSD H+ K+ + + ALD G +D++
Sbjct: 197 VSII-VSSDMSHY---LPEAAAKKRDSLALARVLALDPQGLLDVVR------------RE 240
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
D ++CG P+++ L ++ + Y S + D V YA
Sbjct: 241 DISMCGVLPMTLGL-LIAQALGATSARLAAYATSGEVSGDYDQVVGYA 287
>gi|302341544|ref|YP_003806073.1| hypothetical protein Deba_0100 [Desulfarculus baarsii DSM 2075]
gi|301638157|gb|ADK83479.1| protein of unknown function DUF52 [Desulfarculus baarsii DSM 2075]
Length = 267
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 72/137 (52%), Gaps = 4/137 (2%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+ + AG +Y ++ LA E+D L + P P + VI PHAGY +SGR A FG
Sbjct: 2 VRQPAVAGRFYPAQAEALAAEVDRLL-QTDQPPQPAL-AVICPHAGYVFSGRVAGQVFGQ 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
+ + RV L+GP+H AL + ++TP+G + LD ++ + + + D
Sbjct: 60 V--SVPRRVLLMGPNHSGMGRPAALMSRGQWRTPLGLINLDDQLGQAIMDRAAYVEQDDL 117
Query: 124 VDEAEHSMEMHLPYLAK 140
EHS+E+ P+L +
Sbjct: 118 AHRNEHSLEVQTPFLQR 134
>gi|333910057|ref|YP_004483790.1| hypothetical protein [Methanotorris igneus Kol 5]
gi|333750646|gb|AEF95725.1| UPF0103 protein [Methanotorris igneus Kol 5]
Length = 286
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 135/303 (44%), Gaps = 44/303 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRGV-------IAPHAGYSYSGR 55
IR + AG++Y + L E ++ +L G + P G+ + PHAGY YSG
Sbjct: 2 IRHPAVAGAFYPADPNELIETIEYCYLHNLGPKELPSEGGIFEKPIGLVCPHAGYIYSGP 61
Query: 56 AAAYAFG----NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ +D +LGP+H A + ++KTP+GDL +D E + L
Sbjct: 62 IAAHSYNALSKRVDVEEEITAIILGPNHTGLGTGVA-TMKGIWKTPLGDLEIDDEFADRL 120
Query: 112 KATGKFELMDICVDEA----EHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMY 165
+ ++MD+ DE EHS+E+ LP+L + ++ K VPI + + E A
Sbjct: 121 WK--ECDIMDL--DETSHLHEHSIEVQLPFLQHLSMLNIAKFKFVPISMLLQDYETAADI 176
Query: 166 GRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM--DIIE 221
G AK + + + S+DF H+ + KK + K+I +D+ G+ D++
Sbjct: 177 GYFIAKIAKELNRRVVIIASTDFTHYEPQEVAA---KKDAIAIKNILNMDEEGLYSDVV- 232
Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
Y+ ++CG P+ + + K + K L Y S S V
Sbjct: 233 ------------NYNISMCGCGPVMAMIKAIKLLGGK-EAKLLAYATSGDITKDYSSVVG 279
Query: 282 YAS 284
YAS
Sbjct: 280 YAS 282
>gi|15897032|ref|NP_341637.1| hypothetical protein SSO0066 [Sulfolobus solfataricus P2]
gi|284173376|ref|ZP_06387345.1| hypothetical protein Ssol98_01802 [Sulfolobus solfataricus 98/2]
gi|384433532|ref|YP_005642890.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|7388414|sp|P95994.1|Y066_SULSO RecName: Full=MEMO1 family protein SSO0066
gi|1707828|emb|CAA69536.1| orf c05005 [Sulfolobus solfataricus P2]
gi|13813197|gb|AAK40427.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
gi|261601686|gb|ACX91289.1| protein of unknown function DUF52 [Sulfolobus solfataricus 98/2]
Length = 284
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG----------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R P+ + I PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQSYEKKKRDNLFFIVPHAGYIYSG 62
Query: 55 RAAAYAFGNI-DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+++ + V +LGP+H + ++TP+G + +D EV+ +L
Sbjct: 63 PVAAHSYYYLASEGKPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKVDEEVLMQLVM 122
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-----NEAMYGRL 168
+ ++ EHS+E+ LP+L F+ + KIVPI++ E +A+Y ++
Sbjct: 123 ESEVIDLEEKSHLYEHSIEVQLPFLQYFFDDNF-KIVPIVIMMQTPEIAEFLADAIY-KV 180
Query: 169 FAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
KY D + SSD HYD + K EA++K I+ D
Sbjct: 181 IQKYSDKDIVVLA-SSDMN---------HYDPHEITMKKDEEAIEK-----IQQLDYRGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVSLILAKKLGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|284998567|ref|YP_003420335.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284446463|gb|ADB87965.1| protein of unknown function DUF52 [Sulfolobus islandicus L.D.8.5]
Length = 284
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ +L
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K +E ++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDVEVIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|410721675|ref|ZP_11361005.1| putative dioxygenase [Methanobacterium sp. Maddingley MBC34]
gi|410598421|gb|EKQ52994.1| putative dioxygenase [Methanobacterium sp. Maddingley MBC34]
Length = 280
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 127/298 (42%), Gaps = 32/298 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSP-------EVRGVIAPHAGYSYSG 54
IR+ + AG +Y + L + + W E G + P ++G+I+PHAGYSYSG
Sbjct: 2 IRKPAVAGMFYESDEDSLRKRIK-WCYQHELGPGRLPGKMGNKRSIKGLISPHAGYSYSG 60
Query: 55 RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA ++ + + V +L P+H + T + TP+G++ +D + EL
Sbjct: 61 PVAACSYMELAEDGMPETVVILCPNHTGIGSGLSTMTEGSWLTPLGEVEIDHQFASELLN 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
D EHS E+ LP+L ++ ++VP+ + + E G +
Sbjct: 121 YYPLLDDDPSAHLKEHSCEVQLPFLQEISSD--FQMVPVCMMMQDLETSRELGEAISHTA 178
Query: 174 DDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
V S+DF H+ H+ EA DK ++ I D +
Sbjct: 179 RKLGRDLVVIASTDFTHYQP--------------HEVAEAQDKKVLEAISAMDELEMMRT 224
Query: 232 LLEYDNTICGRHPISVFLHML-GNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+ E++ T+CG P++ + G +T++ + L+Y S S V YASA K
Sbjct: 225 IQEFNVTMCGYGPVAATIEASKGMGATEVSV--LQYATSGDTGGDYTSVVGYASAIFK 280
>gi|149193804|ref|ZP_01870902.1| hypothetical protein CMTB2_01928 [Caminibacter mediatlanticus TB-2]
gi|149135757|gb|EDM24235.1| hypothetical protein CMTB2_01928 [Caminibacter mediatlanticus TB-2]
Length = 263
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 78/149 (52%), Gaps = 8/149 (5%)
Query: 41 RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
+ +I PHAG+ YSG A +A+ T+ RV ++GPSH + + + VY+TP G
Sbjct: 46 KALIVPHAGWMYSGFTANFAYRIASNTNPKRVVVIGPSHRFPIKGISTTLEDVYETPCGL 105
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
LP+D+E +EL F++ ++ + EHS E+ +P++ F I +V ++ G E
Sbjct: 106 LPIDIEFAKEL--IKNFDVQNLEMVHQEHSTEVQMPFIYHYFGK--IPVVELIYGDYAPE 161
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
+ KY + ++ +SSD H+
Sbjct: 162 KL----KEIIKYAIEDNSLVVISSDLSHY 186
>gi|218781935|ref|YP_002433253.1| hypothetical protein Dalk_4100 [Desulfatibacillum alkenivorans
AK-01]
gi|218763319|gb|ACL05785.1| protein of unknown function DUF52 [Desulfatibacillum alkenivorans
AK-01]
Length = 282
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 114/251 (45%), Gaps = 33/251 (13%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVR--GVIAPHAGYSYSGRAAAY 59
+IRRA GSWY + E+++ ++ A LP+ PEV+ G I PHAG+ YSG+ AA
Sbjct: 2 EIRRAQFMGSWYPASENECREKIEDFI--ADLPQVKPEVKRIGGIVPHAGWVYSGKIAAQ 59
Query: 60 AFGNI---DPTSISRVFL--LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
+ +P + VF LGP H + + + ++TP G L + +V +L
Sbjct: 60 VIAALKKNNPPDVIAVFGMHLGPRHPNF-----IMASGAWETPFGPLAIAEDVAGQLAKG 114
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN---AENEAMYGRLFAK 171
+FE+ ++++E+ LP++ F ++IVPI V V+ A EA
Sbjct: 115 FRFEIESPDRHVQDNTIELQLPFIKYFFPD--VRIVPIGVPPVSRSLAIGEA-------- 164
Query: 172 YVDDPSNFFSV-----SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
+VD +F S+D H+G+ + + I DK ++ IE DP
Sbjct: 165 FVDIARHFGYTAKVIGSTDLTHYGANYGFSPVGSGPKAIQWVKNENDKKLVERIEAMDPQ 224
Query: 227 AFKKYLLEYDN 237
+E N
Sbjct: 225 GVIDQAMENSN 235
>gi|147921497|ref|YP_684688.1| hypothetical protein LRC400 [Methanocella arvoryzae MRE50]
gi|110620084|emb|CAJ35362.1| conserved hypothetical protein [Methanocella arvoryzae MRE50]
Length = 264
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 126/294 (42%), Gaps = 49/294 (16%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAFG 62
+RR + AG +Y + + + P+ + G++ PHAGY YSG AA +
Sbjct: 1 MRRPAVAGQFYPGSETEVRSLIARLAPGTSEPRDKIKACGIVVPHAGYVYSGGVAADVYS 60
Query: 63 NID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+I+ PT + LLGPSH+ A+ST ++TP+G + +D + + L+
Sbjct: 61 SIEGAPTFV----LLGPSHYGVGSPVAVSTQP-WETPLGKVEVDQDFV---------RLL 106
Query: 121 DICVD------EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKY 172
D +D ++EHS+E+ +P+L F+ +IVPI +G + ++ E + A
Sbjct: 107 DGIIDRDEIAHQSEHSIEVQIPFLQYFFKD--FRIVPICLGMQDYDSVREVATELITALE 164
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYD---KKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
D SSDF H Y H KK + K IE LD F
Sbjct: 165 KYDGQVVLVASSDFTH------YEHISVAKKKDMTLIKDIEKLDV-----------PTFY 207
Query: 230 KYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
L + T CG PI+ + + G K + +RY S T V YA
Sbjct: 208 DDLYRLNATACGYGPIATTMMVCGARGAS-KAQLVRYATSGDV-TGDSQVVGYA 259
>gi|149173820|ref|ZP_01852449.1| hypothetical protein PM8797T_05265 [Planctomyces maris DSM 8797]
gi|148847350|gb|EDL61684.1| hypothetical protein PM8797T_05265 [Planctomyces maris DSM 8797]
Length = 1051
Score = 74.7 bits (182), Expect = 4e-11, Method: Composition-based stats.
Identities = 68/286 (23%), Positives = 128/286 (44%), Gaps = 28/286 (9%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+IR A AG++Y + R+ +L G L + P ++ PHAG+ YSG+ AA
Sbjct: 784 EIRPAGVAGTFYPADPARMTAQL-GELFHDQVDAQPWAAAMV-PHAGWKYSGKIAARVLN 841
Query: 63 NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I P++I ++GP H A++ ++ P G+L D + ++L +D
Sbjct: 842 RIQLPSTI---IVIGPKHTREGVDWAVAPHQAWQLPGGNLNSDRALAQKLAEQIPGLELD 898
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN----AENEAMYGRLFAKYVDDPS 177
+EH++E+ LP + ++ K++ I++G+ N E A R+ + + P
Sbjct: 899 AAAHRSEHAIEVELPLIQRLAPD--AKVIGIVIGSGNLPRCEEFAAGLARVIQEMPEPP- 955
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN 237
+SSD H+ + DK++ LD++ ++ + + DP+ + + EY
Sbjct: 956 -LLLISSDMNHFAT-------DKEN-------RRLDELALEKMRSLDPEGLLETVREYHI 1000
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
++CG P + + L +I+ + Y S V YA
Sbjct: 1001 SMCGVLPAVIVMKTLQKMGKLNQIEQVGYATSGDVTGDSSRVVGYA 1046
>gi|330834015|ref|YP_004408743.1| hypothetical protein Mcup_0150 [Metallosphaera cuprina Ar-4]
gi|329566154|gb|AEB94259.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
Length = 263
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 110/246 (44%), Gaps = 22/246 (8%)
Query: 44 IAPHAGYSYSGRAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
I PHAGY YSG A+ ++ + V +LGP+H Y ++ ++TP+G
Sbjct: 32 IVPHAGYIYSGPIASNSYYYLVQEGKPDLVIILGPNHTGYGSWVSIWPEGEWRTPLGSYN 91
Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
+D ++++EL + + +D EHS+E+ LP+L F ++I+PI+V E
Sbjct: 92 VDQKLVKELVSVSEVIDIDEKAHLYEHSIEVQLPFLQYFFND--LRILPIVVLMQTPEIA 149
Query: 163 AMYGRLFAKYVDDPSNFFSV---SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
++++ S+ V SSD HYD H+ A D + +
Sbjct: 150 ESIAEGIWRFINRHSDLDIVVLASSDLN---------HYDP-----HEVTMAKDDLVIRK 195
Query: 220 IETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS 279
+E D + + E D T+CG PI V L + + + I LR+ S + S
Sbjct: 196 VEELDHKGLYRVIEEQDVTVCGYAPIMVSLILAKKMNKRPYI--LRHATSGDTSGDKSSV 253
Query: 280 VSYASA 285
V Y +A
Sbjct: 254 VGYLAA 259
>gi|158520316|ref|YP_001528186.1| hypothetical protein Dole_0299 [Desulfococcus oleovorans Hxd3]
gi|158509142|gb|ABW66109.1| protein of unknown function DUF52 [Desulfococcus oleovorans Hxd3]
Length = 282
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 9/226 (3%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR A++AGSWY D + ++ +L E L P E G I PHAG+ YSG A A
Sbjct: 3 IRPATYAGSWYPDTAAECRRQIKAFLAEKTLTPPDREPCGAIVPHAGWIYSGGIACRALA 62
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ + + ++ H Y + + TP+G L +D + +L F+ +
Sbjct: 63 LLKGSQVPDTIVIYGHHLYPKSSPRILSEGSMATPLGMLAVDEALAAKLMDRFTFQAESV 122
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
++++E+ LP++ +F +KIVP VG GR + D V
Sbjct: 123 HHFAPDNTIELQLPFIRYLFGD--VKIVP--VGVPPDAMAVEIGRFTVQAARDLGLSVKV 178
Query: 183 --SSDFCHWGSRFNYMHYDKKHGVIHKSIEA-LDKMGMDIIETGDP 225
S+D H+G+ F HY H+ + A D+ +D + T DP
Sbjct: 179 VGSTDLTHYGAHFGLSHYGLGEAA-HRQVRANEDRRIIDRMLTLDP 223
>gi|340624697|ref|YP_004743150.1| hypothetical protein GYY_07765 [Methanococcus maripaludis X1]
gi|339904965|gb|AEK20407.1| hypothetical protein GYY_07765 [Methanococcus maripaludis X1]
Length = 284
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
R AG +Y L E ++ +L G + P RG +++PHAGY YSG
Sbjct: 3 RNPVVAGMFYPAEYHELLEMIEYCYLNPRGPKELPSKRGKYTKPLGIVSPHAGYIYSGPV 62
Query: 57 AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + I + +LGP+H A T +KTP GD+ +D E + L
Sbjct: 63 AAHGYKKISENISGEITAIILGPNHTGLGSGIATMKGT-WKTPFGDMEIDNEFADRLWKE 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
MD EHS+E+ LP+L + + ++ K VPI + + E+ G + AK
Sbjct: 122 CDILDMDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYESCIDVGYVIAKV 181
Query: 173 VDDPS--NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+ + + S+DF H+ + KK V+ K DI+E D + F
Sbjct: 182 TRELNRKSVIIASTDFSHYEPQ---EQASKKDAVVIK----------DILELNDEEIFTD 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ ++ ++CG P+ + + + K L Y S V YAS K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAKTSY-LLYYSTSGDVTKDYSEVVGYASLLIK 284
>gi|282164310|ref|YP_003356695.1| hypothetical protein MCP_1640 [Methanocella paludicola SANAE]
gi|282156624|dbj|BAI61712.1| conserved hypothetical protein [Methanocella paludicola SANAE]
Length = 263
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/289 (29%), Positives = 130/289 (44%), Gaps = 35/289 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG +Y +++ L ++ L + GV+ PHAGY YSG AA + +
Sbjct: 1 MRNPAVAGMFYPGSNEALRRQIAELAPSVSL--KLDAMGVVVPHAGYMYSGGVAAKVYAS 58
Query: 64 ID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I+ PT I +LGP H + A+ST +KTP+G + +D + I +L G + D
Sbjct: 59 IEGAPTFI----ILGPRHSWEGSAVAVSTVP-WKTPLGIVDVDHDFI-DLLPPGIID-HD 111
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN-EAMYGRLFAKYVDDPSNFF 180
+ EHS+E+ +P+L F+ +IVPI +G + E + G + P
Sbjct: 112 ETAHQREHSLEVQVPFLQYFFKD--FRIVPIALGLQDYETVREVAGEITMAIEKYPKKVV 169
Query: 181 SV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
V SSDF H+ + K H +I K IE LD G F + + T
Sbjct: 170 IVASSDFTHYEP--VEVAKRKDHILIEK-IEKLDVPG-----------FYDEVANLNATC 215
Query: 240 CGRHPISVFLHMLGNCSTK--IKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CG PI+ M+ +C + K + L Y S R+ V YA A
Sbjct: 216 CGYGPIAA---MMLSCKRRGAKKAELLAYATSGDVTGDRE-VVGYAGLA 260
>gi|268323602|emb|CBH37190.1| conserved hypothetical protein, Memo-like protein family
[uncultured archaeon]
Length = 266
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 120/285 (42%), Gaps = 27/285 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR--GVIAPHAGYSYSGRAAAYAF 61
+RR AG++Y L +L+ AG+ + + R G + PHAGY +S AA A+
Sbjct: 1 MRRPIVAGAFYEGERAGLERQLNDCF--AGITREEDERVIGAVVPHAGYMHSCSVAASAY 58
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ I +LGP+H A+ST T ++TP+G + +D +E L + +D
Sbjct: 59 AKLPSADI--FVILGPNHQGIGSLVAVSTDT-WETPLGAVDIDEAFVEALPK--RIIDLD 113
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPSNF 179
EH++E+ LP+L +F VPI + + + G A + D
Sbjct: 114 ENAHRYEHAIEVQLPFLQFLFHEQF-TFVPICMCLSDEDTAKEVGNELADTIAKTDKKVV 172
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSDF H+ + G+ D+ ++ I+ D F + + T
Sbjct: 173 MLASSDFTHY----------EPEGIARDK----DEYVIEAIKELDVSKFYNRVYTRNVTA 218
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
CG PI+ +H + + + + Y S R S V YA+
Sbjct: 219 CGIGPIAAVMHAAKRLGAR-EGELVTYATSGDSTGDRSSVVGYAA 262
>gi|317052576|ref|YP_004113692.1| hypothetical protein Selin_2421 [Desulfurispirillum indicum S5]
gi|316947660|gb|ADU67136.1| protein of unknown function DUF52 [Desulfurispirillum indicum S5]
Length = 264
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 119/287 (41%), Gaps = 35/287 (12%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+++ AG +Y DN + + LD + A P + V+ PHAG+ YSG+ AA
Sbjct: 2 TIRQSAVAGMFYPDNPREIRAFLDHHMAAATAPV--DACAVVVPHAGWIYSGQLAAQVLA 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ V ++GP+H ++ +++TP+GD+P+ L A D
Sbjct: 60 RVRVPDT--VVIIGPNHTGLGSAISIFPPGLWRTPVGDIPVSPHA-RTLAARLGLTC-DT 115
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
+ EHS+E+ LP L + ++I+ I V + E+
Sbjct: 116 AAHQREHSLEVLLPMLH--YRNPSLQIIAITVAGGSPEDARELAHTLEAEFQGEDVLLLA 173
Query: 183 SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDN----T 238
SSD H+ S+ D+K+ + + I ALD G LLE N +
Sbjct: 174 SSDMNHFESQAV---SDRKNALAMERIAALDSPG---------------LLEVVNRHRIS 215
Query: 239 ICGRHPISVFLH--MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+CG P++V L +L S + Y S+ DS V YA
Sbjct: 216 MCGAFPVAVALENALLRGAS---HASLVGYTDSAAVSGDHDSVVGYA 259
>gi|227828313|ref|YP_002830093.1| hypothetical protein M1425_2054 [Sulfolobus islandicus M.14.25]
gi|229585543|ref|YP_002844045.1| hypothetical protein M1627_2134 [Sulfolobus islandicus M.16.27]
gi|238620505|ref|YP_002915331.1| hypothetical protein M164_2061 [Sulfolobus islandicus M.16.4]
gi|385773993|ref|YP_005646560.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|259646573|sp|C3MZ11.1|Y2054_SULIM RecName: Full=MEMO1 family protein M1425_2054
gi|259646574|sp|C4KJ99.1|Y2061_SULIK RecName: Full=MEMO1 family protein M164_2061
gi|259646592|sp|C3N060.1|Y2134_SULIA RecName: Full=MEMO1 family protein M1627_2134
gi|227460109|gb|ACP38795.1| protein of unknown function DUF52 [Sulfolobus islandicus M.14.25]
gi|228020593|gb|ACP56000.1| protein of unknown function DUF52 [Sulfolobus islandicus M.16.27]
gi|238381575|gb|ACR42663.1| protein of unknown function DUF52 [Sulfolobus islandicus M.16.4]
gi|323478108|gb|ADX83346.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 284
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRG------VIAPHAGYSYSG 54
R + AGS+Y + K+L +++ R +PK + + PHAGY YSG
Sbjct: 3 RLPAVAGSFYESDPKKLKMQIEWSFRHNIGPRDIPKQTYEKKKRDNLFFVVPHAGYIYSG 62
Query: 55 RAAAYAFGNIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL 111
AA+++ + S R V +LGP+H + ++TP+G + +D E++ +L
Sbjct: 63 PVAAHSYYYL--VSEGRPDVVIILGPNHTGLGSYVSAWPKGEWETPLGSVKIDEEILMQL 120
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE-NEAMYGRLFA 170
+ +D EHS+E+ LP+L F+ KIVPI++ E E + ++
Sbjct: 121 VKESEVIDLDEKSHLYEHSIEVQLPFLQYFFDDDF-KIVPIVIMMQTPEIAEFLADAIYN 179
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+P V SSD HYD + K EA++K I+ D
Sbjct: 180 VMQKNPDKDIVVLASSDMN---------HYDPHEITVKKDEEAIEK-----IQQLDYKGL 225
Query: 229 KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ + D T+CG PI V L + K I L++ S +DS V Y +A
Sbjct: 226 YEVVEGKDVTLCGYGPIMVNLILAKKFGKKAYI--LKHATSGDTSGPKDSVVGYLAA 280
>gi|134045236|ref|YP_001096722.1| hypothetical protein MmarC5_0191 [Methanococcus maripaludis C5]
gi|166227494|sp|A4FWD2.1|Y191_METM5 RecName: Full=MEMO1 family protein MmarC5_0191
gi|132662861|gb|ABO34507.1| protein of unknown function DUF52 [Methanococcus maripaludis C5]
Length = 284
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
R AG +Y L E ++ +L G + P RG +++PHAGY YSG
Sbjct: 3 RNPVVAGMFYPAEYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62
Query: 57 AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + I V +LGP+H + + ++KTP GD+ +D E + L
Sbjct: 63 AAHGYKKISENINGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADRLWKE 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
+D EHS+E+ LP+L + + ++ K VPI + + E G AK
Sbjct: 122 CDVLDLDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPICMMMQDYETSMDVGYFIAKV 181
Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+ + S+DF H+ S+ + KK ++ K DI+E D + F
Sbjct: 182 AKEMNRKIIIIASTDFSHYESQES---ASKKDALVIK----------DILELKDEEIFTD 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ ++ ++CG P+ + + + K L Y S V YAS K
Sbjct: 229 -VVTHNISMCGYGPVIAMIKAMKDLGAK-NSNLLYYSTSGDVTKDYSEVVGYASILVK 284
>gi|330508374|ref|YP_004384802.1| hypothetical protein MCON_2560 [Methanosaeta concilii GP6]
gi|328929182|gb|AEB68984.1| Protein of unknown function (DUF52) [Methanosaeta concilii GP6]
Length = 237
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 111/227 (48%), Gaps = 27/227 (11%)
Query: 38 PEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
P V GV+ PHAGY YSGR AA + + P + V +LGP+HH ALS + ++T
Sbjct: 8 PIVGGVV-PHAGYVYSGRIAAMVYSRL-PKRETYV-ILGPNHHGLGSPVALSRDS-WRTA 63
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV 157
+G + DLE+ + L TG D + EHS+E+ +P+L K FE KI+ I +G
Sbjct: 64 LGVVTPDLELADLL--TGSIIDHDEAAHQHEHSIEVQIPFLQKRFED--FKILAISMGMQ 119
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
+ E G A+ +V SSDF H+ R + V + IEA+ +
Sbjct: 120 DEETAVEVGEEIARAALKLGRKCTVIASSDFTHYEPR------EVARKVDSQVIEAI--L 171
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKI 259
MDI P + + + ++ + CG PI+ + +LG S K+
Sbjct: 172 NMDI-----PQIYSR-VYRFNASSCGIGPIAATIAASRLLGASSGKL 212
>gi|452203979|ref|YP_007484112.1| hypothetical protein dcmb_1170 [Dehalococcoides mccartyi DCMB5]
gi|452111038|gb|AGG06770.1| hypothetical protein dcmb_1170 [Dehalococcoides mccartyi DCMB5]
Length = 438
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 33/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R+AS AG +Y + +++ E+ G +E L G + PHAGY YSG A
Sbjct: 2 LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+D I+ +LGPSH + A+ + +++TP+G++ +D + + +
Sbjct: 55 AVGSRLDLPEIA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D + EHS+E+ +P L + F+ IKIVPI V +E A G A + +
Sbjct: 113 KADPSAHQYEHSIEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170
Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SSD H+ + + H D + +D I D + +
Sbjct: 171 EAIIIASSDMTHYEPQAD-AHLK-------------DSLALDAIIKLDAAEMLERIQANH 216
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T+CG P++ L + K + + + Y+ S D V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKELGAK-RARVVAYQTSGDITHHLDQVVGYA 262
>gi|147669761|ref|YP_001214579.1| hypothetical protein DehaBAV1_1122 [Dehalococcoides sp. BAV1]
gi|146270709|gb|ABQ17701.1| protein of unknown function DUF52 [Dehalococcoides sp. BAV1]
Length = 438
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 33/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R+AS AG +Y + +++ E+ G +E L G + PHAGY YSG A
Sbjct: 2 LRQASAAGRYYPASKEKIRSVITPMEVKGIAKERYL-------GAVMPHAGYPYSGGVAM 54
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+D + +LGPSH + A+ + +++TP+G++ +D + + +
Sbjct: 55 AVASRLDLPETA--IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHI 112
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN 178
D + EHS+E+ +P L + F+ IKIVPI V +E A G A + +
Sbjct: 113 KADPSAHQYEHSVEVQIPIL-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGR 170
Query: 179 --FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
SSD H+ S+ + H D + +D I D + +
Sbjct: 171 EAIIIASSDMTHYESQAD-AHLK-------------DSLALDAIIKLDAAEMLERIQANH 216
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T+CG P++ L + K + + + Y+ S D V YA
Sbjct: 217 ITMCGYAPVAAMLTAVKELGAK-RARVVAYQTSGDITHHLDQVVGYA 262
>gi|124027421|ref|YP_001012741.1| hypothetical protein Hbut_0536 [Hyperthermus butylicus DSM 5456]
gi|123978115|gb|ABM80396.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
Length = 297
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 122/293 (41%), Gaps = 30/293 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD---------GWLREAGLPKSPEVRGVIAPHAGYSYSGR 55
R + AG +Y +++ L +++ G++ EA + E G + PHAGY YSG
Sbjct: 18 RLPAVAGLFYEADAEALKAQIEAAFRHPLGPGYVPEASSERRRESLGFLVPHAGYMYSGP 77
Query: 56 AAAYAFGNIDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
A +A+ + + +++ GP+H ++ T + TP+G+L +D E+ L
Sbjct: 78 IAVHAYAKMALEGSAETYVIVGPNHTGLGASVSVYPGTAWSTPLGELQVDTELARVLVKA 137
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+ +D EHS+E+ LP+L +F ++I+P++V E + V+
Sbjct: 138 SSYAELDEKAHLYEHSVEVQLPFLQYLFNAR-VRILPVVVYEQTPEVARDIAEALVEAVE 196
Query: 175 DPSN---FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
F + S N HY+ + K DK +D I + + + +
Sbjct: 197 KTGRDVVFIATS----------NLTHYEPYEAAVEK-----DKKVIDAIISLNAEELYRI 241
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ ++CG ++ ++ L+Y S + S V YA+
Sbjct: 242 VTSTPVSMCGPAAAMALIYY-ARLRDARGVELLKYATSGDVAGDKKSVVGYAA 293
>gi|452851477|ref|YP_007493161.1| conserved protein of unknown function [Desulfovibrio piezophilus]
gi|451895131|emb|CCH48010.1| conserved protein of unknown function [Desulfovibrio piezophilus]
Length = 268
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 102/248 (41%), Gaps = 23/248 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R AG +Y + RL LDG+L A P + ++ PHAGY YSG G
Sbjct: 3 REPVVAGRFYEADPTRLNTVLDGYLSGAVRRTEPTILAMV-PHAGYVYSGALCGQTLGT- 60
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ S V LLGP+H + +L + P G++P+D + L + D
Sbjct: 61 -ASLASTVLLLGPNHTGRGKQFSLWPEGEWAIPGGNVPIDTSLATVLLESNSMLSADTEA 119
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV---DDPSNFFS 181
EHS+E+ LP+L + VPI + A E GR + + P +
Sbjct: 120 HIGEHSLEVVLPFLKHMNPD--TTFVPIALSANRFEAMEEIGRTIGDVIQAFERPVSII- 176
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSD H+ + H+ + +D M ++ DP A + E ++CG
Sbjct: 177 VSSDMSHY--------------ISHEKAQEMDSMALEACVELDPLALFTTVREKRISMCG 222
Query: 242 RHPISVFL 249
P++ L
Sbjct: 223 VLPMTCGL 230
>gi|297619510|ref|YP_003707615.1| hypothetical protein Mvol_0985 [Methanococcus voltae A3]
gi|297378487|gb|ADI36642.1| protein of unknown function DUF52 [Methanococcus voltae A3]
Length = 286
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 119/261 (45%), Gaps = 47/261 (18%)
Query: 42 GVIAPHAGYSYSGRAAAYAFGNI-----DPTSISRVFLLGPSHHYYTPKCALSTATVYKT 96
G+I PHAGY YSG A+Y++ + D T+I LL P+H + A+S +++T
Sbjct: 51 GLICPHAGYEYSGITASYSYYELSKRLGDETTI---ILLAPNHTGMGARVAISNE-IWET 106
Query: 97 PIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILV 154
P+GD+ DLE+I+EL + FEL DI E+S+E+ LP++ ++ KIVPI
Sbjct: 107 PLGDIKPDLELIDELISHDLFELDDIA-HLQEYSVEVQLPFIKHLELLNISKFKIVPICC 165
Query: 155 GAVNAENEAMYGRLFAKYVDDPSNF-----FSVSSDFCHWGSRFNYMHYDKKHGVIHKSI 209
++ ++ Y + A + + S+DF H+ + + D K + K+I
Sbjct: 166 QSMEYDD---YVNMGASIYESATKLNKKVVIIASTDFSHYEPQETTIKKDAK---VIKNI 219
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDNTI--CGRHPISVFLHMLGNCSTKI----KIKF 263
+D+ + Y YDN + CG + L C+ K+ +
Sbjct: 220 LEMDEKAI-------------YEAIYDNNVSMCGYGQVITML-----CALKLFGAESAEL 261
Query: 264 LRYEQSSQCKTKRDSSVSYAS 284
L Y S + S V Y S
Sbjct: 262 LNYMTSGDLTQEYTSVVGYGS 282
>gi|389852931|ref|YP_006355165.1| hypothetical protein Py04_1519 [Pyrococcus sp. ST04]
gi|388250237|gb|AFK23090.1| hypothetical protein Py04_1519 [Pyrococcus sp. ST04]
Length = 291
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 122/290 (42%), Gaps = 8/290 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
R + AG +Y + + L E L+ + + G + +APHAGY +SG A+ +
Sbjct: 3 RYPAVAGQFYPEGDE-LIEMLENFFSDLGEEGDRRRITAGVAPHAGYVFSGYTASRTYKA 61
Query: 64 IDPTSISRVFLL-GPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I VF++ GP+H A+ ++TP+G + +D ++ + +D
Sbjct: 62 IYEDGFPEVFVIFGPNHTGMGSPIAVYPEGEWETPLGSVKVDSQMARTIVELSGIADLDD 121
Query: 123 CVDEAEHSMEMHLPYLAKVFE--GHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
+ EHS+E+ LP++ + E G IVPI +G + + GR +
Sbjct: 122 LAHKYEHSIEVQLPFIQYLAELAGKDFMIVPITLGIQDEDVANTLGRAVYEAAQSLGRDV 181
Query: 181 SV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
V S+DF H+G + Y+ + + I + D + I D + + E ++T
Sbjct: 182 VVIASTDFMHYGYFYGYVPFTGRAEDIPNMVREWDMRVIRRILDFDVEGMFDEIREMNHT 241
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+C + S ++ + L Y S + D+ V YAS K
Sbjct: 242 MC-GPGGVGVGIVYSKLSGAVEAELLHYTTSYEVSKSTDAIVGYASIVMK 290
>gi|23466270|ref|NP_696873.1| hypothetical protein BL1725 [Bifidobacterium longum NCC2705]
gi|239621655|ref|ZP_04664686.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
gi|23327019|gb|AAN25509.1| widely conserved hypothetical protein with duf51 [Bifidobacterium
longum NCC2705]
gi|239515530|gb|EEQ55397.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
Length = 596
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 120/282 (42%), Gaps = 64/282 (22%)
Query: 4 IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
IR ++ AGS+Y D ++L ++L+ L AG+P R VI PHAG
Sbjct: 48 IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 101
Query: 50 YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
Y YSG AAA A+ ++ S++R ++GP+H A STA ++TP+G +P+D
Sbjct: 102 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161
Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
L V E L+ A +++ EH++E+ +P+L V G + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
P+ G + G + P +SSD H+ H+
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
ALD + D +L + CG +PI+ L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298
>gi|419847558|ref|ZP_14370727.1| AMMECR1 family protein [Bifidobacterium longum subsp. longum 1-6B]
gi|386410763|gb|EIJ25536.1| AMMECR1 family protein [Bifidobacterium longum subsp. longum 1-6B]
Length = 599
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 120/282 (42%), Gaps = 64/282 (22%)
Query: 4 IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
IR ++ AGS+Y D ++L ++L+ L AG+P R VI PHAG
Sbjct: 51 IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 104
Query: 50 YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
Y YSG AAA A+ ++ S++R ++GP+H A STA ++TP+G +P+D
Sbjct: 105 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 164
Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
L V E L+ A +++ EH++E+ +P+L V G + IV
Sbjct: 165 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 223
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
P+ G + G + P +SSD H+ H+
Sbjct: 224 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 266
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
ALD + D +L + CG +PI+ L +L
Sbjct: 267 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 301
>gi|124485503|ref|YP_001030119.1| hypothetical protein Mlab_0680 [Methanocorpusculum labreanum Z]
gi|124363044|gb|ABN06852.1| protein of unknown function DUF52 [Methanocorpusculum labreanum Z]
Length = 279
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 80/154 (51%), Gaps = 14/154 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R ++ +G +Y N + L + L L A + G++ PHAGY YSG+ AAY + I
Sbjct: 17 RLSTLSGRFYPKNEQEL-DALLSALFAATETSVSDPYGILVPHAGYVYSGKTAAYGYAAI 75
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTAT---VYKTPIGDLPLDLEVIEELKATGKFELMD 121
P LLGPSH L T+T +++TP+G++ D IE L A + + +
Sbjct: 76 SPAFNGTFVLLGPSH------AGLETSTADMIWETPLGNVFPDSAFIEALSA--QIPVRN 127
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
+ E+S+E+ LP++ F +IVPIL+G
Sbjct: 128 DLISAEENSLEVQLPFIRYRFPKA--RIVPILMG 159
>gi|150402556|ref|YP_001329850.1| hypothetical protein MmarC7_0632 [Methanococcus maripaludis C7]
gi|166228803|sp|A6VGX3.1|Y632_METM7 RecName: Full=MEMO1 family protein MmarC7_0632
gi|150033586|gb|ABR65699.1| protein of unknown function DUF52 [Methanococcus maripaludis C7]
Length = 284
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
R AG +Y + L E ++ +L G + P RG +++PHAGY YSG
Sbjct: 3 RNPVVAGMFYPADYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62
Query: 57 AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + I V +LGP+H + + ++KTP GD+ +D E ++L
Sbjct: 63 AAHGYKKISENVSGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADKLWKE 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
+D EHS+E+ LP+L + E ++ K VPI + + E G AK
Sbjct: 122 CDVLDLDENSHLREHSIEVQLPFLKHLEELNIAKFKFVPICMMMQDYETSMDVGYFIAKV 181
Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+ + S+DF H+ + + KK ++ K DI+E D + F
Sbjct: 182 AKEMNRKIIIIASTDFSHYEPQES---ASKKDALVIK----------DILELNDEEIFTD 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ ++ ++CG P+ + + + L Y S V YAS K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAT-NSNLLYYSTSGDVTKDYSEVVGYASILVK 284
>gi|154249943|ref|YP_001410768.1| hypothetical protein Fnod_1264 [Fervidobacterium nodosum Rt17-B1]
gi|154153879|gb|ABS61111.1| protein of unknown function DUF52 [Fervidobacterium nodosum
Rt17-B1]
Length = 267
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 127/286 (44%), Gaps = 31/286 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAAYAFG 62
R+ +G +Y +L + + ++ E K + G+I PHAGY YSG+ A G
Sbjct: 3 RKPVVSGKFYPGTPIQLEKTCEAFVGERATNKDFFSQPVGLILPHAGYVYSGKTA--GMG 60
Query: 63 NIDPTSISR---VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
T R + + GP+H Y ++ + +++TP+G++ ++ E+ ++L
Sbjct: 61 IKKATEYGRPKNIIIFGPNHTGYGELVSVWSEGIWQTPLGNIEVNSEIADKL-IDNTVIF 119
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI-LVGAVNAENEAMYGRLFAKYVDDPSN 178
D EHS+E+ LP L F G KI+P+ ++ + + +L + P
Sbjct: 120 SDEMAHLYEHSIEVQLPLLQYAF-GEF-KIIPVCMMDQRLSTVSKIVDKLKQIIKEYPDT 177
Query: 179 FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
SSDF HYD + K DK+ ++ I GD + + + +++ T
Sbjct: 178 LVVASSDFN---------HYDPHEITLEK-----DKLAIEKILEGDIEGLYERIKKHNIT 223
Query: 239 ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+CG P++V + N ++ + + S++ +V YAS
Sbjct: 224 MCGPGPVAVVRSLFSN------VELVYHTTSAEFSQDYSYTVGYAS 263
>gi|15606223|ref|NP_213601.1| hypothetical protein aq_890 [Aquifex aeolicus VF5]
gi|7388441|sp|O67039.1|Y890_AQUAE RecName: Full=MEMO1 family protein aq_890
gi|2983422|gb|AAC07005.1| hypothetical protein aq_890 [Aquifex aeolicus VF5]
Length = 267
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 116/246 (47%), Gaps = 21/246 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP-EVRGVIAPHAGYSYSGRAAAYAF 61
++R + AG++Y + + L + +D L PK + + ++ PHAGY YSG+ A +
Sbjct: 2 RVREPAVAGTFYPKDKEELNKLMD--LLCGFEPKEKIKPKAILVPHAGYIYSGKTACEVY 59
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I+ +V LLGP+H ++ + ++TP G + +D E+ E++ + D
Sbjct: 60 KRIEIP--EKVVLLGPNHTGLGKPISVYSGDAWETPYGVVEIDGELREKILKY-PYANPD 116
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS+E+ LP+L + + KI+PI+V V E +GR + + +
Sbjct: 117 EYAHLYEHSLEVQLPFLQR-YARREFKILPIVVTFVEYEVAKDFGRFLGEVLKEEDALIV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+SSD H+ KK ++ ++E L+ + FK ++Y+ T+CG
Sbjct: 176 ISSDMSHY---VPAEEARKKDEILISAMERLNTEEL---------YFKA--VQYNITMCG 221
Query: 242 RHPISV 247
P V
Sbjct: 222 VVPAVV 227
>gi|242280444|ref|YP_002992573.1| hypothetical protein Desal_2982 [Desulfovibrio salexigens DSM 2638]
gi|242123338|gb|ACS81034.1| protein of unknown function DUF52 [Desulfovibrio salexigens DSM
2638]
Length = 270
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 123/286 (43%), Gaps = 30/286 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGW--LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R+ AG +Y D+ +L +EL + + AG + P R ++ PHAGY +SG
Sbjct: 3 RQPVVAGRFYPDSPDQLRKELQMYSGTQNAGGDR-PVDRLIMVPHAGYMFSGEPCGKTLA 61
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ VFLLGP+H ++ ++ P G L +D E+ +L + + +
Sbjct: 62 QSKLSDT--VFLLGPNHTGLGSPLSVWDKGSWEFPGGKLDVDEELAAKLIESRTGFVSNE 119
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS- 181
EHS+E+ +P+L + +KIVP+ V + N G A+ + D S S
Sbjct: 120 KAHSREHSLEVIVPFLH--YLNPAMKIVPVCVSEASPSNLRKAGEAIAEIMKDSSQPISM 177
Query: 182 -VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
VSSD H+ V + LD M ++ I DP + ++C
Sbjct: 178 VVSSDMSHF--------------VSADQAKKLDSMALEAIIRMDPSDLYSIVSSNQISMC 223
Query: 241 GRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G P+++ + LG S K+ ++Y S Q + V+YA
Sbjct: 224 GVLPMTMAMFAAKKLGATSGKL----VQYTNSGQATGDYERVVAYA 265
>gi|310778291|ref|YP_003966624.1| hypothetical protein [Ilyobacter polytropus DSM 2926]
gi|309747614|gb|ADO82276.1| protein of unknown function DUF52 [Ilyobacter polytropus DSM 2926]
Length = 277
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 123/294 (41%), Gaps = 38/294 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-----GVIAPHAGYSYSGRAAA 58
+R+ AGS+Y ++ + + E + L++ E++ G ++PHAGY Y R A
Sbjct: 3 VRKCGVAGSFYPNSPEEIKEIFEEALQKESDNIKTELKNKNIIGGVSPHAGYVYCVREAV 62
Query: 59 YAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
+ F + V L+ P+H Y ++ + ++TP G + LD E +EL
Sbjct: 63 HLFEILREKGEKYDTVVLVNPNHTGYGEAVSIDSNKAWETPFGSIELDSEFGDELSFP-- 120
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
++ EHS E+ LPYL F KI+PI + + + A+ + +
Sbjct: 121 ---VEPMAQRFEHSGEVMLPYLY-YFIKKGFKILPICM----MRQDLKTAKNIAEKIKNA 172
Query: 177 SN------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
S +SSDF H+ S D +IE+L KM D F+
Sbjct: 173 SEKLNRKILILISSDFTHFHSSEEGAKLDS------YAIESLLKM--------DSGEFQD 218
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+LE D +ICG PI V L K++ L+ S + D V Y S
Sbjct: 219 RILEKDISICGMGPIMVLLEYSKMILKNPKLEILKRGHSGEV-YPSDEVVDYVS 271
>gi|118580112|ref|YP_901362.1| hypothetical protein Ppro_1691 [Pelobacter propionicus DSM 2379]
gi|118502822|gb|ABK99304.1| protein of unknown function DUF52 [Pelobacter propionicus DSM 2379]
Length = 267
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 4/134 (2%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ AG +Y N +L +L + +A PK VRG+IAPHAGY YSG A A+G I
Sbjct: 3 RQPVVAGQFYPGNPDQLRNDLARMIPQAD-PKR-RVRGIIAPHAGYVYSGAIAGQAYGAI 60
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ + ++GP+HH AL A + TP+G + ++ + + + F +D
Sbjct: 61 QIPATA--LIIGPNHHGAGDPAALFPAGQWLTPLGPVTINSRLNDLIATHVPFVHLDSLA 118
Query: 125 DEAEHSMEMHLPYL 138
EHS+E+ LP+L
Sbjct: 119 HRFEHSLEVQLPFL 132
>gi|374302050|ref|YP_005053689.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
gi|332554986|gb|EGJ52030.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio africanus str. Walvis Bay]
Length = 269
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 116/283 (40%), Gaps = 25/283 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RR AG +Y + +L + +L +AG + PHAGY +SG A G
Sbjct: 3 RRPVVAGQFYPGQADQLESMVGEYLSKAGAKAQERTILAMTPHAGYVFSGSVAGQTLGRA 62
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ V LLGP+H + A+ + +++ P G L ++ + + + + D
Sbjct: 63 NLAKT--VLLLGPNHTGMGSRFAVWSDGIWELPGGGLNVNEGLAKAIIKADARLVADQTA 120
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVG----AVNAENEAMYGRLFAKYVDDPSNFF 180
EHS+E+ LP+L + IVPI V V E A GR+ A + P +
Sbjct: 121 HAREHSLEVVLPFLRAIDPE--TTIVPIAVAEPRLEVLLEVGAAIGRVLASW-KHPVSMV 177
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
VSSD H+ V H+ + D M ++ I +P + + E ++C
Sbjct: 178 -VSSDMSHY--------------VTHEEAKRWDSMALEPILALNPSEAYRVVRESGISMC 222
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G P++V L + K + + Y S + V YA
Sbjct: 223 GIMPMTVGLTAAESLEAS-KAELVAYATSGETSGDYTQVVGYA 264
>gi|86159016|ref|YP_465801.1| hypothetical protein Adeh_2594 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85775527|gb|ABC82364.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 265
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 84/163 (51%), Gaps = 9/163 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG++Y + LA E+DGWL G P +P + GV+ PHAGY YSG A F
Sbjct: 2 VREPAVAGAFYDARAATLAAEVDGWLSAGGAP-APAL-GVLVPHAGYVYSGAVAGATFAR 59
Query: 64 IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ R +LGP+H AL ++TP+G +P+D E+ L A D
Sbjct: 60 V--AVPPRAIVLGPNHTGLGHAAAALWPGGAWRTPLGTVPVDAELTAALAAAPGVA-GDR 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-AENEAM 164
EHS+E+ +P+L + G + I + +G ++ AE EA+
Sbjct: 117 LAHLREHSLEVEVPFLQRARPG--VAIAALCLGPLSFAECEAL 157
>gi|91203620|emb|CAJ71273.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 267
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 123/285 (43%), Gaps = 29/285 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+ + AGS+Y+ + RL E+D ++ + +S G ++PHAGY YSG A + +
Sbjct: 2 IRQPAVAGSFYSGDKSRLQHEIDTFIIKDCEKQS--ALGAVSPHAGYMYSGSIAGSLYSH 59
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
I T V +L P+H Y ++ + TP G++ ++ E ++EL + D
Sbjct: 60 I--TIPDLVVILSPNHTGYGKPYSIWPGGSWITPFGEIAVNEEAVDELVNSCHLIERDKE 117
Query: 124 VDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFFS 181
EH+ E+ +P++ + +IV + + + ++ G+ ++ + P
Sbjct: 118 AHLYEHAAEVQIPFIQ--YFNQKTEIVVMTIASRKIQDLKTIGKCMSQMLQKLHPDALVV 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSD H + + +KK + I AL++ D + E ++CG
Sbjct: 176 ASSDMTHHEPQAS---ANKKDNIAINEILALNE-----------DGLYNKVRELRISMCG 221
Query: 242 RHPISVFLHMLGNCSTKIKIK---FLRYEQSSQCKTKRDSSVSYA 283
+P + L CS + K +RY S D V YA
Sbjct: 222 IYPAVIML----TCSKERGAKEAILVRYATSGDVTGDYDQVVGYA 262
>gi|291287744|ref|YP_003504560.1| hypothetical protein Dacet_1840 [Denitrovibrio acetiphilus DSM
12809]
gi|290884904|gb|ADD68604.1| protein of unknown function DUF52 [Denitrovibrio acetiphilus DSM
12809]
Length = 260
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 89/185 (48%), Gaps = 11/185 (5%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+A+ AG +Y ++ + E + L + G P+ + GV+ PHAGY YSG A +
Sbjct: 3 RQAAVAGVFYPADAGSVTEFIKSSLPD-GAPE--KAVGVVVPHAGYIYSGATAVRTLARV 59
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
V L+GP+H P ++ + TP+GD+P+D ++++ F D
Sbjct: 60 KIPD--TVILVGPNHTGAGPSISVYPEGSWATPLGDVPVDSVLVDKFCENPLFN-KDTTA 116
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
+EHS+E+ LP L K F +K+V + V +N E + A D F +SS
Sbjct: 117 HHSEHSLEVILPIL-KYFNPD-VKVVCVTVKYINLETAETAAKHIADVTD---ALFVISS 171
Query: 185 DFCHW 189
D H+
Sbjct: 172 DLNHF 176
>gi|320101532|ref|YP_004177124.1| hypothetical protein [Desulfurococcus mucosus DSM 2162]
gi|319753884|gb|ADV65642.1| protein of unknown function DUF52 [Desulfurococcus mucosus DSM
2162]
Length = 283
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 123/309 (39%), Gaps = 59/309 (19%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEVR--------GVIAPHAGYSYS 53
K R A AG +Y D + L + ++ + P S P V G + PHAGY YS
Sbjct: 2 KRRTAVVAGYFYPDKREELVKVIEWSFKHGVGPGSTPSVSEAPVSNIIGYVVPHAGYIYS 61
Query: 54 GRAAAYAF----GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
G AA+A+ N P +I +LG +H ++ +++TPIGDL +D EV E
Sbjct: 62 GPVAAHAYYDMAVNGRPDTI---VILGTNHTGMGRIVSVYPGGLWETPIGDLTVDGEVAE 118
Query: 110 ELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEG-----------HLIKIVPILVGAVN 158
+ D EHS+E+ LP++A ++ G H I L A+
Sbjct: 119 GIVKHSSVAEFDEYAHLEEHSVEVQLPFIAYLYNGRVRIVPVVIRVHTPDIARDLAEAIR 178
Query: 159 AENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
E+ ++ SSDF HY+ + K + A+++
Sbjct: 179 RVTESSGKKI----------IILASSDFN---------HYEPHDETVRKDMAAINE---- 215
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTK 275
I D D F +L D +ICG I + MLG K L++ S
Sbjct: 216 -ILKLDSDGFYNTMLRDDISICGPGGIMTLIEYTRMLGG-----KALLLKHATSGDTSGD 269
Query: 276 RDSSVSYAS 284
R V YAS
Sbjct: 270 RSHVVGYAS 278
>gi|254167087|ref|ZP_04873940.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
gi|197623943|gb|EDY36505.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
Length = 271
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 128/291 (43%), Gaps = 35/291 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
+R + AG +Y K L L+ ++ G P +++G + PHAGY +SG
Sbjct: 1 MRYPAVAGQFYPAERKELEIMLNELFIHPIGPGSVPSLIPDGKRKIKGGVVPHAGYMFSG 60
Query: 55 RAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ + + F ++GP+H+ A++ + TP G + +D E+ +++ A
Sbjct: 61 PVAAHFYHALALDGFPESFIIIGPNHYGVGSGVAIALED-FLTPFGKVKVDRELAKDI-A 118
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
G ++ D EHS+E+ LP+L F IK VPI + + E G + + +
Sbjct: 119 KGVIDI-DGYAHRYEHSIEVQLPFLQ--FFKKEIKFVPITMLLQDYEIAIEVGEIIKEAI 175
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
D SSDF H+ + N + + D M ++ I GD +
Sbjct: 176 ADKDVVIIASSDFSHYVPK-NKAYTN-------------DYMAIERIVNGDVKGLYDVIY 221
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+++ T+CG P++ L +G +T L+Y S + + V YA+
Sbjct: 222 KHNITMCGYGPVTAMLTAIGGKAT-----LLKYATSGDIQP-MNEVVGYAA 266
>gi|291303740|ref|YP_003515018.1| hypothetical protein Snas_6306 [Stackebrandtia nassauensis DSM
44728]
gi|290572960|gb|ADD45925.1| protein of unknown function DUF52 [Stackebrandtia nassauensis DSM
44728]
Length = 254
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 106/232 (45%), Gaps = 32/232 (13%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-RGVIAPHAGYSYSGRAAAY 59
M +R A+ AG +Y ++ L ++ + +P S E+ R + PHAGY YSG AA+
Sbjct: 1 MSDVRPAAVAGRFYPGAAESLRTMIESMVDSIEVPDSDELARAYVVPHAGYRYSGPTAAH 60
Query: 60 AFGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ + + RV L+GPSH A S A ++TP+G + E + A
Sbjct: 61 VYARLRHHAARVKRVVLVGPSHFVPLQGLATSPAAGWQTPLGT--VTTPAAEGIPA---- 114
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
+ E EHS+E+ LP+L +V G +++ PI+VG E+ A VDD +
Sbjct: 115 ---EEAPHEREHSLEVQLPFL-QVCVGD-VEVTPIVVGKSTIEDAAT---AINGLVDDQT 166
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
S+D H+ +D+ ++ + LD+ D I +P K
Sbjct: 167 VLL-CSTDLSHY--------HDE------ETAKRLDRATADAILAAEPRRIK 203
>gi|163782830|ref|ZP_02177826.1| hypothetical protein HG1285_15881 [Hydrogenivirga sp. 128-5-R1-1]
gi|159881951|gb|EDP75459.1| hypothetical protein HG1285_15881 [Hydrogenivirga sp. 128-5-R1-1]
Length = 266
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 124/281 (44%), Gaps = 24/281 (8%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
R+ + AG++Y ++L++ +D E K + + ++ PHAG YSG+ A + +
Sbjct: 3 RKPAVAGTFYPAEVEKLSKLVDLLCGEESAEKM-KAKAILVPHAGLIYSGKTACAVYKRV 61
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
R+ LLGP+H ++ +++P G++ +D E + ++ + D
Sbjct: 62 --YIPERIVLLGPNHTGMGTDISVYPGEAWESPYGEVQID-EELRDMVLKYPYASPDESA 118
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS 184
EHS+E+ LP+L + E +I+PI++G ++ + +GR + +SS
Sbjct: 119 HIYEHSLEVQLPFLFRYAE-RPFRILPIILGLLDYDRARDFGRFLGSLLQGIDALIVISS 177
Query: 185 DFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL--LEYDNTICGR 242
D H Y+ D+ I A++++ D + YL ++Y+ T+CG
Sbjct: 178 DMSH------YIPADEARRKDEILISAMERLQTD----------ELYLKRVQYNITMCGF 221
Query: 243 HPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
P + + I + Y S + D V+YA
Sbjct: 222 IPAVAGIEASKVLGARQGI-LIDYTNSGEVTGDYDRVVAYA 261
>gi|410465125|ref|ZP_11318489.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
MBC34]
gi|409981753|gb|EKO38278.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
MBC34]
Length = 278
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 118/288 (40%), Gaps = 31/288 (10%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R+ AG +Y + LA E +L EAG P +APHAG YSG A G
Sbjct: 10 SLRQPVVAGRFYPADPAALARETAAYLAEAGKPSERPTILAMAPHAGAVYSGPVAGKTLG 69
Query: 63 --NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
N+ T + LLGP+H A+ ++ P D+P++ + E L A
Sbjct: 70 AANLADT----LLLLGPNHTGRGAPMAVWPDGAWRIPGRDVPVEAALAEALLAAAPGLSP 125
Query: 121 DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILV-----GAVNAENEAMYGRLFAKYVDD 175
D EHS+E+ LP+L V G +IVP+ V G + AM G L A +
Sbjct: 126 DQAAHLGEHSLEVLLPFLTAVRPG--CRIVPVAVAEPRPGVLAETARAMAGVLAAW--PE 181
Query: 176 PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
P + VSSD H+ K+ + ALD G+ + G+
Sbjct: 182 PISII-VSSDMSHY---LPQATAQKRDSAALARVLALDPQGLLEVVRGE----------- 226
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
D ++CG P+++ L ++ + Y S + + V YA
Sbjct: 227 DISMCGVLPMTLGL-LIARALGATSARLAAYATSGEVSGDYEQVVGYA 273
>gi|374633849|ref|ZP_09706214.1| putative dioxygenase [Metallosphaera yellowstonensis MK1]
gi|373523637|gb|EHP68557.1| putative dioxygenase [Metallosphaera yellowstonensis MK1]
Length = 282
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 128/294 (43%), Gaps = 29/294 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRGV--------IAPHAGYSYSG 54
IRR + AGS+Y D+ K L + ++ +L G + PE I PHAGY YSG
Sbjct: 2 IRRPAVAGSFYEDDPKELRKRIEWSFLHHVGPGRLPEPSNAARRTSPFFIVPHAGYIYSG 61
Query: 55 RAAAYAFGN-IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+++ + + V ++GP+H Y ++ ++TP+G +P+ +E ++E+ +
Sbjct: 62 PVAAHSYYHLVREGRPDLVIMVGPNHTGYGSSVSVWPGGHWETPLGKVPVPIEFVKEIVS 121
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYG--RLFAK 171
+ + EHS+E+ +P+L F I + IL+ G R +
Sbjct: 122 LSEVLDPEERAHLYEHSLEVQVPFLQYFFTEIPIIPITILLQTPETAEFVAEGIWRFMGR 181
Query: 172 YVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ D SSD HYD I K +A+ K IIE D +
Sbjct: 182 H-KDLDIVVLASSDMN---------HYDPHEMTIEKDEKAIKK----IIEL-DYRGLYRV 226
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASA 285
+ E D T+CG P+ V + + K I L++ S + S V Y +A
Sbjct: 227 VEEEDVTMCGYAPVMVSMILARKAGKKPYI--LKHATSGDTSGDKSSVVGYLAA 278
>gi|45358950|ref|NP_988507.1| hypothetical protein MMP1387 [Methanococcus maripaludis S2]
gi|48428699|sp|P61652.1|Y1387_METMP RecName: Full=MEMO1 family protein MMP1387
gi|45047816|emb|CAF30943.1| Protein of unknown function DUF52 [Methanococcus maripaludis S2]
Length = 284
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 124/298 (41%), Gaps = 30/298 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
R AG +Y L E ++ +L G + P RG +++PHAGY YSG
Sbjct: 3 RNPVVAGMFYPAEYHELLEMIEYCYLNPRGPRELPSKRGKYTKPLGIVSPHAGYIYSGPV 62
Query: 57 AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + I + +LGP+H A T +KTP GD+ +D E + L
Sbjct: 63 AAHGYKKISENISGEITAIILGPNHTGLGSGIATMKGT-WKTPFGDMEIDNEFADRLWKE 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
MD EHS+E+ LP+L + + ++ K VPI + + E+ G + AK
Sbjct: 122 CDILDMDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYESCIDVGYVIAKV 181
Query: 173 VDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+ + + S+DF H+ + KK V+ K DI+E D + F
Sbjct: 182 ARELNRKIVIIASTDFSHYEPQ---EQASKKDAVVIK----------DILELNDEEIFTD 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ + ++CG P+ + + + + L Y S V YAS K
Sbjct: 229 -VVTHTISMCGYGPVIAMVKAMKDLGARTSY-LLYYSTSGDVTKDYSEVVGYASLLIK 284
>gi|254167561|ref|ZP_04874412.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
gi|289596975|ref|YP_003483671.1| protein of unknown function DUF52 [Aciduliprofundum boonei T469]
gi|197623370|gb|EDY35934.1| conserved hypothetical protein [Aciduliprofundum boonei T469]
gi|289534762|gb|ADD09109.1| protein of unknown function DUF52 [Aciduliprofundum boonei T469]
Length = 271
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 128/291 (43%), Gaps = 35/291 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP--------EVRGVIAPHAGYSYSG 54
+R + AG +Y K L L+ ++ G P +++G + PHAGY +SG
Sbjct: 1 MRYPAVAGQFYPAERKELEIMLNELFIHPIGPGSVPSLIPDGKRKIKGGVVPHAGYMFSG 60
Query: 55 RAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA+ + + F ++GP+H+ A++ + TP G + +D E+ +++ A
Sbjct: 61 PVAAHFYHALALDGFPESFIIIGPNHYGVGSGVAIALED-FLTPFGKVKVDRELAKDI-A 118
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
G ++ D EHS+E+ LP+L F IK VPI + + E G + + +
Sbjct: 119 KGVIDI-DGYAHRYEHSIEVQLPFLQ--FFKKEIKFVPITMLLQDYEIAIEVGEIIKEAI 175
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
D SSDF H+ + N + + D M ++ I GD +
Sbjct: 176 VDKDVVIIASSDFSHYVPK-NKAYTN-------------DYMAIERIVNGDVKGLYDVIY 221
Query: 234 EYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+++ T+CG P++ L +G +T L+Y S + + V YA+
Sbjct: 222 KHNITMCGYGPVAAMLTAIGGKTT-----LLKYATSGDIQP-MNEVVGYAA 266
>gi|374635034|ref|ZP_09706639.1| protein of unknown function DUF52 [Methanotorris formicicus
Mc-S-70]
gi|373563436|gb|EHP89630.1| protein of unknown function DUF52 [Methanotorris formicicus
Mc-S-70]
Length = 286
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 130/303 (42%), Gaps = 44/303 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD-GWLR---------EAGLPKSPEVRGVIAPHAGYSYS 53
IR AG +Y +S L E ++ +L E G+ K P G+I PHAGY YS
Sbjct: 2 IRNPVVAGVFYPADSNELIELIEYCYLHKLGPKEIPSEGGIFKKP--VGLICPHAGYIYS 59
Query: 54 GRAAAYAFGNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIE 109
G AA+++ + D ++GP+H A + ++KTP+G+L +D E ++
Sbjct: 60 GPIAAHSYNALSKRADIEDEITAVIIGPNHTGLGTGVA-TMKGIWKTPLGNLEIDNEFVD 118
Query: 110 ELKATGKFELMDICVDEA----EHSMEMHLPYLA--KVFEGHLIKIVPILVGAVNAENEA 163
L + ++MD+ DE EHS+E+ LP+L + K VPI + + E
Sbjct: 119 RLWK--ECDIMDL--DETSHLHEHSIEVQLPFLQHLSILNIAKFKFVPISMLFQDYETSV 174
Query: 164 MYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIE 221
G AK + + V S+DF HY+ + K D + + I
Sbjct: 175 DVGYFIAKIAKELNRRVVVIASTDFT---------HYEPQEVAAKK-----DAIAIKNIL 220
Query: 222 TGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVS 281
D + ++ Y+ ++CG P+ + + K + K L Y S S V
Sbjct: 221 NMDEEELYSDVVNYNISMCGCGPVMAMIKAMKLLGGK-EAKLLSYATSGDITKDYSSVVG 279
Query: 282 YAS 284
YAS
Sbjct: 280 YAS 282
>gi|296454703|ref|YP_003661846.1| dioxygenase [Bifidobacterium longum subsp. longum JDM301]
gi|296184134|gb|ADH01016.1| dioxygenase [Bifidobacterium longum subsp. longum JDM301]
Length = 595
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 121/282 (42%), Gaps = 64/282 (22%)
Query: 4 IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
IR + AGS+Y D S+ + ++L+ L AG+PK+ VI PHAG
Sbjct: 48 IRPPAVAGSFYPADRTTLKQLITHQLDYSREVLQQLEPTL-PAGVPKA-----VIVPHAG 101
Query: 50 YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
Y YSG AAA A+ ++ S++R ++GP+H A STA ++TP+G +P+D
Sbjct: 102 YVYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161
Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
L V E L+ A+ +++ EH++E+ +P+L V G + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGASAPAMIVNGPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
P+ G + G + P +SSD H+ H+
Sbjct: 221 PLNAGDATPQE---AGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
ALD + D +L + CG +PI+ L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298
>gi|159905669|ref|YP_001549331.1| hypothetical protein MmarC6_1286 [Methanococcus maripaludis C6]
gi|229554358|sp|A9A9S6.1|Y1286_METM6 RecName: Full=MEMO1 family protein MmarC6_1286
gi|159887162|gb|ABX02099.1| protein of unknown function DUF52 [Methanococcus maripaludis C6]
Length = 284
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELD-GWLREAGLPKSPEVRG-------VIAPHAGYSYSGRA 56
R AG +Y L E ++ +L G + P RG +++PHAGY YSG
Sbjct: 3 RNPVVAGMFYPAEYHELLEMIEYCYLSPRGPKELPSKRGNYTKPLGIVSPHAGYIYSGPV 62
Query: 57 AAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT 114
AA+ + I V +LGP+H + + ++KTP GD+ +D E ++L
Sbjct: 63 AAHGYKKISENVSGEVTAIILGPNHTGLGSGIS-TMKGIWKTPFGDMEIDNEFADKLWKE 121
Query: 115 GKFELMDICVDEAEHSMEMHLPYLAKVFEGHL--IKIVPILVGAVNAENEAMYGRLFAKY 172
+D EHS+E+ LP+L + + ++ K VPI + + E G AK
Sbjct: 122 CDVLDIDENSHLREHSIEVQLPFLKHLEDLNIAKFKFVPISMMMQDYETSIDVGYFIAKV 181
Query: 173 VDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+ + S+DF H+ + + KK ++ K DI+E D + F
Sbjct: 182 AKEMNRKIIIIASTDFSHYEPQES---ASKKDAIVIK----------DILELKDEEIFID 228
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
++ ++ ++CG P+ + + + K L Y S V YAS K
Sbjct: 229 -VVTHNISMCGYGPVIAMVKAMKDLGAKTA-NLLYYSTSGDVTKDYSEVVGYASILVK 284
>gi|407464049|ref|YP_006774931.1| hypothetical protein NSED_00885 [Candidatus Nitrosopumilus sp. AR2]
gi|407047237|gb|AFS81989.1| hypothetical protein NSED_00885 [Candidatus Nitrosopumilus sp. AR2]
Length = 276
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 109/248 (43%), Gaps = 29/248 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLR-----EAGLPKSPE-----VRGVIAPHAGYSYS 53
IR AG +Y + E+L +R E G +P+ + G+I PHAGY YS
Sbjct: 2 IREPVVAGQFYPEKE----EDLKNMIRYCIDHEYGPGNNPKTSNDKIYGIICPHAGYVYS 57
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G A +++ I + ++GP+H A + ++TP+G + +D E +++
Sbjct: 58 GPTACHSYKAISSQNPELAIIIGPNHFGVGKDAATMIDSQWQTPLGMVSIDSESAKQVAE 117
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
+F +D +HS+E+ +P L ++ +I+PI++ A + + G +
Sbjct: 118 ISEFIEIDEYSHSQDHSLEVQIPMLQEMLSNEF-QILPIILRAQDMKTAMDVGNAVYEIA 176
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
+ SSDF H+ +++ H+ DK ++ I + + F + L
Sbjct: 177 KRKNAIIVASSDFTHY----------EENSFAHQQ----DKALIEPILEMNVERFYQVLY 222
Query: 234 EYDNTICG 241
E T CG
Sbjct: 223 ERRVTACG 230
>gi|242399201|ref|YP_002994625.1| hypothetical protein TSIB_1222 [Thermococcus sibiricus MM 739]
gi|242265594|gb|ACS90276.1| hypothetical protein TSIB_1222 [Thermococcus sibiricus MM 739]
Length = 292
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 67/295 (22%), Positives = 134/295 (45%), Gaps = 16/295 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R + AGS+Y + L L + R+ G L + ++ +APHAGY +SG A+ +
Sbjct: 2 LRYPTVAGSFYP-TGEELKMMLKEFFRDLGELGEERKITAGVAPHAGYVFSGFTASRTYK 60
Query: 63 NIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
I + F+ +GP+H A+ + TP+G + +D ++ + + +D
Sbjct: 61 AIYEDGLPETFVVIGPNHTGLGSPVAIYPEGKWITPMGGIKVDEDLAKAIARHSGIADLD 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
+ EHS+E+ +P++ + + ++IVPI +G + E G+ + +
Sbjct: 121 ELAHKYEHSIEVQIPFIQYISQKAEEEVRIVPITLGLQDEEVAEDLGKAIFEASQELGRD 180
Query: 180 FSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALD-KMGMDIIETGDPDAFKKYLLEYD 236
V S+D H+G + Y+ + + + I+ D ++ I+E F + + + +
Sbjct: 181 IVVIASTDMMHYGYAYGYVPFRARGDDLLGRIKEWDFRIIQKILEFDHKGMFDE-IRKMN 239
Query: 237 NTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+T+CG ++ VF + G T++ L Y S + D+ V Y S +
Sbjct: 240 HTMCGPGGVATAIVFSRLAGAVETEV----LHYTTSFEVSRSTDAIVGYVSIVMR 290
>gi|217076175|ref|YP_002333891.1| hypothetical protein THA_42 [Thermosipho africanus TCF52B]
gi|419760738|ref|ZP_14287007.1| hypothetical protein H17ap60334_08093 [Thermosipho africanus
H17ap60334]
gi|217036028|gb|ACJ74550.1| hypothetical protein THA_42 [Thermosipho africanus TCF52B]
gi|407514244|gb|EKF49089.1| hypothetical protein H17ap60334_08093 [Thermosipho africanus
H17ap60334]
Length = 262
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 117/284 (41%), Gaps = 32/284 (11%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN- 63
R +A +Y NS+ L + + + + + + G+I PHAGY YSG+ A YA
Sbjct: 3 REMYYADKFYPKNSELLIKFFNQFCKVKEIKNTNSNFGLILPHAGYVYSGKTALYAICKA 62
Query: 64 IDPTSISRVFLLGPSHHYYTPK-CALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+D R+ + G +H C+L + TP G + +D E+ L FE +
Sbjct: 63 LDFGKPERIIIFGTNHTGIAGNICSLWPRGKWNTPFGYVGIDEELNNYLLKNEIFE-ENY 121
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSV 182
EHS+E+ LP L F K VPI+ + EN L K +D +
Sbjct: 122 KAHILEHSIEVLLPILKYYFGS--FKFVPIIYNYQSYENTIKIVELLKK-IDLKNTLLVA 178
Query: 183 SSDFCHWGSRFNYMHYDKK--HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSDF H+ S + D+K ++ ++IE + Y L + C
Sbjct: 179 SSDFNHYESHEITLRKDEKLIEKILERNIEGI------------------YSLRKSISAC 220
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
G P+SV N +K + + S++ +V YAS
Sbjct: 221 GIGPMSVLTMYFDN------VKLVYHTTSAEYSKDYTFTVGYAS 258
>gi|268678675|ref|YP_003303106.1| hypothetical protein Sdel_0033 [Sulfurospirillum deleyianum DSM
6946]
gi|268616706|gb|ACZ11071.1| protein of unknown function DUF52 [Sulfurospirillum deleyianum DSM
6946]
Length = 263
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 89/195 (45%), Gaps = 13/195 (6%)
Query: 3 KIRRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA-- 57
K R+++ AG +Y ++ K + ++E P + +I PHAGY YSG A
Sbjct: 2 KTRKSAVAGMFYPESCHDIKAFITHFESQIKEPTCTIVP--KALIVPHAGYVYSGYTANL 59
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
AY + + I V ++GPSH Y +++ Y TP G++ +DLE L+ F
Sbjct: 60 AYHYTASKRSDIHCVVVIGPSHRIYLEGASIALYESYHTPCGEIAIDLEYSHALQKRFSF 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
E EHS E+ +P++ F+ +K+V I+ G + L + + +
Sbjct: 120 LSFHPSAHE-EHSTEVQMPFIRHYFKH--VKVVEIVYGDI---THGELSNLMDEVLKEEG 173
Query: 178 NFFSVSSDFCHWGSR 192
VS+D H+ S
Sbjct: 174 RLLVVSTDLSHFHSE 188
>gi|303248253|ref|ZP_07334516.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
gi|302490391|gb|EFL50302.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
Length = 280
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 21/283 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS--PEVRGVIAPHAGYSYSGRAAAYAF 61
IR+ AG +Y + L E+ G+L +A P+S P + ++ PHAGY +SG A
Sbjct: 13 IRQPVVAGRFYPGDPAGLRREVAGYLAQAA-PRSEKPTLLAMV-PHAGYVFSGLVAGRTL 70
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
G R+ LLGP+H + A+ + P ++P+D + EL A D
Sbjct: 71 GAA--RLADRLLLLGPNHTGQGARLAVWPEGAWLVPGCEVPVDAGLAGELMAAAPRLASD 128
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
+ EHS+E+ LP+L V G +IVPI V AE + R A + +
Sbjct: 129 VAAHMGEHSLEVELPFLCAVNPGA--RIVPICV----AEPDPGVLREVAGAI------VA 176
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
V + H S + D H + H + + D M + + DP+ + T+CG
Sbjct: 177 VLRELSHPVSIV--VSSDMSHYISHTAAKKRDSMALSRVLALDPEGLCAVVRSEGITMCG 234
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
P+++ L + + + Y S + + V YA
Sbjct: 235 VLPMALGLFVAIGLGARDAL-LAAYATSGEVTGDMEQVVGYAG 276
>gi|227546624|ref|ZP_03976673.1| dioxygenase [Bifidobacterium longum subsp. longum ATCC 55813]
gi|227212941|gb|EEI80820.1| dioxygenase [Bifidobacterium longum subsp. infantis ATCC 55813]
Length = 596
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 119/282 (42%), Gaps = 64/282 (22%)
Query: 4 IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
IR ++ AGS+Y D ++L ++L+ L AG+P R VI PHAG
Sbjct: 48 IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAGVP-----RAVIVPHAG 101
Query: 50 YSYSGRAAAYAFGNIDPTS--ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
Y YSG AAA A+ ++ ++R ++GP+H A STA ++TP+G +P+D
Sbjct: 102 YIYSGTAAALAYALLERGRGRVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161
Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
L V E L+ A +++ EH++E+ +P+L V G + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
P+ G + G + P +SSD H+ H+
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
ALD + D +L + CG +PI+ L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298
>gi|189345729|ref|YP_001942258.1| hypothetical protein Clim_0180 [Chlorobium limicola DSM 245]
gi|189339876|gb|ACD89279.1| protein of unknown function DUF52 [Chlorobium limicola DSM 245]
Length = 285
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/232 (22%), Positives = 113/232 (48%), Gaps = 19/232 (8%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGL--PKSPEVRGVIAPHAGYSYSGRAAAYA 60
+IR + A ++Y + + L + L++ K ++R ++ PH+GY YSG AA A
Sbjct: 4 EIRYPAVAEAFYPADIRELESLITSLLQQVKPEHKKRKKIRAIVTPHSGYLYSGSIAACA 63
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ I ++ VFL+G +H Y AL + + TP+G++P++ ++ L +
Sbjct: 64 YKAITGSTFKNVFLMGHAHAYLFEGVALDSHKAWHTPLGNVPVNTDMNARLIGLAPELVH 123
Query: 121 DICV-DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
++ + ++H++E+ +P+L + + IVP+L G NA + + + +P +
Sbjct: 124 ELNIAHHSDHTLEVQIPFLLRTLKPDF-SIVPMLFGE-NAIDVHLKTADLLLPLLNPDDL 181
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
VS+D H+ + ++ +D++ ++ I + D D +++
Sbjct: 182 LVVSTDLSHYPA--------------YRDANTIDRLTLEHIVSSDIDGLEEH 219
>gi|413951293|gb|AFW83942.1| hypothetical protein ZEAMMB73_880994, partial [Zea mays]
Length = 136
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 34/59 (57%), Positives = 41/59 (69%)
Query: 48 AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE 106
+G + R +A + SRVFLLGPSHHYYTPKCAL+ A+VY TPIGDLP+D E
Sbjct: 69 SGSIHGAREEGFARRLLVHQRCSRVFLLGPSHHYYTPKCALTRASVYCTPIGDLPVDQE 127
>gi|372270868|ref|ZP_09506916.1| dioxygenase [Marinobacterium stanieri S30]
Length = 267
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 94/195 (48%), Gaps = 18/195 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
+R+ + AG +Y L + +D L ++ LP+ P + PHAG YSG AA
Sbjct: 6 VRQPAVAGVFYPGTETDLNQLVDQLLVQSTEPVSLPRHP--CAFVVPHAGLVYSGPIAAL 63
Query: 60 AFGNID---PTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
+ I+ P + R V LLGP+H A T + TP+GD+ +DL++ L+A
Sbjct: 64 VYRQIERWMPLAGWRQVVLLGPNHRVPLSGFAGVAETNWHTPLGDVAIDLDLERSLQA-- 121
Query: 116 KFEL-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD 174
+F+L + + EH +E+ LP+L KV +++P+LVG E L A+
Sbjct: 122 RFDLPVRQDAHQLEHCLEVQLPFLQKVAPD--ARVLPLLVGHTPVET---VSALIAELWQ 176
Query: 175 DPSNFFSVSSDFCHW 189
VSSD H+
Sbjct: 177 REDVLVLVSSDLSHF 191
>gi|298529330|ref|ZP_07016733.1| protein of unknown function DUF52 [Desulfonatronospira
thiodismutans ASO3-1]
gi|298510766|gb|EFI34669.1| protein of unknown function DUF52 [Desulfonatronospira
thiodismutans ASO3-1]
Length = 266
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 115/276 (41%), Gaps = 24/276 (8%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
AG +Y ++ L E+L +L+ G S + PHAGY +SG A + S
Sbjct: 8 AGQFYPGSAAALDEKLSRYLQ--GEQASARTLLAMVPHAGYPFSGPVAGKVLARSNLAS- 64
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
R+ LLGP+H + A+ ++ P G++ +D + + + D EH
Sbjct: 65 -RIILLGPNHTGQGQRIAVWGDGSWRLPGGEVMVDEDTASVIGKLPGYSF-DYQAHLREH 122
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS--VSSDFC 187
S+E+ LP+LA+ G KIVPI V + + G A+ + D + S VS+D
Sbjct: 123 SLEVILPFLARAVPG--CKIVPISVAEPDLQVLMRAGEDLARTIRDLALDVSLVVSTDMS 180
Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISV 247
H+ V + LD M ++ I DP + + + ++CG P ++
Sbjct: 181 HF--------------VPQEQARRLDHMAIERILDLDPQGLHQVVKQNRISMCGVMPTTL 226
Query: 248 FLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
L + + +F+ Y S V YA
Sbjct: 227 GLACVREMGAR-STEFIDYATSGDAIGDYSQVVGYA 261
>gi|270308481|ref|YP_003330539.1| hypothetical protein DhcVS_1094 [Dehalococcoides sp. VS]
gi|270154373|gb|ACZ62211.1| hypothetical protein DhcVS_1094 [Dehalococcoides sp. VS]
Length = 438
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 35/288 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAE-----ELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAA 58
+R AS AG +Y + +++ E+ G +E L G + PHAGY YSG A
Sbjct: 2 LRPASAAGRYYPASKEKIRSVITPLEIKGVTKERYL-------GAVMPHAGYPYSGGVAM 54
Query: 59 YAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+D P + V +LGPSH + A+ + +++TP+G++ +D + + +
Sbjct: 55 AVATRLDLPET---VIILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRH 111
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
D + EHS+E+ +P L + F+ + IKIVPI + E G A + +
Sbjct: 112 IKADPSAHQYEHSIEVQIPIL-QYFKPN-IKIVPITLSFGKTETLEDIGHGIASALRETG 169
Query: 178 N--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
SSD H+ + H D M +D I D + +
Sbjct: 170 REAIIIASSDMTHYEPQTE-AHLK-------------DSMALDAIVKLDAADMLERIQAN 215
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T+CG P++ L + K + + + Y+ S D V YA
Sbjct: 216 HITMCGYAPVAAMLTAVKEMGAK-RARVVAYQTSGDITHHMDQVVGYA 262
>gi|253700550|ref|YP_003021739.1| hypothetical protein GM21_1928 [Geobacter sp. M21]
gi|251775400|gb|ACT17981.1| protein of unknown function DUF52 [Geobacter sp. M21]
Length = 266
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 128/287 (44%), Gaps = 34/287 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
IR+ + AG +Y+ + ++L EEL + RE + + GV+APHAGY YSG AA +
Sbjct: 2 IRQPAVAGKFYSADPEQLREELSRMIPRE----EPAKAIGVVAPHAGYVYSGGAAGKVYA 57
Query: 63 NID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
++ P + V +LGP+H AL+ + + TP+G + ++ + + + D
Sbjct: 58 AVEVPDA---VIVLGPNHTGMGAAAALAPSGEWLTPLGPVRVNDRLSQLIMKHAPLVRED 114
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--F 179
EHS+E+ +P+L + + I + + + ++ + G A + + +
Sbjct: 115 SVAHRFEHSLEVQVPFLQ--YRNPQVSIAALCLSLPDFDSISRVGTGIAHAIAEYGSEVL 172
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
SSD H+ S +A D + ++ + DP+ + + D ++
Sbjct: 173 IVASSDMTHYES--------------AAQAKAKDDLALERMACMDPEGLLRVCRDKDISM 218
Query: 240 CGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
CG P + L LG +++ + Y S + ++ V+YA
Sbjct: 219 CGVIPATAMLVAAKTLGASGSRL----ICYTNSGEVNGDLNAVVAYA 261
>gi|374316993|ref|YP_005063421.1| putative dioxygenase [Sphaerochaeta pleomorpha str. Grapes]
gi|359352637|gb|AEV30411.1| putative dioxygenase [Sphaerochaeta pleomorpha str. Grapes]
Length = 301
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 133/302 (44%), Gaps = 33/302 (10%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAG---LPKS---PEVRGVIAPHAGYSYSGRA 56
++R A AGSWY ++ L L+ L+EA L K PE R I PHAG S+S R
Sbjct: 9 RVRHAGFAGSWYPLDATELESFLEISLQEAKDRELQKGSFYPEGRFAILPHAGLSFSQRG 68
Query: 57 AAYAFGNIDPTSISRVFLLGPSHH--YYTPKCALSTATVYKTPIGDL---PLDLEVIEEL 111
A F + P + RV ++ PSH+ Y+ + ++ + Y+TP+G+L L +
Sbjct: 69 IAPFFLHFPPQA-ERVLIIAPSHYTSLYSDELSVGSFDEYETPLGNLKGFSLSFGIKGGE 127
Query: 112 KATGKFELMDICVDEAEHSMEMHLPYLAKVFEG--HLIKIVPILVGAVNAENEA------ 163
+A ++EH++EM +PYLA V + + + LV ++ ++A
Sbjct: 128 RAI-----------QSEHAVEMVMPYLAWVVKTRKQELAVSTALVSHLSDCSKAKEIAHR 176
Query: 164 MYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKK-HGVIHKSIEALDKMGMDIIET 222
+ L + + SSDF H+G RF + Y+ I ++ D ++
Sbjct: 177 LIDNLGEESLRSGKTLVIASSDFTHYGRRFAHTPYENNLPRAIFNRVKEDDLALSALLID 236
Query: 223 GDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
G+ + + +T+CG ++ + L C Y + + + D V+Y
Sbjct: 237 GNIKEALAFCENHHSTVCGLASGAI-VATLATCFNAKGTLSEYYTSNDVFENEEDEFVTY 295
Query: 283 AS 284
+S
Sbjct: 296 SS 297
>gi|256089007|ref|XP_002580610.1| hypothetical protein [Schistosoma mansoni]
Length = 80
Score = 70.5 bits (171), Expect = 9e-10, Method: Composition-based stats.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNC-----STKIKIKFLRYEQSS 270
GM++IE P F YL +Y NTICGRHPI V L ++ + +KF+RY QS
Sbjct: 1 GMELIERLAPSEFTSYLEQYGNTICGRHPIGVLLQIVAYLRRNMPNNNFNMKFVRYAQSE 60
Query: 271 QCKTKRDSSVSYASAAAKV 289
C SSVSYA+ ++
Sbjct: 61 HCNNMNQSSVSYAAGVLQI 79
>gi|189424944|ref|YP_001952121.1| hypothetical protein Glov_1885 [Geobacter lovleyi SZ]
gi|189421203|gb|ACD95601.1| protein of unknown function DUF52 [Geobacter lovleyi SZ]
Length = 267
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 24/249 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR+A+ AG +Y + L EL L K + GVI+PHAGY YSG AA
Sbjct: 2 IRQAAVAGQFYPGSQSALIRELQILLPAVKTRK--KAIGVISPHAGYLYSGSAAGQLLAG 59
Query: 64 ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I+ P + V +LGP+HH A+S ++TP+G +P++ + ++ D
Sbjct: 60 IELPRT---VLILGPNHHGTGALAAVSPDDGWQTPLGVVPIEKRLAGLIQQHVPRVEQDS 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
EHS+E+ +P+L + + IVP+ + + + G A + D
Sbjct: 117 SAHRHEHSLEVQVPFLQYLRPD--LNIVPLCLAFGDYAGCELLGNGLAAAIRDFGEEVLI 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ S +K + + I A D G+ +E + T+C
Sbjct: 175 LASSDMTHYES---VATARRKDSLALERILAFDAQGL--VEVCRTERI---------TMC 220
Query: 241 GRHPISVFL 249
G P +V L
Sbjct: 221 GVIPAAVML 229
>gi|383787285|ref|YP_005471854.1| putative dioxygenase [Fervidobacterium pennivorans DSM 9078]
gi|383110132|gb|AFG35735.1| putative dioxygenase [Fervidobacterium pennivorans DSM 9078]
Length = 266
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/283 (25%), Positives = 117/283 (41%), Gaps = 26/283 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
RR AG +Y +L + +L E + + G+I PHAGY YSG+ A
Sbjct: 3 RRPVVAGKFYPGTPNQLTSMCEAFLGSEKPKNRLEKPLGLILPHAGYIYSGKTAGLGIKK 62
Query: 64 -IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ S + +LGP+H Y ++ + P G + ++ + EE+ GK D
Sbjct: 63 AVEFGKPSNIIILGPNHTGYGASVSVWKEGEWFVPNGSVRVNSIIAEEI-IDGKIIEEDE 121
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD-PSNFFS 181
EHS+E+ LP L F G +IVPI + K ++ P
Sbjct: 122 SAHLYEHSIEVQLPLLIHAF-GEF-QIVPICMMDQRLTKARFIAERIRKILEKYPDTLVV 179
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SSDF HYD + K D++ + I + D + + L +YD T+CG
Sbjct: 180 ASSDFN---------HYDPHELTMKK-----DEIAITRIISNDLEGLYEDLKKYDITMCG 225
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
P++V + N + + + S++ +V YAS
Sbjct: 226 PGPVAVVRSLFENA------ELVYHTTSAEFSQDFSYTVGYAS 262
>gi|325958574|ref|YP_004290040.1| hypothetical protein Metbo_0817 [Methanobacterium sp. AL-21]
gi|325330006|gb|ADZ09068.1| UPF0103 protein [Methanobacterium sp. AL-21]
Length = 279
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 132/296 (44%), Gaps = 37/296 (12%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-PEV------RGVIAPHAGYSYSGRA 56
IR+ + AG +Y + L ++++ + P S P++ +GV+ PHAGY YSG
Sbjct: 2 IRKPAVAGIFYEKDPDMLRKQIEWCFKHNLGPGSIPDIGNERTIKGVVTPHAGYIYSGPV 61
Query: 57 AAYAFGNIDPTSISRVFL-LGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
AA+++ +I F+ L P+H + ++TP+G++ +D E +L
Sbjct: 62 AAHSYHDIAEDGFPETFIVLCPNHTGLGSGVSTMNQGSWETPLGNVEID-ETFADL-LIE 119
Query: 116 KFELMDICVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
K +MD D EHS E+ LP+L + K VP+ + + + G A+
Sbjct: 120 KTGIMDSNPDAHLREHSAEVQLPFLQ--YLDPDFKFVPVTMWMQDLQTSLEIGVSIAQAA 177
Query: 174 D--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
D S+DF H+ + D + + ++I+A+D+ M
Sbjct: 178 KELDRDVIVIASTDFTHYQPKKQAYMQDMQ---VIEAIKAMDEKRM-----------MNV 223
Query: 232 LLEYDNTICGRHPIS---VFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
+ E + T+CG P++ V + +G S+++K +Y S S V+YAS
Sbjct: 224 VAEQNVTMCGYGPVAATLVAVKQMGANSSELK----KYATSGDTTGDNSSVVAYAS 275
>gi|116620880|ref|YP_823036.1| hypothetical protein Acid_1761 [Candidatus Solibacter usitatus
Ellin6076]
gi|116224042|gb|ABJ82751.1| protein of unknown function DUF52 [Candidatus Solibacter usitatus
Ellin6076]
Length = 404
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 34/302 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRG---VIAPHAGYSYSGRAAAY 59
+R +HAGS Y AEE++ +RE G + PE G + APH ++
Sbjct: 111 VREPAHAGSAYPGE----AEEVNAMMREYMGGVEPPEPDGLFAIAAPHVSPEGGWQSYRA 166
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG--KF 117
A+G + P R F++ + HY P+ T ++TP+GD D +++ L G
Sbjct: 167 AYGMLRPEHKDRTFVVLATSHYGEPEKFGLTRKAFRTPLGDARSDTALVDWLAKHGGAGV 226
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN--------AENEAMYGRLF 169
++ D C EH++E+ + +L + G +KI+P+L+G E++ F
Sbjct: 227 KMEDYC-HSFEHTVELQVIFLQHML-GADVKILPVLIGGFAHSIYEGGMPEDDDKVRAFF 284
Query: 170 -----AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD 224
+ + F+ + D H G+R+ + +++ + D+ ++ I D
Sbjct: 285 DALSELREREGDKLFWVMGVDMAHMGARYQDSFAARAGEGEMEAVASRDEQRIERINALD 344
Query: 225 PDAFKKYLLEYDNTI--CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSY 282
F + + E + + CG P FL + K + + L YE Q +S VS+
Sbjct: 345 AGGFWELVRERQDDLKWCGSSPFYTFL----KTAPKARGELLSYE---QWNIDENSVVSF 397
Query: 283 AS 284
A
Sbjct: 398 AG 399
>gi|322691725|ref|YP_004221295.1| hypothetical protein BLLJ_1536 [Bifidobacterium longum subsp.
longum JCM 1217]
gi|320456581|dbj|BAJ67203.1| conserved hypothetical protein [Bifidobacterium longum subsp.
longum JCM 1217]
Length = 596
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 119/282 (42%), Gaps = 64/282 (22%)
Query: 4 IRRASHAGSWYT--------------DNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAG 49
IR ++ AGS+Y D ++L ++L+ L A +P R VI PHAG
Sbjct: 48 IRPSAVAGSFYPADRTALKQLINQQLDYGRKLLQQLEPTL-PAWVP-----RAVIVPHAG 101
Query: 50 YSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD--- 104
Y YSG AAA A+ ++ S++R ++GP+H A STA ++TP+G +P+D
Sbjct: 102 YIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAA 161
Query: 105 ------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIV 150
L V E L+ A +++ EH++E+ +P+L V G + IV
Sbjct: 162 ERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPDLTIV 220
Query: 151 PILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
P+ G + G + P +SSD H+ H+
Sbjct: 221 PLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------HEVAR 263
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
ALD + D +L + CG +PI+ L +L
Sbjct: 264 ALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 298
>gi|163785595|ref|ZP_02180153.1| hypothetical protein HG1285_07278 [Hydrogenivirga sp. 128-5-R1-1]
gi|159879135|gb|EDP73081.1| hypothetical protein HG1285_07278 [Hydrogenivirga sp. 128-5-R1-1]
Length = 250
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 107/252 (42%), Gaps = 33/252 (13%)
Query: 41 RGVIAPHAGYSYSGRAAAYA---FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTP 97
VI+PHAGY YSG+ A Y+ F N+D +V L+GPSH + ++TP
Sbjct: 22 EAVISPHAGYIYSGQTATYSYKQFLNLDKNKHYKVLLIGPSHFVPFNGISFGYYDYWETP 81
Query: 98 IGDLPLDLEVIEE--LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
+G + +D E IE+ +K + + EHS+E+ +P+L V E I+P++ G
Sbjct: 82 LGKVKVDKEEIEKFAMKNPDLPITFNTIPHQREHSLEVQVPFLQMVLED--FSIIPVVYG 139
Query: 156 AVNAENEAMYGRLFAKYVDDPSNFFSV-SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDK 214
++ + ++ A D + V S+D H+ +++ +DK
Sbjct: 140 QIDFR---IVEKIIADIKGDRDDVVVVISTDLSHYYP--------------YETARQIDK 182
Query: 215 MGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
+ +E D +K CG ++ + K K K L Y S
Sbjct: 183 YCLLAVEEEDVSYLQK------CEACGEIGMAAIIDWASKA--KWKSKVLDYRTSGDTAG 234
Query: 275 KRDSSVSYASAA 286
+D+ V Y SA
Sbjct: 235 NKDAVVGYLSAV 246
>gi|255513873|gb|EET90138.1| protein of unknown function DUF52 [Candidatus Micrarchaeum
acidiphilum ARMAN-2]
Length = 278
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 119/289 (41%), Gaps = 26/289 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAF 61
+R +G++Y LA+ ++ + A + V+ ++APHAGY YSG AAY++
Sbjct: 1 MRAMQFSGTFYASGKSELAKFIEEAVSHADVKAKHGSAVKAIVAPHAGYVYSGSIAAYSY 60
Query: 62 GNI----DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ + +LGP+H A+S + TP G + D ++ + + + K+
Sbjct: 61 KALMEDREHGKPDTAIILGPNHTGLGEPIAVSMED-WATPFGTVKNDKKLSKAIIGSSKY 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
D EHS+E+ LP+L +F I G + + + + +
Sbjct: 120 ISEDESAHSEEHSIEVQLPFLKYLFPD--TAACFICQGDQSLDASILLSKAVIAAAKESG 177
Query: 178 NFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEY 235
+V SSDF HY+ K D + IE D D F +LE
Sbjct: 178 KRIAVIASSDF---------NHYEPASAAKEK-----DSRLLAAIEKLDSDEFNSSVLET 223
Query: 236 DNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
++ CG PI+V + + K LRY S + S V+Y++
Sbjct: 224 GDSACGIGPITVAME-YSKANGVPKATLLRYGNSGEATMDYSSVVAYSA 271
>gi|78778042|ref|YP_394357.1| hypothetical protein Suden_1848 [Sulfurimonas denitrificans DSM
1251]
gi|78498582|gb|ABB45122.1| Protein of unknown function DUF52 [Sulfurimonas denitrificans DSM
1251]
Length = 262
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 68/143 (47%), Gaps = 5/143 (3%)
Query: 5 RRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
R S GS+Y + +R E E + + R VI PHAGY YSG +A A+
Sbjct: 3 REMSVVGSFYPARAVELERYFEHFSTTYDEENILPDIKSRVVIVPHAGYIYSGYSANVAY 62
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ + + + ++GPSH +L + Y+TP G +P L+++EEL T F L
Sbjct: 63 RVLKKSGVKKFLVIGPSHRVGFEGISLGDFSSYETPFGAIPASLDLVEELSNT--FLLSC 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEG 144
EHS E+ P++ EG
Sbjct: 121 YRDTHFEHSTEVQFPFIKYYIEG 143
>gi|288560176|ref|YP_003423662.1| phosphomevalonate decarboxylase [Methanobrevibacter ruminantium M1]
gi|288542886|gb|ADC46770.1| phosphomevalonate decarboxylase [Methanobrevibacter ruminantium M1]
Length = 282
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 122/296 (41%), Gaps = 30/296 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEEL-DGWLREAG---------LPKSPEVRGVIAPHAGYSYS 53
IR A AG++Y +N K L E + D + G + K EV ++ PHAGY YS
Sbjct: 2 IREAVVAGTFYENNVKYLRESIEDCFTHRLGPGEIPKLSRVNKEKEVNAIMVPHAGYVYS 61
Query: 54 GRAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
G AA+A+ + F ++ P+H + ++ + P G +D E+ +
Sbjct: 62 GPTAAHAYSKLVQDGYPETFVIIAPNHTGFGEHVSIFNEGSWIVPNGVADVDDELANAII 121
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+ D EHS+E+ +P L K F+ + KIVPI + + + +
Sbjct: 122 NQSNYAKADFLAHRNEHSIEVQIPLL-KYFDSNF-KIVPICMMDQSPQASIDLANSIFEA 179
Query: 173 VDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKK 230
+D + ++ S+D H+ S+ K+IE D + + + G+ + +
Sbjct: 180 ANDLNRKITLIDSTDLSHFKSQ-------------EKTIEH-DNLVFNEVINGNTEGLYQ 225
Query: 231 YLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+ + T+CG P V + + K + L++ S S V Y S
Sbjct: 226 VVKKEQITMCGYGPTMVSMEYCKKLNQK-NFELLQHSTSGDITLDYASVVGYGSGV 280
>gi|88813479|ref|ZP_01128714.1| hypothetical protein NB231_00440 [Nitrococcus mobilis Nb-231]
gi|88789269|gb|EAR20401.1| hypothetical protein NB231_00440 [Nitrococcus mobilis Nb-231]
Length = 264
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 122/287 (42%), Gaps = 33/287 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWL-REAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
K+R + A +Y ++ L L + R P +P + +I PHAGY +SG AA A+
Sbjct: 4 KVREPAVAELFYPGEAESLRRLLGELMSRPVAEPHTP--KALIVPHAGYVFSGSTAAAAY 61
Query: 62 GNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
I P I RV LLGP+H A T + TP+G++ LD + L
Sbjct: 62 ARIAPAHARIERVVLLGPAHRVRFAGLAAHTGDAFLTPLGEVALDGSALGRLADLAYVSF 121
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF 179
D E EHS+E+ +P+L + + ++VP+ VG +AE A + +
Sbjct: 122 SDKAHAE-EHSLEVQVPFLQYLLDD--FRLVPLAVGDASAEAVAA---VLERLWGGAETL 175
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
+S+D H+ +D +++ LD + I P+A Y +
Sbjct: 176 VVISTDLSHY--------HD------YETARRLDNRTAENILGKRPEAIG-----YQDA- 215
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
CGR P+ L + +++I+ LR S R V Y + A
Sbjct: 216 CGRTPLVGLLEL--ARRRELRIEQLRLCSSGDTAGNRQRVVGYGAFA 260
>gi|119897971|ref|YP_933184.1| hypothetical protein azo1680 [Azoarcus sp. BH72]
gi|119670384|emb|CAL94297.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length = 274
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 94/200 (47%), Gaps = 25/200 (12%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPHAGYSYSGRAAA 58
+ +R A+ AG +Y + L ++ G + + +P + + ++ PHAGY YSG AA
Sbjct: 3 IASVRPAAVAGQFYPADDLVLRTQI-GEMLSSAVPLNVVYAPKAIVVPHAGYIYSGPVAA 61
Query: 59 YAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
AF + P ISRV LLGP+H AL A + TP+G++ L
Sbjct: 62 SAFAAVAPLRREISRVVLLGPAHRMAVDGFALPAAQSFATPLGEV--------RLSRPDW 113
Query: 117 FELMD---ICVDEA----EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF 169
L D + VD+ EH +E+ LP+L V E ++VP+LVG ++E+ A L
Sbjct: 114 LTLQDHPGVVVDDRPHAFEHCLEVQLPFLQTVLES--FELVPLLVGNASSEDTAA---LL 168
Query: 170 AKYVDDPSNFFSVSSDFCHW 189
+SSD H+
Sbjct: 169 ETLWGGHETLVVISSDLSHY 188
>gi|218886422|ref|YP_002435743.1| hypothetical protein DvMF_1326 [Desulfovibrio vulgaris str.
'Miyazaki F']
gi|218757376|gb|ACL08275.1| protein of unknown function DUF52 [Desulfovibrio vulgaris str.
'Miyazaki F']
Length = 282
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 99/238 (41%), Gaps = 26/238 (10%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
AG +Y D+ L + L A +P + ++ PHAGY YSG A G + P +
Sbjct: 19 AGMFYPDDPDLLRHTVREHLAAAPREAAPTLLAMV-PHAGYVYSGEVAGQTLGAANLPDT 77
Query: 69 ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAE 128
I LLGP+H ++ + TP+G +P+D E+ + L + D E
Sbjct: 78 I---VLLGPNHTGLGAPLSVWPGGHWHTPLGPVPVDEELADTLAESDAGFTRDTAAHMRE 134
Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVG-----AVNAENEAMYGRLFAKYVDDPSNFFSVS 183
HS+E+ LP+L + + IVP+ V A+ +A+ G L VS
Sbjct: 135 HSIEVVLPFLQEYTPD--LTIVPVAVAEPHPQALREAAQALAGALRRLEAAGRRVAMVVS 192
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
SD H+ V H LD++ + IE DP + + ++CG
Sbjct: 193 SDMSHY--------------VTHADAARLDRLALARIEALDPLGLYAIVRDRGISMCG 236
>gi|347732159|ref|ZP_08865241.1| memo-like family protein [Desulfovibrio sp. A2]
gi|347518985|gb|EGY26148.1| memo-like family protein [Desulfovibrio sp. A2]
Length = 282
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 117/283 (41%), Gaps = 33/283 (11%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
AG +Y D+ L E+ L A +P + ++ PHAGY YSG A G + P +
Sbjct: 19 AGMFYPDDPDLLRHEVRQHLAAAARDAAPTLLAMV-PHAGYVYSGEVAGQTLGAANLPDT 77
Query: 69 ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAE 128
I LLGP+H ++ + TP+G +P+D E+ + L + D E
Sbjct: 78 I---VLLGPNHTGLGAPLSVWPGGNWHTPLGPVPVDEELADALAESDAGFTRDTAAHVRE 134
Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFA---KYVDDPSNFFS--VS 183
HS+E+ LP+L + + IVP+ V + + + A + V D + VS
Sbjct: 135 HSIEVILPFLQEYTPD--LSIVPVAVAEPHPQTLREAAQTLADVLRRVTDAGRRVAMVVS 192
Query: 184 SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRH 243
SD H+ V H D++ + IE DP + + ++CG
Sbjct: 193 SDMSHY--------------VTHAEAARRDRLALARIEALDPLGLYATVRDRGISMCGVL 238
Query: 244 PISVFLH---MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
P ++ L LG + ++ + Y S + + V YA
Sbjct: 239 PATLGLFAALALGAQTARV----VAYATSGEVSGDMERVVGYA 277
>gi|150021612|ref|YP_001306966.1| hypothetical protein Tmel_1742 [Thermosipho melanesiensis BI429]
gi|149794133|gb|ABR31581.1| protein of unknown function DUF52 [Thermosipho melanesiensis BI429]
Length = 262
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 130/292 (44%), Gaps = 50/292 (17%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAA------A 58
R + A +Y + ++L + + + + L S G+I PHAGY YSG+ A +
Sbjct: 3 RTMTFADKFYPKSPEKLKKFFEVYAEKVKLEFSNNNFGMILPHAGYVYSGKTALLTLKES 62
Query: 59 YAFGNIDPTSISRVFLLGPSHH-YYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
+ FG + RV +LG +H C++ ++TP G++ +D ++ E+L KF
Sbjct: 63 FRFGKPE-----RVIILGTNHTGTGIGICSVWERGEWETPFGNIAVDEKITEKLL---KF 114
Query: 118 ELM--DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRL--FAKYV 173
++ D EHS+E+ LP+L K + G IK+VPI+ N + R+ K +
Sbjct: 115 DIFVSDYAAHTMEHSIEVILPFL-KYYYGD-IKLVPIV---YNFPSHLYTMRIVEILKKL 169
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDK--KHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
+ S SSD H+ M D+ ++ +++E L K+ I
Sbjct: 170 ELKSTLIIASSDLNHYDPHDITMKKDEIFIENILKQNVEELFKLSKKI------------ 217
Query: 232 LLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
T CG PI+V L+ N +K + + S++ + +V YA
Sbjct: 218 ------TACGTGPIAVLLNYFSN------VKLVYHTTSAEFSNDYNFTVGYA 257
>gi|220905130|ref|YP_002480442.1| hypothetical protein Ddes_1868 [Desulfovibrio desulfuricans subsp.
desulfuricans str. ATCC 27774]
gi|219869429|gb|ACL49764.1| protein of unknown function DUF52 [Desulfovibrio desulfuricans
subsp. desulfuricans str. ATCC 27774]
Length = 298
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 132/312 (42%), Gaps = 51/312 (16%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKS-----------PEVRGVIAPHAGYS 51
IR AG +Y ++++L +E+ WL G+P++ + G++ PHAGY
Sbjct: 2 SIRHPIAAGRFYPADAEQLKKEIRMWLMSGGVPENLVPGEIPREANARLLGLMLPHAGYV 61
Query: 52 YSGRA------AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDL 105
Y GR +A+ R+F+L P+H + T + TP+G + +D
Sbjct: 62 YCGRVIGGTLTSAWDGSTAGANLPERLFILCPNHTGQGRALGVWTGGHWLTPLGPVVVDE 121
Query: 106 EVIEEL-KATGKFELMDICVDEAEHSMEMHLPYLAKV----FEGHLIKIVPILVGAVNAE 160
++ + L +A G F ++C +EHS+E+ LP+L + G IVP+ VG + +
Sbjct: 122 KLGQALVQAAGGFFEDELC-HVSEHSIEVLLPFLQSLPLPRESGR--SIVPVCVGTRSPD 178
Query: 161 NEAMYGRLFAKYV-----DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
G A+ + + S VSSD HY + +HK D +
Sbjct: 179 ALRAAGLALARVIQAFEAEGHSVGIVVSSDL---------NHYQSQELTLHK-----DAL 224
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQC 272
+ + DPD + T+CG +++ L H +G+ ++ + Y+ S+
Sbjct: 225 ALAQVLACDPDGLLTVVERESITMCGAGAMALALFTAHAMGSPWAELTL----YDTSAAA 280
Query: 273 KTKRDSSVSYAS 284
V YA
Sbjct: 281 SGDTSRVVGYAG 292
>gi|237757257|ref|ZP_04585659.1| conserved hypothetical protein [Sulfurihydrogenibium yellowstonense
SS-5]
gi|237690581|gb|EEP59787.1| conserved hypothetical protein [Sulfurihydrogenibium yellowstonense
SS-5]
Length = 121
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 6/112 (5%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGL-PKSPEVRGVIAPHAGYSYSGRAAAY 59
M +R+ + A +YT + +RL + +L +A L PE G++ PHAGY YSG AA
Sbjct: 1 MLTVRKPAVANMFYTGDKERLLYTIKNYLNKAPLYDYVPE--GIVVPHAGYMYSGPVAAV 58
Query: 60 AFG---NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
++ N+DP ++ L+GPSHH Y + ++TP+G + ++ E+I
Sbjct: 59 SYKQLLNLDPNKHYKILLIGPSHHVYFNGVSYGFYDYWETPLGKVKVNKEMI 110
>gi|325971983|ref|YP_004248174.1| hypothetical protein [Sphaerochaeta globus str. Buddy]
gi|324027221|gb|ADY13980.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Sphaerochaeta globus str. Buddy]
Length = 288
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 111/254 (43%), Gaps = 23/254 (9%)
Query: 4 IRRASHAGSWYTDNSKRLA----EELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
IR+A AGSWY + + L E ++ L+ P R + PHAG YS +A
Sbjct: 8 IRKAHFAGSWYPADKQTLRALINESIEVVLKNTTYQHCP-YRFAVLPHAGLFYS-KAGIA 65
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATV--YKTPIGDLPLDLEVIEELKATGKF 117
F D +++ R+ +L PSH+ + AL +A + +TP+G L E A K+
Sbjct: 66 PFFAADLSNVQRLVVLAPSHYANLSQDALVSAPLDGIETPLGML----ESQNLSSAQAKY 121
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI----LVGAVNAENEAMYGRLFAKYV 173
++EH++EM LPY+A + + + + AV +A+ L + +
Sbjct: 122 ----FSAIQSEHALEMVLPYIASLSRPPTVTLALVSHFSQPQAVKTIADALVAELGEQEI 177
Query: 174 DDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
SSDF H+G RF Y Y + G K E + + E +AF +
Sbjct: 178 QAGRTALIASSDFTHYGPRFGYTPY--QSGAPLKVREDDLALSHLLCEGRIEEAF-AFCS 234
Query: 234 EYDNTICGRHPISV 247
+T+CG P V
Sbjct: 235 SKRSTVCGYAPAMV 248
>gi|11499891|ref|NP_071135.1| hypothetical protein AF2310 [Archaeoglobus fulgidus DSM 4304]
gi|7388528|sp|O27974.1|Y2310_ARCFU RecName: Full=MEMO1 family protein AF_2310
gi|2648211|gb|AAB88947.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 261
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 112/250 (44%), Gaps = 28/250 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP-KSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R AGS+Y N E L LRE P K V ++PHAGY YSGR A
Sbjct: 1 MRHPRVAGSFYPANP----ESLLAMLREYTYPAKDESVIACVSPHAGYVYSGRTAG-KVH 55
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ P + + V ++GP+H Y A+ST T + TP+G++ +D E +E + D
Sbjct: 56 SLLPDAETFV-IVGPNHTGYGLPVAVSTDT-WLTPLGEVEVDTEFVEAMPKI--ITAPDE 111
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFF 180
EHS+E+ +P+L + + KIVPI +G + E E L A+
Sbjct: 112 IAHRYEHSLEVQVPFLQYLHDD--FKIVPICLGMQDEETAMEVAEEILTAERETGRKVVV 169
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ + + LD + +D I + D + + + ++C
Sbjct: 170 IASSDMHHY--------------LPDEECRRLDSIVIDAILSMDVKKYYETIYRLQASVC 215
Query: 241 GRHPISVFLH 250
G I+V ++
Sbjct: 216 GYGCIAVAMY 225
>gi|389843600|ref|YP_006345680.1| dioxygenase [Mesotoga prima MesG1.Ag.4.2]
gi|387858346|gb|AFK06437.1| putative dioxygenase [Mesotoga prima MesG1.Ag.4.2]
Length = 275
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 93/215 (43%), Gaps = 25/215 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELD--------------GWLREAGLPKSPEVRGVIAPHAG 49
IRR +G +Y DN + L ++ GW + G G+I PHAG
Sbjct: 2 IRRPVVSGKFYADNEEELKSQIRNCFFHPVGPGSLPHGWEKNEG------PIGLIVPHAG 55
Query: 50 YSYSGRAAAYAFGNIDP-TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
Y SG AA+++ S V ++GP+H K +L ++TP+G + +D E
Sbjct: 56 YMASGPVAAWSYWRAGALVKPSTVLIVGPNHTGLGTKLSLWLEGAWETPLGSVEIDHEFG 115
Query: 109 EELKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
L A +M D EHS+E+ LP+L ++ KIVP+++ E
Sbjct: 116 RSLMAESNGMIMPDTSAHLYEHSIEVQLPFLQFLYSD--FKIVPLIMTDQRVEVALSLSE 173
Query: 168 LFAKYVDDPSNFFSV-SSDFCHWGSRFNYMHYDKK 201
+ K + ++ + SSD H+ + M D +
Sbjct: 174 IIKKELSRRNDVLVIASSDLNHYEAHEVTMEKDNE 208
>gi|225849677|ref|YP_002729911.1| hypothetical protein PERMA_0113 [Persephonella marina EX-H1]
gi|225645192|gb|ACO03378.1| conserved hypothetical protein [Persephonella marina EX-H1]
Length = 269
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 18/195 (9%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPK-SPEVRGVIAPHAGYSYSGRAAAYA-- 60
+R + + ++Y N + L + + +L A L PE V++PHAGY YSG AA
Sbjct: 5 VREPAVSNAFYPGNPEELRDLIRHYLSMAPLYDIKPE--AVVSPHAGYIYSGEVAAVGYK 62
Query: 61 -FGNIDPTSISRVFLLGPSHHYYTPKCALSTA--TVYKTPIGDLPLDLEVIEEL--KATG 115
F N+D + L+GPSH Y P +S ++TP+G++ ++ +VIE+ +
Sbjct: 63 QFLNLDRDKHYNILLIGPSH--YVPFAGISFGYYDFWETPLGEVRVNKKVIEDFVYRNPD 120
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
++ EHS+E+ +P+L V + I+P++ G V+ + + R+ DD
Sbjct: 121 IPVTLNTLPHSKEHSLEVQVPFLQVVLDN--FSIIPVIYGHVDYK---VLERVIDFIKDD 175
Query: 176 PSNFFSV-SSDFCHW 189
+ V SSD H+
Sbjct: 176 RDDVIVVISSDLSHY 190
>gi|251772877|gb|EES53436.1| conserved hypothetical protein [Leptospirillum ferrodiazotrophum]
Length = 269
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 92/199 (46%), Gaps = 15/199 (7%)
Query: 1 MEKIRRASHAGSWYTDNS----KRLAEELDGWLREAGLPK-SPE-VRGVIAPHAGYSYSG 54
M R + AG++Y ++ +R+A LD LP+ P+ +R ++ PH +G
Sbjct: 1 MTPERPPALAGNFYPEDPLELERRIAALLDSP-PPGNLPRLDPDRIRAIVVPHGDLLRAG 59
Query: 55 --RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
+A AY +R+ LLGP H + L T ++ P G P+D I +L
Sbjct: 60 EVQAIAYRLLQKRKKPPARILLLGPLHTGSSYGIVLPTHPSFRIPTGAFPVDRTTIRQLA 119
Query: 113 ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
A + D E EHS+E LP+L ++ G I IVP VG + ++ R+
Sbjct: 120 AFAETLFSDES-HEFEHSLETQLPFLMGIWGG--IPIVP--VGYSDITAHVLF-RVLEPL 173
Query: 173 VDDPSNFFSVSSDFCHWGS 191
++DP S+DF H+ S
Sbjct: 174 MNDPETVTICSADFSHYFS 192
>gi|386812428|ref|ZP_10099653.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386404698|dbj|GAB62534.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 267
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 136/283 (48%), Gaps = 25/283 (8%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR + AGS+Y+ ++ +L +++ + + G K P V G+I+PHAGY YSGR A +
Sbjct: 2 IREPAVAGSFYSGDAIQLKRDIEDFSIK-GCEKQP-VLGIISPHAGYIYSGRVAGNLYSR 59
Query: 64 ID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I+ P + V +L P+H Y ++ + TP+G++ +D EV+EEL D
Sbjct: 60 IEIPNT---VVILAPNHTGYGVPYSVWPDGSWNTPLGNVKVDEEVVEELVQASDLIKKDK 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD--PSNFF 180
+EH+ E+ +P++ + +KIV I++ + + N G+ ++ + P
Sbjct: 117 EAHFSEHAAEVQIPFIQ--YFNPSVKIVVIVISSGDIINLKDIGKNLSRVLQKLRPDALV 174
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ S+ + DK IH+ I +L++ + + + K++ T+C
Sbjct: 175 VASSDMTHYESQTSANRKDKI--AIHE-ILSLNESNLYY------EVYSKHI-----TMC 220
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
G +P L + K + +RYE S + V YA
Sbjct: 221 GIYPTIAMLVCSKDRGAK-NAELVRYETSGDITGDYNQVVGYA 262
>gi|386346075|ref|YP_006044324.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
gi|339411042|gb|AEJ60607.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Spirochaeta thermophila DSM 6578]
Length = 293
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 84/196 (42%), Gaps = 7/196 (3%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYA 60
E +R G +Y D + L LD L P +I PHA + Y G A
Sbjct: 13 ELVREPIVEGIFYPDAADTLEARLDALLSRPSPESPPPRPFALIVPHATWDYVGDLLGAA 72
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
F ++ P RV L GP+H P L +T++ TP+G L LDL +E A+
Sbjct: 73 FAHVLPWREHFRRVVLWGPTHREPFPTLFLPGSTLFATPLGPLALDLPATQEALASSTAF 132
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-VNAENEAMYGRLFAKYVDD-P 176
+D EH +E+ P+L + I+P+LVGA + E + L+ Y +
Sbjct: 133 AVDDLSHMEEHCLEVVFPFLRSILPH--APILPVLVGAPLLTLAETLARALYLIYATEWE 190
Query: 177 SNFFSVSSDFCHWGSR 192
F VS++ + R
Sbjct: 191 HTLFVVSTNLTPFAPR 206
>gi|307718050|ref|YP_003873582.1| dioxygenase [Spirochaeta thermophila DSM 6192]
gi|306531775|gb|ADN01309.1| putative dioxygenase [Spirochaeta thermophila DSM 6192]
Length = 293
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 84/196 (42%), Gaps = 7/196 (3%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYA 60
E +R G +Y D + L LD L P +I PHA + Y G A
Sbjct: 13 ELVREPIVEGIFYPDAADALEARLDALLSRPSPESPPPRPFALIVPHATWDYVGGLLGTA 72
Query: 61 FGNIDP--TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
F ++ P RV L GP+H P L +T++ TP+G L LDL +E A+
Sbjct: 73 FAHVVPWREHFRRVVLWGPTHREPFPSLFLPASTLFATPLGPLSLDLPATQEALASSTAF 132
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGA-VNAENEAMYGRLFAKYVDD-P 176
+D EH +E+ P+L + I+P+LVGA + E + L+ Y +
Sbjct: 133 AVDDLSHMEEHCLEVVFPFLRSILPH--APILPVLVGAPLFTLVETLARALYLIYAAEWE 190
Query: 177 SNFFSVSSDFCHWGSR 192
F VS++ + R
Sbjct: 191 HTLFVVSTNLTPFAPR 206
>gi|150399448|ref|YP_001323215.1| hypothetical protein Mevan_0697 [Methanococcus vannielii SB]
gi|166228835|sp|A6UQ31.1|Y697_METVS RecName: Full=MEMO1 family protein Mevan_0697
gi|150012151|gb|ABR54603.1| protein of unknown function DUF52 [Methanococcus vannielii SB]
Length = 284
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 106/249 (42%), Gaps = 22/249 (8%)
Query: 42 GVIAPHAGYSYSGRAAAYAFGNIDPTSISRV--FLLGPSHHYYTPKCALSTATVYKTPIG 99
GVI+PHAGY YSG AA+++ I + ++GP+H + + ++KTP+G
Sbjct: 48 GVISPHAGYVYSGPIAAHSYKEISKKVSGNITAVIIGPNHSGMGSVVS-TMEGIWKTPLG 106
Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVFEGHLIKIVPILVGAV 157
+L +D E E L +D EHS+E+ LP+L ++ K VPI +
Sbjct: 107 NLEIDNEFSERLWKECDIIDLDETAHLKEHSIEVQLPFLKHLELLNIAKFKFVPISMSLQ 166
Query: 158 NAENEAMYGRLFAKYVDDPSN--FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
+ + G + AK + + SSDF H+ KK +I K
Sbjct: 167 DYDTAVGVGYMVAKVAKELNRKIIIIASSDFSHYEPE---EIASKKDAIIIK-------- 215
Query: 216 GMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTK 275
DI++ + + F ++ + T+CG PI + + K + K L Y S
Sbjct: 216 --DILKMEEEEIFTD-VVTNNVTMCGYGPIIAMIKAMKVLGAK-ESKLLSYSTSGDVTKD 271
Query: 276 RDSSVSYAS 284
V Y +
Sbjct: 272 YSEVVGYGA 280
>gi|307106373|gb|EFN54619.1| hypothetical protein CHLNCDRAFT_135146 [Chlorella variabilis]
Length = 337
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 85/202 (42%), Gaps = 33/202 (16%)
Query: 72 VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
V +LG +H P LS+A ++TP+G PL+ E+ +L A G +D EHS+
Sbjct: 112 VLILGTNHFTALPPACLSSAAAWRTPLGAAPLEPELCRQLAAAGL--PLDDAPHNLEHSI 169
Query: 132 EMHLPYLAKVF-------EGHLIKIVPILVGAVNAENEAMYGRL-------FAKYVDDPS 177
E LP+L V I P+ VG + +EA GRL A+Y
Sbjct: 170 ENQLPFLQHVLGSPGPSPGARPFAIAPVCVGWLG--SEAAAGRLARQLAAAVARYEQRRQ 227
Query: 178 N--FFSVSSDFCHWGSRFN------YMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFK 229
+SDF H G + +M ++H V D + I G P F
Sbjct: 228 RQVLLIATSDFTHGGPSYGEAPAAPWMSL-REHCVWR------DSPLLHAICGGSPRGFL 280
Query: 230 KYLLEYDNTICGRHPISVFLHM 251
+ ++CGR P++VF+ +
Sbjct: 281 QACDLVGASLCGRWPVAVFMQV 302
>gi|197121665|ref|YP_002133616.1| hypothetical protein AnaeK_1254 [Anaeromyxobacter sp. K]
gi|196171514|gb|ACG72487.1| protein of unknown function DUF52 [Anaeromyxobacter sp. K]
Length = 265
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 6/138 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R A+ AG++Y + LA E+DGWL V+ PHAGY YSG A F
Sbjct: 2 VREAAVAGAFYDARAATLAAEVDGWLSAGA--APAPALAVMVPHAGYVYSGAVAGATFAR 59
Query: 64 IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ +R +LGP+H AL A ++TP+G +P+D E + A D
Sbjct: 60 V--AVPARAIVLGPNHTGLGHAGAALWPAGAWRTPLGAVPVDPE-LTAALAAAPGVAADR 116
Query: 123 CVDEAEHSMEMHLPYLAK 140
EHS+E+ +P+L +
Sbjct: 117 LAHLREHSLEVEVPFLQR 134
>gi|254456740|ref|ZP_05070168.1| conserved hypothetical protein [Sulfurimonas gotlandica GD1]
gi|373867875|ref|ZP_09604273.1| protein containing DUF52 [Sulfurimonas gotlandica GD1]
gi|207085532|gb|EDZ62816.1| conserved hypothetical protein [Sulfurimonas gotlandica GD1]
gi|372469976|gb|EHP30180.1| protein containing DUF52 [Sulfurimonas gotlandica GD1]
Length = 261
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 96/222 (43%), Gaps = 16/222 (7%)
Query: 5 RRASHAGSWYTDNS---KRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
R S AGS+Y +R E + E + R VI PHAGY YSG A A+
Sbjct: 3 RMMSVAGSFYPAREVEIERYFEHFNTVYEENFTLPDVKSRAVIVPHAGYIYSGYTANIAY 62
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMD 121
+ + + + ++GPSH ++ T Y+TP DL ++V + L+ KF L
Sbjct: 63 RVLQRSDVRKFVVIGPSHRVAFNGVSMCDFTTYETPFEDLKAAIDVAQRLRE--KFSLTR 120
Query: 122 ICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
EHS E+ P++ F KIV ++ A NE F +D
Sbjct: 121 FENAHQEHSTEVQFPFIK--FYMPDAKIVELVYSGAGA-NEISNIIDFVLSQNDCGVI-- 175
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETG 223
+S+D H+ +N +K + +EA+DK+ + TG
Sbjct: 176 ISTDLSHF---YNLEDANKLDNI---CLEAVDKLDTQKLHTG 211
>gi|220916461|ref|YP_002491765.1| hypothetical protein A2cp1_1355 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219954315|gb|ACL64699.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
2CP-1]
Length = 265
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 6/138 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R + AG++Y + LA E+DGWL V+ PHAGY YSG A F
Sbjct: 2 VREPAVAGAFYDARAATLAAEVDGWLSAGA--APAPALAVMVPHAGYVYSGAVAGATFAR 59
Query: 64 IDPTSISRVFLLGPSHHYYT-PKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
+ +RV +LGP+H AL A ++TP+G +P+D E + A D
Sbjct: 60 V--ALPARVIVLGPNHTGLGHAGAALWPAGAWRTPLGSVPVDPE-LTAALAAAPGVAADR 116
Query: 123 CVDEAEHSMEMHLPYLAK 140
EHS+E+ +P+L +
Sbjct: 117 LAHLREHSLEVEVPFLQR 134
>gi|303249056|ref|ZP_07335299.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
gi|302489521|gb|EFL49463.1| protein of unknown function DUF52 [Desulfovibrio fructosovorans JJ]
Length = 401
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 108/279 (38%), Gaps = 31/279 (11%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPHAGYSYSGRAAAY 59
+ R AG Y DN + L LD L A P S + VIAPH A
Sbjct: 108 QDTREPLFAGPAYPDNPELLVPYLDAILAGAPEASPGSGRILAVIAPHIDPEAGKAGYAA 167
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
A+ + S RV +LG H L T Y TP+GD+P D + L+ G
Sbjct: 168 AYAPLRGLSPKRVVVLGVGHQILRGLYCL-TDKAYGTPLGDIPADAAAVARLRRAGG--- 223
Query: 120 MDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
CVD EHS+E +L V G +VP+L G+ + A+ F Y
Sbjct: 224 --ACVDPGDLQHRDEHSVEFQAVFLRHVLAGDF-SLVPVLCGSPRSVLPALSRSAFLDYA 280
Query: 174 D-----------DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
DP F DF H G +F +DK + ++ A D + +
Sbjct: 281 GPFLDALRGMAADPDTLFVAGVDFSHIGPKFG---HDKPALELEQAAMAHDAALLAALAA 337
Query: 223 GDPDAFKKYLLEYDN--TICGRHPISVFLHMLGNCSTKI 259
GDP+AF D+ +CG ++ ++ C+ +
Sbjct: 338 GDPEAFWAESARVDDGYNVCGLASLATLAEIVPACALTV 376
>gi|145524770|ref|XP_001448212.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415746|emb|CAK80815.1| unnamed protein product [Paramecium tetraurelia]
Length = 79
Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 47/73 (64%), Gaps = 4/73 (5%)
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM----LGNCSTKIKIKFLRYEQSSQC 272
M+ IE D D F YL EY+N +CG+H I++ LH+ + + ++ KF+RY QS
Sbjct: 1 MEHIELHDLDKFNDYLREYENNVCGKHCIAILLHVICIAMSQNTHMMETKFIRYAQSCLV 60
Query: 273 KTKRDSSVSYASA 285
+ K+DSSVSYA+A
Sbjct: 61 RDKKDSSVSYAAA 73
>gi|440302594|gb|ELP94901.1| hypothetical protein EIN_249450 [Entamoeba invadens IP1]
Length = 284
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 101/246 (41%), Gaps = 26/246 (10%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA--GLPK-SPEVRGVIAPHAGYSYSGRAAAYAF 61
R + G W+ ++ L E++G++++A LPK +V G +APHAG+ YSG A ++
Sbjct: 7 RHTAGDGRWFQSSTNGLKNEVNGYIQDAYKALPKLEGKVLGSVAPHAGFRYSGPTAGFSV 66
Query: 62 GNIDPTSISR-----VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK 116
+ S + VF+LG SH + A+ K+PI +D + I
Sbjct: 67 AALLKDSETNGKPDVVFILGFSHSAHFEYAAIMDGAAIKSPICTSKIDTDAINIFMKDRP 126
Query: 117 FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP 176
+ EHS E +P++ F K+V +L+G A+ +
Sbjct: 127 NMKLKYNPHNGEHSAENEIPFVQVAFPE--TKVVMVLIGTHKAQVFKEVSDGLLEVSKTR 184
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
+ SSD H FN +E D++ D+ E D D K Y+
Sbjct: 185 KMYVVASSDMLH-DEDFNL-------------VEKTDRVTADLTENVDFDGLLKN-WSYE 229
Query: 237 NTI-CG 241
N I CG
Sbjct: 230 NQIFCG 235
>gi|119620876|gb|EAX00471.1| chromosome 2 open reading frame 4, isoform CRA_c [Homo sapiens]
Length = 82
Score = 65.5 bits (158), Expect = 3e-08, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 43/74 (58%), Gaps = 12/74 (16%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAP-----------HAGYSYS 53
R ASHAGSWYT + +L +L+GWL + K P R +IAP HAGY+Y
Sbjct: 8 REASHAGSWYTASGPQLNAQLEGWLSQVQSTKRP-ARAIIAPRDGVLPCWPGCHAGYTYC 66
Query: 54 GRAAAYAFGNIDPT 67
G AA+A+ +DP+
Sbjct: 67 GSCAAHAYKQVDPS 80
>gi|406880605|gb|EKD28916.1| Ammecr1 protein [uncultured bacterium]
Length = 300
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
Query: 182 VSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
VSSDF H+GSRF + ++ I ++I+ +D IET D F Y E TICG
Sbjct: 2 VSSDFTHFGSRFGFKPFNSN---IKENIKNIDFKAFKYIETLDQKGFMDYCHESSITICG 58
Query: 242 RHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
++PIS+ L ML + K+ L+Y S + ++ VSY S
Sbjct: 59 KNPISILLAML---NKDFKVTLLKYSTSGELTGDYNNCVSYISCV 100
>gi|118576793|ref|YP_876536.1| dioxygenase [Cenarchaeum symbiosum A]
gi|118195314|gb|ABK78232.1| dioxygenase [Cenarchaeum symbiosum A]
Length = 273
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 103/256 (40%), Gaps = 28/256 (10%)
Query: 35 PKSPEVRGVIA---PHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA 91
P PE+ GVI PHAGY YSG A + +I TS + GP+H A A
Sbjct: 36 PDPPELDGVIGMICPHAGYRYSGPVACHGLLSIRHTSPRLFVMAGPNHWGLGLGIAGIGA 95
Query: 92 TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
+ TP G + D EL+ G E D EHS+E+ +P L + F I+P
Sbjct: 96 CRWITPAGYVETDDAGSVELERCGIKE--DFFAHSKEHSLEVIVPMLQEFF--GEFGILP 151
Query: 152 ILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEA 211
IL+ E A G A+ + SSD H+ S N +++ + IEA
Sbjct: 152 ILLSEQGEEQAAKVGGAMARAAKGRDSMLIGSSDLTHYES--NAFAHEQDGAL----IEA 205
Query: 212 LDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQ 268
+ + MD ++ F L E + CG I+ + LG K L+Y
Sbjct: 206 I--LSMDTVK------FYAVLRERKVSACGYGAIAATMAASIELG----ATKGTLLKYAT 253
Query: 269 SSQCKTKRDSSVSYAS 284
S S V Y S
Sbjct: 254 SGDITGDEQSVVGYCS 269
>gi|291280086|ref|YP_003496921.1| hypothetical protein DEFDS_1709 [Deferribacter desulfuricans SSM1]
gi|290754788|dbj|BAI81165.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
Length = 262
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 104/247 (42%), Gaps = 29/247 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R+A+ G +Y D EEL + + + I PHAGY YSG+ A +
Sbjct: 2 LRKAAVKGYFYPDKK----EELIDFFEKNKTESKIDAFLAIVPHAGYIYSGKTAVKTLSS 57
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDIC 123
++ +V L+GP+H + + AL ++TP G + +D E+ +L + E DI
Sbjct: 58 LNLKK--KVLLIGPNHTGFGERVALYPDGEWETPFGFVRVDSELNRKLSLIPEIE-EDII 114
Query: 124 VDEAEHSMEMHLP---YLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
EHS+E+ LP Y K I ++P+ +Y K + D
Sbjct: 115 AHVREHSLEVILPILHYFKKDITFSAITMMPLKYNQCVKLASDIY-----KQIKDEDLNI 169
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+SSDF H+ ++ + D M ++ I + D + + D ++C
Sbjct: 170 IISSDFNHYEDA--------------ETTDKKDLMAIEKILSLDTKGLYDTVFDNDISMC 215
Query: 241 GRHPISV 247
G +P V
Sbjct: 216 GIYPAIV 222
>gi|408381879|ref|ZP_11179426.1| hypothetical protein A994_05481 [Methanobacterium formicicum DSM
3637]
gi|407815327|gb|EKF85912.1| hypothetical protein A994_05481 [Methanobacterium formicicum DSM
3637]
Length = 280
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 125/298 (41%), Gaps = 32/298 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-----LREAGLP----KSPEVRGVIAPHAGYSYSG 54
IR+ + AG +Y + L + + W L LP ++G+IAPHAGY YSG
Sbjct: 2 IRKPAVAGMFYESDEDSLKKRIK-WCYQHKLGPGRLPGKIGSKRSIKGLIAPHAGYVYSG 60
Query: 55 RAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA ++ + + V +L P+H + T + TP+G++ +D + EL
Sbjct: 61 PVAACSYLELAEDGMPETVVILCPNHTGIGSGLSTMTEGSWLTPLGEVEIDGQFASELLN 120
Query: 114 TGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV 173
D EHS E+ LP+L ++ ++VP+ + + E G
Sbjct: 121 YYPLLDDDPSAHLNEHSCEVQLPFLQEISPD--FQLVPVCMMMQDLETSRELGEAITHTA 178
Query: 174 D--DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKY 231
S+DF H+ +D++ + ++I A+D++ M
Sbjct: 179 QKLKRDTVVIASTDFTHYQPHDVARAHDEE---VLEAIAAMDELEM-----------MNQ 224
Query: 232 LLEYDNTICGRHPISVFLHM-LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAK 288
+ +++ T+CG P++ + G +T+ I L+Y S S V YASA K
Sbjct: 225 IQKFNVTMCGYGPVAATIEASRGMGATEATI--LKYATSGDTGGDYTSVVGYASAIFK 280
>gi|383762695|ref|YP_005441677.1| hypothetical protein CLDAP_17400 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381382963|dbj|BAL99779.1| hypothetical protein CLDAP_17400 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 411
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 108/272 (39%), Gaps = 26/272 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP----EVRGVIAPHAGYSYSGRAAAYA 60
R + AG+ Y D+ K L L +L + + P G+++PH Y GR A+
Sbjct: 113 RPPALAGAGYPDDEKALWSLLQDYLEASDVAADPIDWSAAIGLLSPHIDYPRGGRVYAHV 172
Query: 61 FGNIDPTSISR--VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
+ + V LLG H+ P T Y TP G LP + +++ L A E
Sbjct: 173 WKRAAQAAREAELVILLGVDHYGADPFTL--TRQNYATPYGVLPTETSIVDRLAAVIGEE 230
Query: 119 --LMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAV-----NAENEAMYGRLFA- 170
EHS+E+ +L + EG + +VPIL G+ N + A + A
Sbjct: 231 AAFAGELRHRGEHSLELVAIWLHHLREGRPVPVVPILTGSFYPFMRNGDTPAADPTIGAV 290
Query: 171 -----KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
K + S D H G F D +++ A D+ ++ + GD
Sbjct: 291 LDVLRKAMAGRRTLIVASGDLAHVGPAFGGAPLDATG---RRTLRAADEELIEAMRRGDA 347
Query: 226 DAFKKYL--LEYDNTICGRHPISVFLHMLGNC 255
F + ++ N +CG PI + L +LG
Sbjct: 348 TGFFNAIRRVQNRNNVCGVAPIYLTLRLLGQS 379
>gi|46581301|ref|YP_012109.1| hypothetical protein DVU2897 [Desulfovibrio vulgaris str.
Hildenborough]
gi|387154520|ref|YP_005703456.1| hypothetical protein Deval_2676 [Desulfovibrio vulgaris RCH1]
gi|46450722|gb|AAS97369.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
Hildenborough]
gi|311234964|gb|ADP87818.1| protein of unknown function DUF52 [Desulfovibrio vulgaris RCH1]
Length = 329
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 99/255 (38%), Gaps = 32/255 (12%)
Query: 43 VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
V+ PHAGY YSGR A + + VF+LGP+H A+ + TP+G +P
Sbjct: 94 VMLPHAGYVYSGRVAGRTLSQVRLAPV--VFMLGPNHTGRGAPLAVWPEGDWLTPLGSVP 151
Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
+ L + E EHS+E+ LP L + I+P+ V +A
Sbjct: 152 VHERAAAALLDKDGGYTANRTAHEGEHSLEVLLPLLQVRHPA--LSIIPVAVSEQDAGAL 209
Query: 163 AMYGRLFAKYVDD------PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
G A+ + + PS+ +SSD H+ +R E D +
Sbjct: 210 QRAGASLARTMQELAAAGVPSSIV-LSSDMSHYVTRTQ--------------AEERDALA 254
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCK 273
+ + DP+ + T+CG P V LH LG S + Y+ S+
Sbjct: 255 LGRMAALDPEGLYATVRHNRITMCGVLPAVVALHACRALGAESACLAA----YDTSASAS 310
Query: 274 TKRDSSVSYASAAAK 288
+ V YA AK
Sbjct: 311 GDHERVVGYAGMVAK 325
>gi|332295079|ref|YP_004437002.1| hypothetical protein Thena_0222 [Thermodesulfobium narugense DSM
14796]
gi|332178182|gb|AEE13871.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermodesulfobium narugense DSM 14796]
Length = 264
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 24/211 (11%)
Query: 34 LPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTAT 92
+PK V+GV+ PHAGY++SG A + +I+ P + L+GP H + LS T
Sbjct: 29 IPKR-RVKGVVVPHAGYNFSGSIAGKVYSSIECPDTF---LLIGPKHSMESDGIFLS-QT 83
Query: 93 VYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPI 152
+ TP+G++ D ++ E L +F ++ + EHS+E+ +P++ V KIVP+
Sbjct: 84 SWATPLGEVMPDRDLGESLLHHCEFIHLNERIHANEHSLEVQVPFIKYVCPK--AKIVPV 141
Query: 153 LVGAVNAENEAMYGRLFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIE 210
V + + G+ A + + + V SSD H + + D+K + K++
Sbjct: 142 AVSTTSEGILSSTGKCIANVLKESNKSVVVVMSSDLNHHEPQEITLEKDEK---VIKNLM 198
Query: 211 ALDKMGMDIIETGDPDAFKKYLLEYDNTICG 241
+L DP K + E D ++CG
Sbjct: 199 SL-----------DPRGLLKTVYEEDVSMCG 218
>gi|224372878|ref|YP_002607250.1| protein of unknown function DUF52 [Nautilia profundicola AmH]
gi|223589628|gb|ACM93364.1| protein of unknown function DUF52 [Nautilia profundicola AmH]
Length = 263
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 83/183 (45%), Gaps = 24/183 (13%)
Query: 41 RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
+ +I PHAG+ YSG A +A+ T+ + ++GPSH + + + Y+TP G+
Sbjct: 46 KALIVPHAGWMYSGFTANFAYRIAQNTTPKAIAVIGPSHKFAFEGISTTLENEYETPCGN 105
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
LP+D EL F++ ++ EHS E+ +P++ F +P++
Sbjct: 106 LPIDTATAYEL--INNFDVQNLEYVHVEHSTEVQMPFIKHYFNN-----IPVI------- 151
Query: 161 NEAMYGRLFAKYVDDPSNFF-------SVSSDFCHWGS--RFNYMHYDKKHGVIHKSIEA 211
E +Y K + + N+ +SSD H+ N + Y+ V ++ +
Sbjct: 152 -ELIYSNYSPKKLKEIINYLIQKDILVVISSDLSHYYDIKTANALDYNCLEAVNNQDVLQ 210
Query: 212 LDK 214
L+K
Sbjct: 211 LEK 213
>gi|120601519|ref|YP_965919.1| hypothetical protein Dvul_0469 [Desulfovibrio vulgaris DP4]
gi|120561748|gb|ABM27492.1| protein of unknown function DUF52 [Desulfovibrio vulgaris DP4]
Length = 329
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 99/255 (38%), Gaps = 32/255 (12%)
Query: 43 VIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLP 102
V+ PHAGY YSGR A + + VF+LGP+H A+ + TP+G +P
Sbjct: 94 VMLPHAGYVYSGRVAGRTLSQVRLAPV--VFMLGPNHTGRGAPLAVWPEGDWLTPLGSVP 151
Query: 103 LDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENE 162
+ L + E EHS+E+ LP L + I+P+ V +A
Sbjct: 152 VHERAAAALLDKDGGYTANRTAHEGEHSLEVLLPLLQ--VRHPALSIIPVAVSEQDAGAL 209
Query: 163 AMYGRLFAKYVDD------PSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
G A+ + + PS+ +SSD H+ +R E D +
Sbjct: 210 QRAGASLARTMQELAAAGVPSSIV-LSSDMSHYVTRTQ--------------AEERDALA 254
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH---MLGNCSTKIKIKFLRYEQSSQCK 273
+ + DP+ + T+CG P V LH LG S + Y+ S+
Sbjct: 255 LGRMAALDPEGLYATVRHNRITMCGVLPAVVALHACRALGAESACLAA----YDTSASAS 310
Query: 274 TKRDSSVSYASAAAK 288
+ V YA AK
Sbjct: 311 GDHERVVGYAGMVAK 325
>gi|156743567|ref|YP_001433696.1| hypothetical protein Rcas_3629 [Roseiflexus castenholzii DSM 13941]
gi|156234895|gb|ABU59678.1| protein of unknown function DUF52 [Roseiflexus castenholzii DSM
13941]
Length = 405
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 109/285 (38%), Gaps = 61/285 (21%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAG--LPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
R + AG Y + L LD ++ G P P RGV++PH Y+ GR A +
Sbjct: 114 RPPALAGQSYPADPAELRRLLDDFIAAVGPVAPAPPTGRGVLSPHIDYARGGRVYAQVWQ 173
Query: 63 NIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ V L+G H Y+P+ T Y TP+G LP D V++ L A
Sbjct: 174 RAAEMVRAAEIVLLIGTDH--YSPEPVTLTRQRYATPLGVLPTDTSVVDALAAA------ 225
Query: 121 DICVDEA---------EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAK 171
I D A EHS+E+ +L + ++PIL G+ FA+
Sbjct: 226 -IGEDAAFAGELYHRVEHSLELVAVWLQYIRGDAPCPVIPILAGS------------FAR 272
Query: 172 YV--DDPSN--------------------FFSVSSDFCHWGSRFNYMHYDKKHGVIHKSI 209
Y+ DDP+ +S D H G F +++
Sbjct: 273 YMDGDDPATDPRFEALITALRRIIASRHAVVIISGDMSHVGPAFGGAPLSNAD---KEAL 329
Query: 210 EALDKMGMDIIETGDPDAFKKYLLEYDN--TICGRHPISVFLHML 252
D++ +D + GD F + + E + ICG P + L ++
Sbjct: 330 RRADELVIDRMRAGDAAGFFRVIAETGDRQNICGLPPTYLALRLM 374
>gi|258404346|ref|YP_003197088.1| hypothetical protein Dret_0208 [Desulfohalobium retbaense DSM 5692]
gi|257796573|gb|ACV67510.1| protein of unknown function DUF52 [Desulfohalobium retbaense DSM
5692]
Length = 268
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 115/287 (40%), Gaps = 44/287 (15%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID-PTS 68
AG +Y +++ +L +A P+SP + ++ PHAGY +SG A G P +
Sbjct: 8 AGQFYPGTKTAWEQQVRTYLAQATAPQSPSLMAMV-PHAGYPFSGPVAGKTLGAASLPET 66
Query: 69 ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEA- 127
+ LLGP+H AL + + P L ++ E+L I DEA
Sbjct: 67 L---LLLGPNHTGLGKALALWSRGAWDIPGASLAIE----EKLAGAIVHGCPHIASDEAA 119
Query: 128 ---EHSMEMHLPYLAKVFEGHLIKIVPILVGA-------VNAEN-EAMYGRLFAKYVDDP 176
EHS+E+ LP+L + +IVP+ V AE+ + GR
Sbjct: 120 HLREHSLEVILPFLWAL--NPRTRIVPLAVAEHHLPTLLETAEHLHTVLGR------QQQ 171
Query: 177 SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYD 236
VSSD H+ S + KH LD M + I DP+ +
Sbjct: 172 QVGIVVSSDMSHFISE-----QEAKH---------LDDMALQRILDLDPEGLYNTVRNNR 217
Query: 237 NTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
++CG P+++ L ++ + K + Y S+ R V YA
Sbjct: 218 ISMCGVLPMTLGLQLVKRMGA-VSAKLVDYATSADATGDRSQVVGYA 263
>gi|288931699|ref|YP_003435759.1| hypothetical protein Ferp_1332 [Ferroglobus placidus DSM 10642]
gi|288893947|gb|ADC65484.1| protein of unknown function DUF52 [Ferroglobus placidus DSM 10642]
Length = 261
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 125/289 (43%), Gaps = 31/289 (10%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPE-VRGVIAPHAGYSYSGRAAAYAFG 62
+R + AG +Y N E L LRE +S E V +++PHAGY YSGR A A
Sbjct: 1 MRHPAVAGYFYPSNP----ESLLAMLREFAEKRSDEDVVAIVSPHAGYVYSGRTAG-AVH 55
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++ P + ++GP+H A+S + TP+G + D E I+ + K ++D
Sbjct: 56 SLLP-DVETYVIVGPNHTGLGAAVAVSYDD-WMTPLGVVESDREFIDAMPK--KICVVDE 111
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAEN--EAMYGRLFAKYVDDPSNFF 180
EHS+E+ +P+L + + KIV I +G + E E + L A+
Sbjct: 112 IAHREEHSLEVQVPFLQYLHKD--FKIVAICMGLQDKETAEEVVEEILEAQEKTGRRIAL 169
Query: 181 SVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
SSD H+ + D++ ++ I + D D + + + ++C
Sbjct: 170 VASSDMHHY--------------LPDDECRRRDEIVIEAILSMDVDKYYDTIYKLQASVC 215
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
G PI+V ML + K + + Y S K V YA A +V
Sbjct: 216 GYGPIAVA--MLYSKELGAKAELVHYSTSGDVADK-SQVVGYAGIAFRV 261
>gi|322689777|ref|YP_004209511.1| hypothetical protein BLIF_1595 [Bifidobacterium longum subsp.
infantis 157F]
gi|320461113|dbj|BAJ71733.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis 157F]
Length = 500
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 95/226 (42%), Gaps = 44/226 (19%)
Query: 46 PHAGYSYSGRAAAYAFGNIDPT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
PHAGY YSG AAA A+ ++ S++R ++GP+H A STA ++TP+G +P+
Sbjct: 2 PHAGYIYSGTAAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPV 61
Query: 104 D---------LEVIEELK--------ATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
D L V E L+ A +++ EH++E+ +P+L V G
Sbjct: 62 DIAAERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL-GPD 120
Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH 206
+ IVP+ G + G + P +SSD H+ H
Sbjct: 121 LTIVPLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHYHP--------------H 163
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHML 252
+ ALD + D +L + CG +PI+ L +L
Sbjct: 164 EVARALDDQTIA-------DIAALHLPIHPRRACGAYPINGLLDVL 202
>gi|430742361|ref|YP_007201490.1| dioxygenase [Singulisphaera acidiphila DSM 18658]
gi|430014081|gb|AGA25795.1| putative dioxygenase [Singulisphaera acidiphila DSM 18658]
Length = 413
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 38/313 (12%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREA---GLPKSPE-------VRGVIAPHAGYS 51
E++R A+ AG Y + L +L + A G+P+ PE +RGV++PH +
Sbjct: 106 EQVRPAALAGRSYAGTERALRSQLARYFAHADGSGVPR-PEPQTLPSRLRGVLSPHIDFQ 164
Query: 52 YSGRAAAYAFGNIDPTSISRVF-LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
G +++ + S + F +LG +H Y + AL T ++TP+G + D E ++
Sbjct: 165 RGGLVYTWSYKELVEQSDADTFVILGVAHQYCRNRFAL-TRKDFETPLGRVRTDREYVDR 223
Query: 111 LKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGA----VNAENEAM 164
+ A +L D EHS+E + +L + G IVPILVG+ ++A + +
Sbjct: 224 IAALAGQDLFEDELSHRTEHSIEFQVVFLQYLLGGVRDFSIVPILVGSFHDLMDAGTDPI 283
Query: 165 YGRLFAKYVD-----DPSNFFSVS----SDFCHWGSRFNYMHYDKKHGVIHKSIEALDKM 215
++V+ + +N V+ D CH G F D I + + D
Sbjct: 284 ESDDVRRFVEALRASEAANGRKVAYIGGIDLCHVGPEFG--DPDPLDPEILAEVRSFDTA 341
Query: 216 GMDIIETGDPDAFKKYLLEYDN--TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
+ D + E N +CG LH +G + L+Y+Q+
Sbjct: 342 MLARAAAHDAAGWFGTAAEIGNRWRVCGLAAAYTMLHAMGPARGTL----LKYDQA--VD 395
Query: 274 TKRDSSVSYASAA 286
+R VS+AS A
Sbjct: 396 HERTCCVSFASLA 408
>gi|126728359|ref|ZP_01744175.1| Predicted dioxygenase [Sagittula stellata E-37]
gi|126711324|gb|EBA10374.1| Predicted dioxygenase [Sagittula stellata E-37]
Length = 450
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 74/172 (43%), Gaps = 13/172 (7%)
Query: 33 GLPKSPEVR-GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA 91
G P +PE VI+PHAGY +SGR A A + + +L PSH + A +
Sbjct: 33 GAPTAPEPPVAVISPHAGYRFSGRLTARALATTREAAPKSIAVLSPSHRHAFDGIAAPSQ 92
Query: 92 TVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVP 151
+ P G +D+ + A G + D D+ EH +E+ LP L + + ++P
Sbjct: 93 DAFALPTGTQRIDIATRAAMVAAGLIHVEDAAHDQ-EHGVEVQLPVLHALHPD--VPVLP 149
Query: 152 ILVGAVNAENEAMYGRLFAKYVD--DPSNFFSVSSDFCHWGSRFNYMHYDKK 201
+++G + A VD +SSD H+ +R + D +
Sbjct: 150 LVIGRTGNDR-------VAALVDALPEGTLIVLSSDLSHFLTRDDARAKDAR 194
>gi|116747526|ref|YP_844213.1| hypothetical protein Sfum_0075 [Syntrophobacter fumaroxidans MPOB]
gi|116696590|gb|ABK15778.1| protein of unknown function DUF52 [Syntrophobacter fumaroxidans
MPOB]
Length = 414
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 113/272 (41%), Gaps = 29/272 (10%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSP-------EVRGVIAPHAGYSYS 53
+ +RR HAG Y +++ L L+G+ E G P P V G++APH
Sbjct: 109 DPVRRMRHAGQSYPSDAELLNRRLEGFFSAEDGGPGLPGAARDNRPVLGLVAPHIDIQAG 168
Query: 54 GRAAAYAF-GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK 112
GR A+A+ D S +LG H + AL TA ++TP+G + D E L
Sbjct: 169 GRCFAHAYKAAADSVSPRTWIVLGTGHELVSNYFAL-TAKDFETPLGLVGHDEECCAHLV 227
Query: 113 ATGKFELMDICVDE----AEHSMEMHLPYLAKVFEGHLIKIVPILVGAV--NAENEAMYG 166
+ K DI E EH++E +LA V G KIVP+L + E + Y
Sbjct: 228 NSAK---RDILAGEYNHVREHTVEFQAVFLAYVQPG--AKIVPLLCSFSHEDLETDGEYI 282
Query: 167 RLFAKYVDDPSNFFSV----SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
FA + D SV S D H G R+ V K A D+ ++ +
Sbjct: 283 DHFAGLLRDLVLTRSVGILASVDLAHIGPRYGDRFQPTDSTV--KDHMASDRGLVESLRE 340
Query: 223 GDPDAFKKYLLEYDN--TICGRHPISVFLHML 252
D +AF + + N ICG P+ V L
Sbjct: 341 CDAEAFIRQIRLEGNRRKICGVAPLYVLAQAL 372
>gi|386391427|ref|ZP_10076208.1| putative dioxygenase [Desulfovibrio sp. U5L]
gi|385732305|gb|EIG52503.1| putative dioxygenase [Desulfovibrio sp. U5L]
Length = 406
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 123/311 (39%), Gaps = 48/311 (15%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPH-------AGY--SYS 53
+R A AG Y D L LD L A P + V VIAPH AGY +Y
Sbjct: 110 VREAVFAGQAYPDAPALLVPYLDAMLEAASARPAAGPVVAVIAPHIDPEAGKAGYGAAYG 169
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
AA A G P RV +LG H L T + TP+G++P D + L+
Sbjct: 170 ALRAALA-GRTPP---KRVLVLGVGHQVIKGLYCL-TDKAFATPLGEVPADGPAVAALRR 224
Query: 114 TGKFELMDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAV---------N 158
G+ VD A EHS+E +L V +VPIL G+
Sbjct: 225 AGQ-----ATVDPADLPHRSEHSVEFQAVFLRHVLGDAPFTLVPILCGSPLGALPRHSRQ 279
Query: 159 AENEAMYGRLFA--KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
A EA L A + DP D H G +F + + ++ A D
Sbjct: 280 AFREAAGPFLSALREMAADPDTLVVAGVDLFHIGPKFGHAEPAAA---LEEAALAHDTAL 336
Query: 217 MDIIETGDPDAF--KKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKT 274
+D + GD DAF + ++ +CG ++ +L + + L ++ + +T
Sbjct: 337 LDALARGDADAFWEESARVKDAYNVCGFTALATLAEILPPAT----MALLAHDVMREAET 392
Query: 275 KRDSSVSYASA 285
+ S+VS+A A
Sbjct: 393 R--SAVSFAGA 401
>gi|237808838|ref|YP_002893278.1| hypothetical protein Tola_2093 [Tolumonas auensis DSM 9187]
gi|237501099|gb|ACQ93692.1| protein of unknown function DUF52 [Tolumonas auensis DSM 9187]
Length = 258
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 121/299 (40%), Gaps = 56/299 (18%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV----RGVIAPHAGYSYSGRA 56
M+ I + + A +Y + RL + W++ + +PE R +I PHAGY +SG
Sbjct: 1 MQVIHQPAVADMFYPADPDRLRD----WMQH-HVKAAPETGRKPRMLILPHAGYRFSGAI 55
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDLEVIEELKATG 115
AA + + RV +L P+H Y A+ TP G +PLD I L +
Sbjct: 56 AAQGYSLLQKGDFKRVIVLCPAHRVYLQGIAVPEHWDGEATPFGTIPLDKPAIASLLSLP 115
Query: 116 KFELMDICVDEA---EHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
I EA EH++E+ LP+L ++ ++P++VG AE +
Sbjct: 116 GV----IAATEAHRQEHAIEVQLPFLQ--YQLGDFSLIPLVVGDCPAET---VSHAIEQI 166
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
+ D VSSD H+ S+ DI E D D + +
Sbjct: 167 MTD-DTLIIVSSDLSHYLSK-------------------------DIAELQD-DYTIRQI 199
Query: 233 LEYDNTI-----CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
L++ T+ CG + ++ LH ++ IK L S ++ V YA+ A
Sbjct: 200 LQFSETLTGDQACGCYALNGALHWAHQ--QQLDIKLLAKCNSGDTSHDKERVVGYAAFA 256
>gi|337286681|ref|YP_004626154.1| hypothetical protein Thein_1323 [Thermodesulfatator indicus DSM
15286]
gi|335359509|gb|AEH45190.1| protein of unknown function DUF52 [Thermodesulfatator indicus DSM
15286]
Length = 267
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 86/189 (45%), Gaps = 10/189 (5%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
IR + AG +Y N + L +E++ +L PK P + G + PHAGY +SG A +
Sbjct: 2 IRMPAVAGRFYEANPELLRKEIEAYLDPTA-PKEPAI-GAVCPHAGYMFSGHVAGAVYSR 59
Query: 64 -IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
I P + V +LGP+H A+ ++ P G +P++ ++ + + D+
Sbjct: 60 LIIPDT---VVILGPNHTGLGHPAAIMAKGAWQMPFGTVPIEEKLAAFILQESQVLSHDV 116
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FF 180
EHS+E+ +P+L + + IVPI + + E G AK +
Sbjct: 117 EAHLYEHSLEVQVPFLQYLNPN--VAIVPICLSHLPYEALEDIGLAVAKGIAAYGGPVLI 174
Query: 181 SVSSDFCHW 189
S+D H+
Sbjct: 175 VASTDMSHY 183
>gi|403740291|ref|ZP_10952468.1| hypothetical protein AUCHE_22_00080 [Austwickia chelonae NBRC
105200]
gi|403190089|dbj|GAB79238.1| hypothetical protein AUCHE_22_00080 [Austwickia chelonae NBRC
105200]
Length = 271
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 74/153 (48%), Gaps = 9/153 (5%)
Query: 10 AGSWYTDNSKRLAEELDGWLREA----GLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNID 65
AG Y ++ L + + L E L P + V+ H+ Y + G +A A+ ++
Sbjct: 10 AGRHYPAEARELTDLMRSQLAEGRADMALDPEPPPKAVLVAHSAYVHCGHGSAAAYARVE 69
Query: 66 PT--SISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFELMDI 122
++ RV LLGP+ L T + TP+GD+P+D + + L A+ + +D
Sbjct: 70 TARGTVRRVVLLGPARQDPPRGITLPAWTTFTTPLGDVPVDQDAVRALTASMPEVVHVDD 129
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG 155
AE S+E+ LP+L +V G +VP+ VG
Sbjct: 130 GPHLAETSLEVQLPFLQEVL-GEF-TVVPLAVG 160
>gi|313682443|ref|YP_004060181.1| hypothetical protein Sulku_1318 [Sulfuricurvum kujiense DSM 16994]
gi|313155303|gb|ADR33981.1| protein of unknown function DUF52 [Sulfuricurvum kujiense DSM
16994]
Length = 266
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 54/98 (55%), Gaps = 2/98 (2%)
Query: 41 RGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
+I PHAG+ YSG A A+ ++ + V ++GPSH ++ T Y+TP+GD
Sbjct: 47 NAIIVPHAGWVYSGFTANVAYRILNNSEPKTVVVIGPSHRVGFDGVSICDLTHYQTPLGD 106
Query: 101 LPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYL 138
L +D ++ +EL+ +F L EHS E+ +P++
Sbjct: 107 LTIDPQLTKELRE--RFALPYFSEAHHEHSTEVQMPFI 142
>gi|193214202|ref|YP_001995401.1| hypothetical protein Ctha_0483 [Chloroherpeton thalassium ATCC
35110]
gi|193087679|gb|ACF12954.1| protein of unknown function DUF52 [Chloroherpeton thalassium ATCC
35110]
Length = 292
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 87/200 (43%), Gaps = 9/200 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M+ IR + +D +++ L + LP+ ++ G + PHA + YSG+ AA
Sbjct: 10 MQFIREPQAMDLYPSDCGQQIFAFLKKYTPSTELPE--KLIGAVVPHAAWKYSGKVAART 67
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFEL 119
+ S + +L + H K L +KTP+G LP+ E+ L A K
Sbjct: 68 LHTLSERSSPDICVLLGADHIGLKKHTLLPDGTWKTPLGSLPIASELASALYSALPKILS 127
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYV--DDPS 177
D+ E EHS+E+ P +A + + I+PI++ V N G + + +
Sbjct: 128 NDMSAHELEHSLEVISPMIAYFWPK--LSILPIII--VPNANALEIGNALVQLLKASGKT 183
Query: 178 NFFSVSSDFCHWGSRFNYMH 197
F S+D H+G + H
Sbjct: 184 AIFVASTDLTHYGRFYGNAH 203
>gi|146332000|gb|ABQ22506.1| memo-like protein [Callithrix jacchus]
Length = 61
Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats.
Identities = 30/59 (50%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 234 EYDNTICGRHPISVFLHM---LGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKV 289
+Y NTICGRHPI V L+ L + FL Y QSSQC+ +DSSVSYA+ A V
Sbjct: 2 KYHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNWQDSSVSYAAGALTV 60
>gi|357634174|ref|ZP_09132052.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio sp. FW1012B]
gi|357582728|gb|EHJ48061.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfovibrio sp. FW1012B]
Length = 406
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 120/309 (38%), Gaps = 44/309 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREA-GLPKSPEVRGVIAPH-------AGYSYSGR 55
+R A AG Y D L LD L A P + V VIAPH AGY +
Sbjct: 110 VRPAVFAGQAYPDAPALLVPYLDAMLEAASARPAAGPVVAVIAPHIDPEAGKAGYGAAYG 169
Query: 56 AAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATG 115
A A P RV +LG H L T V+ TP+G++P D + L+ G
Sbjct: 170 ALRAALDGRTPPK--RVVVLGVGHQVIKGLYCL-TDKVFATPLGEVPADGPAVAALRRAG 226
Query: 116 KFELMDICVDEA------EHSMEMHLPYLAKVFEGHLIKIVPILVGAV---------NAE 160
+ VD A EHS+E +L V +VPIL G+ A
Sbjct: 227 Q-----ATVDPADLPHRSEHSVEFQAVFLRHVLGDAPFTLVPILCGSPLGALPRHSRQAF 281
Query: 161 NEAMYGRLFA--KYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
EA L A + DP D H G +F + + + ++ A D +D
Sbjct: 282 REAAGPFLSALREMAADPDTLVVAGVDLFHIGPKFGHAEPAEA---LEEAALAHDTALLD 338
Query: 219 IIETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKR 276
+ GD DA + + +CG ++ +L + + L ++ + +T+
Sbjct: 339 ALARGDADAIWEESARVKDVYNVCGFTALATLAEILPPAT----MALLAHDVMREAETR- 393
Query: 277 DSSVSYASA 285
S+VS+A A
Sbjct: 394 -SAVSFAGA 401
>gi|337286172|ref|YP_004625645.1| hypothetical protein Thein_0803 [Thermodesulfatator indicus DSM
15286]
gi|335359000|gb|AEH44681.1| protein of unknown function DUF52 [Thermodesulfatator indicus DSM
15286]
Length = 390
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 89/199 (44%), Gaps = 18/199 (9%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVIAPHAGYSYSGRAAAYA 60
K+R +SHAG Y L L+ L PK P R +IAPH + + A A
Sbjct: 112 KVRPSSHAGHAYPLKPDELKTFLNQILNL--WPKRPNYNPRIIIAPHIDFRAGAQTFAAA 169
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM 120
+ + +RV +LG H TP +L+ ++TP+G + D E + EL +L
Sbjct: 170 YQGLSWPKGARVIVLGTGHFLETP-VSLAYKD-FETPLGLVKYDREFVAELSKKIDEDLR 227
Query: 121 D-ICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVD-- 174
++EHS++ + VF HL+ +VPILV + + + +L D
Sbjct: 228 GHEWAHKSEHSIDFQV-----VFLKHLLGEFSLVPILVASPQG-HRNFFKKLAESLRDLL 281
Query: 175 DPSNFFSVSSDFCHWGSRF 193
D + V DFCH G R+
Sbjct: 282 DEKTYLVVGVDFCHLGLRY 300
>gi|406919694|gb|EKD57918.1| hypothetical protein ACD_57C00099G0002 [uncultured bacterium]
Length = 300
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 6/150 (4%)
Query: 40 VRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIG 99
V G I PH G + + + F I +I + L+GP+H ++ ++T G
Sbjct: 75 VTGAIIPHHGLA--AKYFSSFFNVISGKNIKTIILIGPNHKLVGDNPVYTSDLTWETEFG 132
Query: 100 DLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
+ D E++++L TGK + D + E EHS+ +PY+AK G +K+VP++ +N
Sbjct: 133 QVNADYELVQKLTETGKVGIDDKII-EDEHSVATIMPYIAKYMPG--VKVVPLVCKEINL 189
Query: 160 ENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
+ + +P + DF H+
Sbjct: 190 NKIGELTEIITANL-EPGVIVISAVDFSHY 218
>gi|358340513|dbj|GAA48391.1| hypothetical protein CLF_101548 [Clonorchis sinensis]
Length = 256
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 71/163 (43%), Gaps = 20/163 (12%)
Query: 148 KIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHK 207
K+VP+ +G + + + R F Y+++P N F S+ CHWG + + I
Sbjct: 75 KVVPVYIGTLAHDQLTVVARRFRDYLEEPENAFIFSASLCHWGDIYGFSAQIPDAPTILD 134
Query: 208 SIEALDKMGMDIIETGDPDAFKKYLLE-----YDNTICG------RHPISVFLHMLGNCS 256
SI+A DK + + F ++LL+ DN I R ++V LH L +
Sbjct: 135 SIKAFDKRAIAAMRELRFRPFDEFLLDTKAKVLDNQIIALLLFMVRQLLNVNLHALISMD 194
Query: 257 TKIKI----KFLRYEQSSQC-----KTKRDSSVSYASAAAKVD 290
+ + K R++ Q T DS +SY SA ++D
Sbjct: 195 EEKVVNQMRKLARFKLMGQAWSYPDVTSADSCISYVSAVVELD 237
>gi|149390717|gb|ABR25376.1| unknown [Oryza sativa Indica Group]
Length = 53
Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats.
Identities = 38/45 (84%), Positives = 41/45 (91%)
Query: 246 SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVD 290
SVFL ML +CSTKIKI F+RYEQSSQCK+ RDSSVSYASAAAKVD
Sbjct: 1 SVFLSMLKHCSTKIKIGFVRYEQSSQCKSMRDSSVSYASAAAKVD 45
>gi|41615137|ref|NP_963635.1| hypothetical protein NEQ347 [Nanoarchaeum equitans Kin4-M]
gi|40068861|gb|AAR39196.1| NEQ347 [Nanoarchaeum equitans Kin4-M]
Length = 266
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 101/236 (42%), Gaps = 15/236 (6%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI 69
AG +Y N ++L L+ + GL K ++ I PHAGY YSG+A A + N D
Sbjct: 5 AGKFYPSNREQLIALLNELFK--GLKKEYSLKAGIVPHAGYIYSGKAMAEFWINSDRNKT 62
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEH 129
+ +G +H +S + +TP+G +D E G F+ D EH
Sbjct: 63 YAI--IGTNHTGLGSLANVSLMPI-ETPLGIAKIDEEAAMIFMKNG-FDYDDRPF-LYEH 117
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
S+E +P+L + + + + ++ E G+ A + + + SSDF H+
Sbjct: 118 SVENQIPFLQYLHGDNFLIVPSVMFNVYRFAKEV--GKQLALELPERVRLVA-SSDFTHY 174
Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPI 245
G + Y + + ++ LD + I D F K ++ T+CG PI
Sbjct: 175 GDIYGYKPFSDG-----RKVKELDMKLISYILKLDSLGFYKEIVRTGATVCGWGPI 225
>gi|256830987|ref|YP_003159715.1| hypothetical protein Dbac_3224 [Desulfomicrobium baculatum DSM
4028]
gi|256580163|gb|ACU91299.1| protein of unknown function DUF52 [Desulfomicrobium baculatum DSM
4028]
Length = 270
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 97/246 (39%), Gaps = 27/246 (10%)
Query: 44 IAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPL 103
+ PHAG+ +SG A T V LLGP+H A+ + P + +
Sbjct: 43 MVPHAGHVFSGGVAGQTLARAKLTDT--VLLLGPNHTGMGAPLAVWPDGKWLLPGAAMDV 100
Query: 104 DLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEA 163
D E+ L A D EHS+E+ LP+L + ++I+PI VG
Sbjct: 101 DAELAAALLAAEPALTADRVAHLQEHSLEVVLPFLWA--KNPQMRIIPIAVGDPRPHK-- 156
Query: 164 MYGRLFAKYVDDPSNF-----FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
G AK + S VSSD H+ S D+K VI D+ +D
Sbjct: 157 -LGGAAAKIAEALSALGREVSVVVSSDMNHFAS-------DEKTRVI-------DQHALD 201
Query: 219 IIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDS 278
I DP F + E + ++CG P+++ +H L N K + + Y S
Sbjct: 202 RILALDPMGFYGKVREENISMCGVLPMTLGMH-LANIQGAKKAELVAYATSGDVNGDMSR 260
Query: 279 SVSYAS 284
V YA
Sbjct: 261 VVGYAG 266
>gi|386347394|ref|YP_006045643.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
gi|339412361|gb|AEJ61926.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Spirochaeta thermophila DSM 6578]
Length = 275
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 90/203 (44%), Gaps = 28/203 (13%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
K+R WY D + + EE+ GW R E LP +P V I PHAG+ +SG A
Sbjct: 2 KVRERMLPPGWYPDEKEEVLEEIAGWKRLERRLP-TPAV-ACIVPHAGWYFSGELAFMGI 59
Query: 62 GNIDPTSISRVFLLG----PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
++DP + + G S Y P+ LS TP+GDLP++ ++E L+
Sbjct: 60 RSLDPEAEVVAVVGGHLSERSPLLYMPEDGLS------TPLGDLPVERPLVEALREHVPT 113
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPILVGAVNAENEAMYGRLFAKYVD 174
E D+ D +++E+ LP + F + + P+L E R+ Y
Sbjct: 114 E-PDVYPD---NTVEIQLPLVKYCFPHASVVGLRCPPLL------EVAERTARVLYDYGK 163
Query: 175 DPSNFFSV--SSDFCHWGSRFNY 195
+ V S+D H+G ++ +
Sbjct: 164 ETGKKLVVIGSTDLTHYGPQYGF 186
>gi|374814620|ref|ZP_09718357.1| hypothetical protein TpriZ_12205 [Treponema primitia ZAS-1]
Length = 283
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 12/161 (7%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRG--VIAPHAGYSYSGRAAAYA 60
K R +G +Y D+ E+ +R GL + R +IAPH + +SG AA A
Sbjct: 11 KGRSPVVSGLFYPDDRA----EMGTRIRALGLKRGVGGRASAIIAPHGAWDFSGSVAATA 66
Query: 61 FGNID--PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGK-F 117
F +SRV +LG H P S + + TP+G LP+D+E+ E L + F
Sbjct: 67 FSAAAGRANEVSRVVILGNVHRMEEPGVFFSDSHYFDTPLGRLPVDMELSESLASCSTLF 126
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
E+ DI E ++E+ LP F G IVPIL+G
Sbjct: 127 EVNDIP-HLQEIAVEVLLPLTQFCFPGA--AIVPILMGGAQ 164
>gi|337288503|ref|YP_004627975.1| hypothetical protein TOPB45_0950 [Thermodesulfobacterium sp. OPB45]
gi|334902241|gb|AEH23047.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermodesulfobacterium geofontis OPF15]
Length = 396
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 99/233 (42%), Gaps = 14/233 (6%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGN 63
+R +HA Y ++ ++ L+ S + +IAPH +A A ++
Sbjct: 111 LRPMAHANQAYPLSASEAKFFIEDILKLCNPDSSKPPKILIAPHIDIRAGAKAFAESYSR 170
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELM-DI 122
S SRV +LG HH P L+ TP G + D + L + K EL +
Sbjct: 171 FKIPSGSRVIILGVGHHLDLPWSVLTKDVA--TPFGVVKNDRGGLLYLSNSKKIELFPNH 228
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVD------DP 176
+ EHS+E + +L + + + ++P L+G + E L K+++ D
Sbjct: 229 IAHKLEHSIEFQVLFLHHLLKDEFV-VLPFLIGPLIVFFEEKTKELLEKFIEALSELIDD 287
Query: 177 SNFFSVSSDFCHWGSRF-NYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAF 228
+ + DFCH G R+ + + H I K +E DK +++ G+ + F
Sbjct: 288 RTYIVLGIDFCHLGLRYGDPFEVSETH--IKKILET-DKRLLELTFNGNSEEF 337
>gi|392411412|ref|YP_006448019.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
gi|390624548|gb|AFM25755.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
Length = 413
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/307 (24%), Positives = 128/307 (41%), Gaps = 35/307 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEV-------RGVIAPHAGYSYSGRA 56
+R SHAG+ Y + + L +L + G P S + G+IAPH G+
Sbjct: 112 VRPCSHAGASYEADPESLHRQLSSFFSGDGGPGSIQSGTDPRRPLGLIAPHIDVRSGGKC 171
Query: 57 AAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELK-ATG 115
A A+ + S ++++ + H +T ++TP+G + D E + L G
Sbjct: 172 FASAYHALAAGQPSDIYVIFGTGHSGVEGIFTATNLDFETPLGVVRTDREFLGHLTDRLG 231
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEG-HLIKIVPILVGAVNAENEAMYG------RL 168
+ + EH +E + +L +F H IVPIL +++ E G R
Sbjct: 232 HDPAAEEILHATEHVIEFQVIFLQHIFAARHPFTIVPILC-SLSPEMLHETGPFQDRKRK 290
Query: 169 FAKYVD---------DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDI 219
F + S F S+D H G R+ K G +++++E D+ +
Sbjct: 291 FDAFCSAVRSACQRIGKSVCFIASADLDHIGPRYGDGFIPHK-GTVNQTLEN-DRRMLGY 348
Query: 220 IETGDPDAFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
+E D F +++ T ICG PI+ H + +T+ ++ L Y Q ++
Sbjct: 349 LEALDLPGFVRHVSSDQETTRICGFSPIAAMFHCMD--ATEGRLLALDYAQVD----DKN 402
Query: 278 SSVSYAS 284
S VS+AS
Sbjct: 403 SFVSFAS 409
>gi|303325747|ref|ZP_07356190.1| hypothetical cytosolic protein [Desulfovibrio sp. 3_1_syn3]
gi|302863663|gb|EFL86594.1| hypothetical cytosolic protein [Desulfovibrio sp. 3_1_syn3]
Length = 298
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/310 (21%), Positives = 115/310 (37%), Gaps = 51/310 (16%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---------------GVIAPHA 48
IR AG +Y L E+ L + P V+ G++ PHA
Sbjct: 3 IRNPVAAGRFYPAAPADLIREVRACLDAGAAIRRPGVQDTSAPAETEAVSRLWGLMLPHA 62
Query: 49 GYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE- 106
GY Y GR ++ P ++ +L P+H + + TP+ LP+D +
Sbjct: 63 GYVYCGRVLGTTLAGVELPRTL---VVLCPNHTGRGQALGVWPEGAWLTPLAPLPVDADL 119
Query: 107 ---VIEELKATGKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVG-----AV 157
+IE G F D EH++E+ LP+L E ++I P+ VG A+
Sbjct: 120 AAALIERGDGNGGFA-ADTLSHLGEHAVEVVLPFLQVAAGEERPLRITPVCVGTQQPEAL 178
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
A +A+ L VSSD HY+ + + K D + +
Sbjct: 179 RAAGQALADVLAGCRSKGQEVGVIVSSDM---------NHYEDERRTVEK-----DALAL 224
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKT 274
+ DP+ ++CG P+++ L LG + + + ++ S+
Sbjct: 225 ERALACDPEGLLAVAARERISMCGAGPLALALFAARQLG----RARAELTAHDTSATASG 280
Query: 275 KRDSSVSYAS 284
+ V YA
Sbjct: 281 DTEHVVGYAG 290
>gi|345892602|ref|ZP_08843421.1| hypothetical protein HMPREF1022_02081 [Desulfovibrio sp.
6_1_46AFAA]
gi|345047094|gb|EGW50963.1| hypothetical protein HMPREF1022_02081 [Desulfovibrio sp.
6_1_46AFAA]
Length = 298
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/310 (21%), Positives = 115/310 (37%), Gaps = 51/310 (16%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR---------------GVIAPHA 48
IR AG +Y L E+ L + P V+ G++ PHA
Sbjct: 3 IRNPVAAGRFYPAAPADLIREVRACLDAGAAIRRPGVQDASAPAETEAVSRPWGLMLPHA 62
Query: 49 GYSYSGRAAAYAFGNID-PTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE- 106
GY Y GR ++ P ++ +L P+H + + TP+ LP+D +
Sbjct: 63 GYVYCGRVLGTTLAGVELPRTL---VVLCPNHTGRGQALGVWPEGAWLTPLAPLPVDADL 119
Query: 107 ---VIEELKATGKFELMDICVDEAEHSMEMHLPYL-AKVFEGHLIKIVPILVG-----AV 157
+IE G F D EH++E+ LP+L E ++I P+ VG A+
Sbjct: 120 AAALIERGDGNGGFA-ADTLSHLGEHAVEVVLPFLQVAAGEERPLRITPVCVGTQQPEAL 178
Query: 158 NAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
A +A+ L VSSD HY+ + + K D + +
Sbjct: 179 RAAGQALADVLAGCRSKGQEVGVIVSSDM---------NHYEDERRTVEK-----DALAL 224
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFL---HMLGNCSTKIKIKFLRYEQSSQCKT 274
+ DP+ ++CG P+++ L LG + + + ++ S+
Sbjct: 225 ERALACDPEGLLAVAARERISMCGAGPLALALFAARQLG----RARAELTAHDTSATASG 280
Query: 275 KRDSSVSYAS 284
+ V YA
Sbjct: 281 DTEHVVGYAG 290
>gi|307719287|ref|YP_003874819.1| dioxygenase [Spirochaeta thermophila DSM 6192]
gi|306533013|gb|ADN02547.1| putative dioxygenase [Spirochaeta thermophila DSM 6192]
Length = 299
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 87/201 (43%), Gaps = 24/201 (11%)
Query: 3 KIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
K+R WY D + + EE+ GW R +P V I PHAG+ +SG A
Sbjct: 26 KVRERMLPPGWYPDEKEEVLEEIAGWKRLERRLSTPAV-ACIVPHAGWYFSGELAFMGIR 84
Query: 63 NIDPTSISRVFLLG----PSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
++DP + + G S Y P+ LS TP+GDLP++ ++E L+ E
Sbjct: 85 SLDPEAEVVAVVGGHLSERSPLLYMPEDGLS------TPLGDLPVERPLVETLREHVPTE 138
Query: 119 LMDICVDEAEHSMEMHLPYLAKVFEGHLI---KIVPIL-VGAVNAENEAMYGRLFAKYVD 174
D+ D +++E+ LP + F + + P+L V A YG+ K
Sbjct: 139 -PDVYPD---NTVEIQLPLVKYCFPHASVVGLRCPPLLEVAERTARVLYDYGKETGK--- 191
Query: 175 DPSNFFSVSSDFCHWGSRFNY 195
S+D H+G ++ +
Sbjct: 192 --KPVVIGSTDLTHYGPQYGF 210
>gi|94270265|ref|ZP_01291693.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
gi|93450875|gb|EAT01892.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
Length = 267
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 99/246 (40%), Gaps = 27/246 (10%)
Query: 43 VIAPHAGYSYSG--RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
V+ PHAGY +SG A A I P V +LGP+HH A+ ++ P G
Sbjct: 39 VVMPHAGYIFSGPVAGATVAAAQIPP----EVIVLGPNHHGLGATAAVMDQGAWEMPWGT 94
Query: 101 LPLDLEVIEE-LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
+P++ + + L+ F+ D EHS+E+ +P+L + ++IVPI + +
Sbjct: 95 VPINASLAAKVLEHCPDFQ-ADELAHRREHSLEVLVPFLH--YRQPELQIVPICLSRSDY 151
Query: 160 E--NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
+ A G A + S+D H+ SR ++ D + +
Sbjct: 152 QFCQRAGAGLAAAIKAWPEPVLLAASTDMSHFESR--------------EATTTKDNLAI 197
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
I DP + + E ++CG P + L + + + +RY S +
Sbjct: 198 GHILELDPAGLYRTVTEQRISMCGVIPTVISLIAALELGAR-RAELIRYADSGEASGDTA 256
Query: 278 SSVSYA 283
V YA
Sbjct: 257 QVVGYA 262
>gi|239908973|ref|YP_002955715.1| hypothetical protein DMR_43380 [Desulfovibrio magneticus RS-1]
gi|239798840|dbj|BAH77829.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 408
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 81/209 (38%), Gaps = 19/209 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA--GLPKSPEVRGVIAPH----AGYSYSGRAAA 58
R AG Y D LA LD L EA ++ VIAPH AG + A A
Sbjct: 113 RETVFAGQAYPDAGDELAAYLDAVLAEAPEACTAPGPIKAVIAPHIDPEAGRAAYAAAYA 172
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
I + RV +LG H L T + TP+GD+ D E + L+A G
Sbjct: 173 ALGAAIKAAAPKRVIVLGVGHQIIDGLFCL-TDKAFATPLGDVGADAEAVARLRAAGGRS 231
Query: 119 L-MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLF-------- 169
+ +AEHS+E +L + +VP+L G+ +A + F
Sbjct: 232 VDASTLPHKAEHSVEFQAVFLRHLLGDAPFAMVPVLCGSPAGVMDAPTRQAFRDCAGPFL 291
Query: 170 ---AKYVDDPSNFFSVSSDFCHWGSRFNY 195
A+ P DFCH G +F +
Sbjct: 292 DALAELASQPDTLIVAGVDFCHIGGKFGH 320
>gi|94265991|ref|ZP_01289714.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
gi|93453457|gb|EAT03873.1| Protein of unknown function DUF52 [delta proteobacterium MLMS-1]
Length = 267
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 99/246 (40%), Gaps = 27/246 (10%)
Query: 43 VIAPHAGYSYSG--RAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGD 100
V+ PHAGY +SG A A I P V +LGP+HH A+ ++ P G
Sbjct: 39 VVMPHAGYIFSGPVAGATVAAAQIPP----EVIVLGPNHHGLGATAAVMDQGAWEMPWGT 94
Query: 101 LPLDLEVIEE-LKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNA 159
+P++ + + L+ F+ D EHS+E+ +P+L + ++IVPI + +
Sbjct: 95 VPINASLAAKVLEHCPDFQ-ADELAHRREHSLEVLVPFLH--YRQPELQIVPICLSRSDY 151
Query: 160 E--NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGM 217
+ A G A + S+D H+ SR ++ D + +
Sbjct: 152 QFCQRAGAGLAAAIKAWPEPVLLAASTDMSHFESR--------------EATTTKDNLAI 197
Query: 218 DIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRD 277
I DP + + E ++CG P + L + + + +RY S +
Sbjct: 198 GHILELDPAGLYRTVTEQRISMCGVIPTVISLIAALELGAR-RAELIRYADSGEASGDTA 256
Query: 278 SSVSYA 283
V YA
Sbjct: 257 QVVGYA 262
>gi|328950699|ref|YP_004368034.1| hypothetical protein Marky_1184 [Marinithermus hydrothermalis DSM
14884]
gi|328451023|gb|AEB11924.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Marinithermus hydrothermalis DSM 14884]
Length = 381
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 98/215 (45%), Gaps = 32/215 (14%)
Query: 40 VRGVIAPHAGYSYSGRAAAYAFGNI-----DPTSISRVFLLGPSHHYYTPKCALSTATVY 94
VRGV+ PH R +G + RV ++G +H + A + +
Sbjct: 139 VRGVVMPH----LEPRRVPEVYGAVLEALAQTPPPERVLVVGVAHRPIQ-EIAAALPLAF 193
Query: 95 KTPIGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPIL 153
+TP+G + +DLE + L A FEL + + EHS+E +L V+ KI+P++
Sbjct: 194 ETPLGRVEVDLEALGALDALVPFELFNTPLAFREEHSVEFPAVFLKGVWPEASFKILPLI 253
Query: 154 VG----AVNAENEAMYGRLFAKYVDDPSNFFSVSS-DFCHWGSRFNYMHYDKKHGVIHKS 208
V A+ E A + AK D P F V+S D H G+RF HG + ++
Sbjct: 254 VSGSEEAIPLEELARAVEVLAK--DYP--FLPVASVDLSHVGARFG-------HGPLTRA 302
Query: 209 I----EALDKMGMDIIETGDPD-AFKKYLLEYDNT 238
+ +A+D+ ++++ G+ D AF L + T
Sbjct: 303 LATRAQAVDRRYLELVAGGEFDRAFLDVTLPKNPT 337
>gi|86159727|ref|YP_466512.1| hypothetical protein Adeh_3308 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85776238|gb|ABC83075.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 301
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 119/289 (41%), Gaps = 25/289 (8%)
Query: 10 AGSWYTDNSKRLAEELDGWLREAGLPKSPEV--RGVI-APHAGYSYSGRAAAYAFGNIDP 66
AG+ Y D L LD WL + RGV+ APH Y+ A+A+ ++
Sbjct: 25 AGAVYPDAPGALRRALDRWLALPAGAPAAPPAPRGVVVAPHIDYARGAAGYAHAYRALE- 83
Query: 67 TSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL-KATGKFELM-DICV 124
S + +F++ + H P+ T Y TP+G + D +++ L A G+ L+ D
Sbjct: 84 ASRADLFVIFGTAHATPPRPFTLTRLDYGTPLGPVRTDRALVDALCGALGEDALLGDELC 143
Query: 125 DEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN-----AENEAMYGRLFAKYVDDPSNF 179
EHS+E+ LA ++P+L A+ A A + A+ V S
Sbjct: 144 HRDEHSIELQAVVLAHRLR-RPFTVLPVLCSAIGHLADPAAATAPFLDALARAVAGRSVC 202
Query: 180 FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLE--YDN 237
+ +D H G R Y ++ A D+ + +E GDP F + +
Sbjct: 203 WVAGADLAHVGPR--YGDARPPAPAELAALAAADRRTLRYVEAGDPAGFHRDAVRDGARR 260
Query: 238 TICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAA 286
+CG PI L G + + L Y+Q T SVS+A+AA
Sbjct: 261 RLCGIAPIYAALRSAGAGA-----RLLHYQQ----WTDGVDSVSFAAAA 300
>gi|73748995|ref|YP_308234.1| hypothetical protein cbdb_A1253 [Dehalococcoides sp. CBDB1]
gi|73660711|emb|CAI83318.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1]
Length = 385
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 90/213 (42%), Gaps = 19/213 (8%)
Query: 73 FLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSME 132
+LGPSH + A+ + +++TP+G++ +D + + + D + EHS+E
Sbjct: 14 IILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHIKADPSAHQYEHSIE 73
Query: 133 MHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSN--FFSVSSDFCHWG 190
+ +P L + F+ IKIVPI V +E A G A + + SSD H+
Sbjct: 74 VQIPML-QYFKPD-IKIVPITVSFGKSETLADIGYGIASALRETGREAIIIASSDMTHYE 131
Query: 191 SRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLH 250
S+ + H D + +D I D + + + T+CG P++ L
Sbjct: 132 SQSD-AHLK-------------DSLALDAIIKLDAAEMLERIQQNHITMCGYAPVAAMLT 177
Query: 251 MLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYA 283
+ K + + + Y+ S D V YA
Sbjct: 178 AVKEMGAK-RARVVAYQTSGDVTHHLDQVVGYA 209
>gi|162452116|ref|YP_001614483.1| hypothetical protein sce3843 [Sorangium cellulosum So ce56]
gi|50403845|gb|AAT76677.1| Dor2 [Sorangium cellulosum So ce56]
gi|161162698|emb|CAN94003.1| hypothetical protein sce3843 [Sorangium cellulosum So ce56]
Length = 422
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 117/300 (39%), Gaps = 42/300 (14%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVR------------GVIAPH---- 47
IR A+HAG Y + ++LA+ ++ + P+ R G+ APH
Sbjct: 111 IRMAAHAGGAYHGDRRKLADFIERKCLQVARPQGGATRAAASQGAPARMVGLCAPHMDLW 170
Query: 48 ---AGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLD 104
AGY ++ A A + LLG SH A+ T + TP+G L D
Sbjct: 171 RAAAGYGHAYAALEQALAGPGLEQVDTFVLLGTSHAAMRRPYAVCEKT-FATPLGPLEPD 229
Query: 105 LEVIEELKATGKFELM-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVG-------- 155
E+I EL A +F++ D + + EHS+E ++ + G IVPIL G
Sbjct: 230 REMIAELAAASRFDVREDQYLHKNEHSIEFQAVFVRHLLGGRAASIVPILCGLSECQARR 289
Query: 156 ---AVNAENEAMYGRLFAKYVDDPSNFFSVS-SDFCHWGSRF-NYMHYDKKHGVIHKSIE 210
A + E+ L P ++ +D H G RF + D++ ++
Sbjct: 290 RDPAQDDGAESFLRALRDALAKRPGRVLVIAGADLAHVGPRFGDPAPLDERQ---RTALR 346
Query: 211 ALDKMGMDIIETGDPDAFKKYLLE--YDNTICGRHPISVFLHMLGNCSTKIKIKFLRYEQ 268
D ++ + D F + +CG PI L L S + + L YEQ
Sbjct: 347 DRDLASIERATSIDAPGFFVDVARDLASRRVCGLGPIYTLLRALPPSS---RGEMLHYEQ 403
>gi|242079151|ref|XP_002444344.1| hypothetical protein SORBIDRAFT_07g020505 [Sorghum bicolor]
gi|241940694|gb|EES13839.1| hypothetical protein SORBIDRAFT_07g020505 [Sorghum bicolor]
Length = 152
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/51 (68%), Positives = 41/51 (80%)
Query: 241 GRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYASAAAKVDA 291
G + ++ ML +CSTKIKI F+RYEQSSQCK RDSSVSYASAAAKVD+
Sbjct: 94 GGNGVAATNKMLKHCSTKIKIGFVRYEQSSQCKNFRDSSVSYASAAAKVDS 144
>gi|302037806|ref|YP_003798128.1| hypothetical protein NIDE2493 [Candidatus Nitrospira defluvii]
gi|300605870|emb|CBK42203.1| conserved protein of unknown function DUF52 [Candidatus Nitrospira
defluvii]
Length = 420
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/300 (21%), Positives = 126/300 (42%), Gaps = 28/300 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWL--REAGLPKSPE-----VRGVIAPHAGYSYSGRAA 57
R AG Y ++ +L +++G+ +E K E ++G++AP ++G
Sbjct: 125 RLPQFAGRSYEADAAKLRAQINGFFISKEGPEVKKSEHAGKLIKGLVAPTYELKHAGPIY 184
Query: 58 AYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A+A+ + +++L + + T ++TP+G + +D V + L+
Sbjct: 185 AWAYKELQEAQQPDLYVLIGTAYSGLEHPVAMTDKDFETPLGLVNVDRTVTDRLREHIPS 244
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVF-EGHLIKIVPILVG---------AVNAENEAMYGR 167
D EH++E LP+L + + IVPIL + + E
Sbjct: 245 AFTDELAHHNEHALEFQLPFLQESLGKDRPFTIVPILTSFSADSLRDPQIREQVETFLQA 304
Query: 168 LFAKYVDDPSNF-FSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPD 226
L V N+ V ++ H G R+ H+ ++ +D + +E DP+
Sbjct: 305 LKDALVATGRNYCVVVGAELAHIGMRYGDSAPPTDFS-FHRCMQ-IDLEMLKHVENRDPE 362
Query: 227 AFKKYLLEYDNT--ICGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSSVSYAS 284
F K++ + ++ I G PI L ++ + ++ LRY++ + +S+V+YAS
Sbjct: 363 EFAKFIQKENDQRRISGFSPIYSLLRLIQAETGQV----LRYDRG--ITDQYNSTVTYAS 416
>gi|406886416|gb|EKD33450.1| hypothetical protein ACD_76C00027G0005 [uncultured bacterium]
Length = 592
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 80/173 (46%), Gaps = 12/173 (6%)
Query: 72 VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSM 131
+ ++GP+H + + ++TP G + D V++ L A+G V EHS+
Sbjct: 133 IVIVGPNHESIGAEIQTMRGS-FETPFGRVSTDNSVVDALVASGAAS-ASPDVFRTEHSI 190
Query: 132 EMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD-PSNFFSVSS-DFCHW 189
H+PY+AK+F KIVP+L + E R+ + PSN F VSS DF H
Sbjct: 191 AFHVPYIAKLFPS--AKIVPVLYHSSVPSGEV--ARVLSNMRSALPSNSFFVSSIDFSHG 246
Query: 190 GSRFNYMHYDKKHGVI--HKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTIC 240
+ D+K I ++S + LD + + I++ P + YL D C
Sbjct: 247 LDSAHSNFNDRKTWNIMQNRSFDELDALQPEFIDS--PPSASAYLQAIDAQKC 297
>gi|392413871|ref|YP_006450478.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
gi|390627007|gb|AFM28214.1| putative dioxygenase [Desulfomonile tiedjei DSM 6799]
Length = 277
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 114/261 (43%), Gaps = 24/261 (9%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA 60
M +RA +G WY +++++ E++ +L P +G IAPHAG+ +SG+AAA
Sbjct: 1 MSTRKRALPSG-WYPESAEQCRREIEEFLTGFKAPAG-SWKGGIAPHAGWYFSGKAAAKV 58
Query: 61 FGNIDPTS-ISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
+ ++ R+ L G H + + ++TP G PLD + + ++G
Sbjct: 59 MKTLAESAHPDRIVLFG--GHLPSGSPIIYADDEWETPFGMQPLDSHLAKTAVSSGNAVF 116
Query: 120 MDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP--S 177
+ A++++E+ LP + F I I V + + A+ +++ S
Sbjct: 117 AG--RNFADNTVEIQLPMIRMFFP----HIPVIAVHSPATKQAAILAESIHNLLNEKGLS 170
Query: 178 NFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGD-----PDAFKKYL 232
+ S+D H+G + + + E D+ +D TGD DAF+K+
Sbjct: 171 AIYIGSADLTHYGPNYGFSPKGTGASAVTWVREENDRSLIDKALTGDVQGVIDDAFQKH- 229
Query: 233 LEYDNTICGRHPISVFLHMLG 253
NT C PI+ + +G
Sbjct: 230 ----NT-CSAGPIASVMASVG 245
>gi|384431098|ref|YP_005640458.1| hypothetical protein [Thermus thermophilus SG0.5JP17-16]
gi|333966566|gb|AEG33331.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Thermus thermophilus SG0.5JP17-16]
Length = 370
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 91/223 (40%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H + A + ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKERAA-ALPVPFQTP 194
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPKA--RVLPLLVAR 252
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+AE G + D ++ D H G RF + + + D
Sbjct: 253 RSAE----LGEALKVVLRDHPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ + G+P+A +L I G ++ L +L K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 348
>gi|313679973|ref|YP_004057712.1| hypothetical protein [Oceanithermus profundus DSM 14977]
gi|313152688|gb|ADR36539.1| protein of unknown function DUF52 [Oceanithermus profundus DSM
14977]
Length = 384
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 80/173 (46%), Gaps = 9/173 (5%)
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD-EAE 128
+R +LG +H + + A + T +TP+G LP+DLE ++ L A FEL + + E
Sbjct: 169 ARAVVLGVAHRGLS-EVAAAWGTPLETPLGPLPVDLEALQALDALLPFELFNTPLAFREE 227
Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNF---FSVSSD 185
HS+E +L + +I+P++V + E +A L + S D
Sbjct: 228 HSIEFPAVFLKAAWSDDAPRILPLIVSG-DPERKAELDELAQALALLAERYPLWPLASVD 286
Query: 186 FCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT 238
H G+RF + D++ +++ D+ ++++ G +A L+ N+
Sbjct: 287 LSHVGARFGHPPLDRERVTQARTV---DRRYLELVAAGSFEAAWASLIPAGNS 336
>gi|320104323|ref|YP_004179914.1| hypothetical protein Isop_2799 [Isosphaera pallida ATCC 43644]
gi|319751605|gb|ADV63365.1| protein of unknown function DUF52 [Isosphaera pallida ATCC 43644]
Length = 318
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 58/230 (25%), Positives = 95/230 (41%), Gaps = 24/230 (10%)
Query: 10 AGSWYTDNSKRLAEELDGWLREA--GLPK-SPEVRGVIAPHAGYSYSGRAAA---YAFGN 63
AG Y ++ ++LA +LD W +A LP S R + PH ++ G A A +
Sbjct: 15 AGLSYPNDPRQLAAQLDTWFNQALDNLPALSTTWRAALCPHIDFARGGAVYATVHRALNH 74
Query: 64 IDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL-MDI 122
++ S S + G SH + +T + TP+G + D ++ L+ EL +D
Sbjct: 75 LNQPS-STYLIYGVSHRVWCRHRFAATRRDFATPLGLVRTDQRFLDHLERVFGPELFVDA 133
Query: 123 CVDEAEHSMEMHLPYLAKVFEGHL-IKIVPILVGA--------VNAENEAMYGRLFAKYV 173
+ E S+E +L + G IVPILVG+ V+ + + R+ V
Sbjct: 134 DAHDPEWSIEFQAVWLQHLLGGRRDFTIVPILVGSFHDLMVRGVDPLEDPLVQRMVEAIV 193
Query: 174 D-----DPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMD 218
D F S DF H G F + H + + + D+ +D
Sbjct: 194 RAERDHDRPVFHLASVDFSHIGPEFGDPEPLRDHDL--EQLRQFDRAMLD 241
>gi|410463514|ref|ZP_11317025.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
MBC34]
gi|409983368|gb|EKO39746.1| putative dioxygenase [Desulfovibrio magneticus str. Maddingley
MBC34]
Length = 408
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 84/209 (40%), Gaps = 19/209 (9%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSP--EVRGVIAPH----AGYSYSGRAAA 58
R AG Y D+ LA LD L EA + V+ VIAPH AG + A A
Sbjct: 113 RETVFAGQAYPDDGAELAAYLDAVLAEAPEACAAPGPVKAVIAPHIDPEAGRAAYAAAYA 172
Query: 59 YAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFE 118
I + RV +LG H L T + TP+GD+ D E L+ G
Sbjct: 173 ALGAAIKAAAPKRVIVLGVGHQIIDGLYCL-TDKAFATPLGDVAADAEASARLRTAGGQA 231
Query: 119 LMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPIL----VGAVNAENEAMYGRLFAKYV 173
+ + +AEHS+E +L ++ +VPIL VG V+A + ++
Sbjct: 232 VDASSLPHKAEHSVEFQAVFLRRLLGDAPFAMVPILCGSPVGVVDAPTRRAFRAYAGPFL 291
Query: 174 D-------DPSNFFSVSSDFCHWGSRFNY 195
D P DFCH G +F +
Sbjct: 292 DALAEIAAAPDTLIVAGVDFCHIGGKFGH 320
>gi|410696913|gb|AFV75981.1| putative dioxygenase [Thermus oshimai JL-2]
Length = 367
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 70/159 (44%), Gaps = 9/159 (5%)
Query: 43 VIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
++ PH S A A +++ T R++LLG +H K A + + TP G
Sbjct: 137 LLMPHLEPSRVPEAYGAALASLEKTPPPERIYLLGVAHRPLKEKAA-ALPVPFATPFGPA 195
Query: 102 PLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
DL+ ++ L A FEL + + EHS+E+ L +L V+ K++P+L+G + E
Sbjct: 196 EPDLKALQALDALLPFELFNTPLAFREEHSLELPLFFLKGVYPEA--KVLPLLIGERSPE 253
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYD 199
G + + ++ D H G RF D
Sbjct: 254 ----LGEALRVLMGEFPGLLVLAVDLSHVGPRFGDRPLD 288
>gi|386360570|ref|YP_006058815.1| dioxygenase [Thermus thermophilus JL-18]
gi|383509597|gb|AFH39029.1| putative dioxygenase [Thermus thermophilus JL-18]
Length = 370
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H + A + ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKERAA-ALPVPFQTP 194
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 252
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+ E G + D ++ D H G RF + + + D
Sbjct: 253 RSPE----LGEALKVVLRDHPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ I G+P+A +L I G ++ L +L K+
Sbjct: 306 LERIAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRGRKGKV 348
>gi|55980893|ref|YP_144190.1| hypothetical protein TTHA0924 [Thermus thermophilus HB8]
gi|7388540|sp|Q56419.2|Y924_THET8 RecName: Full=MEMO1 family protein TTHA0924
gi|295400|gb|AAA27496.1| The first start codon in ORFD is located at position 2046 [Thermus
thermophilus HB8]
gi|55772306|dbj|BAD70747.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length = 326
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H K A + ++TP
Sbjct: 92 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 150
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 151 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 208
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+ E G + D ++ D H G RF + + + D
Sbjct: 209 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 261
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ + G+P+A +L I G ++ L +L K+
Sbjct: 262 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 304
>gi|281495|pir||S27738 hypothetical protein D - Thermus aquaticus
Length = 339
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H K A + ++TP
Sbjct: 105 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 163
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 164 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 221
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+ E G + D ++ D H G RF + + + D
Sbjct: 222 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 274
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ + G+P+A +L I G ++ L +L K+
Sbjct: 275 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 317
>gi|46198863|ref|YP_004530.1| hypothetical protein TTC0555 [Thermus thermophilus HB27]
gi|46196486|gb|AAS80903.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length = 370
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 90/223 (40%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H K A + ++TP
Sbjct: 136 EARVLLMPHLEPSRVPEVYGAALAALEKTPPPERIYLVGVAHRPLKEKAA-ALPVPFQTP 194
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRFPEA--RVLPLLVAR 252
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+ E G + D ++ D H G RF + + + D
Sbjct: 253 RSPE----LGEALKVVLRDFPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ + G+P+A +L I G ++ L +L K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRIDGVEVVASLLPLLRERKGKV 348
>gi|381189704|ref|ZP_09897229.1| hypothetical protein RLTM_01015 [Thermus sp. RL]
gi|380452281|gb|EIA39880.1| hypothetical protein RLTM_01015 [Thermus sp. RL]
Length = 370
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 89/223 (39%), Gaps = 12/223 (5%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSIS-RVFLLGPSHHYYTPKCALSTATVYKTP 97
E R ++ PH S A ++ T R++L+G +H A + ++TP
Sbjct: 136 EARXLLMPHLEPSRVPEVYGAALAALEKTPPPDRIYLVGVAHRPLKDXAA-ALPVPFQTP 194
Query: 98 IGDLPLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGA 156
G DL ++ L A FEL + + EHS+E+ L +L F +++P+LV
Sbjct: 195 FGPALPDLPALQALDALLPFELFNTPLAFREEHSLELPLFFLKGRF--PXARVLPLLVAR 252
Query: 157 VNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMG 216
+AE G + D ++ D H G RF + + + D
Sbjct: 253 RSAE----LGEALKVVLRDXPGLLVLAVDLSHVGPRFGDTPLTR---TLAEEARRRDLGF 305
Query: 217 MDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKI 259
++ + G+P+A +L G ++ L +L K+
Sbjct: 306 LERLAEGEPEAALAFLGANPTRXDGVEVVASLLPLLRERKGKV 348
>gi|333999164|ref|YP_004531776.1| hypothetical protein TREPR_1172 [Treponema primitia ZAS-2]
gi|333739456|gb|AEF84946.1| conserved hypothetical protein [Treponema primitia ZAS-2]
Length = 285
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 75/166 (45%), Gaps = 14/166 (8%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYA- 60
EK R AG +Y + L +L E+G+ + +IAPH ++ SG AA A
Sbjct: 10 EKGRHPVVAGLFYPEEKIDLESQLCSLGLESGV--GGKAAAIIAPHGAWNLSGAPAAAAF 67
Query: 61 -------FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
+ ISRV +LG H+ S + ++TP+G + +D EV E L +
Sbjct: 68 SAAAGRAWQRNGGEGISRVVILGTIHNKEDQGIFFSDSQFFETPLGRMWIDSEVSETLAS 127
Query: 114 TGK-FELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
FE DI E S+E+ LP + F G IVP+L+G
Sbjct: 128 CSTLFETNDI-PHLKETSVEVLLPLVQFCFPGA--AIVPVLMGGAQ 170
>gi|302338446|ref|YP_003803652.1| hypothetical protein Spirs_1932 [Spirochaeta smaragdinae DSM 11293]
gi|301635631|gb|ADK81058.1| protein of unknown function DUF52 [Spirochaeta smaragdinae DSM
11293]
Length = 283
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/227 (22%), Positives = 103/227 (45%), Gaps = 20/227 (8%)
Query: 2 EKIRRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAF 61
+K+R+ WY D + + + + W + I PHAG+++SG A
Sbjct: 4 KKVRKRGLPVGWYPDVEREVLKRITDWEATDTESHLDKAFASIIPHAGWTFSGALAWKGI 63
Query: 62 GNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLE----VIEELKATGKF 117
++ + S + + G H + ++ + TP+G +P D+ ++EEL +GK
Sbjct: 64 RLLNEDAESVIVVGG--HLFAGSGILMALEDAFDTPLGLIPADIGFRDLLLEEL--SGKV 119
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDP- 176
+ + E ++S+E+HLP + F K++ + VG + + + G A+ D
Sbjct: 120 SVREDT--ETDNSVEIHLPLIKYHFPK--AKLIWLRVG--SGDESLVLGECAARLAMDEG 173
Query: 177 -SNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIET 222
+ S+D H+G +++M G ++ +EA D ++IET
Sbjct: 174 GATVMIGSTDLTHYGPDYHFMPAGTG-GEAYRWVEANDA---EMIET 216
>gi|389580823|ref|ZP_10170850.1| putative dioxygenase [Desulfobacter postgatei 2ac9]
gi|389402458|gb|EIM64680.1| putative dioxygenase [Desulfobacter postgatei 2ac9]
Length = 294
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 87/207 (42%), Gaps = 22/207 (10%)
Query: 4 IRRASHAGSWY---TDNSKRLAEELDGWLREAGLPKSPE--VRGVIAPHAGYSYSGRAAA 58
+++ + AGSWY D + E + LR K+ V G++ PHAG+ YSG+ A
Sbjct: 3 LKKMAFAGSWYPGSADQCRSALERFNDDLRAEADTKTLNNPVAGIV-PHAGWIYSGKLAC 61
Query: 59 YAFGNI--DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEEL---KA 113
F + ++ + L G H +P L T TP+G + +D L A
Sbjct: 62 RVFAALAHGRRAVETIVLFGVHMHADSPSFVLDC-TAVDTPLGAIEIDKAFTGRLVKQAA 120
Query: 114 TGKFELMDICVDE--AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAM-YGRLFA 170
+L + E+++E+ P++ F I+V AV + A G +
Sbjct: 121 AANVDLKQLTPARFPEENTLELQYPFIKYFFPK-----ARIVVCAVPPSDAAQALGIIVV 175
Query: 171 KYVDDPSNFFSV--SSDFCHWGSRFNY 195
+ + +V S+D H+G RF +
Sbjct: 176 EVAQELDRSVAVVGSTDMTHYGPRFGF 202
>gi|395731960|ref|XP_003775991.1| PREDICTED: protein MEMO1-like, partial [Pongo abelii]
Length = 100
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 20 RLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPT 67
+L +L+GWL + K P R +IAPHAGY+Y G AA+A+ +DP+
Sbjct: 52 QLNAQLEGWLSQVQSTKRP-ARAIIAPHAGYTYCGSCAAHAYKQVDPS 98
>gi|384439816|ref|YP_005654540.1| hypothetical protein [Thermus sp. CCB_US3_UF1]
gi|359290949|gb|AEV16466.1| hypothetical protein TCCBUS3UF1_14250 [Thermus sp. CCB_US3_UF1]
Length = 370
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 9/154 (5%)
Query: 43 VIAPHAGYSYSGRAAAYAFGNIDPTSI-SRVFLLGPSHHYYTPKCALSTATVYKTPIGDL 101
++ PH S A A ++ T RV+L+G +H + A + ++TP G
Sbjct: 140 LLLPHLEPSRVPEAYGAALAALEKTPPPERVYLVGVAHRPLKERAA-ALPVPFQTPFGPA 198
Query: 102 PLDLEVIEELKATGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAE 160
DLE ++ L A FEL + + EHS+E+ L +L + + +P+LVG + E
Sbjct: 199 LPDLEALQALDALLPFELFNTPLAFREEHSLELPLLFLKGAYPEA--RFLPLLVGRRSPE 256
Query: 161 NEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFN 194
G + D ++ D H G RF
Sbjct: 257 ----LGEALKVVLRDYPGLLVLAVDLSHVGPRFG 286
>gi|114319687|ref|YP_741370.1| dioxygenase-like protein [Alkalilimnicola ehrlichii MLHE-1]
gi|114226081|gb|ABI55880.1| dioxygenase-like protein [Alkalilimnicola ehrlichii MLHE-1]
Length = 225
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 83/185 (44%), Gaps = 9/185 (4%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLR-EAGLPKSPEVRGVIAPHAGYSYSGRAAAY 59
M ++RR AG +Y + + L + LD LR L P R ++ P G + +G A A
Sbjct: 1 MSEVRRPVAAGLFYPGHPELLRQSLDTLLRAHPTLGTRP--RALVLPPGGQARAGAAIAA 58
Query: 60 AFGNIDPTSI--SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKF 117
A + + +RV+LL + H A S + TP+G L LD IE L+
Sbjct: 59 ACSLLQAAAAPPNRVYLLATTPHRTAEGPAFSGKRQFATPLGRLTLDAAGIERLQDDAGG 118
Query: 118 ELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPS 177
L D EH +E LPYL +V ++VP+L+ + A GR+ ++D +
Sbjct: 119 ALDDRA-HALEHRLEAPLPYLQRVLPP--FQLVPVLLPEAGTTSAAC-GRILQLALEDRA 174
Query: 178 NFFSV 182
V
Sbjct: 175 GLLVV 179
>gi|224370218|ref|YP_002604382.1| putative dioxygenase [Desulfobacterium autotrophicum HRM2]
gi|223692935|gb|ACN16218.1| putative dioxygenase [Desulfobacterium autotrophicum HRM2]
Length = 324
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 95/247 (38%), Gaps = 18/247 (7%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
+ A+ G+WY + + E+ +L E + + G I PHAG+ +SG A ++
Sbjct: 34 KDAAFKGAWYPADGQACEREIVRFLNERTGTLTGDYVGGIVPHAGWVFSGSIACRVIASL 93
Query: 65 DPTS-------ISRVFLLGPSHHYYTPKCALSTA-TVYKTPIGDLPLDLEVIEEL-KATG 115
T + V L G HH + L ++TP GDL + V L KA G
Sbjct: 94 AGTGSGGAHGKVDLVLLFG--HHMHPLDAPLVMGRGAWETPFGDLVVHQGVAGALAKAVG 151
Query: 116 KFELMDICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD 175
L + E+++E+ LP++ F IVP+ G + + G L +
Sbjct: 152 ATTLGPDAFPD-ENTIELQLPFVKYFFPD--AAIVPM--GVPPSAHAERIGILAVEAALK 206
Query: 176 PSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLL 233
V S+D H+G + + K + D + + D D L
Sbjct: 207 SGLSLRVIGSTDMTHYGPNYGFSPAGKGQAALDWVKNTNDSRAIKAMVEMDVDTILAQGL 266
Query: 234 EYDNTIC 240
E+ N C
Sbjct: 267 EHHNLCC 273
>gi|220918543|ref|YP_002493847.1| hypothetical protein A2cp1_3451 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219956397|gb|ACL66781.1| protein of unknown function DUF52 [Anaeromyxobacter dehalogenans
2CP-1]
Length = 299
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 72/160 (45%), Gaps = 7/160 (4%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAGLP---KSPEVRGVIAPHAGYSYSGRAAAYA 60
R + AG+ Y D L LDGWL +P V+APH Y+ A+A
Sbjct: 17 FRPPACAGAVYPDAPGALRRALDGWLALPAGAPAASTPPPGVVVAPHIDYARGAAGYAHA 76
Query: 61 FGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKAT-GKFEL 119
+ ++ S + +F++ + H P+ T Y TP+G + D +++ L T G+ L
Sbjct: 77 YRALE-ASRADLFVIFGTAHATPPRPFTLTRLDYGTPLGPVRTDRALVDALCTTLGEDAL 135
Query: 120 M-DICVDEAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVN 158
+ D V EHS+E+ LA ++P+L A+
Sbjct: 136 LGDELVHRDEHSVELQAVILAHRLR-RPFTVLPVLCSAIG 174
>gi|189218196|ref|YP_001938838.1| dioxygenase [Methylacidiphilum infernorum V4]
gi|189185054|gb|ACD82239.1| Predicted dioxygenase [Methylacidiphilum infernorum V4]
Length = 320
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 109/265 (41%), Gaps = 23/265 (8%)
Query: 39 EVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPI 98
E ++APH + S +A YAF + + F+L H+ + ++ Y TP+
Sbjct: 58 ETLAILAPHIDFQVSPKAYTYAFSPWFSMAEADFFILLGVGHHSRIEWSIDPRD-YITPL 116
Query: 99 GDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK----IVPILV 154
G +++EE+ F L D + EHS+E + ++ + I+ +P+L
Sbjct: 117 GRAYNKRDLVEEINRKVDFCLTDPLAHQKEHSIEFPIVFMQALRYWMGIQKPLCFLPVLC 176
Query: 155 GAV-------NAENEAMYGRLFAKYVDDPSNFFS------VSSDFCHWGSRFNYMHYDKK 201
G + N E + + A + + +F+ +S D CH G RF H K
Sbjct: 177 GGLHELIIYDNGREELLRMKKLASILREILSFYKKKVGLVLSIDGCHIGPRFG--HPFKV 234
Query: 202 HGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKI 261
+ K+ + +K +E + + F +L + N ++ L M K+ +
Sbjct: 235 TEEMLKNTDLWEKELWRHVEEQNFEGFISHLQKEKNIRFFDGVGAIALTMEIFKDQKVSL 294
Query: 262 KFLRYEQSSQCKTKRDSSVSYASAA 286
+ YEQ + RDSSV S+
Sbjct: 295 RRTYYEQWFE---GRDSSVVTFSSG 316
>gi|415987739|ref|ZP_11559786.1| hypothetical protein GGI1_14396, partial [Acidithiobacillus sp.
GGI-221]
gi|339834966|gb|EGQ62687.1| hypothetical protein GGI1_14396 [Acidithiobacillus sp. GGI-221]
Length = 209
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 43/200 (21%), Positives = 79/200 (39%), Gaps = 28/200 (14%)
Query: 87 ALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHL 146
AL +TP+G + +D +E L + M EH++E+ LP++ +V
Sbjct: 27 ALPGVQAMQTPLGTVAVDQAGVEALAGLPEVREMP-AAHAQEHALEVQLPFIQEVLGD-- 83
Query: 147 IKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH 206
+ +VP++VG + A R+ K +SSD H Y Y + + H
Sbjct: 84 VSVVPLVVGDARPDEVA---RVLEKLWGGEETVIVISSDLSH------YHPYAEARAIDH 134
Query: 207 KSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFLHMLGNCSTKIKIKFLRY 266
++E + + F ++++ CG PI+ L + + + +
Sbjct: 135 HTVEEILR-------------FDPTPIDHEQA-CGATPINGLLPVARK--HHLHPRLVGL 178
Query: 267 EQSSQCKTKRDSSVSYASAA 286
S RD+ V YA+ A
Sbjct: 179 CNSGDTAGSRDAVVGYAAVA 198
>gi|218296601|ref|ZP_03497319.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
gi|218242914|gb|EED09447.1| protein of unknown function DUF52 [Thermus aquaticus Y51MC23]
Length = 370
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 11/171 (6%)
Query: 70 SRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVD-EAE 128
R++L+G +H K A + + TP G + DL ++ L A +EL + + E
Sbjct: 168 ERIYLVGVAHRPLEAKAA-ALPVPFHTPFGPMEPDLPALQALDALLPYELFNTPLAFREE 226
Query: 129 HSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCH 188
HS+E+ L +L + ++VP+LVG E G + D ++ D H
Sbjct: 227 HSLELPLFFLKGRYPK--ARLVPLLVGRRGPE----LGEALRVVLKDHPGLVVLAVDLSH 280
Query: 189 WGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
G RF + + + D ++ + G+P+A +L E I
Sbjct: 281 VGPRFGDGPLSRP---LAEEARRRDLGFLERLAQGEPEAALAFLGENPTRI 328
>gi|328953314|ref|YP_004370648.1| hypothetical protein Desac_1616 [Desulfobacca acetoxidans DSM
11109]
gi|328453638|gb|AEB09467.1| UPF0103/Mediator of ErbB2-driven cell motility-containing protein
[Desulfobacca acetoxidans DSM 11109]
Length = 281
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 43/201 (21%), Positives = 84/201 (41%), Gaps = 17/201 (8%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDGWLREAG-LPKSPEVRGVIAPHAGYSYSGRAAAY 59
M K+R + WY ++ + L ++ L ++ G + PHAG+ +SGR AA
Sbjct: 1 MMKVRPRTLPIGWYPESGRECQNRLTAFIAGVKPLAAGAKIYGGVMPHAGWYFSGRLAAQ 60
Query: 60 AFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFEL 119
F + +V + H L ++TP+G++ L E+ + L+
Sbjct: 61 VFYLAAQSGQPQVVCIFGGHLGSGSPPLLVMEDAWETPLGEITLAKEMYQPLQEK----- 115
Query: 120 MDICVDE--AEHSMEMHLPYLAKVF-EGHLIKIVPILVGAVNAENEAMYGRLFAKYVDD- 175
C E ++++E+ LP + F + L+ + A +++ + G+ +D
Sbjct: 116 -ITCAREYPGDNTIEIQLPMVKHFFPQAKLLALR-----APHSQQAVLLGQAVVAVANDL 169
Query: 176 -PSNFFSVSSDFCHWGSRFNY 195
S SSD H+G + +
Sbjct: 170 NLSLLAFGSSDLTHYGPNYGW 190
>gi|333997968|ref|YP_004530580.1| hypothetical protein TREPR_2499 [Treponema primitia ZAS-2]
gi|333739632|gb|AEF85122.1| protein of unknown function DUF52 [Treponema primitia ZAS-2]
Length = 303
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 114/274 (41%), Gaps = 20/274 (7%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGW-LREAGLPKSPEVRGVIAPHAGYSYSGRAAAYAFG 62
+R+ WY ++ +A+ L G L P +P +APHAG+ YSG AA A
Sbjct: 3 LRKQRLQAGWYPFSASGVADFLGGISLDGEHYPSAP---AAVAPHAGWFYSGAIAARAVL 59
Query: 63 NIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDI 122
++D + + + G P A A +TP+G+LP+D E L +D
Sbjct: 60 SLDRDAETVAIIGGHLPAGMPPLIAQEDA--LETPLGELPVDREFALALG-----RALDC 112
Query: 123 CVDE-AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFS 181
D ++++E+ LP + F + + + + A A EA GR A+ +
Sbjct: 113 RGDRYTDNTVEVLLPMVHYFFP--RAQALWLRLPAEAASLEA--GREIARIAKKIGRRVA 168
Query: 182 V--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTI 239
V S+D H+G + + K + D+ ++ + +GDP A + E + +
Sbjct: 169 VLGSTDLTHYGENYEFSPRGKGEQALAWVRGVNDRRFIEAVLSGDPRAVLER-AEEERSA 227
Query: 240 CGRHPISVFLHMLGNCSTKIKIKFLRYEQSSQCK 273
C + + G S K + L Y S+ +
Sbjct: 228 CSAGAVLGAIGFAGE-SGAPKGELLAYGTSADAR 260
>gi|384914669|ref|ZP_10015421.1| Predicted dioxygenase [Methylacidiphilum fumariolicum SolV]
gi|384527286|emb|CCG91289.1| Predicted dioxygenase [Methylacidiphilum fumariolicum SolV]
Length = 314
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 118/281 (41%), Gaps = 31/281 (11%)
Query: 27 GWLREAGLPKSPEVR-GVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFL-LGPSHHYYTP 84
W ++ + K + R V++PH + S +A +AF + + + F+ LG HH
Sbjct: 39 AWPKKNTIGKEKKERLAVLSPHIDFQVSPKAYTHAFSHWFSRAEADFFIILGVGHHSRLE 98
Query: 85 KCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLA--KVF 142
S Y TP+G +E++E ++ + F L D + EHS+E + ++ + +
Sbjct: 99 WSIDSRD--YITPLGRAYNRIEIVERIERSVNFSLADPDGHQREHSIEFPIVFMQALRYW 156
Query: 143 EG--HLIKIVPILVGAV------NAENEAM-YGRLFAKYVDDPSNFFS------VSSDFC 187
G ++ VPIL G + N EA+ R+ A + + + +S D C
Sbjct: 157 MGIEKPLEFVPILCGGLHDLIIYNNGKEALSMMRMLASALREVLTAYGEKAALIISIDGC 216
Query: 188 HWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNT--ICGRHPI 245
H G RF H + K E +K +E + + F +L + N G I
Sbjct: 217 HIGPRFG--HPLDLTKALLKDTELWEKELWKNVEEQNLEGFLAHLQKEKNIRFFDGVGAI 274
Query: 246 SVFLHMLGNCSTKIKIKFLRYEQSSQCKTKRDSS-VSYASA 285
++ + + K K F + Q RDSS V+++S
Sbjct: 275 ALLMEIF-----KDKPFFFKRTYYEQWFEPRDSSAVTFSSG 310
>gi|312134017|ref|YP_004001356.1| protein [Bifidobacterium longum subsp. longum BBMN68]
gi|311773323|gb|ADQ02811.1| Hypothetical protein BBMN68_1758 [Bifidobacterium longum subsp.
longum BBMN68]
Length = 462
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 70/183 (38%), Gaps = 42/183 (22%)
Query: 87 ALSTATVYKTPIGDLPLD---------LEVIEELK--------ATGKFELMDICVDEAEH 129
A STA ++TP+G +P+D L V E L+ A +++ EH
Sbjct: 7 ACSTAAAFETPLGTVPVDIAAERKALGLSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEH 66
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSSDFCHW 189
++E+ +P+L V G + IVP+ G + G + P +SSD H+
Sbjct: 67 AVEVQIPFLQTVL-GPDLTIVPLNAGDATPQE---VGDVLRALWGGPETVIVISSDLSHY 122
Query: 190 GSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYLLEYDNTICGRHPISVFL 249
H+ ALD + D +L + CG +PI+ L
Sbjct: 123 HP--------------HEVARALDDQTIA-------DIAALHLPIHPRRACGAYPINGLL 161
Query: 250 HML 252
+L
Sbjct: 162 DVL 164
>gi|88802626|ref|ZP_01118153.1| hypothetical protein PI23P_08550 [Polaribacter irgensii 23-P]
gi|88781484|gb|EAR12662.1| hypothetical protein PI23P_08550 [Polaribacter irgensii 23-P]
Length = 173
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 60/123 (48%), Gaps = 12/123 (9%)
Query: 35 PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH----HYYTPKCALST 90
P S + VI+PH Y Y+G+ + I+ + V L+G +H + K +
Sbjct: 56 PNSKTWKTVISPHDDYKYAGKLYHTSLTGINAKT---VILIGVAHKARNYKLQDKIIFGS 112
Query: 91 ATVYKTPIGDLPL-DLEV-IEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLIK 148
T +KTP G L + DL I+E + F + D + EHS+E +P+L K + ++K
Sbjct: 113 FTHWKTPYGKLKVSDLNTKIQEKLSKESFVVHD-GMQTLEHSLEAIVPFLQK--KTAMLK 169
Query: 149 IVP 151
+ P
Sbjct: 170 LYP 172
>gi|21674191|ref|NP_662256.1| hypothetical protein CT1369 [Chlorobium tepidum TLS]
gi|21647354|gb|AAM72598.1| conserved hypothetical protein [Chlorobium tepidum TLS]
Length = 369
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 103/254 (40%), Gaps = 54/254 (21%)
Query: 35 PKSPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSH----HYYTPKCALST 90
P R I PH Y+Y+G NI + V + G +H ++ + +
Sbjct: 73 PAGTAWRAAICPHDDYTYAGWLYPAVLQNIKAKT---VIIFGVAHTAWRYHLENQLIFDS 129
Query: 91 ATVYKTPIGDL---PLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPYLAKVFEGHLI 147
+ ++ P G++ PL E++E L G + D E E+S+E +P+L ++ +
Sbjct: 130 FSSWRGPYGNVKVSPLRDEILERLP-RGMAIVHDPMQSE-EYSVEALIPFLQ--YQNRNV 185
Query: 148 KIVPILVGAVNAENEAMYGRLFAK---YVDDPSNF-------FSVSSDFCHWGSR----F 193
+I+ ILV ++ E + + FAK V +N +SSD H+G
Sbjct: 186 EIISILVPFMDFERMQIVSQHFAKALFAVMKKNNLRWGKDVALLISSDAVHYGDEDWDGR 245
Query: 194 NYMHYDKKHGVIHKSIEALDKMGMDII------------------ETGDPDAFKKYLLEY 235
N+ +Y G + EA D+ +II T DP+ FKK +
Sbjct: 246 NFAYYGTG-GKANALAEAHDR---EIISHSFESELTEKNIARFYASTTDPNDFKKSRWSW 301
Query: 236 DNTICGRHPISVFL 249
CGR I V L
Sbjct: 302 ----CGRFAIPVGL 311
>gi|406968882|gb|EKD93650.1| hypothetical protein ACD_28C00105G0005 [uncultured bacterium]
Length = 285
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 71/173 (41%), Gaps = 18/173 (10%)
Query: 27 GWLREAGLPK--SPEVRGVIAPHAGYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYY-- 82
G L AG P S V G I PH A Y ++R+ L+ P+H Y
Sbjct: 17 GLLVSAGKPPTISERVAGAILPHHLLVEGYLEAFYQHLREQIPGVTRIILISPNHFNYGY 76
Query: 83 -----TPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDEAEHSMEMHLPY 137
T L++ TP PLD+ VI + + + + + EH + +HLP+
Sbjct: 77 FKIRTTDGVFLNSELFPSTP----PLDMAVIRAVTDSSPLSIENTDFTK-EHGVMVHLPF 131
Query: 138 LAKVFEGHLIKIVPILVGAVNAENEA-MYGRLFAKYVDDPSNFFSVSSDFCHW 189
+A+ F +VPI++ +E + M Y DD + S DF H+
Sbjct: 132 IAEHFPE--ASVVPIILKEGTSEAQLDMLASALIPYRDDHTLVLG-SIDFTHY 181
>gi|406913932|gb|EKD53215.1| hypothetical protein ACD_61C00108G0001, partial [uncultured
bacterium]
Length = 479
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 45/191 (23%), Positives = 78/191 (40%), Gaps = 24/191 (12%)
Query: 16 DNSKRLAEELDGWLREAGLPKSPEVRGVIAPH--AGYSYSGRAAAYAFGNIDPTSISRVF 73
DN L + +R +G P R +I PH S R + G+ T V
Sbjct: 42 DNRSFLPAVAENRIRPSGSPP----RVMILPHHLMASSIIARGISLLTGSPPKT----VI 93
Query: 74 LLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICVDE----AEH 129
+L P+H +S+ + TP G + +D E+++ G +C+D+ +EH
Sbjct: 94 ILSPNHANIGQCDIISSKNKWDTPFGRITIDEELLDLFLKDGS-----VCLDDQAMNSEH 148
Query: 130 SMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKYVDDPSNFFSVSS-DFCH 188
+ LP++ KIVP+++ + + K + P + +++S DF H
Sbjct: 149 GIAGLLPFIKYYLPN--TKIVPLILR--KEIDPVLLTNFTQKLISSPEDVITIASIDFSH 204
Query: 189 WGSRFNYMHYD 199
SR M D
Sbjct: 205 GLSRAKAMEAD 215
>gi|310287811|ref|YP_003939069.1| hypothetical protein BBIF_1290 [Bifidobacterium bifidum S17]
gi|313140612|ref|ZP_07802805.1| predicted protein [Bifidobacterium bifidum NCIMB 41171]
gi|421734885|ref|ZP_16173933.1| hypothetical protein B216_08181 [Bifidobacterium bifidum LMG
13195]
gi|421735802|ref|ZP_16174687.1| hypothetical protein B217_02108 [Bifidobacterium bifidum IPLA
20015]
gi|309251747|gb|ADO53495.1| conserved hypothetical protein [Bifidobacterium bifidum S17]
gi|313133122|gb|EFR50739.1| predicted protein [Bifidobacterium bifidum NCIMB 41171]
gi|407077199|gb|EKE50057.1| hypothetical protein B216_08181 [Bifidobacterium bifidum LMG
13195]
gi|407296907|gb|EKF16404.1| hypothetical protein B217_02108 [Bifidobacterium bifidum IPLA
20015]
Length = 96
Score = 40.4 bits (93), Expect = 1.00, Method: Composition-based stats.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
+R A+ AG++Y N+ L +D + A P+ P+ + VI PHAGY YSG
Sbjct: 15 VRPAAVAGAFYPANAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKAVIVPHAGYVYSGS 74
Query: 56 AAAYAFGNIDPT 67
AA A+ ++ T
Sbjct: 75 VAALAYALLERT 86
>gi|320450023|ref|YP_004202119.1| hypothetical protein TSC_c09450 [Thermus scotoductus SA-01]
gi|320150192|gb|ADW21570.1| conserved hypothetical protein [Thermus scotoductus SA-01]
Length = 371
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 14/200 (7%)
Query: 54 GRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKA 113
G A A G +P R++L+G +H K A + ++TP G DL ++ L A
Sbjct: 156 GAALAALEGTPEP---QRIYLVGVAHRPLKEKAA-ALPVPFQTPFGPAEPDLAALQALDA 211
Query: 114 TGKFELMDICVD-EAEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGRLFAKY 172
+EL + + EHS+E+ L +L VF K++P+LVG + E G
Sbjct: 212 LLPYELFNTPLAFREEHSLELPLFFLKGVFPKS--KVLPLLVGRRSPE----LGEALKVV 265
Query: 173 VDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDPDAFKKYL 232
D ++ D H G RF + + D ++ + G+P+A +L
Sbjct: 266 RKDYPGLLVLAVDLSHVGPRFGDPPLSR---TLAAEARRRDLGFLERLAEGEPEAALAFL 322
Query: 233 LEYDNTICGRHPISVFLHML 252
I G ++ L +L
Sbjct: 323 GANPTRIDGVEVVASLLPLL 342
>gi|431793613|ref|YP_007220518.1| dioxygenase [Desulfitobacterium dichloroeliminans LMG P-21439]
gi|430783839|gb|AGA69122.1| putative dioxygenase [Desulfitobacterium dichloroeliminans LMG
P-21439]
Length = 299
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 89/201 (44%), Gaps = 13/201 (6%)
Query: 6 RASHAGSWYTDNSKRLAEELDGWLRE-AGLPKSPEVRGVIAPHAGYSYSGRAAAYAFGNI 64
RA +A + S +EL L +G PK E R V A + +GR A +
Sbjct: 34 RAQNAPTVAIHPSFVTQQELTTILASGSGKPKITERRAVSAVLPHHLLAGRLLVDAIEAL 93
Query: 65 DPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEELKATGKFELMDICV 124
+ V L+GP+H + + +S + ++TP G + +D EV++ L A G + D V
Sbjct: 94 ALQEPNLVILVGPNHFNHGGRI-ISGFSGWQTPEGVMEVDAEVVDHLLAKG-LVVTDEEV 151
Query: 125 DEAEHSMEMHLPYLAKVF-EGHLIKIV---PILVGAVNAENEAMYGRLFAKYVDDPSNFF 180
EHS+ P L E ++ ++ I + V+A E + +VDD +
Sbjct: 152 LAKEHSIGALAPMLKHFLPEADIVPLILHHDISLKEVDALLEGL-----EVFVDDKAILI 206
Query: 181 SVSSDFCHWGSRFNYMHYDKK 201
+ S DF H+ +R D++
Sbjct: 207 A-SVDFSHYLTRSEAQFKDQE 226
>gi|315427636|dbj|BAJ49233.1| conserved hypothetical protein, partial [Candidatus Caldiarchaeum
subterraneum]
Length = 119
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 59/119 (49%), Gaps = 9/119 (7%)
Query: 1 MEKIRRASHAGSWYTDNSKRLAEELDG-WLREAGLPKSP-------EVRGVIAPHAGYSY 52
M+ RR + AG +Y + + L +++ +L G K+P +I+PHAG Y
Sbjct: 1 MKPRRRPAVAGYFYEGSREALLRQVEHCFLSPHGPGKTPAREWGKRRAPALISPHAGLMY 60
Query: 53 SGRAAAYAFGNIDPTSISR-VFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVIEE 110
SG AA+ + + ++ V + GP+H+ ++ + TP+G++ +D ++ E
Sbjct: 61 SGPVAAHGYYELTKYAVPESVVVFGPNHYGVGTVVSIYPGGSWVTPLGEVKIDEKLAAE 119
>gi|325095395|gb|EGC48705.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 112
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 31/52 (59%), Gaps = 7/52 (13%)
Query: 5 RRASHAGSWYTDNSKRLAEELDGWLREA-----GLPKSP--EVRGVIAPHAG 49
R A+HAGSWY+D+ L+ +L+ WL + G+ + P R +IAP+A
Sbjct: 44 REATHAGSWYSDHEPTLSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPYAS 95
>gi|311064695|ref|YP_003971420.1| hypothetical protein BBPR_1333 [Bifidobacterium bifidum PRL2010]
gi|310867014|gb|ADP36383.1| Conserved hypothetical protein [Bifidobacterium bifidum PRL2010]
Length = 94
Score = 38.9 bits (89), Expect = 2.9, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 8/72 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
+R A+ AG++Y N+ L +D + A P+ P+ + VI PHAGY YSG
Sbjct: 13 VRPAAVAGAFYPANAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKAVIVPHAGYVYSGS 72
Query: 56 AAAYAFGNIDPT 67
A A+ ++ T
Sbjct: 73 VATLAYALLERT 84
>gi|390937225|ref|YP_006394784.1| hypothetical protein BBB_1313 [Bifidobacterium bifidum BGN4]
gi|389890838|gb|AFL04905.1| hypothetical protein BBB_1313 [Bifidobacterium bifidum BGN4]
Length = 94
Score = 38.5 bits (88), Expect = 3.1, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 4 IRRASHAGSWYTDNSKRLAEELDGWLREAG------LPKSPE--VRGVIAPHAGYSYSGR 55
+R A+ AG++Y N+ L +D + A P+ P+ + +I PHAGY YSG
Sbjct: 13 VRPAAVAGAFYPVNAGELQAMIDRQMNHARRLVSSLAPRLPQGAPKALIVPHAGYVYSGS 72
Query: 56 AAAYAFGNIDPT 67
AA A+ ++ T
Sbjct: 73 VAALAYALLERT 84
>gi|374813154|ref|ZP_09716891.1| hypothetical protein TpriZ_04731 [Treponema primitia ZAS-1]
Length = 283
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 85/195 (43%), Gaps = 15/195 (7%)
Query: 49 GYSYSGRAAAYAFGNIDPTSISRVFLLGPSHHYYTPKCALSTATVYKTPIGDLPLDLEVI 108
G+ YSG AA AF ++ + V ++G H + ++ +TP+GD+P+D E
Sbjct: 46 GWRYSGAIAARAFLSLK-RDVDTVLVIG-GHIPGGMRPLMAEEDAVETPLGDIPVDTEFR 103
Query: 109 EELKATGKFELMDICVDE-AEHSMEMHLPYLAKVFEGHLIKIVPILVGAVNAENEAMYGR 167
L + +D D+ A++++E+ LP + F K++ + A + E+ G+
Sbjct: 104 IALGQS-----LDCLPDQYADNTVEVQLPMVRYFFPAA--KLLWLRFPADESSFES--GK 154
Query: 168 LFAKYVDDPSNFFSV--SSDFCHWGSRFNYMHYDKKHGVIHKSIEALDKMGMDIIETGDP 225
+ + V S+D H+G+ + Y + + D + +++GDP
Sbjct: 155 EITRIAEKIGRNLVVLGSTDLTHYGNNYGYSPQGGGKKALAWVRDVNDANFIAAVKSGDP 214
Query: 226 DAFKKYLLEYDNTIC 240
A E D + C
Sbjct: 215 RAILDR-AEKDRSAC 228
>gi|407940365|ref|YP_006856006.1| hypothetical protein C380_18370 [Acidovorax sp. KKS102]
gi|407898159|gb|AFU47368.1| hypothetical protein C380_18370 [Acidovorax sp. KKS102]
Length = 191
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 7/85 (8%)
Query: 170 AKYVDDPSNFFSVSSDFCHWGSRFNYMHYDKKHGVIH-----KSIEALDKMGMDIIETGD 224
A Y DP++ F+ S + H+G+ N +DKK G + KS +A + + TG
Sbjct: 24 ANYAIDPTHTFA-SFEISHFGAAVNRGRFDKKEGTVQFDKAGKSGKAEITFDVTSVNTGT 82
Query: 225 PDAFKKYLLEYDNTICGRHPISVFL 249
P AF K+L D +HP F+
Sbjct: 83 P-AFDKHLQSADLFDAAKHPTMKFV 106
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.410
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,658,532,019
Number of Sequences: 23463169
Number of extensions: 196891449
Number of successful extensions: 468330
Number of sequences better than 100.0: 908
Number of HSP's better than 100.0 without gapping: 645
Number of HSP's successfully gapped in prelim test: 263
Number of HSP's that attempted gapping in prelim test: 465485
Number of HSP's gapped (non-prelim): 993
length of query: 291
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 150
effective length of database: 9,050,888,538
effective search space: 1357633280700
effective search space used: 1357633280700
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)